identifies candidate genes: Topics by Science.gov

Sample records for identifies candidate genes

Whole exome sequencing identifies novel candidate genes that modify chronic obstructive pulmonary disease susceptibility.

PubMed

Bruse, Shannon; Moreau, Michael; Bromberg, Yana; Jang, Jun-Ho; Wang, Nan; Ha, Hongseok; Picchi, Maria; Lin, Yong; Langley, Raymond J; Qualls, Clifford; Klensney-Tait, Julia; Zabner, Joseph; Leng, Shuguang; Mao, Jenny; Belinsky, Steven A; Xing, Jinchuan; Nyunoya, Toru

2016-01-07

Chronic obstructive pulmonary disease (COPD) is characterized by an irreversible airflow limitation in response to inhalation of noxious stimuli, such as cigarette smoke. However, only 15-20 % smokers manifest COPD, suggesting a role for genetic predisposition. Although genome-wide association studies have identified common genetic variants that are associated with susceptibility to COPD, effect sizes of the identified variants are modest, as is the total heritability accounted for by these variants. In this study, an extreme phenotype exome sequencing study was combined with in vitro modeling to identify COPD candidate genes. We performed whole exome sequencing of 62 highly susceptible smokers and 30 exceptionally resistant smokers to identify rare variants that may contribute to disease risk or resistance to COPD. This was a cross-sectional case-control study without therapeutic intervention or longitudinal follow-up information. We identified candidate genes based on rare variant analyses and evaluated exonic variants to pinpoint individual genes whose function was computationally established to be significantly different between susceptible and resistant smokers. Top scoring candidate genes from these analyses were further filtered by requiring that each gene be expressed in human bronchial epithelial cells (HBECs). A total of 81 candidate genes were thus selected for in vitro functional testing in cigarette smoke extract (CSE)-exposed HBECs. Using small interfering RNA (siRNA)-mediated gene silencing experiments, we showed that silencing of several candidate genes augmented CSE-induced cytotoxicity in vitro. Our integrative analysis through both genetic and functional approaches identified two candidate genes (TACC2 and MYO1E) that augment cigarette smoke (CS)-induced cytotoxicity and, potentially, COPD susceptibility.
Next-generation sequencing to identify candidate genes and develop diagnostic markers for a novel Phytophthora resistance gene, RpsHC18, in soybean.

PubMed

Zhong, Chao; Sun, Suli; Li, Yinping; Duan, Canxing; Zhu, Zhendong

2018-03-01

A novel Phytophthora sojae resistance gene RpsHC18 was identified and finely mapped on soybean chromosome 3. Two NBS-LRR candidate genes were identified and two diagnostic markers of RpsHC18 were developed. Phytophthora root rot caused by Phytophthora sojae is a destructive disease of soybean. The most effective disease-control strategy is to deploy resistant cultivars carrying Phytophthora-resistant Rps genes. The soybean cultivar Huachun 18 has a broad and distinct resistance spectrum to 12 P. sojae isolates. Quantitative trait loci sequencing (QTL-seq), based on the whole-genome resequencing (WGRS) of two extreme resistant and susceptible phenotype bulks from an F 2:3 population, was performed, and one 767-kb genomic region with ΔSNP-index ≥ 0.9 on chromosome 3 was identified as the RpsHC18 candidate region in Huachun 18. The candidate region was reduced to a 146-kb region by fine mapping. Nonsynonymous SNP and haplotype analyses were carried out in the 146-kb region among ten soybean genotypes using WGRS. Four specific nonsynonymous SNPs were identified in two nucleotide-binding sites-leucine-rich repeat (NBS-LRR) genes, RpsHC18-NBL1 and RpsHC18-NBL2, which were considered to be the candidate genes. Finally, one specific SNP marker in each candidate gene was successfully developed using a tetra-primer ARMS-PCR assay, and the two markers were verified to be specific for RpsHC18 and to effectively distinguish other known Rps genes. In this study, we applied an integrated genomic-based strategy combining WGRS with traditional genetic mapping to identify RpsHC18 candidate genes and develop diagnostic markers. These results suggest that next-generation sequencing is a precise, rapid and cost-effective way to identify candidate genes and develop diagnostic markers, and it can accelerate Rps gene cloning and marker-assisted selection for breeding of P. sojae-resistant soybean cultivars.
Computational Analysis of Candidate Disease Genes and Variants for Salt-Sensitive Hypertension in Indigenous Southern Africans

PubMed Central

Tiffin, Nicki; Meintjes, Ayton; Ramesar, Rajkumar; Bajic, Vladimir B.; Rayner, Brian

2010-01-01

Multiple factors underlie susceptibility to essential hypertension, including a significant genetic and ethnic component, and environmental effects. Blood pressure response of hypertensive individuals to salt is heterogeneous, but salt sensitivity appears more prevalent in people of indigenous African origin. The underlying genetics of salt-sensitive hypertension, however, are poorly understood. In this study, computational methods including text- and data-mining have been used to select and prioritize candidate aetiological genes for salt-sensitive hypertension. Additionally, we have compared allele frequencies and copy number variation for single nucleotide polymorphisms in candidate genes between indigenous Southern African and Caucasian populations, with the aim of identifying candidate genes with significant variability between the population groups: identifying genetic variability between population groups can exploit ethnic differences in disease prevalence to aid with prioritisation of good candidate genes. Our top-ranking candidate genes include parathyroid hormone precursor (PTH) and type-1angiotensin II receptor (AGTR1). We propose that the candidate genes identified in this study warrant further investigation as potential aetiological genes for salt-sensitive hypertension. PMID:20886000
Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L.) Using SLAF-seq

PubMed Central

Xie, Dongwei; Dai, Zhigang; Yang, Zemao; Sun, Jian; Zhao, Debao; Yang, Xue; Zhang, Liguo; Tang, Qing; Su, Jianguang

2018-01-01

Flax (Linum usitatissimum L.) is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq) was employed to perform a genome-wide association study (GWAS) for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP) loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM) and a mixed linear model (MLM) as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits. PMID:29375606
Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L.) Using SLAF-seq.

PubMed

Xie, Dongwei; Dai, Zhigang; Yang, Zemao; Sun, Jian; Zhao, Debao; Yang, Xue; Zhang, Liguo; Tang, Qing; Su, Jianguang

2017-01-01

Flax ( Linum usitatissimum L.) is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq) was employed to perform a genome-wide association study (GWAS) for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP) loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM) and a mixed linear model (MLM) as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.
Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development

PubMed Central

Takeda, Haruna; Rust, Alistair G.; Ward, Jerrold M.; Yew, Christopher Chin Kuan; Jenkins, Nancy A.; Copeland, Neal G.

2016-01-01

Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4+/− mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC. PMID:27006499
Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development.

PubMed

Takeda, Haruna; Rust, Alistair G; Ward, Jerrold M; Yew, Christopher Chin Kuan; Jenkins, Nancy A; Copeland, Neal G

2016-04-05

Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4(+/-) mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC.
ENU Mutagenesis in Mice Identifies Candidate Genes For Hypogonadism

PubMed Central

Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry

2012-01-01

Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617
Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE

PubMed Central

Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.

2009-01-01

Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438
Defining the human macula transcriptome and candidate retinal disease genes using EyeSAGE.

PubMed

Bowes Rickman, Catherine; Ebright, Jessica N; Zavodni, Zachary J; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P; Wistow, Graeme; Boon, Kathy; Hauser, Michael A

2006-06-01

To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. The EyeSAGE database, combining three different gene-profiling platforms including the authors' multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions.
Candidate genes for obesity-susceptibility show enriched association within a large genome-wide association study for BMI.

PubMed

Vimaleswaran, Karani S; Tachmazidou, Ioanna; Zhao, Jing Hua; Hirschhorn, Joel N; Dudbridge, Frank; Loos, Ruth J F

2012-10-15

Before the advent of genome-wide association studies (GWASs), hundreds of candidate genes for obesity-susceptibility had been identified through a variety of approaches. We examined whether those obesity candidate genes are enriched for associations with body mass index (BMI) compared with non-candidate genes by using data from a large-scale GWAS. A thorough literature search identified 547 candidate genes for obesity-susceptibility based on evidence from animal studies, Mendelian syndromes, linkage studies, genetic association studies and expression studies. Genomic regions were defined to include the genes ±10 kb of flanking sequence around candidate and non-candidate genes. We used summary statistics publicly available from the discovery stage of the genome-wide meta-analysis for BMI performed by the genetic investigation of anthropometric traits consortium in 123 564 individuals. Hypergeometric, rank tail-strength and gene-set enrichment analysis tests were used to test for the enrichment of association in candidate compared with non-candidate genes. The hypergeometric test of enrichment was not significant at the 5% P-value quantile (P = 0.35), but was nominally significant at the 25% quantile (P = 0.015). The rank tail-strength and gene-set enrichment tests were nominally significant for the full set of genes and borderline significant for the subset without SNPs at P < 10(-7). Taken together, the observed evidence for enrichment suggests that the candidate gene approach retains some value. However, the degree of enrichment is small despite the extensive number of candidate genes and the large sample size. Studies that focus on candidate genes have only slightly increased chances of detecting associations, and are likely to miss many true effects in non-candidate genes, at least for obesity-related traits.
Meta-review of protein network regulating obesity between validated obesity candidate genes in the white adipose tissue of high-fat diet-induced obese C57BL/6J mice.

PubMed

Kim, Eunjung; Kim, Eun Jung; Seo, Seung-Won; Hur, Cheol-Goo; McGregor, Robin A; Choi, Myung-Sook

2014-01-01

Worldwide obesity and related comorbidities are increasing, but identifying new therapeutic targets remains a challenge. A plethora of microarray studies in diet-induced obesity models has provided large datasets of obesity associated genes. In this review, we describe an approach to examine the underlying molecular network regulating obesity, and we discuss interactions between obesity candidate genes. We conducted network analysis on functional protein-protein interactions associated with 25 obesity candidate genes identified in a literature-driven approach based on published microarray studies of diet-induced obesity. The obesity candidate genes were closely associated with lipid metabolism and inflammation. Peroxisome proliferator activated receptor gamma (Pparg) appeared to be a core obesity gene, and obesity candidate genes were highly interconnected, suggesting a coordinately regulated molecular network in adipose tissue. In conclusion, the current network analysis approach may help elucidate the underlying molecular network regulating obesity and identify anti-obesity targets for therapeutic intervention.
The Genetic Basis for Variation in Sensitivity to Lead Toxicity in Drosophila melanogaster.

PubMed

Zhou, Shanshan; Morozova, Tatiana V; Hussain, Yasmeen N; Luoma, Sarah E; McCoy, Lenovia; Yamamoto, Akihiko; Mackay, Trudy F C; Anholt, Robert R H

2016-07-01

Lead toxicity presents a worldwide health problem, especially due to its adverse effects on cognitive development in children. However, identifying genes that give rise to individual variation in susceptibility to lead toxicity is challenging in human populations. Our goal was to use Drosophila melanogaster to identify evolutionarily conserved candidate genes associated with individual variation in susceptibility to lead exposure. To identify candidate genes associated with variation in susceptibility to lead toxicity, we measured effects of lead exposure on development time, viability and adult activity in the Drosophila melanogaster Genetic Reference Panel (DGRP) and performed genome-wide association analyses to identify candidate genes. We used mutants to assess functional causality of candidate genes and constructed a genetic network associated with variation in sensitivity to lead exposure, on which we could superimpose human orthologs. We found substantial heritabilities for all three traits and identified candidate genes associated with variation in susceptibility to lead exposure for each phenotype. The genetic architectures that determine variation in sensitivity to lead exposure are highly polygenic. Gene ontology and network analyses showed enrichment of genes associated with early development and function of the nervous system. Drosophila melanogaster presents an advantageous model to study the genetic underpinnings of variation in susceptibility to lead toxicity. Evolutionary conservation of cellular pathways that respond to toxic exposure allows predictions regarding orthologous genes and pathways across phyla. Thus, studies in the D. melanogaster model system can identify candidate susceptibility genes to guide subsequent studies in human populations. Zhou S, Morozova TV, Hussain YN, Luoma SE, McCoy L, Yamamoto A, Mackay TF, Anholt RR. 2016. The genetic basis for variation in sensitivity to lead toxicity in Drosophila melanogaster. Environ Health Perspect 124:1062-1070; http://dx.doi.org/10.1289/ehp.1510513.
Identification of downy mildew resistance gene candidates by positional cloning in maize (Zea mays subsp. mays; Poaceae)1

PubMed Central

Kim, Jae Yoon; Moon, Jun-Cheol; Kim, Hyo Chul; Shin, Seungho; Song, Kitae; Kim, Kyung-Hee; Lee, Byung-Moo

2017-01-01

Premise of the study: Positional cloning in combination with phenotyping is a general approach to identify disease-resistance gene candidates in plants; however, it requires several time-consuming steps including population or fine mapping. Therefore, in the present study, we suggest a new combined strategy to improve the identification of disease-resistance gene candidates. Methods and Results: Downy mildew (DM)–resistant maize was selected from five cultivars using a spreader row technique. Positional cloning and bioinformatics tools were used to identify the DM-resistance quantitative trait locus marker (bnlg1702) and 47 protein-coding gene annotations. Eventually, five DM-resistance gene candidates, including bZIP34, Bak1, and Ppr, were identified by quantitative reverse-transcription PCR (RT-PCR) without fine mapping of the bnlg1702 locus. Conclusions: The combined protocol with the spreader row technique, quantitative trait locus positional cloning, and quantitative RT-PCR was effective for identifying DM-resistance candidate genes. This cloning approach may be applied to other whole-genome-sequenced crops or resistance to other diseases. PMID:28224059
Defining the role of the MADS-box gene, Zea agamous like1, in maize domestication

USDA-ARS?s Scientific Manuscript database

Genomic scans for genes that show the signature of past selection have been widely applied to a number of species and have identified a large number of selection candidate genes. In cultivated maize (Zea mays ssp. mays) selection scans have identified several hundred candidate domestication genes...
Dissecting the organ specificity of insecticide resistance candidate genes in Anopheles gambiae: known and novel candidate genes.

PubMed

Ingham, Victoria A; Jones, Christopher M; Pignatelli, Patricia; Balabanidou, Vasileia; Vontas, John; Wagstaff, Simon C; Moore, Jonathan D; Ranson, Hilary

2014-11-25

The elevated expression of enzymes with insecticide metabolism activity can lead to high levels of insecticide resistance in the malaria vector, Anopheles gambiae. In this study, adult female mosquitoes from an insecticide susceptible and resistant strain were dissected into four different body parts. RNA from each of these samples was used in microarray analysis to determine the enrichment patterns of the key detoxification gene families within the mosquito and to identify additional candidate insecticide resistance genes that may have been overlooked in previous experiments on whole organisms. A general enrichment in the transcription of genes from the four major detoxification gene families (carboxylesterases, glutathione transferases, UDP glucornyltransferases and cytochrome P450s) was observed in the midgut and malpighian tubules. Yet the subset of P450 genes that have previously been implicated in insecticide resistance in An gambiae, show a surprisingly varied profile of tissue enrichment, confirmed by qPCR and, for three candidates, by immunostaining. A stringent selection process was used to define a list of 105 genes that are significantly (p ≤0.001) over expressed in body parts from the resistant versus susceptible strain. Over half of these, including all the cytochrome P450s on this list, were identified in previous whole organism comparisons between the strains, but several new candidates were detected, notably from comparisons of the transcriptomes from dissected abdomen integuments. The use of RNA extracted from the whole organism to identify candidate insecticide resistance genes has a risk of missing candidates if key genes responsible for the phenotype have restricted expression within the body and/or are over expression only in certain tissues. However, as transcription of genes implicated in metabolic resistance to insecticides is not enriched in any one single organ, comparison of the transcriptome of individual dissected body parts cannot be recommended as a preferred means to identify new candidate insecticide resistant genes. Instead the rich data set on in vivo sites of transcription should be consulted when designing follow up qPCR validation steps, or for screening known candidates in field populations.
EnRICH: Extraction and Ranking using Integration and Criteria Heuristics.

PubMed

Zhang, Xia; Greenlee, M Heather West; Serb, Jeanne M

2013-01-15

High throughput screening technologies enable biologists to generate candidate genes at a rate that, due to time and cost constraints, cannot be studied by experimental approaches in the laboratory. Thus, it has become increasingly important to prioritize candidate genes for experiments. To accomplish this, researchers need to apply selection requirements based on their knowledge, which necessitates qualitative integration of heterogeneous data sources and filtration using multiple criteria. A similar approach can also be applied to putative candidate gene relationships. While automation can assist in this routine and imperative procedure, flexibility of data sources and criteria must not be sacrificed. A tool that can optimize the trade-off between automation and flexibility to simultaneously filter and qualitatively integrate data is needed to prioritize candidate genes and generate composite networks from heterogeneous data sources. We developed the java application, EnRICH (Extraction and Ranking using Integration and Criteria Heuristics), in order to alleviate this need. Here we present a case study in which we used EnRICH to integrate and filter multiple candidate gene lists in order to identify potential retinal disease genes. As a result of this procedure, a candidate pool of several hundred genes was narrowed down to five candidate genes, of which four are confirmed retinal disease genes and one is associated with a retinal disease state. We developed a platform-independent tool that is able to qualitatively integrate multiple heterogeneous datasets and use different selection criteria to filter each of them, provided the datasets are tables that have distinct identifiers (required) and attributes (optional). With the flexibility to specify data sources and filtering criteria, EnRICH automatically prioritizes candidate genes or gene relationships for biologists based on their specific requirements. Here, we also demonstrate that this tool can be effectively and easily used to apply highly specific user-defined criteria and can efficiently identify high quality candidate genes from relatively sparse datasets.
Parkinson's disease candidate gene prioritization based on expression profile of midbrain dopaminergic neurons

PubMed Central

2010-01-01

Background Parkinson's disease is the second most common neurodegenerative disorder. The pathological hallmark of the disease is degeneration of midbrain dopaminergic neurons. Genetic association studies have linked 13 human chromosomal loci to Parkinson's disease. Identification of gene(s), as part of the etiology of Parkinson's disease, within the large number of genes residing in these loci can be achieved through several approaches, including screening methods, and considering appropriate criteria. Since several of the indentified Parkinson's disease genes are expressed in substantia nigra pars compact of the midbrain, expression within the neurons of this area could be a suitable criterion to limit the number of candidates and identify PD genes. Methods In this work we have used the combination of findings from six rodent transcriptome analysis studies on the gene expression profile of midbrain dopaminergic neurons and the PARK loci in OMIM (Online Mendelian Inheritance in Man) database, to identify new candidate genes for Parkinson's disease. Results Merging the two datasets, we identified 20 genes within PARK loci, 7 of which are located in an orphan Parkinson's disease locus and one, which had been identified as a disease gene. In addition to identifying a set of candidates for further genetic association studies, these results show that the criteria of expression in midbrain dopaminergic neurons may be used to narrow down the number of genes in PARK loci for such studies. PMID:20716345
The Genetic Basis for Variation in Sensitivity to Lead Toxicity in Drosophila melanogaster

PubMed Central

Zhou, Shanshan; Morozova, Tatiana V.; Hussain, Yasmeen N.; Luoma, Sarah E.; McCoy, Lenovia; Yamamoto, Akihiko; Mackay, Trudy F.C.; Anholt, Robert R.H.

2016-01-01

Background: Lead toxicity presents a worldwide health problem, especially due to its adverse effects on cognitive development in children. However, identifying genes that give rise to individual variation in susceptibility to lead toxicity is challenging in human populations. Objectives: Our goal was to use Drosophila melanogaster to identify evolutionarily conserved candidate genes associated with individual variation in susceptibility to lead exposure. Methods: To identify candidate genes associated with variation in susceptibility to lead toxicity, we measured effects of lead exposure on development time, viability and adult activity in the Drosophila melanogaster Genetic Reference Panel (DGRP) and performed genome-wide association analyses to identify candidate genes. We used mutants to assess functional causality of candidate genes and constructed a genetic network associated with variation in sensitivity to lead exposure, on which we could superimpose human orthologs. Results: We found substantial heritabilities for all three traits and identified candidate genes associated with variation in susceptibility to lead exposure for each phenotype. The genetic architectures that determine variation in sensitivity to lead exposure are highly polygenic. Gene ontology and network analyses showed enrichment of genes associated with early development and function of the nervous system. Conclusions: Drosophila melanogaster presents an advantageous model to study the genetic underpinnings of variation in susceptibility to lead toxicity. Evolutionary conservation of cellular pathways that respond to toxic exposure allows predictions regarding orthologous genes and pathways across phyla. Thus, studies in the D. melanogaster model system can identify candidate susceptibility genes to guide subsequent studies in human populations. Citation: Zhou S, Morozova TV, Hussain YN, Luoma SE, McCoy L, Yamamoto A, Mackay TF, Anholt RR. 2016. The genetic basis for variation in sensitivity to lead toxicity in Drosophila melanogaster. Environ Health Perspect 124:1062–1070; http://dx.doi.org/10.1289/ehp.1510513 PMID:26859824
Secretome Characterization and Correlation Analysis Reveal Putative Pathogenicity Mechanisms and Identify Candidate Avirulence Genes in the Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici.

PubMed

Xia, Chongjing; Wang, Meinan; Cornejo, Omar E; Jiwan, Derick A; See, Deven R; Chen, Xianming

2017-01-01

Stripe (yellow) rust, caused by Puccinia striiformis f. sp. tritici ( Pst ), is one of the most destructive diseases of wheat worldwide. Planting resistant cultivars is an effective way to control this disease, but race-specific resistance can be overcome quickly due to the rapid evolving Pst population. Studying the pathogenicity mechanisms is critical for understanding how Pst virulence changes and how to develop wheat cultivars with durable resistance to stripe rust. We re-sequenced 7 Pst isolates and included additional 7 previously sequenced isolates to represent balanced virulence/avirulence profiles for several avirulence loci in seretome analyses. We observed an uneven distribution of heterozygosity among the isolates. Secretome comparison of Pst with other rust fungi identified a large portion of species-specific secreted proteins, suggesting that they may have specific roles when interacting with the wheat host. Thirty-two effectors of Pst were identified from its secretome. We identified candidates for Avr genes corresponding to six Yr genes by correlating polymorphisms for effector genes to the virulence/avirulence profiles of the 14 Pst isolates. The putative AvYr76 was present in the avirulent isolates, but absent in the virulent isolates, suggesting that deleting the coding region of the candidate avirulence gene has produced races virulent to resistance gene Yr76 . We conclude that incorporating avirulence/virulence phenotypes into correlation analysis with variations in genomic structure and secretome, particularly presence/absence polymorphisms of effectors, is an efficient way to identify candidate Avr genes in Pst . The candidate effector genes provide a rich resource for further studies to determine the evolutionary history of Pst populations and the co-evolutionary arms race between Pst and wheat. The Avr candidates identified in this study will lead to cloning avirulence genes in Pst , which will enable us to understand molecular mechanisms underlying Pst -wheat interactions, to determine the effectiveness of resistance genes and further to develop durable resistance to stripe rust.

Mutational Landscape of Candidate Genes in Familial Prostate Cancer

PubMed Central

Johnson, Anna M.; Zuhlke, Kimberly A.; Plotts, Chris; McDonnell, Shannon K.; Middha, Sumit; Riska, Shaun M.; Thibodeau, Stephen N.; Douglas, Julie A.; Cooney, Kathleen A.

2014-01-01

Background Family history is a major risk factor for prostate cancer (PCa), suggesting a genetic component to the disease. However, traditional linkage and association studies have failed to fully elucidate the underlying genetic basis of familial PCa. Methods Here we use a candidate gene approach to identify potential PCa susceptibility variants in whole exome sequencing data from familial PCa cases. Six hundred ninety-seven candidate genes were identified based on function, location near a known chromosome 17 linkage signal, and/or previous association with prostate or other cancers. Single nucleotide variants (SNVs) in these candidate genes were identified in whole exome sequence data from 33 PCa cases from 11 multiplex PCa families (3 cases/family). Results Overall, 4856 candidate gene SNVs were identified, including 1052 missense and 10 nonsense variants. Twenty missense variants were shared by all 3 family members in each family in which they were observed. Additionally, 15 missense variants were shared by 2 of 3 family members and predicted to be deleterious by 5 different algorithms. Four missense variants, BLM Gln123Arg, PARP2 Arg283Gln, LRCC46 Ala295Thr and KIF2B Pro91Leu, and 1 nonsense variant, CYP3A43 Arg441Ter, showed complete co-segregation with PCa status. Twelve additional variants displayed partial co-segregation with PCa. Conclusions Forty-three nonsense and shared, missense variants were identified in our candidate genes. Further research is needed to determine the contribution of these variants to PCa susceptibility. PMID:25111073
A large-scale RNA interference screen identifies genes that regulate autophagy at different stages.

PubMed

Guo, Sujuan; Pridham, Kevin J; Virbasius, Ching-Man; He, Bin; Zhang, Liqing; Varmark, Hanne; Green, Michael R; Sheng, Zhi

2018-02-12

Dysregulated autophagy is central to the pathogenesis and therapeutic development of cancer. However, how autophagy is regulated in cancer is not well understood and genes that modulate cancer autophagy are not fully defined. To gain more insights into autophagy regulation in cancer, we performed a large-scale RNA interference screen in K562 human chronic myeloid leukemia cells using monodansylcadaverine staining, an autophagy-detecting approach equivalent to immunoblotting of the autophagy marker LC3B or fluorescence microscopy of GFP-LC3B. By coupling monodansylcadaverine staining with fluorescence-activated cell sorting, we successfully isolated autophagic K562 cells where we identified 336 short hairpin RNAs. After candidate validation using Cyto-ID fluorescence spectrophotometry, LC3B immunoblotting, and quantitative RT-PCR, 82 genes were identified as autophagy-regulating genes. 20 genes have been reported previously and the remaining 62 candidates are novel autophagy mediators. Bioinformatic analyses revealed that most candidate genes were involved in molecular pathways regulating autophagy, rather than directly participating in the autophagy process. Further autophagy flux assays revealed that 57 autophagy-regulating genes suppressed autophagy initiation, whereas 21 candidates promoted autophagy maturation. Our RNA interference screen identifies identified genes that regulate autophagy at different stages, which helps decode autophagy regulation in cancer and offers novel avenues to develop autophagy-related therapies for cancer.
Bioinformatics analysis and detection of gelatinase encoded gene in Lysinibacillussphaericus

NASA Astrophysics Data System (ADS)

Repin, Rul Aisyah Mat; Mutalib, Sahilah Abdul; Shahimi, Safiyyah; Khalid, Rozida Mohd.; Ayob, Mohd. Khan; Bakar, Mohd. Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, we performed bioinformatics analysis toward genome sequence of Lysinibacillussphaericus (L. sphaericus) to determine gene encoded for gelatinase. L. sphaericus was isolated from soil and gelatinase species-specific bacterium to porcine and bovine gelatin. This bacterium offers the possibility of enzymes production which is specific to both species of meat, respectively. The main focus of this research is to identify the gelatinase encoded gene within the bacteria of L. Sphaericus using bioinformatics analysis of partially sequence genome. From the research study, three candidate gene were identified which was, gelatinase candidate gene 1 (P1), NODE_71_length_93919_cov_158.931839_21 which containing 1563 base pair (bp) in size with 520 amino acids sequence; Secondly, gelatinase candidate gene 2 (P2), NODE_23_length_52851_cov_190.061386_17 which containing 1776 bp in size with 591 amino acids sequence; and Thirdly, gelatinase candidate gene 3 (P3), NODE_106_length_32943_cov_169.147919_8 containing 1701 bp in size with 566 amino acids sequence. Three pairs of oligonucleotide primers were designed and namely as, F1, R1, F2, R2, F3 and R3 were targeted short sequences of cDNA by PCR. The amplicons were reliably results in 1563 bp in size for candidate gene P1 and 1701 bp in size for candidate gene P3. Therefore, the results of bioinformatics analysis of L. Sphaericus resulting in gene encoded gelatinase were identified.
An Integrative Genetics Approach to Identify Candidate Genes Regulating BMD: Combining Linkage, Gene Expression, and Association

PubMed Central

Farber, Charles R; van Nas, Atila; Ghazalpour, Anatole; Aten, Jason E; Doss, Sudheer; Sos, Brandon; Schadt, Eric E; Ingram-Drake, Leslie; Davis, Richard C; Horvath, Steve; Smith, Desmond J; Drake, Thomas A; Lusis, Aldons J

2009-01-01

Numerous quantitative trait loci (QTLs) affecting bone traits have been identified in the mouse; however, few of the underlying genes have been discovered. To improve the process of transitioning from QTL to gene, we describe an integrative genetics approach, which combines linkage analysis, expression QTL (eQTL) mapping, causality modeling, and genetic association in outbred mice. In C57BL/6J × C3H/HeJ (BXH) F2 mice, nine QTLs regulating femoral BMD were identified. To select candidate genes from within each QTL region, microarray gene expression profiles from individual F2 mice were used to identify 148 genes whose expression was correlated with BMD and regulated by local eQTLs. Many of the genes that were the most highly correlated with BMD have been previously shown to modulate bone mass or skeletal development. Candidates were further prioritized by determining whether their expression was predicted to underlie variation in BMD. Using network edge orienting (NEO), a causality modeling algorithm, 18 of the 148 candidates were predicted to be causally related to differences in BMD. To fine-map QTLs, markers in outbred MF1 mice were tested for association with BMD. Three chromosome 11 SNPs were identified that were associated with BMD within the Bmd11 QTL. Finally, our approach provides strong support for Wnt9a, Rasd1, or both underlying Bmd11. Integration of multiple genetic and genomic data sets can substantially improve the efficiency of QTL fine-mapping and candidate gene identification. PMID:18767929
Breast Tumors with Elevated Expression of 1q Candidate Genes Confer Poor Clinical Outcome and Sensitivity to Ras/PI3K Inhibition

PubMed Central

Viveka Thangaraj, Soundara; Periasamy, Jayaprakash; Bhaskar Rao, Divya; Barnabas, Georgina D.; Raghavan, Swetha; Ganesan, Kumaresan

2013-01-01

Genomic aberrations are common in cancers and the long arm of chromosome 1 is known for its frequent amplifications in breast cancer. However, the key candidate genes of 1q, and their contribution in breast cancer pathogenesis remain unexplored. We have analyzed the gene expression profiles of 1635 breast tumor samples using meta-analysis based approach and identified clinically significant candidates from chromosome 1q. Seven candidate genes including exonuclease 1 (EXO1) are consistently over expressed in breast tumors, specifically in high grade and aggressive breast tumors with poor clinical outcome. We derived a EXO1 co-expression module from the mRNA profiles of breast tumors which comprises 1q candidate genes and their co-expressed genes. By integrative functional genomics investigation, we identified the involvement of EGFR, RAS, PI3K / AKT, MYC, E2F signaling in the regulation of these selected 1q genes in breast tumors and breast cancer cell lines. Expression of EXO1 module was found as indicative of elevated cell proliferation, genomic instability, activated RAS/AKT/MYC/E2F1 signaling pathways and loss of p53 activity in breast tumors. mRNA–drug connectivity analysis indicates inhibition of RAS/PI3K as a possible targeted therapeutic approach for the patients with activated EXO1 module in breast tumors. Thus, we identified seven 1q candidate genes strongly associated with the poor survival of breast cancer patients and identified the possibility of targeting them with EGFR/RAS/PI3K inhibitors. PMID:24147022
Defining a new candidate gene for amelogenesis imperfecta: from molecular genetics to biochemistry.

PubMed

Urzúa, Blanca; Ortega-Pinto, Ana; Morales-Bozo, Irene; Rojas-Alcayaga, Gonzalo; Cifuentes, Víctor

2011-02-01

Amelogenesis imperfecta is a group of genetic conditions that affect the structure and clinical appearance of tooth enamel. The types (hypoplastic, hypocalcified, and hypomature) are correlated with defects in different stages of the process of enamel synthesis. Autosomal dominant, recessive, and X-linked types have been previously described. These disorders are considered clinically and genetically heterogeneous in etiology, involving a variety of genes, such as AMELX, ENAM, DLX3, FAM83H, MMP-20, KLK4, and WDR72. The mutations identified within these causal genes explain less than half of all cases of amelogenesis imperfecta. Most of the candidate and causal genes currently identified encode proteins involved in enamel synthesis. We think it is necessary to refocus the search for candidate genes using biochemical processes. This review provides theoretical evidence that the human SLC4A4 gene (sodium bicarbonate cotransporter) may be a new candidate gene.
A public platform for the verification of the phenotypic effect of candidate genes for resistance to aflatoxin accumulation and Aspergillus flavus infection in maize.

PubMed

Warburton, Marilyn L; Williams, William Paul; Hawkins, Leigh; Bridges, Susan; Gresham, Cathy; Harper, Jonathan; Ozkan, Seval; Mylroie, J Erik; Shan, Xueyan

2011-07-01

A public candidate gene testing pipeline for resistance to aflatoxin accumulation or Aspergillus flavus infection in maize is presented here. The pipeline consists of steps for identifying, testing, and verifying the association of selected maize gene sequences with resistance under field conditions. Resources include a database of genetic and protein sequences associated with the reduction in aflatoxin contamination from previous studies; eight diverse inbred maize lines for polymorphism identification within any maize gene sequence; four Quantitative Trait Loci (QTL) mapping populations and one association mapping panel, all phenotyped for aflatoxin accumulation resistance and associated phenotypes; and capacity for Insertion/Deletion (InDel) and SNP genotyping in the population(s) for mapping. To date, ten genes have been identified as possible candidate genes and put through the candidate gene testing pipeline, and results are presented here to demonstrate the utility of the pipeline.
Walking the interactome for candidate prioritization in exome sequencing studies of Mendelian diseases

DOE PAGES

Smedley, Damian; Kohler, Sebastian; Czeschik, Johanna Christina; ...

2014-07-30

Here, whole-exome sequencing (WES) has opened up previously unheard of possibilities for identifying novel disease genes in Mendelian disorders, only about half of which have been elucidated to date. However, interpretation of WES data remains challenging. As a result, we analyze protein–protein association (PPA) networks to identify candidate genes in the vicinity of genes previously implicated in a disease. The analysis, using a random-walk with restart (RWR) method, is adapted to the setting of WES by developing a composite variant-gene relevance score based on the rarity, location and predicted pathogenicity of variants and the RWR evaluation of genes harboring themore » variants. Benchmarking using known disease variants from 88 disease-gene families reveals that the correct gene is ranked among the top 10 candidates in ≥50% of cases, a figure which we confirmed using a prospective study of disease genes identified in 2012 and PPA data produced before that date. In conclusion, we implement our method in a freely available Web server, ExomeWalker, that displays a ranked list of candidates together with information on PPAs, frequency and predicted pathogenicity of the variants to allow quick and effective searches for candidates that are likely to reward closer investigation.« less
Integration of QTL and bioinformatic tools to identify candidate genes for triglycerides in mice[S

PubMed Central

Leduc, Magalie S.; Hageman, Rachael S.; Verdugo, Ricardo A.; Tsaih, Shirng-Wern; Walsh, Kenneth; Churchill, Gary A.; Paigen, Beverly

2011-01-01

To identify genetic loci influencing lipid levels, we performed quantitative trait loci (QTL) analysis between inbred mouse strains MRL/MpJ and SM/J, measuring triglyceride levels at 8 weeks of age in F2 mice fed a chow diet. We identified one significant QTL on chromosome (Chr) 15 and three suggestive QTL on Chrs 2, 7, and 17. We also carried out microarray analysis on the livers of parental strains of 282 F2 mice and used these data to find cis-regulated expression QTL. We then narrowed the list of candidate genes under significant QTL using a “toolbox” of bioinformatic resources, including haplotype analysis; parental strain comparison for gene expression differences and nonsynonymous coding single nucleotide polymorphisms (SNP); cis-regulated eQTL in livers of F2 mice; correlation between gene expression and phenotype; and conditioning of expression on the phenotype. We suggest Slc25a7 as a candidate gene for the Chr 7 QTL and, based on expression differences, five genes (Polr3 h, Cyp2d22, Cyp2d26, Tspo, and Ttll12) as candidate genes for Chr 15 QTL. This study shows how bioinformatics can be used effectively to reduce candidate gene lists for QTL related to complex traits. PMID:21622629
Walking the interactome for candidate prioritization in exome sequencing studies of Mendelian diseases

DOE Office of Scientific and Technical Information (OSTI.GOV)

Smedley, Damian; Kohler, Sebastian; Czeschik, Johanna Christina

Here, whole-exome sequencing (WES) has opened up previously unheard of possibilities for identifying novel disease genes in Mendelian disorders, only about half of which have been elucidated to date. However, interpretation of WES data remains challenging. As a result, we analyze protein–protein association (PPA) networks to identify candidate genes in the vicinity of genes previously implicated in a disease. The analysis, using a random-walk with restart (RWR) method, is adapted to the setting of WES by developing a composite variant-gene relevance score based on the rarity, location and predicted pathogenicity of variants and the RWR evaluation of genes harboring themore » variants. Benchmarking using known disease variants from 88 disease-gene families reveals that the correct gene is ranked among the top 10 candidates in ≥50% of cases, a figure which we confirmed using a prospective study of disease genes identified in 2012 and PPA data produced before that date. In conclusion, we implement our method in a freely available Web server, ExomeWalker, that displays a ranked list of candidates together with information on PPAs, frequency and predicted pathogenicity of the variants to allow quick and effective searches for candidates that are likely to reward closer investigation.« less
Identification of Inherited Retinal Disease-Associated Genetic Variants in 11 Candidate Genes.

PubMed

Astuti, Galuh D N; van den Born, L Ingeborgh; Khan, M Imran; Hamel, Christian P; Bocquet, Béatrice; Manes, Gaël; Quinodoz, Mathieu; Ali, Manir; Toomes, Carmel; McKibbin, Martin; El-Asrag, Mohammed E; Haer-Wigman, Lonneke; Inglehearn, Chris F; Black, Graeme C M; Hoyng, Carel B; Cremers, Frans P M; Roosing, Susanne

2018-01-10

Inherited retinal diseases (IRDs) display an enormous genetic heterogeneity. Whole exome sequencing (WES) recently identified genes that were mutated in a small proportion of IRD cases. Consequently, finding a second case or family carrying pathogenic variants in the same candidate gene often is challenging. In this study, we searched for novel candidate IRD gene-associated variants in isolated IRD families, assessed their causality, and searched for novel genotype-phenotype correlations. Whole exome sequencing was performed in 11 probands affected with IRDs. Homozygosity mapping data was available for five cases. Variants with minor allele frequencies ≤ 0.5% in public databases were selected as candidate disease-causing variants. These variants were ranked based on their: (a) presence in a gene that was previously implicated in IRD; (b) minor allele frequency in the Exome Aggregation Consortium database (ExAC); (c) in silico pathogenicity assessment using the combined annotation dependent depletion (CADD) score; and (d) interaction of the corresponding protein with known IRD-associated proteins. Twelve unique variants were found in 11 different genes in 11 IRD probands. Novel autosomal recessive and dominant inheritance patterns were found for variants in Small Nuclear Ribonucleoprotein U5 Subunit 200 ( SNRNP200 ) and Zinc Finger Protein 513 ( ZNF513 ), respectively. Using our pathogenicity assessment, a variant in DEAH-Box Helicase 32 ( DHX32 ) was the top ranked novel candidate gene to be associated with IRDs, followed by eight medium and lower ranked candidate genes. The identification of candidate disease-associated sequence variants in 11 single families underscores the notion that the previously identified IRD-associated genes collectively carry > 90% of the defects implicated in IRDs. To identify multiple patients or families with variants in the same gene and thereby provide extra proof for pathogenicity, worldwide data sharing is needed.
Search for sarcoidosis candidate genes by integration of data from genomic, transcriptomic and proteomic studies.

PubMed

Maver, Ales; Medica, Igor; Peterlin, Borut

2009-12-01

The search for gene candidates in multifactorial diseases such as sarcoidosis can be based on the integration of linkage association data, gene expression data, and protein profile data from genomic, transcriptomic and proteomic studies, respectively. In this study we performed a literature-based search for studies reporting such data, followed by integration of collected information. Different databases were examined--Medline, HugGE Navigator, ArrayExpress and Gene Expression Omnibus (GEO). Candidate genes were defined as genes which were reported in at least 2 different types of omics studies. Genes previously investigated in sarcoidosis were excluded from further analyses. We identified 177 genes associated with sarcoidosis as potential new candidate genes. Subsequently, 9 gene candidates identified to overlap in 2 different types of studies (genomic, transcriptomic and/or proteomic) were consistently reported in at least 3 studies: SERPINB1, FABP4, S100A8, HBEGF, IL7R, LRIG1, PTPN23, DPM2 and NUP214. These genes are involved in regulation of immune response, cellular proliferation, apoptosis, inhibition of protease activity, lipid metabolism. Exact biological functions of HBEGF, LRIG1, PTPN23, DPM2 and NUP214 remain to be completely elucidated. We propose 9 candidate genes: SERPINB1, FABP4, S100A8, HBEGF, IL7R, LRIG1, PTPN23, DPM2 and NUP214, as genes with high potential for association with sarcoidosis.
Candidate gene prioritization by network analysis of differential expression using machine learning approaches

PubMed Central

2010-01-01

Background Discovering novel disease genes is still challenging for diseases for which no prior knowledge - such as known disease genes or disease-related pathways - is available. Performing genetic studies frequently results in large lists of candidate genes of which only few can be followed up for further investigation. We have recently developed a computational method for constitutional genetic disorders that identifies the most promising candidate genes by replacing prior knowledge by experimental data of differential gene expression between affected and healthy individuals. To improve the performance of our prioritization strategy, we have extended our previous work by applying different machine learning approaches that identify promising candidate genes by determining whether a gene is surrounded by highly differentially expressed genes in a functional association or protein-protein interaction network. Results We have proposed three strategies scoring disease candidate genes relying on network-based machine learning approaches, such as kernel ridge regression, heat kernel, and Arnoldi kernel approximation. For comparison purposes, a local measure based on the expression of the direct neighbors is also computed. We have benchmarked these strategies on 40 publicly available knockout experiments in mice, and performance was assessed against results obtained using a standard procedure in genetics that ranks candidate genes based solely on their differential expression levels (Simple Expression Ranking). Our results showed that our four strategies could outperform this standard procedure and that the best results were obtained using the Heat Kernel Diffusion Ranking leading to an average ranking position of 8 out of 100 genes, an AUC value of 92.3% and an error reduction of 52.8% relative to the standard procedure approach which ranked the knockout gene on average at position 17 with an AUC value of 83.7%. Conclusion In this study we could identify promising candidate genes using network based machine learning approaches even if no knowledge is available about the disease or phenotype. PMID:20840752
A Stratified Transcriptomics Analysis of Polygenic Fat and Lean Mouse Adipose Tissues Identifies Novel Candidate Obesity Genes

PubMed Central

Morton, Nicholas M.; Nelson, Yvonne B.; Michailidou, Zoi; Di Rollo, Emma M.; Ramage, Lynne; Hadoke, Patrick W. F.; Seckl, Jonathan R.; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J.; Dunbar, Donald R.

2011-01-01

Background Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. Results To enrich for adipose tissue obesity genes a ‘snap-shot’ pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. Conclusions A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity. PMID:21915269
A stratified transcriptomics analysis of polygenic fat and lean mouse adipose tissues identifies novel candidate obesity genes.

PubMed

Morton, Nicholas M; Nelson, Yvonne B; Michailidou, Zoi; Di Rollo, Emma M; Ramage, Lynne; Hadoke, Patrick W F; Seckl, Jonathan R; Bunger, Lutz; Horvat, Simon; Kenyon, Christopher J; Dunbar, Donald R

2011-01-01

Obesity and metabolic syndrome results from a complex interaction between genetic and environmental factors. In addition to brain-regulated processes, recent genome wide association studies have indicated that genes highly expressed in adipose tissue affect the distribution and function of fat and thus contribute to obesity. Using a stratified transcriptome gene enrichment approach we attempted to identify adipose tissue-specific obesity genes in the unique polygenic Fat (F) mouse strain generated by selective breeding over 60 generations for divergent adiposity from a comparator Lean (L) strain. To enrich for adipose tissue obesity genes a 'snap-shot' pooled-sample transcriptome comparison of key fat depots and non adipose tissues (muscle, liver, kidney) was performed. Known obesity quantitative trait loci (QTL) information for the model allowed us to further filter genes for increased likelihood of being causal or secondary for obesity. This successfully identified several genes previously linked to obesity (C1qr1, and Np3r) as positional QTL candidate genes elevated specifically in F line adipose tissue. A number of novel obesity candidate genes were also identified (Thbs1, Ppp1r3d, Tmepai, Trp53inp2, Ttc7b, Tuba1a, Fgf13, Fmr) that have inferred roles in fat cell function. Quantitative microarray analysis was then applied to the most phenotypically divergent adipose depot after exaggerating F and L strain differences with chronic high fat feeding which revealed a distinct gene expression profile of line, fat depot and diet-responsive inflammatory, angiogenic and metabolic pathways. Selected candidate genes Npr3 and Thbs1, as well as Gys2, a non-QTL gene that otherwise passed our enrichment criteria were characterised, revealing novel functional effects consistent with a contribution to obesity. A focussed candidate gene enrichment strategy in the unique F and L model has identified novel adipose tissue-enriched genes contributing to obesity.
The Terpene Synthase Gene Family of Carrot (Daucus carota L.): Identification of QTLs and Candidate Genes Associated with Terpenoid Volatile Compounds

PubMed Central

Keilwagen, Jens; Lehnert, Heike; Berner, Thomas; Budahn, Holger; Nothnagel, Thomas; Ulrich, Detlef; Dunemann, Frank

2017-01-01

Terpenes are an important group of secondary metabolites in carrots influencing taste and flavor, and some of them might also play a role as bioactive substances with an impact on human physiology and health. Understanding the genetic and molecular basis of terpene synthases (TPS) involved in the biosynthesis of volatile terpenoids will provide insights for improving breeding strategies aimed at quality traits and for developing specific carrot chemotypes possibly useful for pharmaceutical applications. Hence, a combination of terpene metabolite profiling, genotyping-by-sequencing (GBS), and genome-wide association study (GWAS) was used in this work to get insights into the genetic control of terpene biosynthesis in carrots and to identify several TPS candidate genes that might be involved in the production of specific monoterpenes. In a panel of 85 carrot cultivars and accessions, metabolite profiling was used to identify 31 terpenoid volatile organic compounds (VOCs) in carrot leaves and roots, and a GBS approach was used to provide dense genome-wide marker coverage (>168,000 SNPs). Based on this data, a total of 30 quantitative trait loci (QTLs) was identified for 15 terpenoid volatiles. Most QTLs were detected for the monoterpene compounds ocimene, sabinene, β-pinene, borneol and bornyl acetate. We identified four genomic regions on three different carrot chromosomes by GWAS which are both associated with high significance (LOD ≥ 5.91) to distinct monoterpenes and to TPS candidate genes, which have been identified by homology-based gene prediction utilizing RNA-seq data. In total, 65 TPS candidate gene models in carrot were identified and assigned to known plant TPS subfamilies with the exception of TPS-d and TPS-h. TPS-b was identified as largest subfamily with 32 TPS candidate genes. PMID:29170675
A transposon-based genetic screen in mice identifies genes altered in colorectal cancer.

PubMed

Starr, Timothy K; Allaei, Raha; Silverstein, Kevin A T; Staggs, Rodney A; Sarver, Aaron L; Bergemann, Tracy L; Gupta, Mihir; O'Sullivan, M Gerard; Matise, Ilze; Dupuy, Adam J; Collier, Lara S; Powers, Scott; Oberg, Ann L; Asmann, Yan W; Thibodeau, Stephen N; Tessarollo, Lino; Copeland, Neal G; Jenkins, Nancy A; Cormier, Robert T; Largaespada, David A

2009-03-27

Human colorectal cancers (CRCs) display a large number of genetic and epigenetic alterations, some of which are causally involved in tumorigenesis (drivers) and others that have little functional impact (passengers). To help distinguish between these two classes of alterations, we used a transposon-based genetic screen in mice to identify candidate genes for CRC. Mice harboring mutagenic Sleeping Beauty (SB) transposons were crossed with mice expressing SB transposase in gastrointestinal tract epithelium. Most of the offspring developed intestinal lesions, including intraepithelial neoplasia, adenomas, and adenocarcinomas. Analysis of over 16,000 transposon insertions identified 77 candidate CRC genes, 60 of which are mutated and/or dysregulated in human CRC and thus are most likely to drive tumorigenesis. These genes include APC, PTEN, and SMAD4. The screen also identified 17 candidate genes that had not previously been implicated in CRC, including POLI, PTPRK, and RSPO2.
Integrative strategies to identify candidate genes in rodent models of human alcoholism.

PubMed

Treadwell, Julie A

2006-01-01

The search for genes underlying alcohol-related behaviours in rodent models of human alcoholism has been ongoing for many years with only limited success. Recently, new strategies that integrate several of the traditional approaches have provided new insights into the molecular mechanisms underlying ethanol's actions in the brain. We have used alcohol-preferring C57BL/6J (B6) and alcohol-avoiding DBA/2J (D2) genetic strains of mice in an integrative strategy combining high-throughput gene expression screening, genetic segregation analysis, and mapping to previously published quantitative trait loci to uncover candidate genes for the ethanol-preference phenotype. In our study, 2 genes, retinaldehyde binding protein 1 (Rlbp1) and syntaxin 12 (Stx12), were found to be strong candidates for ethanol preference. Such experimental approaches have the power and the potential to greatly speed up the laborious process of identifying candidate genes for the animal models of human alcoholism.
Exome sequencing of a large family identifies potential candidate genes contributing risk to bipolar disorder.

PubMed

Zhang, Tianxiao; Hou, Liping; Chen, David T; McMahon, Francis J; Wang, Jen-Chyong; Rice, John P

2018-03-01

Bipolar disorder is a mental illness with lifetime prevalence of about 1%. Previous genetic studies have identified multiple chromosomal linkage regions and candidate genes that might be associated with bipolar disorder. The present study aimed to identify potential susceptibility variants for bipolar disorder using 6 related case samples from a four-generation family. A combination of exome sequencing and linkage analysis was performed to identify potential susceptibility variants for bipolar disorder. Our study identified a list of five potential candidate genes for bipolar disorder. Among these five genes, GRID1(Glutamate Receptor Delta-1 Subunit), which was previously reported to be associated with several psychiatric disorders and brain related traits, is particularly interesting. Variants with functional significance in this gene were identified from two cousins in our bipolar disorder pedigree. Our findings suggest a potential role for these genes and the related rare variants in the onset and development of bipolar disorder in this one family. Additional research is needed to replicate these findings and evaluate their patho-biological significance. Copyright © 2017 Elsevier B.V. All rights reserved.
Gene Expression Profiling of Gastric Cancer

PubMed Central

Marimuthu, Arivusudar; Jacob, Harrys K.C.; Jakharia, Aniruddha; Subbannayya, Yashwanth; Keerthikumar, Shivakumar; Kashyap, Manoj Kumar; Goel, Renu; Balakrishnan, Lavanya; Dwivedi, Sutopa; Pathare, Swapnali; Dikshit, Jyoti Bajpai; Maharudraiah, Jagadeesha; Singh, Sujay; Sameer Kumar, Ghantasala S; Vijayakumar, M.; Veerendra Kumar, Kariyanakatte Veeraiah; Premalatha, Chennagiri Shrinivasamurthy; Tata, Pramila; Hariharan, Ramesh; Roa, Juan Carlos; Prasad, T.S.K; Chaerkady, Raghothama; Kumar, Rekha Vijay; Pandey, Akhilesh

2015-01-01

Gastric cancer is the second leading cause of cancer death worldwide, both in men and women. A genomewide gene expression analysis was carried out to identify differentially expressed genes in gastric adenocarcinoma tissues as compared to adjacent normal tissues. We used Agilent’s whole human genome oligonucleotide microarray platform representing ~41,000 genes to carry out gene expression analysis. Two-color microarray analysis was employed to directly compare the expression of genes between tumor and normal tissues. Through this approach, we identified several previously known candidate genes along with a number of novel candidate genes in gastric cancer. Testican-1 (SPOCK1) was one of the novel molecules that was 10-fold upregulated in tumors. Using tissue microarrays, we validated the expression of testican-1 by immunohistochemical staining. It was overexpressed in 56% (160/282) of the cases tested. Pathway analysis led to the identification of several networks in which SPOCK1 was among the topmost networks of interacting genes. By gene enrichment analysis, we identified several genes involved in cell adhesion and cell proliferation to be significantly upregulated while those corresponding to metabolic pathways were significantly downregulated. The differentially expressed genes identified in this study are candidate biomarkers for gastric adenoacarcinoma. PMID:27030788

Prediction of Human Disease Genes by Human-Mouse Conserved Coexpression Analysis

PubMed Central

Grassi, Elena; Damasco, Christian; Silengo, Lorenzo; Oti, Martin; Provero, Paolo; Di Cunto, Ferdinando

2008-01-01

Background Even in the post-genomic era, the identification of candidate genes within loci associated with human genetic diseases is a very demanding task, because the critical region may typically contain hundreds of positional candidates. Since genes implicated in similar phenotypes tend to share very similar expression profiles, high throughput gene expression data may represent a very important resource to identify the best candidates for sequencing. However, so far, gene coexpression has not been used very successfully to prioritize positional candidates. Methodology/Principal Findings We show that it is possible to reliably identify disease-relevant relationships among genes from massive microarray datasets by concentrating only on genes sharing similar expression profiles in both human and mouse. Moreover, we show systematically that the integration of human-mouse conserved coexpression with a phenotype similarity map allows the efficient identification of disease genes in large genomic regions. Finally, using this approach on 850 OMIM loci characterized by an unknown molecular basis, we propose high-probability candidates for 81 genetic diseases. Conclusion Our results demonstrate that conserved coexpression, even at the human-mouse phylogenetic distance, represents a very strong criterion to predict disease-relevant relationships among human genes. PMID:18369433
Transcriptome profiling of two maize inbreds with distinct responses to Gibberella ear rot disease to identify candidate resistance genes.

PubMed

Kebede, Aida Z; Johnston, Anne; Schneiderman, Danielle; Bosnich, Whynn; Harris, Linda J

2018-02-09

Gibberella ear rot (GER) is one of the most economically important fungal diseases of maize in the temperate zone due to moldy grain contaminated with health threatening mycotoxins. To develop resistant genotypes and control the disease, understanding the host-pathogen interaction is essential. RNA-Seq-derived transcriptome profiles of fungal- and mock-inoculated developing kernel tissues of two maize inbred lines were used to identify differentially expressed transcripts and propose candidate genes mapping within GER resistance quantitative trait loci (QTL). A total of 1255 transcripts were significantly (P ≤ 0.05) up regulated due to fungal infection in both susceptible and resistant inbreds. A greater number of transcripts were up regulated in the former (1174) than the latter (497) and increased as the infection progressed from 1 to 2 days after inoculation. Focusing on differentially expressed genes located within QTL regions for GER resistance, we identified 81 genes involved in membrane transport, hormone regulation, cell wall modification, cell detoxification, and biosynthesis of pathogenesis related proteins and phytoalexins as candidate genes contributing to resistance. Applying droplet digital PCR, we validated the expression profiles of a subset of these candidate genes from QTL regions contributed by the resistant inbred on chromosomes 1, 2 and 9. By screening global gene expression profiles for differentially expressed genes mapping within resistance QTL regions, we have identified candidate genes for gibberella ear rot resistance on several maize chromosomes which could potentially lead to a better understanding of Fusarium resistance mechanisms.
Transcriptome analysis of Brassica napus pod using RNA-Seq and identification of lipid-related candidate genes.

PubMed

Xu, Hai-Ming; Kong, Xiang-Dong; Chen, Fei; Huang, Ji-Xiang; Lou, Xiang-Yang; Zhao, Jian-Yi

2015-10-24

Brassica napus is an important oilseed crop. Dissection of the genetic architecture underlying oil-related biological processes will greatly facilitates the genetic improvement of rapeseed. The differential gene expression during pod development offers a snapshot on the genes responsible for oil accumulation in. To identify candidate genes in the linkage peaks reported previously, we used RNA sequencing (RNA-Seq) technology to analyze the pod transcriptomes of German cultivar Sollux and Chinese inbred line Gaoyou. The RNA samples were collected for RNA-Seq at 5-7, 15-17 and 25-27 days after flowering (DAF). Bioinformatics analysis was performed to investigate differentially expressed genes (DEGs). Gene annotation analysis was integrated with QTL mapping and Brassica napus pod transcriptome profiling to detect potential candidate genes in oilseed. Four hundred sixty five and two thousand, one hundred fourteen candidate DEGs were identified, respectively, between two varieties at the same stages and across different periods of each variety. Then, 33 DEGs between Sollux and Gaoyou were identified as the candidate genes affecting seed oil content by combining those DEGs with the quantitative trait locus (QTL) mapping results, of which, one was found to be homologous to Arabidopsis thaliana lipid-related genes. Intervarietal DEGs of lipid pathways in QTL regions represent important candidate genes for oil-related traits. Integrated analysis of transcriptome profiling, QTL mapping and comparative genomics with other relative species leads to efficient identification of most plausible functional genes underlying oil-content related characters, offering valuable resources for bettering breeding program of Brassica napus. This study provided a comprehensive overview on the pod transcriptomes of two varieties with different oil-contents at the three developmental stages.
Global Transcriptome Changes Underlying Colony Growth in the Opportunistic Human Pathogen Aspergillus fumigatus

PubMed Central

Gibbons, John G.; Beauvais, Anne; Beau, Remi; McGary, Kriston L.

2012-01-01

Aspergillus fumigatus is the most common and deadly pulmonary fungal infection worldwide. In the lung, the fungus usually forms a dense colony of filaments embedded in a polymeric extracellular matrix. To identify candidate genes involved in this biofilm (BF) growth, we used RNA-Seq to compare the transcriptomes of BF and liquid plankton (PL) growth. Sequencing and mapping of tens of millions sequence reads against the A. fumigatus transcriptome identified 3,728 differentially regulated genes in the two conditions. Although many of these genes, including the ones coding for transcription factors, stress response, the ribosome, and the translation machinery, likely reflect the different growth demands in the two conditions, our experiment also identified hundreds of candidate genes for the observed differences in morphology and pathobiology between BF and PL. We found an overrepresentation of upregulated genes in transport, secondary metabolism, and cell wall and surface functions. Furthermore, upregulated genes showed significant spatial structure across the A. fumigatus genome; they were more likely to occur in subtelomeric regions and colocalized in 27 genomic neighborhoods, many of which overlapped with known or candidate secondary metabolism gene clusters. We also identified 1,164 genes that were downregulated. This gene set was not spatially structured across the genome and was overrepresented in genes participating in primary metabolic functions, including carbon and amino acid metabolism. These results add valuable insight into the genetics of biofilm formation in A. fumigatus and other filamentous fungi and identify many relevant, in the context of biofilm biology, candidate genes for downstream functional experiments. PMID:21724936
The Influence of Genetics on Cystic Fibrosis Phenotypes

PubMed Central

Knowles, Michael R.; Drumm, Mitchell

2012-01-01

Technological advances in genetics have made feasible and affordable large studies to identify genetic variants that cause or modify a trait. Genetic studies have been carried out to assess variants in candidate genes, as well as polymorphisms throughout the genome, for their associations with heritable clinical outcomes of cystic fibrosis (CF), such as lung disease, meconium ileus, and CF-related diabetes. The candidate gene approach has identified some predicted relationships, while genome-wide surveys have identified several genes that would not have been obvious disease-modifying candidates, such as a methionine sulfoxide transferase gene that influences intestinal obstruction, or a region on chromosome 11 proximate to genes encoding a transcription factor and an apoptosis controller that associates with lung function. These unforeseen associations thus provide novel insight into disease pathophysiology, as well as suggesting new therapeutic strategies for CF. PMID:23209180
Identifying positive selection candidate loci for high-altitude adaptation in Andean populations

PubMed Central

2009-01-01

High-altitude environments (>2,500 m) provide scientists with a natural laboratory to study the physiological and genetic effects of low ambient oxygen tension on human populations. One approach to understanding how life at high altitude has affected human metabolism is to survey genome-wide datasets for signatures of natural selection. In this work, we report on a study to identify selection-nominated candidate genes involved in adaptation to hypoxia in one highland group, Andeans from the South American Altiplano. We analysed dense microarray genotype data using four test statistics that detect departures from neutrality. Using a candidate gene, single nucleotide polymorphism-based approach, we identified genes exhibiting preliminary evidence of recent genetic adaptation in this population. These included genes that are part of the hypoxia-inducible transcription factor (HIF) pathway, a biochemical pathway involved in oxygen homeostasis, as well as three other genomic regions previously not known to be associated with high-altitude phenotypes. In addition to identifying selection-nominated candidate genes, we also tested whether the HIF pathway shows evidence of natural selection. Our results indicate that the genes of this biochemical pathway as a group show no evidence of having evolved in response to hypoxia in Andeans. Results from particular HIF-targeted genes, however, suggest that genes in this pathway could play a role in Andean adaptation to high altitude, even if the pathway as a whole does not show higher relative rates of evolution. These data suggest a genetic role in high-altitude adaptation and provide a basis for genotype/phenotype association studies that are necessary to confirm the role of putative natural selection candidate genes and gene regions in adaptation to altitude. PMID:20038496
Network-based Analysis of Genome Wide Association Data Provides Novel Candidate Genes for Lipid and Lipoprotein Traits*

PubMed Central

Sharma, Amitabh; Gulbahce, Natali; Pevzner, Samuel J.; Menche, Jörg; Ladenvall, Claes; Folkersen, Lasse; Eriksson, Per; Orho-Melander, Marju; Barabási, Albert-László

2013-01-01

Genome wide association studies (GWAS) identify susceptibility loci for complex traits, but do not identify particular genes of interest. Integration of functional and network information may help in overcoming this limitation and identifying new susceptibility loci. Using GWAS and comorbidity data, we present a network-based approach to predict candidate genes for lipid and lipoprotein traits. We apply a prediction pipeline incorporating interactome, co-expression, and comorbidity data to Global Lipids Genetics Consortium (GLGC) GWAS for four traits of interest, identifying phenotypically coherent modules. These modules provide insights regarding gene involvement in complex phenotypes with multiple susceptibility alleles and low effect sizes. To experimentally test our predictions, we selected four candidate genes and genotyped representative SNPs in the Malmö Diet and Cancer Cardiovascular Cohort. We found significant associations with LDL-C and total-cholesterol levels for a synonymous SNP (rs234706) in the cystathionine beta-synthase (CBS) gene (p = 1 × 10−5 and adjusted-p = 0.013, respectively). Further, liver samples taken from 206 patients revealed that patients with the minor allele of rs234706 had significant dysregulation of CBS (p = 0.04). Despite the known biological role of CBS in lipid metabolism, SNPs within the locus have not yet been identified in GWAS of lipoprotein traits. Thus, the GWAS-based Comorbidity Module (GCM) approach identifies candidate genes missed by GWAS studies, serving as a broadly applicable tool for the investigation of other complex disease phenotypes. PMID:23882023
The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach.

PubMed

Hindumathi, V; Kranthi, T; Rao, S B; Manimaran, P

2014-06-01

With rapidly changing technology, prediction of candidate genes has become an indispensable task in recent years mainly in the field of biological research. The empirical methods for candidate gene prioritization that succors to explore the potential pathway between genetic determinants and complex diseases are highly cumbersome and labor intensive. In such a scenario predicting potential targets for a disease state through in silico approaches are of researcher's interest. The prodigious availability of protein interaction data coupled with gene annotation renders an ease in the accurate determination of disease specific candidate genes. In our work we have prioritized the cervix related cancer candidate genes by employing Csaba Ortutay and his co-workers approach of identifying the candidate genes through graph theoretical centrality measures and gene ontology. With the advantage of the human protein interaction data, cervical cancer gene sets and the ontological terms, we were able to predict 15 novel candidates for cervical carcinogenesis. The disease relevance of the anticipated candidate genes was corroborated through a literature survey. Also the presence of the drugs for these candidates was detected through Therapeutic Target Database (TTD) and DrugMap Central (DMC) which affirms that they may be endowed as potential drug targets for cervical cancer.
Systems biology approach to late-onset Alzheimer's disease genome-wide association study identifies novel candidate genes validated using brain expression data and Caenorhabditis elegans experiments.

PubMed

Mukherjee, Shubhabrata; Russell, Joshua C; Carr, Daniel T; Burgess, Jeremy D; Allen, Mariet; Serie, Daniel J; Boehme, Kevin L; Kauwe, John S K; Naj, Adam C; Fardo, David W; Dickson, Dennis W; Montine, Thomas J; Ertekin-Taner, Nilufer; Kaeberlein, Matt R; Crane, Paul K

2017-10-01

We sought to determine whether a systems biology approach may identify novel late-onset Alzheimer's disease (LOAD) loci. We performed gene-wide association analyses and integrated results with human protein-protein interaction data using network analyses. We performed functional validation on novel genes using a transgenic Caenorhabditis elegans Aβ proteotoxicity model and evaluated novel genes using brain expression data from people with LOAD and other neurodegenerative conditions. We identified 13 novel candidate LOAD genes outside chromosome 19. Of those, RNA interference knockdowns of the C. elegans orthologs of UBC, NDUFS3, EGR1, and ATP5H were associated with Aβ toxicity, and NDUFS3, SLC25A11, ATP5H, and APP were differentially expressed in the temporal cortex. Network analyses identified novel LOAD candidate genes. We demonstrated a functional role for four of these in a C. elegans model and found enrichment of differentially expressed genes in the temporal cortex. Copyright © 2017 the Alzheimer's Association. Published by Elsevier Inc. All rights reserved.
Rapid Communication: MiR-92a as a housekeeping gene for analysis of bovine mastitis-related microRNA in milk.

PubMed

Lai, Y C; Fujikawa, T; Ando, T; Kitahara, G; Koiwa, M; Kubota, C; Miura, N

2017-06-01

Our aim was to identify a suitable microRNA housekeeping gene for real-time PCR analysis of bovine mastitis-related microRNA in milk. We identified , , and as housekeeping gene candidates on the basis of previous Solexa sequencing results. Threshold cycle (CT) values for , , and did not differ between milk from control cows and milk from mastitis-affected cows. NormFinder software identified as the most stable single housekeeping gene. We evaluated the suitability of the housekeeping gene candidates by using them to assess expression levels of the inflammation-related gene . Regardless of the housekeeping gene candidates used for normalization, relative expression levels of were significantly higher in mastitis-affected samples than in control samples. However, of all the housekeeping genes and gene combinations investigated, normalization with alone generated the difference in relative expression between mastitis-affected and control samples with the highest significance. These results suggest that is suitable for use as a housekeeping gene for analysis of bovine mastitis-related microRNA in milk.
Identification of Candidate Genes Responsible for Stem Pith Production Using Expression Analysis in Solid-Stemmed Wheat.

PubMed

Oiestad, A J; Martin, J M; Cook, J; Varella, A C; Giroux, M J

2017-07-01

The wheat stem sawfly (WSS) is an economically important pest of wheat in the Northern Great Plains. The primary means of WSS control is resistance associated with the single quantitative trait locus (QTL) , which controls most stem solidness variation. The goal of this study was to identify stem solidness candidate genes via RNA-seq. This study made use of 28 single nucleotide polymorphism (SNP) makers derived from expressed sequence tags (ESTs) linked to contained within a 5.13 cM region. Allele specific expression of EST markers was examined in stem tissue for solid and hollow-stemmed pairs of two spring wheat near isogenic lines (NILs) differing for the QTL. Of the 28 ESTs, 13 were located within annotated genes and 10 had detectable stem expression. Annotated genes corresponding to four of the ESTs were differentially expressed between solid and hollow-stemmed NILs and represent possible stem solidness gene candidates. Further examination of the 5.13 cM region containing the 28 EST markers identified 260 annotated genes. Twenty of the 260 linked genes were up-regulated in hollow NIL stems, while only seven genes were up-regulated in solid NIL stems. An -methyltransferase within the region of interest was identified as a candidate based on differential expression between solid and hollow-stemmed NILs and putative function. Further study of these candidate genes may lead to the identification of the gene(s) controlling stem solidness and an increased ability to select for wheat stem solidness and manage WSS. Copyright © 2017 Crop Science Society of America.
Integrating microarray analysis and the soybean genome to understand the soybeans iron deficiency response

PubMed Central

2009-01-01

Background Soybeans grown in the upper Midwestern United States often suffer from iron deficiency chlorosis, which results in yield loss at the end of the season. To better understand the effect of iron availability on soybean yield, we identified genes in two near isogenic lines with changes in expression patterns when plants were grown in iron sufficient and iron deficient conditions. Results Transcriptional profiles of soybean (Glycine max, L. Merr) near isogenic lines Clark (PI548553, iron efficient) and IsoClark (PI547430, iron inefficient) grown under Fe-sufficient and Fe-limited conditions were analyzed and compared using the Affymetrix® GeneChip® Soybean Genome Array. There were 835 candidate genes in the Clark (PI548553) genotype and 200 candidate genes in the IsoClark (PI547430) genotype putatively involved in soybean's iron stress response. Of these candidate genes, fifty-eight genes in the Clark genotype were identified with a genetic location within known iron efficiency QTL and 21 in the IsoClark genotype. The arrays also identified 170 single feature polymorphisms (SFPs) specific to either Clark or IsoClark. A sliding window analysis of the microarray data and the 7X genome assembly coupled with an iterative model of the data showed the candidate genes are clustered in the genome. An analysis of 5' untranslated regions in the promoter of candidate genes identified 11 conserved motifs in 248 differentially expressed genes, all from the Clark genotype, representing 129 clusters identified earlier, confirming the cluster analysis results. Conclusion These analyses have identified the first genes with expression patterns that are affected by iron stress and are located within QTL specific to iron deficiency stress. The genetic location and promoter motif analysis results support the hypothesis that the differentially expressed genes are co-regulated. The combined results of all analyses lead us to postulate iron inefficiency in soybean is a result of a mutation in a transcription factor(s), which controls the expression of genes required in inducing an iron stress response. PMID:19678937
Exome Sequencing and Linkage Analysis Identified Novel Candidate Genes in Recessive Intellectual Disability Associated with Ataxia.

PubMed

Jazayeri, Roshanak; Hu, Hao; Fattahi, Zohreh; Musante, Luciana; Abedini, Seyedeh Sedigheh; Hosseini, Masoumeh; Wienker, Thomas F; Ropers, Hans Hilger; Najmabadi, Hossein; Kahrizi, Kimia

2015-10-01

Intellectual disability (ID) is a neuro-developmental disorder which causes considerable socio-economic problems. Some ID individuals are also affected by ataxia, and the condition includes different mutations affecting several genes. We used whole exome sequencing (WES) in combination with homozygosity mapping (HM) to identify the genetic defects in five consanguineous families among our cohort study, with two affected children with ID and ataxia as major clinical symptoms. We identified three novel candidate genes, RIPPLY1, MRPL10, SNX14, and a new mutation in known gene SURF1. All are autosomal genes, except RIPPLY1, which is located on the X chromosome. Two are housekeeping genes, implicated in transcription and translation regulation and intracellular trafficking, and two encode mitochondrial proteins. The pathogenesis of these variants was evaluated by mutation classification, bioinformatic methods, review of medical and biological relevance, co-segregation studies in the particular family, and a normal population study. Linkage analysis and exome sequencing of a small number of affected family members is a powerful new technique which can be used to decrease the number of candidate genes in heterogenic disorders such as ID, and may even identify the responsible gene(s).
Identification of genes from the Treacher Collins candidate region

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dixon, M.; Dixon, J.; Edwards, S.

Treacher Collins syndrome (TCOF1) is an autosomal dominant disorder of craniofacial development. The TCOF1 locus has previously been mapped to chromosome 5q32-33. The candidate gene region has been defined as being between two flanking markers, ribosomal protein S14 (RPS14) and Annexin 6 (ANX6), by analyzing recombination events in affected individuals. It is estimated that the distance between these flanking markers is 500 kb by three separate analysis methods: (1) radiation hybrid mapping; (2) genetic linkage; and (3) YAC contig analysis. A cosmid contig which spans the candidate gene region for TCOF1 has been constructed by screening the Los Alamos Nationalmore » Laboratory flow-sorted chromosome 5 cosmid library. Cosmids were obtained by using a combination of probes generated from YAC end clones, Alu-PCR fragments from YACs, and asymmetric PCR fragments from both T7 and T3 cosmid ends. Exon amplifications, the selection of genomic coding sequences based upon the presence of functional splice acceptor and donor sites, was used to identify potential exon sequences. Sequences found to be conserved between species were then used to screen cDNA libraries in order to identify candidate genes. To date, four different cDNAs have been isolated from this region and are being analyzed as potential candidate genes for TCOF1. These include the genes encoding plasma glutathione peroxidase (GPX3), heparin sulfate sulfotransferase (HSST), a gene with homology to the ETS family of proteins and one which shows no homology to any known genes. Work is also in progress to identify and characterize additional cDNAs from the candidate gene region.« less
Genetic and Proteomic Interrogation of Lower Confidence Candidate Genes Reveals Signaling Networks in beta-Catenin-Active Cancers | Office of Cancer Genomics

Cancer.gov

Genome-scale expression studies and comprehensive loss-of-function genetic screens have focused almost exclusively on the highest confidence candidate genes. Here, we describe a strategy for characterizing the lower confidence candidates identified by such approaches.
Next-generation sequencing for identification of candidate genes for Fusarium wilt and sterility mosaic disease in pigeonpea (Cajanus cajan).

PubMed

Singh, Vikas K; Khan, Aamir W; Saxena, Rachit K; Kumar, Vinay; Kale, Sandip M; Sinha, Pallavi; Chitikineni, Annapurna; Pazhamala, Lekha T; Garg, Vanika; Sharma, Mamta; Sameer Kumar, Chanda Venkata; Parupalli, Swathi; Vechalapu, Suryanarayana; Patil, Suyash; Muniswamy, Sonnappa; Ghanta, Anuradha; Yamini, Kalinati Narasimhan; Dharmaraj, Pallavi Subbanna; Varshney, Rajeev K

2016-05-01

To map resistance genes for Fusarium wilt (FW) and sterility mosaic disease (SMD) in pigeonpea, sequencing-based bulked segregant analysis (Seq-BSA) was used. Resistant (R) and susceptible (S) bulks from the extreme recombinant inbred lines of ICPL 20096 × ICPL 332 were sequenced. Subsequently, SNP index was calculated between R- and S-bulks with the help of draft genome sequence and reference-guided assembly of ICPL 20096 (resistant parent). Seq-BSA has provided seven candidate SNPs for FW and SMD resistance in pigeonpea. In parallel, four additional genotypes were re-sequenced and their combined analysis with R- and S-bulks has provided a total of 8362 nonsynonymous (ns) SNPs. Of 8362 nsSNPs, 60 were found within the 2-Mb flanking regions of seven candidate SNPs identified through Seq-BSA. Haplotype analysis narrowed down to eight nsSNPs in seven genes. These eight nsSNPs were further validated by re-sequencing 11 genotypes that are resistant and susceptible to FW and SMD. This analysis revealed association of four candidate nsSNPs in four genes with FW resistance and four candidate nsSNPs in three genes with SMD resistance. Further, In silico protein analysis and expression profiling identified two most promising candidate genes namely C.cajan_01839 for SMD resistance and C.cajan_03203 for FW resistance. Identified candidate genomic regions/SNPs will be useful for genomics-assisted breeding in pigeonpea. © 2015 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Identification of candidate transmission-blocking antigen genes in Theileria annulata and related vector-borne apicomplexan parasites.

PubMed

Lempereur, Laetitia; Larcombe, Stephen D; Durrani, Zeeshan; Karagenc, Tulin; Bilgic, Huseyin Bilgin; Bakirci, Serkan; Hacilarlioglu, Selin; Kinnaird, Jane; Thompson, Joanne; Weir, William; Shiels, Brian

2017-06-05

Vector-borne apicomplexan parasites are a major cause of mortality and morbidity to humans and livestock globally. The most important disease syndromes caused by these parasites are malaria, babesiosis and theileriosis. Strategies for control often target parasite stages in the mammalian host that cause disease, but this can result in reservoir infections that promote pathogen transmission and generate economic loss. Optimal control strategies should protect against clinical disease, block transmission and be applicable across related genera of parasites. We have used bioinformatics and transcriptomics to screen for transmission-blocking candidate antigens in the tick-borne apicomplexan parasite, Theileria annulata. A number of candidate antigen genes were identified which encoded amino acid domains that are conserved across vector-borne Apicomplexa (Babesia, Plasmodium and Theileria), including the Pfs48/45 6-cys domain and a novel cysteine-rich domain. Expression profiling confirmed that selected candidate genes are expressed by life cycle stages within infected ticks. Additionally, putative B cell epitopes were identified in the T. annulata gene sequences encoding the 6-cys and cysteine rich domains, in a gene encoding a putative papain-family cysteine peptidase, with similarity to the Plasmodium SERA family, and the gene encoding the T. annulata major merozoite/piroplasm surface antigen, Tams1. Candidate genes were identified that encode proteins with similarity to known transmission blocking candidates in related parasites, while one is a novel candidate conserved across vector-borne apicomplexans and has a potential role in the sexual phase of the life cycle. The results indicate that a 'One Health' approach could be utilised to develop a transmission-blocking strategy effective against vector-borne apicomplexan parasites of animals and humans.
Genetic basis of interindividual susceptibility to cancer cachexia: selection of potential candidate gene polymorphisms for association studies.

PubMed

Johns, N; Tan, B H; MacMillan, M; Solheim, T S; Ross, J A; Baracos, V E; Damaraju, S; Fearon, K C H

2014-12-01

Cancer cachexia is a complex and multifactorial disease. Evolving definitions highlight the fact that a diverse range of biological processes contribute to cancer cachexia. Part of the variation in who will and who will not develop cancer cachexia may be genetically determined. As new definitions, classifications and biological targets continue to evolve, there is a need for reappraisal of the literature for future candidate association studies. This review summarizes genes identified or implicated as well as putative candidate genes contributing to cachexia, identified through diverse technology platforms and model systems to further guide association studies. A systematic search covering 1986-2012 was performed for potential candidate genes / genetic polymorphisms relating to cancer cachexia. All candidate genes were reviewed for functional polymorphisms or clinically significant polymorphisms associated with cachexia using the OMIM and GeneRIF databases. Pathway analysis software was used to reveal possible network associations between genes. Functionality of SNPs/genes was explored based on published literature, algorithms for detecting putative deleterious SNPs and interrogating the database for expression of quantitative trait loci (eQTLs). A total of 154 genes associated with cancer cachexia were identified and explored for functional polymorphisms. Of these 154 genes, 119 had a combined total of 281 polymorphisms with functional and/or clinical significance in terms of cachexia associated with them. Of these, 80 polymorphisms (in 51 genes) were replicated in more than one study with 24 polymorphisms found to influence two or more hallmarks of cachexia (i.e., inflammation, loss of fat mass and/or lean mass and reduced survival). Selection of candidate genes and polymorphisms is a key element of multigene study design. The present study provides a contemporary basis to select genes and/or polymorphisms for further association studies in cancer cachexia, and to develop their potential as susceptibility biomarkers of cachexia.
RNA-Seq analysis and annotation of a draft blueberry genome assembly identifies candidate genes involved in fruit ripening, biosynthesis of bioactive compounds, and stage-specific alternative splicing.

PubMed

Gupta, Vikas; Estrada, April D; Blakley, Ivory; Reid, Rob; Patel, Ketan; Meyer, Mason D; Andersen, Stig Uggerhøj; Brown, Allan F; Lila, Mary Ann; Loraine, Ann E

2015-01-01

Blueberries are a rich source of antioxidants and other beneficial compounds that can protect against disease. Identifying genes involved in synthesis of bioactive compounds could enable the breeding of berry varieties with enhanced health benefits. Toward this end, we annotated a previously sequenced draft blueberry genome assembly using RNA-Seq data from five stages of berry fruit development and ripening. Genome-guided assembly of RNA-Seq read alignments combined with output from ab initio gene finders produced around 60,000 gene models, of which more than half were similar to proteins from other species, typically the grape Vitis vinifera. Comparison of gene models to the PlantCyc database of metabolic pathway enzymes identified candidate genes involved in synthesis of bioactive compounds, including bixin, an apocarotenoid with potential disease-fighting properties, and defense-related cyanogenic glycosides, which are toxic. Cyanogenic glycoside (CG) biosynthetic enzymes were highly expressed in green fruit, and a candidate CG detoxification enzyme was up-regulated during fruit ripening. Candidate genes for ethylene, anthocyanin, and 400 other biosynthetic pathways were also identified. Homology-based annotation using Blast2GO and InterPro assigned Gene Ontology terms to around 15,000 genes. RNA-Seq expression profiling showed that blueberry growth, maturation, and ripening involve dynamic gene expression changes, including coordinated up- and down-regulation of metabolic pathway enzymes and transcriptional regulators. Analysis of RNA-seq alignments identified developmentally regulated alternative splicing, promoter use, and 3' end formation. We report genome sequence, gene models, functional annotations, and RNA-Seq expression data that provide an important new resource enabling high throughput studies in blueberry.
Genome-wide transcriptome study in wheat identified candidate genes related to processing quality, majority of them showing interaction (quality x development) and having temporal and spatial distributions.

PubMed

Singh, Anuradha; Mantri, Shrikant; Sharma, Monica; Chaudhury, Ashok; Tuli, Rakesh; Roy, Joy

2014-01-16

The cultivated bread wheat (Triticum aestivum L.) possesses unique flour quality, which can be processed into many end-use food products such as bread, pasta, chapatti (unleavened flat bread), biscuit, etc. The present wheat varieties require improvement in processing quality to meet the increasing demand of better quality food products. However, processing quality is very complex and controlled by many genes, which have not been completely explored. To identify the candidate genes whose expressions changed due to variation in processing quality and interaction (quality x development), genome-wide transcriptome studies were performed in two sets of diverse Indian wheat varieties differing for chapatti quality. It is also important to understand the temporal and spatial distributions of their expressions for designing tissue and growth specific functional genomics experiments. Gene-specific two-way ANOVA analysis of expression of about 55 K transcripts in two diverse sets of Indian wheat varieties for chapatti quality at three seed developmental stages identified 236 differentially expressed probe sets (10-fold). Out of 236, 110 probe sets were identified for chapatti quality. Many processing quality related key genes such as glutenin and gliadins, puroindolines, grain softness protein, alpha and beta amylases, proteases, were identified, and many other candidate genes related to cellular and molecular functions were also identified. The ANOVA analysis revealed that the expression of 56 of 110 probe sets was involved in interaction (quality x development). Majority of the probe sets showed differential expression at early stage of seed development i.e. temporal expression. Meta-analysis revealed that the majority of the genes expressed in one or a few growth stages indicating spatial distribution of their expressions. The differential expressions of a few candidate genes such as pre-alpha/beta-gliadin and gamma gliadin were validated by RT-PCR. Therefore, this study identified several quality related key genes including many other genes, their interactions (quality x development) and temporal and spatial distributions. The candidate genes identified for processing quality and information on temporal and spatial distributions of their expressions would be useful for designing wheat improvement programs for processing quality either by changing their expression or development of single nucleotide polymorphisms (SNPs) markers.

Genome-wide transcriptome study in wheat identified candidate genes related to processing quality, majority of them showing interaction (quality x development) and having temporal and spatial distributions

PubMed Central

2014-01-01

Background The cultivated bread wheat (Triticum aestivum L.) possesses unique flour quality, which can be processed into many end-use food products such as bread, pasta, chapatti (unleavened flat bread), biscuit, etc. The present wheat varieties require improvement in processing quality to meet the increasing demand of better quality food products. However, processing quality is very complex and controlled by many genes, which have not been completely explored. To identify the candidate genes whose expressions changed due to variation in processing quality and interaction (quality x development), genome-wide transcriptome studies were performed in two sets of diverse Indian wheat varieties differing for chapatti quality. It is also important to understand the temporal and spatial distributions of their expressions for designing tissue and growth specific functional genomics experiments. Results Gene-specific two-way ANOVA analysis of expression of about 55 K transcripts in two diverse sets of Indian wheat varieties for chapatti quality at three seed developmental stages identified 236 differentially expressed probe sets (10-fold). Out of 236, 110 probe sets were identified for chapatti quality. Many processing quality related key genes such as glutenin and gliadins, puroindolines, grain softness protein, alpha and beta amylases, proteases, were identified, and many other candidate genes related to cellular and molecular functions were also identified. The ANOVA analysis revealed that the expression of 56 of 110 probe sets was involved in interaction (quality x development). Majority of the probe sets showed differential expression at early stage of seed development i.e. temporal expression. Meta-analysis revealed that the majority of the genes expressed in one or a few growth stages indicating spatial distribution of their expressions. The differential expressions of a few candidate genes such as pre-alpha/beta-gliadin and gamma gliadin were validated by RT-PCR. Therefore, this study identified several quality related key genes including many other genes, their interactions (quality x development) and temporal and spatial distributions. Conclusions The candidate genes identified for processing quality and information on temporal and spatial distributions of their expressions would be useful for designing wheat improvement programs for processing quality either by changing their expression or development of single nucleotide polymorphisms (SNPs) markers. PMID:24433256
Identification and fine mapping of a stay-green gene (Brnye1) in pakchoi (Brassica campestris L. ssp. chinensis).

PubMed

Wang, Nan; Liu, Zhiyong; Zhang, Yun; Li, Chengyu; Feng, Hui

2018-03-01

Using bulked segregant analysis combined with next-generation sequencing, we delimited the Brnye1 gene responsible for the stay-green trait of nye in pakchoi. Sequence analysis identified Bra019346 as the candidate gene. "Stay-green" refers to a plant trait whereby leaves remain green during senescence. This trait is useful in the cultivation of pakchoi (Brassica campestris L. ssp. chinensis), which is marketed as a green leaf product. This study aimed to identify the gene responsible for the stay-green trait in pakchoi. We identified a stay-green mutant in pakchoi, which we termed "nye". Genetic analysis revealed that the stay-green trait is controlled by a single recessive gene, Brnye1. Using the BSA-seq method, a 3.0-Mb candidate region was mapped on chromosome A03, which helped us localize Brnye1 to an 81.01-kb interval between SSR markers SSRWN27 and SSRWN30 via linkage analysis in an F 2 population. We identified 12 genes in this region, 11 of which were annotated based on the Brassica rapa annotation database, and one was a functionally unknown gene. An orthologous gene of the Arabidopsis gene AtNYE1, Bra019346, was identified as the potential candidate for Brnye1. Sequence analysis revealed a 40-bp insertion in the second exon of Bra019346 in nye, which generated the TAA stop codon. A candidate gene-specific Indel marker in 1561 F 2 individuals showed perfect cosegregation with Brnye1 in the nye mutant. These results provide a foundation for uncovering the molecular mechanism of the stay-green trait in pakchoi.
Fine mapping of Restorer-of-fertility in pepper (Capsicum annuum L.) identified a candidate gene encoding a pentatricopeptide repeat (PPR)-containing protein.

PubMed

Jo, Yeong Deuk; Ha, Yeaseong; Lee, Joung-Ho; Park, Minkyu; Bergsma, Alex C; Choi, Hong-Il; Goritschnig, Sandra; Kloosterman, Bjorn; van Dijk, Peter J; Choi, Doil; Kang, Byoung-Cheorl

2016-10-01

Using fine mapping techniques, the genomic region co-segregating with Restorer - of - fertility ( Rf ) in pepper was delimited to a region of 821 kb in length. A PPR gene in this region, CaPPR6 , was identified as a strong candidate for Rf based on expression pattern and characteristics of encoding sequence. Cytoplasmic-genic male sterility (CGMS) has been used for the efficient production of hybrid seeds in peppers (Capsicum annuum L.). Although the mitochondrial candidate genes that might be responsible for cytoplasmic male sterility (CMS) have been identified, the nuclear Restorer-of-fertility (Rf) gene has not been isolated. To identify the genomic region co-segregating with Rf in pepper, we performed fine mapping using an Rf-segregating population consisting of 1068 F2 individuals, based on BSA-AFLP and a comparative mapping approach. Through six cycles of chromosome walking, the co-segregating region harboring the Rf locus was delimited to be within 821 kb of sequence. Prediction of expressed genes in this region based on transcription analysis revealed four candidate genes. Among these, CaPPR6 encodes a pentatricopeptide repeat (PPR) protein with PPR motifs that are repeated 14 times. Characterization of the CaPPR6 protein sequence, based on alignment with other homologs, showed that CaPPR6 is a typical Rf-like (RFL) gene reported to have undergone diversifying selection during evolution. A marker developed from a sequence near CaPPR6 showed a higher prediction rate of the Rf phenotype than those of previously developed markers when applied to a panel of breeding lines of diverse origin. These results suggest that CaPPR6 is a strong candidate for the Rf gene in pepper.
Analysis of shared homozygosity regions in Saudi siblings with attention deficit hyperactivity disorder

PubMed Central

Al Yemni, Eman A.A.; Alnaemi, Faten M.; Abebe, Dejene; Al-Abdulaziz, Basma S.; Al Mubarak, Bashayer R.; Ghaziuddin, Mohammad; Al Tassan, Nada A.

2017-01-01

Aim Genetic and clinical complexities are common features of most psychiatric illnesses that pose a major obstacle in risk-gene identification. Attention deficit hyperactivity disorder (ADHD) is the most prevalent child-onset psychiatric illness, with high heritability. Over the past decade, numerous genetic studies utilizing various approaches, such as genome-wide association, candidate-gene association, and linkage analysis, have identified a multitude of candidate loci/genes. However, such studies have yielded diverse findings that are rarely reproduced, indicating that other genetic determinants have not been discovered yet. In this study, we carried out sib-pair analysis on seven multiplex families with ADHD from Saudi Arabia. We aimed to identify the candidate chromosomal regions and genes linked to the disease. Patients and methods A total of 41 individuals from multiplex families were analyzed for shared regions of homozygosity. Genes within these regions were prioritized according to their potential relevance to ADHD. Results We identified multiple genomic regions spanning different chromosomes to be shared among affected members of each family; these included chromosomes 3, 5, 6, 7, 8, 9, 10, 13, 17, and 18. We also found specific regions on chromosomes 8 and 17 to be shared between affected individuals from more than one family. Among the genes present in the regions reported here were involved in neurotransmission (GRM3, SIGMAR1, CHAT, and SLC18A3) and members of the HLA gene family (HLA-A, HLA-DPA1, and MICC). Conclusion The candidate regions identified in this study highlight the genetic diversity of ADHD. Upon further investigation, these loci may reveal candidate genes that enclose variants associated with ADHD. Although most ADHD studies were conducted in other populations, our study provides insight from an understudied, ethnically interesting population. PMID:28452824
Analysis of shared homozygosity regions in Saudi siblings with attention deficit hyperactivity disorder.

PubMed

Shinwari, Jameela M A; Al Yemni, Eman A A; Alnaemi, Faten M; Abebe, Dejene; Al-Abdulaziz, Basma S; Al Mubarak, Bashayer R; Ghaziuddin, Mohammad; Al Tassan, Nada A

2017-08-01

Genetic and clinical complexities are common features of most psychiatric illnesses that pose a major obstacle in risk-gene identification. Attention deficit hyperactivity disorder (ADHD) is the most prevalent child-onset psychiatric illness, with high heritability. Over the past decade, numerous genetic studies utilizing various approaches, such as genome-wide association, candidate-gene association, and linkage analysis, have identified a multitude of candidate loci/genes. However, such studies have yielded diverse findings that are rarely reproduced, indicating that other genetic determinants have not been discovered yet. In this study, we carried out sib-pair analysis on seven multiplex families with ADHD from Saudi Arabia. We aimed to identify the candidate chromosomal regions and genes linked to the disease. A total of 41 individuals from multiplex families were analyzed for shared regions of homozygosity. Genes within these regions were prioritized according to their potential relevance to ADHD. We identified multiple genomic regions spanning different chromosomes to be shared among affected members of each family; these included chromosomes 3, 5, 6, 7, 8, 9, 10, 13, 17, and 18. We also found specific regions on chromosomes 8 and 17 to be shared between affected individuals from more than one family. Among the genes present in the regions reported here were involved in neurotransmission (GRM3, SIGMAR1, CHAT, and SLC18A3) and members of the HLA gene family (HLA-A, HLA-DPA1, and MICC). The candidate regions identified in this study highlight the genetic diversity of ADHD. Upon further investigation, these loci may reveal candidate genes that enclose variants associated with ADHD. Although most ADHD studies were conducted in other populations, our study provides insight from an understudied, ethnically interesting population.
Phenoscape: Identifying Candidate Genes for Evolutionary Phenotypes

PubMed Central

Edmunds, Richard C.; Su, Baofeng; Balhoff, James P.; Eames, B. Frank; Dahdul, Wasila M.; Lapp, Hilmar; Lundberg, John G.; Vision, Todd J.; Dunham, Rex A.; Mabee, Paula M.; Westerfield, Monte

2016-01-01

Phenotypes resulting from mutations in genetic model organisms can help reveal candidate genes for evolutionarily important phenotypic changes in related taxa. Although testing candidate gene hypotheses experimentally in nonmodel organisms is typically difficult, ontology-driven information systems can help generate testable hypotheses about developmental processes in experimentally tractable organisms. Here, we tested candidate gene hypotheses suggested by expert use of the Phenoscape Knowledgebase, specifically looking for genes that are candidates responsible for evolutionarily interesting phenotypes in the ostariophysan fishes that bear resemblance to mutant phenotypes in zebrafish. For this, we searched ZFIN for genetic perturbations that result in either loss of basihyal element or loss of scales phenotypes, because these are the ancestral phenotypes observed in catfishes (Siluriformes). We tested the identified candidate genes by examining their endogenous expression patterns in the channel catfish, Ictalurus punctatus. The experimental results were consistent with the hypotheses that these features evolved through disruption in developmental pathways at, or upstream of, brpf1 and eda/edar for the ancestral losses of basihyal element and scales, respectively. These results demonstrate that ontological annotations of the phenotypic effects of genetic alterations in model organisms, when aggregated within a knowledgebase, can be used effectively to generate testable, and useful, hypotheses about evolutionary changes in morphology. PMID:26500251
Candidate Chemosensory Genes in the Stemborer Sesamia nonagrioides

PubMed Central

Glaser, Nicolas; Gallot, Aurore; Legeai, Fabrice; Montagné, Nicolas; Poivet, Erwan; Harry, Myriam; Calatayud, Paul-André; Jacquin-Joly, Emmanuelle

2013-01-01

The stemborer Sesamia nonagrioides is an important pest of maize in the Mediterranean Basin. Like other moths, this noctuid uses its chemosensory system to efficiently interact with its environment. However, very little is known on the molecular mechanisms that underlie chemosensation in this species. Here, we used next-generation sequencing (454 and Illumina) on different tissues from adult and larvae, including chemosensory organs and female ovipositors, to describe the chemosensory transcriptome of S. nonagrioides and identify key molecular components of the pheromone production and detection systems. We identified a total of 68 candidate chemosensory genes in this species, including 31 candidate binding-proteins and 23 chemosensory receptors. In particular, we retrieved the three co-receptors Orco, IR25a and IR8a necessary for chemosensory receptor functioning. Focusing on the pheromonal communication system, we identified a new pheromone-binding protein in this species, four candidate pheromone receptors and 12 carboxylesterases as candidate acetate degrading enzymes. In addition, we identified enzymes putatively involved in S. nonagrioides pheromone biosynthesis, including a ∆11-desaturase and different acetyltransferases and reductases. RNAseq analyses and RT-PCR were combined to profile gene expression in different tissues. This study constitutes the first large scale description of chemosensory genes in S. nonagrioides. PMID:23781142
A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis.

PubMed

Ishikawa, Akira

2017-11-27

Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
Survey of candidate genes for maize resistance to infection by Aspergillus flavus and/or aflatoxin contamination

Treesearch

Leigh Hawkins; Marilyn Warburton; Juliet Tang; John Tomashek; Dafne Alves Oliveira; Oluwaseun Ogunola; J. Smith; W. Williams

2018-01-01

Many projects have identified candidate genes for resistance to aflatoxin accumulation or Aspergillus flavus infection and growth in maize using genetic mapping, genomics, transcriptomics and/or proteomics studies. However, only a small percentage of these candidates have been validated in field conditions, and their relative contribution to...
CHK2, A Candidate Prostate Cancer Susceptibility Gene

DTIC Science & Technology

2003-01-01

To identify prostate cancer susceptibility genes, we applied a mutation screening of candidate gene approach. We screened for mutations in CHEK2 , the...families, 400 sporadic cases, and 423 unaffected men as control. A total of 28 (4.8%) germline CHEK2 mutations were found among 578 patients and...additional 11 in 9 families. Sixteen of 18 unique CHEK2 mutations identified in this study were not detected among 423 unaffected men, suggesting a
Transcriptomic Analysis Using Olive Varieties and Breeding Progenies Identifies Candidate Genes Involved in Plant Architecture.

PubMed

González-Plaza, Juan J; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R

2016-01-01

Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species.
In silico identification of genetically attenuated vaccine candidate genes for Plasmodium liver stage.

PubMed

Kumar, Hirdesh; Frischknecht, Friedrich; Mair, Gunnar R; Gomes, James

2015-12-01

Genetically attenuated parasites (GAPs) that lack genes essential for the liver stage of the malaria parasite, and therefore cause developmental arrest, have been developed as live vaccines in rodent malaria models and recently been tested in humans. The genes targeted for deletion were often identified by trial and error. Here we present a systematic gene - protein and transcript - expression analyses of several Plasmodium species with the aim to identify candidate genes for the generation of novel GAPs. With a lack of liver stage expression data for human malaria parasites, we used data available for liver stage development of Plasmodium yoelii, a rodent malaria model, to identify proteins expressed in the liver stage but absent from blood stage parasites. An orthology-based search was then employed to identify orthologous proteins in the human malaria parasite Plasmodium falciparum resulting in a total of 310 genes expressed in the liver stage but lacking evidence of protein expression in blood stage parasites. Among these 310 possible GAP candidates, we further studied Plasmodium liver stage proteins by phyletic distribution and functional domain analyses and shortlisted twenty GAP-candidates; these are: fabB/F, fabI, arp, 3 genes encoding subunits of the PDH complex, dnaJ, urm1, rS5, ancp, mcp, arh, gk, lisp2, valS, palm, and four conserved Plasmodium proteins of unknown function. Parasites lacking one or several of these genes might yield new attenuated malaria parasites for experimental vaccination studies. Copyright © 2015 Elsevier B.V. All rights reserved.
Cellular dissection of psoriasis for transcriptome analyses and the post-GWAS era

PubMed Central

2014-01-01

Background Genome-scale studies of psoriasis have been used to identify genes of potential relevance to disease mechanisms. For many identified genes, however, the cell type mediating disease activity is uncertain, which has limited our ability to design gene functional studies based on genomic findings. Methods We identified differentially expressed genes (DEGs) with altered expression in psoriasis lesions (n = 216 patients), as well as candidate genes near susceptibility loci from psoriasis GWAS studies. These gene sets were characterized based upon their expression across 10 cell types present in psoriasis lesions. Susceptibility-associated variation at intergenic (non-coding) loci was evaluated to identify sites of allele-specific transcription factor binding. Results Half of DEGs showed highest expression in skin cells, although the dominant cell type differed between psoriasis-increased DEGs (keratinocytes, 35%) and psoriasis-decreased DEGs (fibroblasts, 33%). In contrast, psoriasis GWAS candidates tended to have highest expression in immune cells (71%), with a significant fraction showing maximal expression in neutrophils (24%, P < 0.001). By identifying candidate cell types for genes near susceptibility loci, we could identify and prioritize SNPs at which susceptibility variants are predicted to influence transcription factor binding. This led to the identification of potentially causal (non-coding) SNPs for which susceptibility variants influence binding of AP-1, NF-κB, IRF1, STAT3 and STAT4. Conclusions These findings underscore the role of innate immunity in psoriasis and highlight neutrophils as a cell type linked with pathogenetic mechanisms. Assignment of candidate cell types to genes emerging from GWAS studies provides a first step towards functional analysis, and we have proposed an approach for generating hypotheses to explain GWAS hits at intergenic loci. PMID:24885462
Genome-Wide Association Study Identifies Candidate Genes for Starch Content Regulation in Maize Kernels

PubMed Central

Liu, Na; Xue, Yadong; Guo, Zhanyong; Li, Weihua; Tang, Jihua

2016-01-01

Kernel starch content is an important trait in maize (Zea mays L.) as it accounts for 65–75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60 to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM) as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001), among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437) is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops. PMID:27512395
Combining mouse mammary gland gene expression and comparative mapping for the identification of candidate genes for QTL of milk production traits in cattle

PubMed Central

Ron, Micha; Israeli, Galit; Seroussi, Eyal; Weller, Joel I; Gregg, Jeffrey P; Shani, Moshe; Medrano, Juan F

2007-01-01

Background Many studies have found segregating quantitative trait loci (QTL) for milk production traits in different dairy cattle populations. However, even for relatively large effects with a saturated marker map the confidence interval for QTL location by linkage analysis spans tens of map units, or hundreds of genes. Combining mapping and arraying has been suggested as an approach to identify candidate genes. Thus, gene expression analysis in the mammary gland of genes positioned in the confidence interval of the QTL can bridge the gap between fine mapping and quantitative trait nucleotide (QTN) determination. Results We hybridized Affymetrix microarray (MG-U74v2), containing 12,488 murine probes, with RNA derived from mammary gland of virgin, pregnant, lactating and involuting C57BL/6J mice in a total of nine biological replicates. We combined microarray data from two additional studies that used the same design in mice with a total of 75 biological replicates. The same filtering and normalization was applied to each microarray data using GeneSpring software. Analysis of variance identified 249 differentially expressed probe sets common to the three experiments along the four developmental stages of puberty, pregnancy, lactation and involution. 212 genes were assigned to their bovine map positions through comparative mapping, and thus form a list of candidate genes for previously identified QTLs for milk production traits. A total of 82 of the genes showed mammary gland-specific expression with at least 3-fold expression over the median representing all tissues tested in GeneAtlas. Conclusion This work presents a web tool for candidate genes for QTL (cgQTL) that allows navigation between the map of bovine milk production QTL, potential candidate genes and their level of expression in mammary gland arrays and in GeneAtlas. Three out of four confirmed genes that affect QTL in livestock (ABCG2, DGAT1, GDF8, IGF2) were over expressed in the target organ. Thus, cgQTL can be used to determine priority of candidate genes for QTN analysis based on differential expression in the target organ. PMID:17584498
A direct molecular link between the autism candidate gene RORa and the schizophrenia candidate MIR137

NASA Astrophysics Data System (ADS)

Devanna, Paolo; Vernes, Sonja C.

2014-02-01

Retinoic acid-related orphan receptor alpha gene (RORa) and the microRNA MIR137 have both recently been identified as novel candidate genes for neuropsychiatric disorders. RORa encodes a ligand-dependent orphan nuclear receptor that acts as a transcriptional regulator and miR-137 is a brain enriched small non-coding RNA that interacts with gene transcripts to control protein levels. Given the mounting evidence for RORa in autism spectrum disorders (ASD) and MIR137 in schizophrenia and ASD, we investigated if there was a functional biological relationship between these two genes. Herein, we demonstrate that miR-137 targets the 3'UTR of RORa in a site specific manner. We also provide further support for MIR137 as an autism candidate by showing that a large number of previously implicated autism genes are also putatively targeted by miR-137. This work supports the role of MIR137 as an ASD candidate and demonstrates a direct biological link between these previously unrelated autism candidate genes.
The cld mutation: narrowing the critical chromosomal region and selecting candidate genes.

PubMed

Péterfy, Miklós; Mao, Hui Z; Doolittle, Mark H

2006-10-01

Combined lipase deficiency (cld) is a recessive, lethal mutation specific to the tw73 haplotype on mouse Chromosome 17. While the cld mutation results in lipase proteins that are inactive, aggregated, and retained in the endoplasmic reticulum (ER), it maps separately from the lipase structural genes. We have narrowed the gene critical region by about 50% using the tw18 haplotype for deletion mapping and a recombinant chromosome used originally to map cld with respect to the phenotypic marker tf. The region now extends from 22 to 25.6 Mbp on the wild-type chromosome, currently containing 149 genes and 50 expressed sequence tags (ESTs). To identify the affected gene, we have selected candidates based on their known role in associated biological processes, cellular components, and molecular functions that best fit with the predicted function of the cld gene. A secondary approach was based on differences in mRNA levels between mutant (cld/cld) and unaffected (+/cld) cells. Using both approaches, we have identified seven functional candidates with an ER localization and/or an involvement in protein maturation and folding that could explain the lipase deficiency, and six expression candidates that exhibit large differences in mRNA levels between mutant and unaffected cells. Significantly, two genes were found to be candidates with regard to both function and expression, thus emerging as the strongest candidates for cld. We discuss the implications of our mapping results and our selection of candidates with respect to other genes, deletions, and mutations occurring in the cld critical region.
A priori and a posteriori approaches for finding genes of evolutionary interest in non-model species: osmoregulatory genes in the kidney transcriptome of the desert rodent Dipodomys spectabilis (banner-tailed kangaroo rat).

PubMed

Marra, Nicholas J; Eo, Soo Hyung; Hale, Matthew C; Waser, Peter M; DeWoody, J Andrew

2012-12-01

One common goal in evolutionary biology is the identification of genes underlying adaptive traits of evolutionary interest. Recently next-generation sequencing techniques have greatly facilitated such evolutionary studies in species otherwise depauperate of genomic resources. Kangaroo rats (Dipodomys sp.) serve as exemplars of adaptation in that they inhabit extremely arid environments, yet require no drinking water because of ultra-efficient kidney function and osmoregulation. As a basis for identifying water conservation genes in kangaroo rats, we conducted a priori bioinformatics searches in model rodents (Mus musculus and Rattus norvegicus) to identify candidate genes with known or suspected osmoregulatory function. We then obtained 446,758 reads via 454 pyrosequencing to characterize genes expressed in the kidney of banner-tailed kangaroo rats (Dipodomys spectabilis). We also determined candidates a posteriori by identifying genes that were overexpressed in the kidney. The kangaroo rat sequences revealed nine different a priori candidate genes predicted from our Mus and Rattus searches, as well as 32 a posteriori candidate genes that were overexpressed in kidney. Mutations in two of these genes, Slc12a1 and Slc12a3, cause human renal diseases that result in the inability to concentrate urine. These genes are likely key determinants of physiological water conservation in desert rodents. Copyright © 2012 Elsevier Inc. All rights reserved.
Identifying candidate genes for Type 2 Diabetes Mellitus and obesity through gene expression profiling in multiple tissues or cells.

PubMed

Chen, Junhui; Meng, Yuhuan; Zhou, Jinghui; Zhuo, Min; Ling, Fei; Zhang, Yu; Du, Hongli; Wang, Xiaoning

2013-01-01

Type 2 Diabetes Mellitus (T2DM) and obesity have become increasingly prevalent in recent years. Recent studies have focused on identifying causal variations or candidate genes for obesity and T2DM via analysis of expression quantitative trait loci (eQTL) within a single tissue. T2DM and obesity are affected by comprehensive sets of genes in multiple tissues. In the current study, gene expression levels in multiple human tissues from GEO datasets were analyzed, and 21 candidate genes displaying high percentages of differential expression were filtered out. Specifically, DENND1B, LYN, MRPL30, POC1B, PRKCB, RP4-655J12.3, HIBADH, and TMBIM4 were identified from the T2DM-control study, and BCAT1, BMP2K, CSRNP2, MYNN, NCKAP5L, SAP30BP, SLC35B4, SP1, BAP1, GRB14, HSP90AB1, ITGA5, and TOMM5 were identified from the obesity-control study. The majority of these genes are known to be involved in T2DM and obesity. Therefore, analysis of gene expression in various tissues using GEO datasets may be an effective and feasible method to determine novel or causal genes associated with T2DM and obesity.
Candidate Gene Identification of Feed Efficiency and Coat Color Traits in a C57BL/6J × Kunming F2 Mice Population Using Genome-Wide Association Study.

PubMed

Miao, Yuanxin; Soudy, Fathia; Xu, Zhong; Liao, Mingxing; Zhao, Shuhong; Li, Xinyun

2017-01-01

Feed efficiency (FE) is a very important trait in livestock industry. Identification of the candidate genes could be of benefit for the improvement of FE trait. Mouse is used as the model for many studies in mammals. In this study, the candidate genes related to FE and coat color were identified using C57BL/6J (C57) × Kunming (KM) F2 mouse population. GWAS results showed that 61 and 2 SNPs were genome-wise suggestive significantly associated with feed conversion ratio (FCR) and feed intake (FI) traits, respectively. Moreover, the Erbin, Msrb2, Ptf1a, and Fgf10 were considered as the candidate genes of FE. The Lpl was considered as the candidate gene of FI. Further, the coat color trait was studied. KM mice are white and C57 ones are black. The GWAS results showed that the most significant SNP was located at chromosome 7, and the closely linked gene was Tyr. Therefore, our study offered useful target genes related to FE in mice; these genes may play similar roles in FE of livestock. Also, we identified the major gene of coat color in mice, which would be useful for better understanding of natural mutation of the coat color in mice.

Identification and characterization of nuclear genes involved in photosynthesis in Populus

PubMed Central

2014-01-01

Background The gap between the real and potential photosynthetic rate under field conditions suggests that photosynthesis could potentially be improved. Nuclear genes provide possible targets for improving photosynthetic efficiency. Hence, genome-wide identification and characterization of the nuclear genes affecting photosynthetic traits in woody plants would provide key insights on genetic regulation of photosynthesis and identify candidate processes for improvement of photosynthesis. Results Using microarray and bulked segregant analysis strategies, we identified differentially expressed nuclear genes for photosynthesis traits in a segregating population of poplar. We identified 515 differentially expressed genes in this population (FC ≥ 2 or FC ≤ 0.5, P < 0.05), 163 up-regulated and 352 down-regulated. Real-time PCR expression analysis confirmed the microarray data. Singular Enrichment Analysis identified 48 significantly enriched GO terms for molecular functions (28), biological processes (18) and cell components (2). Furthermore, we selected six candidate genes for functional examination by a single-marker association approach, which demonstrated that 20 SNPs in five candidate genes significantly associated with photosynthetic traits, and the phenotypic variance explained by each SNP ranged from 2.3% to 12.6%. This revealed that regulation of photosynthesis by the nuclear genome mainly involves transport, metabolism and response to stimulus functions. Conclusions This study provides new genome-scale strategies for the discovery of potential candidate genes affecting photosynthesis in Populus, and for identification of the functions of genes involved in regulation of photosynthesis. This work also suggests that improving photosynthetic efficiency under field conditions will require the consideration of multiple factors, such as stress responses. PMID:24673936
A genomic scan for selection reveals candidates for genes involved in the evolution of cultivated sunflower (Helianthus annuus).

PubMed

Chapman, Mark A; Pashley, Catherine H; Wenzler, Jessica; Hvala, John; Tang, Shunxue; Knapp, Steven J; Burke, John M

2008-11-01

Genomic scans for selection are a useful tool for identifying genes underlying phenotypic transitions. In this article, we describe the results of a genome scan designed to identify candidates for genes targeted by selection during the evolution of cultivated sunflower. This work involved screening 492 loci derived from ESTs on a large panel of wild, primitive (i.e., landrace), and improved sunflower (Helianthus annuus) lines. This sampling strategy allowed us to identify candidates for selectively important genes and investigate the likely timing of selection. Thirty-six genes showed evidence of selection during either domestication or improvement based on multiple criteria, and a sequence-based test of selection on a subset of these loci confirmed this result. In view of what is known about the structure of linkage disequilibrium across the sunflower genome, these genes are themselves likely to have been targeted by selection, rather than being merely linked to the actual targets. While the selection candidates showed a broad range of putative functions, they were enriched for genes involved in amino acid synthesis and protein catabolism. Given that a similar pattern has been detected in maize (Zea mays), this finding suggests that selection on amino acid composition may be a general feature of the evolution of crop plants. In terms of genomic locations, the selection candidates were significantly clustered near quantitative trait loci (QTL) that contribute to phenotypic differences between wild and cultivated sunflower, and specific instances of QTL colocalization provide some clues as to the roles that these genes may have played during sunflower evolution.
The Candidate Cancer Gene Database: a database of cancer driver genes from forward genetic screens in mice.

PubMed

Abbott, Kenneth L; Nyre, Erik T; Abrahante, Juan; Ho, Yen-Yi; Isaksson Vogel, Rachel; Starr, Timothy K

2015-01-01

Identification of cancer driver gene mutations is crucial for advancing cancer therapeutics. Due to the overwhelming number of passenger mutations in the human tumor genome, it is difficult to pinpoint causative driver genes. Using transposon mutagenesis in mice many laboratories have conducted forward genetic screens and identified thousands of candidate driver genes that are highly relevant to human cancer. Unfortunately, this information is difficult to access and utilize because it is scattered across multiple publications using different mouse genome builds and strength metrics. To improve access to these findings and facilitate meta-analyses, we developed the Candidate Cancer Gene Database (CCGD, http://ccgd-starrlab.oit.umn.edu/). The CCGD is a manually curated database containing a unified description of all identified candidate driver genes and the genomic location of transposon common insertion sites (CISs) from all currently published transposon-based screens. To demonstrate relevance to human cancer, we performed a modified gene set enrichment analysis using KEGG pathways and show that human cancer pathways are highly enriched in the database. We also used hierarchical clustering to identify pathways enriched in blood cancers compared to solid cancers. The CCGD is a novel resource available to scientists interested in the identification of genetic drivers of cancer. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
3p22.1p21.31 microdeletion identifies CCK as Asperger syndrome candidate gene and shows the way for therapeutic strategies in chromosome imbalances.

PubMed

Iourov, Ivan Y; Vorsanova, Svetlana G; Voinova, Victoria Y; Yurov, Yuri B

2015-01-01

In contrast to other autism spectrum disorders, chromosome abnormalities are rare in Asperger syndrome (AS) or high-functioning autism. Consequently, AS was occasionally subjected to classical positional cloning. Here, we report on a case of AS associated with a deletion of the short arm of chromosome 3. Further in silico analysis has identified a candidate gene for AS and has suggested a therapeutic strategy for manifestations of the chromosome rearrangement. Using array comparative genomic hybridization, an interstitial deletion of 3p22.1p21.31 (~2.5 Mb in size) in a child with Asperger's syndrome, seborrheic dermatitis and chronic pancreatitis was detected. Original bioinformatic approach to the prioritization of candidate genes/processes identified CCK (cholecystokinin) as a candidate gene for AS. In addition to processes associated with deleted genes, bioinformatic analysis of CCK gene interactome indicated that zinc deficiency might be a pathogenic mechanism in this case. This suggestion was supported by plasma zinc concentration measurements. The increase of zinc intake produced a rise in zinc plasma concentration and the improvement in the patient's condition. Our study supported previous linkage findings and had suggested a new candidate gene in AS. Moreover, bioinformatic analysis identified the pathogenic mechanism, which was used to propose a therapeutic strategy for manifestations of the deletion. The relative success of this strategy allows speculating that therapeutic or dietary normalization of metabolic processes altered by a chromosome imbalance or genomic copy number variations may be a way for treating at least a small proportion of cases of these presumably incurable genetic conditions.
Novel candidate genes may be possible predisposing factors revealed by whole exome sequencing in familial esophageal squamous cell carcinoma.

PubMed

Forouzanfar, Narjes; Baranova, Ancha; Milanizadeh, Saman; Heravi-Moussavi, Alireza; Jebelli, Amir; Abbaszadegan, Mohammad Reza

2017-05-01

Esophageal squamous cell carcinoma is one of the deadliest of all the cancers. Its metastatic properties portend poor prognosis and high rate of recurrence. A more advanced method to identify new molecular biomarkers predicting disease prognosis can be whole exome sequencing. Here, we report the most effective genetic variants of the Notch signaling pathway in esophageal squamous cell carcinoma susceptibility by whole exome sequencing. We analyzed nine probands in unrelated familial esophageal squamous cell carcinoma pedigrees to identify candidate genes. Genomic DNA was extracted and whole exome sequencing performed to generate information about genetic variants in the coding regions. Bioinformatics software applications were utilized to exploit statistical algorithms to demonstrate protein structure and variants conservation. Polymorphic regions were excluded by false-positive investigations. Gene-gene interactions were analyzed for Notch signaling pathway candidates. We identified novel and damaging variants of the Notch signaling pathway through extensive pathway-oriented filtering and functional predictions, which led to the study of 27 candidate novel mutations in all nine patients. Detection of the trinucleotide repeat containing 6B gene mutation (a slice site alteration) in five of the nine probands, but not in any of the healthy samples, suggested that it may be a susceptibility factor for familial esophageal squamous cell carcinoma. Noticeably, 8 of 27 novel candidate gene mutations (e.g. epidermal growth factor, signal transducer and activator of transcription 3, MET) act in a cascade leading to cell survival and proliferation. Our results suggest that the trinucleotide repeat containing 6B mutation may be a candidate predisposing gene in esophageal squamous cell carcinoma. In addition, some of the Notch signaling pathway genetic mutations may act as key contributors to esophageal squamous cell carcinoma.
Comparative analysis of protein interactome networks prioritizes candidate genes with cancer signatures.

PubMed

Li, Yongsheng; Sahni, Nidhi; Yi, Song

2016-11-29

Comprehensive understanding of human cancer mechanisms requires the identification of a thorough list of cancer-associated genes, which could serve as biomarkers for diagnoses and therapies in various types of cancer. Although substantial progress has been made in functional studies to uncover genes involved in cancer, these efforts are often time-consuming and costly. Therefore, it remains challenging to comprehensively identify cancer candidate genes. Network-based methods have accelerated this process through the analysis of complex molecular interactions in the cell. However, the extent to which various interactome networks can contribute to prediction of candidate genes responsible for cancer is still enigmatic. In this study, we evaluated different human protein-protein interactome networks and compared their application to cancer gene prioritization. Our results indicate that network analyses can increase the power to identify novel cancer genes. In particular, such predictive power can be enhanced with the use of unbiased systematic protein interaction maps for cancer gene prioritization. Functional analysis reveals that the top ranked genes from network predictions co-occur often with cancer-related terms in literature, and further, these candidate genes are indeed frequently mutated across cancers. Finally, our study suggests that integrating interactome networks with other omics datasets could provide novel insights into cancer-associated genes and underlying molecular mechanisms.
Transcription map of Xq27: candidates for several X-linked diseases.

PubMed

Zucchi, I; Jones, J; Affer, M; Montagna, C; Redolfi, E; Susani, L; Vezzoni, P; Parvari, R; Schlessinger, D; Whyte, M P; Mumm, S

1999-04-15

Human Xq27 contains candidate regions for several disorders, yet is predicted to be a gene-poor cytogenetic band. We have developed a transcription map for the entire cytogenetic band to facilitate the identification of the relatively small number of expected candidate genes. Two approaches were taken to identify genes: (1) a group of 64 unique STSs that were generated during the physical mapping of the region were used in RT-PCR with RNA from human adult and fetal brain and (2) ESTs that have been broadly mapped to this region of the chromosome were finely mapped using a high-resolution yeast artificial chromosome contig. This combined approach identified four distinct regions of transcriptional activity within the Xq27 band. Among them is a region at the centromeric boundary that contains candidate regions for several rare developmental disorders (X-linked recessive hypoparathyroidism, thoracoabdominal syndrome, albinism-deafness syndrome, and Borjeson-Forssman-Lehman syndrome). Two transcriptionally active regions were identified in the center of Xq27 and include candidate regions for X-linked mental retardation syndrome 6, X-linked progressive cone dystrophy, X-linked retinitis pigmentosa 24, and a prostate cancer susceptibility locus. The fourth region of transcriptional activity encompasses the FMR1 (FRAXA) and FMR2 (FRAXE) genes. The analysis thus suggests clustered transcription in Xq27 and provides candidates for several heritable disorders for which the causative genes have not yet been found. Copyright 1999 Academic Press.
Identification of Enzyme Genes Using Chemical Structure Alignments of Substrate-Product Pairs.

PubMed

Moriya, Yuki; Yamada, Takuji; Okuda, Shujiro; Nakagawa, Zenichi; Kotera, Masaaki; Tokimatsu, Toshiaki; Kanehisa, Minoru; Goto, Susumu

2016-03-28

Although there are several databases that contain data on many metabolites and reactions in biochemical pathways, there is still a big gap in the numbers between experimentally identified enzymes and metabolites. It is supposed that many catalytic enzyme genes are still unknown. Although there are previous studies that estimate the number of candidate enzyme genes, these studies required some additional information aside from the structures of metabolites such as gene expression and order in the genome. In this study, we developed a novel method to identify a candidate enzyme gene of a reaction using the chemical structures of the substrate-product pair (reactant pair). The proposed method is based on a search for similar reactant pairs in a reference database and offers ortholog groups that possibly mediate the given reaction. We applied the proposed method to two experimentally validated reactions. As a result, we confirmed that the histidine transaminase was correctly identified. Although our method could not directly identify the asparagine oxo-acid transaminase, we successfully found the paralog gene most similar to the correct enzyme gene. We also applied our method to infer candidate enzyme genes in the mesaconate pathway. The advantage of our method lies in the prediction of possible genes for orphan enzyme reactions where any associated gene sequences are not determined yet. We believe that this approach will facilitate experimental identification of genes for orphan enzymes.
No Evidence That Schizophrenia Candidate Genes Are More Associated With Schizophrenia Than Noncandidate Genes.

PubMed

Johnson, Emma C; Border, Richard; Melroy-Greif, Whitney E; de Leeuw, Christiaan A; Ehringer, Marissa A; Keller, Matthew C

2017-11-15

A recent analysis of 25 historical candidate gene polymorphisms for schizophrenia in the largest genome-wide association study conducted to date suggested that these commonly studied variants were no more associated with the disorder than would be expected by chance. However, the same study identified other variants within those candidate genes that demonstrated genome-wide significant associations with schizophrenia. As such, it is possible that variants within historic schizophrenia candidate genes are associated with schizophrenia at levels above those expected by chance, even if the most-studied specific polymorphisms are not. The present study used association statistics from the largest schizophrenia genome-wide association study conducted to date as input to a gene set analysis to investigate whether variants within schizophrenia candidate genes are enriched for association with schizophrenia. As a group, variants in the most-studied candidate genes were no more associated with schizophrenia than were variants in control sets of noncandidate genes. While a small subset of candidate genes did appear to be significantly associated with schizophrenia, these genes were not particularly noteworthy given the large number of more strongly associated noncandidate genes. The history of schizophrenia research should serve as a cautionary tale to candidate gene investigators examining other phenotypes: our findings indicate that the most investigated candidate gene hypotheses of schizophrenia are not well supported by genome-wide association studies, and it is likely that this will be the case for other complex traits as well. Copyright © 2017 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Using Association Mapping in Teosinte (Zea Mays ssp Parviglumis) to Investigate the Function of Selection-Candidate Genes

USDA-ARS?s Scientific Manuscript database

Large-scale screens of the maize genome identified 48 genes that show the putative signature of artificial selection during maize domestication or improvement. These selection-candidate genes may act as quantitative trait loci (QTL) that control the phenotypic differences between maize and its proge...
Haploinsufficiency of TAB2 Causes Congenital Heart Defects in Humans

PubMed Central

Thienpont, Bernard; Zhang, Litu; Postma, Alex V.; Breckpot, Jeroen; Tranchevent, Léon-Charles; Van Loo, Peter; Møllgård, Kjeld; Tommerup, Niels; Bache, Iben; Tümer, Zeynep; van Engelen, Klaartje; Menten, Björn; Mortier, Geert; Waggoner, Darrel; Gewillig, Marc; Moreau, Yves; Devriendt, Koen; Larsen, Lars Allan

2010-01-01

Congenital heart defects (CHDs) are the most common major developmental anomalies and the most frequent cause for perinatal mortality, but their etiology remains often obscure. We identified a locus for CHDs on 6q24-q25. Genotype-phenotype correlations in 12 patients carrying a chromosomal deletion on 6q delineated a critical 850 kb region on 6q25.1 harboring five genes. Bioinformatics prioritization of candidate genes in this locus for a role in CHDs identified the TGF-β-activated kinase 1/MAP3K7 binding protein 2 gene (TAB2) as the top-ranking candidate gene. A role for this candidate gene in cardiac development was further supported by its conserved expression in the developing human and zebrafish heart. Moreover, a critical, dosage-sensitive role during development was demonstrated by the cardiac defects observed upon titrated knockdown of tab2 expression in zebrafish embryos. To definitively confirm the role of this candidate gene in CHDs, we performed mutation analysis of TAB2 in 402 patients with a CHD, which revealed two evolutionarily conserved missense mutations. Finally, a balanced translocation was identified, cosegregating with familial CHD. Mapping of the breakpoints demonstrated that this translocation disrupts TAB2. Taken together, these data clearly demonstrate a role for TAB2 in human cardiac development. PMID:20493459
Transcriptomic Analysis Using Olive Varieties and Breeding Progenies Identifies Candidate Genes Involved in Plant Architecture

PubMed Central

González-Plaza, Juan J.; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F.; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R.; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R.

2016-01-01

Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species. PMID:26973682
In Silico Gene Prioritization by Integrating Multiple Data Sources

PubMed Central

Zhou, Yingyao; Shields, Robert; Chanda, Sumit K.; Elston, Robert C.; Li, Jing

2011-01-01

Identifying disease genes is crucial to the understanding of disease pathogenesis, and to the improvement of disease diagnosis and treatment. In recent years, many researchers have proposed approaches to prioritize candidate genes by considering the relationship of candidate genes and existing known disease genes, reflected in other data sources. In this paper, we propose an expandable framework for gene prioritization that can integrate multiple heterogeneous data sources by taking advantage of a unified graphic representation. Gene-gene relationships and gene-disease relationships are then defined based on the overall topology of each network using a diffusion kernel measure. These relationship measures are in turn normalized to derive an overall measure across all networks, which is utilized to rank all candidate genes. Based on the informativeness of available data sources with respect to each specific disease, we also propose an adaptive threshold score to select a small subset of candidate genes for further validation studies. We performed large scale cross-validation analysis on 110 disease families using three data sources. Results have shown that our approach consistently outperforms other two state of the art programs. A case study using Parkinson disease (PD) has identified four candidate genes (UBB, SEPT5, GPR37 and TH) that ranked higher than our adaptive threshold, all of which are involved in the PD pathway. In particular, a very recent study has observed a deletion of TH in a patient with PD, which supports the importance of the TH gene in PD pathogenesis. A web tool has been implemented to assist scientists in their genetic studies. PMID:21731658
In Silico Identification of Candidate Genes for Fertility Restoration in Cytoplasmic Male Sterile Perennial Ryegrass (Lolium perenne L.)

PubMed Central

Sykes, Timothy; Yates, Steven; Nagy, Istvan; Asp, Torben; Small, Ian

2017-01-01

Perennial ryegrass (Lolium perenne L.) is widely used for forage production in both permanent and temporary grassland systems. To increase yields in perennial ryegrass, recent breeding efforts have been focused on strategies to more efficiently exploit heterosis by hybrid breeding. Cytoplasmic male sterility (CMS) is a widely applied mechanism to control pollination for commercial hybrid seed production and although CMS systems have been identified in perennial ryegrass, they are yet to be fully characterized. Here, we present a bioinformatics pipeline for efficient identification of candidate restorer of fertility (Rf) genes for CMS. From a high-quality draft of the perennial ryegrass genome, 373 pentatricopeptide repeat (PPR) genes were identified and classified, further identifying 25 restorer of fertility-like PPR (RFL) genes through a combination of DNA sequence clustering and comparison to known Rf genes. This extensive gene family was targeted as the majority of Rf genes in higher plants are RFL genes. These RFL genes were further investigated by phylogenetic analyses, identifying three groups of perennial ryegrass RFLs. These three groups likely represent genomic regions of active RFL generation and identify the probable location of perennial ryegrass PPR-Rf genes. This pipeline allows for the identification of candidate PPR-Rf genes from genomic sequence data and can be used in any plant species. Functional markers for PPR-Rf genes will facilitate map-based cloning of Rf genes and enable the use of CMS as an efficient tool to control pollination for hybrid crop production. PMID:26951780
Novel genes on rat chromosome 10 are linked to body fat mass, preadipocyte number and adipocyte size.

PubMed

Weingarten, A; Turchetti, L; Krohn, K; Klöting, I; Kern, M; Kovacs, P; Stumvoll, M; Blüher, M; Klöting, N

2016-12-01

The genetic architecture of obesity is multifactorial. We have previously identified a quantitative trait locus (QTL) on rat chromosome 10 in a F2 cross of Wistar Ottawa Karlsburg (WOKW) and Dark Agouti (DA) rats responsible for obesity-related traits. The QTL was confirmed in congenic DA.WOKW10 rats. To pinpoint the region carrying causal genes, we established two new subcongenic lines, L1 and L2, with smaller refined segments of chromosome 10 to identify novel candidate genes. All lines were extensively characterized under different diet conditions. We employed transcriptome analysis in visceral adipose tissue (VAT) by RNA-Seq technology to identify potential underlying genes in the segregating regions. Three candidate genes were measured in human paired samples of VAT and subcutaneous (SC) AT (SAT) (N=304) individuals with a wide range of body weight and glucose homeostasis parameters. DA.WOKW and L1 subcongenic lines were protected against body fat gain under high-fat diet (HFD), whereas L2 and DA had significantly more body fat after high-fat feeding. Interestingly, adipocyte size distribution in SAT and epigonadal AT of L1 subcongenic rats did not undergo typical ballooning under HFD and the number of preadipocytes in AT was significantly elevated in L2 compared with L1 and parental rats. Transcriptome analysis identified three candidate genes in VAT on rat chromosome 10. In humans, these candidate genes were differentially expressed between SAT and VAT. Moreover, HID1 mRNA significantly correlates with parameters of obesity and glucose metabolism. Our data suggest novel candidate genes for obesity that map on rat chromosome 10 in an interval 102.2-104.7 Mb and are strongly associated with body fat mass regulation, preadipocyte number and adipocyte size in rats. Among those genes, AT head involution defective (HID1) mRNA expression may be relevant for human fat distribution and glucose homeostasis.
Gene expression meta-analysis identifies chromosomal regions and candidate genes involved in breast cancer metastasis.

PubMed

Thomassen, Mads; Tan, Qihua; Kruse, Torben A

2009-01-01

Breast cancer cells exhibit complex karyotypic alterations causing deregulation of numerous genes. Some of these genes are probably causal for cancer formation and local growth whereas others are causal for the various steps of metastasis. In a fraction of tumors deregulation of the same genes might be caused by epigenetic modulations, point mutations or the influence of other genes. We have investigated the relation of gene expression and chromosomal position, using eight datasets including more than 1200 breast tumors, to identify chromosomal regions and candidate genes possibly causal for breast cancer metastasis. By use of "Gene Set Enrichment Analysis" we have ranked chromosomal regions according to their relation to metastasis. Overrepresentation analysis identified regions with increased expression for chromosome 1q41-42, 8q24, 12q14, 16q22, 16q24, 17q12-21.2, 17q21-23, 17q25, 20q11, and 20q13 among metastasizing tumors and reduced gene expression at 1p31-21, 8p22-21, and 14q24. By analysis of genes with extremely imbalanced expression in these regions we identified DIRAS3 at 1p31, PSD3, LPL, EPHX2 at 8p21-22, and FOS at 14q24 as candidate metastasis suppressor genes. Potential metastasis promoting genes includes RECQL4 at 8q24, PRMT7 at 16q22, GINS2 at 16q24, and AURKA at 20q13.
Analysis of Craniocardiac Malformations in Xenopus using Optical Coherence Tomography

PubMed Central

Deniz, Engin; Jonas, Stephan; Hooper, Michael; N. Griffin, John; Choma, Michael A.; Khokha, Mustafa K.

2017-01-01

Birth defects affect 3% of children in the United States. Among the birth defects, congenital heart disease and craniofacial malformations are major causes of mortality and morbidity. Unfortunately, the genetic mechanisms underlying craniocardiac malformations remain largely uncharacterized. To address this, human genomic studies are identifying sequence variations in patients, resulting in numerous candidate genes. However, the molecular mechanisms of pathogenesis for most candidate genes are unknown. Therefore, there is a need for functional analyses in rapid and efficient animal models of human disease. Here, we coupled the frog Xenopus tropicalis with Optical Coherence Tomography (OCT) to create a fast and efficient system for testing craniocardiac candidate genes. OCT can image cross-sections of microscopic structures in vivo at resolutions approaching histology. Here, we identify optimal OCT imaging planes to visualize and quantitate Xenopus heart and facial structures establishing normative data. Next we evaluate known human congenital heart diseases: cardiomyopathy and heterotaxy. Finally, we examine craniofacial defects by a known human teratogen, cyclopamine. We recapitulate human phenotypes readily and quantify the functional and structural defects. Using this approach, we can quickly test human craniocardiac candidate genes for phenocopy as a critical first step towards understanding disease mechanisms of the candidate genes. PMID:28195132
“Soldier's Heart”: A Genetic Basis for Elevated Cardiovascular Disease Risk Associated with Post-traumatic Stress Disorder

PubMed Central

Pollard, Harvey B.; Shivakumar, Chittari; Starr, Joshua; Eidelman, Ofer; Jacobowitz, David M.; Dalgard, Clifton L.; Srivastava, Meera; Wilkerson, Matthew D.; Stein, Murray B.; Ursano, Robert J.

2016-01-01

“Soldier's Heart,” is an American Civil War term linking post-traumatic stress disorder (PTSD) with increased propensity for cardiovascular disease (CVD). We have hypothesized that there might be a quantifiable genetic basis for this linkage. To test this hypothesis we identified a comprehensive set of candidate risk genes for PTSD, and tested whether any were also independent risk genes for CVD. A functional analysis algorithm was used to identify associated signaling networks. We identified 106 PTSD studies that report one or more polymorphic variants in 87 candidate genes in 83,463 subjects and controls. The top upstream drivers for these PTSD risk genes are predicted to be the glucocorticoid receptor (NR3C1) and Tumor Necrosis Factor alpha (TNFA). We find that 37 of the PTSD candidate risk genes are also candidate independent risk genes for CVD. The association between PTSD and CVD is significant by Fisher's Exact Test (P = 3 × 10−54). We also find 15 PTSD risk genes that are independently associated with Type 2 Diabetes Mellitus (T2DM; also significant by Fisher's Exact Test (P = 1.8 × 10−16). Our findings offer quantitative evidence for a genetic link between post-traumatic stress and cardiovascular disease, Computationally, the common mechanism for this linkage between PTSD and CVD is innate immunity and NFκB-mediated inflammation. PMID:27721742
"Soldier's Heart": A Genetic Basis for Elevated Cardiovascular Disease Risk Associated with Post-traumatic Stress Disorder.

PubMed

Pollard, Harvey B; Shivakumar, Chittari; Starr, Joshua; Eidelman, Ofer; Jacobowitz, David M; Dalgard, Clifton L; Srivastava, Meera; Wilkerson, Matthew D; Stein, Murray B; Ursano, Robert J

2016-01-01

"Soldier's Heart," is an American Civil War term linking post-traumatic stress disorder (PTSD) with increased propensity for cardiovascular disease (CVD). We have hypothesized that there might be a quantifiable genetic basis for this linkage. To test this hypothesis we identified a comprehensive set of candidate risk genes for PTSD, and tested whether any were also independent risk genes for CVD. A functional analysis algorithm was used to identify associated signaling networks. We identified 106 PTSD studies that report one or more polymorphic variants in 87 candidate genes in 83,463 subjects and controls. The top upstream drivers for these PTSD risk genes are predicted to be the glucocorticoid receptor (NR3C1) and Tumor Necrosis Factor alpha (TNFA). We find that 37 of the PTSD candidate risk genes are also candidate independent risk genes for CVD. The association between PTSD and CVD is significant by Fisher's Exact Test ( P = 3 × 10 -54 ). We also find 15 PTSD risk genes that are independently associated with Type 2 Diabetes Mellitus (T2DM; also significant by Fisher's Exact Test ( P = 1.8 × 10 -16 ). Our findings offer quantitative evidence for a genetic link between post-traumatic stress and cardiovascular disease, Computationally, the common mechanism for this linkage between PTSD and CVD is innate immunity and NFκB-mediated inflammation.
A public platform for the verification of the phenotypic effect of candidate genes for resistance to aflatoxin accumulation and Aspergillus flavus infection in maize

USDA-ARS?s Scientific Manuscript database

A public candidate gene testing pipeline for resistance to aflatoxin accumulation or Aspergillus flavus infection in maize is presented here. The pipeline consists of steps for identifying, testing, and verifying the association of any maize gene sequence with resistance under field conditions. Reso...

SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate

Treesearch

Gretchen H. Roffler; Stephen J. Amish; Seth Smith; Ted Cosart; Marty Kardos; Michael K. Schwartz; Gordon Luikart

2016-01-01

Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding...
Utilizing Gene Tree Variation to Identify Candidate Effector Genes in Zymoseptoria tritici

PubMed Central

McDonald, Megan C.; McGinness, Lachlan; Hane, James K.; Williams, Angela H.; Milgate, Andrew; Solomon, Peter S.

2016-01-01

Zymoseptoria tritici is a host-specific, necrotrophic pathogen of wheat. Infection by Z. tritici is characterized by its extended latent period, which typically lasts 2 wks, and is followed by extensive host cell death, and rapid proliferation of fungal biomass. This work characterizes the level of genomic variation in 13 isolates, for which we have measured virulence on 11 wheat cultivars with differential resistance genes. Between the reference isolate, IPO323, and the 13 Australian isolates we identified over 800,000 single nucleotide polymorphisms, of which ∼10% had an effect on the coding regions of the genome. Furthermore, we identified over 1700 probable presence/absence polymorphisms in genes across the Australian isolates using de novo assembly. Finally, we developed a gene tree sorting method that quickly identifies groups of isolates within a single gene alignment whose sequence haplotypes correspond with virulence scores on a single wheat cultivar. Using this method, we have identified < 100 candidate effector genes whose gene sequence correlates with virulence toward a wheat cultivar carrying a major resistance gene. PMID:26837952
A role for genetic susceptibility in sporadic focal segmental glomerulosclerosis

PubMed Central

Yu, Haiyang; Artomov, Mykyta; Brähler, Sebastian; Stander, M. Christine; Shamsan, Ghaidan; Sampson, Matthew G.; White, J. Michael; Kretzler, Matthias; Jain, Sanjay; Winkler, Cheryl A.; Mitra, Robi D.; Daly, Mark J.; Shaw, Andrey S.

2016-01-01

Focal segmental glomerulosclerosis (FSGS) is a syndrome that involves kidney podocyte dysfunction and causes chronic kidney disease. Multiple factors including chemical toxicity, inflammation, and infection underlie FSGS; however, highly penetrant disease genes have been identified in a small fraction of patients with a family history of FSGS. Variants of apolipoprotein L1 (APOL1) have been linked to FSGS in African Americans with HIV or hypertension, supporting the proposal that genetic factors enhance FSGS susceptibility. Here, we used sequencing to investigate whether genetics plays a role in the majority of FSGS cases that are identified as primary or sporadic FSGS and have no known cause. Given the limited number of biopsy-proven cases with ethnically matched controls, we devised an analytic strategy to identify and rank potential candidate genes and used an animal model for validation. Nine candidate FSGS susceptibility genes were identified in our patient cohort, and three were validated using a high-throughput mouse method that we developed. Specifically, we introduced a podocyte-specific, doxycycline-inducible transactivator into a murine embryonic stem cell line with an FSGS-susceptible genetic background that allows shRNA-mediated targeting of candidate genes in the adult kidney. Our analysis supports a broader role for genetic susceptibility of both sporadic and familial cases of FSGS and provides a tool to rapidly evaluate candidate FSGS-associated genes. PMID:26901816
Identification of Novel Associations of Candidate Genes with Resistance to Late Blight in Solanum tuberosum Group Phureja

PubMed Central

Álvarez, María F.; Angarita, Myrian; Delgado, María C.; García, Celsa; Jiménez-Gomez, José; Gebhardt, Christiane; Mosquera, Teresa

2017-01-01

The genetic basis of quantitative disease resistance has been studied in crops for several decades as an alternative to R gene mediated resistance. The most important disease in the potato crop is late blight, caused by the oomycete Phytophthora infestans. Quantitative disease resistance (QDR), as any other quantitative trait in plants, can be genetically mapped to understand the genetic architecture. Association mapping using DNA-based markers has been implemented in many crops to dissect quantitative traits. We used an association mapping approach with candidate genes to identify the first genes associated with quantitative resistance to late blight in Solanum tuberosum Group Phureja. Twenty-nine candidate genes were selected from a set of genes that were differentially expressed during the resistance response to late blight in tetraploid European potato cultivars. The 29 genes were amplified and sequenced in 104 accessions of S. tuberosum Group Phureja from Latin America. We identified 238 SNPs in the selected genes and tested them for association with resistance to late blight. The phenotypic data were obtained under field conditions by determining the area under disease progress curve (AUDPC) in two seasons and in two locations. Two genes were associated with QDR to late blight, a potato homolog of thylakoid lumen 15 kDa protein (StTL15A) and a stem 28 kDa glycoprotein (StGP28). Key message: A first association mapping experiment was conducted in Solanum tuberosum Group Phureja germplasm, which identified among 29 candidates two genes associated with quantitative resistance to late blight. PMID:28674545
Photoreceptor dysplasia (pd) in miniature schnauzer dogs: evaluation of candidate genes by molecular genetic analysis.

PubMed

Zhang, Q; Baldwin, V J; Acland, G M; Parshall, C J; Haskel, J; Aguirre, G D; Ray, K

1999-01-01

Photoreceptor dysplasia (pd) is one of a group of at least six distinct autosomal and one X-linked retinal disorders identified in dogs which are collectively known as progressive retinal atrophy (PRA). It is an early onset retinal disease identified in miniature schnauzer dogs, and pedigree analysis and breeding studies have established autosomal recessive inheritance of the disease. Using a gene-based approach, a number of retina-expressed genes, including some members of the phototransduction pathway, have been causally implicated in retinal diseases of humans and other animals. Here we examined seven such potential candidate genes (opsin, RDS/peripherin, ROM1, rod cGMP-gated cation channel alpha-subunit, and three subunits of transducin) for their causal association with the pd locus by testing segregation of intragenic markers with the disease locus, or, in the absence of informative polymorphisms, sequencing of the coding regions of the genes. Based on these results, we have conclusively excluded four photoreceptor-specific genes as candidates for pd by linkage analysis. For three other photoreceptor-specific genes, we did not find any mutation in the coding sequences of the genes and have excluded them provisionally. Formal exclusion would require investigation of the levels of expression of the candidate genes in pd-affected dogs relative to age-matched controls. At present we are building suitable informative pedigrees for the disease locus with a sufficient number of meiosis to be useful for genomewide screening. This should identify markers linked to the disease locus and eventually permit progress toward the identification of the photoreceptor dysplasia gene and the disease-causing mutation.
Identification of Novel Associations of Candidate Genes with Resistance to Late Blight in Solanum tuberosum Group Phureja.

PubMed

Álvarez, María F; Angarita, Myrian; Delgado, María C; García, Celsa; Jiménez-Gomez, José; Gebhardt, Christiane; Mosquera, Teresa

2017-01-01

The genetic basis of quantitative disease resistance has been studied in crops for several decades as an alternative to R gene mediated resistance. The most important disease in the potato crop is late blight, caused by the oomycete Phytophthora infestans. Quantitative disease resistance (QDR), as any other quantitative trait in plants, can be genetically mapped to understand the genetic architecture. Association mapping using DNA-based markers has been implemented in many crops to dissect quantitative traits. We used an association mapping approach with candidate genes to identify the first genes associated with quantitative resistance to late blight in Solanum tuberosum Group Phureja. Twenty-nine candidate genes were selected from a set of genes that were differentially expressed during the resistance response to late blight in tetraploid European potato cultivars. The 29 genes were amplified and sequenced in 104 accessions of S. tuberosum Group Phureja from Latin America. We identified 238 SNPs in the selected genes and tested them for association with resistance to late blight. The phenotypic data were obtained under field conditions by determining the area under disease progress curve (AUDPC) in two seasons and in two locations. Two genes were associated with QDR to late blight, a potato homolog of thylakoid lumen 15 kDa protein ( StTL15A ) and a stem 28 kDa glycoprotein ( StGP28 ). Key message : A first association mapping experiment was conducted in Solanum tuberosum Group Phureja germplasm, which identified among 29 candidates two genes associated with quantitative resistance to late blight.
Adaptation to climate through flowering phenology: a case study in Medicago truncatula.

PubMed

Burgarella, Concetta; Chantret, Nathalie; Gay, Laurène; Prosperi, Jean-Marie; Bonhomme, Maxime; Tiffin, Peter; Young, Nevin D; Ronfort, Joelle

2016-07-01

Local climatic conditions likely constitute an important selective pressure on genes underlying important fitness-related traits such as flowering time, and in many species, flowering phenology and climatic gradients strongly covary. To test whether climate shapes the genetic variation on flowering time genes and to identify candidate flowering genes involved in the adaptation to environmental heterogeneity, we used a large Medicago truncatula core collection to examine the association between nucleotide polymorphisms at 224 candidate genes and both climate variables and flowering phenotypes. Unlike genome-wide studies, candidate gene approaches are expected to enrich for the number of meaningful trait associations because they specifically target genes that are known to affect the trait of interest. We found that flowering time mediates adaptation to climatic conditions mainly by variation at genes located upstream in the flowering pathways, close to the environmental stimuli. Variables related to the annual precipitation regime reflected selective constraints on flowering time genes better than the other variables tested (temperature, altitude, latitude or longitude). By comparing phenotype and climate associations, we identified 12 flowering genes as the most promising candidates responsible for phenological adaptation to climate. Four of these genes were located in the known flowering time QTL region on chromosome 7. However, climate and flowering associations also highlighted largely distinct gene sets, suggesting different genetic architectures for adaptation to climate and flowering onset. © 2016 John Wiley & Sons Ltd.
Identifying Cancer Driver Genes Using Replication-Incompetent Retroviral Vectors

PubMed Central

Bii, Victor M.; Trobridge, Grant D.

2016-01-01

Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. High-throughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. Insertional mutagenesis screens using replication-incompetent retroviral vectors have emerged as a powerful tool to identify cancer genes. Unlike replicating retroviruses and transposons, replication-incompetent retroviral vectors lack additional mutagenesis events that can complicate the identification of driver mutations from passenger mutations. They can also be used for almost any human cancer due to the broad tropism of the vectors. Replication-incompetent retroviral vectors have the ability to dysregulate nearby cancer genes via several mechanisms including enhancer-mediated activation of gene promoters. The integrated provirus acts as a unique molecular tag for nearby candidate driver genes which can be rapidly identified using well established methods that utilize next generation sequencing and bioinformatics programs. Recently, retroviral vector screens have been used to efficiently identify candidate driver genes in prostate, breast, liver and pancreatic cancers. Validated driver genes can be potential therapeutic targets and biomarkers. In this review, we describe the emergence of retroviral insertional mutagenesis screens using replication-incompetent retroviral vectors as a novel tool to identify cancer driver genes in different cancer types. PMID:27792127
Candidate Loci for Yield-Related Traits in Maize Revealed by a Combination of MetaQTL Analysis and Regional Association Mapping

PubMed Central

Chen, Lin; An, Yixin; Li, Yong-xiang; Li, Chunhui; Shi, Yunsu; Song, Yanchun; Zhang, Dengfeng; Wang, Tianyu; Li, Yu

2017-01-01

Maize grain yield and related traits are complex and are controlled by a large number of genes of small effect or quantitative trait loci (QTL). Over the years, a large number of yield-related QTLs have been identified in maize and deposited in public databases. However, integrating and re-analyzing these data and mining candidate loci for yield-related traits has become a major issue in maize. In this study, we collected information on QTLs conferring maize yield-related traits from 33 published studies. Then, 999 of these QTLs were iteratively projected and subjected to meta-analysis to obtain metaQTLs (MQTLs). A total of 76 MQTLs were found across the maize genome. Based on a comparative genomics strategy, several maize orthologs of rice yield-related genes were identified in these MQTL regions. Furthermore, three potential candidate genes (Gene ID: GRMZM2G359974, GRMZM2G301884, and GRMZM2G083894) associated with kernel size and weight within three MQTL regions were identified using regional association mapping, based on the results of the meta-analysis. This strategy, combining MQTL analysis and regional association mapping, is helpful for functional marker development and rapid identification of candidate genes or loci. PMID:29312420
No Association between Personality and Candidate Gene Polymorphisms in a Wild Bird Population

PubMed Central

Durieux, Gillian; Burke, Terry; Dugdale, Hannah L.

2015-01-01

Consistency of between-individual differences in behaviour or personality is a phenomenon in populations that can have ecological consequences and evolutionary potential. One way that behaviour can evolve is to have a genetic basis. Identifying the molecular genetic basis of personality could therefore provide insight into how and why such variation is maintained, particularly in natural populations. Previously identified candidate genes for personality in birds include the dopamine receptor D4 (DRD4), and serotonin transporter (SERT). Studies of wild bird populations have shown that exploratory and bold behaviours are associated with polymorphisms in both DRD4 and SERT. Here we tested for polymorphisms in DRD4 and SERT in the Seychelles warbler (Acrocephalus sechellensis) population on Cousin Island, Seychelles, and then investigated correlations between personality and polymorphisms in these genes. We found no genetic variation in DRD4, but identified four polymorphisms in SERT that clustered into five haplotypes. There was no correlation between bold or exploratory behaviours and SERT polymorphisms/haplotypes. The null result was not due to lack of power, and indicates that there was no association between these behaviours and variation in the candidate genes tested in this population. These null findings provide important data to facilitate representative future meta-analyses on candidate personality genes. PMID:26473495
Integrative Approach to Pain Genetics Identifies Pain Sensitivity Loci across Diseases

PubMed Central

Ruau, David; Dudley, Joel T.; Chen, Rong; Phillips, Nicholas G.; Swan, Gary E.; Lazzeroni, Laura C.; Clark, J. David

2012-01-01

Identifying human genes relevant for the processing of pain requires difficult-to-conduct and expensive large-scale clinical trials. Here, we examine a novel integrative paradigm for data-driven discovery of pain gene candidates, taking advantage of the vast amount of existing disease-related clinical literature and gene expression microarray data stored in large international repositories. First, thousands of diseases were ranked according to a disease-specific pain index (DSPI), derived from Medical Subject Heading (MESH) annotations in MEDLINE. Second, gene expression profiles of 121 of these human diseases were obtained from public sources. Third, genes with expression variation significantly correlated with DSPI across diseases were selected as candidate pain genes. Finally, selected candidate pain genes were genotyped in an independent human cohort and prospectively evaluated for significant association between variants and measures of pain sensitivity. The strongest signal was with rs4512126 (5q32, ABLIM3, P = 1.3×10−10) for the sensitivity to cold pressor pain in males, but not in females. Significant associations were also observed with rs12548828, rs7826700 and rs1075791 on 8q22.2 within NCALD (P = 1.7×10−4, 1.8×10−4, and 2.2×10−4 respectively). Our results demonstrate the utility of a novel paradigm that integrates publicly available disease-specific gene expression data with clinical data curated from MEDLINE to facilitate the discovery of pain-relevant genes. This data-derived list of pain gene candidates enables additional focused and efficient biological studies validating additional candidates. PMID:22685391
A new mutation identified in SPATA16 in two globozoospermic patients.

PubMed

ElInati, Elias; Fossard, Camille; Okutman, Ozlem; Ghédir, Houda; Ibala-Romdhane, Samira; Ray, Pierre F; Saad, Ali; Hennebicq, Sylvianne; Viville, Stéphane

2016-06-01

The aim of this study is to identify potential genes involved in human globozoopsermia. Nineteen globozoospermic patients (previously screened for DPY19L2 mutations with no causative mutation) were recruited in this study and screened for mutations in genes implicated in human globozoospermia SPATA16 and PICK1. Using the candidate gene approach and the determination of Spata16 partners by Glutathione S-transferase (GST) pull-down four genes were also selected and screened for mutations. We identified a novel mutation of SPATA16: deletion of 22.6 Kb encompassing the first coding exon in two unrelated Tunisian patients who presented the same deletion breakpoints. The two patients shared the same haplotype, suggesting a possible ancestral founder effect for this new deletion. Four genes were selected using the candidate gene approach and the GST pull-down (GOPC, PICK1, AGFG1 and IRGC) and were screened for mutation, but no variation was identified. The present study confirms the pathogenicity of the SPATA16 mutations. The fact that no variation was detected in the coding sequence of AFGF1, GOPC, PICK1 and IRGC does not mean that they are not involved in human globozoospermia. A larger globozoospermic cohort must be studied in order to accelerate the process of identifying new genes involved in such phenotypes. Until sufficient numbers of patients have been screened, AFGF1, GOPC, PICK1 and IRGC should still be considered as candidate genes.
Genetic Variation and Recent Positive Selection in Worldwide Human Populations: Evidence from Nearly 1 Million SNPs

PubMed Central

Theunert, Christoph; Pugach, Irina; Li, Jing; Nandineni, Madhusudan R.; Gross, Arnd; Scholz, Markus; Stoneking, Mark

2009-01-01

Background Genome-wide scans of hundreds of thousands of single-nucleotide polymorphisms (SNPs) have resulted in the identification of new susceptibility variants to common diseases and are providing new insights into the genetic structure and relationships of human populations. Moreover, genome-wide data can be used to search for signals of recent positive selection, thereby providing new insights into the genetic adaptations that occurred as modern humans spread out of Africa and around the world. Methodology We genotyped approximately 500,000 SNPs in 255 individuals (5 individuals from each of 51 worldwide populations) from the Human Genome Diversity Panel (HGDP-CEPH). When merged with non-overlapping SNPs typed previously in 250 of these same individuals, the resulting data consist of over 950,000 SNPs. We then analyzed the genetic relationships and ancestry of individuals without assigning them to populations, and we also identified candidate regions of recent positive selection at both the population and regional (continental) level. Conclusions Our analyses both confirm and extend previous studies; in particular, we highlight the impact of various dispersals, and the role of substructure in Africa, on human genetic diversity. We also identified several novel candidate regions for recent positive selection, and a gene ontology (GO) analysis identified several GO groups that were significantly enriched for such candidate genes, including immunity and defense related genes, sensory perception genes, membrane proteins, signal receptors, lipid binding/metabolism genes, and genes involved in the nervous system. Among the novel candidate genes identified are two genes involved in the thyroid hormone pathway that show signals of selection in African Pygmies that may be related to their short stature. PMID:19924308
Population Structure and Domestication Revealed by High-Depth Resequencing of Korean Cultivated and Wild Soybean Genomes†

PubMed Central

Chung, Won-Hyong; Jeong, Namhee; Kim, Jiwoong; Lee, Woo Kyu; Lee, Yun-Gyeong; Lee, Sang-Heon; Yoon, Woongchang; Kim, Jin-Hyun; Choi, Ik-Young; Choi, Hong-Kyu; Moon, Jung-Kyung; Kim, Namshin; Jeong, Soon-Chun

2014-01-01

Despite the importance of soybean as a major crop, genome-wide variation and evolution of cultivated soybeans are largely unknown. Here, we catalogued genome variation in an annual soybean population by high-depth resequencing of 10 cultivated and 6 wild accessions and obtained 3.87 million high-quality single-nucleotide polymorphisms (SNPs) after excluding the sites with missing data in any accession. Nuclear genome phylogeny supported a single origin for the cultivated soybeans. We identified 10-fold longer linkage disequilibrium (LD) in the wild soybean relative to wild maize and rice. Despite the small population size, the long LD and large SNP data allowed us to identify 206 candidate domestication regions with significantly lower diversity in the cultivated, but not in the wild, soybeans. Some of the genes in these candidate regions were associated with soybean homologues of canonical domestication genes. However, several examples, which are likely specific to soybean or eudicot crop plants, were also observed. Consequently, the variation data identified in this study should be valuable for breeding and for identifying agronomically important genes in soybeans. However, the long LD of wild soybeans may hinder pinpointing causal gene(s) in the candidate regions. PMID:24271940
Differences in candidate gene association between European ancestry and African American asthmatic children.

PubMed

Baye, Tesfaye M; Butsch Kovacic, Melinda; Biagini Myers, Jocelyn M; Martin, Lisa J; Lindsey, Mark; Patterson, Tia L; He, Hua; Ericksen, Mark B; Gupta, Jayanta; Tsoras, Anna M; Lindsley, Andrew; Rothenberg, Marc E; Wills-Karp, Marsha; Eissa, N Tony; Borish, Larry; Khurana Hershey, Gurjit K

2011-02-28

Candidate gene case-control studies have identified several single nucleotide polymorphisms (SNPs) that are associated with asthma susceptibility. Most of these studies have been restricted to evaluations of specific SNPs within a single gene and within populations from European ancestry. Recently, there is increasing interest in understanding racial differences in genetic risk associated with childhood asthma. Our aim was to compare association patterns of asthma candidate genes between children of European and African ancestry. Using a custom-designed Illumina SNP array, we genotyped 1,485 children within the Greater Cincinnati Pediatric Clinic Repository and Cincinnati Genomic Control Cohort for 259 SNPs in 28 genes and evaluated their associations with asthma. We identified 14 SNPs located in 6 genes that were significantly associated (p-values <0.05) with childhood asthma in African Americans. Among Caucasians, 13 SNPs in 5 genes were associated with childhood asthma. Two SNPs in IL4 were associated with asthma in both races (p-values <0.05). Gene-gene interaction studies identified race specific sets of genes that best discriminate between asthmatic children and non-allergic controls. We identified IL4 as having a role in asthma susceptibility in both African American and Caucasian children. However, while IL4 SNPs were associated with asthma in asthmatic children with European and African ancestry, the relative contributions of the most replicated asthma-associated SNPs varied by ancestry. These data provides valuable insights into the pathways that may predispose to asthma in individuals with European vs. African ancestry.
Candidate genes and molecular markers associated with heat tolerance in colonial Bentgrass.

PubMed

Jespersen, David; Belanger, Faith C; Huang, Bingru

2017-01-01

Elevated temperature is a major abiotic stress limiting the growth of cool-season grasses during the summer months. The objectives of this study were to determine the genetic variation in the expression patterns of selected genes involved in several major metabolic pathways regulating heat tolerance for two genotypes contrasting in heat tolerance to confirm their status as potential candidate genes, and to identify PCR-based markers associated with candidate genes related to heat tolerance in a colonial (Agrostis capillaris L.) x creeping bentgrass (Agrostis stolonifera L.) hybrid backcross population. Plants were subjected to heat stress in controlled-environmental growth chambers for phenotypic evaluation and determination of genetic variation in candidate gene expression. Molecular markers were developed for genes involved in protein degradation (cysteine protease), antioxidant defense (catalase and glutathione-S-transferase), energy metabolism (glyceraldehyde-3-phosphate dehydrogenase), cell expansion (expansin), and stress protection (heat shock proteins HSP26, HSP70, and HSP101). Kruskal-Wallis analysis, a commonly used non-parametric test used to compare population individuals with or without the gene marker, found the physiological traits of chlorophyll content, electrolyte leakage, normalized difference vegetative index, and turf quality were associated with all candidate gene markers with the exception of HSP101. Differential gene expression was frequently found for the tested candidate genes. The development of candidate gene markers for important heat tolerance genes may allow for the development of new cultivars with increased abiotic stress tolerance using marker-assisted selection.
Candidate genes and molecular markers associated with heat tolerance in colonial Bentgrass

PubMed Central

Jespersen, David; Belanger, Faith C.; Huang, Bingru

2017-01-01

Elevated temperature is a major abiotic stress limiting the growth of cool-season grasses during the summer months. The objectives of this study were to determine the genetic variation in the expression patterns of selected genes involved in several major metabolic pathways regulating heat tolerance for two genotypes contrasting in heat tolerance to confirm their status as potential candidate genes, and to identify PCR-based markers associated with candidate genes related to heat tolerance in a colonial (Agrostis capillaris L.) x creeping bentgrass (Agrostis stolonifera L.) hybrid backcross population. Plants were subjected to heat stress in controlled-environmental growth chambers for phenotypic evaluation and determination of genetic variation in candidate gene expression. Molecular markers were developed for genes involved in protein degradation (cysteine protease), antioxidant defense (catalase and glutathione-S-transferase), energy metabolism (glyceraldehyde-3-phosphate dehydrogenase), cell expansion (expansin), and stress protection (heat shock proteins HSP26, HSP70, and HSP101). Kruskal-Wallis analysis, a commonly used non-parametric test used to compare population individuals with or without the gene marker, found the physiological traits of chlorophyll content, electrolyte leakage, normalized difference vegetative index, and turf quality were associated with all candidate gene markers with the exception of HSP101. Differential gene expression was frequently found for the tested candidate genes. The development of candidate gene markers for important heat tolerance genes may allow for the development of new cultivars with increased abiotic stress tolerance using marker-assisted selection. PMID:28187136
Selection and Validation of Reference Genes for qRT-PCR Expression Analysis of Candidate Genes Involved in Olfactory Communication in the Butterfly Bicyclus anynana

PubMed Central

Arun, Alok; Baumlé, Véronique; Amelot, Gaël; Nieberding, Caroline M.

2015-01-01

Real-time quantitative reverse transcription PCR (qRT-PCR) is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera) have become important models in molecular evolutionary ecology, so far no study aimed at identifying reference genes for accurate data normalization for any butterfly is available. The African bush brown butterfly Bicyclus anynana has drawn considerable attention owing to its suitability as a model for evolutionary ecology, and we here provide a maiden extensive study to identify suitable reference gene in this species. We monitored the expression profile of twelve reference genes: eEF-1α, FK506, UBQL40, RpS8, RpS18, HSP, GAPDH, VATPase, ACT3, TBP, eIF2 and G6PD. We tested the stability of their expression profiles in three different tissues (wings, brains, antennae), two developmental stages (pupal and adult) and two sexes (male and female), all of which were subjected to two food treatments (food stress and control feeding ad libitum). The expression stability and ranking of twelve reference genes was assessed using two algorithm-based methods, NormFinder and geNorm. Both methods identified RpS8 as the best suitable reference gene for expression data normalization. We also showed that the use of two reference genes is sufficient to effectively normalize the qRT-PCR data under varying tissues and experimental conditions that we used in B. anynana. Finally, we tested the effect of choosing reference genes with different stability on the normalization of the transcript abundance of a candidate gene involved in olfactory communication in B. anynana, the Fatty Acyl Reductase 2, and we confirmed that using an unstable reference gene can drastically alter the expression profile of the target candidate genes. PMID:25793735
Selection and validation of reference genes for qRT-PCR expression analysis of candidate genes involved in olfactory communication in the butterfly Bicyclus anynana.

PubMed

Arun, Alok; Baumlé, Véronique; Amelot, Gaël; Nieberding, Caroline M

2015-01-01

Real-time quantitative reverse transcription PCR (qRT-PCR) is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera) have become important models in molecular evolutionary ecology, so far no study aimed at identifying reference genes for accurate data normalization for any butterfly is available. The African bush brown butterfly Bicyclus anynana has drawn considerable attention owing to its suitability as a model for evolutionary ecology, and we here provide a maiden extensive study to identify suitable reference gene in this species. We monitored the expression profile of twelve reference genes: eEF-1α, FK506, UBQL40, RpS8, RpS18, HSP, GAPDH, VATPase, ACT3, TBP, eIF2 and G6PD. We tested the stability of their expression profiles in three different tissues (wings, brains, antennae), two developmental stages (pupal and adult) and two sexes (male and female), all of which were subjected to two food treatments (food stress and control feeding ad libitum). The expression stability and ranking of twelve reference genes was assessed using two algorithm-based methods, NormFinder and geNorm. Both methods identified RpS8 as the best suitable reference gene for expression data normalization. We also showed that the use of two reference genes is sufficient to effectively normalize the qRT-PCR data under varying tissues and experimental conditions that we used in B. anynana. Finally, we tested the effect of choosing reference genes with different stability on the normalization of the transcript abundance of a candidate gene involved in olfactory communication in B. anynana, the Fatty Acyl Reductase 2, and we confirmed that using an unstable reference gene can drastically alter the expression profile of the target candidate genes.
Association of candidate genes with drought tolerance traits in diverse perennial ryegrass accessions

PubMed Central

Jiang, Yiwei

2013-01-01

Drought is a major environmental stress limiting growth of perennial grasses in temperate regions. Plant drought tolerance is a complex trait that is controlled by multiple genes. Candidate gene association mapping provides a powerful tool for dissection of complex traits. Candidate gene association mapping of drought tolerance traits was conducted in 192 diverse perennial ryegrass (Lolium perenne L.) accessions from 43 countries. The panel showed significant variations in leaf wilting, leaf water content, canopy and air temperature difference, and chlorophyll fluorescence under well-watered and drought conditions across six environments. Analysis of 109 simple sequence repeat markers revealed five population structures in the mapping panel. A total of 2520 expression-based sequence readings were obtained for a set of candidate genes involved in antioxidant metabolism, dehydration, water movement across membranes, and signal transduction, from which 346 single nucleotide polymorphisms were identified. Significant associations were identified between a putative LpLEA3 encoding late embryogenesis abundant group 3 protein and a putative LpFeSOD encoding iron superoxide dismutase and leaf water content, as well as between a putative LpCyt Cu-ZnSOD encoding cytosolic copper-zinc superoxide dismutase and chlorophyll fluorescence under drought conditions. Four of these identified significantly associated single nucleotide polymorphisms from these three genes were also translated to amino acid substitutions in different genotypes. These results indicate that allelic variation in these genes may affect whole-plant response to drought stress in perennial ryegrass. PMID:23386684

Transcriptome-wide characterization of candidate genes for improving the water use efficiency of energy crops grown on semiarid land.

PubMed

Fan, Yangyang; Wang, Qian; Kang, Lifang; Liu, Wei; Xu, Qin; Xing, Shilai; Tao, Chengcheng; Song, Zhihong; Zhu, Caiyun; Lin, Cong; Yan, Juan; Li, Jianqiang; Sang, Tao

2015-10-01

Understanding the genetic basis of water use efficiency (WUE) and its roles in plant adaptation to a drought environment is essential for the production of second-generation energy crops in water-deficit marginal land. In this study, RNA-Seq and WUE measurements were performed for 78 individuals of Miscanthus lutarioriparius grown in two common gardens, one located in warm and wet Central China near the native habitats of the species and the other located in the semiarid Loess Plateau, the domestication site of the energy crop. The field measurements showed that WUE of M. lutarioriparius in the semiarid location was significantly higher than that in the wet location. A matrix correlation analysis was conducted between gene expression levels and WUE to identify candidate genes involved in the improvement of WUE from the native to the domestication site. A total of 48 candidate genes were identified and assigned to functional categories, including photosynthesis, stomatal regulation, protein metabolism, and abiotic stress responses. Of these genes, nearly 73% were up-regulated in the semiarid site. It was also found that the relatively high expression variation of the WUE-related genes was affected to a larger extent by environment than by genetic variation. The study demonstrates that transcriptome-wide correlation between physiological phenotypes and expression levels offers an effective means for identifying candidate genes involved in the adaptation to environmental changes. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Functional dissection of drought-responsive gene expression patterns in Cynodon dactylon L.

PubMed

Kim, Changsoo; Lemke, Cornelia; Paterson, Andrew H

2009-05-01

Water deficit is one of the main abiotic factors that affect plant productivity in subtropical regions. To identify genes induced during the water stress response in Bermudagrass (Cynodon dactylon), cDNA macroarrays were used. The macroarray analysis identified 189 drought-responsive candidate genes from C. dactylon, of which 120 were up-regulated and 69 were down-regulated. The candidate genes were classified into seven groups by cluster analysis of expression levels across two intensities and three durations of imposed stress. Annotation using BLASTX suggested that up-regulated genes may be involved in proline biosynthesis, signal transduction pathways, protein repair systems, and removal of toxins, while down-regulated genes were mostly related to basic plant metabolism such as photosynthesis and glycolysis. The functional classification of gene ontology (GO) was consistent with the BLASTX results, also suggesting some crosstalk between abiotic and biotic stress. Comparative analysis of cis-regulatory elements from the candidate genes implicated specific elements in drought response in Bermudagrass. Although only a subset of genes was studied, Bermudagrass shared many drought-responsive genes and cis-regulatory elements with other botanical models, supporting a strategy of cross-taxon application of drought-responsive genes, regulatory cues, and physiological-genetic information.
Candidate gene association mapping for winter survival and spring regrowth in perennial ryegrass

USDA-ARS?s Scientific Manuscript database

Perennial ryegrass (Lolium perenne L.) is a widely cultivated cool-season grass species because of its high quality for forage and turf. Susceptibility to freezing damage limits its further use in temperate zones. The objective of this study was to identify candidate genes significantly associated w...
Evaluating Reported Candidate Gene Associations with Polycystic Ovary Syndrome

PubMed Central

Pau, Cindy; Saxena, Richa; Welt, Corrine Kolka

2013-01-01

Objective To replicate variants in candidate genes associated with PCOS in a population of European PCOS and control subjects. Design Case-control association analysis and meta-analysis. Setting Major academic hospital Patients Women of European ancestry with PCOS (n=525) and controls (n=472), aged 18 to 45 years. Intervention Variants previously associated with PCOS in candidate gene studies were genotyped (n=39). Metabolic, reproductive and anthropomorphic parameters were examined as a function of the candidate variants. All genetic association analyses were adjusted for age, BMI and ancestry and were reported after correction for multiple testing. Main Outcome Measure Association of candidate gene variants with PCOS. Results Three variants, rs3797179 (SRD5A1), rs12473543 (POMC), and rs1501299 (ADIPOQ), were nominally associated with PCOS. However, they did not remain significant after correction for multiple testing and none of the variants replicated in a sufficiently powered meta-analysis. Variants in the FBN3 gene (rs17202517 and rs73503752) were associated with smaller waist circumferences and variant rs727428 in the SHBG gene was associated with lower SHBG levels. Conclusion Previously identified variants in candidate genes do not appear to be associated with PCOS risk. PMID:23375202
Mining biological databases for candidate disease genes

NASA Astrophysics Data System (ADS)

Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.

2001-07-01

The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).
A Novel Strategy for Selection and Validation of Reference Genes in Dynamic Multidimensional Experimental Design in Yeast

PubMed Central

Cankorur-Cetinkaya, Ayca; Dereli, Elif; Eraslan, Serpil; Karabekmez, Erkan; Dikicioglu, Duygu; Kirdar, Betul

2012-01-01

Background Understanding the dynamic mechanism behind the transcriptional organization of genes in response to varying environmental conditions requires time-dependent data. The dynamic transcriptional response obtained by real-time RT-qPCR experiments could only be correctly interpreted if suitable reference genes are used in the analysis. The lack of available studies on the identification of candidate reference genes in dynamic gene expression studies necessitates the identification and the verification of a suitable gene set for the analysis of transient gene expression response. Principal Findings In this study, a candidate reference gene set for RT-qPCR analysis of dynamic transcriptional changes in Saccharomyces cerevisiae was determined using 31 different publicly available time series transcriptome datasets. Ten of the twelve candidates (TPI1, FBA1, CCW12, CDC19, ADH1, PGK1, GCN4, PDC1, RPS26A and ARF1) we identified were not previously reported as potential reference genes. Our method also identified the commonly used reference genes ACT1 and TDH3. The most stable reference genes from this pool were determined as TPI1, FBA1, CDC19 and ACT1 in response to a perturbation in the amount of available glucose and as FBA1, TDH3, CCW12 and ACT1 in response to a perturbation in the amount of available ammonium. The use of these newly proposed gene sets outperformed the use of common reference genes in the determination of dynamic transcriptional response of the target genes, HAP4 and MEP2, in response to relaxation from glucose and ammonium limitations, respectively. Conclusions A candidate reference gene set to be used in dynamic real-time RT-qPCR expression profiling in yeast was proposed for the first time in the present study. Suitable pools of stable reference genes to be used under different experimental conditions could be selected from this candidate set in order to successfully determine the expression profiles for the genes of interest. PMID:22675547
Multi-breed and multi-trait co-association analysis of meat tenderness and other meat quality traits in three French beef cattle breeds.

PubMed

Ramayo-Caldas, Yuliaxis; Renand, Gilles; Ballester, Maria; Saintilan, Romain; Rocha, Dominique

2016-04-23

Studies to identify markers associated with beef tenderness have focused on Warner-Bratzler shear force (WBSF) but the interplay between the genes associated with WBSF has not been explored. We used the association weight matrix (AWM), a systems biology approach, to identify a set of interacting genes that are co-associated with tenderness and other meat quality traits, and shared across the Charolaise, Limousine and Blonde d'Aquitaine beef cattle breeds. Genome-wide association studies were performed using ~500K single nucleotide polymorphisms (SNPs) and 17 phenotypes measured on more than 1000 animals for each breed. First, this multi-trait approach was applied separately for each breed across 17 phenotypes and second, between- and across-breed comparisons at the AWM and functional levels were performed. Genetic heterogeneity was observed, and most of the variants that were associated with WBSF segregated within rather than across breeds. We identified 206 common candidate genes associated with WBSF across the three breeds. SNPs in these common genes explained between 28 and 30 % of the phenotypic variance for WBSF. A reduced number of common SNPs mapping to the 206 common genes were identified, suggesting that different mutations may target the same genes in a breed-specific manner. Therefore, it is likely that, depending on allele frequencies and linkage disequilibrium patterns, a SNP that is identified for one breed may not be informative for another unrelated breed. Well-known candidate genes affecting beef tenderness were identified. In addition, some of the 206 common genes are located within previously reported quantitative trait loci for WBSF in several cattle breeds. Moreover, the multi-breed co-association analysis detected new candidate genes, regulators and metabolic pathways that are likely involved in the determination of meat tenderness and other meat quality traits in beef cattle. Our results suggest that systems biology approaches that explore associations of correlated traits increase statistical power to identify candidate genes beyond the one-dimensional approach. Further studies on the 206 common genes, their pathways, regulators and interactions will expand our knowledge on the molecular basis of meat tenderness and could lead to the discovery of functional mutations useful for genomic selection in a multi-breed beef cattle context.
Quantitative trait loci and candidate genes associated with starch pasting viscosity characteristics in cassava (Manihot esculenta Crantz).

PubMed

Thanyasiriwat, T; Sraphet, S; Whankaew, S; Boonseng, O; Bao, J; Lightfoot, D A; Tangphatsornruang, S; Triwitayakorn, K

2014-01-01

Starch pasting viscosity is an important quality trait in cassava (Manihot esculenta Crantz) cultivars. The aim here was to identify loci and candidate genes associated with the starch pasting viscosity. Quantitative trait loci (QTL) mapping for seven pasting viscosity parameters was carried out using 100 lines of an F1 mapping population from a cross between two cassava cultivars Huay Bong 60 and Hanatee. Starch samples were obtained from roots of cassava grown in 2008 and 2009 at Rayong, and in 2009 at Lop Buri province, Thailand. The traits showed continuous distribution among the F1 progeny with transgressive variation. Fifteen QTL were identified from mean trait data, with Logarithm of Odds (LOD) values from 2.77-13.01 and phenotype variations explained (PVE) from10.0-48.4%. In addition, 48 QTL were identified in separate environments. The LOD values ranged from 2.55-8.68 and explained 6.6-43.7% of phenotype variation. The loci were located on 19 linkage groups. The most important QTL for pasting temperature (PT) (qPT.1LG1) from mean trait values showed largest effect with highest LOD value (13.01) and PVE (48.4%). The QTL co-localised with PT and pasting time (PTi) loci that were identified in separate environments. Candidate genes were identified within the QTL peak regions. However, the major genes of interest, encoding the family of glycosyl or glucosyl transferases and hydrolases, were located at the periphery of QTL peaks. The loci identified could be effectively applied in breeding programmes to improve cassava starch quality. Alleles of candidate genes should be further studied in order to better understand their effects on starch quality traits. © 2013 German Botanical Society and The Royal Botanical Society of the Netherlands.
Genome-wide association study identified genetic variations and candidate genes for plant architecture component traits in Chinese upland cotton.

PubMed

Su, Junji; Li, Libei; Zhang, Chi; Wang, Caixiang; Gu, Lijiao; Wang, Hantao; Wei, Hengling; Liu, Qibao; Huang, Long; Yu, Shuxun

2018-06-01

Thirty significant associations between 22 SNPs and five plant architecture component traits in Chinese upland cotton were identified via GWAS. Four peak SNP loci located on chromosome D03 were simultaneously associated with more plant architecture component traits. A candidate gene, Gh_D03G0922, might be responsible for plant height in upland cotton. A compact plant architecture is increasingly required for mechanized harvesting processes in China. Therefore, cotton plant architecture is an important trait, and its components, such as plant height, fruit branch length and fruit branch angle, affect the suitability of a cultivar for mechanized harvesting. To determine the genetic basis of cotton plant architecture, a genome-wide association study (GWAS) was performed using a panel composed of 355 accessions and 93,250 single nucleotide polymorphisms (SNPs) identified using the specific-locus amplified fragment sequencing method. Thirty significant associations between 22 SNPs and five plant architecture component traits were identified via GWAS. Most importantly, four peak SNP loci located on chromosome D03 were simultaneously associated with more plant architecture component traits, and these SNPs were harbored in one linkage disequilibrium block. Furthermore, 21 candidate genes for plant architecture were predicted in a 0.95-Mb region including the four peak SNPs. One of these genes (Gh_D03G0922) was near the significant SNP D03_31584163 (8.40 kb), and its Arabidopsis homologs contain MADS-box domains that might be involved in plant growth and development. qRT-PCR showed that the expression of Gh_D03G0922 was upregulated in the apical buds and young leaves of the short and compact cotton varieties, and virus-induced gene silencing (VIGS) proved that the silenced plants exhibited increased PH. These results indicate that Gh_D03G0922 is likely the candidate gene for PH in cotton. The genetic variations and candidate genes identified in this study lay a foundation for cultivating moderately short and compact varieties in future Chinese cotton-breeding programs.
Scanning the genome for gene single nucleotide polymorphisms involved in adaptive population differentiation in white spruce

PubMed Central

Namroud, Marie-Claire; Beaulieu, Jean; Juge, Nicolas; Laroche, Jérôme; Bousquet, Jean

2008-01-01

Conifers are characterized by a large genome size and a rapid decay of linkage disequilibrium, most often within gene limits. Genome scans based on noncoding markers are less likely to detect molecular adaptation linked to genes in these species. In this study, we assessed the effectiveness of a genome-wide single nucleotide polymorphism (SNP) scan focused on expressed genes in detecting local adaptation in a conifer species. Samples were collected from six natural populations of white spruce (Picea glauca) moderately differentiated for several quantitative characters. A total of 534 SNPs representing 345 expressed genes were analysed. Genes potentially under natural selection were identified by estimating the differentiation in SNP frequencies among populations (FST) and identifying outliers, and by estimating local differentiation using a Bayesian approach. Both average expected heterozygosity and population differentiation estimates (HE = 0.270 and FST = 0.006) were comparable to those obtained with other genetic markers. Of all genes, 5.5% were identified as outliers with FST at the 95% confidence level, while 14% were identified as candidates for local adaptation with the Bayesian method. There was some overlap between the two gene sets. More than half of the candidate genes for local adaptation were specific to the warmest population, about 20% to the most arid population, and 15% to the coldest and most humid higher altitude population. These adaptive trends were consistent with the genes’ putative functions and the divergence in quantitative traits noted among the populations. The results suggest that an approach separating the locus and population effects is useful to identify genes potentially under selection. These candidates are worth exploring in more details at the physiological and ecological levels. PMID:18662225
Genetic neuropathology of obsessive psychiatric syndromes

PubMed Central

Jaffe, A E; Deep-Soboslay, A; Tao, R; Hauptman, D T; Kaye, W H; Arango, V; Weinberger, D R; Hyde, T M; Kleinman, J E

2014-01-01

Anorexia nervosa (AN), bulimia nervosa (BN) and obsessive-compulsive disorder (OCD) are complex psychiatric disorders with shared obsessive features, thought to arise from the interaction of multiple genes of small effect with environmental factors. Potential candidate genes for AN, BN and OCD have been identified through clinical association and neuroimaging studies; however, recent genome-wide association studies of eating disorders (ED) so far have failed to report significant findings. In addition, few, if any, studies have interrogated postmortem brain tissue for evidence of expression quantitative trait loci (eQTLs) associated with candidate genes, which has particular promise as an approach to elucidating molecular mechanisms of association. We therefore selected single-nucleotide polymorphisms (SNPs) based on candidate gene studies for AN, BN and OCD from the literature, and examined the association of these SNPs with gene expression across the lifespan in prefrontal cortex of a nonpsychiatric control cohort (N=268). Several risk-predisposing SNPs were significantly associated with gene expression among control subjects. We then measured gene expression in the prefrontal cortex of cases previously diagnosed with obsessive psychiatric disorders, for example, ED (N=15) and OCD/obsessive-compulsive personality disorder or tics (OCD/OCPD/Tic; N=16), and nonpsychiatric controls (N=102) and identified 6 and 286 genes that were differentially expressed between ED compared with controls and OCD cases compared with controls, respectively (false discovery rate (FDR) <5%). However, none of the clinical risk SNPs were among the eQTLs and none were significantly associated with gene expression within the broad obsessive cohort, suggesting larger sample sizes or other brain regions may be required to identify candidate molecular mechanisms of clinical association in postmortem brain data sets. PMID:25180571
Genetic neuropathology of obsessive psychiatric syndromes.

PubMed

Jaffe, A E; Deep-Soboslay, A; Tao, R; Hauptman, D T; Kaye, W H; Arango, V; Weinberger, D R; Hyde, T M; Kleinman, J E

2014-09-02

Anorexia nervosa (AN), bulimia nervosa (BN) and obsessive-compulsive disorder (OCD) are complex psychiatric disorders with shared obsessive features, thought to arise from the interaction of multiple genes of small effect with environmental factors. Potential candidate genes for AN, BN and OCD have been identified through clinical association and neuroimaging studies; however, recent genome-wide association studies of eating disorders (ED) so far have failed to report significant findings. In addition, few, if any, studies have interrogated postmortem brain tissue for evidence of expression quantitative trait loci (eQTLs) associated with candidate genes, which has particular promise as an approach to elucidating molecular mechanisms of association. We therefore selected single-nucleotide polymorphisms (SNPs) based on candidate gene studies for AN, BN and OCD from the literature, and examined the association of these SNPs with gene expression across the lifespan in prefrontal cortex of a nonpsychiatric control cohort (N=268). Several risk-predisposing SNPs were significantly associated with gene expression among control subjects. We then measured gene expression in the prefrontal cortex of cases previously diagnosed with obsessive psychiatric disorders, for example, ED (N=15) and OCD/obsessive-compulsive personality disorder or tics (OCD/OCPD/Tic; N=16), and nonpsychiatric controls (N=102) and identified 6 and 286 genes that were differentially expressed between ED compared with controls and OCD cases compared with controls, respectively (false discovery rate (FDR) <5%). However, none of the clinical risk SNPs were among the eQTLs and none were significantly associated with gene expression within the broad obsessive cohort, suggesting larger sample sizes or other brain regions may be required to identify candidate molecular mechanisms of clinical association in postmortem brain data sets.
Annotating novel genes by integrating synthetic lethals and genomic information

PubMed Central

Schöner, Daniel; Kalisch, Markus; Leisner, Christian; Meier, Lukas; Sohrmann, Marc; Faty, Mahamadou; Barral, Yves; Peter, Matthias; Gruissem, Wilhelm; Bühlmann, Peter

2008-01-01

Background Large scale screening for synthetic lethality serves as a common tool in yeast genetics to systematically search for genes that play a role in specific biological processes. Often the amounts of data resulting from a single large scale screen far exceed the capacities of experimental characterization of every identified target. Thus, there is need for computational tools that select promising candidate genes in order to reduce the number of follow-up experiments to a manageable size. Results We analyze synthetic lethality data for arp1 and jnm1, two spindle migration genes, in order to identify novel members in this process. To this end, we use an unsupervised statistical method that integrates additional information from biological data sources, such as gene expression, phenotypic profiling, RNA degradation and sequence similarity. Different from existing methods that require large amounts of synthetic lethal data, our method merely relies on synthetic lethality information from two single screens. Using a Multivariate Gaussian Mixture Model, we determine the best subset of features that assign the target genes to two groups. The approach identifies a small group of genes as candidates involved in spindle migration. Experimental testing confirms the majority of our candidates and we present she1 (YBL031W) as a novel gene involved in spindle migration. We applied the statistical methodology also to TOR2 signaling as another example. Conclusion We demonstrate the general use of Multivariate Gaussian Mixture Modeling for selecting candidate genes for experimental characterization from synthetic lethality data sets. For the given example, integration of different data sources contributes to the identification of genetic interaction partners of arp1 and jnm1 that play a role in the same biological process. PMID:18194531
Single nucleotide polymorphisms in candidate genes related to daughter pregnancy rate in Holstein cows

USDA-ARS?s Scientific Manuscript database

ABSTRACT: Previously, a candidate gene approach identified 40 SNPs associated with daughter pregnancy rate (DPR) in dairy bulls. We evaluated 39 of these SNPs for relationship to DPR in a separate population of Holstein cows grouped on their predicted transmitting ability for DPR: <= -1 (n=1266) a...
Database of cattle candidate genes and genetic markers for milk production and mastitis

PubMed Central

Ogorevc, J; Kunej, T; Razpet, A; Dovc, P

2009-01-01

A cattle database of candidate genes and genetic markers for milk production and mastitis has been developed to provide an integrated research tool incorporating different types of information supporting a genomic approach to study lactation, udder development and health. The database contains 943 genes and genetic markers involved in mammary gland development and function, representing candidates for further functional studies. The candidate loci were drawn on a genetic map to reveal positional overlaps. For identification of candidate loci, data from seven different research approaches were exploited: (i) gene knockouts or transgenes in mice that result in specific phenotypes associated with mammary gland (143 loci); (ii) cattle QTL for milk production (344) and mastitis related traits (71); (iii) loci with sequence variations that show specific allele-phenotype interactions associated with milk production (24) or mastitis (10) in cattle; (iv) genes with expression profiles associated with milk production (207) or mastitis (107) in cattle or mouse; (v) cattle milk protein genes that exist in different genetic variants (9); (vi) miRNAs expressed in bovine mammary gland (32) and (vii) epigenetically regulated cattle genes associated with mammary gland function (1). Fourty-four genes found by multiple independent analyses were suggested as the most promising candidates and were further in silico analysed for expression levels in lactating mammary gland, genetic variability and top biological functions in functional networks. A miRNA target search for mammary gland expressed miRNAs identified 359 putative binding sites in 3′UTRs of candidate genes. PMID:19508288
Fine mapping of the genic male-sterile ms 1 gene in Capsicum annuum L.

PubMed

Jeong, Kyumi; Choi, Doil; Lee, Jundae

2018-01-01

The genomic region cosegregating with the genic male-sterile ms 1 gene of Capsicum annuum L. was delimited to a region of 869.9 kb on chromosome 5 through fine mapping analysis. A strong candidate gene, CA05g06780, a homolog of the Arabidopsis MALE STERILITY 1 gene that controls pollen development, was identified in this region. Genic male sterility caused by the ms 1 gene has been used for the economically efficient production of massive hybrid seeds in paprika (Capsicum annuum L.), a colored bell-type sweet pepper. Previously, a CAPS marker, PmsM1-CAPS, located about 2-3 cM from the ms 1 locus, was reported. In this study, we constructed a fine map near the ms 1 locus using high-resolution melting (HRM) markers in an F 2 population consisting of 1118 individual plants, which segregated into 867 male-fertile and 251 male-sterile plants. A total of 12 HRM markers linked to the ms 1 locus were developed from 53 primer sets targeting intraspecific SNPs derived by comparing genome-wide sequences obtained by next-generation resequencing analysis. Using this approach, we narrowed down the region cosegregating with the ms 1 gene to 869.9 kb of sequence. Gene prediction analysis revealed 11 open reading frames in this region. A strong candidate gene, CA05g06780, was identified; this gene is a homolog of the Arabidopsis MALE STERILITY 1 (MS1) gene, which encodes a PHD-type transcription factor that regulates pollen and tapetum development. Sequence comparison analysis suggested that the CA05g06780 gene is the strongest candidate for the ms 1 gene of paprika. To summarize, we developed a cosegregated marker, 32187928-HRM, for marker-assisted selection and identified a strong candidate for the ms 1 gene.
Identification of possible genetic polymorphisms involved in cancer cachexia: a systematic review.

PubMed

Tan, Benjamin H L; Ross, James A; Kaasa, Stein; Skorpen, Frank; Fearon, Kenneth C H

2011-04-01

Cancer cachexia is a polygenic and complex syndrome. Genetic variations in regulation of the inflammatory response, muscle and fat metabolic pathways, and pathways in appetite regulation are likely to contribute to the susceptibility or resistance to developing cancer cachexia. A systematic search of Medline and EmBase databases, covering 1986-2008 was performed for potential candidate genes/genetic polymorphisms relating to cancer cachexia. Related genes were then identified using pathway functional analysis software. All candidate genes were reviewed for functional polymorphisms or clinically significant polymorphisms associated with cachexia using the OMIM and GeneRIF databases. Genes with variants which had functional or clinical associations with cachexia and replicated in at least one study were entered into pathway analysis software to reveal possible network associations between genes. A total of 184 polymorphisms with functional or clinical relevance to cancer cachexia were identified in 92 candidate genes. Of these, 42 polymorphisms (in 33 genes) were replicated in more than one study with 13 polymorphisms found to influence two or more hallmarks of cachexia (i.e. inflammation, loss of fat mass and/or lean mass and reduced survival). Thirty-three genes were found to be significantly interconnected in two major networks with four genes (ADIPOQ, IL6, NFKB1 and TLR4) interlinking both networks. Selection of candidate genes and polymorphisms is a key element of multigene study design. The present study provides an initial framework to select genes/polymorphisms for further study in cancer cachexia, and to develop their potential as susceptibility biomarkers of developing cachexia.
Evolutionary Distance of Amino Acid Sequence Orthologs across Macaque Subspecies: Identifying Candidate Genes for SIV Resistance in Chinese Rhesus Macaques

PubMed Central

Ross, Cody T.; Roodgar, Morteza; Smith, David Glenn

2015-01-01

We use the Reciprocal Smallest Distance (RSD) algorithm to identify amino acid sequence orthologs in the Chinese and Indian rhesus macaque draft sequences and estimate the evolutionary distance between such orthologs. We then use GOanna to map gene function annotations and human gene identifiers to the rhesus macaque amino acid sequences. We conclude methodologically by cross-tabulating a list of amino acid orthologs with large divergence scores with a list of genes known to be involved in SIV or HIV pathogenesis. We find that many of the amino acid sequences with large evolutionary divergence scores, as calculated by the RSD algorithm, have been shown to be related to HIV pathogenesis in previous laboratory studies. Four of the strongest candidate genes for SIVmac resistance in Chinese rhesus macaques identified in this study are CDK9, CXCL12, TRIM21, and TRIM32. Additionally, ANKRD30A, CTSZ, GORASP2, GTF2H1, IL13RA1, MUC16, NMDAR1, Notch1, NT5M, PDCD5, RAD50, and TM9SF2 were identified as possible candidates, among others. We failed to find many laboratory experiments contrasting the effects of Indian and Chinese orthologs at these sites on SIVmac pathogenesis, but future comparative studies might hold fertile ground for research into the biological mechanisms underlying innate resistance to SIVmac in Chinese rhesus macaques. PMID:25884674
[Screening cold-acclimation differential expression candidate genes in the brain of common carp (Cyprinus carpio)].

PubMed

Xu, Li-Hua; Chang, Yu-Mei; Liu, Chun-Lei; Liang, Li-Qun; Liu, Jin-Liang; Chi, Bing-Jie

2011-03-01

In this study, 26 candidate genes were quantified and normalized in the brain cDNA of common carp (Cyprinus carpio) at 23°C and 6°C using double-standard curve method of real-time quantitative PCR. The results showed that five candidates up-regulated in the samples at 6°C (P<0.01) and quantified 2.11, 13.9, 2.52, 7.38, and 1.83 times more than in the samples at 23°C, respectively. Gene function searching indicated that the protein products of these five candidates were elongation of very long chain fatty acids protein, Acyl-CoA desaturase, Transcription initiation factor IIB, Myo-inositol- 1-phosphate synthase, and Blood-brain barrier HT7 antigen individually. Moreover, seven down-regulated candidates were also identified in the same samples at 6°C (P>0.05), and their expression levels were decreased by 21.8%, 25.9%, 16.6%, 23.7%, 15.8%, 16.3%, and 42.5%, respectively, in comparison with the samples at 23°C. These seven down-regulated candidates mainly participated in the inhibition of glycolysis, improvement of cell apoptosis, and intervention of synapse remodeling based on the results of function searching. The five cold-induced genes identified in this study will be used as important elements for fish with cold sensitive through transgenic technology in future.
Target gene analyses of 39 amelogenesis imperfecta kindreds

PubMed Central

Chan, Hui-Chen; Estrella, Ninna M. R. P.; Milkovich, Rachel N.; Kim, Jung-Wook; Simmer, James P.; Hu, Jan C-C.

2012-01-01

Previously, mutational analyses identified six disease-causing mutations in 24 amelogenesis imperfecta (AI) kindreds. We have since expanded the number of AI kindreds to 39, and performed mutation analyses covering the coding exons and adjoining intron sequences for the six proven AI candidate genes [amelogenin (AMELX), enamelin (ENAM), family with sequence similarity 83, member H (FAM83H), WD repeat containing domain 72 (WDR72), enamelysin (MMP20), and kallikrein-related peptidase 4 (KLK4)] and for ameloblastin (AMBN) (a suspected candidate gene). All four of the X-linked AI families (100%) had disease-causing mutations in AMELX, suggesting that AMELX is the only gene involved in the aetiology of X-linked AI. Eighteen families showed an autosomal-dominant pattern of inheritance. Disease-causing mutations were identified in 12 (67%): eight in FAM83H, and four in ENAM. No FAM83H coding-region or splice-junction mutations were identified in three probands with autosomal-dominant hypocalcification AI (ADHCAI), suggesting that a second gene may contribute to the aetiology of ADHCAI. Six families showed an autosomal-recessive pattern of inheritance, and disease-causing mutations were identified in three (50%): two in MMP20, and one in WDR72. No disease-causing mutations were found in 11 families with only one affected member. We conclude that mutation analyses of the current candidate genes for AI have about a 50% chance of identifying the disease-causing mutation in a given kindred. PMID:22243262

Identification of a duplication within the GDF9 gene and novel candidate genes for primary ovarian insufficiency (POI) by a customized high-resolution array comparative genomic hybridization platform.

PubMed

Norling, A; Hirschberg, A L; Rodriguez-Wallberg, K A; Iwarsson, E; Wedell, A; Barbaro, M

2014-08-01

Can high-resolution array comparative genomic hybridization (CGH) analysis of DNA samples from women with primary ovarian insufficiency (POI) improve the diagnosis of the condition and identify novel candidate genes for POI? A mutation affecting the regulatory region of growth differentiation factor 9 (GDF9) was identified for the first time together with several novel candidate genes for POI. Most patients with POI do not receive a molecular diagnosis despite a significant genetic component in the pathogenesis. We performed a case-control study. Twenty-six patients were analyzed by array CGH for identification of copy number variants. Novel changes were investigated in 95 controls and in a separate population of 28 additional patients with POI. The experimental procedures were performed during a 1-year period. DNA samples from 26 patients with POI were analyzed by a customized 1M array-CGH platform with whole genome coverage and probe enrichment targeting 78 genes in sex development. By PCR amplification and sequencing, the breakpoint of an identified partial GDF9 gene duplication was characterized. A multiplex ligation-dependent probe amplification (MLPA) probe set for specific identification of deletions/duplications affecting GDF9 was developed. An MLPA probe set for the identification of additional cases or controls carrying novel candidate regions identified by array-CGH was developed. Sequencing of three candidate genes was performed. Eleven unique copy number changes were identified in a total of 11 patients, including a tandem duplication of 475 bp, containing part of the GDF9 gene promoter region. The duplicated region contains three NOBOX-binding elements and an E-box, important for GDF9 gene regulation. This aberration is likely causative of POI. Fifty-four patients were investigated for copy number changes within GDF9, but no additional cases were found. Ten aberrations constituting novel candidate regions were detected, including a second DNAH6 deletion in a patient with POI. Other identified candidate genes were TSPYL6, SMARCC1, CSPG5 and ZFR2. This is a descriptive study and no functional experiments were performed. The study illustrates the importance of analyzing small copy number changes in addition to sequence alterations in the genetic investigation of patients with POI. Also, promoter regions should be included in the investigation. The study was supported by grants from the Swedish Research council (project no 12198 to A.W. and project no 20324 to A.L.H.), Stockholm County Council (E.I., A.W. and K.R.W.), Foundation Frimurare Barnhuset (A.N., A.W. and M.B.), Karolinska Institutet (A.N., A.L.H., E.I., A.W. and M.B.), Novo Nordic Foundation (A.W.) and Svenska Läkaresällskapet (M.B.). The funding sources had no involvement in the design or analysis of the study. The authors have no competing interests to declare. Not applicable. © The Author 2014. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology.
Genetic analysis of the calcineurin pathway identifies members of the EGR gene family, specifically EGR3, as potential susceptibility candidates in schizophrenia

PubMed Central

Yamada, Kazuo; Gerber, David J.; Iwayama, Yoshimi; Ohnishi, Tetsuo; Ohba, Hisako; Toyota, Tomoko; Aruga, Jun; Minabe, Yoshio; Tonegawa, Susumu; Yoshikawa, Takeo

2007-01-01

The calcineurin cascade is central to neuronal signal transduction, and genes in this network are intriguing candidate schizophrenia susceptibility genes. To replicate and extend our previously reported association between the PPP3CC gene, encoding the calcineurin catalytic γ-subunit, and schizophrenia, we examined 84 SNPs from 14 calcineurin-related candidate genes for genetic association by using 124 Japanese schizophrenic pedigrees. Four of these genes (PPP3CC, EGR2, EGR3, and EGR4) showed nominally significant association with schizophrenia. In a postmortem brain study, EGR1, EGR2, and EGR3 transcripts were shown to be down-regulated in the prefrontal cortex of schizophrenic, but not bipolar, patients. These findings raise a potentially important role for EGR genes in schizophrenia pathogenesis. Because EGR3 is an attractive candidate gene based on its chromosomal location close to PPP3CC within 8p21.3 and its functional link to dopamine, glutamate, and neuregulin signaling, we extended our analysis by resequencing the entire EGR3 genomic interval and detected 15 SNPs. One of these, IVS1 + 607A→G SNP, displayed the strongest evidence for disease association, which was confirmed in 1,140 independent case-control samples. An in vitro promoter assay detected a possible expression-regulatory effect of this SNP. These findings support the previous genetic association of altered calcineurin signaling with schizophrenia pathogenesis and identify EGR3 as a compelling susceptibility gene. PMID:17360599
Reranking candidate gene models with cross-species comparison for improved gene prediction

PubMed Central

Liu, Qian; Crammer, Koby; Pereira, Fernando CN; Roos, David S

2008-01-01

Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc). Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models. PMID:18854050
A comprehensive study of the genomic differentiation between temperate Dent and Flint maize.

PubMed

Unterseer, Sandra; Pophaly, Saurabh D; Peis, Regina; Westermeier, Peter; Mayer, Manfred; Seidel, Michael A; Haberer, Georg; Mayer, Klaus F X; Ordas, Bernardo; Pausch, Hubert; Tellier, Aurélien; Bauer, Eva; Schön, Chris-Carolin

2016-07-08

Dent and Flint represent two major germplasm pools exploited in maize breeding. Several traits differentiate the two pools, like cold tolerance, early vigor, and flowering time. A comparative investigation of their genomic architecture relevant for quantitative trait expression has not been reported so far. Understanding the genomic differences between germplasm pools may contribute to a better understanding of the complementarity in heterotic patterns exploited in hybrid breeding and of mechanisms involved in adaptation to different environments. We perform whole-genome screens for signatures of selection specific to temperate Dent and Flint maize by comparing high-density genotyping data of 70 American and European Dent and 66 European Flint inbred lines. We find 2.2 % and 1.4 % of the genes are under selective pressure, respectively, and identify candidate genes associated with agronomic traits known to differ between the two pools. Taking flowering time as an example for the differentiation between Dent and Flint, we investigate candidate genes involved in the flowering network by phenotypic analyses in a Dent-Flint introgression library and find that the Flint haplotypes of the candidates promote earlier flowering. Within the flowering network, the majority of Flint candidates are associated with endogenous pathways in contrast to Dent candidate genes, which are mainly involved in response to environmental factors like light and photoperiod. The diversity patterns of the candidates in a unique panel of more than 900 individuals from 38 European landraces indicate a major contribution of landraces from France, Germany, and Spain to the candidate gene diversity of the Flint elite lines. In this study, we report the investigation of pool-specific differences between temperate Dent and Flint on a genome-wide scale. The identified candidate genes represent a promising source for the functional investigation of pool-specific haplotypes in different genetic backgrounds and for the evaluation of their potential for future crop improvement like the adaptation to specific environments.
Genome-wide association study identifies Loci and candidate genes for body composition and meat quality traits in Beijing-You chickens.

PubMed

Liu, Ranran; Sun, Yanfa; Zhao, Guiping; Wang, Fangjie; Wu, Dan; Zheng, Maiqing; Chen, Jilan; Zhang, Lei; Hu, Yaodong; Wen, Jie

2013-01-01

Body composition and meat quality traits are important economic traits of chickens. The development of high-throughput genotyping platforms and relevant statistical methods have enabled genome-wide association studies in chickens. In order to identify molecular markers and candidate genes associated with body composition and meat quality traits, genome-wide association studies were conducted using the Illumina 60 K SNP Beadchip to genotype 724 Beijing-You chickens. For each bird, a total of 16 traits were measured, including carcass weight (CW), eviscerated weight (EW), dressing percentage, breast muscle weight (BrW) and percentage (BrP), thigh muscle weight and percentage, abdominal fat weight and percentage, dry matter and intramuscular fat contents of breast and thigh muscle, ultimate pH, and shear force of the pectoralis major muscle at 100 d of age. The SNPs that were significantly associated with the phenotypic traits were identified using both simple (GLM) and compressed mixed linear (MLM) models. For nine of ten body composition traits studied, SNPs showing genome wide significance (P<2.59E-6) have been identified. A consistent region on chicken (Gallus gallus) chromosome 4 (GGA4), including seven significant SNPs and four candidate genes (LCORL, LAP3, LDB2, TAPT1), were found to be associated with CW and EW. Another 0.65 Mb region on GGA3 for BrW and BrP was identified. After measuring the mRNA content in beast muscle for five genes located in this region, the changes in GJA1 expression were found to be consistent with that of breast muscle weight across development. It is highly possible that GJA1 is a functional gene for breast muscle development in chickens. For meat quality traits, several SNPs reaching suggestive association were identified and possible candidate genes with their functions were discussed.
QTLs for Seed Vigor-Related Traits Identified in Maize Seeds Germinated under Artificial Aging Conditions

PubMed Central

Han, Zanping; Ku, Lixia; Zhang, Zhenzhen; Zhang, Jun; Guo, ShuLei; Liu, Haiying; Zhao, Ruifang; Ren, Zhenzhen; Zhang, Liangkun; Su, Huihui; Dong, Lei; Chen, Yanhui

2014-01-01

High seed vigor is important for agricultural production due to the associated potential for increased growth and productivity. However, a better understanding of the underlying molecular mechanisms is required because the genetic basis for seed vigor remains unknown. We used single-nucleotide polymorphism (SNP) markers to map quantitative trait loci (QTLs) for four seed vigor traits in two connected recombinant inbred line (RIL) maize populations under four treatment conditions during seed germination. Sixty-five QTLs distributed between the two populations were identified and a meta-analysis was used to integrate genetic maps. Sixty-one initially identified QTLs were integrated into 18 meta-QTLs (mQTLs). Initial QTLs with contribution to phenotypic variation values of R2>10% were integrated into mQTLs. Twenty-three candidate genes for association with seed vigor traits coincided with 13 mQTLs. The candidate genes had functions in the glycolytic pathway and in protein metabolism. QTLs with major effects (R2>10%) were identified under at least one treatment condition for mQTL2, mQTL3-2, and mQTL3-4. Candidate genes included a calcium-dependent protein kinase gene (302810918) involved in signal transduction that mapped in the mQTL3-2 interval associated with germination energy (GE) and germination percentage (GP), and an hsp20/alpha crystallin family protein gene (At5g51440) that mapped in the mQTL3-4 interval associated with GE and GP. Two initial QTLs with a major effect under at least two treatment conditions were identified for mQTL5-2. A cucumisin-like Ser protease gene (At5g67360) mapped in the mQTL5-2 interval associated with GP. The chromosome regions for mQTL2, mQTL3-2, mQTL3-4, and mQTL5-2 may be hot spots for QTLs related to seed vigor traits. The mQTLs and candidate genes identified in this study provide valuable information for the identification of additional quantitative trait genes. PMID:24651614
QTLs for seed vigor-related traits identified in maize seeds germinated under artificial aging conditions.

PubMed

Han, Zanping; Ku, Lixia; Zhang, Zhenzhen; Zhang, Jun; Guo, Shulei; Liu, Haiying; Zhao, Ruifang; Ren, Zhenzhen; Zhang, Liangkun; Su, Huihui; Dong, Lei; Chen, Yanhui

2014-01-01

High seed vigor is important for agricultural production due to the associated potential for increased growth and productivity. However, a better understanding of the underlying molecular mechanisms is required because the genetic basis for seed vigor remains unknown. We used single-nucleotide polymorphism (SNP) markers to map quantitative trait loci (QTLs) for four seed vigor traits in two connected recombinant inbred line (RIL) maize populations under four treatment conditions during seed germination. Sixty-five QTLs distributed between the two populations were identified and a meta-analysis was used to integrate genetic maps. Sixty-one initially identified QTLs were integrated into 18 meta-QTLs (mQTLs). Initial QTLs with contribution to phenotypic variation values of R(2)>10% were integrated into mQTLs. Twenty-three candidate genes for association with seed vigor traits coincided with 13 mQTLs. The candidate genes had functions in the glycolytic pathway and in protein metabolism. QTLs with major effects (R(2)>10%) were identified under at least one treatment condition for mQTL2, mQTL3-2, and mQTL3-4. Candidate genes included a calcium-dependent protein kinase gene (302810918) involved in signal transduction that mapped in the mQTL3-2 interval associated with germination energy (GE) and germination percentage (GP), and an hsp20/alpha crystallin family protein gene (At5g51440) that mapped in the mQTL3-4 interval associated with GE and GP. Two initial QTLs with a major effect under at least two treatment conditions were identified for mQTL5-2. A cucumisin-like Ser protease gene (At5g67360) mapped in the mQTL5-2 interval associated with GP. The chromosome regions for mQTL2, mQTL3-2, mQTL3-4, and mQTL5-2 may be hot spots for QTLs related to seed vigor traits. The mQTLs and candidate genes identified in this study provide valuable information for the identification of additional quantitative trait genes.
Identifying candidate drivers of drug response in heterogeneous cancer by mining high throughput genomics data.

PubMed

Nabavi, Sheida

2016-08-15

With advances in technologies, huge amounts of multiple types of high-throughput genomics data are available. These data have tremendous potential to identify new and clinically valuable biomarkers to guide the diagnosis, assessment of prognosis, and treatment of complex diseases, such as cancer. Integrating, analyzing, and interpreting big and noisy genomics data to obtain biologically meaningful results, however, remains highly challenging. Mining genomics datasets by utilizing advanced computational methods can help to address these issues. To facilitate the identification of a short list of biologically meaningful genes as candidate drivers of anti-cancer drug resistance from an enormous amount of heterogeneous data, we employed statistical machine-learning techniques and integrated genomics datasets. We developed a computational method that integrates gene expression, somatic mutation, and copy number aberration data of sensitive and resistant tumors. In this method, an integrative method based on module network analysis is applied to identify potential driver genes. This is followed by cross-validation and a comparison of the results of sensitive and resistance groups to obtain the final list of candidate biomarkers. We applied this method to the ovarian cancer data from the cancer genome atlas. The final result contains biologically relevant genes, such as COL11A1, which has been reported as a cis-platinum resistant biomarker for epithelial ovarian carcinoma in several recent studies. The described method yields a short list of aberrant genes that also control the expression of their co-regulated genes. The results suggest that the unbiased data driven computational method can identify biologically relevant candidate biomarkers. It can be utilized in a wide range of applications that compare two conditions with highly heterogeneous datasets.
Identifying Stable Reference Genes for qRT-PCR Normalisation in Gene Expression Studies of Narrow-Leafed Lupin (Lupinus angustifolius L.).

PubMed

Taylor, Candy M; Jost, Ricarda; Erskine, William; Nelson, Matthew N

2016-01-01

Quantitative Reverse Transcription PCR (qRT-PCR) is currently one of the most popular, high-throughput and sensitive technologies available for quantifying gene expression. Its accurate application depends heavily upon normalisation of gene-of-interest data with reference genes that are uniformly expressed under experimental conditions. The aim of this study was to provide the first validation of reference genes for Lupinus angustifolius (narrow-leafed lupin, a significant grain legume crop) using a selection of seven genes previously trialed as reference genes for the model legume, Medicago truncatula. In a preliminary evaluation, the seven candidate reference genes were assessed on the basis of primer specificity for their respective targeted region, PCR amplification efficiency, and ability to discriminate between cDNA and gDNA. Following this assessment, expression of the three most promising candidates [Ubiquitin C (UBC), Helicase (HEL), and Polypyrimidine tract-binding protein (PTB)] was evaluated using the NormFinder and RefFinder statistical algorithms in two narrow-leafed lupin lines, both with and without vernalisation treatment, and across seven organ types (cotyledons, stem, leaves, shoot apical meristem, flowers, pods and roots) encompassing three developmental stages. UBC was consistently identified as the most stable candidate and has sufficiently uniform expression that it may be used as a sole reference gene under the experimental conditions tested here. However, as organ type and developmental stage were associated with greater variability in relative expression, it is recommended using UBC and HEL as a pair to achieve optimal normalisation. These results highlight the importance of rigorously assessing candidate reference genes for each species across a diverse range of organs and developmental stages. With emerging technologies, such as RNAseq, and the completion of valuable transcriptome data sets, it is possible that other potentially more suitable reference genes will be identified for this species in future.
Identifying Stable Reference Genes for qRT-PCR Normalisation in Gene Expression Studies of Narrow-Leafed Lupin (Lupinus angustifolius L.)

PubMed Central

Erskine, William; Nelson, Matthew N.

2016-01-01

Quantitative Reverse Transcription PCR (qRT-PCR) is currently one of the most popular, high-throughput and sensitive technologies available for quantifying gene expression. Its accurate application depends heavily upon normalisation of gene-of-interest data with reference genes that are uniformly expressed under experimental conditions. The aim of this study was to provide the first validation of reference genes for Lupinus angustifolius (narrow-leafed lupin, a significant grain legume crop) using a selection of seven genes previously trialed as reference genes for the model legume, Medicago truncatula. In a preliminary evaluation, the seven candidate reference genes were assessed on the basis of primer specificity for their respective targeted region, PCR amplification efficiency, and ability to discriminate between cDNA and gDNA. Following this assessment, expression of the three most promising candidates [Ubiquitin C (UBC), Helicase (HEL), and Polypyrimidine tract-binding protein (PTB)] was evaluated using the NormFinder and RefFinder statistical algorithms in two narrow-leafed lupin lines, both with and without vernalisation treatment, and across seven organ types (cotyledons, stem, leaves, shoot apical meristem, flowers, pods and roots) encompassing three developmental stages. UBC was consistently identified as the most stable candidate and has sufficiently uniform expression that it may be used as a sole reference gene under the experimental conditions tested here. However, as organ type and developmental stage were associated with greater variability in relative expression, it is recommended using UBC and HEL as a pair to achieve optimal normalisation. These results highlight the importance of rigorously assessing candidate reference genes for each species across a diverse range of organs and developmental stages. With emerging technologies, such as RNAseq, and the completion of valuable transcriptome data sets, it is possible that other potentially more suitable reference genes will be identified for this species in future. PMID:26872362
Dissecting Vancomycin-Intermediate Resistance in Staphylococcus aureus Using Genome-Wide Association

PubMed Central

Alam, Md Tauqeer; Petit, Robert A.; Crispell, Emily K.; Thornton, Timothy A.; Conneely, Karen N.; Jiang, Yunxuan; Satola, Sarah W.; Read, Timothy D.

2014-01-01

Vancomycin-intermediate Staphylococcus aureus (VISA) is currently defined as having minimal inhibitory concentration (MIC) of 4–8 µg/ml. VISA evolves through changes in multiple genetic loci with at least 16 candidate genes identified in clinical and in vitro-selected VISA strains. We report a whole-genome comparative analysis of 49 vancomycin-sensitive S. aureus and 26 VISA strains. Resistance to vancomycin was determined by broth microdilution, Etest, and population analysis profile-area under the curve (PAP-AUC). Genome-wide association studies (GWAS) of 55,977 single-nucleotide polymorphisms identified in one or more strains found one highly significant association (P = 8.78E-08) between a nonsynonymous mutation at codon 481 (H481) of the rpoB gene and increased vancomycin MIC. Additionally, we used a database of public S. aureus genome sequences to identify rare mutations in candidate genes associated with VISA. On the basis of these data, we proposed a preliminary model called ECM+RMCG for the VISA phenotype as a benchmark for future efforts. The model predicted VISA based on the presence of a rare mutation in a set of candidate genes (walKR, vraSR, graSR, and agrA) and/or three previously experimentally verified mutations (including the rpoB H481 locus) with an accuracy of 81% and a sensitivity of 73%. Further, the level of resistance measured by both Etest and PAP-AUC regressed positively with the number of mutations present in a strain. This study demonstrated 1) the power of GWAS for identifying common genetic variants associated with antibiotic resistance in bacteria and 2) that rare mutations in candidate gene, identified using large genomic data sets, can also be associated with resistance phenotypes. PMID:24787619
Using whole-exome sequencing to identify variants inherited from mosaic parents

PubMed Central

Rios, Jonathan J; Delgado, Mauricio R

2015-01-01

Whole-exome sequencing (WES) has allowed the discovery of genes and variants causing rare human disease. This is often achieved by comparing nonsynonymous variants between unrelated patients, and particularly for sporadic or recessive disease, often identifies a single or few candidate genes for further consideration. However, despite the potential for this approach to elucidate the genetic cause of rare human disease, a majority of patients fail to realize a genetic diagnosis using standard exome analysis methods. Although genetic heterogeneity contributes to the difficulty of exome sequence analysis between patients, it remains plausible that rare human disease is not caused by de novo or recessive variants. Multiple human disorders have been described for which the variant was inherited from a phenotypically normal mosaic parent. Here we highlight the potential for exome sequencing to identify a reasonable number of candidate genes when dominant disease variants are inherited from a mosaic parent. We show the power of WES to identify a limited number of candidate genes using this disease model and how sequence coverage affects identification of mosaic variants by WES. We propose this analysis as an alternative to discover genetic causes of rare human disorders for which typical WES approaches fail to identify likely pathogenic variants. PMID:24986828
Candidate gene association mapping for winter survival and spring regrowth in perennial ryegrass

Treesearch

Xiaoqing Yu; Paula M. Pijut; Stephen Byrne; Torben Asp; Guihua Bai; Yiwei Jiang

2015-01-01

Perennial ryegrass (Lolium perenne L.) is a widely cultivated cool-season grass species because of its high quality for forage and turf. Susceptibility to freezing damage limits its further use in temperate zones. The objective of this study was to identify candidate genes significantly associated with winter survival and spring regrowth in a global...
Single nucleotide polymorphisms in specific candidate genes are associated with phenotypic differences in days open for first lactation in Holstein cows

USDA-ARS?s Scientific Manuscript database

Previously, a candidate gene approach identified 51 single nucleotide polymorphisms (SNP) associated with genetic merit for reproductive traits and 26 associated with genetic merit for production in dairy bulls. We evaluated association of the 77 SNPs with days open (DO) for first lactation in a pop...
Selection on plant male function genes identifies candidates for reproductive isolation of yellow monkeyflowers.

PubMed

Aagaard, Jan E; George, Renee D; Fishman, Lila; Maccoss, Michael J; Swanson, Willie J

2013-01-01

Understanding the genetic basis of reproductive isolation promises insight into speciation and the origins of biological diversity. While progress has been made in identifying genes underlying barriers to reproduction that function after fertilization (post-zygotic isolation), we know much less about earlier acting pre-zygotic barriers. Of particular interest are barriers involved in mating and fertilization that can evolve extremely rapidly under sexual selection, suggesting they may play a prominent role in the initial stages of reproductive isolation. A significant challenge to the field of speciation genetics is developing new approaches for identification of candidate genes underlying these barriers, particularly among non-traditional model systems. We employ powerful proteomic and genomic strategies to study the genetic basis of conspecific pollen precedence, an important component of pre-zygotic reproductive isolation among yellow monkeyflowers (Mimulus spp.) resulting from male pollen competition. We use isotopic labeling in combination with shotgun proteomics to identify more than 2,000 male function (pollen tube) proteins within maternal reproductive structures (styles) of M. guttatus flowers where pollen competition occurs. We then sequence array-captured pollen tube exomes from a large outcrossing population of M. guttatus, and identify those genes with evidence of selective sweeps or balancing selection consistent with their role in pollen competition. We also test for evidence of positive selection on these genes more broadly across yellow monkeyflowers, because a signal of adaptive divergence is a common feature of genes causing reproductive isolation. Together the molecular evolution studies identify 159 pollen tube proteins that are candidate genes for conspecific pollen precedence. Our work demonstrates how powerful proteomic and genomic tools can be readily adapted to non-traditional model systems, allowing for genome-wide screens towards the goal of identifying the molecular basis of genetically complex traits.
Exome Sequence Analysis of 14 Families With High Myopia.

PubMed

Kloss, Bethany A; Tompson, Stuart W; Whisenhunt, Kristina N; Quow, Krystina L; Huang, Samuel J; Pavelec, Derek M; Rosenberg, Thomas; Young, Terri L

2017-04-01

To identify causal gene mutations in 14 families with autosomal dominant (AD) high myopia using exome sequencing. Select individuals from 14 large Caucasian families with high myopia were exome sequenced. Gene variants were filtered to identify potential pathogenic changes. Sanger sequencing was used to confirm variants in original DNA, and to test for disease cosegregation in additional family members. Candidate genes and chromosomal loci previously associated with myopic refractive error and its endophenotypes were comprehensively screened. In 14 high myopia families, we identified 73 rare and 31 novel gene variants as candidates for pathogenicity. In seven of these families, two of the novel and eight of the rare variants were within known myopia loci. A total of 104 heterozygous nonsynonymous rare variants in 104 genes were identified in 10 out of 14 probands. Each variant cosegregated with affection status. No rare variants were identified in genes known to cause myopia or in genes closest to published genome-wide association study association signals for refractive error or its endophenotypes. Whole exome sequencing was performed to determine gene variants implicated in the pathogenesis of AD high myopia. This study provides new genes for consideration in the pathogenesis of high myopia, and may aid in the development of genetic profiling of those at greatest risk for attendant ocular morbidities of this disorder.
Comparison of Expression Profiles in Ovarian Epithelium In Vivo and Ovarian Cancer Identifies Novel Candidate Genes Involved in Disease Pathogenesis

PubMed Central

Emmanuel, Catherine; Gava, Natalie; Kennedy, Catherine; Balleine, Rosemary L.; Sharma, Raghwa; Wain, Gerard; Brand, Alison; Hogg, Russell; Etemadmoghadam, Dariush; George, Joshy; Birrer, Michael J.; Clarke, Christine L.; Chenevix-Trench, Georgia; Bowtell, David D. L.; Harnett, Paul R.; deFazio, Anna

2011-01-01

Molecular events leading to epithelial ovarian cancer are poorly understood but ovulatory hormones and a high number of life-time ovulations with concomitant proliferation, apoptosis, and inflammation, increases risk. We identified genes that are regulated during the estrous cycle in murine ovarian surface epithelium and analysed these profiles to identify genes dysregulated in human ovarian cancer, using publically available datasets. We identified 338 genes that are regulated in murine ovarian surface epithelium during the estrous cycle and dysregulated in ovarian cancer. Six of seven candidates selected for immunohistochemical validation were expressed in serous ovarian cancer, inclusion cysts, ovarian surface epithelium and in fallopian tube epithelium. Most were overexpressed in ovarian cancer compared with ovarian surface epithelium and/or inclusion cysts (EpCAM, EZH2, BIRC5) although BIRC5 and EZH2 were expressed as highly in fallopian tube epithelium as in ovarian cancer. We prioritised the 338 genes for those likely to be important for ovarian cancer development by in silico analyses of copy number aberration and mutation using publically available datasets and identified genes with established roles in ovarian cancer as well as novel genes for which we have evidence for involvement in ovarian cancer. Chromosome segregation emerged as an important process in which genes from our list of 338 were over-represented including two (BUB1, NCAPD2) for which there is evidence of amplification and mutation. NUAK2, upregulated in ovarian surface epithelium in proestrus and predicted to have a driver mutation in ovarian cancer, was examined in a larger cohort of serous ovarian cancer where patients with lower NUAK2 expression had shorter overall survival. In conclusion, defining genes that are activated in normal epithelium in the course of ovulation that are also dysregulated in cancer has identified a number of pathways and novel candidate genes that may contribute to the development of ovarian cancer. PMID:21423607
Signature of genetic associations in oral cancer.

PubMed

Sharma, Vishwas; Nandan, Amrita; Sharma, Amitesh Kumar; Singh, Harpreet; Bharadwaj, Mausumi; Sinha, Dhirendra Narain; Mehrotra, Ravi

2017-10-01

Oral cancer etiology is complex and controlled by multi-factorial events including genetic events. Candidate gene studies, genome-wide association studies, and next-generation sequencing identified various chromosomal loci to be associated with oral cancer. There is no available review that could give us the comprehensive picture of genetic loci identified to be associated with oral cancer by candidate gene studies-based, genome-wide association studies-based, and next-generation sequencing-based approaches. A systematic literature search was performed in the PubMed database to identify the loci associated with oral cancer by exclusive candidate gene studies-based, genome-wide association studies-based, and next-generation sequencing-based study approaches. The information of loci associated with oral cancer is made online through the resource "ORNATE." Next, screening of the loci validated by candidate gene studies and next-generation sequencing approach or by two independent studies within candidate gene studies or next-generation sequencing approaches were performed. A total of 264 loci were identified to be associated with oral cancer by candidate gene studies, genome-wide association studies, and next-generation sequencing approaches. In total, 28 loci, that is, 14q32.33 (AKT1), 5q22.2 (APC), 11q22.3 (ATM), 2q33.1 (CASP8), 11q13.3 (CCND1), 16q22.1 (CDH1), 9p21.3 (CDKN2A), 1q31.1 (COX-2), 7p11.2 (EGFR), 22q13.2 (EP300), 4q35.2 (FAT1), 4q31.3 (FBXW7), 4p16.3 (FGFR3), 1p13.3 (GSTM1-GSTT1), 11q13.2 (GSTP1), 11p15.5 (H-RAS), 3p25.3 (hOGG1), 1q32.1 (IL-10), 4q13.3 (IL-8), 12p12.1 (KRAS), 12q15 (MDM2), 12q13.12 (MLL2), 9q34.3 (NOTCH1), 17p13.1 (p53), 3q26.32 (PIK3CA), 10q23.31 (PTEN), 13q14.2 (RB1), and 5q14.2 (XRCC4), were validated to be associated with oral cancer. "ORNATE" gives a snapshot of genetic loci associated with oral cancer. All 28 loci were validated to be linked to oral cancer for which further fine-mapping followed by gene-by-gene and gene-environment interaction studies is needed to confirm their involvement in modifying oral cancer.
Association of Genetic Loci with Sleep Apnea in European Americans and African-Americans: The Candidate Gene Association Resource (CARe)

PubMed Central

Patel, Sanjay R.; Goodloe, Robert; De, Gourab; Kowgier, Matthew; Weng, Jia; Buxbaum, Sarah G.; Cade, Brian; Fulop, Tibor; Gharib, Sina A.; Gottlieb, Daniel J.; Hillman, David; Larkin, Emma K.; Lauderdale, Diane S.; Li, Li; Mukherjee, Sutapa; Palmer, Lyle; Zee, Phyllis; Zhu, Xiaofeng; Redline, Susan

2012-01-01

Although obstructive sleep apnea (OSA) is known to have a strong familial basis, no genetic polymorphisms influencing apnea risk have been identified in cross-cohort analyses. We utilized the National Heart, Lung, and Blood Institute (NHLBI) Candidate Gene Association Resource (CARe) to identify sleep apnea susceptibility loci. Using a panel of 46,449 polymorphisms from roughly 2,100 candidate genes on a customized Illumina iSelect chip, we tested for association with the apnea hypopnea index (AHI) as well as moderate to severe OSA (AHI≥15) in 3,551 participants of the Cleveland Family Study and two cohorts participating in the Sleep Heart Health Study. Among 647 African-Americans, rs11126184 in the pleckstrin (PLEK) gene was associated with OSA while rs7030789 in the lysophosphatidic acid receptor 1 (LPAR1) gene was associated with AHI using a chip-wide significance threshold of p-value<2×10−6. Among 2,904 individuals of European ancestry, rs1409986 in the prostaglandin E2 receptor (PTGER3) gene was significantly associated with OSA. Consistency of effects between rs7030789 and rs1409986 in LPAR1 and PTGER3 and apnea phenotypes were observed in independent clinic-based cohorts. Novel genetic loci for apnea phenotypes were identified through the use of customized gene chips and meta-analyses of cohort data with replication in clinic-based samples. The identified SNPs all lie in genes associated with inflammation suggesting inflammation may play a role in OSA pathogenesis. PMID:23155414
Evaluation of a functional epigenetic approach to identify promoter region methylation in phaeochromocytoma and neuroblastoma

PubMed Central

Margetts, Caroline D E; Morris, Mark; Astuti, Dewi; Gentle, Dean C; Cascon, Alberto; McRonald, Fiona E; Catchpoole, Daniel; Robledo, Mercedes; Neumann, Hartmut P H; Latif, Farida; Maher, Eamonn R

2008-01-01

The molecular genetics of inherited phaeochromocytoma have received considerable attention, but the somatic genetic and epigenetic events that characterise tumourigenesis in sporadic phaeochromocytomas are less well defined. Previously, we found considerable overlap between patterns of promoter region tumour suppressor gene (TSG) hypermethylation in two neural crest tumours, neuroblastoma and phaeochromocytoma. In order to identify candidate biomarkers and epigenetically inactivated TSGs in phaeochromocytoma and neuroblastoma, we characterised changes in gene expression in three neuroblastoma cell lines after treatment with the demethylating agent 5-azacytidine. Promoter region methylation status was then determined for 28 genes that demonstrated increased expression after demethylation. Three genes HSP47, homeobox A9 (HOXA9) and opioid binding protein (OPCML) were methylated in >10% of phaeochromocytomas (52, 17 and 12% respectively). Two of the genes, epithelial membrane protein 3 (EMP3) and HSP47, demonstrated significantly more frequent methylation in neuroblastoma than phaeochromocytoma. These findings extend epigenotype of phaeochromocytoma and identify candidate genes implicated in sporadic phaeochromocytoma tumourigenesis. PMID:18499731

Meta-analysis and genome-wide interpretation of genetic susceptibility to drug addiction

PubMed Central

2011-01-01

Background Classical genetic studies provide strong evidence for heritable contributions to susceptibility to developing dependence on addictive substances. Candidate gene and genome-wide association studies (GWAS) have sought genes, chromosomal regions and allelic variants likely to contribute to susceptibility to drug addiction. Results Here, we performed a meta-analysis of addiction candidate gene association studies and GWAS to investigate possible functional mechanisms associated with addiction susceptibility. From meta-data retrieved from 212 publications on candidate gene association studies and 5 GWAS reports, we linked a total of 843 haplotypes to addiction susceptibility. We mapped the SNPs in these haplotypes to functional and regulatory elements in the genome and estimated the magnitude of the contributions of different molecular mechanisms to their effects on addiction susceptibility. In addition to SNPs in coding regions, these data suggest that haplotypes in gene regulatory regions may also contribute to addiction susceptibility. When we compared the lists of genes identified by association studies and those identified by molecular biological studies of drug-regulated genes, we observed significantly higher participation in the same gene interaction networks than expected by chance, despite little overlap between the two gene lists. Conclusions These results appear to offer new insights into the genetic factors underlying drug addiction. PMID:21999673
Genome-wide significant localization for working and spatial memory: Identifying genes for psychosis using models of cognition.

PubMed

Knowles, Emma E M; Carless, Melanie A; de Almeida, Marcio A A; Curran, Joanne E; McKay, D Reese; Sprooten, Emma; Dyer, Thomas D; Göring, Harald H; Olvera, Rene; Fox, Peter; Almasy, Laura; Duggirala, Ravi; Kent, Jack W; Blangero, John; Glahn, David C

2014-01-01

It is well established that risk for developing psychosis is largely mediated by the influence of genes, but identifying precisely which genes underlie that risk has been problematic. Focusing on endophenotypes, rather than illness risk, is one solution to this problem. Impaired cognition is a well-established endophenotype of psychosis. Here we aimed to characterize the genetic architecture of cognition using phenotypically detailed models as opposed to relying on general IQ or individual neuropsychological measures. In so doing we hoped to identify genes that mediate cognitive ability, which might also contribute to psychosis risk. Hierarchical factor models of genetically clustered cognitive traits were subjected to linkage analysis followed by QTL region-specific association analyses in a sample of 1,269 Mexican American individuals from extended pedigrees. We identified four genome wide significant QTLs, two for working and two for spatial memory, and a number of plausible and interesting candidate genes. The creation of detailed models of cognition seemingly enhanced the power to detect genetic effects on cognition and provided a number of possible candidate genes for psychosis. © 2013 Wiley Periodicals, Inc.
A novel truncation mutation in CRYBB1 associated with autosomal dominant congenital cataract with nystagmus.

PubMed

Rao, Yan; Dong, Sufang; Li, Zuhua; Yang, Guohua; Peng, Chunyan; Yan, Ming; Zheng, Fang

2017-01-01

To identify the potential candidate genes for a large Chinese family with autosomal dominant congenital cataract (ADCC) and nystagmus, and investigate the possible molecular mechanism underlying the role of the candidate genes in cataractogenesis. We combined the linkage analysis and direct sequencing for the candidate genes in the linkage regions to identify the causative mutation. The molecular and bio-functional properties of the proteins encoded by the candidate genes was further explored with biophysical and biochemical studies of the recombinant wild-type and mutant proteins. We identified a c. C749T (p.Q227X) transversion in exon 6 of CRYBB1 , a cataract-causative gene. This nonsense mutation changes a phylogenetically conserved glutamine to a stop codon and is predicted to truncate the C-terminus of the wild-type protein by 26 amino acids. Comparison of the biophysical and biochemical properties of the recombinant full-length and truncated βB1-crystallins revealed that the mutation led to the insolubility and the phase separation phenomenon of the truncated protein with a changed conformation. Meanwhile, the thermal stability of the truncated βB1-crystallin was significantly decreased, and the mutation diminished the chaperoning ability of αA-crystallin with the mutant under heating stress. Our findings highlight the importance of the C-terminus in βB1-crystallin in maintaining the crystalline function and stability, and provide a novel insight into the molecular mechanism underlying the pathogenesis of human autosomal dominant congenital cataract.
Assays for the Identification and Prioritization of Drug Candidates for Spinal Muscular Atrophy

PubMed Central

Cherry, Jonathan J.; Kobayashi, Dione T.; Lynes, Maureen M.; Naryshkin, Nikolai N.; Tiziano, Francesco Danilo; Zaworski, Phillip G.; Rubin, Lee L.

2014-01-01

Abstract Spinal muscular atrophy (SMA) is an autosomal recessive genetic disorder resulting in degeneration of α-motor neurons of the anterior horn and proximal muscle weakness. It is the leading cause of genetic mortality in children younger than 2 years. It affects ∼1 in 11,000 live births. In 95% of cases, SMA is caused by homozygous deletion of the SMN1 gene. In addition, all patients possess at least one copy of an almost identical gene called SMN2. A single point mutation in exon 7 of the SMN2 gene results in the production of low levels of full-length survival of motor neuron (SMN) protein at amounts insufficient to compensate for the loss of the SMN1 gene. Although no drug treatments are available for SMA, a number of drug discovery and development programs are ongoing, with several currently in clinical trials. This review describes the assays used to identify candidate drugs for SMA that modulate SMN2 gene expression by various means. Specifically, it discusses the use of high-throughput screening to identify candidate molecules from primary screens, as well as the technical aspects of a number of widely used secondary assays to assess SMN messenger ribonucleic acid (mRNA) and protein expression, localization, and function. Finally, it describes the process of iterative drug optimization utilized during preclinical SMA drug development to identify clinical candidates for testing in human clinical trials. PMID:25147906
Genetics of alcoholism.

PubMed

Edenberg, Howard J; Foroud, Tatiana

2014-01-01

Multiple lines of evidence strongly indicate that genetic factors contribute to the risk for alcohol use disorders (AUD). There is substantial heterogeneity in AUD, which complicates studies seeking to identify specific genetic factors. To identify these genetic effects, several different alcohol-related phenotypes have been analyzed, including diagnosis and quantitative measures related to AUDs. Study designs have used candidate gene analyses, genetic linkage studies, genomewide association studies (GWAS), and analyses of rare variants. Two genes that encode enzymes of alcohol metabolism have the strongest effect on AUD: aldehyde dehydrogenase 2 and alcohol dehydrogenase 1B each has strongly protective variants that reduce risk, with odds ratios approximately 0.2-0.4. A number of other genes important in AUD have been identified and replicated, including GABRA2 and alcohol dehydrogenases 1B and 4. GWAS have identified additional candidates. Rare variants are likely also to play a role; studies of these are just beginning. A multifaceted approach to gene identification, targeting both rare and common variations and assembling much larger datasets for meta-analyses, is critical for identifying the key genes and pathways important in AUD. © 2014 Elsevier B.V. All rights reserved.
Quantitative trait loci affecting the 3D skull shape and size in mouse and prioritization of candidate genes in-silico

PubMed Central

Maga, A. Murat; Navarro, Nicolas; Cunningham, Michael L.; Cox, Timothy C.

2015-01-01

We describe the first application of high-resolution 3D micro-computed tomography, together with 3D landmarks and geometric morphometrics, to map QTL responsible for variation in skull shape and size using a backcross between C57BL/6J and A/J inbred strains. Using 433 animals, 53 3D landmarks, and 882 SNPs from autosomes, we identified seven QTL responsible for the skull size (SCS.qtl) and 30 QTL responsible for the skull shape (SSH.qtl). Size, sex, and direction-of-cross were all significant factors and included in the analysis as covariates. All autosomes harbored at least one SSH.qtl, sometimes up to three. Effect sizes of SSH.qtl appeared to be small, rarely exceeding 1% of the overall shape variation. However, they account for significant amount of variation in some specific directions of the shape space. Many QTL have stronger effect on the neurocranium than expected from a random vector that will parcellate uniformly across the four cranial regions. On the contrary, most of QTL have an effect on the palate weaker than expected. Combined interval length of 30 SSH.qtl was about 315 MB and contained 2476 known protein coding genes. We used a bioinformatics approach to filter these candidate genes and identified 16 high-priority candidates that are likely to play a role in the craniofacial development and disorders. Thus, coupling the QTL mapping approach in model organisms with candidate gene enrichment approaches appears to be a feasible way to identify high-priority candidates genes related to the structure or tissue of interest. PMID:25859222
A novel gammaretroviral shuttle vector insertional mutagenesis screen identifies SHARPIN as a breast cancer metastasis gene and prognostic biomarker.

PubMed

Bii, Victor M; Rae, Dustin T; Trobridge, Grant D

2015-11-24

Breast cancer (BC) is the second leading cause of malignancy among U.S. women. Metastasis results in a poor prognosis and increased mortality, but the molecular mechanisms by which metastatic tumors occur are not well understood. Identifying the genes that drive the metastatic process could provide targets for improved therapy and biomarkers to improve BC patient outcomes. Using a forward mutagenesis screen, BC cells mutagenized with a replication-incompetent gammaretroviral vector (γRV) were xenotransplanted into the mammary fat pad of immunodeficient mice. In this approach the vector provirus dysregulates nearby genes, providing a selective advantage to transduced cells to form metastases. Metastatic tumors were analyzed for proviral integration sites to identify nearby candidate metastasis genes. The γRV has a transgene cassette that allows for rescue in bacteria and rapid identification of vector integration sites. Using this approach, we identified the previously described metastasis gene WWTR1 (TAZ), and three other novel candidate metastasis genes including SHARPIN. SHARPIN was independently validated in vivo as a BC metastasis gene. Analysis of patient data showed that SHARPIN expression predicts metastasis-free survival after adjuvant therapy. Our approach has broad potential to identify genes involved in oncogenic processes for BC and other cancers. We show here it can identify both known (WWTR1) and novel (SHARPIN) BC metastasis genes.
An integrative, translational approach to understanding rare and orphan genetically based diseases

PubMed Central

Hoehndorf, Robert; Schofield, Paul N.; Gkoutos, Georgios V.

2013-01-01

PhenomeNet is an approach for integrating phenotypes across species and identifying candidate genes for genetic diseases based on the similarity between a disease and animal model phenotypes. In contrast to ‘guilt-by-association’ approaches, PhenomeNet relies exclusively on the comparison of phenotypes to suggest candidate genes, and can, therefore, be applied to study the molecular basis of rare and orphan diseases for which the molecular basis is unknown. In addition to disease phenotypes from the Online Mendelian Inheritance in Man (OMIM) database, we have now integrated the clinical signs from Orphanet into PhenomeNet. We demonstrate that our approach can efficiently identify known candidate genes for genetic diseases in Orphanet and OMIM. Furthermore, we find evidence that mutations in the HIP1 gene might cause Bassoe syndrome, a rare disorder with unknown genetic aetiology. Our results demonstrate that integration and computational analysis of human disease and animal model phenotypes using PhenomeNet has the potential to reveal novel insights into the pathobiology underlying genetic diseases. PMID:23853703
PINTA: a web server for network-based gene prioritization from expression data

PubMed Central

Nitsch, Daniela; Tranchevent, Léon-Charles; Gonçalves, Joana P.; Vogt, Josef Korbinian; Madeira, Sara C.; Moreau, Yves

2011-01-01

PINTA (available at http://www.esat.kuleuven.be/pinta/; this web site is free and open to all users and there is no login requirement) is a web resource for the prioritization of candidate genes based on the differential expression of their neighborhood in a genome-wide protein–protein interaction network. Our strategy is meant for biological and medical researchers aiming at identifying novel disease genes using disease specific expression data. PINTA supports both candidate gene prioritization (starting from a user defined set of candidate genes) as well as genome-wide gene prioritization and is available for five species (human, mouse, rat, worm and yeast). As input data, PINTA only requires disease specific expression data, whereas various platforms (e.g. Affymetrix) are supported. As a result, PINTA computes a gene ranking and presents the results as a table that can easily be browsed and downloaded by the user. PMID:21602267
Novel Genes Affecting the Interaction between the Cabbage Whitefly and Arabidopsis Uncovered by Genome-Wide Association Mapping

PubMed Central

Broekgaarden, Colette; Bucher, Johan; Bac-Molenaar, Johanna; Keurentjes, Joost J. B.; Kruijer, Willem; Voorrips, Roeland E.; Vosman, Ben

2015-01-01

Plants have evolved a variety of ways to defend themselves against biotic attackers. This has resulted in the presence of substantial variation in defense mechanisms among plants, even within a species. Genome-wide association (GWA) mapping is a useful tool to study the genetic architecture of traits, but has so far only had limited exploitation in studies of plant defense. Here, we study the genetic architecture of defense against the phloem-feeding insect cabbage whitefly (Aleyrodes proletella) in Arabidopsis thaliana. We determined whitefly performance, i.e. the survival and reproduction of whitefly females, on 360 worldwide selected natural accessions and subsequently performed GWA mapping using 214,051 SNPs. Substantial variation for whitefly adult survival and oviposition rate (number of eggs laid per female per day) was observed between the accessions. We identified 39 candidate SNPs for either whitefly adult survival or oviposition rate, all with relatively small effects, underpinning the complex architecture of defense traits. Among the corresponding candidate genes, i.e. genes in linkage disequilibrium (LD) with candidate SNPs, none have previously been identified as a gene playing a role in the interaction between plants and phloem-feeding insects. Whitefly performance on knock-out mutants of a number of candidate genes was significantly affected, validating the potential of GWA mapping for novel gene discovery in plant-insect interactions. Our results show that GWA analysis is a very useful tool to gain insight into the genetic architecture of plant defense against herbivorous insects, i.e. we identified and validated several genes affecting whitefly performance that have not previously been related to plant defense against herbivorous insects. PMID:26699853
Candidate gene biodosimetry markers of exposure to external ionizing radiation in human blood: A systematic review

PubMed Central

Sima, Chao; Amundson, Sally A.; Zenhausern, Frederic

2018-01-01

Purpose To compile a list of genes that have been reported to be affected by external ionizing radiation (IR) and to assess their performance as candidate biomarkers for individual human radiation dosimetry. Methods Eligible studies were identified through extensive searches of the online databases from 1978 to 2017. Original English-language publications of microarray studies assessing radiation-induced changes in gene expression levels in human blood after external IR were included. Genes identified in at least half of the selected studies were retained for bio-statistical analysis in order to evaluate their diagnostic ability. Results 24 studies met the criteria and were included in this study. Radiation-induced expression of 10,170 unique genes was identified and the 31 genes that have been identified in at least 50% of studies (12/24 studies) were selected for diagnostic power analysis. Twenty-seven genes showed a significant Spearman’s correlation with radiation dose. Individually, TNFSF4, FDXR, MYC, ZMAT3 and GADD45A provided the best discrimination of radiation dose < 2 Gy and dose ≥ 2 Gy according to according to their maximized Youden’s index (0.67, 0.55, 0.55, 0.55 and 0.53 respectively). Moreover, 12 combinations of three genes display an area under the Receiver Operating Curve (ROC) curve (AUC) = 1 reinforcing the concept of biomarker combinations instead of looking for an ideal and unique biomarker. Conclusion Gene expression is a promising approach for radiation dosimetry assessment. A list of robust candidate biomarkers has been identified from analysis of the studies published to date, confirming for example the potential of well-known genes such as FDXR and TNFSF4 or highlighting other promising gene such as ZMAT3. However, heterogeneity in protocols and analysis methods will require additional studies to confirm these results. PMID:29879226
Fine-mapping of qGW4.05, a major QTL for kernel weight and size in maize.

PubMed

Chen, Lin; Li, Yong-xiang; Li, Chunhui; Wu, Xun; Qin, Weiwei; Li, Xin; Jiao, Fuchao; Zhang, Xiaojing; Zhang, Dengfeng; Shi, Yunsu; Song, Yanchun; Li, Yu; Wang, Tianyu

2016-04-12

Kernel weight and size are important components of grain yield in cereals. Although some information is available concerning the map positions of quantitative trait loci (QTL) for kernel weight and size in maize, little is known about the molecular mechanisms of these QTLs. qGW4.05 is a major QTL that is associated with kernel weight and size in maize. We combined linkage analysis and association mapping to fine-map and identify candidate gene(s) at qGW4.05. QTL qGW4.05 was fine-mapped to a 279.6-kb interval in a segregating population derived from a cross of Huangzaosi with LV28. By combining the results of regional association mapping and linkage analysis, we identified GRMZM2G039934 as a candidate gene responsible for qGW4.05. Candidate gene-based association mapping was conducted using a panel of 184 inbred lines with variable kernel weights and kernel sizes. Six polymorphic sites in the gene GRMZM2G039934 were significantly associated with kernel weight and kernel size. The results of linkage analysis and association mapping revealed that GRMZM2G039934 is the most likely candidate gene for qGW4.05. These results will improve our understanding of the genetic architecture and molecular mechanisms underlying kernel development in maize.
LNDriver: identifying driver genes by integrating mutation and expression data based on gene-gene interaction network.

PubMed

Wei, Pi-Jing; Zhang, Di; Xia, Junfeng; Zheng, Chun-Hou

2016-12-23

Cancer is a complex disease which is characterized by the accumulation of genetic alterations during the patient's lifetime. With the development of the next-generation sequencing technology, multiple omics data, such as cancer genomic, epigenomic and transcriptomic data etc., can be measured from each individual. Correspondingly, one of the key challenges is to pinpoint functional driver mutations or pathways, which contributes to tumorigenesis, from millions of functional neutral passenger mutations. In this paper, in order to identify driver genes effectively, we applied a generalized additive model to mutation profiles to filter genes with long length and constructed a new gene-gene interaction network. Then we integrated the mutation data and expression data into the gene-gene interaction network. Lastly, greedy algorithm was used to prioritize candidate driver genes from the integrated data. We named the proposed method Length-Net-Driver (LNDriver). Experiments on three TCGA datasets, i.e., head and neck squamous cell carcinoma, kidney renal clear cell carcinoma and thyroid carcinoma, demonstrated that the proposed method was effective. Also, it can identify not only frequently mutated drivers, but also rare candidate driver genes.
Synteny analysis of genes and distribution of loci controlling oil content and fatty acid profile based on QTL alignment map in Brassica napus.

PubMed

Raboanatahiry, Nadia; Chao, Hongbo; Guo, Liangxing; Gan, Jianping; Xiang, Jun; Yan, Mingli; Zhang, Libin; Yu, Longjiang; Li, Maoteng

2017-10-12

Deciphering the genetic architecture of a species is a good way to understand its evolutionary history, but also to tailor its profile for breeding elite cultivars with desirable traits. Aligning QTLs from diverse population in one map and utilizing it for comparison, but also as a basis for multiple analyses assure a stronger evidence to understand the genetic system related to a given phenotype. In this study, 439 genes involved in fatty acid (FA) and triacylglycerol (TAG) biosyntheses were identified in Brassica napus. B. napus genome showed mixed gene loss and insertion compared to B. rapa and B. oleracea, and C genome had more inserted genes. Identified QTLs for oil (OC-QTLs) and fatty acids (FA-QTLs) from nine reported populations were projected on the physical map of the reference genome "Darmor-bzh" to generate a map. Thus, 335 FA-QTLs and OC-QTLs could be highlighted and 82 QTLs were overlapping. Chromosome C3 contained 22 overlapping QTLs with all trait studied except for C18:3. In total, 218 candidate genes which were potentially involved in FA and TAG were identified in 162 QTLs confidence intervals and some of them might affect many traits. Also, 76 among these candidate genes were found inside 57 overlapping QTLs, and candidate genes for oil content were in majority (61/76 genes). Then, sixteen genes were found in overlapping QTLs involving three populations, and the remaining 60 genes were found in overlapping QTLs of two populations. Interaction network and pathway analysis of these candidate genes indicated ten genes that might have strong influence over the other genes that control fatty acids and oil formation. The present results provided new information for genetic basis of FA and TAG formation in B. napus. A map including QTLs from numerous populations was built, which could serve as reference to study the genome profile of B. napus, and new potential genes emerged which might affect seed oil. New useful tracks were showed for the selection of population or/and selection of interesting genes for breeding improvement purpose.
Genetic Basis of Variation in Rice Seed Storage Protein (Albumin, Globulin, Prolamin, and Glutelin) Content Revealed by Genome-Wide Association Analysis.

PubMed

Chen, Pingli; Shen, Zhikang; Ming, Luchang; Li, Yibo; Dan, Wenhan; Lou, Guangming; Peng, Bo; Wu, Bian; Li, Yanhua; Zhao, Da; Gao, Guanjun; Zhang, Qinglu; Xiao, Jinghua; Li, Xianghua; Wang, Gongwei; He, Yuqing

2018-01-01

Rice seed storage protein (SSP) is an important source of nutrition and energy. Understanding the genetic basis of SSP content and mining favorable alleles that control it will be helpful for breeding new improved cultivars. An association analysis for SSP content was performed to identify underlying genes using 527 diverse Oryza sativa accessions grown in two environments. We identified more than 107 associations for five different traits, including the contents of albumin (Alb), globulin (Glo), prolamin (Pro), glutelin (Glu), and total SSP (Total). A total of 28 associations were located at previously reported QTLs or intervals. A lead SNP sf0709447538, associated for Glu content in the indica subpopulation in 2015, was further validated in near isogenic lines NIL(Zhenshan97) and NIL(Delong208), and the Glu phenotype had significantly difference between two NILs. The association region could be target for map-based cloning of the candidate genes. There were 13 associations in regions close to grain-quality-related genes; five lead single nucleotide polymorphisms (SNPs) were located less than 20 kb upstream from grain-quality-related genes ( PG5a , Wx , AGPS2a , RP6 , and, RM1 ). Several starch-metabolism-related genes ( AGPS2a , OsACS6 , PUL , GBSSII , and ISA2 ) were also associated with SSP content. We identified favorable alleles of functional candidate genes, such as RP6 , RM1 , Wx , and other four candidate genes by haplotype analysis and expression pattern. Genotypes of RP6 and RM1 with higher Pro were not identified in japonica and exhibited much higher expression levels in indica group. The lead SNP sf0601764762, repeatedly detected for Alb content in 2 years in the whole association population, was located in the Wx locus that controls the synthesis of amylose. And Alb content was significantly and negatively correlated with amylose content and the level of 2.3 kb Wx pre-mRNA examined in this study. The associations or candidate genes identified would provide new insights into the genetic basis of SSP content that will help in developing rice cultivars with improved grain nutritional quality through marker-assisted breeding.
Mapping QTLs for water-use efficiency reveals the potential candidate genes involved in regulating the trait in apple under drought stress.

PubMed

Wang, Haibo; Zhao, Shuang; Mao, Ke; Dong, Qinglong; Liang, Bowen; Li, Chao; Wei, Zhiwei; Li, Mingjun; Ma, Fengwang

2018-06-26

Improvement of water-use efficiency (WUE) can effectively reduce production losses caused by drought stress. A better understanding of the genetic determination of WUE in crops under drought stress has great potential value for developing cultivars adapted to arid regions. To identify the genetic loci associated with WUE and reveal genes responsible for the trait in apple, we aim to map the quantitative trait loci (QTLs) for carbon isotope composition, the proxy for WUE, applying two contrasting irrigating regimes over the two-year experiment and search for the candidate genes encompassed in the mapped QTLs. We constructed a high-density genetic linkage map with 10,172 markers of apple, using single nucleotide polymorphism (SNP) markers obtained through restriction site-associated DNA sequencing (RADseq) and a final segregating population of 350 seedlings from the cross of Honeycrisp and Qinguan. In total, 33 QTLs were identified for carbon isotope composition in apple under both well-watered and drought-stressed conditions. Three QTLs were stable over 2 years under drought stress on linkage groups LG8, LG15 and LG16, as validated by Kompetitive Allele-Specific PCR (KASP) assays. In those validated QTLs, 258 genes were screened according to their Gene Ontology functional annotations. Among them, 28 genes were identified, which exhibited significant responses to drought stress in 'Honeycrisp' and/or 'Qinguan'. These genes are involved in signaling, photosynthesis, response to stresses, carbohydrate metabolism, protein metabolism and modification, hormone metabolism and transport, transport, respiration, transcriptional regulation, and development regulation. They, especially those for photoprotection and relevant signal transduction, are potential candidate genes connected with WUE regulation in drought-stressed apple. We detected three stable QTLs for carbon isotope composition in apple under drought stress over 2 years, and validated them by KASP assay. Twenty-eight candidate genes encompassed in these QTLs were identified. These stable genetic loci and series of genes provided here serve as a foundation for further studies on marker-assisted selection of high WUE and regulatory mechanism of WUE in apple exposed to drought conditions, respectively.
Identifying the candidate genes involved in the calyx abscission process of 'Kuerlexiangli' (Pyrus sinkiangensis Yu) by digital transcript abundance measurements.

PubMed

Qi, Xiaoxiao; Wu, Jun; Wang, Lifen; Li, Leiting; Cao, Yufen; Tian, Luming; Dong, Xingguang; Zhang, Shaoling

2013-10-23

'Kuerlexiangli' (Pyrus sinkiangensis Yu), a native pear of Xinjiang, China, is an important agricultural fruit and primary export to the international market. However, fruit with persistent calyxes affect fruit shape and quality. Although several studies have looked into the physiological aspects of the calyx abscission process, the underlying molecular mechanisms remain unknown. In order to better understand the molecular basis of the process of calyx abscission, materials at three critical stages of regulation, with 6000 × Flusilazole plus 300 × PBO treatment (calyx abscising treatment) and 50 mg.L-1GA3 treatment (calyx persisting treatment), were collected and cDNA fragments were sequenced using digital transcript abundance measurements to identify candidate genes. Digital transcript abundance measurements was performed using high-throughput Illumina GAII sequencing on seven samples that were collected at three important stages of the calyx abscission process with chemical agent treatments promoting calyx abscission and persistence. Altogether more than 251,123,845 high quality reads were obtained with approximately 8.0 M raw data for each library. The values of 69.85%-71.90% of clean data in the digital transcript abundance measurements could be mapped to the pear genome database. There were 12,054 differentially expressed genes having Gene Ontology (GO) terms and associating with 251 Kyoto Encyclopedia of Genes and Genomes (KEGG) defined pathways. The differentially expressed genes correlated with calyx abscission were mainly involved in photosynthesis, plant hormone signal transduction, cell wall modification, transcriptional regulation, and carbohydrate metabolism. Furthermore, candidate calyx abscission-specific genes, e.g. Inflorescence deficient in abscission gene, were identified. Quantitative real-time PCR was used to confirm the digital transcript abundance measurements results. We identified candidate genes that showed highly dynamic changes in expression during the calyx abscission process. These genes are potential targets for future functional characterization and should be valuable for exploration of the mechanisms of calyx abscission, and eventually for developing methods based on small molecule application to induce calyx abscission in fruit production.
Quantitative Trait Loci Mapping in Brassica rapa Revealed the Structural and Functional Conservation of Genetic Loci Governing Morphological and Yield Component Traits in the A, B, and C Subgenomes of Brassica Species

PubMed Central

Li, Xiaonan; Ramchiary, Nirala; Dhandapani, Vignesh; Choi, Su Ryun; Hur, Yoonkang; Nou, Ill-Sup; Yoon, Moo Kyoung; Lim, Yong Pyo

2013-01-01

Brassica rapa is an important crop species that produces vegetables, oilseed, and fodder. Although many studies reported quantitative trait loci (QTL) mapping, the genes governing most of its economically important traits are still unknown. In this study, we report QTL mapping for morphological and yield component traits in B. rapa and comparative map alignment between B. rapa, B. napus, B. juncea, and Arabidopsis thaliana to identify candidate genes and conserved QTL blocks between them. A total of 95 QTL were identified in different crucifer blocks of the B. rapa genome. Through synteny analysis with A. thaliana, B. rapa candidate genes and intronic and exonic single nucleotide polymorphisms in the parental lines were detected from whole genome resequenced data, a few of which were validated by mapping them to the QTL regions. Semi-quantitative reverse transcriptase PCR analysis showed differences in the expression levels of a few genes in parental lines. Comparative mapping identified five key major evolutionarily conserved crucifer blocks (R, J, F, E, and W) harbouring QTL for morphological and yield components traits between the A, B, and C subgenomes of B. rapa, B. juncea, and B. napus. The information of the identified candidate genes could be used for breeding B. rapa and other related Brassica species. PMID:23223793
Network-based analysis of differentially expressed genes in cerebrospinal fluid (CSF) and blood reveals new candidate genes for multiple sclerosis

PubMed Central

Safari-Alighiarloo, Nahid; Taghizadeh, Mohammad; Tabatabaei, Seyyed Mohammad; Namaki, Saeed

2016-01-01

Background The involvement of multiple genes and missing heritability, which are dominant in complex diseases such as multiple sclerosis (MS), entail using network biology to better elucidate their molecular basis and genetic factors. We therefore aimed to integrate interactome (protein–protein interaction (PPI)) and transcriptomes data to construct and analyze PPI networks for MS disease. Methods Gene expression profiles in paired cerebrospinal fluid (CSF) and peripheral blood mononuclear cells (PBMCs) samples from MS patients, sampled in relapse or remission and controls, were analyzed. Differentially expressed genes which determined only in CSF (MS vs. control) and PBMCs (relapse vs. remission) separately integrated with PPI data to construct the Query-Query PPI (QQPPI) networks. The networks were further analyzed to investigate more central genes, functional modules and complexes involved in MS progression. Results The networks were analyzed and high centrality genes were identified. Exploration of functional modules and complexes showed that the majority of high centrality genes incorporated in biological pathways driving MS pathogenesis. Proteasome and spliceosome were also noticeable in enriched pathways in PBMCs (relapse vs. remission) which were identified by both modularity and clique analyses. Finally, STK4, RB1, CDKN1A, CDK1, RAC1, EZH2, SDCBP genes in CSF (MS vs. control) and CDC37, MAP3K3, MYC genes in PBMCs (relapse vs. remission) were identified as potential candidate genes for MS, which were the more central genes involved in biological pathways. Discussion This study showed that network-based analysis could explicate the complex interplay between biological processes underlying MS. Furthermore, an experimental validation of candidate genes can lead to identification of potential therapeutic targets. PMID:28028462
Cross-platform method for identifying candidate network biomarkers for prostate cancer.

PubMed

Jin, G; Zhou, X; Cui, K; Zhang, X-S; Chen, L; Wong, S T C

2009-11-01

Discovering biomarkers using mass spectrometry (MS) and microarray expression profiles is a promising strategy in molecular diagnosis. Here, the authors proposed a new pipeline for biomarker discovery that integrates disease information for proteins and genes, expression profiles in both genomic and proteomic levels, and protein-protein interactions (PPIs) to discover high confidence network biomarkers. Using this pipeline, a total of 474 molecules (genes and proteins) related to prostate cancer were identified and a prostate-cancer-related network (PCRN) was derived from the integrative information. Thus, a set of candidate network biomarkers were identified from multiple expression profiles composed by eight microarray datasets and one proteomics dataset. The network biomarkers with PPIs can accurately distinguish the prostate patients from the normal ones, which potentially provide more reliable hits of biomarker candidates than conventional biomarker discovery methods.

Candidate innate immune system gene expression in the ecological model Daphnia

PubMed Central

Decaestecker, Ellen; Labbé, Pierrick; Ellegaard, Kirsten; Allen, Judith E.; Little, Tom J.

2011-01-01

The last ten years have witnessed increasing interest in host–pathogen interactions involving invertebrate hosts. The invertebrate innate immune system is now relatively well characterised, but in a limited range of genetic model organisms and under a limited number of conditions. Immune systems have been little studied under real-world scenarios of environmental variation and parasitism. Thus, we have investigated expression of candidate innate immune system genes in the water flea Daphnia, a model organism for ecological genetics, and whose capacity for clonal reproduction facilitates an exceptionally rigorous control of exposure dose or the study of responses at many time points. A unique characteristic of the particular Daphnia clones and pathogen strain combinations used presently is that they have been shown to be involved in specific host–pathogen coevolutionary interactions in the wild. We choose five genes, which are strong candidates to be involved in Daphnia–pathogen interactions, given that they have been shown to code for immune effectors in related organisms. Differential expression of these genes was quantified by qRT-PCR following exposure to the bacterial pathogen Pasteuria ramosa. Constitutive expression levels differed between host genotypes, and some genes appeared to show correlated expression. However, none of the genes appeared to show a major modification of expression level in response to Pasteuria exposure. By applying knowledge from related genetic model organisms (e.g. Drosophila) to models for the study of evolutionary ecology and coevolution (i.e. Daphnia), the candidate gene approach is temptingly efficient. However, our results show that detection of only weak patterns is likely if one chooses target genes for study based on previously identified genome sequences by comparison to homologues from other related organisms. Future work on the Daphnia–Pasteuria system will need to balance a candidate gene approach with more comprehensive approaches to de novo identify immune system genes specific to the Daphnia–Pasteuria interaction. PMID:21550363
Candidate innate immune system gene expression in the ecological model Daphnia.

PubMed

Decaestecker, Ellen; Labbé, Pierrick; Ellegaard, Kirsten; Allen, Judith E; Little, Tom J

2011-10-01

The last ten years have witnessed increasing interest in host-pathogen interactions involving invertebrate hosts. The invertebrate innate immune system is now relatively well characterised, but in a limited range of genetic model organisms and under a limited number of conditions. Immune systems have been little studied under real-world scenarios of environmental variation and parasitism. Thus, we have investigated expression of candidate innate immune system genes in the water flea Daphnia, a model organism for ecological genetics, and whose capacity for clonal reproduction facilitates an exceptionally rigorous control of exposure dose or the study of responses at many time points. A unique characteristic of the particular Daphnia clones and pathogen strain combinations used presently is that they have been shown to be involved in specific host-pathogen coevolutionary interactions in the wild. We choose five genes, which are strong candidates to be involved in Daphnia-pathogen interactions, given that they have been shown to code for immune effectors in related organisms. Differential expression of these genes was quantified by qRT-PCR following exposure to the bacterial pathogen Pasteuria ramosa. Constitutive expression levels differed between host genotypes, and some genes appeared to show correlated expression. However, none of the genes appeared to show a major modification of expression level in response to Pasteuria exposure. By applying knowledge from related genetic model organisms (e.g. Drosophila) to models for the study of evolutionary ecology and coevolution (i.e. Daphnia), the candidate gene approach is temptingly efficient. However, our results show that detection of only weak patterns is likely if one chooses target genes for study based on previously identified genome sequences by comparison to homologues from other related organisms. Future work on the Daphnia-Pasteuria system will need to balance a candidate gene approach with more comprehensive approaches to de novo identify immune system genes specific to the Daphnia-Pasteuria interaction. Copyright © 2011 Elsevier Ltd. All rights reserved.
Genome-Wide Association Study in African Americans with Acute Respiratory Distress Syndrome Identifies the Selectin P Ligand Gene as a Risk Factor.

PubMed

Bime, Christian; Pouladi, Nima; Sammani, Saad; Batai, Ken; Casanova, Nancy; Zhou, Tong; Kempf, Carrie L; Sun, Xiaoguang; Camp, Sara M; Wang, Ting; Kittles, Rick A; Lussier, Yves A; Jones, Tiffanie K; Reilly, John P; Meyer, Nuala J; Christie, Jason D; Karnes, Jason H; Gonzalez-Garay, Manuel; Christiani, David C; Yates, Charles R; Wurfel, Mark M; Meduri, Gianfranco U; Garcia, Joe G N

2018-06-01

Genetic factors are involved in acute respiratory distress syndrome (ARDS) susceptibility. Identification of novel candidate genes associated with increased risk and severity will improve our understanding of ARDS pathophysiology and enhance efforts to develop novel preventive and therapeutic approaches. To identify genetic susceptibility targets for ARDS. A genome-wide association study was performed on 232 African American patients with ARDS and 162 at-risk control subjects. The Identify Candidate Causal SNPs and Pathways platform was used to infer the association of known gene sets with the top prioritized intragenic SNPs. Preclinical validation of SELPLG (selectin P ligand gene) was performed using mouse models of LPS- and ventilator-induced lung injury. Exonic variation within SELPLG distinguishing patients with ARDS from sepsis control subjects was confirmed in an independent cohort. Pathway prioritization analysis identified a nonsynonymous coding SNP (rs2228315) within SELPLG, encoding P-selectin glycoprotein ligand 1, to be associated with increased susceptibility. In an independent cohort, two exonic SELPLG SNPs were significantly associated with ARDS susceptibility. Additional support for SELPLG as an ARDS candidate gene was derived from preclinical ARDS models where SELPLG gene expression in lung tissues was significantly increased in both ventilator-induced (twofold increase) and LPS-induced (5.7-fold increase) murine lung injury models compared with controls. Furthermore, Selplg -/- mice exhibited significantly reduced LPS-induced inflammatory lung injury compared with wild-type C57/B6 mice. Finally, an antibody that neutralizes P-selectin glycoprotein ligand 1 significantly attenuated LPS-induced lung inflammation. These findings identify SELPLG as a novel ARDS susceptibility gene among individuals of European and African descent.
Targeted capture and resequencing of 1040 genes reveal environmentally driven functional variation in grey wolves.

PubMed

Schweizer, Rena M; Robinson, Jacqueline; Harrigan, Ryan; Silva, Pedro; Galverni, Marco; Musiani, Marco; Green, Richard E; Novembre, John; Wayne, Robert K

2016-01-01

In an era of ever-increasing amounts of whole-genome sequence data for individuals and populations, the utility of traditional single nucleotide polymorphisms (SNPs) array-based genome scans is uncertain. We previously performed a SNP array-based genome scan to identify candidate genes under selection in six distinct grey wolf (Canis lupus) ecotypes. Using this information, we designed a targeted capture array for 1040 genes, including all exons and flanking regions, as well as 5000 1-kb nongenic neutral regions, and resequenced these regions in 107 wolves. Selection tests revealed striking patterns of variation within candidate genes relative to noncandidate regions and identified potentially functional variants related to local adaptation. We found 27% and 47% of candidate genes from the previous SNP array study had functional changes that were outliers in sweed and bayenv analyses, respectively. This result verifies the use of genomewide SNP surveys to tag genes that contain functional variants between populations. We highlight nonsynonymous variants in APOB, LIPG and USH2A that occur in functional domains of these proteins, and that demonstrate high correlation with precipitation seasonality and vegetation. We find Arctic and High Arctic wolf ecotypes have higher numbers of genes under selection, which highlight their conservation value and heightened threat due to climate change. This study demonstrates that combining genomewide genotyping arrays with large-scale resequencing and environmental data provides a powerful approach to discern candidate functional variants in natural populations. © 2015 John Wiley & Sons Ltd.
Leishmania genome analysis and high-throughput immunological screening identifies tuzin as a novel vaccine candidate against visceral leishmaniasis.

PubMed

Lakshmi, Bhavana Sethu; Wang, Ruobing; Madhubala, Rentala

2014-06-24

Leishmaniasis is a neglected tropical disease caused by Leishmania species. It is a major health concern affecting 88 countries and threatening 350 million people globally. Unfortunately, there are no vaccines and there are limitations associated with the current therapeutic regimens for leishmaniasis. The emerging cases of drug-resistance further aggravate the situation, demanding rapid drug and vaccine development. The genome sequence of Leishmania, provides access to novel genes that hold potential as chemotherapeutic targets or vaccine candidates. In this study, we selected 19 antigenic genes from about 8000 common Leishmania genes based on the Leishmania major and Leishmania infantum genome information available in the pathogen databases. Potential vaccine candidates thus identified were screened using an in vitro high throughput immunological platform developed in the laboratory. Four candidate genes coding for tuzin, flagellar glycoprotein-like protein (FGP), phospholipase A1-like protein (PLA1) and potassium voltage-gated channel protein (K VOLT) showed a predominant protective Th1 response over disease exacerbating Th2. We report the immunogenic properties and protective efficacy of one of the four antigens, tuzin, as a DNA vaccine against Leishmania donovani challenge. Our results show that administration of tuzin DNA protected BALB/c mice against L. donovani challenge and that protective immunity was associated with higher levels of IFN-γ and IL-12 production in comparison to IL-4 and IL-10. Our study presents a simple approach to rapidly identify potential vaccine candidates using the exhaustive information stored in the genome and an in vitro high-throughput immunological platform. Copyright © 2014. Published by Elsevier Ltd.
Identification of Putative Transmembrane Proteins Involved in Salinity Tolerance in Chenopodium quinoa by Integrating Physiological Data, RNAseq, and SNP Analyses

PubMed Central

Schmöckel, Sandra M.; Lightfoot, Damien J.; Razali, Rozaimi; Tester, Mark; Jarvis, David E.

2017-01-01

Chenopodium quinoa (quinoa) is an emerging crop that produces nutritious grains with the potential to contribute to global food security. Quinoa can also grow on marginal lands, such as soils affected by high salinity. To identify candidate salt tolerance genes in the recently sequenced quinoa genome, we used a multifaceted approach integrating RNAseq analyses with comparative genomics and topology prediction. We identified 219 candidate genes by selecting those that were differentially expressed in response to salinity, were specific to or overrepresented in quinoa relative to other Amaranthaceae species, and had more than one predicted transmembrane domain. To determine whether these genes might underlie variation in salinity tolerance in quinoa and its close relatives, we compared the response to salinity stress in a panel of 21 Chenopodium accessions (14 C. quinoa, 5 C. berlandieri, and 2 C. hircinum). We found large variation in salinity tolerance, with one C. hircinum displaying the highest salinity tolerance. Using genome re-sequencing data from these accessions, we investigated single nucleotide polymorphisms and copy number variation (CNV) in the 219 candidate genes in accessions of contrasting salinity tolerance, and identified 15 genes that could contribute to the differences in salinity tolerance of these Chenopodium accessions. PMID:28680429
Knowledge Discovery in Biological Databases for Revealing Candidate Genes Linked to Complex Phenotypes.

PubMed

Hassani-Pak, Keywan; Rawlings, Christopher

2017-06-13

Genetics and "omics" studies designed to uncover genotype to phenotype relationships often identify large numbers of potential candidate genes, among which the causal genes are hidden. Scientists generally lack the time and technical expertise to review all relevant information available from the literature, from key model species and from a potentially wide range of related biological databases in a variety of data formats with variable quality and coverage. Computational tools are needed for the integration and evaluation of heterogeneous information in order to prioritise candidate genes and components of interaction networks that, if perturbed through potential interventions, have a positive impact on the biological outcome in the whole organism without producing negative side effects. Here we review several bioinformatics tools and databases that play an important role in biological knowledge discovery and candidate gene prioritization. We conclude with several key challenges that need to be addressed in order to facilitate biological knowledge discovery in the future.
Multi-gene panel testing in Korean patients with common genetic generalized epilepsy syndromes.

PubMed

Lee, Cha Gon; Lee, Jeehun; Lee, Munhyang

2018-01-01

Genetic heterogeneity of common genetic generalized epilepsy syndromes is frequently considered. The present study conducted a focused analysis of potential candidate or susceptibility genes for common genetic generalized epilepsy syndromes using multi-gene panel testing with next-generation sequencing. This study included patients with juvenile myoclonic epilepsy, juvenile absence epilepsy, and epilepsy with generalized tonic-clonic seizures alone. We identified pathogenic variants according to the American College of Medical Genetics and Genomics guidelines and identified susceptibility variants using case-control association analyses and family analyses for familial cases. A total of 57 patients were enrolled, including 51 sporadic cases and 6 familial cases. Twenty-two pathogenic and likely pathogenic variants of 16 different genes were identified. CACNA1H was the most frequently observed single gene. Variants of voltage-gated Ca2+ channel genes, including CACNA1A, CACNA1G, and CACNA1H were observed in 32% of variants (n = 7/22). Analyses to identify susceptibility variants using case-control association analysis indicated that KCNMA1 c.400G>C was associated with common genetic generalized epilepsy syndromes. Only 1 family (family A) exhibited a candidate pathogenic variant p.(Arg788His) on CACNA1H, as determined via family analyses. This study identified candidate genetic variants in about a quarter of patients (n = 16/57) and an average of 2.8 variants was identified in each patient. The results reinforced the polygenic disorder with very high locus and allelic heterogeneity of common GGE syndromes. Further, voltage-gated Ca2+ channels are suggested as important contributors to common genetic generalized epilepsy syndromes. This study extends our comprehensive understanding of common genetic generalized epilepsy syndromes.
Association Analysis Suggests SOD2 as a Newly Identified Candidate Gene Associated With Leprosy Susceptibility.

PubMed

Ramos, Geovana Brotto; Salomão, Heloisa; Francio, Angela Schneider; Fava, Vinícius Medeiros; Werneck, Renata Iani; Mira, Marcelo Távora

2016-08-01

Genetic studies have identified several genes and genomic regions contributing to the control of host susceptibility to leprosy. Here, we test variants of the positional and functional candidate gene SOD2 for association with leprosy in 2 independent population samples. Family-based analysis revealed an association between leprosy and allele G of marker rs295340 (P = .042) and borderline evidence of an association between leprosy and alleles C and A of markers rs4880 (P = .077) and rs5746136 (P = .071), respectively. Findings were validated in an independent case-control sample for markers rs295340 (P = .049) and rs4880 (P = .038). These results suggest SOD2 as a newly identified gene conferring susceptibility to leprosy. © The Author 2016. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.
Pharmacological Validation of Candidate Causal Sleep Genes Identified in an N2 Cross

PubMed Central

Brunner, Joseph I.; Gotter, Anthony L.; Millstein, Joshua; Garson, Susan; Binns, Jacquelyn; Fox, Steven V.; Savitz, Alan T.; Yang, He S.; Fitzpatrick, Karrie; Zhou, Lili; Owens, Joseph R.; Webber, Andrea L.; Vitaterna, Martha H.; Kasarskis, Andrew; Uebele, Victor N.; Turek, Fred; Renger, John J.; Winrow, Christopher J.

2013-01-01

Despite the substantial impact of sleep disturbances on human health and the many years of study dedicated to understanding sleep pathologies, the underlying genetic mechanisms that govern sleep and wake largely remain unknown. Recently, we completed large scale genetic and gene expression analyses in a segregating inbred mouse cross and identified candidate causal genes that regulate the mammalian sleep-wake cycle, across multiple traits including total sleep time, amounts of REM, non-REM, sleep bout duration and sleep fragmentation. Here we describe a novel approach toward validating candidate causal genes, while also identifying potential targets for sleep-related indications. Select small molecule antagonists and agonists were used to interrogate candidate causal gene function in rodent sleep polysomnography assays to determine impact on overall sleep architecture and to evaluate alignment with associated sleep-wake traits. Significant effects on sleep architecture were observed in validation studies using compounds targeting the muscarinic acetylcholine receptor M3 subunit (Chrm3)(wake promotion), nicotinic acetylcholine receptor alpha4 subunit (Chrna4)(wake promotion), dopamine receptor D5 subunit (Drd5)(sleep induction), serotonin 1D receptor (Htr1d)(altered REM fragmentation), glucagon-like peptide-1 receptor (Glp1r)(light sleep promotion and reduction of deep sleep), and Calcium channel, voltage-dependent, T type, alpha 1I subunit (Cacna1i)(increased bout duration slow wave sleep). Taken together, these results show the complexity of genetic components that regulate sleep-wake traits and highlight the importance of evaluating this complex behavior at a systems level. Pharmacological validation of genetically identified putative targets provides a rapid alternative to generating knock out or transgenic animal models, and may ultimately lead towards new therapeutic opportunities. PMID:22091728
Evidence for the importance of personalized molecular profiling in pancreatic cancer.

PubMed

Lili, Loukia N; Matyunina, Lilya V; Walker, L DeEtte; Daneker, George W; McDonald, John F

2014-03-01

There is a growing body of evidence that targeted gene therapy holds great promise for the future treatment of cancer. A crucial step in this therapy is the accurate identification of appropriate candidate genes/pathways for targeted treatment. One approach is to identify variant genes/pathways that are significantly enriched in groups of afflicted individuals relative to control subjects. However, if there are multiple molecular pathways to the same cancer, the molecular determinants of the disease may be heterogeneous among individuals and possibly go undetected by group analyses. In an effort to explore this question in pancreatic cancer, we compared the most significantly differentially expressed genes/pathways between cancer and control patient samples as determined by group versus personalized analyses. We found little to no overlap between genes/pathways identified by gene expression profiling using group analyses relative to those identified by personalized analyses. Our results indicate that personalized and not group molecular profiling is the most appropriate approach for the identification of putative candidates for targeted gene therapy of pancreatic and perhaps other cancers with heterogeneous molecular etiology.
EBF factors drive expression of multiple classes of target genes governing neuronal development.

PubMed

Green, Yangsook S; Vetter, Monica L

2011-04-30

Early B cell factor (EBF) family members are transcription factors known to have important roles in several aspects of vertebrate neurogenesis, including commitment, migration and differentiation. Knowledge of how EBF family members contribute to neurogenesis is limited by a lack of detailed understanding of genes that are transcriptionally regulated by these factors. We performed a microarray screen in Xenopus animal caps to search for targets of EBF transcriptional activity, and identified candidate targets with multiple roles, including transcription factors of several classes. We determined that, among the most upregulated candidate genes with expected neuronal functions, most require EBF activity for some or all of their expression, and most have overlapping expression with ebf genes. We also found that the candidate target genes that had the most strongly overlapping expression patterns with ebf genes were predicted to be direct transcriptional targets of EBF transcriptional activity. The identification of candidate targets that are transcription factor genes, including nscl-1, emx1 and aml1, improves our understanding of how EBF proteins participate in the hierarchy of transcription control during neuronal development, and suggests novel mechanisms by which EBF activity promotes migration and differentiation. Other candidate targets, including pcdh8 and kcnk5, expand our knowledge of the types of terminal differentiated neuronal functions that EBF proteins regulate.
ICSNPathway: identify candidate causal SNPs and pathways from genome-wide association study by one analytical framework.

PubMed

Zhang, Kunlin; Chang, Suhua; Cui, Sijia; Guo, Liyuan; Zhang, Liuyan; Wang, Jing

2011-07-01

Genome-wide association study (GWAS) is widely utilized to identify genes involved in human complex disease or some other trait. One key challenge for GWAS data interpretation is to identify causal SNPs and provide profound evidence on how they affect the trait. Currently, researches are focusing on identification of candidate causal variants from the most significant SNPs of GWAS, while there is lack of support on biological mechanisms as represented by pathways. Although pathway-based analysis (PBA) has been designed to identify disease-related pathways by analyzing the full list of SNPs from GWAS, it does not emphasize on interpreting causal SNPs. To our knowledge, so far there is no web server available to solve the challenge for GWAS data interpretation within one analytical framework. ICSNPathway is developed to identify candidate causal SNPs and their corresponding candidate causal pathways from GWAS by integrating linkage disequilibrium (LD) analysis, functional SNP annotation and PBA. ICSNPathway provides a feasible solution to bridge the gap between GWAS and disease mechanism study by generating hypothesis of SNP → gene → pathway(s). The ICSNPathway server is freely available at http://icsnpathway.psych.ac.cn/.
An integration of genome-wide association study and gene expression profiling to prioritize the discovery of novel susceptibility Loci for osteoporosis-related traits.

PubMed

Hsu, Yi-Hsiang; Zillikens, M Carola; Wilson, Scott G; Farber, Charles R; Demissie, Serkalem; Soranzo, Nicole; Bianchi, Estelle N; Grundberg, Elin; Liang, Liming; Richards, J Brent; Estrada, Karol; Zhou, Yanhua; van Nas, Atila; Moffatt, Miriam F; Zhai, Guangju; Hofman, Albert; van Meurs, Joyce B; Pols, Huibert A P; Price, Roger I; Nilsson, Olle; Pastinen, Tomi; Cupples, L Adrienne; Lusis, Aldons J; Schadt, Eric E; Ferrari, Serge; Uitterlinden, André G; Rivadeneira, Fernando; Spector, Timothy D; Karasik, David; Kiel, Douglas P

2010-06-10

Osteoporosis is a complex disorder and commonly leads to fractures in elderly persons. Genome-wide association studies (GWAS) have become an unbiased approach to identify variations in the genome that potentially affect health. However, the genetic variants identified so far only explain a small proportion of the heritability for complex traits. Due to the modest genetic effect size and inadequate power, true association signals may not be revealed based on a stringent genome-wide significance threshold. Here, we take advantage of SNP and transcript arrays and integrate GWAS and expression signature profiling relevant to the skeletal system in cellular and animal models to prioritize the discovery of novel candidate genes for osteoporosis-related traits, including bone mineral density (BMD) at the lumbar spine (LS) and femoral neck (FN), as well as geometric indices of the hip (femoral neck-shaft angle, NSA; femoral neck length, NL; and narrow-neck width, NW). A two-stage meta-analysis of GWAS from 7,633 Caucasian women and 3,657 men, revealed three novel loci associated with osteoporosis-related traits, including chromosome 1p13.2 (RAP1A, p = 3.6x10(-8)), 2q11.2 (TBC1D8), and 18q11.2 (OSBPL1A), and confirmed a previously reported region near TNFRSF11B/OPG gene. We also prioritized 16 suggestive genome-wide significant candidate genes based on their potential involvement in skeletal metabolism. Among them, 3 candidate genes were associated with BMD in women. Notably, 2 out of these 3 genes (GPR177, p = 2.6x10(-13); SOX6, p = 6.4x10(-10)) associated with BMD in women have been successfully replicated in a large-scale meta-analysis of BMD, but none of the non-prioritized candidates (associated with BMD) did. Our results support the concept of our prioritization strategy. In the absence of direct biological support for identified genes, we highlighted the efficiency of subsequent functional characterization using publicly available expression profiling relevant to the skeletal system in cellular or whole animal models to prioritize candidate genes for further functional validation.
Fine mapping of RYMV3: a new resistance gene to Rice yellow mottle virus from Oryza glaberrima.

PubMed

Pidon, Hélène; Ghesquière, Alain; Chéron, Sophie; Issaka, Souley; Hébrard, Eugénie; Sabot, François; Kolade, Olufisayo; Silué, Drissa; Albar, Laurence

2017-04-01

A new resistance gene against Rice yellow mottle virus was identified and mapped in a 15-kb interval. The best candidate is a CC-NBS-LRR gene. Rice yellow mottle virus (RYMV) disease is a serious constraint to the cultivation of rice in Africa and selection for resistance is considered to be the most effective management strategy. The aim of this study was to characterize the resistance of Tog5307, a highly resistant accession belonging to the African cultivated rice species (Oryza glaberrima), that has none of the previously identified resistance genes to RYMV. The specificity of Tog5307 resistance was analyzed using 18 RYMV isolates. While three of them were able to infect Tog5307 very rapidly, resistance against the others was effective despite infection events attributed to resistance-breakdown or incomplete penetrance of the resistance. Segregation of resistance in an interspecific backcross population derived from a cross between Tog5307 and the susceptible Oryza sativa variety IR64 showed that resistance is dominant and is controlled by a single gene, named RYMV3. RYMV3 was mapped in an approximately 15-kb interval in which two candidate genes, coding for a putative transmembrane protein and a CC-NBS-LRR domain-containing protein, were annotated. Sequencing revealed non-synonymous polymorphisms between Tog5307 and the O. glaberrima susceptible accession CG14 in both candidate genes. An additional resistant O. glaberrima accession, Tog5672, was found to have the Tog5307 genotype for the CC-NBS-LRR gene but not for the putative transmembrane protein gene. Analysis of the cosegregation of Tog5672 resistance with the RYMV3 locus suggests that RYMV3 is also involved in Tog5672 resistance, thereby supporting the CC-NBS-LRR gene as the best candidate for RYMV3.
Ancestry-based stratified analysis of Immunochip data identifies novel associations with celiac disease.

PubMed

Garcia-Etxebarria, Koldo; Jauregi-Miguel, Amaia; Romero-Garmendia, Irati; Plaza-Izurieta, Leticia; Legarda, Maria; Irastorza, Iñaki; Bilbao, Jose Ramon

2016-12-01

To identify candidate genes in celiac disease (CD), we reanalyzed the whole Immunochip CD cohort using a different approach that clusters individuals based on immunoancestry prior to disease association analysis, rather than by geographical origin. We detected 636 new associated SNPs (P<7.02 × 10 -07 ) and identified 5 novel genomic regions, extended 8 others previously identified and also detected 18 isolated signals defined by one or very few significant SNPs. To test whether we could identify putative candidate genes, we performed expression analyses of several genes from the top novel region (chr2:134533564-136169524), from a previously identified locus that is now extended, and a gene marked by an isolated SNP, in duodenum biopsies of active and treated CD patients, and non-celiac controls. In the largest novel region, CCNT2 and R3HDM1 were constitutively underexpressed in disease, even after gluten removal. Moreover, several genes within this region were coexpressed in patients, but not in controls. Other novel genes like KIF21B, REL and SORD also showed altered expression in active disease. Apart from the identification of novel CD loci, these results suggest that ancestry-based stratified analysis is an efficient strategy for association studies in complex diseases.
Axon Regeneration Genes Identified by RNAi Screening in C. elegans

PubMed Central

Nix, Paola; Hammarlund, Marc; Hauth, Linda; Lachnit, Martina; Jorgensen, Erik M.

2014-01-01

Axons of the mammalian CNS lose the ability to regenerate soon after development due to both an inhibitory CNS environment and the loss of cell-intrinsic factors necessary for regeneration. The complex molecular events required for robust regeneration of mature neurons are not fully understood, particularly in vivo. To identify genes affecting axon regeneration in Caenorhabditis elegans, we performed both an RNAi-based screen for defective motor axon regeneration in unc-70/β-spectrin mutants and a candidate gene screen. From these screens, we identified at least 50 conserved genes with growth-promoting or growth-inhibiting functions. Through our analysis of mutants, we shed new light on certain aspects of regeneration, including the role of β-spectrin and membrane dynamics, the antagonistic activity of MAP kinase signaling pathways, and the role of stress in promoting axon regeneration. Many gene candidates had not previously been associated with axon regeneration and implicate new pathways of interest for therapeutic intervention. PMID:24403161
The Calcitonin Receptor Gene Is a Candidate for Regulation of Susceptibility to Herpes simplex Type 1 Neuronal Infection Leading to Encephalitis in Rat

PubMed Central

Abdelmagid, Nada; Bereczky-Veress, Biborka; Guerreiro-Cacais, André Ortlieb; Bergman, Petra; Luhr, Katarina M.; Bergström, Tomas; Sköldenberg, Birgit; Piehl, Fredrik

2012-01-01

Herpes simplex encephalitis (HSE) is a fatal infection of the central nervous system (CNS) predominantly caused by Herpes simplex virus type 1. Factors regulating the susceptibility to HSE are still largely unknown. To identify host gene(s) regulating HSE susceptibility we performed a genome-wide linkage scan in an intercross between the susceptible DA and the resistant PVG rat. We found one major quantitative trait locus (QTL), Hse1, on rat chromosome 4 (confidence interval 24.3–31 Mb; LOD score 29.5) governing disease susceptibility. Fine mapping of Hse1 using recombinants, haplotype mapping and sequencing, as well as expression analysis of all genes in the interval identified the calcitonin receptor gene (Calcr) as the main candidate, which also is supported by functional studies. Thus, using unbiased genetic approach variability in Calcr was identified as potentially critical for infection and viral spread to the CNS and subsequent HSE development. PMID:22761571
Exome sequencing of Pakistani consanguineous families identifies 30 novel candidate genes for recessive intellectual disability

PubMed Central

Riazuddin, S; Hussain, M; Razzaq, A; Iqbal, Z; Shahzad, M; Polla, D L; Song, Y; van Beusekom, E; Khan, A A; Tomas-Roca, L; Rashid, M; Zahoor, M Y; Wissink-Lindhout, W M; Basra, M A R; Ansar, M; Agha, Z; van Heeswijk, K; Rasheed, F; Van de Vorst, M; Veltman, J A; Gilissen, C; Akram, J; Kleefstra, T; Assir, M Z; Grozeva, D; Carss, K; Raymond, F L; O'Connor, T D; Riazuddin, S A; Khan, S N; Ahmed, Z M; de Brouwer, A P M; van Bokhoven, H; Riazuddin, S

2017-01-01

Intellectual disability (ID) is a clinically and genetically heterogeneous disorder, affecting 1–3% of the general population. Although research into the genetic causes of ID has recently gained momentum, identification of pathogenic mutations that cause autosomal recessive ID (ARID) has lagged behind, predominantly due to non-availability of sizeable families. Here we present the results of exome sequencing in 121 large consanguineous Pakistani ID families. In 60 families, we identified homozygous or compound heterozygous DNA variants in a single gene, 30 affecting reported ID genes and 30 affecting novel candidate ID genes. Potential pathogenicity of these alleles was supported by co-segregation with the phenotype, low frequency in control populations and the application of stringent bioinformatics analyses. In another eight families segregation of multiple pathogenic variants was observed, affecting 19 genes that were either known or are novel candidates for ID. Transcriptome profiles of normal human brain tissues showed that the novel candidate ID genes formed a network significantly enriched for transcriptional co-expression (P<0.0001) in the frontal cortex during fetal development and in the temporal–parietal and sub-cortex during infancy through adulthood. In addition, proteins encoded by 12 novel ID genes directly interact with previously reported ID proteins in six known pathways essential for cognitive function (P<0.0001). These results suggest that disruptions of temporal parietal and sub-cortical neurogenesis during infancy are critical to the pathophysiology of ID. These findings further expand the existing repertoire of genes involved in ARID, and provide new insights into the molecular mechanisms and the transcriptome map of ID. PMID:27457812
Exome sequencing of Pakistani consanguineous families identifies 30 novel candidate genes for recessive intellectual disability.

PubMed

Riazuddin, S; Hussain, M; Razzaq, A; Iqbal, Z; Shahzad, M; Polla, D L; Song, Y; van Beusekom, E; Khan, A A; Tomas-Roca, L; Rashid, M; Zahoor, M Y; Wissink-Lindhout, W M; Basra, M A R; Ansar, M; Agha, Z; van Heeswijk, K; Rasheed, F; Van de Vorst, M; Veltman, J A; Gilissen, C; Akram, J; Kleefstra, T; Assir, M Z; Grozeva, D; Carss, K; Raymond, F L; O'Connor, T D; Riazuddin, S A; Khan, S N; Ahmed, Z M; de Brouwer, A P M; van Bokhoven, H; Riazuddin, S

2017-11-01

Intellectual disability (ID) is a clinically and genetically heterogeneous disorder, affecting 1-3% of the general population. Although research into the genetic causes of ID has recently gained momentum, identification of pathogenic mutations that cause autosomal recessive ID (ARID) has lagged behind, predominantly due to non-availability of sizeable families. Here we present the results of exome sequencing in 121 large consanguineous Pakistani ID families. In 60 families, we identified homozygous or compound heterozygous DNA variants in a single gene, 30 affecting reported ID genes and 30 affecting novel candidate ID genes. Potential pathogenicity of these alleles was supported by co-segregation with the phenotype, low frequency in control populations and the application of stringent bioinformatics analyses. In another eight families segregation of multiple pathogenic variants was observed, affecting 19 genes that were either known or are novel candidates for ID. Transcriptome profiles of normal human brain tissues showed that the novel candidate ID genes formed a network significantly enriched for transcriptional co-expression (P<0.0001) in the frontal cortex during fetal development and in the temporal-parietal and sub-cortex during infancy through adulthood. In addition, proteins encoded by 12 novel ID genes directly interact with previously reported ID proteins in six known pathways essential for cognitive function (P<0.0001). These results suggest that disruptions of temporal parietal and sub-cortical neurogenesis during infancy are critical to the pathophysiology of ID. These findings further expand the existing repertoire of genes involved in ARID, and provide new insights into the molecular mechanisms and the transcriptome map of ID.

Mapping of Mcs30, a new mammary carcinoma susceptibility quantitative trait locus (QTL30) on rat chromosome 12: identification of fry as a candidate Mcs gene.

PubMed

Ren, Xuefeng; Graham, Jessica C; Jing, Lichen; Mikheev, Andrei M; Gao, Yuan; Lew, Jenny Pan; Xie, Hong; Kim, Andrea S; Shang, Xiuling; Friedman, Cynthia; Vail, Graham; Fang, Ming Zhu; Bromberg, Yana; Zarbl, Helmut

2013-01-01

Rat strains differ dramatically in their susceptibility to mammary carcinogenesis. On the assumption that susceptibility genes are conserved across mammalian species and hence inform human carcinogenesis, numerous investigators have used genetic linkage studies in rats to identify genes responsible for differential susceptibility to carcinogenesis. Using a genetic backcross between the resistant Copenhagen (Cop) and susceptible Fischer 344 (F344) strains, we mapped a novel mammary carcinoma susceptibility (Mcs30) locus to the centromeric region on chromosome 12 (LOD score of ∼8.6 at the D12Rat59 marker). The Mcs30 locus comprises approximately 12 Mbp on the long arm of rat RNO12 whose synteny is conserved on human chromosome 13q12 to 13q13. After analyzing numerous genes comprising this locus, we identified Fry, the rat ortholog of the furry gene of Drosophila melanogaster, as a candidate Mcs gene. We cloned and determined the complete nucleotide sequence of the 13 kbp Fry mRNA. Sequence analysis indicated that the Fry gene was highly conserved across evolution, with 90% similarity of the predicted amino acid sequence among eutherian mammals. Comparison of the Fry sequence in the Cop and F344 strains identified two non-synonymous single nucleotide polymorphisms (SNPs), one of which creates a putative, de novo phosphorylation site. Further analysis showed that the expression of the Fry gene is reduced in a majority of rat mammary tumors. Our results also suggested that FRY activity was reduced in human breast carcinoma cell lines as a result of reduced levels or mutation. This study is the first to identify the Fry gene as a candidate Mcs gene. Our data suggest that the SNPs within the Fry gene contribute to the genetic susceptibility of the F344 rat strain to mammary carcinogenesis. These results provide the foundation for analyzing the role of the human FRY gene in cancer susceptibility and progression.
Identification of a strawberry flavor gene candidate using an integrated genetic-genomic-analytical chemistry approach.

PubMed

Chambers, Alan H; Pillet, Jeremy; Plotto, Anne; Bai, Jinhe; Whitaker, Vance M; Folta, Kevin M

2014-04-17

There is interest in improving the flavor of commercial strawberry (Fragaria × ananassa) varieties. Fruit flavor is shaped by combinations of sugars, acids and volatile compounds. Many efforts seek to use genomics-based strategies to identify genes controlling flavor, and then designing durable molecular markers to follow these genes in breeding populations. In this report, fruit from two cultivars, varying for presence-absence of volatile compounds, along with segregating progeny, were analyzed using GC/MS and RNAseq. Expression data were bulked in silico according to presence/absence of a given volatile compound, in this case γ-decalactone, a compound conferring a peach flavor note to fruits. Computationally sorting reads in segregating progeny based on γ-decalactone presence eliminated transcripts not directly relevant to the volatile, revealing transcripts possibly imparting quantitative contributions. One candidate encodes an omega-6 fatty acid desaturase, an enzyme known to participate in lactone production in fungi, noted here as FaFAD1. This candidate was induced by ripening, was detected in certain harvests, and correlated with γ-decalactone presence. The FaFAD1 gene is present in every genotype where γ-decalactone has been detected, and it was invariably missing in non-producers. A functional, PCR-based molecular marker was developed that cosegregates with the phenotype in F1 and BC1 populations, as well as in many other cultivars and wild Fragaria accessions. Genetic, genomic and analytical chemistry techniques were combined to identify FaFAD1, a gene likely controlling a key flavor volatile in strawberry. The same data may now be re-sorted based on presence/absence of any other volatile to identify other flavor-affecting candidates, leading to rapid generation of gene-specific markers.
Identification of a strawberry flavor gene candidate using an integrated genetic-genomic-analytical chemistry approach

PubMed Central

2014-01-01

Background There is interest in improving the flavor of commercial strawberry (Fragaria × ananassa) varieties. Fruit flavor is shaped by combinations of sugars, acids and volatile compounds. Many efforts seek to use genomics-based strategies to identify genes controlling flavor, and then designing durable molecular markers to follow these genes in breeding populations. In this report, fruit from two cultivars, varying for presence-absence of volatile compounds, along with segregating progeny, were analyzed using GC/MS and RNAseq. Expression data were bulked in silico according to presence/absence of a given volatile compound, in this case γ-decalactone, a compound conferring a peach flavor note to fruits. Results Computationally sorting reads in segregating progeny based on γ-decalactone presence eliminated transcripts not directly relevant to the volatile, revealing transcripts possibly imparting quantitative contributions. One candidate encodes an omega-6 fatty acid desaturase, an enzyme known to participate in lactone production in fungi, noted here as FaFAD1. This candidate was induced by ripening, was detected in certain harvests, and correlated with γ-decalactone presence. The FaFAD1 gene is present in every genotype where γ-decalactone has been detected, and it was invariably missing in non-producers. A functional, PCR-based molecular marker was developed that cosegregates with the phenotype in F1 and BC1 populations, as well as in many other cultivars and wild Fragaria accessions. Conclusions Genetic, genomic and analytical chemistry techniques were combined to identify FaFAD1, a gene likely controlling a key flavor volatile in strawberry. The same data may now be re-sorted based on presence/absence of any other volatile to identify other flavor-affecting candidates, leading to rapid generation of gene-specific markers. PMID:24742080
Identification of candidate genes affecting Δ9-tetrahydrocannabinol biosynthesis in Cannabis sativa

PubMed Central

Marks, M. David; Tian, Li; Wenger, Jonathan P.; Omburo, Stephanie N.; Soto-Fuentes, Wilfredo; He, Ji; Gang, David R.; Weiblen, George D.; Dixon, Richard A.

2009-01-01

RNA isolated from the glands of a Δ9-tetrahydrocannabinolic acid (THCA)-producing strain of Cannabis sativa was used to generate a cDNA library containing over 100 000 expressed sequence tags (ESTs). Sequencing of over 2000 clones from the library resulted in the identification of over 1000 unigenes. Candidate genes for almost every step in the biochemical pathways leading from primary metabolites to THCA were identified. Quantitative PCR analysis suggested that many of the pathway genes are preferentially expressed in the glands. Hexanoyl-CoA, one of the metabolites required for THCA synthesis, could be made via either de novo fatty acids synthesis or via the breakdown of existing lipids. qPCR analysis supported the de novo pathway. Many of the ESTs encode transcription factors and two putative MYB genes were identified that were preferentially expressed in glands. Given the similarity of the Cannabis MYB genes to those in other species with known functions, these Cannabis MYBs may play roles in regulating gland development and THCA synthesis. Three candidates for the polyketide synthase (PKS) gene responsible for the first committed step in the pathway to THCA were characterized in more detail. One of these was identical to a previously reported chalcone synthase (CHS) and was found to have CHS activity. All three could use malonyl-CoA and hexanoyl-CoA as substrates, including the CHS, but reaction conditions were not identified that allowed for the production of olivetolic acid (the proposed product of the PKS activity needed for THCA synthesis). One of the PKS candidates was highly and specifically expressed in glands (relative to whole leaves) and, on the basis of these expression data, it is proposed to be the most likely PKS responsible for olivetolic acid synthesis in Cannabis glands. PMID:19581347
Quantitative Trait Loci for BMD in an SM/J by NZB/BlNJ Intercross Population and Identification of Trps1 as a Probable Candidate Gene

PubMed Central

Ishimori, Naoki; Stylianou, Ioannis M; Korstanje, Ron; Marion, Michael A; Li, Renhua; Donahue, Leah Rae; Rosen, Clifford J; Beamer, Wesley G; Paigen, Beverly; Churchill, Gary A

2008-01-01

Identification of genes that regulate BMD will enhance our understanding of osteoporosis and could provide novel molecular targets for treatment or prevention. We generated a mouse intercross population and carried out a quantitative trait locus (QTL) analysis of 143 female and 124 male F2 progeny from progenitor strains SM/J and NZB/BlNJ using whole body and vertebral areal BMD (aBMD) as measured by DXA. We found that both whole body and vertebral aBMD was affected by two loci on chromosome 9: one with a significant epistatic interaction on distal chromosome 8 and the other with a sex-specific effect. Two additional significant QTLs were identified on chromosome 12, and several suggestive ones were identified on chromosomes 5, 8, 15, and 19. The chromosome 9, 12, and 15 loci have been previously identified in other crosses. SNP-based haplotype analysis of the progenitor strains identified blocks within the QTL region that distinguish the low allele strains from the high allele strains, significantly narrowing the QTL region and reducing the possible candidate genes to 98 for chromosome 9, 31 for chromosome 12, and only 2 for chromosome 15. Trps1 is the most probable candidate gene for the chromosome 15 QTL. The sex-specific effects may help to elucidate the BMD differences between males and females. This study shows the power of statistical modeling to resolve linked QTLs and the use of haplotype analysis in narrowing the list of candidates. PMID:18442308
Partial genome assembly for a candidate division OP11 single cell from an anoxic spring (Zodletone Spring, Oklahoma).

PubMed

Youssef, Noha H; Blainey, Paul C; Quake, Stephen R; Elshahed, Mostafa S

2011-11-01

Members of candidate division OP11 are widely distributed in terrestrial and marine ecosystems, yet little information regarding their metabolic capabilities and ecological role within such habitats is currently available. Here, we report on the microfluidic isolation, multiple-displacement-amplification, pyrosequencing, and genomic analysis of a single cell (ZG1) belonging to candidate division OP11. Genome analysis of the ∼270-kb partial genome assembly obtained showed that it had no particular similarity to a specific phylum. Four hundred twenty-three open reading frames were identified, 46% of which had no function prediction. In-depth analysis revealed a heterotrophic lifestyle, with genes encoding endoglucanase, amylopullulanase, and laccase enzymes, suggesting a capacity for utilization of cellulose, starch, and, potentially, lignin, respectively. Genes encoding several glycolysis enzymes as well as formate utilization were identified, but no evidence for an electron transport chain was found. The presence of genes encoding various components of lipopolysaccharide biosynthesis indicates a Gram-negative bacterial cell wall. The partial genome also provides evidence for antibiotic resistance (β-lactamase, aminoglycoside phosphotransferase), as well as antibiotic production (bacteriocin) and extracellular bactericidal peptidases. Multiple mechanisms for stress response were identified, as were elements of type I and type IV secretion systems. Finally, housekeeping genes identified within the partial genome were used to demonstrate the OP11 affiliation of multiple hitherto unclassified genomic fragments from multiple database-deposited metagenomic data sets. These results provide the first glimpse into the lifestyle of a member of a ubiquitous, yet poorly understood bacterial candidate division.
Mapping eQTLs in the Norfolk Island Genetic Isolate Identifies Candidate Genes for CVD Risk Traits

PubMed Central

Benton, Miles C.; Lea, Rod A.; Macartney-Coxson, Donia; Carless, Melanie A.; Göring, Harald H.; Bellis, Claire; Hanna, Michelle; Eccles, David; Chambers, Geoffrey K.; Curran, Joanne E.; Harper, Jacquie L.; Blangero, John; Griffiths, Lyn R.

2013-01-01

Cardiovascular disease (CVD) affects millions of people worldwide and is influenced by numerous factors, including lifestyle and genetics. Expression quantitative trait loci (eQTLs) influence gene expression and are good candidates for CVD risk. Founder-effect pedigrees can provide additional power to map genes associated with disease risk. Therefore, we identified eQTLs in the genetic isolate of Norfolk Island (NI) and tested for associations between these and CVD risk factors. We measured genome-wide transcript levels of blood lymphocytes in 330 individuals and used pedigree-based heritability analysis to identify heritable transcripts. eQTLs were identified by genome-wide association testing of these transcripts. Testing for association between CVD risk factors (i.e., blood lipids, blood pressure, and body fat indices) and eQTLs revealed 1,712 heritable transcripts (p < 0.05) with heritability values ranging from 0.18 to 0.84. From these, we identified 200 cis-acting and 70 trans-acting eQTLs (p < 1.84 × 10−7) An eQTL-centric analysis of CVD risk traits revealed multiple associations, including 12 previously associated with CVD-related traits. Trait versus eQTL regression modeling identified four CVD risk candidates (NAAA, PAPSS1, NME1, and PRDX1), all of which have known biological roles in disease. In addition, we implicated several genes previously associated with CVD risk traits, including MTHFR and FN3KRP. We have successfully identified a panel of eQTLs in the NI pedigree and used this to implicate several genes in CVD risk. Future studies are required for further assessing the functional importance of these eQTLs and whether the findings here also relate to outbred populations. PMID:24314549
Schizophrenia, vitamin D, and brain development.

PubMed

Mackay-Sim, Alan; Féron, François; Eyles, Darryl; Burne, Thomas; McGrath, John

2004-01-01

Schizophrenia research is invigorated at present by the recent discovery of several plausible candidate susceptibility genes identified from genetic linkage and gene expression studies of brains from persons with schizophrenia. It is a current challenge to reconcile this gathering evidence for specific candidate susceptibility genes with the "neurodevelopmental hypothesis," which posits that schizophrenia arises from gene-environment interactions that disrupt brain development. We make the case here that schizophrenia may result not from numerous genes of small effect, but a few genes of transcriptional regulation acting during brain development. In particular we propose that low vitamin D during brain development interacts with susceptibility genes to alter the trajectory of brain development, probably by epigenetic regulation that alters gene expression throughout adult life. Vitamin D is an attractive "environmental" candidate because it appears to explain several key epidemiological features of schizophrenia. Vitamin D is an attractive "genetic" candidate because its nuclear hormone receptor regulates gene expression and nervous system development. The polygenic quality of schizophrenia, with linkage to many genes of small effect, maybe brought together via this "vitamin D hypothesis." We also discuss the possibility of a broader set of environmental and genetic factors interacting via the nuclear hormone receptors to affect the development of the brain leading to schizophrenia.
Multi-Dimensional Prioritization of Dental Caries Candidate Genes and Its Enriched Dense Network Modules

PubMed Central

Wang, Quan; Jia, Peilin; Cuenco, Karen T.; Feingold, Eleanor; Marazita, Mary L.; Wang, Lily; Zhao, Zhongming

2013-01-01

A number of genetic studies have suggested numerous susceptibility genes for dental caries over the past decade with few definite conclusions. The rapid accumulation of relevant information, along with the complex architecture of the disease, provides a challenging but also unique opportunity to review and integrate the heterogeneous data for follow-up validation and exploration. In this study, we collected and curated candidate genes from four major categories: association studies, linkage scans, gene expression analyses, and literature mining. Candidate genes were prioritized according to the magnitude of evidence related to dental caries. We then searched for dense modules enriched with the prioritized candidate genes through their protein-protein interactions (PPIs). We identified 23 modules comprising of 53 genes. Functional analyses of these 53 genes revealed three major clusters: cytokine network relevant genes, matrix metalloproteinases (MMPs) family, and transforming growth factor-beta (TGF-β) family, all of which have been previously implicated to play important roles in tooth development and carious lesions. Through our extensive data collection and an integrative application of gene prioritization and PPI network analyses, we built a dental caries-specific sub-network for the first time. Our study provided insights into the molecular mechanisms underlying dental caries. The framework we proposed in this work can be applied to other complex diseases. PMID:24146904
A Novel Yeast Genomics Method for Identifying New Breast Cancer Susceptibility Genes

DTIC Science & Technology

2007-05-01

find new candidate genes for breast cancer susceptibility in women and identifying these human genes can further improve monitoring and treatment...breast cancer susceptibility genes in humans that are currently unknown and not deducible from current methodologies. It is a fundamental...template to faithfully repair the broken strand. In human cancer it is loss of HR, rather than NHEJ, that is more important in increasing cancer
A candidate gene study in low HDL-cholesterol families provides evidence for the involvement of the APOA2 gene and the APOA1C3A4 gene cluster.

PubMed

Lilja, Heidi E; Soro, Aino; Ylitalo, Kati; Nuotio, Ilpo; Viikari, Jorma S A; Salomaa, Veikko; Vartiainen, Erkki; Taskinen, Marja-Riitta; Peltonen, Leena; Pajukanta, Päivi

2002-09-01

In patients with premature coronary heart disease, the most common lipoprotein abnormality is high-density lipoprotein (HDL) deficiency. To assess the genetic background of the low HDL-cholesterol trait, we performed a candidate gene study in 25 families with low HDL, collected from the genetically isolated population of Finland. We studied 21 genes encoding essential proteins involved in the HDL metabolism by genotyping intragenic and flanking markers for these genes. We found suggestive evidence for linkage in two candidate regions: Marker D1S2844, in the apolipoprotein A-II (APOA2) region, yielded a LOD score of 2.14 and marker D11S939 flanking the apolipoprotein A-I/C-III/A-IV gene cluster (APOA1C3A4) produced a LOD score of 1.69. Interestingly, we identified potential shared haplotypes in these two regions in a subset of low HDL families. These families also contributed to the obtained positive LOD scores, whereas the rest of the families produced negative LOD scores. None of the remaining candidate regions provided any evidence for linkage. Since only a limited number of loci were tested in this candidate gene study, these LOD scores suggest significant involvement of the APOA2 gene and the APOA1C3A4 gene cluster, or loci in their immediate vicinity, in the pathogenesis of low HDL.
High-throughput discovery of mutations in tef semi-dwarfing genes by next-generation sequencing analysis.

PubMed

Zhu, Qihui; Smith, Shavannor M; Ayele, Mulu; Yang, Lixing; Jogi, Ansuya; Chaluvadi, Srinivasa R; Bennetzen, Jeffrey L

2012-11-01

Tef (Eragrostis tef) is a major cereal crop in Ethiopia. Lodging is the primary constraint to increasing productivity in this allotetraploid species, accounting for losses of ∼15-45% in yield each year. As a first step toward identifying semi-dwarf varieties that might have improved lodging resistance, an ∼6× fosmid library was constructed and used to identify both homeologues of the dw3 semi-dwarfing gene of Sorghum bicolor. An EMS mutagenized population, consisting of ∼21,210 tef plants, was planted and leaf materials were collected into 23 superpools. Two dwarfing candidate genes, homeologues of dw3 of sorghum and rht1 of wheat, were sequenced directly from each superpool with 454 technology, and 120 candidate mutations were identified. Out of 10 candidates tested, six independent mutations were validated by Sanger sequencing, including two predicted detrimental mutations in both dw3 homeologues with a potential to improve lodging resistance in tef through further breeding. This study demonstrates that high-throughput sequencing can identify potentially valuable mutations in under-studied plant species like tef and has provided mutant lines that can now be combined and tested in breeding programs for improved lodging resistance.
Genomic expression analysis of rat chromosome 4 for skeletal traits at femoral neck.

PubMed

Alam, Imranul; Sun, Qiwei; Liu, Lixiang; Koller, Daniel L; Liu, Yunlong; Edenberg, Howard J; Econs, Michael J; Foroud, Tatiana; Turner, Charles H

2008-10-08

Hip fracture is the most devastating osteoporotic fracture type with significant morbidity and mortality. Several studies in humans and animal models identified chromosomal regions linked to hip size and bone mass. Previously, we identified that the region of 4q21-q41 on rat chromosome (Chr) 4 harbors multiple femoral neck quantitative trait loci (QTLs) in inbred Fischer 344 (F344) and Lewis (LEW) rats. The purpose of this study is to identify the candidate genes for femoral neck structure and density by correlating gene expression in the proximal femur with the femoral neck phenotypes linked to the QTLs on Chr 4. RNA was extracted from proximal femora of 4-wk-old rats from F344 and LEW strains, and two other strains, Copenhagen 2331 and Dark Agouti, were used as a negative control. Microarray analysis was performed using Affymetrix Rat Genome 230 2.0 arrays. A total of 99 genes in the 4q21-q41 region were differentially expressed (P < 0.05) among all strains of rats with a false discovery rate <10%. These 99 genes were then ranked based on the strength of correlation between femoral neck phenotypes measured in F2 animals, homozygous for a particular strain's allele at the Chr 4 QTL and the expression level of the gene in that strain. A total of 18 candidate genes were strongly correlated (r(2) > 0.50) with femoral neck width and prioritized for further analysis. Quantitative PCR analysis confirmed 14 of 18 of the candidate genes. Ingenuity pathway analysis revealed several direct or indirect relationships among the candidate genes related to angiogenesis (VEGF), bone growth (FGF2), bone formation (IGF2 and IGF2BP3), and resorption (TNF). This study provides a shortened list of genetic determinants of skeletal traits at the hip and may lead to novel approaches for prevention and treatment of hip fracture.
Genomic expression analysis of rat chromosome 4 for skeletal traits at femoral neck

PubMed Central

Alam, Imranul; Sun, Qiwei; Liu, Lixiang; Koller, Daniel L.; Liu, Yunlong; Edenberg, Howard J.; Econs, Michael J.; Foroud, Tatiana; Turner, Charles H.

2008-01-01

Hip fracture is the most devastating osteoporotic fracture type with significant morbidity and mortality. Several studies in humans and animal models identified chromosomal regions linked to hip size and bone mass. Previously, we identified that the region of 4q21-q41 on rat chromosome (Chr) 4 harbors multiple femoral neck quantitative trait loci (QTLs) in inbred Fischer 344 (F344) and Lewis (LEW) rats. The purpose of this study is to identify the candidate genes for femoral neck structure and density by correlating gene expression in the proximal femur with the femoral neck phenotypes linked to the QTLs on Chr 4. RNA was extracted from proximal femora of 4-wk-old rats from F344 and LEW strains, and two other strains, Copenhagen 2331 and Dark Agouti, were used as a negative control. Microarray analysis was performed using Affymetrix Rat Genome 230 2.0 arrays. A total of 99 genes in the 4q21-q41 region were differentially expressed (P < 0.05) among all strains of rats with a false discovery rate <10%. These 99 genes were then ranked based on the strength of correlation between femoral neck phenotypes measured in F2 animals, homozygous for a particular strain's allele at the Chr 4 QTL and the expression level of the gene in that strain. A total of 18 candidate genes were strongly correlated (r2 > 0.50) with femoral neck width and prioritized for further analysis. Quantitative PCR analysis confirmed 14 of 18 of the candidate genes. Ingenuity pathway analysis revealed several direct or indirect relationships among the candidate genes related to angiogenesis (VEGF), bone growth (FGF2), bone formation (IGF2 and IGF2BP3), and resorption (TNF). This study provides a shortened list of genetic determinants of skeletal traits at the hip and may lead to novel approaches for prevention and treatment of hip fracture. PMID:18728226
Physiological and molecular characterization of drought responses and identification of candidate tolerance genes in cassava

PubMed Central

Turyagyenda, Laban F.; Kizito, Elizabeth B.; Ferguson, Morag; Baguma, Yona; Agaba, Morris; Harvey, Jagger J. W.; Osiru, David S. O.

2013-01-01

Cassava is an important root crop to resource-poor farmers in marginal areas, where its production faces drought stress constraints. Given the difficulties associated with cassava breeding, a molecular understanding of drought tolerance in cassava will help in the identification of markers for use in marker-assisted selection and genes for transgenic improvement of drought tolerance. This study was carried out to identify candidate drought-tolerance genes and expression-based markers of drought stress in cassava. One drought-tolerant (improved variety) and one drought-susceptible (farmer-preferred) cassava landrace were grown in the glasshouse under well-watered and water-stressed conditions. Their morphological, physiological and molecular responses to drought were characterized. Morphological and physiological measurements indicate that the tolerance of the improved variety is based on drought avoidance, through reduction of water loss via partial stomatal closure. Ten genes that have previously been biologically validated as conferring or being associated with drought tolerance in other plant species were confirmed as being drought responsive in cassava. Four genes (MeALDH, MeZFP, MeMSD and MeRD28) were identified as candidate cassava drought-tolerance genes, as they were exclusively up-regulated in the drought-tolerant genotype to comparable levels known to confer drought tolerance in other species. Based on these genes, we hypothesize that the basis of the tolerance at the cellular level is probably through mitigation of the oxidative burst and osmotic adjustment. This study provides an initial characterization of the molecular response of cassava to drought stress resembling field conditions. The drought-responsive genes can now be used as expression-based markers of drought stress tolerance in cassava, and the candidate tolerance genes tested in the context of breeding (as possible quantitative trait loci) and engineering drought tolerance in transgenics. PMID:23519782
Comparative Transcriptional Profiling of the Axolotl Limb Identifies a Tripartite Regeneration-Specific Gene Program

PubMed Central

Knapp, Dunja; Schulz, Herbert; Rascon, Cynthia Alexander; Volkmer, Michael; Scholz, Juliane; Nacu, Eugen; Le, Mu; Novozhilov, Sergey; Tazaki, Akira; Protze, Stephanie; Jacob, Tina; Hubner, Norbert; Habermann, Bianca; Tanaka, Elly M.

2013-01-01

Understanding how the limb blastema is established after the initial wound healing response is an important aspect of regeneration research. Here we performed parallel expression profile time courses of healing lateral wounds versus amputated limbs in axolotl. This comparison between wound healing and regeneration allowed us to identify amputation-specific genes. By clustering the expression profiles of these samples, we could detect three distinguishable phases of gene expression – early wound healing followed by a transition-phase leading to establishment of the limb development program, which correspond to the three phases of limb regeneration that had been defined by morphological criteria. By focusing on the transition-phase, we identified 93 strictly amputation-associated genes many of which are implicated in oxidative-stress response, chromatin modification, epithelial development or limb development. We further classified the genes based on whether they were or were not significantly expressed in the developing limb bud. The specific localization of 53 selected candidates within the blastema was investigated by in situ hybridization. In summary, we identified a set of genes that are expressed specifically during regeneration and are therefore, likely candidates for the regulation of blastema formation. PMID:23658691
Cracking the genomic piggy bank: identifying secrets of the pig genome.

PubMed

Mote, B E; Rothschild, M F

2006-01-01

Though researchers are uncovering valuable information about the pig genome at unprecedented speed, the porcine genome community is barely scratching the surface as to understanding interactions of the biological code. The pig genetic linkage map has nearly 5,000 loci comprised of genes, microsatellites, and amplified fragment length polymorphism markers. Likewise, the physical map is becoming denser with nearly 6,000 markers. The long awaited sequencing efforts are providing multidimensional benefits with sequence available for comparative genomics and identifying single nucleotide polymorphisms for use in linkage and trait association studies. Scientists are using exotic and commercial breeds for quantitative trait loci scans. Additionally, candidate gene studies continue to identify chromosomal regions or genes associated with economically important traits such as growth rate, leanness, feed intake, meat quality, litter size, and disease resistance. The commercial pig industry is actively incorporating these markers in marker-assisted selection along with traditional performance information to improve said traits. Researchers are utilizing novel tools including pig microarrays along with advanced bioinformatics to identify new candidate genes, understand gene function, and piece together gene networks involved in important biological processes. Advances in pig genomics and implications to the pork industry as well as human health are reviewed.
No Association of BDNF, COMT, MAOA, SLC6A3, and SLC6A4 Genes and Depressive Symptoms in a Sample of Healthy Colombian Subjects.

PubMed

González-Giraldo, Yeimy; Camargo, Andrés; López-León, Sandra; Forero, Diego A

2015-01-01

Background. Major depressive disorder (MDD) is the second cause of years lived with disability around the world. A large number of studies have been carried out to identify genetic risk factors for MDD and related endophenotypes, mainly in populations of European and Asian descent, with conflicting results. The main aim of the current study was to analyze the possible association of five candidate genes and depressive symptoms in a Colombian sample of healthy subjects. Methods and Materials. The Spanish adaptation of the Hospital Anxiety and Depression Scale (HADS) was applied to one hundred eighty-eight healthy Colombian subjects. Five functional polymorphisms were genotyped using PCR-based assays: BDNF-Val66Met (rs6265), COMT-Val158Met (rs4680), SLC6A4-HTTLPR (rs4795541), MAOA-uVNTR, and SLC6A3-VNTR (rs28363170). Result. We did not find significant associations with scores of depressive symptoms, derived from the HADS, for any of the five candidate genes (nominal p values >0.05). In addition, we did not find evidence of significant gene-gene interactions. Conclusion. This work is one of the first studies of candidate genes for depressive symptoms in a Latin American sample. Study of additional genetic and epigenetic variants, taking into account other pathophysiological theories, will help to identify novel candidates for MDD in populations around the world.
Genome-Wide Prediction of the Polymorphic Ser Gene Family in Tetrahymena thermophila Based on Motif Analysis

PubMed Central

Ponsuwanna, Patrath; Kümpornsin, Krittikorn; Chookajorn, Thanat

2014-01-01

Even though antigenic variation is employed among parasitic protozoa for host immune evasion, Tetrahymena thermophila, a free-living ciliate, can also change its surface protein antigens. These cysteine-rich glycosylphosphatidylinositol (GPI)-linked surface proteins are encoded by a family of polymorphic Ser genes. Despite the availability of T. thermophila genome, a comprehensive analysis of the Ser family is limited by its high degree of polymorphism. In order to overcome this problem, a new approach was adopted by searching for Ser candidates with common motif sequences, namely length-specific repetitive cysteine pattern and GPI anchor site. The candidate genes were phylogenetically compared with the previously identified Ser genes and classified into subtypes. Ser candidates were often found to be located as tandem arrays of the same subtypes on several chromosomal scaffolds. Certain Ser candidates located in the same chromosomal arrays were transcriptionally expressed at specific T. thermophila developmental stages. These Ser candidates selected by the motif analysis approach can form the foundation for a systematic identification of the entire Ser gene family, which will contribute to the understanding of their function and the basis of T. thermophila antigenic variation. PMID:25133747
Lack of specific alleles for the bovine chemokine (C-X-C) receptor type 4 (CXCR4) gene in West African cattle questions its role as a candidate for trypanotolerance.

PubMed

Álvarez, Isabel; Pérez-Pardal, Lucía; Traoré, Amadou; Fernández, Iván; Goyache, Félix

2016-08-01

A panel of 81 Asian, African and European cattle (Bos taurus and B. indicus) was analysed for the whole sequence of the CXCR4 gene (3844bp), a strong candidate for cattle trypanotolerance. Thirty-one polymorphic sites identified gave 31 different haplotypes. Neutrality tests rejected the hypothesis of either positive or purifying selection. Bayesian phylogenetic tree showed differentiation of haplotypes into two clades gathering genetic variability predating domestication. Related with clades definition, linkage disequilibrium analyses suggested the existence of one only linkage block on the CXCR4 gene. Two tag SNPs identified on exon 2 captured 50% of variability. Whatever the analysis carried out, no clear separation between cattle groups was identified. Most haplotypes identified in West African taurine cattle were also found in European cattle and in Asian and West African zebu. West African taurine samples did not carry unique variants on the CXCR4 gene sequence. The current analysis failed in identifying a causal mutation on the CXCR4 gene underlying a previously reported QTL for cattle trypanotolerance on BTA2. Copyright © 2016 Elsevier B.V. All rights reserved.

Identification of KCNJ11 as a functional candidate gene for bovine meat tenderness.

PubMed

Tizioto, Polyana C; Gasparin, Gustavo; Souza, Marcela M; Mudadu, Mauricio A; Coutinho, Luiz L; Mourão, Gerson B; Tholon, Patricia; Meirelles, Sarah L C; Tullio, Rymer R; Rosa, Antônio N; Alencar, Maurício M; Medeiros, Sérgio R; Siqueira, Fabiane; Feijó, Gelson L D; Nassu, Renata T; Regitano, Luciana C A

2013-12-15

The potassium inwardly rectifying channel, subfamily J, member 11 (KCNJ11) gene was investigated as a candidate for meat tenderness based on the effects reported on muscle for KCNJ11 gene knockout in rat models and its position in a quantitative trait locus (QTL) for meat tenderness in the bovine genome. Sequence variations in the KCNJ11 gene were described by sequencing six amplified fragments, covering almost the entire gene. We identified single nucleotide polymorphisms (SNP) and validated them by different approaches, taking advantage of simultaneous projects that are being developed with the same Nelore population. By sequencing the KCNJ11 in Nelore steers representing extreme phenotypes for Warner-Bratzler shear force (WBSF), it was possible to identify 22 SNPs. We validated two of the identified markers by genotyping the whole population (n = 460). Analysis of association between genotypes and WBSF values revealed a significant additive effect of a SNP at different meat aging times (P ≤ 0.05). In addition, an association between the expression levels of KCNJ11 and WBSF was found, with lower expression levels of KCNJ11 associated with more tender meat (P ≤ 0.05). The results showed that the KCNJ11 gene is a candidate mapped to a QTL for meat tenderness previously identified on BTA15 and may be useful to identify animals with genetic potential to produce tender meat. The effect of KCNJ11 observed on muscle is potentially due to changes in activity of KATP channels, which in turn influence the flow of potassium in the intracellular space, allowing establishment of the membrane potential necessary for muscle contraction.
Selection of reference genes for expression studies with fish myogenic cell cultures.

PubMed

Bower, Neil I; Johnston, Ian A

2009-08-10

Relatively few studies have used cell culture systems to investigate gene expression and the regulation of myogenesis in fish. To produce robust data from quantitative real-time PCR mRNA levels need to be normalised using internal reference genes which have stable expression across all experimental samples. We have investigated the expression of eight candidate genes to identify suitable reference genes for use in primary myogenic cell cultures from Atlantic salmon (Salmo salar L.). The software analysis packages geNorm, Normfinder and Best keeper were used to rank genes according to their stability across 42 samples during the course of myogenic differentiation. Initial results showed several of the candidate genes exhibited stable expression throughout myogenic culture while Sdha was identified as the least stable gene. Further analysis with geNorm, Normfinder and Bestkeeper identified Ef1alpha, Hprt1, Ppia and RNApolII as stably expressed. Comparison of data normalised with the geometric average obtained from combinations of any three of these genes showed no significant differences, indicating that any combination of these genes is valid. The geometric average of any three of Hprt1, Ef1alpha, Ppia and RNApolII is suitable for normalisation of gene expression data in primary myogenic cultures from Atlantic salmon.
Comparative Transcriptome Analysis Identifies Putative Genes Involved in the Biosynthesis of Xanthanolides in Xanthium strumarium L.

PubMed

Li, Yuanjun; Gou, Junbo; Chen, Fangfang; Li, Changfu; Zhang, Yansheng

2016-01-01

Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones (STLs), which include the xanthanolides. To date, the biogenesis of xanthanolides, especially their downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes, which were derived from the young leaves of two different cultivars and the purified glandular trichomes from one of the cultivars, were constructed in this study. In total, 157 million clean reads were generated and assembled into 91,861 unigenes, of which 59,858 unigenes were successfully annotated. All the genes coding for known enzymes in the upstream pathway to the biosynthesis of xanthanolides were present in the X. strumarium transcriptomes. From a comparative analysis of the X. strumarium transcriptomes, this study identified a number of gene candidates that are putatively involved in the downstream pathway to the synthesis of xanthanolides, such as four unigenes encoding CYP71 P450s, 50 unigenes for dehydrogenases, and 27 genes for acetyltransferases. The possible functions of these four CYP71 candidates are extensively discussed. In addition, 116 transcription factors that are highly expressed in X. strumarium glandular trichomes were also identified. Their possible regulatory roles in the biosynthesis of STLs are discussed. The global transcriptomic data for X. strumarium should provide a valuable resource for further research into the biosynthesis of xanthanolides.
A comprehensive approach to identify reliable reference gene candidates to investigate the link between alcoholism and endocrinology in Sprague-Dawley rats.

PubMed

Taki, Faten A; Abdel-Rahman, Abdel A; Zhang, Baohong

2014-01-01

Gender and hormonal differences are often correlated with alcohol dependence and related complications like addiction and breast cancer. Estrogen (E2) is an important sex hormone because it serves as a key protein involved in organism level signaling pathways. Alcoholism has been reported to affect estrogen receptor signaling; however, identifying the players involved in such multi-faceted syndrome is complex and requires an interdisciplinary approach. In many situations, preliminary investigations included a straight forward, yet informative biotechniques such as gene expression analyses using quantitative real time PCR (qRT-PCR). The validity of qRT-PCR-based conclusions is affected by the choice of reliable internal controls. With this in mind, we compiled a list of 15 commonly used housekeeping genes (HKGs) as potential reference gene candidates in rat biological models. A comprehensive comparison among 5 statistical approaches (geNorm, dCt method, NormFinder, BestKeeper, and RefFinder) was performed to identify the minimal number as well the most stable reference genes required for reliable normalization in experimental rat groups that comprised sham operated (SO), ovariectomized rats in the absence (OVX) or presence of E2 (OVXE2). These rat groups were subdivided into subgroups that received alcohol in liquid diet or isocalroic control liquid diet for 12 weeks. Our results showed that U87, 5S rRNA, GAPDH, and U5a were the most reliable gene candidates for reference genes in heart and brain tissue. However, different gene stability ranking was specific for each tissue input combination. The present preliminary findings highlight the variability in reference gene rankings across different experimental conditions and analytic methods and constitute a fundamental step for gene expression assays.
Vitamin D receptor gene Alw I, Fok I, Apa I, and Taq I polymorphisms in patients with urinary stone.

PubMed

Seo, Ill Young; Kang, In-Hong; Chae, Soo-Cheon; Park, Seung Chol; Lee, Young-Jin; Yang, Yun Sik; Ryu, Soo Bang; Rim, Joung Sik

2010-04-01

To evaluate vitamin D receptor (VDR) gene polymorphisms in Korean patients so as to identify the candidate genes associated with urinary stones. Urinary stones are a multifactorial disease that includes various genetic factors. A normal control group of 535 healthy subjects and 278 patients with urinary stones was evaluated. Of 125 patients who presented stone samples, 102 had calcium stones on chemical analysis. The VDR gene Alw I, Fok I, Apa I, and Taq I polymorphisms were evaluated using the polymerase chain reaction-restriction fragment length polymorphism analysis. Allelic and genotypic frequencies were calculated to identify associations in both groups. The haplotype frequencies of the VDR gene polymorphisms for multiple loci were also determined. For the VDR gene Alw I, Fok I, Apa I, and Taq I polymorphisms, there was no statistically significant difference between the patients with urinary stones and the healthy controls. There was also no statistically significant difference between the patients with calcium stones and the healthy controls. A novel haplotype (Ht 4; CTTT) was identified in 13.5% of the patients with urinary stones and in 8.3% of the controls (P = .001). The haplotype frequencies were significantly different between the patients with calcium stones and the controls (P = .004). The VDR gene Alw I, Fok I, Apa I, and Taq I polymorphisms does not seem to be candidate genetic markers for urinary stones in Korean patients. However, 1 novel haplotype of the VDR gene polymorphisms for multiple loci might be a candidate genetic marker. Copyright 2010 Elsevier Inc. All rights reserved.
Genome-Wide Association Mapping Combined with Reverse Genetics Identifies New Effectors of Low Water Potential-Induced Proline Accumulation in Arabidopsis1[W][OPEN

PubMed Central

Verslues, Paul E.; Lasky, Jesse R.; Juenger, Thomas E.; Liu, Tzu-Wen; Kumar, M. Nagaraj

2014-01-01

Arabidopsis (Arabidopsis thaliana) exhibits natural genetic variation in drought response, including varying levels of proline (Pro) accumulation under low water potential. As Pro accumulation is potentially important for stress tolerance and cellular redox control, we conducted a genome-wide association (GWAS) study of low water potential-induced Pro accumulation using a panel of natural accessions and publicly available single-nucleotide polymorphism (SNP) data sets. Candidate genomic regions were prioritized for subsequent study using metrics considering both the strength and spatial clustering of the association signal. These analyses found many candidate regions likely containing gene(s) influencing Pro accumulation. Reverse genetic analysis of several candidates identified new Pro effector genes, including thioredoxins and several genes encoding Universal Stress Protein A domain proteins. These new Pro effector genes further link Pro accumulation to cellular redox and energy status. Additional new Pro effector genes found include the mitochondrial protease LON1, ribosomal protein RPL24A, protein phosphatase 2A subunit A3, a MADS box protein, and a nucleoside triphosphate hydrolase. Several of these new Pro effector genes were from regions with multiple SNPs, each having moderate association with Pro accumulation. This pattern supports the use of summary approaches that incorporate clusters of SNP associations in addition to consideration of individual SNP probability values. Further GWAS-guided reverse genetics promises to find additional effectors of Pro accumulation. The combination of GWAS and reverse genetics to efficiently identify new effector genes may be especially applicable for traits difficult to analyze by other genetic screening methods. PMID:24218491
Pool-based genome-wide association study identified novel candidate regions on BTA9 and 14 for oleic acid percentage in Japanese Black cattle.

PubMed

Kawaguchi, Fuki; Kigoshi, Hiroto; Nakajima, Ayaka; Matsumoto, Yuta; Uemoto, Yoshinobu; Fukushima, Moriyuki; Yoshida, Emi; Iwamoto, Eiji; Akiyama, Takayuki; Kohama, Namiko; Kobayashi, Eiji; Honda, Takeshi; Oyama, Kenji; Mannen, Hideyuki; Sasazaki, Shinji

2018-05-17

Fatty acid composition is an important indicator of beef quality. The objective of this study was to search the potential candidate region for fatty acid composition. We performed pool-based genome-wide association studies (GWAS) for oleic acid percentage (C18:1) in a Japanese Black cattle population from the Hyogo prefecture. GWAS analysis revealed two novel candidate regions on BTA9 and BTA14. The most significant single nucleotide polymorphisms (SNPs) in each region were genotyped in a population (n = 899) to verify their effect on C18:1. Statistical analysis revealed that both SNPs were significantly associated with C18:1 (p = .0080 and .0003), validating the quantitative trait loci (QTLs) detected in GWAS. We subsequently selected VNN1 and LYPLA1 genes as candidate genes from each region on BTA9 and BTA14, respectively. We sequenced full-length coding sequence (CDS) of these genes in eight individuals and identified a nonsynonymous SNP T66M on VNN1 gene as a putative candidate polymorphism. The polymorphism was also significantly associated with C18:1, but the p value (p = .0162) was higher than the most significant SNP on BTA9, suggesting that it would not be responsible for the QTL. Although further investigation will be needed to determine the responsible gene and polymorphism, our findings would contribute to development of selective markers for fatty acid composition in the Japanese Black cattle of Hyogo. © 2018 Japanese Society of Animal Science.
Candidate EDA targets revealed by expression profiling of primary keratinocytes from Tabby mutant mice

PubMed Central

Esibizione, Diana; Cui, Chang-Yi; Schlessinger, David

2009-01-01

EDA, the gene mutated in anhidrotic ectodermal dysplasia, encodes ectodysplasin, a TNF superfamily member that activates NF-kB mediated transcription. To identify EDA target genes, we have earlier used expression profiling to infer genes differentially expressed at various developmental time points in Tabby (Eda-deficient) compared to wild-type mouse skin. To increase the resolution to find genes whose expression may be restricted to epidermal cells, we have now extended studies to primary keratinocyte cultures established from E19 wild-type and Tabby skin. Using microarrays bearing 44,000 gene probes, we found 385 preliminary candidate genes whose expression was significantly affected by Eda loss. By comparing expression profiles to those from Eda-A1 transgenic skin, we restricted the list to 38 “candidate EDA targets”, 14 of which were already known to be expressed in hair follicles or epidermis. We confirmed expression changes for 3 selected genes, Tbx1, Bmp7, and Jag1, both in keratinocytes and in whole skin, by Q-PCR and Western blotting analyses. Thus, by the analysis of keratinocytes, novel candidate pathways downstream of EDA were detected. PMID:18848976
Replication of type 2 diabetes candidate genes variations in three geographically unrelated Indian population groups.

PubMed

Ali, Shafat; Chopra, Rupali; Manvati, Siddharth; Singh, Yoginder Pal; Kaul, Nabodita; Behura, Anita; Mahajan, Ankit; Sehajpal, Prabodh; Gupta, Subash; Dhar, Manoj K; Chainy, Gagan B N; Bhanwer, Amarjit S; Sharma, Swarkar; Bamezai, Rameshwar N K

2013-01-01

Type 2 diabetes (T2D) is a syndrome of multiple metabolic disorders and is genetically heterogeneous. India comprises one of the largest global populations with highest number of reported type 2 diabetes cases. However, limited information about T2D associated loci is available for Indian populations. It is, therefore, pertinent to evaluate the previously associated candidates as well as identify novel genetic variations in Indian populations to understand the extent of genetic heterogeneity. We chose to do a cost effective high-throughput mass-array genotyping and studied the candidate gene variations associated with T2D in literature. In this case-control candidate genes association study, 91 SNPs from 55 candidate genes have been analyzed in three geographically independent population groups from India. We report the genetic variants in five candidate genes: TCF7L2, HHEX, ENPP1, IDE and FTO, are significantly associated (after Bonferroni correction, p<5.5E-04) with T2D susceptibility in combined population. Interestingly, SNP rs7903146 of the TCF7L2 gene passed the genome wide significance threshold (combined P value = 2.05E-08) in the studied populations. We also observed the association of rs7903146 with blood glucose (fasting and postprandial) levels, supporting the role of TCF7L2 gene in blood glucose homeostasis. Further, we noted that the moderate risk provided by the independently associated loci in combined population with Odds Ratio (OR)<1.38 increased to OR = 2.44, (95%CI = 1.67-3.59) when the risk providing genotypes of TCF7L2, HHEX, ENPP1 and FTO genes were combined, suggesting the importance of gene-gene interactions evaluation in complex disorders like T2D.
Replication of Type 2 Diabetes Candidate Genes Variations in Three Geographically Unrelated Indian Population Groups

PubMed Central

Ali, Shafat; Chopra, Rupali; Manvati, Siddharth; Mahajan, Ankit; Sehajpal, Prabodh; Gupta, Subash; Dhar, Manoj K.; Chainy, Gagan B. N.; Bhanwer, Amarjit S.; Sharma, Swarkar; Bamezai, Rameshwar N. K.

2013-01-01

Type 2 diabetes (T2D) is a syndrome of multiple metabolic disorders and is genetically heterogeneous. India comprises one of the largest global populations with highest number of reported type 2 diabetes cases. However, limited information about T2D associated loci is available for Indian populations. It is, therefore, pertinent to evaluate the previously associated candidates as well as identify novel genetic variations in Indian populations to understand the extent of genetic heterogeneity. We chose to do a cost effective high-throughput mass-array genotyping and studied the candidate gene variations associated with T2D in literature. In this case-control candidate genes association study, 91 SNPs from 55 candidate genes have been analyzed in three geographically independent population groups from India. We report the genetic variants in five candidate genes: TCF7L2, HHEX, ENPP1, IDE and FTO, are significantly associated (after Bonferroni correction, p<5.5E−04) with T2D susceptibility in combined population. Interestingly, SNP rs7903146 of the TCF7L2 gene passed the genome wide significance threshold (combined P value = 2.05E−08) in the studied populations. We also observed the association of rs7903146 with blood glucose (fasting and postprandial) levels, supporting the role of TCF7L2 gene in blood glucose homeostasis. Further, we noted that the moderate risk provided by the independently associated loci in combined population with Odds Ratio (OR)<1.38 increased to OR = 2.44, (95%CI = 1.67–3.59) when the risk providing genotypes of TCF7L2, HHEX, ENPP1 and FTO genes were combined, suggesting the importance of gene-gene interactions evaluation in complex disorders like T2D. PMID:23527042
Outlier Analysis Defines Zinc Finger Gene Family DNA Methylation in Tumors and Saliva of Head and Neck Cancer Patients.

PubMed

Gaykalova, Daria A; Vatapalli, Rajita; Wei, Yingying; Tsai, Hua-Ling; Wang, Hao; Zhang, Chi; Hennessey, Patrick T; Guo, Theresa; Tan, Marietta; Li, Ryan; Ahn, Julie; Khan, Zubair; Westra, William H; Bishop, Justin A; Zaboli, David; Koch, Wayne M; Khan, Tanbir; Ochs, Michael F; Califano, Joseph A

2015-01-01

Head and Neck Squamous Cell Carcinoma (HNSCC) is the fifth most common cancer, annually affecting over half a million people worldwide. Presently, there are no accepted biomarkers for clinical detection and surveillance of HNSCC. In this work, a comprehensive genome-wide analysis of epigenetic alterations in primary HNSCC tumors was employed in conjunction with cancer-specific outlier statistics to define novel biomarker genes which are differentially methylated in HNSCC. The 37 identified biomarker candidates were top-scoring outlier genes with prominent differential methylation in tumors, but with no signal in normal tissues. These putative candidates were validated in independent HNSCC cohorts from our institution and TCGA (The Cancer Genome Atlas). Using the top candidates, ZNF14, ZNF160, and ZNF420, an assay was developed for detection of HNSCC cancer in primary tissue and saliva samples with 100% specificity when compared to normal control samples. Given the high detection specificity, the analysis of ZNF DNA methylation in combination with other DNA methylation biomarkers may be useful in the clinical setting for HNSCC detection and surveillance, particularly in high-risk patients. Several additional candidates identified through this work can be further investigated toward future development of a multi-gene panel of biomarkers for the surveillance and detection of HNSCC.
Identifying New Candidate Genes and Chemicals Related to Prostate Cancer Using a Hybrid Network and Shortest Path Approach

PubMed Central

Wang, Meng; Wu, Kai; Lu, Changhong; Kong, Xiangyin

2015-01-01

Prostate cancer is a type of cancer that occurs in the male prostate, a gland in the male reproductive system. Because prostate cancer cells may spread to other parts of the body and can influence human reproduction, understanding the mechanisms underlying this disease is critical for designing effective treatments. The identification of as many genes and chemicals related to prostate cancer as possible will enhance our understanding of this disease. In this study, we proposed a computational method to identify new candidate genes and chemicals based on currently known genes and chemicals related to prostate cancer by applying a shortest path approach in a hybrid network. The hybrid network was constructed according to information concerning chemical-chemical interactions, chemical-protein interactions, and protein-protein interactions. Many of the obtained genes and chemicals are associated with prostate cancer. PMID:26504486
A new gene in A. rubens: A sea star Ig kappa gene.

PubMed

Vincent, Nadine; Osteras, Magne; Otten, Patricia; Leclerc, Michel

2014-12-01

The sea star Asterias rubens reacts specifically to the antigen:HRP (horse-radish peroxydase) and produces an antibody anti-HRP. We previously identified a candidate Ig kappa gene corresponding to this manuscript. We show now the gene referred to as: "sea star Ig kappa gene in its specificity".
A computational approach to candidate gene prioritization for X-linked mental retardation using annotation-based binary filtering and motif-based linear discriminatory analysis

PubMed Central

2011-01-01

Background Several computational candidate gene selection and prioritization methods have recently been developed. These in silico selection and prioritization techniques are usually based on two central approaches - the examination of similarities to known disease genes and/or the evaluation of functional annotation of genes. Each of these approaches has its own caveats. Here we employ a previously described method of candidate gene prioritization based mainly on gene annotation, in accompaniment with a technique based on the evaluation of pertinent sequence motifs or signatures, in an attempt to refine the gene prioritization approach. We apply this approach to X-linked mental retardation (XLMR), a group of heterogeneous disorders for which some of the underlying genetics is known. Results The gene annotation-based binary filtering method yielded a ranked list of putative XLMR candidate genes with good plausibility of being associated with the development of mental retardation. In parallel, a motif finding approach based on linear discriminatory analysis (LDA) was employed to identify short sequence patterns that may discriminate XLMR from non-XLMR genes. High rates (>80%) of correct classification was achieved, suggesting that the identification of these motifs effectively captures genomic signals associated with XLMR vs. non-XLMR genes. The computational tools developed for the motif-based LDA is integrated into the freely available genomic analysis portal Galaxy (http://main.g2.bx.psu.edu/). Nine genes (APLN, ZC4H2, MAGED4, MAGED4B, RAP2C, FAM156A, FAM156B, TBL1X, and UXT) were highlighted as highly-ranked XLMR methods. Conclusions The combination of gene annotation information and sequence motif-orientated computational candidate gene prediction methods highlight an added benefit in generating a list of plausible candidate genes, as has been demonstrated for XLMR. Reviewers: This article was reviewed by Dr Barbara Bardoni (nominated by Prof Juergen Brosius); Prof Neil Smalheiser and Dr Dustin Holloway (nominated by Prof Charles DeLisi). PMID:21668950
IFRD1 Is a Candidate Gene for SMNA on Chromosome 7q22-q23

PubMed Central

Brkanac, Zoran; Spencer, David; Shendure, Jay; Robertson, Peggy D.; Matsushita, Mark; Vu, Tiffany; Bird, Thomas D.; Olson, Maynard V.; Raskind, Wendy H.

2009-01-01

We have established strong linkage evidence that supports mapping autosomal-dominant sensory/motor neuropathy with ataxia (SMNA) to chromosome 7q22-q32. SMNA is a rare neurological disorder whose phenotype encompasses both the central and the peripheral nervous system. In order to identify a gene responsible for SMNA, we have undertaken a comprehensive genomic evaluation of the region of linkage, including evaluation for repeat expansion and small deletions or duplications, capillary sequencing of candidate genes, and massively parallel sequencing of all coding exons. We excluded repeat expansion and small deletions or duplications as causative, and through microarray-based hybrid capture and massively parallel short-read sequencing, we identified a nonsynonymous variant in the human interferon-related developmental regulator gene 1 (IFRD1) as a disease-causing candidate. Sequence conservation, animal models, and protein structure evaluation support the involvement of IFRD1 in SMNA. Mutation analysis of IFRD1 in additional patients with similar phenotypes is needed for demonstration of causality and further evaluation of its importance in neurological diseases. PMID:19409521
Cis-eQTL analysis and functional validation of candidate susceptibility genes for high-grade serous ovarian cancer.

PubMed

Lawrenson, Kate; Li, Qiyuan; Kar, Siddhartha; Seo, Ji-Heui; Tyrer, Jonathan; Spindler, Tassja J; Lee, Janet; Chen, Yibu; Karst, Alison; Drapkin, Ronny; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia; Baker, Helen; Bandera, Elisa V; Bean, Yukie; Beckmann, Matthias W; Berchuck, Andrew; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G; Carty, Karen; Chang-Claude, Jenny; Chenevix-Trench, Georgia; Chen, Anne; Chen, Zhihua; Cook, Linda S; Cramer, Daniel W; Cunningham, Julie M; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T; Edwards, Robert P; Eilber, Ursula; Ekici, Arif B; Fasching, Peter A; Fridley, Brooke L; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G; Glasspool, Rosalind; Goode, Ellen L; Goodman, Marc T; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A T; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S; Jakubowska, Anna; James, Paul; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kruger Kjaer, Susanne; Kelemen, Linda E; Kellar, Melissa; Kelley, Joseph L; Kiemeney, Lambertus A; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Alice W; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F A G; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; Nevanlinna, Heli; McNeish, Ian; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B; Narod, Steven A; Nedergaard, Lotte; Ness, Roberta B; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste L; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pike, Malcolm C; Poole, Elizabeth M; Ramus, Susan J; Risch, Harvey A; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H; Rudolph, Anja; Runnebaum, Ingo B; Rzepecka, Iwona K; Salvesen, Helga B; Schildkraut, Joellen M; Schwaab, Ira; Sellers, Thomas A; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C; Sucheston, Lara; Tangen, Ingvild L; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S; van Altena, Anne M; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S; Wicklund, Kristine G; Wilkens, Lynne R; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Monteiro, Alvaro; Pharoah, Paul D; Gayther, Simon A; Freedman, Matthew L

2015-09-22

Genome-wide association studies have reported 11 regions conferring risk of high-grade serous epithelial ovarian cancer (HGSOC). Expression quantitative trait locus (eQTL) analyses can identify candidate susceptibility genes at risk loci. Here we evaluate cis-eQTL associations at 47 regions associated with HGSOC risk (P≤10(-5)). For three cis-eQTL associations (P<1.4 × 10(-3), FDR<0.05) at 1p36 (CDC42), 1p34 (CDCA8) and 2q31 (HOXD9), we evaluate the functional role of each candidate by perturbing expression of each gene in HGSOC precursor cells. Overexpression of HOXD9 increases anchorage-independent growth, shortens population-doubling time and reduces contact inhibition. Chromosome conformation capture identifies an interaction between rs2857532 and the HOXD9 promoter, suggesting this SNP is a leading causal variant. Transcriptomic profiling after HOXD9 overexpression reveals enrichment of HGSOC risk variants within HOXD9 target genes (P=6 × 10(-10) for risk variants (P<10(-4)) within 10 kb of a HOXD9 target gene in ovarian cells), suggesting a broader role for this network in genetic susceptibility to HGSOC.
Cis-eQTL analysis and functional validation of candidate susceptibility genes for high-grade serous ovarian cancer

PubMed Central

Lawrenson, Kate; Li, Qiyuan; Kar, Siddhartha; Seo, Ji-Heui; Tyrer, Jonathan; Spindler, Tassja J.; Lee, Janet; Chen, Yibu; Karst, Alison; Drapkin, Ronny; Aben, Katja K. H.; Anton-Culver, Hoda; Antonenkova, Natalia; Bowtell, David; Webb, Penelope M.; deFazio, Anna; Baker, Helen; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Berchuck, Andrew; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chenevix-Trench, Georgia; Chen, Anne; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A. T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; James, Paul; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y.; Kruger Kjaer, Susanne; Kelemen, Linda E.; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F. A. G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; Nevanlinna, Heli; McNeish, Ian; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste L.; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Phelan, Catherine M.; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Sellers, Thomas A.; Shu, Xiao-Ou; Shvetsov, Yurii B.; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J.; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Monteiro, Alvaro; Pharoah, Paul D.; Gayther, Simon A.; Freedman, Matthew L.

2015-01-01

Genome-wide association studies have reported 11 regions conferring risk of high-grade serous epithelial ovarian cancer (HGSOC). Expression quantitative trait locus (eQTL) analyses can identify candidate susceptibility genes at risk loci. Here we evaluate cis-eQTL associations at 47 regions associated with HGSOC risk (P≤10−5). For three cis-eQTL associations (P<1.4 × 10−3, FDR<0.05) at 1p36 (CDC42), 1p34 (CDCA8) and 2q31 (HOXD9), we evaluate the functional role of each candidate by perturbing expression of each gene in HGSOC precursor cells. Overexpression of HOXD9 increases anchorage-independent growth, shortens population-doubling time and reduces contact inhibition. Chromosome conformation capture identifies an interaction between rs2857532 and the HOXD9 promoter, suggesting this SNP is a leading causal variant. Transcriptomic profiling after HOXD9 overexpression reveals enrichment of HGSOC risk variants within HOXD9 target genes (P=6 × 10−10 for risk variants (P<10−4) within 10 kb of a HOXD9 target gene in ovarian cells), suggesting a broader role for this network in genetic susceptibility to HGSOC. PMID:26391404
Mapping by sequencing in cotton (Gossypium hirsutum) line MD52ne identified candidate genes for fiber strength and its related quality attributes.

PubMed

Islam, Md S; Zeng, Linghe; Thyssen, Gregory N; Delhom, Christopher D; Kim, Hee Jin; Li, Ping; Fang, David D

2016-06-01

Three QTL regions controlling three fiber quality traits were validated and further fine-mapped with 27 new single nucleotide polymorphism (SNP) markers. Transcriptome analysis suggests that receptor-like kinases found within the validated QTLs are potential candidate genes responsible for superior fiber strength in cotton line MD52ne. Fiber strength, length, maturity and fineness determine the market value of cotton fibers and the quality of spun yarn. Cotton fiber strength has been recognized as a critical quality attribute in the modern textile industry. Fine mapping along with quantitative trait loci (QTL) validation and candidate gene prediction can uncover the genetic and molecular basis of fiber quality traits. Four previously-identified QTLs (qFBS-c3, qSFI-c14, qUHML-c14 and qUHML-c24) related to fiber bundle strength, short fiber index and fiber length, respectively, were validated using an F3 population that originated from a cross of MD90ne × MD52ne. A group of 27 new SNP markers generated from mapping-by-sequencing (MBS) were placed in QTL regions to improve and validate earlier maps. Our refined QTL regions spanned 4.4, 1.8 and 3.7 Mb of physical distance in the Gossypium raimondii reference genome. We performed RNA sequencing (RNA-seq) of 15 and 20 days post-anthesis fiber cells from MD52ne and MD90ne and aligned reads to the G. raimondii genome. The QTL regions contained 21 significantly differentially expressed genes (DEGs) between the two near-isogenic parental lines. SNPs that result in non-synonymous substitutions to amino acid sequences of annotated genes were identified within these DEGs, and mapped. Taken together, transcriptome and amino acid mutation analysis indicate that receptor-like kinase pathway genes are likely candidates for superior fiber strength and length in MD52ne. MBS along with RNA-seq demonstrated a powerful strategy to elucidate candidate genes for the QTLs that control complex traits in a complex genome like tetraploid upland cotton.
The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.

PubMed

Motamayor, Juan C; Mockaitis, Keithanne; Schmutz, Jeremy; Haiminen, Niina; Livingstone, Donald; Cornejo, Omar; Findley, Seth D; Zheng, Ping; Utro, Filippo; Royaert, Stefan; Saski, Christopher; Jenkins, Jerry; Podicheti, Ram; Zhao, Meixia; Scheffler, Brian E; Stack, Joseph C; Feltus, Frank A; Mustiga, Guiliana M; Amores, Freddy; Phillips, Wilbert; Marelli, Jean Philippe; May, Gregory D; Shapiro, Howard; Ma, Jianxin; Bustamante, Carlos D; Schnell, Raymond J; Main, Dorrie; Gilbert, Don; Parida, Laxmi; Kuhn, David N

2013-06-03

Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. We describe the sequencing and assembly of the genome of Theobroma cacao L. cultivar Matina 1-6. The genome of the Matina 1-6 cultivar is 445 Mbp, which is significantly larger than a sequenced Criollo cultivar, and more typical of other cultivars. The chromosome-scale assembly, version 1.1, contains 711 scaffolds covering 346.0 Mbp, with a contig N50 of 84.4 kbp, a scaffold N50 of 34.4 Mbp, and an evidence-based gene set of 29,408 loci. Version 1.1 has 10x the scaffold N50 and 4x the contig N50 as Criollo, and includes 111 Mb more anchored sequence. The version 1.1 assembly has 4.4% gap sequence, while Criollo has 10.9%. Through a combination of haplotype, association mapping and gene expression analyses, we leverage this robust reference genome to identify a promising candidate gene responsible for pod color variation. We demonstrate that green/red pod color in cacao is likely regulated by the R2R3 MYB transcription factor TcMYB113, homologs of which determine pigmentation in Rosaceae, Solanaceae, and Brassicaceae. One SNP within the target site for a highly conserved trans-acting siRNA in dicots, found within TcMYB113, seems to affect transcript levels of this gene and therefore pod color variation. We report a high-quality sequence and annotation of Theobroma cacao L. and demonstrate its utility in identifying candidate genes regulating traits.
The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color

PubMed Central

2013-01-01

Background Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. Results We describe the sequencing and assembly of the genome of Theobroma cacao L. cultivar Matina 1-6. The genome of the Matina 1-6 cultivar is 445 Mbp, which is significantly larger than a sequenced Criollo cultivar, and more typical of other cultivars. The chromosome-scale assembly, version 1.1, contains 711 scaffolds covering 346.0 Mbp, with a contig N50 of 84.4 kbp, a scaffold N50 of 34.4 Mbp, and an evidence-based gene set of 29,408 loci. Version 1.1 has 10x the scaffold N50 and 4x the contig N50 as Criollo, and includes 111 Mb more anchored sequence. The version 1.1 assembly has 4.4% gap sequence, while Criollo has 10.9%. Through a combination of haplotype, association mapping and gene expression analyses, we leverage this robust reference genome to identify a promising candidate gene responsible for pod color variation. We demonstrate that green/red pod color in cacao is likely regulated by the R2R3 MYB transcription factor TcMYB113, homologs of which determine pigmentation in Rosaceae, Solanaceae, and Brassicaceae. One SNP within the target site for a highly conserved trans-acting siRNA in dicots, found within TcMYB113, seems to affect transcript levels of this gene and therefore pod color variation. Conclusions We report a high-quality sequence and annotation of Theobroma cacao L. and demonstrate its utility in identifying candidate genes regulating traits. PMID:23731509

Positive selection on human gamete-recognition genes

PubMed Central

Stover, Daryn A.; Guerra, Vanessa; Mozaffari, Sahar V.; Ober, Carole; Mugal, Carina F.; Kaj, Ingemar

2018-01-01

Coevolution of genes that encode interacting proteins expressed on the surfaces of sperm and eggs can lead to variation in reproductive compatibility between mates and reproductive isolation between members of different species. Previous studies in mice and other mammals have focused in particular on evidence for positive or diversifying selection that shapes the evolution of genes that encode sperm-binding proteins expressed in the egg coat or zona pellucida (ZP). By fitting phylogenetic models of codon evolution to data from the 1000 Genomes Project, we identified candidate sites evolving under diversifying selection in the human genes ZP3 and ZP2. We also identified one candidate site under positive selection in C4BPA, which encodes a repetitive protein similar to the mouse protein ZP3R that is expressed in the sperm head and binds to the ZP at fertilization. Results from several additional analyses that applied population genetic models to the same data were consistent with the hypothesis of selection on those candidate sites leading to coevolution of sperm- and egg-expressed genes. By contrast, we found no candidate sites under selection in a fourth gene (ZP1) that encodes an egg coat structural protein not directly involved in sperm binding. Finally, we found that two of the candidate sites (in C4BPA and ZP2) were correlated with variation in family size and birth rate among Hutterite couples, and those two candidate sites were also in linkage disequilibrium in the same Hutterite study population. All of these lines of evidence are consistent with predictions from a previously proposed hypothesis of balancing selection on epistatic interactions between C4BPA and ZP3 at fertilization that lead to the evolution of co-adapted allele pairs. Such patterns also suggest specific molecular traits that may be associated with both natural reproductive variation and clinical infertility. PMID:29340252
Utilization of gene mapping and candidate gene mutation screening for diagnosing clinically equivocal conditions: a Norrie disease case study.

PubMed

Chini, Vasiliki; Stambouli, Danai; Nedelea, Florina Mihaela; Filipescu, George Alexandru; Mina, Diana; Kambouris, Marios; El-Shantil, Hatem

2014-06-01

Prenatal diagnosis was requested for an undiagnosed eye disease showing X-linked inheritance in a family. No medical records existed for the affected family members. Mapping of the X chromosome and candidate gene mutation screening identified a c.C267A[p.F89L] mutation in NPD previously described as possibly causing Norrie disease. The detection of the c.C267A[p.F89L] variant in another unrelated family confirms the pathogenic nature of the mutation for the Norrie disease phenotype. Gene mapping, haplotype analysis, and candidate gene screening have been previously utilized in research applications but were applied here in a diagnostic setting due to the scarcity of available clinical information. The clinical diagnosis and mutation identification were critical for providing proper genetic counseling and prenatal diagnosis for this family.
Identification of Immunity Related Genes to Study the Physalis peruviana – Fusarium oxysporum Pathosystem

PubMed Central

Enciso-Rodríguez, Felix E.; González, Carolina; Rodríguez, Edwin A.; López, Camilo E.; Landsman, David; Barrero, Luz Stella; Mariño-Ramírez, Leonardo

2013-01-01

The Cape gooseberry ( Physalis peruviana L) is an Andean exotic fruit with high nutritional value and appealing medicinal properties. However, its cultivation faces important phytosanitary problems mainly due to pathogens like Fusarium oxysporum, Cercosporaphysalidis and Alternaria spp. Here we used the Cape gooseberry foliar transcriptome to search for proteins that encode conserved domains related to plant immunity including: NBS (Nucleotide Binding Site), CC (Coiled-Coil), TIR (Toll/Interleukin-1 Receptor). We identified 74 immunity related gene candidates in P . peruviana which have the typical resistance gene (R-gene) architecture, 17 Receptor like kinase (RLKs) candidates related to PAMP-Triggered Immunity (PTI), eight (TIR-NBS-LRR, or TNL) and nine (CC–NBS-LRR, or CNL) candidates related to Effector-Triggered Immunity (ETI) genes among others. These candidate genes were categorized by molecular function (98%), biological process (85%) and cellular component (79%) using gene ontology. Some of the most interesting predicted roles were those associated with binding and transferase activity. We designed 94 primers pairs from the 74 immunity-related genes (IRGs) to amplify the corresponding genomic regions on six genotypes that included resistant and susceptible materials. From these, we selected 17 single band amplicons and sequenced them in 14 F. oxysporum resistant and susceptible genotypes. Sequence polymorphisms were analyzed through preliminary candidate gene association, which allowed the detection of one SNP at the PpIRG-63 marker revealing a nonsynonymous mutation in the predicted LRR domain suggesting functional roles for resistance. PMID:23844210
Identification of immunity related genes to study the Physalis peruviana--Fusarium oxysporum pathosystem.

PubMed

Enciso-Rodríguez, Felix E; González, Carolina; Rodríguez, Edwin A; López, Camilo E; Landsman, David; Barrero, Luz Stella; Mariño-Ramírez, Leonardo

2013-01-01

The Cape gooseberry (Physalisperuviana L) is an Andean exotic fruit with high nutritional value and appealing medicinal properties. However, its cultivation faces important phytosanitary problems mainly due to pathogens like Fusarium oxysporum, Cercosporaphysalidis and Alternaria spp. Here we used the Cape gooseberry foliar transcriptome to search for proteins that encode conserved domains related to plant immunity including: NBS (Nucleotide Binding Site), CC (Coiled-Coil), TIR (Toll/Interleukin-1 Receptor). We identified 74 immunity related gene candidates in P. peruviana which have the typical resistance gene (R-gene) architecture, 17 Receptor like kinase (RLKs) candidates related to PAMP-Triggered Immunity (PTI), eight (TIR-NBS-LRR, or TNL) and nine (CC-NBS-LRR, or CNL) candidates related to Effector-Triggered Immunity (ETI) genes among others. These candidate genes were categorized by molecular function (98%), biological process (85%) and cellular component (79%) using gene ontology. Some of the most interesting predicted roles were those associated with binding and transferase activity. We designed 94 primers pairs from the 74 immunity-related genes (IRGs) to amplify the corresponding genomic regions on six genotypes that included resistant and susceptible materials. From these, we selected 17 single band amplicons and sequenced them in 14 F. oxysporum resistant and susceptible genotypes. Sequence polymorphisms were analyzed through preliminary candidate gene association, which allowed the detection of one SNP at the PpIRG-63 marker revealing a nonsynonymous mutation in the predicted LRR domain suggesting functional roles for resistance.
Molecular evolution of candidate male reproductive genes in the brown algal model Ectocarpus.

PubMed

Lipinska, Agnieszka P; Van Damme, Els J M; De Clerck, Olivier

2016-01-05

Evolutionary studies of genes that mediate recognition between sperm and egg contribute to our understanding of reproductive isolation and speciation. Surface receptors involved in fertilization are targets of sexual selection, reinforcement, and other evolutionary forces including positive selection. This observation was made across different lineages of the eukaryotic tree from land plants to mammals, and is particularly evident in free-spawning animals. Here we use the brown algal model species Ectocarpus (Phaeophyceae) to investigate the evolution of candidate gamete recognition proteins in a distant major phylogenetic group of eukaryotes. Male gamete specific genes were identified by comparing transcriptome data covering different stages of the Ectocarpus life cycle and screened for characteristics expected from gamete recognition receptors. Selected genes were sequenced in a representative number of strains from distant geographical locations and varying stages of reproductive isolation, to search for signatures of adaptive evolution. One of the genes (Esi0130_0068) showed evidence of selective pressure. Interestingly, that gene displayed domain similarities to the receptor for egg jelly (REJ) protein involved in sperm-egg recognition in sea urchins. We have identified a male gamete specific gene with similarity to known gamete recognition receptors and signatures of adaptation. Altogether, this gene could contribute to gamete interaction during reproduction as well as reproductive isolation in Ectocarpus and is therefore a good candidate for further functional evaluation.
Influence of SNPs in nutrient-sensitive candidate genes and gene-diet interactions on blood lipids: the DiOGenes study.

PubMed

Brahe, Lena K; Ängquist, Lars; Larsen, Lesli H; Vimaleswaran, Karani S; Hager, Jörg; Viguerie, Nathalie; Loos, Ruth J F; Handjieva-Darlenska, Teodora; Jebb, Susan A; Hlavaty, Petr; Larsen, Thomas M; Martinez, J Alfredo; Papadaki, Angeliki; Pfeiffer, Andreas F H; van Baak, Marleen A; Sørensen, Thorkild I A; Holst, Claus; Langin, Dominique; Astrup, Arne; Saris, Wim H M

2013-09-14

Blood lipid response to a given dietary intervention could be determined by the effect of diet, gene variants or gene-diet interactions. The objective of the present study was to investigate whether variants in presumed nutrient-sensitive genes involved in lipid metabolism modified lipid profile after weight loss and in response to a given diet, among overweight European adults participating in the Diet Obesity and Genes study. By multiple linear regressions, 240 SNPs in twenty-four candidate genes were investigated for SNP main and SNP-diet interaction effects on total cholesterol, LDL-cholesterol, HDL-cholesterol and TAG after an 8-week low-energy diet (only main effect) ,and a 6-month ad libitum weight maintenance diet, with different contents of dietary protein or glycaemic index. After adjusting for multiple testing, a SNP-dietary protein interaction effect on TAG was identified for lipin 1 (LPIN1) rs4315495, with a decrease in TAG of 20.26 mmol/l per A-allele/protein unit (95% CI 20.38, 20.14, P=0.000043). In conclusion, we investigated SNP-diet interactions for blood lipid profiles for 240 SNPs in twenty-four candidate genes, selected for their involvement in lipid metabolism pathways, and identified one significant interaction between LPIN1 rs4315495 and dietary protein for TAG concentration.
An Expressed Sequence Tag collection from the male antennae of the Noctuid moth Spodoptera littoralis: a resource for olfactory and pheromone detection research

PubMed Central

2011-01-01

Background Nocturnal insects such as moths are ideal models to study the molecular bases of olfaction that they use, among examples, for the detection of mating partners and host plants. Knowing how an odour generates a neuronal signal in insect antennae is crucial for understanding the physiological bases of olfaction, and also could lead to the identification of original targets for the development of olfactory-based control strategies against herbivorous moth pests. Here, we describe an Expressed Sequence Tag (EST) project to characterize the antennal transcriptome of the noctuid pest model, Spodoptera littoralis, and to identify candidate genes involved in odour/pheromone detection. Results By targeting cDNAs from male antennae, we biased gene discovery towards genes potentially involved in male olfaction, including pheromone reception. A total of 20760 ESTs were obtained from a normalized library and were assembled in 9033 unigenes. 6530 were annotated based on BLAST analyses and gene prediction software identified 6738 ORFs. The unigenes were compared to the Bombyx mori proteome and to ESTs derived from Lepidoptera transcriptome projects. We identified a large number of candidate genes involved in odour and pheromone detection and turnover, including 31 candidate chemosensory receptor genes, but also genes potentially involved in olfactory modulation. Conclusions Our project has generated a large collection of antennal transcripts from a Lepidoptera. The normalization process, allowing enrichment in low abundant genes, proved to be particularly relevant to identify chemosensory receptors in a species for which no genomic data are available. Our results also suggest that olfactory modulation can take place at the level of the antennae itself. These EST resources will be invaluable for exploring the mechanisms of olfaction and pheromone detection in S. littoralis, and for ultimately identifying original targets to fight against moth herbivorous pests. PMID:21276261
Loci and candidate genes conferring resistance to soybean cyst nematode HG type 2.5.7.

PubMed

Zhao, Xue; Teng, Weili; Li, Yinghui; Liu, Dongyuan; Cao, Guanglu; Li, Dongmei; Qiu, Lijuan; Zheng, Hongkun; Han, Yingpeng; Li, Wenbin

2017-06-14

Soybean (Glycine max L. Merr.) cyst nematode (SCN, Heterodera glycines I,) is a major pest of soybean worldwide. The most effective strategy to control this pest involves the use of resistant cultivars. The aim of the present study was to investigate the genome-wide genetic architecture of resistance to SCN HG Type 2.5.7 (race 1) in landrace and elite cultivated soybeans. A total of 200 diverse soybean accessions were screened for resistance to SCN HG Type 2.5.7 and genotyped through sequencing using the Specific Locus Amplified Fragment Sequencing (SLAF-seq) approach with a 6.14-fold average sequencing depth. A total of 33,194 SNPs were identified with minor allele frequencies (MAF) over 4%, covering 97% of all the genotypes. Genome-wide association mapping (GWAS) revealed thirteen SNPs associated with resistance to SCN HG Type 2.5.7. These SNPs were distributed on five chromosomes (Chr), including Chr7, 8, 14, 15 and 18. Four SNPs were novel resistance loci and nine SNPs were located near known QTL. A total of 30 genes were identified as candidate genes underlying SCN resistance. A total of sixteen novel soybean accessions were identified with significant resistance to HG Type 2.5.7. The beneficial alleles and candidate genes identified by GWAS might be valuable for improving marker-assisted breeding efficiency and exploring the molecular mechanisms underlying SCN resistance.
confFuse: High-Confidence Fusion Gene Detection across Tumor Entities.

PubMed

Huang, Zhiqin; Jones, David T W; Wu, Yonghe; Lichter, Peter; Zapatka, Marc

2017-01-01

Background: Fusion genes play an important role in the tumorigenesis of many cancers. Next-generation sequencing (NGS) technologies have been successfully applied in fusion gene detection for the last several years, and a number of NGS-based tools have been developed for identifying fusion genes during this period. Most fusion gene detection tools based on RNA-seq data report a large number of candidates (mostly false positives), making it hard to prioritize candidates for experimental validation and further analysis. Selection of reliable fusion genes for downstream analysis becomes very important in cancer research. We therefore developed confFuse, a scoring algorithm to reliably select high-confidence fusion genes which are likely to be biologically relevant. Results: confFuse takes multiple parameters into account in order to assign each fusion candidate a confidence score, of which score ≥8 indicates high-confidence fusion gene predictions. These parameters were manually curated based on our experience and on certain structural motifs of fusion genes. Compared with alternative tools, based on 96 published RNA-seq samples from different tumor entities, our method can significantly reduce the number of fusion candidates (301 high-confidence from 8,083 total predicted fusion genes) and keep high detection accuracy (recovery rate 85.7%). Validation of 18 novel, high-confidence fusions detected in three breast tumor samples resulted in a 100% validation rate. Conclusions: confFuse is a novel downstream filtering method that allows selection of highly reliable fusion gene candidates for further downstream analysis and experimental validations. confFuse is available at https://github.com/Zhiqin-HUANG/confFuse.
Identification of cancer genes that are independent of dominant proliferation and lineage programs

PubMed Central

Selfors, Laura M.; Stover, Daniel G.; Harris, Isaac S.; Brugge, Joan S.; Coloff, Jonathan L.

2017-01-01

Large, multidimensional cancer datasets provide a resource that can be mined to identify candidate therapeutic targets for specific subgroups of tumors. Here, we analyzed human breast cancer data to identify transcriptional programs associated with tumors bearing specific genetic driver alterations. Using an unbiased approach, we identified thousands of genes whose expression was enriched in tumors with specific genetic alterations. However, expression of the vast majority of these genes was not enriched if associations were analyzed within individual breast tumor molecular subtypes, across multiple tumor types, or after gene expression was normalized to account for differences in proliferation or tumor lineage. Together with linear modeling results, these findings suggest that most transcriptional programs associated with specific genetic alterations in oncogenes and tumor suppressors are highly context-dependent and are predominantly linked to differences in proliferation programs between distinct breast cancer subtypes. We demonstrate that such proliferation-dependent gene expression dominates tumor transcriptional programs relative to matched normal tissues. However, we also identified a relatively small group of cancer-associated genes that are both proliferation- and lineage-independent. A subset of these genes are attractive candidate targets for combination therapy because they are essential in breast cancer cell lines, druggable, enriched in stem-like breast cancer cells, and resistant to chemotherapy-induced down-regulation. PMID:29229826
ICan: an integrated co-alteration network to identify ovarian cancer-related genes.

PubMed

Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

2015-01-01

Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.
ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes

PubMed Central

Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

2015-01-01

Background Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. Results We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). Conclusion In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data. PMID:25803614
Discovering genetic variants in Crohn's disease by exploring genomic regions enriched of weak association signals.

PubMed

D'Addabbo, Annarita; Palmieri, Orazio; Maglietta, Rosalia; Latiano, Anna; Mukherjee, Sayan; Annese, Vito; Ancona, Nicola

2011-08-01

A meta-analysis has re-analysed previous genome-wide association scanning definitively confirming eleven genes and further identifying 21 new loci. However, the identified genes/loci still explain only the minority of genetic predisposition of Crohn's disease. To identify genes weakly involved in disease predisposition by analysing chromosomal regions enriched of single nucleotide polymorphisms with modest statistical association. We utilized the WTCCC data set evaluating 1748 CD and 2938 controls. The identification of candidate genes/loci was performed by a two-step procedure: first of all chromosomal regions enriched of weak association signals were localized; subsequently, weak signals clustered in gene regions were identified. The statistical significance was assessed by non parametric permutation tests. The cytoband enrichment analysis highlighted 44 regions (P≤0.05) enriched with single nucleotide polymorphisms significantly associated with the trait including 23 out of 31 previously confirmed and replicated genes. Importantly, we highlight further 20 novel chromosomal regions carrying approximately one hundred genes/loci with modest association. Amongst these we find compelling functional candidate genes such as MAPT, GRB2 and CREM, LCT, and IL12RB2. Our study suggests a different statistical perspective to discover genes weakly associated with a given trait, although further confirmatory functional studies are needed. Copyright © 2011 Editrice Gastroenterologica Italiana S.r.l. All rights reserved.
Identifying candidate driver genes by integrative ovarian cancer genomics data

NASA Astrophysics Data System (ADS)

Lu, Xinguo; Lu, Jibo

2017-08-01

Integrative analysis of molecular mechanics underlying cancer can distinguish interactions that cannot be revealed based on one kind of data for the appropriate diagnosis and treatment of cancer patients. Tumor samples exhibit heterogeneity in omics data, such as somatic mutations, Copy Number Variations CNVs), gene expression profiles and so on. In this paper we combined gene co-expression modules and mutation modulators separately in tumor patients to obtain the candidate driver genes for resistant and sensitive tumor from the heterogeneous data. The final list of modulators identified are well known in biological processes associated with ovarian cancer, such as CCL17, CACTIN, CCL16, CCL22, APOB, KDF1, CCL11, HNF1B, LRG1, MED1 and so on, which can help to facilitate the discovery of biomarkers, molecular diagnostics, and drug discovery.
Genomic analysis of Meckel–Gruber syndrome in Arabs reveals marked genetic heterogeneity and novel candidate genes

PubMed Central

Shaheen, Ranad; Faqeih, Eissa; Alshammari, Muneera J; Swaid, Abdulrahman; Al-Gazali, Lihadh; Mardawi, Elham; Ansari, Shinu; Sogaty, Sameera; Seidahmed, Mohammed Z; AlMotairi, Muhammed I; Farra, Chantal; Kurdi, Wesam; Al-Rasheed, Shatha; Alkuraya, Fowzan S

2013-01-01

Meckel–Gruber syndrome (MKS, OMIM #249000) is a multiple congenital malformation syndrome that represents the severe end of the ciliopathy phenotypic spectrum. Despite the relatively common occurrence of this syndrome among Arabs, little is known about its genetic architecture in this population. This is a series of 18 Arab families with MKS, who were evaluated clinically and studied using autozygome-guided mutation analysis and exome sequencing. We show that autozygome-guided candidate gene analysis identified the underlying mutation in the majority (n=12, 71%). Exome sequencing revealed a likely pathogenic mutation in three novel candidate MKS disease genes. These include C5orf42, Ellis–van-Creveld disease gene EVC2 and SEC8 (also known as EXOC4), which encodes an exocyst protein with an established role in ciliogenesis. This is the largest and most comprehensive genomic study on MKS in Arabs and the results, in addition to revealing genetic and allelic heterogeneity, suggest that previously reported disease genes and the novel candidates uncovered by this study account for the overwhelming majority of MKS patients in our population. PMID:23169490
Mapping a candidate gene (MdMYB10) for red flesh and foliage colour in apple

PubMed Central

Chagné, David; Carlisle, Charmaine M; Blond, Céline; Volz, Richard K; Whitworth, Claire J; Oraguzie, Nnadozie C; Crowhurst, Ross N; Allan, Andrew C; Espley, Richard V; Hellens, Roger P; Gardiner, Susan E

2007-01-01

Background Integrating plant genomics and classical breeding is a challenge for both plant breeders and molecular biologists. Marker-assisted selection (MAS) is a tool that can be used to accelerate the development of novel apple varieties such as cultivars that have fruit with anthocyanin through to the core. In addition, determining the inheritance of novel alleles, such as the one responsible for red flesh, adds to our understanding of allelic variation. Our goal was to map candidate anthocyanin biosynthetic and regulatory genes in a population segregating for the red flesh phenotypes. Results We have identified the Rni locus, a major genetic determinant of the red foliage and red colour in the core of apple fruit. In a population segregating for the red flesh and foliage phenotype we have determined the inheritance of the Rni locus and DNA polymorphisms of candidate anthocyanin biosynthetic and regulatory genes. Simple Sequence Repeats (SSRs) and Single Nucleotide Polymorphisms (SNPs) in the candidate genes were also located on an apple genetic map. We have shown that the MdMYB10 gene co-segregates with the Rni locus and is on Linkage Group (LG) 09 of the apple genome. Conclusion We have performed candidate gene mapping in a fruit tree crop and have provided genetic evidence that red colouration in the fruit core as well as red foliage are both controlled by a single locus named Rni. We have shown that the transcription factor MdMYB10 may be the gene underlying Rni as there were no recombinants between the marker for this gene and the red phenotype in a population of 516 individuals. Associating markers derived from candidate genes with a desirable phenotypic trait has demonstrated the application of genomic tools in a breeding programme of a horticultural crop species. PMID:17608951
Antennal transcriptome analysis of the piercing moth Oraesia emarginata (Lepidoptera: Noctuidae)

PubMed Central

Feng, Bo; Guo, Qianshuang; Zheng, Kaidi; Qin, Yuanxia; Du, Yongjun

2017-01-01

The piercing fruit moth Oraesia emarginata is an economically significant pest; however, our understanding of its olfactory mechanisms in infestation is limited. The present study conducted antennal transcriptome analysis of olfactory genes using real-time quantitative reverse transcription PCR analysis (RT-qPCR). We identified a total of 104 candidate chemosensory genes from several gene families, including 35 olfactory receptors (ORs), 41 odorant-binding proteins, 20 chemosensory proteins, 6 ionotropic receptors, and 2 sensory neuron membrane proteins. Seven candidate pheromone receptors (PRs) and 3 candidate pheromone-binding proteins (PBPs) for sex pheromone recognition were found. OemaOR29 and OemaPBP1 had the highest fragments per kb per million fragments (FPKM) values in all ORs and OBPs, respectively. Eighteen olfactory genes were upregulated in females, including 5 candidate PRs, and 20 olfactory genes were upregulated in males, including 2 candidate PRs (OemaOR29 and 4) and 2 PBPs (OemaPBP1 and 3). These genes may have roles in mediating sex-specific behaviors. Most candidate olfactory genes of sex pheromone recognition (except OemaOR29 and OemaPBP3) in O. emarginata were not clustered with those of studied noctuid species (type I pheromone). In addition, OemaOR29 was belonged to cluster PRIII, which comprise proteins that recognize type II pheromones instead of type I pheromones. The structure and function of olfactory genes that encode sex pheromones in O. emarginata might thus differ from those of other studied noctuids. The findings of the present study may help explain the molecular mechanism underlying olfaction and the evolution of olfactory genes encoding sex pheromones in O. emarginata. PMID:28614384
Using Zebrafish to Test the Genetic Basis of Human Craniofacial Diseases.

PubMed

Machado, R Grecco; Eames, B Frank

2017-10-01

Genome-wide association studies (GWASs) opened an innovative and productive avenue to investigate the molecular basis of human craniofacial disease. However, GWASs identify candidate genes only; they do not prove that any particular one is the functional villain underlying disease or just an unlucky genomic bystander. Genetic manipulation of animal models is the best approach to reveal which genetic loci identified from human GWASs are functionally related to specific diseases. The purpose of this review is to discuss the potential of zebrafish to resolve which candidate genetic loci are mechanistic drivers of craniofacial diseases. Many anatomic, embryonic, and genetic features of craniofacial development are conserved among zebrafish and mammals, making zebrafish a good model of craniofacial diseases. Also, the ability to manipulate gene function in zebrafish was greatly expanded over the past 20 y, enabling systems such as Gateway Tol2 and CRISPR-Cas9 to test gain- and loss-of-function alleles identified from human GWASs in coding and noncoding regions of DNA. With the optimization of genetic editing methods, large numbers of candidate genes can be efficiently interrogated. Finding the functional villains that underlie diseases will permit new treatments and prevention strategies and will increase understanding of how gene pathways operate during normal development.
Whole-Exome Sequencing Identifies Novel Variants for Tooth Agenesis.

PubMed

Dinckan, N; Du, R; Petty, L E; Coban-Akdemir, Z; Jhangiani, S N; Paine, I; Baugh, E H; Erdem, A P; Kayserili, H; Doddapaneni, H; Hu, J; Muzny, D M; Boerwinkle, E; Gibbs, R A; Lupski, J R; Uyguner, Z O; Below, J E; Letra, A

2018-01-01

Tooth agenesis is a common craniofacial abnormality in humans and represents failure to develop 1 or more permanent teeth. Tooth agenesis is complex, and variations in about a dozen genes have been reported as contributing to the etiology. Here, we combined whole-exome sequencing, array-based genotyping, and linkage analysis to identify putative pathogenic variants in candidate disease genes for tooth agenesis in 10 multiplex Turkish families. Novel homozygous and heterozygous variants in LRP6, DKK1, LAMA3, and COL17A1 genes, as well as known variants in WNT10A, were identified as likely pathogenic in isolated tooth agenesis. Novel variants in KREMEN1 were identified as likely pathogenic in 2 families with suspected syndromic tooth agenesis. Variants in more than 1 gene were identified segregating with tooth agenesis in 2 families, suggesting oligogenic inheritance. Structural modeling of missense variants suggests deleterious effects to the encoded proteins. Functional analysis of an indel variant (c.3607+3_6del) in LRP6 suggested that the predicted resulting mRNA is subject to nonsense-mediated decay. Our results support a major role for WNT pathways genes in the etiology of tooth agenesis while revealing new candidate genes. Moreover, oligogenic cosegregation was suggestive for complex inheritance and potentially complex gene product interactions during development, contributing to improved understanding of the genetic etiology of familial tooth agenesis.
Bioinformatics-Based Identification of Candidate Genes from QTLs Associated with Cell Wall Traits in Populus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ranjan, Priya; Yin, Tongming; Zhang, Xinye

2009-11-01

Quantitative trait locus (QTL) studies are an integral part of plant research and are used to characterize the genetic basis of phenotypic variation observed in structured populations and inform marker-assisted breeding efforts. These QTL intervals can span large physical regions on a chromosome comprising hundreds of genes, thereby hampering candidate gene identification. Genome history, evolution, and expression evidence can be used to narrow the genes in the interval to a smaller list that is manageable for detailed downstream functional genomics characterization. Our primary motivation for the present study was to address the need for a research methodology that identifies candidatemore » genes within a broad QTL interval. Here we present a bioinformatics-based approach for subdividing candidate genes within QTL intervals into alternate groups of high probability candidates. Application of this approach in the context of studying cell wall traits, specifically lignin content and S/G ratios of stem and root in Populus plants, resulted in manageable sets of genes of both known and putative cell wall biosynthetic function. These results provide a roadmap for future experimental work leading to identification of new genes controlling cell wall recalcitrance and, ultimately, in the utility of plant biomass as an energy feedstock.« less

Array-based comparative genomic hybridization-guided identification of reference genes for normalization of real-time quantitative polymerase chain reaction assay data for lymphomas, histiocytic sarcomas, and osteosarcomas of dogs.

PubMed

Tsai, Pei-Chien; Breen, Matthew

2012-09-01

To identify suitable reference genes for normalization of real-time quantitative PCR (RT-qPCR) assay data for common tumors of dogs. Malignant lymph node (n = 8), appendicular osteosarcoma (9), and histiocytic sarcoma (12) samples and control samples of various nonneoplastic canine tissues. Array-based comparative genomic hybridization (aCGH) data were used to guide selection of 9 candidate reference genes. Expression stability of candidate reference genes and 4 commonly used reference genes was determined for tumor samples with RT-qPCR assays and 3 software programs. LOC611555 was the candidate reference gene with the highest expression stability among the 3 tumor types. Of the commonly used reference genes, expression stability of HPRT was high in histiocytic sarcoma samples, and expression stability of Ubi and RPL32 was high in osteosarcoma samples. Some of the candidate reference genes had higher expression stability than did the commonly used reference genes. Data for constitutively expressed genes with high expression stability are required for normalization of RT-qPCR assay results. Without such data, accurate quantification of gene expression in tumor tissue samples is difficult. Results of the present study indicated LOC611555 may be a useful RT-qPCR assay reference gene for multiple tissue types. Some commonly used reference genes may be suitable for normalization of gene expression data for tumors of dogs, such as lymphomas, osteosarcomas, or histiocytic sarcomas.
Integration of genome-wide association studies with biological knowledge identifies six novel genes related to kidney function.

PubMed

Chasman, Daniel I; Fuchsberger, Christian; Pattaro, Cristian; Teumer, Alexander; Böger, Carsten A; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; O'Seaghdha, Conall M; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D; Gierman, Hinco J; Feitosa, Mary F; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Lambert, Jean-Charles; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Coassin, Stefan; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; de Andrade, Mariza; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Meisinger, Christa; Gieger, Christian; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S; van Duijn, Cornelia M; Borecki, Ingrid B; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Parsa, Afshin; Bochud, Murielle; Heid, Iris M; Kao, W H Linda; Fox, Caroline S; Köttgen, Anna

2012-12-15

In conducting genome-wide association studies (GWAS), analytical approaches leveraging biological information may further understanding of the pathophysiology of clinical traits. To discover novel associations with estimated glomerular filtration rate (eGFR), a measure of kidney function, we developed a strategy for integrating prior biological knowledge into the existing GWAS data for eGFR from the CKDGen Consortium. Our strategy focuses on single nucleotide polymorphism (SNPs) in genes that are connected by functional evidence, determined by literature mining and gene ontology (GO) hierarchies, to genes near previously validated eGFR associations. It then requires association thresholds consistent with multiple testing, and finally evaluates novel candidates by independent replication. Among the samples of European ancestry, we identified a genome-wide significant SNP in FBXL20 (P = 5.6 × 10(-9)) in meta-analysis of all available data, and additional SNPs at the INHBC, LRP2, PLEKHA1, SLC3A2 and SLC7A6 genes meeting multiple-testing corrected significance for replication and overall P-values of 4.5 × 10(-4)-2.2 × 10(-7). Neither the novel PLEKHA1 nor FBXL20 associations, both further supported by association with eGFR among African Americans and with transcript abundance, would have been implicated by eGFR candidate gene approaches. LRP2, encoding the megalin receptor, was identified through connection with the previously known eGFR gene DAB2 and extends understanding of the megalin system in kidney function. These findings highlight integration of existing genome-wide association data with independent biological knowledge to uncover novel candidate eGFR associations, including candidates lacking known connections to kidney-specific pathways. The strategy may also be applicable to other clinical phenotypes, although more testing will be needed to assess its potential for discovery in general.
Integration of genome-wide association studies with biological knowledge identifies six novel genes related to kidney function

PubMed Central

Chasman, Daniel I.; Fuchsberger, Christian; Pattaro, Cristian; Teumer, Alexander; Böger, Carsten A.; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C.; O'Seaghdha, Conall M.; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V.; O'Connell, Jeffrey R.; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D.; Gierman, Hinco J.; Feitosa, Mary F.; Hwang, Shih-Jen; Atkinson, Elizabeth J.; Lohman, Kurt; Cornelis, Marilyn C.; Johansson, Åsa; Tönjes, Anke; Dehghan, Abbas; Lambert, Jean-Charles; Holliday, Elizabeth G.; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y.; Murgia, Federico; Trompet, Stella; Imboden, Medea; Coassin, Stefan; Pistis, Giorgio; Harris, Tamara B.; Launer, Lenore J.; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D.; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank; Demirkan, Ayse; Oostra, Ben A.; de Andrade, Mariza; Turner, Stephen T.; Ding, Jingzhong; Andrews, Jeanette S.; Freedman, Barry I.; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Meisinger, Christa; Gieger, Christian; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E.; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H.; Wright, Alan F.; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K.; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G.; Rivadeneira, Fernando; Aulchenko, Yurii S.; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K.; Portas, Laura; Ford, Ian; Buckley, Brendan M.; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K.; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J. Wouter; Probst-Hensch, Nicole M.; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R.; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S.; van Duijn, Cornelia M.; Borecki, Ingrid B.; Kardia, Sharon L.R.; Liu, Yongmei; Curhan, Gary C.; Rudan, Igor; Gyllensten, Ulf; Wilson, James F.; Franke, Andre; Pramstaller, Peter P.; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Parsa, Afshin; Bochud, Murielle; Heid, Iris M.; Kao, W.H. Linda; Fox, Caroline S.; Köttgen, Anna

2012-01-01

In conducting genome-wide association studies (GWAS), analytical approaches leveraging biological information may further understanding of the pathophysiology of clinical traits. To discover novel associations with estimated glomerular filtration rate (eGFR), a measure of kidney function, we developed a strategy for integrating prior biological knowledge into the existing GWAS data for eGFR from the CKDGen Consortium. Our strategy focuses on single nucleotide polymorphism (SNPs) in genes that are connected by functional evidence, determined by literature mining and gene ontology (GO) hierarchies, to genes near previously validated eGFR associations. It then requires association thresholds consistent with multiple testing, and finally evaluates novel candidates by independent replication. Among the samples of European ancestry, we identified a genome-wide significant SNP in FBXL20 (P = 5.6 × 10−9) in meta-analysis of all available data, and additional SNPs at the INHBC, LRP2, PLEKHA1, SLC3A2 and SLC7A6 genes meeting multiple-testing corrected significance for replication and overall P-values of 4.5 × 10−4–2.2 × 10−7. Neither the novel PLEKHA1 nor FBXL20 associations, both further supported by association with eGFR among African Americans and with transcript abundance, would have been implicated by eGFR candidate gene approaches. LRP2, encoding the megalin receptor, was identified through connection with the previously known eGFR gene DAB2 and extends understanding of the megalin system in kidney function. These findings highlight integration of existing genome-wide association data with independent biological knowledge to uncover novel candidate eGFR associations, including candidates lacking known connections to kidney-specific pathways. The strategy may also be applicable to other clinical phenotypes, although more testing will be needed to assess its potential for discovery in general. PMID:22962313
Mutant Alleles of Photoperiod-1 in Wheat (Triticum aestivum L.) That Confer a Late Flowering Phenotype in Long Days

PubMed Central

Shaw, Lindsay M.; Turner, Adrian S.; Herry, Laurence; Griffiths, Simon; Laurie, David A.

2013-01-01

Flowering time in wheat and barley is known to be modified by mutations in the Photoperiod-1 (Ppd-1) gene. Semi-dominant Ppd-1a mutations conferring an early flowering phenotype are well documented in wheat but gene sequencing has also identified candidate loss of function mutations for Ppd-A1 and Ppd-D1. By analogy to the recessive ppd-H1 mutation in barley, loss of function mutations in wheat are predicted to delay flowering under long day conditions. To test this experimentally, introgression lines were developed in the spring wheat variety ‘Paragon’. Plants lacking a Ppd-B1 gene were identified from a gamma irradiated ‘Paragon’ population. These were crossed with the other introgression lines to generate plants with candidate loss of function mutations on one, two or three genomes. Lines lacking Ppd-B1 flowered 10 to 15 days later than controls under long days. Candidate loss of function Ppd-A1 alleles delayed flowering by 1 to 5 days while candidate loss of function Ppd-D1 alleles did not affect flowering time. Loss of Ppd-A1 gave an enhanced effect, and loss of Ppd-D1 became detectable in lines where Ppd-B1 was absent, indicating effects may be buffered by functional Ppd-1 alleles on other genomes. Expression analysis revealed that delayed flowering was associated with reduced expression of the TaFT1 gene and increased expression of TaCO1. A survey of the GEDIFLUX wheat collection grown in the UK and North Western Europe between the 1940s and 1980s and the A.E. Watkins global collection of landraces from the 1920s and 1930s showed that the identified candidate loss of function mutations for Ppd-D1 were common and widespread, while the identified candidate Ppd-A1 loss of function mutation was rare in countries around the Mediterranean and in the Far East but was common in North Western Europe. This may reflect a possible benefit of the latter in northern locations. PMID:24244507
From genomes to vaccines: Leishmania as a model.

PubMed Central

Almeida, Renata; Norrish, Alan; Levick, Mark; Vetrie, David; Freeman, Tom; Vilo, Jaak; Ivens, Alasdair; Lange, Uta; Stober, Carmel; McCann, Sharon; Blackwell, Jenefer M

2002-01-01

The 35 Mb genome of Leishmania should be sequenced by late 2002. It contains approximately 8500 genes that will probably translate into more than 10 000 proteins. In the laboratory we have been piloting strategies to try to harness the power of the genome-proteome for rapid screening of new vaccine candidate. To this end, microarray analysis of 1094 unique genes identified using an EST analysis of 2091 cDNA clones from spliced leader libraries prepared from different developmental stages of Leishmania has been employed. The plan was to identify amastigote-expressed genes that could be used in high-throughput DNA-vaccine screens to identify potential new vaccine candidates. Despite the lack of transcriptional regulation that polycistronic transcription in Leishmania dictates, the data provide evidence for a high level of post-transcriptional regulation of RNA abundance during the developmental cycle of promastigotes in culture and in lesion-derived amastigotes of Leishmania major. This has provided 147 candidates from the 1094 unique genes that are specifically upregulated in amastigotes and are being used in vaccine studies. Using DNA vaccination, it was demonstrated that pooling strategies can work to identify protective vaccines, but it was found that some potentially protective antigens are masked by other disease-exacerbatory antigens in the pool. A total of 100 new vaccine candidates are currently being tested separately and in pools to extend this analysis, and to facilitate retrospective bioinformatic analysis to develop predictive algorithms for sequences that constitute potentially protective antigens. We are also working with other members of the Leishmania Genome Network to determine whether RNA expression determined by microarray analyses parallels expression at the protein level. We believe we are making good progress in developing strategies that will allow rapid translation of the sequence of Leishmania into potential interventions for disease control in humans. PMID:11839176
Genome-wide association studies and epistasis analyses of candidate genes related to age at menarche and age at natural menopause in a Korean population.

PubMed

Pyun, Jung-A; Kim, Sunshin; Cho, Nam H; Koh, InSong; Lee, Jong-Young; Shin, Chol; Kwack, KyuBum

2014-05-01

The aim of this study was to identify polymorphisms and gene-gene interactions that are significantly associated with age at menarche and age at menopause in a Korean population. A total of 3,452 and 1,827 women participated in studies of age at menarche and age at natural menopause, respectively. Linear regression analyses adjusted for residence area were used to perform genome-wide association studies (GWAS), candidate gene association studies, and interactions between the candidate genes for age at menarche and age at natural menopause. In GWAS, four single nucleotide polymorphisms (SNPs; rs7528241, rs1324329, rs11597068, and rs6495785) were strongly associated with age at natural menopause (lowest P = 9.66 × 10). However, GWAS of age at menarche did not reveal any strong associations. In candidate gene association studies, SNPs with P < 0.01 were selected to test their synergistic interactions. For age at natural menopause, there was a significant interaction between intronic SNPs on ADAM metallopeptidase with thrombospondin type I motif 9 (ADAMTS9) and SMAD family member 3 (SMAD3) genes (P = 9.52 × 10). For age at menarche, there were three significant interactions between three intronic SNPs on follicle-stimulating hormone receptor (FSHR) gene and one SNP located at the 3' flanking region of insulin-like growth factor 2 receptor (IGF2R) gene (lowest P = 1.95 × 10). Novel SNPs and synergistic interactions between candidate genes are significantly associated with age at menarche and age at natural menopause in a Korean population.
GENE EXPRESSION PATTERNS ASSOCIATED WITH INFERTILITY IN HUMAN AND RODENT MODELS

EPA Science Inventory

Modern genomic technologies such as DNA arrays provide the means to investigate molecular interactions at an unprecedented level, and arrays have been used to carry out gene expression profiling as a means of identifying candidate genes involved in molecular mechanisms underlying...
Exploring Valid Reference Genes for Quantitative Real-time PCR Analysis in Plutella xylostella (Lepidoptera: Plutellidae)

PubMed Central

Fu, Wei; Xie, Wen; Zhang, Zhuo; Wang, Shaoli; Wu, Qingjun; Liu, Yong; Zhou, Xiaomao; Zhou, Xuguo; Zhang, Youjun

2013-01-01

Abstract: Quantitative real-time PCR (qRT-PCR), a primary tool in gene expression analysis, requires an appropriate normalization strategy to control for variation among samples. The best option is to compare the mRNA level of a target gene with that of reference gene(s) whose expression level is stable across various experimental conditions. In this study, expression profiles of eight candidate reference genes from the diamondback moth, Plutella xylostella, were evaluated under diverse experimental conditions. RefFinder, a web-based analysis tool, integrates four major computational programs including geNorm, Normfinder, BestKeeper, and the comparative ΔCt method to comprehensively rank the tested candidate genes. Elongation factor 1 (EF1) was the most suited reference gene for the biotic factors (development stage, tissue, and strain). In contrast, although appropriate reference gene(s) do exist for several abiotic factors (temperature, photoperiod, insecticide, and mechanical injury), we were not able to identify a single universal reference gene. Nevertheless, a suite of candidate reference genes were specifically recommended for selected experimental conditions. Our finding is the first step toward establishing a standardized qRT-PCR analysis of this agriculturally important insect pest. PMID:23983612
Digital transcriptome analysis of putative sex-determination genes in papaya (Carica papaya).

PubMed

Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo

2012-01-01

Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Y(h)) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Y(h) chromosome, implying a loss of many genes on the Y(h) chromosome. Nevertheless, candidate Y(h) chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya.
Digital Transcriptome Analysis of Putative Sex-Determination Genes in Papaya (Carica papaya)

PubMed Central

Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo

2012-01-01

Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Yh) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Yh chromosome, implying a loss of many genes on the Yh chromosome. Nevertheless, candidate Yh chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya. PMID:22815863
Non-coding cancer driver candidates identified with a sample- and position-specific model of the somatic mutation rate

PubMed Central

Juul, Malene; Bertl, Johanna; Guo, Qianyun; Nielsen, Morten Muhlig; Świtnicki, Michał; Hornshøj, Henrik; Madsen, Tobias; Hobolth, Asger; Pedersen, Jakob Skou

2017-01-01

Non-coding mutations may drive cancer development. Statistical detection of non-coding driver regions is challenged by a varying mutation rate and uncertainty of functional impact. Here, we develop a statistically founded non-coding driver-detection method, ncdDetect, which includes sample-specific mutational signatures, long-range mutation rate variation, and position-specific impact measures. Using ncdDetect, we screened non-coding regulatory regions of protein-coding genes across a pan-cancer set of whole-genomes (n = 505), which top-ranked known drivers and identified new candidates. For individual candidates, presence of non-coding mutations associates with altered expression or decreased patient survival across an independent pan-cancer sample set (n = 5454). This includes an antigen-presenting gene (CD1A), where 5’UTR mutations correlate significantly with decreased survival in melanoma. Additionally, mutations in a base-excision-repair gene (SMUG1) correlate with a C-to-T mutational-signature. Overall, we find that a rich model of mutational heterogeneity facilitates non-coding driver identification and integrative analysis points to candidates of potential clinical relevance. DOI: http://dx.doi.org/10.7554/eLife.21778.001 PMID:28362259
Using the methylome to identify aggressive Barrett’s esophagus — EDRN Public Portal

Cancer.gov

OVERALL STRATEGY: Our strategy will consist of using HumanMethylation450 arrays to identify methylation profiles and/or candidate methylated genes that distinguish BE from BE+LGD, BE+HGD and EAC (Aim 1). We will then assess whether these genes are predictive markers for aggressive BE (Aim 2)
Using Single-nucleotide Polymorphisms and Genetic Mapping to find Candidate Genes that Influence Varroa-Specific Hygiene

USDA-ARS?s Scientific Manuscript database

Varroa-sensitive hygienic (VSH) behavior is one of two behaviors identified that are most important for controlling the growth of Varroa mite populations in bee hives. A study was conducted to map quantitative trait loci (QTL) that influence VSH so that resistance genes could be identified. Crosses ...
The genetic characteristics of congenital hypothyroidism in China by comprehensive screening of 21 candidate genes.

PubMed

Sun, Feng; Zhang, Jun-Xiu; Yang, Chang-Yi; Gao, Guan-Qi; Zhu, Wen-Bin; Han, Bing; Zhang, Le-Le; Wan, Yue-Yue; Ye, Xiao-Ping; Ma, Yu-Ru; Zhang, Man-Man; Yang, Liu; Zhang, Qian-Yue; Liu, Wei; Guo, Cui-Cui; Chen, Gang; Zhao, Shuang-Xia; Song, Ke-Yi; Song, Huai-Dong

2018-06-01

Congenital hypothyroidism (CH), the most common neonatal metabolic disorder, is characterized by impaired neurodevelopment. Although several candidate genes have been associated with CH, comprehensive screening of causative genes has been limited. One hundred ten patients with primary CH were recruited in this study. All exons and exon-intron boundaries of 21 candidate genes for CH were analyzed by next-generation sequencing. And the inheritance pattern of causative genes was analyzed by the study of family pedigrees. Our results showed that 57 patients (51.82%) carried biallelic mutations (containing compound heterozygous mutations and homozygous mutations) in six genes ( DUOX2 , DUOXA2 , DUOXA1 , TG , TPO and TSHR ) involved in thyroid hormone synthesis. Autosomal recessive inheritance of CH caused by mutations in DUOX2 , DUOXA2 , TG and TPO was confirmed by analysis of 22 family pedigrees. Notably, eight mutations in four genes ( FOXE1 , NKX2-1 , PAX8 and HHEX ) that lead to thyroid dysgenesis were identified in eight probands. These mutations were heterozygous in all cases and hypothyroidism was not observed in parents of these probands. Most cases of congenital hypothyroidism in China were caused by thyroid dyshormonogenesis rather than thyroid dysgenesis. This study identified previously reported causative genes for 57/110 Chinese patients and revealed DUOX2 was the most frequently mutated gene in these patients. Our study expanded the mutation spectrum of CH in Chinese patients, which was significantly different from Western countries. © 2018 The authors.
Copy number variants analysis in a cohort of isolated and syndromic developmental delay/intellectual disability reveals novel genomic disorders, position effects and candidate disease genes.

PubMed

Di Gregorio, E; Riberi, E; Belligni, E F; Biamino, E; Spielmann, M; Ala, U; Calcia, A; Bagnasco, I; Carli, D; Gai, G; Giordano, M; Guala, A; Keller, R; Mandrile, G; Arduino, C; Maffè, A; Naretto, V G; Sirchia, F; Sorasio, L; Ungari, S; Zonta, A; Zacchetti, G; Talarico, F; Pappi, P; Cavalieri, S; Giorgio, E; Mancini, C; Ferrero, M; Brussino, A; Savin, E; Gandione, M; Pelle, A; Giachino, D F; De Marchi, M; Restagno, G; Provero, P; Cirillo Silengo, M; Grosso, E; Buxbaum, J D; Pasini, B; De Rubeis, S; Brusco, A; Ferrero, G B

2017-10-01

Array-comparative genomic hybridization (array-CGH) is a widely used technique to detect copy number variants (CNVs) associated with developmental delay/intellectual disability (DD/ID). Identification of genomic disorders in DD/ID. We performed a comprehensive array-CGH investigation of 1,015 consecutive cases with DD/ID and combined literature mining, genetic evidence, evolutionary constraint scores, and functional information in order to assess the pathogenicity of the CNVs. We identified non-benign CNVs in 29% of patients. Amongst the pathogenic variants (11%), detected with a yield consistent with the literature, we found rare genomic disorders and CNVs spanning known disease genes. We further identified and discussed 51 cases with likely pathogenic CNVs spanning novel candidate genes, including genes encoding synaptic components and/or proteins involved in corticogenesis. Additionally, we identified two deletions spanning potential Topological Associated Domain (TAD) boundaries probably affecting the regulatory landscape. We show how phenotypic and genetic analyses of array-CGH data allow unraveling complex cases, identifying rare disease genes, and revealing unexpected position effects. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
The Naïve Murine Cornea as a Model System to Identify Novel Endogenous Regulators of Lymphangiogenesis: TRAIL and rtPA.

PubMed

Regenfuß, Birgit; Dreisow, Marie-Luise; Hos, Deniz; Masli, Sharmila; Bock, Felix; Cursiefen, Claus

2015-06-01

In the murine cornea, which is an established model for analyzing pathologic lymphatic vessel growth, phenotypic heterogeneity of the endogenous lymphatic vessels in the limbus of the cornea was previously described. In this study, the cornea of BALB/c, C57BL/6, and FVB mice with different limbal lymphangiogenic phenotypes was analyzed to identify novel candidates potentially influencing lymphatic vessel growth. Pathway specific expression analysis of the cornea was performed to identify novel candidate genes. Corneal protein expression of the respective candidates was analyzed by fluorescent immunohistochemistry. The effect of the candidates on proliferation of human dermal lymphatic endothelial cells (HDLECs) was analyzed by BrdU proliferation ELISA. Thirteen genes were differentially regulated in corneas of mouse strains with more endogenous limbal lymphatic vessels (high-lymphangiogenic) (C57BL/6) compared to mouse strains with less endogenous limbal lymphatic vessels (low-lymphangiogenic) (BALB/c, FVB). Two candidates, Tumor necrosis factor (ligand) superfamily member 10 (Tnfsf10/Trail) and Plasminogen activator, tissue (Plat/tPA) were expressed in the cornea of BALB/c and C57BL/6 mice on the protein level. In vitro, Trail and recombinant tPA inhibited the proliferation of human dermal lymphatic endothelial cells. Molecular analysis of the naive cornea in mouse strains with different limbal lymphatic phenotypes is a valuable model to identify novel endogenous regulators of lymphangiogenesis.
Unexpected identification of a recurrent mutation in the DLX3 gene causing amelogenesis imperfecta.

PubMed

Kim, Y-J; Seymen, F; Koruyucu, M; Kasimoglu, Y; Gencay, K; Shin, T J; Hyun, H-K; Lee, Z H; Kim, J-W

2016-05-01

To identify the molecular genetic aetiology of a family with autosomal dominant amelogenesis imperfecta (AI). DNA samples were collected from a six-generation family, and the candidate gene approach was used to screen for the enamelin (ENAM) gene. Whole-exome sequencing and linkage analysis with SNP array data identified linked regions, and candidate gene screening was performed. Mutational analysis revealed a mutation (c.561_562delCT and p.Tyr188Glnfs*13) in the DLX3 gene. After finding a recurrent DLX3 mutation, the clinical phenotype of the family members was re-examined. The proband's mother had pulp elongation in the third molars. The proband had not hair phenotype, but her cousin had curly hair at birth. In this study, we identified a recurrent 2-bp deletional DLX3 mutation in a new family. The clinical phenotype was the mildest one associated with the DLX3 mutations. These results will advance the understanding of the functional role of DLX3 in developmental processes. © 2016 The Authors. Oral Diseases Published by John Wiley & Sons Ltd.
Omics and Environmental Science Genomic Approaches With Natural Fish Populations From Polluted Environments

PubMed Central

Bozinovic, Goran; Oleksiak, Marjorie F.

2010-01-01

Transcriptomics and population genomics are two complementary genomic approaches that can be used to gain insight into pollutant effects in natural populations. Transcriptomics identify altered gene expression pathways while population genomics approaches more directly target the causative genomic polymorphisms. Neither approach is restricted to a pre-determined set of genes or loci. Instead, both approaches allow a broad overview of genomic processes. Transcriptomics and population genomic approaches have been used to explore genomic responses in populations of fish from polluted environments and have identified sets of candidate genes and loci that appear biologically important in response to pollution. Often differences in gene expression or loci between polluted and reference populations are not conserved among polluted populations suggesting a biological complexity that we do not yet fully understand. As genomic approaches become less expensive with the advent of new sequencing and genotyping technologies, they will be more widely used in complimentary studies. However, while these genomic approaches are immensely powerful for identifying candidate gene and loci, the challenge of determining biological mechanisms that link genotypes and phenotypes remains. PMID:21072843
MMTV insertional mutagenesis identifies genes, gene families and pathways involved in mammary cancer.

PubMed

Theodorou, Vassiliki; Kimm, Melanie A; Boer, Mandy; Wessels, Lodewyk; Theelen, Wendy; Jonkers, Jos; Hilkens, John

2007-06-01

We performed a high-throughput retroviral insertional mutagenesis screen in mouse mammary tumor virus (MMTV)-induced mammary tumors and identified 33 common insertion sites, of which 17 genes were previously not known to be associated with mammary cancer and 13 had not previously been linked to cancer in general. Although members of the Wnt and fibroblast growth factors (Fgf) families were frequently tagged, our exhaustive screening for MMTV insertion sites uncovered a new repertoire of candidate breast cancer oncogenes. We validated one of these genes, Rspo3, as an oncogene by overexpression in a p53-deficient mammary epithelial cell line. The human orthologs of the candidate oncogenes were frequently deregulated in human breast cancers and associated with several tumor parameters. Computational analysis of all MMTV-tagged genes uncovered specific gene families not previously associated with cancer and showed a significant overrepresentation of protein domains and signaling pathways mainly associated with development and growth factor signaling. Comparison of all tagged genes in MMTV and Moloney murine leukemia virus-induced malignancies showed that both viruses target mostly different genes that act predominantly in distinct pathways.
Sporulation genes associated with sporulation efficiency in natural isolates of yeast.

PubMed

Tomar, Parul; Bhatia, Aatish; Ramdas, Shweta; Diao, Liyang; Bhanot, Gyan; Sinha, Himanshu

2013-01-01

Yeast sporulation efficiency is a quantitative trait and is known to vary among experimental populations and natural isolates. Some studies have uncovered the genetic basis of this variation and have identified the role of sporulation genes (IME1, RME1) and sporulation-associated genes (FKH2, PMS1, RAS2, RSF1, SWS2), as well as non-sporulation pathway genes (MKT1, TAO3) in maintaining this variation. However, these studies have been done mostly in experimental populations. Sporulation is a response to nutrient deprivation. Unlike laboratory strains, natural isolates have likely undergone multiple selections for quick adaptation to varying nutrient conditions. As a result, sporulation efficiency in natural isolates may have different genetic factors contributing to phenotypic variation. Using Saccharomyces cerevisiae strains in the genetically and environmentally diverse SGRP collection, we have identified genetic loci associated with sporulation efficiency variation in a set of sporulation and sporulation-associated genes. Using two independent methods for association mapping and correcting for population structure biases, our analysis identified two linked clusters containing 4 non-synonymous mutations in genes - HOS4, MCK1, SET3, and SPO74. Five regulatory polymorphisms in five genes such as MLS1 and CDC10 were also identified as putative candidates. Our results provide candidate genes contributing to phenotypic variation in the sporulation efficiency of natural isolates of yeast.

Sporulation Genes Associated with Sporulation Efficiency in Natural Isolates of Yeast

PubMed Central

Ramdas, Shweta; Diao, Liyang; Bhanot, Gyan; Sinha, Himanshu

2013-01-01

Yeast sporulation efficiency is a quantitative trait and is known to vary among experimental populations and natural isolates. Some studies have uncovered the genetic basis of this variation and have identified the role of sporulation genes (IME1, RME1) and sporulation-associated genes (FKH2, PMS1, RAS2, RSF1, SWS2), as well as non-sporulation pathway genes (MKT1, TAO3) in maintaining this variation. However, these studies have been done mostly in experimental populations. Sporulation is a response to nutrient deprivation. Unlike laboratory strains, natural isolates have likely undergone multiple selections for quick adaptation to varying nutrient conditions. As a result, sporulation efficiency in natural isolates may have different genetic factors contributing to phenotypic variation. Using Saccharomyces cerevisiae strains in the genetically and environmentally diverse SGRP collection, we have identified genetic loci associated with sporulation efficiency variation in a set of sporulation and sporulation-associated genes. Using two independent methods for association mapping and correcting for population structure biases, our analysis identified two linked clusters containing 4 non-synonymous mutations in genes – HOS4, MCK1, SET3, and SPO74. Five regulatory polymorphisms in five genes such as MLS1 and CDC10 were also identified as putative candidates. Our results provide candidate genes contributing to phenotypic variation in the sporulation efficiency of natural isolates of yeast. PMID:23874994
Genetic dissection and validation of candidate genes for flag leaf size in rice (Oryza sativa L.).

PubMed

Tang, Xinxin; Gong, Rong; Sun, Wenqiang; Zhang, Chaopu; Yu, Sibin

2018-04-01

Two major loci with functional candidate genes were identified and validated affecting flag leaf size, which offer desirable genes to improve leaf architecture and photosynthetic capacity in rice. Leaf size is a major determinant of plant architecture and yield potential in crops. However, the genetic and molecular mechanisms regulating leaf size remain largely elusive. In this study, quantitative trait loci (QTLs) for flag leaf length and flag leaf width in rice were detected with high-density single nucleotide polymorphism genotyping of a chromosomal segment substitution line (CSSL) population, in which each line carries one or a few chromosomal segments from the japonica cultivar Nipponbare in a common background of the indica variety Zhenshan 97. In total, 14 QTLs for flag leaf length and nine QTLs for flag leaf width were identified in the CSSL population. Among them, qFW4-2 for flag leaf width was mapped to a 37-kb interval, with the most likely candidate gene being the previously characterized NAL1. Another major QTL for both flag leaf width and length was delimited by substitution mapping to a small region of 13.5 kb that contains a single gene, Ghd7.1. Mutants of Ghd7.1 generated using CRISPR/CAS9 approach showed reduced leaf size. Allelic variation analyses also validated Ghd7.1 as a functional candidate gene for leaf size, photosynthetic capacity and other yield-related traits. These results provide useful genetic information for the improvement of leaf size and yield in rice breeding programs.
Genetics of cortisol secretion and depressive symptoms: a candidate gene and genome wide association approach.

PubMed

Velders, Fleur P; Kuningas, Maris; Kumari, Meena; Dekker, Marieke J; Uitterlinden, Andre G; Kirschbaum, Clemens; Hek, Karin; Hofman, Albert; Verhulst, Frank C; Kivimaki, Mika; Van Duijn, Cornelia M; Walker, Brian R; Tiemeier, Henning

2011-08-01

Depressive patients often have altered cortisol secretion, but few studies have investigated genetic variants in relation to both cortisol secretion and depression. To identify genes related to both these conditions, we: (1) tested the association of single nucleotide polymorphisms (SNPs) in hypothalamic-pituitary-adrenal-axis (HPA-axis) candidate genes with a summary measure of total cortisol secretion during the day (cortisol(AUC)), (2) performed a genome wide association study (GWAS) of cortisol(AUC), and (3) tested the association of identified cortisol-related SNPs with depressive symptoms. We analyzed data on candidate SNPs for the HPA-axis, genome-wide scans, cortisol secretion (n=1711) and depressive symptoms (the Centre for Epidemiology Studies Depression Scale, CES-D) (n=2928) in elderly persons of the Rotterdam Study. We used data from the Whitehall II study (n=2836) to replicate the GWAS findings. Of the 1456 SNPs in 33 candidate genes, minor alleles of 4 SNPs (rs9470080, rs9394309, rs7748266 and rs1360780) in the FKBP5 gene were associated with a decreased cortisol(AUC) (p<1×10(-4) after correction for multiple testing using permutations). These SNPs were also associated with an increased risk of depressive symptoms (rs9470080: OR 1.19 (95%CI 1.0; 1.4)). The GWAS for cortisol yielded 2 SNPs with p-values of 1×10(-06) (rs8062512, rs2252459), but these associations could not be replicated. These results suggest that variation in the FKBP5 gene is associated with both cortisol(AUC) and the likelihood of depressive symptoms. Copyright © 2011 Elsevier Ltd. All rights reserved.
Identification of QTN and candidate genes for Salinity Tolerance at the Germination and Seedling Stages in Rice by Genome-Wide Association Analyses.

PubMed

Naveed, Shahzad Amir; Zhang, Fan; Zhang, Jian; Zheng, Tian-Qing; Meng, Li-Jun; Pang, Yun-Long; Xu, Jian-Long; Li, Zhi-Kang

2018-04-25

To facilitate developing rice varieties tolerant to salt stress, a panel of 208 rice mini-core accessions collected from 25 countries were evaluated for 13 traits associated with salt tolerance (ST) at the germination and seedling stages. The rice panel showed tremendous variation for all measured ST traits and eight accessions showing high levels of ST at either and/or both the germination and seedling stages. Using 395,553 SNP markers covering ~372 Mb of the rice genome and multi-locus mixed linear models, 20 QTN associated with 11 ST traits were identified by GWAS, including 6 QTN affecting ST at the germination stage and 14 QTN for ST at the seedling stage. The integration of bioinformatic with haplotype analyses for the ST QTN lets us identify 22 candidate genes for nine important ST QTN (qGR3, qSNK1, qSNK12, qSNC1, qSNC6, qRNK2, qSDW9a, qSST5 and qSST9). These candidate genes included three known ST genes (SKC1, OsTZF1 and OsEATB) for QTN qSNK1 qSST5 and qSST9. Candidate genes showed significant phenotypic differences in ST traits were detected between or among 2-4 major haplotypes. Thus, our results provided useful materials and genetic information for improving rice ST in future breeding and for molecular dissection of ST in rice.
Screening of the Filamin C Gene in a Large Cohort of Hypertrophic Cardiomyopathy Patients.

PubMed

Gómez, Juan; Lorca, Rebeca; Reguero, Julian R; Morís, César; Martín, María; Tranche, Salvador; Alonso, Belén; Iglesias, Sara; Alvarez, Victoria; Díaz-Molina, Beatriz; Avanzas, Pablo; Coto, Eliecer

2017-04-01

Recent exome sequencing studies identified filamin C ( FLNC ) as a candidate gene for hypertrophic cardiomyopathy (HCM). Our aim was to determine the rate of FLNC candidate variants in a large cohort of HCM patients who were also sequenced for the main sarcomere genes. A total of 448 HCM patients were next generation-sequenced (semiconductor chip technology) for the MYH7, MYBPC3 , TNNT2 , TNNI3 , ACTC1 , TNNC1 , MYL2 , MYL3 , TPM1 , and FLNC genes. We also sequenced 450 healthy controls from the same population. Based on the reported population frequencies, bioinformatic criteria, and familial segregation, we identified 20 FLNC candidate variants (13 new; 1 nonsense; and 19 missense) in 22 patients. Compared with the patients, only 1 of the control's missense variants was nonreported ( P =0.007; Fisher exact probability test). Based on the familial segregation and the reported functional studies, 6 of the candidate variants (in 7 patients) were finally classified as likely pathogenic, 10 as variants of uncertain significance, and 4 as likely benign. We provide a compelling evidence of the involvement of FLNC in the development of HCM. Most of the FLNC variants were associated with mild forms of HCM and a reduced penetrance, with few affected in the families to confirm the segregation. Our work, together with others who found FLNC variants among patients with dilated and restrictive cardiomyopathies, pointed to this gene as an important cause of structural cardiomyopathies. © 2017 American Heart Association, Inc.
Composite selection signals can localize the trait specific genomic regions in multi-breed populations of cattle and sheep

PubMed Central

2014-01-01

Background Discerning the traits evolving under neutral conditions from those traits evolving rapidly because of various selection pressures is a great challenge. We propose a new method, composite selection signals (CSS), which unifies the multiple pieces of selection evidence from the rank distribution of its diverse constituent tests. The extreme CSS scores capture highly differentiated loci and underlying common variants hauling excess haplotype homozygosity in the samples of a target population. Results The data on high-density genotypes were analyzed for evidence of an association with either polledness or double muscling in various cohorts of cattle and sheep. In cattle, extreme CSS scores were found in the candidate regions on autosome BTA-1 and BTA-2, flanking the POLL locus and MSTN gene, for polledness and double muscling, respectively. In sheep, the regions with extreme scores were localized on autosome OAR-2 harbouring the MSTN gene for double muscling and on OAR-10 harbouring the RXFP2 gene for polledness. In comparison to the constituent tests, there was a partial agreement between the signals at the four candidate loci; however, they consistently identified additional genomic regions harbouring no known genes. Persuasively, our list of all the additional significant CSS regions contains genes that have been successfully implicated to secondary phenotypic diversity among several subpopulations in our data. For example, the method identified a strong selection signature for stature in cattle capturing selective sweeps harbouring UQCC-GDF5 and PLAG1-CHCHD7 gene regions on BTA-13 and BTA-14, respectively. Both gene pairs have been previously associated with height in humans, while PLAG1-CHCHD7 has also been reported for stature in cattle. In the additional analysis, CSS identified significant regions harbouring multiple genes for various traits under selection in European cattle including polledness, adaptation, metabolism, growth rate, stature, immunity, reproduction traits and some other candidate genes for dairy and beef production. Conclusions CSS successfully localized the candidate regions in validation datasets as well as identified previously known and novel regions for various traits experiencing selection pressure. Together, the results demonstrate the utility of CSS by its improved power, reduced false positives and high-resolution of selection signals as compared to individual constituent tests. PMID:24636660
An Integration of Genome-Wide Association Study and Gene Expression Profiling to Prioritize the Discovery of Novel Susceptibility Loci for Osteoporosis-Related Traits

PubMed Central

Demissie, Serkalem; Soranzo, Nicole; Bianchi, Estelle N.; Grundberg, Elin; Liang, Liming; Richards, J. Brent; Estrada, Karol; Zhou, Yanhua; van Nas, Atila; Moffatt, Miriam F.; Zhai, Guangju; Hofman, Albert; van Meurs, Joyce B.; Pols, Huibert A. P.; Price, Roger I.; Nilsson, Olle; Pastinen, Tomi; Cupples, L. Adrienne; Lusis, Aldons J.; Schadt, Eric E.; Ferrari, Serge; Uitterlinden, André G.

2010-01-01

Osteoporosis is a complex disorder and commonly leads to fractures in elderly persons. Genome-wide association studies (GWAS) have become an unbiased approach to identify variations in the genome that potentially affect health. However, the genetic variants identified so far only explain a small proportion of the heritability for complex traits. Due to the modest genetic effect size and inadequate power, true association signals may not be revealed based on a stringent genome-wide significance threshold. Here, we take advantage of SNP and transcript arrays and integrate GWAS and expression signature profiling relevant to the skeletal system in cellular and animal models to prioritize the discovery of novel candidate genes for osteoporosis-related traits, including bone mineral density (BMD) at the lumbar spine (LS) and femoral neck (FN), as well as geometric indices of the hip (femoral neck-shaft angle, NSA; femoral neck length, NL; and narrow-neck width, NW). A two-stage meta-analysis of GWAS from 7,633 Caucasian women and 3,657 men, revealed three novel loci associated with osteoporosis-related traits, including chromosome 1p13.2 (RAP1A, p = 3.6×10−8), 2q11.2 (TBC1D8), and 18q11.2 (OSBPL1A), and confirmed a previously reported region near TNFRSF11B/OPG gene. We also prioritized 16 suggestive genome-wide significant candidate genes based on their potential involvement in skeletal metabolism. Among them, 3 candidate genes were associated with BMD in women. Notably, 2 out of these 3 genes (GPR177, p = 2.6×10−13; SOX6, p = 6.4×10−10) associated with BMD in women have been successfully replicated in a large-scale meta-analysis of BMD, but none of the non-prioritized candidates (associated with BMD) did. Our results support the concept of our prioritization strategy. In the absence of direct biological support for identified genes, we highlighted the efficiency of subsequent functional characterization using publicly available expression profiling relevant to the skeletal system in cellular or whole animal models to prioritize candidate genes for further functional validation. PMID:20548944
Exome sequencing of oral squamous cell carcinoma in users of Arabian snuff reveals novel candidates for driver genes.

PubMed

Al-Hebshi, Nezar Noor; Li, Shiyong; Nasher, Akram Thabet; El-Setouhy, Maged; Alsanosi, Rashad; Blancato, Jan; Loffredo, Christopher

2016-07-15

The study sought to identify genetic aberrations driving oral squamous cell carcinoma (OSCC) development among users of shammah, an Arabian preparation of smokeless tobacco. Twenty archival OSCC samples, 15 of which with a history of shammah exposure, were whole-exome sequenced at an average depth of 127×. Somatic mutations were identified using a novel, matched controls-independent filtration algorithm. CODEX and Exomedepth coupled with a novel, Database of Genomic Variant-based filter were employed to call somatic gene-copy number variations. Significantly mutated genes were identified with Oncodrive FM and the Youn and Simon's method. Candidate driver genes were nominated based on Gene Set Enrichment Analysis. The observed mutational spectrum was similar to that reported by the TCGA project. In addition to confirming known genes of OSCC (TP53, CDKNA2, CASP8, PIK3CA, HRAS, FAT1, TP63, CCND1 and FADD) the analysis identified several candidate novel driver events including mutations of NOTCH3, CSMD3, CRB1, CLTCL1, OSMR and TRPM2, amplification of the proto-oncogenes FOSL1, RELA, TRAF6, MDM2, FRS2 and BAG1, and deletion of the recently described tumor suppressor SMARCC1. Analysis also revealed significantly altered pathways not previously implicated in OSCC including Oncostatin-M signalling pathway, AP-1 and C-MYB transcription networks and endocytosis. There was a trend for higher number of mutations, amplifications and driver events in samples with history of shammah exposure particularly those that tested EBV positive, suggesting an interaction between tobacco exposure and EBV. The work provides further evidence for the genetic heterogeneity of oral cancer and suggests shammah-associated OSCC is characterized by extensive amplification of oncogenes. © 2016 UICC.
Genomic signatures of fine-scale local selection in Atlantic salmon suggest involvement of sexual maturation, energy homeostasis and immune defence-related genes.

PubMed

Pritchard, Victoria L; Mäkinen, Hannu; Vähä, Juha-Pekka; Erkinaro, Jaakko; Orell, Panu; Primmer, Craig R

2018-06-01

Elucidating the genetic basis of adaptation to the local environment can improve our understanding of how the diversity of life has evolved. In this study, we used a dense SNP array to identify candidate loci potentially underlying fine-scale local adaptation within a large Atlantic salmon (Salmo salar) population. By combining outlier, gene-environment association and haplotype homozygosity analyses, we identified multiple regions of the genome with strong evidence for diversifying selection. Several of these candidate regions had previously been identified in other studies, demonstrating that the same loci could be adaptively important in Atlantic salmon at subdrainage, regional and continental scales. Notably, we identified signals consistent with local selection around genes associated with variation in sexual maturation, energy homeostasis and immune defence. These included the large-effect age-at-maturity gene vgll3, the known obesity gene mc4r, and major histocompatibility complex II. Most strikingly, we confirmed a genomic region on Ssa09 that was extremely differentiated among subpopulations and that is also a candidate for local selection over the global range of Atlantic salmon. This region colocalized with a haplotype strongly associated with spawning ecotype in sockeye salmon (Oncorhynchus nerka), with circumstantial evidence that the same gene (six6) may be the selective target in both cases. The phenotypic effect of this region in Atlantic salmon remains cryptic, although allelic variation is related to upstream catchment area and covaries with timing of the return spawning migration. Our results further inform management of Atlantic salmon and open multiple avenues for future research. © 2018 John Wiley & Sons Ltd.
Implications of genome wide association studies for addiction: are our a priori assumptions all wrong?

PubMed

Hall, F Scott; Drgonova, Jana; Jain, Siddharth; Uhl, George R

2013-12-01

Substantial genetic contributions to addiction vulnerability are supported by data from twin studies, linkage studies, candidate gene association studies and, more recently, Genome Wide Association Studies (GWAS). Parallel to this work, animal studies have attempted to identify the genes that may contribute to responses to addictive drugs and addiction liability, initially focusing upon genes for the targets of the major drugs of abuse. These studies identified genes/proteins that affect responses to drugs of abuse; however, this does not necessarily mean that variation in these genes contributes to the genetic component of addiction liability. One of the major problems with initial linkage and candidate gene studies was an a priori focus on the genes thought to be involved in addiction based upon the known contributions of those proteins to drug actions, making the identification of novel genes unlikely. The GWAS approach is systematic and agnostic to such a priori assumptions. From the numerous GWAS now completed several conclusions may be drawn: (1) addiction is highly polygenic; each allelic variant contributing in a small, additive fashion to addiction vulnerability; (2) unexpected, compared to our a priori assumptions, classes of genes are most important in explaining addiction vulnerability; (3) although substantial genetic heterogeneity exists, there is substantial convergence of GWAS signals on particular genes. This review traces the history of this research; from initial transgenic mouse models based upon candidate gene and linkage studies, through the progression of GWAS for addiction and nicotine cessation, to the current human and transgenic mouse studies post-GWAS. © 2013.
Finding gene regulatory network candidates using the gene expression knowledge base.

PubMed

Venkatesan, Aravind; Tripathi, Sushil; Sanz de Galdeano, Alejandro; Blondé, Ward; Lægreid, Astrid; Mironov, Vladimir; Kuiper, Martin

2014-12-10

Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of 'omics' data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis. We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions. Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.
Single Nucleotide Polymorphisms in IL8 and TLR4 Genes as Candidates for Digital Dermatitis Resistance/Susceptibility in Holstein Cattle.

PubMed

El-Shafaey, El-Sayed; Ateya, Ahmed; Ramadan, Hazem; Saleh, Rasha; Elseady, Yousef; Abo El Fadl, Eman; El-Khodery, Sabry

2017-04-03

Relatedness between single nucleotide polymorphisms in IL8 and TLR4 genes and digital dermatitis resistance/susceptibility was investigated in seventy Holstein dairy cows. Animals were assigned into two groups, affected group (n = 35) and resistant group (n = 35) based on clinical signs and previous history of farm clinical records. Blood samples were collected for DNA extraction to ampliy fragments of 267-bp and 382-bp for IL8 and TLR4 genes, respectively. PCR-DNA sequencing revealed three SNPs in each of IL8 and TLR4 genes. The identified SNPs associated with digital dermatitis resistance were C94T, A220G, and T262A for IL8 and C118T for TLR4. However, the G349C and C355A SNPs in TLR4 gene were associated with digital dermatitis susceptibility. Chi-square analysis for comparison the distribution of all identified SNPs in both IL8 and TLR4 genes between resistant and affected animals showed no significant variation among the identified SNPs in IL8 gene. Meanwhile, there was a significant variation in case of TLR4 gene. As a pilot study, the present results revealed that identified SNPs in IL8 and TLR4 genes can be used as a genetic marker and predisposing factor for resistance/susceptibility to digital dermatitis in dairy cows. However, TLR4 gene may be a potential candidate for such disease.
Comparative Transcriptome Analysis Identifies Putative Genes Involved in the Biosynthesis of Xanthanolides in Xanthium strumarium L.

PubMed Central

Li, Yuanjun; Gou, Junbo; Chen, Fangfang; Li, Changfu; Zhang, Yansheng

2016-01-01

Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones (STLs), which include the xanthanolides. To date, the biogenesis of xanthanolides, especially their downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes, which were derived from the young leaves of two different cultivars and the purified glandular trichomes from one of the cultivars, were constructed in this study. In total, 157 million clean reads were generated and assembled into 91,861 unigenes, of which 59,858 unigenes were successfully annotated. All the genes coding for known enzymes in the upstream pathway to the biosynthesis of xanthanolides were present in the X. strumarium transcriptomes. From a comparative analysis of the X. strumarium transcriptomes, this study identified a number of gene candidates that are putatively involved in the downstream pathway to the synthesis of xanthanolides, such as four unigenes encoding CYP71 P450s, 50 unigenes for dehydrogenases, and 27 genes for acetyltransferases. The possible functions of these four CYP71 candidates are extensively discussed. In addition, 116 transcription factors that are highly expressed in X. strumarium glandular trichomes were also identified. Their possible regulatory roles in the biosynthesis of STLs are discussed. The global transcriptomic data for X. strumarium should provide a valuable resource for further research into the biosynthesis of xanthanolides. PMID:27625674
Comparative genomics identifies candidate genes for infectious salmon anemia (ISA) resistance in Atlantic salmon (Salmo salar).

PubMed

Li, Jieying; Boroevich, Keith A; Koop, Ben F; Davidson, William S

2011-04-01

Infectious salmon anemia (ISA) has been described as the hoof and mouth disease of salmon farming. ISA is caused by a lethal and highly communicable virus, which can have a major impact on salmon aquaculture, as demonstrated by an outbreak in Chile in 2007. A quantitative trait locus (QTL) for ISA resistance has been mapped to three microsatellite markers on linkage group (LG) 8 (Chr 15) on the Atlantic salmon genetic map. We identified bacterial artificial chromosome (BAC) clones and three fingerprint contigs from the Atlantic salmon physical map that contains these markers. We made use of the extensive BAC end sequence database to extend these contigs by chromosome walking and identified additional two markers in this region. The BAC end sequences were used to search for conserved synteny between this segment of LG8 and the fish genomes that have been sequenced. An examination of the genes in the syntenic segments of the tetraodon and medaka genomes identified candidates for association with ISA resistance in Atlantic salmon based on differential expression profiles from ISA challenges or on the putative biological functions of the proteins they encode. One gene in particular, HIV-EP2/MBP-2, caught our attention as it may influence the expression of several genes that have been implicated in the response to infection by infectious salmon anemia virus (ISAV). Therefore, we suggest that HIV-EP2/MBP-2 is a very strong candidate for the gene associated with the ISAV resistance QTL in Atlantic salmon and is worthy of further study.
A genome scan for selection signatures comparing farmed Atlantic salmon with two wild populations: Testing colocalization among outlier markers, candidate genes, and quantitative trait loci for production traits.

PubMed

Liu, Lei; Ang, Keng Pee; Elliott, J A K; Kent, Matthew Peter; Lien, Sigbjørn; MacDonald, Danielle; Boulding, Elizabeth Grace

2017-03-01

Comparative genome scans can be used to identify chromosome regions, but not traits, that are putatively under selection. Identification of targeted traits may be more likely in recently domesticated populations under strong artificial selection for increased production. We used a North American Atlantic salmon 6K SNP dataset to locate genome regions of an aquaculture strain (Saint John River) that were highly diverged from that of its putative wild founder population (Tobique River). First, admixed individuals with partial European ancestry were detected using STRUCTURE and removed from the dataset. Outlier loci were then identified as those showing extreme differentiation between the aquaculture population and the founder population. All Arlequin methods identified an overlapping subset of 17 outlier loci, three of which were also identified by BayeScan. Many outlier loci were near candidate genes and some were near published quantitative trait loci (QTLs) for growth, appetite, maturity, or disease resistance. Parallel comparisons using a wild, nonfounder population (Stewiacke River) yielded only one overlapping outlier locus as well as a known maturity QTL. We conclude that genome scans comparing a recently domesticated strain with its wild founder population can facilitate identification of candidate genes for traits known to have been under strong artificial selection.
Identifying candidate genes for 2p15p16.1 microdeletion syndrome using clinical, genomic, and functional analysis

PubMed Central

Bagheri, Hani; Badduke, Chansonette; Qiao, Ying; Colnaghi, Rita; Abramowicz, Iga; Alcantara, Diana; Dunham, Christopher; Wen, Jiadi; Wildin, Robert S.; Nowaczyk, Malgorzata J.M.; Eichmeyer, Jennifer; Lehman, Anna; Maranda, Bruno; Martell, Sally; Shan, Xianghong; Lewis, Suzanne M.E.; O’Driscoll, Mark; Gregory-Evans, Cheryl Y.

2016-01-01

The 2p15p16.1 microdeletion syndrome has a core phenotype consisting of intellectual disability, microcephaly, hypotonia, delayed growth, common craniofacial features, and digital anomalies. So far, more than 20 cases of 2p15p16.1 microdeletion syndrome have been reported in the literature; however, the size of the deletions and their breakpoints vary, making it difficult to identify the candidate genes. Recent reports pointed to 4 genes (XPO1, USP34, BCL11A, and REL) that were included, alone or in combination, in the smallest deletions causing the syndrome. Here, we describe 8 new patients with the 2p15p16.1 deletion and review all published cases to date. We demonstrate functional deficits for the above 4 candidate genes using patients’ lymphoblast cell lines (LCLs) and knockdown of their orthologs in zebrafish. All genes were dosage sensitive on the basis of reduced protein expression in LCLs. In addition, deletion of XPO1, a nuclear exporter, cosegregated with nuclear accumulation of one of its cargo molecules (rpS5) in patients’ LCLs. Other pathways associated with these genes (e.g., NF-κB and Wnt signaling as well as the DNA damage response) were not impaired in patients’ LCLs. Knockdown of xpo1a, rel, bcl11aa, and bcl11ab resulted in abnormal zebrafish embryonic development including microcephaly, dysmorphic body, hindered growth, and small fins as well as structural brain abnormalities. Our multifaceted analysis strongly implicates XPO1, REL, and BCL11A as candidate genes for 2p15p16.1 microdeletion syndrome. PMID:27699255
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate

USGS Publications Warehouse

Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon

2016-01-01

Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.
Genome-wide detection of selection signatures in Chinese indigenous Laiwu pigs revealed candidate genes regulating fat deposition in muscle.

PubMed

Chen, Minhui; Wang, Jiying; Wang, Yanping; Wu, Ying; Fu, Jinluan; Liu, Jian-Feng

2018-05-18

Currently, genome-wide scans for positive selection signatures in commercial breed have been investigated. However, few studies have focused on selection footprints of indigenous breeds. Laiwu pig is an invaluable Chinese indigenous pig breed with extremely high proportion of intramuscular fat (IMF), and an excellent model to detect footprint as the result of natural and artificial selection for fat deposition in muscle. In this study, based on GeneSeek Genomic profiler Porcine HD data, three complementary methods, F ST , iHS (integrated haplotype homozygosity score) and CLR (composite likelihood ratio), were implemented to detect selection signatures in the whole genome of Laiwu pigs. Totally, 175 candidate selected regions were obtained by at least two of the three methods, which covered 43.75 Mb genomic regions and corresponded to 1.79% of the genome sequence. Gene annotation of the selected regions revealed a list of functionally important genes for feed intake and fat deposition, reproduction, and immune response. Especially, in accordance to the phenotypic features of Laiwu pigs, among the candidate genes, we identified several genes, NPY1R, NPY5R, PIK3R1 and JAKMIP1, involved in the actions of two sets of neurons, which are central regulators in maintaining the balance between food intake and energy expenditure. Our results identified a number of regions showing signatures of selection, as well as a list of functionally candidate genes with potential effect on phenotypic traits, especially fat deposition in muscle. Our findings provide insights into the mechanisms of artificial selection of fat deposition and further facilitate follow-up functional studies.
Identification of Human Disease Genes from Interactome Network Using Graphlet Interaction

PubMed Central

Yang, Lun; Wei, Dong-Qing; Qi, Ying-Xin; Jiang, Zong-Lai

2014-01-01

Identifying genes related to human diseases, such as cancer and cardiovascular disease, etc., is an important task in biomedical research because of its applications in disease diagnosis and treatment. Interactome networks, especially protein-protein interaction networks, had been used to disease genes identification based on the hypothesis that strong candidate genes tend to closely relate to each other in some kinds of measure on the network. We proposed a new measure to analyze the relationship between network nodes which was called graphlet interaction. The graphlet interaction contained 28 different isomers. The results showed that the numbers of the graphlet interaction isomers between disease genes in interactome networks were significantly larger than random picked genes, while graphlet signatures were not. Then, we designed a new type of score, based on the network properties, to identify disease genes using graphlet interaction. The genes with higher scores were more likely to be disease genes, and all candidate genes were ranked according to their scores. Then the approach was evaluated by leave-one-out cross-validation. The precision of the current approach achieved 90% at about 10% recall, which was apparently higher than the previous three predominant algorithms, random walk, Endeavour and neighborhood based method. Finally, the approach was applied to predict new disease genes related to 4 common diseases, most of which were identified by other independent experimental researches. In conclusion, we demonstrate that the graphlet interaction is an effective tool to analyze the network properties of disease genes, and the scores calculated by graphlet interaction is more precise in identifying disease genes. PMID:24465923
Characterisation of the macrophage transcriptome in glomerulonephritis-susceptible and -resistant rat strains

PubMed Central

Maratou, Klio; Behmoaras, Jacques; Fewings, Chris; Srivastava, Prashant; D’Souza, Zelpha; Smith, Jennifer; Game, Laurence; Cook, Terence; Aitman, Tim

2010-01-01

Crescentic glomerulonephritis (CRGN) is a major cause of rapidly progressive renal failure for which the underlying genetic basis is unknown. WKY rats show marked susceptibility to CRGN, while Lewis rats are resistant. Glomerular injury and crescent formation are macrophage-dependent and mainly explained by seven quantitative trait loci (Crgn1-7). Here, we used microarray analysis in basal and lipopolysaccharide (LPS)-stimulated macrophages to identify genes that reside on pathways predisposing WKY rats to CRGN. We detected 97 novel positional candidates for the uncharacterised Crgn3-7. We identified 10 additional secondary effector genes with profound differences in expression between the two strains (>5-fold change, <1% False Discovery Rate) for basal and LPS-stimulated macrophages. Moreover, we identified 8 genes with differentially expressed alternatively spliced isoforms, by using an in depth analysis at probe-level that allowed us to discard false positives due to polymorphisms between the two rat strains. Pathway analysis identified several common linked pathways, enriched for differentially expressed genes, which affect macrophage activation. In summary, our results identify distinct macrophage transcriptome profiles between two rat strains that differ in susceptibility to glomerulonephritis, provide novel positional candidates for Crgn3-7, and define groups of genes that play a significant role in differential regulation of macrophage activity. PMID:21179115

Integrative analysis of gene expression and DNA methylation using unsupervised feature extraction for detecting candidate cancer biomarkers.

PubMed

Moon, Myungjin; Nakai, Kenta

2018-04-01

Currently, cancer biomarker discovery is one of the important research topics worldwide. In particular, detecting significant genes related to cancer is an important task for early diagnosis and treatment of cancer. Conventional studies mostly focus on genes that are differentially expressed in different states of cancer; however, noise in gene expression datasets and insufficient information in limited datasets impede precise analysis of novel candidate biomarkers. In this study, we propose an integrative analysis of gene expression and DNA methylation using normalization and unsupervised feature extractions to identify candidate biomarkers of cancer using renal cell carcinoma RNA-seq datasets. Gene expression and DNA methylation datasets are normalized by Box-Cox transformation and integrated into a one-dimensional dataset that retains the major characteristics of the original datasets by unsupervised feature extraction methods, and differentially expressed genes are selected from the integrated dataset. Use of the integrated dataset demonstrated improved performance as compared with conventional approaches that utilize gene expression or DNA methylation datasets alone. Validation based on the literature showed that a considerable number of top-ranked genes from the integrated dataset have known relationships with cancer, implying that novel candidate biomarkers can also be acquired from the proposed analysis method. Furthermore, we expect that the proposed method can be expanded for applications involving various types of multi-omics datasets.
Systematic analysis of copy number variation associated with congenital diaphragmatic hernia.

PubMed

Zhu, Qihui; High, Frances A; Zhang, Chengsheng; Cerveira, Eliza; Russell, Meaghan K; Longoni, Mauro; Joy, Maliackal P; Ryan, Mallory; Mil-Homens, Adam; Bellfy, Lauren; Coletti, Caroline M; Bhayani, Pooja; Hila, Regis; Wilson, Jay M; Donahoe, Patricia K; Lee, Charles

2018-05-15

Congenital diaphragmatic hernia (CDH), characterized by malformation of the diaphragm and hypoplasia of the lungs, is one of the most common and severe birth defects, and is associated with high morbidity and mortality rates. There is growing evidence demonstrating that genetic factors contribute to CDH, although the pathogenesis remains largely elusive. Single-nucleotide polymorphisms have been studied in recent whole-exome sequencing efforts, but larger copy number variants (CNVs) have not yet been studied on a large scale in a case control study. To capture CNVs within CDH candidate regions, we developed and tested a targeted array comparative genomic hybridization platform to identify CNVs within 140 regions in 196 patients and 987 healthy controls, and identified six significant CNVs that were either unique to patients or enriched in patients compared with controls. These CDH-associated CNVs reveal high-priority candidate genes including HLX , LHX1 , and HNF1B We also discuss CNVs that are present in only one patient in the cohort but have additional evidence of pathogenicity, including extremely rare large and/or de novo CNVs. The candidate genes within these predicted disease-causing CNVs form functional networks with other known CDH genes and play putative roles in DNA binding/transcription regulation and embryonic development. These data substantiate the importance of CNVs in the etiology of CDH, identify CDH candidate genes and pathways, and highlight the importance of ongoing analysis of CNVs in the study of CDH and other structural birth defects. Copyright © 2018 the Author(s). Published by PNAS.
Polymorphisms in the AOX2 gene are associated with the rooting ability of olive cuttings.

PubMed

Hedayati, Vahideh; Mousavi, Amir; Razavi, Khadijeh; Cultrera, Nicolò; Alagna, Fiammetta; Mariotti, Roberto; Hosseini-Mazinani, Mehdi; Baldoni, Luciana

2015-07-01

Different rooting ability candidate genes were tested on an olive cross progeny. Our results demonstrated that only the AOX2 gene was strongly induced. OeAOX2 was fully characterised and correlated to phenotypical traits. The formation of adventitious roots is a key step in the vegetative propagation of trees crop species, and this ability is under strict genetic control. While numerous studies have been carried out to identify genes controlling adventitious root formation, only a few loci have been characterised. In this work, candidate genes that were putatively involved in rooting ability were identified in olive (Olea europaea L.) by similarity with orthologs identified in other plant species. The mRNA levels of these genes were analysed by real-time PCR during root induction in high- (HR) and low-rooting (LR) individuals. Interestingly, alternative oxidase 2 (AOX2), which was previously reported to be a functional marker for rooting in olive cuttings, showed a strong induction in HR individuals. From the OeAOX2 full-length gene, alleles and effective polymorphisms were distinguished and analysed in the cross progeny, which were segregated based on rooting. The results revealed a possible correlation between two single nucleotide polymorphisms of OeAOX2 gene and rooting ability.
Identification of candidate genes in Populus cell wall biosynthesis using text-mining, co-expression network and comparative genomics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yang, Xiaohan; Ye, Chuyu; Bisaria, Anjali

2011-01-01

Populus is an important bioenergy crop for bioethanol production. A greater understanding of cell wall biosynthesis processes is critical in reducing biomass recalcitrance, a major hindrance in efficient generation of ethanol from lignocellulosic biomass. Here, we report the identification of candidate cell wall biosynthesis genes through the development and application of a novel bioinformatics pipeline. As a first step, via text-mining of PubMed publications, we obtained 121 Arabidopsis genes that had the experimental evidences supporting their involvement in cell wall biosynthesis or remodeling. The 121 genes were then used as bait genes to query an Arabidopsis co-expression database and additionalmore » genes were identified as neighbors of the bait genes in the network, increasing the number of genes to 548. The 548 Arabidopsis genes were then used to re-query the Arabidopsis co-expression database and re-construct a network that captured additional network neighbors, expanding to a total of 694 genes. The 694 Arabidopsis genes were computationally divided into 22 clusters. Queries of the Populus genome using the Arabidopsis genes revealed 817 Populus orthologs. Functional analysis of gene ontology and tissue-specific gene expression indicated that these Arabidopsis and Populus genes are high likelihood candidates for functional genomics in relation to cell wall biosynthesis.« less
A Genome-Wide Association Study for Culm Cellulose Content in Barley Reveals Candidate Genes Co-Expressed with Members of the CELLULOSE SYNTHASE A Gene Family

PubMed Central

Houston, Kelly; Burton, Rachel A.; Sznajder, Beata; Rafalski, Antoni J.; Dhugga, Kanwarpal S.; Mather, Diane E.; Taylor, Jillian; Steffenson, Brian J.; Waugh, Robbie; Fincher, Geoffrey B.

2015-01-01

Cellulose is a fundamentally important component of cell walls of higher plants. It provides a scaffold that allows the development and growth of the plant to occur in an ordered fashion. Cellulose also provides mechanical strength, which is crucial for both normal development and to enable the plant to withstand both abiotic and biotic stresses. We quantified the cellulose concentration in the culm of 288 two – rowed and 288 six – rowed spring type barley accessions that were part of the USDA funded barley Coordinated Agricultural Project (CAP) program in the USA. When the population structure of these accessions was analysed we identified six distinct populations, four of which we considered to be comprised of a sufficient number of accessions to be suitable for genome-wide association studies (GWAS). These lines had been genotyped with 3072 SNPs so we combined the trait and genetic data to carry out GWAS. The analysis allowed us to identify regions of the genome containing significant associations between molecular markers and cellulose concentration data, including one region cross-validated in multiple populations. To identify candidate genes we assembled the gene content of these regions and used these to query a comprehensive RNA-seq based gene expression atlas. This provided us with gene annotations and associated expression data across multiple tissues, which allowed us to formulate a supported list of candidate genes that regulate cellulose biosynthesis. Several regions identified by our analysis contain genes that are co-expressed with CELLULOSE SYNTHASE A (HvCesA) across a range of tissues and developmental stages. These genes are involved in both primary and secondary cell wall development. In addition, genes that have been previously linked with cellulose synthesis by biochemical methods, such as HvCOBRA, a gene of unknown function, were also associated with cellulose levels in the association panel. Our analyses provide new insights into the genes that contribute to cellulose content in cereal culms and to a greater understanding of the interactions between them. PMID:26154104
A genome-wide association study of corneal astigmatism: The CREAM Consortium.

PubMed

Shah, Rupal L; Li, Qing; Zhao, Wanting; Tedja, Milly S; Tideman, J Willem L; Khawaja, Anthony P; Fan, Qiao; Yazar, Seyhan; Williams, Katie M; Verhoeven, Virginie J M; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J; Pärssinen, Olavi; Wedenoja, Juho; Biino, Ginevra; Concas, Maria Pina; Uitterlinden, André; Rivadeneira, Fernando; Jaddoe, Vincent W V; Hysi, Pirro G; Sim, Xueling; Tan, Nicholas; Tham, Yih-Chung; Sensaki, Sonoko; Hofman, Albert; Vingerling, Johannes R; Jonas, Jost B; Mitchell, Paul; Hammond, Christopher J; Höhn, René; Baird, Paul N; Wong, Tien-Yin; Cheng, Chinfsg-Yu; Teo, Yik Ying; Mackey, David A; Williams, Cathy; Saw, Seang-Mei; Klaver, Caroline C W; Guggenheim, Jeremy A; Bailey-Wilson, Joan E

2018-01-01

To identify genes and genetic markers associated with corneal astigmatism. A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohorts were performed using VEGAS2 and MAGMA software. Additionally, estimates of single nucleotide polymorphism (SNP)-based heritability for corneal and refractive astigmatism and the spherical equivalent were calculated for Europeans using LD score regression. The meta-analysis of all cohorts identified a genome-wide significant locus near the platelet-derived growth factor receptor alpha ( PDGFRA ) gene: top SNP: rs7673984, odds ratio=1.12 (95% CI:1.08-1.16), p=5.55×10 -9 . No other genome-wide significant loci were identified in the combined analysis or European/Asian ancestry-specific analyses. Gene-based analysis identified three novel candidate genes for corneal astigmatism in Europeans-claudin-7 ( CLDN7 ), acid phosphatase 2, lysosomal ( ACP2 ), and TNF alpha-induced protein 8 like 3 ( TNFAIP8L3 ). In addition to replicating a previously identified genome-wide significant locus for corneal astigmatism near the PDGFRA gene, gene-based analysis identified three novel candidate genes, CLDN7 , ACP2 , and TNFAIP8L3 , that warrant further investigation to understand their role in the pathogenesis of corneal astigmatism. The much lower number of genetic variants and genes demonstrating an association with corneal astigmatism compared to published spherical equivalent GWAS analyses suggest a greater influence of rare genetic variants, non-additive genetic effects, or environmental factors in the development of astigmatism.
Progress Report for DOE DE-FG03-98ER20317 ''Regulation of the floral homeotic gene AGAMOUS'' Current and Final Funding Period: September 1, 2002, to December 31, 2002

DOE Office of Scientific and Technical Information (OSTI.GOV)

Weigel, D.

2003-03-11

OAK-B135 Results obtained during this funding period: (1) Phylogenetic footprinting of AG regulatory sequences Sequences necessary and sufficient for AGAMOUS (AG) expression in the center of Arabidopsis flowers are located in the second intron, which is about 3 kb in size. This intron contains binding sites for two transcription factors, LEAFY (LFY) and WUSCHEL (WUS), which are direct activators of AG. We used the new method of phylogenetic shadowing to identify new regulatory elements. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested sixmore » of these motifs and found that they are all functionally important for activity of AG regulatory sequences in A. thaliana. (2) Repression of AG by MADS box genes A candidate for repressing AG in the shoot apical meristem has been the MADS box gene FUL, since it is expressed in the shoot apical meristem and since an activated version (FUL:VP16) leads to ectopic AG expression in the shoot apical meristem. However, there is no ectopic AG expression in full single mutants. We therefore started to generate VP16 fusions of several other MADS box genes expressed in the shoot apical meristem, to determine which of these might be candidates for FUL redundant genes. We found that AGL6:VP16 has a similar phenotype as FUL:VP16, suggesting that AGL6 and FUL interact. We are now testing this hypothesis. (3) Two candidate AG regulators, WOW and ULA Because the phylogenetic footprinting project has identified several new candidate regulatory motifs, of which at least one (the CCAATCA motif) has rather strong effects, we had decided to put the analysis of WOW and ULA on hold, and to focus on using the newly identified motifs as tools. We conduct ed yeast one-hybrid screen with two of the conserved motifs, and identified several classes of transcription factors that can interact with them. One of these is encoded by the PAN gene, previously known to be expressed in a domain that overlaps the AG domain, but not known before to regulate AG. (4) New genetic modifiers of AG This part of the project was concluded in the previous funding period.« less
Candidate gene association studies in syndromic and non-syndromic cleft lip and palate

DOE Office of Scientific and Technical Information (OSTI.GOV)

Daack-Hirsch, S.; Basart, A.; Frischmeyer, P.

1994-09-01

Using ongoing case ascertainment through a birth defects registry, we have collected 219 nuclear families with non-syndromic cleft lip and/or palate and 111 families with a collection of syndromic forms. Syndromic cases include 24 with recognized forms and 72 with unrecognized syndromes. Candidate gene studies as well as genome-wide searches for evidence of microdeletions and isodisomy are currently being carried out. Candidate gene association studies, to date, have made use of PCR-based polymorphisms for TGFA, MSX1, CLPG13 (a CA repeat associated with a human homologue of a locus that results in craniofacial dysmorphogenesis in the mouse) and an STRP foundmore » in a Van der Woude syndrome microdeletion. Control tetranucleotide repeats, which insure that population-based differences are not responsible for any observed associations, are also tested. Studies of the syndromic cases have included the same list of candidate genes searching for evidence of microdeletions and a genome-wide search using tri- and tetranucleotide polymorphic markers to search for isodisomy or structural rearrangements. Significant associations have previously been identified for TGFA, and, in this report, identified for MSX1 and nonsyndromic cleft palate only (p = 0.04, uncorrected). Preliminary results of the genome-wide scan for isodisomy has returned no true positives and there has been no evidence for microdeletion cases.« less
A comparative analysis of genetic diversity of candidate genes associated with type 2 diabetes in worldwide populations.

PubMed

Gong, Xian; Zhang, Chao; Yiliyasi·Aisa, Yiliyasi·Aisa; Shi, Ying; Yang, Xue-wei; NuersimanguliAosiman, NuersimanguliAosiman; Guan, Ya-qun; Xu, Shu-hua

2016-06-20

Over the last decade, a larger number of type 2 diabetes mellitus (T2DM) susceptible candidate genes have been reported by numerous genome-wide association studies (GWAS). Understanding the genetic diversity of these candidate genes among worldwide populations not only facilitates to elucidating the genetic mechanism of T2DM, but also provides guidance to further studies of pathogenesis of T2DM in any certain population. In this study, we identified 170 genes or genomic regions associated with T2DM by searching the GWAS databases and related literatures. We next analyzed the genetic diversity of these genes (or genomic regions) among present-day human populations by curetting the 1000 Genomes Projects phase1 dataset covering 14 worldwide populations. We further compared the characteristics of T2DM genes in different populations. No significant differences of genetic diversity were observed among the 14 worldwide populations between the T2DM candidate genes and the non-T2DM genes in terms of overall pattern. However, we observed some genes, such as IL20RA, RNMTL1-NXN, NOTCH2, ADRA2A-BTBD7P2, TBC1D4, RBM38-HMGB1P1, UBE2E2, and PPARD, show considerable differentiation between populations. In particular, IL20RA (FST=0.1521) displays the greatest population difference which is mainly contributed by that between Africans and non-Africans. Moreover, we revealed genetic differences between East Asians and Europeans on some candidate genes such as DGKB-AGMO (FST=0.173) and JAZF1 (FST=0.182). Our results indicate that some T2DM susceptible candidate genes harbor highly-differentiated variants between populations. These analyses, despite preliminary, should advance our understanding of the population difference of susceptibility to T2DM and provide insightful reference that future studies can relay on.
Immunogenetic mechanisms leading to thyroid autoimmunity: recent advances in identifying susceptibility genes and regions.

PubMed

Brand, Oliver J; Gough, Stephen C L

2011-12-01

The autoimmune thyroid diseases (AITD) include Graves' disease (GD) and Hashimoto's thyroiditis (HT), which are characterised by a breakdown in immune tolerance to thyroid antigens. Unravelling the genetic architecture of AITD is vital to better understanding of AITD pathogenesis, required to advance therapeutic options in both disease management and prevention. The early whole-genome linkage and candidate gene association studies provided the first evidence that the HLA region and CTLA-4 represented AITD risk loci. Recent improvements in; high throughput genotyping technologies, collection of larger disease cohorts and cataloguing of genome-scale variation have facilitated genome-wide association studies and more thorough screening of candidate gene regions. This has allowed identification of many novel AITD risk genes and more detailed association mapping. The growing number of confirmed AITD susceptibility loci, implicates a number of putative disease mechanisms most of which are tightly linked with aspects of immune system function. The unprecedented advances in genetic study will allow future studies to identify further novel disease risk genes and to identify aetiological variants within specific gene regions, which will undoubtedly lead to a better understanding of AITD patho-physiology.
Immunogenetic Mechanisms Leading to Thyroid Autoimmunity: Recent Advances in Identifying Susceptibility Genes and Regions

PubMed Central

Brand, Oliver J; Gough, Stephen C.L

2011-01-01

The autoimmune thyroid diseases (AITD) include Graves’ disease (GD) and Hashimoto’s thyroiditis (HT), which are characterised by a breakdown in immune tolerance to thyroid antigens. Unravelling the genetic architecture of AITD is vital to better understanding of AITD pathogenesis, required to advance therapeutic options in both disease management and prevention. The early whole-genome linkage and candidate gene association studies provided the first evidence that the HLA region and CTLA-4 represented AITD risk loci. Recent improvements in; high throughput genotyping technologies, collection of larger disease cohorts and cataloguing of genome-scale variation have facilitated genome-wide association studies and more thorough screening of candidate gene regions. This has allowed identification of many novel AITD risk genes and more detailed association mapping. The growing number of confirmed AITD susceptibility loci, implicates a number of putative disease mechanisms most of which are tightly linked with aspects of immune system function. The unprecedented advances in genetic study will allow future studies to identify further novel disease risk genes and to identify aetiological variants within specific gene regions, which will undoubtedly lead to a better understanding of AITD patho-physiology. PMID:22654554
Selection of reference genes for expression analysis in the entomophthoralean fungus Pandora neoaphidis.

PubMed

Chen, Chun; Xie, Tingna; Ye, Sudan; Jensen, Annette Bruun; Eilenberg, Jørgen

2016-01-01

The selection of suitable reference genes is crucial for accurate quantification of gene expression and can add to our understanding of host-pathogen interactions. To identify suitable reference genes in Pandora neoaphidis, an obligate aphid pathogenic fungus, the expression of three traditional candidate genes including 18S rRNA(18S), 28S rRNA(28S) and elongation factor 1 alpha-like protein (EF1), were measured by quantitative polymerase chain reaction at different developmental stages (conidia, conidia with germ tubes, short hyphae and elongated hyphae), and under different nutritional conditions. We calculated the expression stability of candidate reference genes using four algorithms including geNorm, NormFinder, BestKeeper and Delta Ct. The analysis results revealed that the comprehensive ranking of candidate reference genes from the most stable to the least stable was 18S (1.189), 28S (1.414) and EF1 (3). The 18S was, therefore, the most suitable reference gene for real-time RT-PCR analysis of gene expression under all conditions. These results will support further studies on gene expression in P. neoaphidis. Copyright © 2015 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.
Robust and Comprehensive Analysis of 20 Osteoporosis Candidate Genes by Very High-Density Single-Nucleotide Polymorphism Screen Among 405 White Nuclear Families Identified Significant Association and Gene–Gene Interaction

PubMed Central

Xiong, Dong-Hai; Shen, Hui; Zhao, Lan-Juan; Xiao, Peng; Yang, Tie-Lin; Guo, Yan; Wang, Wei; Guo, Yan-Fang; Liu, Yong-Jun; Recker, Robert R; Deng, Hong-Wen

2007-01-01

Many “novel” osteoporosis candidate genes have been proposed in recent years. To advance our knowledge of their roles in osteoporosis, we screened 20 such genes using a set of high-density SNPs in a large family-based study. Our efforts led to the prioritization of those osteoporosis genes and the detection of gene–gene interactions. Introduction We performed large-scale family-based association analyses of 20 novel osteoporosis candidate genes using 277 single nucleotide polymorphisms (SNPs) for the quantitative trait BMD variation and the qualitative trait osteoporosis (OP) at three clinically important skeletal sites: spine, hip, and ultradistal radius (UD). Materials and Methods One thousand eight hundred seventy-three subjects from 405 white nuclear families were genotyped and analyzed with an average density of one SNP per 4 kb across the 20 genes. We conducted association analyses by SNP- and haplotype-based family-based association test (FBAT) and performed gene–gene interaction analyses using multianalytic approaches such as multifactor-dimensionality reduction (MDR) and conditional logistic regression. Results and Conclusions We detected four genes (DBP, LRP5, CYP17, and RANK) that showed highly suggestive associations (10,000-permutation derived empirical global p ≤ 0.01) with spine BMD/OP; four genes (CYP19, RANK, RANKL, and CYP17) highly suggestive for hip BMD/OP; and four genes (CYP19, BMP2, RANK, and TNFR2) highly suggestive for UD BMD/OP. The associations between BMP2 with UD BMD and those between RANK with OP at the spine, hip, and UD also met the experiment-wide stringent criterion (empirical global p ≤ 0.0007). Sex-stratified analyses further showed that some of the significant associations in the total sample were driven by either male or female subjects. In addition, we identified and validated a two-locus gene–gene interaction model involving GCR and ESR2, for which prior biological evidence exists. Our results suggested the prioritization of osteoporosis candidate genes from among the many proposed in recent years and revealed the significant gene–gene interaction effects influencing osteoporosis risk. PMID:17002564
Exploring digenic inheritance in arrhythmogenic cardiomyopathy.

PubMed

König, Eva; Volpato, Claudia Béu; Motta, Benedetta Maria; Blankenburg, Hagen; Picard, Anne; Pramstaller, Peter; Casella, Michela; Rauhe, Werner; Pompilio, Giulio; Meraviglia, Viviana; Domingues, Francisco S; Sommariva, Elena; Rossini, Alessandra

2017-12-08

Arrhythmogenic cardiomyopathy (ACM) is an inherited genetic disorder, characterized by the substitution of heart muscle with fibro-fatty tissue and severe ventricular arrhythmias, often leading to heart failure and sudden cardiac death. ACM is considered a monogenic disorder, but the low penetrance of mutations identified in patients suggests the involvement of additional genetic or environmental factors. We used whole exome sequencing to investigate digenic inheritance in two ACM families where previous diagnostic tests have revealed a PKP2 mutation in all affected and some healthy individuals. In family members with PKP2 mutations we determined all genes that harbor variants in affected but not in healthy carriers or vice versa. We computationally prioritized the most likely candidates, focusing on known ACM genes and genes related to PKP2 through protein interactions, functional relationships, or shared biological processes. We identified four candidate genes in family 1, namely DAG1, DAB2IP, CTBP2 and TCF25, and eleven candidate genes in family 2. The most promising gene in the second family is TTN, a gene previously associated with ACM, in which the affected individual harbors two rare deleterious-predicted missense variants, one of which is located in the protein's only serine kinase domain. In this study we report genes that might act as digenic players in ACM pathogenesis, on the basis of co-segregation with PKP2 mutations. Validation in larger cohorts is still required to prove the utility of this model.
Genes contributing to the development of alcoholism: an overview.

PubMed

Edenberg, Howard J

2012-01-01

Genetic factors (i.e., variations in specific genes) account for a substantial portion of the risk for alcoholism. However, identifying those genes and the specific variations involved is challenging. Researchers have used both case-control and family studies to identify genes related to alcoholism risk. In addition, different strategies such as candidate gene analyses and genome-wide association studies have been used. The strongest effects have been found for specific variants of genes that encode two enzymes involved in alcohol metabolism-alcohol dehydrogenase and aldehyde dehydrogenase. Accumulating evidence indicates that variations in numerous other genes have smaller but measurable effects.
Discovery of new candidate genes related to brain development using protein interaction information.

PubMed

Chen, Lei; Chu, Chen; Kong, Xiangyin; Huang, Tao; Cai, Yu-Dong

2015-01-01

Human brain development is a dramatic process composed of a series of complex and fine-tuned spatiotemporal gene expressions. A good comprehension of this process can assist us in developing the potential of our brain. However, we have only limited knowledge about the genes and gene functions that are involved in this biological process. Therefore, a substantial demand remains to discover new brain development-related genes and identify their biological functions. In this study, we aimed to discover new brain-development related genes by building a computational method. We referred to a series of computational methods used to discover new disease-related genes and developed a similar method. In this method, the shortest path algorithm was executed on a weighted graph that was constructed using protein-protein interactions. New candidate genes fell on at least one of the shortest paths connecting two known genes that are related to brain development. A randomization test was then adopted to filter positive discoveries. Of the final identified genes, several have been reported to be associated with brain development, indicating the effectiveness of the method, whereas several of the others may have potential roles in brain development.
A novel candidate gene for mouse and human preaxial polydactyly with altered expression in limbs of Hemimelic extra-toes mutant mice.

PubMed

Clark, R M; Marker, P C; Kingsley, D M

2000-07-01

Polydactyly is a common malformation of vertebrate limbs. In humans a major locus for nonsyndromic pre-axial polydactyly (PPD) has been mapped previously to 7q36. The mouse Hemimelic extra-toes (Hx) mutation maps to a homologous chromosome segment and has been proposed to affect a homologous gene. To understand the molecular changes underlying PPD, we used a positional cloning approach to identify the gene or genes disrupted by the Hx mutation and a closely linked limb mutation, Hammertoe (Hm). High resolution genetic mapping identified a small candidate interval for the mouse mutations located 1.2 cM distal to the Shh locus. The nonrecombinant interval was completely cloned in bacterial artificial chromosomes and searched for genes using a combination of exon trapping, sample sequencing, and mapping of known genes. Two novel genes, Lmbr1 and Lmbr2, are entirely within the candidate interval we defined genetically. The open reading frame of both genes is intact in mutant mice, but the expression of the Lmbr1 gene is dramatically altered in developing limbs of Hx mutant mice. The correspondence between the spatial and temporal changes in Lmbr1 expression and the embryonic onset of the Hx mutant phenotype suggests that the mouse Hx mutation may be a regulatory allele of Lmbr1. The human ortholog of Lmbr1 maps within the recently described interval for human PPD, strengthening the possibility that both mouse and human limb abnormalities are due to defects in the same highly conserved gene.
Identification of prostate cancer modifier pathways using parental strain expression mapping

PubMed Central

Xu, Qing; Majumder, Pradip K.; Ross, Kenneth; Shim, Yeonju; Golub, Todd R.; Loda, Massimo; Sellers, William R.

2007-01-01

Inherited genetic risk factors play an important role in cancer. However, other than the Mendelian fashion cancer susceptibility genes found in familial cancer syndromes, little is known about risk modifiers that control individual susceptibility. Here we developed a strategy, parental strain expression mapping, that utilizes the homogeneity of inbred mice and genome-wide mRNA expression analyses to directly identify candidate germ-line modifier genes and pathways underlying phenotypic differences among murine strains exposed to transgenic activation of AKT1. We identified multiple candidate modifier pathways and, specifically, the glycolysis pathway as a candidate negative modulator of AKT1-induced proliferation. In keeping with the findings in the murine models, in multiple human prostate expression data set, we found that enrichment of glycolysis pathways in normal tissues was associated with decreased rates of cancer recurrence after prostatectomy. Together, these data suggest that parental strain expression mapping can directly identify germ-line modifier pathways of relevance to human disease. PMID:17978178
Structural and Functional Analysis of the GRAS Gene Family in Grapevine Indicates a Role of GRAS Proteins in the Control of Development and Stress Responses

PubMed Central

Grimplet, Jérôme; Agudelo-Romero, Patricia; Teixeira, Rita T.; Martinez-Zapater, Jose M.; Fortes, Ana M.

2016-01-01

GRAS transcription factors are involved in many processes of plant growth and development (e.g., axillary shoot meristem formation, root radial patterning, nodule morphogenesis, arbuscular development) as well as in plant disease resistance and abiotic stress responses. However, little information is available concerning this gene family in grapevine (Vitis vinifera L.), an economically important woody crop. We performed a model curation of GRAS genes identified in the latest genome annotation leading to the identification of 52 genes. Gene models were improved and three new genes were identified that could be grapevine- or woody-plant specific. Phylogenetic analysis showed that GRAS genes could be classified into 13 groups that mapped on the 19 V. vinifera chromosomes. Five new subfamilies, previously not characterized in other species, were identified. Multiple sequence alignment showed typical GRAS domain in the proteins and new motifs were also described. As observed in other species, both segmental and tandem duplications contributed significantly to the expansion and evolution of the GRAS gene family in grapevine. Expression patterns across a variety of tissues and upon abiotic and biotic conditions revealed possible divergent functions of GRAS genes in grapevine development and stress responses. By comparing the information available for tomato and grapevine GRAS genes, we identified candidate genes that might constitute conserved transcriptional regulators of both climacteric and non-climacteric fruit ripening. Altogether this study provides valuable information and robust candidate genes for future functional analysis aiming at improving the quality of fleshy fruits. PMID:27065316
A data science approach to candidate gene selection of pain regarded as a process of learning and neural plasticity.

PubMed

Ultsch, Alfred; Kringel, Dario; Kalso, Eija; Mogil, Jeffrey S; Lötsch, Jörn

2016-12-01

The increasing availability of "big data" enables novel research approaches to chronic pain while also requiring novel techniques for data mining and knowledge discovery. We used machine learning to combine the knowledge about n = 535 genes identified empirically as relevant to pain with the knowledge about the functions of thousands of genes. Starting from an accepted description of chronic pain as displaying systemic features described by the terms "learning" and "neuronal plasticity," a functional genomics analysis proposed that among the functions of the 535 "pain genes," the biological processes "learning or memory" (P = 8.6 × 10) and "nervous system development" (P = 2.4 × 10) are statistically significantly overrepresented as compared with the annotations to these processes expected by chance. After establishing that the hypothesized biological processes were among important functional genomics features of pain, a subset of n = 34 pain genes were found to be annotated with both Gene Ontology terms. Published empirical evidence supporting their involvement in chronic pain was identified for almost all these genes, including 1 gene identified in March 2016 as being involved in pain. By contrast, such evidence was virtually absent in a randomly selected set of 34 other human genes. Hence, the present computational functional genomics-based method can be used for candidate gene selection, providing an alternative to established methods.

Validation of candidate genes associated with cardiovascular risk factors in psychiatric patients

PubMed Central

Windemuth, Andreas; de Leon, Jose; Goethe, John W.; Schwartz, Harold I.; Woolley, Stephen; Susce, Margaret; Kocherla, Mohan; Bogaard, Kali; Holford, Theodore R.; Seip, Richard L.; Ruaño, Gualberto

2016-01-01

The purpose of this study was to identify genetic variants predictive of cardiovascular risk factors in a psychiatric population treated with second generation antipsychotics (SGA). 924 patients undergoing treatment for severe mental illness at four US hospitals were genotyped at 1.2 million single nucleotide polymorphisms. Patients were assessed for fasting serum lipid (low density lipoprotein cholesterol [LDLc], high density lipoprotein cholesterol [HDLc], and triglycerides) and obesity phenotypes (body mass index, BMI). Thirteen candidate genes from previous studies of the same phenotypes in non-psychiatric populations were tested for association. We confirmed 8 of the 13 candidate genes at the 95% confidence level. An increased genetic effect size was observed for triglycerides in the psychiatric population compared to that in the cardiovascular population. PMID:21851846
The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color

USDA-ARS?s Scientific Manuscript database

Background: Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. Results: We describe the sequencing and assembly of...
Comparative Genomics and Immunoinformatics Approach for the Identification of Vaccine Candidates for Enterohemorrhagic Escherichia coli O157:H7

PubMed Central

García-Angulo, Víctor A.; Kalita, Anjana; Kalita, Mridul; Lozano, Luis

2014-01-01

Enterohemorrhagic Escherichia coli (EHEC) O157:H7 strains are major human food-borne pathogens, responsible for bloody diarrhea and hemolytic-uremic syndrome worldwide. Thus far, there is no vaccine for humans against EHEC infections. In this study, a comparative genomics analysis was performed to identify EHEC-specific antigens useful as potential vaccines. The genes present in both EHEC EDL933 and Sakai strains but absent in nonpathogenic E. coli K-12 and HS strains were subjected to an in silico analysis to identify secreted or surface-expressed proteins. We obtained a total of 65 gene-encoding protein candidates, which were subjected to immunoinformatics analysis. Our criteria of selection aided in categorizing the candidates as high, medium, and low priority. Three members of each group were randomly selected and cloned into pVAX-1. Candidates were pooled accordingly to their priority group and tested for immunogenicity against EHEC O157:H7 using a murine model of gastrointestinal infection. The high-priority (HP) pool, containing genes encoding a Lom-like protein (pVAX-31), a putative pilin subunit (pVAX-12), and a fragment of the type III secretion structural protein EscC (pVAX-56.2), was able to induce the production of EHEC IgG and sIgA in sera and feces. HP candidate-immunized mice displayed elevated levels of Th2 cytokines and diminished cecum colonization after wild-type challenge. Individually tested HP vaccine candidates showed that pVAX-12 and pVAX-56.2 significantly induced Th2 cytokines and production of fecal EHEC sIgA, with pVAX-56.2 reducing EHEC cecum colonization. We describe here a bioinformatics approach able to identify novel vaccine candidates potentially useful for preventing EHEC O157:H7 infections. PMID:24595137
Genome-wide scan for selection signatures in six cattle breeds in South Africa.

PubMed

Makina, Sithembile O; Muchadeyi, Farai C; van Marle-Köster, Este; Taylor, Jerry F; Makgahlela, Mahlako L; Maiwashe, Azwihangwisi

2015-11-26

The detection of selection signatures in breeds of livestock species can contribute to the identification of regions of the genome that are, or have been, functionally important and, as a consequence, have been targeted by selection. This study used two approaches to detect signatures of selection within and between six cattle breeds in South Africa, including Afrikaner (n = 44), Nguni (n = 54), Drakensberger (n = 47), Bonsmara (n = 44), Angus (n = 31) and Holstein (n = 29). The first approach was based on the detection of genomic regions in which haplotypes have been driven towards complete fixation within breeds. The second approach identified regions of the genome that had very different allele frequencies between populations (F ST). Forty-seven candidate genomic regions were identified as harbouring putative signatures of selection using both methods. Twelve of these candidate selected regions were shared among the breeds and ten were validated by previous studies. Thirty-three of these regions were successfully annotated and candidate genes were identified. Among these genes the keratin genes (KRT222, KRT24, KRT25, KRT26, and KRT27) and one heat shock protein gene (HSPB9) on chromosome 19 between 42,896,570 and 42,897,840 bp were detected for the Nguni breed. These genes were previously associated with adaptation to tropical environments in Zebu cattle. In addition, a number of candidate genes associated with the nervous system (WNT5B, FMOD, PRELP, and ATP2B), immune response (CYM, CDC6, and CDK10), production (MTPN, IGFBP4, TGFB1, and AJAP1) and reproductive performance (ADIPOR2, OVOS2, and RBBP8) were also detected as being under selection. The results presented here provide a foundation for detecting mutations that underlie genetic variation of traits that have economic importance for cattle breeds in South Africa.
Mapping autosomal recessive intellectual disability: combined microarray and exome sequencing identifies 26 novel candidate genes in 192 consanguineous families.

PubMed

Harripaul, R; Vasli, N; Mikhailov, A; Rafiq, M A; Mittal, K; Windpassinger, C; Sheikh, T I; Noor, A; Mahmood, H; Downey, S; Johnson, M; Vleuten, K; Bell, L; Ilyas, M; Khan, F S; Khan, V; Moradi, M; Ayaz, M; Naeem, F; Heidari, A; Ahmed, I; Ghadami, S; Agha, Z; Zeinali, S; Qamar, R; Mozhdehipanah, H; John, P; Mir, A; Ansar, M; French, L; Ayub, M; Vincent, J B

2018-04-01

Approximately 1% of the global population is affected by intellectual disability (ID), and the majority receive no molecular diagnosis. Previous studies have indicated high levels of genetic heterogeneity, with estimates of more than 2500 autosomal ID genes, the majority of which are autosomal recessive (AR). Here, we combined microarray genotyping, homozygosity-by-descent (HBD) mapping, copy number variation (CNV) analysis, and whole exome sequencing (WES) to identify disease genes/mutations in 192 multiplex Pakistani and Iranian consanguineous families with non-syndromic ID. We identified definite or candidate mutations (or CNVs) in 51% of families in 72 different genes, including 26 not previously reported for ARID. The new ARID genes include nine with loss-of-function mutations (ABI2, MAPK8, MPDZ, PIDD1, SLAIN1, TBC1D23, TRAPPC6B, UBA7 and USP44), and missense mutations include the first reports of variants in BDNF or TET1 associated with ID. The genes identified also showed overlap with de novo gene sets for other neuropsychiatric disorders. Transcriptional studies showed prominent expression in the prenatal brain. The high yield of AR mutations for ID indicated that this approach has excellent clinical potential and should inform clinical diagnostics, including clinical whole exome and genome sequencing, for populations in which consanguinity is common. As with other AR disorders, the relevance will also apply to outbred populations.
Identification and validation of reference genes for qRT-PCR studies of the obligate aphid pathogenic fungus Pandora neoaphidis during different developmental stages.

PubMed

Zhang, Shutao; Chen, Chun; Xie, Tingna; Ye, Sudan

2017-01-01

The selection of stable reference genes is a critical step for the accurate quantification of gene expression. To identify and validate the reference genes in Pandora neoaphidis-an obligate aphid pathogenic fungus-the expression of 13classical candidate reference genes were evaluated by quantitative real-time reverse transcriptase polymerase chain reaction(qPCR) at four developmental stages (conidia, conidia with germ tubes, short hyphae and elongated hyphae). Four statistical algorithms, including geNorm, NormFinder, BestKeeper and Delta Ct method were used to rank putative reference genes according to their expression stability and indicate the best reference gene or combination of reference genes for accurate normalization. The analysis of comprehensive ranking revealed that ACT1and 18Swas the most stably expressed genes throughout the developmental stages. To further validate the suitability of the reference genes identified in this study, the expression of cell division control protein 25 (CDC25) and Chitinase 1(CHI1) genes were used to further confirm the validated candidate reference genes. Our study presented the first systematic study of reference gene(s) selection for P. neoaphidis study and provided guidelines to obtain more accurate qPCR results for future developmental efforts.
Functional genome-wide siRNA screen identifies KIAA0586 as mutated in Joubert syndrome

PubMed Central

Roosing, Susanne; Hofree, Matan; Kim, Sehyun; Scott, Eric; Copeland, Brett; Romani, Marta; Silhavy, Jennifer L; Rosti, Rasim O; Schroth, Jana; Mazza, Tommaso; Miccinilli, Elide; Zaki, Maha S; Swoboda, Kathryn J; Milisa-Drautz, Joanne; Dobyns, William B; Mikati, Mohamed A; İncecik, Faruk; Azam, Matloob; Borgatti, Renato; Romaniello, Romina; Boustany, Rose-Mary; Clericuzio, Carol L; D'Arrigo, Stefano; Strømme, Petter; Boltshauser, Eugen; Stanzial, Franco; Mirabelli-Badenier, Marisol; Moroni, Isabella; Bertini, Enrico; Emma, Francesco; Steinlin, Maja; Hildebrandt, Friedhelm; Johnson, Colin A; Freilinger, Michael; Vaux, Keith K; Gabriel, Stacey B; Aza-Blanc, Pedro; Heynen-Genel, Susanne; Ideker, Trey; Dynlacht, Brian D; Lee, Ji Eun; Valente, Enza Maria; Kim, Joon; Gleeson, Joseph G

2015-01-01

Defective primary ciliogenesis or cilium stability forms the basis of human ciliopathies, including Joubert syndrome (JS), with defective cerebellar vermis development. We performed a high-content genome-wide small interfering RNA (siRNA) screen to identify genes regulating ciliogenesis as candidates for JS. We analyzed results with a supervised-learning approach, using SYSCILIA gold standard, Cildb3.0, a centriole siRNA screen and the GTex project, identifying 591 likely candidates. Intersection of this data with whole exome results from 145 individuals with unexplained JS identified six families with predominantly compound heterozygous mutations in KIAA0586. A c.428del base deletion in 0.1% of the general population was found in trans with a second mutation in an additional set of 9 of 163 unexplained JS patients. KIAA0586 is an orthologue of chick Talpid3, required for ciliogenesis and Sonic hedgehog signaling. Our results uncover a relatively high frequency cause for JS and contribute a list of candidates for future gene discoveries in ciliopathies. DOI: http://dx.doi.org/10.7554/eLife.06602.001 PMID:26026149
Changing the Game: Using Integrative Genomics to Probe Virulence Mechanisms of the Stem Rust Pathogen Puccinia graminis f. sp. tritici.

PubMed

Figueroa, Melania; Upadhyaya, Narayana M; Sperschneider, Jana; Park, Robert F; Szabo, Les J; Steffenson, Brian; Ellis, Jeff G; Dodds, Peter N

2016-01-01

The recent resurgence of wheat stem rust caused by new virulent races of Puccinia graminis f. sp. tritici (Pgt) poses a threat to food security. These concerns have catalyzed an extensive global effort toward controlling this disease. Substantial research and breeding programs target the identification and introduction of new stem rust resistance (Sr) genes in cultivars for genetic protection against the disease. Such resistance genes typically encode immune receptor proteins that recognize specific components of the pathogen, known as avirulence (Avr) proteins. A significant drawback to deploying cultivars with single Sr genes is that they are often overcome by evolution of the pathogen to escape recognition through alterations in Avr genes. Thus, a key element in achieving durable rust control is the deployment of multiple effective Sr genes in combination, either through conventional breeding or transgenic approaches, to minimize the risk of resistance breakdown. In this situation, evolution of pathogen virulence would require changes in multiple Avr genes in order to bypass recognition. However, choosing the optimal Sr gene combinations to deploy is a challenge that requires detailed knowledge of the pathogen Avr genes with which they interact and the virulence phenotypes of Pgt existing in nature. Identifying specific Avr genes from Pgt will provide screening tools to enhance pathogen virulence monitoring, assess heterozygosity and propensity for mutation in pathogen populations, and confirm individual Sr gene functions in crop varieties carrying multiple effective resistance genes. Toward this goal, much progress has been made in assembling a high quality reference genome sequence for Pgt, as well as a Pan-genome encompassing variation between multiple field isolates with diverse virulence spectra. In turn this has allowed prediction of Pgt effector gene candidates based on known features of Avr genes in other plant pathogens, including the related flax rust fungus. Upregulation of gene expression in haustoria and evidence for diversifying selection are two useful parameters to identify candidate Avr genes. Recently, we have also applied machine learning approaches to agnostically predict candidate effectors. Here, we review progress in stem rust pathogenomics and approaches currently underway to identify Avr genes recognized by wheat Sr genes.
Analysis of 60 reported glioma risk SNPs replicates published GWAS findings but fails to replicate associations from published candidate-gene studies.

PubMed

Walsh, Kyle M; Anderson, Erik; Hansen, Helen M; Decker, Paul A; Kosel, Matt L; Kollmeyer, Thomas; Rice, Terri; Zheng, Shichun; Xiao, Yuanyuan; Chang, Jeffrey S; McCoy, Lucie S; Bracci, Paige M; Wiemels, Joe L; Pico, Alexander R; Smirnov, Ivan; Lachance, Daniel H; Sicotte, Hugues; Eckel-Passow, Jeanette E; Wiencke, John K; Jenkins, Robert B; Wrensch, Margaret R

2013-02-01

Genomewide association studies (GWAS) and candidate-gene studies have implicated single-nucleotide polymorphisms (SNPs) in at least 45 different genes as putative glioma risk factors. Attempts to validate these associations have yielded variable results and few genetic risk factors have been consistently replicated. We conducted a case-control study of Caucasian glioma cases and controls from the University of California San Francisco (810 cases, 512 controls) and the Mayo Clinic (852 cases, 789 controls) in an attempt to replicate previously reported genetic risk factors for glioma. Sixty SNPs selected from the literature (eight from GWAS and 52 from candidate-gene studies) were successfully genotyped on an Illumina custom genotyping panel. Eight SNPs in/near seven different genes (TERT, EGFR, CCDC26, CDKN2A, PHLDB1, RTEL1, TP53) were significantly associated with glioma risk in the combined dataset (P < 0.05), with all associations in the same direction as in previous reports. Several SNP associations showed considerable differences across histologic subtype. All eight successfully replicated associations were first identified by GWAS, although none of the putative risk SNPs from candidate-gene studies was associated in the full case-control sample (all P values > 0.05). Although several confirmed associations are located near genes long known to be involved in gliomagenesis (e.g., EGFR, CDKN2A, TP53), these associations were first discovered by the GWAS approach and are in noncoding regions. These results highlight that the deficiencies of the candidate-gene approach lay in selecting both appropriate genes and relevant SNPs within these genes. © 2012 WILEY PERIODICALS, INC.
A Case of Two Sisters Suffering from 46,XY Gonadal Dysgenesis and Carrying a Mutation of a Novel Candidate Sex-Determining Gene STARD8 on the X Chromosome.

PubMed

Ilaslan, Erkut; Calvel, Pierre; Nowak, Dominika; Szarras-Czapnik, Maria; Slowikowska-Hilczer, Jolanta; Spik, Anna; Sararols, Pauline; Nef, Serge; Jaruzelska, Jadwiga; Kusz-Zamelczyk, Kamila

2018-06-08

Identification of novel genes involved in sexual development is crucial for understanding disorders of sex development (DSD). Here, we propose a member of the START domain family, the X chromosome STARD8, as a DSD candidate gene. We have identified a missense mutation of this gene in 2 sisters with 46,XY gonadal dysgenesis, inherited from their heterozygous mother. Gonadal tissue of one of the sisters contained Leydig cells overloaded with cholesterol droplets, i.e., structures previously identified in 46,XY DSD patients carrying mutations in the STAR gene encoding another START domain family member, which is crucial for steroidogenesis. Based on the phenotypes of our patients, we propose a dual role of STARD8 in sexual development, namely in testes determination and testosterone synthesis. However, further studies are needed to confirm the involvement of STARD8 in sexual development. © 2018 S. Karger AG, Basel.
Differences in Brain Transcriptomes of Closely Related Baikal Coregonid Species

PubMed Central

Bychenko, Oksana S.; Sukhanova, Lyubov V.; Azhikina, Tatyana L.; Skvortsov, Timofey A.; Belomestnykh, Tuyana V.; Sverdlov, Eugene D.

2014-01-01

The aim of this work was to get deeper insight into genetic factors involved in the adaptive divergence of closely related species, specifically two representatives of Baikal coregonids—Baikal whitefish (Coregonus baicalensis Dybowski) and Baikal omul (Coregonus migratorius Georgi)—that diverged from a common ancestor as recently as 10–20 thousand years ago. Using the Serial Analysis of Gene Expression method, we obtained libraries of short representative cDNA sequences (tags) from the brains of Baikal whitefish and omul. A comparative analysis of the libraries revealed quantitative differences among ~4% tags of the fishes under study. Based on the similarity of these tags with cDNA of known organisms, we identified candidate genes taking part in adaptive divergence. The most important candidate genes related to the adaptation of Baikal whitefish and Baikal omul, identified in this work, belong to the genes of cell metabolism, nervous and immune systems, protein synthesis, and regulatory genes as well as to DTSsa4 Tc1-like transposons which are widespread among fishes. PMID:24719892
Indel-seq: a fast-forward genetics approach for identification of trait-associated putative candidate genomic regions and its application in pigeonpea (Cajanus cajan).

PubMed

Singh, Vikas K; Khan, Aamir W; Saxena, Rachit K; Sinha, Pallavi; Kale, Sandip M; Parupalli, Swathi; Kumar, Vinay; Chitikineni, Annapurna; Vechalapu, Suryanarayana; Sameer Kumar, Chanda Venkata; Sharma, Mamta; Ghanta, Anuradha; Yamini, Kalinati Narasimhan; Muniswamy, Sonnappa; Varshney, Rajeev K

2017-07-01

Identification of candidate genomic regions associated with target traits using conventional mapping methods is challenging and time-consuming. In recent years, a number of single nucleotide polymorphism (SNP)-based mapping approaches have been developed and used for identification of candidate/putative genomic regions. However, in the majority of these studies, insertion-deletion (Indel) were largely ignored. For efficient use of Indels in mapping target traits, we propose Indel-seq approach, which is a combination of whole-genome resequencing (WGRS) and bulked segregant analysis (BSA) and relies on the Indel frequencies in extreme bulks. Deployment of Indel-seq approach for identification of candidate genomic regions associated with fusarium wilt (FW) and sterility mosaic disease (SMD) resistance in pigeonpea has identified 16 Indels affecting 26 putative candidate genes. Of these 26 affected putative candidate genes, 24 genes showed effect in the upstream/downstream of the genic region and two genes showed effect in the genes. Validation of these 16 candidate Indels in other FW- and SMD-resistant and FW- and SMD-susceptible genotypes revealed a significant association of five Indels (three for FW and two for SMD resistance). Comparative analysis of Indel-seq with other genetic mapping approaches highlighted the importance of the approach in identification of significant genomic regions associated with target traits. Therefore, the Indel-seq approach can be used for quick and precise identification of candidate genomic regions for any target traits in any crop species. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Comprehensive genomic analysis of patients with disorders of cerebral cortical development.

PubMed

Wiszniewski, Wojciech; Gawlinski, Pawel; Gambin, Tomasz; Bekiesinska-Figatowska, Monika; Obersztyn, Ewa; Antczak-Marach, Dorota; Akdemir, Zeynep Hande Coban; Harel, Tamar; Karaca, Ender; Jurek, Marta; Sobecka, Katarzyna; Nowakowska, Beata; Kruk, Malgorzata; Terczynska, Iwona; Goszczanska-Ciuchta, Alicja; Rudzka-Dybala, Mariola; Jamroz, Ewa; Pyrkosz, Antoni; Jakubiuk-Tomaszuk, Anna; Iwanowski, Piotr; Gieruszczak-Bialek, Dorota; Piotrowicz, Malgorzata; Sasiadek, Maria; Kochanowska, Iwona; Gurda, Barbara; Steinborn, Barbara; Dawidziuk, Mateusz; Castaneda, Jennifer; Wlasienko, Pawel; Bezniakow, Natalia; Jhangiani, Shalini N; Hoffman-Zacharska, Dorota; Bal, Jerzy; Szczepanik, Elzbieta; Boerwinkle, Eric; Gibbs, Richard A; Lupski, James R

2018-04-30

Malformations of cortical development (MCDs) manifest with structural brain anomalies that lead to neurologic sequelae, including epilepsy, cerebral palsy, developmental delay, and intellectual disability. To investigate the underlying genetic architecture of patients with disorders of cerebral cortical development, a cohort of 54 patients demonstrating neuroradiologic signs of MCDs was investigated. Individual genomes were interrogated for single-nucleotide variants (SNV) and copy number variants (CNV) with whole-exome sequencing and chromosomal microarray studies. Variation affecting known MCDs-associated genes was found in 16/54 cases, including 11 patients with SNV, 2 patients with CNV, and 3 patients with both CNV and SNV, at distinct loci. Diagnostic pathogenic SNV and potentially damaging variants of unknown significance (VUS) were identified in two groups of seven individuals each. We demonstrated that de novo variants are important among patients with MCDs as they were identified in 10/16 individuals with a molecular diagnosis. Three patients showed changes in known MCDs genes and a clinical phenotype beyond the usual characteristics observed, i.e., phenotypic expansion, for a particular known disease gene clinical entity. We also discovered 2 likely candidate genes, CDH4, and ASTN1, with human and animal studies supporting their roles in brain development, and 5 potential candidate genes. Our findings emphasize genetic heterogeneity of MCDs disorders and postulate potential novel candidate genes involved in cerebral cortical development.
A Molecular Portrait of De Novo Genes in Yeasts.

PubMed

Vakirlis, Nikolaos; Hebert, Alex S; Opulente, Dana A; Achaz, Guillaume; Hittinger, Chris Todd; Fischer, Gilles; Coon, Joshua J; Lafontaine, Ingrid

2018-03-01

New genes, with novel protein functions, can evolve "from scratch" out of intergenic sequences. These de novo genes can integrate the cell's genetic network and drive important phenotypic innovations. Therefore, identifying de novo genes and understanding how the transition from noncoding to coding occurs are key problems in evolutionary biology. However, identifying de novo genes is a difficult task, hampered by the presence of remote homologs, fast evolving sequences and erroneously annotated protein coding genes. To overcome these limitations, we developed a procedure that handles the usual pitfalls in de novo gene identification and predicted the emergence of 703 de novo gene candidates in 15 yeast species from 2 genera whose phylogeny spans at least 100 million years of evolution. We validated 85 candidates by proteomic data, providing new translation evidence for 25 of them through mass spectrometry experiments. We also unambiguously identified the mutations that enabled the transition from noncoding to coding for 30 Saccharomyces de novo genes. We established that de novo gene origination is a widespread phenomenon in yeasts, only a few being ultimately maintained by selection. We also found that de novo genes preferentially emerge next to divergent promoters in GC-rich intergenic regions where the probability of finding a fortuitous and transcribed ORF is the highest. Finally, we found a more than 3-fold enrichment of de novo genes at recombination hot spots, which are GC-rich and nucleosome-free regions, suggesting that meiotic recombination contributes to de novo gene emergence in yeasts.
Candidate Proteins, Metabolites and Transcripts in the Biomarkers for Spinal Muscular Atrophy (BforSMA) Clinical Study

PubMed Central

Finkel, Richard S.; Crawford, Thomas O.; Swoboda, Kathryn J.; Kaufmann, Petra; Juhasz, Peter; Li, Xiaohong; Guo, Yu; Li, Rebecca H.; Trachtenberg, Felicia; Forrest, Suzanne J.; Kobayashi, Dione T.; Chen, Karen S.; Joyce, Cynthia L.; Plasterer, Thomas

2012-01-01

Background Spinal Muscular Atrophy (SMA) is a neurodegenerative motor neuron disorder resulting from a homozygous mutation of the survival of motor neuron 1 (SMN1) gene. The gene product, SMN protein, functions in RNA biosynthesis in all tissues. In humans, a nearly identical gene, SMN2, rescues an otherwise lethal phenotype by producing a small amount of full-length SMN protein. SMN2 copy number inversely correlates with disease severity. Identifying other novel biomarkers could inform clinical trial design and identify novel therapeutic targets. Objective: To identify novel candidate biomarkers associated with disease severity in SMA using unbiased proteomic, metabolomic and transcriptomic approaches. Materials and Methods: A cross-sectional single evaluation was performed in 108 children with genetically confirmed SMA, aged 2–12 years, manifesting a broad range of disease severity and selected to distinguish factors associated with SMA type and present functional ability independent of age. Blood and urine specimens from these and 22 age-matched healthy controls were interrogated using proteomic, metabolomic and transcriptomic discovery platforms. Analyte associations were evaluated against a primary measure of disease severity, the Modified Hammersmith Functional Motor Scale (MHFMS) and to a number of secondary clinical measures. Results A total of 200 candidate biomarkers correlate with MHFMS scores: 97 plasma proteins, 59 plasma metabolites (9 amino acids, 10 free fatty acids, 12 lipids and 28 GC/MS metabolites) and 44 urine metabolites. No transcripts correlated with MHFMS. Discussion In this cross-sectional study, “BforSMA” (Biomarkers for SMA), candidate protein and metabolite markers were identified. No transcript biomarker candidates were identified. Additional mining of this rich dataset may yield important insights into relevant SMA-related pathophysiology and biological network associations. Additional prospective studies are needed to confirm these findings, demonstrate sensitivity to change with disease progression, and assess potential impact on clinical trial design. Trial Registry Clinicaltrials.gov NCT00756821. PMID:22558154
Integrated Systems Biology Analysis of Transcriptomes Reveals Candidate Genes for Acidity Control in Developing Fruits of Sweet Orange (Citrus sinensis L. Osbeck).

PubMed

Huang, Dingquan; Zhao, Yihong; Cao, Minghao; Qiao, Liang; Zheng, Zhi-Liang

2016-01-01

Organic acids, such as citrate and malate, are important contributors for the sensory traits of fleshy fruits. Although their biosynthesis has been illustrated, regulatory mechanisms of acid accumulation remain to be dissected. To provide transcriptional architecture and identify candidate genes for citrate accumulation in fruits, we have selected for transcriptome analysis four varieties of sweet orange (Citrus sinensis L. Osbeck) with varying fruit acidity, Succari (acidless), Bingtang (low acid), and Newhall and Xinhui (normal acid). Fruits of these varieties at 45 days post anthesis (DPA), which corresponds to Stage I (cell division), had similar acidity, but they displayed differential acid accumulation at 142 DPA (Stage II, cell expansion). Transcriptomes of fruits at 45 and 142 DPA were profiled using RNA sequencing and analyzed with three different algorithms (Pearson correlation, gene coexpression network and surrogate variable analysis). Our network analysis shows that the acid-correlated genes belong to three distinct network modules. Several of these candidate fruit acidity genes encode regulatory proteins involved in transport (such as AHA10), degradation (such as APD2) and transcription (such as AIL6) and act as hubs in the citrate accumulation gene networks. Taken together, our integrated systems biology analysis has provided new insights into the fruit citrate accumulation gene network and led to the identification of candidate genes likely associated with the fruit acidity control.
Integrated Systems Biology Analysis of Transcriptomes Reveals Candidate Genes for Acidity Control in Developing Fruits of Sweet Orange (Citrus sinensis L. Osbeck)

PubMed Central

Huang, Dingquan; Zhao, Yihong; Cao, Minghao; Qiao, Liang; Zheng, Zhi-Liang

2016-01-01

Organic acids, such as citrate and malate, are important contributors for the sensory traits of fleshy fruits. Although their biosynthesis has been illustrated, regulatory mechanisms of acid accumulation remain to be dissected. To provide transcriptional architecture and identify candidate genes for citrate accumulation in fruits, we have selected for transcriptome analysis four varieties of sweet orange (Citrus sinensis L. Osbeck) with varying fruit acidity, Succari (acidless), Bingtang (low acid), and Newhall and Xinhui (normal acid). Fruits of these varieties at 45 days post anthesis (DPA), which corresponds to Stage I (cell division), had similar acidity, but they displayed differential acid accumulation at 142 DPA (Stage II, cell expansion). Transcriptomes of fruits at 45 and 142 DPA were profiled using RNA sequencing and analyzed with three different algorithms (Pearson correlation, gene coexpression network and surrogate variable analysis). Our network analysis shows that the acid-correlated genes belong to three distinct network modules. Several of these candidate fruit acidity genes encode regulatory proteins involved in transport (such as AHA10), degradation (such as APD2) and transcription (such as AIL6) and act as hubs in the citrate accumulation gene networks. Taken together, our integrated systems biology analysis has provided new insights into the fruit citrate accumulation gene network and led to the identification of candidate genes likely associated with the fruit acidity control. PMID:27092171
Effector-mediated discovery of a novel resistance gene against Bremia lactucae in a nonhost lettuce species.

PubMed

Giesbers, Anne K J; Pelgrom, Alexandra J E; Visser, Richard G F; Niks, Rients E; Van den Ackerveken, Guido; Jeuken, Marieke J W

2017-11-01

Candidate effectors from lettuce downy mildew (Bremia lactucae) enable high-throughput germplasm screening for the presence of resistance (R) genes. The nonhost species Lactuca saligna comprises a source of B. lactucae R genes that has hardly been exploited in lettuce breeding. Its cross-compatibility with the host species L. sativa enables the study of inheritance of nonhost resistance (NHR). We performed transient expression of candidate RXLR effector genes from B. lactucae in a diverse Lactuca germplasm set. Responses to two candidate effectors (BLR31 and BLN08) were genetically mapped and tested for co-segregation with disease resistance. BLN08 induced a hypersensitive response (HR) in 55% of the L. saligna accessions, but responsiveness did not co-segregate with resistance to Bl:24. BLR31 triggered an HR in 5% of the L. saligna accessions, and revealed a novel R gene providing complete B. lactucae race Bl:24 resistance. Resistant hybrid plants that were BLR31 nonresponsive indicated other unlinked R genes and/or nonhost QTLs. We have identified a candidate avirulence effector of B. lactucae (BLR31) and its cognate R gene in L. saligna. Concurrently, our results suggest that R genes are not required for NHR of L. saligna. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Zinc-dependent global transcriptional control, transcriptional deregulation, and higher gene copy number for genes in metal homeostasis of the hyperaccumulator Arabidopsis halleri.

PubMed

Talke, Ina N; Hanikenne, Marc; Krämer, Ute

2006-09-01

The metal hyperaccumulator Arabidopsis halleri exhibits naturally selected zinc (Zn) and cadmium (Cd) hypertolerance and accumulates extraordinarily high Zn concentrations in its leaves. With these extreme physiological traits, A. halleri phylogenetically belongs to the sister clade of Arabidopsis thaliana. Using a combination of genome-wide cross species microarray analysis and real-time reverse transcription-PCR, a set of candidate genes is identified for Zn hyperaccumulation, Zn and Cd hypertolerance, and the adjustment of micronutrient homeostasis in A. halleri. Eighteen putative metal homeostasis genes are newly identified to be more highly expressed in A. halleri than in A. thaliana, and 11 previously identified candidate genes are confirmed. The encoded proteins include HMA4, known to contribute to root-shoot transport of Zn in A. thaliana. Expression of either AtHMA4 or AhHMA4 confers cellular Zn and Cd tolerance to yeast (Saccharomyces cerevisiae). Among further newly implicated proteins are IRT3 and ZIP10, which have been proposed to contribute to cytoplasmic Zn influx, and FRD3 required for iron partitioning in A. thaliana. In A. halleri, the presence of more than a single genomic copy is a hallmark of several highly expressed candidate genes with possible roles in metal hyperaccumulation and metal hypertolerance. Both A. halleri and A. thaliana exert tight regulatory control over Zn homeostasis at the transcript level. Zn hyperaccumulation in A. halleri involves enhanced partitioning of Zn from roots into shoots. The transcriptional regulation of marker genes suggests that in the steady state, A. halleri roots, but not the shoots, act as physiologically Zn deficient under conditions of moderate Zn supply.
New candidate loci identified by array-CGH in a cohort of 100 children presenting with syndromic obesity.

PubMed

Vuillaume, Marie-Laure; Naudion, Sophie; Banneau, Guillaume; Diene, Gwenaelle; Cartault, Audrey; Cailley, Dorothée; Bouron, Julie; Toutain, Jérôme; Bourrouillou, Georges; Vigouroux, Adeline; Bouneau, Laurence; Nacka, Fabienne; Kieffer, Isabelle; Arveiler, Benoit; Knoll-Gellida, Anja; Babin, Patrick J; Bieth, Eric; Jouret, Béatrice; Julia, Sophie; Sarda, Pierre; Geneviève, David; Faivre, Laurence; Lacombe, Didier; Barat, Pascal; Tauber, Maithé; Delrue, Marie-Ange; Rooryck, Caroline

2014-08-01

Syndromic obesity is defined by the association of obesity with one or more feature(s) including developmental delay, dysmorphic traits, and/or congenital malformations. Over 25 syndromic forms of obesity have been identified. However, most cases remain of unknown etiology. The aim of this study was to identify new candidate loci associated with syndromic obesity to find new candidate genes and to better understand molecular mechanisms involved in this pathology. We performed oligonucleotide microarray-based comparative genomic hybridization in a cohort of 100 children presenting with syndromic obesity of unknown etiology, after exhaustive clinical, biological, and molecular studies. Chromosomal copy number variations were detected in 42% of the children in our cohort, with 23% of patients with potentially pathogenic copy number variants. Our results support that chromosomal rearrangements are frequently associated with syndromic obesity with a variety of contributory genes having relevance to either obesity or developmental delay. A list of inherited or apparently de novo duplications and deletions including their enclosed genes and not previously linked to syndromic obesity was established. Proteins encoded by several of these genes are involved in lipid metabolism (ACOXL, MSMO1, MVD, and PDZK1) linked with nervous system function (BDH1 and LINGO2), neutral lipid storage (PLIN2), energy homeostasis and metabolic processes (CDH13, CNTNAP2, CPPED1, NDUFA4, PTGS2, and SOCS6). © 2014 Wiley Periodicals, Inc.

Identification of Linkages between EDCs in Personal Care Products and Breast Cancer through Data Integration Combined with Gene Network Analysis.

PubMed

Jeong, Hyeri; Kim, Jongwoon; Kim, Youngjun

2017-09-30

Approximately 1000 chemicals have been reported to possibly have endocrine disrupting effects, some of which are used in consumer products, such as personal care products (PCPs) and cosmetics. We conducted data integration combined with gene network analysis to: (i) identify causal molecular mechanisms between endocrine disrupting chemicals (EDCs) used in PCPs and breast cancer; and (ii) screen candidate EDCs associated with breast cancer. Among EDCs used in PCPs, four EDCs having correlation with breast cancer were selected, and we curated 27 common interacting genes between those EDCs and breast cancer to perform the gene network analysis. Based on the gene network analysis, ESR1, TP53, NCOA1, AKT1, and BCL6 were found to be key genes to demonstrate the molecular mechanisms of EDCs in the development of breast cancer. Using GeneMANIA, we additionally predicted 20 genes which could interact with the 27 common genes. In total, 47 genes combining the common and predicted genes were functionally grouped with the gene ontology and KEGG pathway terms. With those genes, we finally screened candidate EDCs for their potential to increase breast cancer risk. This study highlights that our approach can provide insights to understand mechanisms of breast cancer and identify potential EDCs which are in association with breast cancer.
Evaluation of Gene-Based Family-Based Methods to Detect Novel Genes Associated With Familial Late Onset Alzheimer Disease

PubMed Central

Fernández, Maria V.; Budde, John; Del-Aguila, Jorge L.; Ibañez, Laura; Deming, Yuetiva; Harari, Oscar; Norton, Joanne; Morris, John C.; Goate, Alison M.; Cruchaga, Carlos

2018-01-01

Gene-based tests to study the combined effect of rare variants on a particular phenotype have been widely developed for case-control studies, but their evolution and adaptation for family-based studies, especially studies of complex incomplete families, has been slower. In this study, we have performed a practical examination of all the latest gene-based methods available for family-based study designs using both simulated and real datasets. We examined the performance of several collapsing, variance-component, and transmission disequilibrium tests across eight different software packages and 22 models utilizing a cohort of 285 families (N = 1,235) with late-onset Alzheimer disease (LOAD). After a thorough examination of each of these tests, we propose a methodological approach to identify, with high confidence, genes associated with the tested phenotype and we provide recommendations to select the best software and model for family-based gene-based analyses. Additionally, in our dataset, we identified PTK2B, a GWAS candidate gene for sporadic AD, along with six novel genes (CHRD, CLCN2, HDLBP, CPAMD8, NLRP9, and MAS1L) as candidate genes for familial LOAD. PMID:29670507
Unsupervised text mining for assessing and augmenting GWAS results.

PubMed

Ailem, Melissa; Role, François; Nadif, Mohamed; Demenais, Florence

2016-04-01

Text mining can assist in the analysis and interpretation of large-scale biomedical data, helping biologists to quickly and cheaply gain confirmation of hypothesized relationships between biological entities. We set this question in the context of genome-wide association studies (GWAS), an actively emerging field that contributed to identify many genes associated with multifactorial diseases. These studies allow to identify groups of genes associated with the same phenotype, but provide no information about the relationships between these genes. Therefore, our objective is to leverage unsupervised text mining techniques using text-based cosine similarity comparisons and clustering applied to candidate and random gene vectors, in order to augment the GWAS results. We propose a generic framework which we used to characterize the relationships between 10 genes reported associated with asthma by a previous GWAS. The results of this experiment showed that the similarities between these 10 genes were significantly stronger than would be expected by chance (one-sided p-value<0.01). The clustering of observed and randomly selected gene also allowed to generate hypotheses about potential functional relationships between these genes and thus contributed to the discovery of new candidate genes for asthma. Copyright © 2016 Elsevier Inc. All rights reserved.
Repressed expression of a gene for a basic helix-loop-helix protein causes a white flower phenotype in carnation

PubMed Central

Totsuka, Akane; Okamoto, Emi; Miyahara, Taira; Kouno, Takanobu; Cano, Emilio A.; Sasaki, Nobuhiro; Watanabe, Aiko; Tasaki, Keisuke; Nishihara, Masahiro; Ozeki, Yoshihiro

2018-01-01

In a previous study, two genes responsible for white flower phenotypes in carnation were identified. These genes encoded enzymes involved in anthocyanin synthesis, namely, flavanone 3-hydroxylase (F3H) and dihydroflavonol 4-reductase (DFR), and showed reduced expression in the white flower phenotypes. Here, we identify another candidate gene for white phenotype in carnation flowers using an RNA-seq analysis followed by RT-PCR. This candidate gene encodes a transcriptional regulatory factor of the basic helix-loop-helix (bHLH) type. In the cultivar examined here, both F3H and DFR genes produced active enzyme proteins; however, expression of DFR and of genes for enzymes involved in the downstream anthocyanin synthetic pathway from DFR was repressed in the absence of bHLH expression. Occasionally, flowers of the white flowered cultivar used here have red speckles and stripes on the white petals. We found that expression of bHLH occurred in these red petal segments and induced expression of DFR and the following downstream enzymes. Our results indicate that a member of the bHLH superfamily is another gene involved in anthocyanin synthesis in addition to structural genes encoding enzymes. PMID:29681756
Evaluation of Gene-Based Family-Based Methods to Detect Novel Genes Associated With Familial Late Onset Alzheimer Disease.

PubMed

Fernández, Maria V; Budde, John; Del-Aguila, Jorge L; Ibañez, Laura; Deming, Yuetiva; Harari, Oscar; Norton, Joanne; Morris, John C; Goate, Alison M; Cruchaga, Carlos

2018-01-01

Gene-based tests to study the combined effect of rare variants on a particular phenotype have been widely developed for case-control studies, but their evolution and adaptation for family-based studies, especially studies of complex incomplete families, has been slower. In this study, we have performed a practical examination of all the latest gene-based methods available for family-based study designs using both simulated and real datasets. We examined the performance of several collapsing, variance-component, and transmission disequilibrium tests across eight different software packages and 22 models utilizing a cohort of 285 families ( N = 1,235) with late-onset Alzheimer disease (LOAD). After a thorough examination of each of these tests, we propose a methodological approach to identify, with high confidence, genes associated with the tested phenotype and we provide recommendations to select the best software and model for family-based gene-based analyses. Additionally, in our dataset, we identified PTK2B , a GWAS candidate gene for sporadic AD, along with six novel genes ( CHRD, CLCN2, HDLBP, CPAMD8, NLRP9 , and MAS1L ) as candidate genes for familial LOAD.
A genome-wide scan for signatures of selection in Azeri and Khuzestani buffalo breeds.

PubMed

Mokhber, Mahdi; Moradi-Shahrbabak, Mohammad; Sadeghi, Mostafa; Moradi-Shahrbabak, Hossein; Stella, Alessandra; Nicolzzi, Ezequiel; Rahmaninia, Javad; Williams, John L

2018-06-11

Identification of genomic regions that have been targets of selection may shed light on the genetic history of livestock populations and help to identify variation controlling commercially important phenotypes. The Azeri and Kuzestani buffalos are the most common indigenous Iranian breeds which have been subjected to divergent selection and are well adapted to completely different regions. Examining the genetic structure of these populations may identify genomic regions associated with adaptation to the different environments and production goals. A set of 385 water buffalo samples from Azeri (N = 262) and Khuzestani (N = 123) breeds were genotyped using the Axiom® Buffalo Genotyping 90 K Array. The unbiased fixation index method (F ST ) was used to detect signatures of selection. In total, 13 regions with outlier F ST values (0.1%) were identified. Annotation of these regions using the UMD3.1 Bos taurus Genome Assembly was performed to find putative candidate genes and QTLs within the selected regions. Putative candidate genes identified include FBXO9, NDFIP1, ACTR3, ARHGAP26, SERPINF2, BOLA-DRB3, BOLA-DQB, CLN8, and MYOM2. Candidate genes identified in regions potentially under selection were associated with physiological pathways including milk production, cytoskeleton organization, growth, metabolic function, apoptosis and domestication-related changes include immune and nervous system development. The QTL identified are involved in economically important traits in buffalo related to milk composition, udder structure, somatic cell count, meat quality, and carcass and body weight.
Genome-Wide Identification of Differentially Expressed Genes Associated with the High Yielding of Oleoresin in Secondary Xylem of Masson Pine (Pinus massoniana Lamb) by Transcriptomic Analysis

PubMed Central

Liu, Qinghua; Zhou, Zhichun; Wei, Yongcheng; Shen, Danyu; Feng, Zhongping; Hong, Shanping

2015-01-01

Masson pine is an important timber and resource for oleoresin in South China. Increasing yield of oleoresin in stems can raise economic benefits and enhance the resistance to bark beetles. However, the genetic mechanisms for regulating the yield of oleoresin were still unknown. Here, high-throughput sequencing technology was used to investigate the transcriptome and compare the gene expression profiles of high and low oleoresin-yielding genotypes. A total of 40,690,540 reads were obtained and assembled into 137,499 transcripts from the secondary xylem tissues. We identified 84,842 candidate unigenes based on sequence annotation using various databases and 96 unigenes were candidates for terpenoid backbone biosynthesis in pine. By comparing the expression profiles of high and low oleoresin-yielding genotypes, 649 differentially expressed genes (DEGs) were identified. GO enrichment analysis of DEGs revealed that multiple pathways were related to high yield of oleoresin. Nine candidate genes were validated by QPCR analysis. Among them, the candidate genes encoding geranylgeranyl diphosphate synthase (GGPS) and (-)-alpha/beta-pinene synthase were up-regulated in the high oleoresin-yielding genotype, while tricyclene synthase revealed lower expression level, which was in good agreement with the GC/MS result. In addition, DEG encoding ABC transporters, pathogenesis-related proteins (PR5 and PR9), phosphomethylpyrimidine synthase, non-specific lipid-transfer protein-like protein and ethylene responsive transcription factors (ERFs) were also confirmed to be critical for the biosynthesis of oleoresin. The next-generation sequencing strategy used in this study has proven to be a powerful means for analyzing transcriptome variation related to the yield of oleoresin in masson pine. The candidate genes encoding GGPS, (-)-alpha/beta-pinene, tricyclene synthase, ABC transporters, non-specific lipid-transfer protein-like protein, phosphomethylpyrimidine synthase, ERFs and pathogen responses may play important roles in regulating the yield of oleoresin. These DEGs are worthy of special attention in future studies. PMID:26167875
A high-resolution genetic, physical, and comparative gene map of the doublefoot (Dbf) region of mouse chromosome 1 and the region of conserved synteny on human chromosome 2q35.

PubMed

Hayes, C; Rump, A; Cadman, M R; Harrison, M; Evans, E P; Lyon, M F; Morriss-Kay, G M; Rosenthal, A; Brown, S D

2001-12-01

The mouse doublefoot (Dbf) mutant exhibits preaxial polydactyly in association with craniofacial defects. This mutation has previously been mapped to mouse chromosome 1. We have used a positional cloning strategy, coupled with a comparative sequencing approach using available human draft sequence, to identify putative candidates for the Dbf gene in the mouse and in homologous human region. We have constructed a high-resolution genetic map of the region, localizing the mutation to a 0.4-cM (+/-0.0061) interval on mouse chromosome 1. Furthermore, we have constructed contiguous BAC/PAC clone maps across the mouse and human Dbf region. Using existing markers and additional sequence tagged sites, which we have generated, we have anchored the physical map to the genetic map. Through the comparative sequencing of these clones we have identified 35 genes within this interval, indicating that the region is gene-rich. From this we have identified several genes that are known to be differentially expressed in the developing mid-gestation mouse embryo, some in the developing embryonic limb buds. These genes include those encoding known developmental signaling molecules such as WNT proteins and IHH, and we provide evidence that these genes are candidates for the Dbf mutation.
Application of selection mapping to identify genomic regions associated with dairy production in sheep.

PubMed

Gutiérrez-Gil, Beatriz; Arranz, Juan Jose; Pong-Wong, Ricardo; García-Gámez, Elsa; Kijas, James; Wiener, Pamela

2014-01-01

In Europe, especially in Mediterranean areas, the sheep has been traditionally exploited as a dual purpose species, with income from both meat and milk. Modernization of husbandry methods and the establishment of breeding schemes focused on milk production have led to the development of "dairy breeds." This study investigated selective sweeps specifically related to dairy production in sheep by searching for regions commonly identified in different European dairy breeds. With this aim, genotypes from 44,545 SNP markers covering the sheep autosomes were analysed in both European dairy and non-dairy sheep breeds using two approaches: (i) identification of genomic regions showing extreme genetic differentiation between each dairy breed and a closely related non-dairy breed, and (ii) identification of regions with reduced variation (heterozygosity) in the dairy breeds using two methods. Regions detected in at least two breeds (breed pairs) by the two approaches (genetic differentiation and at least one of the heterozygosity-based analyses) were labeled as core candidate convergence regions and further investigated for candidate genes. Following this approach six regions were detected. For some of them, strong candidate genes have been proposed (e.g. ABCG2, SPP1), whereas some other genes designated as candidates based on their association with sheep and cattle dairy traits (e.g. LALBA, DGAT1A) were not associated with a detectable sweep signal. Few of the identified regions were coincident with QTL previously reported in sheep, although many of them corresponded to orthologous regions in cattle where QTL for dairy traits have been identified. Due to the limited number of QTL studies reported in sheep compared with cattle, the results illustrate the potential value of selection mapping to identify genomic regions associated with dairy traits in sheep.
Fourteen-Genome Comparison Identifies DNA Markers for Severe-Disease-Associated Strains of Clostridium difficile▿†

PubMed Central

Forgetta, Vincenzo; Oughton, Matthew T.; Marquis, Pascale; Brukner, Ivan; Blanchette, Ruth; Haub, Kevin; Magrini, Vince; Mardis, Elaine R.; Gerding, Dale N.; Loo, Vivian G.; Miller, Mark A.; Mulvey, Michael R.; Rupnik, Maja; Dascal, Andre; Dewar, Ken

2011-01-01

Clostridium difficile is a common cause of infectious diarrhea in hospitalized patients. A severe and increased incidence of C. difficile infection (CDI) is associated predominantly with the NAP1 strain; however, the existence of other severe-disease-associated (SDA) strains and the extensive genetic diversity across C. difficile complicate reliable detection and diagnosis. Comparative genome analysis of 14 sequenced genomes, including those of a subset of NAP1 isolates, allowed the assessment of genetic diversity within and between strain types to identify DNA markers that are associated with severe disease. Comparative genome analysis of 14 isolates, including five publicly available strains, revealed that C. difficile has a core genome of 3.4 Mb, comprising ∼3,000 genes. Analysis of the core genome identified candidate DNA markers that were subsequently evaluated using a multistrain panel of 177 isolates, representing more than 50 pulsovars and 8 toxinotypes. A subset of 117 isolates from the panel had associated patient data that allowed assessment of an association between the DNA markers and severe CDI. We identified 20 candidate DNA markers for species-wide detection and 10,683 single nucleotide polymorphisms (SNPs) associated with the predominant SDA strain (NAP1). A species-wide detection candidate marker, the sspA gene, was found to be the same across 177 sequenced isolates and lacked significant similarity to those of other species. Candidate SNPs in genes CD1269 and CD1265 were found to associate more closely with disease severity than currently used diagnostic markers, as they were also present in the toxin A-negative and B-positive (A-B+) strain types. The genetic markers identified illustrate the potential of comparative genomics for the discovery of diagnostic DNA-based targets that are species specific or associated with multiple SDA strains. PMID:21508155
Application of Selection Mapping to Identify Genomic Regions Associated with Dairy Production in Sheep

PubMed Central

Gutiérrez-Gil, Beatriz; Arranz, Juan Jose; Pong-Wong, Ricardo; García-Gámez, Elsa; Kijas, James; Wiener, Pamela

2014-01-01

In Europe, especially in Mediterranean areas, the sheep has been traditionally exploited as a dual purpose species, with income from both meat and milk. Modernization of husbandry methods and the establishment of breeding schemes focused on milk production have led to the development of “dairy breeds.” This study investigated selective sweeps specifically related to dairy production in sheep by searching for regions commonly identified in different European dairy breeds. With this aim, genotypes from 44,545 SNP markers covering the sheep autosomes were analysed in both European dairy and non-dairy sheep breeds using two approaches: (i) identification of genomic regions showing extreme genetic differentiation between each dairy breed and a closely related non-dairy breed, and (ii) identification of regions with reduced variation (heterozygosity) in the dairy breeds using two methods. Regions detected in at least two breeds (breed pairs) by the two approaches (genetic differentiation and at least one of the heterozygosity-based analyses) were labeled as core candidate convergence regions and further investigated for candidate genes. Following this approach six regions were detected. For some of them, strong candidate genes have been proposed (e.g. ABCG2, SPP1), whereas some other genes designated as candidates based on their association with sheep and cattle dairy traits (e.g. LALBA, DGAT1A) were not associated with a detectable sweep signal. Few of the identified regions were coincident with QTL previously reported in sheep, although many of them corresponded to orthologous regions in cattle where QTL for dairy traits have been identified. Due to the limited number of QTL studies reported in sheep compared with cattle, the results illustrate the potential value of selection mapping to identify genomic regions associated with dairy traits in sheep. PMID:24788864
Systems genetics of intravenous cocaine self-administration in the BXD recombinant inbred mouse panel

PubMed Central

Dickson, Price E.; Miller, Mellessa M.; Calton, Michele A.; Bubier, Jason A.; Cook, Melloni N.; Goldowitz, Daniel; Chesler, Elissa J.; Mittleman, Guy

2015-01-01

Rationale Cocaine addiction is a major public health problem with a substantial genetic basis for which the biological mechanisms remain largely unknown. Systems genetics is a powerful method for discovering novel mechanisms underlying complex traits, and intravenous drug self-administration (IVSA) is the gold standard for assessing volitional drug use in preclinical studies. We have integrated these approaches to identify novel genes and networks underling cocaine use in mice. Methods Mice from 39 BXD strains acquired cocaine IVSA (0.56 mg/kg/infusion). Mice from 29 BXD strains completed a full dose-response curve (0.032 – 1.8 mg/kg/infusion). Results We identified independent genetic correlations between cocaine IVSA and measures of environmental exploration and cocaine sensitization. We identified genome-wide significant QTL on chromosomes 7 and 11 associated with shifts in the dose-response curve and on chromosome 16 associated with sessions to acquire cocaine IVSA. Using publicly available gene expression data from the nucleus accumbens, midbrain, and prefrontal cortex of drug-naïve mice, we identified Aplp1 and Cyfip2 as positional candidates underlying the behavioral QTL on chromosomes 7 and 11, respectively. A genome-wide significant trans-eQTL linking Fam53b (a GWAS candidate for human cocaine dependence) on chromosome 7 to the cocaine IVSA behavioral QTL on chromosome 11 was identified in the midbrain; Fam53b and Cyfip2 were co-expressed genome-wide significantly in the midbrain. This finding indicates that cocaine IVSA studies using mice can identify genes involved in human cocaine use. Conclusions These data provide novel candidate genes underlying cocaine IVSA in mice, and suggest mechanisms driving human cocaine use. PMID:26581503
Systematic Prioritization and Integrative Analysis of Copy Number Variations in Schizophrenia Reveal Key Schizophrenia Susceptibility Genes

PubMed Central

Luo, Xiongjian; Huang, Liang; Han, Leng; Luo, Zhenwu; Hu, Fang; Tieu, Roger; Gan, Lin

2014-01-01

Schizophrenia is a common mental disorder with high heritability and strong genetic heterogeneity. Common disease-common variants hypothesis predicts that schizophrenia is attributable in part to common genetic variants. However, recent studies have clearly demonstrated that copy number variations (CNVs) also play pivotal roles in schizophrenia susceptibility and explain a proportion of missing heritability. Though numerous CNVs have been identified, many of the regions affected by CNVs show poor overlapping among different studies, and it is not known whether the genes disrupted by CNVs contribute to the risk of schizophrenia. By using cumulative scoring, we systematically prioritized the genes affected by CNVs in schizophrenia. We identified 8 top genes that are frequently disrupted by CNVs, including NRXN1, CHRNA7, BCL9, CYFIP1, GJA8, NDE1, SNAP29, and GJA5. Integration of genes affected by CNVs with known schizophrenia susceptibility genes (from previous genetic linkage and association studies) reveals that many genes disrupted by CNVs are also associated with schizophrenia. Further protein-protein interaction (PPI) analysis indicates that protein products of genes affected by CNVs frequently interact with known schizophrenia-associated proteins. Finally, systematic integration of CNVs prioritization data with genetic association and PPI data identifies key schizophrenia candidate genes. Our results provide a global overview of genes impacted by CNVs in schizophrenia and reveal a densely interconnected molecular network of de novo CNVs in schizophrenia. Though the prioritized top genes represent promising schizophrenia risk genes, further work with different prioritization methods and independent samples is needed to confirm these findings. Nevertheless, the identified key candidate genes may have important roles in the pathogenesis of schizophrenia, and further functional characterization of these genes may provide pivotal targets for future therapeutics and diagnostics. PMID:24664977
Gene expression atlas of pigeonpea and its application to gain insights into genes associated with pollen fertility implicated in seed formation

PubMed Central

Pazhamala, Lekha T.; Purohit, Shilp; Saxena, Rachit K.; Garg, Vanika; Krishnamurthy, L.; Verdier, Jerome

2017-01-01

Abstract Pigeonpea (Cajanus cajan) is an important grain legume of the semi-arid tropics, mainly used for its protein rich seeds. To link the genome sequence information with agronomic traits resulting from specific developmental processes, a Cajanus cajan gene expression atlas (CcGEA) was developed using the Asha genotype. Thirty tissues/organs representing developmental stages from germination to senescence were used to generate 590.84 million paired-end RNA-Seq data. The CcGEA revealed a compendium of 28 793 genes with differential, specific, spatio-temporal and constitutive expression during various stages of development in different tissues. As an example to demonstrate the application of the CcGEA, a network of 28 flower-related genes analysed for cis-regulatory elements and splicing variants has been identified. In addition, expression analysis of these candidate genes in male sterile and male fertile genotypes suggested their critical role in normal pollen development leading to seed formation. Gene network analysis also identified two regulatory genes, a pollen-specific SF3 and a sucrose–proton symporter, that could have implications for improvement of agronomic traits such as seed production and yield. In conclusion, the CcGEA provides a valuable resource for pigeonpea to identify candidate genes involved in specific developmental processes and to understand the well-orchestrated growth and developmental process in this resilient crop. PMID:28338822
Identifying metabolic enzymes with multiple types of association evidence

PubMed Central

Kharchenko, Peter; Chen, Lifeng; Freund, Yoav; Vitkup, Dennis; Church, George M

2006-01-01

Background Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. Results We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. Conclusion We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. PMID:16571130
Identification of genes related to proliferative diabetic retinopathy through RWR algorithm based on protein-protein interaction network.

PubMed

Zhang, Jian; Suo, Yan; Liu, Min; Xu, Xun

2018-06-01

Proliferative diabetic retinopathy (PDR) is one of the most common complications of diabetes and can lead to blindness. Proteomic studies have provided insight into the pathogenesis of PDR and a series of PDR-related genes has been identified but are far from fully characterized because the experimental methods are expensive and time consuming. In our previous study, we successfully identified 35 candidate PDR-related genes through the shortest-path algorithm. In the current study, we developed a computational method using the random walk with restart (RWR) algorithm and the protein-protein interaction (PPI) network to identify potential PDR-related genes. After some possible genes were obtained by the RWR algorithm, a three-stage filtration strategy, which includes the permutation test, interaction test and enrichment test, was applied to exclude potential false positives caused by the structure of PPI network, the poor interaction strength, and the limited similarity on gene ontology (GO) terms and biological pathways. As a result, 36 candidate genes were discovered by the method which was different from the 35 genes reported in our previous study. A literature review showed that 21 of these 36 genes are supported by previous experiments. These findings suggest the robustness and complementary effects of both our efforts using different computational methods, thus providing an alternative method to study PDR pathogenesis. Copyright © 2017 Elsevier B.V. All rights reserved.
Candidate genes that have facilitated freshwater adaptation by palaemonid prawns in the genus Macrobrachium: identification and expression validation in a model species (M. koombooloomba).

PubMed

Rahi, Md Lifat; Amin, Shorash; Mather, Peter B; Hurwood, David A

2017-01-01

The endemic Australian freshwater prawn, Macrobrachium koombooloomba , provides a model for exploring genes involved with freshwater adaptation because it is one of the relatively few Macrobrachium species that can complete its entire life cycle in freshwater. The present study was conducted to identify potential candidate genes that are likely to contribute to effective freshwater adaptation by M. koombooloomba using a transcriptomics approach. De novo assembly of 75 bp paired end 227,564,643 high quality Illumina raw reads from 6 different cDNA libraries revealed 125,917 contigs of variable lengths (200-18,050 bp) with an N50 value of 1597. In total, 31,272 (24.83%) of the assembled contigs received significant blast hits, of which 27,686 and 22,560 contigs were mapped and functionally annotated, respectively. CEGMA (Core Eukaryotic Genes Mapping Approach) based transcriptome quality assessment revealed 96.37% completeness. We identified 43 different potential genes that are likely to be involved with freshwater adaptation in M. koombooloomba . Identified candidate genes included: 25 genes for osmoregulation, five for cell volume regulation, seven for stress tolerance, three for body fluid (haemolymph) maintenance, eight for epithelial permeability and water channel regulation, nine for egg size control and three for larval development. RSEM (RNA-Seq Expectation Maximization) based abundance estimation revealed that 6,253, 5,753 and 3,795 transcripts were expressed (at TPM value ≥10) in post larvae, juveniles and adults, respectively. Differential gene expression (DGE) analysis showed that 15 genes were expressed differentially in different individuals but these genes apparently were not involved with freshwater adaptation but rather were involved in growth, development and reproductive maturation. The genomic resources developed here will be useful for better understanding the molecular basis of freshwater adaptation in Macrobrachium prawns and other crustaceans more broadly.
Candidate genes that have facilitated freshwater adaptation by palaemonid prawns in the genus Macrobrachium: identification and expression validation in a model species (M. koombooloomba)

PubMed Central

Amin, Shorash; Mather, Peter B.; Hurwood, David A.

2017-01-01

Background The endemic Australian freshwater prawn, Macrobrachium koombooloomba, provides a model for exploring genes involved with freshwater adaptation because it is one of the relatively few Macrobrachium species that can complete its entire life cycle in freshwater. Methods The present study was conducted to identify potential candidate genes that are likely to contribute to effective freshwater adaptation by M. koombooloomba using a transcriptomics approach. De novo assembly of 75 bp paired end 227,564,643 high quality Illumina raw reads from 6 different cDNA libraries revealed 125,917 contigs of variable lengths (200–18,050 bp) with an N50 value of 1597. Results In total, 31,272 (24.83%) of the assembled contigs received significant blast hits, of which 27,686 and 22,560 contigs were mapped and functionally annotated, respectively. CEGMA (Core Eukaryotic Genes Mapping Approach) based transcriptome quality assessment revealed 96.37% completeness. We identified 43 different potential genes that are likely to be involved with freshwater adaptation in M. koombooloomba. Identified candidate genes included: 25 genes for osmoregulation, five for cell volume regulation, seven for stress tolerance, three for body fluid (haemolymph) maintenance, eight for epithelial permeability and water channel regulation, nine for egg size control and three for larval development. RSEM (RNA-Seq Expectation Maximization) based abundance estimation revealed that 6,253, 5,753 and 3,795 transcripts were expressed (at TPM value ≥10) in post larvae, juveniles and adults, respectively. Differential gene expression (DGE) analysis showed that 15 genes were expressed differentially in different individuals but these genes apparently were not involved with freshwater adaptation but rather were involved in growth, development and reproductive maturation. Discussion The genomic resources developed here will be useful for better understanding the molecular basis of freshwater adaptation in Macrobrachium prawns and other crustaceans more broadly. PMID:28194319
Identification of novel candidate drivers connecting different dysfunctional levels for lung adenocarcinoma using protein-protein interactions and a shortest path approach

NASA Astrophysics Data System (ADS)

Chen, Lei; Huang, Tao; Zhang, Yu-Hang; Jiang, Yang; Zheng, Mingyue; Cai, Yu-Dong

2016-07-01

Tumors are formed by the abnormal proliferation of somatic cells with disordered growth regulation under the influence of tumorigenic factors. Recently, the theory of “cancer drivers” connects tumor initiation with several specific mutations in the so-called cancer driver genes. According to the differentiation of four basic levels between tumor and adjacent normal tissues, the cancer drivers can be divided into the following: (1) Methylation level, (2) microRNA level, (3) mutation level, and (4) mRNA level. In this study, a computational method is proposed to identify novel lung adenocarcinoma drivers based on dysfunctional genes on the methylation, microRNA, mutation and mRNA levels. First, a large network was constructed using protein-protein interactions. Next, we searched all of the shortest paths connecting dysfunctional genes on different levels and extracted new candidate genes lying on these paths. Finally, the obtained candidate genes were filtered by a permutation test and an additional strict selection procedure involving a betweenness ratio and an interaction score. Several candidate genes remained, which are deemed to be related to two different levels of cancer. The analyses confirmed our assertions that some have the potential to contribute to the tumorigenesis process on multiple levels.
Computational exploration of cis-regulatory modules in rhythmic expression data using the "Exploration of Distinctive CREs and CRMs" (EDCC) and "CRM Network Generator" (CNG) programs.

PubMed

Bekiaris, Pavlos Stephanos; Tekath, Tobias; Staiger, Dorothee; Danisman, Selahattin

2018-01-01

Understanding the effect of cis-regulatory elements (CRE) and clusters of CREs, which are called cis-regulatory modules (CRM), in eukaryotic gene expression is a challenge of computational biology. We developed two programs that allow simple, fast and reliable analysis of candidate CREs and CRMs that may affect specific gene expression and that determine positional features between individual CREs within a CRM. The first program, "Exploration of Distinctive CREs and CRMs" (EDCC), correlates candidate CREs and CRMs with specific gene expression patterns. For pairs of CREs, EDCC also determines positional preferences of the single CREs in relation to each other and to the transcriptional start site. The second program, "CRM Network Generator" (CNG), prioritizes these positional preferences using a neural network and thus allows unbiased rating of the positional preferences that were determined by EDCC. We tested these programs with data from a microarray study of circadian gene expression in Arabidopsis thaliana. Analyzing more than 1.5 million pairwise CRE combinations, we found 22 candidate combinations, of which several contained known clock promoter elements together with elements that had not been identified as relevant to circadian gene expression before. CNG analysis further identified positional preferences of these CRE pairs, hinting at positional information that may be relevant for circadian gene expression. Future wet lab experiments will have to determine which of these combinations confer daytime specific circadian gene expression.

Computational exploration of cis-regulatory modules in rhythmic expression data using the “Exploration of Distinctive CREs and CRMs” (EDCC) and “CRM Network Generator” (CNG) programs

PubMed Central

Staiger, Dorothee

2018-01-01

Understanding the effect of cis-regulatory elements (CRE) and clusters of CREs, which are called cis-regulatory modules (CRM), in eukaryotic gene expression is a challenge of computational biology. We developed two programs that allow simple, fast and reliable analysis of candidate CREs and CRMs that may affect specific gene expression and that determine positional features between individual CREs within a CRM. The first program, “Exploration of Distinctive CREs and CRMs” (EDCC), correlates candidate CREs and CRMs with specific gene expression patterns. For pairs of CREs, EDCC also determines positional preferences of the single CREs in relation to each other and to the transcriptional start site. The second program, “CRM Network Generator” (CNG), prioritizes these positional preferences using a neural network and thus allows unbiased rating of the positional preferences that were determined by EDCC. We tested these programs with data from a microarray study of circadian gene expression in Arabidopsis thaliana. Analyzing more than 1.5 million pairwise CRE combinations, we found 22 candidate combinations, of which several contained known clock promoter elements together with elements that had not been identified as relevant to circadian gene expression before. CNG analysis further identified positional preferences of these CRE pairs, hinting at positional information that may be relevant for circadian gene expression. Future wet lab experiments will have to determine which of these combinations confer daytime specific circadian gene expression. PMID:29298348
Genome-wide association study of CSF biomarkers Abeta1-42, t-tau, and p-tau181p in the ADNI cohort.

PubMed

Kim, S; Swaminathan, S; Shen, L; Risacher, S L; Nho, K; Foroud, T; Shaw, L M; Trojanowski, J Q; Potkin, S G; Huentelman, M J; Craig, D W; DeChairo, B M; Aisen, P S; Petersen, R C; Weiner, M W; Saykin, A J

2011-01-04

CSF levels of Aβ1-42, t-tau, and p-tau181p are potential early diagnostic markers for probable Alzheimer disease (AD). The influence of genetic variation on these markers has been investigated for candidate genes but not on a genome-wide basis. We report a genome-wide association study (GWAS) of CSF biomarkers (Aβ1-42, t-tau, p-tau181p, p-tau181p/Aβ1-42, and t-tau/Aβ1-42). A total of 374 non-Hispanic Caucasian participants in the Alzheimer's Disease Neuroimaging Initiative cohort with quality-controlled CSF and genotype data were included in this analysis. The main effect of single nucleotide polymorphisms (SNPs) under an additive genetic model was assessed on each of 5 CSF biomarkers. The p values of all SNPs for each CSF biomarker were adjusted for multiple comparisons by the Bonferroni method. We focused on SNPs with corrected p<0.01 (uncorrected p<3.10×10(-8)) and secondarily examined SNPs with uncorrected p values less than 10(-5) to identify potential candidates. Four SNPs in the regions of the APOE, LOC100129500, TOMM40, and EPC2 genes reached genome-wide significance for associations with one or more CSF biomarkers. SNPs in CCDC134, ABCG2, SREBF2, and NFATC4, although not reaching genome-wide significance, were identified as potential candidates. In addition to known candidate genes, APOE, TOMM40, and one hypothetical gene LOC100129500 partially overlapping APOE; one novel gene, EPC2, and several other interesting genes were associated with CSF biomarkers that are related to AD. These findings, especially the new EPC2 results, require replication in independent cohorts.
Genetic analyses of bolting in bulb onion (Allium cepa L.).

PubMed

Baldwin, Samantha; Revanna, Roopashree; Pither-Joyce, Meeghan; Shaw, Martin; Wright, Kathryn; Thomson, Susan; Moya, Leire; Lee, Robyn; Macknight, Richard; McCallum, John

2014-03-01

We present the first evidence for a QTL conditioning an adaptive trait in bulb onion, and the first linkage and population genetics analyses of candidate genes involved in photoperiod and vernalization physiology. Economic production of bulb onion (Allium cepa L.) requires adaptation to photoperiod and temperature such that a bulb is formed in the first year and a flowering umbel in the second. 'Bolting', or premature flowering before bulb maturation, is an undesirable trait strongly selected against by breeders during adaptation of germplasm. To identify genome regions associated with adaptive traits we conducted linkage mapping and population genetic analyses of candidate genes, and QTL analysis of bolting using a low-density linkage map. We performed tagged amplicon sequencing of ten candidate genes, including the FT-like gene family, in eight diverse populations to identify polymorphisms and seek evidence of differentiation. Low nucleotide diversity and negative estimates of Tajima's D were observed for most genes, consistent with purifying selection. Significant population differentiation was observed only in AcFT2 and AcSOC1. Selective genotyping in a large 'Nasik Red × CUDH2150' F2 family revealed genome regions on chromosomes 1, 3 and 6 associated (LOD > 3) with bolting. Validation genotyping of two F2 families grown in two environments confirmed that a QTL on chromosome 1, which we designate AcBlt1, consistently conditions bolting susceptibility in this cross. The chromosome 3 region, which coincides with a functionally characterised acid invertase, was not associated with bolting in other environments, but showed significant association with bulb sucrose content in this and other mapping pedigrees. These putative QTL and candidate genes were placed on the onion map, enabling future comparative studies of adaptive traits.
Lack of haplotype structuring for two candidate genes for trypanotolerance in cattle.

PubMed

Álvarez, I; Pérez-Pardal, L; Traoré, A; Fernández, I; Goyache, F

2016-04-01

Bovine trypanotolerance is a heritable trait associated to the ability of the individuals to control parasitaemia and anaemia. The INHBA (BTA4) and TICAM1 (BTA7) genes are strong candidates for trypanotolerance-related traits. The coding sequence of both genes (3951 bp in total) were analysed in a panel including 79 Asian, African and European cattle (Bos taurus and B. indicus) to identify naturally occurring polymorphisms on both genes. In general, the genetic diversity was low. Nineteen of the 33 mutations identified were found just one time. Seventeen different haplotypes were defined for the TICAM1 gene, and 9 and 12 were defined for the exon 1 and the exon 2 of the INHBA gene, respectively. There was no clear separation between cattle groups. The most frequent haplotypes identified in West African taurine samples were also identified in other cattle groups including Asian zebu and European cattle. Phylogenetic trees and principal component analysis confirmed that divergence among the cattle groups analysed was poor, particularly for the INHBA sequences. The European cattle subset had the lowest values of haplotype diversity for both the exon1 (monomorphic) and the exon2 (0.077 ± 0.066) of the INHBA gene. Neutrality tests, in general, did not suggest that the analysed genes were under positive selection. The assessed scenario would be consistent with the identification of recent mutations in evolutionary terms. © 2015 Blackwell Verlag GmbH.
Identification of reference genes for RT-qPCR in the Antarctic moss Sanionia uncinata under abiotic stress conditions

PubMed Central

Park, Mira; Hong, Soon Gyu; Park, Hyun; Lee, Byeong-ha

2018-01-01

Sanionia uncinata is a dominant moss species in the maritime Antarctic. Due to its high adaptability to harsh environments, this extremophile plant has been considered a good target for studying the molecular adaptation mechanisms of plants to a variety of environmental stresses. Despite the importance of S. uncinata as a representative Antarctic plant species for the identification and characterization of genes associated with abiotic stress tolerance, suitable reference genes, which are critical for RT-qPCR analyses, have not yet been identified. In this report, 11 traditionally used and 6 novel candidate reference genes were selected from transcriptome data of S. uncinata and the expression stability of these genes was evaluated under various abiotic stress conditions using three statistical algorithms; geNorm, NormFinder, and BestKeeper. The stability ranking analysis selected the best reference genes depending on the stress conditions. Among the 17 candidates, the most stable references were POB1 and UFD2 for cold stress, POB1 and AKB for drought treatment, and UFD2 and AKB for the field samples from a different water contents in Antarctica. Overall, novel genes POB1 and AKB were the most reliable references across all samples, irrespective of experimental conditions. In addition, 6 novel candidate genes including AKB, POB1 and UFD2, were more stable than the housekeeping genes traditionally used for internal controls, indicating that transcriptome data can be useful for identifying novel robust normalizers. The reference genes validated in this study will be useful for improving the accuracy of RT-qPCR analysis for gene expression studies of S. uncinata in Antarctica and for further functional genomic analysis of bryophytes. PMID:29920565
Identification of Differentially Expressed Genes in Blood Cells of Narcolepsy Patients

PubMed Central

Tanaka, Susumu; Honda, Yutaka; Honda, Makoto

2007-01-01

Study Objective: A close association between the human leukocyte antigen (HLA)-DRB1*1501/DQB1*0602 and abnormalities in some inflammatory cytokines have been demonstrated in narcolepsy. Specific alterations in the immune system have been suggested to occur in this disorder. We attempted to identify alterations in gene expression underlying the abnormalities in the blood cells of narcoleptic patients. Designs: Total RNA from 12 narcolepsy-cataplexy patients and from 12 age- and sex-matched healthy controls were pooled. The pooled samples were initially screened for candidate genes for narcolepsy by differential display analysis using annealing control primers (ACP). The second screening of the samples was carried out by semiquantitative PCR using gene-specific primers. Finally, the expression levels of the candidate genes were further confirmed by quantitative real-time PCR using a new set of samples (20 narcolepsy-cataplexy patients and 20 healthy controls). Results: The second screening revealed differential expression of 4 candidate genes. Among them, MX2 was confirmed as a significantly down-regulated gene in the white blood cells of narcoleptic patients by quantitative real-time PCR. Conclusion: We found the MX2 gene to be significantly less expressed in comparison with normal subjects in the white blood cells of narcoleptic patients. This gene is relevant to the immune system. Although differential display analysis using ACP technology has a limitation in that it does not help in determining the functional mechanism underlying sleep/wakefulness dysregulation, it is useful for identifying novel genetic factors related to narcolepsy, such as HLA molecules. Further studies are required to explore the functional relationship between the MX2 gene and narcolepsy pathophysiology. Citation: Tanaka S; Honda Y; Honda M. Identification of differentially expressed genes in blood cells of narcolepsy patients. SLEEP 2007;30(8):974-979. PMID:17702266
Google Goes Cancer: Improving Outcome Prediction for Cancer Patients by Network-Based Ranking of Marker Genes

PubMed Central

Roy, Janine; Aust, Daniela; Knösel, Thomas; Rümmele, Petra; Jahnke, Beatrix; Hentrich, Vera; Rückert, Felix; Niedergethmann, Marco; Weichert, Wilko; Bahra, Marcus; Schlitt, Hans J.; Settmacher, Utz; Friess, Helmut; Büchler, Markus; Saeger, Hans-Detlev; Schroeder, Michael; Pilarsky, Christian; Grützmann, Robert

2012-01-01

Predicting the clinical outcome of cancer patients based on the expression of marker genes in their tumors has received increasing interest in the past decade. Accurate predictors of outcome and response to therapy could be used to personalize and thereby improve therapy. However, state of the art methods used so far often found marker genes with limited prediction accuracy, limited reproducibility, and unclear biological relevance. To address this problem, we developed a novel computational approach to identify genes prognostic for outcome that couples gene expression measurements from primary tumor samples with a network of known relationships between the genes. Our approach ranks genes according to their prognostic relevance using both expression and network information in a manner similar to Google's PageRank. We applied this method to gene expression profiles which we obtained from 30 patients with pancreatic cancer, and identified seven candidate marker genes prognostic for outcome. Compared to genes found with state of the art methods, such as Pearson correlation of gene expression with survival time, we improve the prediction accuracy by up to 7%. Accuracies were assessed using support vector machine classifiers and Monte Carlo cross-validation. We then validated the prognostic value of our seven candidate markers using immunohistochemistry on an independent set of 412 pancreatic cancer samples. Notably, signatures derived from our candidate markers were independently predictive of outcome and superior to established clinical prognostic factors such as grade, tumor size, and nodal status. As the amount of genomic data of individual tumors grows rapidly, our algorithm meets the need for powerful computational approaches that are key to exploit these data for personalized cancer therapies in clinical practice. PMID:22615549
Integrated microarray and ChIP analysis identifies multiple Foxa2 dependent target genes in the notochord.

PubMed

Tamplin, Owen J; Cox, Brian J; Rossant, Janet

2011-12-15

The node and notochord are key tissues required for patterning of the vertebrate body plan. Understanding the gene regulatory network that drives their formation and function is therefore important. Foxa2 is a key transcription factor at the top of this genetic hierarchy and finding its targets will help us to better understand node and notochord development. We performed an extensive microarray-based gene expression screen using sorted embryonic notochord cells to identify early notochord-enriched genes. We validated their specificity to the node and notochord by whole mount in situ hybridization. This provides the largest available resource of notochord-expressed genes, and therefore candidate Foxa2 target genes in the notochord. Using existing Foxa2 ChIP-seq data from adult liver, we were able to identify a set of genes expressed in the notochord that had associated regions of Foxa2-bound chromatin. Given that Foxa2 is a pioneer transcription factor, we reasoned that these sites might represent notochord-specific enhancers. Candidate Foxa2-bound regions were tested for notochord specific enhancer function in a zebrafish reporter assay and 7 novel notochord enhancers were identified. Importantly, sequence conservation or predictive models could not have readily identified these regions. Mutation of putative Foxa2 binding elements in two of these novel enhancers abrogated reporter expression and confirmed their Foxa2 dependence. The combination of highly specific gene expression profiling and genome-wide ChIP analysis is a powerful means of understanding developmental pathways, even for small cell populations such as the notochord. Copyright © 2011 Elsevier Inc. All rights reserved.
QTL-seq for rapid identification of candidate genes for flowering time in broccoli × cabbage.

PubMed

Shu, Jinshuai; Liu, Yumei; Zhang, Lili; Li, Zhansheng; Fang, Zhiyuan; Yang, Limei; Zhuang, Mu; Zhang, Yangyong; Lv, Honghao

2018-04-01

A major QTL controlling early flowering in broccoli × cabbage was identified by marker analysis and next-generation sequencing, corresponding to GRF6 gene conditioning flowering time in Arabidopsis. Flowering is an important agronomic trait for hybrid production in broccoli and cabbage, but the genetic mechanism underlying this process is unknown. In this study, segregation analysis with BC 1 P1, BC 1 P2, F 2 , and F 2:3 populations derived from a cross between two inbred lines "195" (late-flowering) and "93219" (early flowering) suggested that flowering time is a quantitative trait. Next, employing a next-generation sequencing-based whole-genome QTL-seq strategy, we identified a major genomic region harboring a robust flowering time QTL using an F 2 mapping population, designated Ef2.1 on cabbage chromosome 2 for early flowering. Ef2.1 was further validated by indel (insertion or deletion) marker-based classical QTL mapping, explaining 51.5% (LOD = 37.67) and 54.0% (LOD = 40.5) of the phenotypic variation in F 2 and F 2:3 populations, respectively. Combined QTL-seq and classical QTL analysis narrowed down Ef1.1 to a 228-kb genomic region containing 29 genes. A cabbage gene, Bol024659, was identified in this region, which is a homolog of GRF6, a major gene regulating flowering in Arabidopsis, and was designated BolGRF6. qRT-PCR study of the expression level of BolGRF6 revealed significantly higher expression in the early flowering genotypes. Taken together, our results provide support for BolGRF6 as a possible candidate gene for early flowering in the broccoli line 93219. The identified candidate genomic regions and genes may be useful for molecular breeding to improve broccoli and cabbage flowering times.
Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma

PubMed Central

Chambers, John C; Zhang, Weihua; Sehmi, Joban; Li, Xinzhong; Wass, Mark N; Van der Harst, Pim; Holm, Hilma; Sanna, Serena; Kavousi, Maryam; Baumeister, Sebastian E; Coin, Lachlan J; Deng, Guohong; Gieger, Christian; Heard-Costa, Nancy L; Hottenga, Jouke-Jan; Kühnel, Brigitte; Kumar, Vinod; Lagou, Vasiliki; Liang, Liming; Luan, Jian’an; Vidal, Pedro Marques; Leach, Irene Mateo; O’Reilly, Paul F; Peden, John F; Rahmioglu, Nilufer; Soininen, Pasi; Speliotes, Elizabeth K; Yuan, Xin; Thorleifsson, Gudmar; Alizadeh, Behrooz Z; Atwood, Larry D; Borecki, Ingrid B; Brown, Morris J; Charoen, Pimphen; Cucca, Francesco; Das, Debashish; de Geus, Eco J C; Dixon, Anna L; Döring, Angela; Ehret, Georg; Eyjolfsson, Gudmundur I; Farrall, Martin; Forouhi, Nita G; Friedrich, Nele; Goessling, Wolfram; Gudbjartsson, Daniel F; Harris, Tamara B; Hartikainen, Anna-Liisa; Heath, Simon; Hirschfield, Gideon M; Hofman, Albert; Homuth, Georg; Hyppönen, Elina; Janssen, Harry L A; Johnson, Toby; Kangas, Antti J; Kema, Ido P; Kühn, Jens P; Lai, Sandra; Lathrop, Mark; Lerch, Markus M; Li, Yun; Liang, T Jake; Lin, Jing-Ping; Loos, Ruth J F; Martin, Nicholas G; Moffatt, Miriam F; Montgomery, Grant W; Munroe, Patricia B; Musunuru, Kiran; Nakamura, Yusuke; O’Donnell, Christopher J; Olafsson, Isleifur; Penninx, Brenda W; Pouta, Anneli; Prins, Bram P; Prokopenko, Inga; Puls, Ralf; Ruokonen, Aimo; Savolainen, Markku J; Schlessinger, David; Schouten, Jeoffrey N L; Seedorf, Udo; Sen-Chowdhry, Srijita; Siminovitch, Katherine A; Smit, Johannes H; Spector, Timothy D; Tan, Wenting; Teslovich, Tanya M; Tukiainen, Taru; Uitterlinden, Andre G; Van der Klauw, Melanie M; Vasan, Ramachandran S; Wallace, Chris; Wallaschofski, Henri; Wichmann, H-Erich; Willemsen, Gonneke; Würtz, Peter; Xu, Chun; Yerges-Armstrong, Laura M; Abecasis, Goncalo R; Ahmadi, Kourosh R; Boomsma, Dorret I; Caulfield, Mark; Cookson, William O; van Duijn, Cornelia M; Froguel, Philippe; Matsuda, Koichi; McCarthy, Mark I; Meisinger, Christa; Mooser, Vincent; Pietiläinen, Kirsi H; Schumann, Gunter; Snieder, Harold; Sternberg, Michael J E; Stolk, Ronald P; Thomas, Howard C; Thorsteinsdottir, Unnur; Uda, Manuela; Waeber, Gérard; Wareham, Nicholas J; Waterworth, Dawn M; Watkins, Hugh; Whitfield, John B; Witteman, Jacqueline C M; Wolffenbuttel, Bruce H R; Fox, Caroline S; Ala-Korpela, Mika; Stefansson, Kari; Vollenweider, Peter; Völzke, Henry; Schadt, Eric E; Scott, James; Järvelin, Marjo-Riitta; Elliott, Paul; Kooner, Jaspal S

2012-01-01

Concentrations of liver enzymes in plasma are widely used as indicators of liver disease. We carried out a genome-wide association study in 61,089 individuals, identifying 42 loci associated with concentrations of liver enzymes in plasma, of which 32 are new associations (P = 10−8 to P = 10−190). We used functional genomic approaches including metabonomic profiling and gene expression analyses to identify probable candidate genes at these regions. We identified 69 candidate genes, including genes involved in biliary transport (ATP8B1 and ABCB11), glucose, carbohydrate and lipid metabolism (FADS1, FADS2, GCKR, JMJD1C, HNF1A, MLXIPL, PNPLA3, PPP1R3B, SLC2A2 and TRIB1), glycoprotein biosynthesis and cell surface glycobiology (ABO, ASGR1, FUT2, GPLD1 and ST3GAL4), inflammation and immunity (CD276, CDH6, GCKR, HNF1A, HPR, ITGA1, RORA and STAT4) and glutathione metabolism (GSTT1, GSTT2 and GGT), as well as several genes of uncertain or unknown function (including ABHD12, EFHD1, EFNA1, EPHA2, MICAL3 and ZNF827). Our results provide new insight into genetic mechanisms and pathways influencing markers of liver function. PMID:22001757
Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma.

PubMed

Chambers, John C; Zhang, Weihua; Sehmi, Joban; Li, Xinzhong; Wass, Mark N; Van der Harst, Pim; Holm, Hilma; Sanna, Serena; Kavousi, Maryam; Baumeister, Sebastian E; Coin, Lachlan J; Deng, Guohong; Gieger, Christian; Heard-Costa, Nancy L; Hottenga, Jouke-Jan; Kühnel, Brigitte; Kumar, Vinod; Lagou, Vasiliki; Liang, Liming; Luan, Jian'an; Vidal, Pedro Marques; Mateo Leach, Irene; O'Reilly, Paul F; Peden, John F; Rahmioglu, Nilufer; Soininen, Pasi; Speliotes, Elizabeth K; Yuan, Xin; Thorleifsson, Gudmar; Alizadeh, Behrooz Z; Atwood, Larry D; Borecki, Ingrid B; Brown, Morris J; Charoen, Pimphen; Cucca, Francesco; Das, Debashish; de Geus, Eco J C; Dixon, Anna L; Döring, Angela; Ehret, Georg; Eyjolfsson, Gudmundur I; Farrall, Martin; Forouhi, Nita G; Friedrich, Nele; Goessling, Wolfram; Gudbjartsson, Daniel F; Harris, Tamara B; Hartikainen, Anna-Liisa; Heath, Simon; Hirschfield, Gideon M; Hofman, Albert; Homuth, Georg; Hyppönen, Elina; Janssen, Harry L A; Johnson, Toby; Kangas, Antti J; Kema, Ido P; Kühn, Jens P; Lai, Sandra; Lathrop, Mark; Lerch, Markus M; Li, Yun; Liang, T Jake; Lin, Jing-Ping; Loos, Ruth J F; Martin, Nicholas G; Moffatt, Miriam F; Montgomery, Grant W; Munroe, Patricia B; Musunuru, Kiran; Nakamura, Yusuke; O'Donnell, Christopher J; Olafsson, Isleifur; Penninx, Brenda W; Pouta, Anneli; Prins, Bram P; Prokopenko, Inga; Puls, Ralf; Ruokonen, Aimo; Savolainen, Markku J; Schlessinger, David; Schouten, Jeoffrey N L; Seedorf, Udo; Sen-Chowdhry, Srijita; Siminovitch, Katherine A; Smit, Johannes H; Spector, Timothy D; Tan, Wenting; Teslovich, Tanya M; Tukiainen, Taru; Uitterlinden, Andre G; Van der Klauw, Melanie M; Vasan, Ramachandran S; Wallace, Chris; Wallaschofski, Henri; Wichmann, H-Erich; Willemsen, Gonneke; Würtz, Peter; Xu, Chun; Yerges-Armstrong, Laura M; Abecasis, Goncalo R; Ahmadi, Kourosh R; Boomsma, Dorret I; Caulfield, Mark; Cookson, William O; van Duijn, Cornelia M; Froguel, Philippe; Matsuda, Koichi; McCarthy, Mark I; Meisinger, Christa; Mooser, Vincent; Pietiläinen, Kirsi H; Schumann, Gunter; Snieder, Harold; Sternberg, Michael J E; Stolk, Ronald P; Thomas, Howard C; Thorsteinsdottir, Unnur; Uda, Manuela; Waeber, Gérard; Wareham, Nicholas J; Waterworth, Dawn M; Watkins, Hugh; Whitfield, John B; Witteman, Jacqueline C M; Wolffenbuttel, Bruce H R; Fox, Caroline S; Ala-Korpela, Mika; Stefansson, Kari; Vollenweider, Peter; Völzke, Henry; Schadt, Eric E; Scott, James; Järvelin, Marjo-Riitta; Elliott, Paul; Kooner, Jaspal S

2011-10-16

Concentrations of liver enzymes in plasma are widely used as indicators of liver disease. We carried out a genome-wide association study in 61,089 individuals, identifying 42 loci associated with concentrations of liver enzymes in plasma, of which 32 are new associations (P = 10(-8) to P = 10(-190)). We used functional genomic approaches including metabonomic profiling and gene expression analyses to identify probable candidate genes at these regions. We identified 69 candidate genes, including genes involved in biliary transport (ATP8B1 and ABCB11), glucose, carbohydrate and lipid metabolism (FADS1, FADS2, GCKR, JMJD1C, HNF1A, MLXIPL, PNPLA3, PPP1R3B, SLC2A2 and TRIB1), glycoprotein biosynthesis and cell surface glycobiology (ABO, ASGR1, FUT2, GPLD1 and ST3GAL4), inflammation and immunity (CD276, CDH6, GCKR, HNF1A, HPR, ITGA1, RORA and STAT4) and glutathione metabolism (GSTT1, GSTT2 and GGT), as well as several genes of uncertain or unknown function (including ABHD12, EFHD1, EFNA1, EPHA2, MICAL3 and ZNF827). Our results provide new insight into genetic mechanisms and pathways influencing markers of liver function.
TOM: a web-based integrated approach for identification of candidate disease genes.

PubMed

Rossi, Simona; Masotti, Daniele; Nardini, Christine; Bonora, Elena; Romeo, Giovanni; Macii, Enrico; Benini, Luca; Volinia, Stefano

2006-07-01

The massive production of biological data by means of highly parallel devices like microarrays for gene expression has paved the way to new possible approaches in molecular genetics. Among them the possibility of inferring biological answers by querying large amounts of expression data. Based on this principle, we present here TOM, a web-based resource for the efficient extraction of candidate genes for hereditary diseases. The service requires the previous knowledge of at least another gene responsible for the disease and the linkage area, or else of two disease associated genetic intervals. The algorithm uses the information stored in public resources, including mapping, expression and functional databases. Given the queries, TOM will select and list one or more candidate genes. This approach allows the geneticist to bypass the costly and time consuming tracing of genetic markers through entire families and might improve the chance of identifying disease genes, particularly for rare diseases. We present here the tool and the results obtained on known benchmark and on hereditary predisposition to familial thyroid cancer. Our algorithm is available at http://www-micrel.deis.unibo.it/~tom/.
Evidence of linkage and association on chromosome 20 for late-onset Alzheimer disease.

PubMed

Goddard, Katrina A B; Olson, Jane M; Payami, Haydeh; van der Voet, Monique; Kuivaniemi, Helena; Tromp, Gerard

2004-06-01

Recently, we reported evidence of linkage on chromosome 20 for Alzheimer disease (AD) using a novel statistical approach to incorporate covariates (e.g., age, ApoE genotype) into the analysis. These results suggest that very elderly subjects (>85 years), and individuals who carry an epsilon2 allele at the ApoE locus are more likely to be linked to this candidate region. The region on chromosome 20 includes a strong candidate gene, cystatin C (CST3), which has previously been associated with AD in case-control studies. We investigated these findings further by genotyping additional markers to narrow the candidate region, and to identify evidence of linkage disequilibrium as additional support for a susceptibility locus on chromosome 20. We selected 43 elderly sibships (89 subjects) from the NIMH AD Genetics Initiative based on current age older than 84 years, and identified 129 unrelated control subjects who were older than 84 years from the Oregon Brain Aging Study to conduct linkage and association studies in this region. Fourteen additional markers were evaluated, including 4 markers located within or near CST3. We narrowed the candidate region on chromosome 20 to an 11.8-cM region between markers D20S174 and D20S471, which includes the CST3 candidate gene. In addition, we observed evidence of association for markers located near the CST3 candidate gene, with P values between 0.002 and 0.08 for two-locus haplotypes. These results support the presence of a susceptibility locus for AD in the vicinity of CST3 for very elderly subjects with AD.
Exome Sequencing in Suspected Monogenic Dyslipidemias

PubMed Central

Stitziel, Nathan O.; Peloso, Gina M.; Abifadel, Marianne; Cefalu, Angelo B.; Fouchier, Sigrid; Motazacker, M. Mahdi; Tada, Hayato; Larach, Daniel B.; Awan, Zuhier; Haller, Jorge F.; Pullinger, Clive R.; Varret, Mathilde; Rabès, Jean-Pierre; Noto, Davide; Tarugi, Patrizia; Kawashiri, Masa-aki; Nohara, Atsushi; Yamagishi, Masakazu; Risman, Marjorie; Deo, Rahul; Ruel, Isabelle; Shendure, Jay; Nickerson, Deborah A.; Wilson, James G.; Rich, Stephen S.; Gupta, Namrata; Farlow, Deborah N.; Neale, Benjamin M.; Daly, Mark J.; Kane, John P.; Freeman, Mason W.; Genest, Jacques; Rader, Daniel J.; Mabuchi, Hiroshi; Kastelein, John J.P.; Hovingh, G. Kees; Averna, Maurizio R.; Gabriel, Stacey; Boileau, Catherine; Kathiresan, Sekar

2015-01-01

Background Exome sequencing is a promising tool for gene mapping in Mendelian disorders. We utilized this technique in an attempt to identify novel genes underlying monogenic dyslipidemias. Methods and Results We performed exome sequencing on 213 selected family members from 41 kindreds with suspected Mendelian inheritance of extreme levels of low-density lipoprotein (LDL) cholesterol (after candidate gene sequencing excluded known genetic causes for high LDL cholesterol families) or high-density lipoprotein (HDL) cholesterol. We used standard analytic approaches to identify candidate variants and also assigned a polygenic score to each individual in order to account for their burden of common genetic variants known to influence lipid levels. In nine families, we identified likely pathogenic variants in known lipid genes (ABCA1, APOB, APOE, LDLR, LIPA, and PCSK9); however, we were unable to identify obvious genetic etiologies in the remaining 32 families despite follow-up analyses. We identified three factors that limited novel gene discovery: (1) imperfect sequencing coverage across the exome hid potentially causal variants; (2) large numbers of shared rare alleles within families obfuscated causal variant identification; and (3) individuals from 15% of families carried a significant burden of common lipid-related alleles, suggesting complex inheritance can masquerade as monogenic disease. Conclusions We identified the genetic basis of disease in nine of 41 families; however, none of these represented novel gene discoveries. Our results highlight the promise and limitations of exome sequencing as a discovery technique in suspected monogenic dyslipidemias. Considering the confounders identified may inform the design of future exome sequencing studies. PMID:25632026
Identification of candidate genes associated with fibromyalgia susceptibility in southern Spanish women: the al-Ándalus project.

PubMed

Estévez-López, Fernando; Camiletti-Moirón, Daniel; Aparicio, Virginia A; Segura-Jiménez, Víctor; Álvarez-Gallardo, Inmaculada C; Soriano-Maldonado, Alberto; Borges-Cosic, Milkana; Acosta-Manzano, Pedro; Geenen, Rinie; Delgado-Fernández, Manuel; Martínez-González, Luis J; Ruiz, Jonatan R; Álvarez-Cubero, María J

2018-02-27

Candidate-gene studies on fibromyalgia susceptibility often include a small number of single nucleotide polymorphisms (SNPs), which is a limitation. Moreover, there is a paucity of evidence in Europe. Therefore, we compared genotype frequencies of candidate SNPs in a well-characterised sample of Spanish women with fibromyalgia and healthy non-fibromyalgia women. A total of 314 women with a diagnosis of fibromyalgia (cases) and 112 non-fibromyalgia healthy (controls) women participated in this candidate-gene study. Buccal swabs were collected for DNA extraction. Using TaqMan™ OpenArray™, we analysed 61 SNPs of 33 genes related to fibromyalgia susceptibility, symptoms, or potential mechanisms. We observed that the rs841 and rs1799971 GG genotype was more frequently observed in fibromyalgia than in controls (p = 0.04 and p = 0.02, respectively). The rs2097903 AT/TT genotypes were also more often present in the fibromyalgia participants than in their control peers (p = 0.04). There were no differences for the remaining SNPs. We identified, for the first time, associations of the rs841 (guanosine triphosphate cyclohydrolase 1 gene) and rs2097903 (catechol-O-methyltransferase gene) SNPs with higher risk of fibromyalgia susceptibility. We also confirmed that the rs1799971 SNP (opioid receptor μ1 gene) might confer genetic risk of fibromyalgia. We did not adjust for multiple comparisons, which would be too stringent and yield to non-significant differences in the genotype frequencies between cases and controls. Our findings may be biologically meaningful and informative, and should be further investigated in other populations. Of particular interest is to replicate the present study in a larger independent sample to confirm or refute our findings. On the other hand, by including 61 SNPs of 33 candidate-genes with a strong rationale (they were previously investigated in relation to fibromyalgia susceptibility, symptoms or potential mechanisms), the present research is the most comprehensive candidate-gene study on fibromyalgia susceptibility to date.
Transcriptional Profiling and Identification of Heat-Responsive Genes in Perennial Ryegrass by RNA-Sequencing

PubMed Central

Wang, Kehua; Liu, Yanrong; Tian, Jinli; Huang, Kunyong; Shi, Tianran; Dai, Xiaoxia; Zhang, Wanjun

2017-01-01

Perennial ryegrass (Lolium perenne) is one of the most widely used forage and turf grasses in the world due to its desirable agronomic qualities. However, as a cool-season perennial grass species, high temperature is a major factor limiting its performance in warmer and transition regions. In this study, a de novo transcriptome was generated using a cDNA library constructed from perennial ryegrass leaves subjected to short-term heat stress treatment. Then the expression profiling and identification of perennial ryegrass heat response genes by digital gene expression analyses was performed. The goal of this work was to produce expression profiles of high temperature stress responsive genes in perennial ryegrass leaves and further identify the potentially important candidate genes with altered levels of transcript, such as those genes involved in transcriptional regulation, antioxidant responses, plant hormones and signal transduction, and cellular metabolism. The de novo assembly of perennial ryegrass transcriptome in this study obtained more total and annotated unigenes compared to previously published ones. Many DEGs identified were genes that are known to respond to heat stress in plants, including HSFs, HSPs, and antioxidant related genes. In the meanwhile, we also identified four gene candidates mainly involved in C4 carbon fixation, and one TOR gene. Their exact roles in plant heat stress response need to dissect further. This study would be important by providing the gene resources for improving heat stress tolerance in both perennial ryegrass and other cool-season perennial grass plants. PMID:28680431
Refining the Candidate Environment: Interpersonal Stress, the Serotonin Transporter Polymorphism, and Gene-Environment Interactions in Major Depression.

PubMed

Vrshek-Schallhorn, Suzanne; Mineka, Susan; Zinbarg, Richard E; Craske, Michelle G; Griffith, James W; Sutton, Jonathan; Redei, Eva E; Wolitzky-Taylor, Kate; Hammen, Constance; Adam, Emma K

2014-05-01

Meta-analytic evidence supports a gene-environment (G×E) interaction between life stress and the serotonin transporter polymorphism (5-HTTLPR) on depression, but few studies have examined factors that influence detection of this effect, despite years of inconsistent results. We propose that the "candidate environment" (akin to a candidate gene) is key. Theory and evidence implicate major stressful life events (SLEs)-particularly major interpersonal SLEs-as well as chronic family stress. Participants ( N = 400) from the Youth Emotion Project (which began with 627 high school juniors oversampled for high neuroticism) completed up to five annual diagnostic and life stress interviews and provided DNA samples. A significant G×E effect for major SLEs and S -carrier genotype was accounted for significantly by major interpersonal SLEs but not significantly by major non-interpersonal SLEs. S -carrier genotype and chronic family stress also significantly interacted. Identifying such candidate environments may facilitate future G×E research in depression and psychopathology more broadly.
Comparative molecular analyses of select pH- and osmoregulatory genes in three freshwater crayfish Cherax quadricarinatus, C. destructor and C. cainii.

PubMed

Ali, Muhammad Y; Pavasovic, Ana; Dammannagoda, Lalith K; Mather, Peter B; Prentis, Peter J

2017-01-01

Systemic acid-base balance and osmotic/ionic regulation in decapod crustaceans are in part maintained by a set of transport-related enzymes such as carbonic anhydrase (CA), Na + /K + -ATPase (NKA), H + -ATPase (HAT), Na + /K + /2Cl - cotransporter (NKCC), Na + /Cl - /HCO[Formula: see text] cotransporter (NBC), Na + /H + exchanger (NHE), Arginine kinase (AK), Sarcoplasmic Ca +2 -ATPase (SERCA) and Calreticulin (CRT). We carried out a comparative molecular analysis of these genes in three commercially important yet eco-physiologically distinct freshwater crayfish , Cherax quadricarinatus, C. destructor and C. cainii , with the aim to identify mutations in these genes and determine if observed patterns of mutations were consistent with the action of natural selection. We also conducted a tissue-specific expression analysis of these genes across seven different organs, including gills, hepatopancreas, heart, kidney, liver, nerve and testes using NGS transcriptome data. The molecular analysis of the candidate genes revealed a high level of sequence conservation across the three Cherax sp. Hyphy analysis revealed that all candidate genes showed patterns of molecular variation consistent with neutral evolution. The tissue-specific expression analysis showed that 46% of candidate genes were expressed in all tissue types examined, while approximately 10% of candidate genes were only expressed in a single tissue type. The largest number of genes was observed in nerve (84%) and gills (78%) and the lowest in testes (66%). The tissue-specific expression analysis also revealed that most of the master genes regulating pH and osmoregulation (CA, NKA, HAT, NKCC, NBC, NHE) were expressed in all tissue types indicating an important physiological role for these genes outside of osmoregulation in other tissue types. The high level of sequence conservation observed in the candidate genes may be explained by the important role of these genes as well as potentially having a number of other basic physiological functions in different tissue types.
Sarcoidosis Related Novel Candidate Genes Identified by Multi-Omics Integrative Analyses.

PubMed

Hočevar, Keli; Maver, Aleš; Kunej, Tanja; Peterlin, Borut

2018-05-01

Sarcoidosis is a multifactorial systemic disease characterized by granulomatous inflammation and greatly impacting on global public health. The etiology and mechanisms of sarcoidosis are not fully understood. Recent high-throughput biological research has generated vast amounts of multi-omics big data on sarcoidosis, but their significance remains to be determined. We sought to identify novel candidate regions, and genes consistently altered in heterogeneous omics studies so as to reveal the underlying molecular mechanisms. We conducted a comprehensive integrative literature analysis on global data on sarcoidosis, including genomic, transcriptomic, proteomic, and phenomic studies. We performed positional integration analysis of 38 eligible datasets originating from 17 different biological layers. Using the integration interval length of 50 kb, we identified 54 regions reaching significance value p ≤ 0.0001 and 15 regions with significance value p ≤ 0.00001, when applying more stringent criteria. Secondary literature analysis of the top 20 regions, with the most significant accumulation of signals, revealed several novel candidate genes for which associations with sarcoidosis have not yet been established, but have considerable support for their involvement based on omic data. These new plausible candidate genes include NELFE, CFB, EGFL7, AGPAT2, FKBPL, NRC3, and NEU1. Furthermore, annotated data were prepared to enable custom visualization and browsing of these sarcoidosis related omics evidence in the University of California Santa Cruz (UCSC) Genome Browser. Further multi-omics approaches are called for sarcoidosis biomarkers and diagnostic and therapeutic innovation. Our approach for harnessing multi-omics data and the findings presented herein reflect important steps toward understanding the etiology and underlying pathological mechanisms of sarcoidosis.
Restriction site polymorphism-based candidate gene mapping for seedling drought tolerance in cowpea [Vigna unguiculata (L.) Walp.].

PubMed

Muchero, Wellington; Ehlers, Jeffrey D; Roberts, Philip A

2010-02-01

Quantitative trait loci (QTL) studies provide insight into the complexity of drought tolerance mechanisms. Molecular markers used in these studies also allow for marker-assisted selection (MAS) in breeding programs, enabling transfer of genetic factors between breeding lines without complete knowledge of their exact nature. However, potential for recombination between markers and target genes limit the utility of MAS-based strategies. Candidate gene mapping offers an alternative solution to identify trait determinants underlying QTL of interest. Here, we used restriction site polymorphisms to investigate co-location of candidate genes with QTL for seedling drought stress-induced premature senescence identified previously in cowpea. Genomic DNA isolated from 113 F(2:8) RILs of drought-tolerant IT93K503-1 and drought susceptible CB46 genotypes was digested with combinations of EcoR1 and HpaII, Mse1, or Msp1 restriction enzymes and amplified with primers designed from 13 drought-responsive cDNAs. JoinMap 3.0 and MapQTL 4.0 software were used to incorporate polymorphic markers onto the AFLP map and to analyze their association with the drought response QTL. Seven markers co-located with peaks of previously identified QTL. Isolation, sequencing, and blast analysis of these markers confirmed their significant homology with drought or other abiotic stress-induced expressed sequence tags (EST) from cowpea and other plant systems. Further, homology with coding sequences for a multidrug resistance protein 3 and a photosystem I assembly protein ycf3 was revealed in two of these candidates. These results provide a platform for the identification and characterization of genetic trait determinants underlying seedling drought tolerance in cowpea.

An Arrayed Genome-Scale Lentiviral-Enabled Short Hairpin RNA Screen Identifies Lethal and Rescuer Gene Candidates

PubMed Central

Bhinder, Bhavneet; Antczak, Christophe; Ramirez, Christina N.; Shum, David; Liu-Sullivan, Nancy; Radu, Constantin; Frattini, Mark G.

2013-01-01

Abstract RNA interference technology is becoming an integral tool for target discovery and validation.; With perhaps the exception of only few studies published using arrayed short hairpin RNA (shRNA) libraries, most of the reports have been either against pooled siRNA or shRNA, or arrayed siRNA libraries. For this purpose, we have developed a workflow and performed an arrayed genome-scale shRNA lethality screen against the TRC1 library in HeLa cells. The resulting targets would be a valuable resource of candidates toward a better understanding of cellular homeostasis. Using a high-stringency hit nomination method encompassing criteria of at least three active hairpins per gene and filtered for potential off-target effects (OTEs), referred to as the Bhinder–Djaballah analysis method, we identified 1,252 lethal and 6 rescuer gene candidates, knockdown of which resulted in severe cell death or enhanced growth, respectively. Cross referencing individual hairpins with the TRC1 validated clone database, 239 of the 1,252 candidates were deemed independently validated with at least three validated clones. Through our systematic OTE analysis, we have identified 31 microRNAs (miRNAs) in lethal and 2 in rescuer genes; all having a seed heptamer mimic in the corresponding shRNA hairpins and likely cause of the OTE observed in our screen, perhaps unraveling a previously unknown plausible essentiality of these miRNAs in cellular viability. Taken together, we report on a methodology for performing large-scale arrayed shRNA screens, a comprehensive analysis method to nominate high-confidence hits, and a performance assessment of the TRC1 library highlighting the intracellular inefficiencies of shRNA processing in general. PMID:23198867
Genetic dissection of growth, wood basic density and gene expression in interspecific backcrosses of Eucalyptus grandis and E. urophylla

PubMed Central

2012-01-01

Background F1 hybrid clones of Eucalyptus grandis and E. urophylla are widely grown for pulp and paper production in tropical and subtropical regions. Volume growth and wood quality are priority objectives in Eucalyptus tree improvement. The molecular basis of quantitative variation and trait expression in eucalypt hybrids, however, remains largely unknown. The recent availability of a draft genome sequence (http://www.phytozome.net) and genome-wide genotyping platforms, combined with high levels of genetic variation and high linkage disequilibrium in hybrid crosses, greatly facilitate the detection of quantitative trait loci (QTLs) as well as underlying candidate genes for growth and wood property traits. In this study, we used Diversity Arrays Technology markers to assess the genetic architecture of volume growth (diameter at breast height, DBH) and wood basic density in four-year-old progeny of an interspecific backcross pedigree of E. grandis and E. urophylla. In addition, we used Illumina RNA-Seq expression profiling in the E. urophylla backcross family to identify cis- and trans-acting polymorphisms (eQTLs) affecting transcript abundance of genes underlying QTLs for wood basic density. Results A total of five QTLs for DBH and 12 for wood basic density were identified in the two backcross families. Individual QTLs for DBH and wood basic density explained 3.1 to 12.2% of phenotypic variation. Candidate genes underlying QTLs for wood basic density on linkage groups 8 and 9 were found to share trans-acting eQTLs located on linkage groups 4 and 10, which in turn coincided with QTLs for wood basic density suggesting that these QTLs represent segregating components of an underlying transcriptional network. Conclusion This is the first demonstration of the use of next-generation expression profiling to quantify transcript abundance in a segregating tree population and identify candidate genes potentially affecting wood property variation. The QTLs identified in this study provide a resource for identifying candidate genes and developing molecular markers for marker-assisted breeding of volume growth and wood basic density. Our results suggest that integrated analysis of transcript and trait variation in eucalypt hybrids can be used to dissect the molecular basis of quantitative variation in wood property traits. PMID:22817272
Rice-arsenate interactions in hydroponics: a three-gene model for tolerance.

PubMed

Norton, Gareth J; Nigar, Meher; Williams, Paul N; Dasgupta, Tapash; Meharg, Andrew A; Price, Adam H

2008-01-01

In this study, the genetic mapping of the tolerance of root growth to 13.3 muM arsenate [As(V)] using the BalaxAzucena population is improved, and candidate genes for further study are identified. A remarkable three-gene model of tolerance is advanced, which appears to involve epistatic interaction between three major genes, two on chromosome 6 and one on chromosome 10. Any combination of two of these genes inherited from the tolerant parent leads to the plant having tolerance. Lists of potential positional candidate genes are presented. These are then refined using whole genome transcriptomics data and bioinformatics. Physiological evidence is also provided that genes related to phosphate transport are unlikely to be behind the genetic loci conferring tolerance. These results offer testable hypotheses for genes related to As(V) tolerance that might offer strategies for mitigating arsenic (As) accumulation in consumed rice.
Rice–arsenate interactions in hydroponics: a three-gene model for tolerance

PubMed Central

Norton, Gareth J.; Nigar, Meher; Dasgupta, Tapash; Meharg, Andrew A.; Price, Adam H.

2008-01-01

In this study, the genetic mapping of the tolerance of root growth to 13.3 μM arsenate [As(V)] using the Bala×Azucena population is improved, and candidate genes for further study are identified. A remarkable three-gene model of tolerance is advanced, which appears to involve epistatic interaction between three major genes, two on chromosome 6 and one on chromosome 10. Any combination of two of these genes inherited from the tolerant parent leads to the plant having tolerance. Lists of potential positional candidate genes are presented. These are then refined using whole genome transcriptomics data and bioinformatics. Physiological evidence is also provided that genes related to phosphate transport are unlikely to be behind the genetic loci conferring tolerance. These results offer testable hypotheses for genes related to As(V) tolerance that might offer strategies for mitigating arsenic (As) accumulation in consumed rice. PMID:18453529
[BIOINFORMATIC SEARCH AND PHYLOGENETIC ANALYSIS OF THE CELLULOSE SYNTHASE GENES OF FLAX (LINUM USITATISSIMUM)].

PubMed

Pydiura, N A; Bayer, G Ya; Galinousky, D V; Yemets, A I; Pirko, Ya V; Podvitski, T A; Anisimova, N V; Khotyleva, L V; Kilchevsky, A V; Blume, Ya B

2015-01-01

A bioinformatic search of sequences encoding cellulose synthase genes in the flax genome, and their comparison to dicots orthologs was carried out. The analysis revealed 32 cellulose synthase gene candidates, 16 of which are highly likely to encode cellulose synthases, and the remaining 16--cellulose synthase-like proteins (Csl). Phylogenetic analysis of gene products of cellulose synthase genes allowed distinguishing 6 groups of cellulose synthase genes of different classes: CesA1/10, CesA3, CesA4, CesA5/6/2/9, CesA7 and CesA8. Paralogous sequences within classes CesA1/10 and CesA5/6/2/9 which are associated with the primary cell wall formation are characterized by a greater similarity within these classes than orthologous sequences. Whereas the genes controlling the biosynthesis of secondary cell wall cellulose form distinct clades: CesA4, CesA7, and CesA8. The analysis of 16 identified flax cellulose synthase gene candidates shows the presence of at least 12 different cellulose synthase gene variants in flax genome which are represented in all six clades of cellulose synthase genes. Thus, at this point genes of all ten known cellulose synthase classes are identify in flax genome, but their correct classification requires additional research.
Identification of candidate genes for familial early-onset essential tremor.

PubMed

Liu, Xinmin; Hernandez, Nora; Kisselev, Sergey; Floratos, Aris; Sawle, Ashley; Ionita-Laza, Iuliana; Ottman, Ruth; Louis, Elan D; Clark, Lorraine N

2016-07-01

Essential tremor (ET) is one of the most common causes of tremor in humans. Despite its high heritability and prevalence, few susceptibility genes for ET have been identified. To identify ET genes, whole-exome sequencing was performed in 37 early-onset ET families with an autosomal-dominant inheritance pattern. We identified candidate genes for follow-up functional studies in five ET families. In two independent families, we identified variants predicted to affect function in the nitric oxide (NO) synthase 3 gene (NOS3) that cosegregated with disease. NOS3 is highly expressed in the central nervous system (including cerebellum), neurons and endothelial cells, and is one of three enzymes that converts l-arginine to the neurotransmitter NO. In one family, a heterozygous variant, c.46G>A (p.(Gly16Ser)), in NOS3, was identified in three affected ET cases and was absent in an unaffected family member; and in a second family, a heterozygous variant, c.164C>T (p.(Pro55Leu)), was identified in three affected ET cases (dizygotic twins and their mother). Both variants result in amino-acid substitutions of highly conserved amino-acid residues that are predicted to be deleterious and damaging by in silico analysis. In three independent families, variants predicted to affect function were also identified in other genes, including KCNS2 (KV9.2), HAPLN4 (BRAL2) and USP46. These genes are highly expressed in the cerebellum and Purkinje cells, and influence function of the gamma-amino butyric acid (GABA)-ergic system. This is in concordance with recent evidence that the pathophysiological process in ET involves cerebellar dysfunction and possibly cerebellar degeneration with a reduction in Purkinje cells, and a decrease in GABA-ergic tone.
Unsupervised, statistically-based systems biology approach for unraveling the genetics of complex traits: A demonstration with ethanol metabolism.

PubMed

Lusk, Ryan; Saba, Laura M; Vanderlinden, Lauren A; Zidek, Vaclav; Silhavy, Jan; Pravenec, Michal; Hoffman, Paula L; Tabakoff, Boris

2018-04-24

A statistical pipeline was developed and used for determining candidate genes and candidate gene co-expression networks involved in two alcohol (i.e., ethanol) metabolism phenotypes, namely alcohol clearance and acetate area under the curve (AUC) in a recombinant inbred (HXB/BXH) rat panel. The approach was also used to provide an indication of how ethanol metabolism can impact the normal function of the identified networks. RNA was extracted from alcohol-naïve liver tissue of 30 strains of HXB/BXH recombinant inbred rats. The reconstructed transcripts were quantitated and data was used to construct gene co-expression modules and networks. A separate group of rats, comprising the same 30 strains, were injected with ethanol (2 gm/kg) for measurement of blood ethanol and acetate levels. These data were used for QTL analysis of the rate of ethanol disappearance and circulating acetate levels. The analysis pipeline required calculation of the module eigengene values, the correction of these values with ethanol metabolism rates and acetate levels across the rat strains and the determination of the eigengene QTLs. For a module to be considered a candidate for determining phenotype, the module eigengene values had to have significant correlation with the strain phenotypic values and the module eigengene QTLs had to overlap the phenotypic QTLs. Of the 658 transcript co-expression modules generated from liver RNA sequencing data, a single module satisfied all criteria for being a candidate for determining the alcohol clearance trait. This module contained two alcohol dehydrogenase genes, including the gene whose product was previously shown to be responsible for the majority of alcohol elimination in the rat. This module was also the only module identified as a candidate for influencing circulating acetate levels. This module was also linked to the process of generation and utilization of retinoic acid as related to the autonomous immune response. We propose that our analytical pipeline can successfully identify genetic regions and transcripts which predispose a particular phenotype and our analysis provides functional context for co-expression module components. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
The genomic architecture and association genetics of adaptive characters using a candidate SNP approach in boreal black spruce

PubMed Central

2013-01-01

Background The genomic architecture of adaptive traits remains poorly understood in non-model plants. Various approaches can be used to bridge this gap, including the mapping of quantitative trait loci (QTL) in pedigrees, and genetic association studies in non-structured populations. Here we present results on the genomic architecture of adaptive traits in black spruce, which is a widely distributed conifer of the North American boreal forest. As an alternative to the usual candidate gene approach, a candidate SNP approach was developed for association testing. Results A genetic map containing 231 gene loci was used to identify QTL that were related to budset timing and to tree height assessed over multiple years and sites. Twenty-two unique genomic regions were identified, including 20 that were related to budset timing and 6 that were related to tree height. From results of outlier detection and bulk segregant analysis for adaptive traits using DNA pool sequencing of 434 genes, 52 candidate SNPs were identified and subsequently tested in genetic association studies for budset timing and tree height assessed over multiple years and sites. A total of 34 (65%) SNPs were significantly associated with budset timing, or tree height, or both. Although the percentages of explained variance (PVE) by individual SNPs were small, several significant SNPs were shared between sites and among years. Conclusions The sharing of genomic regions and significant SNPs between budset timing and tree height indicates pleiotropic effects. Significant QTLs and SNPs differed quite greatly among years, suggesting that different sets of genes for the same characters are involved at different stages in the tree’s life history. The functional diversity of genes carrying significant SNPs and low observed PVE further indicated that a large number of polymorphisms are involved in adaptive genetic variation. Accordingly, for undomesticated species such as black spruce with natural populations of large effective size and low linkage disequilibrium, efficient marker systems that are predictive of adaptation should require the survey of large numbers of SNPs. Candidate SNP approaches like the one developed in the present study could contribute to reducing these numbers. PMID:23724860
Convergent functional genomics of schizophrenia: from comprehensive understanding to genetic risk prediction

PubMed Central

Ayalew, M; Le-Niculescu, H; Levey, D F; Jain, N; Changala, B; Patel, S D; Winiger, E; Breier, A; Shekhar, A; Amdur, R; Koller, D; Nurnberger, J I; Corvin, A; Geyer, M; Tsuang, M T; Salomon, D; Schork, N J; Fanous, A H; O'Donovan, M C; Niculescu, A B

2012-01-01

We have used a translational convergent functional genomics (CFG) approach to identify and prioritize genes involved in schizophrenia, by gene-level integration of genome-wide association study data with other genetic and gene expression studies in humans and animal models. Using this polyevidence scoring and pathway analyses, we identify top genes (DISC1, TCF4, MBP, MOBP, NCAM1, NRCAM, NDUFV2, RAB18, as well as ADCYAP1, BDNF, CNR1, COMT, DRD2, DTNBP1, GAD1, GRIA1, GRIN2B, HTR2A, NRG1, RELN, SNAP-25, TNIK), brain development, myelination, cell adhesion, glutamate receptor signaling, G-protein–coupled receptor signaling and cAMP-mediated signaling as key to pathophysiology and as targets for therapeutic intervention. Overall, the data are consistent with a model of disrupted connectivity in schizophrenia, resulting from the effects of neurodevelopmental environmental stress on a background of genetic vulnerability. In addition, we show how the top candidate genes identified by CFG can be used to generate a genetic risk prediction score (GRPS) to aid schizophrenia diagnostics, with predictive ability in independent cohorts. The GRPS also differentiates classic age of onset schizophrenia from early onset and late-onset disease. We also show, in three independent cohorts, two European American and one African American, increasing overlap, reproducibility and consistency of findings from single-nucleotide polymorphisms to genes, then genes prioritized by CFG, and ultimately at the level of biological pathways and mechanisms. Finally, we compared our top candidate genes for schizophrenia from this analysis with top candidate genes for bipolar disorder and anxiety disorders from previous CFG analyses conducted by us, as well as findings from the fields of autism and Alzheimer. Overall, our work maps the genomic and biological landscape for schizophrenia, providing leads towards a better understanding of illness, diagnostics and therapeutics. It also reveals the significant genetic overlap with other major psychiatric disorder domains, suggesting the need for improved nosology. PMID:22584867
Systematic identification and validation of candidate genes for detection of circulating tumor cells in peripheral blood specimens of colorectal cancer patients.

PubMed

Findeisen, Peter; Röckel, Matthias; Nees, Matthias; Röder, Christian; Kienle, Peter; Von Knebel Doeberitz, Magnus; Kalthoff, Holger; Neumaier, Michael

2008-11-01

The presence of tumor cells in peripheral blood is being regarded increasingly as a clinically relevant prognostic factor for colorectal cancer patients. Current molecular methods are very sensitive but due to low specificity their diagnostic value is limited. This study was undertaken in order to systematically identify and validate new colorectal cancer (CRC) marker genes for improved detection of minimal residual disease in peripheral blood mononuclear cells of colorectal cancer patients. Marker genes with upregulated gene expression in colorectal cancer tissue and cell lines were identified using microarray experiments and publicly available gene expression data. A systematic iterative approach was used to reduce a set of 346 candidate genes, reportedly associated with CRC to a selection of candidate genes that were then further validated by relative quantitative real-time RT-PCR. Analytical sensitivity of RT-PCR assays was determined by spiking experiments with CRC cells. Diagnostic sensitivity as well as specificity was tested on a control group consisting of 18 CRC patients compared to 12 individuals without malignant disease. From a total of 346-screened genes only serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 5 (SERPINB5) showed significantly elevated transcript levels in peripheral venous blood specimens of tumor patients when compared to the nonmalignant control group. These results were confirmed by analysis of an enlarged collective consisting of 63 CRC patients and 36 control individuals without malignant disease. In conclusion SERPINB5 seems to be a promising marker for detection of circulating tumor cells in peripheral blood of colorectal cancer patients.
A whole genome SNP genotyping by DNA microarray and candidate gene association study for kidney stone disease

PubMed Central

2014-01-01

Background Kidney stone disease (KSD) is a complex disorder with unknown etiology in majority of the patients. Genetic and environmental factors may cause the disease. In the present study, we used DNA microarray to genotype single nucleotide polymorphisms (SNP) and performed candidate gene association analysis to determine genetic variations associated with the disease. Methods A whole genome SNP genotyping by DNA microarray was initially conducted in 101 patients and 105 control subjects. A set of 104 candidate genes reported to be involved in KSD, gathered from public databases and candidate gene association study databases, were evaluated for their variations associated with KSD. Results Altogether 82 SNPs distributed within 22 candidate gene regions showed significant differences in SNP allele frequencies between the patient and control groups (P < 0.05). Of these, 4 genes including BGLAP, AHSG, CD44, and HAO1, encoding osteocalcin, fetuin-A, CD44-molecule and glycolate oxidase 1, respectively, were further assessed for their associations with the disease because they carried high proportion of SNPs with statistical differences of allele frequencies between the patient and control groups within the gene. The total of 26 SNPs showed significant differences of allele frequencies between the patient and control groups and haplotypes associated with disease risk were identified. The SNP rs759330 located 144 bp downstream of BGLAP where it is a predicted microRNA binding site at 3′UTR of PAQR6 – a gene encoding progestin and adipoQ receptor family member VI, was genotyped in 216 patients and 216 control subjects and found to have significant differences in its genotype and allele frequencies (P = 0.0007, OR 2.02 and P = 0.0001, OR 2.02, respectively). Conclusions Our results suggest that these candidate genes are associated with KSD and PAQR6 comes into our view as the most potent candidate since associated SNP rs759330 is located in the miRNA binding site and may affect mRNA expression level. PMID:24886237
Pivotal role of the muscle-contraction pathway in cryptorchidism and evidence for genomic connections with cardiomyopathy pathways in RASopathies.

PubMed

Cannistraci, Carlo V; Ogorevc, Jernej; Zorc, Minja; Ravasi, Timothy; Dovc, Peter; Kunej, Tanja

2013-02-14

Cryptorchidism is the most frequent congenital disorder in male children; however the genetic causes of cryptorchidism remain poorly investigated. Comparative integratomics combined with systems biology approach was employed to elucidate genetic factors and molecular pathways underlying testis descent. Literature mining was performed to collect genomic loci associated with cryptorchidism in seven mammalian species. Information regarding the collected candidate genes was stored in MySQL relational database. Genomic view of the loci was presented using Flash GViewer web tool (http://gmod.org/wiki/Flashgviewer/). DAVID Bioinformatics Resources 6.7 was used for pathway enrichment analysis. Cytoscape plug-in PiNGO 1.11 was employed for protein-network-based prediction of novel candidate genes. Relevant protein-protein interactions were confirmed and visualized using the STRING database (version 9.0). The developed cryptorchidism gene atlas includes 217 candidate loci (genes, regions involved in chromosomal mutations, and copy number variations) identified at the genomic, transcriptomic, and proteomic level. Human orthologs of the collected candidate loci were presented using a genomic map viewer. The cryptorchidism gene atlas is freely available online: http://www.integratomics-time.com/cryptorchidism/. Pathway analysis suggested the presence of twelve enriched pathways associated with the list of 179 literature-derived candidate genes. Additionally, a list of 43 network-predicted novel candidate genes was significantly associated with four enriched pathways. Joint pathway analysis of the collected and predicted candidate genes revealed the pivotal importance of the muscle-contraction pathway in cryptorchidism and evidence for genomic associations with cardiomyopathy pathways in RASopathies. The developed gene atlas represents an important resource for the scientific community researching genetics of cryptorchidism. The collected data will further facilitate development of novel genetic markers and could be of interest for functional studies in animals and human. The proposed network-based systems biology approach elucidates molecular mechanisms underlying co-presence of cryptorchidism and cardiomyopathy in RASopathies. Such approach could also aid in molecular explanation of co-presence of diverse and apparently unrelated clinical manifestations in other syndromes.
Identification of Candidate Genes Underlying an Iron Efficiency Quantitative Trait Locus in Soybean1

PubMed Central

Peiffer, Gregory A.; King, Keith E.; Severin, Andrew J.; May, Gregory D.; Cianzio, Silvia R.; Lin, Shun Fu; Lauter, Nicholas C.; Shoemaker, Randy C.

2012-01-01

Prevalent on calcareous soils in the United States and abroad, iron deficiency is among the most common and severe nutritional stresses in plants. In soybean (Glycine max) commercial plantings, the identification and use of iron-efficient genotypes has proven to be the best form of managing this soil-related plant stress. Previous studies conducted in soybean identified a significant iron efficiency quantitative trait locus (QTL) explaining more than 70% of the phenotypic variation for the trait. In this research, we identified candidate genes underlying this QTL through molecular breeding, mapping, and transcriptome sequencing. Introgression mapping was performed using two related near-isogenic lines in which a region located on soybean chromosome 3 required for iron efficiency was identified. The region corresponds to the previously reported iron efficiency QTL. The location was further confirmed through QTL mapping conducted in this study. Transcriptome sequencing and quantitative real-time-polymerase chain reaction identified two genes encoding transcription factors within the region that were significantly induced in soybean roots under iron stress. The two induced transcription factors were identified as homologs of the subgroup lb basic helix-loop-helix (bHLH) genes that are known to regulate the strategy I response in Arabidopsis (Arabidopsis thaliana). Resequencing of these differentially expressed genes unveiled a significant deletion within a predicted dimerization domain. We hypothesize that this deletion disrupts the Fe-DEFICIENCY-INDUCED TRANSCRIPTION FACTOR (FIT)/bHLH heterodimer that has been shown to induce known iron acquisition genes. PMID:22319075
Integrated computational biology analysis to evaluate target genes for chronic myelogenous leukemia.

PubMed

Zheng, Yu; Wang, Yu-Ping; Cao, Hongbao; Chen, Qiusheng; Zhang, Xi

2018-06-05

Although hundreds of genes have been linked to chronic myelogenous leukemia (CML), many of the results lack reproducibility. In the present study, data across multiple modalities were integrated to evaluate 579 CML candidate genes, including literature‑based CML‑gene relation data, Gene Expression Omnibus RNA expression data and pathway‑based gene‑gene interaction data. The expression data included samples from 76 patients with CML and 73 healthy controls. For each target gene, four metrics were proposed and tested with case/control classification. The effectiveness of the four metrics presented was demonstrated by the high classification accuracy (94.63%; P<2x10‑4). Cross metric analysis suggested nine top candidate genes for CML: Epidermal growth factor receptor, tumor protein p53, catenin β 1, janus kinase 2, tumor necrosis factor, abelson murine leukemia viral oncogene homolog 1, vascular endothelial growth factor A, B‑cell lymphoma 2 and proto‑oncogene tyrosine‑protein kinase. In addition, 145 CML candidate pathways enriched with 485 out of 579 genes were identified (P<8.2x10‑11; q=0.005). In conclusion, weighted genetic networks generated using computational biology may be complementary to biological experiments for the evaluation of known or novel CML target genes.
Dynamic changes in transcriptome and cell wall composition underlying brassinosteroid-mediated lignification of switchgrass suspension cells.

PubMed

Rao, Xiaolan; Shen, Hui; Pattathil, Sivakumar; Hahn, Michael G; Gelineo-Albersheim, Ivana; Mohnen, Debra; Pu, Yunqiao; Ragauskas, Arthur J; Chen, Xin; Chen, Fang; Dixon, Richard A

2017-01-01

Plant cell walls contribute the majority of plant biomass that can be used to produce transportation fuels. However, the complexity and variability in composition and structure of cell walls, particularly the presence of lignin, negatively impacts their deconstruction for bioenergy. Metabolic and genetic changes associated with secondary wall development in the biofuel crop switchgrass ( Panicum virgatum ) have yet to be reported. Our previous studies have established a cell suspension system for switchgrass, in which cell wall lignification can be induced by application of brassinolide (BL). We have now collected cell wall composition and microarray-based transcriptome profiles for BL-induced and non-induced suspension cultures to provide an overview of the dynamic changes in transcriptional reprogramming during BL-induced cell wall modification. From this analysis, we have identified changes in candidate genes involved in cell wall precursor synthesis, cellulose, hemicellulose, and pectin formation and ester-linkage generation. We have also identified a large number of transcription factors with expression correlated with lignin biosynthesis genes, among which are candidates for control of syringyl (S) lignin accumulation. Together, this work provides an overview of the dynamic compositional changes during brassinosteroid-induced cell wall remodeling, and identifies candidate genes for future plant genetic engineering to overcome cell wall recalcitrance.
Genome-Wide association study identifies candidate genes for Parkinson's disease in an Ashkenazi Jewish population

PubMed Central

2011-01-01

Background To date, nine Parkinson disease (PD) genome-wide association studies in North American, European and Asian populations have been published. The majority of studies have confirmed the association of the previously identified genetic risk factors, SNCA and MAPT, and two studies have identified three new PD susceptibility loci/genes (PARK16, BST1 and HLA-DRB5). In a recent meta-analysis of datasets from five of the published PD GWAS an additional 6 novel candidate genes (SYT11, ACMSD, STK39, MCCC1/LAMP3, GAK and CCDC62/HIP1R) were identified. Collectively the associations identified in these GWAS account for only a small proportion of the estimated total heritability of PD suggesting that an 'unknown' component of the genetic architecture of PD remains to be identified. Methods We applied a GWAS approach to a relatively homogeneous Ashkenazi Jewish (AJ) population from New York to search for both 'rare' and 'common' genetic variants that confer risk of PD by examining any SNPs with allele frequencies exceeding 2%. We have focused on a genetic isolate, the AJ population, as a discovery dataset since this cohort has a higher sharing of genetic background and historically experienced a significant bottleneck. We also conducted a replication study using two publicly available datasets from dbGaP. The joint analysis dataset had a combined sample size of 2,050 cases and 1,836 controls. Results We identified the top 57 SNPs showing the strongest evidence of association in the AJ dataset (p < 9.9 × 10-5). Six SNPs located within gene regions had positive signals in at least one other independent dbGaP dataset: LOC100505836 (Chr3p24), LOC153328/SLC25A48 (Chr5q31.1), UNC13B (9p13.3), SLCO3A1(15q26.1), WNT3(17q21.3) and NSF (17q21.3). We also replicated published associations for the gene regions SNCA (Chr4q21; rs3775442, p = 0.037), PARK16 (Chr1q32.1; rs823114 (NUCKS1), p = 6.12 × 10-4), BST1 (Chr4p15; rs12502586, p = 0.027), STK39 (Chr2q24.3; rs3754775, p = 0.005), and LAMP3 (Chr3; rs12493050, p = 0.005) in addition to the two most common PD susceptibility genes in the AJ population LRRK2 (Chr12q12; rs34637584, p = 1.56 × 10-4) and GBA (Chr1q21; rs2990245, p = 0.015). Conclusions We have demonstrated the utility of the AJ dataset in PD candidate gene and SNP discovery both by replication in dbGaP datasets with a larger sample size and by replicating association of previously identified PD susceptibility genes. Our GWAS study has identified candidate gene regions for PD that are implicated in neuronal signalling and the dopamine pathway. PMID:21812969
A family with X-linked anophthalmia: exclusion of SOX3 as a candidate gene.

PubMed

Slavotinek, Anne; Lee, Stephen S; Hamilton, Steven P

2005-10-01

We report on a four-generation family with X-linked anophthalmia in four affected males and show that this family has LOD scores consistent with linkage to Xq27, the third family reported to be linked to the ANOP1 locus. We sequenced the SOX3 gene at Xq27 as a candidate gene for the X-linked anophthalmia based on the high homology of this gene to SOX2, a gene previously mutated in bilateral anophthlamia. However, no amino acid sequence alterations were identified in SOX3. We have improved the definition of the phenotype in males with anophthalmia linked to the ANOP1 locus, as microcephaly, ocular colobomas, and severe renal malformations have not been described in families linked to ANOP1. (c) 2005 Wiley-Liss, Inc.
A mutation in the MATP gene causes the cream coat colour in the horse

PubMed Central

Mariat, Denis; Taourit, Sead; Guérin, Gérard

2003-01-01

In horses, basic colours such as bay or chestnut may be partially diluted to buckskin and palomino, or extremely diluted to cream, a nearly white colour with pink skin and blue eyes. This dilution is expected to be controlled by one gene and we used both candidate gene and positional cloning strategies to identify the "cream mutation". A horse panel including reference colours was established and typed for different markers within or in the neighbourhood of two candidate genes. Our data suggest that the causal mutation, a G to A transition, is localised in exon 2 of the MATP gene leading to an aspartic acid to asparagine substitution in the encoded protein. This conserved mutation was also described in mice and humans, but not in medaka. PMID:12605854
Genome-wide DNA methylation analysis identifies MEGF10 as a novel epigenetically repressed candidate tumor suppressor gene in neuroblastoma.

PubMed

Charlet, Jessica; Tomari, Ayumi; Dallosso, Anthony R; Szemes, Marianna; Kaselova, Martina; Curry, Thomas J; Almutairi, Bader; Etchevers, Heather C; McConville, Carmel; Malik, Karim T A; Brown, Keith W

2017-04-01

Neuroblastoma is a childhood cancer in which many children still have poor outcomes, emphasising the need to better understand its pathogenesis. Despite recent genome-wide mutation analyses, many primary neuroblastomas do not contain recognizable driver mutations, implicating alternate molecular pathologies such as epigenetic alterations. To discover genes that become epigenetically deregulated during neuroblastoma tumorigenesis, we took the novel approach of comparing neuroblastomas to neural crest precursor cells, using genome-wide DNA methylation analysis. We identified 93 genes that were significantly differentially methylated of which 26 (28%) were hypermethylated and 67 (72%) were hypomethylated. Concentrating on hypermethylated genes to identify candidate tumor suppressor loci, we found the cell engulfment and adhesion factor gene MEGF10 to be epigenetically repressed by DNA hypermethylation or by H3K27/K9 methylation in neuroblastoma cell lines. MEGF10 showed significantly down-regulated expression in neuroblastoma tumor samples; furthermore patients with the lowest-expressing tumors had reduced relapse-free survival. Our functional studies showed that knock-down of MEGF10 expression in neuroblastoma cell lines promoted cell growth, consistent with MEGF10 acting as a clinically relevant, epigenetically deregulated neuroblastoma tumor suppressor gene. © 2016 The Authors. Molecular Carcinogenesis Published by Wiley Periodicals, Inc. © 2016 The Authors. Molecular Carcinogenesis Published by Wiley Periodicals, Inc.
Genome‐wide DNA methylation analysis identifies MEGF10 as a novel epigenetically repressed candidate tumor suppressor gene in neuroblastoma

PubMed Central

Charlet, Jessica; Tomari, Ayumi; Dallosso, Anthony R.; Szemes, Marianna; Kaselova, Martina; Curry, Thomas J.; Almutairi, Bader; Etchevers, Heather C.; McConville, Carmel; Malik, Karim T. A.

2016-01-01

Neuroblastoma is a childhood cancer in which many children still have poor outcomes, emphasising the need to better understand its pathogenesis. Despite recent genome‐wide mutation analyses, many primary neuroblastomas do not contain recognizable driver mutations, implicating alternate molecular pathologies such as epigenetic alterations. To discover genes that become epigenetically deregulated during neuroblastoma tumorigenesis, we took the novel approach of comparing neuroblastomas to neural crest precursor cells, using genome‐wide DNA methylation analysis. We identified 93 genes that were significantly differentially methylated of which 26 (28%) were hypermethylated and 67 (72%) were hypomethylated. Concentrating on hypermethylated genes to identify candidate tumor suppressor loci, we found the cell engulfment and adhesion factor gene MEGF10 to be epigenetically repressed by DNA hypermethylation or by H3K27/K9 methylation in neuroblastoma cell lines. MEGF10 showed significantly down‐regulated expression in neuroblastoma tumor samples; furthermore patients with the lowest‐expressing tumors had reduced relapse‐free survival. Our functional studies showed that knock‐down of MEGF10 expression in neuroblastoma cell lines promoted cell growth, consistent with MEGF10 acting as a clinically relevant, epigenetically deregulated neuroblastoma tumor suppressor gene. © 2016 The Authors. Molecular Carcinogenesis Published by Wiley Periodicals, Inc. PMID:27862318

Complex Landscape of Germline Variants in Brazilian Patients With Hereditary and Early Onset Breast Cancer.

PubMed

Torrezan, Giovana T; de Almeida, Fernanda G Dos Santos R; Figueiredo, Márcia C P; Barros, Bruna D de Figueiredo; de Paula, Cláudia A A; Valieris, Renan; de Souza, Jorge E S; Ramalho, Rodrigo F; da Silva, Felipe C C; Ferreira, Elisa N; de Nóbrega, Amanda F; Felicio, Paula S; Achatz, Maria I; de Souza, Sandro J; Palmero, Edenir I; Carraro, Dirce M

2018-01-01

Pathogenic variants in known breast cancer (BC) predisposing genes explain only about 30% of Hereditary Breast Cancer (HBC) cases, whereas the underlying genetic factors for most families remain unknown. Here, we used whole-exome sequencing (WES) to identify genetic variants associated to HBC in 17 patients of Brazil with familial BC and negative for causal variants in major BC risk genes ( BRCA1/2, TP53 , and CHEK2 c.1100delC). First, we searched for rare variants in 27 known HBC genes and identified two patients harboring truncating pathogenic variants in ATM and BARD1 . For the remaining 15 negative patients, we found a substantial vast number of rare genetic variants. Thus, for selecting the most promising variants we used functional-based variant prioritization, followed by NGS validation, analysis in a control group, cosegregation analysis in one family and comparison with previous WES studies, shrinking our list to 23 novel BC candidate genes, which were evaluated in an independent cohort of 42 high-risk BC patients. Rare and possibly damaging variants were identified in 12 candidate genes in this cohort, including variants in DNA repair genes ( ERCC1 and SXL4 ) and other cancer-related genes ( NOTCH2, ERBB2, MST1R , and RAF1 ). Overall, this is the first WES study applied for identifying novel genes associated to HBC in Brazilian patients, in which we provide a set of putative BC predisposing genes. We also underpin the value of using WES for assessing the complex landscape of HBC susceptibility, especially in less characterized populations.
Co-expression analysis identifies CRC and AP1 the regulator of Arabidopsis fatty acid biosynthesis.

PubMed

Han, Xinxin; Yin, Linlin; Xue, Hongwei

2012-07-01

Fatty acids (FAs) play crucial rules in signal transduction and plant development, however, the regulation of FA metabolism is still poorly understood. To study the relevant regulatory network, fifty-eight FA biosynthesis genes including de novo synthases, desaturases and elongases were selected as "guide genes" to construct the co-expression network. Calculation of the correlation between all Arabidopsis thaliana (L.) genes with each guide gene by Arabidopsis co-expression dating mining tools (ACT) identifies 797 candidate FA-correlated genes. Gene ontology (GO) analysis of these co-expressed genes showed they are tightly correlated to photosynthesis and carbohydrate metabolism, and function in many processes. Interestingly, 63 transcription factors (TFs) were identified as candidate FA biosynthesis regulators and 8 TF families are enriched. Two TF genes, CRC and AP1, both correlating with 8 FA guide genes, were further characterized. Analyses of the ap1 and crc mutant showed the altered total FA composition of mature seeds. The contents of palmitoleic acid, stearic acid, arachidic acid and eicosadienoic acid are decreased, whereas that of oleic acid is increased in ap1 and crc seeds, which is consistent with the qRT-PCR analysis revealing the suppressed expression of the corresponding guide genes. In addition, yeast one-hybrid analysis and electrophoretic mobility shift assay (EMSA) revealed that CRC can bind to the promoter regions of KCS7 and KCS15, indicating that CRC may directly regulate FA biosynthesis. © 2012 Institute of Botany, Chinese Academy of Sciences.
Novel candidate genes for alcoholism--transcriptomic analysis of prefrontal medial cortex, hippocampus and nucleus accumbens of Warsaw alcohol-preferring and non-preferring rats.

PubMed

Stankiewicz, Adrian M; Goscik, Joanna; Dyr, Wanda; Juszczak, Grzegorz R; Ryglewicz, Danuta; Swiergiel, Artur H; Wieczorek, Marek; Stefanski, Roman

2015-12-01

Animal models provide opportunity to study neurobiological aspects of human alcoholism. Changes in gene expression have been implicated in mediating brain functions, including reward system and addiction. The current study aimed to identify genes that may underlie differential ethanol preference in Warsaw High Preferring (WHP) and Warsaw Low Preferring (WLP) rats. Microarray analysis comparing gene expression in nucleus accumbens (NAc), hippocampus (HP) and medial prefrontal cortex (mPFC) was performed in male WHP and WLP rats bred for differences in ethanol preference. Differential and stable between biological repeats expression of 345, 254 and 129 transcripts in NAc, HP and mPFC was detected. Identified genes and processes included known mediators of ethanol response (Mx2, Fam111a, Itpr1, Gabra4, Agtr1a, LTP/LTD, renin-angiotensin signaling pathway), toxicity (Sult1c2a, Ces1, inflammatory response), as well as genes involved in regulation of important addiction-related brain systems such as dopamine, tachykinin or acetylcholine (Gng7, Tac4, Slc5a7). The identified candidate genes may underlie differential ethanol preference in an animal model of alcoholism. Names of genes are written in italics, while names of proteins are written in standard font. Names of human genes/proteins are written in all capital letters. Names of rodent genes/proteins are written in capital letter followed by small letters. Copyright © 2015 Elsevier Inc. All rights reserved.
The Acid Phosphatase-Encoding Gene GmACP1 Contributes to Soybean Tolerance to Low-Phosphorus Stress

PubMed Central

Hao, Derong; Wang, Hui; Kan, Guizhen; Jin, Hangxia; Yu, Deyue

2014-01-01

Phosphorus (P) is essential for all living cells and organisms, and low-P stress is a major factor constraining plant growth and yield worldwide. In plants, P efficiency is a complex quantitative trait involving multiple genes, and the mechanisms underlying P efficiency are largely unknown. Combining linkage analysis, genome-wide and candidate-gene association analyses, and plant transformation, we identified a soybean gene related to P efficiency, determined its favorable haplotypes and developed valuable functional markers. First, six major genomic regions associated with P efficiency were detected by performing genome-wide associations (GWAs) in various environments. A highly significant region located on chromosome 8, qPE8, was identified by both GWAs and linkage mapping and explained 41% of the phenotypic variation. Then, a regional mapping study was performed with 40 surrounding markers in 192 diverse soybean accessions. A strongly associated haplotype (P = 10−7) consisting of the markers Sat_233 and BARC-039899-07603 was identified, and qPE8 was located in a region of approximately 250 kb, which contained a candidate gene GmACP1 that encoded an acid phosphatase. GmACP1 overexpression in soybean hairy roots increased P efficiency by 11–20% relative to the control. A candidate-gene association analysis indicated that six natural GmACP1 polymorphisms explained 33% of the phenotypic variation. The favorable alleles and haplotypes of GmACP1 associated with increased transcript expression correlated with higher enzyme activity. The discovery of the optimal haplotype of GmACP1 will now enable the accurate selection of soybeans with higher P efficiencies and improve our understanding of the molecular mechanisms underlying P efficiency in plants. PMID:24391523
Whole-genome sequence analyses of Western Central African Pygmy hunter-gatherers reveal a complex demographic history and identify candidate genes under positive natural selection

PubMed Central

Hsieh, PingHsun; Veeramah, Krishna R.; Lachance, Joseph; Tishkoff, Sarah A.; Wall, Jeffrey D.; Hammer, Michael F.; Gutenkunst, Ryan N.

2016-01-01

African Pygmies practicing a mobile hunter-gatherer lifestyle are phenotypically and genetically diverged from other anatomically modern humans, and they likely experienced strong selective pressures due to their unique lifestyle in the Central African rainforest. To identify genomic targets of adaptation, we sequenced the genomes of four Biaka Pygmies from the Central African Republic and jointly analyzed these data with the genome sequences of three Baka Pygmies from Cameroon and nine Yoruba famers. To account for the complex demographic history of these populations that includes both isolation and gene flow, we fit models using the joint allele frequency spectrum and validated them using independent approaches. Our two best-fit models both suggest ancient divergence between the ancestors of the farmers and Pygmies, 90,000 or 150,000 yr ago. We also find that bidirectional asymmetric gene flow is statistically better supported than a single pulse of unidirectional gene flow from farmers to Pygmies, as previously suggested. We then applied complementary statistics to scan the genome for evidence of selective sweeps and polygenic selection. We found that conventional statistical outlier approaches were biased toward identifying candidates in regions of high mutation or low recombination rate. To avoid this bias, we assigned P-values for candidates using whole-genome simulations incorporating demography and variation in both recombination and mutation rates. We found that genes and gene sets involved in muscle development, bone synthesis, immunity, reproduction, cell signaling and development, and energy metabolism are likely to be targets of positive natural selection in Western African Pygmies or their recent ancestors. PMID:26888263
Evaluation and Validation of Housekeeping Genes as Reference for Gene Expression Studies in Pigeonpea (Cajanus cajan) Under Drought Stress Conditions

PubMed Central

Sinha, Pallavi; Singh, Vikas K.; Suryanarayana, V.; Krishnamurthy, L.; Saxena, Rachit K.; Varshney, Rajeev K.

2015-01-01

Gene expression analysis using quantitative real-time PCR (qRT-PCR) is a very sensitive technique and its sensitivity depends on the stable performance of reference gene(s) used in the study. A number of housekeeping genes have been used in various expression studies in many crops however, their expression were found to be inconsistent under different stress conditions. As a result, species specific housekeeping genes have been recommended for different expression studies in several crop species. However, such specific housekeeping genes have not been reported in the case of pigeonpea (Cajanus cajan) despite the fact that genome sequence has become available for the crop. To identify the stable housekeeping genes in pigeonpea for expression analysis under drought stress conditions, the relative expression variations of 10 commonly used housekeeping genes (EF1α, UBQ10, GAPDH, 18SrRNA, 25SrRNA, TUB6, ACT1, IF4α, UBC and HSP90) were studied on root, stem and leaves tissues of Asha (ICPL 87119). Three statistical algorithms geNorm, NormFinder and BestKeeper were used to define the stability of candidate genes. geNorm analysis identified IF4α and TUB6 as the most stable housekeeping genes however, NormFinder analysis determined IF4α and HSP90 as the most stable housekeeping genes under drought stress conditions. Subsequently validation of the identified candidate genes was undertaken in qRT-PCR based gene expression analysis of uspA gene which plays an important role for drought stress conditions in pigeonpea. The relative quantification of the uspA gene varied according to the internal controls (stable and least stable genes), thus highlighting the importance of the choice of as well as validation of internal controls in such experiments. The identified stable and validated housekeeping genes will facilitate gene expression studies in pigeonpea especially under drought stress conditions. PMID:25849964
Evaluation and validation of housekeeping genes as reference for gene expression studies in pigeonpea (Cajanus cajan) under drought stress conditions.

PubMed

Sinha, Pallavi; Singh, Vikas K; Suryanarayana, V; Krishnamurthy, L; Saxena, Rachit K; Varshney, Rajeev K

2015-01-01

Gene expression analysis using quantitative real-time PCR (qRT-PCR) is a very sensitive technique and its sensitivity depends on the stable performance of reference gene(s) used in the study. A number of housekeeping genes have been used in various expression studies in many crops however, their expression were found to be inconsistent under different stress conditions. As a result, species specific housekeeping genes have been recommended for different expression studies in several crop species. However, such specific housekeeping genes have not been reported in the case of pigeonpea (Cajanus cajan) despite the fact that genome sequence has become available for the crop. To identify the stable housekeeping genes in pigeonpea for expression analysis under drought stress conditions, the relative expression variations of 10 commonly used housekeeping genes (EF1α, UBQ10, GAPDH, 18SrRNA, 25SrRNA, TUB6, ACT1, IF4α, UBC and HSP90) were studied on root, stem and leaves tissues of Asha (ICPL 87119). Three statistical algorithms geNorm, NormFinder and BestKeeper were used to define the stability of candidate genes. geNorm analysis identified IF4α and TUB6 as the most stable housekeeping genes however, NormFinder analysis determined IF4α and HSP90 as the most stable housekeeping genes under drought stress conditions. Subsequently validation of the identified candidate genes was undertaken in qRT-PCR based gene expression analysis of uspA gene which plays an important role for drought stress conditions in pigeonpea. The relative quantification of the uspA gene varied according to the internal controls (stable and least stable genes), thus highlighting the importance of the choice of as well as validation of internal controls in such experiments. The identified stable and validated housekeeping genes will facilitate gene expression studies in pigeonpea especially under drought stress conditions.
A candidate multimodal functional genetic network for thermal adaptation

PubMed Central

Pathak, Rachana; Prajapati, Indira; Bankston, Shannon; Thompson, Aprylle; Usher, Jaytriece; Isokpehi, Raphael D.

2014-01-01

Vertebrate ectotherms such as reptiles provide ideal organisms for the study of adaptation to environmental thermal change. Comparative genomic and exomic studies can recover markers that diverge between warm and cold adapted lineages, but the genes that are functionally related to thermal adaptation may be difficult to identify. We here used a bioinformatics genome-mining approach to predict and identify functions for suitable candidate markers for thermal adaptation in the chicken. We first established a framework of candidate functions for such markers, and then compiled the literature on genes known to adapt to the thermal environment in different lineages of vertebrates. We then identified them in the genomes of human, chicken, and the lizard Anolis carolinensis, and established a functional genetic interaction network in the chicken. Surprisingly, markers initially identified from diverse lineages of vertebrates such as human and fish were all in close functional relationship with each other and more associated than expected by chance. This indicates that the general genetic functional network for thermoregulation and/or thermal adaptation to the environment might be regulated via similar evolutionarily conserved pathways in different vertebrate lineages. We were able to identify seven functions that were statistically overrepresented in this network, corresponding to four of our originally predicted functions plus three unpredicted functions. We describe this network as multimodal: central regulator genes with the function of relaying thermal signal (1), affect genes with different cellular functions, namely (2) lipoprotein metabolism, (3) membrane channels, (4) stress response, (5) response to oxidative stress, (6) muscle contraction and relaxation, and (7) vasodilation, vasoconstriction and regulation of blood pressure. This network constitutes a novel resource for the study of thermal adaptation in the closely related nonavian reptiles and other vertebrate ectotherms. PMID:25289178
Rrp1b, a New Candidate Susceptibility Gene for Breast Cancer Progression and Metastasis

PubMed Central

Crawford, Nigel P. S; Qian, Xiaolan; Ziogas, Argyrios; Papageorge, Alex G; Boersma, Brenda J; Walker, Renard C; Lukes, Luanne; Rowe, William L; Zhang, Jinghui; Ambs, Stefan; Lowy, Douglas R; Anton-Culver, Hoda; Hunter, Kent W

2007-01-01

A novel candidate metastasis modifier, ribosomal RNA processing 1 homolog B (Rrp1b), was identified through two independent approaches. First, yeast two-hybrid, immunoprecipitation, and functional assays demonstrated a physical and functional interaction between Rrp1b and the previous identified metastasis modifier Sipa1. In parallel, using mouse and human metastasis gene expression data it was observed that extracellular matrix (ECM) genes are common components of metastasis predictive signatures, suggesting that ECM genes are either important markers or causal factors in metastasis. To investigate the relationship between ECM genes and poor prognosis in breast cancer, expression quantitative trait locus analysis of polyoma middle-T transgene-induced mammary tumor was performed. ECM gene expression was found to be consistently associated with Rrp1b expression. In vitro expression of Rrp1b significantly altered ECM gene expression, tumor growth, and dissemination in metastasis assays. Furthermore, a gene signature induced by ectopic expression of Rrp1b in tumor cells predicted survival in a human breast cancer gene expression dataset. Finally, constitutional polymorphism within RRP1B was found to be significantly associated with tumor progression in two independent breast cancer cohorts. These data suggest that RRP1B may be a novel susceptibility gene for breast cancer progression and metastasis. PMID:18081427
Transcriptomic Analysis of the Regulation of Rhizome Formation in Temperate and Tropical Lotus (Nelumbo nucifera).

PubMed

Yang, Mei; Zhu, Lingping; Pan, Cheng; Xu, Liming; Liu, Yanling; Ke, Weidong; Yang, Pingfang

2015-08-17

Rhizome is the storage organ of lotus derived from modified stems. The development of rhizome is a complex process and depends on the balanced expression of the genes that is controlled by environmental and endogenous factors. However, little is known about the mechanism that regulates rhizome girth enlargement. In this study, using RNA-seq, transcriptomic analyses were performed at three rhizome developmental stages-the stolon, middle swelling and later swelling stage -in the cultivars 'ZO' (temperate lotus with enlarged rhizome) and 'RL' (tropical lotus with stolon). About 348 million high-quality reads were generated, and 88.5% of the data were mapped to the reference genome. Of 26783 genes identified, 24069 genes were previously predicted in the reference, and 2714 genes were novel transcripts. Moreover, 8821 genes were differentially expressed between the cultivars at the three stages. Functional analysis identified that these genes were significantly enriched in pathways carbohydrate metabolism and plant hormone signal transduction. Twenty-two genes involved in photoperiod pathway, starch metabolism and hormone signal transduction were candidate genes inducing rhizome girth enlargement. Comparative transcriptomic analysis detected several differentially expressed genes and potential candidate genes required for rhizome girth enlargement, which lay a foundation for future studies on molecular mechanisms underlying rhizome formation.
Transcriptomic Analysis of the Regulation of Rhizome Formation in Temperate and Tropical Lotus (Nelumbo nucifera)

PubMed Central

Yang, Mei; Zhu, Lingping; Pan, Cheng; Xu, Liming; Liu, Yanling; Ke, Weidong; Yang, Pingfang

2015-01-01

Rhizome is the storage organ of lotus derived from modified stems. The development of rhizome is a complex process and depends on the balanced expression of the genes that is controlled by environmental and endogenous factors. However, little is known about the mechanism that regulates rhizome girth enlargement. In this study, using RNA-seq, transcriptomic analyses were performed at three rhizome developmental stages—the stolon, middle swelling and later swelling stage —in the cultivars ‘ZO’ (temperate lotus with enlarged rhizome) and ‘RL’ (tropical lotus with stolon). About 348 million high-quality reads were generated, and 88.5% of the data were mapped to the reference genome. Of 26783 genes identified, 24069 genes were previously predicted in the reference, and 2714 genes were novel transcripts. Moreover, 8821 genes were differentially expressed between the cultivars at the three stages. Functional analysis identified that these genes were significantly enriched in pathways carbohydrate metabolism and plant hormone signal transduction. Twenty-two genes involved in photoperiod pathway, starch metabolism and hormone signal transduction were candidate genes inducing rhizome girth enlargement. Comparative transcriptomic analysis detected several differentially expressed genes and potential candidate genes required for rhizome girth enlargement, which lay a foundation for future studies on molecular mechanisms underlying rhizome formation. PMID:26279185
Analysis of X chromosome genomic DNA sequence copy number variation associated with premature ovarian failure (POF)

PubMed Central

Quilter, C.R.; Karcanias, A.C.; Bagga, M.R.; Duncan, S.; Murray, A.; Conway, G.S.; Sargent, C.A.; Affara, N.A.

2013-01-01

BACKGROUND Premature ovarian failure (POF) is a heterogeneous disease defined as amenorrhoea for >6 months before age 40, with an FSH serum level >40 mIU/ml (menopausal levels). While there is a strong genetic association with POF, familial studies have also indicated that idiopathic POF may also be genetically linked. Conventional cytogenetic analyses have identified regions of the X chromosome that are strongly associated with ovarian function, as well as several POF candidate genes. Cryptic chromosome abnormalities that have been missed might be detected by array comparative genomic hybridization. METHODS In this study, samples from 42 idiopathic POF patients were subjected to a complete end-to-end X/Y chromosome tiling path array to achieve a detailed copy number variation (CNV) analysis of X chromosome involvement in POF. The arrays also contained a 1 Mb autosomal tiling path as a reference control. Quantitative PCR for selected genes contained within the CNVs was used to confirm the majority of the changes detected. The expression pattern of some of these genes in human tissue RNA was examined by reverse transcription (RT)–PCR. RESULTS A number of CNVs were identified on both Xp and Xq, with several being shared among the POF cases. Some CNVs fall within known polymorphic CNV regions, and others span previously identified POF candidate regions and genes. CONCLUSIONS The new data reported in this study reveal further discrete X chromosome intervals not previously associated with the disease and therefore implicate new clusters of candidate genes. Further studies will be required to elucidate their involvement in POF. PMID:20570974
Candidate-gene association study of mothers with pre-eclampsia, and their infants, analyzing 775 SNPs in 190 genes.

PubMed

Goddard, Katrina A B; Tromp, Gerard; Romero, Roberto; Olson, Jane M; Lu, Qing; Xu, Zhiying; Parimi, Neeta; Nien, Jyh Kae; Gomez, Ricardo; Behnke, Ernesto; Solari, Margarita; Espinoza, Jimmy; Santolaya, Joaquin; Chaiworapongsa, Tinnakorn; Lenk, Guy M; Volkenant, Kimberly; Anant, Madan Kumar; Salisbury, Benjamin A; Carr, Janet; Lee, Min Soeb; Vovis, Gerald F; Kuivaniemi, Helena

2007-01-01

Pre-eclampsia (PE) affects 5-7% of pregnancies in the US, and is a leading cause of maternal death and perinatal morbidity and mortality worldwide. To identify genes with a role in PE, we conducted a large-scale association study evaluating 775 SNPs in 190 candidate genes selected for a potential role in obstetrical complications. SNP discovery was performed by DNA sequencing, and genotyping was carried out in a high-throughput facility using the MassARRAY(TM) System. Women with PE (n = 394) and their offspring (n = 324) were compared with control women (n = 602) and their offspring (n = 631) from the same hospital-based population. Haplotypes were estimated for each gene using the EM algorithm, and empirical p values were obtained for a logistic regression-based score test, adjusted for significant covariates. An interaction model between maternal and offspring genotypes was also evaluated. The most significant findings for association with PE were COL1A1 (p = 0.0011) and IL1A (p = 0.0014) for the maternal genotype, and PLAUR (p = 0.0008) for the offspring genotype. Common candidate genes for PE, including MTHFR and NOS3, were not significantly associated with PE. For the interaction model, SNPs within IGF1 (p = 0.0035) and IL4R (p = 0.0036) gave the most significant results. This study is one of the most comprehensive genetic association studies of PE to date, including an evaluation of offspring genotypes that have rarely been considered in previous studies. Although we did not identify statistically significant evidence of association for any of the candidate loci evaluated here after adjusting for multiple testing using the false discovery rate, additional compelling evidence exists, including multiple SNPs with nominally significant p values in COL1A1 and the IL1A region, and previous reports of association for IL1A, to support continued interest in these genes as candidates for PE. Identification of the genetic regulators of PE may have broader implications, since women with PE are at increased risk of death from cardiovascular diseases later in life.
Lumbosacral stenosis in Labrador retriever military working dogs - an exomic exploratory study.

PubMed

Mukherjee, Meenakshi; Jones, Jeryl C; Yao, Jianbo

2017-01-01

Canine lumbosacral stenosis is defined as narrowing of the caudal lumbar and/or sacral vertebral canal. A risk factor for neurologic problems in many large sized breeds, lumbosacral stenosis can also cause early retirement in Labrador retriever military working dogs. Though vital for conservative management of the condition, early detection is complicated by the ambiguous nature of clinical signs of lumbosacral stenosis in stoic and high-drive Labrador retriever military working dogs. Though clinical diagnoses of lumbosacral stenosis using CT imaging are standard, they are usually not performed unless dogs present with clinical symptoms. Understanding the underlying genomic mechanisms would be beneficial in developing early detection methods for lumbosacral stenosis, which could prevent premature retirement in working dogs. The exomes of 8 young Labrador retriever military working dogs (4 affected and 4 unaffected by lumbosacral stenosis, phenotypically selected by CT image analyses from 40 dogs with no reported clinical signs of the condition) were sequenced to identify and annotate exonic variants between dogs negative and positive for lumbosacral stenosis. Two-hundred and fifty-two variants were detected to be homozygous for the wild allele and either homozygous or heterozygous for the variant allele. Seventeen non-disruptive variants were detected that could affect protein effectiveness in 7 annotated (SCN1B, RGS9BP, ASXL3, TTR, LRRC16B, PTPRO, ZBBX) and 3 predicted genes (EEF1A1, DNAJA1, ZFX). No exonic variants were detected in any of the canine orthologues for human lumbar spinal stenosis candidate genes. TTR (transthyretin) gene could be a possible candidate for lumbosacral stenosis in Labrador retrievers based on previous human studies that have reported an association between human lumbar spinal stenosis and transthyretin protein amyloidosis. Other genes identified with exonic variants in this study but with no known published association with lumbosacral stenosis and/or lumbar spinal stenosis could also be candidate genes for future canine lumbosacral stenosis studies but their roles remain currently unknown. Human lumbar spinal stenosis candidate genes also cannot be ruled out as lumbosacral stenosis candidate genes. More definitive genetic investigations of this condition are needed before any genetic test for lumbosacral stenosis in Labrador retriever can be developed.
Cancer in silico drug discovery: a systems biology tool for identifying candidate drugs to target specific molecular tumor subtypes.

PubMed

San Lucas, F Anthony; Fowler, Jerry; Chang, Kyle; Kopetz, Scott; Vilar, Eduardo; Scheet, Paul

2014-12-01

Large-scale cancer datasets such as The Cancer Genome Atlas (TCGA) allow researchers to profile tumors based on a wide range of clinical and molecular characteristics. Subsequently, TCGA-derived gene expression profiles can be analyzed with the Connectivity Map (CMap) to find candidate drugs to target tumors with specific clinical phenotypes or molecular characteristics. This represents a powerful computational approach for candidate drug identification, but due to the complexity of TCGA and technology differences between CMap and TCGA experiments, such analyses are challenging to conduct and reproduce. We present Cancer in silico Drug Discovery (CiDD; scheet.org/software), a computational drug discovery platform that addresses these challenges. CiDD integrates data from TCGA, CMap, and Cancer Cell Line Encyclopedia (CCLE) to perform computational drug discovery experiments, generating hypotheses for the following three general problems: (i) determining whether specific clinical phenotypes or molecular characteristics are associated with unique gene expression signatures; (ii) finding candidate drugs to repress these expression signatures; and (iii) identifying cell lines that resemble the tumors being studied for subsequent in vitro experiments. The primary input to CiDD is a clinical or molecular characteristic. The output is a biologically annotated list of candidate drugs and a list of cell lines for in vitro experimentation. We applied CiDD to identify candidate drugs to treat colorectal cancers harboring mutations in BRAF. CiDD identified EGFR and proteasome inhibitors, while proposing five cell lines for in vitro testing. CiDD facilitates phenotype-driven, systematic drug discovery based on clinical and molecular data from TCGA. ©2014 American Association for Cancer Research.
Combining Genotype, Phenotype, and Environment to Infer Potential Candidate Genes.

PubMed

Talbot, Benoit; Chen, Ting-Wen; Zimmerman, Shawna; Joost, Stéphane; Eckert, Andrew J; Crow, Taylor M; Semizer-Cuming, Devrim; Seshadri, Chitra; Manel, Stéphanie

2017-03-01

Population genomic analysis can be an important tool in understanding local adaptation. Identification of potential adaptive loci in such analyses is usually based on the survey of a large genomic dataset in combination with environmental variables. Phenotypic data are less commonly incorporated into such studies, although combining a genome scan analysis with a phenotypic trait analysis can greatly improve the insights obtained from each analysis individually. Here, we aimed to identify loci potentially involved in adaptation to climate in 283 Loblolly pine (Pinus taeda) samples from throughout the species' range in the southeastern United States. We analyzed associations between phenotypic, molecular, and environmental variables from datasets of 3082 single nucleotide polymorphism (SNP) loci and 3 categories of phenotypic traits (gene expression, metabolites, and whole-plant traits). We found only 6 SNP loci that displayed potential signals of local adaptation. Five of the 6 identified SNPs are linked to gene expression traits for lignin development, and 1 is linked with whole-plant traits. We subsequently compared the 6 candidate genes with environmental variables and found a high correlation in only 3 of them (R2 > 0.2). Our study highlights the need for a combination of genotypes, phenotypes, and environmental variables, and for an appropriate sampling scheme and study design, to improve confidence in the identification of potential candidate genes. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
A genome-wide association study suggests new candidate genes for milk production traits in Chinese Holstein cattle.

PubMed

Yue, S J; Zhao, Y Q; Gu, X R; Yin, B; Jiang, Y L; Wang, Z H; Shi, K R

2017-12-01

A genome-wide association study (GWAS) was conducted on 15 milk production traits in Chinese Holstein. The experimental population consisted of 445 cattle, each genotyped by the GGP (GeneSeek genomic profiling)-BovineLD V3 SNP chip, which had 26 151 public SNPs in its manifest file. After data cleaning, 20 326 SNPs were retained for the GWAS. The phenotypes were estimated breeding values of traits, provided by a public dairy herd improvement program center that had been collected once a month for 3 years. Two statistical models, a fixed-effect linear regression model and a mixed-effect linear model, were used to estimate the association effects of SNPs on each of the phenotypes. Genome-wide significant and suggestive thresholds were set at 2.46E-06 and 4.95E-05 respectively. The two statistical models concurrently identified two genome-wide significant (P < 0.05) SNPs on milk production traits in this Chinese Holstein population. The positional candidate genes, which were the ones closest to these two identified SNPs, were EEF2K (eukaryotic elongation factor 2 kinase) and KLHL1 (kelch like family member 1). These two genes could serve as new candidate genes for milk yield and lactation persistence, yet their roles need to be verified in further function studies. © 2017 Stichting International Foundation for Animal Genetics.
Analysis of the QTL for sleep homeostasis in mice: Homer1a is a likely candidate.

PubMed

Mackiewicz, M; Paigen, B; Naidoo, N; Pack, A I

2008-03-14

Electroencephalographic oscillations in the frequency range of 0.5-4 Hz, characteristic of slow-wave sleep (SWS), are often referred to as the delta oscillation or delta power. Delta power reflects sleep intensity and correlates with the homeostatic response to sleep loss. A published survey of inbred strains of mice demonstrated that the time course of accumulation of delta power varied among inbred strains, and the segregation of the rebound of delta power in BxD recombinant inbred strains identified a genomic region on chromosome 13 referred to as the delta power in SWS (or Dps1). The quantitative trait locus (QTL) contains genes that modify the accumulation of delta power after sleep deprivation. Here, we narrow the QTL using interval-specific haplotype analysis and present a comprehensive annotation of the remaining genes in the Dps1 region with sequence comparisons to identify polymorphisms within the coding and regulatory regions. We established the expression pattern of selected genes located in the Dps1 interval in sleep and wakefulness in B6 and D2 parental strains. Taken together, these steps reduced the number of potential candidate genes that may underlie the accumulation of delta power after sleep deprivation and explain the Dps1 QTL. The strongest candidate gene is Homer1a, which is supported by expression differences between sleep and wakefulness and the SNP polymorphism in the upstream regulatory regions.
Bacterial reference genes for gene expression studies by RT-qPCR: survey and analysis.

PubMed

Rocha, Danilo J P; Santos, Carolina S; Pacheco, Luis G C

2015-09-01

The appropriate choice of reference genes is essential for accurate normalization of gene expression data obtained by the method of reverse transcription quantitative real-time PCR (RT-qPCR). In 2009, a guideline called the Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) highlighted the importance of the selection and validation of more than one suitable reference gene for obtaining reliable RT-qPCR results. Herein, we searched the recent literature in order to identify the bacterial reference genes that have been most commonly validated in gene expression studies by RT-qPCR (in the first 5 years following publication of the MIQE guidelines). Through a combination of different search parameters with the text mining tool MedlineRanker, we identified 145 unique bacterial genes that were recently tested as candidate reference genes. Of these, 45 genes were experimentally validated and, in most of the cases, their expression stabilities were verified using the software tools geNorm and NormFinder. It is noteworthy that only 10 of these reference genes had been validated in two or more of the studies evaluated. An enrichment analysis using Gene Ontology classifications demonstrated that genes belonging to the functional categories of DNA Replication (GO: 0006260) and Transcription (GO: 0006351) rendered a proportionally higher number of validated reference genes. Three genes in the former functional class were also among the top five most stable genes identified through an analysis of gene expression data obtained from the Pathosystems Resource Integration Center. These results may provide a guideline for the initial selection of candidate reference genes for RT-qPCR studies in several different bacterial species.
A Novel Candidate Region for Genetic Adaptation to High Altitude in Andean Populations

PubMed Central

Lippold, Sebastian; de Filippo, Cesare; Tang, Kun; López Herráez, David; Li, Jing; Stoneking, Mark

2015-01-01

Humans living at high altitude (≥2,500 meters above sea level) have acquired unique abilities to survive the associated extreme environmental conditions, including hypoxia, cold temperature, limited food availability and high levels of free radicals and oxidants. Long-term inhabitants of the most elevated regions of the world have undergone extensive physiological and/or genetic changes, particularly in the regulation of respiration and circulation, when compared to lowland populations. Genome scans have identified candidate genes involved in altitude adaption in the Tibetan Plateau and the Ethiopian highlands, in contrast to populations from the Andes, which have not been as intensively investigated. In the present study, we focused on three indigenous populations from Bolivia: two groups of Andean natives, Aymara and Quechua, and the low-altitude control group of Guarani from the Gran Chaco lowlands. Using pooled samples, we identified a number of SNPs exhibiting large allele frequency differences over 900,000 genotyped SNPs. A region in chromosome 10 (within the cytogenetic bands q22.3 and q23.1) was significantly differentiated between highland and lowland groups. We resequenced ~1.5 Mb surrounding the candidate region and identified strong signals of positive selection in the highland populations. A composite of multiple signals like test localized the signal to FAM213A and a related enhancer; the product of this gene acts as an antioxidant to lower oxidative stress and may help to maintain bone mass. The results suggest that positive selection on the enhancer might increase the expression of this antioxidant, and thereby prevent oxidative damage. In addition, the most significant signal in a relative extended haplotype homozygosity analysis was localized to the SFTPD gene, which encodes a surfactant pulmonary-associated protein involved in normal respiration and innate host defense. Our study thus identifies two novel candidate genes and associated pathways that may be involved in high-altitude adaptation in Andean populations. PMID:25961286

Ontology based molecular signatures for immune cell types via gene expression analysis

PubMed Central

2013-01-01

Background New technologies are focusing on characterizing cell types to better understand their heterogeneity. With large volumes of cellular data being generated, innovative methods are needed to structure the resulting data analyses. Here, we describe an ‘Ontologically BAsed Molecular Signature’ (OBAMS) method that identifies novel cellular biomarkers and infers biological functions as characteristics of particular cell types. This method finds molecular signatures for immune cell types based on mapping biological samples to the Cell Ontology (CL) and navigating the space of all possible pairwise comparisons between cell types to find genes whose expression is core to a particular cell type’s identity. Results We illustrate this ontological approach by evaluating expression data available from the Immunological Genome project (IGP) to identify unique biomarkers of mature B cell subtypes. We find that using OBAMS, candidate biomarkers can be identified at every strata of cellular identity from broad classifications to very granular. Furthermore, we show that Gene Ontology can be used to cluster cell types by shared biological processes in order to find candidate genes responsible for somatic hypermutation in germinal center B cells. Moreover, through in silico experiments based on this approach, we have identified genes sets that represent genes overexpressed in germinal center B cells and identify genes uniquely expressed in these B cells compared to other B cell types. Conclusions This work demonstrates the utility of incorporating structured ontological knowledge into biological data analysis – providing a new method for defining novel biomarkers and providing an opportunity for new biological insights. PMID:24004649
Genomic approaches for the elucidation of genes and gene networks underlying cardiovascular traits.

PubMed

Adriaens, M E; Bezzina, C R

2018-06-22

Genome-wide association studies have shed light on the association between natural genetic variation and cardiovascular traits. However, linking a cardiovascular trait associated locus to a candidate gene or set of candidate genes for prioritization for follow-up mechanistic studies is all but straightforward. Genomic technologies based on next-generation sequencing technology nowadays offer multiple opportunities to dissect gene regulatory networks underlying genetic cardiovascular trait associations, thereby aiding in the identification of candidate genes at unprecedented scale. RNA sequencing in particular becomes a powerful tool when combined with genotyping to identify loci that modulate transcript abundance, known as expression quantitative trait loci (eQTL), or loci modulating transcript splicing known as splicing quantitative trait loci (sQTL). Additionally, the allele-specific resolution of RNA-sequencing technology enables estimation of allelic imbalance, a state where the two alleles of a gene are expressed at a ratio differing from the expected 1:1 ratio. When multiple high-throughput approaches are combined with deep phenotyping in a single study, a comprehensive elucidation of the relationship between genotype and phenotype comes into view, an approach known as systems genetics. In this review, we cover key applications of systems genetics in the broad cardiovascular field.
Integrated multi-cohort transcriptional meta-analysis of neurodegenerative diseases.

PubMed

Li, Matthew D; Burns, Terry C; Morgan, Alexander A; Khatri, Purvesh

2014-09-04

Neurodegenerative diseases share common pathologic features including neuroinflammation, mitochondrial dysfunction and protein aggregation, suggesting common underlying mechanisms of neurodegeneration. We undertook a meta-analysis of public gene expression data for neurodegenerative diseases to identify a common transcriptional signature of neurodegeneration. Using 1,270 post-mortem central nervous system tissue samples from 13 patient cohorts covering four neurodegenerative diseases, we identified 243 differentially expressed genes, which were similarly dysregulated in 15 additional patient cohorts of 205 samples including seven neurodegenerative diseases. This gene signature correlated with histologic disease severity. Metallothioneins featured prominently among differentially expressed genes, and functional pathway analysis identified specific convergent themes of dysregulation. MetaCore network analyses revealed various novel candidate hub genes (e.g. STAU2). Genes associated with M1-polarized macrophages and reactive astrocytes were strongly enriched in the meta-analysis data. Evaluation of genes enriched in neurons revealed 70 down-regulated genes, over half not previously associated with neurodegeneration. Comparison with aging brain data (3 patient cohorts, 221 samples) revealed 53 of these to be unique to neurodegenerative disease, many of which are strong candidates to be important in neuropathogenesis (e.g. NDN, NAP1L2). ENCODE ChIP-seq analysis predicted common upstream transcriptional regulators not associated with normal aging (REST, RBBP5, SIN3A, SP2, YY1, ZNF143, IKZF1). Finally, we removed genes common to neurodegeneration from disease-specific gene signatures, revealing uniquely robust immune response and JAK-STAT signaling in amyotrophic lateral sclerosis. Our results implicate pervasive bioenergetic deficits, M1-type microglial activation and gliosis as unifying themes of neurodegeneration, and identify numerous novel genes associated with neurodegenerative processes.
Identification of genetic elements in metabolism by high-throughput mouse phenotyping.

PubMed

Rozman, Jan; Rathkolb, Birgit; Oestereicher, Manuela A; Schütt, Christine; Ravindranath, Aakash Chavan; Leuchtenberger, Stefanie; Sharma, Sapna; Kistler, Martin; Willershäuser, Monja; Brommage, Robert; Meehan, Terrence F; Mason, Jeremy; Haselimashhadi, Hamed; Hough, Tertius; Mallon, Ann-Marie; Wells, Sara; Santos, Luis; Lelliott, Christopher J; White, Jacqueline K; Sorg, Tania; Champy, Marie-France; Bower, Lynette R; Reynolds, Corey L; Flenniken, Ann M; Murray, Stephen A; Nutter, Lauryl M J; Svenson, Karen L; West, David; Tocchini-Valentini, Glauco P; Beaudet, Arthur L; Bosch, Fatima; Braun, Robert B; Dobbie, Michael S; Gao, Xiang; Herault, Yann; Moshiri, Ala; Moore, Bret A; Kent Lloyd, K C; McKerlie, Colin; Masuya, Hiroshi; Tanaka, Nobuhiko; Flicek, Paul; Parkinson, Helen E; Sedlacek, Radislav; Seong, Je Kyung; Wang, Chi-Kuang Leo; Moore, Mark; Brown, Steve D; Tschöp, Matthias H; Wurst, Wolfgang; Klingenspor, Martin; Wolf, Eckhard; Beckers, Johannes; Machicao, Fausto; Peter, Andreas; Staiger, Harald; Häring, Hans-Ulrich; Grallert, Harald; Campillos, Monica; Maier, Holger; Fuchs, Helmut; Gailus-Durner, Valerie; Werner, Thomas; Hrabe de Angelis, Martin

2018-01-18

Metabolic diseases are a worldwide problem but the underlying genetic factors and their relevance to metabolic disease remain incompletely understood. Genome-wide research is needed to characterize so-far unannotated mammalian metabolic genes. Here, we generate and analyze metabolic phenotypic data of 2016 knockout mouse strains under the aegis of the International Mouse Phenotyping Consortium (IMPC) and find 974 gene knockouts with strong metabolic phenotypes. 429 of those had no previous link to metabolism and 51 genes remain functionally completely unannotated. We compared human orthologues of these uncharacterized genes in five GWAS consortia and indeed 23 candidate genes are associated with metabolic disease. We further identify common regulatory elements in promoters of candidate genes. As each regulatory element is composed of several transcription factor binding sites, our data reveal an extensive metabolic phenotype-associated network of co-regulated genes. Our systematic mouse phenotype analysis thus paves the way for full functional annotation of the genome.
Detection of Significant Pneumococcal Meningitis Biomarkers by Ego Network.

PubMed

Wang, Qian; Lou, Zhifeng; Zhai, Liansuo; Zhao, Haibin

2017-06-01

To identify significant biomarkers for detection of pneumococcal meningitis based on ego network. Based on the gene expression data of pneumococcal meningitis and global protein-protein interactions (PPIs) data recruited from open access databases, the authors constructed a differential co-expression network (DCN) to identify pneumococcal meningitis biomarkers in a network view. Here EgoNet algorithm was employed to screen the significant ego networks that could accurately distinguish pneumococcal meningitis from healthy controls, by sequentially seeking ego genes, searching candidate ego networks, refinement of candidate ego networks and significance analysis to identify ego networks. Finally, the functional inference of the ego networks was performed to identify significant pathways for pneumococcal meningitis. By differential co-expression analysis, the authors constructed the DCN that covered 1809 genes and 3689 interactions. From the DCN, a total of 90 ego genes were identified. Starting from these ego genes, three significant ego networks (Module 19, Module 70 and Module 71) that could predict clinical outcomes for pneumococcal meningitis were identified by EgoNet algorithm, and the corresponding ego genes were GMNN, MAD2L1 and TPX2, respectively. Pathway analysis showed that these three ego networks were related to CDT1 association with the CDC6:ORC:origin complex, inactivation of APC/C via direct inhibition of the APC/C complex pathway, and DNA strand elongation, respectively. The authors successfully screened three significant ego modules which could accurately predict the clinical outcomes for pneumococcal meningitis and might play important roles in host response to pathogen infection in pneumococcal meningitis.
Association Between Germline Mutation in VSIG10L and Familial Barrett Neoplasia.

PubMed

Fecteau, Ryan E; Kong, Jianping; Kresak, Adam; Brock, Wendy; Song, Yeunjoo; Fujioka, Hisashi; Elston, Robert; Willis, Joseph E; Lynch, John P; Markowitz, Sanford D; Guda, Kishore; Chak, Amitabh

2016-10-01

Esophageal adenocarcinoma and its precursor lesion Barrett esophagus have seen a dramatic increase in incidence over the past 4 decades yet marked genetic heterogeneity of this disease has precluded advances in understanding its pathogenesis and improving treatment. To identify novel disease susceptibility variants in a familial syndrome of esophageal adenocarcinoma and Barrett esophagus, termed familial Barrett esophagus, by using high-throughput sequencing in affected individuals from a large, multigenerational family. We performed whole exome sequencing (WES) from peripheral lymphocyte DNA on 4 distant relatives from our multiplex, multigenerational familial Barrett esophagus family to identify candidate disease susceptibility variants. Gene variants were filtered, verified, and segregation analysis performed to identify a single candidate variant. Gene expression analysis was done with both quantitative real-time polymerase chain reaction and in situ RNA hybridization. A 3-dimensional organotypic cell culture model of esophageal maturation was utilized to determine the phenotypic effects of our gene variant. We used electron microscopy on esophageal mucosa from an affected family member carrying the gene variant to assess ultrastructural changes. Identification of a novel, germline disease susceptibility variant in a previously uncharacterized gene. A multiplex, multigenerational family with 14 members affected (3 members with esophageal adenocarcinoma and 11 with Barrett esophagus) was identified, and whole-exome sequencing identified a germline mutation (S631G) at a highly conserved serine residue in the uncharacterized gene VSIG10L that segregated in affected members. Transfection of S631G variant into a 3-dimensional organotypic culture model of normal esophageal squamous cells dramatically inhibited epithelial maturation compared with the wild-type. VSIG10L exhibited high expression in normal squamous esophagus with marked loss of expression in Barrett-associated lesions. Electron microscopy of squamous esophageal mucosa harboring the S631G variant revealed dilated intercellular spaces and reduced desmosomes. This study presents VSIG10L as a candidate familial Barrett esophagus susceptibility gene, with a putative role in maintaining normal esophageal homeostasis. Further research assessing VSIG10L function may reveal pathways important for esophageal maturation and the pathogenesis of Barrett esophagus and esophageal adenocarcinoma.
Association Between Germline Mutation in VSIG10L and Familial Barrett Neoplasia

PubMed Central

Fecteau, Ryan E.; Kong, Jianping; Kresak, Adam; Brock, Wendy; Song, Yeunjoo; Fujioka, Hisashi; Elston, Robert; Willis, Joseph E.; Lynch, John P.; Markowitz, Sanford D.; Guda, Kishore; Chak, Amitabh

2016-01-01

IMPORTANCE Esophageal adenocarcinoma and its precursor lesion Barrett esophagus have seen a dramatic increase in incidence over the past 4 decades yet marked genetic heterogeneity of this disease has precluded advances in understanding its pathogenesis and improving treatment. OBJECTIVE To identify novel disease susceptibility variants in a familial syndrome of esophageal adenocarcinoma and Barrett esophagus, termed familial Barrett esophagus, by using high-throughput sequencing in affected individuals from a large, multigenerational family. DESIGN, SETTING, AND PARTICIPANTS We performed whole exome sequencing (WES) from peripheral lymphocyte DNA on 4 distant relatives from our multiplex, multigenerational familial Barrett esophagus family to identify candidate disease susceptibility variants. Gene variants were filtered, verified, and segregation analysis performed to identify a single candidate variant. Gene expression analysis was done with both quantitative real-time polymerase chain reaction and in situ RNA hybridization. A 3-dimensional organotypic cell culture model of esophageal maturation was utilized to determine the phenotypic effects of our gene variant. We used electron microscopy on esophageal mucosa from an affected family member carrying the gene variant to assess ultrastructural changes. MAIN OUTCOMES AND MEASURES Identification of a novel, germline disease susceptibility variant in a previously uncharacterized gene. RESULTS A multiplex, multigenerational family with 14 members affected (3 members with esophageal adenocarcinoma and 11 with Barrett esophagus) was identified, and whole-exome sequencing identified a germline mutation (S631G) at a highly conserved serine residue in the uncharacterized gene VSIG10L that segregated in affected members. Transfection of S631G variant into a 3-dimensional organotypic culture model of normal esophageal squamous cells dramatically inhibited epithelial maturation compared with the wild-type. VSIG10L exhibited high expression in normal squamous esophagus with marked loss of expression in Barrett-associated lesions. Electron microscopy of squamous esophageal mucosa harboring the S631G variant revealed dilated intercellular spaces and reduced desmosomes. CONCLUSIONS AND RELEVANCE This study presents VSIG10L as a candidate familial Barrett esophagus susceptibility gene, with a putative role in maintaining normal esophageal homeostasis. Further research assessing VSIG10L function may reveal pathways important for esophageal maturation and the pathogenesis of Barrett esophagus and esophageal adenocarcinoma. PMID:27467440
Genetic approaches to understanding post-traumatic stress disorder

PubMed Central

Almli, Lynn M.; Fani, Negar; Smith, Alicia K.; Ressler, Kerry J.

2015-01-01

Post-traumatic stress disorder (PTSD) is increasingly recognized as both a disorder of enormous mental health and societal burden, but also as an anxiety disorder that may be particularly understandable from a scientific perspective. Specifically, PTSD can be conceptualized as a disorder of fear and stress dysregulation, and the neural circuitry underlying these pathways in both animals and humans are becoming increasingly well understood. Furthermore, PTSD is the only disorder in psychiatry in which the initiating factor, the trauma exposure, can be identified. Thus, the pathophysiology of the fear and stress response underlying PTSD can be examined and potentially interrupted. Twin studies have shown that the development of PTSD following a trauma is heritable, and that genetic risk factors may account for up to 30–40% of this heritability. A current goal is to understand the gene pathways that are associated with PTSD, and how those genes act on the fear/stress circuitry to mediate risk vs. resilience for PTSD. This review will examine gene pathways that have recently been analysed, primarily through candidate gene studies (including neuroimaging studies of candidate genes), in addition to genome-wide associations and the epigenetic regulation of PTSD. Future and on-going studies are utilizing larger and collaborative cohorts to identify novel gene candidates through genome-wide association and other powerful genomic approaches. Identification of PTSD biological pathways strengthens the hope of progress in the mechanistic understanding of a model psychiatric disorder and allows for the development of targeted treatments and interventions. PMID:24103155
Identification and Evolutionary Analysis of Potential Candidate Genes in a Human Eating Disorder.

PubMed

Sabbagh, Ubadah; Mullegama, Saman; Wyckoff, Gerald J

2016-01-01

The purpose of this study was to find genes linked with eating disorders and associated with both metabolic and neural systems. Our operating hypothesis was that there are genetic factors underlying some eating disorders resting in both those pathways. Specifically, we are interested in disorders that may rest in both sleep and metabolic function, generally called Night Eating Syndrome (NES). A meta-analysis of the Gene Expression Omnibus targeting the mammalian nervous system, sleep, and obesity studies was performed, yielding numerous genes of interest. Through a text-based analysis of the results, a number of potential candidate genes were identified. VGF, in particular, appeared to be relevant both to obesity and, broadly, to brain or neural development. VGF is a highly connected protein that interacts with numerous targets via proteolytically digested peptides. We examined VGF from an evolutionary perspective to determine whether other available evidence supported a role for the gene in human disease. We conclude that some of the already identified variants in VGF from human polymorphism studies may contribute to eating disorders and obesity. Our data suggest that there is enough evidence to warrant eGWAS and GWAS analysis of these genes in NES patients in a case-control study.
New mutations in BBS genes in small consanguineous families with Bardet-Biedl syndrome: Detection of candidate regions by homozygosity mapping

PubMed Central

Pereiro, Ines; Piñeiro-Gallego, Teresa; Baiget, Montserrat; Borrego, Salud; Ayuso, Carmen; Searby, Charles; Nishimura, Darryl

2010-01-01

Purpose Bardet-Biedl syndrome (BBS, OMIM 209900) is a rare multi-organ disorder in which BBS patients manifest a variable phenotype that includes retinal dystrophy, polydactyly, mental delay, obesity, and also reproductive tract and renal abnormalities. Mutations in 14 genes (BBS1–BBS14) are found in 70% of the patients, indicating that additional mutations in known and new BBS genes remain to be identified. Therefore, the molecular diagnosis of this complex disorder is a challenging task. Methods In this study we show the use of the genome-wide homozygosity mapping strategy in the mutation detection of nine Caucasian BBS families, eight of them consanguineous and one from the same geographic area with no proven consanguinity. Results We identified the disease-causing mutation in six of the families studied, five of which had novel sequence variants in BBS3, BBS6, and BBS12. This is the first null mutation reported in BBS3. Furthermore, this approach defined homozygous candidate regions that could harbor potential candidate genes for BBS in three of the families. Conclusions These findings further underline the importance of homozygosity mapping as a useful technology for diagnosis in small consanguineous families with a complex disease like BBS. PMID:20142850
Identification and validation of reference genes for quantitative real-time PCR normalization and its applications in lycium.

PubMed

Zeng, Shaohua; Liu, Yongliang; Wu, Min; Liu, Xiaomin; Shen, Xiaofei; Liu, Chunzhao; Wang, Ying

2014-01-01

Lycium barbarum and L. ruthenicum are extensively used as traditional Chinese medicinal plants. Next generation sequencing technology provides a powerful tool for analyzing transcriptomic profiles of gene expression in non-model species. Such gene expression can then be confirmed with quantitative real-time polymerase chain reaction (qRT-PCR). Therefore, use of systematically identified suitable reference genes is a prerequisite for obtaining reliable gene expression data. Here, we calculated the expression stability of 18 candidate reference genes across samples from different tissues and grown under salt stress using geNorm and NormFinder procedures. The geNorm-determined rank of reference genes was similar to those defined by NormFinder with some differences. Both procedures confirmed that the single most stable reference gene was ACNTIN1 for L. barbarum fruits, H2B1 for L. barbarum roots, and EF1α for L. ruthenicum fruits. PGK3, H2B2, and PGK3 were identified as the best stable reference genes for salt-treated L. ruthenicum leaves, roots, and stems, respectively. H2B1 and GAPDH1+PGK1 for L. ruthenicum and SAMDC2+H2B1 for L. barbarum were the best single and/or combined reference genes across all samples. Finally, expression of salt-responsive gene NAC, fruit ripening candidate gene LrPG, and anthocyanin genes were investigated to confirm the validity of the selected reference genes. Suitable reference genes identified in this study provide a foundation for accurately assessing gene expression and further better understanding of novel gene function to elucidate molecular mechanisms behind particular biological/physiological processes in Lycium.
Integration of biological data by kernels on graph nodes allows prediction of new genes involved in mitotic chromosome condensation

PubMed Central

Hériché, Jean-Karim; Lees, Jon G.; Morilla, Ian; Walter, Thomas; Petrova, Boryana; Roberti, M. Julia; Hossain, M. Julius; Adler, Priit; Fernández, José M.; Krallinger, Martin; Haering, Christian H.; Vilo, Jaak; Valencia, Alfonso; Ranea, Juan A.; Orengo, Christine; Ellenberg, Jan

2014-01-01

The advent of genome-wide RNA interference (RNAi)–based screens puts us in the position to identify genes for all functions human cells carry out. However, for many functions, assay complexity and cost make genome-scale knockdown experiments impossible. Methods to predict genes required for cell functions are therefore needed to focus RNAi screens from the whole genome on the most likely candidates. Although different bioinformatics tools for gene function prediction exist, they lack experimental validation and are therefore rarely used by experimentalists. To address this, we developed an effective computational gene selection strategy that represents public data about genes as graphs and then analyzes these graphs using kernels on graph nodes to predict functional relationships. To demonstrate its performance, we predicted human genes required for a poorly understood cellular function—mitotic chromosome condensation—and experimentally validated the top 100 candidates with a focused RNAi screen by automated microscopy. Quantitative analysis of the images demonstrated that the candidates were indeed strongly enriched in condensation genes, including the discovery of several new factors. By combining bioinformatics prediction with experimental validation, our study shows that kernels on graph nodes are powerful tools to integrate public biological data and predict genes involved in cellular functions of interest. PMID:24943848
[Discovery of the target genes inhibited by formic acid in Candida shehatae].

PubMed

Cai, Peng; Xiong, Xujie; Xu, Yong; Yong, Qiang; Zhu, Junjun; Shiyuan, Yu

2014-01-04

At transcriptional level, the inhibitory effects of formic acid was investigated on Candida shehatae, a model yeast strain capable of fermenting xylose to ethanol. Thereby, the target genes were regulated by formic acid and the transcript profiles were discovered. On the basis of the transcriptome data of C. shehatae metabolizing glucose and xylose, the genes responsible for ethanol fermentation were chosen as candidates by the combined method of yeast metabolic pathway analysis and manual gene BLAST search. These candidates were then quantitatively detected by RQ-PCR technique to find the regulating genes under gradient doses of formic acid. By quantitative analysis of 42 candidate genes, we finally identified 10 and 5 genes as markedly down-regulated and up-regulated targets by formic acid, respectively. With regard to gene transcripts regulated by formic acid in C. shehatae, the markedly down-regulated genes ranking declines as follows: xylitol dehydrogenase (XYL2), acetyl-CoA synthetase (ACS), ribose-5-phosphate isomerase (RKI), transaldolase (TAL), phosphogluconate dehydrogenase (GND1), transketolase (TKL), glucose-6-phosphate dehydrogenase (ZWF1), xylose reductase (XYL1), pyruvate dehydrogenase (PDH) and pyruvate decarboxylase (PDC); and a declining rank for up-regulated gens as follows: fructose-bisphosphate aldolase (ALD), glucokinase (GLK), malate dehydrogenase (MDH), 6-phosphofructokinase (PFK) and alcohol dehydrogenase (ADH).
Genome-Wide Association Study of Seed Dormancy and the Genomic Consequences of Improvement Footprints in Rice (Oryza sativa L.)

PubMed Central

Lu, Qing; Niu, Xiaojun; Zhang, Mengchen; Wang, Caihong; Xu, Qun; Feng, Yue; Yang, Yaolong; Wang, Shan; Yuan, Xiaoping; Yu, Hanyong; Wang, Yiping; Chen, Xiaoping; Liang, Xuanqiang; Wei, Xinghua

2018-01-01

Seed dormancy is an important agronomic trait affecting grain yield and quality because of pre-harvest germination and is influenced by both environmental and genetic factors. However, our knowledge of the factors controlling seed dormancy remains limited. To better reveal the molecular mechanism underlying this trait, a genome-wide association study was conducted in an indica-only population consisting of 453 accessions genotyped using 5,291 SNPs. Nine known and new significant SNPs were identified on eight chromosomes. These lead SNPs explained 34.9% of the phenotypic variation, and four of them were designed as dCAPS markers in the hope of accelerating molecular breeding. Moreover, a total of 212 candidate genes was predicted and eight candidate genes showed plant tissue-specific expression in expression profile data from different public bioinformatics databases. In particular, LOC_Os03g10110, which had a maize homolog involved in embryo development, was identified as a candidate regulator for further biological function investigations. Additionally, a polymorphism information content ratio method was used to screen improvement footprints and 27 selective sweeps were identified, most of which harbored domestication-related genes. Further studies suggested that three significant SNPs were adjacent to the candidate selection signals, supporting the accuracy of our genome-wide association study (GWAS) results. These findings show that genome-wide screening for selective sweeps can be used to identify new improvement-related DNA regions, although the phenotypes are unknown. This study enhances our knowledge of the genetic variation in seed dormancy, and the new dormancy-associated SNPs will provide real benefits in molecular breeding. PMID:29354150
How immunogenetically different are domestic pigs from wild boars: a perspective from single-nucleotide polymorphisms of 19 immunity-related candidate genes.

PubMed

Chen, Shanyuan; Gomes, Rui; Costa, Vânia; Santos, Pedro; Charneca, Rui; Zhang, Ya-ping; Liu, Xue-hong; Wang, Shao-qing; Bento, Pedro; Nunes, Jose-Luis; Buzgó, József; Varga, Gyula; Anton, István; Zsolnai, Attila; Beja-Pereira, Albano

2013-10-01

The coexistence of wild boars and domestic pigs across Eurasia makes it feasible to conduct comparative genetic or genomic analyses for addressing how genetically different a domestic species is from its wild ancestor. To test whether there are differences in patterns of genetic variability between wild and domestic pigs at immunity-related genes and to detect outlier loci putatively under selection that may underlie differences in immune responses, here we analyzed 54 single-nucleotide polymorphisms (SNPs) of 19 immunity-related candidate genes on 11 autosomes in three pairs of wild boar and domestic pig populations from China, Iberian Peninsula, and Hungary. Our results showed no statistically significant differences in allele frequency and heterozygosity across SNPs between three pairs of wild and domestic populations. This observation was more likely due to the widespread and long-lasting gene flow between wild boars and domestic pigs across Eurasia. In addition, we detected eight coding SNPs from six genes as outliers being under selection consistently by three outlier tests (BayeScan2.1, FDIST2, and Arlequin3.5). Among four non-synonymous outlier SNPs, one from TLR4 gene was identified as being subject to positive (diversifying) selection and three each from CD36, IFNW1, and IL1B genes were suggested as under balancing selection. All of these four non-synonymous variants were predicted as being benign by PolyPhen-2. Our results were supported by other independent lines of evidence for positive selection or balancing selection acting on these four immune genes (CD36, IFNW1, IL1B, and TLR4). Our study showed an example applying a candidate gene approach to identify functionally important mutations (i.e., outlier loci) in wild and domestic pigs for subsequent functional experiments.
Modulators of the microRNA biogenesis pathway via arrayed lentiviral enabled RNAi screening for drug and biomarker discovery

PubMed Central

Shum, David; Bhinder, Bhavneet; Djaballah, Hakim

2013-01-01

MicroRNAs (miRNAs) are small endogenous and conserved non-coding RNA molecules that regulate gene expression. Although the first miRNA was discovered well over sixteen years ago, little is known about their biogenesis and it is only recently that we have begun to understand their scope and diversity. For this purpose, we performed an RNAi screen aimed at identifying genes involved in their biogenesis pathway with a potential use as biomarkers. Using a previously developed miRNA 21 (miR-21) EGFP-based biosensor cell based assay monitoring green fluorescence enhancements, we performed an arrayed short hairpin RNA (shRNA) screen against a lentiviral particle ready TRC1 library covering 16,039 genes in 384-well plate format, and interrogating the genome one gene at a time building a panoramic view of endogenous miRNA activity. Using the BDA method for RNAi data analysis, we nominate 497 gene candidates the knockdown of which increased the EGFP fluorescence and yielding an initial hit rate of 3.09%; of which only 22, with reported validated clones, are deemed high-confidence gene candidates. An unexpected and surprising result was that only DROSHA was identified as a hit out of the seven core essential miRNA biogenesis genes; suggesting that perhaps intracellular shRNA processing into the correct duplex may be cell dependent and with differential outcome. Biological classification revealed several major control junctions among them genes involved in transport and vesicular trafficking. In summary, we report on 22 high confidence gene candidate regulators of miRNA biogenesis with potential use in drug and biomarker discovery. PMID:23977983
Genomic convergence to identify candidate genes for Alzheimer disease on chromosome 10

PubMed Central

Liang, Xueying; Slifer, Michael; Martin, Eden R.; Schnetz-Boutaud, Nathalie; Bartlett, Jackie; Anderson, Brent; Züchner, Stephan; Gwirtsman, Harry; Gilbert, John R.; Pericak-Vance, Margaret A.; Haines, Jonathan L.

2009-01-01

A broad region of chromosome 10 (chr10) has engendered continued interest in the etiology of late-onset Alzheimer Disease (LOAD) from both linkage and candidate gene studies. However, there is a very extensive heterogeneity on chr10. We converged linkage analysis and gene expression data using the concept of genomic convergence that suggests that genes showing positive results across multiple different data types are more likely to be involved in AD. We identified and examined 28 genes on chr10 for association with AD in a Caucasian case-control dataset of 506 cases and 558 controls with substantial clinical information. The cases were all LOAD (minimum age at onset ≥ 60 years). Both single marker and haplotypic associations were tested in the overall dataset and 8 subsets defined by age, gender, ApoE and clinical status. PTPLA showed allelic, genotypic and haplotypic association in the overall dataset. SORCS1 was significant in the overall data sets (p=0.0025) and most significant in the female subset (allelic association p=0.00002, a 3-locus haplotype had p=0.0005). Odds Ratio of SORCS1 in the female subset was 1.7 (p<0.0001). SORCS1 is an interesting candidate gene involved in the Aβ pathway. Therefore, genetic variations in PTPLA and SORCS1 may be associated and have modest effect to the risk of AD by affecting Aβ pathway. The replication of the effect of these genes in different study populations and search for susceptible variants and functional studies of these genes are necessary to get a better understanding of the roles of the genes in Alzheimer disease. PMID:19241460
Selection of Valid Reference Genes for Reverse Transcription Quantitative PCR Analysis in Heliconius numata (Lepidoptera: Nymphalidae)

PubMed Central

Chouteau, Mathieu; Whibley, Annabel; Joron, Mathieu; Llaurens, Violaine

2016-01-01

Identifying the genetic basis of adaptive variation is challenging in non-model organisms and quantitative real time PCR. is a useful tool for validating predictions regarding the expression of candidate genes. However, comparing expression levels in different conditions requires rigorous experimental design and statistical analyses. Here, we focused on the neotropical passion-vine butterflies Heliconius, non-model species studied in evolutionary biology for their adaptive variation in wing color patterns involved in mimicry and in the signaling of their toxicity to predators. We aimed at selecting stable reference genes to be used for normalization of gene expression data in RT-qPCR analyses from developing wing discs according to the minimal guidelines described in Minimum Information for publication of Quantitative Real-Time PCR Experiments (MIQE). To design internal RT-qPCR controls, we studied the stability of expression of nine candidate reference genes (actin, annexin, eF1α, FK506BP, PolyABP, PolyUBQ, RpL3, RPS3A, and tubulin) at two developmental stages (prepupal and pupal) using three widely used programs (GeNorm, NormFinder and BestKeeper). Results showed that, despite differences in statistical methods, genes RpL3, eF1α, polyABP, and annexin were stably expressed in wing discs in late larval and pupal stages of Heliconius numata. This combination of genes may be used as a reference for a reliable study of differential expression in wings for instance for genes involved in important phenotypic variation, such as wing color pattern variation. Through this example, we provide general useful technical recommendations as well as relevant statistical strategies for evolutionary biologists aiming to identify candidate-genes involved adaptive variation in non-model organisms. PMID:27271971
Identification of Linkages between EDCs in Personal Care Products and Breast Cancer through Data Integration Combined with Gene Network Analysis

PubMed Central

Kim, Jongwoon

2017-01-01

Approximately 1000 chemicals have been reported to possibly have endocrine disrupting effects, some of which are used in consumer products, such as personal care products (PCPs) and cosmetics. We conducted data integration combined with gene network analysis to: (i) identify causal molecular mechanisms between endocrine disrupting chemicals (EDCs) used in PCPs and breast cancer; and (ii) screen candidate EDCs associated with breast cancer. Among EDCs used in PCPs, four EDCs having correlation with breast cancer were selected, and we curated 27 common interacting genes between those EDCs and breast cancer to perform the gene network analysis. Based on the gene network analysis, ESR1, TP53, NCOA1, AKT1, and BCL6 were found to be key genes to demonstrate the molecular mechanisms of EDCs in the development of breast cancer. Using GeneMANIA, we additionally predicted 20 genes which could interact with the 27 common genes. In total, 47 genes combining the common and predicted genes were functionally grouped with the gene ontology and KEGG pathway terms. With those genes, we finally screened candidate EDCs for their potential to increase breast cancer risk. This study highlights that our approach can provide insights to understand mechanisms of breast cancer and identify potential EDCs which are in association with breast cancer. PMID:28973975
Genetic control of functional traits related to photosynthesis and water use efficiency in Pinus pinaster Ait. drought response: integration of genome annotation, allele association and QTL detection for candidate gene identification.

PubMed

de Miguel, Marina; Cabezas, José-Antonio; de María, Nuria; Sánchez-Gómez, David; Guevara, María-Ángeles; Vélez, María-Dolores; Sáez-Laguna, Enrique; Díaz, Luis-Manuel; Mancha, Jose-Antonio; Barbero, María-Carmen; Collada, Carmen; Díaz-Sala, Carmen; Aranda, Ismael; Cervera, María-Teresa

2014-06-12

Understanding molecular mechanisms that control photosynthesis and water use efficiency in response to drought is crucial for plant species from dry areas. This study aimed to identify QTL for these traits in a Mediterranean conifer and tested their stability under drought. High density linkage maps for Pinus pinaster were used in the detection of QTL for photosynthesis and water use efficiency at three water irrigation regimes. A total of 28 significant and 27 suggestive QTL were found. QTL detected for photochemical traits accounted for the higher percentage of phenotypic variance. Functional annotation of genes within the QTL suggested 58 candidate genes for the analyzed traits. Allele association analysis in selected candidate genes showed three SNPs located in a MYB transcription factor that were significantly associated with efficiency of energy capture by open PSII reaction centers and specific leaf area. The integration of QTL mapping of functional traits, genome annotation and allele association yielded several candidate genes involved with molecular control of photosynthesis and water use efficiency in response to drought in a conifer species. The results obtained highlight the importance of maintaining the integrity of the photochemical machinery in P. pinaster drought response.

Microarray-assisted fine-mapping of quantitative trait loci for cold tolerance in rice.

PubMed

Liu, Fengxia; Xu, Wenying; Song, Qian; Tan, Lubin; Liu, Jiayong; Zhu, Zuofeng; Fu, Yongcai; Su, Zhen; Sun, Chuanqing

2013-05-01

Many important agronomic traits, including cold stress resistance, are complex and controlled by quantitative trait loci (QTLs). Isolation of these QTLs will greatly benefit the agricultural industry but it is a challenging task. This study explored an integrated strategy by combining microarray with QTL-mapping in order to identify cold-tolerant QTLs from a cold-tolerant variety IL112 at early-seedling stage. All the early seedlings of IL112 survived normally for 9 d at 4-5°C, while Guichao2 (GC2), an indica cultivar, died after 4 d under the same conditions. Using the F2:3 population derived from the progeny of GC2 and IL112, we identified seven QTLs for cold tolerance. Furthermore, we performed Affymetrix rice whole-genome array hybridization and obtained the expression profiles of IL112 and GC2 under both low-temperature and normal conditions. Four genes were selected as cold QTL-related candidates, based on microarray data mining and QTL-mapping. One candidate gene, LOC_Os07g22494, was shown to be highly associated with cold tolerance in a number of rice varieties and in the F2:3 population, and its overexpression transgenic rice plants displayed strong tolerance to low temperature at early-seedling stage. The results indicated that overexpression of this gene (LOC_Os07g22494) could increase cold tolerance in rice seedlings. Therefore, this study provides a promising strategy for identifying candidate genes in defined QTL regions.
Identification of essential genes in Streptococcus pneumoniae by allelic replacement mutagenesis.

PubMed

Song, Jae-Hoon; Ko, Kwan Soo; Lee, Ji-Young; Baek, Jin Yang; Oh, Won Sup; Yoon, Ha Sik; Jeong, Jin-Yong; Chun, Jongsik

2005-06-30

To find potential targets of novel antimicrobial agents, we identified essential genes of Streptococcus pneumoniae using comparative genomics and allelic replacement mutagenesis. We compared the genome of S. pneumoniae R6 with those of Bacillus subtilis, Enterococcus faecalis, Escherichia coli, and Staphylococcus aureus, and selected 693 candidate target genes with > 40% amino acid sequence identity to the corresponding genes in at least two of the other species. The 693 genes were disrupted and 133 were found to be essential for growth. Of these, 32 encoded proteins of unknown function, and we were able to identify orthologues of 22 of these genes by genomic comparisons. The experimental method used in this study is easy to perform, rapid and efficient for identifying essential genes of bacterial pathogens.
Candidate Gene Approach for Parasite Resistance in Sheep – Variation in Immune Pathway Genes and Association with Fecal Egg Count

PubMed Central

Periasamy, Kathiravan; Pichler, Rudolf; Poli, Mario; Cristel, Silvina; Cetrá, Bibiana; Medus, Daniel; Basar, Muladno; A. K., Thiruvenkadan; Ramasamy, Saravanan; Ellahi, Masroor Babbar; Mohammed, Faruque; Teneva, Atanaska; Shamsuddin, Mohammed; Podesta, Mario Garcia; Diallo, Adama

2014-01-01

Sheep chromosome 3 (Oar3) has the largest number of QTLs reported to be significantly associated with resistance to gastro-intestinal nematodes. This study aimed to identify single nucleotide polymorphisms (SNPs) within candidate genes located in sheep chromosome 3 as well as genes involved in major immune pathways. A total of 41 SNPs were identified across 38 candidate genes in a panel of unrelated sheep and genotyped in 713 animals belonging to 22 breeds across Asia, Europe and South America. The variations and evolution of immune pathway genes were assessed in sheep populations across these macro-environmental regions that significantly differ in the diversity and load of pathogens. The mean minor allele frequency (MAF) did not vary between Asian and European sheep reflecting the absence of ascertainment bias. Phylogenetic analysis revealed two major clusters with most of South Asian, South East Asian and South West Asian breeds clustering together while European and South American sheep breeds clustered together distinctly. Analysis of molecular variance revealed strong phylogeographic structure at loci located in immune pathway genes, unlike microsatellite and genome wide SNP markers. To understand the influence of natural selection processes, SNP loci located in chromosome 3 were utilized to reconstruct haplotypes, the diversity of which showed significant deviations from selective neutrality. Reduced Median network of reconstructed haplotypes showed balancing selection in force at these loci. Preliminary association of SNP genotypes with phenotypes recorded 42 days post challenge revealed significant differences (P<0.05) in fecal egg count, body weight change and packed cell volume at two, four and six SNP loci respectively. In conclusion, the present study reports strong phylogeographic structure and balancing selection operating at SNP loci located within immune pathway genes. Further, SNP loci identified in the study were found to have potential for future large scale association studies in naturally exposed sheep populations. PMID:24533078
Identification of homogeneous genetic architecture of multiple genetically correlated traits by block clustering of genome-wide associations.

PubMed

Gupta, Mayetri; Cheung, Ching-Lung; Hsu, Yi-Hsiang; Demissie, Serkalem; Cupples, L Adrienne; Kiel, Douglas P; Karasik, David

2011-06-01

Genome-wide association studies (GWAS) using high-density genotyping platforms offer an unbiased strategy to identify new candidate genes for osteoporosis. It is imperative to be able to clearly distinguish signal from noise by focusing on the best phenotype in a genetic study. We performed GWAS of multiple phenotypes associated with fractures [bone mineral density (BMD), bone quantitative ultrasound (QUS), bone geometry, and muscle mass] with approximately 433,000 single-nucleotide polymorphisms (SNPs) and created a database of resulting associations. We performed analysis of GWAS data from 23 phenotypes by a novel modification of a block clustering algorithm followed by gene-set enrichment analysis. A data matrix of standardized regression coefficients was partitioned along both axes--SNPs and phenotypes. Each partition represents a distinct cluster of SNPs that have similar effects over a particular set of phenotypes. Application of this method to our data shows several SNP-phenotype connections. We found a strong cluster of association coefficients of high magnitude for 10 traits (BMD at several skeletal sites, ultrasound measures, cross-sectional bone area, and section modulus of femoral neck and shaft). These clustered traits were highly genetically correlated. Gene-set enrichment analyses indicated the augmentation of genes that cluster with the 10 osteoporosis-related traits in pathways such as aldosterone signaling in epithelial cells, role of osteoblasts, osteoclasts, and chondrocytes in rheumatoid arthritis, and Parkinson signaling. In addition to several known candidate genes, we also identified PRKCH and SCNN1B as potential candidate genes for multiple bone traits. In conclusion, our mining of GWAS results revealed the similarity of association results between bone strength phenotypes that may be attributed to pleiotropic effects of genes. This knowledge may prove helpful in identifying novel genes and pathways that underlie several correlated phenotypes, as well as in deciphering genetic and phenotypic modularity underlying osteoporosis risk. Copyright © 2011 American Society for Bone and Mineral Research.
Genetic factors regulating lung vasculature and immune cell functions associate with resistance to pneumococcal infection.

PubMed

Jonczyk, Magda S; Simon, Michelle; Kumar, Saumya; Fernandes, Vitor E; Sylvius, Nicolas; Mallon, Ann-Marie; Denny, Paul; Andrew, Peter W

2014-01-01

Streptococcus pneumoniae is an important human pathogen responsible for high mortality and morbidity worldwide. The susceptibility to pneumococcal infections is controlled by as yet unknown genetic factors. To elucidate these factors could help to develop new medical treatments and tools to identify those most at risk. In recent years genome wide association studies (GWAS) in mice and humans have proved successful in identification of causal genes involved in many complex diseases for example diabetes, systemic lupus or cholesterol metabolism. In this study a GWAS approach was used to map genetic loci associated with susceptibility to pneumococcal infection in 26 inbred mouse strains. As a result four candidate QTLs were identified on chromosomes 7, 13, 18 and 19. Interestingly, the QTL on chromosome 7 was located within S. pneumoniae resistance QTL (Spir1) identified previously in a linkage study of BALB/cOlaHsd and CBA/CaOlaHsd F2 intercrosses. We showed that only a limited number of genes encoded within the QTLs carried phenotype-associated polymorphisms (22 genes out of several hundred located within the QTLs). These candidate genes are known to regulate TGFβ signalling, smooth muscle and immune cells functions. Interestingly, our pulmonary histopathology and gene expression data demonstrated, lung vasculature plays an important role in resistance to pneumococcal infection. Therefore we concluded that the cumulative effect of these candidate genes on vasculature and immune cells functions as contributory factors in the observed differences in susceptibility to pneumococcal infection. We also propose that TGFβ-mediated regulation of fibroblast differentiation plays an important role in development of invasive pneumococcal disease. Gene expression data submitted to the NCBI Gene Expression Omnibus Accession No: GSE49533 SNP data submitted to NCBI dbSNP Short Genetic Variation http://www.ncbi.nlm.nih.gov/projects/SNP/snp_viewTable.cgi?handle=MUSPNEUMONIA.
Gene expression atlas of pigeonpea and its application to gain insights into genes associated with pollen fertility implicated in seed formation.

PubMed

Pazhamala, Lekha T; Purohit, Shilp; Saxena, Rachit K; Garg, Vanika; Krishnamurthy, L; Verdier, Jerome; Varshney, Rajeev K

2017-04-01

Pigeonpea (Cajanus cajan) is an important grain legume of the semi-arid tropics, mainly used for its protein rich seeds. To link the genome sequence information with agronomic traits resulting from specific developmental processes, a Cajanus cajan gene expression atlas (CcGEA) was developed using the Asha genotype. Thirty tissues/organs representing developmental stages from germination to senescence were used to generate 590.84 million paired-end RNA-Seq data. The CcGEA revealed a compendium of 28 793 genes with differential, specific, spatio-temporal and constitutive expression during various stages of development in different tissues. As an example to demonstrate the application of the CcGEA, a network of 28 flower-related genes analysed for cis-regulatory elements and splicing variants has been identified. In addition, expression analysis of these candidate genes in male sterile and male fertile genotypes suggested their critical role in normal pollen development leading to seed formation. Gene network analysis also identified two regulatory genes, a pollen-specific SF3 and a sucrose-proton symporter, that could have implications for improvement of agronomic traits such as seed production and yield. In conclusion, the CcGEA provides a valuable resource for pigeonpea to identify candidate genes involved in specific developmental processes and to understand the well-orchestrated growth and developmental process in this resilient crop. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Parsing the genetic heterogeneity of chromosome 12q susceptibility genes for Alzheimer disease by family-based association analysis.

PubMed

Lin, Ping-I; Martin, Eden R; Browning-Large, Carrie A; Schmechel, Donald E; Welsh-Bohmer, Kathleen A; Doraiswamy, P Murali; Gilbert, John R; Haines, Jonathan L; Pericak-Vance, Margaret A

2006-07-01

Previous linkage studies have suggested that chromosome 12 may harbor susceptibility genes for late-onset Alzheimer disease (LOAD). No risk genes on chromosome 12 have been conclusively identified yet. We have reported that the linkage evidence for LOAD in a 12q region was significantly increased in autopsy-confirmed families particularly for those showing no linkage to alpha-T catenin gene, a LOAD candidate gene on chromosome 10 [LOD score increased from 0.1 in the autopsy-confirmed subset to 4.19 in the unlinked subset (optimal subset); p<0.0001 for the increase in LOD score], indicating a one-LOD support interval spanning 6 Mb. To further investigate this finding and to identify potential candidate LOAD risk genes for follow-up analysis, we analyzed 99 single nucleotide polymorphisms in this region, for the overall sample, the autopsy-confirmed subset, and the optimal subset, respectively, for comparison. We saw no significant association (p<0.01) in the overall sample. In the autopsy-confirmed subset, the best finding was obtained in the activation transcription factor 7 (ATF7) gene (single-locus association, p=0.002; haplotype association global, p=0.007). In the optimal subset, the best finding was obtained in the hypothetical protein FLJ20436 (FLJ20436) gene (single-locus association, p=0.0026). These results suggest that subset and covariate analyses may be one approach to help identify novel susceptibility genes on chromosome 12q for LOAD.
Genome-Wide Identification and Expression Analyses of Aquaporin Gene Family during Development and Abiotic Stress in Banana

PubMed Central

Hu, Wei; Hou, Xiaowan; Huang, Chao; Yan, Yan; Tie, Weiwei; Ding, Zehong; Wei, Yunxie; Liu, Juhua; Miao, Hongxia; Lu, Zhiwei; Li, Meiying; Xu, Biyu; Jin, Zhiqiang

2015-01-01

Aquaporins (AQPs) function to selectively control the flow of water and other small molecules through biological membranes, playing crucial roles in various biological processes. However, little information is available on the AQP gene family in bananas. In this study, we identified 47 banana AQP genes based on the banana genome sequence. Evolutionary analysis of AQPs from banana, Arabidopsis, poplar, and rice indicated that banana AQPs (MaAQPs) were clustered into four subfamilies. Conserved motif analysis showed that all banana AQPs contained the typical AQP-like or major intrinsic protein (MIP) domain. Gene structure analysis suggested the majority of MaAQPs had two to four introns with a highly specific number and length for each subfamily. Expression analysis of MaAQP genes during fruit development and postharvest ripening showed that some MaAQP genes exhibited high expression levels during these stages, indicating the involvement of MaAQP genes in banana fruit development and ripening. Additionally, some MaAQP genes showed strong induction after stress treatment and therefore, may represent potential candidates for improving banana resistance to abiotic stress. Taken together, this study identified some excellent tissue-specific, fruit development- and ripening-dependent, and abiotic stress-responsive candidate MaAQP genes, which could lay a solid foundation for genetic improvement of banana cultivars. PMID:26307965
Combined analysis of DNA methylome and transcriptome reveal novel candidate genes with susceptibility to bovine Staphylococcus aureus subclinical mastitis.

PubMed

Song, Minyan; He, Yanghua; Zhou, Huangkai; Zhang, Yi; Li, Xizhi; Yu, Ying

2016-07-14

Subclinical mastitis is a widely spread disease of lactating cows. Its major pathogen is Staphylococcus aureus (S. aureus). In this study, we performed genome-wide integrative analysis of DNA methylation and transcriptional expression to identify candidate genes and pathways relevant to bovine S. aureus subclinical mastitis. The genome-scale DNA methylation profiles of peripheral blood lymphocytes in cows with S. aureus subclinical mastitis (SA group) and healthy controls (CK) were generated by methylated DNA immunoprecipitation combined with microarrays. We identified 1078 differentially methylated genes in SA cows compared with the controls. By integrating DNA methylation and transcriptome data, 58 differentially methylated genes were shared with differently expressed genes, in which 20.7% distinctly hypermethylated genes showed down-regulated expression in SA versus CK, whereas 14.3% dramatically hypomethylated genes showed up-regulated expression. Integrated pathway analysis suggested that these genes were related to inflammation, ErbB signalling pathway and mismatch repair. Further functional analysis revealed that three genes, NRG1, MST1 and NAT9, were strongly correlated with the progression of S. aureus subclinical mastitis and could be used as powerful biomarkers for the improvement of bovine mastitis resistance. Our studies lay the groundwork for epigenetic modification and mechanistic studies on susceptibility of bovine mastitis.
Combined analysis of DNA methylome and transcriptome reveal novel candidate genes with susceptibility to bovine Staphylococcus aureus subclinical mastitis

PubMed Central

Song, Minyan; He, Yanghua; Zhou, Huangkai; Zhang, Yi; Li, Xizhi; Yu, Ying

2016-01-01

Subclinical mastitis is a widely spread disease of lactating cows. Its major pathogen is Staphylococcus aureus (S. aureus). In this study, we performed genome-wide integrative analysis of DNA methylation and transcriptional expression to identify candidate genes and pathways relevant to bovine S. aureus subclinical mastitis. The genome-scale DNA methylation profiles of peripheral blood lymphocytes in cows with S. aureus subclinical mastitis (SA group) and healthy controls (CK) were generated by methylated DNA immunoprecipitation combined with microarrays. We identified 1078 differentially methylated genes in SA cows compared with the controls. By integrating DNA methylation and transcriptome data, 58 differentially methylated genes were shared with differently expressed genes, in which 20.7% distinctly hypermethylated genes showed down-regulated expression in SA versus CK, whereas 14.3% dramatically hypomethylated genes showed up-regulated expression. Integrated pathway analysis suggested that these genes were related to inflammation, ErbB signalling pathway and mismatch repair. Further functional analysis revealed that three genes, NRG1, MST1 and NAT9, were strongly correlated with the progression of S. aureus subclinical mastitis and could be used as powerful biomarkers for the improvement of bovine mastitis resistance. Our studies lay the groundwork for epigenetic modification and mechanistic studies on susceptibility of bovine mastitis. PMID:27411928
No novel, high penetrant gene might remain to be found in Japanese patients with unknown MODY.

PubMed

Horikawa, Yukio; Hosomichi, Kazuyoshi; Enya, Mayumi; Ishiura, Hiroyuki; Suzuki, Yutaka; Tsuji, Shoji; Sugano, Sumio; Inoue, Ituro; Takeda, Jun

2018-07-01

MODY 5 and 6 have been shown to be low-penetrant MODYs. As the genetic background of unknown MODY is assumed to be similar, a new analytical strategy is applied here to elucidate genetic predispositions to unknown MODY. We examined to find whether there are major MODY gene loci remaining to be identified using SNP linkage analysis in Japanese. Whole-exome sequencing was performed with seven families with typical MODY. Candidates for novel MODY genes were examined combined with in silico network analysis. Some peaks were found only in either parametric or non-parametric analysis; however, none of these peaks showed a LOD score greater than 3.7, which is approved to be the significance threshold of evidence for linkage. Exome sequencing revealed that three mutated genes were common among 3 families and 42 mutated genes were common in two families. Only one of these genes, MYO5A, having rare amino acid mutations p.R849Q and p.V1601G, was involved in the biological network of known MODY genes through the intermediary of the INS. Although only one promising candidate gene, MYO5A, was identified, no novel, high penetrant MODY genes might remain to be found in Japanese MODY.
Constitutional downregulation of SEMA5A expression in autism.

PubMed

Melin, M; Carlsson, B; Anckarsater, H; Rastam, M; Betancur, C; Isaksson, A; Gillberg, C; Dahl, N

2006-01-01

There is strong evidence for the importance of genetic factors in idiopathic autism. The results from independent twin and family studies suggest that the disorder is caused by the action of several genes, possibly acting epistatically. We have used cDNA microarray technology for the identification of constitutional changes in the gene expression profile associated with idiopathic autism. Samples were obtained and analyzed from 6 affected subjects belonging to multiplex autism families and from 6 healthy controls. We assessed the expression levels for approximately 7,700 genes by cDNA microarrays using mRNA derived from Epstein-Barr virus-transformed B lymphocytes. The microarray data were analyzed in order to identify up- or downregulation of specific genes. A common pattern with nine downregulated genes was identified among samples derived from individuals with autism when compared to controls. Four of these nine genes encode proteins involved in biological processes associated with brain function or the immune system, and are consequently considered as candidates for genes associated with autism. Quantitative real-time PCR confirms the downregulation of the gene encoding SEMA5A, a protein involved in axonal guidance. Epstein-Barr virus should be considered as a possible source for altered expression, but our consistent results make us suggest SEMA5A as a candidate gene in the etiology of idiopathic autism.
Constitutional downregulation of SEMA5A expression in autism

PubMed Central

Melin, Malin; Carlsson, Birgit; Anckarsäter, Henrik; Rastam, Maria; Betancur, Catalina; Isaksson, Anders; Gillberg, Christopher; Dahl, Niklas

2006-01-01

There is strong evidence for the importance of genetic factors in idiopathic autism. The results from independent twin and family studies suggest that the disorder is caused by the action of several genes, possibly acting epistatically. We have used cDNA microarray technology for the identification of constitutional changes in the gene expression profile associated with idiopathic autism. Samples were obtained and analyzed from six affected subjects belonging to multiplex autism families and from six healthy controls. We assessed the expression levels for approximately 7,700 genes by cDNA microarrays using mRNA derived from Epstein Barr virus (EBV)-transformed B-lymphocytes. The microarray data was analyzed in order to identify up- or down-regulation of specific genes. A common pattern with nine down-regulated genes was identified among samples derived from individuals with autism when compared to controls. Four of these nine genes encode proteins involved in biological processes associated with brain function or the immune system, and are consequently considered as candidates for genes associated with autism. Quantitative realtime PCR confirms the down-regulation of the gene encoding SEMA5A, a protein involved in axonal guidance. EBV should be considered as a possible source for altered expression but our consistent results make us suggest SEMA5A a candidate gene in the etiology of idiopathic autism. PMID:17028446
Whole-genome resequencing of 292 pigeonpea accessions identifies genomic regions associated with domestication and agronomic traits.

PubMed

Varshney, Rajeev K; Saxena, Rachit K; Upadhyaya, Hari D; Khan, Aamir W; Yu, Yue; Kim, Changhoon; Rathore, Abhishek; Kim, Dongseon; Kim, Jihun; An, Shaun; Kumar, Vinay; Anuradha, Ghanta; Yamini, Kalinati Narasimhan; Zhang, Wei; Muniswamy, Sonnappa; Kim, Jong-So; Penmetsa, R Varma; von Wettberg, Eric; Datta, Swapan K

2017-07-01

Pigeonpea (Cajanus cajan), a tropical grain legume with low input requirements, is expected to continue to have an important role in supplying food and nutritional security in developing countries in Asia, Africa and the tropical Americas. From whole-genome resequencing of 292 Cajanus accessions encompassing breeding lines, landraces and wild species, we characterize genome-wide variation. On the basis of a scan for selective sweeps, we find several genomic regions that were likely targets of domestication and breeding. Using genome-wide association analysis, we identify associations between several candidate genes and agronomically important traits. Candidate genes for these traits in pigeonpea have sequence similarity to genes functionally characterized in other plants for flowering time control, seed development and pod dehiscence. Our findings will allow acceleration of genetic gains for key traits to improve yield and sustainability in pigeonpea.
DRD4 and DAT1 in ADHD: Functional neurobiology to pharmacogenetics

PubMed Central

Turic, Darko; Swanson, James; Sonuga-Barke, Edmund

2010-01-01

Attention deficit/hyperactivity disorder (ADHD) is a common and potentially very impairing neuropsychiatric disorder of childhood. Statistical genetic studies of twins have shown ADHD to be highly heritable, with the combination of genes and gene by environment interactions accounting for around 80% of phenotypic variance. The initial molecular genetic studies where candidates were selected because of the efficacy of dopaminergic compounds in the treatment of ADHD were remarkably successful and provided strong evidence for the role of DRD4 and DAT1 variants in the pathogenesis of ADHD. However, the recent application of non-candidate gene strategies (eg, genome-wide association scans) has failed to identify additional genes with substantial genetic main effects, and the effects for DRD4 and DAT1 have not been replicated. This is the usual pattern observed for most other physical and mental disorders evaluated with current state-of-the-art methods. In this paper we discuss future strategies for genetic studies in ADHD, highlighting both the pitfalls and possible solutions relating to candidate gene studies, genome-wide studies, defining the phenotype, and statistical approaches. PMID:23226043
Islander: A database of precisely mapped genomic islands in tRNA and tmRNA genes

DOE PAGES

Hudson, Corey M.; Lau, Britney Y.; Williams, Kelly P.

2014-11-05

Genomic islands are mobile DNAs that are major agents of bacterial and archaeal evolution. Integration into prokaryotic chromosomes usually occurs site-specifically at tRNA or tmRNA gene (together, tDNA) targets, catalyzed by tyrosine integrases. This splits the target gene, yet sequences within the island restore the disrupted gene; the regenerated target and its displaced fragment precisely mark the endpoints of the island. We applied this principle to search for islands in genomic DNA sequences. Our algorithm identifies tDNAs, finds fragments of those tDNAs in the same replicon and removes unlikely candidate islands through a series of filters. A search for islandsmore » in 2168 whole prokaryotic genomes produced 3919 candidates. The website Islander (recently moved to http://bioinformatics.sandia.gov/islander/) presents these precisely mapped candidate islands, the gene content and the island sequence. The algorithm further insists that each island encode an integrase, and attachment site sequence identity is carefully noted; therefore, the database also serves in the study of integrase site-specificity and its evolution.« less
Genetic findings in anorexia and bulimia nervosa.

PubMed

Hinney, Anke; Scherag, Susann; Hebebrand, Johannes

2010-01-01

Anorexia nervosa (AN) and bulimia nervosa (BN) are complex disorders associated with disordered eating behavior. Heritability estimates derived from twin and family studies are high, so that substantial genetic influences on the etiology can be assumed for both. As the monoaminergic neurotransmitter systems are involved in eating disorders (EDs), candidate gene studies have centered on related genes; additionally, genes relevant for body weight regulation have been considered as candidates. Unfortunately, this approach has yielded very few positive results; confirmed associations or findings substantiated in meta-analyses are scant. None of these associations can be considered unequivocally validated. Systematic genome-wide approaches have been performed to identify genes with no a priori evidence for their relevance in EDs. Family-based scans revealed linkage peaks in single chromosomal regions for AN and BN. Analyses of candidate genes in one of these regions led to the identification of genetic variants associated with AN. Currently, an international consortium is conducting a genome-wide association study for AN, which will hopefully lead to the identification of the first genome-wide significant markers. Copyright © 2010 Elsevier Inc. All rights reserved.
Transcriptome and proteome data reveal candidate genes for pollinator attraction in sexually deceptive orchids.

PubMed

Sedeek, Khalid E M; Qi, Weihong; Schauer, Monica A; Gupta, Alok K; Poveda, Lucy; Xu, Shuqing; Liu, Zhong-Jian; Grossniklaus, Ueli; Schiestl, Florian P; Schlüter, Philipp M

2013-01-01

Sexually deceptive orchids of the genus Ophrys mimic the mating signals of their pollinator females to attract males as pollinators. This mode of pollination is highly specific and leads to strong reproductive isolation between species. This study aims to identify candidate genes responsible for pollinator attraction and reproductive isolation between three closely related species, O. exaltata, O. sphegodes and O. garganica. Floral traits such as odour, colour and morphology are necessary for successful pollinator attraction. In particular, different odour hydrocarbon profiles have been linked to differences in specific pollinator attraction among these species. Therefore, the identification of genes involved in these traits is important for understanding the molecular basis of pollinator attraction by sexually deceptive orchids. We have created floral reference transcriptomes and proteomes for these three Ophrys species using a combination of next-generation sequencing (454 and Solexa), Sanger sequencing, and shotgun proteomics (tandem mass spectrometry). In total, 121 917 unique transcripts and 3531 proteins were identified. This represents the first orchid proteome and transcriptome from the orchid subfamily Orchidoideae. Proteome data revealed proteins corresponding to 2644 transcripts and 887 proteins not observed in the transcriptome. Candidate genes for hydrocarbon and anthocyanin biosynthesis were represented by 156 and 61 unique transcripts in 20 and 7 genes classes, respectively. Moreover, transcription factors putatively involved in the regulation of flower odour, colour and morphology were annotated, including Myb, MADS and TCP factors. Our comprehensive data set generated by combining transcriptome and proteome technologies allowed identification of candidate genes for pollinator attraction and reproductive isolation among sexually deceptive orchids. This includes genes for hydrocarbon and anthocyanin biosynthesis and regulation, and the development of floral morphology. These data will serve as an invaluable resource for research in orchid floral biology, enabling studies into the molecular mechanisms of pollinator attraction and speciation.
Transcriptome and Proteome Data Reveal Candidate Genes for Pollinator Attraction in Sexually Deceptive Orchids

PubMed Central

Sedeek, Khalid E. M.; Qi, Weihong; Schauer, Monica A.; Gupta, Alok K.; Poveda, Lucy; Xu, Shuqing; Liu, Zhong-Jian; Grossniklaus, Ueli; Schiestl, Florian P.; Schlüter, Philipp M.

2013-01-01

Background Sexually deceptive orchids of the genus Ophrys mimic the mating signals of their pollinator females to attract males as pollinators. This mode of pollination is highly specific and leads to strong reproductive isolation between species. This study aims to identify candidate genes responsible for pollinator attraction and reproductive isolation between three closely related species, O. exaltata, O. sphegodes and O. garganica. Floral traits such as odour, colour and morphology are necessary for successful pollinator attraction. In particular, different odour hydrocarbon profiles have been linked to differences in specific pollinator attraction among these species. Therefore, the identification of genes involved in these traits is important for understanding the molecular basis of pollinator attraction by sexually deceptive orchids. Results We have created floral reference transcriptomes and proteomes for these three Ophrys species using a combination of next-generation sequencing (454 and Solexa), Sanger sequencing, and shotgun proteomics (tandem mass spectrometry). In total, 121 917 unique transcripts and 3531 proteins were identified. This represents the first orchid proteome and transcriptome from the orchid subfamily Orchidoideae. Proteome data revealed proteins corresponding to 2644 transcripts and 887 proteins not observed in the transcriptome. Candidate genes for hydrocarbon and anthocyanin biosynthesis were represented by 156 and 61 unique transcripts in 20 and 7 genes classes, respectively. Moreover, transcription factors putatively involved in the regulation of flower odour, colour and morphology were annotated, including Myb, MADS and TCP factors. Conclusion Our comprehensive data set generated by combining transcriptome and proteome technologies allowed identification of candidate genes for pollinator attraction and reproductive isolation among sexually deceptive orchids. This includes genes for hydrocarbon and anthocyanin biosynthesis and regulation, and the development of floral morphology. These data will serve as an invaluable resource for research in orchid floral biology, enabling studies into the molecular mechanisms of pollinator attraction and speciation. PMID:23734209
A genome-wide association study of corneal astigmatism: The CREAM Consortium

PubMed Central

Shah, Rupal L.; Li, Qing; Zhao, Wanting; Tedja, Milly S.; Tideman, J. Willem L.; Khawaja, Anthony P.; Fan, Qiao; Yazar, Seyhan; Williams, Katie M.; Verhoeven, Virginie J.M.; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J.; Pärssinen, Olavi; Wedenoja, Juho; Biino, Ginevra; Concas, Maria Pina; Uitterlinden, André; Rivadeneira, Fernando; Jaddoe, Vincent W.V.; Hysi, Pirro G.; Sim, Xueling; Tan, Nicholas; Tham, Yih-Chung; Sensaki, Sonoko; Hofman, Albert; Vingerling, Johannes R.; Jonas, Jost B.; Mitchell, Paul; Hammond, Christopher J.; Höhn, René; Baird, Paul N.; Wong, Tien-Yin; Cheng, Chinfsg-Yu; Teo, Yik Ying; Mackey, David A.; Williams, Cathy; Saw, Seang-Mei; Klaver, Caroline C.W.; Bailey-Wilson, Joan E.

2018-01-01

Purpose To identify genes and genetic markers associated with corneal astigmatism. Methods A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohorts were performed using VEGAS2 and MAGMA software. Additionally, estimates of single nucleotide polymorphism (SNP)-based heritability for corneal and refractive astigmatism and the spherical equivalent were calculated for Europeans using LD score regression. Results The meta-analysis of all cohorts identified a genome-wide significant locus near the platelet-derived growth factor receptor alpha (PDGFRA) gene: top SNP: rs7673984, odds ratio=1.12 (95% CI:1.08–1.16), p=5.55×10−9. No other genome-wide significant loci were identified in the combined analysis or European/Asian ancestry-specific analyses. Gene-based analysis identified three novel candidate genes for corneal astigmatism in Europeans—claudin-7 (CLDN7), acid phosphatase 2, lysosomal (ACP2), and TNF alpha-induced protein 8 like 3 (TNFAIP8L3). Conclusions In addition to replicating a previously identified genome-wide significant locus for corneal astigmatism near the PDGFRA gene, gene-based analysis identified three novel candidate genes, CLDN7, ACP2, and TNFAIP8L3, that warrant further investigation to understand their role in the pathogenesis of corneal astigmatism. The much lower number of genetic variants and genes demonstrating an association with corneal astigmatism compared to published spherical equivalent GWAS analyses suggest a greater influence of rare genetic variants, non-additive genetic effects, or environmental factors in the development of astigmatism. PMID:29422769

Combining Human Epigenetics and Sleep Studies in Caenorhabditis elegans: A Cross-Species Approach for Finding Conserved Genes Regulating Sleep.

PubMed

Huang, Huiyan; Zhu, Yong; Eliot, Melissa N; Knopik, Valerie S; McGeary, John E; Carskadon, Mary A; Hart, Anne C

2017-06-01

We aimed to test a combined approach to identify conserved genes regulating sleep and to explore the association between DNA methylation and sleep length. We identified candidate genes associated with shorter versus longer sleep duration in college students based on DNA methylation using Illumina Infinium HumanMethylation450 BeadChip arrays. Orthologous genes in Caenorhabditis elegans were identified, and we examined whether their loss of function affected C. elegans sleep. For genes whose perturbation affected C. elegans sleep, we subsequently undertook a small pilot study to re-examine DNA methylation in an independent set of human participants with shorter versus longer sleep durations. Eighty-seven out of 485,577 CpG sites had significant differential methylation in young adults with shorter versus longer sleep duration, corresponding to 52 candidate genes. We identified 34 C. elegans orthologs, including NPY/flp-18 and flp-21, which are known to affect sleep. Loss of five additional genes alters developmentally timed C. elegans sleep (B4GALT6/bre-4, DOCK180/ced-5, GNB2L1/rack-1, PTPRN2/ida-1, ZFYVE28/lst-2). For one of these genes, ZFYVE28 (also known as hLst2), the pilot replication study again found decreased DNA methylation associated with shorter sleep duration at the same two CpG sites in the first intron of ZFYVE28. Using an approach that combines human epigenetics and C. elegans sleep studies, we identified five genes that play previously unidentified roles in C. elegans sleep. We suggest sleep duration in humans may be associated with differential DNA methylation at specific sites and that the conserved genes identified here likely play roles in C. elegans sleep and in other species. © Sleep Research Society 2017. Published by Oxford University Press on behalf of the Sleep Research Society. All rights reserved. For permissions, please e-mail journals.permissions@oup.com.
Transcriptome Sequencing of Codonopsis pilosula and Identification of Candidate Genes Involved in Polysaccharide Biosynthesis

PubMed Central

Gao, Jian Ping; Wang, Dong; Cao, Ling Ya; Sun, Hai Feng

2015-01-01

Background Codonopsis pilosula (Franch.) Nannf. is one of the most widely used medicinal plants. Although chemical and pharmacological studies have shown that codonopsis polysaccharides (CPPs) are bioactive compounds and that their composition is variable, their biosynthetic pathways remain largely unknown. Next-generation sequencing is an efficient and high-throughput technique that allows the identification of candidate genes involved in secondary metabolism. Principal Findings To identify the components involved in CPP biosynthesis, a transcriptome library, prepared using root and other tissues, was assembled with the help of Illumina sequencing. A total of 9.2 Gb of clean nucleotides was obtained comprising 91,175,044 clean reads, 102,125 contigs, and 45,511 unigenes. After aligning the sequences to the public protein databases, 76.1% of the unigenes were annotated. Among these annotated unigenes, 26,189 were assigned to Gene Ontology categories, 11,415 to Clusters of Orthologous Groups, and 18,848 to Kyoto Encyclopedia of Genes and Genomes pathways. Analysis of abundance of transcripts in the library showed that genes, including those encoding metallothionein, aquaporin, and cysteine protease that are related to stress responses, were in the top list. Among genes involved in the biosynthesis of CPP, those responsible for the synthesis of UDP-L-arabinose and UDP-xylose were highly expressed. Significance To our knowledge, this is the first study to provide a public transcriptome dataset prepared from C. pilosula and an outline of the biosynthetic pathway of polysaccharides in a medicinal plant. Identified candidate genes involved in CPP biosynthesis provide understanding of the biosynthesis and regulation of CPP at the molecular level. PMID:25719364
Molecular cloning and characterization of a cytochrome P450 taxoid 9á-hydroxylase in Ginkgo biloba cells.

PubMed

Zhang, Nan; Han, Zhentai; Sun, Guiling; Hoffman, Angela; Wilson, Iain W; Yang, Yanfang; Gao, Qian; Wu, Jianqiang; Xie, Dan; Dai, Jungui; Qiu, Deyou

2014-01-17

Taxol is a well-known effective anticancer compound. Due to the inability to synthesize sufficient quantities of taxol to satisfy commercial demand, a biotechnological approach for a large-scale cell or cell-free system for its production is highly desirable. Several important genes in taxol biosynthesis are currently still unknown and have been shown to be difficult to isolate directly from Taxus, including the gene encoding taxoid 9α-hydroxylase. Ginkgo biloba suspension cells exhibit taxoid hydroxylation activity and provides an alternate means of identifying genes encoding enzymes with taxoid 9α-hydroxylation activity. Through analysis of high throughput RNA sequencing data from G. biloba, we identified two candidate genes with high similarity to Taxus CYP450s. Using in vitro cell-free protein synthesis assays and LC-MS analysis, we show that one candidate that belongs to the CYP716B, a subfamily whose biochemical functions have not been previously studied, possessed 9α-hydroxylation activity. This work will aid future identification of the taxoid 9α-hydroxylase gene from Taxus sp. Copyright © 2013 Elsevier Inc. All rights reserved.
Candidate Gene Identification with SNP Marker-Based Fine Mapping of Anthracnose Resistance Gene Co-4 in Common Bean.

PubMed

Burt, Andrew J; William, H Manilal; Perry, Gregory; Khanal, Raja; Pauls, K Peter; Kelly, James D; Navabi, Alireza

2015-01-01

Anthracnose, caused by Colletotrichum lindemuthianum, is an important fungal disease of common bean (Phaseolus vulgaris). Alleles at the Co-4 locus confer resistance to a number of races of C. lindemuthianum. A population of 94 F4:5 recombinant inbred lines of a cross between resistant black bean genotype B09197 and susceptible navy bean cultivar Nautica was used to identify markers associated with resistance in bean chromosome 8 (Pv08) where Co-4 is localized. Three SCAR markers with known linkage to Co-4 and a panel of single nucleotide markers were used for genotyping. A refined physical region on Pv08 with significant association with anthracnose resistance identified by markers was used in BLAST searches with the genomic sequence of common bean accession G19833. Thirty two unique annotated candidate genes were identified that spanned a physical region of 936.46 kb. A majority of the annotated genes identified had functional similarity to leucine rich repeats/receptor like kinase domains. Three annotated genes had similarity to 1, 3-β-glucanase domains. There were sequence similarities between some of the annotated genes found in the study and the genes associated with phosphoinositide-specific phosphilipases C associated with Co-x and the COK-4 loci found in previous studies. It is possible that the Co-4 locus is structured as a group of genes with functional domains dominated by protein tyrosine kinase along with leucine rich repeats/nucleotide binding site, phosphilipases C as well as β-glucanases.
Candidate Gene Identification with SNP Marker-Based Fine Mapping of Anthracnose Resistance Gene Co-4 in Common Bean

PubMed Central

Burt, Andrew J.; William, H. Manilal; Perry, Gregory; Khanal, Raja; Pauls, K. Peter; Kelly, James D.; Navabi, Alireza

2015-01-01

Anthracnose, caused by Colletotrichum lindemuthianum, is an important fungal disease of common bean (Phaseolus vulgaris). Alleles at the Co–4 locus confer resistance to a number of races of C. lindemuthianum. A population of 94 F4:5 recombinant inbred lines of a cross between resistant black bean genotype B09197 and susceptible navy bean cultivar Nautica was used to identify markers associated with resistance in bean chromosome 8 (Pv08) where Co–4 is localized. Three SCAR markers with known linkage to Co–4 and a panel of single nucleotide markers were used for genotyping. A refined physical region on Pv08 with significant association with anthracnose resistance identified by markers was used in BLAST searches with the genomic sequence of common bean accession G19833. Thirty two unique annotated candidate genes were identified that spanned a physical region of 936.46 kb. A majority of the annotated genes identified had functional similarity to leucine rich repeats/receptor like kinase domains. Three annotated genes had similarity to 1, 3-β-glucanase domains. There were sequence similarities between some of the annotated genes found in the study and the genes associated with phosphoinositide-specific phosphilipases C associated with Co-x and the COK–4 loci found in previous studies. It is possible that the Co–4 locus is structured as a group of genes with functional domains dominated by protein tyrosine kinase along with leucine rich repeats/nucleotide binding site, phosphilipases C as well as β-glucanases. PMID:26431031
The Role of a Novel TRMT1 Gene Mutation and Rare GRM1 Gene Defect in Intellectual Disability in Two Azeri Families

PubMed Central

Kahrizi, Kimia; Musante, Luciana; Fattahi, Zohreh; Hosseini, Masoumeh; Maqsoud, Fariba; Farajollahi, Reza; Wienker, Thomas F.; Ropers, H. Hilger; Najmabadi, Hossein

2015-01-01

Cognitive impairment or intellectual disability (ID) is a widespread neurodevelopmental disorder characterized by low IQ (below 70). ID is genetically heterogeneous and is estimated to affect 1–3% of the world’s population. In affected children from consanguineous families, autosomal recessive inheritance is common, and identifying the underlying genetic cause is an important issue in clinical genetics. In the framework of a larger project, aimed at identifying candidate genes for autosomal recessive intellectual disorder (ARID), we recently carried out single nucleotide polymorphism-based genome-wide linkage analysis in several families from Ardabil province in Iran. The identification of homozygosity-by-descent loci in these families, in combination with whole exome sequencing, led us to identify possible causative homozygous changes in two families. In the first family, a missense variant was found in GRM1 gene, while in the second family, a frameshift alteration was identified in TRMT1, both of which were found to co-segregate with the disease. GRM1, a known causal gene for autosomal recessive spinocerebellar ataxia (SCAR13, MIM#614831), encodes the metabotropic glutamate receptor1 (mGluR1). This gene plays an important role in synaptic plasticity and cerebellar development. Conversely, the TRMT1 gene encodes a tRNA methyltransferase that dimethylates a single guanine residue at position 26 of most tRNAs using S-adenosyl methionine as the methyl group donor. We recently presented TRMT1 as a candidate gene for ARID in a consanguineous Iranian family (Najmabadi et al., 2011). We believe that this second Iranian family with a biallelic loss-of-function mutation in TRMT1 gene supports the idea that this gene likely has function in development of the disorder. PMID:26308914
The Role of a Novel TRMT1 Gene Mutation and Rare GRM1 Gene Defect in Intellectual Disability in Two Azeri Families.

PubMed

Davarniya, Behzad; Hu, Hao; Kahrizi, Kimia; Musante, Luciana; Fattahi, Zohreh; Hosseini, Masoumeh; Maqsoud, Fariba; Farajollahi, Reza; Wienker, Thomas F; Ropers, H Hilger; Najmabadi, Hossein

2015-01-01

Cognitive impairment or intellectual disability (ID) is a widespread neurodevelopmental disorder characterized by low IQ (below 70). ID is genetically heterogeneous and is estimated to affect 1-3% of the world's population. In affected children from consanguineous families, autosomal recessive inheritance is common, and identifying the underlying genetic cause is an important issue in clinical genetics. In the framework of a larger project, aimed at identifying candidate genes for autosomal recessive intellectual disorder (ARID), we recently carried out single nucleotide polymorphism-based genome-wide linkage analysis in several families from Ardabil province in Iran. The identification of homozygosity-by-descent loci in these families, in combination with whole exome sequencing, led us to identify possible causative homozygous changes in two families. In the first family, a missense variant was found in GRM1 gene, while in the second family, a frameshift alteration was identified in TRMT1, both of which were found to co-segregate with the disease. GRM1, a known causal gene for autosomal recessive spinocerebellar ataxia (SCAR13, MIM#614831), encodes the metabotropic glutamate receptor1 (mGluR1). This gene plays an important role in synaptic plasticity and cerebellar development. Conversely, the TRMT1 gene encodes a tRNA methyltransferase that dimethylates a single guanine residue at position 26 of most tRNAs using S-adenosyl methionine as the methyl group donor. We recently presented TRMT1 as a candidate gene for ARID in a consanguineous Iranian family (Najmabadi et al., 2011). We believe that this second Iranian family with a biallelic loss-of-function mutation in TRMT1 gene supports the idea that this gene likely has function in development of the disorder.
Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data

PubMed Central

2013-01-01

Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs) and Support Vector Machines (SVMs) were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression. PMID:23369200
Genomic analysis of human lung fibroblasts exposed to vanadium pentoxide to identify candidate genes for occupational bronchitis

PubMed Central

Ingram, Jennifer L; Antao-Menezes, Aurita; Turpin, Elizabeth A; Wallace, Duncan G; Mangum, James B; Pluta, Linda J; Thomas, Russell S; Bonner, James C

2007-01-01

Background Exposure to vanadium pentoxide (V2O5) is a cause of occupational bronchitis. We evaluated gene expression profiles in cultured human lung fibroblasts exposed to V2O5 in vitro in order to identify candidate genes that could play a role in inflammation, fibrosis, and repair during the pathogenesis of V2O5-induced bronchitis. Methods Normal human lung fibroblasts were exposed to V2O5 in a time course experiment. Gene expression was measured at various time points over a 24 hr period using the Affymetrix Human Genome U133A 2.0 Array. Selected genes that were significantly changed in the microarray experiment were validated by RT-PCR. Results V2O5 altered more than 1,400 genes, of which ~300 were induced while >1,100 genes were suppressed. Gene ontology categories (GO) categories unique to induced genes included inflammatory response and immune response, while GO catogories unique to suppressed genes included ubiquitin cycle and cell cycle. A dozen genes were validated by RT-PCR, including growth factors (HBEGF, VEGF, CTGF), chemokines (IL8, CXCL9, CXCL10), oxidative stress response genes (SOD2, PIPOX, OXR1), and DNA-binding proteins (GAS1, STAT1). Conclusion Our study identified a variety of genes that could play pivotal roles in inflammation, fibrosis and repair during V2O5-induced bronchitis. The induction of genes that mediate inflammation and immune responses, as well as suppression of genes involved in growth arrest appear to be important to the lung fibrotic reaction to V2O5. PMID:17459161
Linkage study of nonsyndromic cleft lip with or without cleft palate using candidate genes and mapped polymorphic markers

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stein, J.D.; Nelson, L.D.; Conner, B.J.

1994-09-01

Nonsyndromic cleft lip with or without cleft palate (CL(P)) involves fusion or growth failure of facial primordia during development. Complex segregation analysis of clefting populations suggest that an autosomal dominant gene may play a role in this common craniofacial disorder. We have ascertained 16 multigenerational families with CL(P) and tested linkage to 29 candidate genes and 139 mapped short tandem repeat markers. The candidate genes were selected based on their expression in craniofacial development or were identified through murine models. These include: TGF{alpha}, TGF{beta}1, TGF{beta}2, TGF{beta}3, EGF, EGFR, GRAS, cMyc, FGFR, Jun, JunB, PDFG{alpha}, PDGF{beta}, IGF2R, GCR Hox7, Hox8, Hox2B,more » twirler, 5 collagen and 3 extracellular matrix genes. Linkage was tested assuming an autosomal dominant model with sex-specific decreased penetrance. Linkage to all of the candidate loci was excluded in 11 families. RARA was tested and was not informative. However, haplotype analysis of markers flanking RARA on 17q allowed exclusion of this candidate locus. We have previously excluded linkage to 61 STR markers in 11 families. Seventy-eight mapped short tandem repeat markers have recently been tested in 16 families and 30 have been excluded. The remaining are being analyzed and an exclusion map is being developed based on the entire study results.« less
Diet and Colorectal Cancer: Analysis of a Candidate Pathway Using SNPS, Haplotypes, and Multi-Gene Assessment

PubMed Central

Slattery, Martha L.; Lundgreen, Abbie; Herrick, Jennifer S.; Caan, Bette J.; Potter, John D.; Wolff, Roger K.

2012-01-01

There is considerable biologic plausibility to the hypothesis that genetic variability in pathways involved in insulin signaling and energy homeostasis may modulate dietary risk associated with colorectal cancer. We utilized data from 2 population-based case-control studies of colon (n = 1,574 cases, 1,970 controls) and rectal (n = 791 cases, 999 controls) cancer to evaluate genetic variation in candidate SNPs identified from 9 genes in a candidate pathway: PDK1, RP6KA1, RPS6KA2, RPS6KB1, RPS6KB2, PTEN, FRAP1 (mTOR), TSC1, TSC2, Akt1, PIK3CA, and PRKAG2 with dietary intake of total energy, carbohydrates, fat, and fiber. We employed SNP, haplotype, and multiple-gene analysis to evaluate associations. PDK1 interacted with dietary fat for both colon and rectal cancer and with dietary carbohydrates for colon cancer. Statistically significant interaction with dietary carbohydrates and rectal cancer was detected by haplotype analysis of PDK1. Evaluation of dietary interactions with multiple genes in this candidate pathway showed several interactions with pairs of genes: Akt1 and PDK1, PDK1 and PTEN, PDK1 and TSC1, and PRKAG2 and PTEN. Analyses show that genetic variation influences risk of colorectal cancer associated with diet and illustrate the importance of evaluating dietary interactions beyond the level of single SNPs or haplotypes when a biologically relevant candidate pathway is examined. PMID:21999454
A comprehensive meta-analysis of plant morphology, yield, stay-green, and virus disease resistance QTL in maize (Zea mays L.).

PubMed

Wang, Yijun; Xu, Jing; Deng, Dexiang; Ding, Haidong; Bian, Yunlong; Yin, Zhitong; Wu, Yarong; Zhou, Bo; Zhao, Ye

2016-02-01

The meta-QTL and candidate genes will facilitate the elucidation of molecular bases underlying agriculturally important traits and open new avenues for functional markers development and elite alleles introgression in maize breeding program. A large number of QTLs attributed to grain productivity and other agriculturally important traits have been identified and deposited in public repositories. The integration of fruitful QTL becomes a major issue in current plant genomics. To this end, we first collected QTL for six agriculturally important traits in maize, including yield, plant height, ear height, leaf angle, stay-green, and maize rough dwarf disease resistance. The meta-analysis method was then employed to retrieve 113 meta-QTL. Additionally, we also isolated candidate genes for target traits by the bioinformatic technique. Several candidates, including some well-characterized genes, GA3ox2 for plant height, lg1 and lg4 for leaf angle, zfl1 and zfl2 for flowering time, were co-localized with established meta-QTL intervals. Intriguingly, in a relatively narrow meta-QTL region, the maize ortholog of rice yield-related gene GW8/OsSPL16 was believed to be a candidate for yield. Leveraging results presented in this study will provide further insights into the genetic architecture of maize agronomic traits. Moreover, the meta-QTL and candidate genes reported here could be harnessed for the enhancement of stress tolerance and yield performance in maize and translation to other crops.
Genome-wide screening of indicator genes for assessing the potential carcinogenic risk of Nanjing city drinking water.

PubMed

Zhang, Rui; Cheng, Shupei; Li, Aimin; Sun, Jie; Zhang, Yan; Zhang, Xuxiang

2011-07-01

Effects of all pollutants existing in the Nanjing city drinking water (DWNC) on mouse gene transcription levels were measured to assess the DWNC carcinogenic risks and to identify candidate indicator genes for assessing and early warning the cancer risks. Transcriptional expression levels of 14,000 hepatic genes for the treatment group mice (Mus musculus, ICR) fed with DWNC for 90 days were detected using the GeneChip(®) Mouse Genome 430A 2.0 array. The analysis indicated that the transcriptional levels of 294 genes were up-regulated and 542 ones were down-regulated. Of these genes, 12 ones identified to be involved in at least five different types of cancers were further analyzed. An interrogation by Kyoto Encyclopedia of Genes and Genomes (KEGG) revealed that three (including ITGAV, CCND1 and SMAD2) of the 12 genes were mapped to pathway in cancer. Gene Ontology (GO) function annotation also showed that they were associated with the functional categories of cell cycle regulation, adhesion, apoptosis, signal transduction and so on which are closely implicated in tumorigenesis and progression. The correlations between the aberrant expressions of them and the genesis and progression of cancers have been further documented by a number of scientific researches. These results might demonstrate that the potential toxicity and carcinogenic risks were associated with DWNC. Moreover, ITGAV, CCND1 and SMAD2 were identified as the most likely candidate indicator genes for the assessment of the combined carcinogenic risk of all pollutants existing in DWNC.
Mapping and Genetic Structure Analysis of the Anthracnose Resistance Locus Co-1HY in the Common Bean (Phaseolus vulgaris L.).

PubMed

Chen, Mingli; Wu, Jing; Wang, Lanfen; Mantri, Nitin; Zhang, Xiaoyan; Zhu, Zhendong; Wang, Shumin

2017-01-01

Anthracnose is a destructive disease of the common bean (Phaseolus vulgaris L.). The Andean cultivar Hongyundou has been demonstrated to possess strong resistance to anthracnose race 81. To study the genetics of this resistance, the Hongyundou cultivar was crossed with a susceptible genotype Jingdou. Segregation of resistance for race 81 was assessed in the F2 population and F2:3 lines under controlled conditions. Results indicate that Hongyundou carries a single dominant gene for anthracnose resistance. An allele test by crossing Hongyundou with another resistant cultivar revealed that the resistance gene is in the Co-1 locus (therefore named Co-1HY). The physical distance between this locus and the two flanking markers was 46 kb, and this region included four candidate genes, namely, Phvul.001G243500, Phvul.001G243600, Phvul.001G243700 and Phvul.001G243800. These candidate genes encoded serine/threonine-protein kinases. Expression analysis of the four candidate genes in the resistant and susceptible cultivars under control condition and inoculated treatment revealed that all the four candidate genes are expressed at significantly higher levels in the resistant genotype than in susceptible genotype. Phvul.001G243600 and Phvul.001G243700 are expressed nearly 15-fold and 90-fold higher in the resistant genotype than in the susceptible parent before inoculation, respectively. Four candidate genes will provide useful information for further research into the resistance mechanism of anthracnose in common bean. The closely linked flanking markers identified here may be useful for transferring the resistance allele Co-1HY from Hongyundou to elite anthracnose susceptible common bean lines.
Mapping and Genetic Structure Analysis of the Anthracnose Resistance Locus Co-1HY in the Common Bean (Phaseolus vulgaris L.)

PubMed Central

Wang, Lanfen; Mantri, Nitin; Zhang, Xiaoyan; Zhu, Zhendong; Wang, Shumin

2017-01-01

Anthracnose is a destructive disease of the common bean (Phaseolus vulgaris L.). The Andean cultivar Hongyundou has been demonstrated to possess strong resistance to anthracnose race 81. To study the genetics of this resistance, the Hongyundou cultivar was crossed with a susceptible genotype Jingdou. Segregation of resistance for race 81 was assessed in the F2 population and F2:3 lines under controlled conditions. Results indicate that Hongyundou carries a single dominant gene for anthracnose resistance. An allele test by crossing Hongyundou with another resistant cultivar revealed that the resistance gene is in the Co-1 locus (therefore named Co-1HY). The physical distance between this locus and the two flanking markers was 46 kb, and this region included four candidate genes, namely, Phvul.001G243500, Phvul.001G243600, Phvul.001G243700 and Phvul.001G243800. These candidate genes encoded serine/threonine-protein kinases. Expression analysis of the four candidate genes in the resistant and susceptible cultivars under control condition and inoculated treatment revealed that all the four candidate genes are expressed at significantly higher levels in the resistant genotype than in susceptible genotype. Phvul.001G243600 and Phvul.001G243700 are expressed nearly 15-fold and 90-fold higher in the resistant genotype than in the susceptible parent before inoculation, respectively. Four candidate genes will provide useful information for further research into the resistance mechanism of anthracnose in common bean. The closely linked flanking markers identified here may be useful for transferring the resistance allele Co-1HY from Hongyundou to elite anthracnose susceptible common bean lines. PMID:28076395
Molecular insight into the association between cartilage regeneration and ear wound healing in genetic mouse models: targeting new genes in regeneration.

PubMed

Rai, Muhammad Farooq; Schmidt, Eric J; McAlinden, Audrey; Cheverud, James M; Sandell, Linda J

2013-11-06

Tissue regeneration is a complex trait with few genetic models available. Mouse strains LG/J and MRL are exceptional healers. Using recombinant inbred strains from a large (LG/J, healer) and small (SM/J, nonhealer) intercross, we have previously shown a positive genetic correlation between ear wound healing, knee cartilage regeneration, and protection from osteoarthritis. We hypothesize that a common set of genes operates in tissue healing and articular cartilage regeneration. Taking advantage of archived histological sections from recombinant inbred strains, we analyzed expression of candidate genes through branched-chain DNA technology directly from tissue lysates. We determined broad-sense heritability of candidates, Pearson correlation of candidates with healing phenotypes, and Ward minimum variance cluster analysis for strains. A bioinformatic assessment of allelic polymorphisms within and near candidate genes was also performed. The expression of several candidates was significantly heritable among strains. Although several genes correlated with both ear wound healing and cartilage healing at a marginal level, the expression of four genes representing DNA repair (Xrcc2, Pcna) and Wnt signaling (Axin2, Wnt16) pathways was significantly positively correlated with both phenotypes. Cluster analysis accurately classified healers and nonhealers for seven out of eight strains based on gene expression. Specific sequence differences between LG/J and SM/J were identified as potential causal polymorphisms. Our study suggests a common genetic basis between tissue healing and osteoarthritis susceptibility. Mapping genetic variations causing differences in diverse healing responses in multiple tissues may reveal generic healing processes in pursuit of new therapeutic targets designed to induce or enhance regeneration and, potentially, protection from osteoarthritis.
Replication and validation of genetic polymorphisms associated with survival after allogeneic blood or marrow transplant

PubMed Central

Karaesmen, Ezgi; Rizvi, Abbas A.; Preus, Leah M.; McCarthy, Philip L.; Pasquini, Marcelo C.; Onel, Kenan; Zhu, Xiaochun; Spellman, Stephen; Haiman, Christopher A.; Stram, Daniel O.; Pooler, Loreall; Sheng, Xin; Zhu, Qianqian; Yan, Li; Liu, Qian; Hu, Qiang; Webb, Amy; Brock, Guy; Clay-Gilmour, Alyssa I.; Battaglia, Sebastiano; Tritchler, David; Liu, Song; Hahn, Theresa

2017-01-01

Multiple candidate gene-association studies of non-HLA single-nucleotide polymorphisms (SNPs) and outcomes after blood or marrow transplant (BMT) have been conducted. We identified 70 publications reporting 45 SNPs in 36 genes significantly associated with disease-related mortality, progression-free survival, transplant-related mortality, and/or overall survival after BMT. Replication and validation of these SNP associations were performed using DISCOVeRY-BMT (Determining the Influence of Susceptibility COnveying Variants Related to one-Year mortality after BMT), a well-powered genome-wide association study consisting of 2 cohorts, totaling 2888 BMT recipients with acute myeloid leukemia, acute lymphoblastic leukemia, or myelodysplastic syndrome, and their HLA-matched unrelated donors, reported to the Center for International Blood and Marrow Transplant Research. Gene-based tests were used to assess the aggregate effect of SNPs on outcome. None of the previously reported significant SNPs replicated at P < .05 in DISCOVeRY-BMT. Validation analyses showed association with one previously reported donor SNP at P < .05 and survival; more associations would be anticipated by chance alone. No gene-based tests were significant at P < .05. Functional annotation with publicly available data shows these candidate SNPs most likely do not have biochemical function; only 13% of candidate SNPs correlate with gene expression or are predicted to impact transcription factor binding. Of these, half do not impact the candidate gene of interest; the other half correlate with expression of multiple genes. These findings emphasize the peril of pursing candidate approaches and the importance of adequately powered tests of unbiased genome-wide associations with BMT clinical outcomes given the ultimate goal of improving patient outcomes. PMID:28811306
A Gene-Oriented Haplotype Comparison Reveals Recently Selected Genomic Regions in Temperate and Tropical Maize Germplasm

PubMed Central

Zhang, Jie; Li, Yongxiang; Zheng, Jun; Zhang, Hongwei; Yang, Xiaohong; Wang, Jianhua; Wang, Guoying

2017-01-01

The extensive genetic variation present in maize (Zea mays) germplasm makes it possible to detect signatures of positive artificial selection that occurred during temperate and tropical maize improvement. Here we report an analysis of 532,815 polymorphisms from a maize association panel consisting of 368 diverse temperate and tropical inbred lines. We developed a gene-oriented approach adapting exonic polymorphisms to identify recently selected alleles by comparing haplotypes across the maize genome. This analysis revealed evidence of selection for more than 1100 genomic regions during recent improvement, and included regulatory genes and key genes with visible mutant phenotypes. We find that selected candidate target genes in temperate maize are enriched in biosynthetic processes, and further examination of these candidates highlights two cases, sucrose flux and oil storage, in which multiple genes in a common pathway can be cooperatively selected. Finally, based on available parallel gene expression data, we hypothesize that some genes were selected for regulatory variations, resulting in altered gene expression. PMID:28099470
QTL-seq approach identified genomic regions and diagnostic markers for rust and late leaf spot resistance in groundnut (Arachis hypogaea L.).

PubMed

Pandey, Manish K; Khan, Aamir W; Singh, Vikas K; Vishwakarma, Manish K; Shasidhar, Yaduru; Kumar, Vinay; Garg, Vanika; Bhat, Ramesh S; Chitikineni, Annapurna; Janila, Pasupuleti; Guo, Baozhu; Varshney, Rajeev K

2017-08-01

Rust and late leaf spot (LLS) are the two major foliar fungal diseases in groundnut, and their co-occurrence leads to significant yield loss in addition to the deterioration of fodder quality. To identify candidate genomic regions controlling resistance to rust and LLS, whole-genome resequencing (WGRS)-based approach referred as 'QTL-seq' was deployed. A total of 231.67 Gb raw and 192.10 Gb of clean sequence data were generated through WGRS of resistant parent and the resistant and susceptible bulks for rust and LLS. Sequence analysis of bulks for rust and LLS with reference-guided resistant parent assembly identified 3136 single-nucleotide polymorphisms (SNPs) for rust and 66 SNPs for LLS with the read depth of ≥7 in the identified genomic region on pseudomolecule A03. Detailed analysis identified 30 nonsynonymous SNPs affecting 25 candidate genes for rust resistance, while 14 intronic and three synonymous SNPs affecting nine candidate genes for LLS resistance. Subsequently, allele-specific diagnostic markers were identified for three SNPs for rust resistance and one SNP for LLS resistance. Genotyping of one RIL population (TAG 24 × GPBD 4) with these four diagnostic markers revealed higher phenotypic variation for these two diseases. These results suggest usefulness of QTL-seq approach in precise and rapid identification of candidate genomic regions and development of diagnostic markers for breeding applications. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Identification of candidate genes for drought tolerance in coffee by high-throughput sequencing in the shoot apex of different Coffea arabica cultivars.

PubMed

Mofatto, Luciana Souto; Carneiro, Fernanda de Araújo; Vieira, Natalia Gomes; Duarte, Karoline Estefani; Vidal, Ramon Oliveira; Alekcevetch, Jean Carlos; Cotta, Michelle Guitton; Verdeil, Jean-Luc; Lapeyre-Montes, Fabienne; Lartaud, Marc; Leroy, Thierry; De Bellis, Fabien; Pot, David; Rodrigues, Gustavo Costa; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães; Andrade, Alan Carvalho; Marraccini, Pierre

2016-04-19

Drought is a widespread limiting factor in coffee plants. It affects plant development, fruit production, bean development and consequently beverage quality. Genetic diversity for drought tolerance exists within the coffee genus. However, the molecular mechanisms underlying the adaptation of coffee plants to drought are largely unknown. In this study, we compared the molecular responses to drought in two commercial cultivars (IAPAR59, drought-tolerant and Rubi, drought-susceptible) of Coffea arabica grown in the field under control (irrigation) and drought conditions using the pyrosequencing of RNA extracted from shoot apices and analysing the expression of 38 candidate genes. Pyrosequencing from shoot apices generated a total of 34.7 Mbp and 535,544 reads enabling the identification of 43,087 clusters (41,512 contigs and 1,575 singletons). These data included 17,719 clusters (16,238 contigs and 1,575 singletons) exclusively from 454 sequencing reads, along with 25,368 hybrid clusters assembled with 454 sequences. The comparison of DNA libraries identified new candidate genes (n = 20) presenting differential expression between IAPAR59 and Rubi and/or drought conditions. Their expression was monitored in plagiotropic buds, together with those of other (n = 18) candidates genes. Under drought conditions, up-regulated expression was observed in IAPAR59 but not in Rubi for CaSTK1 (protein kinase), CaSAMT1 (SAM-dependent methyltransferase), CaSLP1 (plant development) and CaMAS1 (ABA biosynthesis). Interestingly, the expression of lipid-transfer protein (nsLTP) genes was also highly up-regulated under drought conditions in IAPAR59. This may have been related to the thicker cuticle observed on the abaxial leaf surface in IAPAR59 compared to Rubi. The full transcriptome assembly of C. arabica, followed by functional annotation, enabled us to identify differentially expressed genes related to drought conditions. Using these data, candidate genes were selected and their differential expression profiles were confirmed by qPCR experiments in plagiotropic buds of IAPAR59 and Rubi under drought conditions. As regards the genes up-regulated under drought conditions, specifically in the drought-tolerant IAPAR59, several corresponded to orphan genes but also to genes coding proteins involved in signal transduction pathways, as well as ABA and lipid metabolism, for example. The identification of these genes should help advance our understanding of the genetic determinism of drought tolerance in coffee.

An integrated approach of comparative genomics and heritability analysis of pig and human on obesity trait: evidence for candidate genes on human chromosome 2.

PubMed

Kim, Jaemin; Lee, Taeheon; Kim, Tae-Hun; Lee, Kyung-Tai; Kim, Heebal

2012-12-19

Traditional candidate gene approach has been widely used for the study of complex diseases including obesity. However, this approach is largely limited by its dependence on existing knowledge of presumed biology of the phenotype under investigation. Our combined strategy of comparative genomics and chromosomal heritability estimate analysis of obesity traits, subscapular skinfold thickness and back-fat thickness in Korean cohorts and pig (Sus scrofa), may overcome the limitations of candidate gene analysis and allow us to better understand genetic predisposition to human obesity. We found common genes including FTO, the fat mass and obesity associated gene, identified from significant SNPs by association studies of each trait. These common genes were related to blood pressure and arterial stiffness (P = 1.65E-05) and type 2 diabetes (P = 0.00578). Through the estimation of variance of genetic component (heritability) for each chromosome by SNPs, we observed a significant positive correlation (r = 0.479) between genetic contributions of human and pig to obesity traits. Furthermore, we noted that human chromosome 2 (syntenic to pig chromosomes 3 and 15) was most important in explaining the phenotypic variance for obesity. Obesity genetics still awaits further discovery. Navigating syntenic regions suggests obesity candidate genes on chromosome 2 that are previously known to be associated with obesity-related diseases: MRPL33, PARD3B, ERBB4, STK39, and ZNF385B.
Large-scale functional RNAi screen in C. elegans identifies genes that regulate the dysfunction of mutant polyglutamine neurons

PubMed Central

2012-01-01

Background A central goal in Huntington's disease (HD) research is to identify and prioritize candidate targets for neuroprotective intervention, which requires genome-scale information on the modifiers of early-stage neuron injury in HD. Results Here, we performed a large-scale RNA interference screen in C. elegans strains that express N-terminal huntingtin (htt) in touch receptor neurons. These neurons control the response to light touch. Their function is strongly impaired by expanded polyglutamines (128Q) as shown by the nearly complete loss of touch response in adult animals, providing an in vivo model in which to manipulate the early phases of expanded-polyQ neurotoxicity. In total, 6034 genes were examined, revealing 662 gene inactivations that either reduce or aggravate defective touch response in 128Q animals. Several genes were previously implicated in HD or neurodegenerative disease, suggesting that this screen has effectively identified candidate targets for HD. Network-based analysis emphasized a subset of high-confidence modifier genes in pathways of interest in HD including metabolic, neurodevelopmental and pro-survival pathways. Finally, 49 modifiers of 128Q-neuron dysfunction that are dysregulated in the striatum of either R/2 or CHL2 HD mice, or both, were identified. Conclusions Collectively, these results highlight the relevance to HD pathogenesis, providing novel information on the potential therapeutic targets for neuroprotection in HD. PMID:22413862
Large-scale functional RNAi screen in C. elegans identifies genes that regulate the dysfunction of mutant polyglutamine neurons.

PubMed

Lejeune, François-Xavier; Mesrob, Lilia; Parmentier, Frédéric; Bicep, Cedric; Vazquez-Manrique, Rafael P; Parker, J Alex; Vert, Jean-Philippe; Tourette, Cendrine; Neri, Christian

2012-03-13

A central goal in Huntington's disease (HD) research is to identify and prioritize candidate targets for neuroprotective intervention, which requires genome-scale information on the modifiers of early-stage neuron injury in HD. Here, we performed a large-scale RNA interference screen in C. elegans strains that express N-terminal huntingtin (htt) in touch receptor neurons. These neurons control the response to light touch. Their function is strongly impaired by expanded polyglutamines (128Q) as shown by the nearly complete loss of touch response in adult animals, providing an in vivo model in which to manipulate the early phases of expanded-polyQ neurotoxicity. In total, 6034 genes were examined, revealing 662 gene inactivations that either reduce or aggravate defective touch response in 128Q animals. Several genes were previously implicated in HD or neurodegenerative disease, suggesting that this screen has effectively identified candidate targets for HD. Network-based analysis emphasized a subset of high-confidence modifier genes in pathways of interest in HD including metabolic, neurodevelopmental and pro-survival pathways. Finally, 49 modifiers of 128Q-neuron dysfunction that are dysregulated in the striatum of either R/2 or CHL2 HD mice, or both, were identified. Collectively, these results highlight the relevance to HD pathogenesis, providing novel information on the potential therapeutic targets for neuroprotection in HD. © 2012 Lejeune et al; licensee BioMed Central Ltd.
Leaf morphology in Cowpea [Vigna unguiculata (L.) Walp]: QTL analysis, physical mapping and identifying a candidate gene using synteny with model legume species

PubMed Central

2012-01-01

Background Cowpea [Vigna unguiculata (L.) Walp] exhibits a considerable variation in leaf shape. Although cowpea is mostly utilized as a dry grain and animal fodder crop, cowpea leaves are also used as a high-protein pot herb in many countries of Africa. Results Leaf morphology was studied in the cowpea RIL population, Sanzi (sub-globose leaf shape) x Vita 7 (hastate leaf shape). A QTL for leaf shape, Hls (hastate leaf shape), was identified on the Sanzi x Vita 7 genetic map spanning from 56.54 cM to 67.54 cM distance on linkage group 15. SNP marker 1_0910 was the most significant over the two experiments, accounting for 74.7% phenotypic variance (LOD 33.82) in a greenhouse experiment and 71.5% phenotypic variance (LOD 30.89) in a field experiment. The corresponding Hls locus was positioned on the cowpea consensus genetic map on linkage group 4, spanning from 25.57 to 35.96 cM. A marker-trait association of the Hls region identified SNP marker 1_0349 alleles co-segregating with either the hastate or sub-globose leaf phenotype. High co-linearity was observed for the syntenic Hls region in Medicago truncatula and Glycine max. One syntenic locus for Hls was identified on Medicago chromosome 7 while syntenic regions for Hls were identified on two soybean chromosomes, 3 and 19. In all three syntenic loci, an ortholog for the EZA1/SWINGER (AT4G02020.1) gene was observed and is the candidate gene for the Hls locus. The Hls locus was identified on the cowpea physical map via SNP markers 1_0910, 1_1013 and 1_0992 which were identified in three BAC contigs; contig926, contig821 and contig25. Conclusions This study has demonstrated how integrated genomic resources can be utilized for a candidate gene approach. Identification of genes which control leaf morphology may be utilized to improve the quality of cowpea leaves for vegetable and or forage markets as well as contribute to more fundamental research understanding the control of leaf shape in legumes. PMID:22691139
Leaf morphology in Cowpea [Vigna unguiculata (L.) Walp]: QTL analysis, physical mapping and identifying a candidate gene using synteny with model legume species.

PubMed

Pottorff, Marti; Ehlers, Jeffrey D; Fatokun, Christian; Roberts, Philip A; Close, Timothy J

2012-06-12

Cowpea [Vigna unguiculata (L.) Walp] exhibits a considerable variation in leaf shape. Although cowpea is mostly utilized as a dry grain and animal fodder crop, cowpea leaves are also used as a high-protein pot herb in many countries of Africa. Leaf morphology was studied in the cowpea RIL population, Sanzi (sub-globose leaf shape) x Vita 7 (hastate leaf shape). A QTL for leaf shape, Hls (hastate leaf shape), was identified on the Sanzi x Vita 7 genetic map spanning from 56.54 cM to 67.54 cM distance on linkage group 15. SNP marker 1_0910 was the most significant over the two experiments, accounting for 74.7% phenotypic variance (LOD 33.82) in a greenhouse experiment and 71.5% phenotypic variance (LOD 30.89) in a field experiment. The corresponding Hls locus was positioned on the cowpea consensus genetic map on linkage group 4, spanning from 25.57 to 35.96 cM. A marker-trait association of the Hls region identified SNP marker 1_0349 alleles co-segregating with either the hastate or sub-globose leaf phenotype. High co-linearity was observed for the syntenic Hls region in Medicago truncatula and Glycine max. One syntenic locus for Hls was identified on Medicago chromosome 7 while syntenic regions for Hls were identified on two soybean chromosomes, 3 and 19. In all three syntenic loci, an ortholog for the EZA1/SWINGER (AT4G02020.1) gene was observed and is the candidate gene for the Hls locus. The Hls locus was identified on the cowpea physical map via SNP markers 1_0910, 1_1013 and 1_0992 which were identified in three BAC contigs; contig926, contig821 and contig25. This study has demonstrated how integrated genomic resources can be utilized for a candidate gene approach. Identification of genes which control leaf morphology may be utilized to improve the quality of cowpea leaves for vegetable and or forage markets as well as contribute to more fundamental research understanding the control of leaf shape in legumes.
Evolutionary transgenomics: prospects and challenges.

PubMed

Correa, Raul; Baum, David A

2015-01-01

Many advances in our understanding of the genetic basis of species differences have arisen from transformation experiments, which allow us to study the effect of genes from one species (the donor) when placed in the genetic background of another species (the recipient). Such interspecies transformation experiments are usually focused on candidate genes - genes that, based on work in model systems, are suspected to be responsible for certain phenotypic differences between the donor and recipient species. We suggest that the high efficiency of transformation in a few plant species, most notably Arabidopsis thaliana, combined with the small size of typical plant genes and their cis-regulatory regions allow implementation of a screening strategy that does not depend upon a priori candidate gene identification. This approach, transgenomics, entails moving many large genomic inserts of a donor species into the wild type background of a recipient species and then screening for dominant phenotypic effects. As a proof of concept, we recently conducted a transgenomic screen that analyzed more than 1100 random, large genomic inserts of the Alabama gladecress Leavenworthia alabamica for dominant phenotypic effects in the A. thaliana background. This screen identified one insert that shortens fruit and decreases A. thaliana fertility. In this paper we discuss the principles of transgenomic screens and suggest methods to help minimize the frequencies of false positive and false negative results. We argue that, because transgenomics avoids committing in advance to candidate genes it has the potential to help us identify truly novel genes or cryptic functions of known genes. Given the valuable knowledge that is likely to be gained, we believe the time is ripe for the plant evolutionary community to invest in transgenomic screens, at least in the mustard family Brassicaceae where many species are amenable to efficient transformation.
Construction of a β-galactosidase-gene-based fusion is convenient for screening candidate genes involved in regulation of pyrrolnitrin biosynthesis in Pseudomonas chlororaphis G05.

PubMed

Luo, Wangtai; Miao, Jing; Feng, Zhibin; Lu, Ruiyang; Sun, Xiaoqiang; Zhang, Baoshen; Ding, Weiqiu; Lu, Yang; Wang, Yanhua; Chi, Xiaoyan; Ge, Yihe

2018-05-28

In our recent work, we found that pyrrolnitrin, and not phenazines, pyrrolnitrin contributed to the suppression of the mycelia growth of Fusarium graminearum that causes heavy Fusarium head blight (FHB) disease in cereal crops. However, pyrrolnitrin production of Pseudomonas chlororaphis G05 in King's B medium was very low. Although a few regulatory genes mediating the prnABCD (the prn operon, pyrrolnitrin biosynthetic locus) expression have been identified, it is not enough for us to enhance pyrrolnitrin production by systematically constructing a genetically-engineered strain. To obtain new candidate genes involved in regulation of the prn operon expression, we successfully constructed a fusion mutant G05ΔphzΔprn::lacZ, in which most of the coding regions of the prn operon and the phzABCDEFG (the phz operon, phenazine biosynthetic locus) were deleted, and the promoter region plus the first thirty condons of the prnA was in-frame fused with the truncated lacZ gene on its chromosome. The expression of the fused lacZ reporter gene driven by the promoter of the prn operon made it easy for us to detect the level of the prn expression in terms of the color variation of colonies on LB agar plates supplemented with 5-bromo-4-chloro-3-indolyl-β-D-galactopyranoside (X-Gal). With this fusion mutant as a recipient strain, mini-Tn5-based random insertional mutagenesis was then conducted. By picking up colonies with color change, it is possible for us to screen and identify new candidate genes involved in regulation of the prn expression. Identification of additional regulatory genes in further work could reasonably be expected to increase pyrrolnitrin production in G05 and to improve its biological control function.
Different waves of effector genes with contrasted genomic location are expressed by Leptosphaeria maculans during cotyledon and stem colonization of oilseed rape.

PubMed

Gervais, Julie; Plissonneau, Clémence; Linglin, Juliette; Meyer, Michel; Labadie, Karine; Cruaud, Corinne; Fudal, Isabelle; Rouxel, Thierry; Balesdent, Marie-Hélène

2017-10-01

Leptosphaeria maculans, the causal agent of stem canker disease, colonizes oilseed rape (Brassica napus) in two stages: a short and early colonization stage corresponding to cotyledon or leaf colonization, and a late colonization stage during which the fungus colonizes systemically and symptomlessly the plant during several months before stem canker appears. To date, the determinants of the late colonization stage are poorly understood; L. maculans may either successfully escape plant defences, leading to stem canker development, or the plant may develop an 'adult-stage' resistance reducing canker incidence. To obtain an insight into these determinants, we performed an RNA-sequencing (RNA-seq) pilot project comparing fungal gene expression in infected cotyledons and in symptomless or necrotic stems. Despite the low fraction of fungal material in infected stems, sufficient fungal transcripts were detected and a large number of fungal genes were expressed, thus validating the feasibility of the approach. Our analysis showed that all avirulence genes previously identified are under-expressed during stem colonization compared with cotyledon colonization. A validation RNA-seq experiment was then performed to investigate the expression of candidate effector genes during systemic colonization. Three hundred and seven 'late' effector candidates, under-expressed in the early colonization stage and over-expressed in the infected stems, were identified. Finally, our analysis revealed a link between the regulation of expression of effectors and their genomic location: the 'late' effector candidates, putatively involved in systemic colonization, are located in gene-rich genomic regions, whereas the 'early' effector genes, over-expressed in the early colonization stage, are located in gene-poor regions of the genome. © 2016 BSPP AND JOHN WILEY & SONS LTD.
Transposon mutagenesis identifies chromatin modifiers cooperating with Ras in thyroid tumorigenesis and detects ATXN7 as a cancer gene.

PubMed

Montero-Conde, Cristina; Leandro-Garcia, Luis J; Chen, Xu; Oler, Gisele; Ruiz-Llorente, Sergio; Ryder, Mabel; Landa, Iñigo; Sanchez-Vega, Francisco; La, Konnor; Ghossein, Ronald A; Bajorin, Dean F; Knauf, Jeffrey A; Riordan, Jesse D; Dupuy, Adam J; Fagin, James A

2017-06-20

Oncogenic RAS mutations are present in 15-30% of thyroid carcinomas. Endogenous expression of mutant Ras is insufficient to initiate thyroid tumorigenesis in murine models, indicating that additional genetic alterations are required. We used Sleeping Beauty (SB) transposon mutagenesis to identify events that cooperate with Hras G12V in thyroid tumor development. Random genomic integration of SB transposons primarily generated loss-of-function events that significantly increased thyroid tumor penetrance in Tpo-Cre/homozygous FR-Hras G12V mice. The thyroid tumors closely phenocopied the histological features of human RAS-driven, poorly differentiated thyroid cancers. Characterization of transposon insertion sites in the SB-induced tumors identified 45 recurrently mutated candidate cancer genes. These mutation profiles were remarkably concordant with mutated cancer genes identified in a large series of human poorly differentiated and anaplastic thyroid cancers screened by next-generation sequencing using the MSK-IMPACT panel of cancer genes, which we modified to include all SB candidates. The disrupted genes primarily clustered in chromatin remodeling functional nodes and in the PI3K pathway. ATXN7 , a component of a multiprotein complex with histone acetylase activity, scored as a significant SB hit. It was recurrently mutated in advanced human cancers and significantly co-occurred with RAS or NF1 mutations. Expression of ATXN7 mutants cooperated with oncogenic RAS to induce thyroid cell proliferation, pointing to ATXN7 as a previously unrecognized cancer gene.
Transposon mutagenesis identifies chromatin modifiers cooperating with Ras in thyroid tumorigenesis and detects ATXN7 as a cancer gene

PubMed Central

Montero-Conde, Cristina; Leandro-Garcia, Luis J.; Chen, Xu; Oler, Gisele; Ruiz-Llorente, Sergio; Ryder, Mabel; Landa, Iñigo; Sanchez-Vega, Francisco; La, Konnor; Ghossein, Ronald A.; Bajorin, Dean F.; Knauf, Jeffrey A.; Riordan, Jesse D.; Dupuy, Adam J.; Fagin, James A.

2017-01-01

Oncogenic RAS mutations are present in 15–30% of thyroid carcinomas. Endogenous expression of mutant Ras is insufficient to initiate thyroid tumorigenesis in murine models, indicating that additional genetic alterations are required. We used Sleeping Beauty (SB) transposon mutagenesis to identify events that cooperate with HrasG12V in thyroid tumor development. Random genomic integration of SB transposons primarily generated loss-of-function events that significantly increased thyroid tumor penetrance in Tpo-Cre/homozygous FR-HrasG12V mice. The thyroid tumors closely phenocopied the histological features of human RAS-driven, poorly differentiated thyroid cancers. Characterization of transposon insertion sites in the SB-induced tumors identified 45 recurrently mutated candidate cancer genes. These mutation profiles were remarkably concordant with mutated cancer genes identified in a large series of human poorly differentiated and anaplastic thyroid cancers screened by next-generation sequencing using the MSK-IMPACT panel of cancer genes, which we modified to include all SB candidates. The disrupted genes primarily clustered in chromatin remodeling functional nodes and in the PI3K pathway. ATXN7, a component of a multiprotein complex with histone acetylase activity, scored as a significant SB hit. It was recurrently mutated in advanced human cancers and significantly co-occurred with RAS or NF1 mutations. Expression of ATXN7 mutants cooperated with oncogenic RAS to induce thyroid cell proliferation, pointing to ATXN7 as a previously unrecognized cancer gene. PMID:28584132
Analysis of Differentially Expressed Genes and Signaling Pathways Related to Intramuscular Fat Deposition in Skeletal Muscle of Sex-Linked Dwarf Chickens

PubMed Central

Ye, Yaqiong; Lin, Shumao; Mu, Heping; Tang, Xiaohong; Ou, Yangdan; Chen, Jian; Ma, Yongjiang; Li, Yugu

2014-01-01

Intramuscular fat (IMF) plays an important role in meat quality. However, the molecular mechanisms underlying IMF deposition in skeletal muscle have not been addressed for the sex-linked dwarf (SLD) chicken. In this study, potential candidate genes and signaling pathways related to IMF deposition in chicken leg muscle tissue were characterized using gene expression profiling of both 7-week-old SLD and normal chickens. A total of 173 differentially expressed genes (DEGs) were identified between the two breeds. Subsequently, 6 DEGs related to lipid metabolism or muscle development were verified in each breed based on gene ontology (GO) analysis. In addition, KEGG pathway analysis of DEGs indicated that some of them (GHR, SOCS3, and IGF2BP3) participate in adipocytokine and insulin signaling pathways. To investigate the role of the above signaling pathways in IMF deposition, the gene expression of pathway factors and other downstream genes were measured by using qRT-PCR and Western blot analyses. Collectively, the results identified potential candidate genes related to IMF deposition and suggested that IMF deposition in skeletal muscle of SLD chicken is regulated partially by pathways of adipocytokine and insulin and other downstream signaling pathways (TGF-β/SMAD3 and Wnt/catenin-β pathway). PMID:24757673
RNA expression of genes involved in cytarabine metabolism and transport predicts cytarabine response in acute myeloid leukemia.

PubMed

Abraham, Ajay; Varatharajan, Savitha; Karathedath, Sreeja; Philip, Chepsy; Lakshmi, Kavitha M; Jayavelu, Ashok Kumar; Mohanan, Ezhilpavai; Janet, Nancy Beryl; Srivastava, Vivi M; Shaji, Ramachandran V; Zhang, Wei; Abraham, Aby; Viswabandya, Auro; George, Biju; Chandy, Mammen; Srivastava, Alok; Mathews, Vikram; Balasubramanian, Poonkuzhali

2015-07-01

Variation in terms of outcome and toxic side effects of treatment exists among acute myeloid leukemia (AML) patients on chemotherapy with cytarabine (Ara-C) and daunorubicin (Dnr). Candidate Ara-C metabolizing gene expression in primary AML cells is proposed to account for this variation. Ex vivo Ara-C sensitivity was determined in primary AML samples using MTT assay. mRNA expression of candidate Ara-C metabolizing genes were evaluated by RQPCR analysis. Global gene expression profiling was carried out for identifying differentially expressed genes between exvivo Ara-C sensitive and resistant samples. Wide interindividual variations in ex vivo Ara-C cytotoxicity were observed among samples from patients with AML and were stratified into sensitive, intermediately sensitive and resistant, based on IC50 values obtained by MTT assay. RNA expression of deoxycytidine kinase (DCK), human equilibrative nucleoside transporter-1 (ENT1) and ribonucleotide reductase M1 (RRM1) were significantly higher and cytidine deaminase (CDA) was significantly lower in ex vivo Ara-C sensitive samples. Higher DCK and RRM1 expression in AML patient's blast correlated with better DFS. Ara-C resistance index (RI), a mathematically derived quotient was proposed based on candidate gene expression pattern. Ara-C ex vivo sensitive samples were found to have significantly lower RI compared with resistant as well as samples from patients presenting with relapse. Patients with low RI supposedly highly sensitive to Ara-C were found to have higher incidence of induction death (p = 0.002; RR: 4.35 [95% CI: 1.69-11.22]). Global gene expression profiling undertaken to find out additional contributors of Ara-C resistance identified many apoptosis as well as metabolic pathway genes to be differentially expressed between Ara-C resistant and sensitive samples. This study highlights the importance of evaluating expression of candidate Ara-C metabolizing genes in predicting ex vivo drug response as well as treatment outcome. RI could be a predictor of ex vivo Ara-C response irrespective of cytogenetic and molecular risk groups and a potential biomarker for AML treatment outcome and toxicity. Original submitted 22 December 2014; Revision submitted 9 April 2015.
Genetic Variants Identified from Epilepsy of Unknown Etiology in Chinese Children by Targeted Exome Sequencing

PubMed Central

Wang, Yimin; Du, Xiaonan; Bin, Rao; Yu, Shanshan; Xia, Zhezhi; Zheng, Guo; Zhong, Jianmin; Zhang, Yunjian; Jiang, Yong-hui; Wang, Yi

2017-01-01

Genetic factors play a major role in the etiology of epilepsy disorders. Recent genomics studies using next generation sequencing (NGS) technique have identified a large number of genetic variants including copy number (CNV) and single nucleotide variant (SNV) in a small set of genes from individuals with epilepsy. These discoveries have contributed significantly to evaluate the etiology of epilepsy in clinic and lay the foundation to develop molecular specific treatment. However, the molecular basis for a majority of epilepsy patients remains elusive, and furthermore, most of these studies have been conducted in Caucasian children. Here we conducted a targeted exome-sequencing of 63 trios of Chinese epilepsy families using a custom-designed NGS panel that covers 412 known and candidate genes for epilepsy. We identified pathogenic and likely pathogenic variants in 15 of 63 (23.8%) families in known epilepsy genes including SCN1A, CDKL5, STXBP1, CHD2, SCN3A, SCN9A, TSC2, MBD5, POLG and EFHC1. More importantly, we identified likely pathologic variants in several novel candidate genes such as GABRE, MYH1, and CLCN6. Our results provide the evidence supporting the application of custom-designed NGS panel in clinic and indicate a conserved genetic susceptibility for epilepsy between Chinese and Caucasian children. PMID:28074849
Using whole-exome sequencing to investigate the genetic bases of lysosomal storage diseases of unknown etiology.

PubMed

Wang, Nan; Zhang, Yeting; Gedvilaite, Erika; Loh, Jui Wan; Lin, Timothy; Liu, Xiuping; Liu, Chang-Gong; Kumar, Dibyendu; Donnelly, Robert; Raymond, Kimiyo; Schuchman, Edward H; Sleat, David E; Lobel, Peter; Xing, Jinchuan

2017-11-01

Lysosomes are membrane-bound, acidic eukaryotic cellular organelles that play important roles in the degradation of macromolecules. Mutations that cause the loss of lysosomal protein function can lead to a group of disorders categorized as the lysosomal storage diseases (LSDs). Suspicion of LSD is frequently based on clinical and pathologic findings, but in some cases, the underlying genetic and biochemical defects remain unknown. Here, we performed whole-exome sequencing (WES) on 14 suspected LSD cases to evaluate the feasibility of using WES for identifying causal mutations. By examining 2,157 candidate genes potentially associated with lysosomal function, we identified eight variants in five genes as candidate disease-causing variants in four individuals. These included both known and novel mutations. Variants were corroborated by targeted sequencing and, when possible, functional assays. In addition, we identified nonsense mutations in two individuals in genes that are not known to have lysosomal function. However, mutations in these genes could have resulted in phenotypes that were diagnosed as LSDs. This study demonstrates that WES can be used to identify causal mutations in suspected LSD cases. We also demonstrate cases where a confounding clinical phenotype may potentially reflect more than one lysosomal protein defect. © 2017 Wiley Periodicals, Inc.
Systematic screening of isogenic cancer cells identifies DUSP6 as context-specific synthetic lethal target in melanoma

PubMed Central

Wittig-Blaich, Stephanie; Wittig, Rainer; Schmidt, Steffen; Lyer, Stefan; Bewerunge-Hudler, Melanie; Gronert-Sum, Sabine; Strobel-Freidekind, Olga; Müller, Carolin; List, Markus; Jaskot, Aleksandra; Christiansen, Helle; Hafner, Mathias; Schadendorf, Dirk; Block, Ines; Mollenhauer, Jan

2017-01-01

Next-generation sequencing has dramatically increased genome-wide profiling options and conceptually initiates the possibility for personalized cancer therapy. State-of-the-art sequencing studies yield large candidate gene sets comprising dozens or hundreds of mutated genes. However, few technologies are available for the systematic downstream evaluation of these results to identify novel starting points of future cancer therapies. We improved and extended a site-specific recombination-based system for systematic analysis of the individual functions of a large number of candidate genes. This was facilitated by a novel system for the construction of isogenic constitutive and inducible gain- and loss-of-function cell lines. Additionally, we demonstrate the construction of isogenic cell lines with combinations of the traits for advanced functional in vitro analyses. In a proof-of-concept experiment, a library of 108 isogenic melanoma cell lines was constructed and 8 genes were identified that significantly reduced viability in a discovery screen and in an independent validation screen. Here, we demonstrate the broad applicability of this recombination-based method and we proved its potential to identify new drug targets via the identification of the tumor suppressor DUSP6 as potential synthetic lethal target in melanoma cell lines with BRAF V600E mutations and high DUSP6 expression. PMID:28423600
Adipose and muscle tissue gene expression of two genes (NCAPG and LCORL) located in a chromosomal region associated with cattle feed intake and gain

USDA-ARS?s Scientific Manuscript database

A region on bovine chromosome 6 has been implicated in cattle birth weight, growth, and length. Non-SMC conodensin I complex subunit G (NCAPG) and ligand dependent nuclear receptor corepressor-like protein (LCORL) are positional candidate genes within this region. Previously identified genetic mark...
Identifying gene networks underlying the neurobiology of ethanol and alcoholism.

PubMed

Wolen, Aaron R; Miles, Michael F

2012-01-01

For complex disorders such as alcoholism, identifying the genes linked to these diseases and their specific roles is difficult. Traditional genetic approaches, such as genetic association studies (including genome-wide association studies) and analyses of quantitative trait loci (QTLs) in both humans and laboratory animals already have helped identify some candidate genes. However, because of technical obstacles, such as the small impact of any individual gene, these approaches only have limited effectiveness in identifying specific genes that contribute to complex diseases. The emerging field of systems biology, which allows for analyses of entire gene networks, may help researchers better elucidate the genetic basis of alcoholism, both in humans and in animal models. Such networks can be identified using approaches such as high-throughput molecular profiling (e.g., through microarray-based gene expression analyses) or strategies referred to as genetical genomics, such as the mapping of expression QTLs (eQTLs). Characterization of gene networks can shed light on the biological pathways underlying complex traits and provide the functional context for identifying those genes that contribute to disease development.
Identification of Treatment Targets in a Genetic Mouse Model of Voluntary Methamphetamine Drinking.

PubMed

Phillips, T J; Mootz, J R K; Reed, C

2016-01-01

Methamphetamine has powerful stimulant and euphoric effects that are experienced as rewarding and encourage use. Methamphetamine addiction is associated with debilitating illnesses, destroyed relationships, child neglect, violence, and crime; but after many years of research, broadly effective medications have not been identified. Individual differences that may impact not only risk for developing a methamphetamine use disorder but also affect treatment response have not been fully considered. Human studies have identified candidate genes that may be relevant, but lack of control over drug history, the common use or coabuse of multiple addictive drugs, and restrictions on the types of data that can be collected in humans are barriers to progress. To overcome some of these issues, a genetic animal model comprised of lines of mice selectively bred for high and low voluntary methamphetamine intake was developed to identify risk and protective alleles for methamphetamine consumption, and identify therapeutic targets. The mu opioid receptor gene was supported as a target for genes within a top-ranked transcription factor network associated with level of methamphetamine intake. In addition, mice that consume high levels of methamphetamine were found to possess a nonfunctional form of the trace amine-associated receptor 1 (TAAR1). The Taar1 gene is within a mouse chromosome 10 quantitative trait locus for methamphetamine consumption, and TAAR1 function determines sensitivity to aversive effects of methamphetamine that may curb intake. The genes, gene interaction partners, and protein products identified in this genetic mouse model represent treatment target candidates for methamphetamine addiction. © 2016 Elsevier Inc. All rights reserved.
The FUN of identifying gene function in bacterial pathogens; insights from Salmonella functional genomics.

PubMed

Hammarlöf, Disa L; Canals, Rocío; Hinton, Jay C D

2013-10-01

The availability of thousands of genome sequences of bacterial pathogens poses a particular challenge because each genome contains hundreds of genes of unknown function (FUN). How can we easily discover which FUN genes encode important virulence factors? One solution is to combine two different functional genomic approaches. First, transcriptomics identifies bacterial FUN genes that show differential expression during the process of mammalian infection. Second, global mutagenesis identifies individual FUN genes that the pathogen requires to cause disease. The intersection of these datasets can reveal a small set of candidate genes most likely to encode novel virulence attributes. We demonstrate this approach with the Salmonella infection model, and propose that a similar strategy could be used for other bacterial pathogens. Copyright © 2013 Elsevier Ltd. All rights reserved.
Localization of a translocation breakpoint involved in Smith-Lemli-Opitz syndrome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Alley, T.L.; Gray, B.A.; Lee, S.

1994-09-01

Smith-Lemli-Opitz syndrome (SLOS) is a multiple congenital anomaly/mental retardation syndrome, with features including toe syndactyly, genital anomalies, unusual facies, and occasional organ malformations. The gene(s) for this autosomal recessive disorder has not been mapped. Recent biochemical studies suggest that the defect may involve the penultimate step in cholesterol synthesis, as patients have low serum cholesterol and increased 7-dehydrocholesterol (7-DHC) levels. However, the enzyme putatively involved (7-DHC reductase) has not been isolated. We identified an SLOS patient with a de novo balanced chromosome translocation [t(7;20)(q32.1;q13.2)], and we propose that the translocation interrupts one of the patient`s SLOS alleles. We are pursuingmore » positional cloning to identify the SLOS gene. Using fluorescence in situ hybridization (FISH), we recently identified a chromosome 7 yeast artificial chromosome (YAC) that spans the breakpoint and places it onto physical and genetic maps. We are in the process of narrowing this region via overlapping YACs and YAC subclones, from which we will isolate candidate cDNAs. Any candidate gene disrupted by the translocation and mutated on the other allele will be proven to be the SLOS gene. Functional analysis of an SLOS cDNA may also determine its relationship to cholesterol metabolism and the observed biochemical abnormalities.« less

Some links on this page may take you to non-federal websites. Their policies may differ from this site.