Li, Angsheng; Yin, Xianchen; Pan, Yicheng
2016-01-01
In this study, we propose a method for constructing cell sample networks from gene expression profiles, and a structural entropy minimisation principle for detecting natural structure of networks and for identifying cancer cell subtypes. Our method establishes a three-dimensional gene map of cancer cell types and subtypes. The identified subtypes are defined by a unique gene expression pattern, and a three-dimensional gene map is established by defining the unique gene expression pattern for each identified subtype for cancers, including acute leukaemia, lymphoma, multi-tissue, lung cancer and healthy tissue. Our three-dimensional gene map demonstrates that a true tumour type may be divided into subtypes, each defined by a unique gene expression pattern. Clinical data analyses demonstrate that most cell samples of an identified subtype share similar survival times, survival indicators and International Prognostic Index (IPI) scores and indicate that distinct subtypes identified by our algorithms exhibit different overall survival times, survival ratios and IPI scores. Our three-dimensional gene map establishes a high-definition, one-to-one map between the biologically and medically meaningful tumour subtypes and the gene expression patterns, and identifies remarkable cells that form singleton submodules. PMID:26842724
Lathe, R
1977-09-01
The firA (Ts)200 mutation not only eliminates the resistance to rifampin of certain genetically resistant strains, but, moreover, renders ribonucleic acid synthesis thermolabile. The firA gene has been mapped by P1 tranduction and is located extremely close to the structural gene for deoxyribonucleic acid polymerase III at 4 min on the Escherichia coli linkage map.
Lathe, R
1977-01-01
The firA (Ts)200 mutation not only eliminates the resistance to rifampin of certain genetically resistant strains, but, moreover, renders ribonucleic acid synthesis thermolabile. The firA gene has been mapped by P1 tranduction and is located extremely close to the structural gene for deoxyribonucleic acid polymerase III at 4 min on the Escherichia coli linkage map. PMID:330494
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hamann, J.; Van Lier, R.A.W.; Hartmann, E.
1996-02-15
This article reports on the structure and genetic mapping of the human CD97 gene, a homologue to the secretin receptor superfamily of cell surface proteins. The detailed organization of the gene, which maps to the short arm of chromosome 19, is given. 18 refs., 1 fig., 1 tab.
Nagle, D L; Martin-DeLeon, P; Hough, R B; Bućan, M
1994-01-01
We are studying the chromosomal structure of three developmental mutations, dominant spotting (W), patch (Ph), and rump white (Rw) on mouse chromosome 5. These mutations are clustered in a region containing three genes encoding tyrosine kinase receptors (Kit, Pdgfra, and Flk1). Using probes for these genes and for a closely linked locus, D5Mn125, we established a high-resolution physical map covering approximately 2.8 Mb. The entire chromosomal segment mapped in this study is deleted in the W19H mutation. The map indicates the position of the Ph deletion, which encompasses not more than 400 kb around and including the Pdgfra gene. The map also places the distal breakpoint of the Rw inversion to a limited chromosomal segment between Kit and Pdgfra. In light of the structure of the Ph-W-Rw region, we interpret the previously published complementation analyses as indicating that the pigmentation defect in Rw/+ heterozygotes could be due to the disruption of Kit and/or Pdgfra regulatory sequences, whereas the gene(s) responsible for the recessive lethality of Rw/Rw embryos is not closely linked to the Ph and W loci and maps proximally to the W19H deletion. The structural analysis of chromosomal rearrangements associated with W19H, Ph, and Rw combined with the high-resolution physical mapping points the way toward the definition of these mutations in molecular terms and isolation of homologous genes on human chromosome 4. Images PMID:8041773
Lan, DaoLiang; Xiong, XianRong; Wei, YanLi; Xu, Tong; Zhong, JinCheng; Zhi, XiangDong; Wang, Yong; Li, Jian
2014-09-01
RNA-Seq, a high-throughput (HT) sequencing technique, has been used effectively in large-scale transcriptomic studies, and is particularly useful for improving gene structure information and mining of new genes. In this study, RNA-Seq HT technology was employed to analyze the transcriptome of yak ovary. After Illumina-Solexa deep sequencing, 26826516 clean reads with a total of 4828772880 bp were obtained from the ovary library. Alignment analysis showed that 16992 yak genes mapped to the yak genome and 3734 of these genes were involved in alternative splicing. Gene structure refinement analysis showed that 7340 genes that were annotated in the yak genome could be extended at the 5' or 3' ends based on the alignments been the transcripts and the genome sequence. Novel transcript prediction analysis identified 6321 new transcripts with lengths ranging from 180 to 14884 bp, and 2267 of them were predicted to code proteins. BLAST analysis of the new transcripts showed that 1200?4933 mapped to the non-redundant (nr), nucleotide (nt) and/or SwissProt sequence databases. Comparative statistical analysis of the new mapped transcripts showed that the majority of them were similar to genes in Bos taurus (41.4%), Bos grunniens mutus (33.0%), Ovis aries (6.3%), Homo sapiens (2.8%), Mus musculus (1.6%) and other species. Functional analysis showed that these expressed genes were involved in various Gene Ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes pathways. GO analysis of the new transcripts found that the largest proportion of them was associated with reproduction. The results of this study will provide a basis for describing the normal transcriptome map of yak ovary and for future studies on yak breeding performance. Moreover, the results confirmed that RNA-Seq HT technology is highly advantageous in improving gene structure information and mining of new genes, as well as in providing valuable data to expand the yak genome information.
The Choice between MapMan and Gene Ontology for Automated Gene Function Prediction in Plant Science
Klie, Sebastian; Nikoloski, Zoran
2012-01-01
Since the introduction of the Gene Ontology (GO), the analysis of high-throughput data has become tightly coupled with the use of ontologies to establish associations between knowledge and data in an automated fashion. Ontologies provide a systematic description of knowledge by a controlled vocabulary of defined structure in which ontological concepts are connected by pre-defined relationships. In plant science, MapMan and GO offer two alternatives for ontology-driven analyses. Unlike GO, initially developed to characterize microbial systems, MapMan was specifically designed to cover plant-specific pathways and processes. While the dependencies between concepts in MapMan are modeled as a tree, in GO these are captured in a directed acyclic graph. Therefore, the difference in ontologies may cause discrepancies in data reduction, visualization, and hypothesis generation. Here provide the first systematic comparative analysis of GO and MapMan for the case of the model plant species Arabidopsis thaliana (Arabidopsis) with respect to their structural properties and difference in distributions of information content. In addition, we investigate the effect of the two ontologies on the specificity and sensitivity of automated gene function prediction via the coupling of co-expression networks and the guilt-by-association principle. Automated gene function prediction is particularly needed for the model plant Arabidopsis in which only half of genes have been functionally annotated based on sequence similarity to known genes. The results highlight the need for structured representation of species-specific biological knowledge, and warrants caution in the design principles employed in future ontologies. PMID:22754563
Lengeler, J
1975-01-01
Mutants of Escherichia coli K-12 unable to grow on any of the three naturally occurring hexitols D-manitol, D-glucitol, and galactitol and, among these specifically, mutants with altered transport and phosphorylating activity have been isolated. Different isolation procedures have been utilized, including suicide by D-[3H]mannitol, chemotaxis, and resistance to the toxic hexitol analogue 2-deoxy-arabino-hexitol. Mutations thus obtained have been mapped in four distinct operons. (i) Mutations affecting an enzyme II-complexmt1 activity of the phosphoenolpyruvate-dependent phosphotransferase system all map in gene mtlA. This gene has previously been shown (Solomon and Lin, 1972) to be part of an operon, mtl, located at 71 min on the E. coli linkage map containing, in addition to mtlA, the cis-dominant regulatory gene mtlC and mtlD, the structural gene for the enzyme D-mannitol-1-phosphate dehydrogenase. The gene order in this operon, induced by D-mannitol, is mtlC A D. (ii) Mutations in gene gutA affecting a second enzyme II-complexgut of the phosphotransferase system map at 51 min, clustered in operon gutC A D together with the cis-dominant regulatory gene gutC and the structural gene gutD for the enzyme D-glucitol-6-phosphate dehydrogenase. The gut operon, previously called sbl or srl, is induced by D-glucitol. (iii) Mutations affecting the transport and catabolism of galactitol are clustered in a third operon, gatC A D, located at 40.5 min. This operon again contains a cis-dominant regulatory gene, gatC, the structural gene gatD for galactitol-1-phosphate dehydrogenase, and gene gatA coding for a thrid hexitol-specific enzyme II-complexgat. Other genes coding for two additional enzymes involved in galactitol catabolism apparently are not linked to gatC A D. (iv) A fourth class of mutants pleiotropically negative for hexitol growth and transport maps in the pts operon. Triple-negative mutants (mtlA gutA gatA) do not have further transport or phosphorylating activity for any of the three hexitols. PMID:1100602
EMAP and EMAGE: a framework for understanding spatially organized data.
Baldock, Richard A; Bard, Jonathan B L; Burger, Albert; Burton, Nicolas; Christiansen, Jeff; Feng, Guanjie; Hill, Bill; Houghton, Derek; Kaufman, Matthew; Rao, Jianguo; Sharpe, James; Ross, Allyson; Stevenson, Peter; Venkataraman, Shanmugasundaram; Waterhouse, Andrew; Yang, Yiya; Davidson, Duncan R
2003-01-01
The Edinburgh MouseAtlas Project (EMAP) is a time-series of mouse-embryo volumetric models. The models provide a context-free spatial framework onto which structural interpretations and experimental data can be mapped. This enables collation, comparison, and query of complex spatial patterns with respect to each other and with respect to known or hypothesized structure. The atlas also includes a time-dependent anatomical ontology and mapping between the ontology and the spatial models in the form of delineated anatomical regions or tissues. The models provide a natural, graphical context for browsing and visualizing complex data. The Edinburgh Mouse Atlas Gene-Expression Database (EMAGE) is one of the first applications of the EMAP framework and provides a spatially mapped gene-expression database with associated tools for data mapping, submission, and query. In this article, we describe the underlying principles of the Atlas and the gene-expression database, and provide a practical introduction to the use of the EMAP and EMAGE tools, including use of new techniques for whole body gene-expression data capture and mapping.
Gene transfer and gene mapping in mammalian cells in culture.
Shows, T B; Sakaguchi, A Y
1980-01-01
The ability to transfer mammalian genes parasexually has opened new possibilities for gene mapping and fine structure mapping and offers great potential for contributing to several aspects of mammalian biology, including gene expression and genetic engineering. The DNA transferred has ranged from whole genomes to single genes and smaller segments of DNA. The transfer of whole genomes by cell fusion forms cell hybrids, which has promoted the extensive mapping of human and mouse genes. Transfer, by cell fusion, of rearranged chromosomes has contributed significantly to determining close linkage and the assignment of genes to specific chromosomal regions. Transfer of single chromosomes has been achieved utilizing microcells fused to recipient cells. Metaphase chromosomes have been isolated and used to transfer single-to-multigenic DNA segments. DNA-mediated gene transfer, simulating bacterial transformation, has achieved transfer of single-copy genes. By utilizing DNA cleaved with restriction endonucleases, gene transfer is being empolyed as a bioassay for the purification of genes. Gene mapping and the fate of transferred genes can be examined now at the molecular level using sequence-specific probles. Recently, single genes have been cloned into eucaryotic and procaryotic vectors for transfer into mammalian cells. Moreover, recombinant libraries in which entire mammalian genomes are represented collectively are a rich new source of transferable genes. Methodology for transferring mammalian genetic information and applications for mapping mammalian genes is presented and prospects for the future discussed.
Characterization of the Structural Gene Promoter of Aedes aegypti Densovirus
Ward, Todd W.; Kimmick, Michael W.; Afanasiev, Boris N.; Carlson, Jonathan O.
2001-01-01
Aedes aegypti densonucleosis virus (AeDNV) has two promoters that have been shown to be active by reporter gene expression analysis (B. N. Afanasiev, Y. V. Koslov, J. O. Carlson, and B. J. Beaty, Exp. Parasitol. 79:322–339, 1994). Northern blot analysis of cells infected with AeDNV revealed two transcripts 1,200 and 3,500 nucleotides in length that are assumed to express the structural protein (VP) gene and nonstructural protein genes, respectively. Primer extension was used to map the transcriptional start site of the structural protein gene. Surprisingly, the structural protein gene transcript began at an initiator consensus sequence, CAGT, 60 nucleotides upstream from the map unit 61 TATAA sequence previously thought to define the promoter. Constructs with the β-galactosidase gene fused to the structural protein gene were used to determine elements necessary for promoter function. Deletion or mutation of the initiator sequence, CAGT, reduced protein expression by 93%, whereas mutation of the TATAA sequence at map unit 61 had little effect. An additional open reading frame was observed upstream of the structural protein gene that can express β-galactosidase at a low level (20% of that of VP fusions). Expression of the AeDNV structural protein gene was shown to be stimulated by the major nonstructural protein NS1 (Afanasiev et al., Exp. parasitol., 1994). To determine the sequences required for transactivation, expression of structural protein gene–β-galactosidase gene fusion constructs differing in AeDNV genome content was measured with and without NS1. The presence of NS1 led to an 8- to 10-fold increase in expression when either genomic end was present, compared to a 2-fold increase with a construct lacking the genomic ends. An even higher (37-fold) increase in expression occurred with both genomic ends present; however, this was in part due to template replication as shown by Southern blot analysis. These data indicate the location and importance of various elements necessary for efficient protein expression and transactivation from the structural protein gene promoter of AeDNV. PMID:11152505
Phenotypic assessments of peanut nested association mapping (NAM) populations
USDA-ARS?s Scientific Manuscript database
Nested association mapping (NAM) is a valuable innovation and multi-parental mapping population strategy in peanut genetics which increases the power to map quantitative trait loci and assists in extending the gene pool of elite peanut lines. In the peanut research community, two structured mapping ...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ferrari, S.; Finelli, P.; Rocchi, M.
The human genome contains a large number of sequences related to the cDNA for High Mobility Group 1 protein (HMG1), which so far has hampered the cloning and mapping of the active HMG1 gene. We show that the human HMG1 gene contains introns, while the HMG1-related sequences do not and most likely are retrotransposed pseudogenes. We identified eight YACs from the ICI and CEPH libraries that contain the human HMG1 gene. The HMG1 gene is similar in structure to the previously characterized murine homologue and maps to human chromosome 13 and q12, as determined by in situ hybridization. The mousemore » Hmg1 gene maps to the telomeric region of murine Chromosome 5, which is syntenic to the human 13q12 band. 18 refs., 3 figs.« less
Fraenkel, D. G.; Banerjee, Santimoy
1972-01-01
Genes for three enzymes of intermediary sugar metabolism in E. coli, zwf (glucose 6-phosphate dehydrogenase, constitutive), edd (gluconate 6-phosphate dehydrase, inducible), and eda (2-keto-3-deoxygluconate 6-phosphate aldolase, differently inducible) are closely linked on the E. coli genetic map, the overall gene order being man... old... eda. edd. zwf... cheB... uvrC... his. One class of apparent revertants of an eda mutant strain contains a secondary mutation in edd, and some of these mutations are deletions extending into zwf. We have used a series of spontaneous edd-zwf deletions to map a series of point mutants in zwf and thus report the first fine structure map of a gene for a constitutive enzyme (zwf). PMID:4560065
ACTG: novel peptide mapping onto gene models.
Choi, Seunghyuk; Kim, Hyunwoo; Paek, Eunok
2017-04-15
In many proteogenomic applications, mapping peptide sequences onto genome sequences can be very useful, because it allows us to understand origins of the gene products. Existing software tools either take the genomic position of a peptide start site as an input or assume that the peptide sequence exactly matches the coding sequence of a given gene model. In case of novel peptides resulting from genomic variations, especially structural variations such as alternative splicing, these existing tools cannot be directly applied unless users supply information about the variant, either its genomic position or its transcription model. Mapping potentially novel peptides to genome sequences, while allowing certain genomic variations, requires introducing novel gene models when aligning peptide sequences to gene structures. We have developed a new tool called ACTG (Amino aCids To Genome), which maps peptides to genome, assuming all possible single exon skipping, junction variation allowing three edit distances from the original splice sites, exon extension and frame shift. In addition, it can also consider SNVs (single nucleotide variations) during mapping phase if a user provides the VCF (variant call format) file as an input. Available at http://prix.hanyang.ac.kr/ACTG/search.jsp . eunokpaek@hanyang.ac.kr. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Casjens, S.; Eppler, K.; Sampson, L.; Parr, R.; Wyckoff, E.
1991-01-01
The mechanism by which dsDNA is packaged by viruses is not yet understood in any system. Bacteriophage P22 has been a productive system in which to study the molecular genetics of virus particle assembly and DNA packaging. Only five phage encoded proteins, the products of genes 3, 2, 1, 8 and 5, are required for packaging the virus chromosome inside the coat protein shell. We report here the construction of a detailed genetic and physical map of these genes, the neighboring gene 4 and a portion of gene 10, in which 289 conditional lethal amber, opal, temperature sensitive and cold sensitive mutations are mapped into 44 small (several hundred base pair) intervals of known sequence. Knowledge of missense mutant phenotypes and information on the location of these mutations allows us to begin the assignment of partial protein functions to portions of these genes. The map and mapping strains will be of use in the further genetic dissection of the P22 DNA packaging and prohead assembly processes. PMID:2029965
Ash, S; Johnson, C; Shohat, M; Shohat, T; Schlesinger, M
1994-08-01
The properdin deficiency gene has been localized to Xp21.1-Xcen; however, it is not clear whether the mutation responsible for the disease co-maps exactly with the structural properdin gene. Based on a recent study on a total of six families, the gene was found linked to DXS255 (theta = 0.00). As only a few families have been studied, it is not known whether the same gene is responsible for the disease in all families. In order to better localize the disease gene in Israel, we studied a Tunisian Jewish family with properdin deficiency for linkage with various X-markers. A maximum lod score of 1.93 at theta = 0.00 was calculated with the DXS7 probe while there was one recombination with DXS255. This study helps to better localize the properdin deficiency gene to Xp11.3-p21.1 proximal to DXS255 locus and confirms that there is no indication of genetic heterogeneity. Whether the properdin structural gene (PFC) and properdin deficiency locus are one and the same await demonstration of mutations in the structural gene in patients with properdin deficiency.
Marcus, Jeffrey M; Hughes, Tia M
2009-06-01
Structured inquiry approaches, in which students receive a Drosophila strain of unknown genotype to analyze and map the constituent mutations, are a common feature of many genetics teaching laboratories. The required crosses frustrate many students because they are aware that they are participating in a fundamentally trivial exercise, as the map locations of the genes are already established and have been recalculated thousands of times by generations of students. We modified the traditional structured inquiry approach to include a novel research experience for the students in our undergraduate genetics laboratories. Students conducted crosses with Drosophila strains carrying P[lacW] transposon insertions in genes without documented recombination map positions, representing a large number of unique, but equivalent genetic unknowns. Using the eye color phenotypes associated with the inserts as visible markers, it is straightforward to calculate recombination map positions for the interrupted loci. Collectively, our students mapped 95 genetic loci on chromosomes 2 and 3. In most cases, the calculated 95% confidence interval for meiotic map location overlapped with the predicted map position based on cytology. The research experience evoked positive student responses and helped students better understand the nature of scientific research for little additional cost or instructor effort.
Telomere Organization in the Ligninolytic Basidiomycete Pleurotus ostreatus▿ †
Pérez, Gúmer; Pangilinan, Jasmyn; Pisabarro, Antonio G.; Ramírez, Lucía
2009-01-01
Telomeres are structural and functional chromosome regions that are essential for the cell cycle to proceed normally. They are, however, difficult to map genetically and to identify in genome-wide sequence programs because of their structure and repetitive nature. We studied the telomeric and subtelomeric organization in the basidiomycete Pleurotus ostreatus using a combination of molecular and bioinformatics tools that permitted us to determine 19 out of the 22 telomeres expected in this fungus. The telomeric repeating unit in P. ostreatus is TTAGGG, and the numbers of repetitions of this unit range between 25 and 150. The mapping of the telomere restriction fragments to linkage groups 6 and 7 revealed polymorphisms compatible with those observed by pulsed field gel electrophoresis separation of the corresponding chromosomes. The subtelomeric regions in Pleurotus contain genes similar to those described in other eukaryotic systems. The presence of a cluster of laccase genes in chromosome 6 and a bipartite structure containing a Het-related protein and an alcohol dehydrogenase are especially relevant; this bipartite structure is characteristic of the Pezizomycotina fungi Neurospora crassa and Aspergillus terreus. As far as we know, this is the first report describing the presence of such structures in basidiomycetes and the location of a laccase gene cluster in the subtelomeric region, where, among others, species-specific genes allowing the organism to adapt rapidly to the environment usually map. PMID:19114509
Rustenholz, Camille; Choulet, Frédéric; Laugier, Christel; Safár, Jan; Simková, Hana; Dolezel, Jaroslav; Magni, Federica; Scalabrin, Simone; Cattonaro, Federica; Vautrin, Sonia; Bellec, Arnaud; Bergès, Hélène; Feuillet, Catherine; Paux, Etienne
2011-12-01
To improve our understanding of the organization and regulation of the wheat (Triticum aestivum) gene space, we established a transcription map of a wheat chromosome (3B) by hybridizing a newly developed wheat expression microarray with bacterial artificial chromosome pools from a new version of the 3B physical map as well as with cDNA probes derived from 15 RNA samples. Mapping data for almost 3,000 genes showed that the gene space spans the whole chromosome 3B with a 2-fold increase of gene density toward the telomeres due to an increase in the number of genes in islands. Comparative analyses with rice (Oryza sativa) and Brachypodium distachyon revealed that these gene islands are composed mainly of genes likely originating from interchromosomal gene duplications. Gene Ontology and expression profile analyses for the 3,000 genes located along the chromosome revealed that the gene islands are enriched significantly in genes sharing the same function or expression profile, thereby suggesting that genes in islands acquired shared regulation during evolution. Only a small fraction of these clusters of cofunctional and coexpressed genes was conserved with rice and B. distachyon, indicating a recent origin. Finally, genes with the same expression profiles in remote islands (coregulation islands) were identified suggesting long-distance regulation of gene expression along the chromosomes in wheat.
USDA-ARS?s Scientific Manuscript database
We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE (“Assessing Changes to Exons”) converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detect...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bozza, M.; Gerard, C.; Kolakowski, L.F. Jr.
1995-06-10
Macrophage migration inhibitory factor, MIF, is a cytokine released by T-lymphocytes, macrophages, and the pituitary gland that serves to integrate peripheral and central inflammatory responses. Ubiquitous expression and developmental regulation suggest that MIF may have additional roles outside of the immune system. Here we report the structure and chromosomal location of the mouse Mif gene and the partial characterization of five Mif pseudogenes. The mouse Mif gene spans less than 0.7 kb of chromosomal DNA and is composed of three exons. A comparison between the mouse and the human genes shows a similar gene structure and common regulatory elements inmore » both promoter regions. The mouse Mif gene maps to the middle region of chromosome 10, between Bcr and S100b, which have been mapped to human chromosomes 22q11 and 21q22.3, respectively. The entire sequence of two pseudogenes demonstrates the absence of introns, the presence of the 5{prime} untranslated region of the cDNA, a 3{prime} poly(A) tail, and the lack of sequence similarity with untranscribed regions of the gene. The five pseudogenes are highly homologous to the cDNA, but contain a variable number of mutations that would produce mutated or truncated MIF-like proteins. Phylogenetic analyses of MIF genes and pseudogenes indicate several independent genetic events that can account for multiple genomic integrations. Three of the Mif pseudogenes were also mapped by interspecific backcross to chromosomes 1, 9, and 17. These results suggest that Mif pseudogenes originated by retrotransposition. 46 refs., 5 figs., 1 tab.« less
Genomic Rearrangements in Arabidopsis Considered as Quantitative Traits.
Imprialou, Martha; Kahles, André; Steffen, Joshua G; Osborne, Edward J; Gan, Xiangchao; Lempe, Janne; Bhomra, Amarjit; Belfield, Eric; Visscher, Anne; Greenhalgh, Robert; Harberd, Nicholas P; Goram, Richard; Hein, Jotun; Robert-Seilaniantz, Alexandre; Jones, Jonathan; Stegle, Oliver; Kover, Paula; Tsiantis, Miltos; Nordborg, Magnus; Rätsch, Gunnar; Clark, Richard M; Mott, Richard
2017-04-01
To understand the population genetics of structural variants and their effects on phenotypes, we developed an approach to mapping structural variants that segregate in a population sequenced at low coverage. We avoid calling structural variants directly. Instead, the evidence for a potential structural variant at a locus is indicated by variation in the counts of short-reads that map anomalously to that locus. These structural variant traits are treated as quantitative traits and mapped genetically, analogously to a gene expression study. Association between a structural variant trait at one locus, and genotypes at a distant locus indicate the origin and target of a transposition. Using ultra-low-coverage (0.3×) population sequence data from 488 recombinant inbred Arabidopsis thaliana genomes, we identified 6502 segregating structural variants. Remarkably, 25% of these were transpositions. While many structural variants cannot be delineated precisely, we validated 83% of 44 predicted transposition breakpoints by polymerase chain reaction. We show that specific structural variants may be causative for quantitative trait loci for germination and resistance to infection by the fungus Albugo laibachii , isolate Nc14. Further we show that the phenotypic heritability attributable to read-mapping anomalies differs from, and, in the case of time to germination and bolting, exceeds that due to standard genetic variation. Genes within structural variants are also more likely to be silenced or dysregulated. This approach complements the prevalent strategy of structural variant discovery in fewer individuals sequenced at high coverage. It is generally applicable to large populations sequenced at low-coverage, and is particularly suited to mapping transpositions. Copyright © 2017 by the Genetics Society of America.
Cui, Junjie; Luo, Shaobo; Niu, Yu; Huang, Rukui; Wen, Qingfang; Su, Jianwen; Miao, Nansheng; He, Weiming; Dong, Zhensheng; Cheng, Jiaowen; Hu, Kailin
2018-01-01
Genetic mapping is a basic tool necessary for anchoring assembled scaffold sequences and for identifying QTLs controlling important traits. Though bitter gourd (Momordica charantia) is both consumed and used as a medicinal, research on its genomics and genetic mapping is severely limited. Here, we report the construction of a restriction site associated DNA (RAD)-based genetic map for bitter gourd using an F2 mapping population comprising 423 individuals derived from two cultivated inbred lines, the gynoecious line ‘K44’ and the monoecious line ‘Dali-11.’ This map comprised 1,009 SNP markers and spanned a total genetic distance of 2,203.95 cM across the 11 linkage groups. It anchored a total of 113 assembled scaffolds that covered about 251.32 Mb (85.48%) of the 294.01 Mb assembled genome. In addition, three horticulturally important traits including sex expression, fruit epidermal structure, and immature fruit color were evaluated using a combination of qualitative and quantitative data. As a result, we identified three QTL/gene loci responsible for these traits in three environments. The QTL/gene gy/fffn/ffn, controlling sex expression involved in gynoecy, first female flower node, and female flower number was detected in the reported region. Particularly, two QTLs/genes, Fwa/Wr and w, were found to be responsible for fruit epidermal structure and white immature fruit color, respectively. This RAD-based genetic map promotes the assembly of the bitter gourd genome and the identified genetic loci will accelerate the cloning of relevant genes in the future. PMID:29706980
Polytene Chromosomes - A Portrait of Functional Organization of the Drosophila Genome.
Zykova, Tatyana Yu; Levitsky, Victor G; Belyaeva, Elena S; Zhimulev, Igor F
2018-04-01
This mini-review is devoted to the problem genetic meaning of main polytene chromosome structures - bands and interbands. Generally, densely packed chromatin forms black bands, moderately condensed regions form grey loose bands, whereas decondensed regions of the genome appear as interbands. Recent progress in the annotation of the Drosophila genome and epigenome has made it possible to compare the banding pattern and the structural organization of genes, as well as their activity. This was greatly aided by our ability to establish the borders of bands and interbands on the physical map, which allowed to perform comprehensive side-by-side comparisons of cytology, genetic and epigenetic maps and to uncover the association between the morphological structures and the functional domains of the genome. These studies largely conclude that interbands 5'-ends of housekeeping genes that are active across all cell types. Interbands are enriched with proteins involved in transcription and nucleosome remodeling, as well as with active histone modifications. Notably, most of the replication origins map to interband regions. As for grey loose bands adjacent to interbands, they typically host the bodies of house-keeping genes. Thus, the bipartite structure composed of an interband and an adjacent grey band functions as a standalone genetic unit. Finally, black bands harbor tissue-specific genes with narrow temporal and tissue expression profiles. Thus, the uniform and permanent activity of interbands combined with the inactivity of genes in bands forms the basis of the universal banding pattern observed in various Drosophila tissues.
Chromosomal localization and structure of the human type II IMP dehydrogenase gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Glesne, D.; Huberman, E.; Collart, F.
1994-05-01
We determined the chromosomal localization and structure of the gene encoding human type II inosine 5{prime}-monophosphate dehydrogenase (IMPDH, EC 1.1.1.205), an enzyme associated with cellular proliferation, malignant transformation, and differentiation. Using polymerase chain reaction (PCR) primers specific for type II IMPDH, we screened a panel of human-Chinese hamster cell somatic hybrids and a separate deletion panel of chromosome 3 hybrids and localized the gene to 3p21.1{yields}p24.2. Two overlapping yeast artificial chromosome clones containing the full gene for type II IMPDH were isolated and a physical map of 117 kb of human genomic DNA in this region of chromosome 3 wasmore » constructed. The gene for type II IMPDH was localized and oriented on this map and found to span no more than 12.5 kb.« less
2012-01-01
Background Structured association mapping is proving to be a powerful strategy to find genetic polymorphisms associated with disease. However, these algorithms are often distributed as command line implementations that require expertise and effort to customize and put into practice. Because of the difficulty required to use these cutting-edge techniques, geneticists often revert to simpler, less powerful methods. Results To make structured association mapping more accessible to geneticists, we have developed an automatic processing system called Auto-SAM. Auto-SAM enables geneticists to run structured association mapping algorithms automatically, using parallelization. Auto-SAM includes algorithms to discover gene-networks and find population structure. Auto-SAM can also run popular association mapping algorithms, in addition to five structured association mapping algorithms. Conclusions Auto-SAM is available through GenAMap, a front-end desktop visualization tool. GenAMap and Auto-SAM are implemented in JAVA; binaries for GenAMap can be downloaded from http://sailing.cs.cmu.edu/genamap. PMID:22471660
Analysis of multiplex gene expression maps obtained by voxelation.
An, Li; Xie, Hongbo; Chin, Mark H; Obradovic, Zoran; Smith, Desmond J; Megalooikonomou, Vasileios
2009-04-29
Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in cortex and corpus callosum. The experimental results confirm the hypothesis that genes with similar gene expression maps might have similar gene functions. The voxelation data takes into account the location information of gene expression level in mouse brain, which is novel in related research. The proposed approach can potentially be used to predict gene functions and provide helpful suggestions to biologists.
Su, Zhipeng; Zhu, Jiawen; Xu, Zhuofei; Xiao, Ran; Zhou, Rui; Li, Lu; Chen, Huanchun
2016-01-01
Actinobacillus pleuropneumoniae is the pathogen of porcine contagious pleuropneumoniae, a highly contagious respiratory disease of swine. Although the genome of A. pleuropneumoniae was sequenced several years ago, limited information is available on the genome-wide transcriptional analysis to accurately annotate the gene structures and regulatory elements. High-throughput RNA sequencing (RNA-seq) has been applied to study the transcriptional landscape of bacteria, which can efficiently and accurately identify gene expression regions and unknown transcriptional units, especially small non-coding RNAs (sRNAs), UTRs and regulatory regions. The aim of this study is to comprehensively analyze the transcriptome of A. pleuropneumoniae by RNA-seq in order to improve the existing genome annotation and promote our understanding of A. pleuropneumoniae gene structures and RNA-based regulation. In this study, we utilized RNA-seq to construct a single nucleotide resolution transcriptome map of A. pleuropneumoniae. More than 3.8 million high-quality reads (average length ~90 bp) from a cDNA library were generated and aligned to the reference genome. We identified 32 open reading frames encoding novel proteins that were mis-annotated in the previous genome annotations. The start sites for 35 genes based on the current genome annotation were corrected. Furthermore, 51 sRNAs in the A. pleuropneumoniae genome were discovered, of which 40 sRNAs were never reported in previous studies. The transcriptome map also enabled visualization of 5'- and 3'-UTR regions, in which contained 11 sRNAs. In addition, 351 operons covering 1230 genes throughout the whole genome were identified. The RNA-Seq based transcriptome map validated annotated genes and corrected annotations of open reading frames in the genome, and led to the identification of many functional elements (e.g. regions encoding novel proteins, non-coding sRNAs and operon structures). The transcriptional units described in this study provide a foundation for future studies concerning the gene functions and the transcriptional regulatory architectures of this pathogen. PMID:27018591
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eipers, P.G.
1992-01-01
The gene for the human p58[sup clk[minus]1] protein kinase, a cell division control-related gene, has been mapped by somatic cell hybrid analyses, in situ localization with the chromosomal gene, and nested polymerase chain reaction amplification of microdissected chromosomes. These studies indicate that the expressed p58[sup clk[minus]1] chromosomal gene maps to 1p36, while a highly related p58[sup clk[minus]1] sequence of unknown nature maps to chromosome 15. Assignment of a p34[sup cdc2]-related gene to 1p36 region, including neuroblastoma, ductal carcinoma of the breast, malignant melanoma, Merkel cell carcinoma and endocrine neoplasia among others. Aberrant expression of this protein kinase negatively regulates normalmore » cellular growth. The p58[sup clk[minus]1] protein contains a central domain of 299 amino acids that is 46% identical to human p34[sup cdc2], the master mitotic protein kinase. This dissertation details the complete structure of the p58[sup clk[minus]1] chromosomal gene, including its putative promoter region, transcriptional start sites, exonic sequences, and intron/exon boundary sequences. The gene is 10 kb in size and contains 12 exons and 11 introns. Interestingly, the rather large 2.0 kb 3[prime] untranslated region is interrupted by an intron that separates a region containing numerous AUUUA destabilization motifs from the coding region. Furthermore, the expression of this gene in normal human tissues, as well as several human tumor cell samples and lines, is examined. The origin of multiple human transcripts from the same chromosomal gene, and the possible differential stability of these various transcripts, is discussed with regard to the transcriptional and post-transcriptional regulation of this gene. This is the first report of the chromosomal gene structure of a member of the p34[sup cdc2] supergene family.« less
Kujur, Alice; Bajaj, Deepak; Saxena, Maneesha S.; Tripathi, Shailesh; Upadhyaya, Hari D.; Gowda, C.L.L.; Singh, Sube; Jain, Mukesh; Tyagi, Akhilesh K.; Parida, Swarup K.
2013-01-01
We developed 1108 transcription factor gene-derived microsatellite (TFGMS) and 161 transcription factor functional domain-associated microsatellite (TFFDMS) markers from 707 TFs of chickpea. The robust amplification efficiency (96.5%) and high intra-specific polymorphic potential (34%) detected by markers suggest their immense utilities in efficient large-scale genotyping applications, including construction of both physical and functional transcript maps and understanding population structure. Candidate gene-based association analysis revealed strong genetic association of TFFDMS markers with three major seed and pod traits. Further, TFGMS markers in the 5′ untranslated regions of TF genes showing differential expression during seed development had higher trait association potential. The significance of TFFDMS markers was demonstrated by correlating their allelic variation with amino acid sequence expansion/contraction in the functional domain and alteration of secondary protein structure encoded by genes. The seed weight-associated markers were validated through traditional bi-parental genetic mapping. The determination of gene-specific linkage disequilibrium (LD) patterns in desi and kabuli based on single nucleotide polymorphism-microsatellite marker haplotypes revealed extended LD decay, enhanced LD resolution and trait association potential of genes. The evolutionary history of a strong seed-size/weight-associated TF based on natural variation and haplotype sharing among desi, kabuli and wild unravelled useful information having implication for seed-size trait evolution during chickpea domestication. PMID:23633531
RNA-Seq Based Transcriptional Map of Bovine Respiratory Disease Pathogen “Histophilus somni 2336”
Kumar, Ranjit; Lawrence, Mark L.; Watt, James; Cooksey, Amanda M.; Burgess, Shane C.; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify “novel” genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method. The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations. PMID:22276113
RNA-seq based transcriptional map of bovine respiratory disease pathogen "Histophilus somni 2336".
Kumar, Ranjit; Lawrence, Mark L; Watt, James; Cooksey, Amanda M; Burgess, Shane C; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify "novel" genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method.The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations.
JPRS Report, Science and Technology USSR: Life Sciences.
1990-07-16
4 1 VETERINARY MEDICINE Primary Structure of RNA Polymerase Gene of Foot-and-Mouth Disease Virus ( FMDV ...neering were used to obtain cDNA corresponding to the Primary Structure of RNA Polymerase Gene of RNA polymerase gene to FMDV A 2 2 , with a map of the...Foot-and-Mouth Disease Virus ( FMDV ) A22 primary nucleotide sequence of the cDNA provided. 18400538F Moscow BIOORGANICHESKA YA Analysis of the data
Cloning, structure, and chromosome localization of the mouse glutaryl-CoA dehydrogenase gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Koeller, D.M.; DiGiulio, A.; Frerman, F.E.
Glutaryl-CoA dehydrogenase (GCDH) is a nuclear-encoded, mitochondrial matrix enzyme. In humans, deficiency of GCDH leads to glutaric acidemia type I, and inherited disorder of amino acid metabolism characterized by a progressive neurodegenerative disease. In this report we describe the cloning and structure of the mouse GCDH (Gcdh) gene and cDNA and its chromosomal localization. The mouse Gcdh cDNA is 1.75 kb long and contains and open reading frame of 438 amino acids. The amino acid sequences of mouse, human, and pig GCDH are highly conserved. The mouse Gcdh gene contains 11 exons and spans 7 kb of genomic DNA. Gcdhmore » was mapped by backcross analysis to mouse chromosome 8 within a region that is homologous to a region of human chromosome 19, where the human gene was previously mapped. 14 refs., 3 figs.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Omel`yanchuk, L.V.
1995-12-01
A lethal insertion of an element P[lArB], which caused nondisjunction and structural abnormalities in chromosomes in the neuroblasts of homozygous larvae, was found. The insertion was mapped to region 57B1-12 of the polytene map of chromosome 2 of Drosophila. The expression of the corresponding gene was found in testes, ovaries, and neural ganglia. 8 refs., 6 figs.
The Organization of Repetitive DNA in the Genomes of Amazonian Lizard Species in the Family Teiidae.
Carvalho, Natalia D M; Pinheiro, Vanessa S S; Carmo, Edson J; Goll, Leonardo G; Schneider, Carlos H; Gross, Maria C
2015-01-01
Repetitive DNA is the largest fraction of the eukaryote genome and comprises tandem and dispersed sequences. It presents variations in relation to its composition, number of copies, distribution, dynamics, and genome organization, and participates in the evolutionary diversification of different vertebrate species. Repetitive sequences are usually located in the heterochromatin of centromeric and telomeric regions of chromosomes, contributing to chromosomal structures. Therefore, the aim of this study was to physically map repetitive DNA sequences (5S rDNA, telomeric sequences, tropomyosin gene 1, and retroelements Rex1 and SINE) of mitotic chromosomes of Amazonian species of teiids (Ameiva ameiva, Cnemidophorus sp. 1, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin) to understand their genome organization and karyotype evolution. The mapping of repetitive sequences revealed a distinct pattern in Cnemidophorus sp. 1, whereas the other species showed all sequences interspersed in the heterochromatic region. Physical mapping of the tropomyosin 1 gene was performed for the first time in lizards and showed that in addition to being functional, this gene has a structural function similar to the mapped repetitive elements as it is located preferentially in centromeric regions and termini of chromosomes. © 2016 S. Karger AG, Basel.
Chopra, Rupali; Ali, Shafat; Srivastava, Amit K; Aggarwal, Shweta; Kumar, Bhupender; Manvati, Siddharth; Kalaiarasan, Ponnusamy; Jena, Mamta; Garg, Vijay K; Bhattacharya, Sambit N; Bamezai, Rameshwar N K
2013-01-01
Leprosy is a chronic infectious disease caused by Mycobacterium Leprae, where the host genetic background plays an important role toward the disease pathogenesis. Various studies have identified a number of human genes in association with leprosy or its clinical forms. However, non-replication of results has hinted at the heterogeneity among associations between different population groups, which could be due to differently evolved LD structures and differential frequencies of SNPs within the studied regions of the genome. A need for systematic and saturated mapping of the associated regions with the disease is warranted to unravel the observed heterogeneity in different populations. Mapping of the PARK2 and PACRG gene regulatory region with 96 SNPs, with a resolution of 1 SNP per 1 Kb for PARK2 gene regulatory region in a North Indian population, showed an involvement of 11 SNPs in determining the susceptibility towards leprosy. The association was replicated in a geographically distinct and unrelated population from Orissa in eastern India. In vitro reporter assays revealed that the two significantly associated SNPs, located 63.8 kb upstream of PARK2 gene and represented in a single BIN of 8 SNPs, influenced the gene expression. A comparison of BINs between Indian and Vietnamese populations revealed differences in the BIN structures, explaining the heterogeneity and also the reason for non-replication of the associated genomic region in different populations.
Chopra, Rupali; Aggarwal, Shweta; Kumar, Bhupender; Manvati, Siddharth; Kalaiarasan, Ponnusamy; Jena, Mamta; Garg, Vijay K.; Bhattacharya, Sambit N.; Bamezai, Rameshwar N. K.
2013-01-01
Leprosy is a chronic infectious disease caused by Mycobacterium Leprae, where the host genetic background plays an important role toward the disease pathogenesis. Various studies have identified a number of human genes in association with leprosy or its clinical forms. However, non-replication of results has hinted at the heterogeneity among associations between different population groups, which could be due to differently evolved LD structures and differential frequencies of SNPs within the studied regions of the genome. A need for systematic and saturated mapping of the associated regions with the disease is warranted to unravel the observed heterogeneity in different populations. Mapping of the PARK2 and PACRG gene regulatory region with 96 SNPs, with a resolution of 1 SNP per 1 Kb for PARK2 gene regulatory region in a North Indian population, showed an involvement of 11 SNPs in determining the susceptibility towards leprosy. The association was replicated in a geographically distinct and unrelated population from Orissa in eastern India. In vitro reporter assays revealed that the two significantly associated SNPs, located 63.8 kb upstream of PARK2 gene and represented in a single BIN of 8 SNPs, influenced the gene expression. A comparison of BINs between Indian and Vietnamese populations revealed differences in the BIN structures, explaining the heterogeneity and also the reason for non-replication of the associated genomic region in different populations. PMID:23861666
Li, Xiaonan; Ramchiary, Nirala; Dhandapani, Vignesh; Choi, Su Ryun; Hur, Yoonkang; Nou, Ill-Sup; Yoon, Moo Kyoung; Lim, Yong Pyo
2013-01-01
Brassica rapa is an important crop species that produces vegetables, oilseed, and fodder. Although many studies reported quantitative trait loci (QTL) mapping, the genes governing most of its economically important traits are still unknown. In this study, we report QTL mapping for morphological and yield component traits in B. rapa and comparative map alignment between B. rapa, B. napus, B. juncea, and Arabidopsis thaliana to identify candidate genes and conserved QTL blocks between them. A total of 95 QTL were identified in different crucifer blocks of the B. rapa genome. Through synteny analysis with A. thaliana, B. rapa candidate genes and intronic and exonic single nucleotide polymorphisms in the parental lines were detected from whole genome resequenced data, a few of which were validated by mapping them to the QTL regions. Semi-quantitative reverse transcriptase PCR analysis showed differences in the expression levels of a few genes in parental lines. Comparative mapping identified five key major evolutionarily conserved crucifer blocks (R, J, F, E, and W) harbouring QTL for morphological and yield components traits between the A, B, and C subgenomes of B. rapa, B. juncea, and B. napus. The information of the identified candidate genes could be used for breeding B. rapa and other related Brassica species. PMID:23223793
Natural Allelic Diversity, Genetic Structure and Linkage Disequilibrium Pattern in Wild Chickpea
Kujur, Alice; Das, Shouvik; Badoni, Saurabh; Kumar, Vinod; Singh, Mohar; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.
2014-01-01
Characterization of natural allelic diversity and understanding the genetic structure and linkage disequilibrium (LD) pattern in wild germplasm accessions by large-scale genotyping of informative microsatellite and single nucleotide polymorphism (SNP) markers is requisite to facilitate chickpea genetic improvement. Large-scale validation and high-throughput genotyping of genome-wide physically mapped 478 genic and genomic microsatellite markers and 380 transcription factor gene-derived SNP markers using gel-based assay, fluorescent dye-labelled automated fragment analyser and matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass array have been performed. Outcome revealed their high genotyping success rate (97.5%) and existence of a high level of natural allelic diversity among 94 wild and cultivated Cicer accessions. High intra- and inter-specific polymorphic potential and wider molecular diversity (11–94%) along with a broader genetic base (13–78%) specifically in the functional genic regions of wild accessions was assayed by mapped markers. It suggested their utility in monitoring introgression and transferring target trait-specific genomic (gene) regions from wild to cultivated gene pool for the genetic enhancement. Distinct species/gene pool-wise differentiation, admixed domestication pattern, and differential genome-wide recombination and LD estimates/decay observed in a six structured population of wild and cultivated accessions using mapped markers further signifies their usefulness in chickpea genetics, genomics and breeding. PMID:25222488
Chan, Wen-Ling; Yang, Wen-Kuang; Huang, Hsien-Da; Chang, Jan-Gowth
2013-01-01
RNA interference (RNAi) is a gene silencing process within living cells, which is controlled by the RNA-induced silencing complex with a sequence-specific manner. In flies and mice, the pseudogene transcripts can be processed into short interfering RNAs (siRNAs) that regulate protein-coding genes through the RNAi pathway. Following these findings, we construct an innovative and comprehensive database to elucidate siRNA-mediated mechanism in human transcribed pseudogenes (TPGs). To investigate TPG producing siRNAs that regulate protein-coding genes, we mapped the TPGs to small RNAs (sRNAs) that were supported by publicly deep sequencing data from various sRNA libraries and constructed the TPG-derived siRNA-target interactions. In addition, we also presented that TPGs can act as a target for miRNAs that actually regulate the parental gene. To enable the systematic compilation and updating of these results and additional information, we have developed a database, pseudoMap, capturing various types of information, including sequence data, TPG and cognate annotation, deep sequencing data, RNA-folding structure, gene expression profiles, miRNA annotation and target prediction. As our knowledge, pseudoMap is the first database to demonstrate two mechanisms of human TPGs: encoding siRNAs and decoying miRNAs that target the parental gene. pseudoMap is freely accessible at http://pseudomap.mbc.nctu.edu.tw/. Database URL: http://pseudomap.mbc.nctu.edu.tw/
Wang, Chaolong; Zöllner, Sebastian; Rosenberg, Noah A.
2012-01-01
Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-nucleotide polymorphisms (SNPs). However, this similarity has been evident primarily in a qualitative sense; and, because different multivariate techniques and marker sets have been used in different studies, it has not been possible to formally compare genetic variation datasets in terms of their levels of similarity with geography. In this study, using genome-wide SNP data from 128 populations worldwide, we perform a systematic analysis to quantitatively evaluate the similarity of genes and geography in different geographic regions. For each of a series of regions, we apply a Procrustes analysis approach to find an optimal transformation that maximizes the similarity between PCA maps of genetic variation and geographic maps of population locations. We consider examples in Europe, Sub-Saharan Africa, Asia, East Asia, and Central/South Asia, as well as in a worldwide sample, finding that significant similarity between genes and geography exists in general at different geographic levels. The similarity is highest in our examples for Asia and, once highly distinctive populations have been removed, Sub-Saharan Africa. Our results provide a quantitative assessment of the geographic structure of human genetic variation worldwide, supporting the view that geography plays a strong role in giving rise to human population structure. PMID:22927824
Wang, Chaolong; Zöllner, Sebastian; Rosenberg, Noah A
2012-08-01
Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-nucleotide polymorphisms (SNPs). However, this similarity has been evident primarily in a qualitative sense; and, because different multivariate techniques and marker sets have been used in different studies, it has not been possible to formally compare genetic variation datasets in terms of their levels of similarity with geography. In this study, using genome-wide SNP data from 128 populations worldwide, we perform a systematic analysis to quantitatively evaluate the similarity of genes and geography in different geographic regions. For each of a series of regions, we apply a Procrustes analysis approach to find an optimal transformation that maximizes the similarity between PCA maps of genetic variation and geographic maps of population locations. We consider examples in Europe, Sub-Saharan Africa, Asia, East Asia, and Central/South Asia, as well as in a worldwide sample, finding that significant similarity between genes and geography exists in general at different geographic levels. The similarity is highest in our examples for Asia and, once highly distinctive populations have been removed, Sub-Saharan Africa. Our results provide a quantitative assessment of the geographic structure of human genetic variation worldwide, supporting the view that geography plays a strong role in giving rise to human population structure.
Mapping and annotating obesity-related genes in pig and human genomes.
Martelli, Pier Luigi; Fontanesi, Luca; Piovesan, Damiano; Fariselli, Piero; Casadio, Rita
2014-01-01
Background. Obesity is a major health problem in both developed and emerging countries. Obesity is a complex disease whose etiology involves genetic factors in strong interplay with environmental determinants and lifestyle. The discovery of genetic factors and biological pathways underlying human obesity is hampered by the difficulty in controlling the genetic background of human cohorts. Animal models are then necessary to further dissect the genetics of obesity. Pig has emerged as one of the most attractive models, because of the similarity with humans in the mechanisms regulating the fat deposition. Results. We collected the genes related to obesity in humans and to fat deposition traits in pig. We localized them on both human and pig genomes, building a map useful to interpret comparative studies on obesity. We characterized the collected genes structurally and functionally with BAR+ and mapped them on KEGG pathways and on STRING protein interaction network. Conclusions. The collected set consists of 361 obesity related genes in human and pig genomes. All genes were mapped on the human genome, and 54 could not be localized on the pig genome (release 2012). Only for 3 human genes there is no counterpart in pig, confirming that this animal is a good model for human obesity studies. Obesity related genes are mostly involved in regulation and signaling processes/pathways and relevant connection emerges between obesity-related genes and diseases such as cancer and infectious diseases.
LS-SNP/PDB: annotated non-synonymous SNPs mapped to Protein Data Bank structures.
Ryan, Michael; Diekhans, Mark; Lien, Stephanie; Liu, Yun; Karchin, Rachel
2009-06-01
LS-SNP/PDB is a new WWW resource for genome-wide annotation of human non-synonymous (amino acid changing) SNPs. It serves high-quality protein graphics rendered with UCSF Chimera molecular visualization software. The system is kept up-to-date by an automated, high-throughput build pipeline that systematically maps human nsSNPs onto Protein Data Bank structures and annotates several biologically relevant features. LS-SNP/PDB is available at (http://ls-snp.icm.jhu.edu/ls-snp-pdb) and via links from protein data bank (PDB) biology and chemistry tabs, UCSC Genome Browser Gene Details and SNP Details pages and PharmGKB Gene Variants Downloads/Cross-References pages.
Nelson, Matthew N.; Moolhuijzen, Paula M.; Boersma, Jeffrey G.; Chudy, Magdalena; Lesniewska, Karolina; Bellgard, Matthew; Oliver, Richard P.; Święcicki, Wojciech; Wolko, Bogdan; Cowling, Wallace A.; Ellwood, Simon R.
2010-01-01
We have developed a dense reference genetic map of Lupinus angustifolius (2n = 40) based on a set of 106 publicly available recombinant inbred lines derived from a cross between domesticated and wild parental lines. The map comprised 1090 loci in 20 linkage groups and three small clusters, drawing together data from several previous mapping publications plus almost 200 new markers, of which 63 were gene-based markers. A total of 171 mainly gene-based, sequence-tagged site loci served as bridging points for comparing the Lu. angustifolius genome with the genome sequence of the model legume, Lotus japonicus via BLASTn homology searching. Comparative analysis indicated that the genomes of Lu. angustifolius and Lo. japonicus are highly diverged structurally but with significant regions of conserved synteny including the region of the Lu. angustifolius genome containing the pod-shatter resistance gene, lentus. We discuss the potential of synteny analysis for identifying candidate genes for domestication traits in Lu. angustifolius and in improving our understanding of Fabaceae genome evolution. PMID:20133394
Ashburner, M.; Tsubota, S.; Woodruff, R. C.
1982-01-01
Exchange mapping locates the dominant mutation Scutoid to the right of Adh on chromosome arm 2L of D. melanogaster. However, deletion mapping indicates that Sco is to the left of Adh. The phenotype of Sco is sensitive to mutation, or deletion, of noc+ and of three genes, el, l(2)br22, and l(2)br29 mapping immediately distal to noc. The four contiguous loci, el, l(2)br22, l(2)br29 and noc, although separable by deletion end points, interact, because certain (or all) alleles of these four loci show partial failure of complementation, or even negative complementation. The simplest hypothesis is that Sco is a small reciprocal transposition, the genes noc, osp, and Adh exchanging places with three genes normally mapping proximal to them: l(2)br34, l(2)br35 and rd. The Sco phenotype is thought to result from a position effect at the newly created noc/l(2)br28 junction. PMID:6816673
Function does not follow form in gene regulatory circuits.
Payne, Joshua L; Wagner, Andreas
2015-08-20
Gene regulatory circuits are to the cell what arithmetic logic units are to the chip: fundamental components of information processing that map an input onto an output. Gene regulatory circuits come in many different forms, distinct structural configurations that determine who regulates whom. Studies that have focused on the gene expression patterns (functions) of circuits with a given structure (form) have examined just a few structures or gene expression patterns. Here, we use a computational model to exhaustively characterize the gene expression patterns of nearly 17 million three-gene circuits in order to systematically explore the relationship between circuit form and function. Three main conclusions emerge. First, function does not follow form. A circuit of any one structure can have between twelve and nearly thirty thousand distinct gene expression patterns. Second, and conversely, form does not follow function. Most gene expression patterns can be realized by more than one circuit structure. And third, multifunctionality severely constrains circuit form. The number of circuit structures able to drive multiple gene expression patterns decreases rapidly with the number of these patterns. These results indicate that it is generally not possible to infer circuit function from circuit form, or vice versa.
He, Wenyin; Sun, Xiaofang; Liu, Lian; Li, Man; Jin, Hua; Wang, Wei-Hua
2014-01-01
Chromosomal anomalies in human embryos produced by in vitro fertilization are very common, which include numerical (aneuploidy) and structural (deletion, duplication or others) anomalies. Our previous study indicated that chromosomal deletion(s) is the most common structural anomaly accounting for approximately 8% of euploid blastocysts. It is still unknown if these deletions in human euploid blastocysts have clinical significance. In this study, we analyzed 15 previously diagnosed euploid blastocysts that had chromosomal deletion(s) using Agilent oligonucleotide DNA microarray platform and localized the gene location in each deletion. Then, we used OMIM gene map and phenotype database to investigate if these deletions are related with some important genes that cause genetic diseases, especially developmental delay or intellectual disability. As results, we found that the detectable chromosomal deletion size with Agilent microarray is above 2.38 Mb, while the deletions observed in human blastocysts are between 11.6 to 103 Mb. With OMIM gene map and phenotype database information, we found that deletions can result in loss of 81-464 genes. Out of these genes, 34-149 genes are related with known genetic problems. Furthermore, we found that 5 out of 15 samples lost genes in the deleted region, which were related to developmental delay and/or intellectual disability. In conclusion, our data indicates that all human euploid blastocysts with chromosomal deletion(s) are abnormal and transfer of these embryos may cause birth defects and/or developmental and intellectual disabilities. Therefore, the embryos with chromosomal deletion revealed by DNA microarray should not be transferred to the patients, or further gene map and/or phenotype seeking is necessary before making a final decision.
Fine structure of OXI1, the mitochondrial gene coding for subunit II of yeast cytochrome c oxidase.
Weiss-Brummer, B; Guba, R; Haid, A; Schweyen, R J
1979-12-01
Genetic and biochemical studies have been performed with 110 mutants which are defective in cytochrome a·a3 and map in the regions on mit DNA previously designated OXI1 and OXI2. With 88 mutations allocated to OXI1 fine structure mapping was achieved by the analysis of rho (-) deletions. The order of six groups of mutational sites (A 1, A2, B 1, B2, C 1, C2) thus determined was confirmed by oxi i x oxi j recombination analysis.Analysis of mitochondrially translated polypeptides of oxil mutants by SDS-polyacrylamide electrophoresis reveals three classes of mutant patterns: i) similar to wild-tpye (19 mutants); ii) lacking SU II of cytochrome c oxidase (53 mutants); iii) lacking this subunit and exhibiting a single new polypeptide of lower Mr (16 mutants). Mutations of each of these classes are scattered over the OXI1 region without any detectable clustering; this is consistent with the assumption that all oxil mutations studied are within the same gene.New polypeptides observed in oxil mutants of class iii) vary in Mr in the range from 10,500 to 33,000. Those of Mr 17,000 to 33,000 are shown to be antigenically related to subunit II of cytochrome c oxidase. Colinearity is established between the series of new polypeptides of Mr values increasing from 10,500 to 31,500 and the order of the respective mutational sites on the map, e.g. mutations mapping in A 1 generate the smallest and mutations mapping in C2 the largest mutant fragments.From these data we conclude that i) all mutations allocated to the OXI1 region are in the same gene; ii) this gene codes for subunit II of cytochrome c oxidase; iii) the direction of translation is from CAP to 0X12. Out of 19 mutants allocated to OXI2 three exhibit a new polypeptide; these and all the other oxi2 mutants lack subunit III of cytochrome oxidase. This result provides preliminary evidence that the OXI2 region harbours the structural gene for this subunit III.
Huebner, K; Druck, T; Croce, C M; Thiesen, H J
1991-01-01
cDNA clones encoding zinc finger structures were isolated by screening Molt4 and Jurkat cDNA libraries with zinc finger consensus sequences. Candidate clones were partially sequenced to verify the presence of zinc finger-encoding regions; nonoverlapping cDNA clones were chosen on the basis of sequences and genomic hybridization pattern. Zinc finger structure-encoding clones, which were designated by the term "Kox" and a number from 1 to 32 and which were apparently unique (i.e., distinct from each other and distinct from those isolated by other laboratories), were chosen for mapping in the human genome. DNAs from rodent-human somatic cell hybrids retaining defined complements of human chromosomes were analyzed for the presence of each of the Kox genes. Correlation between the presence of specific human chromosome regions and specific Kox genes established the chromosomal locations. Multiple Kox loci were mapped to 7q (Kox 18 and 25 and a locus detected by both Kox 8 cDNA and Kox 27 cDNA), 8q24 5' to the myc locus (Kox 9 and 32), 10cen----q24 (Kox 2, 15, 19, 21, 30, and 31), 12q13-qter (Kox 1 and 20), 17p13 (Kox 11 and 26), and 19q (Kox 5, 6, 10, 22, 24, and 28). Single Kox loci were mapped to 7p22 (Kox 3), 18q12 (Kox 17), 19p (Kox 13), 22q11 between IG lambda and BCR-1 (locus detected by both Kox 8 cDNA and Kox 27 cDNA), and Xp (Kox 14). Several of the Kox loci map to regions in which other zinc finger structure-encoding loci have already been localized, indicating possible zinc finger gene clusters. In addition, Kox genes at 8q24, 17p13, and 22q11--and perhaps other Kox genes--are located near recurrent chromosomal translocation breakpoints. Others, such as those on 7p and 7q, may be near regions specifically active in T cells. Images Figure 4 Figure 5 Figure 2 Figure 3 PMID:2014798
Genome-Wide Structural Variation Detection by Genome Mapping on Nanochannel Arrays.
Mak, Angel C Y; Lai, Yvonne Y Y; Lam, Ernest T; Kwok, Tsz-Piu; Leung, Alden K Y; Poon, Annie; Mostovoy, Yulia; Hastie, Alex R; Stedman, William; Anantharaman, Thomas; Andrews, Warren; Zhou, Xiang; Pang, Andy W C; Dai, Heng; Chu, Catherine; Lin, Chin; Wu, Jacob J K; Li, Catherine M L; Li, Jing-Woei; Yim, Aldrin K Y; Chan, Saki; Sibert, Justin; Džakula, Željko; Cao, Han; Yiu, Siu-Ming; Chan, Ting-Fung; Yip, Kevin Y; Xiao, Ming; Kwok, Pui-Yan
2016-01-01
Comprehensive whole-genome structural variation detection is challenging with current approaches. With diploid cells as DNA source and the presence of numerous repetitive elements, short-read DNA sequencing cannot be used to detect structural variation efficiently. In this report, we show that genome mapping with long, fluorescently labeled DNA molecules imaged on nanochannel arrays can be used for whole-genome structural variation detection without sequencing. While whole-genome haplotyping is not achieved, local phasing (across >150-kb regions) is routine, as molecules from the parental chromosomes are examined separately. In one experiment, we generated genome maps from a trio from the 1000 Genomes Project, compared the maps against that derived from the reference human genome, and identified structural variations that are >5 kb in size. We find that these individuals have many more structural variants than those published, including some with the potential of disrupting gene function or regulation. Copyright © 2016 by the Genetics Society of America.
Bastien, C.; Machlin, S.; Zhang, Y.; Donaldson, K.; Hanson, R. S.
1989-01-01
Restriction maps of genes required for the synthesis of active methanol dehydrogenase in Methylobacterium organophilum XX and Methylobacterium sp. strain AM1 have been completed and compared. In these two species of pink-pigmented, type II methylotrophs, 15 genes were identified that were required for the expression of methanol dehydrogenase activity. None of these genes were required for the synthesis of the prosthetic group of methanol dehydrogenase, pyrroloquinoline quinone. The structural gene required for the synthesis of cytochrome cL, an electron acceptor uniquely required for methanol dehydrogenase, and the genes encoding small basic peptides that copurified with methanol dehydrogenases were closely linked to the methanol dehydrogenase structural genes. A cloned 22-kilobase DNA insert from Methylsporovibrio methanica 81Z, an obligate type II methanotroph, complemented mutants that contained lesions in four genes closely linked to the methanol dehydrogenase structural genes. The methanol dehydrogenase and cytochrome cL structural genes were found to be transcribed independently in M. organophilum XX. Only two of the genes required for methanol dehydrogenase synthesis in this bacterium were found to be cotranscribed. PMID:16348074
Genome sequence, comparative analysis and haplotype structure of the domestic dog.
Lindblad-Toh, Kerstin; Wade, Claire M; Mikkelsen, Tarjei S; Karlsson, Elinor K; Jaffe, David B; Kamal, Michael; Clamp, Michele; Chang, Jean L; Kulbokas, Edward J; Zody, Michael C; Mauceli, Evan; Xie, Xiaohui; Breen, Matthew; Wayne, Robert K; Ostrander, Elaine A; Ponting, Chris P; Galibert, Francis; Smith, Douglas R; DeJong, Pieter J; Kirkness, Ewen; Alvarez, Pablo; Biagi, Tara; Brockman, William; Butler, Jonathan; Chin, Chee-Wye; Cook, April; Cuff, James; Daly, Mark J; DeCaprio, David; Gnerre, Sante; Grabherr, Manfred; Kellis, Manolis; Kleber, Michael; Bardeleben, Carolyne; Goodstadt, Leo; Heger, Andreas; Hitte, Christophe; Kim, Lisa; Koepfli, Klaus-Peter; Parker, Heidi G; Pollinger, John P; Searle, Stephen M J; Sutter, Nathan B; Thomas, Rachael; Webber, Caleb; Baldwin, Jennifer; Abebe, Adal; Abouelleil, Amr; Aftuck, Lynne; Ait-Zahra, Mostafa; Aldredge, Tyler; Allen, Nicole; An, Peter; Anderson, Scott; Antoine, Claudel; Arachchi, Harindra; Aslam, Ali; Ayotte, Laura; Bachantsang, Pasang; Barry, Andrew; Bayul, Tashi; Benamara, Mostafa; Berlin, Aaron; Bessette, Daniel; Blitshteyn, Berta; Bloom, Toby; Blye, Jason; Boguslavskiy, Leonid; Bonnet, Claude; Boukhgalter, Boris; Brown, Adam; Cahill, Patrick; Calixte, Nadia; Camarata, Jody; Cheshatsang, Yama; Chu, Jeffrey; Citroen, Mieke; Collymore, Alville; Cooke, Patrick; Dawoe, Tenzin; Daza, Riza; Decktor, Karin; DeGray, Stuart; Dhargay, Norbu; Dooley, Kimberly; Dooley, Kathleen; Dorje, Passang; Dorjee, Kunsang; Dorris, Lester; Duffey, Noah; Dupes, Alan; Egbiremolen, Osebhajajeme; Elong, Richard; Falk, Jill; Farina, Abderrahim; Faro, Susan; Ferguson, Diallo; Ferreira, Patricia; Fisher, Sheila; FitzGerald, Mike; Foley, Karen; Foley, Chelsea; Franke, Alicia; Friedrich, Dennis; Gage, Diane; Garber, Manuel; Gearin, Gary; Giannoukos, Georgia; Goode, Tina; Goyette, Audra; Graham, Joseph; Grandbois, Edward; Gyaltsen, Kunsang; Hafez, Nabil; Hagopian, Daniel; Hagos, Birhane; Hall, Jennifer; Healy, Claire; Hegarty, Ryan; Honan, Tracey; Horn, Andrea; Houde, Nathan; Hughes, Leanne; Hunnicutt, Leigh; Husby, M; Jester, Benjamin; Jones, Charlien; Kamat, Asha; Kanga, Ben; Kells, Cristyn; Khazanovich, Dmitry; Kieu, Alix Chinh; Kisner, Peter; Kumar, Mayank; Lance, Krista; Landers, Thomas; Lara, Marcia; Lee, William; Leger, Jean-Pierre; Lennon, Niall; Leuper, Lisa; LeVine, Sarah; Liu, Jinlei; Liu, Xiaohong; Lokyitsang, Yeshi; Lokyitsang, Tashi; Lui, Annie; Macdonald, Jan; Major, John; Marabella, Richard; Maru, Kebede; Matthews, Charles; McDonough, Susan; Mehta, Teena; Meldrim, James; Melnikov, Alexandre; Meneus, Louis; Mihalev, Atanas; Mihova, Tanya; Miller, Karen; Mittelman, Rachel; Mlenga, Valentine; Mulrain, Leonidas; Munson, Glen; Navidi, Adam; Naylor, Jerome; Nguyen, Tuyen; Nguyen, Nga; Nguyen, Cindy; Nguyen, Thu; Nicol, Robert; Norbu, Nyima; Norbu, Choe; Novod, Nathaniel; Nyima, Tenchoe; Olandt, Peter; O'Neill, Barry; O'Neill, Keith; Osman, Sahal; Oyono, Lucien; Patti, Christopher; Perrin, Danielle; Phunkhang, Pema; Pierre, Fritz; Priest, Margaret; Rachupka, Anthony; Raghuraman, Sujaa; Rameau, Rayale; Ray, Verneda; Raymond, Christina; Rege, Filip; Rise, Cecil; Rogers, Julie; Rogov, Peter; Sahalie, Julie; Settipalli, Sampath; Sharpe, Theodore; Shea, Terrance; Sheehan, Mechele; Sherpa, Ngawang; Shi, Jianying; Shih, Diana; Sloan, Jessie; Smith, Cherylyn; Sparrow, Todd; Stalker, John; Stange-Thomann, Nicole; Stavropoulos, Sharon; Stone, Catherine; Stone, Sabrina; Sykes, Sean; Tchuinga, Pierre; Tenzing, Pema; Tesfaye, Senait; Thoulutsang, Dawa; Thoulutsang, Yama; Topham, Kerri; Topping, Ira; Tsamla, Tsamla; Vassiliev, Helen; Venkataraman, Vijay; Vo, Andy; Wangchuk, Tsering; Wangdi, Tsering; Weiand, Michael; Wilkinson, Jane; Wilson, Adam; Yadav, Shailendra; Yang, Shuli; Yang, Xiaoping; Young, Geneva; Yu, Qing; Zainoun, Joanne; Zembek, Lisa; Zimmer, Andrew; Lander, Eric S
2005-12-08
Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.
Shpakovskiĭ, G V; Lebedenko, E N
1998-01-01
Plasmid pYUK3 bearing the fet5+ gene of Schizosaccharomyces pombe was isolated from a genomic library of the fission yeast, and a detailed physical map of the whole genomic insert (ca. 9.6 Kbp) was constructed. The primary structure of the fet5+ gene and its flanking regions is established. The gene contains a single 45-bp intron in its distal part. A typical TATA-box (TATAAG) was found in the 5'-noncoding region ca. 50 bp upstream of the putative start of transcription, and the 3'-noncoding region contains AT-rich palindromes, which are probably involved in termination of the fet5+ transcription. A previously unidentified gene of Sz. pombe encoding a protein with some similarity to one of the transcriptional activators from the TBP (TATA-binding protein) group of SPT factors of transcription was found in the vicinity of the fet5+ gene. Taking into account that cDNA of the fet5(+)-gene was isolated as a suppressor of the genetic-defect of nuclear RNA polymerases I-III (Bioorg. Khim., 1997, vol. 23, No 3, pp. 234-237), this vicinity may be the first evidence of possible clustering, in the genome of the fission yeast, of genes participating in transcription regulation.
Polster, Robert; Petropoulos, Christos J; Bonhoeffer, Sebastian; Guillaume, Frédéric
2016-12-01
The genotype-phenotype (GP) map is a central concept in evolutionary biology as it describes the mapping of molecular genetic variation onto phenotypic trait variation. Our understanding of that mapping remains partial, especially when trying to link functional clustering of pleiotropic gene effects with patterns of phenotypic trait co-variation. Only on rare occasions have studies been able to fully explore that link and tend to show poor correspondence between modular structures within the GP map and among phenotypes. By dissecting the structure of the GP map of the replicative capacity of HIV-1 in 15 drug environments, we provide a detailed view of that mapping from mutational pleiotropic variation to phenotypic co-variation, including epistatic effects of a set of amino-acid substitutions in the reverse transcriptase and protease genes. We show that epistasis increases the pleiotropic degree of single mutations and provides modularity to the GP map of drug resistance in HIV-1. Moreover, modules of epistatic pleiotropic effects within the GP map match the phenotypic modules of correlated replicative capacity among drug classes. Epistasis thus increases the evolvability of cross-resistance in HIV by providing more drug- and class-specific pleiotropic profiles to the main effects of the mutations. We discuss the implications for the evolution of cross-resistance in HIV. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
A high-density genetic map of Arachis duranensis, a diploid ancestor of cultivated peanut
2012-01-01
Background Cultivated peanut (Arachis hypogaea) is an allotetraploid species whose ancestral genomes are most likely derived from the A-genome species, A. duranensis, and the B-genome species, A. ipaensis. The very recent (several millennia) evolutionary origin of A. hypogaea has imposed a bottleneck for allelic and phenotypic diversity within the cultigen. However, wild diploid relatives are a rich source of alleles that could be used for crop improvement and their simpler genomes can be more easily analyzed while providing insight into the structure of the allotetraploid peanut genome. The objective of this research was to establish a high-density genetic map of the diploid species A. duranensis based on de novo generated EST databases. Arachis duranensis was chosen for mapping because it is the A-genome progenitor of cultivated peanut and also in order to circumvent the confounding effects of gene duplication associated with allopolyploidy in A. hypogaea. Results More than one million expressed sequence tag (EST) sequences generated from normalized cDNA libraries of A. duranensis were assembled into 81,116 unique transcripts. Mining this dataset, 1236 EST-SNP markers were developed between two A. duranensis accessions, PI 475887 and Grif 15036. An additional 300 SNP markers also were developed from genomic sequences representing conserved legume orthologs. Of the 1536 SNP markers, 1054 were placed on a genetic map. In addition, 598 EST-SSR markers identified in A. hypogaea assemblies were included in the map along with 37 disease resistance gene candidate (RGC) and 35 other previously published markers. In total, 1724 markers spanning 1081.3 cM over 10 linkage groups were mapped. Gene sequences that provided mapped markers were annotated using similarity searches in three different databases, and gene ontology descriptions were determined using the Medicago Gene Atlas and TAIR databases. Synteny analysis between A. duranensis, Medicago and Glycine revealed significant stretches of conserved gene clusters spread across the peanut genome. A higher level of colinearity was detected between A. duranensis and Glycine than with Medicago. Conclusions The first high-density, gene-based linkage map for A. duranensis was generated that can serve as a reference map for both wild and cultivated Arachis species. The markers developed here are valuable resources for the peanut, and more broadly, to the legume research community. The A-genome map will have utility for fine mapping in other peanut species and has already had application for mapping a nematode resistance gene that was introgressed into A. hypogaea from A. cardenasii. PMID:22967170
[Multiplexing mapping of human cDNAs]. Final report, September 1, 1991--February 28, 1994
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
Using PCR with automated product analysis, 329 human brain cDNA sequences have been assigned to individual human chromosomes. Primers were designed from single-pass cDNA sequences expressed sequence tags (ESTs). Primers were used in PCR reactions with DNA from somatic cell hybrid mapping panels as templates, often with multiplexing. Many ESTs mapped match sequence database records. To evaluate of these matches, the position of the primers relative to the matching region (In), the BLAST scores and the Poisson probability values of the EST/sequence record match were determined. In cases where the gene product was stringently identified by the sequence match hadmore » already been mapped, the gene locus determined by EST was consistent with the previous position which strongly supports the validity of assigning unknown genes to human chromosomes based on the EST sequence matches. In the present cases mapping the ESTs to a chromosome can also be considered to have mapped the known gene product: rolipram-sensitive cAMP phosphodiesterase, chromosome 1; protein phosphatase 2A{beta}, chromosome 4; alpha-catenin, chromosome 5; the ELE1 oncogene, chromosome 10q11.2 or q2.1-q23; MXII protein, chromosome l0q24-qter; ribosomal protein L18a homologue, chromosome 14; ribosomal protein L3, chromosome 17; and moesin, Xp11-cen. There were also ESTs mapped that were closely related to non-human sequence records. These matches therefore can be considered to identify human counterparts of known gene products, or members of known gene families. Examples of these include membrane proteins, translation-associated proteins, structural proteins, and enzymes. These data then demonstrate that single pass sequence information is sufficient to design PCR primers useful for assigning cDNA sequences to human chromosomes. When the EST sequence matches previous sequence database records, the chromosome assignments of the EST can be used to make preliminary assignments of the human gene to a chromosome.« less
Skogsberg, J; Kannisto, K; Roshani, L; Gagne, E; Hamsten, A; Larsson, C; Ehrenborg, E
2000-07-01
Peroxisome proliferator activated receptors (PPARs) are nuclear receptors regulating the expression of genes involved in lipid and glucose metabolism. Three different PPARs; alpha (PPARA), gamma (PPARG) and delta (PPARD) have been characterized and they are distinguished from each other by tissue distribution and cell activation. In this study, the structure and detailed chromosomal localization of the human PPARD gene was determined. Three genomic clones containing the PPARD gene was isolated from a human P1 library. The gene spans approximately 85 kb of DNA and consists of 9 exons and 8 introns with exons ranging in size from 84 bp to 2.3 kb and introns ranging from 180 bp to 50 kb. All splice acceptor and donor sites conform to the consensus sequences including the AG-GT motif. Although PPARD lacks a TATA box, the gene is transcribed from a unique start site located 380 bp upstream of the ATG initiation codon. The 5' and 3' ends were mapped by rapid amplification of cDNA ends and the mRNA size of PPARD based upon the structure of the gene is 3803 bp. In addition, the chromosomal sublocalization of PPARD was determined by radiation hybrid mapping. The PPARD gene is located at 14 cR from the colipase gene and 15 cR from the serine kinase gene at chromosomal region 6p21.2.
Characterization and fine mapping of qkc7.03: a major locus for kernel cracking in maize.
Yang, Mingtao; Chen, Lin; Wu, Xun; Gao, Xing; Li, Chunhui; Song, Yanchun; Zhang, Dengfeng; Shi, Yunsu; Li, Yu; Li, Yong-Xiang; Wang, Tianyu
2018-02-01
A major locus conferring kernel cracking in maize was characterized and fine mapped to an interval of 416.27 kb. Meanwhile, combining the results of transcriptomic analysis, the candidate gene was inferred. Seed development requires a proper structural and physiological balance between the maternal tissues and the internal structures of the seeds. In maize, kernel cracking is a disorder in this balance that seriously limits quality and yield and is characterized by a cracked pericarp at the kernel top and endosperm everting. This study elucidated the genetic basis and characterization of kernel cracking. Primarily, a near isogenic line (NIL) with a B73 background exhibited steady kernel cracking across environments. Therefore, deprived mapping populations were developed from this NIL and its recurrent parent B73. A major locus on chromosome 7, qkc7.03, was identified to be associated with the cracking performance. According to a progeny test of recombination events, qkc7.03 was fine mapped to a physical interval of 416.27 kb. In addition, obvious differences were observed in embryo development and starch granule arrangement within the endosperm between the NIL and its recurrent parent upon the occurrence of kernel cracking. Moreover, compared to its recurrent parent, the transcriptome of the NIL showed a significantly down-regulated expression of genes related to zeins, carbohydrate synthesis and MADS-domain transcription factors. The transcriptomic analysis revealed ten annotated genes within the target region of qkc7.03, and only GRMZM5G899476 was differently expressed between the NIL and its recurrent parent, indicating that this gene might be a candidate gene for kernel cracking. The results of this study facilitate the understanding of the potential mechanism underlying kernel cracking in maize.
Song, Mengfei; Wei, Qingzhen; Wang, Jing; Fu, Wenyuan; Qin, Xiaodong; Lu, Xiumei; Cheng, Feng; Yang, Kang; Zhang, Lu; Yu, Xiaqing; Li, Ji; Chen, Jinfeng; Lou, Qunfeng
2018-01-01
Leaf color mutants in higher plants are ideal materials for investigating the structure and function of photosynthetic system. In this study, we identified a cucumber vyl (virescent-yellow leaf) mutant in the mutant library, which exhibited reduced pigment contents and delayed chloroplast development process. F2 and BC1 populations were constructed from the cross between vyl mutant and cucumber inbred line ‘Hazerd’ to identify that the vyl trait is controlled by a simply recessive gene designated as CsVYL. The CsVYL gene was mapped to a 3.8 cM interval on chromosome 4 using these 80 F2 individuals and BSA (bulked segregation analysis) approach. Fine genetic map was conducted with 1542 F2 plants and narrowed down the vyl locus to an 86.3 kb genomic region, which contains a total of 11 genes. Sequence alignment between the wild type (WT) and vyl only identified one single nucleotide mutation (C→T) in the first exon of gene Csa4G637110, which encodes a DnaJ-like zinc finger protein. Gene Expression analysis confirmed the differences in transcription level of Csa4G637110 between wild type and mutant plants. Map-based cloning of the CsVYL gene could accelerate the study of chloroplast development and chlorophyll synthesis of cucumber. PMID:29681911
Mapping copy number variation by population-scale genome sequencing.
Mills, Ryan E; Walter, Klaudia; Stewart, Chip; Handsaker, Robert E; Chen, Ken; Alkan, Can; Abyzov, Alexej; Yoon, Seungtai Chris; Ye, Kai; Cheetham, R Keira; Chinwalla, Asif; Conrad, Donald F; Fu, Yutao; Grubert, Fabian; Hajirasouliha, Iman; Hormozdiari, Fereydoun; Iakoucheva, Lilia M; Iqbal, Zamin; Kang, Shuli; Kidd, Jeffrey M; Konkel, Miriam K; Korn, Joshua; Khurana, Ekta; Kural, Deniz; Lam, Hugo Y K; Leng, Jing; Li, Ruiqiang; Li, Yingrui; Lin, Chang-Yun; Luo, Ruibang; Mu, Xinmeng Jasmine; Nemesh, James; Peckham, Heather E; Rausch, Tobias; Scally, Aylwyn; Shi, Xinghua; Stromberg, Michael P; Stütz, Adrian M; Urban, Alexander Eckehart; Walker, Jerilyn A; Wu, Jiantao; Zhang, Yujun; Zhang, Zhengdong D; Batzer, Mark A; Ding, Li; Marth, Gabor T; McVean, Gil; Sebat, Jonathan; Snyder, Michael; Wang, Jun; Ye, Kenny; Eichler, Evan E; Gerstein, Mark B; Hurles, Matthew E; Lee, Charles; McCarroll, Steven A; Korbel, Jan O
2011-02-03
Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analysing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies.
Philipp, W J; Poulet, S; Eiglmeier, K; Pascopella, L; Balasubramanian, V; Heym, B; Bergh, S; Bloom, B R; Jacobs, W R; Cole, S T
1996-01-01
An integrated map of the genome of the tubercle bacillus, Mycobacterium tuberculosis, was constructed by using a twin-pronged approach. Pulsed-field gel electrophoretic analysis enabled cleavage sites for Asn I and Dra I to be positioned on the 4.4-Mb circular chromosome, while, in parallel, clones from two cosmid libraries were ordered into contigs by means of fingerprinting and hybridization mapping. The resultant contig map was readily correlated with the physical map of the genome via the landmarked restriction sites. Over 165 genes and markers were localized on the integrated map, thus enabling comparisons with the leprosy bacillus, Mycobacterium leprae, to be undertaken. Mycobacterial genomes appear to have evolved as mosaic structures since extended segments with conserved gene order and organization are interspersed with different flanking regions. Repetitive sequences and insertion elements are highly abundant in M. tuberculosis, but the distribution of IS6110 is apparently nonrandom. Images Fig. 1 Fig. 2 PMID:8610181
Chen, Xiaobo; Wang, Ji; Zhu, Ming; Jia, Haihong; Liu, Dongdong; Hao, Lili; Guo, Xingqi
2015-11-01
Mitogen-activated protein kinase (MAPK) cascades mediate various responses in plants. As the top component, MAP3Ks deserve more attention; however, little is known about the role of MAP3Ks, especially in cotton, a worldwide economic crop. In this study, a gene encoding a putative Raf-like MAP3K, GhMAP3K40, was isolated. GhMAP3K40 expression was induced by stress and multiple signal molecules. The plants overexpressing GhMAP3K40 had an enhanced tolerance to drought and salt stress at the germination stage. However, at the seedling stage, the transgenic plants suffered more severe damage after drought, exposure to pathogens and oxidative stress. The defence-related genes and the antioxidant system were activated in transgenic palnts, suggesting that GhMAP3K40 positively regulate the defence response. The transgenic plants were less able to prevent pathogenic invasion, which was due to defects in the cell structure of the leaves. The root system of the control plants were stronger compared with the transgenic plants. These results indicated a negative role of GhMAP3K40 in growth and development and GhMAP3K40 possibly caused the defects by down-regulating the lignin biosynthesis. Overall, these results suggest that GhMAP3K40 may positively regulate defence response but cause reduced tolerance to biotic and abiotic stress by negatively regulating growth and development. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Charles, J. P.; Chihara, C.; Nejad, S.; Riddiford, L. M.
1997-01-01
A 36-kb genomic DNA segment of the Drosophila melanogaster genome containing 12 clustered cuticle genes has been mapped and partially sequenced. The cluster maps at 65A 5-6 on the left arm of the third chromosome, in agreement with the previously determined location of a putative cluster encompassing the genes for the third instar larval cuticle proteins LCP5, LCP6 and LCP8. This cluster is the largest cuticle gene cluster discovered to date and shows a number of surprising features that explain in part the genetic complexity of the LCP5, LCP6 and LCP8 loci. The genes encoding LCP5 and LCP8 are multiple copy genes and the presence of extensive similarity in their coding regions gives the first evidence for gene conversion in cuticle genes. In addition, five genes in the cluster are intronless. Four of these five have arisen by retroposition. The other genes in the cluster have a single intron located at an unusual location for insect cuticle genes. PMID:9383064
Badoni, Saurabh; Das, Sweta; Sayal, Yogesh K.; Gopalakrishnan, S.; Singh, Ashok K.; Rao, Atmakuri R.; Agarwal, Pinky; Parida, Swarup K.; Tyagi, Akhilesh K.
2016-01-01
We developed genome-wide 84634 ISM (intron-spanning marker) and 16510 InDel-fragment length polymorphism-based ILP (intron-length polymorphism) markers from genes physically mapped on 12 rice chromosomes. These genic markers revealed much higher amplification-efficiency (80%) and polymorphic-potential (66%) among rice accessions even by a cost-effective agarose gel-based assay. A wider level of functional molecular diversity (17–79%) and well-defined precise admixed genetic structure was assayed by 3052 genome-wide markers in a structured population of indica, japonica, aromatic and wild rice. Six major grain weight QTLs (11.9–21.6% phenotypic variation explained) were mapped on five rice chromosomes of a high-density (inter-marker distance: 0.98 cM) genetic linkage map (IR 64 x Sonasal) anchored with 2785 known/candidate gene-derived ISM and ILP markers. The designing of multiple ISM and ILP markers (2 to 4 markers/gene) in an individual gene will broaden the user-preference to select suitable primer combination for efficient assaying of functional allelic variation/diversity and realistic estimation of differential gene expression profiles among rice accessions. The genomic information generated in our study is made publicly accessible through a user-friendly web-resource, “Oryza ISM-ILP marker” database. The known/candidate gene-derived ISM and ILP markers can be enormously deployed to identify functionally relevant trait-associated molecular tags by optimal-resource expenses, leading towards genomics-assisted crop improvement in rice. PMID:27032371
Clément, D; Lanaud, C; Sabau, X; Fouet, O; Le Cunff, L; Ruiz, E; Risterucci, A M; Glaszmann, J C; Piffanelli, P
2004-05-01
We have constructed and validated the first cocoa ( Theobroma cacao L.) BAC library, with the aim of developing molecular resources to study the structure and evolution of the genome of this perennial crop. This library contains 36,864 clones with an average insert size of 120 kb, representing approximately ten haploid genome equivalents. It was constructed from the genotype Scavina-6 (Sca-6), a Forastero clone highly resistant to cocoa pathogens and a parent of existing mapping populations. Validation of the BAC library was carried out with a set of 13 genetically-anchored single copy and one duplicated markers. An average of nine BAC clones per probe was identified, giving an initial experimental estimation of the genome coverage represented in the library. Screening of the library with a set of resistance gene analogues (RGAs), previously mapped in cocoa and co-localizing with QTL for resistance to Phytophthora traits, confirmed at the physical level the tight clustering of RGAs in the cocoa genome and provided the first insights into the relationships between genetic and physical distances in the cocoa genome. This library represents an available BAC resource for structural genomic studies or map-based cloning of genes corresponding to important QTLs for agronomic traits such as resistance genes to major cocoa pathogens like Phytophthora spp ( palmivora and megakarya), Crinipellis perniciosa and Moniliophthora roreri.
Carpenter, Margaret A; Shaw, Martin; Cooper, Rebecca D; Frew, Tonya J; Butler, Ruth C; Murray, Sarah R; Moya, Leire; Coyne, Clarice J; Timmerman-Vaughan, Gail M
2017-08-01
Although starch consists of large macromolecules composed of glucose units linked by α-1,4-glycosidic linkages with α-1,6-glycosidic branchpoints, variation in starch structural and functional properties is found both within and between species. Interest in starch genetics is based on the importance of starch in food and industrial processes, with the potential of genetics to provide novel starches. The starch metabolic pathway is complex but has been characterized in diverse plant species, including pea. To understand how allelic variation in the pea starch metabolic pathway affects starch structure and percent amylose, partial sequences of 25 candidate genes were characterized for polymorphisms using a panel of 92 diverse pea lines. Variation in the percent amylose composition of extracted seed starch and (amylopectin) chain length distribution, one measure of starch structure, were characterized for these lines. Association mapping was undertaken to identify polymorphisms associated with the variation in starch chain length distribution and percent amylose, using a mixed linear model that incorporated population structure and kinship. Associations were found for polymorphisms in seven candidate genes plus Mendel's r locus (which conditions the round versus wrinkled seed phenotype). The genes with associated polymorphisms are involved in the substrate supply, chain elongation and branching stages of the pea carbohydrate and starch metabolic pathways. The association of polymorphisms in carbohydrate and starch metabolic genes with variation in amylopectin chain length distribution and percent amylose may help to guide manipulation of pea seed starch structural and functional properties through plant breeding.
Bajaj, Deepak; Das, Shouvik; Upadhyaya, Hari D.; Ranjan, Rajeev; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. Laxmipathi; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
The study identified 9045 high-quality SNPs employing both genome-wide GBS- and candidate gene-based SNP genotyping assays in 172, including 93 cultivated (desi and kabuli) and 79 wild chickpea accessions. The GWAS in a structured population of 93 sequenced accessions detected 15 major genomic loci exhibiting significant association with seed coat color. Five seed color-associated major genomic loci underlying robust QTLs mapped on a high-density intra-specific genetic linkage map were validated by QTL mapping. The integration of association and QTL mapping with gene haplotype-specific LD mapping and transcript profiling identified novel allelic variants (non-synonymous SNPs) and haplotypes in a MATE secondary transporter gene regulating light/yellow brown and beige seed coat color differentiation in chickpea. The down-regulation and decreased transcript expression of beige seed coat color-associated MATE gene haplotype was correlated with reduced proanthocyanidins accumulation in the mature seed coats of beige than light/yellow brown seed colored desi and kabuli accessions for their coloration/pigmentation. This seed color-regulating MATE gene revealed strong purifying selection pressure primarily in LB/YB seed colored desi and wild Cicer reticulatum accessions compared with the BE seed colored kabuli accessions. The functionally relevant molecular tags identified have potential to decipher the complex transcriptional regulatory gene function of seed coat coloration and for understanding the selective sweep-based seed color trait evolutionary pattern in cultivated and wild accessions during chickpea domestication. The genome-wide integrated approach employed will expedite marker-assisted genetic enhancement for developing cultivars with desirable seed coat color types in chickpea. PMID:26635822
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bastien, C.; Machlin, S.; Zhang, Y.
Restriction maps of genes required for the synthesis of active methanol dehydrogenase in Methylobacterium organophilum XX and Methylobacterium sp. strain AM1 have been completed and compared. In these two species of pink-pigmented, type II methylotrophs, 15 genes were identified that were required for the expression of methanol dehydrogenase activity. None of these genes were required for the synthesis of the prosthetic group of methanol dehydrogenase, pyrroloquinoline quinone. The structural gene required for the synthesis of cytochrome c{sub L}, an electron acceptor uniquely required for methanol dehydrogenase, and the genes encoding small basic peptides that copurified with methanol dehydrogenases were closelymore » linked to the methanol dehydrogenase structural genes. A cloned 22-kilobase DNA insert from Methylsporovibrio methanica 81Z, an obligate type II methanotroph, complemented mutants that contained lesions in four genes closely linked to the methanol dehydrogenase structural genes. The methanol dehydrogenase and cytochrome c{sub L} structural genes were found to be transcribed independently in M. organophilum XX. Only two of the genes required for methanol dehydrogenase synthesis in this bacterium were found to be cotranscribed.« less
Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones
Imanishi, Tadashi; Itoh, Takeshi; Suzuki, Yutaka; O'Donovan, Claire; Fukuchi, Satoshi; Koyanagi, Kanako O; Barrero, Roberto A; Tamura, Takuro; Yamaguchi-Kabata, Yumi; Tanino, Motohiko; Yura, Kei; Miyazaki, Satoru; Ikeo, Kazuho; Homma, Keiichi; Kasprzyk, Arek; Nishikawa, Tetsuo; Hirakawa, Mika; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Ashurst, Jennifer; Jia, Libin; Nakao, Mitsuteru; Thomas, Michael A; Mulder, Nicola; Karavidopoulou, Youla; Jin, Lihua; Kim, Sangsoo; Yasuda, Tomohiro; Lenhard, Boris; Eveno, Eric; Suzuki, Yoshiyuki; Yamasaki, Chisato; Takeda, Jun-ichi; Gough, Craig; Hilton, Phillip; Fujii, Yasuyuki; Sakai, Hiroaki; Tanaka, Susumu; Amid, Clara; Bellgard, Matthew; Bonaldo, Maria de Fatima; Bono, Hidemasa; Bromberg, Susan K; Brookes, Anthony J; Bruford, Elspeth; Carninci, Piero; Chelala, Claude; Couillault, Christine; de Souza, Sandro J.; Debily, Marie-Anne; Devignes, Marie-Dominique; Dubchak, Inna; Endo, Toshinori; Estreicher, Anne; Eyras, Eduardo; Fukami-Kobayashi, Kaoru; R. Gopinath, Gopal; Graudens, Esther; Hahn, Yoonsoo; Han, Michael; Han, Ze-Guang; Hanada, Kousuke; Hanaoka, Hideki; Harada, Erimi; Hashimoto, Katsuyuki; Hinz, Ursula; Hirai, Momoki; Hishiki, Teruyoshi; Hopkinson, Ian; Imbeaud, Sandrine; Inoko, Hidetoshi; Kanapin, Alexander; Kaneko, Yayoi; Kasukawa, Takeya; Kelso, Janet; Kersey, Paul; Kikuno, Reiko; Kimura, Kouichi; Korn, Bernhard; Kuryshev, Vladimir; Makalowska, Izabela; Makino, Takashi; Mano, Shuhei; Mariage-Samson, Regine; Mashima, Jun; Matsuda, Hideo; Mewes, Hans-Werner; Minoshima, Shinsei; Nagai, Keiichi; Nagasaki, Hideki; Nagata, Naoki; Nigam, Rajni; Ogasawara, Osamu; Ohara, Osamu; Ohtsubo, Masafumi; Okada, Norihiro; Okido, Toshihisa; Oota, Satoshi; Ota, Motonori; Ota, Toshio; Otsuki, Tetsuji; Piatier-Tonneau, Dominique; Poustka, Annemarie; Ren, Shuang-Xi; Saitou, Naruya; Sakai, Katsunaga; Sakamoto, Shigetaka; Sakate, Ryuichi; Schupp, Ingo; Servant, Florence; Sherry, Stephen; Shiba, Rie; Shimizu, Nobuyoshi; Shimoyama, Mary; Simpson, Andrew J; Soares, Bento; Steward, Charles; Suwa, Makiko; Suzuki, Mami; Takahashi, Aiko; Tamiya, Gen; Tanaka, Hiroshi; Taylor, Todd; Terwilliger, Joseph D; Unneberg, Per; Veeramachaneni, Vamsi; Watanabe, Shinya; Wilming, Laurens; Yasuda, Norikazu; Yoo, Hyang-Sook; Stodolsky, Marvin; Makalowski, Wojciech; Go, Mitiko; Nakai, Kenta; Takagi, Toshihisa; Kanehisa, Minoru; Sakaki, Yoshiyuki; Quackenbush, John; Okazaki, Yasushi; Hayashizaki, Yoshihide; Hide, Winston; Chakraborty, Ranajit; Nishikawa, Ken; Sugawara, Hideaki; Tateno, Yoshio; Chen, Zhu; Oishi, Michio; Tonellato, Peter; Apweiler, Rolf; Okubo, Kousaku; Wagner, Lukas; Wiemann, Stefan; Strausberg, Robert L; Isogai, Takao; Auffray, Charles; Nomura, Nobuo; Sugano, Sumio
2004-01-01
The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology. PMID:15103394
The cld mutation: narrowing the critical chromosomal region and selecting candidate genes.
Péterfy, Miklós; Mao, Hui Z; Doolittle, Mark H
2006-10-01
Combined lipase deficiency (cld) is a recessive, lethal mutation specific to the tw73 haplotype on mouse Chromosome 17. While the cld mutation results in lipase proteins that are inactive, aggregated, and retained in the endoplasmic reticulum (ER), it maps separately from the lipase structural genes. We have narrowed the gene critical region by about 50% using the tw18 haplotype for deletion mapping and a recombinant chromosome used originally to map cld with respect to the phenotypic marker tf. The region now extends from 22 to 25.6 Mbp on the wild-type chromosome, currently containing 149 genes and 50 expressed sequence tags (ESTs). To identify the affected gene, we have selected candidates based on their known role in associated biological processes, cellular components, and molecular functions that best fit with the predicted function of the cld gene. A secondary approach was based on differences in mRNA levels between mutant (cld/cld) and unaffected (+/cld) cells. Using both approaches, we have identified seven functional candidates with an ER localization and/or an involvement in protein maturation and folding that could explain the lipase deficiency, and six expression candidates that exhibit large differences in mRNA levels between mutant and unaffected cells. Significantly, two genes were found to be candidates with regard to both function and expression, thus emerging as the strongest candidates for cld. We discuss the implications of our mapping results and our selection of candidates with respect to other genes, deletions, and mutations occurring in the cld critical region.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Farah, S.B.; Ramos, C.F.; Bortoletto, R.K.
1994-09-01
Microdeletions in proximal and distal Yq11 found {open_quotes}de novo{close_quotes} in males with idiopathic azoospermia or a severe oligospermia suggested that these Y mutations interrupt the gene structure(s) of a spermatogenesis function located in Yq11 and defined earlier as AZF. Using an extended interval map, dividing Yq11 into 22 subintervals (D1-D22) a third class of microdeletions could now be mapped in middle Yq11. It was found {open_quotes}de novo{close_quotes} in two sterile males with idiopathic azoospermia. Histological analysis of testes tissue sections of both males reveals arrest of spermatogenesis during the pachytene stage. Y-mutations in proximal Yq11 were usually associated to anmore » arrest of spermatogenesis before the proliferation phase of spermatogonia; the corresponding Y gene was defined as {open_quotes}AZFa{close_quotes}. It is, therefore, assumed that the new class of microdeletions, observed now in the middle of Yq11, disrupt another Y spermatogenesis gene expressed during the spermatocyte stage and defined earlier as {open_quotes}AZFb{close_quotes}. Experimental evidence presented during the meeting will try to confirm this view discussing a series of Y genes (AZFa,b,c,...) in Yq11 functioning at different stages during human spermatogenesis.« less
Data on the genome-wide identification of CNL R-genes in Setaria italica (L.) P. Beauv.
Andersen, Ethan J; Nepal, Madhav P
2017-08-01
We report data associated with the identification of 242 disease resistance genes (R-genes) in the genome of Setaria italica as presented in "Genetic diversity of disease resistance genes in foxtail millet ( Setaria italica L.)" (Andersen and Nepal, 2017) [1]. Our data describe the structure and evolution of the Coiled-coil, Nucleotide-binding site, Leucine-rich repeat (CNL) R-genes in foxtail millet. The CNL genes were identified through rigorous extraction and analysis of recently available plant genome sequences using cutting-edge analytical software. Data visualization includes gene structure diagrams, chromosomal syntenic maps, a chromosomal density plot, and a maximum-likelihood phylogenetic tree comparing Sorghum bicolor , Panicum virgatum , Setaria italica , and Arabidopsis thaliana . Compilation of InterProScan annotations, Gene Ontology (GO) annotations, and Basic Local Alignment Search Tool (BLAST) results for the 242 R-genes identified in the foxtail millet genome are also included in tabular format.
A roadmap for functional structural variants in the soybean genome
USDA-ARS?s Scientific Manuscript database
Gene structural variation (SV) has recently emerged as a key genetic mechanism underlying several important phenotypic traits in crop species. We screened a panel of 41 soybean accessions serving as parents in a soybean nested association mapping population for deletions and duplications in over 53...
2012-01-01
Background The first draft assembly and gene prediction of the grapevine genome (8X base coverage) was made available to the scientific community in 2007, and functional annotation was developed on this gene prediction. Since then additional Sanger sequences were added to the 8X sequences pool and a new version of the genomic sequence with superior base coverage (12X) was produced. Results In order to more efficiently annotate the function of the genes predicted in the new assembly, it is important to build on as much of the previous work as possible, by transferring 8X annotation of the genome to the 12X version. The 8X and 12X assemblies and gene predictions of the grapevine genome were compared to answer the question, “Can we uniquely map 8X predicted genes to 12X predicted genes?” The results show that while the assemblies and gene structure predictions are too different to make a complete mapping between them, most genes (18,725) showed a one-to-one relationship between 8X predicted genes and the last version of 12X predicted genes. In addition, reshuffled genomic sequence structures appeared. These highlight regions of the genome where the gene predictions need to be taken with caution. Based on the new grapevine gene functional annotation and in-depth functional categorization, twenty eight new molecular networks have been created for VitisNet while the existing networks were updated. Conclusions The outcomes of this study provide a functional annotation of the 12X genes, an update of VitisNet, the system of the grapevine molecular networks, and a new functional categorization of genes. Data are available at the VitisNet website (http://www.sdstate.edu/ps/research/vitis/pathways.cfm). PMID:22554261
Pingault, Lise; Choulet, Frédéric; Alberti, Adriana; Glover, Natasha; Wincker, Patrick; Feuillet, Catherine; Paux, Etienne
2015-02-10
Because of its size, allohexaploid nature, and high repeat content, the bread wheat genome is a good model to study the impact of the genome structure on gene organization, function, and regulation. However, because of the lack of a reference genome sequence, such studies have long been hampered and our knowledge of the wheat gene space is still limited. The access to the reference sequence of the wheat chromosome 3B provided us with an opportunity to study the wheat transcriptome and its relationships to genome and gene structure at a level that has never been reached before. By combining this sequence with RNA-seq data, we construct a fine transcriptome map of the chromosome 3B. More than 8,800 transcription sites are identified, that are distributed throughout the entire chromosome. Expression level, expression breadth, alternative splicing as well as several structural features of genes, including transcript length, number of exons, and cumulative intron length are investigated. Our analysis reveals a non-monotonic relationship between gene expression and structure and leads to the hypothesis that gene structure is determined by its function, whereas gene expression is subject to energetic cost. Moreover, we observe a recombination-based partitioning at the gene structure and function level. Our analysis provides new insights into the relationships between gene and genome structure and function. It reveals mechanisms conserved with other plant species as well as superimposed evolutionary forces that shaped the wheat gene space, likely participating in wheat adaptation.
The structure of the human interferon alpha/beta receptor gene.
Lutfalla, G; Gardiner, K; Proudhon, D; Vielh, E; Uzé, G
1992-02-05
Using the cDNA coding for the human interferon alpha/beta receptor (IFNAR), the IFNAR gene has been physically mapped relative to the other loci of the chromosome 21q22.1 region. 32,906 base pairs covering the IFNAR gene have been cloned and sequenced. Primer extension and solution hybridization-ribonuclease protection have been used to determine that the transcription of the gene is initiated in a broad region of 20 base pairs. Some aspects of the polymorphism of the gene, including noncoding sequences, have been analyzed; some are allelic differences in the coding sequence that induce amino acid variations in the resulting protein. The exon structure of the IFNAR gene and of that of the available genes for the receptors of the cytokine/growth hormone/prolactin/interferon receptor family have been compared with the predictions for the secondary structure of those receptors. From this analysis, we postulate a common origin and propose an hypothesis for the divergence from the immunoglobulin superfamily.
10. international mouse genome conference
DOE Office of Scientific and Technical Information (OSTI.GOV)
Meisler, M.H.
Ten years after hosting the First International Mammalian Genome Conference in Paris in 1986, Dr. Jean-Louis Guenet presided over the Tenth Conference at the Pasteur Institute, October 7--10, 1996. The 1986 conference was a satellite to the Human Gene Mapping Workshop and had approximately 50 attendees. The 1996 meeting was attended by 300 scientists from around the world. In the interim, the number of mapped loci in the mouse increased from 1,000 to over 20,000. This report contains a listing of the program and its participants, and two articles that review the meeting and the role of the laboratory mousemore » in the Human Genome project. More than 200 papers were presented at the conference covering the following topics: International mouse chromosome committee meetings; Mutant generation and identification; Physical and genetic maps; New technology and resources; Chromatin structure and gene regulation; Rate and hamster genetic maps; Informatics and databases; and Quantitative trait analysis.« less
A maximum entropy model for chromatin structure
NASA Astrophysics Data System (ADS)
Farre, Pau; Emberly, Eldon; Emberly Group Team
The DNA inside the nucleus of eukaryotic cells shows a variety of conserved structures at different length scales These structures are formed by interactions between protein complexes that bind to the DNA and regulate gene activity. Recent high throughput sequencing techniques allow for the measurement both of the genome wide contact map of the folded DNA within a cell (HiC) and where various proteins are bound to the DNA (ChIP-seq). In this talk I will present a maximum-entropy method capable of both predicting HiC contact maps from binding data, and binding data from HiC contact maps. This method results in an intuitive Ising-type model that is able to predict how altering the presence of binding factors can modify chromosome conformation, without the need of polymer simulations.
Braberg, Hannes; Moehle, Erica A.; Shales, Michael; Guthrie, Christine; Krogan, Nevan J.
2014-01-01
We have achieved a residue-level resolution of genetic interaction mapping – a technique that measures how the function of one gene is affected by the alteration of a second gene – by analyzing point mutations. Here, we describe how to interpret point mutant genetic interactions, and outline key applications for the approach, including interrogation of protein interaction interfaces and active sites, and examination of post-translational modifications. Genetic interaction analysis has proven effective for characterizing cellular processes; however, to date, systematic high-throughput genetic interaction screens have relied on gene deletions or knockdowns, which limits the resolution of gene function analysis and poses problems for multifunctional genes. Our point mutant approach addresses these issues, and further provides a tool for in vivo structure-function analysis that complements traditional biophysical methods. We also discuss the potential for genetic interaction mapping of point mutations in human cells and its application to personalized medicine. PMID:24842270
Mapping cis- and trans-regulatory effects across multiple tissues in twins
Grundberg, Elin; Small, Kerrin S.; Hedman, Åsa K.; Nica, Alexandra C.; Buil, Alfonso; Keildson, Sarah; Bell, Jordana T.; Yang, Tsun-Po; Meduri, Eshwar; Barrett, Amy; Nisbett, James; Sekowska, Magdalena; Wilk, Alicja; Shin, So-Youn; Glass, Daniel; Travers, Mary; Min, Josine L.; Ring, Sue; Ho, Karen; Thorleifsson, Gudmar; Kong, Augustine; Thorsteindottir, Unnur; Ainali, Chrysanthi; Dimas, Antigone S.; Hassanali, Neelam; Ingle, Catherine; Knowles, David; Krestyaninova, Maria; Lowe, Christopher E.; Di Meglio, Paola; Montgomery, Stephen B.; Parts, Leopold; Potter, Simon; Surdulescu, Gabriela; Tsaprouni, Loukia; Tsoka, Sophia; Bataille, Veronique; Durbin, Richard; Nestle, Frank O.; O’Rahilly, Stephen; Soranzo, Nicole; Lindgren, Cecilia M.; Zondervan, Krina T.; Ahmadi, Kourosh R.; Schadt, Eric E.; Stefansson, Kari; Smith, George Davey; McCarthy, Mark I.; Deloukas, Panos; Dermitzakis, Emmanouil T.; Spector, Tim D.
2013-01-01
Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many eQTL studies typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis-effect on expression cannot be accounted for by common cis-variants, a finding which exposes the contribution of low frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene and identify several replicating trans-variants which act predominantly in a tissue-restricted manner and may regulate the transcription of many genes. PMID:22941192
Kulski, Jerzy K; Shiina, Takashi; Anzai, Tatsuya; Kohara, Sakae; Inoko, Hidetoshi
2002-12-01
The major histocompatibility complex (MHC) genomic region is composed of a group of linked genes involved functionally with the adaptive and innate immune systems. The class I and class II genes are intrinsic features of the MHC and have been found in all the jawed vertebrates studied so far. The MHC genomic regions of the human and the chicken (B locus) have been fully sequenced and mapped, and the mouse MHC sequence is almost finished. Information on the MHC genomic structures (size, complexity, genic and intergenic composition and organization, gene order and number) of other vertebrates is largely limited or nonexistent. Therefore, we are mapping, sequencing and analyzing the MHC genomic regions of different human haplotypes and at least eight nonhuman species. Here, we review our progress with these sequences and compare the human MHC structure with that of the nonhuman primates (chimpanzee and rhesus macaque), other mammals (pigs, mice and rats) and nonmammalian vertebrates such as birds (chicken and quail), bony fish (medaka, pufferfish and zebrafish) and cartilaginous fish (nurse shark). This comparison reveals a complex MHC structure for mammals and a relatively simpler design for nonmammalian animals with a hypothetical prototypic structure for the shark. In the mammalian MHC, there are two to five different class I duplication blocks embedded within a framework of conserved nonclass I and/or nonclass II genes. With a few exceptions, the class I framework genes are absent from the MHC of birds, bony fish and sharks. Comparative genomics of the MHC reveal a highly plastic region with major structural differences between the mammalian and nonmammalian vertebrates. Additional genomic data are needed on animals of the reptilia, crocodilia and marsupial classes to find the origins of the class I framework genes and examples of structures that may be intermediate between the simple and complex MHC organizations of birds and mammals, respectively.
Curtis, Ross E; Kim, Seyoung; Woolford, John L; Xu, Wenjie; Xing, Eric P
2013-03-21
Association analysis using genome-wide expression quantitative trait locus (eQTL) data investigates the effect that genetic variation has on cellular pathways and leads to the discovery of candidate regulators. Traditional analysis of eQTL data via pairwise statistical significance tests or linear regression does not leverage the availability of the structural information of the transcriptome, such as presence of gene networks that reveal correlation and potentially regulatory relationships among the study genes. We employ a new eQTL mapping algorithm, GFlasso, which we have previously developed for sparse structured regression, to reanalyze a genome-wide yeast dataset. GFlasso fully takes into account the dependencies among expression traits to suppress false positives and to enhance the signal/noise ratio. Thus, GFlasso leverages the gene-interaction network to discover the pleiotropic effects of genetic loci that perturb the expression level of multiple (rather than individual) genes, which enables us to gain more power in detecting previously neglected signals that are marginally weak but pleiotropically significant. While eQTL hotspots in yeast have been reported previously as genomic regions controlling multiple genes, our analysis reveals additional novel eQTL hotspots and, more interestingly, uncovers groups of multiple contributing eQTL hotspots that affect the expression level of functional gene modules. To our knowledge, our study is the first to report this type of gene regulation stemming from multiple eQTL hotspots. Additionally, we report the results from in-depth bioinformatics analysis for three groups of these eQTL hotspots: ribosome biogenesis, telomere silencing, and retrotransposon biology. We suggest candidate regulators for the functional gene modules that map to each group of hotspots. Not only do we find that many of these candidate regulators contain mutations in the promoter and coding regions of the genes, in the case of the Ribi group, we provide experimental evidence suggesting that the identified candidates do regulate the target genes predicted by GFlasso. Thus, this structured association analysis of a yeast eQTL dataset via GFlasso, coupled with extensive bioinformatics analysis, discovers a novel regulation pattern between multiple eQTL hotspots and functional gene modules. Furthermore, this analysis demonstrates the potential of GFlasso as a powerful computational tool for eQTL studies that exploit the rich structural information among expression traits due to correlation, regulation, or other forms of biological dependencies.
Gene networks associated with conditional fear in mice identified using a systems genetics approach
2011-01-01
Background Our understanding of the genetic basis of learning and memory remains shrouded in mystery. To explore the genetic networks governing the biology of conditional fear, we used a systems genetics approach to analyze a hybrid mouse diversity panel (HMDP) with high mapping resolution. Results A total of 27 behavioral quantitative trait loci were mapped with a false discovery rate of 5%. By integrating fear phenotypes, transcript profiling data from hippocampus and striatum and also genotype information, two gene co-expression networks correlated with context-dependent immobility were identified. We prioritized the key markers and genes in these pathways using intramodular connectivity measures and structural equation modeling. Highly connected genes in the context fear modules included Psmd6, Ube2a and Usp33, suggesting an important role for ubiquitination in learning and memory. In addition, we surveyed the architecture of brain transcript regulation and demonstrated preservation of gene co-expression modules in hippocampus and striatum, while also highlighting important differences. Rps15a, Kif3a, Stard7, 6330503K22RIK, and Plvap were among the individual genes whose transcript abundance were strongly associated with fear phenotypes. Conclusion Application of our multi-faceted mapping strategy permits an increasingly detailed characterization of the genetic networks underlying behavior. PMID:21410935
Zhang, Ruijie; Lv, Wenhua; Luan, Meiwei; Zheng, Jiajia; Shi, Miao; Zhu, Hongjie; Li, Jin; Lv, Hongchao; Zhang, Mingming; Shang, Zhenwei; Duan, Lian; Jiang, Yongshuai
2015-11-24
Different human genes often exhibit different degrees of stability in their DNA methylation levels between tissues, samples or cell types. This may be related to the evolution of human genome. Thus, we compared the evolutionary conservation between two types of genes: genes with stable DNA methylation levels (SM genes) and genes with fluctuant DNA methylation levels (FM genes). For long-term evolutionary characteristics between species, we compared the percentage of the orthologous genes, evolutionary rate dn/ds and protein sequence identity. We found that the SM genes had greater percentages of the orthologous genes, lower dn/ds, and higher protein sequence identities in all the 21 species. These results indicated that the SM genes were more evolutionarily conserved than the FM genes. For short-term evolutionary characteristics among human populations, we compared the single nucleotide polymorphism (SNP) density, and the linkage disequilibrium (LD) degree in HapMap populations and 1000 genomes project populations. We observed that the SM genes had lower SNP densities, and higher degrees of LD in all the 11 HapMap populations and 13 1000 genomes project populations. These results mean that the SM genes had more stable chromosome genetic structures, and were more conserved than the FM genes.
Zhang, Qianqian; Guldbrandtsen, Bernt; Calus, Mario P L; Lund, Mogens Sandø; Sahana, Goutam
2016-08-17
There is growing interest in the role of rare variants in the variation of complex traits due to increasing evidence that rare variants are associated with quantitative traits. However, association methods that are commonly used for mapping common variants are not effective to map rare variants. Besides, livestock populations have large half-sib families and the occurrence of rare variants may be confounded with family structure, which makes it difficult to disentangle their effects from family mean effects. We compared the power of methods that are commonly applied in human genetics to map rare variants in cattle using whole-genome sequence data and simulated phenotypes. We also studied the power of mapping rare variants using linear mixed models (LMM), which are the method of choice to account for both family relationships and population structure in cattle. We observed that the power of the LMM approach was low for mapping a rare variant (defined as those that have frequencies lower than 0.01) with a moderate effect (5 to 8 % of phenotypic variance explained by multiple rare variants that vary from 5 to 21 in number) contributing to a QTL with a sample size of 1000. In contrast, across the scenarios studied, statistical methods that are specialized for mapping rare variants increased power regardless of whether multiple rare variants or a single rare variant underlie a QTL. Different methods for combining rare variants in the test single nucleotide polymorphism set resulted in similar power irrespective of the proportion of total genetic variance explained by the QTL. However, when the QTL variance is very small (only 0.1 % of the total genetic variance), these specialized methods for mapping rare variants and LMM generally had no power to map the variants within a gene with sample sizes of 1000 or 5000. We observed that the methods that combine multiple rare variants within a gene into a meta-variant generally had greater power to map rare variants compared to LMM. Therefore, it is recommended to use rare variant association mapping methods to map rare genetic variants that affect quantitative traits in livestock, such as bovine populations.
Genetic Characterization of the SufJ Frameshift Suppressor in SALMONELLA TYPHIMURIUM
Bossi, Lionello; Kohno, Tadahiko; Roth, John R.
1983-01-01
A new suppressor of +1 frameshift mutations has been isolated in Salmonella typhimurium. This suppressor, sufJ, maps at minute 89 on the Salmonella genetic map between the argH and rpo(rif) loci, closely linked to the gene for the ochre suppressor tyrU(supM). The suppressor mutation is dominant to its wild-type allele, consistent with the suppressor phenotype being caused by an altered tRNA species. The sufJ map position coincides with that of a threonine tRNA(ACC/U) gene; the suppressor has been shown to read the related fourbase codons ACCU, ACCC, ACCA.—The ability of sufJ to correct one particular mutation depends on the presence of a hisT mutation which causes a defect in tRNA modification. This requirement is allele specific, since other frameshift mutations can be corrected by sufJ regardless of the state of the hisT locus.—Strains carrying both a sufJ and a hisT mutation are acutely sensitive to growth inhibition by uracil; the inhibition is reversed by arginine. This behavior is characteristic of strains with mutations affecting the arginine-uracil biosynthetic enzyme carbamyl phosphate synthetase. The combination of two mutations affecting tRNA structure may reduce expression of the structural gene for this enzyme (pyrA). PMID:6188650
Woodruff, R. C.; Ashburner, M.
1979-01-01
The position of the structural gene coding for alcohol dehydrogenase (ADH) in Drosophila melanogaster has been shown to be within polytene chromosome bands 35B1 and 35B3, most probably within 35B2. The genetic and cytological properties of twelve deficiencies in polytene chromosome region 34–35 have been characterized, eleven of which include Adh. Also mapped cytogenetically are seven other recessive visible mutant loci. Flies heterozygous for overlapping deficiencies that include both the Adh locus and that for the outspread mutant (osp: a recessive wing phenotype) are homozygous viable and show a complete ADH negative phenotype and strong osp phenotype. These deficiencies probably include two polytene chromosome bands, 35B2 and 35B3. PMID:115743
Ming, Ray; Yu, Qingyi; Moore, Paul H
2007-06-01
Sex determination is an intriguing system in trioecious papaya. Over the past seven decades various hypotheses, based on the knowledge and information available at the time, have been proposed to explain the genetics of the papaya's sex determination. These include a single gene with three alleles, a group of closely linked genes, a genic balance of sex chromosome over autosomes, classical XY chromosomes, and regulatory elements of the flower development pathway. Recent advancements in genomic technology make it possible to characterize the genomic region involved in sex determination at the molecular level. High density linkage mapping validated the hypothesis that predicted recombination suppression at the sex determination locus. Physical mapping and sample sequencing of the non-recombination region led to the conclusion that sex determination is controlled by a pair of primitive sex chromosomes with a small male-specific region (MSY) of the Y chromosome. We now postulate that two sex determination genes control the sex determination pathway. One, a feminizing or stamen suppressor gene, causes stamen abortion before or at flower inception while the other, a masculinizing or carpel suppressor gene, causes carpel abortion at a later flower developmental stage. Detailed physical mapping is beginning to reveal structural details about the sex determination region and sequencing is expected to uncover candidate sex determining genes. Cloning of the sex determination genes and understanding the sex determination process could have profound application in papaya production.
Miller, Hilary C.; O’Meally, Denis; Ezaz, Tariq; Amemiya, Chris; Marshall-Graves, Jennifer A.; Edwards, Scott
2015-01-01
Major histocompatibility complex (MHC) genes are a central component of the vertebrate immune system and usually exist in a single genomic region. However, considerable differences in MHC organization and size exist between different vertebrate lineages. Reptiles occupy a key evolutionary position for understanding how variation in MHC structure evolved in vertebrates, but information on the structure of the MHC region in reptiles is limited. In this study, we investigate the organization and cytogenetic location of MHC genes in the tuatara (Sphenodon punctatus), the sole extant representative of the early-diverging reptilian order Rhynchocephalia. Sequencing and mapping of 12 clones containing class I and II MHC genes from a bacterial artificial chromosome library indicated that the core MHC region is located on chromosome 13q. However, duplication and translocation of MHC genes outside of the core region was evident, because additional class I MHC genes were located on chromosome 4p. We found a total of seven class I sequences and 11 class II β sequences, with evidence for duplication and pseudogenization of genes within the tuatara lineage. The tuatara MHC is characterized by high repeat content and low gene density compared with other species and we found no antigen processing or MHC framework genes on the MHC gene-containing clones. Our findings indicate substantial differences in MHC organization in tuatara compared with mammalian and avian MHCs and highlight the dynamic nature of the MHC. Further sequencing and annotation of tuatara and other reptile MHCs will determine if the tuatara MHC is representative of nonavian reptiles in general. PMID:25953959
Characterization of cDNAs and genomic DNAs for human threonyl- and cysteinyl-tRNA synthetases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cruzen, M.E.
1993-01-01
Techniques of molecular biology were used to clone, sequence and map two human aminoacyl-tRNA synthetase (aaRS) cDNAs: threonyl-tRNA synthetase (ThrRS) a class II enzyme and cysteinyl-tRNA synthetase (CysRS) a class I enzyme. The predicted protein sequence of human ThrRS is highly homologous to that of lower eukaryotic and prokaryotic ThRSs, particularly in the regions containing the three structural motifs common to all class II synthetases. Signature regions 1 and 2, which characterize the class IIa subgroup (SerRS, ThrRS and HisRS) are highly conserved from bacteria to human. Structural predictions for human ThrRS based on the known structure of the closelymore » related SerRS from E.coli implicate strongly conserved residues in the signature sequences to be important in substrate binding. The amino terminal 100 residues of the deduced amino acid sequence of ThrRS shares structural similarity to SerRS consistent with forming an antiparallel helix implicated in tRNA binding. The 5' untranslated sequence of the human ThrRS gene shares short stretches of common sequence with the gene for hamster HisRS including a binding site for the promoter specific transcription factor sp-1. The deduced amino acid sequence of human CysRS has a high degree of sequence identify to E. coli CysRS. Human CysRS possesses the classic characteristics of a class I synthetase and is most closely related to the MetRS subgroup. The amino terminal half of human CysRS can be modeled as a nucleotide binding fold and shares significant sequence and structural similarity to the other enzymes in this subgroup. The CysRS structural gene (CARS) was mapped to human chromosome 11p15.5 by fluorescent in situ hybridization. CARS is the first aaRS gene to be mapped to chromosome 11. The steady state of both CysRS and ThrRs mRNA were quantitated in several human tissues. Message levels for these enzymes appear to be subjected to differential regulation in different cell types.« less
Mapping the polysaccharide degradation potential of Aspergillus niger
2012-01-01
Background The degradation of plant materials by enzymes is an industry of increasing importance. For sustainable production of second generation biofuels and other products of industrial biotechnology, efficient degradation of non-edible plant polysaccharides such as hemicellulose is required. For each type of hemicellulose, a complex mixture of enzymes is required for complete conversion to fermentable monosaccharides. In plant-biomass degrading fungi, these enzymes are regulated and released by complex regulatory structures. In this study, we present a methodology for evaluating the potential of a given fungus for polysaccharide degradation. Results Through the compilation of information from 203 articles, we have systematized knowledge on the structure and degradation of 16 major types of plant polysaccharides to form a graphical overview. As a case example, we have combined this with a list of 188 genes coding for carbohydrate-active enzymes from Aspergillus niger, thus forming an analysis framework, which can be queried. Combination of this information network with gene expression analysis on mono- and polysaccharide substrates has allowed elucidation of concerted gene expression from this organism. One such example is the identification of a full set of extracellular polysaccharide-acting genes for the degradation of oat spelt xylan. Conclusions The mapping of plant polysaccharide structures along with the corresponding enzymatic activities is a powerful framework for expression analysis of carbohydrate-active enzymes. Applying this network-based approach, we provide the first genome-scale characterization of all genes coding for carbohydrate-active enzymes identified in A. niger. PMID:22799883
Mapping the polysaccharide degradation potential of Aspergillus niger.
Andersen, Mikael R; Giese, Malene; de Vries, Ronald P; Nielsen, Jens
2012-07-16
The degradation of plant materials by enzymes is an industry of increasing importance. For sustainable production of second generation biofuels and other products of industrial biotechnology, efficient degradation of non-edible plant polysaccharides such as hemicellulose is required. For each type of hemicellulose, a complex mixture of enzymes is required for complete conversion to fermentable monosaccharides. In plant-biomass degrading fungi, these enzymes are regulated and released by complex regulatory structures. In this study, we present a methodology for evaluating the potential of a given fungus for polysaccharide degradation. Through the compilation of information from 203 articles, we have systematized knowledge on the structure and degradation of 16 major types of plant polysaccharides to form a graphical overview. As a case example, we have combined this with a list of 188 genes coding for carbohydrate-active enzymes from Aspergillus niger, thus forming an analysis framework, which can be queried. Combination of this information network with gene expression analysis on mono- and polysaccharide substrates has allowed elucidation of concerted gene expression from this organism. One such example is the identification of a full set of extracellular polysaccharide-acting genes for the degradation of oat spelt xylan. The mapping of plant polysaccharide structures along with the corresponding enzymatic activities is a powerful framework for expression analysis of carbohydrate-active enzymes. Applying this network-based approach, we provide the first genome-scale characterization of all genes coding for carbohydrate-active enzymes identified in A. niger.
Construction of an SSR and RAD-Marker Based Molecular Linkage Map of Vigna vexillata (L.) A. Rich
Chankaew, Sompong; Kaga, Akito; Naito, Ken; Ehara, Hiroshi; Tomooka, Norihiko
2015-01-01
Vigna vexillata (L.) A. Rich. (tuber cowpea) is an underutilized crop for consuming its tuber and mature seeds. Wild form of V. vexillata is a pan-tropical perennial herbaceous plant which has been used by local people as a food. Wild V. vexillata has also been considered as useful gene(s) source for V. unguiculata (cowpea), since it was reported to have various resistance gene(s) for insects and diseases of cowpea. To exploit the potential of V. vexillata, an SSR-based linkage map of V. vexillata was developed. A total of 874 SSR markers successfully amplified single DNA fragment in V. vexillata among 1,336 SSR markers developed from Vigna angularis (azuki bean), V. unguiculata and Phaseolus vulgaris (common bean). An F2 population of 300 plants derived from a cross between salt resistant (V1) and susceptible (V5) accessions was used for mapping. A genetic linkage map was constructed using 82 polymorphic SSR markers loci, which could be assigned to 11 linkage groups spanning 511.5 cM in length with a mean distance of 7.2 cM between adjacent markers. To develop higher density molecular linkage map and to confirm SSR markers position in a linkage map, RAD markers were developed and a combined SSR and RAD markers linkage map of V. vexillata was constructed. A total of 559 (84 SSR and 475 RAD) markers loci could be assigned to 11 linkage groups spanning 973.9 cM in length with a mean distance of 1.8 cM between adjacent markers. Linkage and genetic position of all SSR markers in an SSR linkage map were confirmed. When an SSR genetic linkage map of V. vexillata was compared with those of V. radiata and V. unguiculata, it was suggested that the structure of V. vexillata chromosome was considerably differentiated. This map is the first SSR and RAD marker-based V. vexillata linkage map which can be used for the mapping of useful traits. PMID:26398819
Defining the location of promoter-associated R-loops at near-nucleotide resolution using bisDRIP-seq
Dumelie, Jason G
2017-01-01
R-loops are features of chromatin consisting of a strand of DNA hybridized to RNA, as well as the expelled complementary DNA strand. R-loops are enriched at promoters where they have recently been shown to have important roles in modifying gene expression. However, the location of promoter-associated R-loops and the genomic domains they perturb to modify gene expression remain unclear. To resolve this issue, we developed a bisulfite-based approach, bisDRIP-seq, to map R-loops across the genome at near-nucleotide resolution in MCF-7 cells. We found the location of promoter-associated R-loops is dependent on the presence of introns. In intron-containing genes, R-loops are bounded between the transcription start site and the first exon-intron junction. In intronless genes, the 3' boundary displays gene-specific heterogeneity. Moreover, intronless genes are often associated with promoter-associated R-loop formation. Together, these studies provide a high-resolution map of R-loops and identify gene structure as a critical determinant of R-loop formation. PMID:29072160
Zhou, Gaofeng; Jian, Jianbo; Wang, Penghao; Li, Chengdao; Tao, Ye; Li, Xuan; Renshaw, Daniel; Clements, Jonathan; Sweetingham, Mark; Yang, Huaan
2018-01-01
An ultra-high density genetic map containing 34,574 sequence-defined markers was developed in Lupinus angustifolius. Markers closely linked to nine genes of agronomic traits were identified. A physical map was improved to cover 560.5 Mb genome sequence. Lupin (Lupinus angustifolius L.) is a recently domesticated legume grain crop. In this study, we applied the restriction-site associated DNA sequencing (RADseq) method to genotype an F 9 recombinant inbred line population derived from a wild type × domesticated cultivar (W × D) cross. A high density linkage map was developed based on the W × D population. By integrating sequence-defined DNA markers reported in previous mapping studies, we established an ultra-high density consensus genetic map, which contains 34,574 markers consisting of 3508 loci covering 2399 cM on 20 linkage groups. The largest gap in the entire consensus map was 4.73 cM. The high density W × D map and the consensus map were used to develop an improved physical map, which covered 560.5 Mb of genome sequence data. The ultra-high density consensus linkage map, the improved physical map and the markers linked to genes of breeding interest reported in this study provide a common tool for genome sequence assembly, structural genomics, comparative genomics, functional genomics, QTL mapping, and molecular plant breeding in lupin.
An integrated map of structural variation in 2,504 human genomes.
Sudmant, Peter H; Rausch, Tobias; Gardner, Eugene J; Handsaker, Robert E; Abyzov, Alexej; Huddleston, John; Zhang, Yan; Ye, Kai; Jun, Goo; Fritz, Markus Hsi-Yang; Konkel, Miriam K; Malhotra, Ankit; Stütz, Adrian M; Shi, Xinghua; Casale, Francesco Paolo; Chen, Jieming; Hormozdiari, Fereydoun; Dayama, Gargi; Chen, Ken; Malig, Maika; Chaisson, Mark J P; Walter, Klaudia; Meiers, Sascha; Kashin, Seva; Garrison, Erik; Auton, Adam; Lam, Hugo Y K; Mu, Xinmeng Jasmine; Alkan, Can; Antaki, Danny; Bae, Taejeong; Cerveira, Eliza; Chines, Peter; Chong, Zechen; Clarke, Laura; Dal, Elif; Ding, Li; Emery, Sarah; Fan, Xian; Gujral, Madhusudan; Kahveci, Fatma; Kidd, Jeffrey M; Kong, Yu; Lameijer, Eric-Wubbo; McCarthy, Shane; Flicek, Paul; Gibbs, Richard A; Marth, Gabor; Mason, Christopher E; Menelaou, Androniki; Muzny, Donna M; Nelson, Bradley J; Noor, Amina; Parrish, Nicholas F; Pendleton, Matthew; Quitadamo, Andrew; Raeder, Benjamin; Schadt, Eric E; Romanovitch, Mallory; Schlattl, Andreas; Sebra, Robert; Shabalin, Andrey A; Untergasser, Andreas; Walker, Jerilyn A; Wang, Min; Yu, Fuli; Zhang, Chengsheng; Zhang, Jing; Zheng-Bradley, Xiangqun; Zhou, Wanding; Zichner, Thomas; Sebat, Jonathan; Batzer, Mark A; McCarroll, Steven A; Mills, Ryan E; Gerstein, Mark B; Bashir, Ali; Stegle, Oliver; Devine, Scott E; Lee, Charles; Eichler, Evan E; Korbel, Jan O
2015-10-01
Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.
The major resistance gene cluster in lettuce is highly duplicated and spans several megabases.
Meyers, B C; Chin, D B; Shen, K A; Sivaramakrishnan, S; Lavelle, D O; Zhang, Z; Michelmore, R W
1998-01-01
At least 10 Dm genes conferring resistance to the oomycete downy mildew fungus Bremia lactucae map to the major resistance cluster in lettuce. We investigated the structure of this cluster in the lettuce cultivar Diana, which contains Dm3. A deletion breakpoint map of the chromosomal region flanking Dm3 was saturated with a variety of molecular markers. Several of these markers are components of a family of resistance gene candidates (RGC2) that encode a nucleotide binding site and a leucine-rich repeat region. These motifs are characteristic of plant disease resistance genes. Bacterial artificial chromosome clones were identified by using duplicated restriction fragment length polymorphism markers from the region, including the nucleotide binding site-encoding region of RGC2. Twenty-two distinct members of the RGC2 family were characterized from the bacterial artificial chromosomes; at least two additional family members exist. The RGC2 family is highly divergent; the nucleotide identity was as low as 53% between the most distantly related copies. These RGC2 genes span at least 3.5 Mb. Eighteen members were mapped on the deletion breakpoint map. A comparison between the phylogenetic and physical relationships of these sequences demonstrated that closely related copies are physically separated from one another and indicated that complex rearrangements have shaped this region. Analysis of low-copy genomic sequences detected no genes, including RGC2, in the Dm3 region, other than sequences related to retrotransposons and transposable elements. The related but divergent family of RGC2 genes may act as a resource for the generation of new resistance phenotypes through infrequent recombination or unequal crossing over. PMID:9811791
2011-01-01
Background The identification of genes or quantitative trait loci that are expressed in response to different environmental factors such as temperature and light, through functional mapping, critically relies on precise modeling of the covariance structure. Previous work used separable parametric covariance structures, such as a Kronecker product of autoregressive one [AR(1)] matrices, that do not account for interaction effects of different environmental factors. Results We implement a more robust nonparametric covariance estimator to model these interactions within the framework of functional mapping of reaction norms to two signals. Our results from Monte Carlo simulations show that this estimator can be useful in modeling interactions that exist between two environmental signals. The interactions are simulated using nonseparable covariance models with spatio-temporal structural forms that mimic interaction effects. Conclusions The nonparametric covariance estimator has an advantage over separable parametric covariance estimators in the detection of QTL location, thus extending the breadth of use of functional mapping in practical settings. PMID:21269481
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hanson, R.S.
Broad host range plasmid vectors useful for cloning genes from bacteria that grow on methane and methanol were constructed. We have cloned and mapped nineteen genes required for the growth of Methylobacterium organophilum strain XX on methanol. Nineteen genes were found in seven linkage groups on the M. organophilum genome and were separated by 40 kb or more. Eleven genes were required for the synthesis of methanol dehydrogenase (MDH) and were located in three unlinked gene clusters. The MDH structural gene was localized on a 2.5 kb DNA fragment. The gene was sequenced and contains a 175 bp untranslated leadermore » sequence, a signal sequence and the structural gene. MDH messenger RNA (mRNA) has a half life of approximately 20 min. and is present at approximately 2% of the cellular mRNA. The structural gene for the ..gamma.. subunit of methane monoxygenases has been cloned from Methylosporovibrio. Methane monooxygenase subunits have been purified by Prof. J. Lipscomb's laboratory and are being sequenced to construct DNA probes to identify cloned subunit genes. New facultative methylotrophic bacteria were isolated and characterized. Several amino acid auxotrophs have been isolated. 11 refs.« less
Gutiérrez, Gabriel; Millán-Zambrano, Gonzalo; Medina, Daniel A; Jordán-Pla, Antonio; Pérez-Ortín, José E; Peñate, Xenia; Chávez, Sebastián
2017-12-07
TFIIS stimulates RNA cleavage by RNA polymerase II and promotes the resolution of backtracking events. TFIIS acts in the chromatin context, but its contribution to the chromatin landscape has not yet been investigated. Co-transcriptional chromatin alterations include subtle changes in nucleosome positioning, like those expected to be elicited by TFIIS, which are elusive to detect. The most popular method to map nucleosomes involves intensive chromatin digestion by micrococcal nuclease (MNase). Maps based on these exhaustively digested samples miss any MNase-sensitive nucleosomes caused by transcription. In contrast, partial digestion approaches preserve such nucleosomes, but introduce noise due to MNase sequence preferences. A systematic way of correcting this bias for massively parallel sequencing experiments is still missing. To investigate the contribution of TFIIS to the chromatin landscape, we developed a refined nucleosome-mapping method in Saccharomyces cerevisiae. Based on partial MNase digestion and a sequence-bias correction derived from naked DNA cleavage, the refined method efficiently mapped nucleosomes in promoter regions rich in MNase-sensitive structures. The naked DNA correction was also important for mapping gene body nucleosomes, particularly in those genes whose core promoters contain a canonical TATA element. With this improved method, we analyzed the global nucleosomal changes caused by lack of TFIIS. We detected a general increase in nucleosomal fuzziness and more restricted changes in nucleosome occupancy, which concentrated in some gene categories. The TATA-containing genes were preferentially associated with decreased occupancy in gene bodies, whereas the TATA-like genes did so with increased fuzziness. The detected chromatin alterations correlated with functional defects in nascent transcription, as revealed by genomic run-on experiments. The combination of partial MNase digestion and naked DNA correction of the sequence bias is a precise nucleosomal mapping method that does not exclude MNase-sensitive nucleosomes. This method is useful for detecting subtle alterations in nucleosome positioning produced by lack of TFIIS. Their analysis revealed that TFIIS generally contributed to nucleosome positioning in both gene promoters and bodies. The independent effect of lack of TFIIS on nucleosome occupancy and fuzziness supports the existence of alternative chromatin dynamics during transcription elongation.
A Roadmap for Functional Structural Variants in the Soybean Genome
Anderson, Justin E.; Kantar, Michael B.; Kono, Thomas Y.; Fu, Fengli; Stec, Adrian O.; Song, Qijian; Cregan, Perry B.; Specht, James E.; Diers, Brian W.; Cannon, Steven B.; McHale, Leah K.; Stupar, Robert M.
2014-01-01
Gene structural variation (SV) has recently emerged as a key genetic mechanism underlying several important phenotypic traits in crop species. We screened a panel of 41 soybean (Glycine max) accessions serving as parents in a soybean nested association mapping population for deletions and duplications in more than 53,000 gene models. Array hybridization and whole genome resequencing methods were used as complementary technologies to identify SV in 1528 genes, or approximately 2.8%, of the soybean gene models. Although SV occurs throughout the genome, SV enrichment was noted in families of biotic defense response genes. Among accessions, SV was nearly eightfold less frequent for gene models that have retained paralogs since the last whole genome duplication event, compared with genes that have not retained paralogs. Increases in gene copy number, similar to that described at the Rhg1 resistance locus, account for approximately one-fourth of the genic SV events. This assessment of soybean SV occurrence presents a target list of genes potentially responsible for rapidly evolving and/or adaptive traits. PMID:24855315
Bärlund, M; Nupponen, N N; Karhu, R; Tanner, M M; Paavola, P; Kallioniemi, O P; Kallioniemi, A
1998-01-01
Defining boundaries of chromosomal rearrangements at the molecular level would benefit from landmarks that link the cytogenetic map to physical, genetic, and transcript maps, as well as from large-insert FISH probes for such loci to detect numerical and structural rearrangements in metaphase or interphase cells. Here, we determined the locations of 24 genetically mapped CEPH-Mega YACs along the FLpter scale (fractional length from p-telomere) by quantitative fluorescence in situ hybridization analysis. This generated a set of cytogenetically mapped probes for chromosome 17 with an average spacing of about 5 cM. We then developed large-insert YAC, BAC, PAC, or P1 clones to the following 24 known genes, and determined refined map locations along the same FLpter scale: pter-TP53-TOP3-cen-TNFAIP1-ERBB2-TOP2A- BRCA1-TCF11-NME1-HLF-ZNF147/CL N80-BCL5/MPO/SFRS1-TBX2-PECAM1-DDX5/ PRKCA-ICAM2-GH1/PRKAR1A-GRB2-CDK3 /FKHL13-qter. Taken together, these 48 cytogenetically mapped large-insert probes provide tools for the molecular analysis of chromosome 17 rearrangements, such as mapping amplification, deletion, and translocation breakpoints in this chromosome, in cancer and other diseases.
Creating and validating cis-regulatory maps of tissue-specific gene expression regulation
O'Connor, Timothy R.; Bailey, Timothy L.
2014-01-01
Predicting which genomic regions control the transcription of a given gene is a challenge. We present a novel computational approach for creating and validating maps that associate genomic regions (cis-regulatory modules–CRMs) with genes. The method infers regulatory relationships that explain gene expression observed in a test tissue using widely available genomic data for ‘other’ tissues. To predict the regulatory targets of a CRM, we use cross-tissue correlation between histone modifications present at the CRM and expression at genes within 1 Mbp of it. To validate cis-regulatory maps, we show that they yield more accurate models of gene expression than carefully constructed control maps. These gene expression models predict observed gene expression from transcription factor binding in the CRMs linked to that gene. We show that our maps are able to identify long-range regulatory interactions and improve substantially over maps linking genes and CRMs based on either the control maps or a ‘nearest neighbor’ heuristic. Our results also show that it is essential to include CRMs predicted in multiple tissues during map-building, that H3K27ac is the most informative histone modification, and that CAGE is the most informative measure of gene expression for creating cis-regulatory maps. PMID:25200088
Structure and Expression of Genes for Flavivirus Immunogens.
1985-09-01
the same order in YFV i.e., C-M-E-NSI--- NS3---NS5 and an open reading frame extends at least through the C-M-E-NS1 coding region, consistent with...been determined (Castle et al., 1985). Comparison of these results shows that 1) the six major JEV genes mapped thus far occur in the same order in YFV ...pre-M proteins and 3) the predicted structures of the E, NSI and ns2a proteins of JEV and YFV exhibit a high degree of relatedness. The E proteins
Zhang, Chengxin; Zheng, Wei; Freddolino, Peter L; Zhang, Yang
2018-03-10
Homology-based transferal remains the major approach to computational protein function annotations, but it becomes increasingly unreliable when the sequence identity between query and template decreases below 30%. We propose a novel pipeline, MetaGO, to deduce Gene Ontology attributes of proteins by combining sequence homology-based annotation with low-resolution structure prediction and comparison, and partner's homology-based protein-protein network mapping. The pipeline was tested on a large-scale set of 1000 non-redundant proteins from the CAFA3 experiment. Under the stringent benchmark conditions where templates with >30% sequence identity to the query are excluded, MetaGO achieves average F-measures of 0.487, 0.408, and 0.598, for Molecular Function, Biological Process, and Cellular Component, respectively, which are significantly higher than those achieved by other state-of-the-art function annotations methods. Detailed data analysis shows that the major advantage of the MetaGO lies in the new functional homolog detections from partner's homology-based network mapping and structure-based local and global structure alignments, the confidence scores of which can be optimally combined through logistic regression. These data demonstrate the power of using a hybrid model incorporating protein structure and interaction networks to deduce new functional insights beyond traditional sequence homology-based referrals, especially for proteins that lack homologous function templates. The MetaGO pipeline is available at http://zhanglab.ccmb.med.umich.edu/MetaGO/. Copyright © 2018. Published by Elsevier Ltd.
Aokic, Jun-ya; Kawase, Junya; Hamada, Kazuhisa; Fujimoto, Hiroshi; Yamamoto, Ikki; Usuki, Hironori
2018-01-01
Greater amberjack (Seriola dumerili) is distributed in tropical and temperate waters worldwide and is an important aquaculture fish. We carried out de novo sequencing of the greater amberjack genome to construct a reference genome sequence to identify single nucleotide polymorphisms (SNPs) for breeding amberjack by marker-assisted or gene-assisted selection as well as to identify functional genes for biological traits. We obtained 200 times coverage and constructed a high-quality genome assembly using next generation sequencing technology. The assembled sequences were aligned onto a yellowtail (Seriola quinqueradiata) radiation hybrid (RH) physical map by sequence homology. A total of 215 of the longest amberjack sequences, with a total length of 622.8 Mbp (92% of the total length of the genome scaffolds), were lined up on the yellowtail RH map. We resequenced the whole genomes of 20 greater amberjacks and mapped the resulting sequences onto the reference genome sequence. About 186,000 nonredundant SNPs were successfully ordered on the reference genome. Further, we found differences in the genome structural variations between two greater amberjack populations using BreakDancer. We also analyzed the greater amberjack transcriptome and mapped the annotated sequences onto the reference genome sequence. PMID:29785397
Schönhals, E M; Ortega, F; Barandalla, L; Aragones, A; Ruiz de Galarreta, J I; Liao, J-C; Sanetomo, R; Walkemeier, B; Tacke, E; Ritter, E; Gebhardt, C
2016-04-01
SNPs in candidate genes Pain - 1, InvCD141 (invertases), SSIV (starch synthase), StCDF1 (transcription factor), LapN (leucine aminopeptidase), and cytoplasm type are associated with potato tuber yield, starch content and/or starch yield. Tuber yield (TY), starch content (TSC), and starch yield (TSY) are complex characters of high importance for the potato crop in general and for industrial starch production in particular. DNA markers associated with superior alleles of genes that control the natural variation of TY, TSC, and TSY could increase precision and speed of breeding new cultivars optimized for potato starch production. Diagnostic DNA markers are identified by association mapping in populations of tetraploid potato varieties and advanced breeding clones. A novel association mapping population of 282 genotypes including varieties, breeding clones and Andean landraces was assembled and field evaluated in Northern Spain for TY, TSC, TSY, tuber number (TN) and tuber weight (TW). The landraces had lower mean values of TY, TW, TN, and TSY. The population was genotyped for 183 microsatellite alleles, 221 single nucleotide polymorphisms (SNPs) in fourteen candidate genes and eight known diagnostic markers for TSC and TSY. Association test statistics including kinship and population structure reproduced five known marker-trait associations of candidate genes and discovered new ones, particularly for tuber yield and starch yield. The inclusion of landraces increased the number of detected marker-trait associations. Integration of the present association mapping results with previous QTL linkage mapping studies for TY, TSC, TSY, TW, TN, and tuberization revealed some hot spots of QTL for these traits in the potato genome. The genomic positions of markers linked or associated with QTL for complex tuber traits suggest high multiplicity and genome wide distribution of the underlying genes.
Genomic characterization of putative allergen genes in peach/almond and their synteny with apple
Chen, Lin; Zhang, Shuiming; Illa, Eudald; Song, Lijuan; Wu, Shandong; Howad, Werner; Arús, Pere; Weg, Eric van de; Chen, Kunsong; Gao, Zhongshan
2008-01-01
Background Fruits from several species of the Rosaceae family are reported to cause allergic reactions in certain populations. The allergens identified belong to mainly four protein families: pathogenesis related 10 proteins, thaumatin-like proteins, lipid transfer proteins and profilins. These families of putative allergen genes in apple (Mal d 1 to 4) have been mapped on linkage maps and subsequent genetic study on allelic diversity and hypoallergenic traits has been carried out recently. In peach (Prunus persica), these allergen gene families are denoted as Pru p 1 to 4 and for almond (Prunus dulcis)Pru du 1 to 4. Genetic analysis using current molecular tools may be helpful to establish the cause of allergenicity differences observed among different peach cultivars. This study was to characterize putative peach allergen genes for their genomic sequences and linkage map positions, and to compare them with previously characterized homologous genes in apple (Malus domestica). Results Eight Pru p/du 1 genes were identified, four of which were new. All the Pru p/du 1 genes were mapped in a single bin on the top of linkage group 1 (G1). Five Pru p/du 2 genes were mapped on four different linkage groups, two very similar Pru p/du 2.01 genes (A and B) were on G3, Pru p/du 2.02 on G7,Pru p/du 2.03 on G8 and Pru p/du 2.04 on G1. There were differences in the intron and exon structure in these Pru p/du 2 genes and in their amino acid composition. Three Pru p/du 3 genes (3.01–3.03) containing an intron and a mini exon of 10 nt were mapped in a cluster on G6. Two Pru p/du 4 genes (Pru p/du 4.01 and 4.02) were located on G1 and G7, respectively. The Pru p/du 1 cluster on G1 aligned to the Mal d 1 clusters on LG16; Pru p/du 2.01A and B on G3 to Mal d 2.01A and B on LG9; the Pru p/du 3 cluster on G6 to Mal d 3.01 on LG12; Pru p/du 4.01 on G1 to Mal d 4.03 on LG2; and Pru p/du 4.02 on G7 to Mal d 4.02 on LG2. Conclusion A total of 18 putative peach/almond allergen genes have been mapped on five linkage groups. Their positions confirm the high macro-synteny between peach/almond and apple. The insight gained will help to identify key genes causing differences in allergenicity among different cultivars of peach and other Prunus species. PMID:19014629
A new yeast gene with a myosin-like heptad repeat structure.
Kölling, R; Nguyen, T; Chen, E Y; Botstein, D
1993-03-01
We isolated a gene encoding a 218 kDa myosin-like protein from Saccharomyces cerevisiae using a monoclonal antibody directed against human platelet myosin as a probe. The protein sequence encoded by the MLP1 gene (for myosin-like protein) contains extensive stretches of a heptad-repeat pattern suggesting that the protein can form coiled coils typical of myosins. Immunolocalization experiments using affinity-purified antibodies raised against a TrpE-MLP1 fusion protein showed a dot-like structure adjacent to the nucleus in yeast cells bearing the MLP1 gene on a multicopy plasmid. In mouse epithelial cells the yeast anti-MLP1 antibodies stained the nucleus. Mutants bearing disruptions of the MLP1 gene were viable, but more sensitive to ultraviolet light than wild-type strains, suggesting an involvement of MLP1 in DNA repair. The MLP1 gene was mapped to chromosome 11, 25 cM from met1.
A fruit quality gene map of Prunus
2009-01-01
Background Prunus fruit development, growth, ripening, and senescence includes major biochemical and sensory changes in texture, color, and flavor. The genetic dissection of these complex processes has important applications in crop improvement, to facilitate maximizing and maintaining stone fruit quality from production and processing through to marketing and consumption. Here we present an integrated fruit quality gene map of Prunus containing 133 genes putatively involved in the determination of fruit texture, pigmentation, flavor, and chilling injury resistance. Results A genetic linkage map of 211 markers was constructed for an intraspecific peach (Prunus persica) progeny population, Pop-DG, derived from a canning peach cultivar 'Dr. Davis' and a fresh market cultivar 'Georgia Belle'. The Pop-DG map covered 818 cM of the peach genome and included three morphological markers, 11 ripening candidate genes, 13 cold-responsive genes, 21 novel EST-SSRs from the ChillPeach database, 58 previously reported SSRs, 40 RAFs, 23 SRAPs, 14 IMAs, and 28 accessory markers from candidate gene amplification. The Pop-DG map was co-linear with the Prunus reference T × E map, with 39 SSR markers in common to align the maps. A further 158 markers were bin-mapped to the reference map: 59 ripening candidate genes, 50 cold-responsive genes, and 50 novel EST-SSRs from ChillPeach, with deduced locations in Pop-DG via comparative mapping. Several candidate genes and EST-SSRs co-located with previously reported major trait loci and quantitative trait loci for chilling injury symptoms in Pop-DG. Conclusion The candidate gene approach combined with bin-mapping and availability of a community-recognized reference genetic map provides an efficient means of locating genes of interest in a target genome. We highlight the co-localization of fruit quality candidate genes with previously reported fruit quality QTLs. The fruit quality gene map developed here is a valuable tool for dissecting the genetic architecture of fruit quality traits in Prunus crops. PMID:19995417
1985-01-01
We have determined the DNA sequence of a gene encoding a thymus leukemia (TL) antigen in the BALB/c mouse, and have more definitively mapped the cloned BALB/c Tla-region class I gene clusters. Analysis of the sequence shows that the Tla gene is less closely related to the H-2 genes than H-2 genes are to one another or to a Qa-2,3-region genes. The Tla gene, 17.3A, contains an apparent gene conversion. Comparison of the BALB/c Tla genes with those from C57BL shows that BALB/c has more Tla-region class I genes, and that one of the genes absent in C57BL is gene 17.3A. PMID:3894562
Savage, Jeanne E; Jansen, Philip R; Stringer, Sven; Watanabe, Kyoko; Bryois, Julien; de Leeuw, Christiaan A; Nagel, Mats; Awasthi, Swapnil; Barr, Peter B; Coleman, Jonathan R I; Grasby, Katrina L; Hammerschlag, Anke R; Kaminski, Jakob A; Karlsson, Robert; Krapohl, Eva; Lam, Max; Nygaard, Marianne; Reynolds, Chandra A; Trampush, Joey W; Young, Hannah; Zabaneh, Delilah; Hägg, Sara; Hansell, Narelle K; Karlsson, Ida K; Linnarsson, Sten; Montgomery, Grant W; Muñoz-Manchado, Ana B; Quinlan, Erin B; Schumann, Gunter; Skene, Nathan G; Webb, Bradley T; White, Tonya; Arking, Dan E; Avramopoulos, Dimitrios; Bilder, Robert M; Bitsios, Panos; Burdick, Katherine E; Cannon, Tyrone D; Chiba-Falek, Ornit; Christoforou, Andrea; Cirulli, Elizabeth T; Congdon, Eliza; Corvin, Aiden; Davies, Gail; Deary, Ian J; DeRosse, Pamela; Dickinson, Dwight; Djurovic, Srdjan; Donohoe, Gary; Conley, Emily Drabant; Eriksson, Johan G; Espeseth, Thomas; Freimer, Nelson A; Giakoumaki, Stella; Giegling, Ina; Gill, Michael; Glahn, David C; Hariri, Ahmad R; Hatzimanolis, Alex; Keller, Matthew C; Knowles, Emma; Koltai, Deborah; Konte, Bettina; Lahti, Jari; Le Hellard, Stephanie; Lencz, Todd; Liewald, David C; London, Edythe; Lundervold, Astri J; Malhotra, Anil K; Melle, Ingrid; Morris, Derek; Need, Anna C; Ollier, William; Palotie, Aarno; Payton, Antony; Pendleton, Neil; Poldrack, Russell A; Räikkönen, Katri; Reinvang, Ivar; Roussos, Panos; Rujescu, Dan; Sabb, Fred W; Scult, Matthew A; Smeland, Olav B; Smyrnis, Nikolaos; Starr, John M; Steen, Vidar M; Stefanis, Nikos C; Straub, Richard E; Sundet, Kjetil; Tiemeier, Henning; Voineskos, Aristotle N; Weinberger, Daniel R; Widen, Elisabeth; Yu, Jin; Abecasis, Goncalo; Andreassen, Ole A; Breen, Gerome; Christiansen, Lene; Debrabant, Birgit; Dick, Danielle M; Heinz, Andreas; Hjerling-Leffler, Jens; Ikram, M Arfan; Kendler, Kenneth S; Martin, Nicholas G; Medland, Sarah E; Pedersen, Nancy L; Plomin, Robert; Polderman, Tinca J C; Ripke, Stephan; van der Sluis, Sophie; Sullivan, Patrick F; Vrieze, Scott I; Wright, Margaret J; Posthuma, Danielle
2018-06-25
Intelligence is highly heritable 1 and a major determinant of human health and well-being 2 . Recent genome-wide meta-analyses have identified 24 genomic loci linked to variation in intelligence 3-7 , but much about its genetic underpinnings remains to be discovered. Here, we present a large-scale genetic association study of intelligence (n = 269,867), identifying 205 associated genomic loci (190 new) and 1,016 genes (939 new) via positional mapping, expression quantitative trait locus (eQTL) mapping, chromatin interaction mapping, and gene-based association analysis. We find enrichment of genetic effects in conserved and coding regions and associations with 146 nonsynonymous exonic variants. Associated genes are strongly expressed in the brain, specifically in striatal medium spiny neurons and hippocampal pyramidal neurons. Gene set analyses implicate pathways related to nervous system development and synaptic structure. We confirm previous strong genetic correlations with multiple health-related outcomes, and Mendelian randomization analysis results suggest protective effects of intelligence for Alzheimer's disease and ADHD and bidirectional causation with pleiotropic effects for schizophrenia. These results are a major step forward in understanding the neurobiology of cognitive function as well as genetically related neurological and psychiatric disorders.
Chen, Yao; Mohammadi, Moosa; Flanagan, John G.
2009-01-01
Summary Graded guidance labels are widely used in neural map formation, but it is not well understood which potential strategy leads to their graded expression. In midbrain tectal map development, FGFs can induce an entire midbrain, but their protein distribution is unclear, nor is it known whether they may act instructively to produce graded gene expression. Using a receptor-alkaline phosphatase fusion probe, we find a long-range posterior>anterior FGF protein gradient spanning the midbrain. Heparan sulfate proteoglycan (HSPG) is required for this gradient. To test whether graded FGF concentrations can instruct graded gene expression, a quantitative tectal explant assay was developed. Engrailed-2 and ephrin-As, normally in posterior>anterior tectal gradients, showed graded upregulation. Moreover, EphAs, normally in anterior>posterior countergradients, showed coordinately graded downregulation. These results provide a mechanism to establish graded mapping labels, and more generally provide a developmental strategy to coordinately induce a structure and pattern its cell properties in gradients. PMID:19555646
Zhang, Linlin
2017-01-01
The optix gene has been implicated in butterfly wing pattern adaptation by genetic association, mapping, and expression studies. The actual developmental function of this gene has remained unclear, however. Here we used CRISPR/Cas9 genome editing to show that optix plays a fundamental role in nymphalid butterfly wing pattern development, where it is required for determination of all chromatic coloration. optix knockouts in four species show complete replacement of color pigments with melanins, with corresponding changes in pigment-related gene expression, resulting in black and gray butterflies. We also show that optix simultaneously acts as a switch gene for blue structural iridescence in some butterflies, demonstrating simple regulatory coordination of structural and pigmentary coloration. Remarkably, these optix knockouts phenocopy the recurring “black and blue” wing pattern archetype that has arisen on many independent occasions in butterflies. Here we demonstrate a simple genetic basis for structural coloration, and show that optix plays a deeply conserved role in butterfly wing pattern development. PMID:28923944
Zhang, Linlin; Mazo-Vargas, Anyi; Reed, Robert D
2017-10-03
The optix gene has been implicated in butterfly wing pattern adaptation by genetic association, mapping, and expression studies. The actual developmental function of this gene has remained unclear, however. Here we used CRISPR/Cas9 genome editing to show that optix plays a fundamental role in nymphalid butterfly wing pattern development, where it is required for determination of all chromatic coloration. optix knockouts in four species show complete replacement of color pigments with melanins, with corresponding changes in pigment-related gene expression, resulting in black and gray butterflies. We also show that optix simultaneously acts as a switch gene for blue structural iridescence in some butterflies, demonstrating simple regulatory coordination of structural and pigmentary coloration. Remarkably, these optix knockouts phenocopy the recurring "black and blue" wing pattern archetype that has arisen on many independent occasions in butterflies. Here we demonstrate a simple genetic basis for structural coloration, and show that optix plays a deeply conserved role in butterfly wing pattern development.
Miller, Hilary C; O'Meally, Denis; Ezaz, Tariq; Amemiya, Chris; Marshall-Graves, Jennifer A; Edwards, Scott
2015-05-07
Major histocompatibility complex (MHC) genes are a central component of the vertebrate immune system and usually exist in a single genomic region. However, considerable differences in MHC organization and size exist between different vertebrate lineages. Reptiles occupy a key evolutionary position for understanding how variation in MHC structure evolved in vertebrates, but information on the structure of the MHC region in reptiles is limited. In this study, we investigate the organization and cytogenetic location of MHC genes in the tuatara (Sphenodon punctatus), the sole extant representative of the early-diverging reptilian order Rhynchocephalia. Sequencing and mapping of 12 clones containing class I and II MHC genes from a bacterial artificial chromosome library indicated that the core MHC region is located on chromosome 13q. However, duplication and translocation of MHC genes outside of the core region was evident, because additional class I MHC genes were located on chromosome 4p. We found a total of seven class I sequences and 11 class II β sequences, with evidence for duplication and pseudogenization of genes within the tuatara lineage. The tuatara MHC is characterized by high repeat content and low gene density compared with other species and we found no antigen processing or MHC framework genes on the MHC gene-containing clones. Our findings indicate substantial differences in MHC organization in tuatara compared with mammalian and avian MHCs and highlight the dynamic nature of the MHC. Further sequencing and annotation of tuatara and other reptile MHCs will determine if the tuatara MHC is representative of nonavian reptiles in general. Copyright © 2015 Miller et al.
Construction of an integrated genetic map for Capsicum baccatum L.
Moulin, M M; Rodrigues, R; Ramos, H C C; Bento, C S; Sudré, C P; Gonçalves, L S A; Viana, A P
2015-06-18
Capsicum baccatum L. is one of the five Capsicum domesticated species and has multiple uses in the food, pharmaceutical and cosmetic industries. This species is also a valuable source of genes for chili pepper breeding, especially genes for disease resistance and fruit quality. However, knowledge of the genetic structure of C. baccatum is limited. A reference map for C. baccatum (2n = 2x = 24) based on 42 microsatellite, 85 inter-simple sequence repeat, and 56 random amplified polymorphic DNA markers was constructed using an F2 population consisting of 203 individuals. The map was generated using the JoinMap software (version 4.0) and the linkage groups were formed and ordered using a LOD score of 3.0 and maximum of 40% recombination. The genetic map consisted of 12 major and four minor linkage groups covering a total genome distance of 2547.5 cM with an average distance of 14.25 cM between markers. Of the 152 pairs of microsatellite markers available for Capsicum annuum, 62 were successfully transferred to C. baccatum, generating polymorphism. Forty-two of these markers were mapped, allowing the introduction of C. baccatum in synteny studies with other species of the genus Capsicum.
A Spatial Framework for Understanding Population Structure and Admixture.
Bradburd, Gideon S; Ralph, Peter L; Coop, Graham M
2016-01-01
Geographic patterns of genetic variation within modern populations, produced by complex histories of migration, can be difficult to infer and visually summarize. A general consequence of geographically limited dispersal is that samples from nearby locations tend to be more closely related than samples from distant locations, and so genetic covariance often recapitulates geographic proximity. We use genome-wide polymorphism data to build "geogenetic maps," which, when applied to stationary populations, produces a map of the geographic positions of the populations, but with distances distorted to reflect historical rates of gene flow. In the underlying model, allele frequency covariance is a decreasing function of geogenetic distance, and nonlocal gene flow such as admixture can be identified as anomalously strong covariance over long distances. This admixture is explicitly co-estimated and depicted as arrows, from the source of admixture to the recipient, on the geogenetic map. We demonstrate the utility of this method on a circum-Tibetan sampling of the greenish warbler (Phylloscopus trochiloides), in which we find evidence for gene flow between the adjacent, terminal populations of the ring species. We also analyze a global sampling of human populations, for which we largely recover the geography of the sampling, with support for significant histories of admixture in many samples. This new tool for understanding and visualizing patterns of population structure is implemented in a Bayesian framework in the program SpaceMix.
A Spatial Framework for Understanding Population Structure and Admixture
Bradburd, Gideon S.; Ralph, Peter L.; Coop, Graham M.
2016-01-01
Geographic patterns of genetic variation within modern populations, produced by complex histories of migration, can be difficult to infer and visually summarize. A general consequence of geographically limited dispersal is that samples from nearby locations tend to be more closely related than samples from distant locations, and so genetic covariance often recapitulates geographic proximity. We use genome-wide polymorphism data to build “geogenetic maps,” which, when applied to stationary populations, produces a map of the geographic positions of the populations, but with distances distorted to reflect historical rates of gene flow. In the underlying model, allele frequency covariance is a decreasing function of geogenetic distance, and nonlocal gene flow such as admixture can be identified as anomalously strong covariance over long distances. This admixture is explicitly co-estimated and depicted as arrows, from the source of admixture to the recipient, on the geogenetic map. We demonstrate the utility of this method on a circum-Tibetan sampling of the greenish warbler (Phylloscopus trochiloides), in which we find evidence for gene flow between the adjacent, terminal populations of the ring species. We also analyze a global sampling of human populations, for which we largely recover the geography of the sampling, with support for significant histories of admixture in many samples. This new tool for understanding and visualizing patterns of population structure is implemented in a Bayesian framework in the program SpaceMix. PMID:26771578
Structure and chromosomal localization of the human PD-1 gene (PDCD1)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shinohara, T.; Ishida, Y.; Kawaichi, M.
1994-10-01
A cDNA encoding mouse PD-1, a member of the immunoglobulin superfamily, was previously isolated from apoptosis-induced cells by subtractive hybridization. To determine the structure and chromosomal location of the human PD-1 gene, we screened a human T cell cDNA library by mouse PD-1 probe and isolated a cDNA coding for the human PD-1 protein. The deduced amino acid sequence of human PD-1 was 60% identical to the mouse counterpart, and a putative tyrosine kinase-association motif was well conserved. The human PD-1 gene was mapped to 2q37.3 by chromosomal in situ hybridization. 7 refs., 3 figs.
Jiang, Yiwei
2013-01-01
Drought is a major environmental stress limiting growth of perennial grasses in temperate regions. Plant drought tolerance is a complex trait that is controlled by multiple genes. Candidate gene association mapping provides a powerful tool for dissection of complex traits. Candidate gene association mapping of drought tolerance traits was conducted in 192 diverse perennial ryegrass (Lolium perenne L.) accessions from 43 countries. The panel showed significant variations in leaf wilting, leaf water content, canopy and air temperature difference, and chlorophyll fluorescence under well-watered and drought conditions across six environments. Analysis of 109 simple sequence repeat markers revealed five population structures in the mapping panel. A total of 2520 expression-based sequence readings were obtained for a set of candidate genes involved in antioxidant metabolism, dehydration, water movement across membranes, and signal transduction, from which 346 single nucleotide polymorphisms were identified. Significant associations were identified between a putative LpLEA3 encoding late embryogenesis abundant group 3 protein and a putative LpFeSOD encoding iron superoxide dismutase and leaf water content, as well as between a putative LpCyt Cu-ZnSOD encoding cytosolic copper-zinc superoxide dismutase and chlorophyll fluorescence under drought conditions. Four of these identified significantly associated single nucleotide polymorphisms from these three genes were also translated to amino acid substitutions in different genotypes. These results indicate that allelic variation in these genes may affect whole-plant response to drought stress in perennial ryegrass. PMID:23386684
2011-01-01
Background Technological advances are progressively increasing the application of genomics to a wider array of economically and ecologically important species. High-density maps enriched for transcribed genes facilitate the discovery of connections between genes and phenotypes. We report the construction of a high-density linkage map of expressed genes for the heterozygous genome of Eucalyptus using Single Feature Polymorphism (SFP) markers. Results SFP discovery and mapping was achieved using pseudo-testcross screening and selective mapping to simultaneously optimize linkage mapping and microarray costs. SFP genotyping was carried out by hybridizing complementary RNA prepared from 4.5 year-old trees xylem to an SFP array containing 103,000 25-mer oligonucleotide probes representing 20,726 unigenes derived from a modest size expressed sequence tags collection. An SFP-mapping microarray with 43,777 selected candidate SFP probes representing 15,698 genes was subsequently designed and used to genotype SFPs in a larger subset of the segregating population drawn by selective mapping. A total of 1,845 genes were mapped, with 884 of them ordered with high likelihood support on a framework map anchored to 180 microsatellites with average density of 1.2 cM. Using more probes per unigene increased by two-fold the likelihood of detecting segregating SFPs eventually resulting in more genes mapped. In silico validation showed that 87% of the SFPs map to the expected location on the 4.5X draft sequence of the Eucalyptus grandis genome. Conclusions The Eucalyptus 1,845 gene map is the most highly enriched map for transcriptional information for any forest tree species to date. It represents a major improvement on the number of genes previously positioned on Eucalyptus maps and provides an initial glimpse at the gene space for this global tree genome. A general protocol is proposed to build high-density transcript linkage maps in less characterized plant species by SFP genotyping with a concurrent objective of reducing microarray costs. HIgh-density gene-rich maps represent a powerful resource to assist gene discovery endeavors when used in combination with QTL and association mapping and should be especially valuable to assist the assembly of reference genome sequences soon to come for several plant and animal species. PMID:21492453
Structure, Expression, Chromosomal Location and Product of the Gene Encoding Adh2 in Petunia
Gregerson, R. G.; Cameron, L.; McLean, M.; Dennis, P.; Strommer, J.
1993-01-01
In most higher plants the genes encoding alcohol dehydrogenase comprise a small gene family, usually with two members. The Adh1 gene of Petunia has been cloned and analyzed, but a second identifiable gene was not recovered from any of three genomic libraries. We have therefore employed the polymerase chain reaction to obtain the major portion of a second Adh gene. From sequence, mapping and northern data we conclude this gene encodes ADH2, the major anaerobically inducible Adh gene of Petunia. The availability of both Adh1 and Adh2 from Petunia has permitted us to compare their structures and patterns of expression to those of the well-studied Adh genes of maize, of which one is highly expressed developmentally, while both are induced in response to hypoxia. Despite their evolutionary distance, evidenced by deduced amino acid sequence as well as taxonomic classification, the pairs of genes are regulated in strikingly similar ways in maize and Petunia. Our findings suggest a significant biological basis for the regulatory strategy employed by these distant species for differential expression of multiple Adh genes. PMID:8096485
Ji, Shuiwang
2013-07-11
The structured organization of cells in the brain plays a key role in its functional efficiency. This delicate organization is the consequence of unique molecular identity of each cell gradually established by precise spatiotemporal gene expression control during development. Currently, studies on the molecular-structural association are beginning to reveal how the spatiotemporal gene expression patterns are related to cellular differentiation and structural development. In this article, we aim at a global, data-driven study of the relationship between gene expressions and neuroanatomy in the developing mouse brain. To enable visual explorations of the high-dimensional data, we map the in situ hybridization gene expression data to a two-dimensional space by preserving both the global and the local structures. Our results show that the developing brain anatomy is largely preserved in the reduced gene expression space. To provide a quantitative analysis, we cluster the reduced data into groups and measure the consistency with neuroanatomy at multiple levels. Our results show that the clusters in the low-dimensional space are more consistent with neuroanatomy than those in the original space. Gene expression patterns and developing brain anatomy are closely related. Dimensionality reduction and visual exploration facilitate the study of this relationship.
Nucleosome Positioning and NDR Structure at RNA Polymerase III Promoters
NASA Astrophysics Data System (ADS)
Helbo, Alexandra Søgaard; Lay, Fides D.; Jones, Peter A.; Liang, Gangning; Grønbæk, Kirsten
2017-02-01
Chromatin is structurally involved in the transcriptional regulation of all genes. While the nucleosome positioning at RNA polymerase II (pol II) promoters has been extensively studied, less is known about the chromatin structure at pol III promoters in human cells. We use a high-resolution analysis to show substantial differences in chromatin structure of pol II and pol III promoters, and between subtypes of pol III genes. Notably, the nucleosome depleted region at the transcription start site of pol III genes extends past the termination sequences, resulting in nucleosome free gene bodies. The +1 nucleosome is located further downstream than at pol II genes and furthermore displays weak positioning. The variable position of the +1 location is seen not only within individual cell populations and between cell types, but also between different pol III promoter subtypes, suggesting that the +1 nucleosome may be involved in the transcriptional regulation of pol III genes. We find that expression and DNA methylation patterns correlate with distinct accessibility patterns, where DNA methylation associates with the silencing and inaccessibility at promoters. Taken together, this study provides the first high-resolution map of nucleosome positioning and occupancy at human pol III promoters at specific loci and genome wide.
Linkage Map of Escherichia coli K-12, Edition 10: The Traditional Map
Berlyn, Mary K. B.
1998-01-01
This map is an update of the edition 9 map by Berlyn et al. (M. K. B. Berlyn, K. B. Low, and K. E. Rudd, p. 1715–1902, in F. C. Neidhardt et al., ed., Escherichia coli and Salmonella: cellular and molecular biology, 2nd ed., vol. 2, 1996). It uses coordinates established by the completed sequence, expressed as 100 minutes for the entire circular map, and adds new genes discovered and established since 1996 and eliminates those shown to correspond to other known genes. The latter are included as synonyms. An alphabetical list of genes showing map location, synonyms, the protein or RNA product of the gene, phenotypes of mutants, and reference citations is provided. In addition to genes known to correspond to gene sequences, other genes, often older, that are described by phenotype and older mapping techniques and that have not been correlated with sequences are included. PMID:9729611
Gaber, Richard F.; Mathison, Lorilee; Edelman, Irv; Culbertson, Michael R.
1983-01-01
Five previously unmapped frameshift suppressor genes have been located on the yeast genetic map. In addition, we have further characterized the map positions of two suppressors whose approximate locations were determined in an earlier study. These results represent the completion of genetic mapping studies on all 25 of the known frameshift suppressor genes in yeast.—The approximate location of each suppressor gene was initially determined through the use of a set of mapping strains containing 61 signal markers distributed throughout the yeast genome. Standard meiotic linkage was assayed in crosses between strains carrying the suppressors and the mapping strains. Subsequent to these approximate linkage determinations, each suppressor gene was more precisely located in multi-point crosses. The implications of these mapping results for the genomic distribution of frameshift suppressor genes, which include both glycine and proline tRNA genes, are discussed. PMID:17246112
Pengelly, Reuben J; Tapper, William; Gibson, Jane; Knut, Marcin; Tearle, Rick; Collins, Andrew; Ennis, Sarah
2015-09-03
An understanding of linkage disequilibrium (LD) structures in the human genome underpins much of medical genetics and provides a basis for disease gene mapping and investigating biological mechanisms such as recombination and selection. Whole genome sequencing (WGS) provides the opportunity to determine LD structures at maximal resolution. We compare LD maps constructed from WGS data with LD maps produced from the array-based HapMap dataset, for representative European and African populations. WGS provides up to 5.7-fold greater SNP density than array-based data and achieves much greater resolution of LD structure, allowing for identification of up to 2.8-fold more regions of intense recombination. The absence of ascertainment bias in variant genotyping improves the population representativeness of the WGS maps, and highlights the extent of uncaptured variation using array genotyping methodologies. The complete capture of LD patterns using WGS allows for higher genome-wide association study (GWAS) power compared to array-based GWAS, with WGS also allowing for the analysis of rare variation. The impact of marker ascertainment issues in arrays has been greatest for Sub-Saharan African populations where larger sample sizes and substantially higher marker densities are required to fully resolve the LD structure. WGS provides the best possible resource for LD mapping due to the maximal marker density and lack of ascertainment bias. WGS LD maps provide a rich resource for medical and population genetics studies. The increasing availability of WGS data for large populations will allow for improved research utilising LD, such as GWAS and recombination biology studies.
Primary structure and mapping of the hupA gene of Salmonella typhimurium.
Higgins, N P; Hillyard, D
1988-01-01
In bacteria, the complex nucleoid structure is folded and maintained by negative superhelical tension and a set of type II DNA-binding proteins, also called histonelike proteins. The most abundant type II DNA-binding protein is HU. Southern blot analysis showed that Salmonella typhimurium contained two HU genes that corresponded to Escherichia coli genes hupA (encoding HU-2 protein) and hupB (encoding HU-1). Salmonella hupA was cloned, and the nucleotide sequence of the gene was determined. Comparison of hupA of E. coli and S. typhimurium revealed that the HU-2 proteins were identical and that there was high conservation of nucleotide sequences outside the coding frames of the genes. A 300-member genomic library of S. typhimurium was constructed by using random transposition of MudP, a specialized chimeric P22-Mu phage that packages chromosomal DNA unidirectionally from its insertion point. Oligonucleotide hybridization against the library identified one MudP insertion that lies within 28 kilobases of hupA; the MudP was 12% linked to purH at 90.5 min on the standard map. Plasmids expressing HU-2 had a surprising phenotype; they caused growth arrest when they were introduced into E. coli strains bearing a himA or hip mutation. These results suggest that IHF and HU have interactive roles in bacteria. Images PMID:3056912
Gillen, K L; Hughes, K T
1991-01-01
The complex regulation of flagellin gene expression in Salmonella typhimurium was characterized in vivo by using lac transcriptional fusions to the two flagellin structural genes (fliC [H1] and fljB [H2]). Phase variation was measured as the rate of switching of flagellin gene expression. Switching frequencies varied from 1/500 per cell per generation to 1/10,000 per cell per generation depending on the particular insertion and the direction of switching. There is a 4- to 20-fold bias in favor of switching from the fljB(On) to the fljB(Off) orientation. Random Tn10dTc insertions were isolated which failed to express flagellin. While most of these insertions mapped to loci known to be required for flagellin expression, several new loci were identified. The presence of functional copies of all of the genes responsible for complete flagellar assembly, except the hook-associated proteins (flgK, flgL, and fliD gene products), were required for expression of the fliC or fljB flagellin genes. Two novel loci involved in negative regulation of fliC and fljB in fla mutant backgrounds were identified. One of these loci, designated the flgR locus, mapped to the flg operon at 23 min on the Salmonella linkage map. An flgR insertion mutation resulted in relief of repression of the fliC and fljB genes in all fla mutant backgrounds except for mutants in the positive regulatory loci (flhC, flhD, and fliA genes). PMID:1848842
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mucklow, S.; Hartnell, A.; Crocker, P.R.
1995-07-20
Sialoadhesin is a cell-cell interaction molecule expressed by subpopulations of tissue macrophages. It contains 17 immunoglobulin (Ig)-like domains and is structurally related to CD22, MAG, and CD33. These molecules establish a distinct family of sialic acid-dependent adhesion molecules, the sialoadhesin family. We have mapped the rodent sialoadhesin gene, Sn, to chromosome 2F-H1 by in situ hybridization (ISH) and shown linkage to Il1b and four other markers by backcross linkage analysis. We have also used ISH and a human-mouse somatic cell hybrid panel to localize the human sialoadhesin gene, SN, to the conserved syntenic region on human chromosome 20p13. This demonstratesmore » that the sialoadhesin gene is not linked to the other members of the sialoadhesin family, CD22, MAG. and CD33, which have been independently mapped to the distal region of mouse chromosome 7 and to human chromosome 19q13.1-3. 19 refs., 1 fig.« less
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2017-04-01
Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .
Comparative physical mapping between wheat chromosome arm 2BL and rice chromosome 4.
Lee, Tong Geon; Lee, Yong Jin; Kim, Dae Yeon; Seo, Yong Weon
2010-12-01
Physical maps of chromosomes provide a framework for organizing and integrating diverse genetic information. DNA microarrays are a valuable technique for physical mapping and can also be used to facilitate the discovery of single feature polymorphisms (SFPs). Wheat chromosome arm 2BL was physically mapped using a Wheat Genome Array onto near-isogenic lines (NILs) with the aid of wheat-rice synteny and mapped wheat EST information. Using high variance probe set (HVP) analysis, 314 HVPs constituting genes present on 2BL were identified. The 314 HVPs were grouped into 3 categories: HVPs that match only rice chromosome 4 (298 HVPs), those that match only wheat ESTs mapped on 2BL (1), and those that match both rice chromosome 4 and wheat ESTs mapped on 2BL (15). All HVPs were converted into gene sets, which represented either unique rice gene models or mapped wheat ESTs that matched identified HVPs. Comparative physical maps were constructed for 16 wheat gene sets and 271 rice gene sets. Of the 271 rice gene sets, 257 were mapped to the 18-35 Mb regions on rice chromosome 4. Based on HVP analysis and sequence similarity between the gene models in the rice chromosomes and mapped wheat ESTs, the outermost rice gene model that limits the translocation breakpoint to orthologous regions was identified.
The standard operating procedure of the DOE-JGI Metagenome Annotation Pipeline (MAP v.4)
Huntemann, Marcel; Ivanova, Natalia N.; Mavromatis, Konstantinos; ...
2016-02-24
The DOE-JGI Metagenome Annotation Pipeline (MAP v.4) performs structural and functional annotation for metagenomic sequences that are submitted to the Integrated Microbial Genomes with Microbiomes (IMG/M) system for comparative analysis. The pipeline runs on nucleotide sequences provide d via the IMG submission site. Users must first define their analysis projects in GOLD and then submit the associated sequence datasets consisting of scaffolds/contigs with optional coverage information and/or unassembled reads in fasta and fastq file formats. The MAP processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNAs, as well as CRISPR elements. Structural annotation ismore » followed by functional annotation including assignment of protein product names and connection to various protein family databases.« less
The standard operating procedure of the DOE-JGI Metagenome Annotation Pipeline (MAP v.4)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Huntemann, Marcel; Ivanova, Natalia N.; Mavromatis, Konstantinos
The DOE-JGI Metagenome Annotation Pipeline (MAP v.4) performs structural and functional annotation for metagenomic sequences that are submitted to the Integrated Microbial Genomes with Microbiomes (IMG/M) system for comparative analysis. The pipeline runs on nucleotide sequences provide d via the IMG submission site. Users must first define their analysis projects in GOLD and then submit the associated sequence datasets consisting of scaffolds/contigs with optional coverage information and/or unassembled reads in fasta and fastq file formats. The MAP processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNAs, as well as CRISPR elements. Structural annotation ismore » followed by functional annotation including assignment of protein product names and connection to various protein family databases.« less
Genetic diversity and accession structure in European Cynara cardunculus collections
Fernández, Juan A.; Sonnante, Gabriella; Egea-Gilabert, Catalina
2017-01-01
Understanding the distribution of genetic variations and accession structures is an important factor for managing genetic resources, but also for using proper germplasm in association map analyses and breeding programs. The globe artichoke is the fourth most important horticultural crop in Europe. Here, we report the results of a molecular analysis of a collection including globe artichoke and leafy cardoon germplasm present in the Italian, French and Spanish gene banks. The aims of this study were to: (i) assess the diversity present in European collections, (ii) determine the population structure, (iii) measure the genetic distance between accessions; (iv) cluster the accessions; (v) properly distinguish accessions present in the different national collections carrying the same name; and (vi) understand the diversity distribution in relation to the gene bank and the geographic origin of the germplasm. A total of 556 individuals grouped into 174 accessions of distinct typologies were analyzed by different types of molecular markers, i.e. dominant (ISSR and AFLP) and co-dominant (SSR). The data of the two crops (globe artichoke and leafy cardoon) were analyzed jointly and separately to compute, among other aims, the gene diversity, heterozygosity (He, Ho), fixation indexes, AMOVA, genetic distance and structure. The findings underline the huge diversity present in the analyzed material, and the existence of alleles that are able to discriminate among accessions. The accessions were clustered not only on the basis of their typology, but also on the basis of the gene bank they come from. Probably, the environmental conditions of the different field gene banks affected germplasm conservation. These outcomes will be useful in plant breeding to select accessions and to fingerprint varieties. Moreover, the results highlight the particular attention that should be paid to the method used to conserve the Cynara cardunculus germplasm and suggest to the preference of using accessions from different gene banks to run an association map. PMID:28570688
Holland, M J; Holland, J P; Thill, G P; Jackson, K A
1981-02-10
Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5- noncoding portions of these glycolytic genes.
Cloning and Characterization of the Scalloped Region of Drosophila Melanogaster
Campbell, S. D.; Duttaroy, A.; Katzen, A. L.; Chovnick, A.
1991-01-01
Viable mutants of the scalloped gene (sd) of Drosophila melanogaster exhibit defects that can include gapping of the wing margin and ectopic bristle formation on the wing. Lethal sd alleles characterized in the present study now implicate this gene in a genetic function essential for normal development. In order to further characterize the developmental role of this gene, we have undertaken to clone and characterize the region where sd maps. A P[ry(+)] transposon insertion at 13F associated with sd([ry+2216]) served as the starting point for a 42-kb chromosomal walk. Molecular lesions associated with viable and lethal sd alleles were characterized by genomic hybridization analysis as a means of defining the extent of the gene. DNA rearrangements associated with 11 viable sd alleles map to a 2-kb interval which appears to be a ``hot spot'' for P element activity. Four of five recessive lethal sd mutations were mapped by denaturing gradient gel electrophoresis to a region 12-14 kb away from the region of viable lesions. In a sd(+) genotype, at least two structurally related and developmentally regulated transcripts hybridize to the genomic region where several sd lethal alleles have been localized. A viable mutation, sd(58), used for comparison in the transcript analysis, makes at least two slightly smaller transcripts that also hybridize to this region. Preliminary analysis of cDNA clones has identified three structurally related transcripts that hybridize to this genomic region. The 5' end of these transcripts extends into the 2-kb genomic region wherein DNA rearrangements were seen in the P element rearrangements. We favor the view that the transcripts represented by these cDNA clones are products of the sd gene. If this is true, the sd gene would include genomic sequences extending over at least 14 kb of the described chromosomal walk, and would appear to be subject to alternative splicing. PMID:1706292
Digital transcriptome analysis of putative sex-determination genes in papaya (Carica papaya).
Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo
2012-01-01
Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Y(h)) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Y(h) chromosome, implying a loss of many genes on the Y(h) chromosome. Nevertheless, candidate Y(h) chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya.
Digital Transcriptome Analysis of Putative Sex-Determination Genes in Papaya (Carica papaya)
Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo
2012-01-01
Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Yh) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Yh chromosome, implying a loss of many genes on the Yh chromosome. Nevertheless, candidate Yh chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya. PMID:22815863
Structure and expression of dna methyltransferase genes from apomictic and sexual Boechera species.
Taşkin, Kemal Melik; Özbilen, Aslıhan; Sezer, Fatih; Hürkan, Kaan; Güneş, Şebnem
2017-04-01
In this study, we determined the structure of DNA methyltransferase (DNMT) genes in apomict and sexual Boechera species and investigated the expression levels during seed development. Protein and DNA sequences of diploid sexual Boechera stricta DNMT genes obtained from Phytozome 10.3 were used to identify the homologues in apomicts, Boechera holboellii and Boechera divaricarpa. Geneious R8 software was used to map the short-paired reads library of B. holboellii whole genome or B. divaricarpa transcriptome reads to the reference gene sequences. We determined three DNMT genes; for Boechera spp. METHYLTRANSFERASE1 (MET1), CHROMOMETHYLASE 3 (CMT3) and DOMAINS REARRANGED METHYLTRANSFERASE 1/2 (DRM2). We examined the structure of these genes with bioinformatic tools and compared with other DNMT genes in plants. We also examined the levels of expression in silique tissues after fertilization by semi-quantitative PCR. The structure of DNMT proteins in apomict and sexual Boechera species share common features. However, the expression levels of DNMT genes were different in apomict and sexual Boechera species. We found that DRM2 was upregulated in apomictic Boechera species after fertilization. Phylogenetic trees showed that three genes are conserved among green algae, monocotyledons and dicotyledons. Our results indicated a deregulation of DNA methylation machinery during seed development in apomicts. Copyright © 2016 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Pathak, Sayan D.; Haynor, David R.; Thompson, Carol L.; Lein, Ed; Hawrylycz, Michael
2009-02-01
Understanding the geography of genetic expression in the mouse brain has opened previously unexplored avenues in neuroinformatics. The Allen Brain Atlas (www.brain-map.org) (ABA) provides genome-wide colorimetric in situ hybridization (ISH) gene expression images at high spatial resolution, all mapped to a common three-dimensional 200μm3 spatial framework defined by the Allen Reference Atlas (ARA) and is a unique data set for studying expression based structural and functional organization of the brain. The goal of this study was to facilitate an unbiased data-driven structural partitioning of the major structures in the mouse brain. We have developed an algorithm that uses nonnegative matrix factorization (NMF) to perform parts based analysis of ISH gene expression images. The standard NMF approach and its variants are limited in their ability to flexibly integrate prior knowledge, in the context of spatial data. In this paper, we introduce spatial connectivity as an additional regularization in NMF decomposition via the use of Markov Random Fields (mNMF). The mNMF algorithm alternates neighborhood updates with iterations of the standard NMF algorithm to exploit spatial correlations in the data. We present the algorithm and show the sub-divisions of hippocampus and somatosensory-cortex obtained via this approach. The results are compared with established neuroanatomic knowledge. We also highlight novel gene expression based sub divisions of the hippocampus identified by using the mNMF algorithm.
The ecoresponsive genome of Daphnia pulex
DOE Office of Scientific and Technical Information (OSTI.GOV)
Colbourne, John K.; Pfrender, Michael E.; Gilbert, Donald
2011-02-04
This document provides supporting material related to the sequencing of the ecoresponsive genome of Daphnia pulex. This material includes information on materials and methods and supporting text, as well as supplemental figures, tables, and references. The coverage of materials and methods addresses genome sequence, assembly, and mapping to chromosomes, gene inventory, attributes of a compact genome, the origin and preservation of Daphnia pulex genes, implications of Daphnia's genome structure, evolutionary diversification of duplicated genes, functional significance of expanded gene families, and ecoresponsive genes. Supporting text covers chromosome studies, gene homology among Daphnia genomes, micro-RNA and transposable elements and the 46more » Daphnia pulex opsins. 36 figures, 50 tables, 183 references.« less
Reinprecht, Yarmilla; Yadegari, Zeinab; Perry, Gregory E.; Siddiqua, Mahbuba; Wright, Lori C.; McClean, Phillip E.; Pauls, K. Peter
2013-01-01
Legumes contain a variety of phytochemicals derived from the phenylpropanoid pathway that have important effects on human health as well as seed coat color, plant disease resistance and nodulation. However, the information about the genes involved in this important pathway is fragmentary in common bean (Phaseolus vulgaris L.). The objectives of this research were to isolate genes that function in and control the phenylpropanoid pathway in common bean, determine their genomic locations in silico in common bean and soybean, and analyze sequences of the 4CL gene family in two common bean genotypes. Sequences of phenylpropanoid pathway genes available for common bean or other plant species were aligned, and the conserved regions were used to design sequence-specific primers. The PCR products were cloned and sequenced and the gene sequences along with common bean gene-based (g) markers were BLASTed against the Glycine max v.1.0 genome and the P. vulgaris v.1.0 (Andean) early release genome. In addition, gene sequences were BLASTed against the OAC Rex (Mesoamerican) genome sequence assembly. In total, fragments of 46 structural and regulatory phenylpropanoid pathway genes were characterized in this way and placed in silico on common bean and soybean sequence maps. The maps contain over 250 common bean g and SSR (simple sequence repeat) markers and identify the positions of more than 60 additional phenylpropanoid pathway gene sequences, plus the putative locations of seed coat color genes. The majority of cloned phenylpropanoid pathway gene sequences were mapped to one location in the common bean genome but had two positions in soybean. The comparison of the genomic maps confirmed previous studies, which show that common bean and soybean share genomic regions, including those containing phenylpropanoid pathway gene sequences, with conserved synteny. Indels identified in the comparison of Andean and Mesoamerican common bean 4CL gene sequences might be used to develop inter-pool phenylpropanoid pathway gene-based markers. We anticipate that the information obtained by this study will simplify and accelerate selections of common bean with specific phenylpropanoid pathway alleles to increase the contents of beneficial phenylpropanoids in common bean and other legumes. PMID:24046770
Identification and characterization of a novel serine-threonine kinase gene from the Xp22 region.
Montini, E; Andolfi, G; Caruso, A; Buchner, G; Walpole, S M; Mariani, M; Consalez, G; Trump, D; Ballabio, A; Franco, B
1998-08-01
Eukaryotic protein kinases are part of a large and expanding family of proteins. Through our transcriptional mapping effort in the Xp22 region, we have isolated and sequenced the full-length transcript of STK9, a novel cDNA highly homologous to serine-threonine kinases. A number of human genetic disorders have been mapped to the region where STK9 has been localized including Nance-Horan (NH) syndrome, oral-facial-digital syndrome type 1 (OFD1), and a novel locus for nonsyndromic sensorineural deafness (DFN6). To evaluate the possible involvement of STK9 in any of the above-mentioned disorders, a 2416-bp full-length cDNA was assembled. The entire genomic structure of the gene, which is composed of 20 coding exons, was determined. Northern analysis revealed a transcript larger than 9.5 kb in several tissues including brain, lung, and kidney. The mouse homologue (Stk9) was identified and mapped in the mouse in the region syntenic to human Xp. This location is compatible with the location of the Xcat mutant, which shows congenital cataracts very similar to those observed in NH patients. Sequence homologies, expression pattern, and mapping information in both human and mouse make STK9 a candidate gene for the above-mentioned disorders. Copyright 1998 Academic Press.
Guillet-Claude, Carine; Isabel, Nathalie; Pelgas, Betty; Bousquet, Jean
2004-12-01
Class I knox genes code for transcription factors that play an essential role in plant growth and development as central regulators of meristem cell identity. Based on the analysis of new cDNA sequences from various tissues and genomic DNA sequences, we identified a highly diversified group of class I knox genes in conifers. Phylogenetic analyses of complete amino acid sequences from various seed plants indicated that all conifer sequences formed a monophyletic group. Within conifers, four subgroups here named genes KN1 to KN4 were well delineated, each regrouping pine and spruce sequences. KN4 was sister group to KN3, which was sister group to KN1 and KN2. Genetic mapping on the genomes of two divergent Picea species indicated that KN1 and KN2 are located close to each other on the same linkage group, whereas KN3 and KN4 mapped on different linkage groups, correlating the more ancient divergence of these two genes. The proportion of synonymous and nonsynonymous substitutions suggested intense purifying selection for the four genes. However, rates of substitution per year indicated an evolution in two steps: faster rates were noted after gene duplications, followed subsequently by lower rates. Positive directional selection was detected for most of the internal branches harboring an accelerated rate of evolution. In addition, many sites with highly significant amino acid rate shift were identified between these branches. However, the tightly linked KN1 and KN2 did not diverge as much from each other. The implications of the correlation between phylogenetic, structural, and functional information are discussed in relation to the diversification of the knox-I gene family in conifers.
Mapping eQTL Networks with Mixed Graphical Markov Models
Tur, Inma; Roverato, Alberto; Castelo, Robert
2014-01-01
Expression quantitative trait loci (eQTL) mapping constitutes a challenging problem due to, among other reasons, the high-dimensional multivariate nature of gene-expression traits. Next to the expression heterogeneity produced by confounding factors and other sources of unwanted variation, indirect effects spread throughout genes as a result of genetic, molecular, and environmental perturbations. From a multivariate perspective one would like to adjust for the effect of all of these factors to end up with a network of direct associations connecting the path from genotype to phenotype. In this article we approach this challenge with mixed graphical Markov models, higher-order conditional independences, and q-order correlation graphs. These models show that additive genetic effects propagate through the network as function of gene–gene correlations. Our estimation of the eQTL network underlying a well-studied yeast data set leads to a sparse structure with more direct genetic and regulatory associations that enable a straightforward comparison of the genetic control of gene expression across chromosomes. Interestingly, it also reveals that eQTLs explain most of the expression variability of network hub genes. PMID:25271303
Mapping the Schizophrenia Genes by Neuroimaging: The Opportunities and the Challenges
2018-01-01
Schizophrenia (SZ) is a heritable brain disease originating from a complex interaction of genetic and environmental factors. The genes underpinning the neurobiology of SZ are largely unknown but recent data suggest strong evidence for genetic variations, such as single nucleotide polymorphisms, making the brain vulnerable to the risk of SZ. Structural and functional brain mapping of these genetic variations are essential for the development of agents and tools for better diagnosis, treatment and prevention of SZ. Addressing this, neuroimaging methods in combination with genetic analysis have been increasingly used for almost 20 years. So-called imaging genetics, the opportunities of this approach along with its limitations for SZ research will be outlined in this invited paper. While the problems such as reproducibility, genetic effect size, specificity and sensitivity exist, opportunities such as multivariate analysis, development of multisite consortia for large-scale data collection, emergence of non-candidate gene (hypothesis-free) approach of neuroimaging genetics are likely to contribute to a rapid progress for gene discovery besides to gene validation studies that are related to SZ. PMID:29324666
Trapitz, P; Glätzer, K H; Bünemann, H
1992-11-01
The understanding of structure and function of the so-called fertility genes of Drosophila is very limited due to their unusual size--several megabases--and their location on the heterochromatic Y chromosome. Since mapping of these genes has mainly been done by classical cytogenetic analyses using a small number of cytologically visible lampbrush loops as the sole markers for particular fertility genes, the resolution of the genetic map of the Y chromosome is restricted to 3-5 Mb. Here we demonstrate that a substantially finer subdivision of the megabase-sized fertility genes in the subtelomeric regions of the Y chromosome of Drosophila hydei can be achieved by a combination of digestion with restriction enzymes having 6 bp recognition sequences, and pulsed field gel electrophoresis. The physical subdivision is based upon large conserved fragments of repetitive DNA in the size range from 50 up to 1600 kb and refers to the long-range organization of several families of repetitive DNA involved in Y chromosomal transcription processes in primary spermatocytes. We conclude from our results that at least five different families of repetitive DNA specifically transcribed on the lampbrush loops nooses and threads are organized as extended clusters of several hundred kb, essentially free of interspersed non-repetitive sequences.
Petti, Carloalberto; Hirano, Ko; Stork, Jozsef; DeBolt, Seth
2015-09-01
Here, we show a mechanism for expansion regulation through mutations in the green revolution gene gibberellin20 (GA20)-oxidase and show that GAs control biosynthesis of the plants main structural polymer cellulose. Within a 12,000 mutagenized Sorghum bicolor plant population, we identified a single cellulose-deficient and male gametophyte-dysfunctional mutant named dwarf1-1 (dwf1-1). Through the Sorghum propinquum male/dwf1-1 female F2 population, we mapped dwf1-1 to a frameshift in GA20-oxidase. Assessment of GAs in dwf1-1 revealed ablation of GA. GA ablation was antagonistic to the expression of three specific cellulose synthase genes resulting in cellulose deficiency and growth dwarfism, which were complemented by exogenous bioactive gibberellic acid application. Using quantitative polymerase chain reaction, we found that GA was positively regulating the expression of a subset of specific cellulose synthase genes. To cross reference data from our mapped Sorghum sp. allele with another monocotyledonous plant, a series of rice (Oryza sativa) mutants involved in GA biosynthesis and signaling were isolated, and these too displayed cellulose deficit. Taken together, data support a model whereby suppressed expansion in green revolution GA genes involves regulation of cellulose biosynthesis. © 2015 American Society of Plant Biologists. All Rights Reserved.
Kelemen, Arpad; Vasilakos, Athanasios V; Liang, Yulan
2009-09-01
Comprehensive evaluation of common genetic variations through association of single-nucleotide polymorphism (SNP) structure with common complex disease in the genome-wide scale is currently a hot area in human genome research due to the recent development of the Human Genome Project and HapMap Project. Computational science, which includes computational intelligence (CI), has recently become the third method of scientific enquiry besides theory and experimentation. There have been fast growing interests in developing and applying CI in disease mapping using SNP and haplotype data. Some of the recent studies have demonstrated the promise and importance of CI for common complex diseases in genomic association study using SNP/haplotype data, especially for tackling challenges, such as gene-gene and gene-environment interactions, and the notorious "curse of dimensionality" problem. This review provides coverage of recent developments of CI approaches for complex diseases in genetic association study with SNP/haplotype data.
Drath, Miriam; Baier, Kerstin; Forchhammer, Karl
2009-05-01
Methionine aminopeptidases (MetAPs or MAPs, encoded by map genes) are ubiquitous and pivotal enzymes for protein maturation in all living organisms. Whereas most bacteria harbour only one map gene, many cyanobacterial genomes contain two map paralogues, the genome of Synechocystis sp. PCC 6803 even three. The physiological function of multiple map paralogues remains elusive so far. This communication reports for the first time differential MetAP function in a cyanobacterium. In Synechocystis sp. PCC 6803, the universally conserved mapC gene (sll0555) is predominantly expressed in exponentially growing cells and appears to be a housekeeping gene. By contrast, expression of mapA (slr0918) and mapB (slr0786) genes increases during stress conditions. The mapB paralogue is only transiently expressed, whereas the widely distributed mapA gene appears to be the major MetAP during stress conditions. A mapA-deficient Synechocystis mutant shows a subtle impairment of photosystem II properties even under non-stressed conditions. In particular, the binding site for the quinone Q(B) is affected, indicating specific N-terminal methionine processing requirements of photosystem II components. MAP-A-specific processing becomes essential under certain stress conditions, since the mapA-deficient mutant is severely impaired in surviving conditions of prolonged nitrogen starvation and high light exposure.
Chromosome map of the thermophilic archaebacterium Thermococcus celer
NASA Technical Reports Server (NTRS)
Noll, K. M.; Woese, C. R. (Principal Investigator)
1989-01-01
A physical map for the chromosome of the thermophilic archaebacterium Thermococcus celer Vu13 has been constructed. Thirty-four restriction endonucleases were tested for their ability to generate large restriction fragments from the chromosome of T. celer. Of these, the enzymes NheI, SpeI, and XbaI yielded the fewest fragments when analyzed by pulsed-field electrophoresis. NheI and SpeI each gave 5 fragments, while XbaI gave 12. The size of the T. celer chromosome was determined from the sum of the apparent sizes of restriction fragments derived from single and double digests by using these enzymes and was found to be 1,890 +/- 27 kilobase pairs. Partial and complete digests allowed the order of all but three small (less than 15 kilobase pairs) fragments to be deduced. These three fragments were assigned positions by using hybridization probes derived from these restriction fragments. The positions of the other fragments were confirmed by using hybridization probes derived in the same manner. The positions of the 5S, 16S, and 23S rRNA genes as well as the 7S RNA gene were located on this map by using cloned portions of these genes as hybridization probes. The 5S rRNA gene was localized 48 to 196 kilobases from the 5' end of the 16S gene. The 7S RNA gene was localized 190 to 504 kilobases from the 3' end of the 23S gene. These analyses demonstrated that the chromosome of T. celer is a single, circular DNA molecule. This is the first such demonstration of the structure of an archaebacterial chromosome.
Diotel, Nicolas; Rodriguez Viales, Rebecca; Armant, Olivier; März, Martin; Ferg, Marco; Rastegar, Sepand; Strähle, Uwe
2015-01-01
The zebrafish has become a model to study adult vertebrate neurogenesis. In particular, the adult telencephalon has been an intensely studied structure in the zebrafish brain. Differential expression of transcriptional regulators (TRs) is a key feature of development and tissue homeostasis. Here we report an expression map of 1,202 TR genes in the telencephalon of adult zebrafish. Our results are summarized in a database with search and clustering functions to identify genes expressed in particular regions of the telencephalon. We classified 562 genes into 13 distinct patterns, including genes expressed in the proliferative zone. The remaining 640 genes displayed unique and complex patterns of expression and could thus not be grouped into distinct classes. The neurogenic ventricular regions express overlapping but distinct sets of TR genes, suggesting regional differences in the neurogenic niches in the telencephalon. In summary, the small telencephalon of the zebrafish shows a remarkable complexity in TR gene expression. The adult zebrafish telencephalon has become a model to study neurogenesis. We established the expression pattern of more than 1200 transcription regulators (TR) in the adult telencephalon. The neurogenic regions express overlapping but distinct sets of TR genes suggesting regional differences in the neurogenic potential. J. Comp. Neurol. 523:1202–1221, 2015. © 2015 Wiley Periodicals, Inc. PMID:25556858
Diotel, Nicolas; Rodriguez Viales, Rebecca; Armant, Olivier; März, Martin; Ferg, Marco; Rastegar, Sepand; Strähle, Uwe
2015-06-01
The zebrafish has become a model to study adult vertebrate neurogenesis. In particular, the adult telencephalon has been an intensely studied structure in the zebrafish brain. Differential expression of transcriptional regulators (TRs) is a key feature of development and tissue homeostasis. Here we report an expression map of 1,202 TR genes in the telencephalon of adult zebrafish. Our results are summarized in a database with search and clustering functions to identify genes expressed in particular regions of the telencephalon. We classified 562 genes into 13 distinct patterns, including genes expressed in the proliferative zone. The remaining 640 genes displayed unique and complex patterns of expression and could thus not be grouped into distinct classes. The neurogenic ventricular regions express overlapping but distinct sets of TR genes, suggesting regional differences in the neurogenic niches in the telencephalon. In summary, the small telencephalon of the zebrafish shows a remarkable complexity in TR gene expression. The adult zebrafish telencephalon has become a model to study neurogenesis. We established the expression pattern of more than 1200 transcription regulators (TR) in the adult telencephalon. The neurogenic regions express overlapping but distinct sets of TR genes suggesting regional differences in the neurogenic potential. © 2015 Wiley Periodicals, Inc.
Taye, Mengistie; Kim, Jaemin; Yoon, Sook Hee; Lee, Wonseok; Hanotte, Olivier; Dessie, Tadelle; Kemp, Stephen; Mwai, Okeyo Ally; Caetano-Anolles, Kelsey; Cho, Seoae; Oh, Sung Jong; Lee, Hak-Kyo; Kim, Heebal
2017-02-09
Africa is home to numerous cattle breeds whose diversity has been shaped by subtle combinations of human and natural selection. African Sanga cattle are an intermediate type of cattle resulting from interbreeding between Bos taurus and Bos indicus subspecies. Recently, research has asserted the potential of Sanga breeds for commercial beef production with better meat quality as compared to Bos indicus breeds. Here, we identified meat quality related gene regions that are positively selected in Ankole (Sanga) cattle breeds as compared to indicus (Boran, Ogaden, and Kenana) breeds using cross-population (XP-EHH and XP-CLR) statistical methods. We identified 238 (XP-EHH) and 213 (XP-CLR) positively selected genes, of which 97 were detected from both statistics. Among the genes obtained, we primarily reported those involved in different biological process and pathways associated with meat quality traits. Genes (CAPZB, COL9A2, PDGFRA, MAP3K5, ZNF410, and PKM2) involved in muscle structure and metabolism affect meat tenderness. Genes (PLA2G2A, PARK2, ZNF410, MAP2K3, PLCD3, PLCD1, and ROCK1) related to intramuscular fat (IMF) are involved in adipose metabolism and adipogenesis. MB and SLC48A1 affect meat color. In addition, we identified genes (TIMP2, PKM2, PRKG1, MAP3K5, and ATP8A1) related to feeding efficiency. Among the enriched Gene Ontology Biological Process (GO BP) terms, actin cytoskeleton organization, actin filament-based process, and protein ubiquitination are associated with meat tenderness whereas cellular component organization, negative regulation of actin filament depolymerization and negative regulation of protein complex disassembly are involved in adipocyte regulation. The MAPK pathway is responsible for cell proliferation and plays an important role in hyperplastic growth, which has a positive effect on meat tenderness. Results revealed several candidate genes positively selected in Ankole cattle in relation to meat quality characteristics. The genes identified are involved in muscle structure and metabolism, and adipose metabolism and adipogenesis. These genes help in the understanding of the biological mechanisms controlling beef quality characteristics in African Ankole cattle. These results provide a basis for further research on the genomic characteristics of Ankole and other Sanga cattle breeds for quality beef.
Development of a set of SNP markers present in expressed genes of the apple.
Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S
2008-11-01
Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.
A map of copy number variations in Chinese populations.
Lou, Haiyi; Li, Shilin; Yang, Yajun; Kang, Longli; Zhang, Xin; Jin, Wenfei; Wu, Bailin; Jin, Li; Xu, Shuhua
2011-01-01
It has been shown that the human genome contains extensive copy number variations (CNVs). Investigating the medical and evolutionary impacts of CNVs requires the knowledge of locations, sizes and frequency distribution of them within and between populations. However, CNV study of Chinese minorities, which harbor the majority of genetic diversity of Chinese populations, has been underrepresented considering the same efforts in other populations. Here we constructed, to our knowledge, a first CNV map in seven Chinese populations representing the major linguistic groups in China with 1,440 CNV regions identified using Affymetrix SNP 6.0 Array. Considerable differences in distributions of CNV regions between populations and substantial population structures were observed. We showed that ∼35% of CNV regions identified in minority ethnic groups are not shared by Han Chinese population, indicating that the contribution of the minorities to genetic architecture of Chinese population could not be ignored. We further identified highly differentiated CNV regions between populations. For example, a common deletion in Dong and Zhuang (44.4% and 50%), which overlaps two keratin-associated protein genes contributing to the structure of hair fibers, was not observed in Han Chinese. Interestingly, the most differentiated CNV deletion between HapMap CEU and YRI containing CCL3L1 gene reported in previous studies was also the highest differentiated regions between Tibetan and other populations. Besides, by jointly analyzing CNVs and SNPs, we found a CNV region containing gene CTDSPL were in almost perfect linkage disequilibrium between flanking SNPs in Tibetan while not in other populations except HapMap CHD. Furthermore, we found the SNP taggability of CNVs in Chinese populations was much lower than that in European populations. Our results suggest the necessity of a full characterization of CNVs in Chinese populations, and the CNV map we constructed serves as a useful resource in further evolutionary and medical studies.
A Map of Copy Number Variations in Chinese Populations
Yang, Yajun; Kang, Longli; Zhang, Xin; Jin, Wenfei; Wu, Bailin; Jin, Li; Xu, Shuhua
2011-01-01
It has been shown that the human genome contains extensive copy number variations (CNVs). Investigating the medical and evolutionary impacts of CNVs requires the knowledge of locations, sizes and frequency distribution of them within and between populations. However, CNV study of Chinese minorities, which harbor the majority of genetic diversity of Chinese populations, has been underrepresented considering the same efforts in other populations. Here we constructed, to our knowledge, a first CNV map in seven Chinese populations representing the major linguistic groups in China with 1,440 CNV regions identified using Affymetrix SNP 6.0 Array. Considerable differences in distributions of CNV regions between populations and substantial population structures were observed. We showed that ∼35% of CNV regions identified in minority ethnic groups are not shared by Han Chinese population, indicating that the contribution of the minorities to genetic architecture of Chinese population could not be ignored. We further identified highly differentiated CNV regions between populations. For example, a common deletion in Dong and Zhuang (44.4% and 50%), which overlaps two keratin-associated protein genes contributing to the structure of hair fibers, was not observed in Han Chinese. Interestingly, the most differentiated CNV deletion between HapMap CEU and YRI containing CCL3L1 gene reported in previous studies was also the highest differentiated regions between Tibetan and other populations. Besides, by jointly analyzing CNVs and SNPs, we found a CNV region containing gene CTDSPL were in almost perfect linkage disequilibrium between flanking SNPs in Tibetan while not in other populations except HapMap CHD. Furthermore, we found the SNP taggability of CNVs in Chinese populations was much lower than that in European populations. Our results suggest the necessity of a full characterization of CNVs in Chinese populations, and the CNV map we constructed serves as a useful resource in further evolutionary and medical studies. PMID:22087296
Genome Structure of the Legume, Lotus japonicus
Sato, Shusei; Nakamura, Yasukazu; Kaneko, Takakazu; Asamizu, Erika; Kato, Tomohiko; Nakao, Mitsuteru; Sasamoto, Shigemi; Watanabe, Akiko; Ono, Akiko; Kawashima, Kumiko; Fujishiro, Tsunakazu; Katoh, Midori; Kohara, Mitsuyo; Kishida, Yoshie; Minami, Chiharu; Nakayama, Shinobu; Nakazaki, Naomi; Shimizu, Yoshimi; Shinpo, Sayaka; Takahashi, Chika; Wada, Tsuyuko; Yamada, Manabu; Ohmido, Nobuko; Hayashi, Makoto; Fukui, Kiichi; Baba, Tomoya; Nakamichi, Tomoko; Mori, Hirotada; Tabata, Satoshi
2008-01-01
The legume Lotus japonicus has been widely used as a model system to investigate the genetic background of legume-specific phenomena such as symbiotic nitrogen fixation. Here, we report structural features of the L. japonicus genome. The 315.1-Mb sequences determined in this and previous studies correspond to 67% of the genome (472 Mb), and are likely to cover 91.3% of the gene space. Linkage mapping anchored 130-Mb sequences onto the six linkage groups. A total of 10 951 complete and 19 848 partial structures of protein-encoding genes were assigned to the genome. Comparative analysis of these genes revealed the expansion of several functional domains and gene families that are characteristic of L. japonicus. Synteny analysis detected traces of whole-genome duplication and the presence of synteny blocks with other plant genomes to various degrees. This study provides the first opportunity to look into the complex and unique genetic system of legumes. PMID:18511435
Neuhaus, H; Link, G
1987-01-01
The trnK gene endocing the tRNALys(UUU) has been located on mustard (Sinapis alba) chloroplast DNA, 263 bp upstream of the psbA gene on the same strand. The nucleotide sequence of the trnK gene and its flanking regions as well as the putative transcription start and termination sites are shown. The 5' end of the transcript lies 121 bp upstream of the 5' tRNA coding region and is preceded by procaryotic-type "-10" and "-35" sequence elements, while the 3' end maps 2.77 kb downstream to a DNA region with possible stemloop secondary structure. The anticodon loop of the tRNALys is interrupted by a 2,574 bp intron containing a long open reading frame, which codes for 524 amino acids. Based on conserved stem and loop structures, this intron has characteristic features of a class II intron. A region near the carboxyl terminus of the derived polypeptide appears structurally related to maturases.
Saturation of an intra-gene pool linkage map: toward unified consensus linkage map in common bean
USDA-ARS?s Scientific Manuscript database
Map-based cloning to find genes of interest and marker assisted selection (MAS) requires good genetic maps with high reproducible markers. In this study, we saturated the linkage map of the intra-gene pool population of common bean DOR364×BAT477 (DB) by evaluating 2,706 molecular markers in includin...
Evolution of substrate specificity in a retained enzyme driven by gene loss
Juárez-Vázquez, Ana Lilia; Edirisinghe, Janaka N; Verduzco-Castro, Ernesto A; Michalska, Karolina; Wu, Chenggang; Noda-García, Lianet; Babnigg, Gyorgy; Endres, Michael; Medina-Ruíz, Sofía; Santoyo-Flores, Julián; Carrillo-Tripp, Mauricio; Ton-That, Hung; Joachimiak, Andrzej; Henry, Christopher S; Barona-Gómez, Francisco
2017-01-01
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. We apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to a monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. Our results show how gene loss can drive the evolution of substrate specificity from retained enzymes. DOI: http://dx.doi.org/10.7554/eLife.22679.001 PMID:28362260
Evolution of Substrate Specificity in A Retained Enzyme Driven by Gene Loss
Juarez-Vazquez, Ana L.; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.; ...
2017-03-31
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. Here, we apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We also observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to amore » monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. These results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
Evolution of Substrate Specificity in A Retained Enzyme Driven by Gene Loss
DOE Office of Scientific and Technical Information (OSTI.GOV)
Juarez-Vazquez, Ana L.; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. Here, we apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We also observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to amore » monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. These results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
Szpirer, C; Szpirer, J; Tissir, F; Stephanova, E; Vanvooren, P; Kurtz, T W; Iwai, N; Inagami, T; Pravenec, M; Kren, V; Klinga-Levan, K; Levan, G
1997-09-01
Seven genes were regionally localized on rat Chromosome (Chr) 1, from 1p11 to 1q42, and two of these genes were also included in a linkage map. This mapping work integrates the genetic linkage map and the cytogenetic map, and allows us to orient the linkage map with respect to the centromere, and to deduce the approximate position of the centromere in the linkage map. These mapping data also indicate that the Slc9a3 gene, encoding the Na+/H+ exchanger 3, is an unlikely candidate for the blood pressure loci assigned to rat Chr 1. These new localizations expand comparative mapping between rat Chr 1 and mouse or human chromosomes.
Nambeesan, Savithri U; Mandel, Jennifer R; Bowers, John E; Marek, Laura F; Ebert, Daniel; Corbi, Jonathan; Rieseberg, Loren H; Knapp, Steven J; Burke, John M
2015-03-11
Shoot branching is an important determinant of plant architecture and influences various aspects of growth and development. Selection on branching has also played an important role in the domestication of crop plants, including sunflower (Helianthus annuus L.). Here, we describe an investigation of the genetic basis of variation in branching in sunflower via association mapping in a diverse collection of cultivated sunflower lines. Detailed phenotypic analyses revealed extensive variation in the extent and type of branching within the focal population. After correcting for population structure and kinship, association analyses were performed using a genome-wide collection of SNPs to identify genomic regions that influence a variety of branching-related traits. This work resulted in the identification of multiple previously unidentified genomic regions that contribute to variation in branching. Genomic regions that were associated with apical and mid-apical branching were generally distinct from those associated with basal and mid-basal branching. Homologs of known branching genes from other study systems (i.e., Arabidopsis, rice, pea, and petunia) were also identified from the draft assembly of the sunflower genome and their map positions were compared to those of associations identified herein. Numerous candidate branching genes were found to map in close proximity to significant branching associations. In sunflower, variation in branching is genetically complex and overall branching patterns (i.e., apical vs. basal) were found to be influenced by distinct genomic regions. Moreover, numerous candidate branching genes mapped in close proximity to significant branching associations. Although the sunflower genome exhibits localized islands of elevated linkage disequilibrium (LD), these non-random associations are known to decay rapidly elsewhere. The subset of candidate genes that co-localized with significant associations in regions of low LD represents the most promising target for future functional analyses.
Chapman, Natalie H; Bonnet, Julien; Grivet, Laurent; Lynn, James; Graham, Neil; Smith, Rebecca; Sun, Guiping; Walley, Peter G; Poole, Mervin; Causse, Mathilde; King, Graham J; Baxter, Charles; Seymour, Graham B
2012-08-01
Fruit firmness in tomato (Solanum lycopersicum) is determined by a number of factors including cell wall structure, turgor, and cuticle properties. Firmness is a complex polygenic trait involving the coregulation of many genes and has proved especially challenging to unravel. In this study, a quantitative trait locus (QTL) for fruit firmness was mapped to tomato chromosome 2 using the Zamir Solanum pennellii interspecific introgression lines (ILs) and fine-mapped in a population consisting of 7,500 F2 and F3 lines from IL 2-3 and IL 2-4. This firmness QTL contained five distinct subpeaks, Fir(s.p.)QTL2.1 to Fir(s.p.)QTL2.5, and an effect on a distal region of IL 2-4 that was nonoverlapping with IL 2-3. All these effects were located within an 8.6-Mb region. Using genetic markers, each subpeak within this combinatorial locus was mapped to a physical location within the genome, and an ethylene response factor (ERF) underlying Fir(s.p.)QTL2.2 and a region containing three pectin methylesterase (PME) genes underlying Fir(s.p.)QTL2.5 were nominated as QTL candidate genes. Statistical models used to explain the observed variability between lines indicated that these candidates and the nonoverlapping portion of IL 2-4 were sufficient to account for the majority of the fruit firmness effects. Quantitative reverse transcription-polymerase chain reaction was used to quantify the expression of each candidate gene. ERF showed increased expression associated with soft fruit texture in the mapping population. In contrast, PME expression was tightly linked with firm fruit texture. Analysis of a range of recombinant lines revealed evidence for an epistatic interaction that was associated with this combinatorial locus.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Overhauser, J.; Mewar, R.; Rojas, K.
1993-02-01
Somatic cell hybrids containing different deleted regions of chromosome 18 derived form patients with balanced translocations or terminal deletions were used to create a deletion mapping panel. Twenty-four sequence-tagged sites (STSs) for 17 genes and 7 anonymous polymorphic DNA fragments were identified. These STSs were used to map the 24 loci to 18 defined regions of chromosome 18. Both ERV1, previously mapped to 18q22-q23, and YES1, previously mapped to 18q21.3, were found to map to 18p11.21-pter. Several genes previously mapped to 18q21 were found to be in the order cen-SSAV1-DCC-FECH-GRP-BCL2-PLANH2-tel. The precise mapping of genes to chromosome 18 should helpmore » in determining whether these genes may be involved in the etiology of specific chromosomal syndromes associated with chromosome 18. The mapping of the poloymorphic loci will assist in the integration of the physical map with the recombination map of chromosome 18. 43 refs., 2 figs., 1 tab.« less
Gu, Ganyu; Smith, Leif; Liu, Aixin; Lu, Shi-En
2011-01-01
A striking feature of Burkholderia contaminans strain MS14 is the production of a glycolipopeptide named occidiofungin. Occidiofungin has a broad range of antifungal activities against plant and animal pathogens. In this study, a complete covalent structure characterization and identification of the whole genomic DNA region for the occidiofungin gene (ocf) cluster are described. Discovery of the presence of 2,4-diaminobutyric acid and 3-chloro-β-hydroxytyrosine and elucidation of the structure of a novel C18 fatty amino acid residue have been achieved. In addition, seven additional putative open reading frames (the genes from ocfI to ocfN [ocfI-N] and ORF16) were identified. Transcription of all the putative genes ocfI-N identified in the region except ORF16 was regulated by both ambR1 and ambR2. Elucidation of the structure and the ocf gene cluster provides insight into the biosynthesis of occidiofungin and promotes future aims at understanding the biosynthetic machinery. This work provides new avenues for optimizing the production and synthesis of structural analogs of occidiofungin. PMID:21742901
Engin, H. Billur; Guney, Emre; Keskin, Ozlem; Oliva, Baldo; Gursoy, Attila
2013-01-01
Blocking specific protein interactions can lead to human diseases. Accordingly, protein interactions and the structural knowledge on interacting surfaces of proteins (interfaces) have an important role in predicting the genotype-phenotype relationship. We have built the phenotype specific sub-networks of protein-protein interactions (PPIs) involving the relevant genes responsible for lung and brain metastasis from primary tumor in breast cancer. First, we selected the PPIs most relevant to metastasis causing genes (seed genes), by using the “guilt-by-association” principle. Then, we modeled structures of the interactions whose complex forms are not available in Protein Databank (PDB). Finally, we mapped mutations to interface structures (real and modeled), in order to spot the interactions that might be manipulated by these mutations. Functional analyses performed on these sub-networks revealed the potential relationship between immune system-infectious diseases and lung metastasis progression, but this connection was not observed significantly in the brain metastasis. Besides, structural analyses showed that some PPI interfaces in both metastasis sub-networks are originating from microbial proteins, which in turn were mostly related with cell adhesion. Cell adhesion is a key mechanism in metastasis, therefore these PPIs may be involved in similar molecular pathways that are shared by infectious disease and metastasis. Finally, by mapping the mutations and amino acid variations on the interface regions of the proteins in the metastasis sub-networks we found evidence for some mutations to be involved in the mechanisms differentiating the type of the metastasis. PMID:24278371
Characterization of a microdissection library from human chromosome region 3p14
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bardenheuer, W.; Szymanski, S.; Lux, A.
1994-01-15
Structural alterations in human chromosome region 3p14-p23 resulting in the inactivation of one or more tumor suppressor genes are thought to play a pathogenic role in small cell lung cancer, renal cell carcinoma, and other human neoplasms. To identify putative tumor suppressor genes, 428 recombinant clones from a microdissection library specific for human chromosome region 3p14 were isolated and characterized. Ninety-six of these (22.5%) were human single-copy DNA sequences, 57 of which were unique sequence clones. Forty-four of these were mapped to the microdissected region using a cell hybrid mapping panel. Within this mapping panel, four probes detected two newmore » chromosome breakpoints that were previously indistinguishable from the translocation breakpoint t(3;8) in 3p14.2 in hereditary renal cell carcinoma. One probe maps to the homozygously deleted region of the small cell lung cancer cell line U2020. In addition, microdissection clones have been shown to be suitable for isolation of yeast artificial chromosomes. 52 refs., 3 figs., 2 tabs.« less
Talukder, Zahirul I; Hulke, Brent S; Qi, Lili; Scheffler, Brian E; Pegadaraju, Venkatramana; McPhee, Kevin; Gulya, Thomas J
2014-01-01
Functional markers for Sclerotinia basal stalk rot resistance in sunflower were obtained using gene-level information from the model species Arabidopsis thaliana. Sclerotinia stalk rot, caused by Sclerotinia sclerotiorum, is one of the most destructive diseases of sunflower (Helianthus annuus L.) worldwide. Markers for genes controlling resistance to S. sclerotiorum will enable efficient marker-assisted selection (MAS). We sequenced eight candidate genes homologous to Arabidopsis thaliana defense genes known to be associated with Sclerotinia disease resistance in a sunflower association mapping population evaluated for Sclerotinia stalk rot resistance. The total candidate gene sequence regions covered a concatenated length of 3,791 bp per individual. A total of 187 polymorphic sites were detected for all candidate gene sequences, 149 of which were single nucleotide polymorphisms (SNPs) and 38 were insertions/deletions. Eight SNPs in the coding regions led to changes in amino acid codons. Linkage disequilibrium decay throughout the candidate gene regions declined on average to an r (2) = 0.2 for genetic intervals of 120 bp, but extended up to 350 bp with r (2) = 0.1. A general linear model with modification to account for population structure was found the best fitting model for this population and was used for association mapping. Both HaCOI1-1 and HaCOI1-2 were found to be strongly associated with Sclerotinia stalk rot resistance and explained 7.4 % of phenotypic variation in this population. These SNP markers associated with Sclerotinia stalk rot resistance can potentially be applied to the selection of favorable genotypes, which will significantly improve the efficiency of MAS during the development of stalk rot resistant cultivars.
Garris, Amanda J; McCouch, Susan R; Kresovich, Stephen
2003-01-01
To assess the usefulness of linkage disequilibrium mapping in an autogamous, domesticated species, we have characterized linkage disequilibrium in the candidate region for xa5, a recessive gene conferring race-specific resistance to bacterial blight in rice. This trait and locus have good mapping information, a tractable phenotype, and available sequence data, but no cloned gene. We sampled 13 short segments from the 70-kb candidate region in 114 accessions of Oryza sativa. Five additional segments were sequenced from the adjacent 45-kb region in resistant accessions to estimate the distance at which linkage disequilibrium decays. The data show significant linkage disequilibrium between sites 100 kb apart. The presence of the xa5 resistant reaction in two ecotypes and in accessions with different haplotypes in the candidate region may indicate multiple origins or genetic heterogeneity for resistance. In addition, genetic differentiation between ecotypes emphasizes the need for controlling for population structure in the design of linkage disequilibrium studies in rice. PMID:14573486
2013-01-01
Background The structured organization of cells in the brain plays a key role in its functional efficiency. This delicate organization is the consequence of unique molecular identity of each cell gradually established by precise spatiotemporal gene expression control during development. Currently, studies on the molecular-structural association are beginning to reveal how the spatiotemporal gene expression patterns are related to cellular differentiation and structural development. Results In this article, we aim at a global, data-driven study of the relationship between gene expressions and neuroanatomy in the developing mouse brain. To enable visual explorations of the high-dimensional data, we map the in situ hybridization gene expression data to a two-dimensional space by preserving both the global and the local structures. Our results show that the developing brain anatomy is largely preserved in the reduced gene expression space. To provide a quantitative analysis, we cluster the reduced data into groups and measure the consistency with neuroanatomy at multiple levels. Our results show that the clusters in the low-dimensional space are more consistent with neuroanatomy than those in the original space. Conclusions Gene expression patterns and developing brain anatomy are closely related. Dimensionality reduction and visual exploration facilitate the study of this relationship. PMID:23845024
Comparative Reannotation of 21 Aspergillus Genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Salamov, Asaf; Riley, Robert; Kuo, Alan
2013-03-08
We used comparative gene modeling to reannotate 21 Aspergillus genomes. Initial automatic annotation of individual genomes may contain some errors of different nature, e.g. missing genes, incorrect exon-intron structures, 'chimeras', which fuse 2 or more real genes or alternatively splitting some real genes into 2 or more models. The main premise behind the comparative modeling approach is that for closely related genomes most orthologous families have the same conserved gene structure. The algorithm maps all gene models predicted in each individual Aspergillus genome to the other genomes and, for each locus, selects from potentially many competing models, the one whichmore » most closely resembles the orthologous genes from other genomes. This procedure is iterated until no further change in gene models is observed. For Aspergillus genomes we predicted in total 4503 new gene models ( ~;;2percent per genome), supported by comparative analysis, additionally correcting ~;;18percent of old gene models. This resulted in a total of 4065 more genes with annotated PFAM domains (~;;3percent increase per genome). Analysis of a few genomes with EST/transcriptomics data shows that the new annotation sets also have a higher number of EST-supported splice sites at exon-intron boundaries.« less
Zhang, Weihua; Collins, Andrew; Gibson, Jane; Tapper, William J.; Hunt, Sarah; Deloukas, Panos; Bentley, David R.; Morton, Newton E.
2004-01-01
Genetic maps in linkage disequilibrium (LD) units play the same role for association mapping as maps in centimorgans provide at much lower resolution for linkage mapping. Association mapping of genes determining disease susceptibility and other phenotypes is based on the theory of LD, here applied to relations with three phenomena. To test the theory, markers at high density along a 10-Mb continuous segment of chromosome 20q were studied in African-American, Asian, and Caucasian samples. Population structure, whether created by pooling samples from divergent populations or by the mating pattern in a mixed population, is accurately bioassayed from genotype frequencies. The effective bottleneck time for Eurasians is substantially less than for migration out of Africa, reflecting later bottlenecks. The classical dependence of allele frequency on mutation age does not hold for the generally shorter time span of inbreeding and LD. Limitation of the classical theory to mutation age justifies the assumption of constant time in a LD map, except for alleles that were rare at the effective bottleneck time or have arisen since. This assumption is derived from the Malecot model and verified in all samples. Tested measures of relative efficiency, support intervals, and localization error determine the operating characteristics of LD maps that are applicable to every sexually reproducing species, with implications for association mapping, high-resolution linkage maps, evolutionary inference, and identification of recombinogenic sequences. PMID:15604137
Zhang, Weihua; Collins, Andrew; Gibson, Jane; Tapper, William J; Hunt, Sarah; Deloukas, Panos; Bentley, David R; Morton, Newton E
2004-12-28
Genetic maps in linkage disequilibrium (LD) units play the same role for association mapping as maps in centimorgans provide at much lower resolution for linkage mapping. Association mapping of genes determining disease susceptibility and other phenotypes is based on the theory of LD, here applied to relations with three phenomena. To test the theory, markers at high density along a 10-Mb continuous segment of chromosome 20q were studied in African-American, Asian, and Caucasian samples. Population structure, whether created by pooling samples from divergent populations or by the mating pattern in a mixed population, is accurately bioassayed from genotype frequencies. The effective bottleneck time for Eurasians is substantially less than for migration out of Africa, reflecting later bottlenecks. The classical dependence of allele frequency on mutation age does not hold for the generally shorter time span of inbreeding and LD. Limitation of the classical theory to mutation age justifies the assumption of constant time in a LD map, except for alleles that were rare at the effective bottleneck time or have arisen since. This assumption is derived from the Malecot model and verified in all samples. Tested measures of relative efficiency, support intervals, and localization error determine the operating characteristics of LD maps that are applicable to every sexually reproducing species, with implications for association mapping, high-resolution linkage maps, evolutionary inference, and identification of recombinogenic sequences.
Genome-wide diversity and selective pressure in the human rhinovirus
Kistler, Amy L; Webster, Dale R; Rouskin, Silvi; Magrini, Vince; Credle, Joel J; Schnurr, David P; Boushey, Homer A; Mardis, Elaine R; Li, Hao; DeRisi, Joseph L
2007-01-01
Background The human rhinoviruses (HRV) are one of the most common and diverse respiratory pathogens of humans. Over 100 distinct HRV serotypes are known, yet only 6 genomes are available. Due to the paucity of HRV genome sequence, little is known about the genetic diversity within HRV or the forces driving this diversity. Previous comparative genome sequence analyses indicate that recombination drives diversification in multiple genera of the picornavirus family, yet it remains unclear if this holds for HRV. Results To resolve this and gain insight into the forces driving diversification in HRV, we generated a representative set of 34 fully sequenced HRVs. Analysis of these genomes shows consistent phylogenies across the genome, conserved non-coding elements, and only limited recombination. However, spikes of genetic diversity at both the nucleotide and amino acid level are detectable within every locus of the genome. Despite this, the HRV genome as a whole is under purifying selective pressure, with islands of diversifying pressure in the VP1, VP2, and VP3 structural genes and two non-structural genes, the 3C protease and 3D polymerase. Mapping diversifying residues in these factors onto available 3-dimensional structures revealed the diversifying capsid residues partition to the external surface of the viral particle in statistically significant proximity to antigenic sites. Diversifying pressure in the pleconaril binding site is confined to a single residue known to confer drug resistance (VP1 191). In contrast, diversifying pressure in the non-structural genes is less clear, mapping both nearby and beyond characterized functional domains of these factors. Conclusion This work provides a foundation for understanding HRV genetic diversity and insight into the underlying biology driving evolution in HRV. It expands our knowledge of the genome sequence space that HRV reference serotypes occupy and how the pattern of genetic diversity across HRV genomes differs from other picornaviruses. It also reveals evidence of diversifying selective pressure in both structural genes known to interact with the host immune system and in domains of unassigned function in the non-structural 3C and 3D genes, raising the possibility that diversification of undiscovered functions in these essential factors may influence HRV fitness and evolution. PMID:17477878
Fraser, Ross M; Allan, James; Simmen, Martin W
2006-12-08
Nucleosome positioning signals embedded within the DNA sequence have the potential to influence the detailed structure of the higher-order chromatin fibre. In two previous studies of long stretches of DNA, encompassing the chicken beta-globin and ovine beta-lactoglobulin genes, respectively, we mapped the relative affinity of every site for the core histone octamer. In both cases a periodic arrangement of the in vitro positioning sites suggests that they might influence the folding of a nucleosome chain into higher-order structure; this hypothesis was borne out in the case of the beta-lactoglobulin gene, where the distribution of the in vitro positioning sites is related to the positions nucleosomes actually occupy in sheep liver cells. Here, we have exploited the in vitro nucleosome positioning datasets to simulate nucleosomal organisation using in silico approaches. We use the high-resolution, quantitative positioning maps to define a one-dimensional positioning energy lattice, which can be populated with a defined number of nucleosomes. Monte Carlo techniques are employed to simulate the behaviour of the model at equilibrium to produce a set of configurations, which provide a probability-based occupancy map. Employing a variety of techniques we show that the occupancy maps are a sensitive function of the histone octamer density (nucleosome repeat length) and find that a minimal change in this property can produce dramatic localised changes in structure. Although simulations generally give rise to regular periodic nucleosomal arrangements, they often show octamer density-dependent discontinuities, which tend to co-localise with sequences that adopt distinctive chromatin structure in vivo. Furthermore, the overall organisation of simulated chromatin structures are more closely related to the situation in vivo than is the original in vitro positioning data, particularly at a nucleosome density corresponding to the in vivo state. Although our model is simplified, we argue that it provides a unique insight into the influence that DNA sequence can have in determining chromatin structure and could serve as a useful basis for the incorporation of other parameters.
Importance of MAP Kinases during Protoperithecial Morphogenesis in Neurospora crassa
Jeffree, Chris E.; Oborny, Radek; Boonyarungsrit, Patid; Read, Nick D.
2012-01-01
In order to produce multicellular structures filamentous fungi combine various morphogenetic programs that are fundamentally different from those used by plants and animals. The perithecium, the female sexual fruitbody of Neurospora crassa, differentiates from the vegetative mycelium in distinct morphological stages, and represents one of the more complex multicellular structures produced by fungi. In this study we defined the stages of protoperithecial morphogenesis in the N. crassa wild type in greater detail than has previously been described; compared protoperithecial morphogenesis in gene-deletion mutants of all nine mitogen-activated protein (MAP) kinases conserved in N. crassa; confirmed that all three MAP kinase cascades are required for sexual development; and showed that the three different cascades each have distinctly different functions during this process. However, only MAP kinases equivalent to the budding yeast pheromone response and cell wall integrity pathways, but not the osmoregulatory pathway, were essential for vegetative cell fusion. Evidence was obtained for MAP kinase signaling cascades performing roles in extracellular matrix deposition, hyphal adhesion, and envelopment during the construction of fertilizable protoperithecia. PMID:22900028
Xie, Weilong; Perry, Gregory; Martin, C Joe; Shim, Youn-Seb; Navabi, Alireza; Pauls, K Peter
2017-07-01
Common beans (Phaseolus vulgaris) are excellent sources of dietary folates, but different varieties contain different amounts of these compounds. Genes coding for dihydroneopterin aldolase (DHNA) and aminodeoxychorismate synthase (ADCS) of the folate synthesis pathway were characterized by PCR amplification, BAC clone sequencing, and whole genome sequencing. All DHNA and ADCS genes in the Mesoamerican cultivar OAC Rex were isolated and compared with those genes in the genome of Andean genotype G19833. Both genotypes have two functional DHNA genes and one pseudo gene. PvDHNA1 and PvDHNA2 proteins have similar secondary structures and conserved residues as DHNA homologs in Staphylococcus aureus and Arabidopsis. Sequence analysis and synteny mapping indicated that PvDHNA1 might be a duplicated and transposed copy of PvDHNA2. There is only one ADCS gene (PvADCS) identified in the bean genome and it is identical in OAC Rex and G19833. PvADCS has the conserved motifs required for catalytic activity similar to other plant ADCS homologs. DHNA and ADCS gene-specific markers were developed, mapped, and compared to their physical locations on chromosomes 1 and 7, respectively. The gene-specific markers developed in this study should be useful for detection and selection of varieties with enhanced folate contents in bean breeding programs.
A network approach to analyzing highly recombinant malaria parasite genes.
Larremore, Daniel B; Clauset, Aaron; Buckee, Caroline O
2013-01-01
The var genes of the human malaria parasite Plasmodium falciparum present a challenge to population geneticists due to their extreme diversity, which is generated by high rates of recombination. These genes encode a primary antigen protein called PfEMP1, which is expressed on the surface of infected red blood cells and elicits protective immune responses. Var gene sequences are characterized by pronounced mosaicism, precluding the use of traditional phylogenetic tools that require bifurcating tree-like evolutionary relationships. We present a new method that identifies highly variable regions (HVRs), and then maps each HVR to a complex network in which each sequence is a node and two nodes are linked if they share an exact match of significant length. Here, networks of var genes that recombine freely are expected to have a uniformly random structure, but constraints on recombination will produce network communities that we identify using a stochastic block model. We validate this method on synthetic data, showing that it correctly recovers populations of constrained recombination, before applying it to the Duffy Binding Like-α (DBLα) domain of var genes. We find nine HVRs whose network communities map in distinctive ways to known DBLα classifications and clinical phenotypes. We show that the recombinational constraints of some HVRs are correlated, while others are independent. These findings suggest that this micromodular structuring facilitates independent evolutionary trajectories of neighboring mosaic regions, allowing the parasite to retain protein function while generating enormous sequence diversity. Our approach therefore offers a rigorous method for analyzing evolutionary constraints in var genes, and is also flexible enough to be easily applied more generally to any highly recombinant sequences.
A Network Approach to Analyzing Highly Recombinant Malaria Parasite Genes
Larremore, Daniel B.; Clauset, Aaron; Buckee, Caroline O.
2013-01-01
The var genes of the human malaria parasite Plasmodium falciparum present a challenge to population geneticists due to their extreme diversity, which is generated by high rates of recombination. These genes encode a primary antigen protein called PfEMP1, which is expressed on the surface of infected red blood cells and elicits protective immune responses. Var gene sequences are characterized by pronounced mosaicism, precluding the use of traditional phylogenetic tools that require bifurcating tree-like evolutionary relationships. We present a new method that identifies highly variable regions (HVRs), and then maps each HVR to a complex network in which each sequence is a node and two nodes are linked if they share an exact match of significant length. Here, networks of var genes that recombine freely are expected to have a uniformly random structure, but constraints on recombination will produce network communities that we identify using a stochastic block model. We validate this method on synthetic data, showing that it correctly recovers populations of constrained recombination, before applying it to the Duffy Binding Like-α (DBLα) domain of var genes. We find nine HVRs whose network communities map in distinctive ways to known DBLα classifications and clinical phenotypes. We show that the recombinational constraints of some HVRs are correlated, while others are independent. These findings suggest that this micromodular structuring facilitates independent evolutionary trajectories of neighboring mosaic regions, allowing the parasite to retain protein function while generating enormous sequence diversity. Our approach therefore offers a rigorous method for analyzing evolutionary constraints in var genes, and is also flexible enough to be easily applied more generally to any highly recombinant sequences. PMID:24130474
Transcriptional analysis of Penaeus stylirostris densovirus genes
USDA-ARS?s Scientific Manuscript database
Penaeus stylirostris densovirus (PstDNV) genome contains three open reading frames (ORFs), left, middle, and right, which encode a non-structural (NS) protein, an unknown protein, and a capsid protein (CP), respectively. Transcription mapping revealed that P2, P11 and P61 promoters transcribe the le...
Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K
2017-04-01
There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.
Sahoo, Dipak K; Abeysekara, Nilwala S; Cianzio, Silvia R; Robertson, Alison E; Bhattacharyya, Madan K
2017-01-01
Phytophthora sojae Kaufmann and Gerdemann, which causes Phytophthora root rot, is a widespread pathogen that limits soybean production worldwide. Development of Phytophthora resistant cultivars carrying Phytophthora resistance Rps genes is a cost-effective approach in controlling this disease. For this mapping study of a novel Rps gene, 290 recombinant inbred lines (RILs) (F7 families) were developed by crossing the P. sojae resistant cultivar PI399036 with the P. sojae susceptible AR2 line, and were phenotyped for responses to a mixture of three P. sojae isolates that overcome most of the known Rps genes. Of these 290 RILs, 130 were homozygous resistant, 12 heterzygous and segregating for Phytophthora resistance, and 148 were recessive homozygous and susceptible. From this population, 59 RILs homozygous for Phytophthora sojae resistance and 61 susceptible to a mixture of P. sojae isolates R17 and Val12-11 or P7074 that overcome resistance encoded by known Rps genes mapped to Chromosome 18 were selected for mapping novel Rps gene. A single gene accounted for the 1:1 segregation of resistance and susceptibility among the RILs. The gene encoding the Phytophthora resistance mapped to a 5.8 cM interval between the SSR markers BARCSOYSSR_18_1840 and Sat_064 located in the lower arm of Chromosome 18. The gene is mapped 2.2 cM proximal to the NBSRps4/6-like sequence that was reported to co-segregate with the Phytophthora resistance genes Rps4 and Rps6. The gene is mapped to a highly recombinogenic, gene-rich genomic region carrying several nucleotide binding site-leucine rich repeat (NBS-LRR)-like genes. We named this novel gene as Rps12, which is expected to be an invaluable resource in breeding soybeans for Phytophthora resistance.
Shirak, Andrey; Seroussi, Eyal; Cnaani, Avner; Howe, Aimee E; Domokhovsky, Raisa; Zilberman, Noam; Kocher, Thomas D; Hulata, Gideon; Ron, Micha
2006-11-01
Recent studies have revealed that the major genes of the mammalian sex determination pathway are also involved in sex determination of fish. Several studies have reported QTL in various species and strains of tilapia, regions contributing to sex determination have been identified on linkage groups 1, 3, and 23. Genes contributing to sex-specific mortality have been detected on linkage groups 2, 6, and 23. To test whether the same genes might control sex determination in mammals and fishes, we mapped 11 genes that are considered putative master key regulators of sex determination: Amh, Cyp19, Dax1, Dmrt2, Dmrta2, Fhl3l, Foxl2, Ixl, Lhx9, Sf1, and Sox8. We identified polymorphisms in noncoding regions of these genes and genotyped these sites for 90 individuals of an F2 mapping family. Mapping of Dax1 joined LG16 and LG21 into a single linkage group. The Amh and Dmrta2 genes were mapped to two distinct regions of LG23. The Amh gene was mapped 5 cM from UNH879 within a QTL region for sex determination and 2 cM from UNH216 within a QTL region for sex-specific mortality. Dmrta2 was mapped 4 cM from UNH848 within another QTL region for sex determination. Cyp19 was mapped to LG1 far from a previously reported QTL region for sex determination on this chromosome. Seven other candidate genes mapped to LG4, -11, -12, -14, and -17.
Interconnected microbiomes and resistomes in low-income human habitats
Pehrsson, Erica C.; Tsukayama, Pablo; Patel, Sanket; Mejía-Bautista, Melissa; Sosa-Soto, Giordano; Navarrete, Karla M.; Calderon, Maritza; Cabrera, Lilia; Hoyos-Arango, William; Bertoli, M. Teresita; Berg, Douglas E.; Gilman, Robert H.; Dantas, Gautam
2016-01-01
Summary Antibiotic-resistant infections annually claim hundreds of thousands of lives worldwide. This problem is exacerbated by resistance gene exchange between pathogens and benign microbes from diverse habitats. Mapping resistance gene dissemination between humans and their environment is a public health priority. We characterized the bacterial community structure and resistance exchange networks of hundreds of interconnected human fecal and environmental samples from two low-income Latin American communities. We found that resistomes across habitats are generally structured by bacterial phylogeny along ecological gradients, but identified key resistance genes that cross habitat boundaries and determined their association with mobile genetic elements. We also assessed the effectiveness of widely-used excreta management strategies in reducing fecal bacteria and resistance genes in these settings representative of low- and middle-income countries. Our results lay the foundation for quantitative risk assessment and surveillance of resistance dissemination across interconnected habitats in settings representing over two-thirds of the world’s population. PMID:27172044
Structural and functional partitioning of bread wheat chromosome 3B.
Choulet, Frédéric; Alberti, Adriana; Theil, Sébastien; Glover, Natasha; Barbe, Valérie; Daron, Josquin; Pingault, Lise; Sourdille, Pierre; Couloux, Arnaud; Paux, Etienne; Leroy, Philippe; Mangenot, Sophie; Guilhot, Nicolas; Le Gouis, Jacques; Balfourier, Francois; Alaux, Michael; Jamilloux, Véronique; Poulain, Julie; Durand, Céline; Bellec, Arnaud; Gaspin, Christine; Safar, Jan; Dolezel, Jaroslav; Rogers, Jane; Vandepoele, Klaas; Aury, Jean-Marc; Mayer, Klaus; Berges, Hélène; Quesneville, Hadi; Wincker, Patrick; Feuillet, Catherine
2014-07-18
We produced a reference sequence of the 1-gigabase chromosome 3B of hexaploid bread wheat. By sequencing 8452 bacterial artificial chromosomes in pools, we assembled a sequence of 774 megabases carrying 5326 protein-coding genes, 1938 pseudogenes, and 85% of transposable elements. The distribution of structural and functional features along the chromosome revealed partitioning correlated with meiotic recombination. Comparative analyses indicated high wheat-specific inter- and intrachromosomal gene duplication activities that are potential sources of variability for adaption. In addition to providing a better understanding of the organization, function, and evolution of a large and polyploid genome, the availability of a high-quality sequence anchored to genetic maps will accelerate the identification of genes underlying important agronomic traits. Copyright © 2014, American Association for the Advancement of Science.
Fujisawa, Takatomo; Narikawa, Rei; Okamoto, Shinobu; Ehira, Shigeki; Yoshimura, Hidehisa; Suzuki, Iwane; Masuda, Tatsuru; Mochimaru, Mari; Takaichi, Shinichi; Awai, Koichiro; Sekine, Mitsuo; Horikawa, Hiroshi; Yashiro, Isao; Omata, Seiha; Takarada, Hiromi; Katano, Yoko; Kosugi, Hiroki; Tanikawa, Satoshi; Ohmori, Kazuko; Sato, Naoki; Ikeuchi, Masahiko; Fujita, Nobuyuki; Ohmori, Masayuki
2010-01-01
A filamentous non-N2-fixing cyanobacterium, Arthrospira (Spirulina) platensis, is an important organism for industrial applications and as a food supply. Almost the complete genome of A. platensis NIES-39 was determined in this study. The genome structure of A. platensis is estimated to be a single, circular chromosome of 6.8 Mb, based on optical mapping. Annotation of this 6.7 Mb sequence yielded 6630 protein-coding genes as well as two sets of rRNA genes and 40 tRNA genes. Of the protein-coding genes, 78% are similar to those of other organisms; the remaining 22% are currently unknown. A total 612 kb of the genome comprise group II introns, insertion sequences and some repetitive elements. Group I introns are located in a protein-coding region. Abundant restriction-modification systems were determined. Unique features in the gene composition were noted, particularly in a large number of genes for adenylate cyclase and haemolysin-like Ca2+-binding proteins and in chemotaxis proteins. Filament-specific genes were highlighted by comparative genomic analysis. PMID:20203057
Chihara, Carol J.; Song, Chunyan; LaMonte, Greg; Fetalvero, Kristina; Hinchman, Kristy; Phan, Helen; Pineda, Mario; Robinson, Kelly; Schneider, Gregory P.
2005-01-01
The omega (ome) gene product is a modifier of larval cuticle protein 5 and its alleles (and duplicates) in the third instar of Drosophila melanogaster. Using deletion mapping the locus mapped to 70F-71A on the left arm of chromosome 3. A homozygote null mutant (ome 1) shows a pleiotropic phenotype that affected the size, developmental time of the flies, and the fertility (or perhaps the behavior) of homozygous mutant males. The omega gene was verified as producing a dipeptidyl peptidase IV (DPPIV) by genetic analysis, substrate specificity and pH optimum. The identity of the gene was confirmed as CG32145 (cytology 70F4) in the Celera Database (Berkeley Drosophila Genome Project), which is consistent with its deletion map position. The genomic structure of the gene is described and the decrease in DPPIV activity in the mutant ome1 is shown to be due to the gene CG32145 (omega). The D. melanogaster omega DPPIV enzyme was partially purified and characterized. The exons of the ome1 mutant were sequenced and a base substitution mutation in exon 4 was identified that would yield a truncated protein caused by a stop codon. A preliminary study of the compartmentalization of the omega DPPIV enzyme in several organs is also reported. Abbreviations: DPPIV dipeptidyl peptidase IV LCP5 & LCP6 third instar larval cuticle proteins 5 & 6 ome & ome1 omega locus name (CG32145) and mutant allele in D. melanogaster pNA paranotroanilide PMID:17119608
YouGenMap: a web platform for dynamic multi-comparative mapping and visualization of genetic maps
Keith Batesole; Kokulapalan Wimalanathan; Lin Liu; Fan Zhang; Craig S. Echt; Chun Liang
2014-01-01
Comparative genetic maps are used in examination of genome organization, detection of conserved gene order, and exploration of marker order variations. YouGenMap is an open-source web tool that offers dynamic comparative mapping capability of users' own genetic mapping between 2 or more map sets. Users' genetic map data and optional gene annotations are...
Cuenca, José; Aleza, Pablo; Vicent, Antonio; Brunel, Dominique; Ollitrault, Patrick; Navarro, Luis
2013-01-01
Genetic analysis of phenotypical traits and marker-trait association in polyploid species is generally considered as a challenge. In the present work, different approaches were combined taking advantage of the particular genetic structures of 2n gametes resulting from second division restitution (SDR) to map a genome region linked to Alternaria brown spot (ABS) resistance in triploid citrus progeny. ABS in citrus is a serious disease caused by the tangerine pathotype of the fungus Alternaria alternata. This pathogen produces ACT-toxin, which induces necrotic lesions on fruit and young leaves, defoliation and fruit drop in susceptible genotypes. It is a strong concern for triploid breeding programs aiming to produce seedless mandarin cultivars. The monolocus dominant inheritance of susceptibility, proposed on the basis of diploid population studies, was corroborated in triploid progeny. Bulk segregant analysis coupled with genome scan using a large set of genetically mapped SNP markers and targeted genetic mapping by half tetrad analysis, using SSR and SNP markers, allowed locating a 3.3 Mb genomic region linked to ABS resistance near the centromere of chromosome III. Clusters of resistance genes were identified by gene ontology analysis of this genomic region. Some of these genes are good candidates to control the dominant susceptibility to the ACT-toxin. SSR and SNP markers were developed for efficient early marker-assisted selection of ABS resistant hybrids.
Cuenca, José; Aleza, Pablo; Vicent, Antonio; Brunel, Dominique; Ollitrault, Patrick; Navarro, Luis
2013-01-01
Genetic analysis of phenotypical traits and marker-trait association in polyploid species is generally considered as a challenge. In the present work, different approaches were combined taking advantage of the particular genetic structures of 2n gametes resulting from second division restitution (SDR) to map a genome region linked to Alternaria brown spot (ABS) resistance in triploid citrus progeny. ABS in citrus is a serious disease caused by the tangerine pathotype of the fungus Alternaria alternata. This pathogen produces ACT-toxin, which induces necrotic lesions on fruit and young leaves, defoliation and fruit drop in susceptible genotypes. It is a strong concern for triploid breeding programs aiming to produce seedless mandarin cultivars. The monolocus dominant inheritance of susceptibility, proposed on the basis of diploid population studies, was corroborated in triploid progeny. Bulk segregant analysis coupled with genome scan using a large set of genetically mapped SNP markers and targeted genetic mapping by half tetrad analysis, using SSR and SNP markers, allowed locating a 3.3 Mb genomic region linked to ABS resistance near the centromere of chromosome III. Clusters of resistance genes were identified by gene ontology analysis of this genomic region. Some of these genes are good candidates to control the dominant susceptibility to the ACT-toxin. SSR and SNP markers were developed for efficient early marker-assisted selection of ABS resistant hybrids. PMID:24116149
Mapping Flagellar Genes in Chlamydomonas Using Restriction Fragment Length Polymorphisms
Ranum, LPW.; Thompson, M. D.; Schloss, J. A.; Lefebvre, P. A.; Silflow, C. D.
1988-01-01
To correlate cloned nuclear DNA sequences with previously characterized mutations in Chlamydomonas and, to gain insight into the organization of its nuclear genome, we have begun to map molecular markers using restriction fragment length polymorphisms (RFLPs). A Chlamydomonas reinhardtii strain (CC-29) containing phenotypic markers on nine of the 19 linkage groups was crossed to the interfertile species Chlamydomonas smithii. DNA from each member of 22 randomly selected tetrads was analyzed for the segregation of RFLPs associated with cloned genes detected by hybridization with radioactive DNA probes. The current set of markers allows the detection of linkage to new molecular markers over approximately 54% of the existing genetic map. This study focused on mapping cloned flagellar genes and genes whose transcripts accumulate after deflagellation. Twelve different molecular clones have been assigned to seven linkage groups. The α-1 tubulin gene maps to linkage group III and is linked to the genomic sequence homologous to pcf6-100, a cDNA clone whose corresponding transcript accumulates after deflagellation. The α-2 tubulin gene maps to linkage group IV. The two β-tubulin genes are linked, with the β-1 gene being approximately 12 cM more distal from the centromere than the β-2 gene. A clone corresponding to a 73-kD dynein protein maps to the opposite arm of the same linkage group. The gene corresponding to the cDNA clone pcf6-187, whose mRNA accumulates after deflagellation, maps very close to the tightly linked pf-26 and pf-1 mutations on linkage group V. PMID:2906025
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hanson, R.S.
In the past several years researchers have identified at least 20 genes whose products were required for the oxidation of methanol to formaldehyde in three different facultative methylotrophic bacteria. These genes include structural genes for a cytochrome c{sub L} (mox G) and is a specific electron acceptor for methanol dehydrogenase (MDH), and the two structural genes that encode the large subunit (mox F) and smaller subunit (mox I) of MDH. Other genes are required for the synthesis of the prosthetic group of MDH, Pyrroloquinoline quinone (PQQ), and proteins required for assembly of the active MDH in the periplasm. Three genesmore » are believed to be required for incorporation of calcium into the MDH tetramer. The principal investigator`s group has studied the regulation of methanol oxidation in the pink-pigmented-facultative methylotroph Methylobacterium organophilum XX. The authors have mapped several genes and have sequenced the mox F gene and sequences upstream of mox F. The authors had tentatively identified several genes required for the transcription of the MDH structural genes in three methylotrophs. In the previous proposal, the P.I. proposed to establish an in-vitro transcription/translation system to study the function of the regulatory gene products. Further studies demonstrated that the regulation of transcription of these genes was far more complex than imagined at that time and the research plan was modified to determine the number and function of the regulatory genes using genetic approaches.« less
Electron microscopic studies of bacteriophage M13 DNA replication. [Escherichia coli
DOE Office of Scientific and Technical Information (OSTI.GOV)
Allison, D.P.; Ganesan, A.T.; Olson, A.C.
Intracellular forms of M13 phage DNA isolated after infection of Escherichia coli with wild-type phage have been studied by electron microscopy and ultracentrifugation. The data indicate the involvement of rolling-circle intermediates in single-stranded DNA synthesis. In addition to single-stranded, circular DNA, we observed covalently closed and nicked replicative-form (RF) DNAs, dimer RF DNAs, concatenated RF DNAs, RF DNAs with single-stranded tails (sigma, rolling circles), and, occasionally, RF DNAs with theta structures. The tails in sigma molecules are always single stranded and are never longer than the DNA from mature phage; the proportion of sigma to other RF molecules does notmore » change significantly with time after infection. The origin of single-stranded DNA synthesis has been mapped by electron microscopy at a unique location on RF DNA by use of partial denaturation mapping and restriction endonuclease digestion. This location is between gene IV and gene II, and synthesis proceeds in a counterclockwise direction on the conventional genetic map.« less
Physical Model of the Genotype-to-Phenotype Map of Proteins
NASA Astrophysics Data System (ADS)
Tlusty, Tsvi; Libchaber, Albert; Eckmann, Jean-Pierre
2017-04-01
How DNA is mapped to functional proteins is a basic question of living matter. We introduce and study a physical model of protein evolution which suggests a mechanical basis for this map. Many proteins rely on large-scale motion to function. We therefore treat protein as learning amorphous matter that evolves towards such a mechanical function: Genes are binary sequences that encode the connectivity of the amino acid network that makes a protein. The gene is evolved until the network forms a shear band across the protein, which allows for long-range, soft modes required for protein function. The evolution reduces the high-dimensional sequence space to a low-dimensional space of mechanical modes, in accord with the observed dimensional reduction between genotype and phenotype of proteins. Spectral analysis of the space of 1 06 solutions shows a strong correspondence between localization around the shear band of both mechanical modes and the sequence structure. Specifically, our model shows how mutations are correlated among amino acids whose interactions determine the functional mode.
Reddy, Umesh K.; Abburi, Lavanya; Abburi, Venkata Lakshmi; Saminathan, Thangasamy; Cantrell, Robert; Vajja, Venkata Gopinath; Reddy, Rishi; Tomason, Yan R.; Levi, Amnon; Wehner, Todd C.; Nimmakayala, Padma
2015-01-01
Our genetic diversity study uses microsatellites of known map position to estimate genome level population structure and linkage disequilibrium, and to identify genomic regions that have undergone selection during watermelon domestication and improvement. Thirty regions that showed evidence of selective sweep were scanned for the presence of candidate genes using the watermelon genome browser (www.icugi.org). We localized selective sweeps in intergenic regions, close to the promoters, and within the exons and introns of various genes. This study provided an evidence of convergent evolution for the presence of diverse ecotypes with special reference to American and European ecotypes. Our search for location of linked markers in the whole-genome draft sequence revealed that BVWS00358, a GA repeat microsatellite, is the GAGA type transcription factor located in the 5′ untranslated regions of a structure and insertion element that expresses a Cys2His2 Zinc finger motif, with presumed biological processes related to chitin response and transcriptional regulation. In addition, BVWS01708, an ATT repeat microsatellite, located in the promoter of a DTW domain-containing protein (Cla002761); and 2 other simple sequence repeats that association mapping link to fruit length and rind thickness. PMID:25425675
Cain-Hom, Carol; Splinter, Erik; van Min, Max; Simonis, Marieke; van de Heijning, Monique; Martinez, Maria; Asghari, Vida
2017-01-01
Abstract Cre/LoxP technology is widely used in the field of mouse genetics for spatial and/or temporal regulation of gene function. For Cre lines generated via pronuclear microinjection of a Cre transgene construct, the integration site is random and in most cases not known. Integration of a transgene can disrupt an endogenous gene, potentially interfering with interpretation of the phenotype. In addition, knowledge of where the transgene is integrated is important for planning of crosses between animals carrying a conditional allele and a given Cre allele in case the alleles are on the same chromosome. We have used targeted locus amplification (TLA) to efficiently map the transgene location in seven previously published Cre and CreERT2 transgenic lines. In all lines, transgene insertion was associated with structural changes of variable complexity, illustrating the importance of testing for rearrangements around the integration site. In all seven lines the exact integration site and breakpoint sequences were identified. Our methods, data and genotyping assays can be used as a resource for the mouse community and our results illustrate the power of the TLA method to not only efficiently map the integration site of any transgene, but also provide additional information regarding the transgene integration events. PMID:28053125
2002-10-01
there is a mutation in the p53 gene itself (4, 5). Interestingly, -80% of p53 mutations are missense changes that lead to single amino acid...substitutions, a feature that distinguishes p53 from other tumor suppressor genes (e.g., APC, NF1, BRCAJ) (6). The incidence of p53 mutations and the types of...intronic promoter is contained within the human mutation hotspot maps of p53: correlation with p53 protein structural and mdm2 gene . Nucleic Acids Res
DSSTox chemical-index files for exposure-related ...
The Distributed Structure-Searchable Toxicity (DSSTox) ARYEXP and GEOGSE files are newly published, structure-annotated files of the chemical-associated and chemical exposure-related summary experimental content contained in the ArrayExpress Repository and Gene Expression Omnibus (GEO) Series (based on data extracted on September 20, 2008). ARYEXP and GEOGSE contain 887 and 1064 unique chemical substances mapped to 1835 and 2381 chemical exposure-related experiment accession IDs, respectively. The standardized files allow one to assess, compare and search the chemical content in each resource, in the context of the larger DSSTox toxicology data network, as well as across large public cheminformatics resources such as PubChem (http://pubchem.ncbi.nlm.nih.gov). The Distributed Structure-Searchable Toxicity (DSSTox) ARYEXP and GEOGSE files are newly published, structure-annotated files of the chemical-associated and chemical exposure-related summary experimental content contained in the ArrayExpress Repository and Gene Expression Omnibus (GEO) Series (based on data extracted on September 20, 2008). ARYEXP and GEOGSE contain 887 and 1064 unique chemical substances mapped to 1835 and 2381 chemical exposure-related experiment accession IDs, respectively. The standardized files allow one to assess, compare and search the chemical content in each resource, in the context of the larger DSSTox toxicology data network, as well as across large public cheminformatics resourc
Evolution of substrate specificity in a retained enzyme driven by gene loss
Juárez-Vázquez, Ana Lilia; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.; ...
2017-03-31
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. We apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence oftrpandhisgenes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to a monofunctional, yet not necessarily specialized, inefficientmore » form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. Finally, our results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
Evolution of substrate specificity in a retained enzyme driven by gene loss
DOE Office of Scientific and Technical Information (OSTI.GOV)
Juárez-Vázquez, Ana Lilia; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.
The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. We apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence oftrpandhisgenes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to a monofunctional, yet not necessarily specialized, inefficientmore » form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. Finally, our results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Spencer, S.R.; Taylor, J.B.; Cowell, I.G.
The soluble glutathione transferases (GSTs) are a family of dimeric isoenymes catalyzing the conjugation of glutathione to hydrophobic electropiles. Their subunits can be grouped into four families, alpha, mu, pi, and theta, on the basis of their primary structures. In man, the pi class is represented by a single gene, GSTP1-1 (GST[pi]) localized to human chromosome 11, band q13. The oncogenes INT2, HSTF1, and PRAD1 are also localized at 11q13, and together with the GSTP1 locus and other gene loci mapped to 11q13, i.e., BCL1 and EMS1, they form a unit of DNA approximately 2000-2500 kb, known as the 11q13more » amplicon, which is often amplified in a range of solid tumors. Any gene locus at 11q13 is of interest because it may influence tumorigenesis. 14 refs., 1 fig.« less
A draft annotation and overview of the human genome
Wright, Fred A; Lemon, William J; Zhao, Wei D; Sears, Russell; Zhuo, Degen; Wang, Jian-Ping; Yang, Hee-Yung; Baer, Troy; Stredney, Don; Spitzner, Joe; Stutz, Al; Krahe, Ralf; Yuan, Bo
2001-01-01
Background The recent draft assembly of the human genome provides a unified basis for describing genomic structure and function. The draft is sufficiently accurate to provide useful annotation, enabling direct observations of previously inferred biological phenomena. Results We report here a functionally annotated human gene index placed directly on the genome. The index is based on the integration of public transcript, protein, and mapping information, supplemented with computational prediction. We describe numerous global features of the genome and examine the relationship of various genetic maps with the assembly. In addition, initial sequence analysis reveals highly ordered chromosomal landscapes associated with paralogous gene clusters and distinct functional compartments. Finally, these annotation data were synthesized to produce observations of gene density and number that accord well with historical estimates. Such a global approach had previously been described only for chromosomes 21 and 22, which together account for 2.2% of the genome. Conclusions We estimate that the genome contains 65,000-75,000 transcriptional units, with exon sequences comprising 4%. The creation of a comprehensive gene index requires the synthesis of all available computational and experimental evidence. PMID:11516338
Quraishi, Umar Masood; Abrouk, Michael; Murat, Florent; Pont, Caroline; Foucrier, Séverine; Desmaizieres, Gregory; Confolent, Carole; Rivière, Nathalie; Charmet, Gilles; Paux, Etienne; Murigneux, Alain; Guerreiro, Laurent; Lafarge, Stéphane; Le Gouis, Jacques; Feuillet, Catherine; Salse, Jerome
2011-03-01
Monitoring nitrogen use efficiency (NUE) in plants is becoming essential to maintain yield while reducing fertilizer usage. Optimized NUE application in major crops is essential for long-term sustainability of agriculture production. Here, we report the precise identification of 11 major chromosomal regions controlling NUE in wheat that co-localise with key developmental genes such as Ppd (photoperiod sensitivity), Vrn (vernalization requirement), Rht (reduced height) and can be considered as robust markers from a molecular breeding perspective. Physical mapping, sequencing, annotation and candidate gene validation of an NUE metaQTL on wheat chromosome 3B allowed us to propose that a glutamate synthase (GoGAT) gene that is conserved structurally and functionally at orthologous positions in rice, sorghum and maize genomes may contribute to NUE in wheat and other cereals. We propose an evolutionary model for the NUE locus in cereals from a common ancestral region, involving species specific shuffling events such as gene deletion, inversion, transposition and the invasion of repetitive elements. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.
High resolution physical mapping of single gene fragments on pachytene chromosome 4 and 7 of Rosa.
Kirov, Ilya V; Van Laere, Katrijn; Khrustaleva, Ludmila I
2015-07-02
Rosaceae is a family containing many economically important fruit and ornamental species. Although fluorescence in situ hybridization (FISH)-based physical mapping of plant genomes is a valuable tool for map-based cloning, comparative genomics and evolutionary studies, no studies using high resolution physical mapping have been performed in this family. Previously we proved that physical mapping of single-copy genes as small as 1.1 kb is possible on mitotic metaphase chromosomes of Rosa wichurana using Tyramide-FISH. In this study we aimed to further improve the physical map of Rosa wichurana by applying high resolution FISH to pachytene chromosomes. Using high resolution Tyramide-FISH and multicolor Tyramide-FISH, 7 genes (1.7-3 kb) were successfully mapped on pachytene chromosomes 4 and 7 of Rosa wichurana. Additionally, by using multicolor Tyramide-FISH three closely located genes were simultaneously visualized on chromosome 7. A detailed map of heterochromatine/euchromatine patterns of chromosome 4 and 7 was developed with indication of the physical position of these 7 genes. Comparison of the gene order between Rosa wichurana and Fragaria vesca revealed a poor collinearity for chromosome 7, but a perfect collinearity for chromosome 4. High resolution physical mapping of short probes on pachytene chromosomes of Rosa wichurana was successfully performed for the first time. Application of Tyramide-FISH on pachytene chromosomes allowed the mapping resolution to be increased up to 20 times compared to mitotic metaphase chromosomes. High resolution Tyramide-FISH and multicolor Tyramide-FISH might become useful tools for further physical mapping of single-copy genes and for the integration of physical and genetic maps of Rosa wichurana and other members of the Rosaceae.
Huang, Xin; Gollin, Susanne M.; Raja, Siva; Godfrey, Tony E.
2002-01-01
Amplification of chromosomal band 11q13 is a common event in human cancer. It has been reported in about 45% of head and neck carcinomas and in other cancers including esophageal, breast, liver, lung, and bladder cancer. To understand the mechanism of 11q13 amplification and to identify the potential oncogene(s) driving it, we have fine-mapped the structure of the amplicon in oral squamous cell carcinoma cell lines and localized the proximal and distal breakpoints. A 5-Mb physical map of the region has been prepared from which sequence is available. We quantified copy number of sequence-tagged site markers at 42–550 kb intervals along the length of the amplicon and defined the amplicon core and breakpoints by using TaqMan-based quantitative microsatellite analysis. The core of the amplicon maps to a 1.5-Mb region. The proximal breakpoint localizes to two intervals between sequence-tagged site markers, 550 kb and 160 kb in size, and the distal breakpoint maps to a 250 kb interval. The cyclin D1 gene maps to the amplicon core, as do two new expressed sequence tag clusters. We have analyzed one of these expressed sequence tag clusters and now report that it contains a previously uncharacterized gene, TAOS1 (tumor amplified and overexpressed sequence 1), which is both amplified and overexpressed in oral cancer cells. The data suggest that TAOS1 may be an amplification-dependent candidate oncogene with a role in the development and/or progression of human tumors, including oral squamous cell carcinomas. The approach described here should be useful for characterizing amplified genomic regions in a wide variety of tumors. PMID:12172009
Reconstitutional Mutagenesis of the Maize P Gene by Short-Range Ac Transpositions
Moreno, M. A.; Chen, J.; Greenblatt, I.; Dellaporta, S. L.
1992-01-01
The tendency for Ac to transpose over short intervals has been utilized to develop insertional mutagenesis and fine structure genetic mapping strategies in maize. We recovered excisions of Ac from the P gene and insertions into nearby chromosomal sites. These closely linked Ac elements reinserted into the P gene, reconstituting over 250 unstable variegated alleles. Reconstituted alleles condition a variety of variegation patterns that reflect the position and orientation of Ac within the P gene. Molecular mapping and DNA sequence analyses have shown that reinsertion sites are dispersed throughout a 12.3-kb chromosomal region in the promoter, exons and introns of the P gene, but in some regions insertions sites were clustered in a nonrandom fashion. Transposition profiles and target site sequence data obtained from these studies have revealed several features of Ac transposition including its preference for certain target sites. These results clearly demonstrate the tendency of Ac to transpose to nearby sites in both proximal and distal directions from the donor site. With minor modifications, reconstitutional mutagenesis should be applicable to many Ac-induced mutations in maize and in other plant species and can possibly be extended to other eukaryotic transposon systems as well. PMID:1325389
Jones, David B; Jerry, Dean R; Khatkar, Mehar S; Raadsma, Herman W; Zenger, Kyall R
2013-11-20
The silver-lipped pearl oyster, Pinctada maxima, is an important tropical aquaculture species extensively farmed for the highly sought "South Sea" pearls. Traditional breeding programs have been initiated for this species in order to select for improved pearl quality, but many economic traits under selection are complex, polygenic and confounded with environmental factors, limiting the accuracy of selection. The incorporation of a marker-assisted selection (MAS) breeding approach would greatly benefit pearl breeding programs by allowing the direct selection of genes responsible for pearl quality. However, before MAS can be incorporated, substantial genomic resources such as genetic linkage maps need to be generated. The construction of a high-density genetic linkage map for P. maxima is not only essential for unravelling the genomic architecture of complex pearl quality traits, but also provides indispensable information on the genome structure of pearl oysters. A total of 1,189 informative genome-wide single nucleotide polymorphisms (SNPs) were incorporated into linkage map construction. The final linkage map consisted of 887 SNPs in 14 linkage groups, spans a total genetic distance of 831.7 centimorgans (cM), and covers an estimated 96% of the P. maxima genome. Assessment of sex-specific recombination across all linkage groups revealed limited overall heterochiasmy between the sexes (i.e. 1.15:1 F/M map length ratio). However, there were pronounced localised differences throughout the linkage groups, whereby male recombination was suppressed near the centromeres compared to female recombination, but inflated towards telomeric regions. Mean values of LD for adjacent SNP pairs suggest that a higher density of markers will be required for powerful genome-wide association studies. Finally, numerous nacre biomineralization genes were localised providing novel positional information for these genes. This high-density SNP genetic map is the first comprehensive linkage map for any pearl oyster species. It provides an essential genomic tool facilitating studies investigating the genomic architecture of complex trait variation and identifying quantitative trait loci for economically important traits useful in genetic selection programs within the P. maxima pearling industry. Furthermore, this map provides a foundation for further research aiming to improve our understanding of the dynamic process of biomineralization, and pearl oyster evolution and synteny.
NASA Astrophysics Data System (ADS)
Mittal, Shikha; Banduni, Pooja; Mallikarjuna, Mallana G.; Rao, Atmakuri R.; Jain, Prashant A.; Dash, Prasanta K.; Thirunavukkarasu, Nepolean
2018-05-01
Drought is one of the major threats to maize production. In order to improve the production and to breed tolerant hybrids, understanding the genes and regulatory mechanisms during drought stress is important. Transcription factors (TFs) play a major role in gene regulation and many TFs have been identified in response to drought stress. In our experiment, a set of 15 major TF families comprising 1436 genes was structurally and functionally characterized using in-silico tools and a gene expression assay. All 1436 genes were mapped on 10 chromosome of maize. The functional annotation indicated the involvement of these genes in ABA signaling, ROS scavenging, photosynthesis, stomatal regulation, and sucrose metabolism. Duplication was identified as the primary force in divergence and expansion of TF families. Phylogenetic relationship was developed individually for each TF family as well as combined TF families. Phylogenetic analysis grouped the TF family of genes into TF-specific and mixed groups. Phylogenetic analysis of genes belonging to various TF families suggested that the origin of TFs occurred in the lineage of maize evolution. Gene structure analysis revealed that more number of genes were intron-rich as compared to intronless genes. Drought-responsive CRE’s such as ABREA, ABREB, DRE1 and DRECRTCOREAT have been identified. Expression and interaction analyses identified leaf-specific bZIP TF, GRMZM2G140355, as a potential contributor toward drought tolerance in maize. We also analyzed protein-protein interaction network of 269 drought-responsive genes belonging to different drought-related TFs. The information generated on structural and functional characteristics, expression and interaction of the drought-related TF families will be useful to decipher the drought tolerance mechanisms and to derive drought-tolerant genotypes in maize.
A global interaction network maps a wiring diagram of cellular function
Costanzo, Michael; VanderSluis, Benjamin; Koch, Elizabeth N.; Baryshnikova, Anastasia; Pons, Carles; Tan, Guihong; Wang, Wen; Usaj, Matej; Hanchard, Julia; Lee, Susan D.; Pelechano, Vicent; Styles, Erin B.; Billmann, Maximilian; van Leeuwen, Jolanda; van Dyk, Nydia; Lin, Zhen-Yuan; Kuzmin, Elena; Nelson, Justin; Piotrowski, Jeff S.; Srikumar, Tharan; Bahr, Sondra; Chen, Yiqun; Deshpande, Raamesh; Kurat, Christoph F.; Li, Sheena C.; Li, Zhijian; Usaj, Mojca Mattiazzi; Okada, Hiroki; Pascoe, Natasha; Luis, Bryan-Joseph San; Sharifpoor, Sara; Shuteriqi, Emira; Simpkins, Scott W.; Snider, Jamie; Suresh, Harsha Garadi; Tan, Yizhao; Zhu, Hongwei; Malod-Dognin, Noel; Janjic, Vuk; Przulj, Natasa; Troyanskaya, Olga G.; Stagljar, Igor; Xia, Tian; Ohya, Yoshikazu; Gingras, Anne-Claude; Raught, Brian; Boutros, Michael; Steinmetz, Lars M.; Moore, Claire L.; Rosebrock, Adam P.; Caudy, Amy A.; Myers, Chad L.; Andrews, Brenda; Boone, Charles
2017-01-01
We generated a global genetic interaction network for Saccharomyces cerevisiae, constructing over 23 million double mutants, identifying ~550,000 negative and ~350,000 positive genetic interactions. This comprehensive network maps genetic interactions for essential gene pairs, highlighting essential genes as densely connected hubs. Genetic interaction profiles enabled assembly of a hierarchical model of cell function, including modules corresponding to protein complexes and pathways, biological processes, and cellular compartments. Negative interactions connected functionally related genes, mapped core bioprocesses, and identified pleiotropic genes, whereas positive interactions often mapped general regulatory connections among gene pairs, rather than shared functionality. The global network illustrates how coherent sets of genetic interactions connect protein complex and pathway modules to map a functional wiring diagram of the cell. PMID:27708008
Ron, Micha; Israeli, Galit; Seroussi, Eyal; Weller, Joel I; Gregg, Jeffrey P; Shani, Moshe; Medrano, Juan F
2007-01-01
Background Many studies have found segregating quantitative trait loci (QTL) for milk production traits in different dairy cattle populations. However, even for relatively large effects with a saturated marker map the confidence interval for QTL location by linkage analysis spans tens of map units, or hundreds of genes. Combining mapping and arraying has been suggested as an approach to identify candidate genes. Thus, gene expression analysis in the mammary gland of genes positioned in the confidence interval of the QTL can bridge the gap between fine mapping and quantitative trait nucleotide (QTN) determination. Results We hybridized Affymetrix microarray (MG-U74v2), containing 12,488 murine probes, with RNA derived from mammary gland of virgin, pregnant, lactating and involuting C57BL/6J mice in a total of nine biological replicates. We combined microarray data from two additional studies that used the same design in mice with a total of 75 biological replicates. The same filtering and normalization was applied to each microarray data using GeneSpring software. Analysis of variance identified 249 differentially expressed probe sets common to the three experiments along the four developmental stages of puberty, pregnancy, lactation and involution. 212 genes were assigned to their bovine map positions through comparative mapping, and thus form a list of candidate genes for previously identified QTLs for milk production traits. A total of 82 of the genes showed mammary gland-specific expression with at least 3-fold expression over the median representing all tissues tested in GeneAtlas. Conclusion This work presents a web tool for candidate genes for QTL (cgQTL) that allows navigation between the map of bovine milk production QTL, potential candidate genes and their level of expression in mammary gland arrays and in GeneAtlas. Three out of four confirmed genes that affect QTL in livestock (ABCG2, DGAT1, GDF8, IGF2) were over expressed in the target organ. Thus, cgQTL can be used to determine priority of candidate genes for QTN analysis based on differential expression in the target organ. PMID:17584498
Everts-van der Wind, Annelie; Kata, Srinivas R.; Band, Mark R.; Rebeiz, Mark; Larkin, Denis M.; Everts, Robin E.; Green, Cheryl A.; Liu, Lei; Natarajan, Shreedhar; Goldammer, Tom; Lee, Jun Heon; McKay, Stephanie; Womack, James E.; Lewin, Harris A.
2004-01-01
A second-generation 5000 rad radiation hybrid (RH) map of the cattle genome was constructed primarily using cattle ESTs that were targeted to gaps in the existing cattle–human comparative map, as well as to sparsely populated map intervals. A total of 870 targeted markers were added, bringing the number of markers mapped on the RH5000 panel to 1913. Of these, 1463 have significant BLASTN hits (E < e–5) against the human genome sequence. A cattle–human comparative map was created using human genome sequence coordinates of the paired orthologs. One-hundred and ninety-five conserved segments (defined by two or more genes) were identified between the cattle and human genomes, of which 31 are newly discovered and 34 were extended singletons on the first-generation map. The new map represents an improvement of 20% genome-wide comparative coverage compared with the first-generation map. Analysis of gene content within human genome regions where there are gaps in the comparative map revealed gaps with both significantly greater and significantly lower gene content. The new, more detailed cattle–human comparative map provides an improved resource for the analysis of mammalian chromosome evolution, the identification of candidate genes for economically important traits, and for proper alignment of sequence contigs on cattle chromosomes. PMID:15231756
Khan, Sabaz Ali; Chibon, Pierre-Yves; de Vos, Ric C.H.; Schipper, Bert A.; Walraven, Evert; Beekwilder, Jules; van Dijk, Thijs; Finkers, Richard; Visser, Richard G.F.; van de Weg, Eric W.; Bovy, Arnaud; Cestaro, Alessandro; Velasco, Riccardo; Jacobsen, Evert; Schouten, Henk J.
2012-01-01
Apple (Malus×domestica Borkh) is among the main sources of phenolic compounds in the human diet. The genetic basis of the quantitative variations of these potentially beneficial phenolic compounds was investigated. A segregating F1 population was used to map metabolite quantitative trait loci (mQTLs). Untargeted metabolic profiling of peel and flesh tissues of ripe fruits was performed using liquid chromatography–mass spectrometry (LC-MS), resulting in the detection of 418 metabolites in peel and 254 in flesh. In mQTL mapping using MetaNetwork, 669 significant mQTLs were detected: 488 in the peel and 181 in the flesh. Four linkage groups (LGs), LG1, LG8, LG13, and LG16, were found to contain mQTL hotspots, mainly regulating metabolites that belong to the phenylpropanoid pathway. The genetics of annotated metabolites was studied in more detail using MapQTL®. A number of quercetin conjugates had mQTLs on LG1 or LG13. The most important mQTL hotspot with the largest number of metabolites was detected on LG16: mQTLs for 33 peel-related and 17 flesh-related phenolic compounds. Structural genes involved in the phenylpropanoid biosynthetic pathway were located, using the apple genome sequence. The structural gene leucoanthocyanidin reductase (LAR1) was in the mQTL hotspot on LG16, as were seven transcription factor genes. The authors believe that this is the first time that a QTL analysis was performed on such a high number of metabolites in an outbreeding plant species. PMID:22330898
UniGene Tabulator: a full parser for the UniGene format.
Lenzi, Luca; Frabetti, Flavia; Facchin, Federica; Casadei, Raffaella; Vitale, Lorenza; Canaider, Silvia; Carinci, Paolo; Zannotti, Maria; Strippoli, Pierluigi
2006-10-15
UniGene Tabulator 1.0 provides a solution for full parsing of UniGene flat file format; it implements a structured graphical representation of each data field present in UniGene following import into a common database managing system usable in a personal computer. This database includes related tables for sequence, protein similarity, sequence-tagged site (STS) and transcript map interval (TXMAP) data, plus a summary table where each record represents a UniGene cluster. UniGene Tabulator enables full local management of UniGene data, allowing parsing, querying, indexing, retrieving, exporting and analysis of UniGene data in a relational database form, usable on Macintosh (OS X 10.3.9 or later) and Windows (2000, with service pack 4, XP, with service pack 2 or later) operating systems-based computers. The current release, including both the FileMaker runtime applications, is freely available at http://apollo11.isto.unibo.it/software/
Coleman, M P; Németh, A H; Campbell, L; Raut, C P; Weissenbach, J; Davies, K E
1994-05-15
The genes ARAF1, SYN1, TIMP, and PFC are clustered within 70 kb of one another, and, as reported in the accompanying paper (J. Knight et al., 1994, Genomics 21: 180-187), at least four more genes map within 400 kb: a cluster of Krüppel-type zinc finger genes (including ZNF21, ZNF41, and ZNF81) and ELK-1, a member of the ets oncogene superfamily. This gene-rich region is of particular interest because of the large number of disease genes mapping to Xp11.23: at least three eye diseases (retinitis pigmentosa type 2, congenital stationary night blindness CSNB1, and Aland Island eye disease), Wiskott-Aldrich syndrome, X-linked nephrolithiasis, and a translocation breakpoint associated with synovial sarcoma. We have constructed a 1.8-Mb YAC contig in this region, confirming the link between TIMP and OATL1 reported by Knight et al. (1994) and extending the map in the distal direction. To investigate the likelihood that more genes are located within this region, we have carried out detailed mapping of rare-cutter restriction sites in these YACs and identified seven CpG islands. At least six of these islands are located over 50 kb from any known gene locations, suggesting that the region contains at least this many as yet unidentified genes. We have also mapped the physical locations of six highly polymorphic CA repeats within the contig, thus integrating the physical, genetic, and transcriptional maps of the region and facilitating the mapping and identification of disease genes.(ABSTRACT TRUNCATED AT 250 WORDS)
RatMap--rat genome tools and data.
Petersen, Greta; Johnson, Per; Andersson, Lars; Klinga-Levan, Karin; Gómez-Fabre, Pedro M; Ståhl, Fredrik
2005-01-01
The rat genome database RatMap (http://ratmap.org or http://ratmap.gen.gu.se) has been one of the main resources for rat genome information since 1994. The database is maintained by CMB-Genetics at Goteborg University in Sweden and provides information on rat genes, polymorphic rat DNA-markers and rat quantitative trait loci (QTLs), all curated at RatMap. The database is under the supervision of the Rat Gene and Nomenclature Committee (RGNC); thus much attention is paid to rat gene nomenclature. RatMap presents information on rat idiograms, karyotypes and provides a unified presentation of the rat genome sequence and integrated rat linkage maps. A set of tools is also available to facilitate the identification and characterization of rat QTLs, as well as the estimation of exon/intron number and sizes in individual rat genes. Furthermore, comparative gene maps of rat in regard to mouse and human are provided.
Ndeve, Arsenio Daniel; Huynh, Bao-Lam; Matthews, William Charles; Roberts, Philip Alan
2018-01-01
Cowpea is one of the most important food and forage legumes in drier regions of the tropics and subtropics. However, cowpea yield worldwide is markedly below the known potential due to abiotic and biotic stresses, including parasitism by root-knot nematodes (Meloidogyne spp., RKN). Two resistance genes with dominant effect, Rk and Rk2, have been reported to provide resistance against RKN in cowpea. Despite their description and use in breeding for resistance to RKN and particularly genetic mapping of the Rk locus, the exact genes conferring resistance to RKN remain unknown. In the present work, QTL mapping using recombinant inbred line (RIL) population 524B x IT84S-2049 segregating for a newly mapped locus and analysis of the transcriptome changes in two cowpea near-isogenic lines (NIL) were used to identify candidate genes for Rk and the newly mapped locus. A major QTL, designated QRk-vu9.1, associated with resistance to Meloidogyne javanica reproduction, was detected and mapped on linkage group LG9 at position 13.37 cM using egg production data. Transcriptome analysis on resistant and susceptible NILs 3 and 9 days after inoculation revealed up-regulation of 109 and 98 genes and down-regulation of 110 and 89 genes, respectively, out of 19,922 unique genes mapped to the common bean reference genome. Among the differentially expressed genes, four and nine genes were found within the QRk-vu9.1 and QRk-vu11.1 QTL intervals, respectively. Six of these genes belong to the TIR-NBS-LRR family of resistance genes and three were upregulated at one or more time-points. Quantitative RT-PCR validated gene expression to be positively correlated with RNA-seq expression pattern for eight genes. Future functional analysis of these cowpea genes will enhance our understanding of Rk-mediated resistance and identify the specific gene responsible for the resistance. PMID:29300744
Santos, Jansen Rodrigo Pereira; Ndeve, Arsenio Daniel; Huynh, Bao-Lam; Matthews, William Charles; Roberts, Philip Alan
2018-01-01
Cowpea is one of the most important food and forage legumes in drier regions of the tropics and subtropics. However, cowpea yield worldwide is markedly below the known potential due to abiotic and biotic stresses, including parasitism by root-knot nematodes (Meloidogyne spp., RKN). Two resistance genes with dominant effect, Rk and Rk2, have been reported to provide resistance against RKN in cowpea. Despite their description and use in breeding for resistance to RKN and particularly genetic mapping of the Rk locus, the exact genes conferring resistance to RKN remain unknown. In the present work, QTL mapping using recombinant inbred line (RIL) population 524B x IT84S-2049 segregating for a newly mapped locus and analysis of the transcriptome changes in two cowpea near-isogenic lines (NIL) were used to identify candidate genes for Rk and the newly mapped locus. A major QTL, designated QRk-vu9.1, associated with resistance to Meloidogyne javanica reproduction, was detected and mapped on linkage group LG9 at position 13.37 cM using egg production data. Transcriptome analysis on resistant and susceptible NILs 3 and 9 days after inoculation revealed up-regulation of 109 and 98 genes and down-regulation of 110 and 89 genes, respectively, out of 19,922 unique genes mapped to the common bean reference genome. Among the differentially expressed genes, four and nine genes were found within the QRk-vu9.1 and QRk-vu11.1 QTL intervals, respectively. Six of these genes belong to the TIR-NBS-LRR family of resistance genes and three were upregulated at one or more time-points. Quantitative RT-PCR validated gene expression to be positively correlated with RNA-seq expression pattern for eight genes. Future functional analysis of these cowpea genes will enhance our understanding of Rk-mediated resistance and identify the specific gene responsible for the resistance.
Wu, Ping; Ng, Chen Siang; Yan, Jie; Lai, Yung-Chih; Chen, Chih-Kuan; Lai, Yu-Ting; Wu, Siao-Man; Chen, Jiun-Jie; Luo, Weiqi; Widelitz, Randall B.; Li, Wen-Hsiung; Chuong, Cheng-Ming
2015-01-01
Avian integumentary organs include feathers, scales, claws, and beaks. They cover the body surface and play various functions to help adapt birds to diverse environments. These keratinized structures are mainly composed of corneous materials made of α-keratins, which exist in all vertebrates, and β-keratins, which only exist in birds and reptiles. Here, members of the keratin gene families were used to study how gene family evolution contributes to novelty and adaptation, focusing on tissue morphogenesis. Using chicken as a model, we applied RNA-seq and in situ hybridization to map α- and β-keratin genes in various skin appendages at embryonic developmental stages. The data demonstrate that temporal and spatial α- and β-keratin expression is involved in establishing the diversity of skin appendage phenotypes. Embryonic feathers express a higher proportion of β-keratin genes than other skin regions. In feather filament morphogenesis, β-keratins show intricate complexity in diverse substructures of feather branches. To explore functional interactions, we used a retrovirus transgenic system to ectopically express mutant α- or antisense β-keratin forms. α- and β-keratins show mutual dependence and mutations in either keratin type results in disrupted keratin networks and failure to form proper feather branches. Our data suggest that combinations of α- and β-keratin genes contribute to the morphological and structural diversity of different avian skin appendages, with feather-β-keratins conferring more possible composites in building intrafeather architecture complexity, setting up a platform of morphological evolution of functional forms in feathers. PMID:26598683
Integrating Evolutionary Game Theory into Mechanistic Genotype-Phenotype Mapping.
Zhu, Xuli; Jiang, Libo; Ye, Meixia; Sun, Lidan; Gragnoli, Claudia; Wu, Rongling
2016-05-01
Natural selection has shaped the evolution of organisms toward optimizing their structural and functional design. However, how this universal principle can enhance genotype-phenotype mapping of quantitative traits has remained unexplored. Here we show that the integration of this principle and functional mapping through evolutionary game theory gains new insight into the genetic architecture of complex traits. By viewing phenotype formation as an evolutionary system, we formulate mathematical equations to model the ecological mechanisms that drive the interaction and coordination of its constituent components toward population dynamics and stability. Functional mapping provides a procedure for estimating the genetic parameters that specify the dynamic relationship of competition and cooperation and predicting how genes mediate the evolution of this relationship during trait formation. Copyright © 2016 Elsevier Ltd. All rights reserved.
Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger.
Wright, James C; Sugden, Deana; Francis-McIntyre, Sue; Riba-Garcia, Isabel; Gaskell, Simon J; Grigoriev, Igor V; Baker, Scott E; Beynon, Robert J; Hubbard, Simon J
2009-02-04
Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were acquired from 1d gel electrophoresis bands and searched against all available gene models using Average Peptide Scoring (APS) and reverse database searching to produce confident identifications at an acceptable false discovery rate (FDR). 405 identified peptide sequences were mapped to 214 different A.niger genomic loci to which 4093 predicted gene models clustered, 2872 of which contained the mapped peptides. Interestingly, 13 (6%) of these loci either had no preferred predicted gene model or the genome annotators' chosen "best" model for that genomic locus was not found to be the most parsimonious match to the identified peptides. The peptides identified also boosted confidence in predicted gene structures spanning 54 introns from different gene models. This work highlights the potential of integrating experimental proteomics data into genomic annotation pipelines much as expressed sequence tag (EST) data has been. A comparison of the published genome from another strain of A.niger sequenced by DSM showed that a number of the gene models or proteins with proteomics evidence did not occur in both genomes, further highlighting the utility of the method.
MHC class I-associated peptides derive from selective regions of the human genome.
Pearson, Hillary; Daouda, Tariq; Granados, Diana Paola; Durette, Chantal; Bonneil, Eric; Courcelles, Mathieu; Rodenbrock, Anja; Laverdure, Jean-Philippe; Côté, Caroline; Mader, Sylvie; Lemieux, Sébastien; Thibault, Pierre; Perreault, Claude
2016-12-01
MHC class I-associated peptides (MAPs) define the immune self for CD8+ T lymphocytes and are key targets of cancer immunosurveillance. Here, the goals of our work were to determine whether the entire set of protein-coding genes could generate MAPs and whether specific features influence the ability of discrete genes to generate MAPs. Using proteogenomics, we have identified 25,270 MAPs isolated from the B lymphocytes of 18 individuals who collectively expressed 27 high-frequency HLA-A,B allotypes. The entire MAP repertoire presented by these 27 allotypes covered only 10% of the exomic sequences expressed in B lymphocytes. Indeed, 41% of expressed protein-coding genes generated no MAPs, while 59% of genes generated up to 64 MAPs, often derived from adjacent regions and presented by different allotypes. We next identified several features of transcripts and proteins associated with efficient MAP production. From these data, we built a logistic regression model that predicts with good accuracy whether a gene generates MAPs. Our results show preferential selection of MAPs from a limited repertoire of proteins with distinctive features. The notion that the MHC class I immunopeptidome presents only a small fraction of the protein-coding genome for monitoring by the immune system has profound implications in autoimmunity and cancer immunology.
MHC class I–associated peptides derive from selective regions of the human genome
Pearson, Hillary; Granados, Diana Paola; Durette, Chantal; Bonneil, Eric; Courcelles, Mathieu; Rodenbrock, Anja; Laverdure, Jean-Philippe; Côté, Caroline; Thibault, Pierre
2016-01-01
MHC class I–associated peptides (MAPs) define the immune self for CD8+ T lymphocytes and are key targets of cancer immunosurveillance. Here, the goals of our work were to determine whether the entire set of protein-coding genes could generate MAPs and whether specific features influence the ability of discrete genes to generate MAPs. Using proteogenomics, we have identified 25,270 MAPs isolated from the B lymphocytes of 18 individuals who collectively expressed 27 high-frequency HLA-A,B allotypes. The entire MAP repertoire presented by these 27 allotypes covered only 10% of the exomic sequences expressed in B lymphocytes. Indeed, 41% of expressed protein-coding genes generated no MAPs, while 59% of genes generated up to 64 MAPs, often derived from adjacent regions and presented by different allotypes. We next identified several features of transcripts and proteins associated with efficient MAP production. From these data, we built a logistic regression model that predicts with good accuracy whether a gene generates MAPs. Our results show preferential selection of MAPs from a limited repertoire of proteins with distinctive features. The notion that the MHC class I immunopeptidome presents only a small fraction of the protein-coding genome for monitoring by the immune system has profound implications in autoimmunity and cancer immunology. PMID:27841757
Reconstruction of an SSR-based Magnaporthe oryzae physical map to locate avirulence gene AvrPi12.
Li, Tonghui; Wen, Jianqiang; Zhang, Yaling; Correll, James; Wang, Ling; Pan, Qinghua
2018-05-31
Pathogen avirulence (Avr) genes can evolve rapidly when challenged by the widespread deployment of host genes for resistance. They can be effectively isolated by positional cloning provided a robust and well-populated genetic map is available. An updated, SSR-based physical map of the rice blast pathogen Magnaporthe oryzae (Mo) has been constructed based on 116 of the 120 SSRs used to assemble the last map, along with 18 newly developed ones. A comparison between the two versions of the map has revealed an altered marker content and order within most of the Mo chromosomes. The avirulence gene AvrPi12 was mapped in a population of 219 progeny derived from a cross between the two Mo isolates CHL42 and CHL357. A bulked segregant analysis indicated that the gene was located on chromosome 6, a conclusion borne out by an analysis of the pattern of segregation shown by individual isolates. Six additional PCR-based markers were developed to improve the map resolution in the key region. AvrPi12 was finally located within the sub-telomeric region of chromosome 6, distal to the SSR locus LSM6-5. The improved SSR-based linkage map should be useful as a platform for gene mapping and isolation in Mo. It was used to establish the location of AvrPi12, thereby providing a starting point for its positional cloning.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Richard, C.W. III; Berg, D.J.; Meeker, T.C.
1993-05-01
The authors describe a high-resolution radiation hybrid (RH) map of the distal short arm of human chromosome 11 containing the Beckwith-Weidemann gene and the associated embryonal tumor disease loci. Thirteen human 11p15 genes and 17 new anonymous probes were mapped by a statistical analysis of the cosegregation of markers in 102 rodent-human radiation hybrids retaining fragments of human chromosome 11. The 17 anonymous probes were generated from lambda phage containing human 11p15.5 inserts, by using ALU-PCR. A comprehensive map of all 30 loci and a framework map of nine clusters of loci ordered at odds of 1,000:1 were constructed bymore » a multipoint maximum-likelihood approach by using the computer program RHMAP. This RH map localizes one new gene to chromosome 11p15 (WEE1), provides more precise order information for several 11p15 genes (CTSD, H19, HPX,.ST5, RNH, and SMPD1), confirms previous map orders for other 11p15 genes (CALCA, PTH, HBBC, TH, HRAS, and DRD4), and maps 17 new anonymous probes within the 11p15.5 region. This RH map should prove useful in better defining the positions of the Beckwith-Weidemann and associated embryonal tumor disease-gene loci. 41 refs., 1 fig., 2 tabs.« less
Homology of aspartyl- and lysyl-tRNA synthetases.
Gampel, A; Tzagoloff, A
1989-01-01
The yeast nuclear gene MSD1 coding for mitochondrial aspartyl-tRNA synthetase has been cloned and sequenced. The identity of the gene is confirmed by the following evidence. (i) The primary structure of the protein derived from the gene sequence is similar to that of the yeast cytoplasmic aspartyl-tRNA synthetase. (ii) In situ disruption of MSD1 in a respiratory-competent haploid strain of yeast induces a pleiotropic phenotype consistent with a lesion in mitochondrial protein synthesis. (iii) Mitochondria from a mutant with a disrupted chromosomal copy of MSD1 are unable to acylate mitochondrial aspartyl-tRNA. The primary structures of the cytoplasmic and mitochondrial aspartyl-tRNA synthetases are similar to the yeast cytoplasmic lysyl-tRNA synthetase, suggesting that the two types of synthetases may have a common evolutionary origin. Searches of the current protein banks also have revealed a high degree of sequence similarity of the lysyl-tRNA synthetase to the product of the Escherichia coli herC gene and to the partial sequence of a protein encoded by an unidentified reading frame located adjacent to the E. coli frdA gene. Based on the sequence similarities and the map positions of the herC and frdA loci, we propose herC to be the structural gene of the constitutively expressed lysyl-tRNA synthetase of E. coli and the unidentified reading frame to be the structural gene of the heat-inducible lysyl-tRNA synthetase. Images PMID:2668951
Galeano, Carlos H.; Fernandez, Andrea C.; Franco-Herrera, Natalia; Cichy, Karen A.; McClean, Phillip E.; Vanderleyden, Jos; Blair, Matthew W.
2011-01-01
Map-based cloning and fine mapping to find genes of interest and marker assisted selection (MAS) requires good genetic maps with reproducible markers. In this study, we saturated the linkage map of the intra-gene pool population of common bean DOR364×BAT477 (DB) by evaluating 2,706 molecular markers including SSR, SNP, and gene-based markers. On average the polymorphism rate was 7.7% due to the narrow genetic base between the parents. The DB linkage map consisted of 291 markers with a total map length of 1,788 cM. A consensus map was built using the core mapping populations derived from inter-gene pool crosses: DOR364×G19833 (DG) and BAT93×JALO EEP558 (BJ). The consensus map consisted of a total of 1,010 markers mapped, with a total map length of 2,041 cM across 11 linkage groups. On average, each linkage group on the consensus map contained 91 markers of which 83% were single copy markers. Finally, a synteny analysis was carried out using our highly saturated consensus maps compared with the soybean pseudo-chromosome assembly. A total of 772 marker sequences were compared with the soybean genome. A total of 44 syntenic blocks were identified. The linkage group Pv6 presented the most diverse pattern of synteny with seven syntenic blocks, and Pv9 showed the most consistent relations with soybean with just two syntenic blocks. Additionally, a co-linear analysis using common bean transcript map information against soybean coding sequences (CDS) revealed the relationship with 787 soybean genes. The common bean consensus map has allowed us to map a larger number of markers, to obtain a more complete coverage of the common bean genome. Our results, combined with synteny relationships provide tools to increase marker density in selected genomic regions to identify closely linked polymorphic markers for indirect selection, fine mapping or for positional cloning. PMID:22174773
Curtiss, W C; Vournakis, J N
1984-01-01
Eukaryotic 5S rRNA sequences from 34 diverse species were compared by the following method: (1) The sequences were aligned; (2) the positions of substitutions were located by comparison of all possible pairs of sequences; (3) the substitution sites were mapped to an assumed general base pairing model; and (4) the R-Y model of base stacking was used to study stacking pattern relationships in the structure. An analysis of the sequence and structure variability in each region of the molecule is presented. It was found that the degree of base substitution varies over a wide range, from absolute conservation to occurrence of over 90% of the possible observable substitutions. The substitutions are located primarily in stem regions of the 5S rRNA secondary structure. More than 88% of the substitutions in helical regions maintain base pairing. The disruptive substitutions are primarily located at the edges of helical regions, resulting in shortening of the helical regions and lengthening of the adjacent nonpaired regions. Base stacking patterns determined by the R-Y model are mapped onto the general secondary structure. Intrastrand and interstrand stacking could stabilize alternative coaxial structures and limit the conformational flexibility of nonpaired regions. Two short contiguous regions are 100% conserved in all species. This may reflect evolutionary constraints imposed at the DNA level by the requirement for binding of a 5S gene transcription initiation factor during gene expression.
HDAPD: a web tool for searching the disease-associated protein structures
2010-01-01
Background The protein structures of the disease-associated proteins are important for proceeding with the structure-based drug design to against a particular disease. Up until now, proteins structures are usually searched through a PDB id or some sequence information. However, in the HDAPD database presented here the protein structure of a disease-associated protein can be directly searched through the associated disease name keyed in. Description The search in HDAPD can be easily initiated by keying some key words of a disease, protein name, protein type, or PDB id. The protein sequence can be presented in FASTA format and directly copied for a BLAST search. HDAPD is also interfaced with Jmol so that users can observe and operate a protein structure with Jmol. The gene ontological data such as cellular components, molecular functions, and biological processes are provided once a hyperlink to Gene Ontology (GO) is clicked. Further, HDAPD provides a link to the KEGG map such that where the protein is placed and its relationship with other proteins in a metabolic pathway can be found from the map. The latest literatures namely titles, journals, authors, and abstracts searched from PubMed for the protein are also presented as a length controllable list. Conclusions Since the HDAPD data content can be routinely updated through a PHP-MySQL web page built, the new database presented is useful for searching the structures for some disease-associated proteins that may play important roles in the disease developing process for performing the structure-based drug design to against the diseases. PMID:20158919
Entering the Next Dimension: Plant Genomes in 3D.
Sotelo-Silveira, Mariana; Chávez Montes, Ricardo A; Sotelo-Silveira, Jose R; Marsch-Martínez, Nayelli; de Folter, Stefan
2018-04-24
After linear sequences of genomes and epigenomic landscape data, the 3D organization of chromatin in the nucleus is the next level to be explored. Different organisms present a general hierarchical organization, with chromosome territories at the top. Chromatin interaction maps, obtained by chromosome conformation capture (3C)-based methodologies, for eight plant species reveal commonalities, but also differences, among them and with animals. The smallest structures, found in high-resolution maps of the Arabidopsis genome, are single genes. Epigenetic marks (histone modification and DNA methylation), transcriptional activity, and chromatin interaction appear to be correlated, and whether structure is the cause or consequence of the function of interacting regions is being actively investigated. Copyright © 2018 Elsevier Ltd. All rights reserved.
Fine-Scale Map of Encyclopedia of DNA Elements Regions in the Korean Population
Yoo, Yeon-Kyeong; Ke, Xiayi; Hong, Sungwoo; Jang, Hye-Yoon; Park, Kyunghee; Kim, Sook; Ahn, TaeJin; Lee, Yeun-Du; Song, Okryeol; Rho, Na-Young; Lee, Moon Sue; Lee, Yeon-Su; Kim, Jaeheup; Kim, Young J.; Yang, Jun-Mo; Song, Kyuyoung; Kimm, Kyuchan; Weir, Bruce; Cardon, Lon R.; Lee, Jong-Eun; Hwang, Jung-Joo
2006-01-01
The International HapMap Project aims to generate detailed human genome variation maps by densely genotyping single-nucleotide polymorphisms (SNPs) in CEPH, Chinese, Japanese, and Yoruba samples. This will undoubtedly become an important facility for genetic studies of diseases and complex traits in the four populations. To address how the genetic information contained in such variation maps is transferable to other populations, the Korean government, industries, and academics have launched the Korean HapMap project to genotype high-density Encyclopedia of DNA Elements (ENCODE) regions in 90 Korean individuals. Here we show that the LD pattern, block structure, haplotype diversity, and recombination rate are highly concordant between Korean and the two HapMap Asian samples, particularly Japanese. The availability of information from both Chinese and Japanese samples helps to predict more accurately the possible performance of HapMap markers in Korean disease-gene studies. Tagging SNPs selected from the two HapMap Asian maps, especially the Japanese map, were shown to be very effective for Korean samples. These results demonstrate that the HapMap variation maps are robust in related populations and will serve as an important resource for the studies of the Korean population in particular. PMID:16702437
Chromosomal mapping of canine-derived BAC clones to the red fox and American mink genomes.
Kukekova, Anna V; Vorobieva, Nadegda V; Beklemisheva, Violetta R; Johnson, Jennifer L; Temnykh, Svetlana V; Yudkin, Dmitry V; Trut, Lyudmila N; Andre, Catherine; Galibert, Francis; Aguirre, Gustavo D; Acland, Gregory M; Graphodatsky, Alexander S
2009-01-01
High-quality sequencing of the dog (Canis lupus familiaris) genome has enabled enormous progress in genetic mapping of canine phenotypic variation. The red fox (Vulpes vulpes), another canid species, also exhibits a wide range of variation in coat color, morphology, and behavior. Although the fox genome has not yet been sequenced, canine genomic resources have been used to construct a meiotic linkage map of the red fox genome and begin genetic mapping in foxes. However, a more detailed gene-specific comparative map between the dog and fox genomes is required to establish gene order within homologous regions of dog and fox chromosomes and to refine breakpoints between homologous chromosomes of the 2 species. In the current study, we tested whether canine-derived gene-containing bacterial artificial chromosome (BAC) clones can be routinely used to build a gene-specific map of the red fox genome. Forty canine BAC clones were mapped to the red fox genome by fluorescence in situ hybridization (FISH). Each clone was uniquely assigned to a single fox chromosome, and the locations of 38 clones agreed with cytogenetic predictions. These results clearly demonstrate the utility of FISH mapping for construction of a whole-genome gene-specific map of the red fox. The further possibility of using canine BAC clones to map genes in the American mink (Mustela vison) genome was also explored. Much lower success was obtained for this more distantly related farm-bred species, although a few BAC clones were mapped to the predicted chromosomal locations.
A fine structure genomic map of the region of 12q13 containing SAS and CDK4
DOE Office of Scientific and Technical Information (OSTI.GOV)
Linder, C.Y.; Elkahloun, A.G.; Su, Y.A.
1994-09-01
We have recently adapted a method, originally described by Rackwitz, to the rapid restriction mapping of multiple cosmid DNA samples. Linearization of the cosmids at the lambda cohesive site using lambda terminase is followed by partial digestion with selected restriction enzymes and hybridization to oligonucleotides specific for the right or left hand termini. Partial digestions are performed in a microtiter plate thus allowing up to 12 cosmid clones to be digested with one restriction enzyme. We have applied this rapid restriction mapping method to cosmids derived from a region of chromosome 12q13 that has recently been shown to be amplifiedmore » in a variety of cancers including malignant fibrous histiocytoma, fibrosarcoma, liposarcoma, osteosarcoma and brain tumors. A small segment of this amplification unit containing three genes, SAS (a membrane protein), CDK4 (a cyclin dependent kinase) and OS-9 (a recently described cDNA) has been analyzed with the system described above. This fine structure genomic map will be useful for completing the expression map of this region as well as characterizing its pattern of amplification in tumor specimens.« less
Kim, Sang Hoon; Pajarillo, Edward Alain B; Balolong, Marilen P; Lee, Ji Yoon; Kang, Dae-Kyung
2016-06-28
In this study, the global proteome of the IPEC-J2 cell line was evaluated using ultra-high performance liquid chromatography coupled to a quadrupole Q Exactive™ Orbitrap mass spectrometer. Proteins were isolated from highly confluent IPEC-J2 cells in biological replicates and analyzed by label-free mass spectrometry prior to matching against a porcine genomic dataset. The results identified 1,517 proteins, accounting for 7.35% of all genes in the porcine genome. The highly abundant proteins detected, such as actin, annexin A2, and AHNAK nucleoprotein, are involved in structural integrity, signaling mechanisms, and cellular homeostasis. The high abundance of heat shock proteins indicated their significance in cellular defenses, barrier function, and gut homeostasis. Pathway analysis and annotation using the Kyoto Encyclopedia of Genes and Genomes database resulted in a putative protein network map of the regulation of immunological responses and structural integrity in the cell line. The comprehensive proteome analysis of IPEC-J2 cells provides fundamental insights into overall protein expression and pathway dynamics that might be useful in cell adhesion studies and immunological applications.
[Genome-wide identification and expression analysis of the WRKY gene family in peach].
Gu, Yan-bing; Ji, Zhi-rui; Chi, Fu-mei; Qiao, Zhuang; Xu, Cheng-nan; Zhang, Jun-xiang; Zhou, Zong-shan; Dong, Qing-long
2016-03-01
The WRKY transcription factors are one of the largest families of transcriptional regulators and play diverse regulatory roles in biotic and abiotic stresses, plant growth and development processes. In this study, the WRKY DNA-binding domain (Pfam Database number: PF03106) downloaded from Pfam protein families database was exploited to identify WRKY genes from the peach (Prunus persica 'Lovell') genome using HMMER 3.0. The obtained amino acid sequences were analyzed with DNAMAN 5.0, WebLogo 3, MEGA 5.1, MapInspect and MEME bioinformatics softwares. Totally 61 peach WRKY genes were found in the peach genome. Our phylogenetic analysis revealed that peach WRKY genes were classified into three Groups: Ⅰ, Ⅱ and Ⅲ. The WRKY N-terminal and C-terminal domains of Group Ⅰ (group I-N and group I-C) were monophyletic. The Group Ⅱ was sub-divided into five distinct clades (groupⅡ-a, Ⅱ-b, Ⅱ-c, Ⅱ-d and Ⅱ-e). Our domain analysis indicated that the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK at its N-terminus followed by a zinc-finger motif. The chromosome mapping analysis showed that peach WRKY genes were distributed with different densities over 8 chromosomes. The intron-exon structure analysis revealed that structures of the WRKY gene were highly conserved in the peach. The conserved motif analysis showed that the conserved motifs 1, 2 and 3, which specify the WRKY domain, were observed in all peach WRKY proteins, motif 5 as the unknown domain was observed in group Ⅱ-d, two WRKY domains were assigned to GroupⅠ. SqRT-PCR and qRT-PCR results indicated that 16 PpWRKY genes were expressed in roots, stems, leaves, flowers and fruits at various expression levels. Our analysis thus identified the PpWRKY gene families, and future functional studies are needed to reveal its specific roles.
Khare, Sangeeta; Lawhon, Sara D.; Drake, Kenneth L.; Nunes, Jairo E. S.; Figueiredo, Josely F.; Rossetti, Carlos A.; Gull, Tamara; Everts, Robin E.; Lewin, Harris A.; Galindo, Cristi L.; Garner, Harold R.; Adams, Leslie Garry
2012-01-01
Survival and persistence of Mycobacterium avium subsp. paratuberculosis (MAP) in the intestinal mucosa is associated with host immune tolerance. However, the initial events during MAP interaction with its host that lead to pathogen survival, granulomatous inflammation, and clinical disease progression are poorly defined. We hypothesize that immune tolerance is initiated upon initial contact of MAP with the intestinal Peyer's patch. To test our hypothesis, ligated ileal loops in neonatal calves were infected with MAP. Intestinal tissue RNAs were collected (0.5, 1, 2, 4, 8 and 12 hrs post-infection), processed, and hybridized to bovine gene expression microarrays. By comparing the gene transcription responses of calves infected with the MAP, informative complex patterns of expression were clearly visible. To interpret these complex data, changes in the gene expression were further analyzed by dynamic Bayesian analysis, and genes were grouped into the specific pathways and gene ontology categories to create a holistic model. This model revealed three different phases of responses: i) early (30 min and 1 hr post-infection), ii) intermediate (2, 4 and 8 hrs post-infection), and iii) late (12 hrs post-infection). We describe here the data that include expression profiles for perturbed pathways, as well as, mechanistic genes (genes predicted to have regulatory influence) that are associated with immune tolerance. In the Early Phase of MAP infection, multiple pathways were initiated in response to MAP invasion via receptor mediated endocytosis and changes in intestinal permeability. During the Intermediate Phase, perturbed pathways involved the inflammatory responses, cytokine-cytokine receptor interaction, and cell-cell signaling. During the Late Phase of infection, gene responses associated with immune tolerance were initiated at the level of T-cell signaling. Our study provides evidence that MAP infection resulted in differentially regulated genes, perturbed pathways and specifically modified mechanistic genes contributing to the colonization of Peyer's patch. PMID:22912686
Sharma, Akanksha; Sharma, Niharika; Bhalla, Prem; Singh, Mohan
2017-01-01
Comparative genomics have facilitated the mining of biological information from a genome sequence, through the detection of similarities and differences with genomes of closely or more distantly related species. By using such comparative approaches, knowledge can be transferred from the model to non-model organisms and insights can be gained in the structural and evolutionary patterns of specific genes. In the absence of sequenced genomes for allergenic grasses, this study was aimed at understanding the structure, organisation and expression profiles of grass pollen allergens using the genomic data from Brachypodium distachyon as it is phylogenetically related to the allergenic grasses. Combining genomic data with the anther RNA-Seq dataset revealed 24 pollen allergen genes belonging to eight allergen groups mapping on the five chromosomes in B. distachyon. High levels of anther-specific expression profiles were observed for the 24 identified putative allergen-encoding genes in Brachypodium. The genomic evidence suggests that gene encoding the group 5 allergen, the most potent trigger of hay fever and allergic asthma originated as a pollen specific orphan gene in a common grass ancestor of Brachypodium and Triticiae clades. Gene structure analysis showed that the putative allergen-encoding genes in Brachypodium either lack or contain reduced number of introns. Promoter analysis of the identified Brachypodium genes revealed the presence of specific cis-regulatory sequences likely responsible for high anther/pollen-specific expression. With the identification of putative allergen-encoding genes in Brachypodium, this study has also described some important plant gene families (e.g. expansin superfamily, EF-Hand family, profilins etc) for the first time in the model plant Brachypodium. Altogether, the present study provides new insights into structural characterization and evolution of pollen allergens and will further serve as a base for their functional characterization in related grass species.
NASA Astrophysics Data System (ADS)
Kikuchi, Shoshi
2009-02-01
Completion of the high-precision genome sequence analysis of rice led to the collection of about 35,000 full-length cDNA clones and the determination of their complete sequences. Mapping of these full-length cDNA sequences has given us information on (1) the number of genes expressed in the rice genome; (2) the start and end positions and exon-intron structures of rice genes; (3) alternative transcripts; (4) possible encoded proteins; (5) non-protein-coding (np) RNAs; (6) the density of gene localization on the chromosome; (7) setting the parameters of gene prediction programs; and (8) the construction of a microarray system that monitors global gene expression. Manual curation for rice gene annotation by using mapping information on full-length cDNA and EST assemblies has revealed about 32,000 expressed genes in the rice genome. Analysis of major gene families, such as those encoding membrane transport proteins (pumps, ion channels, and secondary transporters), along with the evolution from bacteria to higher animals and plants, reveals how gene numbers have increased through adaptation to circumstances. Family-based gene annotation also gives us a new way of comparing organisms. Massive amounts of data on gene expression under many kinds of physiological conditions are being accumulated in rice oligoarrays (22K and 44K) based on full-length cDNA sequences. Cluster analyses of genes that have the same promoter cis-elements, that have similar expression profiles, or that encode enzymes in the same metabolic pathways or signal transduction cascades give us clues to understanding the networks of gene expression in rice. As a tool for that purpose, we recently developed "RiCES", a tool for searching for cis-elements in the promoter regions of clustered genes.
Shao, Yafang; Jin, Liang; Zhang, Gan; Lu, Yan; Shen, Yun; Bao, Jinsong
2011-03-01
Phytochemicals such as phenolics and flavonoids in rice grain are antioxidants that are associated with reduced risk of developing chronic diseases including cardiovascular disease, type-2 diabetes and some cancers. Understanding the genetic basis of these traits is necessary for the improvement of nutritional quality by breeding. Association mapping based on linkage disequilibrium has emerged as a powerful strategy for identifying genes or quantitative trait loci (QTL) underlying complex traits in plants. In this study, genome-wide association mapping using models controlling both population structure (Q) and relative kinship (K) were performed to identify the marker loci/QTLs underlying the naturally occurring variations of grain color and nutritional quality traits in 416 rice germplasm accessions including red and black rice. A total of 41 marker loci were identified for all the traits, and it was confirmed that Ra (i.e., Prp-b for purple pericarp) and Rc (brown pericarp and seed coat) genes were main-effect loci for rice grain color and nutritional quality traits. RM228, RM339, fgr (fragrance gene) and RM316 were important markers associated with most of the traits. Association mapping for the traits of the 361 white or non-pigmented rice accessions (i.e., excluding the red and black rice) revealed a total of 11 markers for four color parameters, and one marker (RM346) for phenolic content. Among them, Wx gene locus was identified for the color parameters of lightness (L*), redness (a*) and hue angle (H (o)). Our study suggested that the markers identified in this study can feasibly be used to improve nutritional quality or health benefit properties of rice by marker-assisted selection if the co-segregations of the marker-trait associations are validated in segregating populations.
Global mapping of DNA conformational flexibility on Saccharomyces cerevisiae.
Menconi, Giulia; Bedini, Andrea; Barale, Roberto; Sbrana, Isabella
2015-04-01
In this study we provide the first comprehensive map of DNA conformational flexibility in Saccharomyces cerevisiae complete genome. Flexibility plays a key role in DNA supercoiling and DNA/protein binding, regulating DNA transcription, replication or repair. Specific interest in flexibility analysis concerns its relationship with human genome instability. Enrichment in flexible sequences has been detected in unstable regions of human genome defined fragile sites, where genes map and carry frequent deletions and rearrangements in cancer. Flexible sequences have been suggested to be the determinants of fragile gene proneness to breakage; however, their actual role and properties remain elusive. Our in silico analysis carried out genome-wide via the StabFlex algorithm, shows the conserved presence of highly flexible regions in budding yeast genome as well as in genomes of other Saccharomyces sensu stricto species. Flexibile peaks in S. cerevisiae identify 175 ORFs mapping on their 3'UTR, a region affecting mRNA translation, localization and stability. (TA)n repeats of different extension shape the central structure of peaks and co-localize with polyadenylation efficiency element (EE) signals. ORFs with flexible peaks share common features. Transcripts are characterized by decreased half-life: this is considered peculiar of genes involved in regulatory systems with high turnover; consistently, their function affects biological processes such as cell cycle regulation or stress response. Our findings support the functional importance of flexibility peaks, suggesting that the flexible sequence may be derived by an expansion of canonical TAYRTA polyadenylation efficiency element. The flexible (TA)n repeat amplification could be the outcome of an evolutionary neofunctionalization leading to a differential 3'-end processing and expression regulation in genes with peculiar function. Our study provides a new support to the functional role of flexibility in genomes and a strategy for its characterization inside human fragile sites.
Global Mapping of DNA Conformational Flexibility on Saccharomyces cerevisiae
Menconi, Giulia; Bedini, Andrea; Barale, Roberto; Sbrana, Isabella
2015-01-01
In this study we provide the first comprehensive map of DNA conformational flexibility in Saccharomyces cerevisiae complete genome. Flexibility plays a key role in DNA supercoiling and DNA/protein binding, regulating DNA transcription, replication or repair. Specific interest in flexibility analysis concerns its relationship with human genome instability. Enrichment in flexible sequences has been detected in unstable regions of human genome defined fragile sites, where genes map and carry frequent deletions and rearrangements in cancer. Flexible sequences have been suggested to be the determinants of fragile gene proneness to breakage; however, their actual role and properties remain elusive. Our in silico analysis carried out genome-wide via the StabFlex algorithm, shows the conserved presence of highly flexible regions in budding yeast genome as well as in genomes of other Saccharomyces sensu stricto species. Flexibile peaks in S. cerevisiae identify 175 ORFs mapping on their 3’UTR, a region affecting mRNA translation, localization and stability. (TA)n repeats of different extension shape the central structure of peaks and co-localize with polyadenylation efficiency element (EE) signals. ORFs with flexible peaks share common features. Transcripts are characterized by decreased half-life: this is considered peculiar of genes involved in regulatory systems with high turnover; consistently, their function affects biological processes such as cell cycle regulation or stress response. Our findings support the functional importance of flexibility peaks, suggesting that the flexible sequence may be derived by an expansion of canonical TAYRTA polyadenylation efficiency element. The flexible (TA)n repeat amplification could be the outcome of an evolutionary neofunctionalization leading to a differential 3’-end processing and expression regulation in genes with peculiar function. Our study provides a new support to the functional role of flexibility in genomes and a strategy for its characterization inside human fragile sites. PMID:25860149
Ganal, Martin W.; Durstewitz, Gregor; Polley, Andreas; Bérard, Aurélie; Buckler, Edward S.; Charcosset, Alain; Clarke, Joseph D.; Graner, Eva-Maria; Hansen, Mark; Joets, Johann; Le Paslier, Marie-Christine; McMullen, Michael D.; Montalent, Pierre; Rose, Mark; Schön, Chris-Carolin; Sun, Qi; Walter, Hildrun; Martin, Olivier C.; Falque, Matthieu
2011-01-01
SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations – IBM (B73×Mo17) and LHRF (F2×F252) – were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding. PMID:22174790
Qian, Wei; Fan, Guiyan; Liu, Dandan; Zhang, Helong; Wang, Xiaowu; Wu, Jian; Xu, Zhaosheng
2017-04-04
Cultivated spinach (Spinacia oleracea L.) is one of the most widely cultivated types of leafy vegetable in the world, and it has a high nutritional value. Spinach is also an ideal plant for investigating the mechanism of sex determination because it is a dioecious species with separate male and female plants. Some reports on the sex labeling and localization of spinach in the study of molecular markers have surfaced. However, there have only been two reports completed on the genetic map of spinach. The lack of rich and reliable molecular markers and the shortage of high-density linkage maps are important constraints in spinach research work. In this study, a high-density genetic map of spinach based on the Specific-locus Amplified Fragment Sequencing (SLAF-seq) technique was constructed; the sex-determining gene was also finely mapped. Through bio-information analysis, 50.75 Gb of data in total was obtained, including 207.58 million paired-end reads. Finally, 145,456 high-quality SLAF markers were obtained, with 27,800 polymorphic markers and 4080 SLAF markers were finally mapped onto the genetic map after linkage analysis. The map spanned 1,125.97 cM with an average distance of 0.31 cM between the adjacent marker loci. It was divided into 6 linkage groups corresponding to the number of spinach chromosomes. Besides, the combination of Bulked Segregation Analysis (BSA) with SLAF-seq technology(super-BSA) was employed to generate the linkage markers with the sex-determining gene. Combined with the high-density genetic map of spinach, the sex-determining gene X/Y was located at the position of the linkage group (LG) 4 (66.98 cM-69.72 cM and 75.48 cM-92.96 cM), which may be the ideal region for the sex-determining gene. A high-density genetic map of spinach based on the SLAF-seq technique was constructed with a backcross (BC 1 ) population (which is the highest density genetic map of spinach reported at present). At the same time, the sex-determining gene X/Y was mapped to LG4 with super-BSA. This map will offer a suitable basis for further study of spinach, such as gene mapping, map-based cloning of Specific genes, quantitative trait locus (QTL) mapping and marker-assisted selection (MAS). It will also provide an efficient reference for studies on the mechanism of sex determination in other dioecious plants.
Lalucque, Hervé; Malagnac, Fabienne; Green, Kimberly; Gautier, Valérie; Grognet, Pierre; Chan Ho Tong, Laetitia; Scott, Barry; Silar, Philippe
2017-01-15
Filamentous ascomycetes produce complex multicellular structures during sexual reproduction. Little is known about the genetic pathways enabling the construction of such structures. Here, with a combination of classical and reverse genetic methods, as well as genetic mosaic and graft analyses, we identify and provide evidence for key roles for two genes during the formation of perithecia, the sexual fruiting bodies, of the filamentous fungus Podospora anserina. Data indicate that the proteins coded by these two genes function cell-non-autonomously and that their activity depends upon conserved cysteines, making them good candidate for being involved in the transmission of a reactive oxygen species (ROS) signal generated by the PaNox1 NADPH oxidase inside the maturing fruiting body towards the PaMpk1 MAP kinase, which is located inside the underlying mycelium, in which nutrients are stored. These data provide important new insights to our understanding of how fungi build multicellular structures. Copyright © 2016 Elsevier Inc. All rights reserved.
Mondal, Suvendu; Badigannavar, Anand M
2018-05-09
A consensus rust QTL was identified within a 1.25 cM map interval of A03 chromosome in cultivated peanut. This map interval contains a TIR-NB-LRR R gene and four pathogenesis-related genes. Disease resistance in plants is manifested due to the specific interaction between the R gene product and its cognate avirulence gene product (AVR) in the pathogen. Puccinia arachidis Speg. causes rust disease and inflicts economic damages to peanut. Till now, no experimental evidence is known for the action of R gene in peanut for rust resistance. A fine mapping approach towards the development of closely linked markers for rust resistance gene was undertaken in this study. Phenotyping of an RIL population at five environments for field rust score and subsequent QTL analysis has identified a 1.25 cM map interval that harbored a consensus major Rust_QTL in A03 chromosome. This Rust_QTL is flanked by two SSR markers: FRS72 and SSR_GO340445. Both the markers clearly identified strong association of the mapped region with rust reaction in both resistant and susceptible genotypes from a collection of 95 cultivated peanut germplasm. This 1.25 cM map interval contained 331.7 kb in the physical map of A. duranensis and had a TIR-NB-LRR category R gene (Aradu.Z87JB) and four glucan endo-1,3 β glucosidase genes (Aradu.RKA6 M, Aradu.T44NR, Aradu.IWV86 and Aradu.VG51Q). Another resistance gene analog was also found in the vicinity of mapped Rust_QTL. The sequence between SSR markers, FRS72 and FRS49, contains an LRR-PK (Aradu.JG217) which is equivalent to RHG4 in soybean. Probably, the protein kinase domain in AhRHG4 acts as an integrated decoy for the cognate AVR from Puccinia arachidis and helps the TIR-NB-LRR R-protein to initiate a controlled program cell death in resistant peanut plants.
Fingerprinting Soybean Germplasm and Its Utility in Genomic Research
Song, Qijian; Hyten, David L.; Jia, Gaofeng; Quigley, Charles V.; Fickus, Edward W.; Nelson, Randall L.; Cregan, Perry B.
2015-01-01
The United States Department of Agriculture, Soybean Germplasm Collection includes 18,480 domesticated soybean and 1168 wild soybean accessions introduced from 84 countries or developed in the United States. This collection was genotyped with the SoySNP50K BeadChip containing greater than 50K single-nucleotide polymorphisms. Redundant accessions were identified in the collection, and distinct genetic backgrounds of soybean from different geographic origins were observed that could be a unique resource for soybean genetic improvement. We detected a dramatic reduction of genetic diversity based on linkage disequilibrium and haplotype structure analyses of the wild, landrace, and North American cultivar populations and identified candidate regions associated with domestication and selection imposed by North American breeding. We constructed the first soybean haplotype block maps in the wild, landrace, and North American cultivar populations and observed that most recombination events occurred in the regions between haplotype blocks. These haplotype maps are crucial for association mapping aimed at the identification of genes controlling traits of economic importance. A case-control association test delimited potential genomic regions along seven chromosomes that most likely contain genes controlling seed weight in domesticated soybean. The resulting dataset will facilitate germplasm utilization, identification of genes controlling important traits, and will accelerate the creation of soybean varieties with improved seed yield and quality. PMID:26224783
Structure–function mapping of a heptameric module in the nuclear pore complex
Fernandez-Martinez, Javier; Phillips, Jeremy; Sekedat, Matthew D.; Diaz-Avalos, Ruben; Velazquez-Muriel, Javier; Franke, Josef D.; Williams, Rosemary; Stokes, David L.; Chait, Brian T.
2012-01-01
The nuclear pore complex (NPC) is a multiprotein assembly that serves as the sole mediator of nucleocytoplasmic exchange in eukaryotic cells. In this paper, we use an integrative approach to determine the structure of an essential component of the yeast NPC, the ∼600-kD heptameric Nup84 complex, to a precision of ∼1.5 nm. The configuration of the subunit structures was determined by satisfaction of spatial restraints derived from a diverse set of negative-stain electron microscopy and protein domain–mapping data. Phenotypic data were mapped onto the complex, allowing us to identify regions that stabilize the NPC’s interaction with the nuclear envelope membrane and connect the complex to the rest of the NPC. Our data allow us to suggest how the Nup84 complex is assembled into the NPC and propose a scenario for the evolution of the Nup84 complex through a series of gene duplication and loss events. This work demonstrates that integrative approaches based on low-resolution data of sufficient quality can generate functionally informative structures at intermediate resolution. PMID:22331846
Busslinger, M; Portmann, R; Irminger, J C; Birnstiel, M L
1980-01-01
The DNA sequences of the entire structural H4, H3, H2A and H2B genes and of their 5' flanking regions have been determined in the histone DNA clone h19 of the sea urchin Psammechinus miliaris. In clone h19 the polarity of transcription and the relative arrangement of the histone genes is identical to that in clone h22 of the same species. The histone proteins encoded by h19 DNA differ in their primary structure from those encoded by clone h22 and have been compared to histone protein sequences of other sea urchin species as well as other eukaryotes. A comparative analysis of the 5' flanking DNA sequences of the structural histone genes in both clones revealed four ubiquitous sequence motifs; a pentameric element GATCC, followed at short distance by the Hogness box GTATAAATAG, a conserved sequence PyCATTCPu, in or near which the 5' ends of the mRNAs map in h22 DNA and lastly a sequence A, containing the initiation codon. These sequences are also found, sometimes in modified version, in front of other eukaryotic genes transcribed by polymerase II. When prelude sequences of isocoding histone genes in clone h19 and h22 are compared areas of homology are seen to extend beyond the ubiquitous sequence motifs towards the divergent AT-rich spacer and terminate between approximately 140 and 240 nucleotides away from the structural gene. These prelude regions contain quite large conservative sequence blocks which are specific for each type of histone genes. Images PMID:7443547
Bakera, Beata; Makowska, Bogna; Groszyk, Jolanta; Niziołek, Michał; Orczyk, Wacław; Bolibok-Brągoszewska, Hanna; Hromada-Judycka, Aneta; Rakoczy-Trojanowska, Monika
2015-08-01
Benzoxazinoids (BX) are major secondary metabolites of gramineous plants that play an important role in disease resistance and allelopathy. They also have many other unique properties including anti-bacterial and anti-fungal activity, and the ability to reduce alfa-amylase activity. The biosynthesis and modification of BX are controlled by the genes Bx1 ÷ Bx10, GT and glu, and the majority of these Bx genes have been mapped in maize, wheat and rye. However, the genetic basis of BX biosynthesis remains largely uncharacterized apart from some data from maize and wheat. The aim of this study was to isolate, sequence and characterize five genes (ScBx1, ScBx2, ScBx3, ScBx4 and ScBx5) encoding enzymes involved in the synthesis of DIBOA, an important defense compound of rye. Using a modified 3D procedure of BAC library screening, seven BAC clones containing all of the ScBx genes were isolated and sequenced. Bioinformatic analyses of the resulting contigs were used to examine the structure and other features of these genes, including their promoters, introns and 3'UTRs. Comparative analysis showed that the ScBx genes are similar to those of other Poaceae species, especially to the TaBx genes. The polymorphisms present both in the coding sequences and non-coding regions of ScBx in relation to other Bx genes are predicted to have an impact on the expression, structure and properties of the encoded proteins.
Misra, Namrata; Panda, Prasanna Kumar; Parida, Bikram Kumar
2014-12-01
Lysophosphatidyl acyltransferase (LPAT) is one of the major triacylglycerol synthesis enzymes, controlling the metabolic flow of lysophosphatidic acid to phosphatidic acid. Experimental studies in Arabidopsis have shown that LPAT activity is exhibited primarily by three distinct isoforms, namely the plastid-located LPAT1, the endoplasmic reticulum-located LPAT2, and the soluble isoform of LPAT (solLPAT). In this study, 24 putative genes representing all LPAT isoforms were identified from the analysis of 11 complete genomes including green algae, red algae, diatoms and higher plants. We observed LPAT1 and solLPAT genes to be ubiquitously present in nearly all genomes examined, whereas LPAT2 genes to have evolved more recently in the plant lineage. Phylogenetic analysis indicated that LPAT1, LPAT2 and solLPAT have convergently evolved through separate evolutionary paths and belong to three different gene families, which was further evidenced by their wide divergence at gene structure and sequence level. The genome distribution supports the hypothesis that each gene encoding a LPAT is not duplicated. Mapping of exon-intron structure of LPAT genes to the domain structure of proteins across different algal and plant species indicates that exon shuffling plays no role in the evolution of LPAT genes. Besides the previously defined motifs, several conserved consensus sequences were discovered which could be useful to distinguish different LPAT isoforms. Taken together, this study will enable the generation of experimental approximations to better understand the functional role of algal LPAT in lipid accumulation.
Hallerman, E M; Nave, A; Kashi, Y; Holzer, Z; Soller, M; Beckmann, J S
1987-01-01
Two bovine populations, a Holstein-Friesian dairy stock and a synthetic (Baladi X Hereford X Simmental X Charolais) beef stock, were screened for restriction fragment length polymorphisms (RFLPs) at the growth hormone and prolactin genes. Most RFLPs at the growth hormone gene are apparently the consequence of an insertion/deletion event which was localized to a region downstream of the structural gene. The restriction map for the genomic region including the growth hormone gene was extended. Two HindIII RFLPs at the growth hormone locus, as well as several RFLPs at the prolactin gene, seemed to be the consequence of a series of point mutations. The results are discussed in terms of the possibility that minor genomic variability underlies quantitative genetic variation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martinson, J.J.; Clegg, J.B.; Boyce, A.J.
1994-09-01
Analysis of the {alpha}-globin gene complex in Oceania has revealed many different rearrangements which remove one of the adult globin genes. Frequencies of these deletion chromosomes are elevated by malarial resistance conferred by the resulting {alpha}-thalassaemia. One particular deletion chromosome, designated -{alpha}{sup 3.7}III, is found at high levels in Melanesia and Polynesia: RFLP haplotype analysis shows that this deletion is always found on chromosomes bearing the IIIa haplotype and is likely to be the product of one single rearrangement event. A subset of the -{alpha}{sup 3.7}III chromosomes carries a more recent mutation which generates the haemoglobin variant HbJ{sup Tongariki}. Wemore » have characterized the allelic variation at the 3{prime}HVR VNTR locus located 6 kb from the globin genes in each of these groups of chromosomes. We have determined the internal structure of these alleles by RFLP mapping of PCR-amplified DNA: within each group, the allelic diversity results from the insertion and/or deletion of small {open_quotes}motifs{close_quotes} of up to 6 adjacent repeats. Mapping of 3{prime}HVR alleles associated with other haplotypes reveals that these are composed of repeat arrays that are substantially different to those derived from IIIa chromosomes, indicating that interchromosomal recombination between heterologous haplotypes does not account for any of the diversity seen to date. We have recently shown that allelic size variation at the two VNTR loci flanking the {alpha}-globin complex is very closely linked to the haplotypes known to be present at this locus. Here we show that, within a haplotype, VNTR alleles are very closely related to each other on the basis of internal structure and demonstrate that intrachromosomal mutation processes involving small numbers of tandem repeats are the main cause of variation at this locus.« less
Spielmann, A; Stutz, E
1983-10-25
The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2.
QuickMap: a public tool for large-scale gene therapy vector insertion site mapping and analysis.
Appelt, J-U; Giordano, F A; Ecker, M; Roeder, I; Grund, N; Hotz-Wagenblatt, A; Opelz, G; Zeller, W J; Allgayer, H; Fruehauf, S; Laufs, S
2009-07-01
Several events of insertional mutagenesis in pre-clinical and clinical gene therapy studies have created intense interest in assessing the genomic insertion profiles of gene therapy vectors. For the construction of such profiles, vector-flanking sequences detected by inverse PCR, linear amplification-mediated-PCR or ligation-mediated-PCR need to be mapped to the host cell's genome and compared to a reference set. Although remarkable progress has been achieved in mapping gene therapy vector insertion sites, public reference sets are lacking, as are the possibilities to quickly detect non-random patterns in experimental data. We developed a tool termed QuickMap, which uniformly maps and analyzes human and murine vector-flanking sequences within seconds (available at www.gtsg.org). Besides information about hits in chromosomes and fragile sites, QuickMap automatically determines insertion frequencies in +/- 250 kb adjacency to genes, cancer genes, pseudogenes, transcription factor and (post-transcriptional) miRNA binding sites, CpG islands and repetitive elements (short interspersed nuclear elements (SINE), long interspersed nuclear elements (LINE), Type II elements and LTR elements). Additionally, all experimental frequencies are compared with the data obtained from a reference set, containing 1 000 000 random integrations ('random set'). Thus, for the first time a tool allowing high-throughput profiling of gene therapy vector insertion sites is available. It provides a basis for large-scale insertion site analyses, which is now urgently needed to discover novel gene therapy vectors with 'safe' insertion profiles.
Lee, M H; Hazard, S; Carpten, J D; Yi, S; Cohen, J; Gerhardt, G T; Salen, G; Patel, S B
2001-02-01
Cerebrotendinous xanthomatosis (CTX) is a rare autosomal recessive disorder of bile acid biosynthesis. Clinically, CTX patients present with tendon xanthomas, juvenile cataracts, and progressive neurological dysfunction and can be diagnosed by the detection of elevated plasma cholestanol levels. CTX is caused by mutations affecting the sterol 27-hydroxylase gene (CYP27 ). CTX has been identified in a number of populations, but seems to have a higher prevalence in the Japanese, Sephardic Jewish, and Italian populations. We have assembled 12 previously unreported pedigrees from the United States. The CYP27 locus had been previously mapped to chromosome 2q33-qter. We performed linkage analyses and found no evidence of genetic heterogeneity. All CTX patients showed segregation with the CYP27 locus, and haplotype analysis and recombinant events allowed us to precisely map CYP27 to chromosome 2q35, between markers D2S1371 and D2S424. Twenty-three mutations were identified from 13 probands analyzed thus far; 11 were compound heterozygotes and 2 had homozygous mutations. Of these, five are novel mutations [Trp100Stop, Pro408Ser, Gln428Stop, a 10-base pair (bp) deletion in exon 1, and a 2-bp deletion in exon 6 of the CYP27 gene]. Three-dimensional structural modeling of sterol 27-hydroxylase showed that, while the majority of the missense mutations disrupt the heme-binding and adrenodoxin-binding domains critical for enzyme activity, two missense mutations (Arg94Trp/Gln and Lys226Arg) are clearly located outside these sites and may identify a potential substrate-binding or other protein contact site.
Miller, Marcia M.; Taylor, Robert L.
2016-01-01
Nearly all genes presently mapped to chicken chromosome 16 (GGA 16) have either a demonstrated role in immune responses or are considered to serve in immunity by reason of sequence homology with immune system genes defined in other species. The genes are best described in regional units. Among these, the best known is the polymorphic major histocompatibility complex-B (MHC-B) region containing genes for classical peptide antigen presentation. Nearby MHC-B is a small region containing two CD1 genes, which encode molecules known to bind lipid antigens and which will likely be found in chickens to present lipids to specialized T cells, as occurs with CD1 molecules in other species. Another region is the MHC-Y region, separated from MHC-B by an intervening region of tandem repeats. Like MHC-B, MHC-Y is polymorphic. It contains specialized class I and class II genes and c-type lectin-like genes. Yet another region, separated from MHC-Y by the single nucleolar organizing region (NOR) in the chicken genome, contains olfactory receptor genes and scavenger receptor genes, which are also thought to contribute to immunity. The structure, distribution, linkages and patterns of polymorphism in these regions, suggest GGA 16 evolves as a microchromosome devoted to immune defense. Many GGA 16 genes are polymorphic and polygenic. At the moment most disease associations are at the haplotype level. Roles of individual MHC genes in disease resistance are documented in only a very few instances. Provided suitable experimental stocks persist, the availability of increasingly detailed maps of GGA 16 genes combined with new means for detecting genetic variability will lead to investigations defining the contributions of individual loci and more applications for immunogenetics in breeding healthy poultry. PMID:26740135
Bhattarai, Dinesh; Chen, Xing; Ur Rehman, Zia; Hao, Xingjie; Ullah, Farman; Dad, Rahim; Talpur, Hira Sajjad; Kadariya, Ishwari; Cui, Lu; Fan, Mingxia; Zhang, Shujun
2017-02-01
The objective of the studies presented in this Research Communication was to investigate the association of single nucleotide polymorphisms present in the MAP4K4 gene with different milk traits in dairy cows. Based on previous QTL fine mapping results on bovine chromosome 11, the MAP4K4 gene was selected as a candidate gene to evaluate its effect on somatic cell count and milk traits in ChineseHolstein cows. Milk production traits including milk yield, fat percentage, and protein percentage of each cow were collected using 305 d lactation records. Association between MAP4K4 genotype and different traits and Somatic Cell Score (SCS) was performed using General Linear Regression Model of R. Two SNPs at exon 18 (c.2061T > G and c.2196T > C) with genotype TT in both SNPs were found significantly higher for somatic SCS. We found the significant effect of exon 18 (c.2061T > G) on protein percentage, milk yield and SCS. We identified SNPs at different location of MAP4K4 gene of the cattle and several of them were significantly associated with the somatic cell score and other different milk traits. Thus, MAP4K4 gene could be a useful candidate gene for selection of dairy cattle against mastitis and the identified polymorphisms might potentially be strong genetic markers.
Tlapakova, Tereza; Krylov, Vladimir; Macha, Jaroslav
2005-01-01
Two paralogous mitochondrial malate dehydrogenase 2 (Mdh2) genes of Xenopus laevis have been cloned and sequenced, revealing 95% identity. Fluorescence in-situ hybridization (FISH) combined with tyramide amplification discriminates both genes; Mdh2a was localized into chromosome q3 and Mdh2b into chromosome q8. One kb cDNA probes detect both genes with 85% accuracy. The remaining signals were on the paralogous counterpart. Introns interrupt coding sequences at the same nucleotide as defined for mouse. Restriction polymorphism has been detected in the first intron of Mdh2a, while the individual variability in intron 6 of Mdh2b gene is represented by an insertion of incomplete retrotransposon L1Xl. Rates of nucleotide substitutions indicate that both genes are under similar evolutionary constraints. X. laevis Mdh2 genes can be used as markers for physical mapping and linkage analysis.
Cain-Hom, Carol; Splinter, Erik; van Min, Max; Simonis, Marieke; van de Heijning, Monique; Martinez, Maria; Asghari, Vida; Cox, J Colin; Warming, Søren
2017-05-05
Cre/LoxP technology is widely used in the field of mouse genetics for spatial and/or temporal regulation of gene function. For Cre lines generated via pronuclear microinjection of a Cre transgene construct, the integration site is random and in most cases not known. Integration of a transgene can disrupt an endogenous gene, potentially interfering with interpretation of the phenotype. In addition, knowledge of where the transgene is integrated is important for planning of crosses between animals carrying a conditional allele and a given Cre allele in case the alleles are on the same chromosome. We have used targeted locus amplification (TLA) to efficiently map the transgene location in seven previously published Cre and CreERT2 transgenic lines. In all lines, transgene insertion was associated with structural changes of variable complexity, illustrating the importance of testing for rearrangements around the integration site. In all seven lines the exact integration site and breakpoint sequences were identified. Our methods, data and genotyping assays can be used as a resource for the mouse community and our results illustrate the power of the TLA method to not only efficiently map the integration site of any transgene, but also provide additional information regarding the transgene integration events. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
A high resolution spatiotemporal atlas of gene expression of the developing mouse brain
Thompson, Carol L.; Ng, Lydia; Menon, Vilas; Martinez, Salvador; Lee, Chang-Kyu; Glattfelder, Katie; Sunkin, Susan M.; Henry, Alex; Lau, Christopher; Dang, Chinh; Garcia-Lopez, Raquel; Martinez-Ferre, Almudena; Pombero, Ana; Rubenstein, John L.R.; Wakeman, Wayne B.; Hohmann, John; Dee, Nick; Sodt, Andrew J.; Young, Rob; Smith, Kimberly; Nguyen, Thuc-Nghi; Kidney, Jolene; Kuan, Leonard; Jeromin, Andreas; Kaykas, Ajamete; Miller, Jeremy; Page, Damon; Orta, Geri; Bernard, Amy; Riley, Zackery; Smith, Simon; Wohnoutka, Paul; Hawrylycz, Mike; Puelles, Luis; Jones, Allan R.
2015-01-01
SUMMARY To provide a temporal framework for the genoarchitecture of brain development, in situ hybridization data were generated for embryonic and postnatal mouse brain at 7 developmental stages for ~2100 genes, processed with an automated informatics pipeline and manually annotated. This resource comprises 434,946 images, 7 reference atlases, an ontogenetic ontology, and tools to explore co-expression of genes across neurodevelopment. Gene sets coinciding with developmental phenomena were identified. A temporal shift in the principles governing the molecular organization of the brain was detected, with transient neuromeric, plate-based organization of the brain present at E11.5 and E13.5. Finally, these data provided a transcription factor code that discriminates brain structures and identifies the developmental age of a tissue, providing a foundation for eventual genetic manipulation or tracking of specific brain structures over development. The resource is available as the Allen Developing Mouse Brain Atlas (developingmouse.brain-map.org). PMID:24952961
Phage phenomics: Physiological approaches to characterize novel viral proteins
Sanchez, Savannah E. [San Diego State Univ., San Diego, CA (United States); Cuevas, Daniel A. [San Diego State Univ., San Diego, CA (United States); Rostron, Jason E. [San Diego State Univ., San Diego, CA (United States); Liang, Tiffany Y. [San Diego State Univ., San Diego, CA (United States); Pivaroff, Cullen G. [San Diego State Univ., San Diego, CA (United States); Haynes, Matthew R. [San Diego State Univ., San Diego, CA (United States); Nulton, Jim [San Diego State Univ., San Diego, CA (United States); Felts, Ben [San Diego State Univ., San Diego, CA (United States); Bailey, Barbara A. [San Diego State Univ., San Diego, CA (United States); Salamon, Peter [San Diego State Univ., San Diego, CA (United States); Edwards, Robert A. [San Diego State Univ., San Diego, CA (United States); Argonne National Lab. (ANL), Argonne, IL (United States); Burgin, Alex B. [Broad Institute, Cambridge, MA (United States); Segall, Anca M. [San Diego State Univ., San Diego, CA (United States); Rohwer, Forest [San Diego State Univ., San Diego, CA (United States)
2018-06-21
Current investigations into phage-host interactions are dependent on extrapolating knowledge from (meta)genomes. Interestingly, 60 - 95% of all phage sequences share no homology to current annotated proteins. As a result, a large proportion of phage genes are annotated as hypothetical. This reality heavily affects the annotation of both structural and auxiliary metabolic genes. Here we present phenomic methods designed to capture the physiological response(s) of a selected host during expression of one of these unknown phage genes. Multi-phenotype Assay Plates (MAPs) are used to monitor the diversity of host substrate utilization and subsequent biomass formation, while metabolomics provides bi-product analysis by monitoring metabolite abundance and diversity. Both tools are used simultaneously to provide a phenotypic profile associated with expression of a single putative phage open reading frame (ORF). Thus, representative results for both methods are compared, highlighting the phenotypic profile differences of a host carrying either putative structural or metabolic phage genes. In addition, the visualization techniques and high throughput computational pipelines that facilitated experimental analysis are presented.
Evolving phenotypic networks in silico.
François, Paul
2014-11-01
Evolved gene networks are constrained by natural selection. Their structures and functions are consequently far from being random, as exemplified by the multiple instances of parallel/convergent evolution. One can thus ask if features of actual gene networks can be recovered from evolutionary first principles. I review a method for in silico evolution of small models of gene networks aiming at performing predefined biological functions. I summarize the current implementation of the algorithm, insisting on the construction of a proper "fitness" function. I illustrate the approach on three examples: biochemical adaptation, ligand discrimination and vertebrate segmentation (somitogenesis). While the structure of the evolved networks is variable, dynamics of our evolved networks are usually constrained and present many similar features to actual gene networks, including properties that were not explicitly selected for. In silico evolution can thus be used to predict biological behaviours without a detailed knowledge of the mapping between genotype and phenotype. Copyright © 2014 The Author. Published by Elsevier Ltd.. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sanchez, Savannah E.; Cuevas, Daniel A.; Rostron, Jason E.
Current investigations into phage-host interactions are dependent on extrapolating knowledge from (meta)genomes. Interestingly, 60 - 95% of all phage sequences share no homology to current annotated proteins. As a result, a large proportion of phage genes are annotated as hypothetical. This reality heavily affects the annotation of both structural and auxiliary metabolic genes. Here we present phenomic methods designed to capture the physiological response(s) of a selected host during expression of one of these unknown phage genes. Multi-phenotype Assay Plates (MAPs) are used to monitor the diversity of host substrate utilization and subsequent biomass formation, while metabolomics provides bi-product analysismore » by monitoring metabolite abundance and diversity. Both tools are used simultaneously to provide a phenotypic profile associated with expression of a single putative phage open reading frame (ORF). Thus, representative results for both methods are compared, highlighting the phenotypic profile differences of a host carrying either putative structural or metabolic phage genes. In addition, the visualization techniques and high throughput computational pipelines that facilitated experimental analysis are presented.« less
Andres, Ryan J; Bowman, Daryl T; Kaur, Baljinder; Kuraparthy, Vasu
2014-01-01
A major leaf shape locus (L) was mapped with molecular markers and genomically targeted to a small region in the D-genome of cotton. By using expression analysis and candidate gene mapping, two LMI1 -like genes are identified as possible candidates for leaf shape trait in cotton. Leaf shape in cotton is an important trait that influences yield, flowering rates, disease resistance, lint trash, and the efficacy of foliar chemical application. The leaves of okra leaf cotton display a significantly enhanced lobing pattern, as well as ectopic outgrowths along the lobe margins when compared with normal leaf cotton. These phenotypes are the hallmark characteristics of mutations in various known modifiers of leaf shape that culminate in the mis/over-expression of Class I KNOX genes. To better understand the molecular and genetic processes underlying leaf shape in cotton, a normal leaf accession (PI607650) was crossed to an okra leaf breeding line (NC05AZ21). An F2 population of 236 individuals confirmed the incompletely dominant single gene nature of the okra leaf shape trait in Gossypium hirsutum L. Molecular mapping with simple sequence repeat markers localized the leaf shape gene to 5.4 cM interval in the distal region of the short arm of chromosome 15. Orthologous mapping of the closely linked markers with the sequenced diploid D-genome (Gossypium raimondii) tentatively resolved the leaf shape locus to a small genomic region. RT-PCR-based expression analysis and candidate gene mapping indicated that the okra leaf shape gene (L (o) ) in cotton might be an upstream regulator of Class I KNOX genes. The linked molecular markers and delineated genomic region in the sequenced diploid D-genome will assist in the future high-resolution mapping and map-based cloning of the leaf shape gene in cotton.
Fine-mapping of qGW4.05, a major QTL for kernel weight and size in maize.
Chen, Lin; Li, Yong-xiang; Li, Chunhui; Wu, Xun; Qin, Weiwei; Li, Xin; Jiao, Fuchao; Zhang, Xiaojing; Zhang, Dengfeng; Shi, Yunsu; Song, Yanchun; Li, Yu; Wang, Tianyu
2016-04-12
Kernel weight and size are important components of grain yield in cereals. Although some information is available concerning the map positions of quantitative trait loci (QTL) for kernel weight and size in maize, little is known about the molecular mechanisms of these QTLs. qGW4.05 is a major QTL that is associated with kernel weight and size in maize. We combined linkage analysis and association mapping to fine-map and identify candidate gene(s) at qGW4.05. QTL qGW4.05 was fine-mapped to a 279.6-kb interval in a segregating population derived from a cross of Huangzaosi with LV28. By combining the results of regional association mapping and linkage analysis, we identified GRMZM2G039934 as a candidate gene responsible for qGW4.05. Candidate gene-based association mapping was conducted using a panel of 184 inbred lines with variable kernel weights and kernel sizes. Six polymorphic sites in the gene GRMZM2G039934 were significantly associated with kernel weight and kernel size. The results of linkage analysis and association mapping revealed that GRMZM2G039934 is the most likely candidate gene for qGW4.05. These results will improve our understanding of the genetic architecture and molecular mechanisms underlying kernel development in maize.
2012-01-01
Background Cultivated peanut or groundnut (Arachis hypogaea L.) is an important oilseed crop with an allotetraploid genome (AABB, 2n = 4x = 40). Both the low level of genetic variation within the cultivated gene pool and its polyploid nature limit the utilization of molecular markers to explore genome structure and facilitate genetic improvement. Nevertheless, a wealth of genetic diversity exists in diploid Arachis species (2n = 2x = 20), which represent a valuable gene pool for cultivated peanut improvement. Interspecific populations have been used widely for genetic mapping in diploid species of Arachis. However, an intraspecific mapping strategy was essential to detect chromosomal rearrangements among species that could be obscured by mapping in interspecific populations. To develop intraspecific reference linkage maps and gain insights into karyotypic evolution within the genus, we comparatively mapped the A- and B-genome diploid species using intraspecific F2 populations. Exploring genome organization among diploid peanut species by comparative mapping will enhance our understanding of the cultivated tetraploid peanut genome. Moreover, new sources of molecular markers that are highly transferable between species and developed from expressed genes will be required to construct saturated genetic maps for peanut. Results A total of 2,138 EST-SSR (expressed sequence tag-simple sequence repeat) markers were developed by mining a tetraploid peanut EST assembly including 101,132 unigenes (37,916 contigs and 63,216 singletons) derived from 70,771 long-read (Sanger) and 270,957 short-read (454) sequences. A set of 97 SSR markers were also developed by mining 9,517 genomic survey sequences of Arachis. An SSR-based intraspecific linkage map was constructed using an F2 population derived from a cross between K 9484 (PI 298639) and GKBSPSc 30081 (PI 468327) in the B-genome species A. batizocoi. A high degree of macrosynteny was observed when comparing the homoeologous linkage groups between A (A. duranensis) and B (A. batizocoi) genomes. Comparison of the A- and B-genome genetic linkage maps also showed a total of five inversions and one major reciprocal translocation between two pairs of chromosomes under our current mapping resolution. Conclusions Our findings will contribute to understanding tetraploid peanut genome origin and evolution and eventually promote its genetic improvement. The newly developed EST-SSR markers will enrich current molecular marker resources in peanut. PMID:23140574
Wang, Xiaohong; Zheng, Zhi-Ming
2016-01-01
Papillomaviruses are a family of small, non-enveloped DNA tumor viruses. Knowing a complete transcription map from each papillomavirus genome can provide guidance for various papillomavirus studies. This unit provides detailed protocols to construct a transcription map of human papillomavirus type 18. The same approach can be easily adapted to other transcription map studies of any other papillomavirus genotype due to the high degree of conservation in the genome structure, organization and gene expression among papillomaviruses. The focused methods are 5’- and 3’- rapid amplification of cDNA ends (RACE), which are the techniques commonly used in molecular biology to obtain the full length RNA transcript or to map a transcription start site (TSS) or an RNA polyadenylation (pA) cleavage site. Primer walking RT-PCR is a method for studying splicing junction of RACE products. In addition, RNase protection assay and primer extension are also introduced as alternative methods in the mapping analysis. PMID:26855281
NASA Astrophysics Data System (ADS)
Gibbs, Holly C.; Dodson, Colin R.; Bai, Yuqiang; Lekven, Arne C.; Yeh, Alvin T.
2014-12-01
During embryogenesis, presumptive brain compartments are patterned by dynamic networks of gene expression. The spatiotemporal dynamics of these networks, however, have not been characterized with sufficient resolution for us to understand the regulatory logic resulting in morphogenetic cellular behaviors that give the brain its shape. We have developed a new, integrated approach using ultrashort pulse microscopy [a high-resolution, two-photon fluorescence (2PF)-optical coherence microscopy (OCM) platform using 10-fs pulses] and image registration to study brain patterning and morphogenesis in zebrafish embryos. As a demonstration, we used time-lapse 2PF to capture midbrain-hindbrain boundary morphogenesis and a wnt1 lineage map from embryos during brain segmentation. We then performed in situ hybridization to deposit NBT/BCIP, where wnt1 remained actively expressed, and reimaged the embryos with combined 2PF-OCM. When we merged these datasets using morphological landmark registration, we found that the mechanism of boundary formation differs along the dorsoventral axis. Dorsally, boundary sharpening is dominated by changes in gene expression, while ventrally, sharpening may be accomplished by lineage sorting. We conclude that the integrated visualization of lineage reporter and gene expression domains simultaneously with brain morphology will be useful for understanding how changes in gene expression give rise to proper brain compartmentalization and structure.
Gibbs, Holly C; Dodson, Colin R; Bai, Yuqiang; Lekven, Arne C; Yeh, Alvin T
2014-12-01
During embryogenesis, presumptive brain compartments are patterned by dynamic networks of gene expression. The spatiotemporal dynamics of these networks, however, have not been characterized with sufficient resolution for us to understand the regulatory logic resulting in morphogenetic cellular behaviors that give the brain its shape. We have developed a new, integrated approach using ultrashort pulse microscopy [a high-resolution, two-photon fluorescence (2PF)-optical coherence microscopy (OCM) platform using 10-fs pulses] and image registration to study brain patterning and morphogenesis in zebrafish embryos. As a demonstration, we used time-lapse 2PF to capture midbrain-hindbrain boundary morphogenesis and a wnt1 lineage map from embryos during brain segmentation. We then performed in situ hybridization to deposit NBT/BCIP, where wnt1 remained actively expressed, and reimaged the embryos with combined 2PF-OCM. When we merged these datasets using morphological landmark registration, we found that the mechanism of boundary formation differs along the dorsoventral axis. Dorsally, boundary sharpening is dominated by changes in gene expression, while ventrally, sharpening may be accomplished by lineage sorting. We conclude that the integrated visualization of lineage reporter and gene expression domains simultaneously with brain morphology will be useful for understanding how changes in gene expression give rise to proper brain compartmentalization and structure.
Fluorescent in situ hybridisation to amphioxus chromosomes.
Castro, Luis Filipe Costa; Holland, Peter William Harold
2002-12-01
We describe an efficient protocol for mapping genes and other DNA sequences to amphioxus chromosomes using fluorescent in situ hybridisation. We apply this method to identify the number and location of ribosomal DNA gene clusters and telomere sequences in metaphase spreads of Branchiostoma floridae. We also describe how the locations of two single copy genes can be mapped relative to each other, and demonstrate this by mapping an amphioxus Pax gene relative to a homologue of the Notch gene. These methods have great potential for performing comparative genomics between amphioxus and vertebrates.
Resistance genes in barley (Hordeum vulgare L.) and their identification with molecular markers.
Chełkowski, Jerzy; Tyrka, Mirosław; Sobkiewicz, Andrzej
2003-01-01
Current information on barley resistance genes available from scientific papers and on-line databases is summarised. The recent literature contains information on 107 major resistance genes (R genes) against fungal pathogens (excluding powdery mildew), pathogenic viruses and aphids identified in Hordeum vulgare accessions. The highest number of resistance genes was identified against Puccinia hordei, Rhynchosporium secalis, and the viruses BaYMV and BaMMV, with 17, 14 and 13 genes respectively. There is still a lot of confusion regarding symbols for R genes against powdery mildew. Among the 23 loci described to date, two regions Mla and Mlo comprise approximately 31 and 25 alleles. Over 50 R genes have already been localised and over 30 mapped on 7 barley chromosomes. Four barley R genes have been cloned recently: Mlo, Rpg1, Mla1 and Mla6, and their structures (sequences) are available. The paper presents a catalogue of barley resistance gene symbols, their chromosomalocation and the list of available DNA markers useful in characterising cultivars and breeding accessions.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, C.H.; Wei, Li-Na; Copeland, N.G.
We have isolated and characterized overlapping genomic clones containing the complete transcribed region of a newly isolated mouse cDNA encoding an orphan receptor expressed specifically in midgestation embryos and adult testis. This gene spans a distance of more than 50 kb and is organized into 13 exons. The transcription initiation site is located at the 158th nucleotide upstream from the translation initiation codon. All the exon/intron junction sequences follow the GT/AG rule. Based upon Northern blot analysis and the size of the transcribed region of the gene, its transcript was determined to be approximately 2.5 kb. Within approximately 500 hpmore » upstream from the transcription initiation site, several immune response regulatory elements were identified but no TATA box was located. This gene was mapped to the distal region of mouse chromosome 10 and its locus has been designated Tr2-11. Immunohistochemical studies show that the Tr2-11 protein is present mainly in advanced germ cell populations of mature testes and that Tr2-11 gene expression is dramatically decreased in vitamin A-depleted animals. 23 refs., 7 figs.« less
Gao, Shi Gang; Zhou, Fei Hong; Liu, Tong; Li, Ying Ying; Chen, Jie
2013-03-01
Mitogen-activated protein kinase (MAPK) cascades are highly conserved signal transduction pathways, which play a wide variety of important roles in extracellular signal transduction. The first MAPK gene of the maize pathogen Curvularia lunata, Clk1, was isolated via a PCR-based approach with a primer pair designed on the basis of conserved regions of known MAPKs. Southern blot analysis showed that the gene existed in the genome as a single copy. The predicted amino acid sequence (352 amino acids) was highly homologous with MAP kinases of other phytopathogenic fungi. Flanking regions of Clk1 were obtained through RACE and genomic walking technology. To understand the role of Clk1 in C. lunata, targeted gene disruption was adopted to construct Clk1 mutants. It was found that mutants lacking functional domain of Clk1 were not able to produce conidia but tended to form a few special chlamydospore-shaped structures. Clk1 mutants grew slower in adverse environments (at 24°C), produced less cell degrading enzymes (CWDEs) than the wild type, and they were almost unable to infect maize leaves via artificial wounds. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Chromosomal localization of actin genes in the malaria mosquito Anopheles darlingi
BRIDI, L. C.; SHARAKHOVA, M. V.; SHARAKHOV, I. V.; CORDEIRO, J.; AZEVEDO, G. M.; TADEI, W. P.; RAFAEL, M. S.
2012-01-01
Physical and genetic maps have been used for chromosomal localization of genes in vectors of infectious diseases. The availability of polytene chromosomes in malaria mosquitoes provides a unique opportunity to precisely map genes of interest. We report physical mapping of two actin genes on polytene chromosomes of the major malaria vector in Amazon Anopheles darlingi. The clones with the actin genes sequences were obtained from a cDNA library constructed from RNA isolated from adult females and males of An. darlingi. Each of the two clones was mapped to a unique site on the chromosomal arm 2L in subdivisions 21A (clone pl05-A04) and 23B (clone pl17-G06). The obtained results together with previous mapping data provide a suitable basis for comparative genomics and for establishing chromosomal homologies among major malaria vectors. PMID:22804344
RatMap—rat genome tools and data
Petersen, Greta; Johnson, Per; Andersson, Lars; Klinga-Levan, Karin; Gómez-Fabre, Pedro M.; Ståhl, Fredrik
2005-01-01
The rat genome database RatMap (http://ratmap.org or http://ratmap.gen.gu.se) has been one of the main resources for rat genome information since 1994. The database is maintained by CMB–Genetics at Göteborg University in Sweden and provides information on rat genes, polymorphic rat DNA-markers and rat quantitative trait loci (QTLs), all curated at RatMap. The database is under the supervision of the Rat Gene and Nomenclature Committee (RGNC); thus much attention is paid to rat gene nomenclature. RatMap presents information on rat idiograms, karyotypes and provides a unified presentation of the rat genome sequence and integrated rat linkage maps. A set of tools is also available to facilitate the identification and characterization of rat QTLs, as well as the estimation of exon/intron number and sizes in individual rat genes. Furthermore, comparative gene maps of rat in regard to mouse and human are provided. PMID:15608244
DOE Office of Scientific and Technical Information (OSTI.GOV)
Plomp, M; Malkin, A J
2008-06-02
Atomic force microscopy provides a unique capability to image high-resolution architecture and structural dynamics of pathogens (e.g. viruses, bacteria and bacterial spores) at near molecular resolution in native conditions. Further development of atomic force microscopy in order to enable the correlation of pathogen protein surface structures with specific gene products is essential to understand the mechanisms of the pathogen life cycle. We have applied an AFM-based immunolabeling technique for the proteomic mapping of macromolecular structures through the visualization of the binding of antibodies, conjugated with nanogold particles, to specific epitopes on Bacillus spore surfaces. This information is generated while simultaneouslymore » acquiring the surface morphology of the pathogen. The immunospecificity of this labeling method was established through the utilization of specific polyclonal and monoclonal antibodies that target spore coat and exosporium epitopes of Bacillus atrophaeus and Bacillus anthracis spores.« less
Structural genes for thiamine biosynthetic enzymes (thiCEFGH) in Escherichia coli K-12.
Vander Horn, P B; Backstrom, A D; Stewart, V; Begley, T P
1993-01-01
Escherichia coli K-12 synthesizes thiamine pyrophosphate (vitamin B1) de novo. Two precursors [4-methyl-5-(beta-hydroxyethyl)thiazole monophosphate and 4-amino-5-hydroxymethyl-2-methylpyrimidine pyrophosphate] are coupled to form thiamine monophosphate, which is then phosphorylated to make thiamine pyrophosphate. Previous studies have identified two classes of thi mutations, clustered at 90 min on the genetic map, which result in requirements for the thiazole or the hydroxymethylpryimidine. We report here our initial molecular genetic analysis of the thi cluster. We cloned the thi cluster genes and examined their organization, structure, and function by a combination of phenotypic testing, complementation analysis, polypeptide expression, and DNA sequencing. We found five tightly linked genes, designated thiCEFGH. The thiC gene product is required for the synthesis of the hydroxymethylpyrimidine. The thiE, thiF, thiG, and thiH gene products are required for synthesis of the thiazole. These mutants did not respond to 1-deoxy-D-threo-2-pentulose, indicating that they are blocked in the conversion of this precursor compound to the thiazole itself. Images PMID:8432721
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sankar, P.; Lee, J.H.; Shanmugam, K.T.
1985-04-01
Escherichia coli has two unlinked genes that code for hydrogenase synthesis and activity. The DNA fragments containing the two genes (hydA and hydB) were cloned into a plasmid vector, pBR322. The plasmids containing the hyd genes (pSE-290 and pSE-111 carrying the hydA and hydB genes, respectively) were used to genetically map a total of 51 mutant strains with defects in hydrogenase activity. A total of 37 mutants carried a mutation in the hydB gene, whereas the remaining 14 hyd were hydA. This complementation analysis also established the presence of two new genes, so far unidentified, one coding for formate dehydrogenase-2more » (fdv) and another producing an electron transport protein (fhl) coupling formate dehydrogenase-2 to hydrogenase. Three of the four genes, hydB, fhl, and fdv, may constitute a single operon, and all three genes are carried by a 5.6-kilobase-pair chromosomal DNA insert in plasmid pSE-128. Plasmids carrying a part of this 5.6-kilobase-pair DNA (pSE-130) or fragments derived from this DNA in different orientations (pSE-126 and pSE-129) inhibited the production of active formate hydrogenlyase. This inhibition occurred even in a prototrophic E. coli, strain K-10, but only during an early induction period. These results, based on complementation analysis with cloned DNA fragments, show that both hydA and hydB genes are essential for the production of active hydrogenase. For the expression of active formate hydrogenlyase, two other gene products, fhl and fdv are also needed. All four genes map between 58 and 59 min in the E. coli chromosome.« less
Chandran, Anil Kumar Nalini; Lee, Gang-Seob; Yoo, Yo-Han; Yoon, Ung-Han; Ahn, Byung-Ohg; Yun, Doh-Won; Kim, Jin-Hyun; Choi, Hong-Kyu; An, GynHeung; Kim, Tae-Ho; Jung, Ki-Hong
2016-12-01
Rice is one of the most important food crops for humans. To improve the agronomical traits of rice, the functions of more than 1,000 rice genes have been recently characterized and summarized. The completed, map-based sequence of the rice genome has significantly accelerated the functional characterization of rice genes, but progress remains limited in assigning functions to all predicted non-transposable element (non-TE) genes, estimated to number 37,000-41,000. The International Rice Functional Genomics Consortium (IRFGC) has generated a huge number of gene-indexed mutants by using mutagens such as T-DNA, Tos17 and Ds/dSpm. These mutants have been identified by 246,566 flanking sequence tags (FSTs) and cover 65 % (25,275 of 38,869) of the non-TE genes in rice, while the mutation ratio of TE genes is 25.7 %. In addition, almost 80 % of highly expressed non-TE genes have insertion mutations, indicating that highly expressed genes in rice chromosomes are more likely to have mutations by mutagens such as T-DNA, Ds, dSpm and Tos17. The functions of around 2.5 % of rice genes have been characterized, and studies have mainly focused on transcriptional and post-transcriptional regulation. Slow progress in characterizing the function of rice genes is mainly due to a lack of clues to guide functional studies or functional redundancy. These limitations can be partially solved by a well-categorized functional classification of FST genes. To create this classification, we used the diverse overviews installed in the MapMan toolkit. Gene Ontology (GO) assignment to FST genes supplemented the limitation of MapMan overviews. The functions of 863 of 1,022 known genes can be evaluated by current FST lines, indicating that FST genes are useful resources for functional genomic studies. We assigned 16,169 out of 29,624 FST genes to 34 MapMan classes, including major three categories such as DNA, RNA and protein. To demonstrate the MapMan application on FST genes, transcriptome analysis was done from a rice mutant of 1-deoxy-D-xylulose 5-phosphate reductoisomerase (DXR) gene with FST. Mapping of 756 down-regulated genes in dxr mutants and their annotation in terms of various MapMan overviews revealed candidate genes downstream of DXR-mediating light signaling pathway in diverse functional classes such as the methyl-D-erythritol 4-phosphatepathway (MEP) pathway overview, photosynthesis, secondary metabolism and regulatory overview. This report provides a useful guide for systematic phenomics and further applications to enhance the key agronomic traits of rice.
Transcription forms and remodels supercoiling domains unfolding large-scale chromatin structures
Naughton, Catherine; Avlonitis, Nicolaos; Corless, Samuel; Prendergast, James G.; Mati, Ioulia K.; Eijk, Paul P.; Cockroft, Scott L.; Bradley, Mark; Ylstra, Bauke; Gilbert, Nick
2013-01-01
DNA supercoiling is an inherent consequence of twisting DNA and is critical for regulating gene expression and DNA replication. However, DNA supercoiling at a genomic scale in human cells is uncharacterized. To map supercoiling we used biotinylated-trimethylpsoralen as a DNA structure probe to show the genome is organized into supercoiling domains. Domains are formed and remodeled by RNA polymerase and topoisomerase activities and are flanked by GC-AT boundaries and CTCF binding sites. Under-wound domains are transcriptionally active, enriched in topoisomerase I, “open” chromatin fibers and DNaseI sites, but are depleted of topoisomerase II. Furthermore DNA supercoiling impacts on additional levels of chromatin compaction as under-wound domains are cytologically decondensed, topologically constrained, and decompacted by transcription of short RNAs. We suggest that supercoiling domains create a topological environment that facilitates gene activation providing an evolutionary purpose for clustering genes along chromosomes. PMID:23416946
Mapping of the 3q27 region involved in Dup(3q) syndrome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rizzu, P.; Baldini, A.; Overhauser, J.
1994-09-01
The duplication 3q syndrome is characterized by partial trisomy of a segment of the long arm of chromosome 3. We have previously found that 3q26.3-3q27 is the minimal region of trisomy overlap. This critical region (CR) is delimited by two patient chromosome breakpoints, approximately 10 cM apart. In order to identify the gene(s) responsible for the Dup(3q) phenotype, we are generating a physical map of the region and identifying expressed sequences. First, we have generated a cytological map using two- and three-color fluorescence in situ hybridization on metaphase and interphase chromosomes. Results allowed us to determine the centromere-telomere orientation, ordermore » and relative distances of six cosmid clones mapped to the CR. Because some of the markers used are part of the consensus chromosome 3 map, our data were easily integrated with existing mapping information. Subsequently, we have included in the map YAC clones positive for polymorphic PCR markers identified by CEPH-Genethon, as well as newly isolated YACs. We have assigned them to the critical region 7 of the Genethon polymorphic markers and linked them to three YAC contigs. Currently our map includes two of the five genes known to map in this region. Interestingly, we found that these two functionally related genes (kininogen and histidin-rich glycoprotein) map to the same 1 Mb genomic fragment. As the physical map is being constructed we are searching for expressed sequences. Positive cDNAs have been found and their characterization is in progress. In conclusion, we will present an integrated map of 3q27 that includes genetic, physical and cytological information as well as gene annotation. As Dup(3q) syndrome is likely to be a contiguous gene syndrome, such a map will be necessary for our understanding of this multiple congenital anomaly.« less
Hayes, C; Rump, A; Cadman, M R; Harrison, M; Evans, E P; Lyon, M F; Morriss-Kay, G M; Rosenthal, A; Brown, S D
2001-12-01
The mouse doublefoot (Dbf) mutant exhibits preaxial polydactyly in association with craniofacial defects. This mutation has previously been mapped to mouse chromosome 1. We have used a positional cloning strategy, coupled with a comparative sequencing approach using available human draft sequence, to identify putative candidates for the Dbf gene in the mouse and in homologous human region. We have constructed a high-resolution genetic map of the region, localizing the mutation to a 0.4-cM (+/-0.0061) interval on mouse chromosome 1. Furthermore, we have constructed contiguous BAC/PAC clone maps across the mouse and human Dbf region. Using existing markers and additional sequence tagged sites, which we have generated, we have anchored the physical map to the genetic map. Through the comparative sequencing of these clones we have identified 35 genes within this interval, indicating that the region is gene-rich. From this we have identified several genes that are known to be differentially expressed in the developing mid-gestation mouse embryo, some in the developing embryonic limb buds. These genes include those encoding known developmental signaling molecules such as WNT proteins and IHH, and we provide evidence that these genes are candidates for the Dbf mutation.
Yamada, Takahisa; Muramatsu, Youji; Taniguchi, Yukio; Sasaki, Yoshiyuki
Our previous study detected 291 and 77 genes showing early embryonic death-associated elevation and reduction of expression, respectively, in the fetal placenta of the cow carrying somatic nuclear transfer-derived cloned embryo. In this study, we mapped the 10 genes showing the elevation and the 10 genes doing the reduction most significantly, using somatic cell hybrid and bovine draft genome sequence. We then compared the mapped positions for these genes with the genomic locations of bovine quantitative trait loci for still-birth and/or abortion. Among the mapped genes, peptidylglycine alpha-amidating monooxygenase (PAM), spectrin, beta, nonerythrocytic 1 (SPTBNI), and an unknown novel gene containing AU277832 expressed sequence tag were intriguing, in that the mapped positions were consistent with the genomic locations of bovine still-birth and/or abortion quantitative trait loci, and thus identified as positional candidates for bovine placental genes responsible for the early embryonic death during the pregnancy attempted by somatic nuclear transfer-derived cloning.
Joslin, A C; Green, R; German, J B; Lange, M C
2014-09-01
Advances in the development of bioinformatic tools continue to improve investigators' ability to interrogate, organize, and derive knowledge from large amounts of heterogeneous information. These tools often require advanced technical skills not possessed by life scientists. User-friendly, low-barrier-to-entry methods of visualizing nutrigenomics information are yet to be developed. We utilized concept mapping software from the Institute for Human and Machine Cognition to create a conceptual model of diet and health-related data that provides a foundation for future nutrigenomics ontologies describing published nutrient-gene/polymorphism-phenotype data. In this model, maps containing phenotype, nutrient, gene product, and genetic polymorphism interactions are visualized as triples of two concepts linked together by a linking phrase. These triples, or "knowledge propositions," contextualize aggregated data and information into easy-to-read knowledge maps. Maps of these triples enable visualization of genes spanning the One-Carbon Metabolism (OCM) pathway, their sequence variants, and multiple literature-mined associations including concepts relevant to nutrition, phenotypes, and health. The concept map development process documents the incongruity of information derived from pathway databases versus literature resources. This conceptual model highlights the importance of incorporating information about genes in upstream pathways that provide substrates, as well as downstream pathways that utilize products of the pathway under investigation, in this case OCM. Other genes and their polymorphisms, such as TCN2 and FUT2, although not directly involved in OCM, potentially alter OCM pathway functionality. These upstream gene products regulate substrates such as B12. Constellations of polymorphisms affecting the functionality of genes along OCM, together with substrate and cofactor availability, may impact resultant phenotypes. These conceptual maps provide a foundational framework for development of nutrient-gene/polymorphism-phenotype ontologies and systems visualization.
Fajardo-Ortiz, David; Duran, Luis; Moreno, Laura; Ochoa, Hector; Castaño, Victor M
2014-09-03
We explored how the knowledge translation and innovation processes are structured when theyresult in innovations, as in the case of liposomal doxorubicin research. In order to map the processes, a literature network analysis was made through Cytoscape and semantic analysis was performed by GOPubmed which is based in the controlled vocabularies MeSH (Medical Subject Headings) and GO (Gene Ontology). We found clusters related to different stages of the technological development (invention, innovation and imitation) and the knowledge translation process (preclinical, translational and clinical research), and we were able to map the historic emergence of Doxil as a paradigmatic nanodrug. This research could be a powerful methodological tool for decision-making and innovation management in drug delivery research.
Qiu, Y C; Zhou, R H; Kong, X Y; Zhang, S S; Jia, J Z
2005-11-01
A powdery mildew resistance gene from Triticum urartu Tum. accession UR206 was successfully transferred into hexaploid wheat (Triticum aestivum L.) through crossing and backcrossing. The F1 plants, which had 28 chromosomes and an average of 5.32 bivalents and 17.36 univalents in meiotic pollen mother cells (PMC), were obtained through embryos rescued owing to shriveling of endosperm in hybrid seed of cross Chinese Spring (CS) x UR206. Hybrid seeds were produced through backcrossing F1 with common wheat parents. The derivative lines had normal chromosome numbers and powdery mildew resistance similar to the donor UR206, indicating that the powdery mildew resistance gene originating from T. urartu accession UR206 was successfully transferred and expressed in a hexaploid wheat background. Genetic analysis indicated that a single dominant gene controlled the powdery mildew resistance at the seedling stage. To map and tag the powdery mildew resistance gene, 143 F2 individuals derived from a cross UR206 x UR203 were used to construct a linkage map. The resistant gene was mapped on the chromosome 7AL based on the mapped microsatellite makers. The map spanned 52.1 cM and the order of these microsatellite loci agreed well with the established microsatellite map of chromosome arm 7AL. The resistance gene was flanked by the microsatellite loci Xwmc273 and Xpsp3003, with the genetic distances of 2.2 cM and 3.8 cM, respectively. On the basis of the origin and chromosomal location of the gene, it was temporarily designated PmU.
Saxena, Maneesha S.; Bajaj, Deepak; Das, Shouvik; Kujur, Alice; Kumar, Vinod; Singh, Mohar; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.
2014-01-01
The identification and fine mapping of robust quantitative trait loci (QTLs)/genes governing important agro-morphological traits in chickpea still lacks systematic efforts at a genome-wide scale involving wild Cicer accessions. In this context, an 834 simple sequence repeat and single-nucleotide polymorphism marker-based high-density genetic linkage map between cultivated and wild parental accessions (Cicer arietinum desi cv. ICC 4958 and Cicer reticulatum wild cv. ICC 17160) was constructed. This inter-specific genetic map comprising eight linkage groups spanned a map length of 949.4 cM with an average inter-marker distance of 1.14 cM. Eleven novel major genomic regions harbouring 15 robust QTLs (15.6–39.8% R2 at 4.2–15.7 logarithm of odds) associated with four agro-morphological traits (100-seed weight, pod and branch number/plant and plant hairiness) were identified and mapped on chickpea chromosomes. Most of these QTLs showed positive additive gene effects with effective allelic contribution from ICC 4958, particularly for increasing seed weight (SW) and pod and branch number. One robust SW-influencing major QTL region (qSW4.2) has been narrowed down by combining QTL mapping with high-resolution QTL region-specific association analysis, differential expression profiling and gene haplotype-based association/LD mapping. This enabled to delineate a strong SW-regulating ABI3VP1 transcription factor (TF) gene at trait-specific QTL interval and consequently identified favourable natural allelic variants and superior high seed weight-specific haplotypes in the upstream regulatory region of this gene showing increased transcript expression during seed development. The genes (TFs) harbouring diverse trait-regulating QTLs, once validated and fine-mapped by our developed rapid integrated genomic approach and through gene/QTL map-based cloning, can be utilized as potential candidates for marker-assisted genetic enhancement of chickpea. PMID:25335477
Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger
Wright, James C; Sugden, Deana; Francis-McIntyre, Sue; Riba-Garcia, Isabel; Gaskell, Simon J; Grigoriev, Igor V; Baker, Scott E; Beynon, Robert J; Hubbard, Simon J
2009-01-01
Background Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were acquired from 1d gel electrophoresis bands and searched against all available gene models using Average Peptide Scoring (APS) and reverse database searching to produce confident identifications at an acceptable false discovery rate (FDR). Results 405 identified peptide sequences were mapped to 214 different A.niger genomic loci to which 4093 predicted gene models clustered, 2872 of which contained the mapped peptides. Interestingly, 13 (6%) of these loci either had no preferred predicted gene model or the genome annotators' chosen "best" model for that genomic locus was not found to be the most parsimonious match to the identified peptides. The peptides identified also boosted confidence in predicted gene structures spanning 54 introns from different gene models. Conclusion This work highlights the potential of integrating experimental proteomics data into genomic annotation pipelines much as expressed sequence tag (EST) data has been. A comparison of the published genome from another strain of A.niger sequenced by DSM showed that a number of the gene models or proteins with proteomics evidence did not occur in both genomes, further highlighting the utility of the method. PMID:19193216
DOE Office of Scientific and Technical Information (OSTI.GOV)
Machlin, S.M.; Hanson, R.S.
The nucleotide sequence of a cloned 2.5-kilobase-pair SmaI fragment containing the methanol dehydrogenase (MDH) structural gene from Methylobacterium organophilum XX was determined. A single open reading frame with a coding capacity of 626 amino acids (molecular weight, 66,000) was identified on one stand, and N-terminal sequencing of purified MDH revealed that 27 of these residues constituted a putative signal peptide. Primer extension mapping of in vivo transcripts indicated that the start of mRNA synthesis was 160 to 170 base pairs upstream of the ATG codon. Northern (RNA) blot analysis further demonstrated that the transcript was 2.1 kilobase pairs in lengthmore » and therefore appeared to encode only MDH.« less
Canine RD3 mutation establishes rod cone dysplasia type 2 (rcd2) as ortholog of human and murine rd3
Kukekova, Anna V.; Goldstein, Orly; Johnson, Jennifer L.; Richardson, Malcolm A.; Pearce-Kelling, Susan E.; Swaroop, Anand; Friedman, James S.; Aguirre, Gustavo D.; Acland, Gregory M.
2009-01-01
Rod cone dysplasia type 2 (rcd2) is an autosomal recessive disorder that segregates in collie dogs. Linkage disequilibrium and meiotic linkage mapping were combined to take advantage of population structure within this breed, and to fine map rcd2 to a 230 kb candidate region that included the gene C1orf36 responsible for human and murine rd3, and within which all affected dogs were homozygous for one haplotype. In one of three identified canine retinal RD3 splice variants, an insertion was found that cosegregates with rcd2, and is predicted to alter the last 61 codons of the normal open reading frame and further extend the ORF. Thus combined meiotic linkage and LD mapping within a single canine breed can yield critical reduction of the disease interval when appropriate advantage is taken of within breed population structure. This should permit a similar approach to tackle other hereditary traits that segregate in single closed populations. PMID:19130129
Blenda, Anna; Fang, David D.; Rami, Jean-François; Garsmeur, Olivier; Luo, Feng; Lacape, Jean-Marc
2012-01-01
A consensus genetic map of tetraploid cotton was constructed using six high-density maps and after the integration of a sequence-based marker redundancy check. Public cotton SSR libraries (17,343 markers) were curated for sequence redundancy using 90% as a similarity cutoff. As a result, 20% of the markers (3,410) could be considered as redundant with some other markers. The marker redundancy information had been a crucial part of the map integration process, in which the six most informative interspecific Gossypium hirsutum×G. barbadense genetic maps were used for assembling a high density consensus (HDC) map for tetraploid cotton. With redundant markers being removed, the HDC map could be constructed thanks to the sufficient number of collinear non-redundant markers in common between the component maps. The HDC map consists of 8,254 loci, originating from 6,669 markers, and spans 4,070 cM, with an average of 2 loci per cM. The HDC map presents a high rate of locus duplications, as 1,292 markers among the 6,669 were mapped in more than one locus. Two thirds of the duplications are bridging homoeologous AT and DT chromosomes constitutive of allopolyploid cotton genome, with an average of 64 duplications per AT/DT chromosome pair. Sequences of 4,744 mapped markers were used for a mutual blast alignment (BBMH) with the 13 major scaffolds of the recently released Gossypium raimondii genome indicating high level of homology between the diploid D genome and the tetraploid cotton genetic map, with only a few minor possible structural rearrangements. Overall, the HDC map will serve as a valuable resource for trait QTL comparative mapping, map-based cloning of important genes, and better understanding of the genome structure and evolution of tetraploid cotton. PMID:23029214
Zhang, Shu-Dong; Gant, Timothy W
2009-07-31
Connectivity mapping is a process to recognize novel pharmacological and toxicological properties in small molecules by comparing their gene expression signatures with others in a database. A simple and robust method for connectivity mapping with increased specificity and sensitivity was recently developed, and its utility demonstrated using experimentally derived gene signatures. This paper introduces sscMap (statistically significant connections' map), a Java application designed to undertake connectivity mapping tasks using the recently published method. The software is bundled with a default collection of reference gene-expression profiles based on the publicly available dataset from the Broad Institute Connectivity Map 02, which includes data from over 7000 Affymetrix microarrays, for over 1000 small-molecule compounds, and 6100 treatment instances in 5 human cell lines. In addition, the application allows users to add their custom collections of reference profiles and is applicable to a wide range of other 'omics technologies. The utility of sscMap is two fold. First, it serves to make statistically significant connections between a user-supplied gene signature and the 6100 core reference profiles based on the Broad Institute expanded dataset. Second, it allows users to apply the same improved method to custom-built reference profiles which can be added to the database for future referencing. The software can be freely downloaded from http://purl.oclc.org/NET/sscMap.
Sargent, D J; Rys, A; Nier, S; Simpson, D W; Tobutt, K R
2007-01-01
We have developed 46 primer pairs from exon sequences flanking polymorphic introns of 23 Fragaria gene sequences and one Malus sequence deposited in the EMBL database. Sequencing of a set of the PCR products amplified with the novel primer pairs in diploid Fragaria showed the products to be homologous to the sequences from which the primers were originally designed. By scoring the segregation of the 24 genes in two diploid Fragaria progenies FV x FN (F. vesca x F. nubicola F(2)) and 815 x 903BC (F. vesca x F. viridis BC(1)) 29 genetic loci at discrete positions on the seven linkage groups previously characterised could be mapped, bringing to 35 the total number of known function genes mapped in Fragaria. Twenty primer pairs, representing 14 genes, amplified a product of the expected size in both Malus and Prunus. To demonstrate the applicability of these gene-specific loci to comparative mapping in Rosaceae, five markers that displayed clear polymorphism between the parents of a Malus and a Prunus mapping population were selected. The markers were then scored and mapped in at least one of the two additional progenies.
Basler, Tina; Jeckstadt, Sabine; Valentin-Weigand, Peter; Goethe, Ralph
2006-03-01
Mycobacterium avium subspecies paratuberculosis (MAP) causes a chronic enteritis in ruminants. In addition, MAP is presently the most favored pathogen linked to Crohn's disease. In this study, we were interested in dissecting the molecular mechanisms of macrophage activation or deactivation after infection with MAP. By subtractive hybridization of cDNAs, we identified the immune-responsive gene 1 (IRG1), which was expressed substantially higher in lipopolysaccharide (LPS)-stimulated than in MAP-infected murine macrophage cell lines. A nuclear run-on transcription assay revealed that the IRG1 gene was activated transcriptionally in LPS-stimulated and MAP-infected macrophages with higher expression in LPS-stimulated cells. Analysis of post-transcriptional regulation demonstrated that IRG1 mRNA stability was increased in LPS-stimulated but not in MAP-infected macrophages. Furthermore, IRG1 gene expression of macrophages infected with the nonpathogenic Mycobacterium smegmatis differed from those of LPS-stimulated and MAP-infected macrophages. At 2 h postinfection, M. smegmatis-induced IRG1 gene expression was as low as in MAP-infected, and 8 h postinfection, it increased nearly to the level in LPS-stimulated macrophages. Transient transfection experiments revealed similar IRG1 promoter activities in MAP- and M. smegmatis-infected cells. Northern analysis demonstrated increased IRG1 mRNA stability in M. smegmatis-infected macrophages. IRG1 mRNA stabilization was p38 mitogen-activated protein kinase-independent. Inhibition of protein synthesis revealed that constitutively expressed factors seemed to be responsible for IRG1 mRNA destabilization. Thus, our data demonstrate that transcriptional and post-transcriptional mechanisms are responsible for a differential IRG1 gene expression in murine macrophages treated with LPS, MAP, and M. smegmatis.
Cell-cycle dynamics of chromosomal organisation at single-cell resolution
Nagano, Takashi; Lubling, Yaniv; Várnai, Csilla; Dudley, Carmel; Leung, Wing; Baran, Yael; Mendelson-Cohen, Netta; Wingett, Steven; Fraser, Peter; Tanay, Amos
2017-01-01
Summary Chromosomes in proliferating metazoan cells undergo dramatic structural metamorphoses every cell cycle, alternating between highly condensed mitotic structures facilitating chromosome segregation, and decondensed interphase structures accommodating transcription, gene silencing and DNA replication. Here we use single-cell Hi-C to study chromosome conformations in thousands of individual cells, and discover a continuum of cis-interaction profiles that finely position individual cells along the cell cycle. We show that chromosomal compartments, topological associated domains (TADs), contact insulation and long-range loops, all defined by bulk Hi-C maps, are governed by distinct cell-cycle dynamics. In particular, DNA replication correlates with build-up of compartments and reduction in TAD insulation, while loops are generally stable from G1 through S and G2. Whole-genome 3D structural models reveal a radial architecture of chromosomal compartments with distinct epigenomic signatures. Our single-cell data thereby allow for re-interpretation of chromosome conformation maps through the prism of the cell cycle. PMID:28682332
Harnessing cell-to-cell variations to probe bacterial structure and biophysics
NASA Astrophysics Data System (ADS)
Cass, Julie A.
Advances in microscopy and biotechnology have given us novel insights into cellular biology and physics. While bacteria were long considered to be relatively unstructured, the development of fluorescence microscopy techniques, and spatially and temporally resolved high-throughput quantitative studies, have uncovered that the bacterial cell is highly organized, and its structure rigorously maintained. In this thesis I will describe our gateTool software, designed to harness cell-to-cell variations to probe bacterial structure, and discuss two exciting aspects of structure that we have employed gateTool to investigate: (i) chromosome organization and the cellular mechanisms for controlling DNA dynamics, and (ii) the study of cell wall synthesis, and how the genes in the synthesis pathway impact cellular shape. In the first project, we develop a spatial and temporal mapping of cell-cycle-dependent chromosomal organization, and use this quantitative map to discover that chromosomal loci segregate from midcell with universal dynamics. In the second project, I describe preliminary time- lapse and snapshot imaging analysis suggesting phentoypical coherence across peptidoglycan synthesis pathways.
Millot, Benjamin; Montoliu, Lluís; Fontaine, Marie-Louise; Mata, Teresa; Devinoy, Eve
2003-01-01
The upstream regulatory regions of the mouse and rabbit whey acidic protein (WAP) genes have been used extensively to target the efficient expression of foreign genes into the mammary gland of transgenic animals. Therefore both regions have been studied to elucidate fully the mechanisms controlling WAP gene expression. Three DNase I-hypersensitive sites (HSS0, HSS1 and HSS2) have been described upstream of the rabbit WAP gene in the lactating mammary gland and correspond to important regulatory regions. These sites are surrounded by variable chromatin structures during mammary-gland development. In the present study, we describe the upstream sequence of the mouse WAP gene. Analysis of genomic sequences shows that the mouse WAP gene is situated between two widely expressed genes (Cpr2 and Ramp3). We show that the hypersensitive sites found upstream of the rabbit WAP gene are also detected in the mouse WAP gene. Further, they encompass functional signal transducer and activator of transcription 5-binding sites, as has been observed in the rabbit. A new hypersensitive site (HSS3), not specific to the mammary gland, was mapped 8 kb upstream of the rabbit WAP gene. Unlike the three HSSs described above, HSS3 is also detected in the liver, but similar to HSS1, it does not depend on lactogenic hormone treatments during cell culture. The region surrounding HSS3 encompasses a potential matrix attachment region, which is also conserved upstream of the mouse WAP gene and contains a functional transcription factor Ets-1 (E26 transformation-specific-1)-binding site. Finally, we demonstrate for the first time that variations in the chromatin structure are dependent on prolactin alone. PMID:12580766
Molecular cloning, structure, and chromosomal localization of the mouse LIM/homeobox gene Lhx5
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bertuzzi, S.; Sheng, Hui Z.; Westphal, H.
1996-09-01
Lhx5, the mouse ortholog of the Xenopus Xlim-5, is a LIM/homeobox gene expressed in the central nervous system during both embryonic development and adulthood. During development its domain of expression is mainly localized at the most anterior portion of the neural tube, and it precedes the morphological differentiation of the forebrain; for this reason we believe that Lhx5 could play an important role in forebrain patterning. Here we present the structural organization and the chromosomal localization of the Lhx5 gene. The gene is composed of five exons spanning more than 10 kb of genomic sequence. The first and second LIMmore » domains are encoded by the first and second exon, while the codons of the homeobox are split between the third and the fourth exons. The structure of Lhx5 is similar to that of other LIM/homeodomain proteins, Lxh1/lim1 and Lhx3/lim3, but differs from that of other LIM genes, such as mec3 and LMO1/Rbtn1, in which the codons for the LIM domains are interrupted by introns. We have mapped Lhx5 to the central region of mouse chromosome 5. 38 refs., 4 figs.« less
Nelson, Justin; Simpkins, Scott W; Safizadeh, Hamid; Li, Sheena C; Piotrowski, Jeff S; Hirano, Hiroyuki; Yashiroda, Yoko; Osada, Hiroyuki; Yoshida, Minoru; Boone, Charles; Myers, Chad L
2018-04-01
Chemical-genomic approaches that map interactions between small molecules and genetic perturbations offer a promising strategy for functional annotation of uncharacterized bioactive compounds. We recently developed a new high-throughput platform for mapping chemical-genetic (CG) interactions in yeast that can be scaled to screen large compound collections, and we applied this system to generate CG interaction profiles for more than 13 000 compounds. When integrated with the existing global yeast genetic interaction network, CG interaction profiles can enable mode-of-action prediction for previously uncharacterized compounds as well as discover unexpected secondary effects for known drugs. To facilitate future analysis of these valuable data, we developed a public database and web interface named MOSAIC. The website provides a convenient interface for querying compounds, bioprocesses (Gene Ontology terms) and genes for CG information including direct CG interactions, bioprocesses and gene-level target predictions. MOSAIC also provides access to chemical structure information of screened molecules, chemical-genomic profiles and the ability to search for compounds sharing structural and functional similarity. This resource will be of interest to chemical biologists for discovering new small molecule probes with specific modes-of-action as well as computational biologists interested in analysing CG interaction networks. MOSAIC is available at http://mosaic.cs.umn.edu. hisyo@riken.jp, yoshidam@riken.jp, charlie.boone@utoronto.ca or chadm@umn.edu. Supplementary data are available at Bioinformatics online.
2012-01-01
Background High-density linkage maps facilitate the mapping of target genes and the construction of partial linkage maps around target loci to develop markers for marker-assisted selection (MAS). MAS is quite challenging in conifers because of their large, complex, and poorly-characterized genomes. Our goal was to construct a high-density linkage map to facilitate the identification of markers that are tightly linked to a major recessive male-sterile gene (ms1) for MAS in C. japonica, a species that is important in Japanese afforestation but which causes serious social pollinosis problems. Results We constructed a high-density saturated genetic linkage map for C. japonica using expressed sequence-derived co-dominant single nucleotide polymorphism (SNP) markers, most of which were genotyped using the GoldenGate genotyping assay. A total of 1261 markers were assigned to 11 linkage groups with an observed map length of 1405.2 cM and a mean distance between two adjacent markers of 1.1 cM; the number of linkage groups matched the basic chromosome number in C. japonica. Using this map, we located ms1 on the 9th linkage group and constructed a partial linkage map around the ms1 locus. This enabled us to identify a marker (hrmSNP970_sf) that is closely linked to the ms1 gene, being separated from it by only 0.5 cM. Conclusions Using the high-density map, we located the ms1 gene on the 9th linkage group and constructed a partial linkage map around the ms1 locus. The map distance between the ms1 gene and the tightly linked marker was only 0.5 cM. The identification of markers that are tightly linked to the ms1 gene will facilitate the early selection of male-sterile trees, which should expedite C. japonica breeding programs aimed at alleviating pollinosis problems without harming productivity. PMID:22424262
2012-01-01
Background Seed plants are composed of angiosperms and gymnosperms, which diverged from each other around 300 million years ago. While much light has been shed on the mechanisms and rate of genome evolution in flowering plants, such knowledge remains conspicuously meagre for the gymnosperms. Conifers are key representatives of gymnosperms and the sheer size of their genomes represents a significant challenge for characterization, sequencing and assembling. Results To gain insight into the macro-organisation and long-term evolution of the conifer genome, we developed a genetic map involving 1,801 spruce genes. We designed a statistical approach based on kernel density estimation to analyse gene density and identified seven gene-rich isochors. Groups of co-localizing genes were also found that were transcriptionally co-regulated, indicative of functional clusters. Phylogenetic analyses of 157 gene families for which at least two duplicates were mapped on the spruce genome indicated that ancient gene duplicates shared by angiosperms and gymnosperms outnumbered conifer-specific duplicates by a ratio of eight to one. Ancient duplicates were much more translocated within and among spruce chromosomes than conifer-specific duplicates, which were mostly organised in tandem arrays. Both high synteny and collinearity were also observed between the genomes of spruce and pine, two conifers that diverged more than 100 million years ago. Conclusions Taken together, these results indicate that much genomic evolution has occurred in the seed plant lineage before the split between gymnosperms and angiosperms, and that the pace of evolution of the genome macro-structure has been much slower in the gymnosperm lineage leading to extent conifers than that seen for the same period of time in flowering plants. This trend is largely congruent with the contrasted rates of diversification and morphological evolution observed between these two groups of seed plants. PMID:23102090
Pavy, Nathalie; Pelgas, Betty; Laroche, Jérôme; Rigault, Philippe; Isabel, Nathalie; Bousquet, Jean
2012-10-26
Seed plants are composed of angiosperms and gymnosperms, which diverged from each other around 300 million years ago. While much light has been shed on the mechanisms and rate of genome evolution in flowering plants, such knowledge remains conspicuously meagre for the gymnosperms. Conifers are key representatives of gymnosperms and the sheer size of their genomes represents a significant challenge for characterization, sequencing and assembling. To gain insight into the macro-organisation and long-term evolution of the conifer genome, we developed a genetic map involving 1,801 spruce genes. We designed a statistical approach based on kernel density estimation to analyse gene density and identified seven gene-rich isochors. Groups of co-localizing genes were also found that were transcriptionally co-regulated, indicative of functional clusters. Phylogenetic analyses of 157 gene families for which at least two duplicates were mapped on the spruce genome indicated that ancient gene duplicates shared by angiosperms and gymnosperms outnumbered conifer-specific duplicates by a ratio of eight to one. Ancient duplicates were much more translocated within and among spruce chromosomes than conifer-specific duplicates, which were mostly organised in tandem arrays. Both high synteny and collinearity were also observed between the genomes of spruce and pine, two conifers that diverged more than 100 million years ago. Taken together, these results indicate that much genomic evolution has occurred in the seed plant lineage before the split between gymnosperms and angiosperms, and that the pace of evolution of the genome macro-structure has been much slower in the gymnosperm lineage leading to extent conifers than that seen for the same period of time in flowering plants. This trend is largely congruent with the contrasted rates of diversification and morphological evolution observed between these two groups of seed plants.
A cis-Regulatory Mutation of PDSS2 Causes Silky-Feather in Chickens
Feng, Chungang; Gao, Yu; Dorshorst, Ben; Song, Chi; Gu, Xiaorong; Li, Qingyuan; Li, Jinxiu; Liu, Tongxin; Rubin, Carl-Johan; Zhao, Yiqiang; Wang, Yanqiang; Fei, Jing; Li, Huifang; Chen, Kuanwei; Qu, Hao; Shu, Dingming; Ashwell, Chris; Da, Yang; Andersson, Leif; Hu, Xiaoxiang; Li, Ning
2014-01-01
Silky-feather has been selected and fixed in some breeds due to its unique appearance. This phenotype is caused by a single recessive gene (hookless, h). Here we map the silky-feather locus to chromosome 3 by linkage analysis and subsequently fine-map it to an 18.9 kb interval using the identical by descent (IBD) method. Further analysis reveals that a C to G transversion located upstream of the prenyl (decaprenyl) diphosphate synthase, subunit 2 (PDSS2) gene is causing silky-feather. All silky-feather birds are homozygous for the G allele. The silky-feather mutation significantly decreases the expression of PDSS2 during feather development in vivo. Consistent with the regulatory effect, the C to G transversion is shown to remarkably reduce PDSS2 promoter activity in vitro. We report a new example of feather structure variation associated with a spontaneous mutation and provide new insight into the PDSS2 function. PMID:25166907
Korinsak, Siripar; Tangphatsornruang, Sithichoke; Pootakham, Wirulda; Wanchana, Samart; Plabpla, Anucha; Jantasuriyarat, Chatchawan; Patarapuwadol, Sujin; Vanavichit, Apichart; Toojinda, Theerayut
2018-05-15
Magnaporthe oryzae is a fungal pathogen causing blast disease in many plant species. In this study, seventy three isolates of M. oryzae collected from rice (Oryza sativa) in 1996-2014 were genotyped using a genotyping-by-sequencing approach to detect genetic variation. An association study was performed to identify single nucleotide polymorphisms (SNPs) associated with virulence genes using 831 selected SNP and infection phenotypes on local and improved rice varieties. Population structure analysis revealed eight subpopulations. The division into eight groups was not related to the degree of virulence. Association mapping showed five SNPs associated with fungal virulence on chromosome 1, 2, 3, 4 and 7. The SNP on chromosome 1 was associated with virulence against RD6-Pi7 and IRBL7-M which might be linked to the previously reported AvrPi7. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kawagoe, Kazuyoshi; Takeda, Junji; Kinoshita, Taroh
Many membrane proteins are anchored to the cell membrane by glycosylphosphatidylinositol (GPI). The core structure and biosynthesis of the GPI anchor are well conserved in eukaryote cells. We previously cloned a human PIGA gene that participates in GPI anchor biosynthesis. We have now cloned complementary and genomic DNA of Pig-a, the murine homologue of PIGA, and compared its function and gene structure with those of PIGA. The deduced amino acid sequence of mouse PIG-A is 88% identical with that of human PIG-A. Transfection of Pig-a cDNA complemented the defects of both a PIG-A-deficient murine cell line and a PIG-A-deficient humanmore » cell line, demonstrating that functions of mouse and human PIG-A are conserved. Like human PIGA, the chromosomal Pig-a gene has six exons and spans approximately 16 kb. Moreover, Pig-a was mapped to X-F3/4, which is syntenic to human Xp22.1, where PIGA is located. Thus, murine Pig-a provides a good animal model to study paroxysmal nocturnal hemoglobinuria, a disease caused by a somatic mutation of PIGA. Database analysis demonstrated that a yeast gene, SPT14, is homologous to Pig-a and PIGA and that these genes are members of a glycosyltransferase gene family.« less
Temperature-responsive in vitro RNA structurome of Yersinia pseudotuberculosis.
Righetti, Francesco; Nuss, Aaron M; Twittenhoff, Christian; Beele, Sascha; Urban, Kristina; Will, Sebastian; Bernhart, Stephan H; Stadler, Peter F; Dersch, Petra; Narberhaus, Franz
2016-06-28
RNA structures are fundamentally important for RNA function. Dynamic, condition-dependent structural changes are able to modulate gene expression as shown for riboswitches and RNA thermometers. By parallel analysis of RNA structures, we mapped the RNA structurome of Yersinia pseudotuberculosis at three different temperatures. This human pathogen is exquisitely responsive to host body temperature (37 °C), which induces a major metabolic transition. Our analysis profiles the structure of more than 1,750 RNAs at 25 °C, 37 °C, and 42 °C. Average mRNAs tend to be unstructured around the ribosome binding site. We searched for 5'-UTRs that are folded at low temperature and identified novel thermoresponsive RNA structures from diverse gene categories. The regulatory potential of 16 candidates was validated. In summary, we present a dynamic bacterial RNA structurome and find that the expression of virulence-relevant functions in Y. pseudotuberculosis and reprogramming of its metabolism in response to temperature is associated with a restructuring of numerous mRNAs.
Adaptation of video game UVW mapping to 3D visualization of gene expression patterns
NASA Astrophysics Data System (ADS)
Vize, Peter D.; Gerth, Victor E.
2007-01-01
Analysis of gene expression patterns within an organism plays a critical role in associating genes with biological processes in both health and disease. During embryonic development the analysis and comparison of different gene expression patterns allows biologists to identify candidate genes that may regulate the formation of normal tissues and organs and to search for genes associated with congenital diseases. No two individual embryos, or organs, are exactly the same shape or size so comparing spatial gene expression in one embryo to that in another is difficult. We will present our efforts in comparing gene expression data collected using both volumetric and projection approaches. Volumetric data is highly accurate but difficult to process and compare. Projection methods use UV mapping to align texture maps to standardized spatial frameworks. This approach is less accurate but is very rapid and requires very little processing. We have built a database of over 180 3D models depicting gene expression patterns mapped onto the surface of spline based embryo models. Gene expression data in different models can easily be compared to determine common regions of activity. Visualization software, both Java and OpenGL optimized for viewing 3D gene expression data will also be demonstrated.
The molecular architecture of human N-acetylgalactosamine kinase.
Thoden, James B; Holden, Hazel M
2005-09-23
Galactokinase plays a key role in normal galactose metabolism by catalyzing the conversion of alpha-d-galactose to galactose 1-phosphate. Within recent years, the three-dimensional structures of human galactokinase and two bacterial forms of the enzyme have been determined. Originally, the gene encoding galactokinase in humans was mapped to chromosome 17. An additional gene, encoding a protein with sequence similarity to galactokinase, was subsequently mapped to chromosome 15. Recent reports have shown that this second gene (GALK2) encodes an enzyme with greater activity against GalNAc than galactose. This enzyme, GalNAc kinase, has been implicated in a salvage pathway for the reutilization of free GalNAc derived from the degradation of complex carbohydrates. Here we report the first structural analysis of a GalNAc kinase. The structure of the human enzyme was solved in the presence of MnAMPPNP and GalNAc or MgATP and GalNAc (which resulted in bound products in the active site). The enzyme displays a distinctly bilobal appearance with its active site wedged between the two domains. The N-terminal region is dominated by a seven-stranded mixed beta-sheet, whereas the C-terminal motif contains two layers of anti-parallel beta-sheet. The overall topology displayed by GalNAc kinase places it into the GHMP superfamily of enzymes, which generally function as small molecule kinases. From this investigation, the geometry of the GalNAc kinase active site before and after catalysis has been revealed, and the determinants of substrate specificity have been defined on a molecular level.
Eckelt, Elke; Jarek, Michael; Frömke, Cornelia; Meens, Jochen; Goethe, Ralph
2014-12-06
Maintenance of metal homeostasis is crucial in bacterial pathogenicity as metal starvation is the most important mechanism in the nutritional immunity strategy of host cells. Thus, pathogenic bacteria have evolved sensitive metal scavenging systems to overcome this particular host defence mechanism. The ruminant pathogen Mycobacterium avium ssp. paratuberculosis (MAP) displays a unique gut tropism and causes a chronic progressive intestinal inflammation. MAP possesses eight conserved lineage specific large sequence polymorphisms (LSP), which distinguish MAP from its ancestral M. avium ssp. hominissuis or other M. avium subspecies. LSP14 and LSP15 harbour many genes proposed to be involved in metal homeostasis and have been suggested to substitute for a MAP specific, impaired mycobactin synthesis. In the present study, we found that a LSP14 located putative IrtAB-like iron transporter encoded by mptABC was induced by zinc but not by iron starvation. Heterologous reporter gene assays with the lacZ gene under control of the mptABC promoter in M. smegmatis (MSMEG) and in a MSMEG∆furB deletion mutant revealed a zinc dependent, metalloregulator FurB mediated expression of mptABC via a conserved mycobacterial FurB recognition site. Deep sequencing of RNA from MAP cultures treated with the zinc chelator TPEN revealed that 70 genes responded to zinc limitation. Remarkably, 45 of these genes were located on a large genomic island of approximately 90 kb which harboured LSP14 and LSP15. Thirty-five of these genes were predicted to be controlled by FurB, due to the presence of putative binding sites. This clustering of zinc responsive genes was exclusively found in MAP and not in other mycobacteria. Our data revealed a particular genomic signature for MAP given by a unique zinc specific locus, thereby suggesting an exceptional relevance of zinc for the metabolism of MAP. MAP seems to be well adapted to maintain zinc homeostasis which might contribute to the peculiarity of MAP pathogenicity.
Bajaj, Deepak; Das, Shouvik; Badoni, Saurabh; Kumar, Vinod; Singh, Mohar; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
We identified 82489 high-quality genome-wide SNPs from 93 wild and cultivated Cicer accessions through integrated reference genome- and de novo-based GBS assays. High intra- and inter-specific polymorphic potential (66–85%) and broader natural allelic diversity (6–64%) detected by genome-wide SNPs among accessions signify their efficacy for monitoring introgression and transferring target trait-regulating genomic (gene) regions/allelic variants from wild to cultivated Cicer gene pools for genetic improvement. The population-specific assignment of wild Cicer accessions pertaining to the primary gene pool are more influenced by geographical origin/phenotypic characteristics than species/gene-pools of origination. The functional significance of allelic variants (non-synonymous and regulatory SNPs) scanned from transcription factors and stress-responsive genes in differentiating wild accessions (with potential known sources of yield-contributing and stress tolerance traits) from cultivated desi and kabuli accessions, fine-mapping/map-based cloning of QTLs and determination of LD patterns across wild and cultivated gene-pools are suitably elucidated. The correlation between phenotypic (agromorphological traits) and molecular diversity-based admixed domestication patterns within six structured populations of wild and cultivated accessions via genome-wide SNPs was apparent. This suggests utility of whole genome SNPs as a potential resource for identifying naturally selected trait-regulating genomic targets/functional allelic variants adaptive to diverse agroclimatic regions for genetic enhancement of cultivated gene-pools. PMID:26208313
Consistency of gene starts among Burkholderia genomes
2011-01-01
Background Evolutionary divergence in the position of the translational start site among orthologous genes can have significant functional impacts. Divergence can alter the translation rate, degradation rate, subcellular location, and function of the encoded proteins. Results Existing Genbank gene maps for Burkholderia genomes suggest that extensive divergence has occurred--53% of ortholog sets based on Genbank gene maps had inconsistent gene start sites. However, most of these inconsistencies appear to be gene-calling errors. Evolutionary divergence was the most plausible explanation for only 17% of the ortholog sets. Correcting probable errors in the Genbank gene maps decreased the percentage of ortholog sets with inconsistent starts by 68%, increased the percentage of ortholog sets with extractable upstream intergenic regions by 32%, increased the sequence similarity of intergenic regions and predicted proteins, and increased the number of proteins with identifiable signal peptides. Conclusions Our findings highlight an emerging problem in comparative genomics: single-digit percent errors in gene predictions can lead to double-digit percentages of inconsistent ortholog sets. The work demonstrates a simple approach to evaluate and improve the quality of gene maps. PMID:21342528
DOE Office of Scientific and Technical Information (OSTI.GOV)
Suzuki, Kazuo; Yasunami, Michio; Matsuda, Yoichi
1996-09-01
Embryonic TEA domain-containing factor (ETF) belongs to the family of proteins structurally related to transcriptional enhancer factor-1 (TEF-1) and is implicated in neural development. Isolation and characterization of the cosmid clones encoding the mouse ETF gene (Etdf) revealed that Etdf spans approximately 17.9 kb and consists of 12 exons. The exon-intron structure of Etdf closely resembles that of the Drosophila scalloped gene, indicating that these genes may have evolved from a common ancestor. Then multiple transcription initiation sites revealed by S1 protection and primer extension analyses are consistent with the absence of the canonical TATA and CAAT boxes in themore » 5{prime}-flanking region, which contains many potential regulatory sequences, such as the E-box, N-box, Sp1 element, GATA-1 element, TAATGARAT element, and B2 short interspersed element (SINE) as well as several direct and inverted repeat sequences. The Etdf locus was assigned to the proximal region of mouse chromosome 7 using fluorescence in situ hybridization and linkage mapping analyses. These results provide the molecular basis for studying the regulation, in vivo function, and evolution of Etdf. 29 refs., 5 figs., 1 tab.« less
Suzuki, K; Yasunami, M; Matsuda, Y; Maeda, T; Kobayashi, H; Terasaki, H; Ohkubo, H
1996-09-01
Embryonic TEA domain-containing factor (ETF) belongs to the family of proteins structurally related to transcriptional enhancer factor-1 (TEF-1) and is implicated in neural development. Isolation and characterization of the cosmid clones encoding the mouse ETF gene (Etdf) revealed that Etdf spans approximately 17.9 kb and consists of 12 exons. The exon-intron structure of Etdf closely resembles that of the Drosophila scalloped gene, indicating that these genes may have evolved from a common ancestor. The multiple transcription initiation sites revealed by S1 protection and primer extension analyses are consistent with the absence of the canonical TATA and CAAT boxes in the 5'-flanking region, which contains many potential regulatory sequences, such as the E-box, N-box, Sp1 element, GATA-1 element, TAATGARAT element, and B2 short interspersed element (SINE) as well as several direct and inverted repeat sequences. The Etdf locus was assigned to the proximal region of mouse chromosome 7 using fluorescence in situ hybridization and linkage mapping analyses. These results provide the molecular basis for studying the regulation, in vivo function, and evolution of Etdf.
John, Anulekha Mary; C, George Priya Doss; Ebenazer, Andrew; Seshadri, Mandalam Subramaniam; Nair, Aravindan; Rajaratnam, Simon; Pai, Rekha
2013-01-01
Various missense mutations in the VHL gene have been reported among patients with familial bilateral pheochromocytoma. However, the p.Arg82Leu mutation in the VHL gene described here among patients with familial bilateral pheochromocytoma, has never been reported previously in a germline configuration. Interestingly, long-term follow-up of these patients indicated that the mutation might have had little impact on the normal function of the VHL gene, since all of them have remained asymptomatic. We further attempted to correlate this information with the results obtained by in silico analysis of this mutation using SIFT, PhD-SNP SVM profile, MutPred, PolyPhen2, and SNPs&GO prediction tools. To gain, new mechanistic insight into the structural effect, we mapped the mutation on to 3D structure (PDB ID 1LM8). Further, we analyzed the structural level changes in time scale level with respect to native and mutant protein complexes by using 12 ns molecular dynamics simulation method. Though these methods predict the mutation to have a pathogenic potential, it remains to be seen if these patients will eventually develop symptomatic disease. PMID:23626751
Models for loosely linked gene duplicates suggest lengthy persistence of both copies.
O'Hely, Martin; Wockner, Leesa
2007-06-21
Consider the appearance of a duplicate copy of a gene at a locus linked loosely, if at all, to the locus at which the gene is usually found. If all copies of the gene are subject to non-functionalizing mutations, then two fates are possible: loss of functional copies at the duplicate locus (loss of duplicate expression), or loss of functional copies at the original locus (map change). This paper proposes a simple model to address the probability of map change, the time taken for a map change and/or loss of duplicate expression, and considers where in the spectrum between loss of duplicate expression and map change such a duplicate complex is likely to be found. The findings are: the probability of map change is always half the reciprocal of the population size N, the time for a map change to occur is order NlogN generations, and that there is a marked tendency for duplicates to remain near equi-frequency with the gene at the original locus for a large portion of that time. This is in excellent agreement with simulations.
Genome structure and primitive sex chromosome revealed in Populus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tuskan, Gerald A; Yin, Tongming; Gunter, Lee E
We constructed a comprehensive genetic map for Populus and ordered 332 Mb of sequence scaffolds along the 19 haploid chromosomes in order to compare chromosomal regions among diverse members of the genus. These efforts lead us to conclude that chromosome XIX in Populus is evolving into a sex chromosome. Consistent segregation distortion in favor of the sub-genera Tacamahaca alleles provided evidence of divergent selection among species, particularly at the proximal end of chromosome XIX. A large microsatellite marker (SSR) cluster was detected in the distorted region even though the genome-wide distribute SSR sites was uniform across the physical map. Themore » differences between the genetic map and physical sequence data suggested recombination suppression was occurring in the distorted region. A gender-determination locus and an overabundance of NBS-LRR genes were also co-located to the distorted region and were put forth as the cause for divergent selection and recombination suppression. This hypothesis was verified by using fine-scale mapping of an integrated scaffold in the vicinity of the gender-determination locus. As such it appears that chromosome XIX in Populus is in the process of evolving from an autosome into a sex chromosome and that NBS-LRR genes may play important role in the chromosomal diversification process in Populus.« less
Allen Brain Atlas-Driven Visualizations: a web-based gene expression energy visualization tool.
Zaldivar, Andrew; Krichmar, Jeffrey L
2014-01-01
The Allen Brain Atlas-Driven Visualizations (ABADV) is a publicly accessible web-based tool created to retrieve and visualize expression energy data from the Allen Brain Atlas (ABA) across multiple genes and brain structures. Though the ABA offers their own search engine and software for researchers to view their growing collection of online public data sets, including extensive gene expression and neuroanatomical data from human and mouse brain, many of their tools limit the amount of genes and brain structures researchers can view at once. To complement their work, ABADV generates multiple pie charts, bar charts and heat maps of expression energy values for any given set of genes and brain structures. Such a suite of free and easy-to-understand visualizations allows for easy comparison of gene expression across multiple brain areas. In addition, each visualization links back to the ABA so researchers may view a summary of the experimental detail. ABADV is currently supported on modern web browsers and is compatible with expression energy data from the Allen Mouse Brain Atlas in situ hybridization data. By creating this web application, researchers can immediately obtain and survey numerous amounts of expression energy data from the ABA, which they can then use to supplement their work or perform meta-analysis. In the future, we hope to enable ABADV across multiple data resources.
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks
Peng, Jiajie; Uygun, Sahra; Kim, Taehyong; ...
2015-02-14
Background: Gene Ontology (GO) has been used widely to study functional relationships between genes. The current semantic similarity measures rely only on GO annotations and GO structure. This limits the power of GO-based similarity because of the limited proportion of genes that are annotated to GO in most organisms. Results: We introduce a novel approach called NETSIM (network-based similarity measure) that incorporates information from gene co-function networks in addition to using the GO structure and annotations. Using metabolic reaction maps of yeast, Arabidopsis, and human, we demonstrate that NETSIM can improve the accuracy of GO term similarities. We also demonstratemore » that NETSIM works well even for genomes with sparser gene annotation data. We applied NETSIM on large Arabidopsis gene families such as cytochrome P450 monooxygenases to group the members functionally and show that this grouping could facilitate functional characterization of genes in these families. Conclusions: Using NETSIM as an example, we demonstrated that the performance of a semantic similarity measure could be significantly improved after incorporating genome-specific information. NETSIM incorporates both GO annotations and gene co-function network data as a priori knowledge in the model. Therefore, functional similarities of GO terms that are not explicitly encoded in GO but are relevant in a taxon-specific manner become measurable when GO annotations are limited.« less
van den Broek, Evert; van Lieshout, Stef; Rausch, Christian; Ylstra, Bauke; van de Wiel, Mark A; Meijer, Gerrit A; Fijneman, Remond J A; Abeln, Sanne
2016-01-01
Development of cancer is driven by somatic alterations, including numerical and structural chromosomal aberrations. Currently, several computational methods are available and are widely applied to detect numerical copy number aberrations (CNAs) of chromosomal segments in tumor genomes. However, there is lack of computational methods that systematically detect structural chromosomal aberrations by virtue of the genomic location of CNA-associated chromosomal breaks and identify genes that appear non-randomly affected by chromosomal breakpoints across (large) series of tumor samples. 'GeneBreak' is developed to systematically identify genes recurrently affected by the genomic location of chromosomal CNA-associated breaks by a genome-wide approach, which can be applied to DNA copy number data obtained by array-Comparative Genomic Hybridization (CGH) or by (low-pass) whole genome sequencing (WGS). First, 'GeneBreak' collects the genomic locations of chromosomal CNA-associated breaks that were previously pinpointed by the segmentation algorithm that was applied to obtain CNA profiles. Next, a tailored annotation approach for breakpoint-to-gene mapping is implemented. Finally, dedicated cohort-based statistics is incorporated with correction for covariates that influence the probability to be a breakpoint gene. In addition, multiple testing correction is integrated to reveal recurrent breakpoint events. This easy-to-use algorithm, 'GeneBreak', is implemented in R ( www.cran.r-project.org ) and is available from Bioconductor ( www.bioconductor.org/packages/release/bioc/html/GeneBreak.html ).
Chromosomal Mapping of Canine-Derived BAC Clones to the Red Fox and American Mink Genomes
Vorobieva, Nadegda V.; Beklemisheva, Violetta R.; Johnson, Jennifer L.; Temnykh, Svetlana V.; Yudkin, Dmitry V.; Trut, Lyudmila N.; Andre, Catherine; Galibert, Francis; Aguirre, Gustavo D.; Acland, Gregory M.; Graphodatsky, Alexander S.
2009-01-01
High-quality sequencing of the dog (Canis lupus familiaris) genome has enabled enormous progress in genetic mapping of canine phenotypic variation. The red fox (Vulpes vulpes), another canid species, also exhibits a wide range of variation in coat color, morphology, and behavior. Although the fox genome has not yet been sequenced, canine genomic resources have been used to construct a meiotic linkage map of the red fox genome and begin genetic mapping in foxes. However, a more detailed gene-specific comparative map between the dog and fox genomes is required to establish gene order within homologous regions of dog and fox chromosomes and to refine breakpoints between homologous chromosomes of the 2 species. In the current study, we tested whether canine-derived gene–containing bacterial artificial chromosome (BAC) clones can be routinely used to build a gene-specific map of the red fox genome. Forty canine BAC clones were mapped to the red fox genome by fluorescence in situ hybridization (FISH). Each clone was uniquely assigned to a single fox chromosome, and the locations of 38 clones agreed with cytogenetic predictions. These results clearly demonstrate the utility of FISH mapping for construction of a whole-genome gene-specific map of the red fox. The further possibility of using canine BAC clones to map genes in the American mink (Mustela vison) genome was also explored. Much lower success was obtained for this more distantly related farm-bred species, although a few BAC clones were mapped to the predicted chromosomal locations. PMID:19546120
2013-01-01
Background As for other major crops, achieving a complete wheat genome sequence is essential for the application of genomics to breeding new and improved varieties. To overcome the complexities of the large, highly repetitive and hexaploid wheat genome, the International Wheat Genome Sequencing Consortium established a chromosome-based strategy that was validated by the construction of the physical map of chromosome 3B. Here, we present improved strategies for the construction of highly integrated and ordered wheat physical maps, using chromosome 1BL as a template, and illustrate their potential for evolutionary studies and map-based cloning. Results Using a combination of novel high throughput marker assays and an assembly program, we developed a high quality physical map representing 93% of wheat chromosome 1BL, anchored and ordered with 5,489 markers including 1,161 genes. Analysis of the gene space organization and evolution revealed that gene distribution and conservation along the chromosome results from the superimposition of the ancestral grass and recent wheat evolutionary patterns, leading to a peak of synteny in the central part of the chromosome arm and an increased density of non-collinear genes towards the telomere. With a density of about 11 markers per Mb, the 1BL physical map provides 916 markers, including 193 genes, for fine mapping the 40 QTLs mapped on this chromosome. Conclusions Here, we demonstrate that high marker density physical maps can be developed in complex genomes such as wheat to accelerate map-based cloning, gain new insights into genome evolution, and provide a foundation for reference sequencing. PMID:23800011
Mitchelson, K R
1996-01-01
The small single-copy region (SSCR) of the chloroplast genome of many higher plants typically contain ndh genes encoding proteins that share homology with subunits of the respiratory-chain reduced nicotinamide adenine dinucleotide (NADH) dehydrogenase complex of mitochondria. A map of the lettuce chloroplast SSCR has been determined by Southern cross-hybridization, taking advantage of the high degree of homology between a tobacco small single-copy fragment and a corresponding lettuce chloroplast fragment. The gene order of the SSCR of lettuce and tobacco chloroplasts is similar. The cross-hybridization method can rapidly create a primary gene map of unknown chloroplast fragments, thus providing detailed information of the localization and arrangement of genes and conserved open reading frame regions.
Sánchez-Mir, Laura; Salat-Canela, Clàudia; Paulo, Esther; Carmona, Mercè; Ayté, José; Oliva, Baldo; Hidalgo, Elena
2018-02-01
Stress-dependent activation of signaling cascades is often mediated by phosphorylation events, but the exact nature and role of these phosphorelays are frequently poorly understood. Here, we review which are the consequences of the stress-dependent phosphorylation of a transcription factor on gene activation. In fission yeast, the MAP kinase Sty1 is activated upon several environmental hazards and promotes cell adaptation and survival, greatly through activation of a gene program mediated by the transcription factor Atf1. Although described decades ago, the role of the phosphorylation of Atf1 by Sty1 is still a matter of debate. We present here a brief review of recent data, obtained through the characterization of several phosphorylation mutant derivatives of Atf1, demonstrating that Atf1 phosphorylation does not stabilize the factor nor stimulates its binding to DNA. Rather, it provides a structural platform of interaction with the transcriptional machinery. Based on these findings, future work will establish how this phosphorylated trans-activation domain promotes the massive gene expression shift allowing cellular adaptation to stress.
Campbell, Raymond; Pont, Simon D A; Morris, Jenny A; McKenzie, Gaynor; Sharma, Sanjeev Kumar; Hedley, Pete E; Ramsay, Gavin; Bryan, Glenn J; Taylor, Mark A
2014-09-01
Genome-wide QTL analysis of potato tuber carotenoid content was investigated in populations of Solanum tuberosum Group Phureja that segregate for flesh colour, revealing a novel major QTL on chromosome 9. The carotenoid content of edible plant storage organs is a key nutritional and quality trait. Although the structural genes that encode the biosynthetic enzymes are well characterised, much less is known about the factors that determine overall storage organ content. In this study, genome-wide QTL mapping, in concert with an efficient 'genetical genomics' analysis using bulked samples, has been employed to investigate the genetic architecture of potato tuber carotenoid content. Two diploid populations of Solanum tuberosum Group Phureja were genotyped (AFLP, SSR and DArT markers) and analysed for their tuber carotenoid content over two growing seasons. Common to both populations were QTL that explained relatively small proportions of the variation in constituent carotenoids and a major QTL on chromosome 3 explaining up to 71 % of the variation in carotenoid content. In one of the populations (01H15), a second major carotenoid QTL was identified on chromosome 9, explaining up to 20 % of the phenotypic variation. Whereas the major chromosome 3 QTL was likely to be due to an allele of a gene encoding β-carotene hydroxylase, no known carotenoid biosynthetic genes are located in the vicinity of the chromosome 9 QTL. A unique expression profiling strategy using phenotypically distinct bulks comprised individuals with similar carotenoid content provided further support for the QTL mapping to chromosome 9. This study shows the potential of using the potato genome sequence to link genetic maps to data arising from eQTL approaches to enhance the discovery of candidate genes underlying QTLs.
Bruining, Hilgo; Matsui, Asuka; Oguro-Ando, Asami; Kahn, René S; Van't Spijker, Heleen M; Akkermans, Guus; Stiedl, Oliver; van Engeland, Herman; Koopmans, Bastijn; van Lith, Hein A; Oppelaar, Hugo; Tieland, Liselotte; Nonkes, Lourens J; Yagi, Takeshi; Kaneko, Ryosuke; Burbach, J Peter H; Yamamoto, Nobuhiko; Kas, Martien J
2015-10-01
Quantitative genetic analysis of basic mouse behaviors is a powerful tool to identify novel genetic phenotypes contributing to neurobehavioral disorders. Here, we analyzed genetic contributions to single-trial, long-term social and nonsocial recognition and subsequently studied the functional impact of an identified candidate gene on behavioral development. Genetic mapping of single-trial social recognition was performed in chromosome substitution strains, a sophisticated tool for detecting quantitative trait loci (QTL) of complex traits. Follow-up occurred by generating and testing knockout (KO) mice of a selected QTL candidate gene. Functional characterization of these mice was performed through behavioral and neurological assessments across developmental stages and analyses of gene expression and brain morphology. Chromosome substitution strain 14 mapping studies revealed an overlapping QTL related to long-term social and object recognition harboring Pcdh9, a cell-adhesion gene previously associated with autism spectrum disorder. Specific long-term social and object recognition deficits were confirmed in homozygous (KO) Pcdh9-deficient mice, while heterozygous mice only showed long-term social recognition impairment. The recognition deficits in KO mice were not associated with alterations in perception, multi-trial discrimination learning, sociability, behavioral flexibility, or fear memory. Rather, KO mice showed additional impairments in sensorimotor development reflected by early touch-evoked biting, rotarod performance, and sensory gating deficits. This profile emerged with structural changes in deep layers of sensory cortices, where Pcdh9 is selectively expressed. This behavior-to-gene study implicates Pcdh9 in cognitive functions required for long-term social and nonsocial recognition. This role is supported by the involvement of Pcdh9 in sensory cortex development and sensorimotor phenotypes. Copyright © 2015 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Regional gene mapping using mixed radiation hybrids and reverse chromosome painting.
Lin, J Y; Bedford, J S
1997-11-01
We describe a new approach for low-resolution physical mapping using pooled DNA probe from mixed (non-clonal) populations of human-CHO cell hybrids and reverse chromosome painting. This mapping method is based on a process in which the human chromosome fragments bearing a complementing gene were selectively retained in a large non-clonal population of CHO-human hybrid cells during a series of 12- to 15-Gy gamma irradiations each followed by continuous growth selection. The location of the gene could then be identified by reverse chromosome painting on normal human metaphase spreads using biotinylated DNA from this population of "enriched" hybrid cells. We tested the validity of this method by correctly mapping the complementing human HPRT gene, whose location is well established. We then demonstrated the method's usefulness by mapping the chromosome location of a human gene which complemented the defect responsible for the hypersensitivity to ionizing radiation in CHO irs-20 cells. This method represents an efficient alternative to conventional concordance analysis in somatic cell hybrids where detailed chromosome analysis of numerous hybrid clones is necessary. Using this approach, it is possible to localize a gene for which there is no prior sequence or linkage information to a subchromosomal region, thus facilitating association with known mapping landmarks (e.g. RFLP, YAC or STS contigs) for higher-resolution mapping.
Lgn1, a gene that determines susceptibility to Legionella pneumophila, maps to mouse chromosome 13
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dietrich, W.F.; Damron, D.M.; Lander, E.S.
1995-04-10
The intracellular pathogen Legionella pneumophila is unable to replicate in macrophages derived from most inbred mouse strains. Here, we report the mapping of a gene, called Lgn1, that determines whether mouse macrophages are permissive for the intracellular replication of L. pneumophila. Although Lgn1 has been previously reported to map to mouse chromosome 15, we show here that it actually maps to chromosome 13, between D13Mit128 and D13Mit70. In the absence of any regional candidates for Lgn1, this map position will facilitate positional cloning attempts directed at this gene. 22 refs., 2 figs., 2 tabs.
Molecular Structure and Transformation of the Glucose Dehydrogenase Gene in Drosophila Melanogaster
Whetten, R.; Organ, E.; Krasney, P.; Cox-Foster, D.; Cavener, D.
1988-01-01
We have precisely mapped and sequenced the three 5' exons of the Drosophila melanogaster Gld gene and have identified the start sites for transcription and translation. The first exon is composed of 335 nucleotides and does not contain any putative translation start codons. The second exon is separated from the first exon by 8 kb and contains the Gld translation start codon. The inferred amino acid sequence of the amino terminus contains two unusual features: three tandem repeats of serine-alanine, and a relatively high density of cysteine residues. P element-mediated transformation experiments demonstrated that a 17.5-kb genomic fragment contains the functional and regulatory components of the Gld gene. PMID:3143620
Transcriptional Regulatory Networks in Saccharomyces cerevisiae
NASA Astrophysics Data System (ADS)
Lee, Tong Ihn; Rinaldi, Nicola J.; Robert, François; Odom, Duncan T.; Bar-Joseph, Ziv; Gerber, Georg K.; Hannett, Nancy M.; Harbison, Christopher T.; Thompson, Craig M.; Simon, Itamar; Zeitlinger, Julia; Jennings, Ezra G.; Murray, Heather L.; Gordon, D. Benjamin; Ren, Bing; Wyrick, John J.; Tagne, Jean-Bosco; Volkert, Thomas L.; Fraenkel, Ernest; Gifford, David K.; Young, Richard A.
2002-10-01
We have determined how most of the transcriptional regulators encoded in the eukaryote Saccharomyces cerevisiae associate with genes across the genome in living cells. Just as maps of metabolic networks describe the potential pathways that may be used by a cell to accomplish metabolic processes, this network of regulator-gene interactions describes potential pathways yeast cells can use to regulate global gene expression programs. We use this information to identify network motifs, the simplest units of network architecture, and demonstrate that an automated process can use motifs to assemble a transcriptional regulatory network structure. Our results reveal that eukaryotic cellular functions are highly connected through networks of transcriptional regulators that regulate other transcriptional regulators.
Rodriguez, L; Lampen, J O; MacKay, V L
1981-01-01
Saccharomyces cerevisiae revertant strain D10-ER1 has been shown to contain thermosensitive forms of the large (glycoprotein) and small (carbohydrate-free) invertases and a very low level of the small enzyme, along with a wild-type level of the large form (T. Mizunaga et al., Mol. Cell. Biol. 1:460-468, 1981). These characteristics cosegregated in crosses of the revertant strain with wild-type sucrose-fermenting (SUC1) or nonfermenting (suc0) strains. In addition, there is tight linkage between sucrose and maltose fermentation in revertant D10-ER1 (characteristic of the SUC1 and MAL1 genes). From this we infer that a single reversion event is responsible for the several changes observed in D10-ER1, and that this mutation maps within or very close to the SUC1 gene present in the ancestor strain 4059-358D. The revertant SUC1 allele in D10-ER1 (termed SUC1-R1) was expressed independently of the wild-type SUC1 gene when both were present in diploid cells. Diploids carrying only the wild-type or the mutant genes synthesized invertases with the characteristics of the parental Suc+ haploids. The possibility that a modifier gene was responsible for the alterations in the invertases of revertant D10-ER1 was ruled out by appropriate crosses. We conclude that SUC1 is a structural gene that codes for both the large and the small forms of invertase and suggest that SUC2 through SUC5 are structural genes as well. PMID:6765604
Iqbal, Muhammad Javed; Mamidi, Sujan; Ahsan, Rubina; Kianian, Shahryar F; Coyne, Clarice J; Hamama, Anwar A; Narina, Satya S; Bhardwaj, Harbans L
2012-08-01
White lupin (Lupinus albus L.) has been around since 300 B.C. and is recognized for its ability to grow on poor soils and application as green manure in addition to seed harvest. The seed has very high levels of protein (33-47 %) and oil (6-13 %). It also has many secondary metabolites that are potentially of nutraceutical value to animals and humans. Despite such a great potential, lupins role in modern agriculture began only in the twentieth century. Although a large collection of Lupinus germplasm accessions is available worldwide, rarely have they been genetically characterized. Additionally, scarce genomic resources in terms of recombinant populations and genome information have been generated for L. albus. With the advancement in association mapping methods, the natural populations have the potential to replace the recombinant populations in gene mapping and marker-trait associations. Therefore, we studied the genetic similarity, population structure and marker-trait association in a USDA germplasm collection for their current and future application in this crop improvement. A total of 122 PI (Plant Inventory) lines were screened with 18 AFLP primer pairs that generated 2,277 fragments. A subset of 892 polymorphic markers with MAF >0.05 (minor allele frequency) were used for association mapping. The cluster analysis failed to group accessions on the basis of their passport information, and a weak structure and low linkage disequilibrium (LD) were observed indicating the usefulness of the collection for association mapping. Moreover, we were also able to identify two markers (a p value of 1.53 × 10(-4) and 2.3 × 10(-4)) that explained 22.69 and 20.5 % of seed weight variation determined using R (LR) (2) . The implications of lack of geographic clustering, population structure, low LD and the ability of AFLP to map seed weight trait using association mapping and the usefulness of the PI collections in breeding programs are discussed.
Genetic organization of the unc-22 IV gene and the adjacent region in Caenorhabditis elegans.
Rogalski, T M; Baillie, D L
1985-01-01
The genetic organization of the region immediately adjacent to the unc-22 IV gene in Caenorhabditis elegans has been studied. We have identified twenty essential genes in this interval of approximately 1.5-map units on Linkage Group IV. The mutations that define these genes were positioned by recombination mapping and complementation with several deficiencies. With few exceptions, the positions obtained by these two methods agreed. Eight of the twenty essential genes identified are represented by more than one allele. Three possible internal deletions of the unc-22 gene have been located by intra-genic mapping. In addition, the right end point of a deficiency or an inversion affecting the adjacent genes let-56 and unc-22 has been positioned inside the unc-22 gene.
ERK1 and ERK2 Map Kinases: Specific Roles or Functional Redundancy?
Buscà, Roser; Pouysségur, Jacques; Lenormand, Philippe
2016-01-01
The MAP kinase signaling cascade Ras/Raf/MEK/ERK has been involved in a large variety of cellular and physiological processes that are crucial for life. Many pathological situations have been associated to this pathway. More than one isoform has been described at each level of the cascade. In this review we devoted our attention to ERK1 and ERK2, which are the effector kinases of the pathway. Whether ERK1 and ERK2 specify functional differences or are in contrast functionally redundant, constitutes an ongoing debate despite the huge amount of studies performed to date. In this review we compiled data on ERK1 vs. ERK2 gene structures, protein sequences, expression levels, structural and molecular mechanisms of activation and substrate recognition. We have also attempted to perform a rigorous analysis of studies regarding the individual roles of ERK1 and ERK2 by the means of morpholinos, siRNA, and shRNA silencing as well as gene disruption or gene replacement in mice. Finally, we comment on a recent study of gene and protein evolution of ERK isoforms as a distinct approach to address the same question. Our review permits the evaluation of the relevance of published studies in the field especially when measurements of global ERK activation are taken into account. Our analysis favors the hypothesis of ERK1 and ERK2 exhibiting functional redundancy and points to the concept of the global ERK quantity, and not isoform specificity, as being the essential determinant to achieve ERK function. PMID:27376062
Data Imputation in Epistatic MAPs by Network-Guided Matrix Completion
Žitnik, Marinka; Zupan, Blaž
2015-01-01
Abstract Epistatic miniarray profile (E-MAP) is a popular large-scale genetic interaction discovery platform. E-MAPs benefit from quantitative output, which makes it possible to detect subtle interactions with greater precision. However, due to the limits of biotechnology, E-MAP studies fail to measure genetic interactions for up to 40% of gene pairs in an assay. Missing measurements can be recovered by computational techniques for data imputation, in this way completing the interaction profiles and enabling downstream analysis algorithms that could otherwise be sensitive to missing data values. We introduce a new interaction data imputation method called network-guided matrix completion (NG-MC). The core part of NG-MC is low-rank probabilistic matrix completion that incorporates prior knowledge presented as a collection of gene networks. NG-MC assumes that interactions are transitive, such that latent gene interaction profiles inferred by NG-MC depend on the profiles of their direct neighbors in gene networks. As the NG-MC inference algorithm progresses, it propagates latent interaction profiles through each of the networks and updates gene network weights toward improved prediction. In a study with four different E-MAP data assays and considered protein–protein interaction and gene ontology similarity networks, NG-MC significantly surpassed existing alternative techniques. Inclusion of information from gene networks also allowed NG-MC to predict interactions for genes that were not included in original E-MAP assays, a task that could not be considered by current imputation approaches. PMID:25658751
THREaD Mapper Studio: a novel, visual web server for the estimation of genetic linkage maps
Cheema, Jitender; Ellis, T. H. Noel; Dicks, Jo
2010-01-01
The estimation of genetic linkage maps is a key component in plant and animal research, providing both an indication of the genetic structure of an organism and a mechanism for identifying candidate genes associated with traits of interest. Because of this importance, several computational solutions to genetic map estimation exist, mostly implemented as stand-alone software packages. However, the estimation process is often largely hidden from the user. Consequently, problems such as a program crashing may occur that leave a user baffled. THREaD Mapper Studio (http://cbr.jic.ac.uk/threadmapper) is a new web site that implements a novel, visual and interactive method for the estimation of genetic linkage maps from DNA markers. The rationale behind the web site is to make the estimation process as transparent and robust as possible, while also allowing users to use their expert knowledge during analysis. Indeed, the 3D visual nature of the tool allows users to spot features in a data set, such as outlying markers and potential structural rearrangements that could cause problems with the estimation procedure and to account for them in their analysis. Furthermore, THREaD Mapper Studio facilitates the visual comparison of genetic map solutions from third party software, aiding users in developing robust solutions for their data sets. PMID:20494977
Okamura-Oho, Yuko; Shimokawa, Kazuro; Nishimura, Masaomi; Takemoto, Satoko; Sato, Akira; Furuichi, Teiichi; Yokota, Hideo
2014-01-01
Using a recently invented technique for gene expression mapping in the whole-anatomy context, termed transcriptome tomography, we have generated a dataset of 36,000 maps of overall gene expression in the adult-mouse brain. Here, using an informatics approach, we identified a broad co-expression network that follows an inverse power law and is rich in functional interaction and gene-ontology terms. Our framework for the integrated analysis of expression maps and graphs of co-expression networks revealed that groups of combinatorially expressed genes, which regulate cell differentiation during development, were present in the adult brain and each of these groups was associated with a discrete cell types. These groups included non-coding genes of unknown function. We found that these genes specifically linked developmentally conserved groups in the network. A previously unrecognized robust expression pattern covering the whole brain was related to the molecular anatomy of key biological processes occurring in particular areas. PMID:25382412
Wen, Qing; Kim, Chang-Sik; Hamilton, Peter W; Zhang, Shu-Dong
2016-05-11
Gene expression connectivity mapping has gained much popularity recently with a number of successful applications in biomedical research testifying its utility and promise. Previously methodological research in connectivity mapping mainly focused on two of the key components in the framework, namely, the reference gene expression profiles and the connectivity mapping algorithms. The other key component in this framework, the query gene signature, has been left to users to construct without much consensus on how this should be done, albeit it has been an issue most relevant to end users. As a key input to the connectivity mapping process, gene signature is crucially important in returning biologically meaningful and relevant results. This paper intends to formulate a standardized procedure for constructing high quality gene signatures from a user's perspective. We describe a two-stage process for making quality gene signatures using gene expression data as initial inputs. First, a differential gene expression analysis comparing two distinct biological states; only the genes that have passed stringent statistical criteria are considered in the second stage of the process, which involves ranking genes based on statistical as well as biological significance. We introduce a "gene signature progression" method as a standard procedure in connectivity mapping. Starting from the highest ranked gene, we progressively determine the minimum length of the gene signature that allows connections to the reference profiles (drugs) being established with a preset target false discovery rate. We use a lung cancer dataset and a breast cancer dataset as two case studies to demonstrate how this standardized procedure works, and we show that highly relevant and interesting biological connections are returned. Of particular note is gefitinib, identified as among the candidate therapeutics in our lung cancer case study. Our gene signature was based on gene expression data from Taiwan female non-smoker lung cancer patients, while there is evidence from independent studies that gefitinib is highly effective in treating women, non-smoker or former light smoker, advanced non-small cell lung cancer patients of Asian origin. In summary, we introduced a gene signature progression method into connectivity mapping, which enables a standardized procedure for constructing high quality gene signatures. This progression method is particularly useful when the number of differentially expressed genes identified is large, and when there is a need to prioritize them to be included in the query signature. The results from two case studies demonstrate that the approach we have developed is capable of obtaining pertinent candidate drugs with high precision.
Mapping of the Tuple1 gene to mouse chromosome 16A-B1
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mattei, M.G.; Halford, S.; Scambler, P.J.
The human TUPLE1 gene encodes a putative transcriptional regulator and maps to chromosome 22, and therefore may play a role in Di-George syndrome (DGS), relo-cardio-facial syndrome (VCFS), or a related pathology. The murine TUPLE1 gene has also been cloned and shows strong sequence similarity to TUPLE1. Comparative mapping is useful in the study of chromosome evolution and is sometimes able to indicate possible mouse mutations that are potential models of human genetic disorders. As TIPLE1 is a candidate gene for the haploinsufficient phenotype in DGS, we mapped TUPLE1 to mouse chromosome 16A-B1. 6 refs., 1 fig.
Sineokiĭ, S P; Pogosov, V Z; Iankovskiĭ, N K; Krylov, V N
1976-01-01
123 Amber mutants of lambdoid bacteriophage phi81 are isolated and distributed into 19 complementation groups. Deletion mapping made possible to locate 5 gene groups on the genetic map of bacteriophage phi81 and to determine a region of possible location of mm' sticky ends on the prophage genetic map. A gene of phage phi81 is localized, which controls the adsorption specificity, and which functional similarity to a respective gene of phage phi80 is demonstrated.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Haberle, Rosemarie C.; Fourcade, Matthew L.; Boore, Jeffrey L.
2006-01-09
Chloroplast genome structure, gene order and content arehighly conserved in land plants. We sequenced the complete chloroplastgenome sequence of Trachelium caeruleum (Campanulaceae) a member of anangiosperm family known for highly rearranged chloroplast genomes. Thetotal genome size is 162,321 bp with an IR of 27,273 bp, LSC of 100,113bp and SSC of 7,661 bp. The genome encodes 115 unique genes, with 19duplicated in the IR, a tRNA (trnI-CAU) duplicated once in the LSC and aprotein coding gene (psbJ) duplicated twice, for a total of 137 genes.Four genes (ycf15, rpl23, infA and accD) are truncated and likelynonfunctional; three others (clpP, ycf1 andmore » ycf2) are so highly divergedthat they may now be pseudogenes. The most conspicuous feature of theTrachelium genome is the presence of eighteen internally unrearrangedblocks of genes that have been inverted or relocated within the genome,relative to the typical gene order of most angiosperm chloroplastgenomes. Recombination between repeats or tRNAs has been suggested as twomeans of chloroplast genome rearrangements. We compared the relativenumber of repeats in Trachelium to eight other angiosperm chloroplastgenomes, and evaluated the location of repeats and tRNAs in relation torearrangements. Trachelium has the highest number and largest repeats,which are concentrated near inversion endpoints or other rearrangements.tRNAs occur at many but not all inversion endpoints. There is likely nosingle mechanism responsible for the remarkable number of alterations inthis genome, but both repeats and tRNAs are clearly associated with theserearrangements. Land plant chloroplast genomes are highly conserved instructure, gene order and content. The chloroplast genomes of ferns, thegymnosperm Ginkgo, and most angiosperms are nearly collinear, reflectingthe gene order in lineages that diverged from lycopsids and the ancestralchloroplast gene order over 350 million years ago (Raubeson and Jansen,1992). Although earlier mapping studies identified a number of taxa inwhich several rearrangements have occurred (reviewed in Raubeson andJansen, 2005), an extraordinary number of chloroplast genome alterationsare concentrated in several families in the angiosperm order Asterales(sensu APGII, Bremer et al., 2003). Gene mapping studies ofrepresentatives of the Campanulaceae (Cosner, 1993; Cosner et al.,1997,2004) and Lobeliaceae (Knox et al., 1993; Knox and Palmer, 1999)identified large inversions, contraction and expansion of the invertedrepeat regions, and several insertions and deletions in the cpDNAs ofthese closely related taxa. Detailed restriction site and gene mapping ofthe chloroplast genome of Trachelium caeruleum (Campanulaceae) identifiedseven to ten large inversions, families of repeats associated withrearrangements, possible transpositions, and even the disruption ofoperons (Cosner et al., 1997). Seventeen other members of theCampanulaceae were mapped and exhibit many additional rearrangements(Cosner et al., 2004). What happened in this lineage that made itsusceptible to so many chloroplast genome rearrangements? How do normallyvery conserved chloroplast genomes change? The cause of rearrangements inthis group is unclear based on the limited resolution available withmapping techniques. Several mechanisms have been proposed to explain howrearrangements occur: recombination between repeats, transposition, ortemporary instability due to loss of the inverted repeat (Raubeson andJansen, 2005). Sequencing whole chloroplast genomes within theCampanulaceae offers a unique opportunity to examine both the extent andmechanisms of rearrangements within a phylogenetic framework.We reporthere the first complete chloroplast genome sequence of a member of theCampanulaceae, Trachelium caeruleum. This work will serve as a benchmarkfor subsequent, comparative sequencing and analysis of other members ofthis family and close relatives, with the goal of further understandingchloroplast genome evolution. We confirmed features previously identifiedthrough mapping, and discovered many additional structural changes,including several partial to entire gene duplications, deterioration ofat least four normally conserved chloroplast genes into gene fragments,and the nature and position of numerous repeat elements at or nearinversion endpoints. The focus of this paper is on analyses of sequencesat or near these rearrangements in Trachelium caeruleum. Inversions arebelieved to occur due to the presence of repeat elements subject tohomologous recombination (Palmer, 1991; Knox et al., 1993). Repeats mayfacilitate inversions or other genome rearrangements (Achaz et al.,2003), and higher incidences of repeats have been correlated with greaternumbers of rearrangements (Rocha, 2003). Alternatively, repeats mayproliferate within a genome asa result of DNA strand repair mechanismsfollowing a rearrangement event such as an inversion. Gene« less
The Human Genome Initiative: First Steps.
ERIC Educational Resources Information Center
Newman, Alan R.
1990-01-01
Described is the basic biology involved in mapping chromosomes as presented at a symposium at a recent meeting of the American Chemical Association which focused on the Human Genome Initiative. Different types of gene maps and techniques used to produce gene maps are discussed. (CW)
Learning the Structure of Biomedical Relationships from Unstructured Text
Percha, Bethany; Altman, Russ B.
2015-01-01
The published biomedical research literature encompasses most of our understanding of how drugs interact with gene products to produce physiological responses (phenotypes). Unfortunately, this information is distributed throughout the unstructured text of over 23 million articles. The creation of structured resources that catalog the relationships between drugs and genes would accelerate the translation of basic molecular knowledge into discoveries of genomic biomarkers for drug response and prediction of unexpected drug-drug interactions. Extracting these relationships from natural language sentences on such a large scale, however, requires text mining algorithms that can recognize when different-looking statements are expressing similar ideas. Here we describe a novel algorithm, Ensemble Biclustering for Classification (EBC), that learns the structure of biomedical relationships automatically from text, overcoming differences in word choice and sentence structure. We validate EBC's performance against manually-curated sets of (1) pharmacogenomic relationships from PharmGKB and (2) drug-target relationships from DrugBank, and use it to discover new drug-gene relationships for both knowledge bases. We then apply EBC to map the complete universe of drug-gene relationships based on their descriptions in Medline, revealing unexpected structure that challenges current notions about how these relationships are expressed in text. For instance, we learn that newer experimental findings are described in consistently different ways than established knowledge, and that seemingly pure classes of relationships can exhibit interesting chimeric structure. The EBC algorithm is flexible and adaptable to a wide range of problems in biomedical text mining. PMID:26219079
Song, Zhaojun; Ye, Yongjie; Zhang, Zhi; Shen, Jieliang; Hu, Zhenming; Wang, Zhigang; Zheng, Jiazhuang
2018-02-12
Various gene delivery systems have been widely studied for the acute spinal cord injury (SCI) treatment. In the present study, a novel type of brain-derived neurotrophic factor (BDNF)-loaded cationic nanobubbles (CNBs) conjugated with MAP-2 antibody (mAb MAP-2 /BDNF/CNBs) was prepared to provide low-intensity focused ultrasound (LIFU)-targeted gene therapy. In vitro experiments, the ultrasound-targeted tranfection to BDNF overexpressioin in neurons and efficiently inhibition neuronal apoptosis have been demonstrated, and the elaborately designed mAb MAP-2 /BDNF/CNBs can specifically target to the neurons. Furthermore, in a acute SCI rat model, LIFU-mediated mAb MAP-2 /BDNF/CNBs transfection significantly increased BDNF expression, attenuated histological injury, decreased neurons loss, inhibited neuronal apoptosis in injured spinal cords, and increased BBB scores in SCI rats. LIFU-mediated mAb MAP-2 /BDNF/CNBs destruction significantly increase transfection efficiency of BDNF gene both in vitro and in vivo, and has a significant neuroprotective effect on the injured spinal cord. Therefore, the combination of LIFU irradiation and gene therapy through mAb MAP-2 /BDNF/CNBs can be considered as a novel non-invasive and targeted treatment for gene therapy of SCI. Copyright © 2018 Elsevier Inc. All rights reserved.
Maria C. Mateo-Sanchez; Niko Balkenhol; Samuel Cushman; Trinidad Perez; Ana Dominguez; Santiago Saura
2015-01-01
Most current methods to assess connectivity begin with landscape resistance maps. The prevailing resistance models are commonly based on expert opinion and, more recently, on a direct transformation of habitat suitability. However, habitat associations are not necessarily accurate indicators of dispersal, and thus may fail as a surrogate of resistance to...
Filling the gap: Micro-C accesses the nucleosomal fiber at 100-1000 bp resolution.
Mozziconacci, Julien; Koszul, Romain
2015-08-21
The fine three-dimensional structure of the nucleosomal fiber has remained elusive to genome-wide chromosome conformation capture (3C) approaches. A new study mapping contacts at the single nucleosome level (Micro-C) reveals topological interacting domains along budding yeast chromosomes. These domains encompass one to five consecutive genes and are delimited by highly active promoters.
Spielmann, A; Stutz, E
1983-01-01
The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2. PMID:6314279
Srivastava, Rishi; Singh, Mohar; Bajaj, Deepak; Parida, Swarup K.
2016-01-01
Development and large-scale genotyping of user-friendly informative genome/gene-derived InDel markers in natural and mapping populations is vital for accelerating genomics-assisted breeding applications of chickpea with minimal resource expenses. The present investigation employed a high-throughput whole genome next-generation resequencing strategy in low and high pod number parental accessions and homozygous individuals constituting the bulks from each of two inter-specific mapping populations [(Pusa 1103 × ILWC 46) and (Pusa 256 × ILWC 46)] to develop non-erroneous InDel markers at a genome-wide scale. Comparing these high-quality genomic sequences, 82,360 InDel markers with reference to kabuli genome and 13,891 InDel markers exhibiting differentiation between low and high pod number parental accessions and bulks of aforementioned mapping populations were developed. These informative markers were structurally and functionally annotated in diverse coding and non-coding sequence components of genome/genes of kabuli chickpea. The functional significance of regulatory and coding (frameshift and large-effect mutations) InDel markers for establishing marker-trait linkages through association/genetic mapping was apparent. The markers detected a greater amplification (97%) and intra-specific polymorphic potential (58–87%) among a diverse panel of cultivated desi, kabuli, and wild accessions even by using a simpler cost-efficient agarose gel-based assay implicating their utility in large-scale genetic analysis especially in domesticated chickpea with narrow genetic base. Two high-density inter-specific genetic linkage maps generated using aforesaid mapping populations were integrated to construct a consensus 1479 InDel markers-anchored high-resolution (inter-marker distance: 0.66 cM) genetic map for efficient molecular mapping of major QTLs governing pod number and seed yield per plant in chickpea. Utilizing these high-density genetic maps as anchors, three major genomic regions harboring each of pod number and seed yield robust QTLs (15–28% phenotypic variation explained) were identified on chromosomes 2, 4, and 6. The integration of genetic and physical maps at these QTLs mapped on chromosomes scaled-down the long major QTL intervals into high-resolution short pod number and seed yield robust QTL physical intervals (0.89–2.94 Mb) which were essentially got validated in multiple genetic backgrounds of two chickpea mapping populations. The genome-wide InDel markers including natural allelic variants and genomic loci/genes delineated at major six especially in one colocalized novel congruent robust pod number and seed yield robust QTLs mapped on a high-density consensus genetic map were found most promising in chickpea. These functionally relevant molecular tags can drive marker-assisted genetic enhancement to develop high-yielding cultivars with increased seed/pod number and yield in chickpea. PMID:27695461
Zhang, Yanxin; Wang, Linhai; Gao, Yuan; Li, Donghua; Yu, Jingyin; Zhou, Rong; Zhang, Xiurong
2018-06-14
As an important oil crop, growth habit of sesame (Sesamum indicum L.) is naturally indeterminate, which brings about asynchronous maturity of capsules and causes loss of yield. The genetic basis of determinate growth habit in sesame was investigated by classical genetic analysis through multiple populations, results revealed that it was controlled by an unique recessive gene. The genotyping by sequencing (GBS) approach was employed for high-throughput SNP identification and genotyping in the F 2 population, then a high density bin map was constructed, the map was 1086.403 cM in length, which consisted of 1184 bins (13,679 SNPs), with an average of 0.918 cM between adjacent bins. Based on bin mapping in conjunction with SSR markers analysis in targeted region, the novel sesame determinacy gene was mapped on LG09 in a genome region of 41 kb. This study dissected genetic basis of determinate growth habit in sesame, constructed a new high-density bin map and mapped a novel determinacy gene. Results of this study demonstrate that we employed an optimized approach to get fine-accuracy, high-resolution and high-efficiency mapping result in sesame. The findings provided important foundation for sesame determinacy gene cloning and were expected to be applied in breeding for cultivars suited to mechanized production.
A comprehensive whole-genome integrated cytogenetic map for the alpaca (Lama pacos).
Avila, Felipe; Baily, Malorie P; Perelman, Polina; Das, Pranab J; Pontius, Joan; Chowdhary, Renuka; Owens, Elaine; Johnson, Warren E; Merriwether, David A; Raudsepp, Terje
2014-01-01
Genome analysis of the alpaca (Lama pacos, LPA) has progressed slowly compared to other domestic species. Here, we report the development of the first comprehensive whole-genome integrated cytogenetic map for the alpaca using fluorescence in situ hybridization (FISH) and CHORI-246 BAC library clones. The map is comprised of 230 linearly ordered markers distributed among all 36 alpaca autosomes and the sex chromosomes. For the first time, markers were assigned to LPA14, 21, 22, 28, and 36. Additionally, 86 genes from 15 alpaca chromosomes were mapped in the dromedary camel (Camelus dromedarius, CDR), demonstrating exceptional synteny and linkage conservation between the 2 camelid genomes. Cytogenetic mapping of 191 protein-coding genes improved and refined the known Zoo-FISH homologies between camelids and humans: we discovered new homologous synteny blocks (HSBs) corresponding to HSA1-LPA/CDR11, HSA4-LPA/CDR31 and HSA7-LPA/CDR36, and revised the location of breakpoints for others. Overall, gene mapping was in good agreement with the Zoo-FISH and revealed remarkable evolutionary conservation of gene order within many human-camelid HSBs. Most importantly, 91 FISH-mapped markers effectively integrated the alpaca whole-genome sequence and the radiation hybrid maps with physical chromosomes, thus facilitating the improvement of the sequence assembly and the discovery of genes of biological importance. © 2015 S. Karger AG, Basel.
Ohmido, Nobuko; Fukui, Kiichi; Kinoshita, Toshiro
2010-01-01
Fluorescence in situ hybridization (FISH) is an effective method for the physical mapping of genes and repetitive DNA sequences on chromosomes. Physical mapping of unique nucleotide sequences on specific rice chromosome regions was performed using a combination of chromosome identification and highly sensitive FISH. Increases in the detection sensitivity of smaller DNA sequences and improvements in spatial resolution have ushered in a new phase in FISH technology. Thus, it is now possible to perform in situ hybridization on somatic chromosomes, pachytene chromosomes, and even on extended DNA fibers (EDFs). Pachytene-FISH allows the integration of genetic linkage maps and quantitative chromosome maps. Visualization methods using FISH can reveal the spatial organization of the centromere, heterochromatin/euchromatin, and the terminal structures of rice chromosomes. Furthermore, EDF-FISH and the DNA combing technique can resolve a spatial distance of 1 kb between adjacent DNA sequences, and the detection of even a 300-bp target is now feasible. The copy numbers of various repetitive sequences and the sizes of various DNA molecules were quantitatively measured using the molecular combing technique. This review describes the significance of these advances in molecular cytology in rice and discusses future applications in plant studies using visualization techniques.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kaerrman, C.; Holmgren, G.; Forsman, K.
1997-01-15
Amelogenesis imperfecta (Al) is a clinically and genetically heterogeneous group of inherited enamel defects. We recently mapped a locus for autosomal dominant local hypoplastic amelogenesis imperfecta (AIH2) to the long arm of chromosome 4. The disease gene was localized to a 17.6-cM region between the markers D4S392 and D4S395. The albumin gene (ALB), located in the same interval, was a candidate gene for autosomal dominant AI (ADAI) since albumin has a potential role in enamel maturation. Here we describe refined mapping of the AIH2 locus and the construction of marker maps by radiation hybrid mapping and yeast artificial chromosome (YAC)-basedmore » sequence tagged site-content mapping. A radiation hybrid map consisting of 11 microsatellite markers in the 5-cM interval between D4S409 and D4S1558 was constructed. Recombinant haplotypes in six Swedish ADAI families suggest that the disease gene is located in the interval between D4S2421 and ALB. ALB is therefore not likely to be the disease-causing gene. Affected members in all six families share the same allele haplotypes, indicating a common ancestral mutation in all families. The AIH2 critical region is less than 4 cM and spans a physical distance of approximately 4 Mb as judged from radiation hybrid maps. A YAC contig over the AIH2 critical region including several potential candidate genes was constructed. 35 refs., 4 figs., 1 tab.« less
Kulaeva, Olga A; Zhernakov, Aleksandr I; Afonin, Alexey M; Boikov, Sergei S; Sulima, Anton S; Tikhonovich, Igor A; Zhukov, Vladimir A
2017-01-01
Pea (Pisum sativum L.) is the oldest model object of plant genetics and one of the most agriculturally important legumes in the world. Since the pea genome has not been sequenced yet, identification of genes responsible for mutant phenotypes or desirable agricultural traits is usually performed via genetic mapping followed by candidate gene search. Such mapping is best carried out using gene-based molecular markers, as it opens the possibility for exploiting genome synteny between pea and its close relative Medicago truncatula Gaertn., possessing sequenced and annotated genome. In the last 5 years, a large number of pea gene-based molecular markers have been designed and mapped owing to the rapid evolution of "next-generation sequencing" technologies. However, the access to the complete set of markers designed worldwide is limited because the data are not uniformed and therefore hard to use. The Pea Marker Database was designed to combine the information about pea markers in a form of user-friendly and practical online tool. Version 1 (PMD1) comprises information about 2484 genic markers, including their locations in linkage groups, the sequences of corresponding pea transcripts and the names of related genes in M. truncatula. Version 2 (PMD2) is an updated version comprising 15944 pea markers in the same format with several advanced features. To test the performance of the PMD, fine mapping of pea symbiotic genes Sym13 and Sym27 in linkage groups VII and V, respectively, was carried out. The results of mapping allowed us to propose the Sen1 gene (a homologue of SEN1 gene of Lotus japonicus (Regel) K. Larsen) as the best candidate gene for Sym13, and to narrow the list of possible candidate genes for Sym27 to ten, thus proving PMD to be useful for pea gene mapping and cloning. All information contained in PMD1 and PMD2 is available at www.peamarker.arriam.ru.
Liu, P N; Miao, H; Lu, H W; Cui, J Y; Tian, G L; Wehner, T C; Gu, X F; Zhang, S P
2017-08-31
Powdery mildew (PM) of cucumber (Cucumis sativus), caused by Podosphaera xanthii, is a major foliar disease worldwide and resistance is one of the main objectives in cucumber breeding programs. The resistance to PM in cucumber stem is important to the resistance for the whole plant. In this study, genetic analysis and gene mapping were implemented with cucumber inbred lines NCG-122 (with resistance to PM in the stem) and NCG-121 (with susceptibility in the stem). Genetic analysis showed that resistance to PM in the stem of NCG-122 was qualitative and controlled by a single-recessive nuclear gene (pm-s). Susceptibility was dominant to resistance. In the initial genetic mapping of the pm-s gene, 10 SSR markers were discovered to be linked to pm-s, which was mapped to chromosome 5 (Chr.5) of cucumber. The pm-s gene's closest flanking markers were SSR20486 and SSR06184/SSR13237 with genetic distances of 0.9 and 1.8 cM, respectively. One hundred and fifty-seven pairs of new SSR primers were exploited by the sequence information in the initial mapping region of pm-s. The analysis on the F 2 mapping population using the new molecular markers showed that 17 SSR markers were confirmed to be linked to the pm-s gene. The two closest flanking markers, pmSSR27and pmSSR17, were 0.1 and 0.7 cM from pm-s, respectively, confirming the location of this gene on Chr.5. The physical length of the genomic region containing pm-s was 135.7 kb harboring 21 predicted genes. Among these genes, the gene Csa5G623470 annotated as encoding Mlo-related protein was defined as the most probable candidate gene for the pm-s. The results of this study will provide a basis for marker-assisted selection, and make the benefit for the cloning of the resistance gene.
Global Mapping of the Yeast Genetic Interaction Network
NASA Astrophysics Data System (ADS)
Tong, Amy Hin Yan; Lesage, Guillaume; Bader, Gary D.; Ding, Huiming; Xu, Hong; Xin, Xiaofeng; Young, James; Berriz, Gabriel F.; Brost, Renee L.; Chang, Michael; Chen, YiQun; Cheng, Xin; Chua, Gordon; Friesen, Helena; Goldberg, Debra S.; Haynes, Jennifer; Humphries, Christine; He, Grace; Hussein, Shamiza; Ke, Lizhu; Krogan, Nevan; Li, Zhijian; Levinson, Joshua N.; Lu, Hong; Ménard, Patrice; Munyana, Christella; Parsons, Ainslie B.; Ryan, Owen; Tonikian, Raffi; Roberts, Tania; Sdicu, Anne-Marie; Shapiro, Jesse; Sheikh, Bilal; Suter, Bernhard; Wong, Sharyl L.; Zhang, Lan V.; Zhu, Hongwei; Burd, Christopher G.; Munro, Sean; Sander, Chris; Rine, Jasper; Greenblatt, Jack; Peter, Matthias; Bretscher, Anthony; Bell, Graham; Roth, Frederick P.; Brown, Grant W.; Andrews, Brenda; Bussey, Howard; Boone, Charles
2004-02-01
A genetic interaction network containing ~1000 genes and ~4000 interactions was mapped by crossing mutations in 132 different query genes into a set of ~4700 viable gene yeast deletion mutants and scoring the double mutant progeny for fitness defects. Network connectivity was predictive of function because interactions often occurred among functionally related genes, and similar patterns of interactions tended to identify components of the same pathway. The genetic network exhibited dense local neighborhoods; therefore, the position of a gene on a partially mapped network is predictive of other genetic interactions. Because digenic interactions are common in yeast, similar networks may underlie the complex genetics associated with inherited phenotypes in other organisms.
High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster).
Plomion, C; Bartholomé, J; Lesur, I; Boury, C; Rodríguez-Quilón, I; Lagraulet, H; Ehrenmann, F; Bouffier, L; Gion, J M; Grivet, D; de Miguel, M; de María, N; Cervera, M T; Bagnoli, F; Isik, F; Vendramin, G G; González-Martínez, S C
2016-03-01
Maritime pine provides essential ecosystem services in the south-western Mediterranean basin, where it covers around 4 million ha. Its scattered distribution over a range of environmental conditions makes it an ideal forest tree species for studies of local adaptation and evolutionary responses to climatic change. Highly multiplexed single nucleotide polymorphism (SNP) genotyping arrays are increasingly used to study genetic variation in living organisms and for practical applications in plant and animal breeding and genetic resource conservation. We developed a 9k Illumina Infinium SNP array and genotyped maritime pine trees from (i) a three-generation inbred (F2) pedigree, (ii) the French breeding population and (iii) natural populations from Portugal and the French Atlantic coast. A large proportion of the exploitable SNPs (2052/8410, i.e. 24.4%) segregated in the mapping population and could be mapped, providing the densest ever gene-based linkage map for this species. Based on 5016 SNPs, natural and breeding populations from the French gene pool exhibited similar level of genetic diversity. Population genetics and structure analyses based on 3981 SNP markers common to the Portuguese and French gene pools revealed high levels of differentiation, leading to the identification of a set of highly differentiated SNPs that could be used for seed provenance certification. Finally, we discuss how the validated SNPs could facilitate the identification of ecologically and economically relevant genes in this species, improving our understanding of the demography and selective forces shaping its natural genetic diversity, and providing support for new breeding strategies. © 2015 John Wiley & Sons Ltd.
Kumar, P Natraj; Sujatha, K; Laha, G S; Rao, K Srinivasa; Mishra, B; Viraktamath, B C; Hari, Y; Reddy, C S; Balachandran, S M; Ram, T; Madhav, M Sheshu; Rani, N Shobha; Neeraja, C N; Reddy, G Ashok; Shaik, H; Sundaram, R M
2012-02-01
Broadening of the genetic base for identification and transfer of genes for resistance to insect pests and diseases from wild relatives of rice is an important strategy in resistance breeding programs across the world. An accession of Oryza nivara, International Rice Germplasm Collection (IRGC) accession number 105710, was identified to exhibit high level and broad-spectrum resistance to Xanthomonas oryzae pv. oryzae. In order to study the genetics of resistance and to tag and map the resistance gene or genes present in IRGC 105710, it was crossed with the bacterial blight (BB)-susceptible varieties 'TN1' and 'Samba Mahsuri' (SM) and then backcrossed to generate backcross mapping populations. Analysis of these populations and their progeny testing revealed that a single dominant gene controls resistance in IRGC 105710. The BC(1)F(2) population derived from the cross IRGC 105710/TN1//TN1 was screened with a set of 72 polymorphic simple-sequence repeat (SSR) markers distributed across the rice genome and the resistance gene was coarse mapped on chromosome 7 between the SSR markers RM5711 and RM6728 at a genetic distance of 17.0 and 19.3 centimorgans (cM), respectively. After analysis involving 49 SSR markers located between the genomic interval spanned by RM5711 and RM6728, and BC(2)F(2) population consisting of 2,011 individuals derived from the cross IRGC 105710/TN1//TN1, the gene was fine mapped between two SSR markers (RMWR7.1 and RMWR7.6) located at a genetic distance of 0.9 and 1.2 cM, respectively, from the gene and flanking it. The linkage distances were validated in a BC(1)F(2) mapping population derived from the cross IRGC 105710/SM//2 × SM. The BB resistance gene present in the O. nivara accession was identified to be novel based on its unique map location on chromosome 7 and wider spectrum of BB resistance; this gene has been named Xa33. The genomic region between the two closely flanking SSR markers was in silico analyzed for putatively expressed candidate genes. In total, eight genes were identified in the region and a putative gene encoding serinethreonine kinase appears to be a candidate for the Xa33 gene.
CTCF-Mediated Human 3D Genome Architecture Reveals Chromatin Topology for Transcription.
Tang, Zhonghui; Luo, Oscar Junhong; Li, Xingwang; Zheng, Meizhen; Zhu, Jacqueline Jufen; Szalaj, Przemyslaw; Trzaskoma, Pawel; Magalska, Adriana; Wlodarczyk, Jakub; Ruszczycki, Blazej; Michalski, Paul; Piecuch, Emaly; Wang, Ping; Wang, Danjuan; Tian, Simon Zhongyuan; Penrad-Mobayed, May; Sachs, Laurent M; Ruan, Xiaoan; Wei, Chia-Lin; Liu, Edison T; Wilczynski, Grzegorz M; Plewczynski, Dariusz; Li, Guoliang; Ruan, Yijun
2015-12-17
Spatial genome organization and its effect on transcription remains a fundamental question. We applied an advanced chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) strategy to comprehensively map higher-order chromosome folding and specific chromatin interactions mediated by CCCTC-binding factor (CTCF) and RNA polymerase II (RNAPII) with haplotype specificity and nucleotide resolution in different human cell lineages. We find that CTCF/cohesin-mediated interaction anchors serve as structural foci for spatial organization of constitutive genes concordant with CTCF-motif orientation, whereas RNAPII interacts within these structures by selectively drawing cell-type-specific genes toward CTCF foci for coordinated transcription. Furthermore, we show that haplotype variants and allelic interactions have differential effects on chromosome configuration, influencing gene expression, and may provide mechanistic insights into functions associated with disease susceptibility. 3D genome simulation suggests a model of chromatin folding around chromosomal axes, where CTCF is involved in defining the interface between condensed and open compartments for structural regulation. Our 3D genome strategy thus provides unique insights in the topological mechanism of human variations and diseases. Copyright © 2015 Elsevier Inc. All rights reserved.
Structural forms of the human amylase locus and their relationships to SNPs, haplotypes, and obesity
Usher, Christina L; Handsaker, Robert E; Esko, Tõnu; Tuke, Marcus A; Weedon, Michael N; Hastie, Alex R; Cao, Han; Moon, Jennifer E; Kashin, Seva; Fuchsberger, Christian; Metspalu, Andres; Pato, Carlos N; Pato, Michele T; McCarthy, Mark I; Boehnke, Michael; Altshuler, David M; Frayling, Timothy M; Hirschhorn, Joel N; McCarroll, Steven A
2016-01-01
Hundreds of genes reside in structurally complex, poorly understood regions of the human genome1-3. One such region contains the three amylase genes (AMY2B, AMY2A, and AMY1) responsible for digesting starch into sugar. The copy number of AMY1 is reported to be the genome’s largest influence on obesity4, though genome-wide association studies for obesity have found this locus unremarkable. Using whole genome sequence analysis3,5, droplet digital PCR6, and genome mapping7, we identified eight common structural haplotypes of the amylase locus that suggest its mutational history. We found that AMY1 copy number in individuals’ genomes is generally even (rather than odd) and partially correlates to nearby SNPs, which do not associate with BMI. We measured amylase gene copy number in 1,000 obese or lean Estonians and in two other cohorts totaling ~3,500 individuals. We had 99% power to detect the lower bound of the reported effects on BMI4, yet found no association. PMID:26098870
Ashbrook, David G; Williams, Robert W; Lu, Lu; Stein, Jason L; Hibar, Derrek P; Nichols, Thomas E; Medland, Sarah E; Thompson, Paul M; Hager, Reinmar
2014-10-03
Variation in hippocampal volume has been linked to significant differences in memory, behavior, and cognition among individuals. To identify genetic variants underlying such differences and associated disease phenotypes, multinational consortia such as ENIGMA have used large magnetic resonance imaging (MRI) data sets in human GWAS studies. In addition, mapping studies in mouse model systems have identified genetic variants for brain structure variation with great power. A key challenge is to understand how genetically based differences in brain structure lead to the propensity to develop specific neurological disorders. We combine the largest human GWAS of brain structure with the largest mammalian model system, the BXD recombinant inbred mouse population, to identify novel genetic targets influencing brain structure variation that are linked to increased risk for neurological disorders. We first use a novel cross-species, comparative analysis using mouse and human genetic data to identify a candidate gene, MGST3, associated with adult hippocampus size in both systems. We then establish the coregulation and function of this gene in a comprehensive systems-analysis. We find that MGST3 is associated with hippocampus size and is linked to a group of neurodegenerative disorders, such as Alzheimer's.
Wyrwa, Katarzyna; Książkiewicz, Michał; Szczepaniak, Anna; Susek, Karolina; Podkowiński, Jan; Naganowska, Barbara
2016-09-01
Narrow-leafed lupin (Lupinus angustifolius L.) has recently been considered a reference genome for the Lupinus genus. In the present work, genetic and cytogenetic maps of L. angustifolius were supplemented with 30 new molecular markers representing lupin genome regions, harboring genes involved in nitrogen fixation during the symbiotic interaction of legumes and soil bacteria (Rhizobiaceae). Our studies resulted in the precise localization of bacterial artificial chromosomes (BACs) carrying sequence variants for early nodulin 40, nodulin 26, nodulin 45, aspartate aminotransferase P2, asparagine synthetase, cytosolic glutamine synthetase, and phosphoenolpyruvate carboxylase. Together with previously mapped chromosomes, the integrated L. angustifolius map encompasses 73 chromosome markers, including 5S ribosomal DNA (rDNA) and 45S rDNA, and anchors 20 L. angustifolius linkage groups to corresponding chromosomes. Chromosomal identification using BAC fluorescence in situ hybridization identified two BAC clones as narrow-leafed lupin centromere-specific markers, which served as templates for preliminary studies of centromere composition within the genus. Bioinformatic analysis of these two BACs revealed that centromeric/pericentromeric regions of narrow-leafed lupin chromosomes consisted of simple sequence repeats ordered into tandem repeats containing the trinucleotide and pentanucleotide simple sequence repeats AGG and GATAC, structured into long arrays. Moreover, cross-genus microsynteny analysis revealed syntenic patterns of 31 single-locus BAC clones among several legume species. The gene and chromosome level findings provide evidence of ancient duplication events that must have occurred very early in the divergence of papilionoid lineages. This work provides a strong foundation for future comparative mapping among legumes and may facilitate understanding of mechanisms involved in shaping legume chromosomes.
Mapping asthma-associated variants in admixed populations
Mersha, Tesfaye B.
2015-01-01
Admixed populations arise when two or more previously isolated populations interbreed. Mapping asthma susceptibility loci in an admixed population using admixture mapping (AM) involves screening the genome of individuals of mixed ancestry for chromosomal regions that have a higher frequency of alleles from a parental population with higher asthma risk as compared with parental population with lower asthma risk. AM takes advantage of the admixture created in populations of mixed ancestry to identify genomic regions where an association exists between genetic ancestry and asthma (in contrast to between the genotype of the marker and asthma). The theory behind AM is that chromosomal segments of affected individuals contain a significantly higher-than-average proportion of alleles from the high-risk parental population and thus are more likely to harbor disease–associated loci. Criteria to evaluate the applicability of AM as a gene mapping approach include: (1) the prevalence of the disease differences in ancestral populations from which the admixed population was formed; (2) a measurable difference in disease-causing alleles between the parental populations; (3) reduced linkage disequilibrium (LD) between unlinked loci across chromosomes and strong LD between neighboring loci; (4) a set of markers with noticeable allele-frequency differences between parental populations that contributes to the admixed population (single nucleotide polymorphisms (SNPs) are the markers of choice because they are abundant, stable, relatively cheap to genotype, and informative with regard to the LD structure of chromosomal segments); and (5) there is an understanding of the extent of segmental chromosomal admixtures and their interactions with environmental factors. Although genome-wide association studies have contributed greatly to our understanding of the genetic components of asthma, the large and increasing degree of admixture in populations across the world create many challenges for further efforts to map disease-causing genes. This review, summarizes the historical context of admixed populations and AM, and considers current opportunities to use AM to map asthma genes. In addition, we provide an overview of the potential limitations and future directions of AM in biomedical research, including joint admixture and association mapping for asthma and asthma-related disorders. PMID:26483834
Taketa, Shin; Mascher, Martin; Yuo, Takahisa; Beier, Sebastian; Taudien, Stefan; Morgante, Michele
2016-01-01
Inflorescence architecture in small-grain cereals has a direct effect on yield and is an important selection target in breeding for yield improvement. We analyzed the recessive mutation laxatum-a (lax-a) in barley (Hordeum vulgare), which causes pleiotropic changes in spike development, resulting in (1) extended rachis internodes conferring a more relaxed inflorescence, (2) broadened base of the lemma awns, (3) thinner grains that are largely exposed due to reduced marginal growth of the palea and lemma, and (4) and homeotic conversion of lodicules into two stamenoid structures. Map-based cloning enforced by mapping-by-sequencing of the mutant lax-a locus enabled the identification of a homolog of BLADE-ON-PETIOLE1 (BOP1) and BOP2 as the causal gene. Interestingly, the recently identified barley uniculme4 gene also is a BOP1/2 homolog and has been shown to regulate tillering and leaf sheath development. While the Arabidopsis (Arabidopsis thaliana) BOP1 and BOP2 genes act redundantly, the barley genes contribute independent effects in specifying the developmental growth of vegetative and reproductive organs, respectively. Analysis of natural genetic diversity revealed strikingly different haplotype diversity for the two paralogous barley genes, likely affected by the respective genomic environments, since no indication for an active selection process was detected. PMID:27208226
Boucher, Benjamin; Lee, Anna Y.; Hallett, Michael; Jenna, Sarah
2016-01-01
A genetic interaction (GI) is defined when the mutation of one gene modifies the phenotypic expression associated with the mutation of a second gene. Genome-wide efforts to map GIs in yeast revealed structural and functional properties of a GI network. This provided insights into the mechanisms underlying the robustness of yeast to genetic and environmental insults, and also into the link existing between genotype and phenotype. While a significant conservation of GIs and GI network structure has been reported between distant yeast species, such a conservation is not clear between unicellular and multicellular organisms. Structural and functional characterization of a GI network in these latter organisms is consequently of high interest. In this study, we present an in-depth characterization of ~1.5K GIs in the nematode Caenorhabditis elegans. We identify and characterize six distinct classes of GIs by examining a wide-range of structural and functional properties of genes and network, including co-expression, phenotypical manifestations, relationship with protein-protein interaction dense subnetworks (PDS) and pathways, molecular and biological functions, gene essentiality and pleiotropy. Our study shows that GI classes link genes within pathways and display distinctive properties, specifically towards PDS. It suggests a model in which pathways are composed of PDS-centric and PDS-independent GIs coordinating molecular machines through two specific classes of GIs involving pleiotropic and non-pleiotropic connectors. Our study provides the first in-depth characterization of a GI network within pathways of a multicellular organism. It also suggests a model to understand better how GIs control system robustness and evolution. PMID:26871911
2013-03-14
SUPPLEMENTARY NOTES 14. ABSTRACT Autism is an extremely common and heterogeneous neurodevelopmental disorder. While genetic factors are known to play...AFRL-SA-WP-TR-2013-0013 Comprehensive Clinical Phenotyping and Genetic Mapping for the Discovery of Autism Susceptibility Genes...Genetic Mapping for the Discovery of Autism Susceptibility Genes 5a. CONTRACT NUMBER N/A 5b. GRANT NUMBER N/A 5c. PROGRAM ELEMENT NUMBER N/A 6
Quan, X; Laes, J F; Ravoet, M; Van Vooren, P; Szpirer, J; Szpirer, C
2000-01-01
The centromeric region of rat chromosome 2 (2q1) harbors unidentified quantitative trait loci of genes that control tumor growth or development. To improve the mapping of this chromosome region, we microdissected it and generated 10 new microsatellite markers, which we included in the linkage map and/or radiation hybrid map of 2q1, together with other known markers, including four genes: Pcsk1 (protein convertase 1), Dhfr (dihydrofolate reductase), Ndub13 (NADH ubiquinone oxidoreductase subunit b13), and Ccnb1 (cyclin B1). To generate anchor points between the different maps, the gene Ndub13 and the microsatellite markers D2Ulb25 and D2Mit1 were also localized cytogenetically. The radiation map generated in region 2q1 extends its centromeric end of about 150 cR. Copyright 2000 S. Karger AG, Basel
Xiaoqing Yu; Guihua Bai; Shuwei Liu; Na Luo; Ying Wang; Douglas S. Richmond; Paula M. Pijut; Scott A. Jackson; Jianming Yu; Yiwei Jiang
2013-01-01
Drought is a major environmental stress limiting growth of perennial grasses in temperate regions. Plant drought tolerance is a complex trait that is controlled by multiple genes. Candidate gene association mapping provides a powerful tool for dissection of complex traits. Candidate gene association mapping of drought tolerance traits was conducted in 192 diverse...
2013-01-01
Background Cucumber is an important vegetable crop that is susceptible to many pathogens, but no disease resistance (R) genes have been cloned. The availability of whole genome sequences provides an excellent opportunity for systematic identification and characterization of the nucleotide binding and leucine-rich repeat (NB-LRR) type R gene homolog (RGH) sequences in the genome. Cucumber has a very narrow genetic base making it difficult to construct high-density genetic maps. Development of a consensus map by synthesizing information from multiple segregating populations is a method of choice to increase marker density. As such, the objectives of the present study were to identify and characterize NB-LRR type RGHs, and to develop a high-density, integrated cucumber genetic-physical map anchored with RGH loci. Results From the Gy14 draft genome, 70 NB-containing RGHs were identified and characterized. Most RGHs were in clusters with uneven distribution across seven chromosomes. In silico analysis indicated that all 70 RGHs had EST support for gene expression. Phylogenetic analysis classified 58 RGHs into two clades: CNL and TNL. Comparative analysis revealed high-degree sequence homology and synteny in chromosomal locations of these RGH members between the cucumber and melon genomes. Fifty-four molecular markers were developed to delimit 67 of the 70 RGHs, which were integrated into a genetic map through linkage analysis. A 1,681-locus cucumber consensus map including 10 gene loci and spanning 730.0 cM in seven linkage groups was developed by integrating three component maps with a bin-mapping strategy. Physically, 308 scaffolds with 193.2 Mbp total DNA sequences were anchored onto this consensus map that covered 52.6% of the 367 Mbp cucumber genome. Conclusions Cucumber contains relatively few NB-LRR RGHs that are clustered and unevenly distributed in the genome. All RGHs seem to be transcribed and shared significant sequence homology and synteny with the melon genome suggesting conservation of these RGHs in the Cucumis lineage. The 1,681-locus consensus genetic-physical map developed and the RGHs identified and characterized herein are valuable genomics resources that may have many applications such as quantitative trait loci identification, map-based gene cloning, association mapping, marker-assisted selection, as well as assembly of a more complete cucumber genome. PMID:23531125
NASA Astrophysics Data System (ADS)
Li, Qi; Qi, Mingjun; Nie, Hongtao; Kong, Lingfeng; Yu, Hong
2016-06-01
Gene-centromere mapping is an essential prerequisite for understanding the composition and structure of genomes. Half-tetrad analysis is a powerful tool for mapping genes and understanding chromosomal behavior during meiosis. The Japanese scallop ( Patinopecten yessoensis), a cold-tolerant species inhabiting the northwestern Pacific coast, is a commercially important marine bivalve in Asian countries. In this study, inheritance of 32 informative microsatellite loci was examined in 70-h D-shaped larvae of three induced meiogynogenetic diploid families of P. yessoensis for centromere mapping using half-tetrad analysis. The ratio of gynogenetic diploids was proven to be 100%, 100% and 96% in the three families, respectively. Inheritance analysis in the control crosses showed that 51 of the 53 genotypic ratios observed were in accordance with Mendelian expectations at the 5% level after Bonferroni correction. Seven of the 32 microsatellite loci showed the existence of null alleles in control crosses. The second division segregation frequency ( y) of the microsatellite loci ranged from 0.07 to 0.85 with a mean of 0.38, suggesting the existence of positive interference after a single chiasma formation in some chromosomes in the scallop. Microsatellite-centromere distances ranged from 4 cM to 42 cM under the assumption of complete interference. Information on the positions of centromeres in relation to the microsatellite loci will represent a contribution towards the assembly of genetic maps in the commercially important scallop species.
Sharma, Akanksha; Sharma, Niharika; Bhalla, Prem; Singh, Mohan
2017-01-01
Comparative genomics have facilitated the mining of biological information from a genome sequence, through the detection of similarities and differences with genomes of closely or more distantly related species. By using such comparative approaches, knowledge can be transferred from the model to non-model organisms and insights can be gained in the structural and evolutionary patterns of specific genes. In the absence of sequenced genomes for allergenic grasses, this study was aimed at understanding the structure, organisation and expression profiles of grass pollen allergens using the genomic data from Brachypodium distachyon as it is phylogenetically related to the allergenic grasses. Combining genomic data with the anther RNA-Seq dataset revealed 24 pollen allergen genes belonging to eight allergen groups mapping on the five chromosomes in B. distachyon. High levels of anther-specific expression profiles were observed for the 24 identified putative allergen-encoding genes in Brachypodium. The genomic evidence suggests that gene encoding the group 5 allergen, the most potent trigger of hay fever and allergic asthma originated as a pollen specific orphan gene in a common grass ancestor of Brachypodium and Triticiae clades. Gene structure analysis showed that the putative allergen-encoding genes in Brachypodium either lack or contain reduced number of introns. Promoter analysis of the identified Brachypodium genes revealed the presence of specific cis-regulatory sequences likely responsible for high anther/pollen-specific expression. With the identification of putative allergen-encoding genes in Brachypodium, this study has also described some important plant gene families (e.g. expansin superfamily, EF-Hand family, profilins etc) for the first time in the model plant Brachypodium. Altogether, the present study provides new insights into structural characterization and evolution of pollen allergens and will further serve as a base for their functional characterization in related grass species. PMID:28103252
Discovery and characterization of two new stem rust resistance genes in Aegilops sharonensis.
Yu, Guotai; Champouret, Nicolas; Steuernagel, Burkhard; Olivera, Pablo D; Simmons, Jamie; Williams, Cole; Johnson, Ryan; Moscou, Matthew J; Hernández-Pinzón, Inmaculada; Green, Phon; Sela, Hanan; Millet, Eitan; Jones, Jonathan D G; Ward, Eric R; Steffenson, Brian J; Wulff, Brande B H
2017-06-01
We identified two novel wheat stem rust resistance genes, Sr-1644-1Sh and Sr-1644-5Sh in Aegilops sharonensis that are effective against widely virulent African races of the wheat stem rust pathogen. Stem rust is one of the most important diseases of wheat in the world. When single stem rust resistance (Sr) genes are deployed in wheat, they are often rapidly overcome by the pathogen. To this end, we initiated a search for novel sources of resistance in diverse wheat relatives and identified the wild goatgrass species Aegilops sharonesis (Sharon goatgrass) as a rich reservoir of resistance to wheat stem rust. The objectives of this study were to discover and map novel Sr genes in Ae. sharonensis and to explore the possibility of identifying new Sr genes by genome-wide association study (GWAS). We developed two biparental populations between resistant and susceptible accessions of Ae. sharonensis and performed QTL and linkage analysis. In an F 6 recombinant inbred line and an F 2 population, two genes were identified that mapped to the short arm of chromosome 1S sh , designated as Sr-1644-1Sh, and the long arm of chromosome 5S sh , designated as Sr-1644-5Sh. The gene Sr-1644-1Sh confers a high level of resistance to race TTKSK (a member of the Ug99 race group), while the gene Sr-1644-5Sh conditions strong resistance to TRTTF, another widely virulent race found in Yemen. Additionally, GWAS was conducted on 125 diverse Ae. sharonensis accessions for stem rust resistance. The gene Sr-1644-1Sh was detected by GWAS, while Sr-1644-5Sh was not detected, indicating that the effectiveness of GWAS might be affected by marker density, population structure, low allele frequency and other factors.
Khowaja, Farkhanda S; Norton, Gareth J; Courtois, Brigitte; Price, Adam H
2009-01-01
Background Meta-analysis of QTLs combines the results of several QTL detection studies and provides narrow confidence intervals for meta-QTLs, permitting easier positional candidate gene identification. It is usually applied to multiple mapping populations, but can be applied to one. Here, a meta-analysis of drought related QTLs in the Bala × Azucena mapping population compiles data from 13 experiments and 25 independent screens providing 1,650 individual QTLs separated into 5 trait categories; drought avoidance, plant height, plant biomass, leaf morphology and root traits. A heat map of the overlapping 1 LOD confidence intervals provides an overview of the distribution of QTLs. The programme BioMercator is then used to conduct a formal meta-analysis at example QTL clusters to illustrate the value of meta-analysis of QTLs in this population. Results The heat map graphically illustrates the genetic complexity of drought related traits in rice. QTLs can be linked to their physical position on the rice genome using Additional file 1 provided. Formal meta-analysis on chromosome 1, where clusters of QTLs for all trait categories appear close, established that the sd1 semi-dwarfing gene coincided with a plant height meta-QTL, that the drought avoidance meta-QTL was not likely to be associated with this gene, and that this meta-QTL was not pleiotropic with close meta-QTLs for leaf morphology and root traits. On chromosome 5, evidence suggests that a drought avoidance meta-QTL was pleiotropic with leaf morphology and plant biomass meta-QTLs, but not with meta-QTLs for root traits and plant height 10 cM lower down. A region of dense root QTL activity graphically visible on chromosome 9 was dissected into three meta-QTLs within a space of 35 cM. The confidence intervals for meta-QTLs obtained ranged from 5.1 to 14.5 cM with an average of 9.4 cM, which is approximately 180 genes in rice. Conclusion The meta-analysis is valuable in providing improved ability to dissect the complex genetic structure of traits, and distinguish between pleiotropy and close linkage. It also provides relatively small target regions for the identification of positional candidate genes. PMID:19545420
Heterogeneous Stock Rat: A Unique Animal Model for Mapping Genes Influencing Bone Fragility
Alam, Imranul; Koller, Daniel L.; Sun, Qiwei; Roeder, Ryan K.; Cañete, Toni; Blázquez, Gloria; López-Aumatell, Regina; Martínez-Membrives, Esther; Vicens-Costa, Elia; Mont, Carme; Díaz, Sira; Tobeña, Adolf; Fernández-Teruel, Alberto; Whitley, Adam; Strid, Pernilla; Diez, Margarita; Johannesson, Martina; Flint, Jonathan; Econs, Michael J.; Turner, Charles H.; Foroud, Tatiana
2011-01-01
Previously, we demonstrated that skeletal mass, structure and biomechanical properties vary considerably among 11 different inbred rat strains. Subsequently, we performed quantitative trait loci (QTL) analysis in 4 inbred rat strains (F344, LEW, COP and DA) for different bone phenotypes and identified several candidate genes influencing various bone traits. The standard approach to narrowing QTL intervals down to a few candidate genes typically employs the generation of congenic lines, which is time consuming and often not successful. A potential alternative approach is to use a highly genetically informative animal model resource capable of delivering very high-resolution gene mapping such as Heterogeneous stock (HS) rat. HS rat was derived from eight inbred progenitors: ACI/N, BN/SsN, BUF/N, F344/N, M520/N, MR/N, WKY/N and WN/N. The genetic recombination pattern generated across 50 generations in these rats has been shown to deliver ultra-high even gene-level resolution for complex genetic studies. The purpose of this study is to investigate the usefulness of the HS rat model for fine mapping and identification of genes underlying bone fragility phenotypes. We compared bone geometry, density and strength phenotypes at multiple skeletal sites in HS rats with those obtained from 5 of the 8 progenitor inbred strains. In addition, we estimated the heritability for different bone phenotypes in these rats and employed principal component analysis to explore relationships among bone phenotypes in the HS rats. Our study demonstrates that significant variability exists for different skeletal phenotypes in HS rats compared with their inbred progenitors. In addition, we estimated high heritability for several bone phenotypes and biologically interpretable factors explaining significant overall variability, suggesting that the HS rat model could be a unique genetic resource for rapid and efficient discovery of the genetic determinants of bone fragility. PMID:21334473
Heterogeneous stock rat: a unique animal model for mapping genes influencing bone fragility.
Alam, Imranul; Koller, Daniel L; Sun, Qiwei; Roeder, Ryan K; Cañete, Toni; Blázquez, Gloria; López-Aumatell, Regina; Martínez-Membrives, Esther; Vicens-Costa, Elia; Mont, Carme; Díaz, Sira; Tobeña, Adolf; Fernández-Teruel, Alberto; Whitley, Adam; Strid, Pernilla; Diez, Margarita; Johannesson, Martina; Flint, Jonathan; Econs, Michael J; Turner, Charles H; Foroud, Tatiana
2011-05-01
Previously, we demonstrated that skeletal mass, structure and biomechanical properties vary considerably among 11 different inbred rat strains. Subsequently, we performed quantitative trait loci (QTL) analysis in four inbred rat strains (F344, LEW, COP and DA) for different bone phenotypes and identified several candidate genes influencing various bone traits. The standard approach to narrowing QTL intervals down to a few candidate genes typically employs the generation of congenic lines, which is time consuming and often not successful. A potential alternative approach is to use a highly genetically informative animal model resource capable of delivering very high resolution gene mapping such as Heterogeneous stock (HS) rat. HS rat was derived from eight inbred progenitors: ACI/N, BN/SsN, BUF/N, F344/N, M520/N, MR/N, WKY/N and WN/N. The genetic recombination pattern generated across 50 generations in these rats has been shown to deliver ultra-high even gene-level resolution for complex genetic studies. The purpose of this study is to investigate the usefulness of the HS rat model for fine mapping and identification of genes underlying bone fragility phenotypes. We compared bone geometry, density and strength phenotypes at multiple skeletal sites in HS rats with those obtained from five of the eight progenitor inbred strains. In addition, we estimated the heritability for different bone phenotypes in these rats and employed principal component analysis to explore relationships among bone phenotypes in the HS rats. Our study demonstrates that significant variability exists for different skeletal phenotypes in HS rats compared with their inbred progenitors. In addition, we estimated high heritability for several bone phenotypes and biologically interpretable factors explaining significant overall variability, suggesting that the HS rat model could be a unique genetic resource for rapid and efficient discovery of the genetic determinants of bone fragility. Copyright © 2010 Elsevier Inc. All rights reserved.
Lemieux, Jacob E; Kyes, Sue A; Otto, Thomas D; Feller, Avi I; Eastman, Richard T; Pinches, Robert A; Berriman, Matthew; Su, Xin-zhuan; Newbold, Chris I
2013-01-01
Spatial relationships within the eukaryotic nucleus are essential for proper nuclear function. In Plasmodium falciparum, the repositioning of chromosomes has been implicated in the regulation of the expression of genes responsible for antigenic variation, and the formation of a single, peri-nuclear nucleolus results in the clustering of rDNA. Nevertheless, the precise spatial relationships between chromosomes remain poorly understood, because, until recently, techniques with sufficient resolution have been lacking. Here we have used chromosome conformation capture and second-generation sequencing to study changes in chromosome folding and spatial positioning that occur during switches in var gene expression. We have generated maps of chromosomal spatial affinities within the P. falciparum nucleus at 25 Kb resolution, revealing a structured nucleolus, an absence of chromosome territories, and confirming previously identified clustering of heterochromatin foci. We show that switches in var gene expression do not appear to involve interaction with a distant enhancer, but do result in local changes at the active locus. These maps reveal the folding properties of malaria chromosomes, validate known physical associations, and characterize the global landscape of spatial interactions. Collectively, our data provide critical information for a better understanding of gene expression regulation and antigenic variation in malaria parasites. PMID:23980881
Genetic map of artichoke × wild cardoon: toward a consensus map for Cynara cardunculus.
Sonnante, Gabriella; Gatto, Angela; Morgese, Anita; Montemurro, Francesco; Sarli, Giulio; Blanco, Emanuela; Pignone, Domenico
2011-11-01
An integrated consensus linkage map is proposed for globe artichoke. Maternal and paternal genetic maps were constructed on the basis of an F(1) progeny derived from crossing an artichoke genotype (Mola) with its progenitor, the wild cardoon (Tolfa), using EST-derived SSRs, genomic SSRs, AFLPs, ten genes, and two morphological traits. For most genes, mainly belonging to the chlorogenic acid pathway, new markers were developed. Five of these were SNP markers analyzed through high-resolution melt technology. From the maternal (Mola) and paternal (Tolfa) maps, an integrated map was obtained, containing 337 molecular and one morphological markers ordered in 17 linkage groups (LGs), linked between Mola and Tolfa. The integrated map covers 1,488.8 cM, with an average distance of 4.4 cM between markers. The map was aligned with already existing maps for artichoke, and 12 LGs were linked via 31 bridge markers. LG numbering has been proposed. A total of 124 EST-SSRs and two genes were mapped here for the first time, providing a framework for the construction of a functional map in artichoke. The establishment of a consensus map represents a necessary condition to plan a complete sequencing of the globe artichoke genome.
Phage phenomics: Physiological approaches to characterize novel viral proteins
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sanchez, Savannah E.; Cuevas, Daniel A.; Rostron, Jason E.
Current investigations into phage-host interactions are dependent on extrapolating knowledge from (meta)genomes. Interestingly, 60 - 95% of all phage sequences share no homology to current annotated proteins. As a result, a large proportion of phage genes are annotated as hypothetical. This reality heavily affects the annotation of both structural and auxiliary metabolic genes. Here we present phenomic methods designed to capture the physiological response(s) of a selected host during expression of one of these unknown phage genes. Multi-phenotype Assay Plates (MAPs) are used to monitor the diversity of host substrate utilization and subsequent biomass formation, while metabolomics provides bi-product analysismore » by monitoring metabolite abundance and diversity. Both tools are used simultaneously to provide a phenotypic profile associated with expression of a single putative phage open reading frame (ORF). Thus, representative results for both methods are compared, highlighting the phenotypic profile differences of a host carrying either putative structural or metabolic phage genes. In addition, the visualization techniques and high throughput computational pipelines that facilitated experimental analysis are presented.« less
Phage phenomics: Physiological approaches to characterize novel viral proteins
Sanchez, Savannah E.; Cuevas, Daniel A.; Rostron, Jason E.; ...
2015-06-11
Current investigations into phage-host interactions are dependent on extrapolating knowledge from (meta)genomes. Interestingly, 60 - 95% of all phage sequences share no homology to current annotated proteins. As a result, a large proportion of phage genes are annotated as hypothetical. This reality heavily affects the annotation of both structural and auxiliary metabolic genes. Here we present phenomic methods designed to capture the physiological response(s) of a selected host during expression of one of these unknown phage genes. Multi-phenotype Assay Plates (MAPs) are used to monitor the diversity of host substrate utilization and subsequent biomass formation, while metabolomics provides bi-product analysismore » by monitoring metabolite abundance and diversity. Both tools are used simultaneously to provide a phenotypic profile associated with expression of a single putative phage open reading frame (ORF). Thus, representative results for both methods are compared, highlighting the phenotypic profile differences of a host carrying either putative structural or metabolic phage genes. In addition, the visualization techniques and high throughput computational pipelines that facilitated experimental analysis are presented.« less
Zhang, Yunxia; Cheng, Chunyan; Li, Ji; Yang, Shuqiong; Wang, Yunzhu; Li, Ziang; Chen, Jinfeng; Lou, Qunfeng
2015-09-25
Differentiation and copy number of repetitive sequences affect directly chromosome structure which contributes to reproductive isolation and speciation. Comparative cytogenetic mapping has been verified an efficient tool to elucidate the differentiation and distribution of repetitive sequences in genome. In present study, the distinct chromosomal structures of five Cucumis species were revealed through genomic in situ hybridization (GISH) technique and comparative cytogenetic mapping of major satellite repeats. Chromosome structures of five Cucumis species were investigated using GISH and comparative mapping of specific satellites. Southern hybridization was employed to study the proliferation of satellites, whose structural characteristics were helpful for analyzing chromosome evolution. Preferential distribution of repetitive DNAs at the subtelomeric regions was found in C. sativus, C hystrix and C. metuliferus, while majority was positioned at the pericentromeric heterochromatin regions in C. melo and C. anguria. Further, comparative GISH (cGISH) through using genomic DNA of other species as probes revealed high homology of repeats between C. sativus and C. hystrix. Specific satellites including 45S rDNA, Type I/II, Type III, Type IV, CentM and telomeric repeat were then comparatively mapped in these species. Type I/II and Type IV produced bright signals at the subtelomeric regions of C. sativus and C. hystrix simultaneously, which might explain the significance of their amplification in the divergence of Cucumis subgenus from the ancient ancestor. Unique positioning of Type III and CentM only at the centromeric domains of C. sativus and C. melo, respectively, combining with unique southern bands, revealed rapid evolutionary patterns of centromeric DNA in Cucumis. Obvious interstitial telomeric repeats were observed in chromosomes 1 and 2 of C. sativus, which might provide evidence of the fusion hypothesis of chromosome evolution from x = 12 to x = 7 in Cucumis species. Besides, the significant correlation was found between gene density along chromosome and GISH band intensity in C. sativus and C. melo. In summary, comparative cytogenetic mapping of major satellites and GISH revealed the distinct differentiation of chromosome structure during species formation. The evolution of repetitive sequences was the main force for the divergence of Cucumis species from common ancestor.
Studies with infections fragments of phage DNA. Final report, January 1, 1970--June 30, 1976
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schachtele, C. F.
The minute, virulent and structurally intricate Bacillus subtilis bacteriophage phi 29 was utilized to study in vivo viral development. Purified strands of phi 29 DNA were used to analyze transcription of the viral genome. Early mRNA hybridizes to the light DNA strand which controls DNA replication and other early functions. Late mRNA hybridizes to the heavy DNA strand which codes for phage structural proteins. The temporal sequence of specific viral protein synthesis was analyzed by gel electrophoresis and was shown to directly correlate with the RNA transcription pattern. The genes carried by phi 29 have been marked with ts andmore » sus mutations and mapped by appropriate crosses yielding a linear map of 17 cistrons. Fragments of the phi 29 DNA were shown to retain their biological activity and marker rescue studies indicated that gene transfer could be performed with pieces having a molecular weight of less than 1 million daltons. Mutant infection under nonpermissive conditions and the analysis of precursor structures has allowed the formation of a tentative morphogenetic pathway leading to the formation of infectious particles. Work with phi 29 has established this virus as an advantageous model system for studying a variety of problems in molecular biology and approximately a dozen laboratories in the country and abroad are working with this phage.« less
Differential deposition of H2A.Z in rice seedling tissue during the day-night cycle.
Zhang, Kang; Xu, Wenying; Wang, Chunchao; Yi, Xin; Su, Zhen
2017-03-04
Chromatin structure has an important role in modulating gene expression. The incorporation of histone variants into the nucleosome leads to important changes in the chromatin structure. The histone variant H2A.Z is highly conserved between different species of fungi, animals, and plants. However, dynamic changes to H2A.Z in rice have not been reported during the day-night cycle. In this study, we generated genome wide maps of H2A.Z for day and night time in harvested seedling tissues by combining chromatin immunoprecipitation and high-throughput sequencing. The analysis results for the H2A.Z data sets detected 7099 genes with higher depositions of H2A.Z in seedling tissues harvested at night compared with seedling tissues harvested during the day, whereas 4597 genes had higher H2A.Z depositions in seedlings harvested during the day. The gene expression profiles data suggested that H2A.Z probably negatively regulated gene expression during the day-night cycle and was involved in many important biologic processes. In general, our results indicated that H2A.Z may play an important role in plant responses to the diurnal oscillation process.
Draye, Xavier; Lin, Yann-Rong; Qian, Xiao-yin; Bowers, John E.; Burow, Gloria B.; Morrell, Peter L.; Peterson, Daniel G.; Presting, Gernot G.; Ren, Shu-xin; Wing, Rod A.; Paterson, Andrew H.
2001-01-01
The small genome of sorghum (Sorghum bicolor L. Moench.) provides an important template for study of closely related large-genome crops such as maize (Zea mays) and sugarcane (Saccharum spp.), and is a logical complement to distantly related rice (Oryza sativa) as a “grass genome model.” Using a high-density RFLP map as a framework, a robust physical map of sorghum is being assembled by integrating hybridization and fingerprint data with comparative data from related taxa such as rice and using new methods to resolve genomic duplications into locus-specific groups. By taking advantage of allelic variation revealed by heterologous probes, the positions of corresponding loci on the wheat (Triticum aestivum), rice, maize, sugarcane, and Arabidopsis genomes are being interpolated on the sorghum physical map. Bacterial artificial chromosomes for the small genome of rice are shown to close several gaps in the sorghum contigs; the emerging rice physical map and assembled sequence will further accelerate progress. An important motivation for developing genomic tools is to relate molecular level variation to phenotypic diversity. “Diversity maps,” which depict the levels and patterns of variation in different gene pools, shed light on relationships of allelic diversity with chromosome organization, and suggest possible locations of genomic regions that are under selection due to major gene effects (some of which may be revealed by quantitative trait locus mapping). Both physical maps and diversity maps suggest interesting features that may be integrally related to the chromosomal context of DNA—progress in cytology promises to provide a means to elucidate such relationships. We seek to provide a detailed picture of the structure, function, and evolution of the genome of sorghum and its relatives, together with molecular tools such as locus-specific sequence-tagged site DNA markers and bacterial artificial chromosome contigs that will have enduring value for many aspects of genome analysis. PMID:11244113
Signal recognition particle RNA in dinoflagellates and the Perkinsid Perkinsus marinus.
Zhang, Huan; Campbell, David A; Sturm, Nancy R; Rosenblad, Magnus A; Dungan, Christopher F; Lin, Senjie
2013-09-01
In dinoflagellates and perkinsids, the molecular structure of the protein translocating machinery is unclear. Here, we identified several types of full-length signal recognition particle (SRP) RNA genes from Karenia brevis (dinoflagellate) and Perkinsus marinus (perkinsid). We also identified the four SRP S-domain proteins, but not the two Alu domain proteins, from P. marinus and several dinoflagellates. We mapped both ends of SRP RNA transcripts from K. brevis and P. marinus, and obtained the 3' end from four other dinoflagellates. The lengths of SRP RNA are predicted to be ∼260-300 nt in dinoflagellates and 280-285 nt in P. marinus. Although these SRP RNA sequences are substantially variable, the predicted structures are similar. The genomic organization of the SRP RNA gene differs among species. In K. brevis, this gene is located downstream of the spliced leader (SL) RNA, either as SL RNA-SRP RNA-tRNA gene tandem repeats, or within a SL RNA-SRP RNA-tRNA-U6-5S rRNA gene cluster. In other dinoflagellates, SRP RNA does not cluster with SL RNA or 5S rRNA genes. The majority of P. marinus SRP RNA genes array as tandem repeats without the above-mentioned small RNA genes. Our results capture a snapshot of a potentially complex evolutionary history of SRP RNA in alveolates. Copyright © 2013 Elsevier GmbH. All rights reserved.
Chromosome I duplications in Caenorhabditis elegans
DOE Office of Scientific and Technical Information (OSTI.GOV)
McKim, K.S.; Rose, A.M.
1990-01-01
We have isolated and characterized 76 duplications of chromosome I in the genome of Caenorhabditis elegans. The region studied is the 20 map unit left half of the chromosome. Sixty-two duplications were induced with gamma radiation and 14 arose spontaneously. The latter class was apparently the result of spontaneous breaks within the parental duplication. The majority of duplications behave as if they are free. Three duplications are attached to identifiable sequences from other chromosomes. The duplication breakpoints have been mapped by complementation analysis relative to genes on chromosome I. Nineteen duplication breakpoints and seven deficiency breakpoints divide the left halfmore » of the chromosome into 24 regions. We have studied the relationship between duplication size and segregational stability. While size is an important determinant of mitotic stability, it is not the only one. We observed clear exceptions to a size-stability correlation. In addition to size, duplication stability may be influenced by specific sequences or chromosome structure. The majority of the duplications were stable enough to be powerful tools for gene mapping. Therefore the duplications described here will be useful in the genetic characterization of chromosome I and the techniques we have developed can be adapted to other regions of the genome.« less
Transcriptional atlas of cardiogenesis maps congenital heart disease interactome.
Li, Xing; Martinez-Fernandez, Almudena; Hartjes, Katherine A; Kocher, Jean-Pierre A; Olson, Timothy M; Terzic, Andre; Nelson, Timothy J
2014-07-01
Mammalian heart development is built on highly conserved molecular mechanisms with polygenetic perturbations resulting in a spectrum of congenital heart diseases (CHD). However, knowledge of cardiogenic ontogeny that regulates proper cardiogenesis remains largely based on candidate-gene approaches. Mapping the dynamic transcriptional landscape of cardiogenesis from a genomic perspective is essential to integrate the knowledge of heart development into translational applications that accelerate disease discovery efforts toward mechanistic-based treatment strategies. Herein, we designed a time-course transcriptome analysis to investigate the genome-wide dynamic expression landscape of innate murine cardiogenesis ranging from embryonic stem cells to adult cardiac structures. This comprehensive analysis generated temporal and spatial expression profiles, revealed stage-specific gene functions, and mapped the dynamic transcriptome of cardiogenesis to curated pathways. Reconciling known genetic underpinnings of CHD, we deconstructed a disease-centric dynamic interactome encoded within this cardiogenic atlas to identify stage-specific developmental disturbances clustered on regulation of epithelial-to-mesenchymal transition (EMT), BMP signaling, NF-AT signaling, TGFb-dependent EMT, and Notch signaling. Collectively, this cardiogenic transcriptional landscape defines the time-dependent expression of cardiac ontogeny and prioritizes regulatory networks at the interface between health and disease. Copyright © 2014 the American Physiological Society.
Zeng, Shaohua; Xiao, Gong; Wang, Gan; Wang, Ying; Peng, Ming; Huang, Hongwen
2015-01-01
Red-fleshed kiwifruit (Actinidia chinensis Planch. ‘Hongyang’) is a promising commercial cultivar due to its nutritious value and unique flesh color, derived from vitamin C and anthocyanins. In this study, we obtained transcriptome data of ‘Hongyang’ from seven developmental stages using Illumina sequencing. We mapped 39–54 million reads to the recently sequenced kiwifruit genome and other databases to define gene structure, to analyze alternative splicing, and to quantify gene transcript abundance at different developmental stages. The transcript profiles throughout red kiwifruit development were constructed and analyzed, with a focus on the biosynthesis and metabolism of compounds such as phytohormones, sugars, starch and L-ascorbic acid, which are indispensable for the development and formation of quality fruit. Candidate genes for these pathways were identified through MapMan and phylogenetic analysis. The transcript levels of genes involved in sucrose and starch metabolism were consistent with the change in soluble sugar and starch content throughout kiwifruit development. The metabolism of L-ascorbic acid was very active, primarily through the L-galactose pathway. The genes responsible for the accumulation of anthocyanin in red kiwifruit were identified, and their expression levels were investigated during kiwifruit development. This survey of gene expression during kiwifruit development paves the way for further investigation of the development of this uniquely colored and nutritious fruit and reveals which factors are needed for high quality fruit formation. This transcriptome data and its analysis will be useful for improving kiwifruit genome annotation, for basic fruit molecular biology research, and for kiwifruit breeding and improvement. PMID:26301713
2000-04-01
Genes, LOH Mapping, Chromosome 17, Physical Mapping, Genetic Mapping, CDNA Screening, Humans, Anatomical 81 Samples, Mutation Detection, Breast Cancer...According to the established model for LOH involving tumor suppressor genes, the allele remaining in the tumor sample would harbor the deleterious mutation ...sequencing on an AB1373A sequencer (Applied Biosystems, Foster City, CA). As none of the samples we have sequenced have revealed any mutations , we have
Clark, R M; Marker, P C; Kingsley, D M
2000-07-01
Polydactyly is a common malformation of vertebrate limbs. In humans a major locus for nonsyndromic pre-axial polydactyly (PPD) has been mapped previously to 7q36. The mouse Hemimelic extra-toes (Hx) mutation maps to a homologous chromosome segment and has been proposed to affect a homologous gene. To understand the molecular changes underlying PPD, we used a positional cloning approach to identify the gene or genes disrupted by the Hx mutation and a closely linked limb mutation, Hammertoe (Hm). High resolution genetic mapping identified a small candidate interval for the mouse mutations located 1.2 cM distal to the Shh locus. The nonrecombinant interval was completely cloned in bacterial artificial chromosomes and searched for genes using a combination of exon trapping, sample sequencing, and mapping of known genes. Two novel genes, Lmbr1 and Lmbr2, are entirely within the candidate interval we defined genetically. The open reading frame of both genes is intact in mutant mice, but the expression of the Lmbr1 gene is dramatically altered in developing limbs of Hx mutant mice. The correspondence between the spatial and temporal changes in Lmbr1 expression and the embryonic onset of the Hx mutant phenotype suggests that the mouse Hx mutation may be a regulatory allele of Lmbr1. The human ortholog of Lmbr1 maps within the recently described interval for human PPD, strengthening the possibility that both mouse and human limb abnormalities are due to defects in the same highly conserved gene.
CARHTA GENE: multipopulation integrated genetic and radiation hybrid mapping.
de Givry, Simon; Bouchez, Martin; Chabrier, Patrick; Milan, Denis; Schiex, Thomas
2005-04-15
CAR(H)(T)A GENE: is an integrated genetic and radiation hybrid (RH) mapping tool which can deal with multiple populations, including mixtures of genetic and RH data. CAR(H)(T)A GENE: performs multipoint maximum likelihood estimations with accelerated expectation-maximization algorithms for some pedigrees and has sophisticated algorithms for marker ordering. Dedicated heuristics for framework mapping are also included. CAR(H)(T)A GENE: can be used as a C++ library, through a shell command and a graphical interface. The XML output for companion tools is integrated. The program is available free of charge from www.inra.fr/bia/T/CarthaGene for Linux, Windows and Solaris machines (with Open Source). tschiex@toulouse.inra.fr.
Albaugh, Matthew D; Orr, Catherine; Chaarani, Bader; Althoff, Robert R; Allgaier, Nicholas; D'Alberto, Nicholas; Hudson, Kelsey; Mackey, Scott; Spechler, Philip A; Banaschewski, Tobias; Brühl, Rüdiger; Bokde, Arun L W; Bromberg, Uli; Büchel, Christian; Cattrell, Anna; Conrod, Patricia J; Desrivières, Sylvane; Flor, Herta; Frouin, Vincent; Gallinat, Jürgen; Goodman, Robert; Gowland, Penny; Grimmer, Yvonne; Heinz, Andreas; Kappel, Viola; Martinot, Jean-Luc; Paillère Martinot, Marie-Laure; Nees, Frauke; Orfanos, Dimitri Papadopoulos; Penttila, Jani; Poustka, Luise; Paus, Tomáš; Smolka, Michael N; Struve, Maren; Walter, Henrik; Whelan, Robert; Schumann, Gunter; Garavan, Hugh; Potter, Alexandra S
2017-11-01
Neuroimaging studies of attention-deficit/hyperactivity disorder (ADHD) have most commonly reported volumetric abnormalities in the basal ganglia, cerebellum, and prefrontal cortices. Few studies have examined the relationship between ADHD symptomatology and brain structure in population-based samples. We investigated the relationship between dimensional measures of ADHD symptomatology, brain structure, and reaction time variability-an index of lapses in attention. We also tested for associations between brain structural correlates of ADHD symptomatology and maps of dopaminergic gene expression. Psychopathology and imaging data were available for 1538 youths. Parent ratings of ADHD symptoms were obtained using the Development and Well-Being Assessment and the Strengths and Difficulties Questionnaire (SDQ). Self-reports of ADHD symptoms were assessed using the youth version of the SDQ. Reaction time variability was available in a subset of participants. For each measure, whole-brain voxelwise regressions with gray matter volume were calculated. Parent ratings of ADHD symptoms (Development and Well-Being Assessment and SDQ), adolescent self-reports of ADHD symptoms on the SDQ, and reaction time variability were each negatively associated with gray matter volume in an overlapping region of the ventromedial prefrontal cortex. Maps of DRD1 and DRD2 gene expression were associated with brain structural correlates of ADHD symptomatology. This is the first study to reveal relationships between ventromedial prefrontal cortex structure and multi-informant measures of ADHD symptoms in a large population-based sample of adolescents. Our results indicate that ventromedial prefrontal cortex structure is a biomarker for ADHD symptomatology. These findings extend previous research implicating the default mode network and dopaminergic dysfunction in ADHD. Copyright © 2017 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Assortative mating and fragmentation within dog breeds.
Björnerfeldt, Susanne; Hailer, Frank; Nord, Maria; Vilà, Carles
2008-01-28
There are around 400 internationally recognized dog breeds in the world today, with a remarkable diversity in size, shape, color and behavior. Breeds are considered to be uniform groups with similar physical characteristics, shaped by selection rooted in human preferences. This has led to a large genetic difference between breeds and a large extent of linkage disequilibrium within breeds. These characteristics are important for association mapping of candidate genes for diseases and therefore make dogs ideal models for gene mapping of human disorders. However, genetic uniformity within breeds may not always be the case. We studied patterns of genetic diversity within 164 poodles and compared it to 133 dogs from eight other breeds. Our analyses revealed strong population structure within poodles, with differences among some poodle groups as pronounced as those among other well-recognized breeds. Pedigree analysis going three generations back in time confirmed that subgroups within poodles result from assortative mating imposed by breed standards as well as breeder preferences. Matings have not taken place at random or within traditionally identified size classes in poodles. Instead, a novel set of five poodle groups was identified, defined by combinations of size and color, which is not officially recognized by the kennel clubs. Patterns of genetic diversity in other breeds suggest that assortative mating leading to fragmentation may be a common feature within many dog breeds. The genetic structure observed in poodles is the result of local mating patterns, implying that breed fragmentation may be different in different countries. Such pronounced structuring within dog breeds can increase the power of association mapping studies, but also represents a serious problem if ignored. In dog breeding, individuals are selected on the basis of morphology, behaviour, working or show purposes, as well as geographic population structure. The same processes which have historically created dog breeds are still ongoing, and create further subdivision within current dog breeds.
Assortative mating and fragmentation within dog breeds
2008-01-01
Background There are around 400 internationally recognized dog breeds in the world today, with a remarkable diversity in size, shape, color and behavior. Breeds are considered to be uniform groups with similar physical characteristics, shaped by selection rooted in human preferences. This has led to a large genetic difference between breeds and a large extent of linkage disequilibrium within breeds. These characteristics are important for association mapping of candidate genes for diseases and therefore make dogs ideal models for gene mapping of human disorders. However, genetic uniformity within breeds may not always be the case. We studied patterns of genetic diversity within 164 poodles and compared it to 133 dogs from eight other breeds. Results Our analyses revealed strong population structure within poodles, with differences among some poodle groups as pronounced as those among other well-recognized breeds. Pedigree analysis going three generations back in time confirmed that subgroups within poodles result from assortative mating imposed by breed standards as well as breeder preferences. Matings have not taken place at random or within traditionally identified size classes in poodles. Instead, a novel set of five poodle groups was identified, defined by combinations of size and color, which is not officially recognized by the kennel clubs. Patterns of genetic diversity in other breeds suggest that assortative mating leading to fragmentation may be a common feature within many dog breeds. Conclusion The genetic structure observed in poodles is the result of local mating patterns, implying that breed fragmentation may be different in different countries. Such pronounced structuring within dog breeds can increase the power of association mapping studies, but also represents a serious problem if ignored. In dog breeding, individuals are selected on the basis of morphology, behaviour, working or show purposes, as well as geographic population structure. The same processes which have historically created dog breeds are still ongoing, and create further subdivision within current dog breeds. PMID:18226210
Bashatwah, Rasha M; Khanfar, Mohammad A; Bardaweel, Sanaa K
2018-05-08
Inorganic polyphosphate (polyP) is present in all living forms of life. Studied mainly in prokaryotes, polyP and its associated enzymes are vital in diverse metabolic activities, in some structural functions, and most importantly in stress responses. Bacterial species, including many pathogens, encode a homolog of a major polyP synthesis enzyme, Poly Phosphate Kinase (PPK) with 2 different genes coding for PPK1 and PPK2. Genetic deletion of the ppk1 gene leads to reduced polyP levels and the consequent loss of virulence and stress adaptation responses. This far, no PPK1 homolog has been identified in higher-order eukaryotes, and, therefore, PPK1 represents a novel target for chemotherapy. The aim of the current study is to investigate PPK1 from Escherichia coli with comprehensive understanding of the enzyme's structure and binding sites, which were used to design pharmacophores and screen a library of compounds for potential discovery of selective PPK1 inhibitors. Verification of the resultant inhibitors activities was conducted using a combination of mutagenic and chemical biological approaches. The metabolic phenotypic maps of the wild type E. coli (WT) and ppk1 knockout mutant were generated and compared with the metabolic map of the chemically inhibited WT. In addition, biofilm formation ability was measured in WT, ppk1 knockout mutant, and the chemically inhibited WT. The results demonstrated that chemical inhibition of PPK1, with the designed inhibitors, was equivalent to gene deletion in altering specific metabolic pathways, changing the metabolic fingerprint, and suppressing the ability of E. coli to form a biofilm. Copyright © 2018 John Wiley & Sons, Ltd.
NABIC marker database: A molecular markers information network of agricultural crops.
Kim, Chang-Kug; Seol, Young-Joo; Lee, Dong-Jun; Jeong, In-Seon; Yoon, Ung-Han; Lee, Gang-Seob; Hahn, Jang-Ho; Park, Dong-Suk
2013-01-01
In 2013, National Agricultural Biotechnology Information Center (NABIC) reconstructs a molecular marker database for useful genetic resources. The web-based marker database consists of three major functional categories: map viewer, RSN marker and gene annotation. It provides 7250 marker locations, 3301 RSN marker property, 3280 molecular marker annotation information in agricultural plants. The individual molecular marker provides information such as marker name, expressed sequence tag number, gene definition and general marker information. This updated marker-based database provides useful information through a user-friendly web interface that assisted in tracing any new structures of the chromosomes and gene positional functions using specific molecular markers. The database is available for free at http://nabic.rda.go.kr/gere/rice/molecularMarkers/
MC EMiNEM maps the interaction landscape of the Mediator.
Niederberger, Theresa; Etzold, Stefanie; Lidschreiber, Michael; Maier, Kerstin C; Martin, Dietmar E; Fröhlich, Holger; Cramer, Patrick; Tresch, Achim
2012-01-01
The Mediator is a highly conserved, large multiprotein complex that is involved essentially in the regulation of eukaryotic mRNA transcription. It acts as a general transcription factor by integrating regulatory signals from gene-specific activators or repressors to the RNA Polymerase II. The internal network of interactions between Mediator subunits that conveys these signals is largely unknown. Here, we introduce MC EMiNEM, a novel method for the retrieval of functional dependencies between proteins that have pleiotropic effects on mRNA transcription. MC EMiNEM is based on Nested Effects Models (NEMs), a class of probabilistic graphical models that extends the idea of hierarchical clustering. It combines mode-hopping Monte Carlo (MC) sampling with an Expectation-Maximization (EM) algorithm for NEMs to increase sensitivity compared to existing methods. A meta-analysis of four Mediator perturbation studies in Saccharomyces cerevisiae, three of which are unpublished, provides new insight into the Mediator signaling network. In addition to the known modular organization of the Mediator subunits, MC EMiNEM reveals a hierarchical ordering of its internal information flow, which is putatively transmitted through structural changes within the complex. We identify the N-terminus of Med7 as a peripheral entity, entailing only local structural changes upon perturbation, while the C-terminus of Med7 and Med19 appear to play a central role. MC EMiNEM associates Mediator subunits to most directly affected genes, which, in conjunction with gene set enrichment analysis, allows us to construct an interaction map of Mediator subunits and transcription factors.
Shin, Min-Kyoung; Shin, Seung Won; Jung, Myunghwan; Park, Hongtae; Park, Hyun-Eui; Yoo, Han Sang
2015-07-01
Mycobacterium avium subsp. paratuberculosis (MAP) is the causative agent of Johne's disease, which causes considerable economic loss in the dairy industry and has a possible relationship to Crohn's disease (CD) in humans. As MAP has been detected in retail pasteurized milk samples, its transmission via milk is of concern. Despite its possible role in the etiology of CD, there have been few studies examining the interactions between MAP and human cells. In the current study, we applied Ingenuity Pathway Analysis to the transcription profiles generated from a murine model with MAP infection as part of a previously conducted study. Twenty-one genes were selected as potential host immune responses, compared with the transcriptional profiles in naturally MAP-infected cattle, and validated in MAP-infected human monocyte-derived macrophage THP-1 cells. Of these, the potential host responses included up-regulation of genes related to immune response (CD14, S100A8, S100A9, LTF, HP and CHCIL3), up-regulation of Th1-polarizing factor (CCL4, CCL5, CXCL9 and CXCL10), down-regulation of genes related to metabolism (ELANE, IGF1, TCF7L2 and MPO) and no significant response of other genes (GADD45a, GPNMB, HMOX1, IFNG and NQO1) in THP-1 cells infected with MAP. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Novotny, Peter; Tang, Xiaojia; Kalari, Krishna R.; Gorodkin, Jan
2014-01-01
Traditional mutation assessment methods generally focus on predicting disruptive changes in protein-coding regions rather than non-coding regulatory regions like untranslated regions (UTRs) of mRNAs. The UTRs, however, are known to have many sequence and structural motifs that can regulate translational and transcriptional efficiency and stability of mRNAs through interaction with RNA-binding proteins and other non-coding RNAs like microRNAs (miRNAs). In a recent study, transcriptomes of tumor cells harboring mutant and wild-type KRAS (V-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog) genes in patients with non-small cell lung cancer (NSCLC) have been sequenced to identify single nucleotide variations (SNVs). About 40% of the total SNVs (73,717) identified were mapped to UTRs, but omitted in the previous analysis. To meet this obvious demand for analysis of the UTRs, we designed a comprehensive pipeline to predict the effect of SNVs on two major regulatory elements, secondary structure and miRNA target sites. Out of 29,290 SNVs in 6462 genes, we predict 472 SNVs (in 408 genes) affecting local RNA secondary structure, 490 SNVs (in 447 genes) affecting miRNA target sites and 48 that do both. Together these disruptive SNVs were present in 803 different genes, out of which 188 (23.4%) were previously known to be cancer-associated. Notably, this ratio is significantly higher (one-sided Fisher's exact test p-value = 0.032) than the ratio (20.8%) of known cancer-associated genes (n = 1347) in our initial data set (n = 6462). Network analysis shows that the genes harboring disruptive SNVs were involved in molecular mechanisms of cancer, and the signaling pathways of LPS-stimulated MAPK, IL-6, iNOS, EIF2 and mTOR. In conclusion, we have found hundreds of SNVs which are highly disruptive with respect to changes in the secondary structure and miRNA target sites within UTRs. These changes hold the potential to alter the expression of known cancer genes or genes linked to cancer-associated pathways. PMID:24416147
Sabarinathan, Radhakrishnan; Wenzel, Anne; Novotny, Peter; Tang, Xiaojia; Kalari, Krishna R; Gorodkin, Jan
2014-01-01
Traditional mutation assessment methods generally focus on predicting disruptive changes in protein-coding regions rather than non-coding regulatory regions like untranslated regions (UTRs) of mRNAs. The UTRs, however, are known to have many sequence and structural motifs that can regulate translational and transcriptional efficiency and stability of mRNAs through interaction with RNA-binding proteins and other non-coding RNAs like microRNAs (miRNAs). In a recent study, transcriptomes of tumor cells harboring mutant and wild-type KRAS (V-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog) genes in patients with non-small cell lung cancer (NSCLC) have been sequenced to identify single nucleotide variations (SNVs). About 40% of the total SNVs (73,717) identified were mapped to UTRs, but omitted in the previous analysis. To meet this obvious demand for analysis of the UTRs, we designed a comprehensive pipeline to predict the effect of SNVs on two major regulatory elements, secondary structure and miRNA target sites. Out of 29,290 SNVs in 6462 genes, we predict 472 SNVs (in 408 genes) affecting local RNA secondary structure, 490 SNVs (in 447 genes) affecting miRNA target sites and 48 that do both. Together these disruptive SNVs were present in 803 different genes, out of which 188 (23.4%) were previously known to be cancer-associated. Notably, this ratio is significantly higher (one-sided Fisher's exact test p-value = 0.032) than the ratio (20.8%) of known cancer-associated genes (n = 1347) in our initial data set (n = 6462). Network analysis shows that the genes harboring disruptive SNVs were involved in molecular mechanisms of cancer, and the signaling pathways of LPS-stimulated MAPK, IL-6, iNOS, EIF2 and mTOR. In conclusion, we have found hundreds of SNVs which are highly disruptive with respect to changes in the secondary structure and miRNA target sites within UTRs. These changes hold the potential to alter the expression of known cancer genes or genes linked to cancer-associated pathways.
He, Huagang; Zhu, Shanying; Jiang, Zhengning; Ji, Yaoyong; Wang, Feng; Zhao, Renhui; Bie, Tongde
2016-04-01
The powdery mildew resistance gene Pm21 was physically and comparatively mapped by newly developed markers. Seven candidate genes were verified to be required for Pm21 -mediated resistance to wheat powdery mildew. Pm21, a gene derived from wheat wild relative Dasypyrum villosum, has been transferred into common wheat and widely utilized in wheat resistance breeding for powdery mildew. Previously, Pm21 has been located to the bin FL0.45-0.58 of 6VS by using deletion stocks. However, its fine mapping is still a hard work. In the present study, 30 gene-derived 6VS-specific markers were obtained based on the collinearity among genomes of Brachypodium distachyon, Oryza and Triticeae, and then physically and comparatively mapped in the bin FL0.45-0.58 and its nearby chromosome region. According to the maps, the bin FL0.45-0.58 carrying Pm21 was closely flanked by the markers 6VS-03 and 6VS-23, which further narrowed the orthologous regions to 1.06 Mb in Brachypodium and 1.38 Mb in rice, respectively. Among the conserved genes shared by Brachypodium and rice, four serine/threonine protein kinase genes (DvMPK1, DvMLPK, DvUPK and DvPSYR1), one protein phosphatase gene (DvPP2C) and two transcription factor genes (DvGATA and DvWHY) were confirmed to be required for Pm21-mediated resistance to wheat powdery mildew by barley stripe mosaic virus-induced gene silencing (BSMV-VIGS) and transcriptional pattern analyses. In summary, this study gives new insights into the genetic basis of the Pm21 locus and the disease resistance pathways mediated by Pm21.
DNA Probe Pooling for Rapid Delineation of Chromosomal Breakpoints
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, Chun-Mei; Kwan, Johnson; Baumgartner, Adolf
2009-01-30
Structural chromosome aberrations are hallmarks of many human genetic diseases. The precise mapping of translocation breakpoints in tumors is important for identification of genes with altered levels of expression, prediction of tumor progression, therapy response, or length of disease-free survival as well as the preparation of probes for detection of tumor cells in peripheral blood. Similarly, in vitro fertilization (IVF) and preimplantation genetic diagnosis (PGD) for carriers of balanced, reciprocal translocations benefit from accurate breakpoint maps in the preparation of patient-specific DNA probes followed by a selection of normal or balanced oocytes or embryos. We expedited the process of breakpointmore » mapping and preparation of case-specific probes by utilizing physically mapped bacterial artificial chromosome (BAC) clones. Historically, breakpoint mapping is based on the definition of the smallest interval between proximal and distal probes. Thus, many of the DNA probes prepared for multi-clone and multi-color mapping experiments do not generate additional information. Our pooling protocol described here with examples from thyroid cancer research and PGD accelerates the delineation of translocation breakpoints without sacrificing resolution. The turnaround time from clone selection to mapping results using tumor or IVF patient samples can be as short as three to four days.« less
Ma, Xin; Fu, Yongcai; Zhao, Xinhui; Jiang, Liyun; Zhu, Zuofeng; Gu, Ping; Xu, Wenying; Su, Zhen; Sun, Chuanqing; Tan, Lubin
2016-01-01
Oryza nivara, an annual wild AA-genome species of rice, is an important gene pool for broadening the genetic diversity of cultivated rice (O. sativa L.). Towards identifying and utilizing favourable alleles from O. nivara, we developed a set of introgression lines (ILs) by introducing O. nivara segments into the elite indica rice variety 93-11 background through advanced backcrossing and repeated selfing. Using whole-genome resequencing, a high-density genetic map containing 1,070 bin-markers was constructed for the 131 ILs, with an average length of 349 kb per bin. The 131 ILs cover 95% of O. nivara genome, providing a relatively complete genomic library for introgressing O. nivara alleles for trait improvement. Using this high-density bin-map, QTL mapping for 13 yield-related traits was performed and a total of 65 QTLs were detected across two environments. At ~36.9% of detected QTLs, the alleles from O. nivara conferred improving effects on yield-associated traits. Six cloned genes, Sh4/SHA1, Bh4, Sd1, TE/TAD1, GS3 and FZP, colocalised in the peak intervals of 9 QTLs. In conclusion, we developed new genetic materials for exploration and use of beneficial alleles from wild rice and provided a basis for future fine mapping and cloning of the favourable O. nivara-derived QTLs. PMID:27251022
Chen, Shisheng; Guo, Yan; Briggs, Jordan; Dubach, Felix; Chao, Shiaoman; Zhang, Wenjun; Rouse, Matthew N; Dubcovsky, Jorge
2018-03-01
The new stem rust resistance gene Sr60 was fine-mapped to the distal region of chromosome arm 5A m S, and the TTKSK-effective gene SrTm5 could be a new allele of Sr22. The emergence and spread of new virulent races of the wheat stem rust pathogen (Puccinia graminis f. sp. tritici; Pgt), including the Ug99 race group, is a serious threat to global wheat production. In this study, we mapped and characterized two stem rust resistance genes from diploid wheat Triticum monococcum accession PI 306540. We mapped SrTm5, a previously postulated gene effective to Ug99, on chromosome arm 7A m L, completely linked to Sr22. SrTm5 displayed a different race specificity compared to Sr22 indicating that they are distinct. Sequencing of the Sr22 homolog in PI 306540 revealed a novel haplotype. Characterization of the segregating populations with Pgt race QFCSC revealed an additional resistance gene on chromosome arm 5A m S that was assigned the official name Sr60. This gene was also effective against races QTHJC and SCCSC but not against TTKSK (a Ug99 group race). Using two large mapping populations (4046 gametes), we mapped Sr60 within a 0.44 cM interval flanked by sequenced-based markers GH724575 and CJ942731. These two markers delimit a 54.6-kb region in Brachypodium distachyon chromosome 4 and a 430-kb region in the Chinese Spring reference genome. Both regions include a leucine-rich repeat protein kinase (LRRK123.1) that represents a potential candidate gene. Three CC-NBS-LRR genes were found in the colinear Brachypodium region but not in the wheat genome. We are currently developing a Bacterial Artificial Chromosome library of PI 306540 to determine which of these candidate genes are present in the T. monococcum genome and to complete the cloning of Sr60.
USDA-ARS?s Scientific Manuscript database
To better understand maize endosperm filling and maturation, we developed a novel functional genomics platform that combined Bulked Segregant RNA and Exome sequencing (BSREx-seq) to map causative mutations and identify candidate genes within mapping intervals. Using gamma-irradiation of B73 maize to...
Taroncher-Oldenburg, Gaspar; Anderson, Donald M.
2000-01-01
Genes showing differential expression related to the early G1 phase of the cell cycle during synchronized circadian growth of the toxic dinoflagellate Alexandrium fundyense were identified and characterized by differential display (DD). The determination in our previous work that toxin production in Alexandrium is relegated to a narrow time frame in early G1 led to the hypothesis that transcriptionally up- or downregulated genes during this subphase of the cell cycle might be related to toxin biosynthesis. Three genes, encoding S-adenosylhomocysteine hydrolase (Sahh), methionine aminopeptidase (Map), and a histone-like protein (HAf), were isolated. Sahh was downregulated, while Map and HAf were upregulated, during the early G1 phase of the cell cycle. Sahh and Map encoded amino acid sequences with about 90 and 70% similarity to those encoded by several eukaryotic and prokaryotic Sahh and Map genes, respectively. The partial Map sequence also contained three cobalt binding motifs characteristic of all Map genes. HAf encoded an amino acid sequence with 60% similarity to those of two histone-like proteins from the dinoflagellate Crypthecodinium cohnii Biecheler. This study documents the potential of applying DD to the identification of genes that are related to physiological processes or cell cycle events in phytoplankton under conditions where small sample volumes represent an experimental constraint. The identification of an additional 21 genes with various cell cycle-related DD patterns also provides evidence for the importance of pretranslational or transcriptional regulation in dinoflagellates, contrary to previous reports suggesting the possibility that translational mechanisms are the primary means of circadian regulation in this group of organisms. PMID:10788388
Wang, Chun Ming; Lo, Loong Chueng; Feng, Felicia; Gong, Ping; Li, Jian; Zhu, Ze Yuan; Lin, Grace; Yue, Gen Hua
2008-03-25
Barramundi (Lates calcarifer) is an important farmed marine food fish species. Its first generation linkage map has been applied to map QTL for growth traits. To identify genes located in QTL responsible for specific traits, genomic large insert libraries are of crucial importance. We reported herein a bacterial artificial chromosome (BAC) library and the mapping of BAC clones to the linkage map. This BAC library consisted of 49,152 clones with an average insert size of 98 kb, representing 6.9-fold haploid genome coverage. Screening the library with 24 microsatellites and 15 ESTs/genes demonstrated that the library had good genome coverage. In addition, 62 novel microsatellites each isolated from 62 BAC clones were mapped onto the first generation linkage map. A total of 86 BAC clones were anchored on the linkage map with at least one BAC clone on each linkage group. We have constructed the first BAC library for L. calcarifer and mapped 86 BAC clones to the first generation linkage map. This BAC library and the improved linkage map with 302 DNA markers not only supply an indispensable tool to the integration of physical and linkage maps, the fine mapping of QTL and map based cloning genes located in QTL of commercial importance, but also contribute to comparative genomic studies and eventually whole genome sequencing.
Contribution of radiation hybrids to genome mapping in domestic animals.
Faraut, T; de Givry, S; Hitte, C; Lahbib-Mansais, Y; Morisson, M; Milan, D; Schiex, T; Servin, B; Vignal, A; Galibert, F; Yerle, M
2009-01-01
Radiation hybrid mapping has emerged in the end of the 1990 s as a successful and complementary approach to map genomes, essentially because of its ability to bridge the gaps between genetic and clone-based physical maps, but also using comparative mapping approaches, between 'gene-rich' and 'gene-poor' maps. Since its early development in human, radiation hybrid mapping played a pivotal role in the process of mapping animal genomes, especially mammalian ones. We review here all the different steps involved in radiation hybrid mapping from the constitution of panels to the construction of maps. A description of its contribution to whole genome maps with a special emphasis on domestic animals will also be presented. Finally, current applications of radiation hybrid mapping in the context of whole genome assemblies will be described. (c) 2009 S. Karger AG, Basel.
Moraxella catarrhalis synthesizes an autotransporter that is an acid phosphatase.
Hoopman, Todd C; Wang, Wei; Brautigam, Chad A; Sedillo, Jennifer L; Reilly, Thomas J; Hansen, Eric J
2008-02-01
Moraxella catarrhalis O35E was shown to synthesize a 105-kDa protein that has similarity to both acid phosphatases and autotransporters. The N-terminal portion of the M. catarrhalis acid phosphatase A (MapA) was most similar (the BLAST probability score was 10(-10)) to bacterial class A nonspecific acid phosphatases. The central region of the MapA protein had similarity to passenger domains of other autotransporter proteins, whereas the C-terminal portion of MapA resembled the translocation domain of conventional autotransporters. Cloning and expression of the M. catarrhalis mapA gene in Escherichia coli confirmed the presence of acid phosphatase activity in the MapA protein. The MapA protein was shown to be localized to the outer membrane of M. catarrhalis and was not detected either in the soluble cytoplasmic fraction from disrupted M. catarrhalis cells or in the spent culture supernatant fluid from M. catarrhalis. Use of the predicted MapA translocation domain in a fusion construct with the passenger domain from another predicted M. catarrhalis autotransporter confirmed the translocation ability of this MapA domain. Inactivation of the mapA gene in M. catarrhalis strain O35E reduced the acid phosphatase activity expressed by this organism, and this mutation could be complemented in trans with the wild-type mapA gene. Nucleotide sequence analysis of the mapA gene from six M. catarrhalis strains showed that this protein was highly conserved among strains of this pathogen. Site-directed mutagenesis of a critical histidine residue (H233A) in the predicted active site of the acid phosphatase domain in MapA eliminated acid phosphatase activity in the recombinant MapA protein. This is the first description of an autotransporter protein that expresses acid phosphatase activity.
Moraxella catarrhalis Synthesizes an Autotransporter That Is an Acid Phosphatase▿
Hoopman, Todd C.; Wang, Wei; Brautigam, Chad A.; Sedillo, Jennifer L.; Reilly, Thomas J.; Hansen, Eric J.
2008-01-01
Moraxella catarrhalis O35E was shown to synthesize a 105-kDa protein that has similarity to both acid phosphatases and autotransporters. The N-terminal portion of the M. catarrhalis acid phosphatase A (MapA) was most similar (the BLAST probability score was 10−10) to bacterial class A nonspecific acid phosphatases. The central region of the MapA protein had similarity to passenger domains of other autotransporter proteins, whereas the C-terminal portion of MapA resembled the translocation domain of conventional autotransporters. Cloning and expression of the M. catarrhalis mapA gene in Escherichia coli confirmed the presence of acid phosphatase activity in the MapA protein. The MapA protein was shown to be localized to the outer membrane of M. catarrhalis and was not detected either in the soluble cytoplasmic fraction from disrupted M. catarrhalis cells or in the spent culture supernatant fluid from M. catarrhalis. Use of the predicted MapA translocation domain in a fusion construct with the passenger domain from another predicted M. catarrhalis autotransporter confirmed the translocation ability of this MapA domain. Inactivation of the mapA gene in M. catarrhalis strain O35E reduced the acid phosphatase activity expressed by this organism, and this mutation could be complemented in trans with the wild-type mapA gene. Nucleotide sequence analysis of the mapA gene from six M. catarrhalis strains showed that this protein was highly conserved among strains of this pathogen. Site-directed mutagenesis of a critical histidine residue (H233A) in the predicted active site of the acid phosphatase domain in MapA eliminated acid phosphatase activity in the recombinant MapA protein. This is the first description of an autotransporter protein that expresses acid phosphatase activity. PMID:18065547
Kujur, Alice; Upadhyaya, Hari D.; Shree, Tanima; Bajaj, Deepak; Das, Shouvik; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
We discovered 26785 and 16573 high-quality SNPs differentiating two parental genotypes of a RIL mapping population using reference desi and kabuli genome-based GBS assay. Of these, 3625 and 2177 SNPs have been integrated into eight desi and kabuli chromosomes, respectively in order to construct ultra-high density (0.20–0.37 cM) intra-specific chickpea genetic linkage maps. One of these constructed high-resolution genetic map has potential to identify 33 major genomic regions harbouring 35 robust QTLs (PVE: 17.9–39.7%) associated with three agronomic traits, which were mapped within <1 cM mean marker intervals on desi chromosomes. The extended LD (linkage disequilibrium) decay (~15 cM) in chromosomes of genetic maps have encouraged us to use a rapid integrated approach (comparative QTL mapping, QTL-region specific haplotype/LD-based trait association analysis, expression profiling and gene haplotype-based association mapping) rather than a traditional QTL map-based cloning method to narrow-down one major seed weight (SW) robust QTL region. It delineated favourable natural allelic variants and superior haplotype-containing one seed-specific candidate embryo defective gene regulating SW in chickpea. The ultra-high-resolution genetic maps, QTLs/genes and alleles/haplotypes-related genomic information generated and integrated strategy for rapid QTL/gene identification developed have potential to expedite genomics-assisted breeding applications in crop plants, including chickpea for their genetic enhancement. PMID:25942004
Ramiah, K; van Reenen, C A; Dicks, L M T
2007-05-30
Expression of the mucus adhesion genes Mub and MapA, adhesion-like factor EF-Tu and bacteriocin gene plaA by Lactobacillus plantarum 423, grown in the presence of bile, pancreatin and at low pH, was studied by real-time PCR. Mub, MapA and EF-Tu were up-regulated in the presence of mucus, proportional to increasing concentrations. Expression of MapA was up-regulated in the presence of 3.0 g/l bile and 3.0 g/l pancreatin at pH 6.5. Similar results were recorded in the presence of 10.0 g/l bile and 10.0 g/l pancreatin at pH 6.5. Expression of Mub was down-regulated in the presence of bile and pancreatin, whilst the expression of EF-Tu and plaA remained unchanged. Expression of Mub and MapA remained unchanged at pH 4.0, whilst expression of EF-Tu and plaA were up-regulated. Expression of MapA was down-regulated in the presence of 1.0 g/l l-cysteine HCl, suggesting that the gene is regulated by transcription attenuation that involves cysteine.
Famoso, Adam N.; Zhao, Keyan; Clark, Randy T.; Tung, Chih-Wei; Wright, Mark H.; Bustamante, Carlos; Kochian, Leon V.; McCouch, Susan R.
2011-01-01
Aluminum (Al) toxicity is a primary limitation to crop productivity on acid soils, and rice has been demonstrated to be significantly more Al tolerant than other cereal crops. However, the mechanisms of rice Al tolerance are largely unknown, and no genes underlying natural variation have been reported. We screened 383 diverse rice accessions, conducted a genome-wide association (GWA) study, and conducted QTL mapping in two bi-parental populations using three estimates of Al tolerance based on root growth. Subpopulation structure explained 57% of the phenotypic variation, and the mean Al tolerance in Japonica was twice that of Indica. Forty-eight regions associated with Al tolerance were identified by GWA analysis, most of which were subpopulation-specific. Four of these regions co-localized with a priori candidate genes, and two highly significant regions co-localized with previously identified QTLs. Three regions corresponding to induced Al-sensitive rice mutants (ART1, STAR2, Nrat1) were identified through bi-parental QTL mapping or GWA to be involved in natural variation for Al tolerance. Haplotype analysis around the Nrat1 gene identified susceptible and tolerant haplotypes explaining 40% of the Al tolerance variation within the aus subpopulation, and sequence analysis of Nrat1 identified a trio of non-synonymous mutations predictive of Al sensitivity in our diversity panel. GWA analysis discovered more phenotype–genotype associations and provided higher resolution, but QTL mapping identified critical rare and/or subpopulation-specific alleles not detected by GWA analysis. Mapping using Indica/Japonica populations identified QTLs associated with transgressive variation where alleles from a susceptible aus or indica parent enhanced Al tolerance in a tolerant Japonica background. This work supports the hypothesis that selectively introgressing alleles across subpopulations is an efficient approach for trait enhancement in plant breeding programs and demonstrates the fundamental importance of subpopulation in interpreting and manipulating the genetics of complex traits in rice. PMID:21829395
DOE Office of Scientific and Technical Information (OSTI.GOV)
Flejter, W.L.; McDaniel, L.D.; Johns, D.
1992-01-01
Cultured cells from individuals afflicted with the genetically heterogeneous autosomal recessive disorder xeroderma pigmentosum (XP) exhibit sensitivity to UV radiation and defective nucleotide excision repair. Complementation of these mutant phenotypes after the introduction of single human chromosomes from repair-proficient cells into XP cells has provided a means of mapping the genes involved in this disease. The authors now report the phenotypic correction of XP cells from genetic complementation group D (XP-D) by a single human chromosome designated Tneo. Detailed molecular characterization of Tneo revealed a rearranged structure involving human chromosomes 16 and 19, including the excision repair cross-complementing 2 (ERCC2)more » gene from the previously described human DNA repair gene cluster at 19q13.2-q13.3. Direct transfer of a cosmid bearing the ERCC2 gene conferred UV resistance to XP-D cells.« less
Map-Based Cloning of Genes Important for Maize Anther Development
NASA Astrophysics Data System (ADS)
Anaya, Y.; Walbot, V.; Nan, G.
2012-12-01
Map-Based cloning for maize mutant MS13 . Scientists still do not understand what decides the fate of a cell in plants. Many maize genes are important for anther development and when they are disrupted, the anthers do not shed pollen, i.e. male sterile. Since the maize genome has been fully sequenced, we conduct map-based cloning using a bulk segregant analysis strategy. Using PCR (polymerase chain reaction), we look for biomarkers that are linked to our gene of interest, Male Sterile 13 (MS13). Recombinations occur more often if the biomarkers are further away from the gene, therefore we can estimate where the gene is and design more PCR primers to get closer to our gene. Genetic and molecular analysis will help distinguish the role of key genes in setting cell fates before meiosis and for being in charge of the switch from mitosis to meiosis.
Fine mapping of regulatory loci for mammalian gene expression using radiation hybrids
Park, Christopher C; Ahn, Sangtae; Bloom, Joshua S; Lin, Andy; Wang, Richard T; Wu, Tongtong; Sekar, Aswin; Khan, Arshad H; Farr, Christine J; Lusis, Aldons J; Leahy, Richard M; Lange, Kenneth; Smith, Desmond J
2010-01-01
We mapped regulatory loci for nearly all protein-coding genes in mammals using comparative genomic hybridization and expression array measurements from a panel of mouse–hamster radiation hybrid cell lines. The large number of breaks in the mouse chromosomes and the dense genotyping of the panel allowed extremely sharp mapping of loci. As the regulatory loci result from extra gene dosage, we call them copy number expression quantitative trait loci, or ceQTLs. The −2log10P support interval for the ceQTLs was <150 kb, containing an average of <2–3 genes. We identified 29,769 trans ceQTLs with −log10P > 4, including 13 hotspots each regulating >100 genes in trans. Further, this work identifies 2,761 trans ceQTLs harboring no known genes, and provides evidence for a mode of gene expression autoregulation specific to the X chromosome. PMID:18362883
Warburton, Marilyn L; Williams, William Paul; Hawkins, Leigh; Bridges, Susan; Gresham, Cathy; Harper, Jonathan; Ozkan, Seval; Mylroie, J Erik; Shan, Xueyan
2011-07-01
A public candidate gene testing pipeline for resistance to aflatoxin accumulation or Aspergillus flavus infection in maize is presented here. The pipeline consists of steps for identifying, testing, and verifying the association of selected maize gene sequences with resistance under field conditions. Resources include a database of genetic and protein sequences associated with the reduction in aflatoxin contamination from previous studies; eight diverse inbred maize lines for polymorphism identification within any maize gene sequence; four Quantitative Trait Loci (QTL) mapping populations and one association mapping panel, all phenotyped for aflatoxin accumulation resistance and associated phenotypes; and capacity for Insertion/Deletion (InDel) and SNP genotyping in the population(s) for mapping. To date, ten genes have been identified as possible candidate genes and put through the candidate gene testing pipeline, and results are presented here to demonstrate the utility of the pipeline.
NASA Astrophysics Data System (ADS)
Ye, Weiming; Li, Pengfei; Huang, Xuhui; Xia, Qinzhi; Mi, Yuanyuan; Chen, Runsheng; Hu, Gang
2010-10-01
Exploring the principle and relationship of gene transcriptional regulations (TR) has been becoming a generally researched issue. So far, two major mathematical methods, ordinary differential equation (ODE) method and Boolean map (BM) method have been widely used for these purposes. It is commonly believed that simplified BMs are reasonable approximations of more realistic ODEs, and both methods may reveal qualitatively the same essential features though the dynamical details of both systems may show some differences. In this Letter we exhaustively enumerated all the 3-gene networks and many autonomous randomly constructed TR networks with more genes by using both the ODE and BM methods. In comparison we found that both methods provide practically identical results in most of cases of steady solutions. However, to our great surprise, most of network structures showing periodic cycles with the BM method possess only stationary states in ODE descriptions. These observations strongly suggest that many periodic oscillations and other complicated oscillatory states revealed by the BM rule may be related to the computational errors of variable and time discretizations and rarely have correspondence in realistic biology transcriptional regulatory circuits.
Wang, Ji; Kang, Rongyan; Huang, He; Xi, Xueyan; Wang, Bei; Wang, Jianwei; Zhao, Zhendong
2014-01-01
HCV infection induces autophagy, but how this occurs is unclear. Here, we report the induction of autophagy by the structural HCV core protein and subsequent endoplasmic reticular (ER) stress in Huh7 hepatoma cells. During ER stress, both the EIF2AK3 and ATF6 pathways of the unfolded protein response (UPR) were activated by HCV core protein. Then, these pathways upregulated transcription factors ATF4 and DDIT3. The ERN1-XBP1 pathway was not activated. Through ATF4 in the EIF2AK3 pathway, the autophagy gene ATG12 was upregulated. DDIT3 upregulated the transcription of autophagy gene MAP1LC3B (LC3B) by directly binding to the –253 to –99 base region of the LC3B promoter, contributing to the development of autophagy. Collectively, these data suggest not only a novel role for the HCV core protein in autophagy but also offer new insight into detailed molecular mechanisms with respect to HCV-induced autophagy, specifically how downstream UPR molecules regulate key autophagic gene expression. PMID:24589849
Martín, A C; López, R; García, P
1996-06-01
Cp-1, a bacteriophage infecting Streptococcus pneumoniae, has a linear double-stranded DNA genome, with a terminal protein covalently linked to its 5' ends, that replicates by the protein-priming mechanism. We describe here the complete DNA sequence and transcriptional map of the Cp-1 genome. These analyses have led to the firm assignment of 10 genes and the localization of 19 additional open reading frames in the 19,345-bp Cp-1 DNA. Striking similarities and differences between some of these proteins and those of the Bacillus subtilis phage phi 29, a system that also replicates its DNA by the protein-priming mechanism, have been revealed. The genes coding for structural proteins and assembly factors are located in the central part of the Cp-1 genome. Several proteins corresponding to the predicted gene products were identified by in vitro and in vivo expression of the cloned genes. Mature major head protein from the virion particles results from hydrolysis of the primary gene product at the His-49 residue, whereas the phage gene is expressed in Escherichia coli without modification. We have also identified two open reading frames coding for proteins that show high degrees of similarity to the N- and C-terminal regions, respectively, of the single tail protein identified in phi 29. Sequencing and primer extension analysis suggest transcription of a small RNA showing a secondary structure similar to that of the prohead RNA required for the ATP-dependent packaging of phi 29 DNA. On the basis of its temporal expression, transcription of the Cp-1 genome takes place in two stages, early and late. Combined Northern (RNA) blot and primer extension experiments allowed us to map the 5' initiation sites of the transcripts, and we found that only three genes were transcribed from right to left. These analyses reveal that there are also noticeable differences between Cp-l and phi 29 in transcriptional organization. Considered together, the observations reported here provide new tangible evidence on phylogenetic relationships between B. subtilis and S. pneumoniae.
A High-Density Admixture Map for Disease Gene Discovery in African Americans
Smith, Michael W. ; Patterson, Nick ; Lautenberger, James A. ; Truelove, Ann L. ; McDonald, Gavin J. ; Waliszewska, Alicja ; Kessing, Bailey D. ; Malasky, Michael J. ; Scafe, Charles ; Le, Ernest ; De Jager, Philip L. ; Mignault, Andre A. ; Yi, Zeng ; de Thé, Guy ; Essex, Myron ; Sankalé, Jean-Louis ; Moore, Jason H. ; Poku, Kwabena ; Phair, John P. ; Goedert, James J. ; Vlahov, David ; Williams, Scott M. ; Tishkoff, Sarah A. ; Winkler, Cheryl A. ; De La Vega, Francisco M. ; Woodage, Trevor ; Sninsky, John J. ; Hafler, David A. ; Altshuler, David ; Gilbert, Dennis A. ; O’Brien, Stephen J. ; Reich, David
2004-01-01
Admixture mapping (also known as “mapping by admixture linkage disequilibrium,” or MALD) provides a way of localizing genes that cause disease, in admixed ethnic groups such as African Americans, with ∼100 times fewer markers than are required for whole-genome haplotype scans. However, it has not been possible to perform powerful scans with admixture mapping because the method requires a dense map of validated markers known to have large frequency differences between Europeans and Africans. To create such a map, we screened through databases containing ∼450,000 single-nucleotide polymorphisms (SNPs) for which frequencies had been estimated in African and European population samples. We experimentally confirmed the frequencies of the most promising SNPs in a multiethnic panel of unrelated samples and identified 3,011 as a MALD map (1.2 cM average spacing). We estimate that this map is ∼70% informative in differentiating African versus European origins of chromosomal segments. This map provides a practical and powerful tool, which is freely available without restriction, for screening for disease genes in African American patient cohorts. The map is especially appropriate for those diseases that differ in incidence between the parental African and European populations. PMID:15088270
Mapping genes to human chromosome 19
DOE Office of Scientific and Technical Information (OSTI.GOV)
Connolly, Sarah
1996-05-01
For this project, 22 Expressed Sequence Tags (ESTs) were fine mapped to regions of human chromosome 19. An EST is a short DNA sequence that occurs once in the genome and corresponds to a single expressed gene. {sup 32}P-radiolabeled probes were made by polymerase chain reaction for each EST and hybridized to filters containing a chromosome 19-specific cosmid library. The location of the ESTs on the chromosome was determined by the location of the ordered cosmid to which the EST hybridized. Of the 22 ESTs that were sublocalized, 6 correspond to known genes, and 16 correspond to anonymous genes. Thesemore » localized ESTs may serve as potential candidates for disease genes, as well as markers for future physical mapping.« less
Winchester, Catherine L; Ohzeki, Hiromitsu; Vouyiouklis, Demetrius A; Thompson, Rhiannon; Penninger, Josef M; Yamagami, Keiji; Norrie, John D; Hunter, Robert; Pratt, Judith A; Morris, Brian J
2012-11-15
Schizophrenia is a debilitating psychiatric disease with a strong genetic contribution, potentially linked to altered glutamatergic function in brain regions such as the prefrontal cortex (PFC). Here, we report converging evidence to support a functional candidate gene for schizophrenia. In post-mortem PFC from patients with schizophrenia, we detected decreased expression of MKK7/MAP2K7-a kinase activated by glutamatergic activity. While mice lacking one copy of the Map2k7 gene were overtly normal in a variety of behavioural tests, these mice showed a schizophrenia-like cognitive phenotype of impaired working memory. Additional support for MAP2K7 as a candidate gene came from a genetic association study. A substantial effect size (odds ratios: ~1.9) was observed for a common variant in a cohort of case and control samples collected in the Glasgow area and also in a replication cohort of samples of Northern European descent (most significant P-value: 3 × 10(-4)). While some caution is warranted until these association data are further replicated, these results are the first to implicate the candidate gene MAP2K7 in genetic risk for schizophrenia. Complete sequencing of all MAP2K7 exons did not reveal any non-synonymous mutations. However, the MAP2K7 haplotype appeared to have functional effects, in that it influenced the level of expression of MAP2K7 mRNA in human PFC. Taken together, the results imply that reduced function of the MAP2K7-c-Jun N-terminal kinase (JNK) signalling cascade may underlie some of the neurochemical changes and core symptoms in schizophrenia.
Agarwal, Gaurav; Clevenger, Josh; Pandey, Manish K; Wang, Hui; Shasidhar, Yaduru; Chu, Ye; Fountain, Jake C; Choudhary, Divya; Culbreath, Albert K; Liu, Xin; Huang, Guodong; Wang, Xingjun; Deshmukh, Rupesh; Holbrook, C Corley; Bertioli, David J; Ozias-Akins, Peggy; Jackson, Scott A; Varshney, Rajeev K; Guo, Baozhu
2018-04-10
Whole-genome resequencing (WGRS) of mapping populations has facilitated development of high-density genetic maps essential for fine mapping and candidate gene discovery for traits of interest in crop species. Leaf spots, including early leaf spot (ELS) and late leaf spot (LLS), and Tomato spotted wilt virus (TSWV) are devastating diseases in peanut causing significant yield loss. We generated WGRS data on a recombinant inbred line population, developed a SNP-based high-density genetic map, and conducted fine mapping, candidate gene discovery and marker validation for ELS, LLS and TSWV. The first sequence-based high-density map was constructed with 8869 SNPs assigned to 20 linkage groups, representing 20 chromosomes, for the 'T' population (Tifrunner × GT-C20) with a map length of 3120 cM and an average distance of 1.45 cM. The quantitative trait locus (QTL) analysis using high-density genetic map and multiple season phenotyping data identified 35 main-effect QTLs with phenotypic variation explained (PVE) from 6.32% to 47.63%. Among major-effect QTLs mapped, there were two QTLs for ELS on B05 with 47.42% PVE and B03 with 47.38% PVE, two QTLs for LLS on A05 with 47.63% and B03 with 34.03% PVE and one QTL for TSWV on B09 with 40.71% PVE. The epistasis and environment interaction analyses identified significant environmental effects on these traits. The identified QTL regions had disease resistance genes including R-genes and transcription factors. KASP markers were developed for major QTLs and validated in the population and are ready for further deployment in genomics-assisted breeding in peanut. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Comparative mapping in the Fagaceae and beyond with EST-SSRs
2012-01-01
Background Genetic markers and linkage mapping are basic prerequisites for comparative genetic analyses, QTL detection and map-based cloning. A large number of mapping populations have been developed for oak, but few gene-based markers are available for constructing integrated genetic linkage maps and comparing gene order and QTL location across related species. Results We developed a set of 573 expressed sequence tag-derived simple sequence repeats (EST-SSRs) and located 397 markers (EST-SSRs and genomic SSRs) on the 12 oak chromosomes (2n = 2x = 24) on the basis of Mendelian segregation patterns in 5 full-sib mapping pedigrees of two species: Quercus robur (pedunculate oak) and Quercus petraea (sessile oak). Consensus maps for the two species were constructed and aligned. They showed a high degree of macrosynteny between these two sympatric European oaks. We assessed the transferability of EST-SSRs to other Fagaceae genera and a subset of these markers was mapped in Castanea sativa, the European chestnut. Reasonably high levels of macrosynteny were observed between oak and chestnut. We also obtained diversity statistics for a subset of EST-SSRs, to support further population genetic analyses with gene-based markers. Finally, based on the orthologous relationships between the oak, Arabidopsis, grape, poplar, Medicago, and soybean genomes and the paralogous relationships between the 12 oak chromosomes, we propose an evolutionary scenario of the 12 oak chromosomes from the eudicot ancestral karyotype. Conclusions This study provides map locations for a large set of EST-SSRs in two oak species of recognized biological importance in natural ecosystems. This first step toward the construction of a gene-based linkage map will facilitate the assignment of future genome scaffolds to pseudo-chromosomes. This study also provides an indication of the potential utility of new gene-based markers for population genetics and comparative mapping within and beyond the Fagaceae. PMID:22931513
Khare, Sangeeta; Drake, Kenneth L.; Lawhon, Sara D.; Nunes, Jairo E. S.; Figueiredo, Josely F.; Rossetti, Carlos A.; Gull, Tamara; Everts, Robin E.; Lewin, Harris. A.; Adams, Leslie Garry
2016-01-01
It has long been a quest in ruminants to understand how two very similar mycobacterial species, Mycobacterium avium ssp. paratuberculosis (MAP) and Mycobacterium avium ssp. avium (MAA) lead to either a chronic persistent infection or a rapid-transient infection, respectively. Here, we hypothesized that when the host immune response is activated by MAP or MAA, the outcome of the infection depends on the early activation of signaling molecules and host temporal gene expression. To test our hypothesis, ligated jejuno-ileal loops including Peyer’s patches in neonatal calves were inoculated with PBS, MAP, or MAA. A temporal analysis of the host transcriptome profile was conducted at several times post-infection (0.5, 1, 2, 4, 8 and 12 hours). When comparing the transcriptional responses of calves infected with the MAA versus MAP, discordant patterns of mucosal expression were clearly evident, and the numbers of unique transcripts altered were moderately less for MAA-infected tissue than were mucosal tissues infected with the MAP. To interpret these complex data, changes in the gene expression were further analyzed by dynamic Bayesian analysis. Bayesian network modeling identified mechanistic genes, gene-to-gene relationships, pathways and Gene Ontologies (GO) biological processes that are involved in specific cell activation during infection. MAP and MAA had significant different pathway perturbation at 0.5 and 12 hours post inoculation. Inverse processes were observed between MAP and MAA response for epithelial cell proliferation, negative regulation of chemotaxis, cell-cell adhesion mediated by integrin and regulation of cytokine-mediated signaling. MAP inoculated tissue had significantly lower expression of phagocytosis receptors such as mannose receptor and complement receptors. This study reveals that perturbation of genes and cellular pathways during MAP infection resulted in host evasion by mucosal membrane barrier weakening to access entry in the ileum, inhibition of Ca signaling associated with decreased phagosome-lysosome fusion as well as phagocytosis inhibition, bias toward Th2 cell immune response accompanied by cell recruitment, cell proliferation and cell differentiation; leading to persistent infection. Contrarily, MAA infection was related to cellular responses associated with activation of molecular pathways that release chemicals and cytokines involved with containment of infection and a strong bias toward Th1 immune response, resulting in a transient infection. PMID:27653506
Khare, Sangeeta; Drake, Kenneth L; Lawhon, Sara D; Nunes, Jairo E S; Figueiredo, Josely F; Rossetti, Carlos A; Gull, Tamara; Everts, Robin E; Lewin, Harris A; Adams, Leslie Garry
It has long been a quest in ruminants to understand how two very similar mycobacterial species, Mycobacterium avium ssp. paratuberculosis (MAP) and Mycobacterium avium ssp. avium (MAA) lead to either a chronic persistent infection or a rapid-transient infection, respectively. Here, we hypothesized that when the host immune response is activated by MAP or MAA, the outcome of the infection depends on the early activation of signaling molecules and host temporal gene expression. To test our hypothesis, ligated jejuno-ileal loops including Peyer's patches in neonatal calves were inoculated with PBS, MAP, or MAA. A temporal analysis of the host transcriptome profile was conducted at several times post-infection (0.5, 1, 2, 4, 8 and 12 hours). When comparing the transcriptional responses of calves infected with the MAA versus MAP, discordant patterns of mucosal expression were clearly evident, and the numbers of unique transcripts altered were moderately less for MAA-infected tissue than were mucosal tissues infected with the MAP. To interpret these complex data, changes in the gene expression were further analyzed by dynamic Bayesian analysis. Bayesian network modeling identified mechanistic genes, gene-to-gene relationships, pathways and Gene Ontologies (GO) biological processes that are involved in specific cell activation during infection. MAP and MAA had significant different pathway perturbation at 0.5 and 12 hours post inoculation. Inverse processes were observed between MAP and MAA response for epithelial cell proliferation, negative regulation of chemotaxis, cell-cell adhesion mediated by integrin and regulation of cytokine-mediated signaling. MAP inoculated tissue had significantly lower expression of phagocytosis receptors such as mannose receptor and complement receptors. This study reveals that perturbation of genes and cellular pathways during MAP infection resulted in host evasion by mucosal membrane barrier weakening to access entry in the ileum, inhibition of Ca signaling associated with decreased phagosome-lysosome fusion as well as phagocytosis inhibition, bias toward Th2 cell immune response accompanied by cell recruitment, cell proliferation and cell differentiation; leading to persistent infection. Contrarily, MAA infection was related to cellular responses associated with activation of molecular pathways that release chemicals and cytokines involved with containment of infection and a strong bias toward Th1 immune response, resulting in a transient infection.
Construction of the first genetic linkage map of Japanese gentian (Gentianaceae)
2012-01-01
Background Japanese gentians (Gentiana triflora and Gentiana scabra) are amongst the most popular floricultural plants in Japan. However, genomic resources for Japanese gentians have not yet been developed, mainly because of the heterozygous genome structure conserved by outcrossing, the long juvenile period, and limited knowledge about the inheritance of important traits. In this study, we developed a genetic linkage map to improve breeding programs of Japanese gentians. Results Enriched simple sequence repeat (SSR) libraries from a G. triflora double haploid line yielded almost 20,000 clones using 454 pyrosequencing technology, 6.7% of which could be used to design SSR markers. To increase the number of molecular markers, we identified three putative long terminal repeat (LTR) sequences using the recently developed inter-primer binding site (iPBS) method. We also developed retrotransposon microsatellite amplified polymorphism (REMAP) markers combining retrotransposon and inter-simple sequence repeat (ISSR) markers. In addition to SSR and REMAP markers, modified amplified fragment length polymorphism (AFLP) and random amplification polymorphic DNA (RAPD) markers were developed. Using 93 BC1 progeny from G. scabra backcrossed with a G. triflora double haploid line, 19 linkage groups were constructed with a total of 263 markers (97 SSR, 97 AFLP, 39 RAPD, and 30 REMAP markers). One phenotypic trait (stem color) and 10 functional markers related to genes controlling flower color, flowering time and cold tolerance were assigned to the linkage map, confirming its utility. Conclusions This is the first reported genetic linkage map for Japanese gentians and for any species belonging to the family Gentianaceae. As demonstrated by mapping of functional markers and the stem color trait, our results will help to explain the genetic basis of agronomic important traits, and will be useful for marker-assisted selection in gentian breeding programs. Our map will also be an important resource for further genetic analyses such as mapping of quantitative trait loci and map-based cloning of genes in this species. PMID:23186361
2012-01-01
Background Single nucleotide polymorphism (SNP) validation and large-scale genotyping are required to maximize the use of DNA sequence variation and determine the functional relevance of candidate genes for complex stress tolerance traits through genetic association in rice. We used the bead array platform-based Illumina GoldenGate assay to validate and genotype SNPs in a select set of stress-responsive genes to understand their functional relevance and study the population structure in rice. Results Of the 384 putative SNPs assayed, we successfully validated and genotyped 362 (94.3%). Of these 325 (84.6%) showed polymorphism among the 91 rice genotypes examined. Physical distribution, degree of allele sharing, admixtures and introgression, and amino acid replacement of SNPs in 263 abiotic and 62 biotic stress-responsive genes provided clues for identification and targeted mapping of trait-associated genomic regions. We assessed the functional and adaptive significance of validated SNPs in a set of contrasting drought tolerant upland and sensitive lowland rice genotypes by correlating their allelic variation with amino acid sequence alterations in catalytic domains and three-dimensional secondary protein structure encoded by stress-responsive genes. We found a strong genetic association among SNPs in the nine stress-responsive genes with upland and lowland ecological adaptation. Higher nucleotide diversity was observed in indica accessions compared with other rice sub-populations based on different population genetic parameters. The inferred ancestry of 16% among rice genotypes was derived from admixed populations with the maximum between upland aus and wild Oryza species. Conclusions SNPs validated in biotic and abiotic stress-responsive rice genes can be used in association analyses to identify candidate genes and develop functional markers for stress tolerance in rice. PMID:22921105
Rebscher, Nicole; Deichmann, Christina; Sudhop, Stefanie; Fritzenwanker, Jens Holger; Green, Stephen; Hassel, Monika
2009-10-01
We have analyzed the evolution of fibroblast growth factor receptor (FGFR) tyrosine kinase genes throughout a wide range of animal phyla. No evidence for an FGFR gene was found in Porifera, but we tentatively identified an FGFR gene in the placozoan Trichoplax adhaerens. The gene encodes a protein with three immunoglobulin-like domains, a single-pass transmembrane, and a split tyrosine kinase domain. By superimposing intron positions of 20 FGFR genes from Placozoa, Cnidaria, Protostomia, and Deuterostomia over the respective protein domain structure, we identified ten ancestral introns and three conserved intron groups. Our analysis shows (1) that the position of ancestral introns correlates to the modular structure of FGFRs, (2) that the acidic domain very likely evolved in the last common ancestor of triploblasts, (3) that splicing of IgIII was enabled by a triploblast-specific insertion, and (4) that IgI is subject to substantial loss or duplication particularly in quickly evolving genomes. Moreover, intron positions in the catalytic domain of FGFRs map to the borders of protein subdomains highly conserved in other serine/threonine kinases. Nevertheless, these introns were introduced in metazoan receptor tyrosine kinases exclusively. Our data support the view that protein evolution dating back to the Cambrian explosion took place in such a short time window that only subtle changes in the domain structure are detectable in extant representatives of animal phyla. We propose that the first multidomain FGFR originated in the last common ancestor of Placozoa, Cnidaria, and Bilateria. Additional domains were introduced mainly in the ancestor of triploblasts and in the Ecdysozoa.
Fine Mapping of Resistance Genes from Five Brown Stem Rot Resistance Sources in Soybean.
Rincker, Keith; Hartman, Glen L; Diers, Brian W
2016-03-01
Brown stem rot (BSR) of soybean [ (L.) Merr.] caused by (Allington & Chamb.) T.C. Harr. & McNew can be controlled effectively with genetic host resistance. Three BSR resistance genes , , and , have been identified and mapped to a large region on chromosome 16. Marker-assisted selection (MAS) will be more efficient and gene cloning will be facilitated with a narrowed genomic interval containing an gene. The objective of this study was to fine map the positions of genes from five sources. Mapping populations were developed by crossing the resistant sources 'Bell', PI 84946-2, PI 437833, PI 437970, L84-5873, and PI 86150 with either the susceptible cultivar Colfax or Century 84. Plants identified as having a recombination event near genes were selected and individually harvested to create recombinant lines. Progeny from recombinant lines were tested in a root-dip assay and evaluated for foliar and stem BSR symptom development. Overall, 4878 plants were screened for recombination, and progeny from 52 recombinant plants were evaluated with simple-sequence repeat (SSR) genetic markers and assessed for symptom development. Brown stem rot resistance was mapped to intervals ranging from 0.34 to 0.04 Mb in the different sources. In all sources, resistance was fine mapped to intervals inclusive of BARCSOYSSR_16_1114 and BARCSOYSSR_16_1115, which provides further evidence that one locus provides BSR resistance in soybean. Copyright © 2016 Crop Science Society of America.
USDA-ARS?s Scientific Manuscript database
Molecular mapping of new blast resistance genes is important for developing resistant rice cultivars using marker-assisted selection. In this study, 259 recombinant inbred lines (RILs) were developed from a cross between Nipponbare and 93-11, and were used to construct a 1165.8-cM linkage map with 1...
A Plain English Map of the Human Glycolysis Enzymes.
ERIC Educational Resources Information Center
Offner, Susan
1999-01-01
Presents a plain English map of the gene coding for the glycolysis enzymes in humans to be used as a teaching tool. The map can be used to illustrate that every reaction in a cell requires an enzyme, and that every enzyme is a protein coded for by a gene somewhere on the chromosomes. (WRM)
Majoros, William H.; Campbell, Michael S.; Holt, Carson; DeNardo, Erin K.; Ware, Doreen; Allen, Andrew S.; Yandell, Mark; Reddy, Timothy E.
2017-01-01
Abstract Motivation: The accurate interpretation of genetic variants is critical for characterizing genotype–phenotype associations. Because the effects of genetic variants can depend strongly on their local genomic context, accurate genome annotations are essential. Furthermore, as some variants have the potential to disrupt or alter gene structure, variant interpretation efforts stand to gain from the use of individualized annotations that account for differences in gene structure between individuals or strains. Results: We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE (‘Assessing Changes to Exons’) converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detects gene-structure changes and their possible repercussions, and identifies several classes of possible loss of function. Novel transcripts predicted by ACE are commonly supported by spliced RNA-seq reads, and can be used to improve read alignment and transcript quantification when an individual-specific genome sequence is available. Using publicly available RNA-seq data, we show that ACE predictions confirm earlier results regarding the quantitative effects of nonsense-mediated decay, and we show that predicted loss-of-function events are highly concordant with patterns of intolerance to mutations across the human population. ACE can be readily applied to diverse species including animals and plants, making it a broadly useful tool for use in eukaryotic population-based resequencing projects, particularly for assessing the joint impact of all variants at a locus. Availability and Implementation: ACE is written in open-source C ++ and Perl and is available from geneprediction.org/ACE Contact: myandell@genetics.utah.edu or tim.reddy@duke.edu Supplementary information: Supplementary information is available at Bioinformatics online. PMID:28011790
Majoros, William H; Campbell, Michael S; Holt, Carson; DeNardo, Erin K; Ware, Doreen; Allen, Andrew S; Yandell, Mark; Reddy, Timothy E
2017-05-15
The accurate interpretation of genetic variants is critical for characterizing genotype-phenotype associations. Because the effects of genetic variants can depend strongly on their local genomic context, accurate genome annotations are essential. Furthermore, as some variants have the potential to disrupt or alter gene structure, variant interpretation efforts stand to gain from the use of individualized annotations that account for differences in gene structure between individuals or strains. We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE ('Assessing Changes to Exons') converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detects gene-structure changes and their possible repercussions, and identifies several classes of possible loss of function. Novel transcripts predicted by ACE are commonly supported by spliced RNA-seq reads, and can be used to improve read alignment and transcript quantification when an individual-specific genome sequence is available. Using publicly available RNA-seq data, we show that ACE predictions confirm earlier results regarding the quantitative effects of nonsense-mediated decay, and we show that predicted loss-of-function events are highly concordant with patterns of intolerance to mutations across the human population. ACE can be readily applied to diverse species including animals and plants, making it a broadly useful tool for use in eukaryotic population-based resequencing projects, particularly for assessing the joint impact of all variants at a locus. ACE is written in open-source C ++ and Perl and is available from geneprediction.org/ACE. myandell@genetics.utah.edu or tim.reddy@duke.edu. Supplementary information is available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Feltus, F Alex
2014-06-01
Understanding the control of any trait optimally requires the detection of causal genes, gene interaction, and mechanism of action to discover and model the biochemical pathways underlying the expressed phenotype. Functional genomics techniques, including RNA expression profiling via microarray and high-throughput DNA sequencing, allow for the precise genome localization of biological information. Powerful genetic approaches, including quantitative trait locus (QTL) and genome-wide association study mapping, link phenotype with genome positions, yet genetics is less precise in localizing the relevant mechanistic information encoded in DNA. The coupling of salient functional genomic signals with genetically mapped positions is an appealing approach to discover meaningful gene-phenotype relationships. Techniques used to define this genetic-genomic convergence comprise the field of systems genetics. This short review will address an application of systems genetics where RNA profiles are associated with genetically mapped genome positions of individual genes (eQTL mapping) or as gene sets (co-expression network modules). Both approaches can be applied for knowledge independent selection of candidate genes (and possible control mechanisms) underlying complex traits where multiple, likely unlinked, genomic regions might control specific complex traits. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Gibbons, John G.; Beauvais, Anne; Beau, Remi; McGary, Kriston L.
2012-01-01
Aspergillus fumigatus is the most common and deadly pulmonary fungal infection worldwide. In the lung, the fungus usually forms a dense colony of filaments embedded in a polymeric extracellular matrix. To identify candidate genes involved in this biofilm (BF) growth, we used RNA-Seq to compare the transcriptomes of BF and liquid plankton (PL) growth. Sequencing and mapping of tens of millions sequence reads against the A. fumigatus transcriptome identified 3,728 differentially regulated genes in the two conditions. Although many of these genes, including the ones coding for transcription factors, stress response, the ribosome, and the translation machinery, likely reflect the different growth demands in the two conditions, our experiment also identified hundreds of candidate genes for the observed differences in morphology and pathobiology between BF and PL. We found an overrepresentation of upregulated genes in transport, secondary metabolism, and cell wall and surface functions. Furthermore, upregulated genes showed significant spatial structure across the A. fumigatus genome; they were more likely to occur in subtelomeric regions and colocalized in 27 genomic neighborhoods, many of which overlapped with known or candidate secondary metabolism gene clusters. We also identified 1,164 genes that were downregulated. This gene set was not spatially structured across the genome and was overrepresented in genes participating in primary metabolic functions, including carbon and amino acid metabolism. These results add valuable insight into the genetics of biofilm formation in A. fumigatus and other filamentous fungi and identify many relevant, in the context of biofilm biology, candidate genes for downstream functional experiments. PMID:21724936
Tulpová, Zuzana; Luo, Ming-Cheng; Toegelová, Helena; Visendi, Paul; Hayashi, Satomi; Vojta, Petr; Paux, Etienne; Kilian, Andrzej; Abrouk, Michaël; Bartoš, Jan; Hajdúch, Marián; Batley, Jacqueline; Edwards, David; Doležel, Jaroslav; Šimková, Hana
2018-03-08
Bread wheat (Triticum aestivum L.) is a staple food for a significant part of the world's population. The growing demand on its production can be satisfied by improving yield and resistance to biotic and abiotic stress. Knowledge of the genome sequence would aid in discovering genes and QTLs underlying these traits and provide a basis for genomics-assisted breeding. Physical maps and BAC clones associated with them have been valuable resources from which to generate a reference genome of bread wheat and to assist map-based gene cloning. As a part of a joint effort coordinated by the International Wheat Genome Sequencing Consortium, we have constructed a BAC-based physical map of bread wheat chromosome arm 7DS consisting of 895 contigs and covering 94% of its estimated length. By anchoring BAC contigs to one radiation hybrid map and three high resolution genetic maps, we assigned 73% of the assembly to a distinct genomic position. This map integration, interconnecting a total of 1713 markers with ordered and sequenced BAC clones from a minimal tiling path, provides a tool to speed up gene cloning in wheat. The process of physical map assembly included the integration of the 7DS physical map with a whole-genome physical map of Aegilops tauschii and a 7DS Bionano genome map, which together enabled efficient scaffolding of physical-map contigs, even in the non-recombining region of the genetic centromere. Moreover, this approach facilitated a comparison of bread wheat and its ancestor at BAC-contig level and revealed a reconstructed region in the 7DS pericentromere. Copyright © 2018. Published by Elsevier B.V.
cudaMap: a GPU accelerated program for gene expression connectivity mapping.
McArt, Darragh G; Bankhead, Peter; Dunne, Philip D; Salto-Tellez, Manuel; Hamilton, Peter; Zhang, Shu-Dong
2013-10-11
Modern cancer research often involves large datasets and the use of sophisticated statistical techniques. Together these add a heavy computational load to the analysis, which is often coupled with issues surrounding data accessibility. Connectivity mapping is an advanced bioinformatic and computational technique dedicated to therapeutics discovery and drug re-purposing around differential gene expression analysis. On a normal desktop PC, it is common for the connectivity mapping task with a single gene signature to take > 2h to complete using sscMap, a popular Java application that runs on standard CPUs (Central Processing Units). Here, we describe new software, cudaMap, which has been implemented using CUDA C/C++ to harness the computational power of NVIDIA GPUs (Graphics Processing Units) to greatly reduce processing times for connectivity mapping. cudaMap can identify candidate therapeutics from the same signature in just over thirty seconds when using an NVIDIA Tesla C2050 GPU. Results from the analysis of multiple gene signatures, which would previously have taken several days, can now be obtained in as little as 10 minutes, greatly facilitating candidate therapeutics discovery with high throughput. We are able to demonstrate dramatic speed differentials between GPU assisted performance and CPU executions as the computational load increases for high accuracy evaluation of statistical significance. Emerging 'omics' technologies are constantly increasing the volume of data and information to be processed in all areas of biomedical research. Embracing the multicore functionality of GPUs represents a major avenue of local accelerated computing. cudaMap will make a strong contribution in the discovery of candidate therapeutics by enabling speedy execution of heavy duty connectivity mapping tasks, which are increasingly required in modern cancer research. cudaMap is open source and can be freely downloaded from http://purl.oclc.org/NET/cudaMap.
Evidence for Transcript Networks Composed of Chimeric RNAs in Human Cells
Borel, Christelle; Mudge, Jonathan M.; Howald, Cédric; Foissac, Sylvain; Ucla, Catherine; Chrast, Jacqueline; Ribeca, Paolo; Martin, David; Murray, Ryan R.; Yang, Xinping; Ghamsari, Lila; Lin, Chenwei; Bell, Ian; Dumais, Erica; Drenkow, Jorg; Tress, Michael L.; Gelpí, Josep Lluís; Orozco, Modesto; Valencia, Alfonso; van Berkum, Nynke L.; Lajoie, Bryan R.; Vidal, Marc; Stamatoyannopoulos, John; Batut, Philippe; Dobin, Alex; Harrow, Jennifer; Hubbard, Tim; Dekker, Job; Frankish, Adam; Salehi-Ashtiani, Kourosh; Reymond, Alexandre; Antonarakis, Stylianos E.; Guigó, Roderic; Gingeras, Thomas R.
2012-01-01
The classic organization of a gene structure has followed the Jacob and Monod bacterial gene model proposed more than 50 years ago. Since then, empirical determinations of the complexity of the transcriptomes found in yeast to human has blurred the definition and physical boundaries of genes. Using multiple analysis approaches we have characterized individual gene boundaries mapping on human chromosomes 21 and 22. Analyses of the locations of the 5′ and 3′ transcriptional termini of 492 protein coding genes revealed that for 85% of these genes the boundaries extend beyond the current annotated termini, most often connecting with exons of transcripts from other well annotated genes. The biological and evolutionary importance of these chimeric transcripts is underscored by (1) the non-random interconnections of genes involved, (2) the greater phylogenetic depth of the genes involved in many chimeric interactions, (3) the coordination of the expression of connected genes and (4) the close in vivo and three dimensional proximity of the genomic regions being transcribed and contributing to parts of the chimeric RNAs. The non-random nature of the connection of the genes involved suggest that chimeric transcripts should not be studied in isolation, but together, as an RNA network. PMID:22238572
Rouppe van der Voort, J N; van Eck, H J; van Zandvoort, P M; Overmars, H; Helder, J; Bakker, J
1999-07-01
A mapping strategy is described for the construction of a linkage map of a non-inbred species in which individual offspring genotypes are not amenable to marker analysis. After one extra generation of random mating, the segregating progeny was propagated, and bulked populations of offspring were analyzed. Although the resulting population structure is different from that of commonly used mapping populations, we show that the maximum likelihood formula for a normal F2 is applicable for the estimation of recombination. This "pseudo-F2" mapping strategy, in combination with the development of an AFLP assay for single cysts, facilitated the construction of a linkage map for the potato cyst nematode Globodera rostochiensis. Using 12 pre-selected AFLP primer combinations, a total of 66 segregating markers were identified, 62 of which were mapped to nine linkage groups. These 62 AFLP markers are randomly distributed and cover about 65% of the genome. An estimate of the physical size of the Globodera genome was obtained from comparisons of the number of AFLP fragments obtained with the values for Caenorhabditis elegans. The methodology presented here resulted in the first genomic map for a cyst nematode. The low value of the kilobase/centimorgan (kb/cM) ratio for the Globodera genome will facilitate map-based cloning of genes that mediate the interaction between the nematode and its host plant.
Johnston, Christopher; Douarre, Pierre E; Soulimane, Tewfik; Pletzer, Daniel; Weingart, Helge; MacSharry, John; Coffey, Aidan; Sleator, Roy D; O'Mahony, Jim
2013-06-01
Subunit and DNA-based vaccines against Mycobacterium avium ssp. paratuberculosis (MAP) attempt to overcome inherent issues associated with whole-cell formulations. However, these vaccines can be hampered by poor expression of recombinant antigens from a number of disparate hosts. The high G+C content of MAP invariably leads to a codon bias throughout gene expression. To investigate if the codon bias affects recombinant MAP antigen expression, the open reading frame of a MAP-specific antigen MptD (MAP3733c) was codon optimised for expression against a Lactobacillus salivarius host. Of the total 209 codons which constitute MAP3733c, 172 were modified resulting in a reduced G+C content from 61% for the native gene to 32.7% for the modified form. Both genes were placed under the transcriptional control of the PnisA promoter; allowing controlled heterologous expression in L. salivarius. Expression was monitored using fluorescence microscopy and microplate fluorometry via GFP tags translationally fused to the C-termini of the two MptD genes. A > 37-fold increase in expression was observed for the codon-optimised MAP3733synth variant over the native gene. Due to the low cost and improved expression achieved, codon optimisation significantly improves the potential of L. salivarius as an oral vaccine stratagem against Johne's disease. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Chhuneja, Parveen; Yadav, Bharat; Stirnweis, Daniel; Hurni, Severine; Kaur, Satinder; Elkot, Ahmed Fawzy; Keller, Beat; Wicker, Thomas; Sehgal, Sunish; Gill, Bikram S; Singh, Kuldeep
2015-10-01
A novel powdery mildew resistance gene and a new allele of Pm1 were identified and fine mapped. DNA markers suitable for marker-assisted selection have been identified. Powdery mildew caused by Blumeria graminis is one of the most important foliar diseases of wheat and causes significant yield losses worldwide. Diploid A genome species are an important genetic resource for disease resistance genes. Two powdery mildew resistance genes, identified in Triticum boeoticum (A(b)A(b)) accession pau5088, PmTb7A.1 and PmTb7A.2 were mapped on chromosome 7AL. In the present study, shotgun sequence assembly data for chromosome 7AL were utilised for fine mapping of these Pm resistance genes. Forty SSR, 73 resistance gene analogue-based sequence-tagged sites (RGA-STS) and 36 single nucleotide polymorphism markers were designed for fine mapping of PmTb7A.1 and PmTb7A.2. Twenty-one RGA-STS, 8 SSR and 13 SNP markers were mapped to 7AL. RGA-STS markers Ta7AL-4556232 and 7AL-4426363 were linked to the PmTb7A.1 and PmTb7A.2, at a genetic distance of 0.6 and 6.0 cM, respectively. The present investigation established that PmTb7A.1 is a new powdery mildew resistance gene that confers resistance to a broad range of Bgt isolates, whereas PmTb7A.2 most probably is a new allele of Pm1 based on chromosomal location and screening with Bgt isolates showing differential reaction on lines with different Pm1 alleles. The markers identified to be linked to the two Pm resistance genes are robust and can be used for marker-assisted introgression of these genes to hexaploid wheat.
Berdugo-Cely, Jhon; Valbuena, Raúl Iván; Sánchez-Betancourt, Erika; Barrero, Luz Stella; Yockteng, Roxana
2017-01-01
The potato (Solanum tuberosum L.) is the fourth most important crop food in the world and Colombia has one of the most important collections of potato germplasm in the world (the Colombian Central Collection-CCC). Little is known about its potential as a source of genetic diversity for molecular breeding programs. In this study, we analyzed 809 Andigenum group accessions from the CCC using 5968 SNPs to determine: 1) the genetic diversity and population structure of the Andigenum germplasm and 2) the usefulness of this collection to map qualitative traits across the potato genome. The genetic structure analysis based on principal components, cluster analyses, and Bayesian inference revealed that the CCC can be subdivided into two main groups associated with their ploidy level: Phureja (diploid) and Andigena (tetraploid). The Andigena population was more genetically diverse but less genetically substructured than the Phureja population (three vs. five subpopulations, respectively). The association mapping analysis of qualitative morphological data using 4666 SNPs showed 23 markers significantly associated with nine morphological traits. The present study showed that the CCC is a highly diverse germplasm collection genetically and phenotypically, useful to implement association mapping in order to identify genes related to traits of interest and to assist future potato genetic breeding programs.
Berdugo-Cely, Jhon; Valbuena, Raúl Iván; Sánchez-Betancourt, Erika; Barrero, Luz Stella
2017-01-01
The potato (Solanum tuberosum L.) is the fourth most important crop food in the world and Colombia has one of the most important collections of potato germplasm in the world (the Colombian Central Collection-CCC). Little is known about its potential as a source of genetic diversity for molecular breeding programs. In this study, we analyzed 809 Andigenum group accessions from the CCC using 5968 SNPs to determine: 1) the genetic diversity and population structure of the Andigenum germplasm and 2) the usefulness of this collection to map qualitative traits across the potato genome. The genetic structure analysis based on principal components, cluster analyses, and Bayesian inference revealed that the CCC can be subdivided into two main groups associated with their ploidy level: Phureja (diploid) and Andigena (tetraploid). The Andigena population was more genetically diverse but less genetically substructured than the Phureja population (three vs. five subpopulations, respectively). The association mapping analysis of qualitative morphological data using 4666 SNPs showed 23 markers significantly associated with nine morphological traits. The present study showed that the CCC is a highly diverse germplasm collection genetically and phenotypically, useful to implement association mapping in order to identify genes related to traits of interest and to assist future potato genetic breeding programs. PMID:28257509
Casey, Maura E; Meade, Kieran G; Nalpas, Nicolas C; Taraktsoglou, Maria; Browne, John A; Killick, Kate E; Park, Stephen D E; Gormley, Eamonn; Hokamp, Karsten; Magee, David A; MacHugh, David E
2015-01-01
Johne's disease, caused by infection with Mycobacterium avium subsp. paratuberculosis, (MAP), is a chronic intestinal disease of ruminants with serious economic consequences for cattle production in the United States and elsewhere. During infection, MAP bacilli are phagocytosed and subvert host macrophage processes, resulting in subclinical infections that can lead to immunopathology and dissemination of disease. Analysis of the host macrophage transcriptome during infection can therefore shed light on the molecular mechanisms and host-pathogen interplay associated with Johne's disease. Here, we describe results of an in vitro study of the bovine monocyte-derived macrophage (MDM) transcriptome response during MAP infection using RNA-seq. MDM were obtained from seven age- and sex-matched Holstein-Friesian cattle and were infected with MAP across a 6-h infection time course with non-infected controls. We observed 245 and 574 differentially expressed (DE) genes in MAP-infected versus non-infected control samples (adjusted P value ≤0.05) at 2 and 6 h post-infection, respectively. Functional analyses of these DE genes, including biological pathway enrichment, highlighted potential functional roles for genes that have not been previously described in the host response to infection with MAP bacilli. In addition, differential expression of pro- and anti-inflammatory cytokine genes, such as those associated with the IL-10 signaling pathway, and other immune-related genes that encode proteins involved in the bovine macrophage response to MAP infection emphasize the balance between protective host immunity and bacilli survival and proliferation. Systematic comparisons of RNA-seq gene expression results with Affymetrix(®) microarray data generated from the same experimental samples also demonstrated that RNA-seq represents a superior technology for studying host transcriptional responses to intracellular infection.
Casey, Maura E.; Meade, Kieran G.; Nalpas, Nicolas C.; Taraktsoglou, Maria; Browne, John A.; Killick, Kate E.; Park, Stephen D. E.; Gormley, Eamonn; Hokamp, Karsten; Magee, David A.; MacHugh, David E.
2015-01-01
Johne’s disease, caused by infection with Mycobacterium avium subsp. paratuberculosis, (MAP), is a chronic intestinal disease of ruminants with serious economic consequences for cattle production in the United States and elsewhere. During infection, MAP bacilli are phagocytosed and subvert host macrophage processes, resulting in subclinical infections that can lead to immunopathology and dissemination of disease. Analysis of the host macrophage transcriptome during infection can therefore shed light on the molecular mechanisms and host-pathogen interplay associated with Johne’s disease. Here, we describe results of an in vitro study of the bovine monocyte-derived macrophage (MDM) transcriptome response during MAP infection using RNA-seq. MDM were obtained from seven age- and sex-matched Holstein-Friesian cattle and were infected with MAP across a 6-h infection time course with non-infected controls. We observed 245 and 574 differentially expressed (DE) genes in MAP-infected versus non-infected control samples (adjusted P value ≤0.05) at 2 and 6 h post-infection, respectively. Functional analyses of these DE genes, including biological pathway enrichment, highlighted potential functional roles for genes that have not been previously described in the host response to infection with MAP bacilli. In addition, differential expression of pro- and anti-inflammatory cytokine genes, such as those associated with the IL-10 signaling pathway, and other immune-related genes that encode proteins involved in the bovine macrophage response to MAP infection emphasize the balance between protective host immunity and bacilli survival and proliferation. Systematic comparisons of RNA-seq gene expression results with Affymetrix® microarray data generated from the same experimental samples also demonstrated that RNA-seq represents a superior technology for studying host transcriptional responses to intracellular infection. PMID:25699042
A New Chicken Genome Assembly Provides Insight into Avian Genome Structure.
Warren, Wesley C; Hillier, LaDeana W; Tomlinson, Chad; Minx, Patrick; Kremitzki, Milinn; Graves, Tina; Markovic, Chris; Bouk, Nathan; Pruitt, Kim D; Thibaud-Nissen, Francoise; Schneider, Valerie; Mansour, Tamer A; Brown, C Titus; Zimin, Aleksey; Hawken, Rachel; Abrahamsen, Mitch; Pyrkosz, Alexis B; Morisson, Mireille; Fillon, Valerie; Vignal, Alain; Chow, William; Howe, Kerstin; Fulton, Janet E; Miller, Marcia M; Lovell, Peter; Mello, Claudio V; Wirthlin, Morgan; Mason, Andrew S; Kuo, Richard; Burt, David W; Dodgson, Jerry B; Cheng, Hans H
2017-01-05
The importance of the Gallus gallus (chicken) as a model organism and agricultural animal merits a continuation of sequence assembly improvement efforts. We present a new version of the chicken genome assembly (Gallus_gallus-5.0; GCA_000002315.3), built from combined long single molecule sequencing technology, finished BACs, and improved physical maps. In overall assembled bases, we see a gain of 183 Mb, including 16.4 Mb in placed chromosomes with a corresponding gain in the percentage of intact repeat elements characterized. Of the 1.21 Gb genome, we include three previously missing autosomes, GGA30, 31, and 33, and improve sequence contig length 10-fold over the previous Gallus_gallus-4.0. Despite the significant base representation improvements made, 138 Mb of sequence is not yet located to chromosomes. When annotated for gene content, Gallus_gallus-5.0 shows an increase of 4679 annotated genes (2768 noncoding and 1911 protein-coding) over those in Gallus_gallus-4.0. We also revisited the question of what genes are missing in the avian lineage, as assessed by the highest quality avian genome assembly to date, and found that a large fraction of the original set of missing genes are still absent in sequenced bird species. Finally, our new data support a detailed map of MHC-B, encompassing two segments: one with a highly stable gene copy number and another in which the gene copy number is highly variable. The chicken model has been a critical resource for many other fields of study, and this new reference assembly will substantially further these efforts. Copyright © 2017 Warren et al.
Guyon, Richard; Senger, Fabrice; Rakotomanga, Michaelle; Sadequi, Naoual; Volckaert, Filip A M; Hitte, Christophe; Galibert, Francis
2010-10-01
The selective breeding of fish for aquaculture purposes requires the understanding of the genetic basis of traits such as growth, behaviour, resistance to pathogens and sex determinism. Access to well-developed genomic resources is a prerequisite to improve the knowledge of these traits. Having this aim in mind, a radiation hybrid (RH) panel of European sea bass (Dicentrarchus labrax) was constructed from splenocytes irradiated at 3000 rad, allowing the construction of a 1581 marker RH map. A total of 1440 gene markers providing ~4400 anchors with the genomes of three-spined stickleback, medaka, pufferfish and zebrafish, helped establish synteny relationships with these model species. The identification of Conserved Segments Ordered (CSO) between sea bass and model species allows the anticipation of the position of any sea bass gene from its location in model genomes. Synteny relationships between sea bass and gilthead seabream were addressed by mapping 37 orthologous markers. The sea bass genetic linkage map was integrated in the RH map through the mapping of 141 microsatellites. We are thus able to present the first complete gene map of sea bass. It will facilitate linkage studies and the identification of candidate genes and Quantitative Trait Loci (QTL). The RH map further positions sea bass as a genetic and evolutionary model of Perciformes and supports their ongoing aquaculture expansion. Copyright © 2010 Elsevier Inc. All rights reserved.
Cloud computing-based TagSNP selection algorithm for human genome data.
Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling
2015-01-05
Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used.
Cloud Computing-Based TagSNP Selection Algorithm for Human Genome Data
Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling
2015-01-01
Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used. PMID:25569088
Toor, Puneet Inder; Kaur, Satinder; Bansal, Mitaly; Yadav, Bharat; Chhuneja, Parveen
2016-12-01
A pair of stripe rust and leaf rust resistance genes was introgressed from Aegilops caudata, a nonprogenitor diploid species with the CC genome, to cultivated wheat. Inheritance and genetic mapping of stripe rust resistance gene in backcrossrecombinant inbred line (BC-RIL) population derived from the cross of a wheat-Ae. caudata introgression line (IL) T291- 2(pau16060) with wheat cv. PBW343 is reported here. Segregation of BC-RILs for stripe rust resistance depicted a single major gene conditioning adult plant resistance (APR) with stripe rust reaction varying from TR-20MS in resistant RILs signifying the presence of some minor genes as well. Genetic association with leaf rust resistance revealed that two genes are located at a recombination distance of 13%. IL T291-2 had earlier been reported to carry introgressions on wheat chromosomes 2D, 3D, 4D, 5D, 6D and 7D. Genetic mapping indicated the introgression of stripe rust resistance gene on wheat chromosome 5DS in the region carrying leaf rust resistance gene LrAc, but as an independent introgression. Simple sequence repeat (SSR) and sequence-tagged site (STS) markers designed from the survey sequence data of 5DS enriched the target region harbouring stripe and leaf rust resistance genes. Stripe rust resistance locus, temporarily designated as YrAc, mapped at the distal most end of 5DS linked with a group of four colocated SSRs and two resistance gene analogue (RGA)-STS markers at a distance of 5.3 cM. LrAc mapped at a distance of 9.0 cM from the YrAc and at 2.8 cM from RGA-STS marker Ta5DS_2737450, YrAc and LrAc appear to be the candidate genes for marker-assisted enrichment of the wheat gene pool for rust resistance.
Le Cunff, Loïc; Garsmeur, Olivier; Raboin, Louis Marie; Pauquet, Jérome; Telismart, Hugues; Selvi, Athiappan; Grivet, Laurent; Philippe, Romain; Begum, Dilara; Deu, Monique; Costet, Laurent; Wing, Rod; Glaszmann, Jean Christophe; D'Hont, Angélique
2008-01-01
The genome of modern sugarcane cultivars is highly polyploid (∼12x), aneuploid, of interspecific origin, and contains 10 Gb of DNA. Its size and complexity represent a major challenge for the isolation of agronomically important genes. Here we report on the first attempt to isolate a gene from sugarcane by map-based cloning, targeting a durable major rust resistance gene (Bru1). We describe the genomic strategies that we have developed to overcome constraints associated with high polyploidy in the successive steps of map-based cloning approaches, including diploid/polyploid syntenic shuttle mapping with two model diploid species (sorghum and rice) and haplotype-specific chromosome walking. Their applications allowed us (i) to develop a high-resolution map including markers at 0.28 and 0.14 cM on both sides and 13 markers cosegregating with Bru1 and (ii) to develop a physical map of the target haplotype that still includes two gaps at this stage due to the discovery of an insertion specific to this haplotype. These approaches will pave the way for the development of future map-based cloning approaches for sugarcane and other complex polyploid species. PMID:18757946
Jenkins, Z A; Henry, H M; Galloway, S M; Dodds, K G; Montgomery, G W
1997-01-01
Three genes--parathyroid hormone-like hormone (PTHLH), insulin-like growth factor 1 (IGF 1), and retinoic acid receptor gamma (RARG)--have been mapped to sheep (Ovis aries) chromosome 3 (OAR 3). The order and genetic distances between loci on OAR 3 are similar to those on cattle (Bos taurus) chromosome 5, as expected from their close evolutionary relationship. The OAR 3 linkage map shows conserved synteny with human chromosome 12, but there are at least two rearrangements in gene order between the species.
The gene coding for glial cell line derived neurotrophic factor (GDNF) maps to chromosome 5p12-p13.1
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schindelhauer, D.; Schuffenhauer, S.; Meitinger, T.
1995-08-10
The gene coding for glial cell line derived neurotrophic factor (GDNF) has biological properties that may have potential as a treatment for Parkinson`s and motoneuron diseases. Using the NIGMS Mapping Panel 2, we have localized the GDNF gene to human chromosome 5p12-p13.1. Large NruI and NotI fragments on chromosome 5 will facilitate the construction of a long-range map of the region. 26 refs., 1 fig., 1 tab.
Genomic stability in the archaeae Haloferax volcanii and Haloferax mediterranei.
López-García, P; St Jean, A; Amils, R; Charlebois, R L
1995-01-01
Through hybridization of available probes, we have added nine genes to the macrorestriction map of the Haloferax mediterranei chromosome and five genes to the contig map of Haloferax volcanii. Additionally, we hybridized 17 of the mapped cosmid clones from H. volcanii to the H. mediterranei genome. The resulting 35-point chromosomal comparison revealed only two inversions and a few translocations. Forces known to promote rearrangement, common in the haloarchaea, have been ineffective in changing global gene order throughout the nearly 10(7) years of these species' divergent evolution. PMID:7868620
Abdel Moniem, H E M; Schemerhorn, B J; DeWoody, J A; Holland, J D
2016-10-01
Landscape connectivity, the degree to which the landscape structure facilitates or impedes organismal movement and gene flow, is increasingly important to conservationists and land managers. Metrics for describing the undulating shape of continuous habitat surfaces can expand the usefulness of continuous gradient surfaces that describe habitat and predict the flow of organisms and genes. We adopted a landscape gradient model of habitat and used surface metrics of connectivity to model the genetic continuity between populations of the banded longhorn beetle [Typocerus v. velutinus (Olivier)] collected at 17 sites across a fragmentation gradient in Indiana, USA. We tested the hypothesis that greater habitat connectivity facilitates gene flow between beetle populations against a null model of isolation by distance (IBD). We used next-generation sequencing to develop 10 polymorphic microsatellite loci and genotype the individual beetles to assess the population genetic structure. Isolation by distance did not explain the population genetic structure. The surface metrics model of habitat connectivity explained the variance in genetic dissimilarities 30 times better than the IBD model. We conclude that surface metrology of habitat maps is a powerful extension of landscape genetics in heterogeneous landscapes. © 2016 John Wiley & Sons Ltd.
Mapping the core of the Arabidopsis circadian clock defines the network structure of the oscillator.
Huang, W; Pérez-García, P; Pokhilko, A; Millar, A J; Antoshechkin, I; Riechmann, J L; Mas, P
2012-04-06
In many organisms, the circadian clock is composed of functionally coupled morning and evening oscillators. In Arabidopsis, oscillator coupling relies on a core loop in which the evening oscillator component TIMING OF CAB EXPRESSION 1 (TOC1) was proposed to activate a subset of morning-expressed oscillator genes. Here, we show that TOC1 does not function as an activator but rather as a general repressor of oscillator gene expression. Repression occurs through TOC1 rhythmic association to the promoters of the oscillator genes. Hormone-dependent induction of TOC1 and analysis of RNA interference plants show that TOC1 prevents the activation of morning-expressed genes at night. Our study overturns the prevailing model of the Arabidopsis circadian clock, showing that the morning and evening oscillator loops are connected through the repressing activity of TOC1.
Rykowski, M C; Parmelee, S J; Agard, D A; Sedat, J W
1988-08-12
We have aligned the molecular map of the Notch locus to the cytological features of the salivary gland polytene chromosomes of D. melanogaster in order to determine the interphase chromatin structure of this gene. Using high-resolution in situ hybridization and computer-aided optical microscope data collection and image analysis, we have determined that the coding portions and introns of the Notch gene, which is not expressed in this tissue, are all contained within the polytene chromosome band 3C7. The portion of the Notch gene that resides 5' to the start of transcription lies in an open chromatin conformation, the interband between bands 3C6 and 3C7. Our data are most consistent with condensation of the chromosomal DNA into 30 nm fibers in this polytene band.
Mapping the malaria parasite druggable genome by using in vitro evolution and chemogenomics.
Cowell, Annie N; Istvan, Eva S; Lukens, Amanda K; Gomez-Lorenzo, Maria G; Vanaerschot, Manu; Sakata-Kato, Tomoyo; Flannery, Erika L; Magistrado, Pamela; Owen, Edward; Abraham, Matthew; LaMonte, Gregory; Painter, Heather J; Williams, Roy M; Franco, Virginia; Linares, Maria; Arriaga, Ignacio; Bopp, Selina; Corey, Victoria C; Gnädig, Nina F; Coburn-Flynn, Olivia; Reimer, Christin; Gupta, Purva; Murithi, James M; Moura, Pedro A; Fuchs, Olivia; Sasaki, Erika; Kim, Sang W; Teng, Christine H; Wang, Lawrence T; Akidil, Aslı; Adjalley, Sophie; Willis, Paul A; Siegel, Dionicio; Tanaseichuk, Olga; Zhong, Yang; Zhou, Yingyao; Llinás, Manuel; Ottilie, Sabine; Gamo, Francisco-Javier; Lee, Marcus C S; Goldberg, Daniel E; Fidock, David A; Wirth, Dyann F; Winzeler, Elizabeth A
2018-01-12
Chemogenetic characterization through in vitro evolution combined with whole-genome analysis can identify antimalarial drug targets and drug-resistance genes. We performed a genome analysis of 262 Plasmodium falciparum parasites resistant to 37 diverse compounds. We found 159 gene amplifications and 148 nonsynonymous changes in 83 genes associated with drug-resistance acquisition, where gene amplifications contributed to one-third of resistance acquisition events. Beyond confirming previously identified multidrug-resistance mechanisms, we discovered hitherto unrecognized drug target-inhibitor pairs, including thymidylate synthase and a benzoquinazolinone, farnesyltransferase and a pyrimidinedione, and a dipeptidylpeptidase and an arylurea. This exploration of the P. falciparum resistome and druggable genome will likely guide drug discovery and structural biology efforts, while also advancing our understanding of resistance mechanisms available to the malaria parasite. Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
2011-01-01
Background Several tools have been developed to perform global gene expression profile data analysis, to search for specific chromosomal regions whose features meet defined criteria as well as to study neighbouring gene expression. However, most of these tools are tailored for a specific use in a particular context (e.g. they are species-specific, or limited to a particular data format) and they typically accept only gene lists as input. Results TRAM (Transcriptome Mapper) is a new general tool that allows the simple generation and analysis of quantitative transcriptome maps, starting from any source listing gene expression values for a given gene set (e.g. expression microarrays), implemented as a relational database. It includes a parser able to assign univocal and updated gene symbols to gene identifiers from different data sources. Moreover, TRAM is able to perform intra-sample and inter-sample data normalization, including an original variant of quantile normalization (scaled quantile), useful to normalize data from platforms with highly different numbers of investigated genes. When in 'Map' mode, the software generates a quantitative representation of the transcriptome of a sample (or of a pool of samples) and identifies if segments of defined lengths are over/under-expressed compared to the desired threshold. When in 'Cluster' mode, the software searches for a set of over/under-expressed consecutive genes. Statistical significance for all results is calculated with respect to genes localized on the same chromosome or to all genome genes. Transcriptome maps, showing differential expression between two sample groups, relative to two different biological conditions, may be easily generated. We present the results of a biological model test, based on a meta-analysis comparison between a sample pool of human CD34+ hematopoietic progenitor cells and a sample pool of megakaryocytic cells. Biologically relevant chromosomal segments and gene clusters with differential expression during the differentiation toward megakaryocyte were identified. Conclusions TRAM is designed to create, and statistically analyze, quantitative transcriptome maps, based on gene expression data from multiple sources. The release includes FileMaker Pro database management runtime application and it is freely available at http://apollo11.isto.unibo.it/software/, along with preconfigured implementations for mapping of human, mouse and zebrafish transcriptomes. PMID:21333005
Deciphering RNA regulatory elements in trypanosomatids: one piece at a time or genome-wide?
Gazestani, Vahid H; Lu, Zhiquan; Salavati, Reza
2014-05-01
Morphological and metabolic changes in the life cycle of Trypanosoma brucei are accomplished by precise regulation of hundreds of genes. In the absence of transcriptional control, RNA-binding proteins (RBPs) shape the structure of gene regulatory maps in this organism, but our knowledge about their target RNAs, binding sites, and mechanisms of action is far from complete. Although recent technological advances have revolutionized the RBP-based approaches, the main framework for the RNA regulatory element (RRE)-based approaches has not changed over the last two decades in T. brucei. In this Opinion, after highlighting the current challenges in RRE inference, we explain some genome-wide solutions that can significantly boost our current understanding about gene regulatory networks in T. brucei. Copyright © 2014 Elsevier Ltd. All rights reserved.
Sodium Channel Mutations and Susceptibility to Heart Failure and Atrial Fibrillation
Olson, Timothy M.; Michels, Virginia V.; Ballew, Jeffrey D.; Reyna, Sandra P.; Karst, Margaret L.; Herron, Kathleen J.; Horton, Steven C.; Rodeheffer, Richard J.; Anderson, Jeffrey L.
2007-01-01
Context Dilated cardiomyopathy (DCM), a genetically heterogeneous disorder, causes heart failure and rhythm disturbances. The majority of identified DCM genes encode structural proteins of the contractile apparatus and cytoskeleton. Recently, genetic defects in calcium and potassium regulation have been discovered in patients with DCM, implicating an alternative disease mechanism. The full spectrum of genetic defects in DCM, however, has not been established. Objectives To identify a novel gene for DCM at a previously mapped locus, define the spectrum of mutations in this gene within a DCM cohort, and determine the frequency of DCM among relatives inheriting a mutation in this gene. Design, Setting, and Participants Refined mapping of a DCM locus on chromosome 3p in a multigenerational family and mutation scanning in 156 unrelated pro-bands with DCM, prospectively identified at the Mayo Clinic between 1987 and 2004. Relatives underwent screening echocardiography and electrocardiography and DNA sample procurement. Main Outcome Measure Correlation of identified mutations with cardiac phenotype. Results Refined locus mapping revealed SCN5A, encoding the cardiac sodium channel, as a candidate gene. Mutation scans identified a missense mutation (D1275N) that cosegregated with an age-dependent, variably expressed phenotype of DCM, atrial fibrillation, impaired automaticity, and conduction delay. In the DCM cohort, additional missense (T220I, R814W, D1595H) and truncation (2550-2551insTG) SCN5A mutations, segregating with cardiac disease or arising de novo, were discovered in unrelated probands. Among individuals with an SCN5A mutation 27% had early features of DCM (mean age at diagnosis, 20.3 years), 38% had DCM (mean age at diagnosis, 47.9 years), and 43% had atrial fibrillation (mean age at diagnosis, 27.8 years). Conclusions Heritable SCN5A defects are associated with susceptibility to early-onset DCM and atrial fibrillation. Similar or even identical mutations may lead to heart failure, arrhythmia, or both. PMID:15671429
Yuan, Congying; Wang, Meinan; Skinner, Danniel Z; See, Deven R; Xia, Chongjing; Guo, Xinhong; Chen, Xianming
2018-01-01
Puccinia striiformis f. sp. tritici, the wheat stripe rust pathogen, is a dikaryotic, biotrophic, and macrocyclic fungus. Genetic study of P. striiformis f. sp. tritici virulence was not possible until the recent discovery of Berberis spp. and Mahonia spp. as alternate hosts. To determine inheritance of virulence and map virulence genes, a segregating population of 119 isolates was developed by self-fertilizing P. striiformis f. sp. tritici isolate 08-220 (race PSTv-11) on barberry leaves under controlled greenhouse conditions. The progeny isolates were phenotyped on a set of 29 wheat lines with single genes for race-specific resistance and genotyped with simple sequence repeat (SSR) markers, single nucleotide polymorphism (SNP) markers derived from secreted protein genes, and SNP markers from genotyping-by-sequencing (GBS). Using the GBS technique, 10,163 polymorphic GBS-SNP markers were identified. Clustering and principal component analysis grouped these markers into six genetic groups, and a genetic map, consisting of six linkage groups, was constructed with 805 markers. The six clusters or linkage groups resulting from these analyses indicated a haploid chromosome number of six in P. striiformis f. sp. tritici. Through virulence testing of the progeny isolates, the parental isolate was found to be homozygous for the avirulence loci corresponding to resistance genes Yr5, Yr10, Yr15, Yr24, Yr32, YrSP, YrTr1, Yr45, and Yr53 and homozygous for the virulence locus corresponding to resistance gene Yr41. Segregation was observed for virulence phenotypes in response to the remaining 19 single-gene lines. A single dominant gene or two dominant genes with different nonallelic gene interactions were identified for each of the segregating virulence phenotypes. Of 27 dominant virulence genes identified, 17 were mapped to two chromosomes. Markers tightly linked to some of the virulence loci may facilitate further studies to clone these genes. The virulence genes and their inheritance information are useful for understanding the host-pathogen interactions and for selecting effective resistance genes or gene combinations for developing stripe rust resistant wheat cultivars.
Raboanatahiry, Nadia; Chao, Hongbo; Guo, Liangxing; Gan, Jianping; Xiang, Jun; Yan, Mingli; Zhang, Libin; Yu, Longjiang; Li, Maoteng
2017-10-12
Deciphering the genetic architecture of a species is a good way to understand its evolutionary history, but also to tailor its profile for breeding elite cultivars with desirable traits. Aligning QTLs from diverse population in one map and utilizing it for comparison, but also as a basis for multiple analyses assure a stronger evidence to understand the genetic system related to a given phenotype. In this study, 439 genes involved in fatty acid (FA) and triacylglycerol (TAG) biosyntheses were identified in Brassica napus. B. napus genome showed mixed gene loss and insertion compared to B. rapa and B. oleracea, and C genome had more inserted genes. Identified QTLs for oil (OC-QTLs) and fatty acids (FA-QTLs) from nine reported populations were projected on the physical map of the reference genome "Darmor-bzh" to generate a map. Thus, 335 FA-QTLs and OC-QTLs could be highlighted and 82 QTLs were overlapping. Chromosome C3 contained 22 overlapping QTLs with all trait studied except for C18:3. In total, 218 candidate genes which were potentially involved in FA and TAG were identified in 162 QTLs confidence intervals and some of them might affect many traits. Also, 76 among these candidate genes were found inside 57 overlapping QTLs, and candidate genes for oil content were in majority (61/76 genes). Then, sixteen genes were found in overlapping QTLs involving three populations, and the remaining 60 genes were found in overlapping QTLs of two populations. Interaction network and pathway analysis of these candidate genes indicated ten genes that might have strong influence over the other genes that control fatty acids and oil formation. The present results provided new information for genetic basis of FA and TAG formation in B. napus. A map including QTLs from numerous populations was built, which could serve as reference to study the genome profile of B. napus, and new potential genes emerged which might affect seed oil. New useful tracks were showed for the selection of population or/and selection of interesting genes for breeding improvement purpose.
Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B., Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj
2013-01-01
The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants. PMID:23691254
Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B, Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj
2013-01-01
The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants.
Molecular mapping of stripe rust resistance gene Yr76 in winter club wheat cultivar Tyee
USDA-ARS?s Scientific Manuscript database
Tyee, one of the wheat cultivars used to differentiate races of Puccinia striiformis f. sp. tritici (Pst) in the United States, was identified to have a single gene for all-stage resistance, tentatively named YrTye. To map the gene, Tyee was crossed with ‘Avocet Susceptible’ (AvS). Genetic analysi...
Zhang, Zhen; Shang, Haihong; Shi, Yuzhen; Huang, Long; Li, Junwen; Ge, Qun; Gong, Juwu; Liu, Aiying; Chen, Tingting; Wang, Dan; Wang, Yanling; Palanga, Koffi Kibalou; Muhammad, Jamshed; Li, Weijie; Lu, Quanwei; Deng, Xiaoying; Tan, Yunna; Song, Weiwu; Cai, Juan; Li, Pengtao; Rashid, Harun or; Gong, Wankui; Yuan, Youlu
2016-04-11
Upland Cotton (Gossypium hirsutum) is one of the most important worldwide crops it provides natural high-quality fiber for the industrial production and everyday use. Next-generation sequencing is a powerful method to identify single nucleotide polymorphism markers on a large scale for the construction of a high-density genetic map for quantitative trait loci mapping. In this research, a recombinant inbred lines population developed from two upland cotton cultivars 0-153 and sGK9708 was used to construct a high-density genetic map through the specific locus amplified fragment sequencing method. The high-density genetic map harbored 5521 single nucleotide polymorphism markers which covered a total distance of 3259.37 cM with an average marker interval of 0.78 cM without gaps larger than 10 cM. In total 18 quantitative trait loci of boll weight were identified as stable quantitative trait loci and were detected in at least three out of 11 environments and explained 4.15-16.70 % of the observed phenotypic variation. In total, 344 candidate genes were identified within the confidence intervals of these stable quantitative trait loci based on the cotton genome sequence. These genes were categorized based on their function through gene ontology analysis, Kyoto Encyclopedia of Genes and Genomes analysis and eukaryotic orthologous groups analysis. This research reported the first high-density genetic map for Upland Cotton (Gossypium hirsutum) with a recombinant inbred line population using single nucleotide polymorphism markers developed by specific locus amplified fragment sequencing. We also identified quantitative trait loci of boll weight across 11 environments and identified candidate genes within the quantitative trait loci confidence intervals. The results of this research would provide useful information for the next-step work including fine mapping, gene functional analysis, pyramiding breeding of functional genes as well as marker-assisted selection.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sharma, V.; Bonnycastle, L.; Poorkai, P.
1994-09-01
We have constructed a yeast artificial chromosome (YAC) contig of chromosome 14q24.3 which encompasses the chromosome 14 Alzheimer`s disease locus (AD3). Determined by linkage analysis of early-onset Alzheimer`s disease kindreds, this interval is bounded by the genetic markers D14S61-D14S63 and spans approximately 15 centimorgans. The contig consists of 29 markers and 74 YACs of which 57 are defined by one or more sequence tagged sites (STSs). The STS markers comprise 5 genes, 16 short tandem repeat polymorphisms and 8 cDNA clones. An additional number of genes, expressed sequence tags and cDNA fragments have been identified and localized to the contigmore » by hybridization and sequence analysis of anonymous clones isolated by cDNA direct selection techniques. A minimal contig of about 15 YACs averaging 0.5-1.5 megabase in length will span this interval and is, at first approximation, in rough agreement with the genetic map. For two regions of the contig, our coverage has relied on L1/THE fingerprint and Alu-PCR hybridization data of YACs provided by CEPH/Genethon. We are currently developing sequence tagged sites from these to confirm the overlaps revealed by the fingerprint data. Among the genes which map to the contig are transforming growth factor beta 3, c-fos, and heat shock protein 2A (HSPA2). C-fos is not a candidate gene for AD3 based on the sequence analysis of affected and unaffected individuals. HSPA2 maps to the proximal edge of the contig and Calmodulin 1, a candidate gene from 4q24.3, maps outside of the region. The YAC contig is a framework physical map from which cosmid or P1 clone contigs can be constructed. As more genes and cDNAs are mapped, a highly resolved transcription map will emerge, a necessary step towards positionally cloning the AD3 gene.« less
Sequencing of cDNA Clones from the Genetic Map of Tomato (Lycopersicon esculentum)
Ganal, Martin W.; Czihal, Rosemarie; Hannappel, Ulrich; Kloos, Dorothee-U.; Polley, Andreas; Ling, Hong-Qing
1998-01-01
The dense RFLP linkage map of tomato (Lycopersicon esculentum) contains >300 anonymous cDNA clones. Of those clones, 272 were partially or completely sequenced. The sequences were compared at the DNA and protein level to known genes in databases. For 57% of the clones, a significant match to previously described genes was found. The information will permit the conversion of those markers to STS markers and allow their use in PCR-based mapping experiments. Furthermore, it will facilitate the comparative mapping of genes across distantly related plant species by direct comparison of DNA sequences and map positions. [cDNA sequence data reported in this paper have been submitted to the EMBL database under accession nos. AA824695–AA825005 and the dbEST_Id database under accession nos. 1546519–1546862.] PMID:9724330
Lu, D; Yang, H; Raizada, M K
1996-12-01
Angiotensin II (Ang II) stimulates expression of tyrosine hydroxylase and norepinephrine transporter genes in brain neurons; however, the signal-transduction mechanism is not clearly defined. This study was conducted to determine the involvement of the mitogen-activated protein (MAP) kinase signaling pathway in Ang II stimulation of these genes. MAP kinase was localized in the perinuclear region of the neuronal soma. Ang II caused activation of MAP kinase and its subsequent translocation from the cytoplasmic to nuclear compartment, both effects being mediated by AT1 receptor subtype. Ang II also stimulated SRE- and AP1-binding activities and fos gene expression and its translocation in a MAP kinase-dependent process. These observations are the first demonstration of a downstream signaling pathway involving MAP kinase in Ang II-mediated neuromodulation in noradrenergic neurons.
2013-01-01
Background The wheat genome sequence is an essential tool for advanced genomic research and improvements. The generation of a high-quality wheat genome sequence is challenging due to its complex 17 Gb polyploid genome. To overcome these difficulties, sequencing through the construction of BAC-based physical maps of individual chromosomes is employed by the wheat genomics community. Here, we present the construction of the first comprehensive physical map of chromosome 1BS, and illustrate its unique gene space organization and evolution. Results Fingerprinted BAC clones were assembled into 57 long scaffolds, anchored and ordered with 2,438 markers, covering 83% of chromosome 1BS. The BAC-based chromosome 1BS physical map and gene order of the orthologous regions of model grass species were consistent, providing strong support for the reliability of the chromosome 1BS assembly. The gene space for chromosome 1BS spans the entire length of the chromosome arm, with 76% of the genes organized in small gene islands, accompanied by a two-fold increase in gene density from the centromere to the telomere. Conclusions This study provides new evidence on common and chromosome-specific features in the organization and evolution of the wheat genome, including a non-uniform distribution of gene density along the centromere-telomere axis, abundance of non-syntenic genes, the degree of colinearity with other grass genomes and a non-uniform size expansion along the centromere-telomere axis compared with other model cereal genomes. The high-quality physical map constructed in this study provides a solid basis for the assembly of a reference sequence of chromosome 1BS and for breeding applications. PMID:24359668
Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Lu, Kun; Xu, Xinfu; Wang, Rui; Li, Jiana; Qu, Cunmin
2017-10-24
The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed ( Brassica napus ). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B . napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B . napus and its parental lines and for molecular breeding studies of bZIP genes in B . napus .
Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Xu, Xinfu; Wang, Rui; Li, Jiana
2017-01-01
The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed (Brassica napus). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B. napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B. napus and its parental lines and for molecular breeding studies of bZIP genes in B. napus. PMID:29064393
Chang, Meiping; Smith, Sarah; Thorpe, Andrew; Barratt, Michael J; Karim, Farzana
2010-09-16
We have previously used the rat 4 day Complete Freund's Adjuvant (CFA) model to screen compounds with potential to reduce osteoarthritic pain. The aim of this study was to identify genes altered in this model of osteoarthritic pain and use this information to infer analgesic potential of compounds based on their own gene expression profiles using the Connectivity Map approach. Using microarrays, we identified differentially expressed genes in L4 and L5 dorsal root ganglia (DRG) from rats that had received intraplantar CFA for 4 days compared to matched, untreated control animals. Analysis of these data indicated that the two groups were distinguishable by differences in genes important in immune responses, nerve growth and regeneration. This list of differentially expressed genes defined a "CFA signature". We used the Connectivity Map approach to identify pharmacologic agents in the Broad Institute Build02 database that had gene expression signatures that were inversely related ('negatively connected') with our CFA signature. To test the predictive nature of the Connectivity Map methodology, we tested phenoxybenzamine (an alpha adrenergic receptor antagonist) - one of the most negatively connected compounds identified in this database - for analgesic activity in the CFA model. Our results indicate that at 10 mg/kg, phenoxybenzamine demonstrated analgesia comparable to that of Naproxen in this model. Evaluation of phenoxybenzamine-induced analgesia in the current study lends support to the utility of the Connectivity Map approach for identifying compounds with analgesic properties in the CFA model.
Characterization of two rice MADS box genes that control flowering time.
Kang, H G; Jang, S; Chung, J E; Cho, Y G; An, G
1997-08-31
Plants contain a variety of the MADS box genes that encode regulatory proteins and play important roles in both the formation of flower meristem and the determination of floral organ identity. We have characterized two flower-specific cDNAs from rice, designated OsMADS7 and OsMADS8. The cDNAs displayed the structure of a typical plant MADS box gene, which consists of the MADS domain, I region, K domain, and C-terminal region. These genes were classified as members of the AGL2 gene family based on sequence homology. The OsMADS7 and 8 proteins were most homologous to OM1 and FBP2, respectively. The OsMADS7 and 8 transcripts were detectable primarily in carpels and also weakly in anthers. During flower development, the OsMADS genes started to express at the young flower stage and the expression continued to the late stage of flower development. The OsMADS7 and 8 genes were mapped on the long arms of the chromosome 8 and 9, respectively. To study the functions of the genes, the cDNA clones were expressed ectopically using the CaMV 35S promoter in a heterologous tobacco plant system. Transgenic plants expressing the OsMADS genes exhibited the phenotype of early flowering and dwarfism. The strength of the phenotypes was proportional to the levels of transgene expression and the phenotypes were co-inherited with the kanamycin resistant gene to the next generation. These results indicate that OsMADS7 and 8 are structurally related to the AGL2 family and are involved in controlling flowering time.
Trifonova, E A; Eremina, E R; Urnov, F D; Stepanov, V A
2012-01-01
The structure of the haplotypes and linkage disequilibrium (LD) of the methylenetetrahydrofolate reductase gene (MTHFR) in 9 population groups from Northern Eurasia and populations of the international HapMap project was investigated in the present study. The data suggest that the architecture of LD in the human genome is largely determined by the evolutionary history of populations; however, the results of phylogenetic and haplotype analyses seems to suggest that in fact there may be a common "old" mechanism for the formation of certain patterns of LD. Variability in the structure of LD and the level of diversity of MTHFRhaplotypes cause a certain set of tagSNPs with an established prognostic significance for each population. In our opinion, the results obtained in the present study are of considerable interest for understanding multiple genetic phenomena: namely, the association of interpopulation differences in the patterns of LD with structures possessing a genetic susceptibility to complex diseases, and the functional significance of the pleiotropicMTHFR gene effect. Summarizing the results of this study, a conclusion can be made that the genetic variability analysis with emphasis on the structure of LD in human populations is a powerful tool that can make a significant contribution to such areas of biomedical science as human evolutionary biology, functional genomics, genetics of complex diseases, and pharmacogenomics.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Slaugenhaupt, S.A.; Liebert, C.B.; Altherr, M.R.
The pineal hormone melatonin elicits potent circadian and reproductive effects in mammals. The authors report the chromosomal location of the gene for the Mel{sub 1a}-melatonin receptor that likely mediates these circadian and reproductive actions. PCR analysis of human-rodent somatic cell hybrids showed that the receptor gene (MTNR1A) maps to human chromosome 4q35.1. An interspecific backcross analysis revealed that the mouse gene (Mtnr1a) maps to the proximal portion of chromosome 8. These loci may be involved in genetically based circadian and neuroendocrine disorders. 14 refs., 1 fig.
Dash, Debasis; Mukerji, Mitali
2014-01-01
Admixture mapping has been enormously resourceful in identifying genetic variations linked to phenotypes, adaptation, and diseases. In this study through analysis of copy number variable regions (CNVRs), we report extensive restructuring in the genomes of the recently admixed African-Indian population (OG-W-IP) that inhabits a highly saline environment in Western India. The study included subjects from OG-W-IP (OG), five different Indian and three HapMap populations that were genotyped using Affymetrix version 6.0 arrays. Copy number variations (CNVs) detected using Birdsuite were used to define CNVRs. Population structure with respect to CNVRs was delineated using random forest approach. OG genomes have a surprising excess of CNVs in comparison to other studied populations. Individual ancestry proportions computed using STRUCTURE also reveals a unique genetic component in OGs. Population structure analysis with CNV genotypes indicates OG to be distant from both the African and Indian ancestral populations. Interestingly, it shows genetic proximity with respect to CNVs to only one Indian population IE-W-LP4, which also happens to reside in the same geographical region. We also observe a significant enrichment of molecular processes related to ion binding and receptor activity in genes encompassing OG-specific CNVRs. Our results suggest that retention of CNVRs from ancestral natives and de novo acquisition of CNVRs could accelerate the process of adaptation especially in an extreme environment. Additionally, this population would be enormously useful for dissecting genes and delineating the involvement of CNVs in salt adaptation. PMID:25398783
Davis, G L; McMullen, M D; Baysdorfer, C; Musket, T; Grant, D; Staebell, M; Xu, G; Polacco, M; Koster, L; Melia-Hancock, S; Houchins, K; Chao, S; Coe, E H
1999-01-01
We have constructed a 1736-locus maize genome map containing1156 loci probed by cDNAs, 545 probed by random genomic clones, 16 by simple sequence repeats (SSRs), 14 by isozymes, and 5 by anonymous clones. Sequence information is available for 56% of the loci with 66% of the sequenced loci assigned functions. A total of 596 new ESTs were mapped from a B73 library of 5-wk-old shoots. The map contains 237 loci probed by barley, oat, wheat, rice, or tripsacum clones, which serve as grass genome reference points in comparisons between maize and other grass maps. Ninety core markers selected for low copy number, high polymorphism, and even spacing along the chromosome delineate the 100 bins on the map. The average bin size is 17 cM. Use of bin assignments enables comparison among different maize mapping populations and experiments including those involving cytogenetic stocks, mutants, or quantitative trait loci. Integration of nonmaize markers in the map extends the resources available for gene discovery beyond the boundaries of maize mapping information into the expanse of map, sequence, and phenotype information from other grass species. This map provides a foundation for numerous basic and applied investigations including studies of gene organization, gene and genome evolution, targeted cloning, and dissection of complex traits. PMID:10388831
HFE gene: Structure, function, mutations, and associated iron abnormalities.
Barton, James C; Edwards, Corwin Q; Acton, Ronald T
2015-12-15
The hemochromatosis gene HFE was discovered in 1996, more than a century after clinical and pathologic manifestations of hemochromatosis were reported. Linked to the major histocompatibility complex (MHC) on chromosome 6p, HFE encodes the MHC class I-like protein HFE that binds beta-2 microglobulin. HFE influences iron absorption by modulating the expression of hepcidin, the main controller of iron metabolism. Common HFE mutations account for ~90% of hemochromatosis phenotypes in whites of western European descent. We review HFE mapping and cloning, structure, promoters and controllers, and coding region mutations, HFE protein structure, cell and tissue expression and function, mouse Hfe knockouts and knockins, and HFE mutations in other mammals with iron overload. We describe the pertinence of HFE and HFE to mechanisms of iron homeostasis, the origin and fixation of HFE polymorphisms in European and other populations, and the genetic and biochemical basis of HFE hemochromatosis and iron overload. Copyright © 2015 Elsevier B.V. All rights reserved.
Bowman, Shaun M; Piwowar, Amy; Ciocca, Maria; Free, Stephen J
2005-01-01
Two Neurospora mutants with a phenotype that includes a tight colonial growth pattern, an inability to form conidia and an inability to form protoperithecia have been isolated and characterized. The relevant mutations were mapped to the same locus on the sequenced Neurospora genome. The mutations responsible for the mutant phenotype then were identified by examining likely candidate genes from the mutant genomes at the mapped locus with PCR amplification and a sequencing assay. The results demonstrate that a map and sequence strategy is a feasible way to identify mutant genes in Neurospora. The gene responsible for the phenotype is a putative alpha-1,2-mannosyltransferase gene. The mutant cell wall has an altered composition demonstrating that the gene functions in cell wall biosynthesis. The results demonstrate that the mnt-1 gene is required for normal cell wall biosynthesis, morphology and for the regulation of asexual development.
Identification and genetic mapping of a homeobox gene to the 4p16. 1 region of human chromosome 4
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stadler, H.S.; Padanilam, B.J.; Solursh, M.
1992-12-01
A human craniofacial cDNA library was screened with a degenerate oligonucleotide probe based on the conserved third helix of homeobox genes. From this screening, we identified a homeobox gene, H6, which shared only 57-65% amino acid identity to previously reported homeodomains. H6 was physically mapped to the 4P16.1 region by using somatic cell hybrids containing specific deletions of human chromosome 4. Linkage data from a single-stranded conformational polymorphism derived from the 3[prime] untranslated region of the H6 cDNA placed this homeobox gene more than 20 centimorgans proximal of the previously mapped HOX7 gene on chromosome 4. Identity comparisons of themore » H6 Homeodomain with previously reported homeodomains reveal the highest identities to be with the Nk class of homeobox genes in Drosophila melanogaster. 53 refs., 5 figs., 2 tabs.« less
Silva, C; Garcia-Mas, J; Sánchez, A M; Arús, P; Oliveira, M M
2005-03-01
Blooming time is one of the most important agronomic traits in almond. Biochemical and molecular events underlying flowering regulation must be understood before methods to stimulate late flowering can be developed. Attempts to elucidate the genetic control of this process have led to the identification of a major gene (Lb) and quantitative trait loci (QTLs) linked to observed phenotypic differences, but although this gene and these QTLs have been placed on the Prunus reference genetic map, their sequences and specific functions remain unknown. The aim of our investigation was to associate these loci with known genes using a candidate gene approach. Two almond cDNAs and eight Prunus expressed sequence tags were selected as candidate genes (CGs) since their sequences were highly identical to those of flowering regulatory genes characterized in other species. The CGs were amplified from both parental lines of the mapping population using specific primers. Sequence comparison revealed DNA polymorphisms between the parental lines, mainly of the single nucleotide type. Polymorphisms were used to develop co-dominant cleaved amplified polymorphic sequence markers or length polymorphisms based on insertion/deletion events for mapping the candidate genes on the Prunus reference map. Ten candidate genes were assigned to six linkage groups in the Prunus genome. The positions of two of these were compatible with the regions where two QTLs for blooming time were detected. One additional candidate was localized close to the position of the Evergrowing gene, which determines a non-deciduous behaviour in peach.
A circular genetic map of Erwinia carotovora subsp. atroseptica 3-2
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nikolaichik, E.A.; Pesnyakevich, A.G.
1995-08-01
A circular genetic map of Erwinia carotovora subsp. atroseptica 3-2 was constructed on the basis of the R471a plasmid and Tn5 and Tn9 using Hfr-like donors. Forty-six genes, including phytopathogenicity genes, were located on the basis of interrupted mating experiment results and analysis of coinheritance of markers on a map of 183 min in length. The similarity and differences of chromosomal genetic maps of Erwinia genus bacteria are discussed. 23 refs., 2 figs., 4 tabs.
The control of lambda DNA terminase synthesis.
Murialdo, H; Davidson, A; Chow, S; Gold, M
1987-01-01
Nu1 and A, the genes coding for bacteriophage lambda DNA terminase, rank among the most poorly translated genes expressed in E. coli. To understand the reason for this low level of translation the genes were cloned into plasmids and their expression measured. In addition, the wild type DNA sequences immediately preceding the genes were reduced and modified. It was found that the elements that control translation are contained in the 100 base pairs upstream from the initiation codon. Interchanging these upstream sequences with those of an efficiently translated gene dramatically increased the translation of terminase subunits. It seems unlikely that the rare codons present in the genes, and any feature of their mRNA secondary structure play a role in the control of their translation. The elimination of cos from plasmids containing Nu1 and A also resulted in an increase in terminase production. This result suggests a role for cos in the control of late gene expression. The terminase subunit overproducer strains are potentially very useful for the design of improved DNA packaging and cosmid mapping techniques. Images PMID:3029667
Sharma, Prem N; Torii, Akihide; Takumi, Shigeo; Mori, Naoki; Nakamura, Chiharu
2004-01-01
Brown planthopper (BPH) (Nilaparvata lugens Stål) is a significant insect pest of rice (Oryza sativa L.). We constructed a gene-pyramided japonica line, in which two BPH resistance genes Bph1 and Bph2 on the long arm of chromosome 12 independently derived from two indica resistance lines were combined through the recombinant selection. The gene-pyramiding was achieved based on the previously constructed high-resolution linkage maps of the two genes. Two co-dominant and four dominant PCR-based markers flanking the loci were used to select for a homozygous recombinant line in a segregating population that was derived from a cross between the parental homozygous single-gene introgression lines. BPH bioassay showed that the resistance level of the pyramided line was equivalent to that of the Bph1-single introgression line, which showed a higher level of resistance than the Bph2-single introgression line. The pyramid line should provide a useful experimental means for studying the fine structure of the chromosomal region covering these two major BPH resistance genes.
Eggenhofer, Elke; Rachel, Reinhard; Haslbeck, Martin; Scharf, Birgit
2006-01-01
The flagella of the soil bacterium Sinorhizobium meliloti differ from the enterobacterial paradigm in the complex filament structure and modulation of the flagellar rotary speed. The mode of motility control in S. meliloti has a molecular corollary in two novel periplasmic motility proteins, MotC and MotE, that are present in addition to the ubiquitous MotA/MotB energizing proton channel. A fifth motility gene is located in the mot operon downstream of the motB and motC genes. Its gene product was originally designated MotD, a cytoplasmic motility protein having an unknown function. We report here reassignment of MotD as FliK, the regulator of flagellar hook length. The FliK gene is one of the few flagellar genes not annotated in the contiguous flagellar regulon of S. meliloti. Characteristic for its class, the 475-residue FliK protein contains a conserved, compactly folded Flg hook domain in its carboxy-terminal region. Deletion of fliK leads to formation of prolonged flagellar hooks (polyhooks) with missing filament structures. Extragenic suppressor mutations all mapped in the cytoplasmic region of the transmembrane export protein FlhB and restored assembly of a flagellar filament, and thus motility, in the presence of polyhooks. The structural properties of FliK are consistent with its function as a substrate specificity switch of the flagellar export apparatus for switching from rod/hook-type substrates to filament-type substrates. PMID:16513744
High-resolution genetic mapping of allelic variants associated with cell wall chemistry in Populus.
Muchero, Wellington; Guo, Jianjun; DiFazio, Stephen P; Chen, Jin-Gui; Ranjan, Priya; Slavov, Gancho T; Gunter, Lee E; Jawdy, Sara; Bryan, Anthony C; Sykes, Robert; Ziebell, Angela; Klápště, Jaroslav; Porth, Ilga; Skyba, Oleksandr; Unda, Faride; El-Kassaby, Yousry A; Douglas, Carl J; Mansfield, Shawn D; Martin, Joel; Schackwitz, Wendy; Evans, Luke M; Czarnecki, Olaf; Tuskan, Gerald A
2015-01-23
QTL cloning for the discovery of genes underlying polygenic traits has historically been cumbersome in long-lived perennial plants like Populus. Linkage disequilibrium-based association mapping has been proposed as a cloning tool, and recent advances in high-throughput genotyping and whole-genome resequencing enable marker saturation to levels sufficient for association mapping with no a priori candidate gene selection. Here, multiyear and multienvironment evaluation of cell wall phenotypes was conducted in an interspecific P. trichocarpa x P. deltoides pseudo-backcross mapping pedigree and two partially overlapping populations of unrelated P. trichocarpa genotypes using pyrolysis molecular beam mass spectrometry, saccharification, and/ or traditional wet chemistry. QTL mapping was conducted using a high-density genetic map with 3,568 SNP markers. As a fine-mapping approach, chromosome-wide association mapping targeting a QTL hot-spot on linkage group XIV was performed in the two P. trichocarpa populations. Both populations were genotyped using the 34 K Populus Infinium SNP array and whole-genome resequencing of one of the populations facilitated marker-saturation of candidate intervals for gene identification. Five QTLs ranging in size from 0.6 to 1.8 Mb were mapped on linkage group XIV for lignin content, syringyl to guaiacyl (S/G) ratio, 5- and 6-carbon sugars using the mapping pedigree. Six candidate loci exhibiting significant associations with phenotypes were identified within QTL intervals. These associations were reproducible across multiple environments, two independent genotyping platforms, and different plant growth stages. cDNA sequencing for allelic variants of three of the six loci identified polymorphisms leading to variable length poly glutamine (PolyQ) stretch in a transcription factor annotated as an ANGUSTIFOLIA C-terminus Binding Protein (CtBP) and premature stop codons in a KANADI transcription factor as well as a protein kinase. Results from protoplast transient expression assays suggested that each of the polymorphisms conferred allelic differences in the activation of cellulose, hemicelluloses, and lignin pathway marker genes. This study illustrates the utility of complementary QTL and association mapping as tools for gene discovery with no a priori candidate gene selection. This proof of concept in a perennial organism opens up opportunities for discovery of novel genetic determinants of economically important but complex traits in plants.
Serial analysis of gene expression (SAGE) in normal human trabecular meshwork.
Liu, Yutao; Munro, Drew; Layfield, David; Dellinger, Andrew; Walter, Jeffrey; Peterson, Katherine; Rickman, Catherine Bowes; Allingham, R Rand; Hauser, Michael A
2011-04-08
To identify the genes expressed in normal human trabecular meshwork tissue, a tissue critical to the pathogenesis of glaucoma. Total RNA was extracted from human trabecular meshwork (HTM) harvested from 3 different donors. Extracted RNA was used to synthesize individual SAGE (serial analysis of gene expression) libraries using the I-SAGE Long kit from Invitrogen. Libraries were analyzed using SAGE 2000 software to extract the 17 base pair sequence tags. The extracted sequence tags were mapped to the genome using SAGE Genie map. A total of 298,834 SAGE tags were identified from all HTM libraries (96,842, 88,126, and 113,866 tags, respectively). Collectively, there were 107,325 unique tags. There were 10,329 unique tags with a minimum of 2 counts from a single library. These tags were mapped to known unique Unigene clusters. Approximately 29% of the tags (orphan tags) did not map to a known Unigene cluster. Thirteen percent of the tags mapped to at least 2 Unigene clusters. Sequence tags from many glaucoma-related genes, including myocilin, optineurin, and WD repeat domain 36, were identified. This is the first time SAGE analysis has been used to characterize the gene expression profile in normal HTM. SAGE analysis provides an unbiased sampling of gene expression of the target tissue. These data will provide new and valuable information to improve understanding of the biology of human aqueous outflow.
Radiation hybrid mapping of genes in the lithium-sensitive wnt signaling pathway.
Rhoads, A R; Karkera, J D; Detera-Wadleigh, S D
1999-09-01
Lithium, an effective drug in the treatment of bipolar disorder, has been proposed to disrupt the Wnt signaling pathway. To facilitate analysis of the possible involvement of elements of the Wnt pathway in human bipolar disorder, a high resolution radiation hybrid mapping (RHM) of these genes was performed. A fine physical location has been obtained for Wnt 7A, frizzled 3, 4 and 5, dishevelled 1, 2 and 3, GSK3beta, axin, alpha-catenin, the Armadillo repeat-containing genes (delta-catenin and ARVCF), and a frizzled-like protein (frpHE) using the Stanford Human Genome Center (SHGC) G3 panel. Most of these genes were previously mapped by fluorescence in situ hybridization (FISH). Frizzled 4, axin and frpHE did not have a previous chromosomal assignment and were linked by RHM to chromosome markers, SHGC-35131 at 11q22.1, NIB1488 at 16p13.3 and D7S2919 at 7p15.2, respectively. Interestingly, some of these genes were found to map within potential regions underlying susceptibility to bipolar disorder and schizophrenia as well as disorders of neurodevelopmental origin. This alternative approach of establishing the precise location of selected genetic components of a candidate pathway and determining if they map within previously defined susceptibility loci should help to identify plausible candidate genes that warrant further analysis through association and mutational scanning.
A chromatin link to caste identity in the carpenter ant Camponotus floridanus
Simola, Daniel F.; Ye, Chaoyang; Mutti, Navdeep S.; Dolezal, Kelly; Bonasio, Roberto; Liebig, Jürgen; Reinberg, Danny; Berger, Shelley L.
2013-01-01
In many ant species, sibling larvae follow alternative ontogenetic trajectories that generate striking variation in morphology and behavior among adults. These organism-level outcomes are often determined by environmental rather than genetic factors. Therefore, epigenetic mechanisms may mediate the expression of adult polyphenisms. We produced the first genome-wide maps of chromatin structure in a eusocial insect and found that gene-proximal changes in histone modifications, notably H3K27 acetylation, discriminate two female worker and male castes in Camponotus floridanus ants and partially explain differential gene expression between castes. Genes showing coordinated changes in H3K27ac and RNA implicate muscle development, neuronal regulation, and sensory responses in modulating caste identity. Binding sites of the acetyltransferase CBP harbor the greatest caste variation in H3K27ac, are enriched with motifs for conserved transcription factors, and show evolutionary expansion near developmental and neuronal genes. These results suggest that environmental effects on caste identity may be mediated by differential recruitment of CBP to chromatin. We propose that epigenetic mechanisms that modify chromatin structure may help orchestrate the generation and maintenance of polyphenic caste morphology and social behavior in ants. PMID:23212948
Chen, Frank; Spano, Anthony; Goodman, Benjamin E.; Blasier, Kiev R.; Sabat, Agnes; Jeffery, Erin; Norris, Andrew; Shabanowitz, Jeffrey; Hunt, Donald F.; Lebedev, Nikolai
2010-01-01
The gene transfer agent of Rhodobacter capsulatus (GTA) is a unique phage-like particle that exchanges genetic information between members of this same species of bacterium. Besides being an excellent tool for genetic mapping, the GTA has a number of advantages for biotechnological and nanoengineering purposes. To facilitate the GTA purification and identify the proteins involved in GTA expression, assembly and regulation, in the present work we construct and transform into R. capsulatus Y262 a gene coding for a C-terminally His-tagged capsid protein. The constructed protein was expressed in the cells, assembled into chimeric GTA particles inside the cells and excreted from the cells into surrounding medium. Transmission electron micrographs of phosphotungstate-stained, NiNTA-purified chimeric GTA confirm that its structure is similar to normal GTA particles, with many particles composed both of a head and a tail. The mass spectrometric proteomic analysis of polypeptides present in the GTA recovered outside the cells shows that GTA is composed of at least 9 proteins represented in the GTA gene cluster including proteins coded for by Orf’s 3, 5, 6–9, 11, 13, and 15. PMID:19105630
Chen, Frank; Spano, Anthony; Goodman, Benjamin E; Blasier, Kiev R; Sabat, Agnes; Jeffery, Erin; Norris, Andrew; Shabanowitz, Jeffrey; Hunt, Donald F; Lebedev, Nikolai
2009-02-01
The gene transfer agent of Rhodobacter capsulatus (GTA) is a unique phage-like particle that exchanges genetic information between members of this same species of bacterium. Besides being an excellent tool for genetic mapping, the GTA has a number of advantages for biotechnological and nanoengineering purposes. To facilitate the GTA purification and identify the proteins involved in GTA expression, assembly and regulation, in the present work we construct and transform into R. capsulatus Y262 a gene coding for a C-terminally His-tagged capsid protein. The constructed protein was expressed in the cells, assembled into chimeric GTA particles inside the cells and excreted from the cells into surrounding medium. Transmission electron micrographs of phosphotungstate-stained, NiNTA-purified chimeric GTA confirm that its structure is similar to normal GTA particles, with many particles composed both of a head and a tail. The mass spectrometric proteomic analysis of polypeptides present in the GTA recovered outside the cells shows that GTA is composed of at least 9 proteins represented in the GTA gene cluster including proteins coded for by Orf's 3, 5, 6-9, 11, 13, and 15.
Structure and transcriptional regulation of the major intrinsic protein gene family in grapevine.
Wong, Darren Chern Jan; Zhang, Li; Merlin, Isabelle; Castellarin, Simone D; Gambetta, Gregory A
2018-04-11
The major intrinsic protein (MIP) family is a family of proteins, including aquaporins, which facilitate water and small molecule transport across plasma membranes. In plants, MIPs function in a huge variety of processes including water transport, growth, stress response, and fruit development. In this study, we characterize the structure and transcriptional regulation of the MIP family in grapevine, describing the putative genome duplication events leading to the family structure and characterizing the family's tissue and developmental specific expression patterns across numerous preexisting microarray and RNAseq datasets. Gene co-expression network (GCN) analyses were carried out across these datasets and the promoters of each family member were analyzed for cis-regulatory element structure in order to provide insight into their transcriptional regulation. A total of 29 Vitis vinifera MIP family members (excluding putative pseudogenes) were identified of which all but two were mapped onto Vitis vinifera chromosomes. In this study, segmental duplication events were identified for five plasma membrane intrinsic protein (PIP) and four tonoplast intrinsic protein (TIP) genes, contributing to the expansion of PIPs and TIPs in grapevine. Grapevine MIP family members have distinct tissue and developmental expression patterns and hierarchical clustering revealed two primary groups regardless of the datasets analyzed. Composite microarray and RNA-seq gene co-expression networks (GCNs) highlighted the relationships between MIP genes and functional categories involved in cell wall modification and transport, as well as with other MIPs revealing a strong co-regulation within the family itself. Some duplicated MIP family members have undergone sub-functionalization and exhibit distinct expression patterns and GCNs. Cis-regulatory element (CRE) analyses of the MIP promoters and their associated GCN members revealed enrichment for numerous CREs including AP2/ERFs and NACs. Combining phylogenetic analyses, gene expression profiling, gene co-expression network analyses, and cis-regulatory element enrichment, this study provides a comprehensive overview of the structure and transcriptional regulation of the grapevine MIP family. The study highlights the duplication and sub-functionalization of the family, its strong coordinated expression with genes involved in growth and transport, and the putative classes of TFs responsible for its regulation.