Science.gov

Sample records for additional genomic regions

  1. Detection and removal of PCR duplicates in population genomic ddRAD studies by addition of a degenerate base region (DBR) in sequencing adapters.

    PubMed

    Schweyen, Hannah; Rozenberg, Andrey; Leese, Florian

    2014-10-01

    Restriction-site associated DNA sequencing (RAD) has emerged as a powerful marker system for studying genome-wide DNA polymorphisms using next-generation sequencing. A recent technical facilitation of RAD is double-digest RAD (ddRAD), which utilizes two restriction enzymes for library preparation. The more flexible and balanced ddRAD allows analysis of genomic loci in hundreds of individuals. However, in contrast to paired-end sequencing of traditional RAD libraries, PCR duplicates cannot be detected with ddRAD. This is a concern because duplicates can contribute substantially to read coverage data and erroneously inflate the proportion of homozygous loci (allele dropout). Allele dropout can bias population genetic parameter inference and complicate the detection of outlier loci under selection. Here we outline a simple and straightforward approach to detecting PCR duplicates from ddRAD libraries. Our approach introduces a degenerate base region (DBR, 12,288 unique combinations) in the sequencing adapter. We demonstrate the high efficiency and low rate of false positives in simulations. In addition, a pilot study was performed to test this approach on six aquatic invertebrates, sequenced on a HiSeq 2500 sequencer. The reads of the ddRAD libraries consisted of 33.48% PCR duplicates distributed on 19.40% of the loci. A disproportionate number of PCR duplicates were detected in only 4.66% of the loci. While this should not be a concern for general parameter inference, outlier loci detection in particular would be improved by the DBR technique. Given the easy and straightforward application of the technique in other RAD protocols as well, we suggest that DBR regions should generally be included in PCR-based RAD studies.

  2. Genome of turbot rhabdovirus exhibits unusual non-coding regions and an additional ORF that could be expressed in fish cell.

    PubMed

    Zhu, Ruo-Lin; Lei, Xiao-Ying; Ke, Fei; Yuan, Xiu-Ping; Zhang, Qi-Ya

    2011-02-01

    Genomic sequence of Scophthalmus maximus rhabdovirus (SMRV) isolated from diseased turbot has been characterized. The complete genome of SMRV comprises 11,492 nucleotides and encodes five typical rhabdovirus genes N, P, M, G and L. In addition, two open reading frames (ORF) are predicted overlapping with P gene, one upstream of P and smaller than P (temporarily called Ps), and another in P gene which may encodes a protein similar to the vesicular stomatitis virus C protein. The C ORF is contained within the P ORF. The five typical proteins share the highest sequence identities (48.9%) with the corresponding proteins of rhabdoviruses in genus Vesiculovirus. Phylogenetic analysis of partial L protein sequence indicates that SMRV is close to genus Vesiculovirus. The first 13 nucleotides at the ends of the SMRV genome are absolutely inverse complementarity. The gene junctions between the five genes show conserved polyadenylation signal (CATGA(7)) and intergenic dinucleotide (CT) followed by putative transcription initiation sequence A(A/G)(C/G)A(A/G/T), which are different from known rhabdoviruses. The entire Ps ORF was cloned and expressed, and used to generate polyclonal antibody in mice. One obvious band could be detected in SMRV-infected carp leucocyte cells (CLCs) by anti-Ps/C serum via Western blot, and the subcellular localization of Ps-GFP fusion protein exhibited cytoplasm distribution as multiple punctuate or doughnut shaped foci of uneven size.

  3. Atypical regions in large genomic DNA sequences

    SciTech Connect

    Scherer, S. |; McPeek, M.S.; Speed, T.P.

    1994-07-19

    Large genomic DNA sequences contain regions with distinctive patterns of sequence organization. The authors describe a method using logarithms of probabilities based on seventh-order Markov chains to rapidly identify genomic sequences that do not resemble models of genome organization built from compilations of octanucleotide usage. Data bases have been constructed from Escherichia coli and Saccharomyces cerevisiae DNA sequences of >1000 nt and human sequences of >10,000 nt. Atypical genes and clusters of genes have been located in bacteriophage, yeast, and primate DNA sequences. The authors consider criteria for statistical significance of the results, offer possible explanations for the observed variation in genome organization, and give additional applications of these methods in DNA sequence analysis.

  4. The opossum MHC genomic region revisited.

    PubMed

    Krasnec, Katina V; Sharp, Alana R; Williams, Tracey L; Miller, Robert D

    2015-04-01

    The gray short-tailed opossum Monodelphis domestica is one of the few marsupial species for which a high quality whole genome sequence is available and the major histocompatibility complex (MHC) region has been annotated. Previous analyses revealed only a single locus within the opossum MHC region, designated Modo-UA1, with the features expected for encoding a functionally classical class I α-chain. Nine other class I genes found within the MHC are highly divergent and have features usually associated with non-classical roles. The original annotation, however, was based on an early version of the opossum genome assembly. More recent analyses of allelic variation in individual opossums revealed too many Modo-UA1 sequences per individual to be accounted for by a single MHC class I locus found in the genome assembly. A reanalysis of a later generation assembly, MonDom5, revealed the presence of two additional loci, now designated Modo-UA3 and UA4, in a region that was expanded and more complete than in the earlier assembly. Modo-UA1, UA3, and UA4 are all transcribed, although Modo-UA4 transcripts are rarer. Modo-UA4 is also relatively non-polymorphic. Evidence presented support the accuracy of the later assembly and the existence of three related class I genes in the opossum, making opossums more typical of mammals and most tetrapods by having multiple apparent classical MHC class I loci.

  5. Unraveling Additive from Nonadditive Effects Using Genomic Relationship Matrices

    PubMed Central

    Muñoz, Patricio R.; Resende, Marcio F. R.; Gezan, Salvador A.; Resende, Marcos Deon Vilela; de los Campos, Gustavo; Kirst, Matias; Huber, Dudley; Peter, Gary F.

    2014-01-01

    The application of quantitative genetics in plant and animal breeding has largely focused on additive models, which may also capture dominance and epistatic effects. Partitioning genetic variance into its additive and nonadditive components using pedigree-based models (P-genomic best linear unbiased predictor) (P-BLUP) is difficult with most commonly available family structures. However, the availability of dense panels of molecular markers makes possible the use of additive- and dominance-realized genomic relationships for the estimation of variance components and the prediction of genetic values (G-BLUP). We evaluated height data from a multifamily population of the tree species Pinus taeda with a systematic series of models accounting for additive, dominance, and first-order epistatic interactions (additive by additive, dominance by dominance, and additive by dominance), using either pedigree- or marker-based information. We show that, compared with the pedigree, use of realized genomic relationships in marker-based models yields a substantially more precise separation of additive and nonadditive components of genetic variance. We conclude that the marker-based relationship matrices in a model including additive and nonadditive effects performed better, improving breeding value prediction. Moreover, our results suggest that, for tree height in this population, the additive and nonadditive components of genetic variance are similar in magnitude. This novel result improves our current understanding of the genetic control and architecture of a quantitative trait and should be considered when developing breeding strategies. PMID:25324160

  6. Harnessing genomics to improve health in the Eastern Mediterranean Region - an executive course in genomics policy.

    PubMed

    Acharya, Tara; Rab, Mohammed Abdur; Singer, Peter A; Daar, Abdallah S

    2005-01-21

    BACKGROUND: While innovations in medicine, science and technology have resulted in improved health and quality of life for many people, the benefits of modern medicine continue to elude millions of people in many parts of the world. To assess the potential of genomics to address health needs in EMR, the World Health Organization's Eastern Mediterranean Regional Office and the University of Toronto Joint Centre for Bioethics jointly organized a Genomics and Public Health Policy Executive Course, held September 20th-23rd, 2003, in Muscat, Oman. The 4-day course was sponsored by WHO-EMRO with additional support from the Canadian Program in Genomics and Global Health. The overall objective of the course was to collectively explore how to best harness genomics to improve health in the region. This article presents the course findings and recommendations for genomics policy in EMR. METHODS: The course brought together senior representatives from academia, biotechnology companies, regulatory bodies, media, voluntary, and legal organizations to engage in discussion. Topics covered included scientific advances in genomics, followed by innovations in business models, public sector perspectives, ethics, legal issues and national innovation systems. RESULTS: A set of recommendations, summarized below, was formulated for the Regional Office, the Member States and for individuals.* Advocacy for genomics and biotechnology for political leadership;* Networking between member states to share information, expertise, training, and regional cooperation in biotechnology; coordination of national surveys for assessment of health biotechnology innovation systems, science capacity, government policies, legislation and regulations, intellectual property policies, private sector activity;* Creation in each member country of an effective National Body on genomics, biotechnology and health to:- formulate national biotechnology strategies- raise biotechnology awareness- encourage teaching and

  7. Characterization of copy number variation in genomic regions containing STR loci using array comparative genomic hybridization.

    PubMed

    Repnikova, Elena A; Rosenfeld, Jill A; Bailes, Andrea; Weber, Cecilia; Erdman, Linda; McKinney, Aimee; Ramsey, Sarah; Hashimoto, Sayaka; Lamb Thrush, Devon; Astbury, Caroline; Reshmi, Shalini C; Shaffer, Lisa G; Gastier-Foster, Julie M; Pyatt, Robert E

    2013-09-01

    Short tandem repeat (STR) loci are commonly used in forensic casework, familial analysis for human identification, and for monitoring hematopoietic cell engraftment after bone marrow transplant. Unexpected genetic variation leading to sequence and length differences in STR loci can complicate STR typing, and presents challenges in casework interpretation. Copy number variation (CNV) is a relatively recently identified form of genetic variation consisting of genomic regions present at variable copy numbers within an individual compared to a reference genome. Large scale population studies have demonstrated that likely all individuals carry multiple regions with CNV of 1kb in size or greater in their genome. To date, no study correlating genomic regions containing STR loci with CNV has been conducted. In this study, we analyzed results from 32,850 samples sent for clinical array comparative genomic hybridization (CGH) analysis for the presence of CNV at regions containing the 13 CODIS (Combined DNA Index System) STR, and the Amelogenin X (AMELX) and Amelogenin Y (AMELY) loci. Thirty-two individuals with CNV involving STR loci on chromosomes 2, 4, 7, 11, 12, 13, 16, and 21, and twelve with CNV involving the AMELX/AMELY loci were identified. These results were correlated with data from publicly available databases housing information on CNV identified in normal populations and additional clinical cases. These collective results demonstrate the presence of CNV in regions containing 9 of the 13 CODIS STR and AMELX/Y loci. Further characterization of STR profiles within regions of CNV, additional cataloging of these variants in multiple populations, and contributing such examples to the public domain will provide valuable information for reliable use of these loci.

  8. Selecting Hypomethylated Genomic Regions Using MRE-Seq.

    PubMed

    Wischnitzki, Elisabeth; Burg, Kornel; Berenyi, Maria; Sehr, Eva Maria

    2016-01-01

    Here, we describe a method capable of filtering the hypomethylated part of plant genomes, the so-called hypomethylome. The principle of the method is based on the filtration and sequence analysis of small DNA fragments generated by methylation-sensitive four-cutter restriction endonucleases, possessing ((5me))CpG motifs in their recognition sites. The majority of these fragments represent genes and their flanking regions containing also regulatory elements-the gene space of the genome. Besides the enrichment of the gene space, another advantage of the method is the simultaneous depletion of repetitive elements due to their methylated nature and its easy application on complex and large plant genomes. Additionally to the wet lab procedure, we describe how to analyze the data using bioinformatics methods and how to apply the method to comparative studies. PMID:27557762

  9. Histone modifications predispose genome regions to breakage and translocation

    PubMed Central

    Burman, Bharat; Zhang, Zhuzhu Z.; Pegoraro, Gianluca; Lieb, Jason D.; Misteli, Tom

    2015-01-01

    Chromosome translocations are well-established hallmarks of cancer cells and often occur at nonrandom sites in the genome. The molecular features that define recurrent chromosome breakpoints are largely unknown. Using a combination of bioinformatics, biochemical analysis, and cell-based assays, we identify here specific histone modifications as facilitators of chromosome breakage and translocations. We show enrichment of several histone modifications over clinically relevant translocation-prone genome regions. Experimental modulation of histone marks sensitizes genome regions to breakage by endonuclease challenge or irradiation and promotes formation of chromosome translocations of endogenous gene loci. Our results demonstrate that histone modifications predispose genome regions to chromosome breakage and translocations. PMID:26104467

  10. Genomic in situ hybridization analysis of Thinopyrum chromatin in a wheat-Th. intermedium partial amphiploid and six derived chromosome addition lines

    PubMed

    Chen; Conner; Laroche; Ji; Armstrong; Fedak

    1999-12-01

    The genomic origin of alien chromosomes present in a wheat-Thinopyrum intermedium partial amphiploid TAF46 (2n = 8x = 56) and six derived chromosome addition lines were analyzed by genomic in situ hybridization (GISH) using S genomic DNA from Pseudoroegneria strigosa (2n = 2x = 14, SS) as a probe. The GISH analysis clearly showed that the chromosome complement of the partial amphiploid TAF46 consists of an entire wheat genome plus one synthetic genome consisting of a mixture of six S genome chromosomes and eight J (=E) genome chromosomes derived from Th. intermedium (2n = 6x = 42, JJJ(s)J(s)SS). There were no Js genome chromosomes present in TAF46. The J genome chromosomes present in TAF46 displayed a unique GISH hybridization pattern with the S genomic DNA probe, in which S genome DNA strongly hybridized at the terminal regions and weakly hybridized over the remaining parts of the chromosomes. This provides a diagnostic marker for distinguishing J genome chromosomes from Js or S genome or wheat ABD genome chromosomes. The genomic origin of the alien chromosomes present in the six derived chromosome addition lines were identified by their characteristic GISH hybridization patterns with S genomic DNA probe. GISH analysis showed that addition lines L1, L2, L3, and L5 carried one pair of J genome chromosomes, while addition lines L4 and L7 each carried one pair of S genome chromosomes. GISH patterns detected by the S genome probe on addition line of L1 were identical to those of the J genome chromosomes present in the partial amphiploid TAF46, suggesting that these chromosomes were not structurally altered when they were transferred from TAF46 to addition lines.

  11. Admixture mapping identifies introgressed genomic regions in North American canids.

    PubMed

    vonHoldt, Bridgett M; Kays, Roland; Pollinger, John P; Wayne, Robert K

    2016-06-01

    Hybrid zones typically contain novel gene combinations that can be tested by natural selection in a unique genetic context. Parental haplotypes that increase fitness can introgress beyond the hybrid zone, into the range of parental species. We used the Affymetrix canine SNP genotyping array to identify genomic regions tagged by multiple ancestry informative markers that are more frequent in an admixed population than expected. We surveyed a hybrid zone formed in the last 100 years as coyotes expanded their range into eastern North America. Concomitant with expansion, coyotes hybridized with wolves and some populations became more wolflike, such that coyotes in the northeast have the largest body size of any coyote population. Using a set of 3102 ancestry informative markers, we identified 60 differentially introgressed regions in 44 canines across this admixture zone. These regions are characterized by an excess of exogenous ancestry and, in northeastern coyotes, are enriched for genes affecting body size and skeletal proportions. Further, introgressed wolf-derived alleles have penetrated into Southern US coyote populations. Because no wolves currently exist in this area, these alleles are unlikely to have originated from recent hybridization. Instead, they probably originated from intraspecific gene flow or ancient admixture. We show that grey wolf and coyote admixture has far-reaching effects and, in addition to phenotypically transforming admixed populations, allows for the differential movement of alleles from different parental species to be tested in new genomic backgrounds.

  12. Admixture mapping identifies introgressed genomic regions in North American canids.

    PubMed

    vonHoldt, Bridgett M; Kays, Roland; Pollinger, John P; Wayne, Robert K

    2016-06-01

    Hybrid zones typically contain novel gene combinations that can be tested by natural selection in a unique genetic context. Parental haplotypes that increase fitness can introgress beyond the hybrid zone, into the range of parental species. We used the Affymetrix canine SNP genotyping array to identify genomic regions tagged by multiple ancestry informative markers that are more frequent in an admixed population than expected. We surveyed a hybrid zone formed in the last 100 years as coyotes expanded their range into eastern North America. Concomitant with expansion, coyotes hybridized with wolves and some populations became more wolflike, such that coyotes in the northeast have the largest body size of any coyote population. Using a set of 3102 ancestry informative markers, we identified 60 differentially introgressed regions in 44 canines across this admixture zone. These regions are characterized by an excess of exogenous ancestry and, in northeastern coyotes, are enriched for genes affecting body size and skeletal proportions. Further, introgressed wolf-derived alleles have penetrated into Southern US coyote populations. Because no wolves currently exist in this area, these alleles are unlikely to have originated from recent hybridization. Instead, they probably originated from intraspecific gene flow or ancient admixture. We show that grey wolf and coyote admixture has far-reaching effects and, in addition to phenotypically transforming admixed populations, allows for the differential movement of alleles from different parental species to be tested in new genomic backgrounds. PMID:27106273

  13. Enhancer scanning to locate regulatory regions in genomic loci

    PubMed Central

    Buckley, Melissa; Gjyshi, Anxhela; Mendoza-Fandiño, Gustavo; Baskin, Rebekah; Carvalho, Renato S.; Carvalho, Marcelo A.; Woods, Nicholas T.; Monteiro, Alvaro N.A.

    2016-01-01

    The present protocol provides a rapid, streamlined and scalable strategy to systematically scan genomic regions for the presence of transcriptional regulatory regions active in a specific cell type. It creates genomic tiles spanning a region of interest that are subsequently cloned by recombination into a luciferase reporter vector containing the Simian Virus 40 promoter. Tiling clones are transfected into specific cell types to test for the presence of transcriptional regulatory regions. The protocol includes testing of different SNP (single nucleotide polymorphism) alleles to determine their effect on regulatory activity. This procedure provides a systematic framework to identify candidate functional SNPs within a locus during functional analysis of genome-wide association studies. This protocol adapts and combines previous well-established molecular biology methods to provide a streamlined strategy, based on automated primer design and recombinational cloning to rapidly go from a genomic locus to a set of candidate functional SNPs in eight weeks. PMID:26658467

  14. Comparative mitochondrial genomics of snakes: extraordinary substitution rate dynamics and functionality of the duplicate control region

    PubMed Central

    Jiang, Zhi J; Castoe, Todd A; Austin, Christopher C; Burbrink, Frank T; Herron, Matthew D; McGuire, Jimmy A; Parkinson, Christopher L; Pollock, David D

    2007-01-01

    Background The mitochondrial genomes of snakes are characterized by an overall evolutionary rate that appears to be one of the most accelerated among vertebrates. They also possess other unusual features, including short tRNAs and other genes, and a duplicated control region that has been stably maintained since it originated more than 70 million years ago. Here, we provide a detailed analysis of evolutionary dynamics in snake mitochondrial genomes to better understand the basis of these extreme characteristics, and to explore the relationship between mitochondrial genome molecular evolution, genome architecture, and molecular function. We sequenced complete mitochondrial genomes from Slowinski's corn snake (Pantherophis slowinskii) and two cottonmouths (Agkistrodon piscivorus) to complement previously existing mitochondrial genomes, and to provide an improved comparative view of how genome architecture affects molecular evolution at contrasting levels of divergence. Results We present a Bayesian genetic approach that suggests that the duplicated control region can function as an additional origin of heavy strand replication. The two control regions also appear to have different intra-specific versus inter-specific evolutionary dynamics that may be associated with complex modes of concerted evolution. We find that different genomic regions have experienced substantial accelerated evolution along early branches in snakes, with different genes having experienced dramatic accelerations along specific branches. Some of these accelerations appear to coincide with, or subsequent to, the shortening of various mitochondrial genes and the duplication of the control region and flanking tRNAs. Conclusion Fluctuations in the strength and pattern of selection during snake evolution have had widely varying gene-specific effects on substitution rates, and these rate accelerations may have been functionally related to unusual changes in genomic architecture. The among-lineage and

  15. Telomere maintenance through recruitment of internal genomic regions

    PubMed Central

    Seo, Beomseok; Kim, Chuna; Hills, Mark; Sung, Sanghyun; Kim, Hyesook; Kim, Eunkyeong; Lim, Daisy S.; Oh, Hyun-Seok; Choi, Rachael Mi Jung; Chun, Jongsik; Shim, Jaegal; Lee, Junho

    2015-01-01

    Cells surviving crisis are often tumorigenic and their telomeres are commonly maintained through the reactivation of telomerase. However, surviving cells occasionally activate a recombination-based mechanism called alternative lengthening of telomeres (ALT). Here we establish stably maintained survivors in telomerase-deleted Caenorhabditis elegans that escape from sterility by activating ALT. ALT survivors trans-duplicate an internal genomic region, which is already cis-duplicated to chromosome ends, across the telomeres of all chromosomes. These ‘Template for ALT' (TALT) regions consist of a block of genomic DNA flanked by telomere-like sequences, and are different between two genetic background. We establish a model that an ancestral duplication of a donor TALT region to a proximal telomere region forms a genomic reservoir ready to be incorporated into telomeres on ALT activation. PMID:26382656

  16. Genome Sequences of Five Additional Brevibacillus laterosporus Bacteriophages

    PubMed Central

    Merrill, Bryan D.; Berg, Jordan A.; Graves, Kiel A.; Ward, Andy T.; Hilton, Jared A.; Wake, Braden N.; Grose, Julianne H.; Breakwell, Donald P.

    2015-01-01

    Brevibacillus laterosporus has been isolated from many different environments, including beehives, and produces compounds that are toxic to many organisms. Five B. laterosporus phages have been isolated previously. Here, we announce five additional phages that infect this bacterium, including the first B. laterosporus siphoviruses to be discovered. PMID:26494658

  17. The shared genomic architecture of human nucleolar organizer regions

    PubMed Central

    Floutsakou, Ioanna; Agrawal, Saumya; Nguyen, Thong T.; Seoighe, Cathal; Ganley, Austen R.D.; McStay, Brian

    2013-01-01

    The short arms of the five acrocentric human chromosomes harbor sequences that direct the assembly and function of the nucleolus, one of the key functional domains of the nucleus, yet they are absent from the current human genome assembly. Here we describe the genomic architecture of these human nucleolar organizers. Sequences distal and proximal to ribosomal gene arrays are conserved among the acrocentric chromosomes, suggesting they are sites of frequent recombination. Although previously believed to be heterochromatic, characterization of these two flanking regions reveals that they share a complex genomic architecture similar to other euchromatic regions of the genome, but they have distinct genomic characteristics. Proximal sequences are almost entirely segmentally duplicated, similar to the regions bordering centromeres. In contrast, the distal sequence is predominantly unique to the acrocentric short arms and is dominated by a very large inverted repeat. We show that the distal element is localized to the periphery of the nucleolus, where it appears to anchor the ribosomal gene repeats. This, combined with its complex chromatin structure and transcriptional activity, suggests that this region is involved in nucleolar organization. Our results provide a platform for investigating the role of NORs in nucleolar formation and function, and open the door for determining the role of these regions in the well-known empirical association of nucleoli with pathology. PMID:23990606

  18. Analysis of Human Accelerated DNA Regions Using Archaic Hominin Genomes

    PubMed Central

    Burbano, Hernán A.; Green, Richard E.; Maricic, Tomislav; Lalueza-Fox, Carles; de la Rasilla, Marco; Rosas, Antonio; Kelso, Janet; Pollard, Katherine S.; Lachmann, Michael; Pääbo, Svante

    2012-01-01

    Several previous comparisons of the human genome with other primate and vertebrate genomes identified genomic regions that are highly conserved in vertebrate evolution but fast-evolving on the human lineage. These human accelerated regions (HARs) may be regions of past adaptive evolution in humans. Alternatively, they may be the result of non-adaptive processes, such as biased gene conversion. We captured and sequenced DNA from a collection of previously published HARs using DNA from an Iberian Neandertal. Combining these new data with shotgun sequence from the Neandertal and Denisova draft genomes, we determine at least one archaic hominin allele for 84% of all positions within HARs. We find that 8% of HAR substitutions are not observed in the archaic hominins and are thus recent in the sense that the derived allele had not come to fixation in the common ancestor of modern humans and archaic hominins. Further, we find that recent substitutions in HARs tend to have come to fixation faster than substitutions elsewhere in the genome and that substitutions in HARs tend to cluster in time, consistent with an episodic rather than a clock-like process underlying HAR evolution. Our catalog of sequence changes in HARs will help prioritize them for functional studies of genomic elements potentially responsible for modern human adaptations. PMID:22412940

  19. Attenuation of Monkeypox Virus by Deletion of Genomic Regions

    PubMed Central

    Lopera, Juan G.; Falendysz, Elizabeth A.; Rocke, Tonie E.; Osorio, Jorge E.

    2015-01-01

    Monkeypox virus (MPXV) is an emerging pathogen from Africa that causes disease similar to smallpox. Two clades with different geographic distributions and virulence have been described. Here, we utilized bioinformatic tools to identify genomic regions in MPXV containing multiple virulence genes and explored their roles in pathogenicity; two selected regions were then deleted singularly or in combination. In vitro and in vivo studies indicated that these regions play a significant role in MPXV replication, tissue spread, and mortality in mice. Interestingly, while deletion of either region led to decreased virulence in mice, one region had no effect on in vitro replication. Deletion of both regions simultaneously also reduced cell culture replication and significantly increased the attenuation in vivo over either single deletion. Attenuated MPXV with genomic deletions present a safe and efficacious tool in the study of MPX pathogenesis and in the identification of genetic factors associated with virulence. PMID:25462353

  20. Attenuation of monkeypox virus by deletion of genomic regions

    USGS Publications Warehouse

    Lopera, Juan G.; Falendysz, Elizabeth A.; Rocke, Tonie E.; Osorio, Jorge E.

    2015-01-01

    Monkeypox virus (MPXV) is an emerging pathogen from Africa that causes disease similar to smallpox. Two clades with different geographic distributions and virulence have been described. Here, we utilized bioinformatic tools to identify genomic regions in MPXV containing multiple virulence genes and explored their roles in pathogenicity; two selected regions were then deleted singularly or in combination. In vitro and in vivostudies indicated that these regions play a significant role in MPXV replication, tissue spread, and mortality in mice. Interestingly, while deletion of either region led to decreased virulence in mice, one region had no effect on in vitro replication. Deletion of both regions simultaneously also reduced cell culture replication and significantly increased the attenuation in vivo over either single deletion. Attenuated MPXV with genomic deletions present a safe and efficacious tool in the study of MPX pathogenesis and in the identification of genetic factors associated with virulence.

  1. On the Additive and Dominant Variance and Covariance of Individuals Within the Genomic Selection Scope

    PubMed Central

    Vitezica, Zulma G.; Varona, Luis; Legarra, Andres

    2013-01-01

    Genomic evaluation models can fit additive and dominant SNP effects. Under quantitative genetics theory, additive or “breeding” values of individuals are generated by substitution effects, which involve both “biological” additive and dominant effects of the markers. Dominance deviations include only a portion of the biological dominant effects of the markers. Additive variance includes variation due to the additive and dominant effects of the markers. We describe a matrix of dominant genomic relationships across individuals, D, which is similar to the G matrix used in genomic best linear unbiased prediction. This matrix can be used in a mixed-model context for genomic evaluations or to estimate dominant and additive variances in the population. From the “genotypic” value of individuals, an alternative parameterization defines additive and dominance as the parts attributable to the additive and dominant effect of the markers. This approach underestimates the additive genetic variance and overestimates the dominance variance. Transforming the variances from one model into the other is trivial if the distribution of allelic frequencies is known. We illustrate these results with mouse data (four traits, 1884 mice, and 10,946 markers) and simulated data (2100 individuals and 10,000 markers). Variance components were estimated correctly in the model, considering breeding values and dominance deviations. For the model considering genotypic values, the inclusion of dominant effects biased the estimate of additive variance. Genomic models were more accurate for the estimation of variance components than their pedigree-based counterparts. PMID:24121775

  2. Genomic organization of the S-locus region of Brassica.

    PubMed

    Shiba, Hiroshi; Kenmochi, Masayuki; Sugihara, Minoru; Iwano, Megumi; Kawasaki, Shinji; Suzuki, Go; Watanabe, Masao; Isogai, Akira; Takayama, Seiji

    2003-03-01

    To gain some insights into the structure of the S-locus and the mechanisms that have kept its diversity, a 75-kb genomic fragment containing the self-incompatibility (S) locus region was isolated from the S12-haplotype of Brassica rapa and compared with those of other S-haplotypes. The region around the S determinant genes was highly polymorphic and filled with S-haplotype-specific intergenic sequences. The diverse genomic structure must contribute to the suppression of recombination at the S-locus.

  3. Genome-wide identification of hypoxia-induced enhancer regions

    PubMed Central

    Preston, Jessica L.; Randel, Melissa A.; Johnson, Eric A.

    2015-01-01

    Here we present a genome-wide method for de novo identification of enhancer regions. This approach enables massively parallel empirical investigation of DNA sequences that mediate transcriptional activation and provides a platform for discovery of regulatory modules capable of driving context-specific gene expression. The method links fragmented genomic DNA to the transcription of randomer molecule identifiers and measures the functional enhancer activity of the library by massively parallel sequencing. We transfected a Drosophila melanogaster library into S2 cells in normoxia and hypoxia, and assayed 4,599,881 genomic DNA fragments in parallel. The locations of the enhancer regions strongly correlate with genes up-regulated after hypoxia and previously described enhancers. Novel enhancer regions were identified and integrated with RNAseq data and transcription factor motifs to describe the hypoxic response on a genome-wide basis as a complex regulatory network involving multiple stress-response pathways. This work provides a novel method for high-throughput assay of enhancer activity and the genome-scale identification of 31 hypoxia-activated enhancers in Drosophila. PMID:26713262

  4. Can metabolomics in addition to genomics add to prognostic and predictive information in breast cancer?

    PubMed

    Howell, Anthony

    2010-11-16

    Genomic data from breast cancers provide additional prognostic and predictive information that is beginning to be used for patient management. The question arises whether additional information derived from other 'omic' approaches such as metabolomics can provide additional information. In an article published this month in BMC Cancer, Borgan et al. add metabolomic information to genomic measures in breast tumours and demonstrate, for the first time, that it may be possible to further define subgroups of patients which could be of value clinically. See research article: http://www.biomedcentral.com/1471-2407/10/628.

  5. Nucleotide diversity analysis highlights functionally important genomic regions

    PubMed Central

    Tatarinova, Tatiana V.; Chekalin, Evgeny; Nikolsky, Yuri; Bruskin, Sergey; Chebotarov, Dmitry; McNally, Kenneth L.; Alexandrov, Nickolai

    2016-01-01

    We analyzed functionality and relative distribution of genetic variants across the complete Oryza sativa genome, using the 40 million single nucleotide polymorphisms (SNPs) dataset from the 3,000 Rice Genomes Project (http://snp-seek.irri.org), the largest and highest density SNP collection for any higher plant. We have shown that the DNA-binding transcription factors (TFs) are the most conserved group of genes, whereas kinases and membrane-localized transporters are the most variable ones. TFs may be conserved because they belong to some of the most connected regulatory hubs that modulate transcription of vast downstream gene networks, whereas signaling kinases and transporters need to adapt rapidly to changing environmental conditions. In general, the observed profound patterns of nucleotide variability reveal functionally important genomic regions. As expected, nucleotide diversity is much higher in intergenic regions than within gene bodies (regions spanning gene models), and protein-coding sequences are more conserved than untranslated gene regions. We have observed a sharp decline in nucleotide diversity that begins at about 250 nucleotides upstream of the transcription start and reaches minimal diversity exactly at the transcription start. We found the transcription termination sites to have remarkably symmetrical patterns of SNP density, implying presence of functional sites near transcription termination. Also, nucleotide diversity was significantly lower near 3′ UTRs, the area rich with regulatory regions. PMID:27774999

  6. Regions of Homozygosity in the Porcine Genome: Consequence of Demography and the Recombination Landscape

    PubMed Central

    Bosse, Mirte; Megens, Hendrik-Jan; Madsen, Ole; Paudel, Yogesh; Frantz, Laurent A. F.; Schook, Lawrence B.; Crooijmans, Richard P. M. A.; Groenen, Martien A. M.

    2012-01-01

    Inbreeding has long been recognized as a primary cause of fitness reduction in both wild and domesticated populations. Consanguineous matings cause inheritance of haplotypes that are identical by descent (IBD) and result in homozygous stretches along the genome of the offspring. Size and position of regions of homozygosity (ROHs) are expected to correlate with genomic features such as GC content and recombination rate, but also direction of selection. Thus, ROHs should be non-randomly distributed across the genome. Therefore, demographic history may not fully predict the effects of inbreeding. The porcine genome has a relatively heterogeneous distribution of recombination rate, making Sus scrofa an excellent model to study the influence of both recombination landscape and demography on genomic variation. This study utilizes next-generation sequencing data for the analysis of genomic ROH patterns, using a comparative sliding window approach. We present an in-depth study of genomic variation based on three different parameters: nucleotide diversity outside ROHs, the number of ROHs in the genome, and the average ROH size. We identified an abundance of ROHs in all genomes of multiple pigs from commercial breeds and wild populations from Eurasia. Size and number of ROHs are in agreement with known demography of the populations, with population bottlenecks highly increasing ROH occurrence. Nucleotide diversity outside ROHs is high in populations derived from a large ancient population, regardless of current population size. In addition, we show an unequal genomic ROH distribution, with strong correlations of ROH size and abundance with recombination rate and GC content. Global gene content does not correlate with ROH frequency, but some ROH hotspots do contain positive selected genes in commercial lines and wild populations. This study highlights the importance of the influence of demography and recombination on homozygosity in the genome to understand the effects of

  7. Evolutionary history of the ABCB2 genomic region in teleosts

    USGS Publications Warehouse

    Palti, Y.; Rodriguez, M.F.; Gahr, S.A.; Hansen, J.D.

    2007-01-01

    Gene duplication, silencing and translocation have all been implicated in shaping the unique genomic architecture of the teleost MH regions. Previously, we demonstrated that trout possess five unlinked regions encoding MH genes. One of these regions harbors ABCB2 which in all other vertebrate classes is found in the MHC class II region. In this study, we sequenced a BAC contig for the trout ABCB2 region. Analysis of this region revealed the presence of genes homologous to those located in the human class II (ABCB2, BRD2, ??DAA), extended class II (RGL2, PHF1, SYGP1) and class III (PBX2, Notch-L) regions. The organization and syntenic relationships of this region were then compared to similar regions in humans, Tetraodon and zebrafish to learn more about the evolutionary history of this region. Our analysis indicates that this region was generated during the teleost-specific duplication event while also providing insight about potential MH paralogous regions in teleosts. ?? 2006 Elsevier Ltd. All rights reserved.

  8. GRAbB: Selective Assembly of Genomic Regions, a New Niche for Genomic Research.

    PubMed

    Brankovics, Balázs; Zhang, Hao; van Diepeningen, Anne D; van der Lee, Theo A J; Waalwijk, Cees; de Hoog, G Sybren

    2016-06-01

    GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often neglected or poorly assembled, although they contain interesting information from phylogenetic or epidemiologic perspectives, but also single copy regions can be assembled. The program is capable of targeting multiple regions within a single run. Furthermore, GRAbB can be used to extract specific loci from NGS data, based on homology, like sequences that are used for barcoding. To make the assembly specific, a known part of the region, such as the sequence of a PCR amplicon or a homologous sequence from a related species must be specified. By assembling only the region of interest, the assembly process is computationally much less demanding and may lead to assemblies of better quality. In this study the different applications and functionalities of the program are demonstrated such as: exhaustive assembly (rDNA region and mitochondrial genome), extracting homologous regions or genes (IGS, RPB1, RPB2 and TEF1a), as well as extracting multiple regions within a single run. The program is also compared with MITObim, which is meant for the exhaustive assembly of a single target based on a similar query sequence. GRAbB is shown to be more efficient than MITObim in terms of speed, memory and disk usage. The other functionalities (handling multiple targets simultaneously and extracting homologous regions) of the new program are not matched by other programs. The program is available with explanatory documentation at https://github.com/b-brankovics/grabb. GRAbB has been tested on Ubuntu (12.04 and 14.04), Fedora (23), CentOS (7.1.1503) and Mac OS X (10.7). Furthermore, GRAbB is available as a docker repository: brankovics/grabb (https://hub.docker.com/r/brankovics/grabb/).

  9. GRAbB: Selective Assembly of Genomic Regions, a New Niche for Genomic Research.

    PubMed

    Brankovics, Balázs; Zhang, Hao; van Diepeningen, Anne D; van der Lee, Theo A J; Waalwijk, Cees; de Hoog, G Sybren

    2016-06-01

    GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often neglected or poorly assembled, although they contain interesting information from phylogenetic or epidemiologic perspectives, but also single copy regions can be assembled. The program is capable of targeting multiple regions within a single run. Furthermore, GRAbB can be used to extract specific loci from NGS data, based on homology, like sequences that are used for barcoding. To make the assembly specific, a known part of the region, such as the sequence of a PCR amplicon or a homologous sequence from a related species must be specified. By assembling only the region of interest, the assembly process is computationally much less demanding and may lead to assemblies of better quality. In this study the different applications and functionalities of the program are demonstrated such as: exhaustive assembly (rDNA region and mitochondrial genome), extracting homologous regions or genes (IGS, RPB1, RPB2 and TEF1a), as well as extracting multiple regions within a single run. The program is also compared with MITObim, which is meant for the exhaustive assembly of a single target based on a similar query sequence. GRAbB is shown to be more efficient than MITObim in terms of speed, memory and disk usage. The other functionalities (handling multiple targets simultaneously and extracting homologous regions) of the new program are not matched by other programs. The program is available with explanatory documentation at https://github.com/b-brankovics/grabb. GRAbB has been tested on Ubuntu (12.04 and 14.04), Fedora (23), CentOS (7.1.1503) and Mac OS X (10.7). Furthermore, GRAbB is available as a docker repository: brankovics/grabb (https://hub.docker.com/r/brankovics/grabb/). PMID

  10. GRAbB: Selective Assembly of Genomic Regions, a New Niche for Genomic Research

    PubMed Central

    Zhang, Hao; van Diepeningen, Anne D.; van der Lee, Theo A. J.; Waalwijk, Cees; de Hoog, G. Sybren

    2016-01-01

    GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often neglected or poorly assembled, although they contain interesting information from phylogenetic or epidemiologic perspectives, but also single copy regions can be assembled. The program is capable of targeting multiple regions within a single run. Furthermore, GRAbB can be used to extract specific loci from NGS data, based on homology, like sequences that are used for barcoding. To make the assembly specific, a known part of the region, such as the sequence of a PCR amplicon or a homologous sequence from a related species must be specified. By assembling only the region of interest, the assembly process is computationally much less demanding and may lead to assemblies of better quality. In this study the different applications and functionalities of the program are demonstrated such as: exhaustive assembly (rDNA region and mitochondrial genome), extracting homologous regions or genes (IGS, RPB1, RPB2 and TEF1a), as well as extracting multiple regions within a single run. The program is also compared with MITObim, which is meant for the exhaustive assembly of a single target based on a similar query sequence. GRAbB is shown to be more efficient than MITObim in terms of speed, memory and disk usage. The other functionalities (handling multiple targets simultaneously and extracting homologous regions) of the new program are not matched by other programs. The program is available with explanatory documentation at https://github.com/b-brankovics/grabb. GRAbB has been tested on Ubuntu (12.04 and 14.04), Fedora (23), CentOS (7.1.1503) and Mac OS X (10.7). Furthermore, GRAbB is available as a docker repository: brankovics/grabb (https://hub.docker.com/r/brankovics/grabb/). PMID

  11. Nucleolar organizer regions: genomic 'dark matter' requiring illumination.

    PubMed

    McStay, Brian

    2016-07-15

    Nucleoli form around tandem arrays of a ribosomal gene repeat, termed nucleolar organizer regions (NORs). During metaphase, active NORs adopt a characteristic undercondensed morphology. Recent evidence indicates that the HMG-box-containing DNA-binding protein UBF (upstream binding factor) is directly responsible for this morphology and provides a mitotic bookmark to ensure rapid nucleolar formation beginning in telophase in human cells. This is likely to be a widely employed strategy, as UBF is present throughout metazoans. In higher eukaryotes, NORs are typically located within regions of chromosomes that form perinucleolar heterochromatin during interphase. Typically, the genomic architecture of NORs and the chromosomal regions within which they lie is very poorly described, yet recent evidence points to a role for context in their function. In Arabidopsis, NOR silencing appears to be controlled by sequences outside the rDNA (ribosomal DNA) array. Translocations reveal a role for context in the expression of the NOR on the X chromosome in Drosophila Recent work has begun on characterizing the genomic architecture of human NORs. A role for distal sequences located in perinucleolar heterochromatin has been inferred, as they exhibit a complex transcriptionally active chromatin structure. Links between rDNA genomic stability and aging in Saccharomyces cerevisiae are now well established, and indications are emerging that this is important in aging and replicative senescence in higher eukaryotes. This, combined with the fact that rDNA arrays are recombinational hot spots in cancer cells, has focused attention on DNA damage responses in NORs. The introduction of DNA double-strand breaks into rDNA arrays leads to a dramatic reorganization of nucleolar structure. Damaged rDNA repeats move from the nucleolar interior to form caps at the nucleolar periphery, presumably to facilitate repair, suggesting that the chromosomal context of human NORs contributes to their genomic

  12. Nucleolar organizer regions: genomic 'dark matter' requiring illumination.

    PubMed

    McStay, Brian

    2016-07-15

    Nucleoli form around tandem arrays of a ribosomal gene repeat, termed nucleolar organizer regions (NORs). During metaphase, active NORs adopt a characteristic undercondensed morphology. Recent evidence indicates that the HMG-box-containing DNA-binding protein UBF (upstream binding factor) is directly responsible for this morphology and provides a mitotic bookmark to ensure rapid nucleolar formation beginning in telophase in human cells. This is likely to be a widely employed strategy, as UBF is present throughout metazoans. In higher eukaryotes, NORs are typically located within regions of chromosomes that form perinucleolar heterochromatin during interphase. Typically, the genomic architecture of NORs and the chromosomal regions within which they lie is very poorly described, yet recent evidence points to a role for context in their function. In Arabidopsis, NOR silencing appears to be controlled by sequences outside the rDNA (ribosomal DNA) array. Translocations reveal a role for context in the expression of the NOR on the X chromosome in Drosophila Recent work has begun on characterizing the genomic architecture of human NORs. A role for distal sequences located in perinucleolar heterochromatin has been inferred, as they exhibit a complex transcriptionally active chromatin structure. Links between rDNA genomic stability and aging in Saccharomyces cerevisiae are now well established, and indications are emerging that this is important in aging and replicative senescence in higher eukaryotes. This, combined with the fact that rDNA arrays are recombinational hot spots in cancer cells, has focused attention on DNA damage responses in NORs. The introduction of DNA double-strand breaks into rDNA arrays leads to a dramatic reorganization of nucleolar structure. Damaged rDNA repeats move from the nucleolar interior to form caps at the nucleolar periphery, presumably to facilitate repair, suggesting that the chromosomal context of human NORs contributes to their genomic

  13. Recurrent Somatic Mutations in Regulatory Regions of Human Cancer Genomes

    PubMed Central

    Melton, Collin; Reuter, Jason A.; Spacek, Damek V.; Snyder, Michael

    2015-01-01

    Aberrant regulation of gene expression in cancer can promote survival and proliferation of cancer cells. Here we integrate TCGA whole genome sequencing data of 436 patients from eight cancer subtypes with ENCODE and other regulatory annotations to identify point mutations in regulatory regions. We find evidence for positive selection of mutations in transcription factor binding sites, consistent with these sites regulating important cancer cell functions. Using a novel method that adjusts for sample- and genomic locus-specific mutation rate, we identify recurrently mutated sites across cancer patients. Mutated regulatory sites include known sites in the TERT promoter and many novel sites, including a subset in proximity to cancer genes. In reporter assays, two novel sites display decreased enhancer activity upon mutation. These data demonstrate that many regulatory regions contain mutations under selective pressure and suggest a larger role for regulatory mutations in cancer than previously appreciated. PMID:26053494

  14. Global assessment of genomic regions required for growth in Mycobacterium tuberculosis.

    PubMed

    Zhang, Yanjia J; Ioerger, Thomas R; Huttenhower, Curtis; Long, Jarukit E; Sassetti, Christopher M; Sacchettini, James C; Rubin, Eric J

    2012-09-01

    Identifying genomic elements required for viability is central to our understanding of the basic physiology of bacterial pathogens. Recently, the combination of high-density mutagenesis and deep sequencing has allowed for the identification of required and conditionally required genes in many bacteria. Genes, however, make up only a part of the complex genomes of important bacterial pathogens. Here, we use an unbiased analysis to comprehensively identify genomic regions, including genes, domains, and intergenic elements, required for the optimal growth of Mycobacterium tuberculosis, a major global health pathogen. We found that several proteins jointly contain both domains required for optimal growth and domains that are dispensable. In addition, many non-coding regions, including regulatory elements and non-coding RNAs, are critical for mycobacterial growth. Our analysis shows that the genetic requirements for growth are more complex than can be appreciated using gene-centric analysis.

  15. Genomic Regions Associated with Multiple Sclerosis Are Active in B Cells

    PubMed Central

    Disanto, Giulio; Sandve, Geir Kjetil; Berlanga-Taylor, Antonio J.; Morahan, Julia M.; Dobson, Ruth; Giovannoni, Gavin; Ramagopalan, Sreeram V.

    2012-01-01

    More than 50 genomic regions have now been shown to influence the risk of multiple sclerosis (MS). However, the mechanisms of action, and the cell types in which these associated variants act at the molecular level remain largely unknown. This is especially true for associated regions containing no known genes. Given the evidence for a role for B cells in MS, we hypothesized that MS associated genomic regions co-localized with regions which are functionally active in B cells. We used publicly available data on 1) MS associated regions and single nucleotide polymorphisms (SNPs) and 2) chromatin profiling in B cells as well as three additional cell types thought to be unrelated to MS (hepatocytes, fibroblasts and keratinocytes). Genomic intervals and SNPs were tested for overlap using the Genomic Hyperbrowser. We found that MS associated regions are significantly enriched in strong enhancer, active promoter and strong transcribed regions (p = 0.00005) and that this overlap is significantly higher in B cells than control cells. In addition, MS associated SNPs also land in active promoter (p = 0.00005) and enhancer regions more than expected by chance (strong enhancer p = 0.0006; weak enhancer p = 0.00005). These results confirm the important role of the immune system and specifically B cells in MS and suggest that MS risk variants exert a gene regulatory role. Previous studies assessing MS risk variants in T cells may be missing important effects in B cells. Similar analyses in other immunological cell types relevant to MS and functional studies are necessary to fully elucidate how genes contribute to MS pathogenesis. PMID:22396755

  16. Estimating Additive and Non-Additive Genetic Variances and Predicting Genetic Merits Using Genome-Wide Dense Single Nucleotide Polymorphism Markers

    PubMed Central

    Su, Guosheng; Christensen, Ole F.; Ostersen, Tage; Henryon, Mark; Lund, Mogens S.

    2012-01-01

    Non-additive genetic variation is usually ignored when genome-wide markers are used to study the genetic architecture and genomic prediction of complex traits in human, wild life, model organisms or farm animals. However, non-additive genetic effects may have an important contribution to total genetic variation of complex traits. This study presented a genomic BLUP model including additive and non-additive genetic effects, in which additive and non-additive genetic relation matrices were constructed from information of genome-wide dense single nucleotide polymorphism (SNP) markers. In addition, this study for the first time proposed a method to construct dominance relationship matrix using SNP markers and demonstrated it in detail. The proposed model was implemented to investigate the amounts of additive genetic, dominance and epistatic variations, and assessed the accuracy and unbiasedness of genomic predictions for daily gain in pigs. In the analysis of daily gain, four linear models were used: 1) a simple additive genetic model (MA), 2) a model including both additive and additive by additive epistatic genetic effects (MAE), 3) a model including both additive and dominance genetic effects (MAD), and 4) a full model including all three genetic components (MAED). Estimates of narrow-sense heritability were 0.397, 0.373, 0.379 and 0.357 for models MA, MAE, MAD and MAED, respectively. Estimated dominance variance and additive by additive epistatic variance accounted for 5.6% and 9.5% of the total phenotypic variance, respectively. Based on model MAED, the estimate of broad-sense heritability was 0.506. Reliabilities of genomic predicted breeding values for the animals without performance records were 28.5%, 28.8%, 29.2% and 29.5% for models MA, MAE, MAD and MAED, respectively. In addition, models including non-additive genetic effects improved unbiasedness of genomic predictions. PMID:23028912

  17. Chromosome region-specific libraries for human genome analysis

    SciTech Connect

    Kao, Fa-Ten.

    1991-01-01

    We have made important progress since the beginning of the current grant year. We have further developed the microdissection and PCR- assisted microcloning techniques using the linker-adaptor method. We have critically evaluated the microdissection libraries constructed by this microtechnology and proved that they are of high quality. We further demonstrated that these microdissection clones are useful in identifying corresponding YAC clones for a thousand-fold expansion of the genomic coverage and for contig construction. We are also improving the technique of cloning the dissected fragments in test tube by the TDT method. We are applying both of these PCR cloning technique to human chromosomes 2 and 5 to construct region-specific libraries for physical mapping purposes of LLNL and LANL. Finally, we are exploring efficient procedures to use unique sequence microclones to isolate cDNA clones from defined chromosomal regions as valuable resources for identifying expressed gene sequences in the human genome. We believe that we are making important progress under the auspices of this DOE human genome program grant and we will continue to make significant contributions in the coming year. 4 refs., 4 figs.

  18. Genome-wide assessment of recurrent genomic imbalances in canine leukemia identifies evolutionarily conserved regions for subtype differentiation.

    PubMed

    Roode, Sarah C; Rotroff, Daniel; Avery, Anne C; Suter, Steven E; Bienzle, Dorothee; Schiffman, Joshua D; Motsinger-Reif, Alison; Breen, Matthew

    2015-12-01

    Leukemia in dogs is a heterogeneous disease with survival ranging from days to years, depending on the subtype. Strides have been made in both human and canine leukemia to improve classification and understanding of pathogenesis through immunophenotyping, yet classification and choosing appropriate therapy remains challenging. In this study, we assessed 123 cases of canine leukemia (28 ALLs, 24 AMLs, 25 B-CLLs, and 46 T-CLLs) using high-resolution oligonucleotide array comparative genomic hybridization (oaCGH) to detect DNA copy number alterations (CNAs). For the first time, such data were used to identify recurrent CNAs and inclusive genes that may be potential drivers of subtype-specific pathogenesis. We performed predictive modeling to identify CNAs that could reliably differentiate acute subtypes (ALL vs. AML) and chronic subtypes (B-CLL vs. T-CLL) and used this model to differentiate cases with up to 83.3 and 95.8 % precision, respectively, based on CNAs at only one to three genomic regions. In addition, CGH datasets for canine and human leukemia were compared to reveal evolutionarily conserved copy number changes between species, including the shared gain of HSA 21q in ALL and ∼25 Mb of shared gain of HSA 12 and loss of HSA 13q14 in CLL. These findings support the use of canine leukemia as a relevant in vivo model for human leukemia and justify the need to further explore the conserved genomic regions of interest for their clinical impact.

  19. The control region of maternally and paternally inherited mitochondrial genomes of three species of the sea mussel genus Mytilus.

    PubMed

    Cao, Liqin; Ort, Brian S; Mizi, Athanasia; Pogson, Grant; Kenchington, Elen; Zouros, Eleftherios; Rodakis, George C

    2009-03-01

    Species of the mussel genus Mytilus possess maternally and paternally transmitted mitochondrial genomes. In the interbreeding taxa Mytilus edulis and M. galloprovincialis, several genomes of both types have been fully sequenced. The genome consists of the coding part (which, in addition to protein and RNA genes, contains several small noncoding sequences) and the main control region (CR), which in turn consists of three distinct parts: the first variable (VD1), the conserved (CD), and the second variable (VD2) domain. The maternal and paternal genomes are very similar in gene content and organization, even though they differ by >20% in primary sequence. They differ even more at VD1 and VD2, yet they are remarkably similar at CD. The complete sequence of a genome from the closely related species M. trossulus was previously reported and found to consist of a maternal-like coding part and a paternal-like and a maternal-like CR. From this and from the fact that it was extracted from a male individual, it was inferred that this is a genome that switched from maternal to paternal transmission. Here we provide clear evidence that this genome is the maternal genome of M. trossulus. We have found that in this genome the tRNA(Gln) in the coding region is apparently defective and that an intact copy of this tRNA occurs in the CR, that one of the two conserved domains is missing essential motifs, and that one of the two first variable domains has a high rate of divergence. These features may explain the large size and mosaic structure of the CR of the maternal genome of M. trossulus. We have also obtained CR sequences of the maternal and paternal genomes of M. californianus, a more distantly related species. We compare the control regions from all three species, focusing on the divergence among genomes of different species origin and among genomes of different transmission routes.

  20. A sea urchin genome project: sequence scan, virtual map, and additional resources.

    PubMed

    Cameron, R A; Mahairas, G; Rast, J P; Martinez, P; Biondi, T R; Swartzell, S; Wallace, J C; Poustka, A J; Livingston, B T; Wray, G A; Ettensohn, C A; Lehrach, H; Britten, R J; Davidson, E H; Hood, L

    2000-08-15

    Results of a first-stage Sea Urchin Genome Project are summarized here. The species chosen was Strongylocentrotus purpuratus, a research model of major importance in developmental and molecular biology. A virtual map of the genome was constructed by sequencing the ends of 76,020 bacterial artificial chromosome (BAC) recombinants (average length, 125 kb). The BAC-end sequence tag connectors (STCs) occur an average of 10 kb apart, and, together with restriction digest patterns recorded for the same BAC clones, they provide immediate access to contigs of several hundred kilobases surrounding any gene of interest. The STCs survey >5% of the genome and provide the estimate that this genome contains approximately 27,350 protein-coding genes. The frequency distribution and canonical sequences of all middle and highly repetitive sequence families in the genome were obtained from the STCs as well. The 500-kb Hox gene complex of this species is being sequenced in its entirety. In addition, arrayed cDNA libraries of >10(5) clones each were constructed from every major stage of embryogenesis, several individual cell types, and adult tissues and are available to the community. The accumulated STC data and an expanding expressed sequence tag database (at present including >12, 000 sequences) have been reported to GenBank and are accessible on public web sites.

  1. Genome-wide association and genome partitioning reveal novel genomic regions underlying variation in gastrointestinal nematode burden in a wild bird.

    PubMed

    Wenzel, Marius A; James, Marianne C; Douglas, Alex; Piertney, Stuart B

    2015-08-01

    Identifying the genetic architecture underlying complex phenotypes is a notoriously difficult problem that often impedes progress in understanding adaptive eco-evolutionary processes in natural populations. Host-parasite interactions are fundamentally important drivers of evolutionary processes, but a lack of understanding of the genes involved in the host's response to chronic parasite insult makes it particularly difficult to understand the mechanisms of host life history trade-offs and the adaptive dynamics involved. Here, we examine the genetic basis of gastrointestinal nematode (Trichostrongylus tenuis) burden in 695 red grouse (Lagopus lagopus scotica) individuals genotyped at 384 genome-wide SNPs. We first use genome-wide association to identify individual SNPs associated with nematode burden. We then partition genome-wide heritability to identify chromosomes with greater heritability than expected from gene content, due to harbouring a multitude of additive SNPs with individually undetectable effects. We identified five SNPs on five chromosomes that accounted for differences of up to 556 worms per bird, but together explained at best 4.9% of the phenotypic variance. These SNPs were closely linked to genes representing a range of physiological processes including the immune system, protein degradation and energy metabolism. Genome partitioning indicated genome-wide heritability of up to 29% and three chromosomes with excess heritability of up to 4.3% (total 8.9%). These results implicate SNPs and novel genomic regions underlying nematode burden in this system and suggest that this phenotype is somewhere between being based on few large-effect genes (oligogenic) and based on a large number of genes with small individual but large combined effects (polygenic). PMID:26179597

  2. A Genome-Wide Association Study Identifies Multiple Regions Associated with Head Size in Catfish

    PubMed Central

    Geng, Xin; Liu, Shikai; Yao, Jun; Bao, Lisui; Zhang, Jiaren; Li, Chao; Wang, Ruijia; Sha, Jin; Zeng, Peng; Zhi, Degui; Liu, Zhanjiang

    2016-01-01

    Skull morphology is fundamental to evolution and the biological adaptation of species to their environments. With aquaculture fish species, head size is also important for economic reasons because it has a direct impact on fillet yield. However, little is known about the underlying genetic basis of head size. Catfish is the primary aquaculture species in the United States. In this study, we performed a genome-wide association study using the catfish 250K SNP array with backcross hybrid catfish to map the QTL for head size (head length, head width, and head depth). One significantly associated region on linkage group (LG) 7 was identified for head length. In addition, LGs 7, 9, and 16 contain suggestively associated regions for head length. For head width, significantly associated regions were found on LG9, and additional suggestively associated regions were identified on LGs 5 and 7. No region was found associated with head depth. Head size genetic loci were mapped in catfish to genomic regions with candidate genes involved in bone development. Comparative analysis indicated that homologs of several candidate genes are also involved in skull morphology in various other species ranging from amphibian to mammalian species, suggesting possible evolutionary conservation of those genes in the control of skull morphologies. PMID:27558670

  3. Exploring the diploid wheat ancestral A genome through sequence comparison at the high-molecular-weight glutenin locus region.

    PubMed

    Dong, Lingli; Huo, Naxin; Wang, Yi; Deal, Karin; Luo, Ming-Cheng; Wang, Daowen; Anderson, Olin D; Gu, Yong Qiang

    2012-12-01

    The polyploid nature of hexaploid wheat (T. aestivum, AABBDD) often represents a great challenge in various aspects of research including genetic mapping, map-based cloning of important genes, and sequencing and accurately assembly of its genome. To explore the utility of ancestral diploid species of polyploid wheat, sequence variation of T. urartu (A(u)A(u)) was analyzed by comparing its 277-kb large genomic region carrying the important Glu-1 locus with the homologous regions from the A genomes of the diploid T. monococcum (A(m)A(m)), tetraploid T. turgidum (AABB), and hexaploid T. aestivum (AABBDD). Our results revealed that in addition to a high degree of the gene collinearity, nested retroelement structures were also considerably conserved among the A(u) genome and the A genomes in polyploid wheats, suggesting that the majority of the repetitive sequences in the A genomes of polyploid wheats originated from the diploid A(u) genome. The difference in the compared region between A(u) and A is mainly caused by four differential TE insertion and two deletion events between these genomes. The estimated divergence time of A genomes calculated on nucleotide substitution rate in both shared TEs and collinear genes further supports the closer evolutionary relationship of A to A(u) than to A(m). The structure conservation in the repetitive regions promoted us to develop repeat junction markers based on the A(u) sequence for mapping the A genome in hexaploid wheat. Eighty percent of these repeat junction markers were successfully mapped to the corresponding region in hexaploid wheat, suggesting that T. urartu could serve as a useful resource for developing molecular markers for genetic and breeding studies in hexaploid wheat.

  4. Genome Scans for Transmission Ratio Distortion Regions in Mice

    PubMed Central

    Casellas, Joaquim; Gularte, Rodrigo J.; Farber, Charles R.; Varona, Luis; Mehrabian, Margarete; Schadt, Eric E.; Lusis, Aldon J.; Attie, Alan D.; Yandell, Brian S.; Medrano, Juan F.

    2012-01-01

    Transmission ratio distortion (TRD) is the departure from the expected genotypic frequencies under Mendelian inheritance. This departure can be due to multiple physiological mechanisms during gametogenesis, fertilization, fetal and embryonic development, and early neonatal life. Although a few TRD loci have been reported in mouse, inheritance patterns have never been evaluated for TRD. In this article, we developed a Bayesian binomial model accounting for additive and dominant deviation TRD mechanisms. Moreover, this model was used to perform genome-wide scans for TRD quantitative trait loci (QTL) on six F2 mouse crosses involving between 296 and 541 mice and between 72 and 1854 genetic markers. Statistical significance of each model was checked at each genetic marker with Bayes factors. Genome scans revealed overdominance TRD QTL located in mouse chromosomes 1, 2, 12, 13, and 14 and additive TRD QTL in mouse chromosomes 2, 3, and 15, although these results did not replicate across mouse crosses. This research contributes new statistical tools for the analysis of specific genetic patterns involved in TRD in F2 populations, our results suggesting a relevant incidence of TRD phenomena in mouse with important implications for both statistical analyses and biological research. PMID:22367040

  5. Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates

    PubMed Central

    Yuan, Bo; Liu, Pengfei; Gupta, Aditya; Beck, Christine R.; Tejomurtula, Anusha; Campbell, Ian M.; Gambin, Tomasz; Simmons, Alexandra D.; Withers, Marjorie A.; Harris, R. Alan; Rogers, Jeffrey; Schwartz, David C.; Lupski, James R.

    2015-01-01

    Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100) is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs) are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases—about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR) between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV) haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual’s susceptibility to acquiring disease-associated alleles. PMID:26641089

  6. Genomic Regions Associated With Interspecies Communication in Dogs Contain Genes Related to Human Social Disorders

    PubMed Central

    Persson, Mia E.; Wright, Dominic; Roth, Lina S. V.; Batakis, Petros; Jensen, Per

    2016-01-01

    Unlike their wolf ancestors, dogs have unique social skills for communicating and cooperating with humans. Previously, significant heritabilities for human-directed social behaviors have been found in laboratory beagles. Here, a Genome-Wide Association Study identified two genomic regions associated with dog’s human-directed social behaviors. We recorded the propensity of laboratory beagles, bred, kept and handled under standardized conditions, to initiate physical interactions with a human during an unsolvable problem-task, and 190 individuals were genotyped with an HD Canine SNP-chip. One genetic marker on chromosome 26 within the SEZ6L gene was significantly associated with time spent close to, and in physical contact with, the human. Two suggestive markers on chromosome 26, located within the ARVCF gene, were also associated with human contact seeking. Strikingly, four additional genes present in the same linkage blocks affect social abilities in humans, e.g., SEZ6L has been associated with autism and COMT affects aggression in adolescents with ADHD. This is, to our knowledge, the first genome-wide study presenting candidate genomic regions for dog sociability and inter-species communication. These results advance our understanding of dog domestication and raise the use of the dog as a novel model system for human social disorders. PMID:27685260

  7. Evidence of structural genomic region recombination in Hepatitis C virus

    PubMed Central

    Cristina, Juan; Colina, Rodney

    2006-01-01

    Background/Aim Hepatitis C virus (HCV) has been the subject of intense research and clinical investigation as its major role in human disease has emerged. Although homologous recombination has been demonstrated in many members of the family Flaviviridae, to which HCV belongs, there have been few studies reporting recombination on natural populations of HCV. Recombination break-points have been identified in non structural proteins of the HCV genome. Given the implications that recombination has for RNA virus evolution, it is clearly important to determine the extent to which recombination plays a role in HCV evolution. In order to gain insight into these matters, we have performed a phylogenetic analysis of 89 full-length HCV strains from all types and sub-types, isolated all over the world, in order to detect possible recombination events. Method Putative recombinant sequences were identified with the use of SimPlot program. Recombination events were confirmed by bootscaning, using putative recombinant sequence as a query. Results Two crossing over events were identified in the E1/E2 structural region of an intra-typic (1a/1c) recombinant strain. Conclusion Only one of 89 full-length strains studied resulted to be a recombinant HCV strain, revealing that homologous recombination does not play an extensive roll in HCV evolution. Nevertheless, this mechanism can not be denied as a source for generating genetic diversity in natural populations of HCV, since a new intra-typic recombinant strain was found. Moreover, the recombination break-points were found in the structural region of the HCV genome. PMID:16813646

  8. Multiple Comparison Analysis of Two New Genomic Sequences of ILTV Strains from China with Other Strains from Different Geographic Regions.

    PubMed

    Zhao, Yan; Kong, Congcong; Wang, Yunfeng

    2015-01-01

    To date, twenty complete genome sequences of ILTV strains have been published in GenBank, including one strain from China, and nineteen strains from Australian and the United States. To investigate the genomic information on ILTVs from different geographic regions, two additional individual complete genome sequences of WG and K317 strains from China were determined. The genomes of WG and K317 strains were 153,505 and 153,639 bp in length, respectively. Alignments performed on the amino acid sequences of the twelve glycoproteins showed that 13 out of 116 mutational sites were present only among the Chinese strain WG and the Australian strains SA2 and A20. The phylogenetic tree analysis suggested that the WG strain established close relationships with the Australian strain SA2. The recombination events were detected and confirmed in different subregions of the WG strain with the sequences of SA2 and K317 strains as parental. In this study, two new complete genome sequences of Chinese ILTV strains were used in comparative analysis with other complete genome sequences of ILTV strains from China, the United States, and Australia. The analysis of genome comparison, phylogenetic trees, and recombination events showed close relationships among the Chinese strain WG and the Australian strains SA2. The information of the two new complete genome sequences from China will help to facilitate the analysis of phylogenetic relationships and the molecular differences among ILTV strains from different geographic regions.

  9. Multiple Comparison Analysis of Two New Genomic Sequences of ILTV Strains from China with Other Strains from Different Geographic Regions

    PubMed Central

    Zhao, Yan; Kong, Congcong; Wang, Yunfeng

    2015-01-01

    To date, twenty complete genome sequences of ILTV strains have been published in GenBank, including one strain from China, and nineteen strains from Australian and the United States. To investigate the genomic information on ILTVs from different geographic regions, two additional individual complete genome sequences of WG and K317 strains from China were determined. The genomes of WG and K317 strains were 153,505 and 153,639 bp in length, respectively. Alignments performed on the amino acid sequences of the twelve glycoproteins showed that 13 out of 116 mutational sites were present only among the Chinese strain WG and the Australian strains SA2 and A20. The phylogenetic tree analysis suggested that the WG strain established close relationships with the Australian strain SA2. The recombination events were detected and confirmed in different subregions of the WG strain with the sequences of SA2 and K317 strains as parental. In this study, two new complete genome sequences of Chinese ILTV strains were used in comparative analysis with other complete genome sequences of ILTV strains from China, the United States, and Australia. The analysis of genome comparison, phylogenetic trees, and recombination events showed close relationships among the Chinese strain WG and the Australian strains SA2. The information of the two new complete genome sequences from China will help to facilitate the analysis of phylogenetic relationships and the molecular differences among ILTV strains from different geographic regions. PMID:26186451

  10. DoriC 5.0: an updated database of oriC regions in both bacterial and archaeal genomes.

    PubMed

    Gao, Feng; Luo, Hao; Zhang, Chun-Ting

    2013-01-01

    Replication of chromosomes is one of the central events in the cell cycle. Chromosome replication begins at specific sites, called origins of replication (oriCs), for all three domains of life. However, the origins of replication still remain unknown in a considerably large number of bacterial and archaeal genomes completely sequenced so far. The availability of increasing complete bacterial and archaeal genomes has created challenges and opportunities for identification of their oriCs in silico, as well as in vivo. Based on the Z-curve theory, we have developed a web-based system Ori-Finder to predict oriCs in bacterial genomes with high accuracy and reliability by taking advantage of comparative genomics, and the predicted oriC regions have been organized into an online database DoriC, which is publicly available at http://tubic.tju.edu.cn/doric/ since 2007. Five years after we constructed DoriC, the database has significant advances over the number of bacterial genomes, increasing about 4-fold. Additionally, oriC regions in archaeal genomes identified by in vivo experiments, as well as in silico analyses, have also been added to the database. Consequently, the latest release of DoriC contains oriCs for >1500 bacterial genomes and 81 archaeal genomes, respectively.

  11. DNA Replication Control Is Linked to Genomic Positioning of Control Regions in Escherichia coli

    PubMed Central

    Frimodt-Møller, Jakob; Charbon, Godefroid; Krogfelt, Karen A.; Løbner-Olesen, Anders

    2016-01-01

    Chromosome replication in Escherichia coli is in part controlled by three non-coding genomic sequences, DARS1, DARS2, and datA that modulate the activity of the initiator protein DnaA. The relative distance from oriC to the non-coding regions are conserved among E. coli species, despite large variations in genome size. Here we use a combination of i) site directed translocation of each region to new positions on the bacterial chromosome and ii) random transposon mediated translocation followed by culture evolution, to show genetic evidence for the importance of position. Here we provide evidence that the genomic locations of these regulatory sequences are important for cell cycle control and bacterial fitness. In addition, our work shows that the functionally redundant DARS1 and DARS2 regions play different roles in replication control. DARS1 is mainly involved in maintaining the origin concentration, whether DARS2 is also involved in maintaining single cell synchrony. PMID:27589233

  12. DNA Replication Control Is Linked to Genomic Positioning of Control Regions in Escherichia coli.

    PubMed

    Frimodt-Møller, Jakob; Charbon, Godefroid; Krogfelt, Karen A; Løbner-Olesen, Anders

    2016-09-01

    Chromosome replication in Escherichia coli is in part controlled by three non-coding genomic sequences, DARS1, DARS2, and datA that modulate the activity of the initiator protein DnaA. The relative distance from oriC to the non-coding regions are conserved among E. coli species, despite large variations in genome size. Here we use a combination of i) site directed translocation of each region to new positions on the bacterial chromosome and ii) random transposon mediated translocation followed by culture evolution, to show genetic evidence for the importance of position. Here we provide evidence that the genomic locations of these regulatory sequences are important for cell cycle control and bacterial fitness. In addition, our work shows that the functionally redundant DARS1 and DARS2 regions play different roles in replication control. DARS1 is mainly involved in maintaining the origin concentration, whether DARS2 is also involved in maintaining single cell synchrony. PMID:27589233

  13. Conservation of Microstructure between a Sequenced Region of the Genome of Rice and Multiple Segments of the Genome of Arabidopsis thaliana

    PubMed Central

    Mayer, Klaus; Murphy, George; Tarchini, Renato; Wambutt, Rolf; Volckaert, Guido; Pohl, Thomas; Düsterhöft, Andreas; Stiekema, Willem; Entian, Karl-Dieter; Terryn, Nancy; Lemcke, Kai; Haase, Dirk; Hall, Caroline R.; van Dodeweerd, Anne-Marie; Tingey, Scott V.; Mewes, Hans-Werner; Bevan, Michael W.; Bancroft, Ian

    2001-01-01

    The nucleotide sequence was determined for a 340-kb segment of rice chromosome 2, revealing 56 putative protein-coding genes. This represents a density of one gene per 6.1 kb, which is higher than was reported for a previously sequenced segment of the rice genome. Sixteen of the putative genes were supported by matches to ESTs. The predicted products of 29 of the putative genes showed similarity to known proteins, and a further 17 genes showed similarity only to predicted or hypothetical proteins identified in genome sequence data. The region contains a few transposable elements: one retrotransposon, and one transposon. The segment of the rice genome studied had previously been identified as representing a part of rice chromosome 2 that may be homologous to a segment of Arabidopsis chromosome 4. We confirmed the conservation of gene content and order between the two genome segments. In addition, we identified a further four segments of the Arabidopsis genome that contain conserved gene content and order. In total, 22 of the 56 genes identified in the rice genome segment were represented in this set of Arabidopsis genome segments, with at least five genes present, in conserved order, in each segment. These data are consistent with the hypothesis that the Arabidopsis genome has undergone multiple duplication events. Our results demonstrate that conservation of the genome microstructure can be identified even between monocot and dicot species. However, the frequent occurrence of duplication, and subsequent microstructure divergence, within plant genomes may necessitate the integration of subsets of genes present in multiple redundant segments to deduce evolutionary relationships and identify orthologous genes. PMID:11435398

  14. Additives

    NASA Technical Reports Server (NTRS)

    Smalheer, C. V.

    1973-01-01

    The chemistry of lubricant additives is discussed to show what the additives are chemically and what functions they perform in the lubrication of various kinds of equipment. Current theories regarding the mode of action of lubricant additives are presented. The additive groups discussed include the following: (1) detergents and dispersants, (2) corrosion inhibitors, (3) antioxidants, (4) viscosity index improvers, (5) pour point depressants, and (6) antifouling agents.

  15. Selection for Unequal Densities of Sigma70 Promoter-like Signalsin Different Regions of Large Bacterial Genomes

    SciTech Connect

    Huerta, Araceli M.; Francino, M. Pilar; Morett, Enrique; Collado-Vides, Julio

    2006-03-01

    The evolutionary processes operating in the DNA regions that participate in the regulation of gene expression are poorly understood. In Escherichia coli, we have established a sequence pattern that distinguishes regulatory from nonregulatory regions. The density of promoter-like sequences, that are recognizable by RNA polymerase and may function as potential promoters, is high within regulatory regions, in contrast to coding regions and regions located between convergently-transcribed genes. Moreover, functional promoter sites identified experimentally are often found in the subregions of highest density of promoter-like signals, even when individual sites with higher binding affinity for RNA polymerase exist elsewhere within the regulatory region. In order to investigate the generality of this pattern, we have used position weight matrices describing the -35 and -10 promoter boxes of E. coli to search for these motifs in 43 additional genomes belonging to most established bacterial phyla, after specific calibration of the matrices according to the base composition of the noncoding regions of each genome. We have found that all bacterial species analyzed contain similar promoter-like motifs, and that, in most cases, these motifs follow the same genomic distribution observed in E. coli. Differential densities between regulatory and nonregulatory regions are detectable in most bacterial genomes, with the exception of those that have experienced evolutionary extreme genome reduction. Thus, the phylogenetic distribution of this pattern mirrors that of genes and other genomic features that require weak selection to be effective in order to persist. On this basis, we suggest that the loss of differential densities in the reduced genomes of host-restricted pathogens and symbionts is the outcome of a process of genome degradation resulting from the decreased efficiency of purifying selection in highly structured small populations. This implies that the differential

  16. Targeted gene addition into a specified location in the human genome using designed zinc finger nucleases

    PubMed Central

    Moehle, Erica A.; Rock, Jeremy M.; Lee, Ya-Li; Jouvenot, Yann; DeKelver, Russell C.; Gregory, Philip D.; Urnov, Fyodor D.; Holmes, Michael C.

    2007-01-01

    Efficient incorporation of novel DNA sequences into a specific site in the genome of living human cells remains a challenge despite its potential utility to genetic medicine, biotechnology, and basic research. We find that a precisely placed double-strand break induced by engineered zinc finger nucleases (ZFNs) can stimulate integration of long DNA stretches into a predetermined genomic location, resulting in high-efficiency site-specific gene addition. Using an extrachromosomal DNA donor carrying a 12-bp tag, a 900-bp ORF, or a 1.5-kb promoter-transcription unit flanked by locus-specific homology arms, we find targeted integration frequencies of 15%, 6%, and 5%, respectively, within 72 h of treatment, and with no selection for the desired event. Importantly, we find that the integration event occurs in a homology-directed manner and leads to the accurate reconstruction of the donor-specified genotype at the endogenous chromosomal locus, and hence presumably results from synthesis-dependent strand annealing repair of the break using the donor DNA as a template. This site-specific gene addition occurs with no measurable increase in the rate of random integration. Remarkably, we also find that ZFNs can drive the addition of an 8-kb sequence carrying three distinct promoter-transcription units into an endogenous locus at a frequency of 6%, also in the absence of any selection. These data reveal the surprising versatility of the specialized polymerase machinery involved in double-strand break repair, illuminate a powerful approach to mammalian cell engineering, and open the possibility of ZFN-driven gene addition therapy for human genetic disease. PMID:17360608

  17. Genomic Regions Associated with Feed Efficiency Indicator Traits in an Experimental Nellore Cattle Population

    PubMed Central

    Olivieri, Bianca Ferreira; Mercadante, Maria Eugênia Zerlotti; Cyrillo, Joslaine Noely dos Santos Gonçalves; Branco, Renata Helena; Bonilha, Sarah Figueiredo Martins; de Albuquerque, Lucia Galvão; Silva, Rafael Medeiros de Oliveira; Baldi, Fernando

    2016-01-01

    The objective of this study was to identify genomic regions and metabolic pathways associated with dry matter intake, average daily gain, feed efficiency and residual feed intake in an experimental Nellore cattle population. The high-density SNP chip (Illumina High-Density Bovine BeadChip, 777k) was used to genotype the animals. The SNP markers effects and their variances were estimated using the single-step genome wide association method. The (co)variance components were estimated by Bayesian inference. The chromosome segments that are responsible for more than 1.0% of additive genetic variance were selected to explore and determine possible quantitative trait loci. The bovine genome Map Viewer was used to identify genes. In total, 51 genomic regions were identified for all analyzed traits. The heritability estimated for feed efficiency was low magnitude (0.13±0.06). For average daily gain, dry matter intake and residual feed intake, heritability was moderate to high (0.43±0.05; 0.47±0.05, 0.18±0.05, respectively). A total of 8, 17, 14 and 12 windows that are responsible for more than 1% of the additive genetic variance for dry matter intake, average daily gain, feed efficiency and residual feed intake, respectively, were identified. Candidate genes GOLIM4, RFX6, CACNG7, CACNG6, CAPN8, CAPN2, AKT2, GPRC6A, and GPR45 were associated with feed efficiency traits. It was expected that the response to selection would be higher for residual feed intake than for feed efficiency. Genomic regions harboring possible QTL for feed efficiency indicator traits were identified. Candidate genes identified are involved in energy use, metabolism protein, ion transport, transmembrane transport, the olfactory system, the immune system, secretion and cellular activity. The identification of these regions and their respective candidate genes should contribute to the formation of a genetic basis in Nellore cattle for feed efficiency indicator traits, and these results would support

  18. Augmenting Chinese hamster genome assembly by identifying regions of high confidence.

    PubMed

    Vishwanathan, Nandita; Bandyopadhyay, Arpan A; Fu, Hsu-Yuan; Sharma, Mohit; Johnson, Kathryn C; Mudge, Joann; Ramaraj, Thiruvarangan; Onsongo, Getiria; Silverstein, Kevin A T; Jacob, Nitya M; Le, Huong; Karypis, George; Hu, Wei-Shou

    2016-09-01

    Chinese hamster Ovary (CHO) cell lines are the dominant industrial workhorses for therapeutic recombinant protein production. The availability of genome sequence of Chinese hamster and CHO cells will spur further genome and RNA sequencing of producing cell lines. However, the mammalian genomes assembled using shot-gun sequencing data still contain regions of uncertain quality due to assembly errors. Identifying high confidence regions in the assembled genome will facilitate its use for cell engineering and genome engineering. We assembled two independent drafts of Chinese hamster genome by de novo assembly from shotgun sequencing reads and by re-scaffolding and gap-filling the draft genome from NCBI for improved scaffold lengths and gap fractions. We then used the two independent assemblies to identify high confidence regions using two different approaches. First, the two independent assemblies were compared at the sequence level to identify their consensus regions as "high confidence regions" which accounts for at least 78 % of the assembled genome. Further, a genome wide comparison of the Chinese hamster scaffolds with mouse chromosomes revealed scaffolds with large blocks of collinearity, which were also compiled as high-quality scaffolds. Genome scale collinearity was complemented with EST based synteny which also revealed conserved gene order compared to mouse. As cell line sequencing becomes more commonly practiced, the approaches reported here are useful for assessing the quality of assembly and potentially facilitate the engineering of cell lines. PMID:27374913

  19. Addition of a breeding database in the Genome Database for Rosaceae.

    PubMed

    Evans, Kate; Jung, Sook; Lee, Taein; Brutcher, Lisa; Cho, Ilhyung; Peace, Cameron; Main, Dorrie

    2013-01-01

    Breeding programs produce large datasets that require efficient management systems to keep track of performance, pedigree, geographical and image-based data. With the development of DNA-based screening technologies, more breeding programs perform genotyping in addition to phenotyping for performance evaluation. The integration of breeding data with other genomic and genetic data is instrumental for the refinement of marker-assisted breeding tools, enhances genetic understanding of important crop traits and maximizes access and utility by crop breeders and allied scientists. Development of new infrastructure in the Genome Database for Rosaceae (GDR) was designed and implemented to enable secure and efficient storage, management and analysis of large datasets from the Washington State University apple breeding program and subsequently expanded to fit datasets from other Rosaceae breeders. The infrastructure was built using the software Chado and Drupal, making use of the Natural Diversity module to accommodate large-scale phenotypic and genotypic data. Breeders can search accessions within the GDR to identify individuals with specific trait combinations. Results from Search by Parentage lists individuals with parents in common and results from Individual Variety pages link to all data available on each chosen individual including pedigree, phenotypic and genotypic information. Genotypic data are searchable by markers and alleles; results are linked to other pages in the GDR to enable the user to access tools such as GBrowse and CMap. This breeding database provides users with the opportunity to search datasets in a fully targeted manner and retrieve and compare performance data from multiple selections, years and sites, and to output the data needed for variety release publications and patent applications. The breeding database facilitates efficient program management. Storing publicly available breeding data in a database together with genomic and genetic data will

  20. Addition of a breeding database in the Genome Database for Rosaceae.

    PubMed

    Evans, Kate; Jung, Sook; Lee, Taein; Brutcher, Lisa; Cho, Ilhyung; Peace, Cameron; Main, Dorrie

    2013-01-01

    Breeding programs produce large datasets that require efficient management systems to keep track of performance, pedigree, geographical and image-based data. With the development of DNA-based screening technologies, more breeding programs perform genotyping in addition to phenotyping for performance evaluation. The integration of breeding data with other genomic and genetic data is instrumental for the refinement of marker-assisted breeding tools, enhances genetic understanding of important crop traits and maximizes access and utility by crop breeders and allied scientists. Development of new infrastructure in the Genome Database for Rosaceae (GDR) was designed and implemented to enable secure and efficient storage, management and analysis of large datasets from the Washington State University apple breeding program and subsequently expanded to fit datasets from other Rosaceae breeders. The infrastructure was built using the software Chado and Drupal, making use of the Natural Diversity module to accommodate large-scale phenotypic and genotypic data. Breeders can search accessions within the GDR to identify individuals with specific trait combinations. Results from Search by Parentage lists individuals with parents in common and results from Individual Variety pages link to all data available on each chosen individual including pedigree, phenotypic and genotypic information. Genotypic data are searchable by markers and alleles; results are linked to other pages in the GDR to enable the user to access tools such as GBrowse and CMap. This breeding database provides users with the opportunity to search datasets in a fully targeted manner and retrieve and compare performance data from multiple selections, years and sites, and to output the data needed for variety release publications and patent applications. The breeding database facilitates efficient program management. Storing publicly available breeding data in a database together with genomic and genetic data will

  1. Comprehensive Repertoire of Foldable Regions within Whole Genomes

    PubMed Central

    Faure, Guilhem; Callebaut, Isabelle

    2013-01-01

    In order to get a comprehensive repertoire of foldable domains within whole proteomes, including orphan domains, we developed a novel procedure, called SEG-HCA. From only the information of a single amino acid sequence, SEG-HCA automatically delineates segments possessing high densities in hydrophobic clusters, as defined by Hydrophobic Cluster Analysis (HCA). These hydrophobic clusters mainly correspond to regular secondary structures, which together form structured or foldable regions. Genome-wide analyses revealed that SEG-HCA is opposite of disorder predictors, both addressing distinct structural states. Interestingly, there is however an overlap between the two predictions, including small segments of disordered sequences, which undergo coupled folding and binding. SEG-HCA thus gives access to these specific domains, which are generally poorly represented in domain databases. Comparison of the whole set of SEG-HCA predictions with the Conserved Domain Database (CDD) also highlighted a wide proportion of predicted large (length >50 amino acids) segments, which are CDD orphan. These orphan sequences may either correspond to highly divergent members of already known families or belong to new families of domains. Their comprehensive description thus opens new avenues to investigate new functional and/or structural features, which remained so far uncovered. Altogether, the data described here provide new insights into the protein architecture and organization throughout the three kingdoms of life. PMID:24204229

  2. Genome assemblies for 11 Yersinia pestis strains isolated in the Caucasus region

    SciTech Connect

    Zhgenti, Ekaterine; Johnson, Shannon L.; Davenport, Karen W.; Chanturia, Gvantsa; Daligault, Hajnalka E.; Chain, Patrick S.; Nikolich, Mikeljon P.

    2015-09-17

    Yersinia pestis, the causative agent of plague, is endemic to the Caucasus region but few reference strain genome sequences from that region are available. We present the improved draft or finished assembled genomes from 11 strains isolated in the nation of Georgia and surrounding countries.

  3. Genome assemblies for 11 Yersinia pestis strains isolated in the Caucasus region

    DOE PAGES

    Zhgenti, Ekaterine; Johnson, Shannon L.; Davenport, Karen W.; Chanturia, Gvantsa; Daligault, Hajnalka E.; Chain, Patrick S.; Nikolich, Mikeljon P.

    2015-09-17

    Yersinia pestis, the causative agent of plague, is endemic to the Caucasus region but few reference strain genome sequences from that region are available. We present the improved draft or finished assembled genomes from 11 strains isolated in the nation of Georgia and surrounding countries.

  4. Ecological effects of cell-level processes: genome size, functional traits and regional abundance of herbaceous plant species

    PubMed Central

    Herben, Tomáš; Suda, Jan; Klimešová, Jitka; Mihulka, Stanislav; Říha, Pavel; Šímová, Irena

    2012-01-01

    Background and Aims Genome size is known to be correlated with a number of phenotypic traits associated with cell sizes and cell-division rates. Genome size was therefore used as a proxy for them in order to assess how common plant traits such as height, specific leaf area and seed size/number predict species regional abundance. In this study it is hypothesized that if there is residual correlation between genome size and abundance after these traits are partialled out, there must be additional ecological effects of cell size and/or cell-division rate. Methods Variation in genome size, plant traits and regional abundance were examined in 436 herbaceous species of central European flora, and relationships were sought for among these variables by correlation and path analysis. Key Results Species regional abundance was weakly but significantly correlated with genome size; the relationship was stronger for annuals (R2 = 0·145) than for perennials (R2 = 0·027). In annuals, genome size was linked to abundance via its effect on seed size, which constrains seed number and hence population growth rate. In perennials, it weakly affected (via height and specific leaf area) competitive ability. These relationships did not change qualitatively after phylogenetic correction. In both annuals and perennials there was an unresolved effect of genome size on abundance. Conclusions The findings indicate that additional predictors of regional abundance should be sought among variables that are linked to cell size and cell-division rate. Signals of these cell-level processes remain identifiable even at the landscape scale, and show deep differences between perennials and annuals. Plant population biology could thus possibly benefit from more systematic use of indicators of cell-level processes. PMID:22628380

  5. Discovery and verification of functional single nucleotide polymorphisms in regulatory genomic regions: Current and developing technologies

    PubMed Central

    Chorley, Brian N.; Wang, Xuting; Campbell, Michelle R.; Pittman, Gary S.; Noureddine, Maher A.; Bell, Douglas A.

    2008-01-01

    The most common form of genetic variation, single nucleotide polymorphisms or SNPs, can affect the way an individual responds to the environment and modify disease risk. Although most of the millions of SNPs have little or no effect on gene regulation and protein activity, there are many circumstances where base changes can have deleterious effects. Non-synonymous SNPs that result in amino acid changes in proteins have been studied because of their obvious impact on protein activity. It is well known that SNPs within regulatory regions of the genome can result in disregulation of gene transcription. However, the impact of SNPs located in putative regulatory regions, or rSNPs, is harder to predict for two primary reasons. First, the mechanistic roles of non-coding genomic sequence remain poorly defined. Second, experimental validation of the functional consequences of rSNPs is often slow and laborious. In this review, we summarize traditional and novel methodologies for candidate rSNPs selection, in particular in silico techniques that aid in candidate rSNP selection. Additionally we will discuss molecular biological techniques that assess the impact of rSNPs on binding of regulatory machinery, as well as functional consequences on transcription. Standard techniques such as EMSA and luciferase reporter constructs are still widely used to assess effects of rSNPs on binding and gene transcription; however, these protocols are often bottlenecks in the discovery process. Therefore, we highlight novel and developing high-throughput protocols that promise to aid in shortening the process of rSNP validation. Given the large amount of genomic information generated from a multitude of re-sequencing and genome-wide SNP array efforts, future focus should be to develop validation techniques that will allow greater understanding of the impact these polymorphisms have on human health and disease. PMID:18565787

  6. Mixed model methods for genomic prediction and variance component estimation of additive and dominance effects using SNP markers.

    PubMed

    Da, Yang; Wang, Chunkao; Wang, Shengwen; Hu, Guo

    2014-01-01

    We established a genomic model of quantitative trait with genomic additive and dominance relationships that parallels the traditional quantitative genetics model, which partitions a genotypic value as breeding value plus dominance deviation and calculates additive and dominance relationships using pedigree information. Based on this genomic model, two sets of computationally complementary but mathematically identical mixed model methods were developed for genomic best linear unbiased prediction (GBLUP) and genomic restricted maximum likelihood estimation (GREML) of additive and dominance effects using SNP markers. These two sets are referred to as the CE and QM sets, where the CE set was designed for large numbers of markers and the QM set was designed for large numbers of individuals. GBLUP and associated accuracy formulations for individuals in training and validation data sets were derived for breeding values, dominance deviations and genotypic values. Simulation study showed that GREML and GBLUP generally were able to capture small additive and dominance effects that each accounted for 0.00005-0.0003 of the phenotypic variance and GREML was able to differentiate true additive and dominance heritability levels. GBLUP of the total genetic value as the summation of additive and dominance effects had higher prediction accuracy than either additive or dominance GBLUP, causal variants had the highest accuracy of GREML and GBLUP, and predicted accuracies were in agreement with observed accuracies. Genomic additive and dominance relationship matrices using SNP markers were consistent with theoretical expectations. The GREML and GBLUP methods can be an effective tool for assessing the type and magnitude of genetic effects affecting a phenotype and for predicting the total genetic value at the whole genome level.

  7. Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep

    PubMed Central

    Mousel, Michelle R.; Reynolds, James O.; White, Stephen N.

    2015-01-01

    Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10-5) were identified including markers in or near PIK3CB (P = 2.22x10-6; additive model), KCNB1 (P = 2.93x10-6; dominance model), ZC3H12C (P = 3.25x10-6; genotypic model), JPH1 (P = 4.68x20-6; genotypic model), and MYO3B (P = 5.74x10-6; recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection. PMID:26098909

  8. Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep.

    PubMed

    Mousel, Michelle R; Reynolds, James O; White, Stephen N

    2015-01-01

    Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10(-5)) were identified including markers in or near PIK3CB (P = 2.22x10(-6); additive model), KCNB1 (P = 2.93x10(-6); dominance model), ZC3H12C (P = 3.25x10(-6); genotypic model), JPH1 (P = 4.68x20(-6); genotypic model), and MYO3B (P = 5.74x10(-6); recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection.

  9. Identification of Low-Confidence Regions in the Pig Reference Genome (Sscrofa10.2).

    PubMed

    Warr, Amanda; Robert, Christelle; Hume, David; Archibald, Alan L; Deeb, Nader; Watson, Mick

    2015-01-01

    Many applications of high throughput sequencing rely on the availability of an accurate reference genome. Variant calling often produces large data sets that cannot be realistically validated and which may contain large numbers of false-positives. Errors in the reference assembly increase the number of false-positives. While resources are available to aid in the filtering of variants from human data, for other species these do not yet exist and strict filtering techniques must be employed which are more likely to exclude true-positives. This work assesses the accuracy of the pig reference genome (Sscrofa10.2) using whole genome sequencing reads from the Duroc sow whose genome the assembly was based on. Indicators of structural variation including high regional coverage, unexpected insert sizes, improper pairing and homozygous variants were used to identify low quality (LQ) regions of the assembly. Low coverage (LC) regions were also identified and analyzed separately. The LQ regions covered 13.85% of the genome, the LC regions covered 26.6% of the genome and combined (LQLC) they covered 33.07% of the genome. Over half of dbSNP variants were located in the LQLC regions. Of copy number variable regions identified in a previous study, 86.3% were located in the LQLC regions. The regions were also enriched for gene predictions from RNA-seq data with 42.98% falling in the LQLC regions. Excluding variants in the LQ, LC, or LQLC from future analyses will help reduce the number of false-positive variant calls. Researchers using WGS data should be aware that the current pig reference genome does not give an accurate representation of the copy number of alleles in the original Duroc sow's genome. PMID:26640477

  10. Identification of Low-Confidence Regions in the Pig Reference Genome (Sscrofa10.2)

    PubMed Central

    Warr, Amanda; Robert, Christelle; Hume, David; Archibald, Alan L.; Deeb, Nader; Watson, Mick

    2015-01-01

    Many applications of high throughput sequencing rely on the availability of an accurate reference genome. Variant calling often produces large data sets that cannot be realistically validated and which may contain large numbers of false-positives. Errors in the reference assembly increase the number of false-positives. While resources are available to aid in the filtering of variants from human data, for other species these do not yet exist and strict filtering techniques must be employed which are more likely to exclude true-positives. This work assesses the accuracy of the pig reference genome (Sscrofa10.2) using whole genome sequencing reads from the Duroc sow whose genome the assembly was based on. Indicators of structural variation including high regional coverage, unexpected insert sizes, improper pairing and homozygous variants were used to identify low quality (LQ) regions of the assembly. Low coverage (LC) regions were also identified and analyzed separately. The LQ regions covered 13.85% of the genome, the LC regions covered 26.6% of the genome and combined (LQLC) they covered 33.07% of the genome. Over half of dbSNP variants were located in the LQLC regions. Of copy number variable regions identified in a previous study, 86.3% were located in the LQLC regions. The regions were also enriched for gene predictions from RNA-seq data with 42.98% falling in the LQLC regions. Excluding variants in the LQ, LC, or LQLC from future analyses will help reduce the number of false-positive variant calls. Researchers using WGS data should be aware that the current pig reference genome does not give an accurate representation of the copy number of alleles in the original Duroc sow’s genome. PMID:26640477

  11. Functional annotation of HOT regions in the human genome: implications for human disease and cancer.

    PubMed

    Li, Hao; Chen, Hebing; Liu, Feng; Ren, Chao; Wang, Shengqi; Bo, Xiaochen; Shu, Wenjie

    2015-06-26

    Advances in genome-wide association studies (GWAS) and large-scale sequencing studies have resulted in an impressive and growing list of disease- and trait-associated genetic variants. Most studies have emphasised the discovery of genetic variation in coding sequences, however, the noncoding regulatory effects responsible for human disease and cancer biology have been substantially understudied. To better characterise the cis-regulatory effects of noncoding variation, we performed a comprehensive analysis of the genetic variants in HOT (high-occupancy target) regions, which are considered to be one of the most intriguing findings of recent large-scale sequencing studies. We observed that GWAS variants that map to HOT regions undergo a substantial net decrease and illustrate development-specific localisation during haematopoiesis. Additionally, genetic risk variants are disproportionally enriched in HOT regions compared with LOT (low-occupancy target) regions in both disease-relevant and cancer cells. Importantly, this enrichment is biased toward disease- or cancer-specific cell types. Furthermore, we observed that cancer cells generally acquire cancer-specific HOT regions at oncogenes through diverse mechanisms of cancer pathogenesis. Collectively, our findings demonstrate the key roles of HOT regions in human disease and cancer and represent a critical step toward further understanding disease biology, diagnosis, and therapy.

  12. In the QTL region surrounding porcine MHC, gene order is conserved with human genome.

    PubMed

    Genêt, C; Renard, C; Cabau, C; Rogel-Gaillard, C; Gellin, J; Milan, D

    2001-03-01

    On the porcine genome, the region surrounding the Major Histocompatibility Complex, also called Swine Leukocyte Antigens (SLA), is of particular interest not only owing to itq role in the control of immune response, but also because of its influence on many traits such as growth, fatness, and meat quality. To help in the identification of responsible genes, detailed comparative maps of the MHC region in mammalian species and powerful mapping tools allowing accurate ordering of genes and markers in this region are needed. In this report, we describe the use of the recently developed IMpRH radiation hybrid panel, to construct a higher density radiation hybrid map of swine Sscr 7p-q12, containing 23 additional loci. Our results show that the gene order is conserved between the two MHC-containing regions, even if an inversion is observed above the QTL region in the region containing DEK, SCA1, and EDN1 genes. The framework map produced shows that the IMpRH panel permits the ordering of genes and markers in the three MHC classes and would thus allow accurate localization of ESTs and candidate genes. PMID:11252175

  13. Breaking Good: Accounting for Fragility of Genomic Regions in Rearrangement Distance Estimation.

    PubMed

    Biller, Priscila; Guéguen, Laurent; Knibbe, Carole; Tannier, Eric

    2016-01-01

    Models of evolution by genome rearrangements are prone to two types of flaws: One is to ignore the diversity of susceptibility to breakage across genomic regions, and the other is to suppose that susceptibility values are given. Without necessarily supposing their precise localization, we call "solid" the regions that are improbably broken by rearrangements and "fragile" the regions outside solid ones. We propose a model of evolution by inversions where breakage probabilities vary across fragile regions and over time. It contains as a particular case the uniform breakage model on the nucleotidic sequence, where breakage probabilities are proportional to fragile region lengths. This is very different from the frequently used pseudouniform model where all fragile regions have the same probability to break. Estimations of rearrangement distances based on the pseudouniform model completely fail on simulations with the truly uniform model. On pairs of amniote genomes, we show that identifying coding genes with solid regions yields incoherent distance estimations, especially with the pseudouniform model, and to a lesser extent with the truly uniform model. This incoherence is solved when we coestimate the number of fragile regions with the rearrangement distance. The estimated number of fragile regions is surprisingly small, suggesting that a minority of regions are recurrently used by rearrangements. Estimations for several pairs of genomes at different divergence times are in agreement with a slowly evolvable colocalization of active genomic regions in the cell.

  14. Breaking Good: Accounting for Fragility of Genomic Regions in Rearrangement Distance Estimation

    PubMed Central

    Biller, Priscila; Guéguen, Laurent; Knibbe, Carole; Tannier, Eric

    2016-01-01

    Models of evolution by genome rearrangements are prone to two types of flaws: One is to ignore the diversity of susceptibility to breakage across genomic regions, and the other is to suppose that susceptibility values are given. Without necessarily supposing their precise localization, we call “solid” the regions that are improbably broken by rearrangements and “fragile” the regions outside solid ones. We propose a model of evolution by inversions where breakage probabilities vary across fragile regions and over time. It contains as a particular case the uniform breakage model on the nucleotidic sequence, where breakage probabilities are proportional to fragile region lengths. This is very different from the frequently used pseudouniform model where all fragile regions have the same probability to break. Estimations of rearrangement distances based on the pseudouniform model completely fail on simulations with the truly uniform model. On pairs of amniote genomes, we show that identifying coding genes with solid regions yields incoherent distance estimations, especially with the pseudouniform model, and to a lesser extent with the truly uniform model. This incoherence is solved when we coestimate the number of fragile regions with the rearrangement distance. The estimated number of fragile regions is surprisingly small, suggesting that a minority of regions are recurrently used by rearrangements. Estimations for several pairs of genomes at different divergence times are in agreement with a slowly evolvable colocalization of active genomic regions in the cell. PMID:27190002

  15. Identification and characterization of conserved and variable regions of lime witches' broom phytoplasma genome.

    PubMed

    Siampour, Majid; Izadpanah, Keramatollah; Marzachi, Cristina; Salehi Abarkoohi, Mohammad

    2015-09-01

    Several segments (∼20  kbp) of the lime witches' broom (LWB) phytoplasma genome (16SrII group) were sequenced and analysed. A 5.7  kbp segment (LWB-C) included conserved genes whose phylogenetic tree was consistent with that generated using 16S rRNA genes. Another 6.4  kbp LWB phytoplasma genome segment (LWB-NC) was structurally similar to the putative mobile unit or sequence variable mosaic genomic region of phytoplasmas, although it represented a new arrangement of genes or pseudogenes such as phage-related protein genes and tra5 insertion sequences. Sequence- and phylogenetic-based evidence suggested that LWB-NC is a genomic region which includes horizontally transferred genes and could be regarded as a hot region to incorporate more foreign genes into the genome of LWB phytoplasma. The presence of phylogenetically related fragments of retroelements was also verified in the LWB phytoplasma genome. Putative intragenomic retrotransposition or retrohoming of these elements might have been determinant in shaping and manipulating the LWB phytoplasma genome. Altogether, the results of this study suggested that the genome of LWB phytoplasma is colonized by a variety of genes that have been acquired through horizontal gene transfer events, which may have further affected the genome through intragenomic mobility and insertion at cognate or incognate sites. Some of these genes are expected to have been involved in the development of features specific to LWB phytoplasma.

  16. Comparative analyses of multi-species sequences from targeted genomic regions.

    PubMed

    Thomas, J W; Touchman, J W; Blakesley, R W; Bouffard, G G; Beckstrom-Sternberg, S M; Margulies, E H; Blanchette, M; Siepel, A C; Thomas, P J; McDowell, J C; Maskeri, B; Hansen, N F; Schwartz, M S; Weber, R J; Kent, W J; Karolchik, D; Bruen, T C; Bevan, R; Cutler, D J; Schwartz, S; Elnitski, L; Idol, J R; Prasad, A B; Lee-Lin, S-Q; Maduro, V V B; Summers, T J; Portnoy, M E; Dietrich, N L; Akhter, N; Ayele, K; Benjamin, B; Cariaga, K; Brinkley, C P; Brooks, S Y; Granite, S; Guan, X; Gupta, J; Haghighi, P; Ho, S-L; Huang, M C; Karlins, E; Laric, P L; Legaspi, R; Lim, M J; Maduro, Q L; Masiello, C A; Mastrian, S D; McCloskey, J C; Pearson, R; Stantripop, S; Tiongson, E E; Tran, J T; Tsurgeon, C; Vogt, J L; Walker, M A; Wetherby, K D; Wiggins, L S; Young, A C; Zhang, L-H; Osoegawa, K; Zhu, B; Zhao, B; Shu, C L; De Jong, P J; Lawrence, C E; Smit, A F; Chakravarti, A; Haussler, D; Green, P; Miller, W; Green, E D

    2003-08-14

    The systematic comparison of genomic sequences from different organisms represents a central focus of contemporary genome analysis. Comparative analyses of vertebrate sequences can identify coding and conserved non-coding regions, including regulatory elements, and provide insight into the forces that have rendered modern-day genomes. As a complement to whole-genome sequencing efforts, we are sequencing and comparing targeted genomic regions in multiple, evolutionarily diverse vertebrates. Here we report the generation and analysis of over 12 megabases (Mb) of sequence from 12 species, all derived from the genomic region orthologous to a segment of about 1.8 Mb on human chromosome 7 containing ten genes, including the gene mutated in cystic fibrosis. These sequences show conservation reflecting both functional constraints and the neutral mutational events that shaped this genomic region. In particular, we identify substantial numbers of conserved non-coding segments beyond those previously identified experimentally, most of which are not detectable by pair-wise sequence comparisons alone. Analysis of transposable element insertions highlights the variation in genome dynamics among these species and confirms the placement of rodents as a sister group to the primates.

  17. Detailed Analysis of a Contiguous 22-Mb Region of the Maize Genome

    PubMed Central

    Wei, Fusheng; Stein, Joshua C.; Liang, Chengzhi; Zhang, Jianwei; Fulton, Robert S.; Baucom, Regina S.; De Paoli, Emanuele; Zhou, Shiguo; Yang, Lixing; Han, Yujun; Pasternak, Shiran; Narechania, Apurva; Zhang, Lifang; Yeh, Cheng-Ting; Ying, Kai; Nagel, Dawn H.; Collura, Kristi; Kudrna, David; Currie, Jennifer; Lin, Jinke; Kim, HyeRan; Angelova, Angelina; Scara, Gabriel; Wissotski, Marina; Golser, Wolfgang; Courtney, Laura; Kruchowski, Scott; Graves, Tina A.; Rock, Susan M.; Adams, Stephanie; Fulton, Lucinda A.; Fronick, Catrina; Courtney, William; Kramer, Melissa; Spiegel, Lori; Nascimento, Lydia; Kalyanaraman, Ananth; Chaparro, Cristian; Deragon, Jean-Marc; Miguel, Phillip San; Jiang, Ning; Wessler, Susan R.; Green, Pamela J.; Yu, Yeisoo; Schwartz, David C.; Meyers, Blake C.; Bennetzen, Jeffrey L.; Martienssen, Robert A.; McCombie, W. Richard; Aluru, Srinivas; Clifton, Sandra W.; Schnable, Patrick S.; Ware, Doreen; Wilson, Richard K.; Wing, Rod A.

    2009-01-01

    Most of our understanding of plant genome structure and evolution has come from the careful annotation of small (e.g., 100 kb) sequenced genomic regions or from automated annotation of complete genome sequences. Here, we sequenced and carefully annotated a contiguous 22 Mb region of maize chromosome 4 using an improved pseudomolecule for annotation. The sequence segment was comprehensively ordered, oriented, and confirmed using the maize optical map. Nearly 84% of the sequence is composed of transposable elements (TEs) that are mostly nested within each other, of which most families are low-copy. We identified 544 gene models using multiple levels of evidence, as well as five miRNA genes. Gene fragments, many captured by TEs, are prevalent within this region. Elimination of gene redundancy from a tetraploid maize ancestor that originated a few million years ago is responsible in this region for most disruptions of synteny with sorghum and rice. Consistent with other sub-genomic analyses in maize, small RNA mapping showed that many small RNAs match TEs and that most TEs match small RNAs. These results, performed on ∼1% of the maize genome, demonstrate the feasibility of refining the B73 RefGen_v1 genome assembly by incorporating optical map, high-resolution genetic map, and comparative genomic data sets. Such improvements, along with those of gene and repeat annotation, will serve to promote future functional genomic and phylogenomic research in maize and other grasses. PMID:19936048

  18. Construction of a genomic library of the human cytomegalovirus genome and analysis of late transcription of its inverted internal repeat region

    SciTech Connect

    Silva, K.F.S.T.

    1989-01-01

    The investigations described in this dissertation were designed to determine the transcriptionally active DNA sequences of IIR region and to identify the viral mRNA transcribed from the transcriptionally most active DNA sequences of that region during late phase of HCMV Towne infection. Preliminary transcriptional studies which included the hybridization of a southern blot of XbaI digested entire HCMV genome to {sup 32}P-labelled late phase infected cell A{sup +} RNA, indicated that late viral transcripts homologous to XbaI Q fragment of IIR region were very highly abundant while XbaI Q fragment showed a very low transcriptional activity. To facilitate further analysis of late transcription of IIR region, the entire DNA sequences of IIR region were molecularly cloned as U, S, and H BamHI fragments in pACYC-184 plasmid vector. In addition, to be used in future studies on other regions of the genome, except for y and c{prime} smaller fragments the entire 240 kb HCMV genome was cloned as BamHI fragments in the same vector. Furthermore, the U, S, and H BamHI fragments were mapped with six other restriction enzymes in order to use that mapping data in subsequent transcriptional analysis of the IIR region. Further localization of transcriptionally active DNA sequences within IIR region was achieved by hybridization of southern blots of restricted U, S, and H BamHI fragments with 3{prime} {sup 32}P-labelled infected cell late A{sup +} RNA. The 1.5 kb EcooRI subfragments of S BamHI fragment and the adjoining 0.72 kb XhoI subfragment of H BamHI fragment revealed the highest level of transcription, although the remainder of the S fragment was also transcribed at a substantial level. The U fragment and the remainder of the H fragment was transcribed at a very low level.

  19. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions.

    PubMed

    Wang, Yu; Li, Wei; Xia, Yingying; Wang, Chongzhi; Tang, Y Tom; Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian

    2014-01-01

    Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information. PMID:25919136

  20. Representational difference analysis, high-resolution physical mapping, and transcript identification of the zebrafish genomic region for a motor behavior.

    PubMed

    Sato, Tomomi; Mishina, Masayoshi

    2003-08-01

    Zebrafish is one of the best model organisms for investigating gene functions in vertebrates. By 4,5',8-trimethylpsoralen mutagenesis, we isolated a zebrafish mutant, vibrato, with defects in the spontaneous contraction and touch response. Whole genome subtraction between the wild-type and the mutant genomes by representational difference analysis yielded polymorphic markers tightly linked to the vibrato locus. Using these markers, we constructed a high-resolution physical map and localized the vibrato locus within a genomic region of 720 kb. Direct cDNA selection with the contig led to the identification of a novel gene, solo, encoding a protein with SEC14 and spectrin repeat domains. These domains of Solo shared significant amino acid sequence identities with those of mammalian Trio and Karilin. In addition, we found the zebrafish orthologs for mammalian TTN, COL5A2, and CED-6 in the vibrato region. Mapping of these genes localized human chromosomal regions possibly involved in motor disorders. Our results suggest that representational difference analysis provides an efficient way to isolate mutated genomic regions in zebrafish. PMID:12837271

  1. Genome-wide meta-analysis of maize heterosis reveals the potential role of additive gene expression at pericentromeric loci

    PubMed Central

    2014-01-01

    Background The identification of QTL involved in heterosis formation is one approach to unravel the not yet fully understood genetic basis of heterosis - the improved agronomic performance of hybrid F1 plants compared to their inbred parents. The identification of candidate genes underlying a QTL is important both for developing markers and determining the molecular genetic basis of a trait, but remains difficult owing to the large number of genes often contained within individual QTL. To address this problem in heterosis analysis, we applied a meta-analysis strategy for grain yield (GY) of Zea mays L. as example, incorporating QTL-, hybrid field-, and parental gene expression data. Results For the identification of genes underlying known heterotic QTL, we made use of tight associations between gene expression pattern and the trait of interest, identified by correlation analyses. Using this approach genes strongly associated with heterosis for GY were discovered to be clustered in pericentromeric regions of the complex maize genome. This suggests that expression differences of sequences in recombination-suppressed regions are important in the establishment of heterosis for GY in F1 hybrids and also in the conservation of heterosis for GY across genotypes. Importantly functional analysis of heterosis-associated genes from these genomic regions revealed over-representation of a number of functional classes, identifying key processes contributing to heterosis for GY. Based on the finding that the majority of the analyzed heterosis-associated genes were addtitively expressed, we propose a model referring to the influence of cis-regulatory variation on heterosis for GY by the compensation of fixed detrimental expression levels in parents. Conclusions The study highlights the utility of a meta-analysis approach that integrates phenotypic and multi-level molecular data to unravel complex traits in plants. It provides prospects for the identification of genes relevant for

  2. Genomics of the hop psuedo-autosomal regions

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Hop is one of the few crop species with female and male plants with sex being determined by either XX or XY chromosomes. Hop cones are only produced in female hops with or without fertilization. This has lead to most genomic research being directed toward female plants. Very little work has been don...

  3. Characterization of the flamenco region of the Drosophila melanogaster genome.

    PubMed

    Robert, V; Prud'homme, N; Kim, A; Bucheton, A; Pélisson, A

    2001-06-01

    The flamenco gene, located at 20A1-3 in the beta-heterochromatin of the Drosophila X chromosome, is a major regulator of the gypsy/mdg4 endogenous retrovirus. As a first step to characterize this gene, approximately 100 kb of genomic DNA flanking a P-element-induced mutation of flamenco was isolated. This DNA is located in a sequencing gap of the Celera Genomics project, i.e., one of those parts of the genome in which the "shotgun" sequence could not be assembled, probably because it contains long stretches of repetitive DNA, especially on the proximal side of the P insertion point. Deficiency mapping indicated that sequences required for the normal flamenco function are located >130 kb proximal to the insertion site. The distal part of the cloned DNA does, nevertheless, contain several unique sequences, including at least four different transcription units. Dip1, the closest one to the P-element insertion point, might be a good candidate for a gypsy regulator, since it putatively encodes a nuclear protein containing two double-stranded RNA-binding domains. However, transgenes containing dip1 genomic DNA were not able to rescue flamenco mutant flies. The possible nature of the missing flamenco sequences is discussed. PMID:11404334

  4. Whole-genome sequencing reveals small genomic regions of introgression in an introduced crater lake population of threespine stickleback.

    PubMed

    Yoshida, Kohta; Miyagi, Ryutaro; Mori, Seiichi; Takahashi, Aya; Makino, Takashi; Toyoda, Atsushi; Fujiyama, Asao; Kitano, Jun

    2016-04-01

    Invasive species pose a major threat to biological diversity. Although introduced populations often experience population bottlenecks, some invasive species are thought to be originated from hybridization between multiple populations or species, which can contribute to the maintenance of high genetic diversity. Recent advances in genome sequencing enable us to trace the evolutionary history of invasive species even at whole-genome level and may help to identify the history of past hybridization that may be overlooked by traditional marker-based analysis. Here, we conducted whole-genome sequencing of eight threespine stickleback (Gasterosteus aculeatus) individuals, four from a recently introduced crater lake population and four of the putative source population. We found that both populations have several small genomic regions with high genetic diversity, which resulted from introgression from a closely related species (Gasterosteus nipponicus). The sizes of the regions were too small to be detected with traditional marker-based analysis or even some reduced-representation sequencing methods. Further amplicon sequencing revealed linkage disequilibrium around an introgression site, which suggests the possibility of selective sweep at the introgression site. Thus, interspecies introgression might predate introduction and increase genetic variation in the source population. Whole-genome sequencing of even a small number of individuals can therefore provide higher resolution inference of history of introduced populations. PMID:27069575

  5. Whole-genome sequencing reveals small genomic regions of introgression in an introduced crater lake population of threespine stickleback.

    PubMed

    Yoshida, Kohta; Miyagi, Ryutaro; Mori, Seiichi; Takahashi, Aya; Makino, Takashi; Toyoda, Atsushi; Fujiyama, Asao; Kitano, Jun

    2016-04-01

    Invasive species pose a major threat to biological diversity. Although introduced populations often experience population bottlenecks, some invasive species are thought to be originated from hybridization between multiple populations or species, which can contribute to the maintenance of high genetic diversity. Recent advances in genome sequencing enable us to trace the evolutionary history of invasive species even at whole-genome level and may help to identify the history of past hybridization that may be overlooked by traditional marker-based analysis. Here, we conducted whole-genome sequencing of eight threespine stickleback (Gasterosteus aculeatus) individuals, four from a recently introduced crater lake population and four of the putative source population. We found that both populations have several small genomic regions with high genetic diversity, which resulted from introgression from a closely related species (Gasterosteus nipponicus). The sizes of the regions were too small to be detected with traditional marker-based analysis or even some reduced-representation sequencing methods. Further amplicon sequencing revealed linkage disequilibrium around an introgression site, which suggests the possibility of selective sweep at the introgression site. Thus, interspecies introgression might predate introduction and increase genetic variation in the source population. Whole-genome sequencing of even a small number of individuals can therefore provide higher resolution inference of history of introduced populations.

  6. ECRbase: Database of Evolutionary Conserved Regions, Promoters, and Transcription Factor Binding Sites in Vertebrate Genomes

    DOE Data Explorer

    Loots, Gabriela G. [LLNL; Ovcharenko, I. [LLNL

    Evolutionary conservation of DNA sequences provides a tool for the identification of functional elements in genomes. This database of evolutionary conserved regions (ECRs) in vertebrate genomes features a database of syntenic blocks that recapitulate the evolution of rearrangements in vertebrates and a comprehensive collection of promoters in all vertebrate genomes generated using multiple sources of gene annotation. The database also contains a collection of annotated transcription factor binding sites (TFBSs) in evolutionary conserved and promoter elements. ECRbase currently includes human, rhesus macaque, dog, opossum, rat, mouse, chicken, frog, zebrafish, and fugu genomes. (taken from paper in Journal: Bioinformatics, November 7, 2006, pp. 122-124

  7. Whole-genome resequencing of Hanwoo (Korean cattle) and insight into regions of homozygosity

    PubMed Central

    2013-01-01

    Background Hanwoo (Korean cattle), which originated from natural crossbreeding between taurine and zebu cattle, migrated to the Korean peninsula through North China. Hanwoo were raised as draft animals until the 1970s without the introduction of foreign germplasm. Since 1979, Hanwoo has been bred as beef cattle. Genetic variation was analyzed by whole-genome deep resequencing of a Hanwoo bull. The Hanwoo genome was compared to that of two other breeds, Black Angus and Holstein, and genes within regions of homozygosity were investigated to elucidate the genetic and genomic characteristics of Hanwoo. Results The Hanwoo bull genome was sequenced to 45.6-fold coverage using the ABI SOLiD system. In total, 4.7 million single-nucleotide polymorphisms and 0.4 million small indels were identified by comparison with the Btau4.0 reference assembly. Of the total number of SNPs and indels, 58% and 87%, respectively, were novel. The overall genotype concordance between the SNPs and BovineSNP50 BeadChip data was 96.4%. Of 1.6 million genetic differences in Hanwoo, approximately 25,000 non-synonymous SNPs, splice-site variants, and coding indels (NS/SS/Is) were detected in 8,360 genes. Among 1,045 genes containing reliable specific NS/SS/Is in Hanwoo, 109 genes contained more than one novel damaging NS/SS/I. Of the genes containing NS/SS/Is, 610 genes were assigned as trait-associated genes. Moreover, 16, 78, and 51 regions of homozygosity (ROHs) were detected in Hanwoo, Black Angus, and Holstein, respectively. ‘Regulation of actin filament length’ was revealed as a significant gene ontology term and 25 trait-associated genes for meat quality and disease resistance were found in 753 genes that resided in the ROHs of Hanwoo. In Hanwoo, 43 genes were located in common ROHs between whole-genome resequencing and SNP chips in BTA2, 10, and 13 coincided with quantitative trait loci for meat fat traits. In addition, the common ROHs in BTA2 and 16 were in agreement between Hanwoo and

  8. Genome-Wide Profiling of PARP1 Reveals an Interplay with Gene Regulatory Regions and DNA Methylation

    PubMed Central

    Nalabothula, Narasimharao; Al-jumaily, Taha; Eteleeb, Abdallah M.; Flight, Robert M.; Xiaorong, Shao; Moseley, Hunter; Rouchka, Eric C.; Fondufe-Mittendorf, Yvonne N.

    2015-01-01

    Poly (ADP-ribose) polymerase-1 (PARP1) is a nuclear enzyme involved in DNA repair, chromatin remodeling and gene expression. PARP1 interactions with chromatin architectural multi-protein complexes (i.e. nucleosomes) alter chromatin structure resulting in changes in gene expression. Chromatin structure impacts gene regulatory processes including transcription, splicing, DNA repair, replication and recombination. It is important to delineate whether PARP1 randomly associates with nucleosomes or is present at specific nucleosome regions throughout the cell genome. We performed genome-wide association studies in breast cancer cell lines to address these questions. Our studies show that PARP1 associates with epigenetic regulatory elements genome-wide, such as active histone marks, CTCF and DNase hypersensitive sites. Additionally, the binding of PARP1 to chromatin genome-wide is mutually exclusive with DNA methylation pattern suggesting a functional interplay between PARP1 and DNA methylation. Indeed, inhibition of PARylation results in genome-wide changes in DNA methylation patterns. Our results suggest that PARP1 controls the fidelity of gene transcription and marks actively transcribed gene regions by selectively binding to transcriptionally active chromatin. These studies provide a platform for developing our understanding of PARP1’s role in gene regulation. PMID:26305327

  9. Self-Confirmation and Ascertainment of the Candidate Genomic Regions of Complex Trait Loci – A None-Experimental Solution

    PubMed Central

    Wang, Lishi; Jiao, Yan; Wang, Yongjun; Zhang, Mengchen; Gu, Weikuan

    2016-01-01

    Over the past half century, thousands of quantitative trait loci (QTL) have been identified by using animal models and plant populations. However, the none-reliability and imprecision of the genomic regions of these loci have remained the major hurdle for the identification of the causal genes for the correspondent traits. We used a none-experimental strategy of strain number reduction for testing accuracy and ascertainment of the candidate region for QTL. We tested the strategy in over 400 analyses with data from 47 studies. These studies include: 1) studies with recombinant inbred (RI) strains of mice. We first tested two previously mapped QTL with well-defined genomic regions; We then tested additional four studies with known QTL regions; and finally we examined the reliability of QTL in 38 sets of data which are produced from relatively large numbers of RI strains, derived from C57BL/6J (B6) X DBA/2J (D2), known as BXD RI mouse strains; 2) studies with RI strains of rats and plants; and 3) studies using F2 populations in mice, rats and plants. In these cases, our method identified the reliability of mapped QTL and localized the candidate genes into the defined genomic regions. Our data also suggests that LRS score produced by permutation tests does not necessarily confirm the reliability of the QTL. Number of strains are not the reliable indicators for the accuracy of QTL either. Our strategy determines the reliability and accuracy of the genomic region of a QTL without any additional experimental study such as congenic breeding. PMID:27203862

  10. Self-Confirmation and Ascertainment of the Candidate Genomic Regions of Complex Trait Loci - A None-Experimental Solution.

    PubMed

    Wang, Lishi; Jiao, Yan; Wang, Yongjun; Zhang, Mengchen; Gu, Weikuan

    2016-01-01

    Over the past half century, thousands of quantitative trait loci (QTL) have been identified by using animal models and plant populations. However, the none-reliability and imprecision of the genomic regions of these loci have remained the major hurdle for the identification of the causal genes for the correspondent traits. We used a none-experimental strategy of strain number reduction for testing accuracy and ascertainment of the candidate region for QTL. We tested the strategy in over 400 analyses with data from 47 studies. These studies include: 1) studies with recombinant inbred (RI) strains of mice. We first tested two previously mapped QTL with well-defined genomic regions; We then tested additional four studies with known QTL regions; and finally we examined the reliability of QTL in 38 sets of data which are produced from relatively large numbers of RI strains, derived from C57BL/6J (B6) X DBA/2J (D2), known as BXD RI mouse strains; 2) studies with RI strains of rats and plants; and 3) studies using F2 populations in mice, rats and plants. In these cases, our method identified the reliability of mapped QTL and localized the candidate genes into the defined genomic regions. Our data also suggests that LRS score produced by permutation tests does not necessarily confirm the reliability of the QTL. Number of strains are not the reliable indicators for the accuracy of QTL either. Our strategy determines the reliability and accuracy of the genomic region of a QTL without any additional experimental study such as congenic breeding. PMID:27203862

  11. Expressed Peptide Tags: An additional layer of data for genome annotation

    SciTech Connect

    Savidor, Alon; Donahoo, Ryan S; Hurtado-Gonzales, Oscar; Verberkmoes, Nathan C; Shah, Manesh B; Lamour, Kurt H; McDonald, W Hayes

    2006-01-01

    While genome sequencing is becoming ever more routine, genome annotation remains a challenging process. Identification of the coding sequences within the genomic milieu presents a tremendous challenge, especially for eukaryotes with their complex gene architectures. Here we present a method to assist the annotation process through the use of proteomic data and bioinformatics. Mass spectra of digested protein preparations of the organism of interest were acquired and searched against a protein database created by a six frame translation of the genome. The identified peptides were mapped back to the genome, compared to the current annotation, and then categorized as supporting or extending the current genome annotation. We named the classified peptides Expressed Peptide Tags (EPTs). The well annotated bacterium Rhodopseudomonas palustris was used as a control for the method and showed high degree of correlation between EPT mapping and the current annotation, with 86% of the EPTs confirming existing gene calls and less than 1% of the EPTs expanding on the current annotation. The eukaryotic plant pathogens Phytophthora ramorum and Phytophthora sojae, whose genomes have been recently sequenced and are much less well annotated, were also subjected to this method. A series of algorithmic steps were taken to increase the confidence of EPT identification for these organisms, including generation of smaller sub-databases to be searched against, and definition of EPT criteria that accommodates the more complex eukaryotic gene architecture. As expected, the analysis of the Phytophthora species showed less correlation between EPT mapping and their current annotation. While ~77% of Phytophthora EPTs supported the current annotation, a portion of them (7.2% and 12.6% for P. ramorum and P. sojae, respectively) suggested modification to current gene calls or identified novel genes that were missed by the current genome annotation of these organisms.

  12. Analysis of real time PCR amplification efficiencies from three genomic region of dengue virus.

    PubMed

    Odreman-Macchioli, María; Vielma, Silvana; Atchley, Daniel; Comach, Guillermo; Ramirez, Alvaro; Pérez, Saberio; Téllez, Luis; Quintero, Beatriz; Hernández, Erick; Muñoz, Maritza; Mendoza, José

    2013-03-01

    Early diagnosis of dengue virus (DENV) infection represents a key factor in preventing clinical complications attributed to the disease. The aim of this study was to evaluate the amplification efficiencies of an in-house quantitative real time-PCR (qPCR) assay of DENV, using the non-structural conserved genomic region protein-5 (NS5) versus two genomic regions usually employed for virus detection, the capsid/pre-membrane region (C-prM) and the 3'-noncoding region (3'NC). One-hundred sixty seven acute phase serum samples from febrile patients were used for validation purposes. Results showed that the three genomic regions had similar amplification profiles and correlation coefficients (0.987-0.999). When isolated viruses were used, the NS5 region had the highest qPCR efficiencies for the four serotypes (98-100%). Amplification from acute serum samples showed that 41.1% (67/167) were positive for the universal assay by at least two of the selected genomic regions. The agreement rates between NS5/C-prM and NS5/3'NC regions were 56.7% and 97%, respectively. Amplification concordance values between C-prM/NS5 and NS5/3'NC regions showed a weak (kappa = 0.109; CI 95%) and a moderate (kappa = 0.489; CI 95%) efficiencies in amplification, respectively. Serotyping assay using a singleplex NS5-TaqMan format was much more sensitive than the C-prM/SYBR Green I protocol (76%). External evaluation showed a high sensitivity (100%), specificity (78%) and high agreement between the assays. According to the results, the NS5 genomic region provides the best genomic region for optimal detection and typification of DENV in clinical samples. PMID:23781709

  13. Sequencing of 15 622 gene-bearing BACs clarifies the gene-dense regions of the barley genome.

    PubMed

    Muñoz-Amatriaín, María; Lonardi, Stefano; Luo, MingCheng; Madishetty, Kavitha; Svensson, Jan T; Moscou, Matthew J; Wanamaker, Steve; Jiang, Tao; Kleinhofs, Andris; Muehlbauer, Gary J; Wise, Roger P; Stein, Nils; Ma, Yaqin; Rodriguez, Edmundo; Kudrna, Dave; Bhat, Prasanna R; Chao, Shiaoman; Condamine, Pascal; Heinen, Shane; Resnik, Josh; Wing, Rod; Witt, Heather N; Alpert, Matthew; Beccuti, Marco; Bozdag, Serdar; Cordero, Francesca; Mirebrahim, Hamid; Ounit, Rachid; Wu, Yonghui; You, Frank; Zheng, Jie; Simková, Hana; Dolezel, Jaroslav; Grimwood, Jane; Schmutz, Jeremy; Duma, Denisa; Altschmied, Lothar; Blake, Tom; Bregitzer, Phil; Cooper, Laurel; Dilbirligi, Muharrem; Falk, Anders; Feiz, Leila; Graner, Andreas; Gustafson, Perry; Hayes, Patrick M; Lemaux, Peggy; Mammadov, Jafar; Close, Timothy J

    2015-10-01

    Barley (Hordeum vulgare L.) possesses a large and highly repetitive genome of 5.1 Gb that has hindered the development of a complete sequence. In 2012, the International Barley Sequencing Consortium released a resource integrating whole-genome shotgun sequences with a physical and genetic framework. However, because only 6278 bacterial artificial chromosome (BACs) in the physical map were sequenced, fine structure was limited. To gain access to the gene-containing portion of the barley genome at high resolution, we identified and sequenced 15 622 BACs representing the minimal tiling path of 72 052 physical-mapped gene-bearing BACs. This generated ~1.7 Gb of genomic sequence containing an estimated 2/3 of all Morex barley genes. Exploration of these sequenced BACs revealed that although distal ends of chromosomes contain most of the gene-enriched BACs and are characterized by high recombination rates, there are also gene-dense regions with suppressed recombination. We made use of published map-anchored sequence data from Aegilops tauschii to develop a synteny viewer between barley and the ancestor of the wheat D-genome. Except for some notable inversions, there is a high level of collinearity between the two species. The software HarvEST:Barley provides facile access to BAC sequences and their annotations, along with the barley-Ae. tauschii synteny viewer. These BAC sequences constitute a resource to improve the efficiency of marker development, map-based cloning, and comparative genomics in barley and related crops. Additional knowledge about regions of the barley genome that are gene-dense but low recombination is particularly relevant. PMID:26252423

  14. Sequencing of 15 622 gene-bearing BACs clarifies the gene-dense regions of the barley genome.

    PubMed

    Muñoz-Amatriaín, María; Lonardi, Stefano; Luo, MingCheng; Madishetty, Kavitha; Svensson, Jan T; Moscou, Matthew J; Wanamaker, Steve; Jiang, Tao; Kleinhofs, Andris; Muehlbauer, Gary J; Wise, Roger P; Stein, Nils; Ma, Yaqin; Rodriguez, Edmundo; Kudrna, Dave; Bhat, Prasanna R; Chao, Shiaoman; Condamine, Pascal; Heinen, Shane; Resnik, Josh; Wing, Rod; Witt, Heather N; Alpert, Matthew; Beccuti, Marco; Bozdag, Serdar; Cordero, Francesca; Mirebrahim, Hamid; Ounit, Rachid; Wu, Yonghui; You, Frank; Zheng, Jie; Simková, Hana; Dolezel, Jaroslav; Grimwood, Jane; Schmutz, Jeremy; Duma, Denisa; Altschmied, Lothar; Blake, Tom; Bregitzer, Phil; Cooper, Laurel; Dilbirligi, Muharrem; Falk, Anders; Feiz, Leila; Graner, Andreas; Gustafson, Perry; Hayes, Patrick M; Lemaux, Peggy; Mammadov, Jafar; Close, Timothy J

    2015-10-01

    Barley (Hordeum vulgare L.) possesses a large and highly repetitive genome of 5.1 Gb that has hindered the development of a complete sequence. In 2012, the International Barley Sequencing Consortium released a resource integrating whole-genome shotgun sequences with a physical and genetic framework. However, because only 6278 bacterial artificial chromosome (BACs) in the physical map were sequenced, fine structure was limited. To gain access to the gene-containing portion of the barley genome at high resolution, we identified and sequenced 15 622 BACs representing the minimal tiling path of 72 052 physical-mapped gene-bearing BACs. This generated ~1.7 Gb of genomic sequence containing an estimated 2/3 of all Morex barley genes. Exploration of these sequenced BACs revealed that although distal ends of chromosomes contain most of the gene-enriched BACs and are characterized by high recombination rates, there are also gene-dense regions with suppressed recombination. We made use of published map-anchored sequence data from Aegilops tauschii to develop a synteny viewer between barley and the ancestor of the wheat D-genome. Except for some notable inversions, there is a high level of collinearity between the two species. The software HarvEST:Barley provides facile access to BAC sequences and their annotations, along with the barley-Ae. tauschii synteny viewer. These BAC sequences constitute a resource to improve the efficiency of marker development, map-based cloning, and comparative genomics in barley and related crops. Additional knowledge about regions of the barley genome that are gene-dense but low recombination is particularly relevant.

  15. Regions identity between the genome of vertebrates and non-retroviral families of insect viruses

    PubMed Central

    2011-01-01

    Background The scope of our understanding of the evolutionary history between viruses and animals is limited. The fact that the recent availability of many complete insect virus genomes and vertebrate genomes as well as the ability to screen these sequences makes it possible to gain a new perspective insight into the evolutionary interaction between insect viruses and vertebrates. This study is to determine the possibility of existence of sequence identity between the genomes of insect viruses and vertebrates, attempt to explain this phenomenon in term of genetic mobile element, and try to investigate the evolutionary relationship between these short regions of identity among these species. Results Some of studied insect viruses contain variable numbers of short regions of sequence identity to the genomes of vertebrate with nucleotide sequence length from 28 bp to 124 bp. They are found to locate in multiple sites of the vertebrate genomes. The ontology of animal genes with identical regions involves in several processes including chromatin remodeling, regulation of apoptosis, signaling pathway, nerve system development and some enzyme-like catalysis. Phylogenetic analysis reveals that at least some short regions of sequence identity in the genomes of vertebrate are derived the ancestral of insect viruses. Conclusion Short regions of sequence identity were found in the vertebrates and insect viruses. These sequences played an important role not only in the long-term evolution of vertebrates, but also in promotion of insect virus. This typical win-win strategy may come from natural selection. PMID:22073942

  16. Identification of Genomic Regions Required for DNA Replication during Drosophila Embryogenesis

    PubMed Central

    Smith, A. V.; King, J. A.; Orr-Weaver, T. L.

    1993-01-01

    A collection of Drosophila deficiency stocks was examined by bromodeoxyuridine (BrdU) labeling of embryos to analyze the DNA replication patterns in late embryogenesis. This permitted us to screen 34% of the genome for genes that when absent in homozygous deficiencies affect the cell cycle or DNA replication. We found three genomic intervals that when deleted result in cessation of DNA replication in the embryo, 39D2-3;E2-F1, 51E and 75C5-7;F1. Embryos deleted for the 75C5-7;F1 region stop DNA replication at the time in embryogenesis when a G(1) phase is added to the mitotic cell cycle and the larval tissues begin to become polytene. Thus, this interval may contain a gene controlling these cell cycle transitions. DNA replication arrests earlier in embryos homozygous for deletions for the other two regions. Analysis of the effects of deletions in the 39D2-3;E2-F1 region on DNA replication showed that the block to DNA replication correlates with deletion of the histone genes. We were able to identify a single, lethal complementation group in 51E, l(2)51Ec, that is responsible for the cessation of replication observed in this interval. Deficiencies that removed one of the Drosophila cdc2 genes and the cyclin A gene had no effect on replication during embryogenesis. Additionally, our analysis identified a gene, pimples, that is required for the proper completion of mitosis in the post-blastoderm divisions of the embryo. PMID:8293981

  17. Genome-Wide Analyses in Bacteria Show Small-RNA Enrichment for Long and Conserved Intergenic Regions

    PubMed Central

    Tsai, Chen-Hsun; Liao, Rick; Chou, Brendan; Palumbo, Michael

    2014-01-01

    Interest in finding small RNAs (sRNAs) in bacteria has significantly increased in recent years due to their regulatory functions. Development of high-throughput methods and more sophisticated computational algorithms has allowed rapid identification of sRNA candidates in different species. However, given their various sizes (50 to 500 nucleotides [nt]) and their potential genomic locations in the 5′ and 3′ untranslated regions as well as in intergenic regions, identification and validation of true sRNAs have been challenging. In addition, the evolution of bacterial sRNAs across different species continues to be puzzling, given that they can exert similar functions with various sequences and structures. In this study, we analyzed the enrichment patterns of sRNAs in 13 well-annotated bacterial species using existing transcriptome and experimental data. All intergenic regions were analyzed by WU-BLAST to examine conservation levels relative to species within or outside their genus. In total, more than 900 validated bacterial sRNAs and 23,000 intergenic regions were analyzed. The results indicate that sRNAs are enriched in intergenic regions, which are longer and more conserved than the average intergenic regions in the corresponding bacterial genome. We also found that sRNA-coding regions have different conservation levels relative to their flanking regions. This work provides a way to analyze how noncoding RNAs are distributed in bacterial genomes and also shows conserved features of intergenic regions that encode sRNAs. These results also provide insight into the functions of regions surrounding sRNAs and into optimization of RNA search algorithms. PMID:25313390

  18. Identification of Whole Mitochondrial Genomes from Venezuela and Implications on Regional Phylogenies in South America.

    PubMed

    Lee, Esther J; Merriwether, D Andrew

    2015-01-01

    Recent studies have expanded and refined the founding haplogroups of the Americas using whole mitochondrial (mtDNA) genome analysis. In addition to pan-American lineages, specific variants have been identified in a number of studies that show higher frequencies in restricted geographical areas. To further characterize Native American maternal lineages and specifically examine local patterns within South America, we analyzed 12 maternally unrelated Yekuana whole mtDNA genomes from one village (Sharamaña) that include the four major Native American haplogroups A2, B2, C1, and D1. Based on our results, we propose a reconfiguration of one subhaplogroup A2 (A2aa) that is specific to South America and identify other singleton branches across the four haplogroups. Furthermore, we show nucleotide diversity values that increase from north to south for haplogroups C1 and D1. The results from our work add to the growing mitogenomic data that highlight local phylogenies and support the rapid genetic differentiation of South American populations, which has been correlated with the linguistic diversity in the region by previous studies. PMID:26416320

  19. HYBRIDCHECK: software for the rapid detection, visualization and dating of recombinant regions in genome sequence data.

    PubMed

    Ward, Ben J; van Oosterhout, Cock

    2016-03-01

    HYBRIDCHECK is a software package to visualize the recombination signal in large DNA sequence data set, and it can be used to analyse recombination, genetic introgression, hybridization and horizontal gene transfer. It can scan large (multiple kb) contigs and whole-genome sequences of three or more individuals. HYBRIDCHECK is written in the r software for OS X, Linux and Windows operating systems, and it has a simple graphical user interface. In addition, the r code can be readily incorporated in scripts and analysis pipelines. HYBRIDCHECK implements several ABBA-BABA tests and visualizes the effects of hybridization and the resulting mosaic-like genome structure in high-density graphics. The package also reports the following: (i) the breakpoint positions, (ii) the number of mutations in each introgressed block, (iii) the probability that the identified region is not caused by recombination and (iv) the estimated age of each recombination event. The divergence times between the donor and recombinant sequence are calculated using a JC, K80, F81, HKY or GTR correction, and the dating algorithm is exceedingly fast. By estimating the coalescence time of introgressed blocks, it is possible to distinguish between hybridization and incomplete lineage sorting. HYBRIDCHECK is libré software and it and its manual are free to download from http://ward9250.github.io/HybridCheck/. PMID:26394708

  20. Identification of Whole Mitochondrial Genomes from Venezuela and Implications on Regional Phylogenies in South America.

    PubMed

    Lee, Esther J; Merriwether, D Andrew

    2015-01-01

    Recent studies have expanded and refined the founding haplogroups of the Americas using whole mitochondrial (mtDNA) genome analysis. In addition to pan-American lineages, specific variants have been identified in a number of studies that show higher frequencies in restricted geographical areas. To further characterize Native American maternal lineages and specifically examine local patterns within South America, we analyzed 12 maternally unrelated Yekuana whole mtDNA genomes from one village (Sharamaña) that include the four major Native American haplogroups A2, B2, C1, and D1. Based on our results, we propose a reconfiguration of one subhaplogroup A2 (A2aa) that is specific to South America and identify other singleton branches across the four haplogroups. Furthermore, we show nucleotide diversity values that increase from north to south for haplogroups C1 and D1. The results from our work add to the growing mitogenomic data that highlight local phylogenies and support the rapid genetic differentiation of South American populations, which has been correlated with the linguistic diversity in the region by previous studies.

  1. HYBRIDCHECK: software for the rapid detection, visualization and dating of recombinant regions in genome sequence data.

    PubMed

    Ward, Ben J; van Oosterhout, Cock

    2016-03-01

    HYBRIDCHECK is a software package to visualize the recombination signal in large DNA sequence data set, and it can be used to analyse recombination, genetic introgression, hybridization and horizontal gene transfer. It can scan large (multiple kb) contigs and whole-genome sequences of three or more individuals. HYBRIDCHECK is written in the r software for OS X, Linux and Windows operating systems, and it has a simple graphical user interface. In addition, the r code can be readily incorporated in scripts and analysis pipelines. HYBRIDCHECK implements several ABBA-BABA tests and visualizes the effects of hybridization and the resulting mosaic-like genome structure in high-density graphics. The package also reports the following: (i) the breakpoint positions, (ii) the number of mutations in each introgressed block, (iii) the probability that the identified region is not caused by recombination and (iv) the estimated age of each recombination event. The divergence times between the donor and recombinant sequence are calculated using a JC, K80, F81, HKY or GTR correction, and the dating algorithm is exceedingly fast. By estimating the coalescence time of introgressed blocks, it is possible to distinguish between hybridization and incomplete lineage sorting. HYBRIDCHECK is libré software and it and its manual are free to download from http://ward9250.github.io/HybridCheck/.

  2. Genomic Regions Associated with Sheep Resistance to Gastrointestinal Nematodes.

    PubMed

    Benavides, Magda Vieira; Sonstegard, Tad S; Van Tassell, Curtis

    2016-06-01

    Genetic markers for sheep resistance to gastrointestinal parasites have long been sought by the livestock industry as a way to select more resistant individuals and to help farmers reduce parasite transmission by identifying and removing high egg shedders from the flock. Polymorphisms related to the major histocompatibility complex and interferon (IFN)-γ genes have been the most frequently reported markers associated with infection. Recently, a new picture is emerging from genome-wide studies, showing that not only immune mechanisms are important determinants of host resistance but that gastrointestinal mucus production and hemostasis pathways may also play a role. PMID:27183838

  3. Chromosome region-specific libraries for human genome analysis

    SciTech Connect

    Kao, Fa-Ten.

    1992-08-01

    During the grant period progress has been made in the successful demonstration of regional mapping of microclones derived from microdissection libraries; successful demonstration of the feasibility of converting microclones with short inserts into yeast artificial chromosome clones with very large inserts for high resolution physical mapping of the dissected region; Successful demonstration of the usefulness of region-specific microclones to isolate region-specific cDNA clones as candidate genes to facilitate search for the crucial genes underlying genetic diseases assigned to the dissected region; and the successful construction of four region-specific microdissection libraries for human chromosome 2, including 2q35-q37, 2q33-q35, 2p23-p25 and 2p2l-p23. The 2q35-q37 library has been characterized in detail. The characterization of the other three libraries is in progress. These region-specific microdissection libraries and the unique sequence microclones derived from the libraries will be valuable resources for investigators engaged in high resolution physical mapping and isolation of disease-related genes residing in these chromosomal regions.

  4. Isolation of Specific Genomic Regions and Identification of Associated Molecules by enChIP

    PubMed Central

    Fujita, Toshitsugu; Fujii, Hodaka

    2016-01-01

    The identification of molecules associated with specific genomic regions of interest is required to understand the mechanisms of regulation of the functions of these regions. To enable the non-biased identification of molecules interacting with a specific genomic region of interest, we recently developed the engineered DNA-binding molecule-mediated chromatin immunoprecipitation (enChIP) technique. Here, we describe how to use enChIP to isolate specific genomic regions and identify the associated proteins and RNAs. First, a genomic region of interest is tagged with a transcription activator-like (TAL) protein or a clustered regularly interspaced short palindromic repeats (CRISPR) complex consisting of a catalytically inactive form of Cas9 and a guide RNA. Subsequently, the chromatin is crosslinked and fragmented by sonication. The tagged locus is then immunoprecipitated and the crosslinking is reversed. Finally, the proteins or RNAs that are associated with the isolated chromatin are subjected to mass spectrometric or RNA sequencing analyses, respectively. This approach allows the successful identification of proteins and RNAs associated with a genomic region of interest. PMID:26862718

  5. Genome-wide association analysis to identify chromosomal regions determining components of earliness in wheat.

    PubMed

    Le Gouis, J; Bordes, J; Ravel, C; Heumez, E; Faure, S; Praud, S; Galic, N; Remoué, C; Balfourier, F; Allard, V; Rousset, M

    2012-02-01

    The modification of flowering date is considered an important way to escape the current or future climatic constraints that affect wheat crops. A better understanding of its genetic bases would enable a more efficient and rapid modification through breeding. The objective of this study was to identify chromosomal regions associated with earliness in wheat. A 227-wheat core collection chosen to be highly contrasted for earliness was characterized for heading date. Experiments were conducted in controlled conditions and in the field for 3 years to break down earliness in the component traits: photoperiod sensitivity, vernalization requirement and narrow-sense earliness. Whole-genome association mapping was carried out using 760 molecular markers and taking into account the five ancestral group structure. We identified 62 markers individually associated to earliness components corresponding to 33 chromosomal regions. In addition, we identified 15 other significant markers and seven more regions by testing marker pair interactions. Co-localizations were observed with the Ppd-1, Vrn-1 and Rht-1 candidate genes. Using an independent set of lines to validate the model built for heading date, we were able to explain 34% of the variation using the structure and the significant markers. Results were compared with already published data using bi-parental populations giving an insight into the genetic architecture of flowering time in wheat.

  6. Differentially Methylated Genomic Regions in Birth-Weight Discordant Twin Pairs.

    PubMed

    Chen, Mubo; Baumbach, Jan; Vandin, Fabio; Röttger, Richard; Barbosa, Eudes; Dong, Mingchui; Frost, Morten; Christiansen, Lene; Tan, Qihua

    2016-03-01

    Poor nutrition during critical growth phases may alter the structural and physiologic development of vital organs thus "programming" the susceptibility to adult-onset diseases and disease-related health conditions. Epigenome-wide association studies have been performed in birth-weight discordant twin pairs to find evidence for such "programming" effects, but no significant results emerged. We further investigated this issue using a new computational approach: Instead of probing single genomic sites for significant alterations in epigenetic marks, we scan for differentially methylated genomic regions. Whole genome DNA methylation levels were measured in whole blood from 150 pairs of adult identical twins discordant for birth-weight. Intrapair differential DNA methylation was associated with qualitative (large or small) and quantitative (percentage) birth-weight discordance at each genomic site using regression models adjusting for age and sex. Based on the regression results, genomic regions with consistent alteration patterns of DNA methylation were located and tested for significant robustness using computational permutation tests. This yielded an interesting genomic region on chromosome 1, which is significantly differentially methylated for quantitative birth-weight discordance. The region covers two genes (TYW3 and CRYZ) both reportedly associated with metabolism. We conclude that prenatal conditions for birth-weight discordance may result in persistent epigenetic modifications potentially affecting even adult health. PMID:26831219

  7. ECRbase: Database of Evolutionary Conserved Regions, Promoters, and Transcription Factor Binding Sites in Vertebrate Genomes

    SciTech Connect

    Loots, G; Ovcharenko, I

    2006-08-08

    Evolutionary conservation of DNA sequences provides a tool for the identification of functional elements in genomes. We have created a database of evolutionary conserved regions (ECRs) in vertebrate genomes entitled ECRbase that is constructed from a collection of pairwise vertebrate genome alignments produced by the ECR Browser database. ECRbase features a database of syntenic blocks that recapitulate the evolution of rearrangements in vertebrates and a collection of promoters in all vertebrate genomes presented in the database. The database also contains a collection of annotated transcription factor binding sites (TFBS) in all ECRs and promoter elements. ECRbase currently includes human, rhesus macaque, dog, opossum, rat, mouse, chicken, frog, zebrafish, and two pufferfish genomes. It is freely accessible at http://ECRbase.dcode.org.

  8. CNV-based genome wide association study reveals additional variants contributing to meat quality in swine

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality is important both to the meat processing industry and consumers’ purchasing attitudes. Copy number variation (CNV) is a burgeoning kind of variant that may influence meat quality. Herein, a genome-wide association study (GWAS) was performed between CNVs and meat quality traits in swine....

  9. Genome-Wide Association Study of Intelligence: Additive Effects of Novel Brain Expressed Genes

    ERIC Educational Resources Information Center

    Loo, Sandra K.; Shtir, Corina; Doyle, Alysa E.; Mick, Eric; McGough, James J.; McCracken, James; Biederman, Joseph; Smalley, Susan L.; Cantor, Rita M.; Faraone, Stephen V.; Nelson, Stanley F.

    2012-01-01

    Objective: The purpose of the present study was to identify common genetic variants that are associated with human intelligence or general cognitive ability. Method: We performed a genome-wide association analysis with a dense set of 1 million single-nucleotide polymorphisms (SNPs) and quantitative intelligence scores within an ancestrally…

  10. De Novo Identification of Regulatory Regions in Intergenic Spaces of Prokaryotic Genomes

    SciTech Connect

    Chain, P; Garcia, E; Mcloughlin, K; Ovcharenko, I

    2007-02-20

    This project was begun to implement, test, and experimentally validate the results of a novel algorithm for genome-wide identification of candidate transcription-factor binding sites in prokaryotes. Most techniques used to identify regulatory regions rely on conservation between different genomes or have a predetermined sequence motif(s) to perform a genome-wide search. Therefore, such techniques cannot be used with new genome sequences, where information regarding such motifs has not yet been discovered. This project aimed to apply a de novo search algorithm to identify candidate binding-site motifs in intergenic regions of prokaryotic organisms, initially testing the available genomes of the Yersinia genus. We retrofitted existing nucleotide pattern-matching algorithms, analyzed the candidate sites identified by these algorithms as well as their target genes to screen for meaningful patterns. Using properly annotated prokaryotic genomes, this project aimed to develop a set of procedures to identify candidate intergenic sites important for gene regulation. We planned to demonstrate this in Yersinia pestis, a model biodefense, Category A Select Agent pathogen, and then follow up with experimental evidence that these regions are indeed involved in regulation. The ability to quickly characterize transcription-factor binding sites will help lead to a better understanding of how known virulence pathways are modulated in biodefense-related organisms, and will help our understanding and exploration of regulons--gene regulatory networks--and novel pathways for metabolic processes in environmental microbes.

  11. STaRRRT: a table of short tandem repeats in regulatory regions of the human genome

    PubMed Central

    2013-01-01

    Background Tandem repeats (TRs) are unstable regions commonly found within genomes that have consequences for evolution and disease. In humans, polymorphic TRs are known to cause neurodegenerative and neuromuscular disorders as well as being associated with complex diseases such as diabetes and cancer. If present in upstream regulatory regions, TRs can modify chromatin structure and affect transcription; resulting in altered gene expression and protein abundance. The most common TRs are short tandem repeats (STRs), or microsatellites. Promoter located STRs are considerably more polymorphic than coding region STRs. As such, they may be a common driver of phenotypic variation. To study STRs located in regulatory regions, we have performed genome-wide analysis to identify all STRs present in a region that is 2 kilobases upstream and 1 kilobase downstream of the transcription start sites of genes. Results The Short Tandem Repeats in Regulatory Regions Table, STaRRRT, contains the results of the genome-wide analysis, outlining the characteristics of 5,264 STRs present in the upstream regulatory region of 4,441 human genes. Gene set enrichment analysis has revealed significant enrichment for STRs in cellular, transcriptional and neurological system gene promoters and genes important in ion and calcium homeostasis. The set of enriched terms has broad similarity to that seen in coding regions, suggesting that regulatory region STRs are subject to similar evolutionary pressures as STRs in coding regions and may, like coding region STRs, have an important role in controlling gene expression. Conclusions STaRRRT is a readily-searchable resource for investigating potentially polymorphic STRs that could influence the expression of any gene of interest. The processes and genes enriched for regulatory region STRs provide potential novel targets for diagnosing and treating disease, and support a role for these STRs in the evolution of the human genome. PMID:24228761

  12. Demographically-Based Evaluation of Genomic Regions under Selection in Domestic Dogs

    PubMed Central

    Freedman, Adam H.; Schweizer, Rena M.; Ortega-Del Vecchyo, Diego; Han, Eunjung; Davis, Brian W.; Gronau, Ilan; Silva, Pedro M.; Galaverni, Marco; Fan, Zhenxin; Marx, Peter; Lorente-Galdos, Belen; Ramirez, Oscar; Hormozdiari, Farhad; Alkan, Can; Vilà, Carles; Squire, Kevin; Geffen, Eli; Kusak, Josip; Boyko, Adam R.; Parker, Heidi G.; Lee, Clarence; Tadigotla, Vasisht; Siepel, Adam; Bustamante, Carlos D.; Harkins, Timothy T.; Nelson, Stanley F.; Marques-Bonet, Tomas; Ostrander, Elaine A.; Wayne, Robert K.; Novembre, John

    2016-01-01

    Controlling for background demographic effects is important for accurately identifying loci that have recently undergone positive selection. To date, the effects of demography have not yet been explicitly considered when identifying loci under selection during dog domestication. To investigate positive selection on the dog lineage early in the domestication, we examined patterns of polymorphism in six canid genomes that were previously used to infer a demographic model of dog domestication. Using an inferred demographic model, we computed false discovery rates (FDR) and identified 349 outlier regions consistent with positive selection at a low FDR. The signals in the top 100 regions were frequently centered on candidate genes related to brain function and behavior, including LHFPL3, CADM2, GRIK3, SH3GL2, MBP, PDE7B, NTAN1, and GLRA1. These regions contained significant enrichments in behavioral ontology categories. The 3rd top hit, CCRN4L, plays a major role in lipid metabolism, that is supported by additional metabolism related candidates revealed in our scan, including SCP2D1 and PDXC1. Comparing our method to an empirical outlier approach that does not directly account for demography, we found only modest overlaps between the two methods, with 60% of empirical outliers having no overlap with our demography-based outlier detection approach. Demography-aware approaches have lower-rates of false discovery. Our top candidates for selection, in addition to expanding the set of neurobehavioral candidate genes, include genes related to lipid metabolism, suggesting a dietary target of selection that was important during the period when proto-dogs hunted and fed alongside hunter-gatherers. PMID:26943675

  13. Demographically-Based Evaluation of Genomic Regions under Selection in Domestic Dogs.

    PubMed

    Freedman, Adam H; Schweizer, Rena M; Ortega-Del Vecchyo, Diego; Han, Eunjung; Davis, Brian W; Gronau, Ilan; Silva, Pedro M; Galaverni, Marco; Fan, Zhenxin; Marx, Peter; Lorente-Galdos, Belen; Ramirez, Oscar; Hormozdiari, Farhad; Alkan, Can; Vilà, Carles; Squire, Kevin; Geffen, Eli; Kusak, Josip; Boyko, Adam R; Parker, Heidi G; Lee, Clarence; Tadigotla, Vasisht; Siepel, Adam; Bustamante, Carlos D; Harkins, Timothy T; Nelson, Stanley F; Marques-Bonet, Tomas; Ostrander, Elaine A; Wayne, Robert K; Novembre, John

    2016-03-01

    Controlling for background demographic effects is important for accurately identifying loci that have recently undergone positive selection. To date, the effects of demography have not yet been explicitly considered when identifying loci under selection during dog domestication. To investigate positive selection on the dog lineage early in the domestication, we examined patterns of polymorphism in six canid genomes that were previously used to infer a demographic model of dog domestication. Using an inferred demographic model, we computed false discovery rates (FDR) and identified 349 outlier regions consistent with positive selection at a low FDR. The signals in the top 100 regions were frequently centered on candidate genes related to brain function and behavior, including LHFPL3, CADM2, GRIK3, SH3GL2, MBP, PDE7B, NTAN1, and GLRA1. These regions contained significant enrichments in behavioral ontology categories. The 3rd top hit, CCRN4L, plays a major role in lipid metabolism, that is supported by additional metabolism related candidates revealed in our scan, including SCP2D1 and PDXC1. Comparing our method to an empirical outlier approach that does not directly account for demography, we found only modest overlaps between the two methods, with 60% of empirical outliers having no overlap with our demography-based outlier detection approach. Demography-aware approaches have lower-rates of false discovery. Our top candidates for selection, in addition to expanding the set of neurobehavioral candidate genes, include genes related to lipid metabolism, suggesting a dietary target of selection that was important during the period when proto-dogs hunted and fed alongside hunter-gatherers.

  14. Demographically-Based Evaluation of Genomic Regions under Selection in Domestic Dogs.

    PubMed

    Freedman, Adam H; Schweizer, Rena M; Ortega-Del Vecchyo, Diego; Han, Eunjung; Davis, Brian W; Gronau, Ilan; Silva, Pedro M; Galaverni, Marco; Fan, Zhenxin; Marx, Peter; Lorente-Galdos, Belen; Ramirez, Oscar; Hormozdiari, Farhad; Alkan, Can; Vilà, Carles; Squire, Kevin; Geffen, Eli; Kusak, Josip; Boyko, Adam R; Parker, Heidi G; Lee, Clarence; Tadigotla, Vasisht; Siepel, Adam; Bustamante, Carlos D; Harkins, Timothy T; Nelson, Stanley F; Marques-Bonet, Tomas; Ostrander, Elaine A; Wayne, Robert K; Novembre, John

    2016-03-01

    Controlling for background demographic effects is important for accurately identifying loci that have recently undergone positive selection. To date, the effects of demography have not yet been explicitly considered when identifying loci under selection during dog domestication. To investigate positive selection on the dog lineage early in the domestication, we examined patterns of polymorphism in six canid genomes that were previously used to infer a demographic model of dog domestication. Using an inferred demographic model, we computed false discovery rates (FDR) and identified 349 outlier regions consistent with positive selection at a low FDR. The signals in the top 100 regions were frequently centered on candidate genes related to brain function and behavior, including LHFPL3, CADM2, GRIK3, SH3GL2, MBP, PDE7B, NTAN1, and GLRA1. These regions contained significant enrichments in behavioral ontology categories. The 3rd top hit, CCRN4L, plays a major role in lipid metabolism, that is supported by additional metabolism related candidates revealed in our scan, including SCP2D1 and PDXC1. Comparing our method to an empirical outlier approach that does not directly account for demography, we found only modest overlaps between the two methods, with 60% of empirical outliers having no overlap with our demography-based outlier detection approach. Demography-aware approaches have lower-rates of false discovery. Our top candidates for selection, in addition to expanding the set of neurobehavioral candidate genes, include genes related to lipid metabolism, suggesting a dietary target of selection that was important during the period when proto-dogs hunted and fed alongside hunter-gatherers. PMID:26943675

  15. Comparative genomics provides insight into maize adaptation in temperate regions.

    PubMed

    Hufford, Matthew B

    2016-01-01

    A new study provides insights into the evolution of maize during its global spread into temperate regions from its origin in coastal Mexico.Please see related Research article: http://genomebiology.biomedcentral.com/articles/10.1186/s13059-016-1009-x. PMID:27411931

  16. Specific amplification by PCR of rearranged genomic variable regions of immunoglobulin genes from mouse hybridoma cells.

    PubMed

    Berdoz, J; Monath, T P; Kraehenbuhl, J P

    1995-04-01

    We have designed a novel strategy for the isolation of the rearranged genomic fragments encoding the L-VH-D-JH and L-V kappa/lambda-J kappa/lambda regions of mouse immunoglobulin genes. This strategy is based on the PCR amplification of genomic DNA from mouse hybridomas using multiple specific primers chosen in the 5'-untranslated region and in the intron downstream of the rearranged JH/J kappa/lambda sequences. Variable regions with intact coding sequences, including full-length leader peptides (L) can be obtained without previous DNA sequencing. Our strategy is based on a genomic template that produces fragments that do not need to be adapted for recombinant antibody expression, thus facilitating the generation of chimeric and isotype-switched immunoglobulins.

  17. A genome-wide association study identifies a genomic region for the polycerate phenotype in sheep (Ovis aries)

    PubMed Central

    Ren, Xue; Yang, Guang-Li; Peng, Wei-Feng; Zhao, Yong-Xin; Zhang, Min; Chen, Ze-Hui; Wu, Fu-An; Kantanen, Juha; Shen, Min; Li, Meng-Hua

    2016-01-01

    Horns are a cranial appendage found exclusively in Bovidae, and play important roles in accessing resources and mates. In sheep (Ovies aries), horns vary from polled to six-horned, and human have been selecting polled animals in farming and breeding. Here, we conducted a genome-wide association study on 24 two-horned versus 22 four-horned phenotypes in a native Chinese breed of Sishui Fur sheep. Together with linkage disequilibrium (LD) analyses and haplotype-based association tests, we identified a genomic region comprising 132.0–133.1 Mb on chromosome 2 that contained the top 10 SNPs (including 4 significant SNPs) and 5 most significant haplotypes associated with the polycerate phenotype. In humans and mice, this genomic region contains the HOXD gene cluster and adjacent functional genes EVX2 and KIAA1715, which have a close association with the formation of limbs and genital buds. Our results provide new insights into the genetic basis underlying variable numbers of horns and represent a new resource for use in sheep genetics and breeding. PMID:26883901

  18. Chromosome region-specific libraries for human genome analysis. Final progress report, 1 March 1991--28 February 1994

    SciTech Connect

    Kao, F.T.

    1994-04-01

    The objectives of this grant proposal include (1) development of a chromosome microdissection and PCR-mediated microcloning technology, (2) application of this microtechnology to the construction of region-specific libraries for human genome analysis. During this grant period, the authors have successfully developed this microtechnology and have applied it to the construction of microdissection libraries for the following chromosome regions: a whole chromosome 21 (21E), 2 region-specific libraries for the long arm of chromosome 2, 2q35-q37 (2Q1) and 2q33-q35 (2Q2), and 4 region-specific libraries for the entire short arm of chromosome 2, 2p23-p25 (2P1), 2p21-p23 (2P2), 2p14-p16 (wP3) and 2p11-p13 (2P4). In addition, 20--40 unique sequence microclones have been isolated and characterized for genomic studies. These region-specific libraries and the single-copy microclones from the library have been used as valuable resources for (1) isolating microsatellite probes in linkage analysis to further refine the disease locus; (2) isolating corresponding clones with large inserts, e.g. YAC, BAC, P1, cosmid and phage, to facilitate construction of contigs for high resolution physical mapping; and (3) isolating region-specific cDNA clones for use as candidate genes. These libraries are being deposited in the American Type Culture Collection (ATCC) for general distribution.

  19. SynFind: Compiling Syntenic Regions across Any Set of Genomes on Demand

    PubMed Central

    Tang, Haibao; Bomhoff, Matthew D.; Briones, Evan; Zhang, Liangsheng; Schnable, James C.; Lyons, Eric

    2015-01-01

    The identification of conserved syntenic regions enables discovery of predicted locations for orthologous and homeologous genes, even when no such gene is present. This capability means that synteny-based methods are far more effective than sequence similarity-based methods in identifying true-negatives, a necessity for studying gene loss and gene transposition. However, the identification of syntenic regions requires complex analyses which must be repeated for pairwise comparisons between any two species. Therefore, as the number of published genomes increases, there is a growing demand for scalable, simple-to-use applications to perform comparative genomic analyses that cater to both gene family studies and genome-scale studies. We implemented SynFind, a web-based tool that addresses this need. Given one query genome, SynFind is capable of identifying conserved syntenic regions in any set of target genomes. SynFind is capable of reporting per-gene information, useful for researchers studying specific gene families, as well as genome-wide data sets of syntenic gene and predicted gene locations, critical for researchers focused on large-scale genomic analyses. Inference of syntenic homologs provides the basis for correlation of functional changes around genes of interests between related organisms. Deployed on the CoGe online platform, SynFind is connected to the genomic data from over 15,000 organisms from all domains of life as well as supporting multiple releases of the same organism. SynFind makes use of a powerful job execution framework that promises scalability and reproducibility. SynFind can be accessed at http://genomevolution.org/CoGe/SynFind.pl. A video tutorial of SynFind using Phytophthrora as an example is available at http://www.youtube.com/watch?v=2Agczny9Nyc. PMID:26560340

  20. SynFind: Compiling Syntenic Regions across Any Set of Genomes on Demand.

    PubMed

    Tang, Haibao; Bomhoff, Matthew D; Briones, Evan; Zhang, Liangsheng; Schnable, James C; Lyons, Eric

    2015-12-01

    The identification of conserved syntenic regions enables discovery of predicted locations for orthologous and homeologous genes, even when no such gene is present. This capability means that synteny-based methods are far more effective than sequence similarity-based methods in identifying true-negatives, a necessity for studying gene loss and gene transposition. However, the identification of syntenic regions requires complex analyses which must be repeated for pairwise comparisons between any two species. Therefore, as the number of published genomes increases, there is a growing demand for scalable, simple-to-use applications to perform comparative genomic analyses that cater to both gene family studies and genome-scale studies. We implemented SynFind, a web-based tool that addresses this need. Given one query genome, SynFind is capable of identifying conserved syntenic regions in any set of target genomes. SynFind is capable of reporting per-gene information, useful for researchers studying specific gene families, as well as genome-wide data sets of syntenic gene and predicted gene locations, critical for researchers focused on large-scale genomic analyses. Inference of syntenic homologs provides the basis for correlation of functional changes around genes of interests between related organisms. Deployed on the CoGe online platform, SynFind is connected to the genomic data from over 15,000 organisms from all domains of life as well as supporting multiple releases of the same organism. SynFind makes use of a powerful job execution framework that promises scalability and reproducibility. SynFind can be accessed at http://genomevolution.org/CoGe/SynFind.pl. A video tutorial of SynFind using Phytophthrora as an example is available at http://www.youtube.com/watch?v=2Agczny9Nyc. PMID:26560340

  1. SynFind: Compiling Syntenic Regions across Any Set of Genomes on Demand.

    PubMed

    Tang, Haibao; Bomhoff, Matthew D; Briones, Evan; Zhang, Liangsheng; Schnable, James C; Lyons, Eric

    2015-12-01

    The identification of conserved syntenic regions enables discovery of predicted locations for orthologous and homeologous genes, even when no such gene is present. This capability means that synteny-based methods are far more effective than sequence similarity-based methods in identifying true-negatives, a necessity for studying gene loss and gene transposition. However, the identification of syntenic regions requires complex analyses which must be repeated for pairwise comparisons between any two species. Therefore, as the number of published genomes increases, there is a growing demand for scalable, simple-to-use applications to perform comparative genomic analyses that cater to both gene family studies and genome-scale studies. We implemented SynFind, a web-based tool that addresses this need. Given one query genome, SynFind is capable of identifying conserved syntenic regions in any set of target genomes. SynFind is capable of reporting per-gene information, useful for researchers studying specific gene families, as well as genome-wide data sets of syntenic gene and predicted gene locations, critical for researchers focused on large-scale genomic analyses. Inference of syntenic homologs provides the basis for correlation of functional changes around genes of interests between related organisms. Deployed on the CoGe online platform, SynFind is connected to the genomic data from over 15,000 organisms from all domains of life as well as supporting multiple releases of the same organism. SynFind makes use of a powerful job execution framework that promises scalability and reproducibility. SynFind can be accessed at http://genomevolution.org/CoGe/SynFind.pl. A video tutorial of SynFind using Phytophthrora as an example is available at http://www.youtube.com/watch?v=2Agczny9Nyc.

  2. OcculterCut: A Comprehensive Survey of AT-Rich Regions in Fungal Genomes

    PubMed Central

    Testa, Alison C.; Oliver, Richard P.; Hane, James K.

    2016-01-01

    We present a novel method to measure the local GC-content bias in genomes and a survey of published fungal species. The method, enacted as “OcculterCut” (https://sourceforge.net/projects/occultercut, last accessed April 30, 2016), identified species containing distinct AT-rich regions. In most fungal taxa, AT-rich regions are a signature of repeat-induced point mutation (RIP), which targets repetitive DNA and decreases GC-content though the conversion of cytosine to thymine bases. RIP has in turn been identified as a driver of fungal genome evolution, as RIP mutations can also occur in single-copy genes neighboring repeat-rich regions. Over time RIP perpetuates “two speeds” of gene evolution in the GC-equilibrated and AT-rich regions of fungal genomes. In this study, genomes showing evidence of this process are found to be common, particularly among the Pezizomycotina. Further analysis highlighted differences in amino acid composition and putative functions of genes from these regions, supporting the hypothesis that these regions play an important role in fungal evolution. OcculterCut can also be used to identify genes undergoing RIP-assisted diversifying selection, such as small, secreted effector proteins that mediate host-microbe disease interactions. PMID:27289099

  3. Molecular cloning and sequence determination of the genomic regions encoding protease and genome-linked protein of three picornaviruses.

    PubMed Central

    Werner, G; Rosenwirth, B; Bauer, E; Seifert, J M; Werner, F J; Besemer, J

    1986-01-01

    To investigate the degree of similarity between picornavirus proteases, we cloned the genomic cDNAs of an enterovirus, echovirus 9 (strain Barty), and two rhinoviruses, serotypes 1A and 14LP, and determined the nucleotide sequence of the region which, by analogy to poliovirus, encodes the protease. The nucleotide sequence of the region encoding the genome-linked protein VPg, immediately adjacent to the protease, was also determined. Comparison of nucleotide and deduced amino acid sequences with other available picornavirus sequences showed remarkable homology in proteases and among VPgs. Three highly conserved peptide regions were identified in the protease; one of these is specific for human picornaviruses and has no obvious counterpart in encephalomyocarditis virus, foot-and-mouth disease virus, or cowpea mosaic virus proteases. Within the other two peptide regions two conserved amino acids, Cys 147 and His 161, could be the reactive residues of the active site. We used a statistical method to predict certain features of the secondary structures, such as alpha helices, beta sheets, and turns, and found many of these conformations to be conserved. The hydropathy profiles of the compared proteases were also strikingly similar. Thus, the proteases of human picornaviruses very probably have a similar three-dimensional structure. Images PMID:3512851

  4. The catecholamine biosynthetic enzyme dopamine β-hydroxylase (DBH): first genome-wide search positions trait-determining variants acting additively in the proximal promoter

    PubMed Central

    Mustapic, Maja; Maihofer, Adam X.; Mahata, Manjula; Chen, Yuqing; Baker, Dewleen G.; O'Connor, Daniel T.; Nievergelt, Caroline M.

    2014-01-01

    Dopamine beta-hydroxylase (DBH) is the biosynthetic enzyme catalyzing formation of norepinephrine. Changes in DBH expression or activity have been implicated in the pathogenesis of cardiovascular and neuropsychiatric disorders. Genetic determination of DBH enzymatic activity and its secretion are only incompletely understood. We began with a genome-wide association search for loci contributing to DBH activity in human plasma. Initially, in a population sample of European ancestry, we identified the proximal DBH promoter as a region harboring three common trait-determining variants (top hit rs1611115, P = 7.2 × 10−51). We confirmed their effects on transcription and showed that the three variants each acted additively on gene expression. Results were replicated in a population sample of Native American descent (top hit rs1611115, P = 4.1 × 10−15). Jointly, DBH variants accounted for 57% of DBH trait variation. We further identified a genome-wide significant SNP at the LOC338797 locus on chromosome 12 as trans-quantitative trait locus (QTL) (rs4255618, P = 4.62 × 10−8). Conditional analyses on DBH identified a third genomic region contributing to DBH variation: a likely cis-QTL adjacent to DBH in SARDH (rs7040170, P = 1.31 × 10−14) on chromosome 9q. We conclude that three common SNPs in the DBH promoter act additively to control phenotypic variation in DBH levels, and that two additional novel loci (SARDH and LOC338797) may also contribute to the expression of this catecholamine biosynthetic trait. Identification of DBH variants with strong effects makes it possible to take advantage of Mendelian randomization approaches to test causal effects of this intermediate trait on disease. PMID:24986918

  5. Expression of transcribed ultraconserved regions of genome in rat cerebral cortex

    PubMed Central

    Mehta, Suresh L.; Dharap, Ashutosh; Vemuganti, Raghu

    2014-01-01

    Emerging evidence indicates that 481 regions of the genome (>200 bp) that actively transcribe noncoding RNAs shows 100% homology between humans, rats and mice. These transcribed ultraconserved regions (T-UCRs) are thought to control the essential regulatory functions basic for life in rodents and mammals. Using microarray analysis, we presently show that 107 T-UCRs are actively expressed in adult rat cerebral cortex. They are grouped into intragenic (61) and intergenic (46) based on their genic location. Interestingly, 10 T-UCRs are expressed at unusually high levels in cerebral cortex. Additionally, many T-UCRs also showed cogenic expression. We further analyzed the correlation of intragenic T-UCRs with their host protein coding genes. Surprisingly, most of the expressed intragenic T-UCRs (54 out of 61) displayed a negative correlation with their host gene expression. T-UCRs are thought to control the splicing and transcription of the protein-coding genes that host them and flank them. Bioinformatics analysis indicated that the protein products of majority of these genes are nuclear in localization, share protein domains and are involved in the regulation of diverse biological and molecular functions including metabolism, development, cell cycle, binding and transcription factor regulation. In conclusion, this is the first study to shows that many T-UCRs are expressed in rodent brain and they might play a role in physiological brain functions. PMID:24953281

  6. Genome-Enabled Estimates of Additive and Nonadditive Genetic Variances and Prediction of Apple Phenotypes Across Environments.

    PubMed

    Kumar, Satish; Molloy, Claire; Muñoz, Patricio; Daetwyler, Hans; Chagné, David; Volz, Richard

    2015-12-01

    The nonadditive genetic effects may have an important contribution to total genetic variation of phenotypes, so estimates of both the additive and nonadditive effects are desirable for breeding and selection purposes. Our main objectives were to: estimate additive, dominance and epistatic variances of apple (Malus × domestica Borkh.) phenotypes using relationship matrices constructed from genome-wide dense single nucleotide polymorphism (SNP) markers; and compare the accuracy of genomic predictions using genomic best linear unbiased prediction models with or without including nonadditive genetic effects. A set of 247 clonally replicated individuals was assessed for six fruit quality traits at two sites, and also genotyped using an Illumina 8K SNP array. Across several fruit quality traits, the additive, dominance, and epistatic effects contributed about 30%, 16%, and 19%, respectively, to the total phenotypic variance. Models ignoring nonadditive components yielded upwardly biased estimates of additive variance (heritability) for all traits in this study. The accuracy of genomic predicted genetic values (GEGV) varied from about 0.15 to 0.35 for various traits, and these were almost identical for models with or without including nonadditive effects. However, models including nonadditive genetic effects further reduced the bias of GEGV. Between-site genotypic correlations were high (>0.85) for all traits, and genotype-site interaction accounted for <10% of the phenotypic variability. The accuracy of prediction, when the validation set was present only at one site, was generally similar for both sites, and varied from about 0.50 to 0.85. The prediction accuracies were strongly influenced by trait heritability, and genetic relatedness between the training and validation families.

  7. Genome-Enabled Estimates of Additive and Nonadditive Genetic Variances and Prediction of Apple Phenotypes Across Environments

    PubMed Central

    Kumar, Satish; Molloy, Claire; Muñoz, Patricio; Daetwyler, Hans; Chagné, David; Volz, Richard

    2015-01-01

    The nonadditive genetic effects may have an important contribution to total genetic variation of phenotypes, so estimates of both the additive and nonadditive effects are desirable for breeding and selection purposes. Our main objectives were to: estimate additive, dominance and epistatic variances of apple (Malus × domestica Borkh.) phenotypes using relationship matrices constructed from genome-wide dense single nucleotide polymorphism (SNP) markers; and compare the accuracy of genomic predictions using genomic best linear unbiased prediction models with or without including nonadditive genetic effects. A set of 247 clonally replicated individuals was assessed for six fruit quality traits at two sites, and also genotyped using an Illumina 8K SNP array. Across several fruit quality traits, the additive, dominance, and epistatic effects contributed about 30%, 16%, and 19%, respectively, to the total phenotypic variance. Models ignoring nonadditive components yielded upwardly biased estimates of additive variance (heritability) for all traits in this study. The accuracy of genomic predicted genetic values (GEGV) varied from about 0.15 to 0.35 for various traits, and these were almost identical for models with or without including nonadditive effects. However, models including nonadditive genetic effects further reduced the bias of GEGV. Between-site genotypic correlations were high (>0.85) for all traits, and genotype-site interaction accounted for <10% of the phenotypic variability. The accuracy of prediction, when the validation set was present only at one site, was generally similar for both sites, and varied from about 0.50 to 0.85. The prediction accuracies were strongly influenced by trait heritability, and genetic relatedness between the training and validation families. PMID:26497141

  8. Acute hepatitis C in a chronically HIV-infected patient: Evolution of different viral genomic regions

    PubMed Central

    Flichman, Diego; Kott, Veronica; Sookoian, Silvia; Campos, Rodolfo

    2003-01-01

    AIM: To analyze the molecular evolution of different viral genomic regions of HCV in an acute HCV infected patient chronically infected with HIV through a 42-month follow-up. METHODS: Serum samples of a chronically HIV infected patient that seroconverted to anti HCV antibodies were sequenced, from the event of superinfection through a period of 17 mo and in a late sample (42nd month). Hypervariable genomic regions of HIV (V3 loop of the gp120) and HCV (HVR-1 on the E2 glycoprotein gene) were studied. In order to analyze genomic regions involved in different biological functions and with the cellular immune response, HCV core and NS5A were also chosen to be sequenced. Amplification of the different regions was done by RT-PCR and directly sequenced. Confirmation of sequences was done on reamplified material. Nucleotide sequences of the different time points were aligned with CLUSTAL W 1.5, and the corresponding amino acid ones were deduced. RESULTS: Hypervariable genomic regions of both viruses (HVR1 and gp120 V3 loop) presented several nonsynonymous changes but, while in the gp120 V3 loop mutations were detected in the sample obtained right after HCV superinfection and maintained throughout, they occurred following a sequential and cumulative pattern in the HVR1. In the NS5A region of HCV, two amino acid changes were detected during the follow-up period, whereas the core region presented several amino acid replacements, once the HCV chronic infection had been established. CONCLUSION: During the HIV-HCV superinfection, each genomic region analyzed shows a different evolutionary pattern. Most of the nucleotide substitutions observed are non-synonymous and clustered in previously described epitopes, thus suggesting an immune-driven evolutionary process. PMID:12854149

  9. Intrachromosomal rearrangements in avian genome evolution: evidence for regions prone to breakpoints

    PubMed Central

    Skinner, B M; Griffin, D K

    2012-01-01

    It is generally believed that the organization of avian genomes remains highly conserved in evolution as chromosome number is constant and comparative chromosome painting demonstrated there to be very few interchromosomal rearrangements. The recent sequencing of the zebra finch (Taeniopygia guttata) genome allowed an assessment of the number of intrachromosomal rearrangements between it and the chicken (Gallus gallus) genome, revealing a surprisingly high number of intrachromosomal rearrangements. With the publication of the turkey (Meleagris gallopavo) genome it has become possible to describe intrachromosomal rearrangements between these three important avian species, gain insight into the direction of evolutionary change and assess whether breakpoint regions are reused in birds. To this end, we aligned entire chromosomes between chicken, turkey and zebra finch, identifying syntenic blocks of at least 250 kb. Potential optimal pathways of rearrangements between each of the three genomes were determined, as was a potential Galliform ancestral organization. From this, our data suggest that around one-third of chromosomal breakpoint regions may recur during avian evolution, with 10% of breakpoints apparently recurring in different lineages. This agrees with our previous hypothesis that mechanisms of genome evolution are driven by hotspots of non-allelic homologous recombination. PMID:22045382

  10. A Genome-Wide Association Analysis Reveals Epistatic Cancellation of Additive Genetic Variance for Root Length in Arabidopsis thaliana.

    PubMed

    Lachowiec, Jennifer; Shen, Xia; Queitsch, Christine; Carlborg, Örjan

    2015-01-01

    Efforts to identify loci underlying complex traits generally assume that most genetic variance is additive. Here, we examined the genetics of Arabidopsis thaliana root length and found that the genomic narrow-sense heritability for this trait in the examined population was statistically zero. The low amount of additive genetic variance that could be captured by the genome-wide genotypes likely explains why no associations to root length could be found using standard additive-model-based genome-wide association (GWA) approaches. However, as the broad-sense heritability for root length was significantly larger, and primarily due to epistasis, we also performed an epistatic GWA analysis to map loci contributing to the epistatic genetic variance. Four interacting pairs of loci were revealed, involving seven chromosomal loci that passed a standard multiple-testing corrected significance threshold. The genotype-phenotype maps for these pairs revealed epistasis that cancelled out the additive genetic variance, explaining why these loci were not detected in the additive GWA analysis. Small population sizes, such as in our experiment, increase the risk of identifying false epistatic interactions due to testing for associations with very large numbers of multi-marker genotypes in few phenotyped individuals. Therefore, we estimated the false-positive risk using a new statistical approach that suggested half of the associated pairs to be true positive associations. Our experimental evaluation of candidate genes within the seven associated loci suggests that this estimate is conservative; we identified functional candidate genes that affected root development in four loci that were part of three of the pairs. The statistical epistatic analyses were thus indispensable for confirming known, and identifying new, candidate genes for root length in this population of wild-collected A. thaliana accessions. We also illustrate how epistatic cancellation of the additive genetic variance

  11. A Genome-Wide Association Analysis Reveals Epistatic Cancellation of Additive Genetic Variance for Root Length in Arabidopsis thaliana

    PubMed Central

    Lachowiec, Jennifer; Shen, Xia; Queitsch, Christine; Carlborg, Örjan

    2015-01-01

    Efforts to identify loci underlying complex traits generally assume that most genetic variance is additive. Here, we examined the genetics of Arabidopsis thaliana root length and found that the genomic narrow-sense heritability for this trait in the examined population was statistically zero. The low amount of additive genetic variance that could be captured by the genome-wide genotypes likely explains why no associations to root length could be found using standard additive-model-based genome-wide association (GWA) approaches. However, as the broad-sense heritability for root length was significantly larger, and primarily due to epistasis, we also performed an epistatic GWA analysis to map loci contributing to the epistatic genetic variance. Four interacting pairs of loci were revealed, involving seven chromosomal loci that passed a standard multiple-testing corrected significance threshold. The genotype-phenotype maps for these pairs revealed epistasis that cancelled out the additive genetic variance, explaining why these loci were not detected in the additive GWA analysis. Small population sizes, such as in our experiment, increase the risk of identifying false epistatic interactions due to testing for associations with very large numbers of multi-marker genotypes in few phenotyped individuals. Therefore, we estimated the false-positive risk using a new statistical approach that suggested half of the associated pairs to be true positive associations. Our experimental evaluation of candidate genes within the seven associated loci suggests that this estimate is conservative; we identified functional candidate genes that affected root development in four loci that were part of three of the pairs. The statistical epistatic analyses were thus indispensable for confirming known, and identifying new, candidate genes for root length in this population of wild-collected A. thaliana accessions. We also illustrate how epistatic cancellation of the additive genetic variance

  12. Genetics/Genomics Research in the Central Region

    USGS Publications Warehouse

    ,

    2006-01-01

    Genetics-based research within the Biological Resources Discipline (BRD) Science Centers in the Central Region incorporates many aspects of the field of genetics. Research activities range from documenting patterns of genetic variation in order to investigate relationships among species, populations and individuals to investigating the structure, function and expression of genes and their response to environmental stressors. Research in the broad areas of genetics requires multidisciplinary expertise and specialized equipment and instrumentation. Brief summaries of the capabilities of the five BRD Centers are given below.

  13. The Complete Chloroplast Genome Sequences of Three Veroniceae Species (Plantaginaceae): Comparative Analysis and Highly Divergent Regions

    PubMed Central

    Choi, Kyoung Su; Chung, Myong Gi; Park, SeonJoo

    2016-01-01

    Previous studies of Veronica and related genera were weakly supported by molecular and paraphyletic taxa. Here, we report the complete chloroplast genome sequence of Veronica nakaiana and the related species Veronica persica and Veronicastrum sibiricum. The chloroplast genome length of V. nakaiana, V. persica, and V. sibiricum ranged from 150,198 bp to 152,930 bp. A total of 112 genes comprising 79 protein coding genes, 29 tRNA genes, and 4 rRNA genes were observed in three chloroplast genomes. The total number of SSRs was 48, 51, and 53 in V. nakaiana, V. persica, and V. sibiricum, respectively. Two SSRs (10 bp of AT and 12 bp of AATA) were observed in the same regions (rpoC2 and ndhD) in three chloroplast genomes. A comparison of coding genes and non-coding regions between V. nakaiana and V. persica revealed divergent sites, with the greatest variation occurring petD-rpoA region. The complete chloroplast genome sequence information regarding the three Veroniceae will be helpful for elucidating Veroniceae phylogenetic relationships. PMID:27047524

  14. Isolation of Additional Bacteriophages with Genomes of Segmented Double-Stranded RNA

    PubMed Central

    Mindich, Leonard; Qiao, Xueying; Qiao, Jian; Onodera, Shiroh; Romantschuk, Martin; Hoogstraten, Deborah

    1999-01-01

    Eight different bacteriophages were isolated from leaves of Pisum sativum, Phaseolus vulgaris, Lycopersicon esculentum, Daucus carota sativum, Raphanus sativum, and Ocimum basilicum. All contain three segments of double-stranded RNA and have genomic-segment sizes that are similar but not identical to those of previously described bacteriophage φ6. All appear to have lipid-containing membranes. The base sequences of some of the viruses are very similar but not identical to those of φ6. Three of the viruses have little or no base sequence identity to φ6. Two of the viruses, φ8 and φ12, contain proteins with a size distribution very different from that of φ6 and do not package genomic segments of φ6. Whereas φ6 attaches to host cells by means of a pilus, several of the new isolates attach directly to the outer membrane. Although the normal hosts of these viruses seem to be pseudomonads, those viruses that attach directly to the outer membrane can establish carrier states in Escherichia coli or Salmonella typhimurium. One of the isolates, φ8, can form plaques on heptoseless strains of S. typhimurium. PMID:10419946

  15. Extensive Pyrosequencing Reveals Frequent Intra-Genomic Variations of Internal Transcribed Spacer Regions of Nuclear Ribosomal DNA

    PubMed Central

    Li, Dezhu; Sun, Yongzhen; Niu, Yunyun; Chen, Zhiduan; Luo, Hongmei; Pang, Xiaohui; Sun, Zhiying; Liu, Chang; Lv, Aiping; Deng, Youping; Larson-Rabin, Zachary; Wilkinson, Mike; Chen, Shilin

    2012-01-01

    Background Internal transcribed spacer of nuclear ribosomal DNA (nrDNA) is already one of the most popular phylogenetic and DNA barcoding markers. However, the existence of its multiple copies has complicated such usage and a detailed characterization of intra-genomic variations is critical to address such concerns. Methodology/Principal Findings In this study, we used sequence-tagged pyrosequencing and genome-wide analyses to characterize intra-genomic variations of internal transcribed spacer 2 (ITS2) regions from 178 plant species. We discovered that mutation of ITS2 is frequent, with a mean of 35 variants per species. And on average, three of the most abundant variants make up 91% of all ITS2 copies. Moreover, we found different congeneric species share identical variants in 13 genera. Interestingly, different species across different genera also share identical variants. In particular, one minor variant of ITS2 in Eleutherococcus giraldii was found identical to the ITS2 major variant of Panax ginseng, both from Araliaceae family. In addition, DNA barcoding gap analysis showed that the intra-genomic distances were markedly smaller than those of the intra-specific or inter-specific variants. When each of 5543 variants were examined for its species discrimination efficiency, a 97% success rate was obtained at the species level. Conclusions Identification of identical ITS2 variants across intra-generic or inter-generic species revealed complex species evolutionary history, possibly, horizontal gene transfer and ancestral hybridization. Although intra-genomic multiple variants are frequently found within each genome, the usage of the major variants alone is sufficient for phylogeny construction and species determination in most cases. Furthermore, the inclusion of minor variants further improves the resolution of species identification. PMID:22952830

  16. Genome-wide Association Study Identifies Five Susceptibility Loci for Follicular Lymphoma outside the HLA Region

    PubMed Central

    Skibola, Christine F.; Berndt, Sonja I.; Vijai, Joseph; Conde, Lucia; Wang, Zhaoming; Yeager, Meredith; de Bakker, Paul I.W.; Birmann, Brenda M.; Vajdic, Claire M.; Foo, Jia-Nee; Bracci, Paige M.; Vermeulen, Roel C.H.; Slager, Susan L.; de Sanjose, Silvia; Wang, Sophia S.; Linet, Martha S.; Salles, Gilles; Lan, Qing; Severi, Gianluca; Hjalgrim, Henrik; Lightfoot, Tracy; Melbye, Mads; Gu, Jian; Ghesquières, Hervé; Link, Brian K.; Morton, Lindsay M.; Holly, Elizabeth A.; Smith, Alex; Tinker, Lesley F.; Teras, Lauren R.; Kricker, Anne; Becker, Nikolaus; Purdue, Mark P.; Spinelli, John J.; Zhang, Yawei; Giles, Graham G.; Vineis, Paolo; Monnereau, Alain; Bertrand, Kimberly A.; Albanes, Demetrius; Zeleniuch-Jacquotte, Anne; Gabbas, Attilio; Chung, Charles C.; Burdett, Laurie; Hutchinson, Amy; Lawrence, Charles; Montalvan, Rebecca; Liang, Liming; Huang, Jinyan; Ma, Baoshan; Liu, Jianjun; Adami, Hans-Olov; Glimelius, Bengt; Ye, Yuanqing; Nowakowski, Grzegorz S.; Dogan, Ahmet; Thompson, Carrie A.; Habermann, Thomas M.; Novak, Anne J.; Liebow, Mark; Witzig, Thomas E.; Weiner, George J.; Schenk, Maryjean; Hartge, Patricia; De Roos, Anneclaire J.; Cozen, Wendy; Zhi, Degui; Akers, Nicholas K.; Riby, Jacques; Smith, Martyn T.; Lacher, Mortimer; Villano, Danylo J.; Maria, Ann; Roman, Eve; Kane, Eleanor; Jackson, Rebecca D.; North, Kari E.; Diver, W. Ryan; Turner, Jenny; Armstrong, Bruce K.; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Foretova, Lenka; Maynadie, Marc; Staines, Anthony; McKay, James; Brooks-Wilson, Angela R.; Zheng, Tongzhang; Holford, Theodore R.; Chamosa, Saioa; Kaaks, Rudolph; Kelly, Rachel S.; Ohlsson, Bodil; Travis, Ruth C.; Weiderpass, Elisabete; Clavel, Jacqueline; Giovannucci, Edward; Kraft, Peter; Virtamo, Jarmo; Mazza, Patrizio; Cocco, Pierluigi; Ennas, Maria Grazia; Chiu, Brian C.H.; Fraumeni, Joseph F.; Nieters, Alexandra; Offit, Kenneth; Wu, Xifeng; Cerhan, James R.; Smedby, Karin E.; Chanock, Stephen J.; Rothman, Nathaniel

    2014-01-01

    Genome-wide association studies (GWASs) of follicular lymphoma (FL) have previously identified human leukocyte antigen (HLA) gene variants. To identify additional FL susceptibility loci, we conducted a large-scale two-stage GWAS in 4,523 case subjects and 13,344 control subjects of European ancestry. Five non-HLA loci were associated with FL risk: 11q23.3 (rs4938573, p = 5.79 × 10−20) near CXCR5; 11q24.3 (rs4937362, p = 6.76 × 10−11) near ETS1; 3q28 (rs6444305, p = 1.10 × 10−10) in LPP; 18q21.33 (rs17749561, p = 8.28 × 10−10) near BCL2; and 8q24.21 (rs13254990, p = 1.06 × 10−8) near PVT1. In an analysis of the HLA region, we identified four linked HLA-DRβ1 multiallelic amino acids at positions 11, 13, 28, and 30 that were associated with FL risk (pomnibus = 4.20 × 10−67 to 2.67 × 10−70). Additional independent signals included rs17203612 in HLA class II (odds ratio [ORper-allele] = 1.44; p = 4.59 × 10−16) and rs3130437 in HLA class I (ORper-allele = 1.23; p = 8.23 × 10−9). Our findings further expand the number of loci associated with FL and provide evidence that multiple common variants outside the HLA region make a significant contribution to FL risk. PMID:25279986

  17. The complete mitochondrial genome sequence of the tubeworm Lamellibrachia satsuma and structural conservation in the mitochondrial genome control regions of Order Sabellida.

    PubMed

    Patra, Ajit Kumar; Kwon, Yong Min; Kang, Sung Gyun; Fujiwara, Yoshihiro; Kim, Sang-Jin

    2016-04-01

    The control region of the mitochondrial genomes shows high variation in conserved sequence organizations, which follow distinct evolutionary patterns in different species or taxa. In this study, we sequenced the complete mitochondrial genome of Lamellibrachia satsuma from the cold-seep region of Kagoshima Bay, as a part of whole genome study and extensively studied the structural features and patterns of the control region sequences. We obtained 15,037 bp of mitochondrial genome using Illumina sequencing and identified the non-coding AT-rich region or control region (354 bp, AT=83.9%) located between trnH and trnR. We found 7 conserved sequence blocks (CSB), scattered throughout the control region of L. satsuma and other taxa of Annelida. The poly-TA stretches, which commonly form the stem of multiple stem-loop structures, are most conserved in the CSB-I and CSB-II regions. The mitochondrial genome of L. satsuma encodes a unique repetitive sequence in the control region, which forms a unique secondary structure in comparison to Lamellibrachia luymesi. Phylogenetic analyses of all protein-coding genes indicate that L. satsuma forms a monophyletic clade with L. luymesi along with other tubeworms found in cold-seep regions (genera: Lamellibrachia, Escarpia, and Seepiophila). In general, the control region sequences of Annelida could be aligned with certainty within each genus, and to some extent within the family, but with a higher rate of variation in conserved regions. PMID:26776396

  18. 5S rDNA genome regions of Lens species.

    PubMed

    Fernández, M; Ruiz, M L; Linares, C; Fominaya, A; Pérez de la Vega, M

    2005-10-01

    The length variability of the nontranscribed spacer (NTS) of the 5S rDNA repeats was analyzed in species of the genus Lens by means of PCR amplification. The NTS ranged from approximately 227 to approximately 952 bp. The polymorphism detected was higher than previous NTS polymorphisms described in this genus. Three NTS length variants from Lens culinaris subsp. culinaris and 2 from Lens culinaris subsp. orientalis were sequenced. The culinaris NTS fragment lengths were 239, 371, and 838 bp, whereas the orientalis ones were 472 bp and 506 bp, respectively. As a result of sequence similarities, 2 families of sequences were distinguished, 1 including the sequences of 838 and 506 bp, and others with the sequences of 239, 371, and 472 bp. The 1st family was characterized by the presence of a repeated sequence designated A, whereas the 2nd family showed a single A sequence and other repeated sequences designated B, C, and D. The presence of an (AT)n microsatellite was also observed in the 2nd family of sequences. The fragments, which included the 239-bp and 838-bp NTS sequences, as well as the intergenic spacer (IGS) of the 18S-5.8S-26S ribosomal DNA also from L. culinaris subsp. culinaris, were used to localize the nucleolar organizer region (NOR) and the 5S rDNA loci in the chromosomes of several species of the genus Lens by means of fluorescence in situ hybridization (FISH). The selective hybridization of the 2 NTS probes allowed us to distinguish between different 5S rDNA chromosomal loci.

  19. Genomic Characterization and Comparison of Multi-Regional and Pooled Tumor Biopsy Specimens

    PubMed Central

    Kim, Sang Cheol; Jung, HyunChul; Park, Woong-Yang; Song, Sang-Yong

    2016-01-01

    A single tumor biopsy specimen is typically used in cancer genome studies. However, it may represent incompletely the underlying mutational and transcriptional profiles of tumor biology. Multi-regional biopsies have the advantage of increased sensitivity for genomic profiling, but they are not cost-effective. The concept of an alternative method such as the pooling of multiple biopsies is a challenge. In order to determine if the pooling of distinct regions is representative at the genomic and transcriptome level, we performed sequencing of four regional samples and pooled samples for four cancer types including colon, stomach, kidney and liver cancer. Subsequently, a comparative analysis was conducted to explore differences in mutations and gene expression profiles between multiple regional biopsies and pooled biopsy for each tumor. Our analysis revealed a marginal level of regional difference in detected variants, but in those with low allele frequency, considerable discrepancies were observed. In conclusion, sequencing pooled samples has the benefit of detecting many variants with moderate allele frequency that occur in partial regions, but it is not applicable for detecting low-frequency mutations that require deep sequencing. PMID:27010638

  20. β-globin matrix attachment region improves stable genomic expression of the Sleeping Beauty transposon.

    PubMed

    Sjeklocha, Lucas; Chen, Yixin; Daly, Meghan C; Steer, Clifford J; Kren, Betsy T

    2011-09-01

    The liver is an attractive target for gene therapy due to its extensive capability for protein production and the numerous diseases resulting from a loss of gene function it normally provides. The Sleeping Beauty Transposon (SB-Tn)(1) system is a non-viral vector capable of delivering and mediating therapeutic transgene(s) insertion into the host genome for long-term expression. A current challenge for this system is the low efficiency of integration of the transgene. In this study we use a human hepatoma cell line (HuH-7) and primary human blood outgrowth endothelial cells (BOECs) to test vectors containing DNA elements to enhance transposition without integrating themselves. We employed the human β-globin matrix attachment region (MAR) and the Simian virus 40 (SV40) nuclear translocation signal to increase the percent of HuH-7 cells persistently expressing a GFP::Zeo reporter construct by ∼50% for each element; while combining both did not show an additive effect. Interestingly, both elements together displayed an additive effect on the number of insertion sites, and in BOECs the SV40 alone appeared to have an inhibitory effect on transposition. In long-term cultures the loss of plasmid DNA, transposase expression and mapping of insertion sites demonstrated bona fide transposition without episomal expression. These results show that addition of the β-globin MAR and potentially other elements to the backbone of SB-Tn system can enhance transposition and expression of therapeutic transgenes. These findings may have a significant influence on the use of SB transgene delivery to liver for the treatment of a wide variety of disorders. PMID:21520245

  1. An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: the Adh region.

    PubMed Central

    Ashburner, M; Misra, S; Roote, J; Lewis, S E; Blazej, R; Davis, T; Doyle, C; Galle, R; George, R; Harris, N; Hartzell, G; Harvey, D; Hong, L; Houston, K; Hoskins, R; Johnson, G; Martin, C; Moshrefi, A; Palazzolo, M; Reese, M G; Spradling, A; Tsang, G; Wan, K; Whitelaw, K; Celniker, S

    1999-01-01

    A contiguous sequence of nearly 3 Mb from the genome of Drosophila melanogaster has been sequenced from a series of overlapping P1 and BAC clones. This region covers 69 chromosome polytene bands on chromosome arm 2L, including the genetically well-characterized "Adh region." A computational analysis of the sequence predicts 218 protein-coding genes, 11 tRNAs, and 17 transposable element sequences. At least 38 of the protein-coding genes are arranged in clusters of from 2 to 6 closely related genes, suggesting extensive tandem duplication. The gene density is one protein-coding gene every 13 kb; the transposable element density is one element every 171 kb. Of 73 genes in this region identified by genetic analysis, 49 have been located on the sequence; P-element insertions have been mapped to 43 genes. Ninety-five (44%) of the known and predicted genes match a Drosophila EST, and 144 (66%) have clear similarities to proteins in other organisms. Genes known to have mutant phenotypes are more likely to be represented in cDNA libraries, and far more likely to have products similar to proteins of other organisms, than are genes with no known mutant phenotype. Over 650 chromosome aberration breakpoints map to this chromosome region, and their nonrandom distribution on the genetic map reflects variation in gene spacing on the DNA. This is the first large-scale analysis of the genome of D. melanogaster at the sequence level. In addition to the direct results obtained, this analysis has allowed us to develop and test methods that will be needed to interpret the complete sequence of the genome of this species.Before beginning a Hunt, it is wise to ask someone what you are looking for before you begin looking for it. Milne 1926 PMID:10471707

  2. Genomic complexity of the variable region-containing chitin-binding proteins in amphioxus

    PubMed Central

    Dishaw, Larry J; Mueller, M Gail; Gwatney, Natasha; Cannon, John P; Haire, Robert N; Litman, Ronda T; Amemiya, Chris T; Ota, Tatsuya; Rowen, Lee; Glusman, Gustavo; Litman, Gary W

    2008-01-01

    Background The variable region-containing chitin-binding proteins (VCBPs) are found in protochordates and consist of two tandem immunoglobulin variable (V)-type domains and a chitin-binding domain. We previously have shown that these polymorphic genes, which primarily are expressed in the gut, exhibit characteristics of immune genes. In this report, we describe VCBP genomic organization and characterize adjacent and intervening genetic features which may influence both their polymorphism and complex transcriptional repertoire. Results VCBP genes 1, 2, 4, and 5 are encoded in a single contiguous gene-rich chromosomal region and VCBP3 is encoded in a separate locus. The VCBPs exhibit extensive haplotype variation, including copy number variation (CNV), indel polymorphism and a markedly elevated variation in repeat type and density. In at least one haplotype, inverted repeats occur more frequently than elsewhere in the genome. Multi-animal cDNA screening, as well as transcriptional profilingusing a novel transfection system, suggests that haplotype-specific transcriptional variants may contribute to VCBP genetic diversity. Conclusion The availability of the Branchiostoma floridae genome (Joint Genome Institute, Brafl1), along with BAC and PAC screening and sequencing described here, reveal that the relatively limited number of VCBP genes present in the amphioxus genome exhibit exceptionally high haplotype variation. These VCBP haplotypes contribute a diverse pool of allelic variants, which includes gene copy number variation, pseudogenes, and other polymorphisms, while contributing secondary effects on gene transcription as well. PMID:19046437

  3. Genome analysis of Excretory/Secretory proteins in Taenia solium reveals their Abundance of Antigenic Regions (AAR)

    PubMed Central

    Gomez, Sandra; Adalid-Peralta, Laura; Palafox-Fonseca, Hector; Cantu-Robles, Vito Adrian; Soberón, Xavier; Sciutto, Edda; Fragoso, Gladis; Bobes, Raúl J.; Laclette, Juan P.; Yauner, Luis del Pozo; Ochoa-Leyva, Adrián

    2015-01-01

    Excretory/Secretory (ES) proteins play an important role in the host-parasite interactions. Experimental identification of ES proteins is time-consuming and expensive. Alternative bioinformatics approaches are cost-effective and can be used to prioritize the experimental analysis of therapeutic targets for parasitic diseases. Here we predicted and functionally annotated the ES proteins in T. solium genome using an integration of bioinformatics tools. Additionally, we developed a novel measurement to evaluate the potential antigenicity of T. solium secretome using sequence length and number of antigenic regions of ES proteins. This measurement was formalized as the Abundance of Antigenic Regions (AAR) value. AAR value for secretome showed a similar value to that obtained for a set of experimentally determined antigenic proteins and was different to the calculated value for the non-ES proteins of T. solium genome. Furthermore, we calculated the AAR values for known helminth secretomes and they were similar to that obtained for T. solium. The results reveal the utility of AAR value as a novel genomic measurement to evaluate the potential antigenicity of secretomes. This comprehensive analysis of T. solium secretome provides functional information for future experimental studies, including the identification of novel ES proteins of therapeutic, diagnosis and immunological interest. PMID:25989346

  4. Genome analysis of Excretory/Secretory proteins in Taenia solium reveals their Abundance of Antigenic Regions (AAR).

    PubMed

    Gomez, Sandra; Adalid-Peralta, Laura; Palafox-Fonseca, Hector; Cantu-Robles, Vito Adrian; Soberón, Xavier; Sciutto, Edda; Fragoso, Gladis; Bobes, Raúl J; Laclette, Juan P; Yauner, Luis del Pozo; Ochoa-Leyva, Adrián

    2015-05-19

    Excretory/Secretory (ES) proteins play an important role in the host-parasite interactions. Experimental identification of ES proteins is time-consuming and expensive. Alternative bioinformatics approaches are cost-effective and can be used to prioritize the experimental analysis of therapeutic targets for parasitic diseases. Here we predicted and functionally annotated the ES proteins in T. solium genome using an integration of bioinformatics tools. Additionally, we developed a novel measurement to evaluate the potential antigenicity of T. solium secretome using sequence length and number of antigenic regions of ES proteins. This measurement was formalized as the Abundance of Antigenic Regions (AAR) value. AAR value for secretome showed a similar value to that obtained for a set of experimentally determined antigenic proteins and was different to the calculated value for the non-ES proteins of T. solium genome. Furthermore, we calculated the AAR values for known helminth secretomes and they were similar to that obtained for T. solium. The results reveal the utility of AAR value as a novel genomic measurement to evaluate the potential antigenicity of secretomes. This comprehensive analysis of T. solium secretome provides functional information for future experimental studies, including the identification of novel ES proteins of therapeutic, diagnosis and immunological interest.

  5. Identification of Genomic Regions Associated with Phenotypic Variation between Dog Breeds using Selection Mapping

    PubMed Central

    Derrien, Thomas; Axelsson, Erik; Rosengren Pielberg, Gerli; Sigurdsson, Snaevar; Fall, Tove; Seppälä, Eija H.; Hansen, Mark S. T.; Lawley, Cindy T.; Karlsson, Elinor K.; Bannasch, Danika; Vilà, Carles; Lohi, Hannes; Galibert, Francis; Fredholm, Merete; Häggström, Jens; Hedhammar, Åke; André, Catherine; Lindblad-Toh, Kerstin; Hitte, Christophe; Webster, Matthew T.

    2011-01-01

    The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. Here we perform a comprehensive analysis using multiple test statistics to identify regions under selection in 509 dogs from 46 diverse breeds using a newly developed high-density genotyping array consisting of >170,000 evenly spaced SNPs. We first identify 44 genomic regions exhibiting extreme differentiation across multiple breeds. Genetic variation in these regions correlates with variation in several phenotypic traits that vary between breeds, and we identify novel associations with both morphological and behavioral traits. We next scan the genome for signatures of selective sweeps in single breeds, characterized by long regions of reduced heterozygosity and fixation of extended haplotypes. These scans identify hundreds of regions, including 22 blocks of homozygosity longer than one megabase in certain breeds. Candidate selection loci are strongly enriched for developmental genes. We chose one highly differentiated region, associated with body size and ear morphology, and characterized it using high-throughput sequencing to provide a list of variants that may directly affect these traits. This study provides a catalogue of genomic regions showing extreme reduction in genetic variation or population differentiation in dogs, including many linked to phenotypic variation. The many blocks of reduced haplotype diversity observed across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease. PMID:22022279

  6. Comparative annotation of functional regions in the human genome using epigenomic data.

    PubMed

    Won, Kyoung-Jae; Zhang, Xian; Wang, Tao; Ding, Bo; Raha, Debasish; Snyder, Michael; Ren, Bing; Wang, Wei

    2013-04-01

    Epigenetic regulation is dynamic and cell-type dependent. The recently available epigenomic data in multiple cell types provide an unprecedented opportunity for a comparative study of epigenetic landscape. We developed a machine-learning method called ChroModule to annotate the epigenetic states in eight ENCyclopedia Of DNA Elements cell types. The trained model successfully captured the characteristic histone-modification patterns associated with regulatory elements, such as promoters and enhancers, and showed superior performance on identifying enhancers compared with the state-of-art methods. In addition, given the fixed number of epigenetic states in the model, ChroModule allows straightforward illustration of epigenetic variability in multiple cell types. Using this feature, we found that invariable and variable epigenetic states across cell types correspond to housekeeping functions and stimulus response, respectively. Especially, we observed that enhancers, but not the other regulatory elements, dictate cell specificity, as similar cell types share common enhancers, and cell-type-specific enhancers are often bound by transcription factors playing critical roles in that cell type. More interestingly, we found some genomic regions are dormant in cell type but primed to become active in other cell types. These observations highlight the usefulness of ChroModule in comparative analysis and interpretation of multiple epigenomes.

  7. Evolutionary Genomics Suggests That CheV Is an Additional Adaptor for Accommodating Specific Chemoreceptors within the Chemotaxis Signaling Complex.

    PubMed

    Ortega, Davi R; Zhulin, Igor B

    2016-02-01

    Escherichia coli and Salmonella enterica are models for many experiments in molecular biology including chemotaxis, and most of the results obtained with one organism have been generalized to another. While most components of the chemotaxis pathway are strongly conserved between the two species, Salmonella genomes contain some chemoreceptors and an additional protein, CheV, that are not found in E. coli. The role of CheV was examined in distantly related species Bacillus subtilis and Helicobacter pylori, but its role in bacterial chemotaxis is still not well understood. We tested a hypothesis that in enterobacteria CheV functions as an additional adaptor linking the CheA kinase to certain types of chemoreceptors that cannot be effectively accommodated by the universal adaptor CheW. Phylogenetic profiling, genomic context and comparative protein sequence analyses suggested that CheV interacts with specific domains of CheA and chemoreceptors from an orthologous group exemplified by the Salmonella McpC protein. Structural consideration of the conservation patterns suggests that CheV and CheW share the same binding spot on the chemoreceptor structure, but have some affinity bias towards chemoreceptors from different orthologous groups. Finally, published experimental results and data newly obtained via comparative genomics support the idea that CheV functions as a "phosphate sink" possibly to off-set the over-stimulation of the kinase by certain types of chemoreceptors. Overall, our results strongly suggest that CheV is an additional adaptor for accommodating specific chemoreceptors within the chemotaxis signaling complex.

  8. Evolutionary genomics suggests that CheV is an additional adaptor for accommodating specific chemoreceptors within the chemotaxis signaling complex

    DOE PAGES

    Ortega, Davi R.; Zhulin, Igor B.; Punta, Marco

    2016-02-04

    Escherichia coli and Salmonella enterica are models for many experiments in molecular biology including chemotaxis, and most of the results obtained with one organism have been generalized to another. While most components of the chemotaxis pathway are strongly conserved between the two species, Salmonella genomes contain some chemoreceptors and an additional protein, CheV, that are not found in E. coli. The role of CheV was examined in distantly related species Bacillus subtilis and Helicobacter pylori, but its role in bacterial chemotaxis is still not well understood. We tested a hypothesis that in enterobacteria CheV functions as an additional adaptor linkingmore » the CheA kinase to certain types of chemoreceptors that cannot be effectively accommodated by the universal adaptor CheW. Phylogenetic profiling, genomic context and comparative protein sequence analyses suggested that CheV interacts with specific domains of CheA and chemoreceptors from an orthologous group exemplified by the Salmonella McpC protein. Structural consideration of the conservation patterns suggests that CheV and CheW share the same binding spot on the chemoreceptor structure, but have some affinity bias towards chemoreceptors from different orthologous groups. Finally, published experimental results and data newly obtained via comparative genomics support the idea that CheV functions as a "phosphate sink" possibly to off-set the over-stimulation of the kinase by certain types of chemoreceptors. Altogether, our results strongly suggest that CheV is an additional adaptor for accommodating specific chemoreceptors within the chemotaxis signaling complex.« less

  9. Analysis of genomic regions of Trichoderma harzianum IOC-3844 related to biomass degradation.

    PubMed

    Crucello, Aline; Sforça, Danilo Augusto; Horta, Maria Augusta Crivelente; dos Santos, Clelton Aparecido; Viana, Américo José Carvalho; Beloti, Lilian Luzia; de Toledo, Marcelo Augusto Szymanski; Vincentz, Michel; Kuroshu, Reginaldo Massanobu; de Souza, Anete Pereira

    2015-01-01

    Trichoderma harzianum IOC-3844 secretes high levels of cellulolytic-active enzymes and is therefore a promising strain for use in biotechnological applications in second-generation bioethanol production. However, the T. harzianum biomass degradation mechanism has not been well explored at the genetic level. The present work investigates six genomic regions (~150 kbp each) in this fungus that are enriched with genes related to biomass conversion. A BAC library consisting of 5,760 clones was constructed, with an average insert length of 90 kbp. The assembled BAC sequences revealed 232 predicted genes, 31.5% of which were related to catabolic pathways, including those involved in biomass degradation. An expression profile analysis based on RNA-Seq data demonstrated that putative regulatory elements, such as membrane transport proteins and transcription factors, are located in the same genomic regions as genes related to carbohydrate metabolism and exhibit similar expression profiles. Thus, we demonstrate a rapid and efficient tool that focuses on specific genomic regions by combining a BAC library with transcriptomic data. This is the first BAC-based structural genomic study of the cellulolytic fungus T. harzianum, and its findings provide new perspectives regarding the use of this species in biomass degradation processes. PMID:25836973

  10. Analysis of Genomic Regions of Trichoderma harzianum IOC-3844 Related to Biomass Degradation

    PubMed Central

    Crucello, Aline; Sforça, Danilo Augusto; Horta, Maria Augusta Crivelente; dos Santos, Clelton Aparecido; Viana, Américo José Carvalho; Beloti, Lilian Luzia; de Toledo, Marcelo Augusto Szymanski; Vincentz, Michel; Kuroshu, Reginaldo Massanobu; de Souza, Anete Pereira

    2015-01-01

    Trichoderma harzianum IOC-3844 secretes high levels of cellulolytic-active enzymes and is therefore a promising strain for use in biotechnological applications in second-generation bioethanol production. However, the T. harzianum biomass degradation mechanism has not been well explored at the genetic level. The present work investigates six genomic regions (~150 kbp each) in this fungus that are enriched with genes related to biomass conversion. A BAC library consisting of 5,760 clones was constructed, with an average insert length of 90 kbp. The assembled BAC sequences revealed 232 predicted genes, 31.5% of which were related to catabolic pathways, including those involved in biomass degradation. An expression profile analysis based on RNA-Seq data demonstrated that putative regulatory elements, such as membrane transport proteins and transcription factors, are located in the same genomic regions as genes related to carbohydrate metabolism and exhibit similar expression profiles. Thus, we demonstrate a rapid and efficient tool that focuses on specific genomic regions by combining a BAC library with transcriptomic data. This is the first BAC-based structural genomic study of the cellulolytic fungus T. harzianum, and its findings provide new perspectives regarding the use of this species in biomass degradation processes. PMID:25836973

  11. [Comparative analysis of variable regions in the genomes of variola virus].

    PubMed

    Babkin, I V; Nepomniashchikh, T S; Maksiutov, R A; Gutorov, V V; Babkina, I N; Shchelkunov, S N

    2008-01-01

    Nucleotide sequences of two extended segments of the terminal variable regions in variola virus genome were determined. The size of the left segment was 13.5 kbp and of the right, 10.5 kbp. Totally, over 540 kbp were sequenced for 22 variola virus strains. The conducted phylogenetic analysis and the data published earlier allowed us to find the interrelations between 70 variola virus isolates, the character of their clustering, and the degree of intergroup and intragroup variations of the clusters of variola virus strains. The most polymorphic loci of the genome segments studied were determined. It was demonstrated that that these loci are localized to either noncoding genome regions or to the regions of destroyed open reading frames, characteristic of the ancestor virus. These loci are promising for development of the strategy for genotyping variola virus strains. Analysis of recombination using various methods demonstrated that, with the only exception, no statistically significant recombinational events in the genomes of variola virus strains studied were detectable.

  12. Assessing the Genome-Wide Effect of Promoter Region Tandem Repeat Natural Variation on Gene Expression

    PubMed Central

    Elmore, Martha H.; Gibbons, John G.; Rokas, Antonis

    2012-01-01

    Copy number polymorphisms of nucleotide tandem repeat (TR) regions, such as microsatellites and minisatellites, are mutationally reversible and highly abundant in eukaryotic genomes. Studies linking TR polymorphism to phenotypic variation have led some to suggest that TR variation modulates and majorly contributes to phenotypic variation; however, studies in which the authors assess the genome-wide impact of TR variation on phenotype are lacking. To address this question, we quantified relationships between polymorphism levels in 143 genome-wide promoter region TRs across 16 isolates of the filamentous fungus Aspergillus flavus and its ecotype Aspergillus oryzae with expression levels of their downstream genes. We found that only 4.3% of relationships tested were significant; these findings were consistent with models in which TRs act as “tuning,” “volume,” or “optimality” “knobs” of phenotype but not with “switch” models. Furthermore, the promoter regions of differentially expressed genes between A. oryzae and A. flavus did not show TR enrichment, suggesting that genome-wide differences in molecular phenotype between the two species are not significantly associated with TRs. Although in some cases TR polymorphisms do contribute to transcript abundance variation, these results argue that at least in this case, TRs might not be major modulators of variation in phenotype. PMID:23275886

  13. Intra-Genomic Internal Transcribed Spacer Region Sequence Heterogeneity and Molecular Diagnosis in Clinical Microbiology

    PubMed Central

    Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K. P.; Woo, Patrick C. Y.

    2015-01-01

    Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10–49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n = 2), Pichia (Candida) norvegensis (n = 2), Candida tropicalis (n = 1) and Saccharomyces cerevisiae (n = 1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study. PMID:26506340

  14. Intra-Genomic Internal Transcribed Spacer Region Sequence Heterogeneity and Molecular Diagnosis in Clinical Microbiology.

    PubMed

    Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K P; Woo, Patrick C Y

    2015-10-22

    Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10-49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n=2), Pichia (Candida) norvegensis (n=2), Candida tropicalis (n=1) and Saccharomyces cerevisiae (n=1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study.

  15. Evaluation of Apis mellifera syriaca Levant region honeybee conservation using comparative genome hybridization.

    PubMed

    Haddad, Nizar Jamal; Batainh, Ahmed; Saini, Deepti; Migdadi, Osama; Aiyaz, Mohamed; Manchiganti, Rushiraj; Krishnamurthy, Venkatesh; Al-Shagour, Banan; Brake, Mohammad; Bourgeois, Lelania; De Guzman, Lilia; Rinderer, Thomas; Hamouri, Zayed Mahoud

    2016-06-01

    Apis mellifera syriaca is the native honeybee subspecies of Jordan and much of the Levant region. It expresses behavioral adaptations to a regional climate with very high temperatures, nectar dearth in summer, attacks of the Oriental wasp and is resistant to Varroa mites. The A. m. syriaca control reference sample (CRS) in this study was originally collected and stored since 2001 from "Wadi Ben Hammad", a remote valley in the southern region of Jordan. Morphometric and mitochondrial DNA markers of these honeybees had shown highest similarity to reference A. m. syriaca samples collected in 1952 by Brother Adam of samples collected from the Middle East. Samples 1-5 were collected from the National Center for Agricultural Research and Extension breeding apiary which was established for the conservation of A. m. syriaca. Our objective was to determine the success of an A. m. syriaca honey bee conservation program using genomic information from an array-based comparative genomic hybridization platform to evaluate genetic similarities to a historic reference collection (CRS). Our results had shown insignificant genomic differences between the current population in the conservation program and the CRS indicated that program is successfully conserving A. m. syriaca. Functional genomic variations were identified which are useful for conservation monitoring and may be useful for breeding programs designed to improve locally adapted strains of A. m. syriaca.

  16. Evaluation of Apis mellifera syriaca Levant region honeybee conservation using comparative genome hybridization.

    PubMed

    Haddad, Nizar Jamal; Batainh, Ahmed; Saini, Deepti; Migdadi, Osama; Aiyaz, Mohamed; Manchiganti, Rushiraj; Krishnamurthy, Venkatesh; Al-Shagour, Banan; Brake, Mohammad; Bourgeois, Lelania; De Guzman, Lilia; Rinderer, Thomas; Hamouri, Zayed Mahoud

    2016-06-01

    Apis mellifera syriaca is the native honeybee subspecies of Jordan and much of the Levant region. It expresses behavioral adaptations to a regional climate with very high temperatures, nectar dearth in summer, attacks of the Oriental wasp and is resistant to Varroa mites. The A. m. syriaca control reference sample (CRS) in this study was originally collected and stored since 2001 from "Wadi Ben Hammad", a remote valley in the southern region of Jordan. Morphometric and mitochondrial DNA markers of these honeybees had shown highest similarity to reference A. m. syriaca samples collected in 1952 by Brother Adam of samples collected from the Middle East. Samples 1-5 were collected from the National Center for Agricultural Research and Extension breeding apiary which was established for the conservation of A. m. syriaca. Our objective was to determine the success of an A. m. syriaca honey bee conservation program using genomic information from an array-based comparative genomic hybridization platform to evaluate genetic similarities to a historic reference collection (CRS). Our results had shown insignificant genomic differences between the current population in the conservation program and the CRS indicated that program is successfully conserving A. m. syriaca. Functional genomic variations were identified which are useful for conservation monitoring and may be useful for breeding programs designed to improve locally adapted strains of A. m. syriaca. PMID:27010806

  17. Analysis of genomic regions of Trichoderma harzianum IOC-3844 related to biomass degradation.

    PubMed

    Crucello, Aline; Sforça, Danilo Augusto; Horta, Maria Augusta Crivelente; dos Santos, Clelton Aparecido; Viana, Américo José Carvalho; Beloti, Lilian Luzia; de Toledo, Marcelo Augusto Szymanski; Vincentz, Michel; Kuroshu, Reginaldo Massanobu; de Souza, Anete Pereira

    2015-01-01

    Trichoderma harzianum IOC-3844 secretes high levels of cellulolytic-active enzymes and is therefore a promising strain for use in biotechnological applications in second-generation bioethanol production. However, the T. harzianum biomass degradation mechanism has not been well explored at the genetic level. The present work investigates six genomic regions (~150 kbp each) in this fungus that are enriched with genes related to biomass conversion. A BAC library consisting of 5,760 clones was constructed, with an average insert length of 90 kbp. The assembled BAC sequences revealed 232 predicted genes, 31.5% of which were related to catabolic pathways, including those involved in biomass degradation. An expression profile analysis based on RNA-Seq data demonstrated that putative regulatory elements, such as membrane transport proteins and transcription factors, are located in the same genomic regions as genes related to carbohydrate metabolism and exhibit similar expression profiles. Thus, we demonstrate a rapid and efficient tool that focuses on specific genomic regions by combining a BAC library with transcriptomic data. This is the first BAC-based structural genomic study of the cellulolytic fungus T. harzianum, and its findings provide new perspectives regarding the use of this species in biomass degradation processes.

  18. Genome wide signatures of positive selection: The comparison of independent samples and the identification of regions associated to traits

    PubMed Central

    Barendse, William; Harrison, Blair E; Bunch, Rowan J; Thomas, Merle B; Turner, Lex B

    2009-01-01

    Background The goal of genome wide analyses of polymorphisms is to achieve a better understanding of the link between genotype and phenotype. Part of that goal is to understand the selective forces that have operated on a population. Results In this study we compared the signals of selection, identified through population divergence in the Bovine HapMap project, to those found in an independent sample of cattle from Australia. Evidence for population differentiation across the genome, as measured by FST, was highly correlated in the two data sets. Nevertheless, 40% of the variance in FST between the two studies was attributed to the differences in breed composition. Seventy six percent of the variance in FST was attributed to differences in SNP composition and density when the same breeds were compared. The difference between FST of adjacent loci increased rapidly with the increase in distance between SNP, reaching an asymptote after 20 kb. Using 129 SNP that have highly divergent FST values in both data sets, we identified 12 regions that had additive effects on the traits residual feed intake, beef yield or intramuscular fatness measured in the Australian sample. Four of these regions had effects on more than one trait. One of these regions includes the R3HDM1 gene, which is under selection in European humans. Conclusion Firstly, many different populations will be necessary for a full description of selective signatures across the genome, not just a small set of highly divergent populations. Secondly, it is necessary to use the same SNP when comparing the signatures of selection from one study to another. Thirdly, useful signatures of selection can be obtained where many of the groups have only minor genetic differences and may not be clearly separated in a principal component analysis. Fourthly, combining analyses of genome wide selection signatures and genome wide associations to traits helps to define the trait under selection or the population group in which

  19. Characterization of transgenic mice containing adenovirus early region 3 genomic DNA.

    PubMed Central

    Fejer, G; Gyory, I; Tufariello, J; Horwitz, M S

    1994-01-01

    Human adenoviruses (Ad) contain a complex transcription region (E3) which codes for proteins that interact with several arms of the immune system. However, E3 genes are not essential for replication in tissue culture. An E3-encoded 19,000-molecular-weight (19K) glycoprotein (gp19K) binds to the class I major histocompatibility complex (MHC) in the endoplasmic reticulum and prevents MHC transport to the cell surface. Three other E3 proteins are involved in the inhibition of apoptosis by tumor necrosis factor alpha. The entire E3 genomic DNA was utilized to produce transgenic mice to study the effect of the E3 proteins on pathogenesis of various infectious agents and to investigate the in vivo synthesis and processing of the multiple E3 mRNAs and proteins. There was basal expression of the E3 promoter in the thymus, kidneys, uterus, and testes and at all levels of the gastrointestinal tract. In addition, the E3 promoter of the transgene could be activated in some other organs, including the liver, by infection of these animals with an E3-deficient Ad (Ad7001) which contains a functional E1A region. Transactivation in vivo could also be demonstrated by infusion of bacterial lipopolysaccharide. There appeared to be differential ratios of expression between several of the E3 mRNAs in transgenic lung fibroblasts and primary kidney cells cultured from the transgenic animals. This observation suggested that there was differential mRNA splicing that was organ specific. These transgenic animals should provide a useful model for studying the effects of the E3 proteins on the immune system and on diseases affected either by control of MHC or by selected functions of tumor necrosis factor that are inhibitable by Ad E3 proteins. Images PMID:8057467

  20. Estimation of Additive, Dominance, and Imprinting Genetic Variance Using Genomic Data

    PubMed Central

    Lopes, Marcos S.; Bastiaansen, John W. M.; Janss, Luc; Knol, Egbert F.; Bovenhuis, Henk

    2015-01-01

    Traditionally, exploration of genetic variance in humans, plants, and livestock species has been limited mostly to the use of additive effects estimated using pedigree data. However, with the development of dense panels of single-nucleotide polymorphisms (SNPs), the exploration of genetic variation of complex traits is moving from quantifying the resemblance between family members to the dissection of genetic variation at individual loci. With SNPs, we were able to quantify the contribution of additive, dominance, and imprinting variance to the total genetic variance by using a SNP regression method. The method was validated in simulated data and applied to three traits (number of teats, backfat, and lifetime daily gain) in three purebred pig populations. In simulated data, the estimates of additive, dominance, and imprinting variance were very close to the simulated values. In real data, dominance effects account for a substantial proportion of the total genetic variance (up to 44%) for these traits in these populations. The contribution of imprinting to the total phenotypic variance of the evaluated traits was relatively small (1–3%). Our results indicate a strong relationship between additive variance explained per chromosome and chromosome length, which has been described previously for other traits in other species. We also show that a similar linear relationship exists for dominance and imprinting variance. These novel results improve our understanding of the genetic architecture of the evaluated traits and shows promise to apply the SNP regression method to other traits and species, including human diseases. PMID:26438289

  1. Organic Matter and Water Addition Enhance Soil Respiration in an Arid Region

    PubMed Central

    Lai, Liming; Wang, Jianjian; Tian, Yuan; Zhao, Xuechun; Jiang, Lianhe; Chen, Xi; Gao, Yong; Wang, Shaoming; Zheng, Yuanrun

    2013-01-01

    Climate change is generally predicted to increase net primary production, which could lead to additional C input to soil. In arid central Asia, precipitation has increased and is predicted to increase further. To assess the combined effects of these changes on soil CO2 efflux in arid land, a two factorial manipulation experiment in the shrubland of an arid region in northwest China was conducted. The experiment used a nested design with fresh organic matter and water as the two controlled parameters. It was found that both fresh organic matter and water enhanced soil respiration, and there was a synergistic effect of these two treatments on soil respiration increase. Water addition not only enhanced soil C emission, but also regulated soil C sequestration by fresh organic matter addition. The results indicated that the soil CO2 flux of the shrubland is likely to increase with climate change, and precipitation played a dominant role in regulating soil C balance in the shrubland of an arid region. PMID:24204907

  2. Characterization of promoter region and genomic structure of the murine and human genes encoding Src like adapter protein.

    PubMed

    Kratchmarova, I; Sosinowski, T; Weiss, A; Witter, K; Vincenz, C; Pandey, A

    2001-01-10

    Src-like adapter protein (SLAP) was identified as a signaling molecule in a yeast two-hybrid system using the cytoplasmic domain of EphA2, a receptor protein tyrosine kinase (Pandey et al., 1995. Characterization of a novel Src-like adapter protein that associates with the Eck receptor tyrosine kinase. J. Biol. Chem. 270, 19201-19204). It is very similar to members of the Src family of cytoplasmic tyrosine kinases in that it contains very homologous SH3 and SH2 domains (Abram and Courtneidge, 2000. Src family tyrosine kinases and growth factor signaling. Exp. Cell. Res. 254, 1-13.). However, instead of a kinase domain at the C-terminus, it contains a unique C-terminal region. In order to exclude the possibility that an alternative form exists, we have isolated genomic clones containing the murine Slap gene as well as the human SLA gene. The coding regions of murine Slap and human SLA genes contain seven exons and six introns. Absence of any kinase domain in the genomic region confirm its designation as an adapter protein. Additionally, we have cloned and sequenced approximately 2.6 kb of the region 5' to the initiator methionine of the murine Slap gene. When subcloned upstream of a luciferase gene, this fragment increased the transcriptional activity about 6-fold in a human Jurkat T cell line and approximately 52-fold in a murine T cell line indicating that this region contains promoter elements that dictate SLAP expression. We have also cloned the promoter region of the human SLA gene. Since SLAP is transcriptionally regulated by retinoic acid and by activation of B cells, the cloning of its promoter region will permit a detailed analysis of the elements required for its transcriptional regulation.

  3. Information compression exploits patterns of genome composition to discriminate populations and highlight regions of evolutionary interest

    PubMed Central

    2014-01-01

    Background Genomic information allows population relatedness to be inferred and selected genes to be identified. Single nucleotide polymorphism microarray (SNP-chip) data, a proxy for genome composition, contains patterns in allele order and proportion. These patterns can be quantified by compression efficiency (CE). In principle, the composition of an entire genome can be represented by a CE number quantifying allele representation and order. Results We applied a compression algorithm (DEFLATE) to genome-wide high-density SNP data from 4,155 human, 1,800 cattle, 1,222 sheep, 81 dogs and 49 mice samples. All human ethnic groups can be clustered by CE and the clusters recover phylogeography based on traditional fixation index (FST) analyses. CE analysis of other mammals results in segregation by breed or species, and is sensitive to admixture and past effective population size. This clustering is a consequence of individual patterns such as runs of homozygosity. Intriguingly, a related approach can also be used to identify genomic loci that show population-specific CE segregation. A high resolution CE ‘sliding window’ scan across the human genome, organised at the population level, revealed genes known to be under evolutionary pressure. These include SLC24A5 (European and Gujarati Indian skin pigmentation), HERC2 (European eye color), LCT (European and Maasai milk digestion) and EDAR (Asian hair thickness). We also identified a set of previously unidentified loci with high population-specific CE scores including the chromatin remodeler SCMH1 in Africans and EDA2R in Asians. Closer inspection reveals that these prioritised genomic regions do not correspond to simple runs of homozygosity but rather compositionally complex regions that are shared by many individuals of a given population. Unlike FST, CE analyses do not require ab initio population comparisons and are amenable to the hemizygous X chromosome. Conclusions We conclude with a discussion of the

  4. A statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data.

    PubMed

    Lu, Qiongshi; Hu, Yiming; Sun, Jiehuan; Cheng, Yuwei; Cheung, Kei-Hoi; Zhao, Hongyu

    2015-05-27

    Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These efforts have resulted in a rich collection of functional annotation data of diverse types that need to be jointly analyzed for integrated interpretation and annotation. Here we present GenoCanyon, a whole-genome annotation method that performs unsupervised statistical learning using 22 computational and experimental annotations thereby inferring the functional potential of each position in the human genome. With GenoCanyon, we are able to predict many of the known functional regions. The ability of predicting functional regions as well as its generalizable statistical framework makes GenoCanyon a unique and powerful tool for whole-genome annotation. The GenoCanyon web server is available at http://genocanyon.med.yale.edu.

  5. Transferability of regional permafrost disturbance susceptibility modelling using generalized linear and generalized additive models

    NASA Astrophysics Data System (ADS)

    Rudy, Ashley C. A.; Lamoureux, Scott F.; Treitz, Paul; van Ewijk, Karin Y.

    2016-07-01

    To effectively assess and mitigate risk of permafrost disturbance, disturbance-prone areas can be predicted through the application of susceptibility models. In this study we developed regional susceptibility models for permafrost disturbances using a field disturbance inventory to test the transferability of the model to a broader region in the Canadian High Arctic. Resulting maps of susceptibility were then used to explore the effect of terrain variables on the occurrence of disturbances within this region. To account for a large range of landscape characteristics, the model was calibrated using two locations: Sabine Peninsula, Melville Island, NU, and Fosheim Peninsula, Ellesmere Island, NU. Spatial patterns of disturbance were predicted with a generalized linear model (GLM) and generalized additive model (GAM), each calibrated using disturbed and randomized undisturbed locations from both locations and GIS-derived terrain predictor variables including slope, potential incoming solar radiation, wetness index, topographic position index, elevation, and distance to water. Each model was validated for the Sabine and Fosheim Peninsulas using independent data sets while the transferability of the model to an independent site was assessed at Cape Bounty, Melville Island, NU. The regional GLM and GAM validated well for both calibration sites (Sabine and Fosheim) with the area under the receiver operating curves (AUROC) > 0.79. Both models were applied directly to Cape Bounty without calibration and validated equally with AUROC's of 0.76; however, each model predicted disturbed and undisturbed samples differently. Additionally, the sensitivity of the transferred model was assessed using data sets with different sample sizes. Results indicated that models based on larger sample sizes transferred more consistently and captured the variability within the terrain attributes in the respective study areas. Terrain attributes associated with the initiation of disturbances were

  6. Tumorigenic poxviruses: genomic organization and DNA sequence of the telomeric region of the Shope fibroma virus genome.

    PubMed

    Upton, C; DeLange, A M; McFadden, G

    1987-09-01

    Shope fibroma virus (SFV), a tumorigenic poxvirus, has a 160-kb linear double-stranded DNA genome and possesses terminal inverted repeats (TIRs) of 12.4 kb. The DNA sequence of the terminal 5.5 kb of the viral genome is presented and together with previously published sequences completes the entire sequence of the SFV TIR. The terminal 400-bp region contains no major open reading frames (ORFs) but does possess five related imperfect palindromes. The remaining 5.1 kb of the sequence contains seven tightly clustered and tandemly oriented ORFs, four larger than 100 amino acids in length (T1, T2, T4, and T5) and three smaller ORFs (T3A, T3B, and T3C). All are transcribed toward the viral hairpin and almost all possess the consensus sequence TTTTTNT near their 3' ends which has been implicated for the transcription termination of vaccinia virus early genes. Searches of the published DNA database revealed no sequences with significant homology with this region of the SFV genome but when the protein database was searched with the translation products of ORFs T1-T5 it was found that the N-terminus of the putative T4 polypeptide is closely related to the signal sequence of the hemagglutinin precursor from influenza A virus, suggesting that the T4 polypeptide may be secreted from SFV-infected cells. Examination of other SFV ORFs shows that T1 and T2 also possess signal-like hydrophobic amino acid stretches close to their N-termini. The protein database search also revealed that the putative T2 protein has significant homology to the insulin family of polypeptides. In terms of sequence repetitions, seven tandemly repeated copies of the hexanucleotide ATTGTT and three flanking regions of dyad symmetry were detected, all in ORF T3C. A search for palindromic sequences also revealed two clusters, one in ORF T3A/B and a second in ORF T2. ORF T2 harbors five short sequence domains, each of which consists of a 6-bp short palindrome and a 10- to 18-bp larger palindrome. The

  7. Parametric and Nonparametric Statistical Methods for Genomic Selection of Traits with Additive and Epistatic Genetic Architectures

    PubMed Central

    Howard, Réka; Carriquiry, Alicia L.; Beavis, William D.

    2014-01-01

    Parametric and nonparametric methods have been developed for purposes of predicting phenotypes. These methods are based on retrospective analyses of empirical data consisting of genotypic and phenotypic scores. Recent reports have indicated that parametric methods are unable to predict phenotypes of traits with known epistatic genetic architectures. Herein, we review parametric methods including least squares regression, ridge regression, Bayesian ridge regression, least absolute shrinkage and selection operator (LASSO), Bayesian LASSO, best linear unbiased prediction (BLUP), Bayes A, Bayes B, Bayes C, and Bayes Cπ. We also review nonparametric methods including Nadaraya-Watson estimator, reproducing kernel Hilbert space, support vector machine regression, and neural networks. We assess the relative merits of these 14 methods in terms of accuracy and mean squared error (MSE) using simulated genetic architectures consisting of completely additive or two-way epistatic interactions in an F2 population derived from crosses of inbred lines. Each simulated genetic architecture explained either 30% or 70% of the phenotypic variability. The greatest impact on estimates of accuracy and MSE was due to genetic architecture. Parametric methods were unable to predict phenotypic values when the underlying genetic architecture was based entirely on epistasis. Parametric methods were slightly better than nonparametric methods for additive genetic architectures. Distinctions among parametric methods for additive genetic architectures were incremental. Heritability, i.e., proportion of phenotypic variability, had the second greatest impact on estimates of accuracy and MSE. PMID:24727289

  8. The Evolution of Sex Ratio Distorter Suppression Affects a 25 cM Genomic Region in the Butterfly Hypolimnas bolina

    PubMed Central

    Hornett, Emily A.; Moran, Bruce; Reynolds, Louise A.; Charlat, Sylvain; Tazzyman, Samuel; Wedell, Nina; Jiggins, Chris D.; Hurst, Greg D. D.

    2014-01-01

    Symbionts that distort their host's sex ratio by favouring the production and survival of females are common in arthropods. Their presence produces intense Fisherian selection to return the sex ratio to parity, typified by the rapid spread of host ‘suppressor’ loci that restore male survival/development. In this study, we investigated the genomic impact of a selective event of this kind in the butterfly Hypolimnas bolina. Through linkage mapping, we first identified a genomic region that was necessary for males to survive Wolbachia-induced male-killing. We then investigated the genomic impact of the rapid spread of suppression, which converted the Samoan population of this butterfly from a 100∶1 female-biased sex ratio in 2001 to a 1∶1 sex ratio by 2006. Models of this process revealed the potential for a chromosome-wide effect. To measure the impact of this episode of selection directly, the pattern of genetic variation before and after the spread of suppression was compared. Changes in allele frequencies were observed over a 25 cM region surrounding the suppressor locus, with a reduction in overall diversity observed at loci that co-segregate with the suppressor. These changes exceeded those expected from drift and occurred alongside the generation of linkage disequilibrium. The presence of novel allelic variants in 2006 suggests that the suppressor was likely to have been introduced via immigration rather than through de novo mutation. In addition, further sampling in 2010 indicated that many of the introduced variants were lost or had declined in frequency since 2006. We hypothesize that this loss may have resulted from a period of purifying selection, removing deleterious material that introgressed during the initial sweep. Our observations of the impact of suppression of sex ratio distorting activity reveal a very wide genomic imprint, reflecting its status as one of the strongest selective forces in nature. PMID:25474676

  9. The evolution of sex ratio distorter suppression affects a 25 cM genomic region in the butterfly Hypolimnas bolina.

    PubMed

    Hornett, Emily A; Moran, Bruce; Reynolds, Louise A; Charlat, Sylvain; Tazzyman, Samuel; Wedell, Nina; Jiggins, Chris D; Hurst, Greg D D

    2014-12-01

    Symbionts that distort their host's sex ratio by favouring the production and survival of females are common in arthropods. Their presence produces intense Fisherian selection to return the sex ratio to parity, typified by the rapid spread of host 'suppressor' loci that restore male survival/development. In this study, we investigated the genomic impact of a selective event of this kind in the butterfly Hypolimnas bolina. Through linkage mapping, we first identified a genomic region that was necessary for males to survive Wolbachia-induced male-killing. We then investigated the genomic impact of the rapid spread of suppression, which converted the Samoan population of this butterfly from a 100:1 female-biased sex ratio in 2001 to a 1:1 sex ratio by 2006. Models of this process revealed the potential for a chromosome-wide effect. To measure the impact of this episode of selection directly, the pattern of genetic variation before and after the spread of suppression was compared. Changes in allele frequencies were observed over a 25 cM region surrounding the suppressor locus, with a reduction in overall diversity observed at loci that co-segregate with the suppressor. These changes exceeded those expected from drift and occurred alongside the generation of linkage disequilibrium. The presence of novel allelic variants in 2006 suggests that the suppressor was likely to have been introduced via immigration rather than through de novo mutation. In addition, further sampling in 2010 indicated that many of the introduced variants were lost or had declined in frequency since 2006. We hypothesize that this loss may have resulted from a period of purifying selection, removing deleterious material that introgressed during the initial sweep. Our observations of the impact of suppression of sex ratio distorting activity reveal a very wide genomic imprint, reflecting its status as one of the strongest selective forces in nature. PMID:25474676

  10. Matrix attachment regions and structural colinearity in the genomes of two grass species.

    PubMed Central

    Avramova, Z; Tikhonov, A; Chen, M; Bennetzen, J L

    1998-01-01

    In order to gain insights into the relationship between spatial organization of the genome and genome function we have initiated studies of the co-linear Sh2/A1- homologous regions of rice (30 kb) and sorghum (50 kb). We have identified the locations of matrix attachment regions (MARs) in these homologous chromosome segments, which could serve as anchors for individual structural units or loops. Despite the fact that the nucleotide sequences serving as MARs were not detectably conserved, the general organizational patterns of MARs relative to the neighboring genes were preserved. All identified genes were placed in individual loops that were of comparable size for homologous genes. Hence, gene composition, gene orientation, gene order and the placement of genes into structural units has been evolutionarily conserved in this region. Our analysis demonstrated that the occurrence of various 'MAR motifs' is not indicative of MAR location. However, most of the MARs discovered in the two genomic regions were found to co-localize with miniature inverted repeat transposable elements (MITEs), suggesting that MITEs preferentially insert near MARs and/or that they can serve as MARs. PMID:9443968

  11. The Rhodomonas salina mitochondrial genome: bacteria-like operons, compact gene arrangement and complex repeat region.

    PubMed

    Hauth, Amy M; Maier, Uwe G; Lang, B Franz; Burger, Gertraud

    2005-01-01

    To gain insight into the mitochondrial genome structure and gene content of a putatively ancestral group of eukaryotes, the cryptophytes, we sequenced the complete mitochondrial DNA of Rhodomonas salina. The 48 063 bp circular-mapping molecule codes for 2 rRNAs, 27 tRNAs and 40 proteins including 23 components of oxidative phosphorylation, 15 ribosomal proteins and two subunits of tat translocase. One potential protein (ORF161) is without assigned function. Only two introns occur in the genome; both are present within cox1 belong to group II and contain RT open reading frames. Primitive genome features include bacteria-like rRNAs and tRNAs, ribosomal protein genes organized in large clusters resembling bacterial operons and the presence of the otherwise rare genes such as rps1 and tatA. The highly compact gene organization contrasts with the presence of a 4.7 kb long, repeat-containing intergenic region. Repeat motifs approximately 40-700 bp long occur up to 31 times, forming a complex repeat structure. Tandem repeats are the major arrangement but the region also includes a large, approximately 3 kb, inverted repeat and several potentially stable approximately 40-80 bp long hairpin structures. We provide evidence that the large repeat region is involved in replication and transcription initiation, predict a promoter motif that occurs in three locations and discuss two likely scenarios of how this highly structured repeat region might have evolved.

  12. Read clouds uncover variation in complex regions of the human genome

    PubMed Central

    Bishara, Alex; Liu, Yuling; Weng, Ziming; Kashef-Haghighi, Dorna; Newburger, Daniel E.; West, Robert; Sidow, Arend; Batzoglou, Serafim

    2015-01-01

    Although an increasing amount of human genetic variation is being identified and recorded, determining variants within repeated sequences of the human genome remains a challenge. Most population and genome-wide association studies have therefore been unable to consider variation in these regions. Core to the problem is the lack of a sequencing technology that produces reads with sufficient length and accuracy to enable unique mapping. Here, we present a novel methodology of using read clouds, obtained by accurate short-read sequencing of DNA derived from long fragment libraries, to confidently align short reads within repeat regions and enable accurate variant discovery. Our novel algorithm, Random Field Aligner (RFA), captures the relationships among the short reads governed by the long read process via a Markov Random Field. We utilized a modified version of the Illumina TruSeq synthetic long-read protocol, which yielded shallow-sequenced read clouds. We test RFA through extensive simulations and apply it to discover variants on the NA12878 human sample, for which shallow TruSeq read cloud sequencing data are available, and on an invasive breast carcinoma genome that we sequenced using the same method. We demonstrate that RFA facilitates accurate recovery of variation in 155 Mb of the human genome, including 94% of 67 Mb of segmental duplication sequence and 96% of 11 Mb of transcribed sequence, that are currently hidden from short-read technologies. PMID:26286554

  13. Read clouds uncover variation in complex regions of the human genome.

    PubMed

    Bishara, Alex; Liu, Yuling; Weng, Ziming; Kashef-Haghighi, Dorna; Newburger, Daniel E; West, Robert; Sidow, Arend; Batzoglou, Serafim

    2015-10-01

    Although an increasing amount of human genetic variation is being identified and recorded, determining variants within repeated sequences of the human genome remains a challenge. Most population and genome-wide association studies have therefore been unable to consider variation in these regions. Core to the problem is the lack of a sequencing technology that produces reads with sufficient length and accuracy to enable unique mapping. Here, we present a novel methodology of using read clouds, obtained by accurate short-read sequencing of DNA derived from long fragment libraries, to confidently align short reads within repeat regions and enable accurate variant discovery. Our novel algorithm, Random Field Aligner (RFA), captures the relationships among the short reads governed by the long read process via a Markov Random Field. We utilized a modified version of the Illumina TruSeq synthetic long-read protocol, which yielded shallow-sequenced read clouds. We test RFA through extensive simulations and apply it to discover variants on the NA12878 human sample, for which shallow TruSeq read cloud sequencing data are available, and on an invasive breast carcinoma genome that we sequenced using the same method. We demonstrate that RFA facilitates accurate recovery of variation in 155 Mb of the human genome, including 94% of 67 Mb of segmental duplication sequence and 96% of 11 Mb of transcribed sequence, that are currently hidden from short-read technologies.

  14. Small tumor virus genomes are integrated near nuclear matrix attachment regions in transformed cells.

    PubMed

    Shera, K A; Shera, C A; McDougall, J K

    2001-12-01

    More than 15% of human cancers have a viral etiology. In benign lesions induced by the small DNA tumor viruses, viral genomes are typically maintained extrachromosomally. Malignant progression is often associated with viral integration into host cell chromatin. To study the role of viral integration in tumorigenesis, we analyzed the positions of integrated viral genomes in tumors and tumor cell lines induced by the small oncogenic viruses, including the high-risk human papillomaviruses, hepatitis B virus, simian virus 40, and human T-cell leukemia virus type 1. We show that viral integrations in tumor cells lie near cellular sequences identified as nuclear matrix attachment regions (MARs), while integrations in nonneoplastic cells show no significant correlation with these regions. In mammalian cells, the nuclear matrix functions in gene expression and DNA replication. MARs play varied but poorly understood roles in eukaryotic gene expression. Our results suggest that integrated tumor virus genomes are subject to MAR-mediated transcriptional regulation, providing insight into mechanisms of viral carcinogenesis. Furthermore, the viral oncoproteins serve as invaluable tools for the study of mechanisms controlling cellular growth. Similarly, our demonstration that integrated viral genomes may be subject to MAR-mediated transcriptional effects should facilitate elucidation of fundamental mechanisms regulating eukaryotic gene expression.

  15. A 5'-proximal region of the Citrus tristeza virus genome encoding two leader proteases is involved in virus superinfection exclusion.

    PubMed

    Atallah, Osama O; Kang, Sung-Hwan; El-Mohtar, Choaa A; Shilts, Turksen; Bergua, María; Folimonova, Svetlana Y

    2016-02-01

    Superinfection exclusion (SIE), a phenomenon in which a primary virus infection prevents a secondary infection with the same or closely related virus, has been observed with various viruses. Earlier we demonstrated that SIE by Citrus tristeza virus (CTV) requires viral p33 protein. In this work we show that p33 alone is not sufficient for virus exclusion. To define the additional viral components that are involved in this phenomenon, we engineered a hybrid virus in which a 5'-proximal region in the genome of the T36 isolate containing coding sequences for the two leader proteases L1 and L2 has been substituted with a corresponding region from the genome of a heterologous T68-1 isolate. Sequential inoculation of plants pre-infected with the CTV L1L2T68 hybrid with T36 CTV resulted in superinfection with the challenge virus, which indicated that the substitution of the L1-L2 coding region affected SIE ability of the virus.

  16. A 5'-proximal region of the Citrus tristeza virus genome encoding two leader proteases is involved in virus superinfection exclusion.

    PubMed

    Atallah, Osama O; Kang, Sung-Hwan; El-Mohtar, Choaa A; Shilts, Turksen; Bergua, María; Folimonova, Svetlana Y

    2016-02-01

    Superinfection exclusion (SIE), a phenomenon in which a primary virus infection prevents a secondary infection with the same or closely related virus, has been observed with various viruses. Earlier we demonstrated that SIE by Citrus tristeza virus (CTV) requires viral p33 protein. In this work we show that p33 alone is not sufficient for virus exclusion. To define the additional viral components that are involved in this phenomenon, we engineered a hybrid virus in which a 5'-proximal region in the genome of the T36 isolate containing coding sequences for the two leader proteases L1 and L2 has been substituted with a corresponding region from the genome of a heterologous T68-1 isolate. Sequential inoculation of plants pre-infected with the CTV L1L2T68 hybrid with T36 CTV resulted in superinfection with the challenge virus, which indicated that the substitution of the L1-L2 coding region affected SIE ability of the virus. PMID:26748332

  17. cDNA sequence, genomic organization, and evolutionary conservation of a novel gene from the WAGR region

    SciTech Connect

    Schwartz, F.; Eisenman, R.; Knoll, J.; Bruns, G.

    1995-09-20

    A new gene (239FB) with predominant and differential expression in fetal brain has recently been isolated from a chromosome 11p13-p14 boundary area near FSHB. The corresponding mRNA has an open reading frame of 294 amino acids, a 3` untranslated region of 1247 nucleotides, and a highly GC-rich 5` untranslated region. The coding and 3` UT sequence is specified by 6 exons within nearly 87 kb of isolated genomic locus. The 5` end region of the transcript maps adjacent to the only genomically defined CpG island in a chromosomal subregion that may be associated with part of the mental retardation of some WAGR (Wilms tumor, aniridia, genitourinary anomalies, and mental retardation) syndrome patients. In addition to nucleotide and amino acid similarity to an EST from a normalized infant brain cDNA library, the predicted protein has extensive similarity to Caenorhbditis elegans polypeptides of, as yet, unknown function. The 239FB locus is, therefore, likely part of a family of genes with two members expressed in human brain. The extensive conservation of the predicted protein suggests a fundamental function of the gene product and will enable evaluation of the role of the 239FB gene in neurogenesis in model organisms. 48 refs., 4 figs., 1 tab.

  18. Two Genomic Regions Contribute Disproportionately to Geographic Differentiation in Wild Barley

    PubMed Central

    Fang, Zhou; Gonzales, Ana M.; Clegg, Michael T.; Smith, Kevin P.; Muehlbauer, Gary J.; Steffenson, Brian J.; Morrell, Peter L.

    2014-01-01

    Genetic differentiation in natural populations is driven by geographic distance and by ecological or physical features within and between natural habitats that reduce migration. The primary population structure in wild barley differentiates populations east and west of the Zagros Mountains. Genetic differentiation between eastern and western populations is uneven across the genome and is greatest on linkage groups 2H and 5H. Genetic markers in these two regions demonstrate the largest difference in frequency between the primary populations and have the highest informativeness for assignment to each population. Previous cytological and genetic studies suggest there are chromosomal structural rearrangements (inversions or translocations) in these genomic regions. Environmental association analyses identified an association with both temperature and precipitation variables on 2H and with precipitation variables on 5H. PMID:24760390

  19. Two genomic regions contribute disproportionately to geographic differentiation in wild barley.

    PubMed

    Fang, Zhou; Gonzales, Ana M; Clegg, Michael T; Smith, Kevin P; Muehlbauer, Gary J; Steffenson, Brian J; Morrell, Peter L

    2014-07-01

    Genetic differentiation in natural populations is driven by geographic distance and by ecological or physical features within and between natural habitats that reduce migration. The primary population structure in wild barley differentiates populations east and west of the Zagros Mountains. Genetic differentiation between eastern and western populations is uneven across the genome and is greatest on linkage groups 2H and 5H. Genetic markers in these two regions demonstrate the largest difference in frequency between the primary populations and have the highest informativeness for assignment to each population. Previous cytological and genetic studies suggest there are chromosomal structural rearrangements (inversions or translocations) in these genomic regions. Environmental association analyses identified an association with both temperature and precipitation variables on 2H and with precipitation variables on 5H.

  20. New insights into the origin of the B genome of hexaploid wheat: Evolutionary relationships at the SPA genomic region with the S genome of the diploid relative Aegilops speltoides

    PubMed Central

    Salse, Jérome; Chagué, Véronique; Bolot, Stéphanie; Magdelenat, Ghislaine; Huneau, Cécile; Pont, Caroline; Belcram, Harry; Couloux, Arnaud; Gardais, Soazic; Evrard, Aurélie; Segurens, Béatrice; Charles, Mathieu; Ravel, Catherine; Samain, Sylvie; Charmet, Gilles; Boudet, Nathalie; Chalhoub, Boulos

    2008-01-01

    Background Several studies suggested that the diploid ancestor of the B genome of tetraploid and hexaploid wheat species belongs to the Sitopsis section, having Aegilops speltoides (SS, 2n = 14) as the closest identified relative. However molecular relationships based on genomic sequence comparison, including both coding and non-coding DNA, have never been investigated. In an attempt to clarify these relationships, we compared, in this study, sequences of the Storage Protein Activator (SPA) locus region of the S genome of Ae. speltoides (2n = 14) to that of the A, B and D genomes co-resident in the hexaploid wheat species (Triticum aestivum, AABBDD, 2n = 42). Results Four BAC clones, spanning the SPA locus of respectively the A, B, D and S genomes, were isolated and sequenced. Orthologous genomic regions were identified as delimited by shared non-transposable elements and non-coding sequences surrounding the SPA gene and correspond to 35 268, 22 739, 43 397 and 53 919 bp for the A, B, D and S genomes, respectively. Sequence length discrepancies within and outside the SPA orthologous regions are the result of non-shared transposable elements (TE) insertions, all of which inserted after the progenitors of the four genomes divergence. Conclusion On the basis of conserved sequence length as well as identity of the shared non-TE regions and the SPA coding sequence, Ae speltoides appears to be more evolutionary related to the B genome of T. aestivum than the A and D genomes. However, the differential insertions of TEs, none of which are conserved between the two genomes led to the conclusion that the S genome of Ae. speltoides has diverged very early from the progenitor of the B genome which remains to be identified. PMID:19032732

  1. Genome-wide methylation analysis identified sexually dimorphic methylated regions in hybrid tilapia

    PubMed Central

    Wan, Zi Yi; Xia, Jun Hong; Lin, Grace; Wang, Le; Lin, Valerie C. L.; Yue, Gen Hua

    2016-01-01

    Sexual dimorphism is an interesting biological phenomenon. Previous studies showed that DNA methylation might play a role in sexual dimorphism. However, the overall picture of the genome-wide methylation landscape in sexually dimorphic species remains unclear. We analyzed the DNA methylation landscape and transcriptome in hybrid tilapia (Oreochromis spp.) using whole genome bisulfite sequencing (WGBS) and RNA-sequencing (RNA-seq). We found 4,757 sexually dimorphic differentially methylated regions (DMRs), with significant clusters of DMRs located on chromosomal regions associated with sex determination. CpG methylation in promoter regions was negatively correlated with the gene expression level. MAPK/ERK pathway was upregulated in male tilapia. We also inferred active cis-regulatory regions (ACRs) in skeletal muscle tissues from WGBS datasets, revealing sexually dimorphic cis-regulatory regions. These results suggest that DNA methylation contribute to sex-specific phenotypes and serve as resources for further investigation to analyze the functions of these regions and their contributions towards sexual dimorphisms. PMID:27782217

  2. The structure of the Morganella morganii lipopolysaccharide core region and identification of its genomic loci.

    PubMed

    Vinogradov, Evgeny; Nash, John H E; Foote, Simon; Young, N Martin

    2015-01-30

    The core region of the lipopolysaccharide of Morganella morganii serotype O:1ab was obtained by hydrolysis of the LPS and studied by 2D NMR, ESI MS, and chemical methods. Its structure was highly homologous to those from the two major members of the same Proteeae tribe, Proteus mirabilis and Providencia alcalifaciens, and analysis of the M. morganii genome disclosed that the loci for its outer core, lipid A and Ara4N moieties are similarly conserved.

  3. Generation of Recombinant Polioviruses Harboring RNA Affinity Tags in the 5′ and 3′ Noncoding Regions of Genomic RNAs

    PubMed Central

    Flather, Dylan; Cathcart, Andrea L.; Cruz, Casey; Baggs, Eric; Ngo, Tuan; Gershon, Paul D.; Semler, Bert L.

    2016-01-01

    Despite being intensely studied for more than 50 years, a complete understanding of the enterovirus replication cycle remains elusive. Specifically, only a handful of cellular proteins have been shown to be involved in the RNA replication cycle of these viruses. In an effort to isolate and identify additional cellular proteins that function in enteroviral RNA replication, we have generated multiple recombinant polioviruses containing RNA affinity tags within the 3′ or 5′ noncoding region of the genome. These recombinant viruses retained RNA affinity sequences within the genome while remaining viable and infectious over multiple passages in cell culture. Further characterization of these viruses demonstrated that viral protein production and growth kinetics were unchanged or only slightly altered relative to wild type poliovirus. However, attempts to isolate these genetically-tagged viral genomes from infected cells have been hindered by high levels of co-purification of nonspecific proteins and the limited matrix-binding efficiency of RNA affinity sequences. Regardless, these recombinant viruses represent a step toward more thorough characterization of enterovirus ribonucleoprotein complexes involved in RNA replication. PMID:26861382

  4. Genome-Wide Association of Bipolar Disorder Suggests an Enrichment of Replicable Associations in Regions near Genes

    PubMed Central

    Smith, Erin N.; Koller, Daniel L.; Panganiban, Corrie; Szelinger, Szabolcs; Zhang, Peng; Badner, Judith A.; Barrett, Thomas B.; Berrettini, Wade H.; Bloss, Cinnamon S.; Byerley, William; Coryell, William; Edenberg, Howard J.; Foroud, Tatiana; Gershon, Elliot S.; Greenwood, Tiffany A.; Guo, Yiran; Hipolito, Maria; Keating, Brendan J.; Lawson, William B.; Liu, Chunyu; Mahon, Pamela B.; McInnis, Melvin G.; McMahon, Francis J.; McKinney, Rebecca; Murray, Sarah S.; Nievergelt, Caroline M.; Nurnberger, John I.; Nwulia, Evaristus A.; Potash, James B.; Rice, John; Schulze, Thomas G.; Scheftner, William A.; Shilling, Paul D.; Zandi, Peter P.; Zöllner, Sebastian; Craig, David W.; Schork, Nicholas J.; Kelsoe, John R.

    2011-01-01

    Although a highly heritable and disabling disease, bipolar disorder's (BD) genetic variants have been challenging to identify. We present new genotype data for 1,190 cases and 401 controls and perform a genome-wide association study including additional samples for a total of 2,191 cases and 1,434 controls. We do not detect genome-wide significant associations for individual loci; however, across all SNPs, we show an association between the power to detect effects calculated from a previous genome-wide association study and evidence for replication (P = 1.5×10−7). To demonstrate that this result is not likely to be a false positive, we analyze replication rates in a large meta-analysis of height and show that, in a large enough study, associations replicate as a function of power, approaching a linear relationship. Within BD, SNPs near exons exhibit a greater probability of replication, supporting an enrichment of reproducible associations near functional regions of genes. These results indicate that there is likely common genetic variation associated with BD near exons (±10 kb) that could be identified in larger studies and, further, provide a framework for assessing the potential for replication when combining results from multiple studies. PMID:21738484

  5. Comparison of Exome and Genome Sequencing Technologies for the Complete Capture of Protein‐Coding Regions

    PubMed Central

    Lelieveld, Stefan H.; Spielmann, Malte; Mundlos, Stefan; Veltman, Joris A.

    2015-01-01

    ABSTRACT For next‐generation sequencing technologies, sufficient base‐pair coverage is the foremost requirement for the reliable detection of genomic variants. We investigated whether whole‐genome sequencing (WGS) platforms offer improved coverage of coding regions compared with whole‐exome sequencing (WES) platforms, and compared single‐base coverage for a large set of exome and genome samples. We find that WES platforms have improved considerably in the last years, but at comparable sequencing depth, WGS outperforms WES in terms of covered coding regions. At higher sequencing depth (95x–160x), WES successfully captures 95% of the coding regions with a minimal coverage of 20x, compared with 98% for WGS at 87‐fold coverage. Three different assessments of sequence coverage bias showed consistent biases for WES but not for WGS. We found no clear differences for the technologies concerning their ability to achieve complete coverage of 2,759 clinically relevant genes. We show that WES performs comparable to WGS in terms of covered bases if sequenced at two to three times higher coverage. This does, however, go at the cost of substantially more sequencing biases in WES approaches. Our findings will guide laboratories to make an informed decision on which sequencing platform and coverage to choose. PMID:25973577

  6. Genomic Analysis Reveals Novel Diversity among the 1976 Philadelphia Legionnaires’ Disease Outbreak Isolates and Additional ST36 Strains

    PubMed Central

    Mercante, Jeffrey W.; Morrison, Shatavia S.; Desai, Heta P.; Raphael, Brian H.; Winchell, Jonas M.

    2016-01-01

    Legionella pneumophila was first recognized as a cause of severe and potentially fatal pneumonia during a large-scale outbreak of Legionnaires’ disease (LD) at a Pennsylvania veterans’ convention in Philadelphia, 1976. The ensuing investigation and recovery of four clinical isolates launched the fields of Legionella epidemiology and scientific research. Only one of the original isolates, “Philadelphia-1”, has been widely distributed or extensively studied. Here we describe the whole-genome sequencing (WGS), complete assembly, and comparative analysis of all Philadelphia LD strains recovered from that investigation, along with L. pneumophila isolates sharing the Philadelphia sequence type (ST36). Analyses revealed that the 1976 outbreak was due to multiple serogroup 1 strains within the same genetic lineage, differentiated by an actively mobilized, self-replicating episome that is shared with L. pneumophila str. Paris, and two large, horizontally-transferred genomic loci, among other polymorphisms. We also found a completely unassociated ST36 strain that displayed remarkable genetic similarity to the historical Philadelphia isolates. This similar strain implies the presence of a potential clonal population, and suggests important implications may exist for considering epidemiological context when interpreting phylogenetic relationships among outbreak-associated isolates. Additional extensive archival research identified the Philadelphia isolate associated with a non-Legionnaire case of “Broad Street pneumonia”, and provided new historical and genetic insights into the 1976 epidemic. This retrospective analysis has underscored the utility of fully-assembled WGS data for Legionella outbreak investigations, highlighting the increased resolution that comes from long-read sequencing and a sequence type-matched genomic data set. PMID:27684472

  7. Tandem repeat regions within the Burkholderia pseudomallei genome and their application for high resolution genotyping

    PubMed Central

    U'Ren, Jana M; Schupp, James M; Pearson, Talima; Hornstra, Heidie; Friedman, Christine L Clark; Smith, Kimothy L; Daugherty, Rebecca R Leadem; Rhoton, Shane D; Leadem, Ben; Georgia, Shalamar; Cardon, Michelle; Huynh, Lynn Y; DeShazer, David; Harvey, Steven P; Robison, Richard; Gal, Daniel; Mayo, Mark J; Wagner, David; Currie, Bart J; Keim, Paul

    2007-01-01

    Background The facultative, intracellular bacterium Burkholderia pseudomallei is the causative agent of melioidosis, a serious infectious disease of humans and animals. We identified and categorized tandem repeat arrays and their distribution throughout the genome of B. pseudomallei strain K96243 in order to develop a genetic typing method for B. pseudomallei. We then screened 104 of the potentially polymorphic loci across a diverse panel of 31 isolates including B. pseudomallei, B. mallei and B. thailandensis in order to identify loci with varying degrees of polymorphism. A subset of these tandem repeat arrays were subsequently developed into a multiple-locus VNTR analysis to examine 66 B. pseudomallei and 21 B. mallei isolates from around the world, as well as 95 lineages from a serial transfer experiment encompassing ~18,000 generations. Results B. pseudomallei contains a preponderance of tandem repeat loci throughout its genome, many of which are duplicated elsewhere in the genome. The majority of these loci are composed of repeat motif lengths of 6 to 9 bp with 4 to 10 repeat units and are predominately located in intergenic regions of the genome. Across geographically diverse B. pseudomallei and B.mallei isolates, the 32 VNTR loci displayed between 7 and 28 alleles, with Nei's diversity values ranging from 0.47 and 0.94. Mutation rates for these loci are comparable (>10-5 per locus per generation) to that of the most diverse tandemly repeated regions found in other less diverse bacteria. Conclusion The frequency, location and duplicate nature of tandemly repeated regions within the B. pseudomallei genome indicate that these tandem repeat regions may play a role in generating and maintaining adaptive genomic variation. Multiple-locus VNTR analysis revealed extensive diversity within the global isolate set containing B. pseudomallei and B. mallei, and it detected genotypic differences within clonal lineages of both species that were identical using previous

  8. Goldilocks: a tool for identifying genomic regions that are ‘just right’

    PubMed Central

    Nicholls, Samuel M.; Clare, Amanda; Randall, Joshua C.

    2016-01-01

    Summary: We present Goldilocks: a Python package providing functionality for collecting summary statistics, identifying shifts in variation, discovering outlier regions and locating and extracting interesting regions from one or more arbitrary genomes for further analysis, for a user-provided definition of interesting. Availability and implementation: Goldilocks is freely available open-source software distributed under the MIT licence. Source code is hosted publicly at https://github.com/SamStudio8/goldilocks and the package may also be installed using pip install goldilocks. Documentation can be found at https://goldilocks.readthedocs.org. Contact: msn@aber.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153673

  9. Application of Selection Mapping to Identify Genomic Regions Associated with Dairy Production in Sheep

    PubMed Central

    Gutiérrez-Gil, Beatriz; Arranz, Juan Jose; Pong-Wong, Ricardo; García-Gámez, Elsa; Kijas, James; Wiener, Pamela

    2014-01-01

    In Europe, especially in Mediterranean areas, the sheep has been traditionally exploited as a dual purpose species, with income from both meat and milk. Modernization of husbandry methods and the establishment of breeding schemes focused on milk production have led to the development of “dairy breeds.” This study investigated selective sweeps specifically related to dairy production in sheep by searching for regions commonly identified in different European dairy breeds. With this aim, genotypes from 44,545 SNP markers covering the sheep autosomes were analysed in both European dairy and non-dairy sheep breeds using two approaches: (i) identification of genomic regions showing extreme genetic differentiation between each dairy breed and a closely related non-dairy breed, and (ii) identification of regions with reduced variation (heterozygosity) in the dairy breeds using two methods. Regions detected in at least two breeds (breed pairs) by the two approaches (genetic differentiation and at least one of the heterozygosity-based analyses) were labeled as core candidate convergence regions and further investigated for candidate genes. Following this approach six regions were detected. For some of them, strong candidate genes have been proposed (e.g. ABCG2, SPP1), whereas some other genes designated as candidates based on their association with sheep and cattle dairy traits (e.g. LALBA, DGAT1A) were not associated with a detectable sweep signal. Few of the identified regions were coincident with QTL previously reported in sheep, although many of them corresponded to orthologous regions in cattle where QTL for dairy traits have been identified. Due to the limited number of QTL studies reported in sheep compared with cattle, the results illustrate the potential value of selection mapping to identify genomic regions associated with dairy traits in sheep. PMID:24788864

  10. Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes

    PubMed Central

    Zeggini, Eleftheria; Scott, Laura J.; Saxena, Richa; Voight, Benjamin F.; Marchini, Jonathan L; Hu, Tainle; de Bakker, Paul IW; Abecasis, Gonçalo R; Almgren, Peter; Andersen, Gitte; Ardlie, Kristin; Boström, Kristina Bengtsson; Bergman, Richard N; Bonnycastle, Lori L; Borch-Johnsen, Knut; Burtt, Noël P; Chen, Hong; Chines, Peter S; Daly, Mark J; Deodhar, Parimal; Ding, Charles; Doney, Alex S F; Duren, William L; Elliott, Katherine S; Erdos, Michael R; Frayling, Timothy M; Freathy, Rachel M; Gianniny, Lauren; Grallert, Harald; Grarup, Niels; Groves, Christopher J; Guiducci, Candace; Hansen, Torben; Herder, Christian; Hitman, Graham A; Hughes, Thomas E; Isomaa, Bo; Jackson, Anne U; Jørgensen, Torben; Kong, Augustine; Kubalanza, Kari; Kuruvilla, Finny G; Kuusisto, Johanna; Langenberg, Claudia; Lango, Hana; Lauritzen, Torsten; Li, Yun; Lindgren, Cecilia M; Lyssenko, Valeriya; Marvelle, Amanda F; Meisinger, Christa; Midthjell, Kristian; Mohlke, Karen L; Morken, Mario A; Morris, Andrew D; Narisu, Narisu; Nilsson, Peter; Owen, Katharine R; Palmer, Colin NA; Payne, Felicity; Perry, John RB; Pettersen, Elin; Platou, Carl; Prokopenko, Inga; Qi, Lu; Qin, Li; Rayner, Nigel W; Rees, Matthew; Roix, Jeffrey J; Sandbæk, Anelli; Shields, Beverley; Sjögren, Marketa; Steinthorsdottir, Valgerdur; Stringham, Heather M; Swift, Amy J; Thorleifsson, Gudmar; Thorsteinsdottir, Unnur; Timpson, Nicholas J; Tuomi, Tiinamaija; Tuomilehto, Jaakko; Walker, Mark; Watanabe, Richard M; Weedon, Michael N; Willer, Cristen J; Illig, Thomas; Hveem, Kristian; Hu, Frank B; Laakso, Markku; Stefansson, Kari; Pedersen, Oluf; Wareham, Nicholas J; Barroso, Inês; Hattersley, Andrew T; Collins, Francis S; Groop, Leif; McCarthy, Mark I; Boehnke, Michael; Altshuler, David

    2009-01-01

    Genome-wide association (GWA) studies have identified multiple new genomic loci at which common variants modestly but reproducibly influence risk of type 2 diabetes (T2D)1-11. Established associations to common and rare variants explain only a small proportion of the heritability of T2D. As previously published analyses had limited power to discover loci at which common alleles have modest effects, we performed meta-analysis of three T2D GWA scans encompassing 10,128 individuals of European-descent and ~2.2 million SNPs (directly genotyped and imputed). Replication testing was performed in an independent sample with an effective sample size of up to 53,975. At least six new loci with robust evidence for association were detected, including the JAZF1 (p=5.0×10−14), CDC123/CAMK1D (p=1.2×10−10), TSPAN8/LGR5 (p=1.1×10−9), THADA (p=1.1×10−9), ADAMTS9 (p=1.2×10−8), and NOTCH2 (p=4.1×10−8) gene regions. The large number of loci with relatively small effects indicates the value of large discovery and follow-up samples in identifying additional clues about the inherited basis of T2D. PMID:18372903

  11. Genomic-scale comparison of sequence- and structure-based methods of function prediction: Does structure provide additional insight?

    PubMed Central

    Fetrow, Jacquelyn S.; Siew, Naomi; Di Gennaro, Jeannine A.; Martinez-Yamout, Maria; Dyson, H. Jane; Skolnick, Jeffrey

    2001-01-01

    A function annotation method using the sequence-to-structure-to-function paradigm is applied to the identification of all disulfide oxidoreductases in the Saccharomyces cerevisiae genome. The method identifies 27 sequences as potential disulfide oxidoreductases. All previously known thioredoxins, glutaredoxins, and disulfide isomerases are correctly identified. Three of the 27 predictions are probable false-positives. Three novel predictions, which subsequently have been experimentally validated, are presented. Two additional novel predictions suggest a disulfide oxidoreductase regulatory mechanism for two subunits (OST3 and OST6) of the yeast oligosaccharyltransferase complex. Based on homology, this prediction can be extended to a potential tumor suppressor gene, N33, in humans, whose biochemical function was not previously known. Attempts to obtain a folded, active N33 construct to test the prediction were unsuccessful. The results show that structure prediction coupled with biochemically relevant structural motifs is a powerful method for the function annotation of genome sequences and can provide more detailed, robust predictions than function prediction methods that rely on sequence comparison alone. PMID:11316881

  12. Genome Regions Associated with Functional Performance of Soybean Stem Fibers in Polypropylene Thermoplastic Composites.

    PubMed

    Reinprecht, Yarmilla; Arif, Muhammad; Simon, Leonardo C; Pauls, K Peter

    2015-01-01

    Plant fibers can be used to produce composite materials for automobile parts, thus reducing plastic used in their manufacture, overall vehicle weight and fuel consumption when they replace mineral fillers and glass fibers. Soybean stem residues are, potentially, significant sources of inexpensive, renewable and biodegradable natural fibers, but are not curretly used for biocomposite production due to the functional properties of their fibers in composites being unknown. The current study was initiated to investigate the effects of plant genotype on the performance characteristics of soybean stem fibers when incorporated into a polypropylene (PP) matrix using a selective phenotyping approach. Fibers from 50 lines of a recombinant inbred line population (169 RILs) grown in different environments were incorporated into PP at 20% (wt/wt) by extrusion. Test samples were injection molded and characterized for their mechanical properties. The performance of stem fibers in the composites was significantly affected by genotype and environment. Fibers from different genotypes had significantly different chemical compositions, thus composites prepared with these fibers displayed different physical properties. This study demonstrates that thermoplastic composites with soybean stem-derived fibers have mechanical properties that are equivalent or better than wheat straw fiber composites currently being used for manufacturing interior automotive parts. The addition of soybean stem residues improved flexural, tensile and impact properties of the composites. Furthermore, by linkage and in silico mapping we identified genomic regions to which quantitative trait loci (QTL) for compositional and functional properties of soybean stem fibers in thermoplastic composites, as well as genes for cell wall synthesis, were co-localized. These results may lead to the development of high value uses for soybean stem residue. PMID:26167917

  13. Genome Regions Associated with Functional Performance of Soybean Stem Fibers in Polypropylene Thermoplastic Composites

    PubMed Central

    Reinprecht, Yarmilla; Arif, Muhammad; Simon, Leonardo C.; Pauls, K. Peter

    2015-01-01

    Plant fibers can be used to produce composite materials for automobile parts, thus reducing plastic used in their manufacture, overall vehicle weight and fuel consumption when they replace mineral fillers and glass fibers. Soybean stem residues are, potentially, significant sources of inexpensive, renewable and biodegradable natural fibers, but are not curretly used for biocomposite production due to the functional properties of their fibers in composites being unknown. The current study was initiated to investigate the effects of plant genotype on the performance characteristics of soybean stem fibers when incorporated into a polypropylene (PP) matrix using a selective phenotyping approach. Fibers from 50 lines of a recombinant inbred line population (169 RILs) grown in different environments were incorporated into PP at 20% (wt/wt) by extrusion. Test samples were injection molded and characterized for their mechanical properties. The performance of stem fibers in the composites was significantly affected by genotype and environment. Fibers from different genotypes had significantly different chemical compositions, thus composites prepared with these fibers displayed different physical properties. This study demonstrates that thermoplastic composites with soybean stem-derived fibers have mechanical properties that are equivalent or better than wheat straw fiber composites currently being used for manufacturing interior automotive parts. The addition of soybean stem residues improved flexural, tensile and impact properties of the composites. Furthermore, by linkage and in silico mapping we identified genomic regions to which quantitative trait loci (QTL) for compositional and functional properties of soybean stem fibers in thermoplastic composites, as well as genes for cell wall synthesis, were co-localized. These results may lead to the development of high value uses for soybean stem residue. PMID:26167917

  14. Distribution of Activator (Ac) Throughout the Maize Genome for Use in Regional Mutagenesis

    PubMed Central

    Kolkman, Judith M.; Conrad, Liza J.; Farmer, Phyllis R.; Hardeman, Kristine; Ahern, Kevin R.; Lewis, Paul E.; Sawers, Ruairidh J. H.; Lebejko, Sara; Chomet, Paul; Brutnell, Thomas P.

    2005-01-01

    A collection of Activator (Ac)-containing, near-isogenic W22 inbred lines has been generated for use in regional mutagenesis experiments. Each line is homozygous for a single, precisely positioned Ac element and the Ds reporter, r1-sc:m3. Through classical and molecular genetic techniques, 158 transposed Ac elements (tr-Acs) were distributed throughout the maize genome and 41 were precisely placed on the linkage map utilizing multiple recombinant inbred populations. Several PCR techniques were utilized to amplify DNA fragments flanking tr-Ac insertions up to 8 kb in length. Sequencing and database searches of flanking DNA revealed that the majority of insertions are in hypomethylated, low- or single-copy sequences, indicating an insertion site preference for genic sequences in the genome. However, a number of Ac transposition events were to highly repetitive sequences in the genome. We present evidence that suggests Ac expression is regulated by genomic context resulting in subtle variations in Ac-mediated excision patterns. These tr-Ac lines can be utilized to isolate genes with unknown function, to conduct fine-scale genetic mapping experiments, and to generate novel allelic diversity in applied breeding programs. PMID:15520264

  15. Variability among the Most Rapidly Evolving Plastid Genomic Regions is Lineage-Specific: Implications of Pairwise Genome Comparisons in Pyrus (Rosaceae) and Other Angiosperms for Marker Choice

    PubMed Central

    Ter-Voskanyan, Hasmik; Allgaier, Martin; Borsch, Thomas

    2014-01-01

    Plastid genomes exhibit different levels of variability in their sequences, depending on the respective kinds of genomic regions. Genes are usually more conserved while noncoding introns and spacers evolve at a faster pace. While a set of about thirty maximum variable noncoding genomic regions has been suggested to provide universally promising phylogenetic markers throughout angiosperms, applications often require several regions to be sequenced for many individuals. Our project aims to illuminate evolutionary relationships and species-limits in the genus Pyrus (Rosaceae)—a typical case with very low genetic distances between taxa. In this study, we have sequenced the plastid genome of Pyrus spinosa and aligned it to the already available P. pyrifolia sequence. The overall p-distance of the two Pyrus genomes was 0.00145. The intergenic spacers between ndhC–trnV, trnR–atpA, ndhF–rpl32, psbM–trnD, and trnQ–rps16 were the most variable regions, also comprising the highest total numbers of substitutions, indels and inversions (potentially informative characters). Our comparative analysis of further plastid genome pairs with similar low p-distances from Oenothera (representing another rosid), Olea (asterids) and Cymbidium (monocots) showed in each case a different ranking of genomic regions in terms of variability and potentially informative characters. Only two intergenic spacers (ndhF–rpl32 and trnK–rps16) were consistently found among the 30 top-ranked regions. We have mapped the occurrence of substitutions and microstructural mutations in the four genome pairs. High AT content in specific sequence elements seems to foster frequent mutations. We conclude that the variability among the fastest evolving plastid genomic regions is lineage-specific and thus cannot be precisely predicted across angiosperms. The often lineage-specific occurrence of stem-loop elements in the sequences of introns and spacers also governs lineage-specific mutations

  16. Genome-Centric Analysis of Microbial Populations Enriched by Hydraulic Fracture Fluid Additives in a Coal Bed Methane Production Well.

    PubMed

    Robbins, Steven J; Evans, Paul N; Parks, Donovan H; Golding, Suzanne D; Tyson, Gene W

    2016-01-01

    Coal bed methane (CBM) is generated primarily through the microbial degradation of coal. Despite a limited understanding of the microorganisms responsible for this process, there is significant interest in developing methods to stimulate additional methane production from CBM wells. Physical techniques including hydraulic fracture stimulation are commonly applied to CBM wells, however the effects of specific additives contained in hydraulic fracture fluids on native CBM microbial communities are poorly understood. Here, metagenomic sequencing was applied to the formation waters of a hydraulically fractured and several non-fractured CBM production wells to determine the effect of this stimulation technique on the in-situ microbial community. The hydraulically fractured well was dominated by two microbial populations belonging to the class Phycisphaerae (within phylum Planctomycetes) and candidate phylum Aminicenantes. Populations from these phyla were absent or present at extremely low abundance in non-fractured CBM wells. Detailed metabolic reconstruction of near-complete genomes from these populations showed that their high relative abundance in the hydraulically fractured CBM well could be explained by the introduction of additional carbon sources, electron acceptors, and biocides contained in the hydraulic fracture fluid. PMID:27375557

  17. Genome-Centric Analysis of Microbial Populations Enriched by Hydraulic Fracture Fluid Additives in a Coal Bed Methane Production Well.

    PubMed

    Robbins, Steven J; Evans, Paul N; Parks, Donovan H; Golding, Suzanne D; Tyson, Gene W

    2016-01-01

    Coal bed methane (CBM) is generated primarily through the microbial degradation of coal. Despite a limited understanding of the microorganisms responsible for this process, there is significant interest in developing methods to stimulate additional methane production from CBM wells. Physical techniques including hydraulic fracture stimulation are commonly applied to CBM wells, however the effects of specific additives contained in hydraulic fracture fluids on native CBM microbial communities are poorly understood. Here, metagenomic sequencing was applied to the formation waters of a hydraulically fractured and several non-fractured CBM production wells to determine the effect of this stimulation technique on the in-situ microbial community. The hydraulically fractured well was dominated by two microbial populations belonging to the class Phycisphaerae (within phylum Planctomycetes) and candidate phylum Aminicenantes. Populations from these phyla were absent or present at extremely low abundance in non-fractured CBM wells. Detailed metabolic reconstruction of near-complete genomes from these populations showed that their high relative abundance in the hydraulically fractured CBM well could be explained by the introduction of additional carbon sources, electron acceptors, and biocides contained in the hydraulic fracture fluid.

  18. Genome-Centric Analysis of Microbial Populations Enriched by Hydraulic Fracture Fluid Additives in a Coal Bed Methane Production Well

    PubMed Central

    Robbins, Steven J.; Evans, Paul N.; Parks, Donovan H.; Golding, Suzanne D.; Tyson, Gene W.

    2016-01-01

    Coal bed methane (CBM) is generated primarily through the microbial degradation of coal. Despite a limited understanding of the microorganisms responsible for this process, there is significant interest in developing methods to stimulate additional methane production from CBM wells. Physical techniques including hydraulic fracture stimulation are commonly applied to CBM wells, however the effects of specific additives contained in hydraulic fracture fluids on native CBM microbial communities are poorly understood. Here, metagenomic sequencing was applied to the formation waters of a hydraulically fractured and several non-fractured CBM production wells to determine the effect of this stimulation technique on the in-situ microbial community. The hydraulically fractured well was dominated by two microbial populations belonging to the class Phycisphaerae (within phylum Planctomycetes) and candidate phylum Aminicenantes. Populations from these phyla were absent or present at extremely low abundance in non-fractured CBM wells. Detailed metabolic reconstruction of near-complete genomes from these populations showed that their high relative abundance in the hydraulically fractured CBM well could be explained by the introduction of additional carbon sources, electron acceptors, and biocides contained in the hydraulic fracture fluid. PMID:27375557

  19. Genomic imprinting controls matrix attachment regions in the Igf2 gene.

    PubMed

    Weber, Michaël; Hagège, Hélène; Murrell, Adele; Brunel, Claude; Reik, Wolf; Cathala, Guy; Forné, Thierry

    2003-12-01

    Genomic imprinting at the Igf2/H19 locus originates from allele-specific DNA methylation, which modifies the affinity of some proteins for their target sequences. Here, we show that AT-rich DNA sequences located in the vicinity of previously characterized differentially methylated regions (DMRs) of the imprinted Igf2 gene are conserved between mouse and human. These sequences have all the characteristics of matrix attachment regions (MARs), which are known as versatile regulatory elements involved in chromatin structure and gene expression. Combining allele-specific nuclear matrix binding assays and real-time PCR quantification, we show that retention of two of these Igf2 MARs (MAR0 and MAR2) in the nuclear matrix fraction depends on the tissue and is specific to the paternal allele. Furthermore, on this allele, the Igf2 MAR2 is functionally linked to the neighboring DMR2 while, on the maternal allele, it is controlled by the imprinting-control region. Our work clearly demonstrates that genomic imprinting controls matrix attachment regions in the Igf2 gene.

  20. Molecular characterization of a genomic region associated with virulence in Dichelobacter nodosus.

    PubMed Central

    Katz, M E; Strugnell, R A; Rood, J I

    1992-01-01

    The major pathogen implicated in footrot, a highly contagious disease of sheep, is the strict anaerobe Dichelobacter nodosus (formerly Bacteroides nodosus). Sequence analysis of a 2,262-bp segment of the D. nodosus genome which is more prevalent in virulent isolates than in other isolates showed the presence of four open reading frames which appeared to have consensus transcriptional and translational start signals. These virulence-associated genes have been designated vapABCD. Two of the three copies of the vap region in the genome of the reference strain D. nodosus A198 were shown to carry all of the vap genes, whereas one copy contained only the vapD gene. The VapD protein was gel purified, shown to contain the predicted amino-terminal sequence, and used to raise rabbit antibodies. Western blots (immunoblots) showed that all of the D. nodosus strains tested that contained the vap region produced the VapD protein. The VapD protein had significant amino acid sequence identity with open reading frame 5 from the cryptic plasmid of Neisseria gonorrhoeae, and the vapBC operon had sequence similarity with the trbH region of the Escherichia coli F plasmid. It is proposed that these gene regions evolved from the integration of a conjugative plasmid from another bacterial species into the D. nodosus chromosome. Images PMID:1398971

  1. Sardinians Genetic Background Explained by Runs of Homozygosity and Genomic Regions under Positive Selection

    PubMed Central

    Di Gaetano, Cornelia; Fiorito, Giovanni; Ortu, Maria Francesca; Rosa, Fabio; Guarrera, Simonetta; Pardini, Barbara; Cusi, Daniele; Frau, Francesca; Barlassina, Cristina; Troffa, Chiara; Argiolas, Giuseppe; Zaninello, Roberta; Fresu, Giovanni; Glorioso, Nicola; Piazza, Alberto; Matullo, Giuseppe

    2014-01-01

    The peculiar position of Sardinia in the Mediterranean sea has rendered its population an interesting biogeographical isolate. The aim of this study was to investigate the genetic population structure, as well as to estimate Runs of Homozygosity and regions under positive selection, using about 1.2 million single nucleotide polymorphisms genotyped in 1077 Sardinian individuals. Using four different methods - fixation index, inflation factor, principal component analysis and ancestry estimation - we were able to highlight, as expected for a genetic isolate, the high internal homogeneity of the island. Sardinians showed a higher percentage of genome covered by RoHs>0.5 Mb (FRoH%0.5) when compared to peninsular Italians, with the only exception of the area surrounding Alghero. We furthermore identified 9 genomic regions showing signs of positive selection and, we re-captured many previously inferred signals. Other regions harbor novel candidate genes for positive selection, like TMEM252, or regions containing long non coding RNA. With the present study we confirmed the high genetic homogeneity of Sardinia that may be explained by the shared ancestry combined with the action of evolutionary forces. PMID:24651212

  2. Comparative genomic analysis of duplicated homoeologous regions involved in the resistance of Brassica napus to stem canker.

    PubMed

    Fopa Fomeju, Berline; Falentin, Cyril; Lassalle, Gilles; Manzanares-Dauleux, Maria J; Delourme, Régine

    2015-01-01

    All crop species are current or ancient polyploids. Following whole genome duplication, structural and functional modifications result in differential gene content or regulation in the duplicated regions, which can play a fundamental role in the diversification of genes underlying complex traits. We have investigated this issue in Brassica napus, a species with a highly duplicated genome, with the aim of studying the structural and functional organization of duplicated regions involved in quantitative resistance to stem canker, a disease caused by the fungal pathogen Leptosphaeria maculans. Genome-wide association analysis on two oilseed rape panels confirmed that duplicated regions of ancestral blocks E, J, R, U, and W were involved in resistance to stem canker. The structural analysis of the duplicated genomic regions showed a higher gene density on the A genome than on the C genome and a better collinearity between homoeologous regions than paralogous regions, as overall in the whole B. napus genome. The three ancestral sub-genomes were involved in the resistance to stem canker and the fractionation profile of the duplicated regions corresponded to what was expected from results on the B. napus progenitors. About 60% of the genes identified in these duplicated regions were single-copy genes while less than 5% were retained in all the duplicated copies of a given ancestral block. Genes retained in several copies were mainly involved in response to stress, signaling, or transcription regulation. Genes with resistance-associated markers were mainly retained in more than two copies. These results suggested that some genes underlying quantitative resistance to stem canker might be duplicated genes. Genes with a hydrolase activity that were retained in one copy or R-like genes might also account for resistance in some regions. Further analyses need to be conducted to indicate to what extent duplicated genes contribute to the expression of the resistance phenotype

  3. Comparative genomic analysis of duplicated homoeologous regions involved in the resistance of Brassica napus to stem canker

    PubMed Central

    Fopa Fomeju, Berline; Falentin, Cyril; Lassalle, Gilles; Manzanares-Dauleux, Maria J.; Delourme, Régine

    2015-01-01

    All crop species are current or ancient polyploids. Following whole genome duplication, structural and functional modifications result in differential gene content or regulation in the duplicated regions, which can play a fundamental role in the diversification of genes underlying complex traits. We have investigated this issue in Brassica napus, a species with a highly duplicated genome, with the aim of studying the structural and functional organization of duplicated regions involved in quantitative resistance to stem canker, a disease caused by the fungal pathogen Leptosphaeria maculans. Genome-wide association analysis on two oilseed rape panels confirmed that duplicated regions of ancestral blocks E, J, R, U, and W were involved in resistance to stem canker. The structural analysis of the duplicated genomic regions showed a higher gene density on the A genome than on the C genome and a better collinearity between homoeologous regions than paralogous regions, as overall in the whole B. napus genome. The three ancestral sub-genomes were involved in the resistance to stem canker and the fractionation profile of the duplicated regions corresponded to what was expected from results on the B. napus progenitors. About 60% of the genes identified in these duplicated regions were single-copy genes while less than 5% were retained in all the duplicated copies of a given ancestral block. Genes retained in several copies were mainly involved in response to stress, signaling, or transcription regulation. Genes with resistance-associated markers were mainly retained in more than two copies. These results suggested that some genes underlying quantitative resistance to stem canker might be duplicated genes. Genes with a hydrolase activity that were retained in one copy or R-like genes might also account for resistance in some regions. Further analyses need to be conducted to indicate to what extent duplicated genes contribute to the expression of the resistance phenotype

  4. High-density linkage mapping and distribution of segregation distortion regions in the oak genome

    PubMed Central

    Bodénès, Catherine; Chancerel, Emilie; Ehrenmann, François; Kremer, Antoine; Plomion, Christophe

    2016-01-01

    We developed the densest single-nucleotide polymorphism (SNP)-based linkage genetic map to date for the genus Quercus. An 8k gene-based SNP array was used to genotype more than 1,000 full-sibs from two intraspecific and two interspecific full-sib families of Quercus petraea and Quercus robur. A high degree of collinearity was observed between the eight parental maps of the two species. A composite map was then established with 4,261 SNP markers spanning 742 cM over the 12 linkage groups (LGs) of the oak genome. Nine genomic regions from six LGs displayed highly significant distortions of segregation. Two main hypotheses concerning the mechanisms underlying segregation distortion are discussed: genetic load vs. reproductive barriers. Our findings suggest a predominance of pre-zygotic to post-zygotic barriers. PMID:27013549

  5. Complete genome sequence of Deltapapillomavirus 4 (bovine papillomavirus 2) from a bovine papillomavirus lesion in Amazon Region, Brazil

    PubMed Central

    Daudt, Cíntia; da Silva, Flavio RC; Cibulski, Samuel P; Weber, Matheus N; Mayer, Fabiana Q; Varela, Ana Paula M; Roehe, Paulo M; Canal, Cláudio W

    2016-01-01

    The complete genome sequence of bovine papillomavirus 2 (BPV2) from Brazilian Amazon Region was determined using multiple-primed rolling circle amplification followed by Illumina sequencing. The genome is 7,947 bp long, with 45.9% GC content. It encodes seven early (E1, E2,E4, E5, E6,E7, and E8) and two late (L1 and L2) genes. The complete genome of a BPV2 can help in future studies since this BPV type is highly reported worldwide although the lack of complete genome sequences available. PMID:27074259

  6. Complete genome sequence of Deltapapillomavirus 4 (bovine papillomavirus 2) from a bovine papillomavirus lesion in Amazon Region, Brazil.

    PubMed

    Daudt, Cíntia; Silva, Flavio R C da; Cibulski, Samuel P; Weber, Matheus N; Mayer, Fabiana Q; Varela, Ana Paula M; Roehe, Paulo M; Canal, Cláudio W

    2016-04-01

    The complete genome sequence of bovine papillomavirus 2 (BPV2) from Brazilian Amazon Region was determined using multiple-primed rolling circle amplification followed by Illumina sequencing. The genome is 7,947 bp long, with 45.9% GC content. It encodes seven early (E1, E2,E4, E5, E6,E7, and E8) and two late (L1 and L2) genes. The complete genome of a BPV2 can help in future studies since this BPV type is highly reported worldwide although the lack of complete genome sequences available.

  7. Origins of the Xylella fastidiosa Prophage-Like Regions and Their Impact in Genome Differentiation

    PubMed Central

    de Mello Varani, Alessandro; Souza, Rangel Celso; Nakaya, Helder I.; de Lima, Wanessa Cristina; Paula de Almeida, Luiz Gonzaga; Kitajima, Elliot Watanabe; Chen, Jianchi; Civerolo, Edwin; Vasconcelos, Ana Tereza Ribeiro; Van Sluys, Marie-Anne

    2008-01-01

    Xylella fastidiosa is a Gram negative plant pathogen causing many economically important diseases, and analyses of completely sequenced X. fastidiosa genome strains allowed the identification of many prophage-like elements and possibly phage remnants, accounting for up to 15% of the genome composition. To better evaluate the recent evolution of the X. fastidiosa chromosome backbone among distinct pathovars, the number and location of prophage-like regions on two finished genomes (9a5c and Temecula1), and in two candidate molecules (Ann1 and Dixon) were assessed. Based on comparative best bidirectional hit analyses, the majority (51%) of the predicted genes in the X. fastidiosa prophage-like regions are related to structural phage genes belonging to the Siphoviridae family. Electron micrograph reveals the existence of putative viral particles with similar morphology to lambda phages in the bacterial cell in planta. Moreover, analysis of microarray data indicates that 9a5c strain cultivated under stress conditions presents enhanced expression of phage anti-repressor genes, suggesting switches from lysogenic to lytic cycle of phages under stress-induced situations. Furthermore, virulence-associated proteins and toxins are found within these prophage-like elements, thus suggesting an important role in host adaptation. Finally, clustering analyses of phage integrase genes based on multiple alignment patterns reveal they group in five lineages, all possessing a tyrosine recombinase catalytic domain, and phylogenetically close to other integrases found in phages that are genetic mosaics and able to perform generalized and specialized transduction. Integration sites and tRNA association is also evidenced. In summary, we present comparative and experimental evidence supporting the association and contribution of phage activity on the differentiation of Xylella genomes. PMID:19116666

  8. Nonhomologous recombination between the large unassigned region of the male and female mitochondrial genomes in the mussel, Mytilus trossulus.

    PubMed

    Rawson, Paul D

    2005-12-01

    Doubly uniparental inheritance of mtDNA (DUI) is commonly observed in several genera of bivalves. Under DUI, female offspring inherit mtDNA from their mothers, while male offspring inherit mtDNA from both parents but preferentially transmit the paternally inherited mtDNA to their sons. Several studies have shown that the female- and male-specific mtDNA lineages in blue mussels, Mytilus spp., vary by upward of 20% at the nucleotide level. In addition to high levels of nucleotide substitution, the present study observed substantial gender-based length polymorphism in the presumptive mitochondrial control region (=large unassigned region; LUR) of North American M. trossulus. In this species, female lineage LUR haplotypes are over 2 kb larger than male lineage LUR haplotypes. Analysis of sequence data for these length variants indicates that the F LUR haplotypes of North American M. trossulus contain sequences similar to the F lineage control region in the congeners M. edulis and M. galloprovincialis. Relative to the F LUR in the latter two species, however, the F lineage LUR haplotypes in M. trossulus contain two large sequence insertions, each nearly 1 kb in size. One of these insertions has high sequence similarity to the male lineage LUR of M. trossulus. The tandem arrangement of F and M control region sequences in the F lineage LUR of M. trossulus is most likely the result of nonhomologous recombination between the male and the female mitochondrial genomes in M. trossulus, a finding that has important implications regarding the transmission and evolution of blue mussel mitochondrial genomes.

  9. PCR primers for 30 novel gene regions in the nuclear genomes of Lepidoptera

    PubMed Central

    Wahlberg, Niklas; Peña, Carlos; Ahola, Milla; Wheat, Christopher W.; Rota, Jadranka

    2016-01-01

    Abstract We report primer pairs for 30 new gene regions in the nuclear genomes of Lepidoptera that can be amplified using a standard PCR protocol. The new primers were tested across diverse Lepidoptera, including nonditrysians and a wide selection of ditrysians. These new gene regions give a total of 11,043 bp of DNA sequence data and they show similar variability to traditionally used nuclear gene regions in studies of Lepidoptera. We feel that a PCR-based approach still has its place in molecular systematic studies of Lepidoptera, particularly at the intrafamilial level, and our new set of primers now provides a route to generating phylogenomic datasets using traditional methods. PMID:27408580

  10. Establishment of regions of genomic activity during the Drosophila maternal to zygotic transition.

    PubMed

    Li, Xiao-Yong; Harrison, Melissa M; Villalta, Jacqueline E; Kaplan, Tommy; Eisen, Michael B

    2014-01-01

    We describe the genome-wide distributions and temporal dynamics of nucleosomes and post-translational histone modifications throughout the maternal-to-zygotic transition in embryos of Drosophila melanogaster. At mitotic cycle 8, when few zygotic genes are being transcribed, embryonic chromatin is in a relatively simple state: there are few nucleosome free regions, undetectable levels of the histone methylation marks characteristic of mature chromatin, and low levels of histone acetylation at a relatively small number of loci. Histone acetylation increases by cycle 12, but it is not until cycle 14 that nucleosome free regions and domains of histone methylation become widespread. Early histone acetylation is strongly associated with regions that we have previously shown to be bound in early embryos by the maternally deposited transcription factor Zelda, suggesting that Zelda triggers a cascade of events, including the accumulation of specific histone modifications, that plays a role in the subsequent activation of these sequences. PMID:25313869

  11. Gene map of large yellow croaker (Larimichthys crocea) provides insights into teleost genome evolution and conserved regions associated with growth.

    PubMed

    Xiao, Shijun; Wang, Panpan; Zhang, Yan; Fang, Lujing; Liu, Yang; Li, Jiong-Tang; Wang, Zhi-Yong

    2015-12-22

    The genetic map of a species is essential for its whole genome assembly and can be applied to the mapping of important traits. In this study, we performed RNA-seq for a family of large yellow croakers (Larimichthys crocea) and constructed a high-density genetic map. In this map, 24 linkage groups comprised 3,448 polymorphic SNP markers. Approximately 72.4% (2,495) of the markers were located in protein-coding regions. Comparison of the croaker genome with those of five model fish species revealed that the croaker genome structure was closer to that of the medaka than to the remaining four genomes. Because the medaka genome preserves the teleost ancestral karyotype, this result indicated that the croaker genome might also maintain the teleost ancestral genome structure. The analysis also revealed different genome rearrangements across teleosts. QTL mapping and association analysis consistently identified growth-related QTL regions and associated genes. Orthologs of the associated genes in other species were demonstrated to regulate development, indicating that these genes might regulate development and growth in croaker. This gene map will enable us to construct the croaker genome for comparative studies and to provide an important resource for selective breeding of croaker.

  12. Gene map of large yellow croaker (Larimichthys crocea) provides insights into teleost genome evolution and conserved regions associated with growth

    PubMed Central

    Xiao, Shijun; Wang, Panpan; Zhang, Yan; Fang, Lujing; Liu, Yang; Li, Jiong-Tang; Wang, Zhi-Yong

    2015-01-01

    The genetic map of a species is essential for its whole genome assembly and can be applied to the mapping of important traits. In this study, we performed RNA-seq for a family of large yellow croakers (Larimichthys crocea) and constructed a high-density genetic map. In this map, 24 linkage groups comprised 3,448 polymorphic SNP markers. Approximately 72.4% (2,495) of the markers were located in protein-coding regions. Comparison of the croaker genome with those of five model fish species revealed that the croaker genome structure was closer to that of the medaka than to the remaining four genomes. Because the medaka genome preserves the teleost ancestral karyotype, this result indicated that the croaker genome might also maintain the teleost ancestral genome structure. The analysis also revealed different genome rearrangements across teleosts. QTL mapping and association analysis consistently identified growth-related QTL regions and associated genes. Orthologs of the associated genes in other species were demonstrated to regulate development, indicating that these genes might regulate development and growth in croaker. This gene map will enable us to construct the croaker genome for comparative studies and to provide an important resource for selective breeding of croaker. PMID:26689832

  13. Non-additive genome-wide association scan reveals a new gene associated with habitual coffee consumption

    PubMed Central

    Pirastu, Nicola; Kooyman, Maarten; Robino, Antonietta; van der Spek, Ashley; Navarini, Luciano; Amin, Najaf; Karssen, Lennart C.; Van Duijn, Cornelia M; Gasparini, Paolo

    2016-01-01

    Coffee is one of the most consumed beverages world-wide and one of the primary sources of caffeine intake. Given its important health and economic impact, the underlying genetics of its consumption has been widely studied. Despite these efforts, much has still to be uncovered. In particular, the use of non-additive genetic models may uncover new information about the genetic variants driving coffee consumption. We have conducted a genome-wide association study in two Italian populations using additive, recessive and dominant models for analysis. This has uncovered a significant association in the PDSS2 gene under the recessive model that has been replicated in an independent cohort from the Netherlands (ERF). The identified gene has been shown to negatively regulate the expression of the caffeine metabolism genes and can thus be linked to coffee consumption. Further bioinformatics analysis of eQTL and histone marks from Roadmap data has evidenced a possible role of the identified SNPs in regulating PDSS2 gene expression through enhancers present in its intron. Our results highlight a novel gene which regulates coffee consumption by regulating the expression of the genes linked to caffeine metabolism. Further studies will be needed to clarify the biological mechanism which links PDSS2 and coffee consumption. PMID:27561104

  14. Non-additive genome-wide association scan reveals a new gene associated with habitual coffee consumption.

    PubMed

    Pirastu, Nicola; Kooyman, Maarten; Robino, Antonietta; van der Spek, Ashley; Navarini, Luciano; Amin, Najaf; Karssen, Lennart C; Van Duijn, Cornelia M; Gasparini, Paolo

    2016-01-01

    Coffee is one of the most consumed beverages world-wide and one of the primary sources of caffeine intake. Given its important health and economic impact, the underlying genetics of its consumption has been widely studied. Despite these efforts, much has still to be uncovered. In particular, the use of non-additive genetic models may uncover new information about the genetic variants driving coffee consumption. We have conducted a genome-wide association study in two Italian populations using additive, recessive and dominant models for analysis. This has uncovered a significant association in the PDSS2 gene under the recessive model that has been replicated in an independent cohort from the Netherlands (ERF). The identified gene has been shown to negatively regulate the expression of the caffeine metabolism genes and can thus be linked to coffee consumption. Further bioinformatics analysis of eQTL and histone marks from Roadmap data has evidenced a possible role of the identified SNPs in regulating PDSS2 gene expression through enhancers present in its intron. Our results highlight a novel gene which regulates coffee consumption by regulating the expression of the genes linked to caffeine metabolism. Further studies will be needed to clarify the biological mechanism which links PDSS2 and coffee consumption. PMID:27561104

  15. A genome walking strategy for the identification of nucleotide sequences adjacent to known regions.

    PubMed

    Wang, Hailong; Yao, Ting; Cai, Mei; Xiao, Xiuqing; Ding, Xuezhi; Xia, Liqiu

    2013-02-01

    To identify the transposon insertion sites in a soil actinomycete, Saccharopolyspora spinosa, a genome walking approach, termed SPTA-PCR, was developed. In SPTA-PCR, a simple procedure consisting of TA cloning and a high stringency PCR, following the single primer-mediated, randomly-primed PCR, can eliminate non-target DNA fragments and obtain target fragments specifically. Using SPTA-PCR, the DNA sequence adjacent to the highly conserved region of lectin coding gene in onion plant, Allium chinense, was also cloned. PMID:23108875

  16. A Dual Color Southern Blot to Visualize Two Genomes or Genic Regions Simultaneously

    PubMed Central

    Zavala, Anamaria G.; Kulkarni, Amit S.; Fortunato, Elizabeth A.

    2014-01-01

    This report describes the development of a novel dual color Southern protocol to visualize two distinct genomes or genic regions simultaneously on a single Southern blot. The blot is developed with IRDye-conjugated antibody (Ab) and streptavidin that recognize Digoxigenin- (Dig) or biotin-labeled probes, respectively and visualized on an infrared imager. This protocol was validated by visualizing viral and host genomes of human cytomegalovirus (HCMV)-infected human fibroblasts. This technique utilizes extremely sensitive fluorescent imaging, allowing the detection of nanogram quantities of DNA, as opposed to microgram quantities needed in Southerns using radioactively labeled probes, and eliminates the inherent loss in signal after stripping and reprobing a Southern blot. The probes are labeled with non-radioactive Dig and biotin and can be stored for extended periods of time. This protocol will aid in studies of any system with two genomes, such as cells infected with numerous types of microorganisms (virus/parasites/bacteria), or studies of mitochondrial and nuclear DNA within the same cells. PMID:24389128

  17. [Topological Conflicts in Phylogenetic Analysis of Different Regions of the Sable (Martes zibellina L.) Mitochondrial Genome].

    PubMed

    Malyarchuk, B A; Derenko, M V; Denisova, G A; Litvinov, A N

    2015-08-01

    Phylogenetic analysis of different regions of the mitochondrial genome of the sable showed the presence of several topologies of phylogenetic trees, but the most statistically significant topology is A-BC, which was obtained as a result of the analysis of the mitochondrial genome as a whole, as well as of the individual CO1, ND4, and ND5 genes. Analysis of the intergroup divergence of the mtDNA haplotypes (Dxy) indicated that the maximum Dxy values between A and BC groups were accompanied by minimum differences between B and C groups only for six genes showing the A-BC topology (12S rRNA; CO1, CO2, ND4, ND5, and CYTB). It is assumed that the topological conflicts observed in the analysis of individual sable mtDNA genes are associated with the uneven distribution of mutations along the mitochondrial genome and the mitochondrial tree. This may be due to random causes, as well as the nonuniform effect of selection. PMID:26601491

  18. Identification and mapping of DNA binding proteins target sequences in long genomic regions by two-dimensional EMSA.

    PubMed

    Chernov, Igor P; Akopov, Sergey B; Nikolaev, Lev G; Sverdlov, Eugene D

    2006-07-01

    Specific binding of nuclear proteins, in particular transcription factors, to target DNA sequences is a major mechanism of genome functioning and gene expression regulation in eukaryotes. Therefore, identification and mapping specific protein target sites (PTS) is necessary for understanding genomic regulation. Here we used a novel two-dimensional electrophoretic mobility shift assay (2D-EMSA) procedure for identification and mapping of 52 PTS within a 563-kb human genome region located between the FXYD5 and TZFP genes. The PTS occurred with approximately equal frequency within unique and repetitive genomic regions. PTS belonging to unique sequences tended to group together within gene introns and close to their 5' and 3' ends, whereas PTS located within repeats were evenly distributed between transcribed and intragenic regions. PMID:16869519

  19. Composite selection signals can localize the trait specific genomic regions in multi-breed populations of cattle and sheep

    PubMed Central

    2014-01-01

    Background Discerning the traits evolving under neutral conditions from those traits evolving rapidly because of various selection pressures is a great challenge. We propose a new method, composite selection signals (CSS), which unifies the multiple pieces of selection evidence from the rank distribution of its diverse constituent tests. The extreme CSS scores capture highly differentiated loci and underlying common variants hauling excess haplotype homozygosity in the samples of a target population. Results The data on high-density genotypes were analyzed for evidence of an association with either polledness or double muscling in various cohorts of cattle and sheep. In cattle, extreme CSS scores were found in the candidate regions on autosome BTA-1 and BTA-2, flanking the POLL locus and MSTN gene, for polledness and double muscling, respectively. In sheep, the regions with extreme scores were localized on autosome OAR-2 harbouring the MSTN gene for double muscling and on OAR-10 harbouring the RXFP2 gene for polledness. In comparison to the constituent tests, there was a partial agreement between the signals at the four candidate loci; however, they consistently identified additional genomic regions harbouring no known genes. Persuasively, our list of all the additional significant CSS regions contains genes that have been successfully implicated to secondary phenotypic diversity among several subpopulations in our data. For example, the method identified a strong selection signature for stature in cattle capturing selective sweeps harbouring UQCC-GDF5 and PLAG1-CHCHD7 gene regions on BTA-13 and BTA-14, respectively. Both gene pairs have been previously associated with height in humans, while PLAG1-CHCHD7 has also been reported for stature in cattle. In the additional analysis, CSS identified significant regions harbouring multiple genes for various traits under selection in European cattle including polledness, adaptation, metabolism, growth rate, stature

  20. Genomic Regions Associated with Root Traits under Drought Stress in Tropical Maize (Zea mays L.)

    PubMed Central

    Zaidi, P. H.; Krishna, Girish; Krishnamurthy, L.; Gajanan, S.; Babu, Raman; Zerka, M.; Vinayan, M. T.; Vivek, B. S.

    2016-01-01

    An association mapping panel, named as CIMMYT Asia association mapping (CAAM) panel, involving 396 diverse tropical maize lines were phenotyped for various structural and functional traits of roots under drought and well-watered conditions. The experiment was conducted during Kharif (summer-rainy) season of 2012 and 2013 in root phenotyping facility at CIMMYT-Hyderabad, India. The CAAM panel was genotyped to generate 955, 690 SNPs through GBS v2.7 using Illumina Hi-seq 2000/2500 at Institute for Genomic Diversity, Cornell University, Ithaca, NY, USA. GWAS analysis was carried out using 331,390 SNPs filtered from the entire set of SNPs revealed a total of 50 and 67 SNPs significantly associated for root functional (transpiration efficiency, flowering period water use) and structural traits (rooting depth, root dry weight, root length, root volume, root surface area and root length density), respectively. In addition to this, 37 SNPs were identified for grain yield and shoot biomass under well-watered and drought stress. Though many SNPs were found to have significant association with the traits under study, SNPs that were common for more than one trait were discussed in detail. A total 18 SNPs were found to have common association with more than one trait, out of which 12 SNPs were found within or near the various gene functional regions. In this study we attempted to identify the trait specific maize lines based on the presence of favorable alleles for the SNPs associated with multiple traits. Two SNPs S3_128533512 and S7_151238865 were associated with transpiration efficiency, shoot biomass and grain yield under well-watered condition. Based on favorable allele for these SNPs seven inbred lines were identified. Similarly, four lines were identified for transpiration efficiency and shoot biomass under drought stress based on the presence of favorable allele for the common SNPs S1_211520521, S2_20017716, S3_57210184 and S7_130878458 and three lines were identified

  1. Microcollinearity in an ethylene receptor coding gene region of the Coffea canephora genome is extensively conserved with Vitis vinifera and other distant dicotyledonous sequenced genomes

    PubMed Central

    Guyot, Romain; de la Mare, Marion; Viader, Véronique; Hamon, Perla; Coriton, Olivier; Bustamante-Porras, José; Poncet, Valérie; Campa, Claudine; Hamon, Serge; de Kochko, Alexandre

    2009-01-01

    Background Coffea canephora, also called Robusta, belongs to the Rubiaceae, the fourth largest angiosperm family. This diploid species (2x = 2n = 22) has a fairly small genome size of ≈ 690 Mb and despite its extreme economic importance, particularly for developing countries, knowledge on the genome composition, structure and evolution remain very limited. Here, we report the 160 kb of the first C. canephora Bacterial Artificial Chromosome (BAC) clone ever sequenced and its fine analysis. Results This clone contains the CcEIN4 gene, encoding an ethylene receptor, and twenty other predicted genes showing a high gene density of one gene per 7.8 kb. Most of them display perfect matches with C. canephora expressed sequence tags or show transcriptional activities through PCR amplifications on cDNA libraries. Twenty-three transposable elements, mainly Class II transposon derivatives, were identified at this locus. Most of these Class II elements are Miniature Inverted-repeat Transposable Elements (MITE) known to be closely associated with plant genes. This BAC composition gives a pattern similar to those found in gene rich regions of Solanum lycopersicum and Medicago truncatula genomes indicating that the CcEIN4 regions may belong to a gene rich region in the C. canephora genome. Comparative sequence analysis indicated an extensive conservation between C. canephora and most of the reference dicotyledonous genomes studied in this work, such as tomato (S. lycopersicum), grapevine (V. vinifera), barrel medic M. truncatula, black cottonwood (Populus trichocarpa) and Arabidopsis thaliana. The higher degree of microcollinearity was found between C. canephora and V. vinifera, which belong respectively to the Asterids and Rosids, two clades that diverged more than 114 million years ago. Conclusion This study provides a first glimpse of C. canephora genome composition and evolution. Our data revealed a remarkable conservation of the microcollinearity between C. canephora and V

  2. A genomic region involved in the formation of adhesin fibers in Bacillus cereus biofilms

    PubMed Central

    Caro-Astorga, Joaquín; Pérez-García, Alejandro; de Vicente, Antonio; Romero, Diego

    2015-01-01

    Bacillus cereus is a bacterial pathogen that is responsible for many recurrent disease outbreaks due to food contamination. Spores and biofilms are considered the most important reservoirs of B. cereus in contaminated fresh vegetables and fruits. Biofilms are bacterial communities that are difficult to eradicate from biotic and abiotic surfaces because of their stable and extremely strong extracellular matrix. These extracellular matrixes contain exopolysaccharides, proteins, extracellular DNA, and other minor components. Although B. cereus can form biofilms, the bacterial features governing assembly of the protective extracellular matrix are not known. Using the well-studied bacterium B. subtilis as a model, we identified two genomic loci in B. cereus, which encodes two orthologs of the amyloid-like protein TasA of B. subtilis and a SipW signal peptidase. Deletion of this genomic region in B. cereus inhibited biofilm assembly; notably, mutation of the putative signal peptidase SipW caused the same phenotype. However, mutations in tasA or calY did not completely prevent biofilm formation; strains that were mutated for either of these genes formed phenotypically different surface attached biofilms. Electron microscopy studies revealed that TasA polymerizes to form long and abundant fibers on cell surfaces, whereas CalY does not aggregate similarly. Heterologous expression of this amyloid-like cassette in a B. subtilis strain lacking the factors required for the assembly of TasA amyloid-like fibers revealed (i) the involvement of this B. cereus genomic region in formation of the air-liquid interphase pellicles and (ii) the intrinsic ability of TasA to form fibers similar to the amyloid-like fibers produced by its B. subtilis ortholog. PMID:25628606

  3. Long regions of homologous DNA are incorporated into the tobacco plastid genome by transformation.

    PubMed Central

    Staub, J M; Maliga, P

    1992-01-01

    We investigated the size of flanking DNA incorporated into the tobacco plastid genome alongside a selectable antibiotic resistance mutation. The results showed that integration of a long uninterrupted region of homologous DNA, rather than of small fragments as previously thought, is the more likely event in plastid transformation of land plants. Transforming plasmid pJS75 contains a 6.2-kb DNA fragment from the inverted repeat region of the tobacco plastid genome. A spectinomycin resistance mutation is encoded in the gene of the 16S rRNA and, 3.2 kb away, a streptomycin resistance mutation is encoded in exon II of the ribosomal protein gene rps12. Transplastomic lines were obtained after introduction of pJS75 DNA into leaf cells by the biolistic process and selection for the spectinomycin resistance marker. Homologous replacement of resident wild-type sequences resulted in integration of all, or almost all, of the 6.2-kb plastid DNA sequence from pJS75. Plasmid pJS75, which contains engineered cloning sites between two selectable markers, can be used as a plastid insertion vector. PMID:1356049

  4. Two genomic regions together cause dark abdominal pigmentation in Drosophila tenebrosa.

    PubMed

    Bray, M J; Werner, T; Dyer, K A

    2014-04-01

    Pigmentation is a rapidly evolving trait that is under both natural and sexual selection in many organisms. In the quinaria group of Drosophila, nearly all of the 30 species have an abdomen that is light in color with distinct markings; D. tenebrosa is the exception in that it has a completely melanic abdomen with no visible markings. In this study, we use a combination of quantitative genetic and candidate gene approaches to investigate the genetic basis of abdominal pigmentation in D. tenebrosa. We find that abdominal pigmentation is invariant across wild-caught lines of D. tenebrosa and is not sexually dimorphic. Quantitative genetic mapping utilizing crosses between D. tenebrosa and the light-colored D. suboccidentalis indicates that two genomic regions together underlie abdominal pigmentation, including the X-chromosome and an autosome (Muller Element C/E). Further support for their central importance in pigmentation is that experimental introgression of one phenotype into the other species, in either direction, results in introgression of these two genomic regions. Finally, the expression of the X-linked gene yellow in the pupae exactly foreshadows the adult melanization pattern in the abdomen of both species, suggesting that changes in the regulation of yellow are important for the phenotypic divergence of D. tenebrosa from the rest of the quinaria group. These results contribute to a body of work that demonstrates how changes in expression of highly conserved genes can cause substantial phenotypic differences even between closely related species.

  5. Identification and characterization of two nonessential regions of the rabbitpox virus genome involved in virulence.

    PubMed

    Bloom, D C; Edwards, K M; Hager, C; Moyer, R W

    1991-03-01

    We have developed a means to identify genes associated with particular aspects of virulence. By beginning with an avirulent deletion mutant of rabbitpox virus and systematically reintroducing overlapping segments of the deleted region, we have identified two regions of the viral genome associated with increased virulence in mice. Evaluation of illness has been aided by the exploitation of weight loss as an indicator of pathogenesis. One of the regions identified by this method contains several open reading frames and includes two previously described genes. A third, as yet undescribed, gene within this region potentially encodes a product related to the C5 protein of human complement. The second region of DNA associated with increased virulence is the HindIII M fragment, which contains only one complete open reading frame. Analysis of this previously unreported gene shows coding potential for a polypeptide of 254 amino acids (approximately 25 kDa) which is related to the C4 component of human complement. The elucidation of two new viral genes related to complement components, taken together with the recent report of the biological activity of a poxvirus-encoded complement-binding protein, suggests the importance of interactions of the virus with the complement system during a normal infection.

  6. The polycystic kidney disease 1 gene lies in a duplicated genomic region

    SciTech Connect

    Ward, C.J.; Hughes, J.; Peral, B. |

    1994-09-01

    The polycystic kidney disease 1 (PKD1) gene is situated in chromosomal band 16p13.3 and encodes a 14 kb transcript. The 5{prime} region of the PKD1 gene is located within a 40-50 kb stretch of genomic DNA which is duplicated several times in the more proximal region, 16p13.1. This proximal area gives rise to at least three transcripts designated homologous gene A (HG-A; 21 kb), HG-B (17 kb) and HG-C (8.5 kb). These three transcripts share substantial homology with each other and the PKD1 transcript. However, the 3{prime} 3.8 kb section of the PKD1 transcript is unique because it is encoded by a region of the gene that lies outside the duplicated area. The presence of the duplicate transcripts in all tissues analyzed has hampered attempts to clone and sequence the bone fide PKD1 gene. Comparison of cDNAs known to arise from the PKD1 transcript to those from the HG transcripts reveals that divergence of 2-3% has occurred between these sequences. To overcome the problem of the duplication, a large 15 kb section of genomic DNA has been sequenced together with several large HG cDNAs. Utilizing a radiation hybrid which contains only the 16p13.3 region and expresses low levels of the PKD1 transcript, we are now attempting to clone the duplicated part of the PKD1 gene by exon linking.

  7. In silico screening of the chicken genome for overlaps between genomic regions: microRNA genes, coding and non-coding transcriptional units, QTL, and genetic variations.

    PubMed

    Zorc, Minja; Kunej, Tanja

    2016-05-01

    MicroRNAs (miRNAs) are a class of non-coding RNAs involved in posttranscriptional regulation of target genes. Regulation requires complementarity between target mRNA and the mature miRNA seed region, responsible for their recognition and binding. It has been estimated that each miRNA targets approximately 200 genes, and genetic variability of miRNA genes has been reported to affect phenotypic variability and disease susceptibility in humans, livestock species, and model organisms. Polymorphisms in miRNA genes could therefore represent biomarkers for phenotypic traits in livestock animals. In our previous study, we collected polymorphisms within miRNA genes in chicken. In the present study, we identified miRNA-related genomic overlaps to prioritize genomic regions of interest for further functional studies and biomarker discovery. Overlapping genomic regions in chicken were analyzed using the following bioinformatics tools and databases: miRNA SNiPer, Ensembl, miRBase, NCBI Blast, and QTLdb. Out of 740 known pre-miRNA genes, 263 (35.5 %) contain polymorphisms; among them, 35 contain more than three polymorphisms The most polymorphic miRNA genes in chicken are gga-miR-6662, containing 23 single nucleotide polymorphisms (SNPs) within the pre-miRNA region, including five consecutive SNPs, and gga-miR-6688, containing ten polymorphisms including three consecutive polymorphisms. Several miRNA-related genomic hotspots have been revealed in chicken genome; polymorphic miRNA genes are located within protein-coding and/or non-coding transcription units and quantitative trait loci (QTL) associated with production traits. The present study includes the first description of an exonic miRNA in a chicken genome, an overlap between the miRNA gene and the exon of the protein-coding gene (gga-miR-6578/HADHB), and the first report of a missense polymorphism located within a mature miRNA seed region. Identified miRNA-related genomic hotspots in chicken can serve researchers as a

  8. A novel method for discovering local spatial clusters of genomic regions with functional relationships from DNA contact maps

    PubMed Central

    Hu, Xihao; Shi, Christina Huan; Yip, Kevin Y.

    2016-01-01

    Motivation: The three-dimensional structure of genomes makes it possible for genomic regions not adjacent in the primary sequence to be spatially proximal. These DNA contacts have been found to be related to various molecular activities. Previous methods for analyzing DNA contact maps obtained from Hi-C experiments have largely focused on studying individual interactions, forming spatial clusters composed of contiguous blocks of genomic locations, or classifying these clusters into general categories based on some global properties of the contact maps. Results: Here, we describe a novel computational method that can flexibly identify small clusters of spatially proximal genomic regions based on their local contact patterns. Using simulated data that highly resemble Hi-C data obtained from real genome structures, we demonstrate that our method identifies spatial clusters that are more compact than methods previously used for clustering genomic regions based on DNA contact maps. The clusters identified by our method enable us to confirm functionally related genomic regions previously reported to be spatially proximal in different species. We further show that each genomic region can be assigned a numeric affinity value that indicates its degree of participation in each local cluster, and these affinity values correlate quantitatively with DNase I hypersensitivity, gene expression, super enhancer activities and replication timing in a cell type specific manner. We also show that these cluster affinity values can precisely define boundaries of reported topologically associating domains, and further define local sub-domains within each domain. Availability and implementation: The source code of BNMF and tutorials on how to use the software to extract local clusters from contact maps are available at http://yiplab.cse.cuhk.edu.hk/bnmf/. Contact: kevinyip@cse.cuhk.edu.hk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27307607

  9. 30 CFR 250.177 - What additional requirements may the Regional Supervisor order for a suspension?

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... Supervisor may require you to: (a) Conduct a site-specific study. (1) The Regional Supervisor must approve or prescribe the scope for any site-specific study that you perform. (2) The study must evaluate the cause of... study unless you request, and the Regional Supervisor agrees to arrange, payment by another party....

  10. 30 CFR 250.177 - What additional requirements may the Regional Supervisor order for a suspension?

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... Supervisor may require you to: (a) Conduct a site-specific study. (1) The Regional Supervisor must approve or prescribe the scope for any site-specific study that you perform. (2) The study must evaluate the cause of... study unless you request, and the Regional Supervisor agrees to arrange, payment by another party....

  11. 30 CFR 250.177 - What additional requirements may the Regional Supervisor order for a suspension?

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... Supervisor may require you to: (a) Conduct a site-specific study. (1) The Regional Supervisor must approve or prescribe the scope for any site-specific study that you perform. (2) The study must evaluate the cause of... study unless you request, and the Regional Supervisor agrees to arrange, payment by another party....

  12. Value addition of wild apricot fruits grown in North-West Himalayan regions-a review.

    PubMed

    Sharma, Rakesh; Gupta, Anil; Abrol, G S; Joshi, V K

    2014-11-01

    Wild apricot (Prunus armeniaca L.) commonly known as chulli is a potential fruit widely distributed in North-West Himalayan regions of the world. The fruits are good source of carbohydrates, vitamins, minerals besides having attractive colour and typical flavour. Unlike table purpose varieties of apricots like New Castle, the fruits of wild apricot are unsuitable for fresh consumption because of its high acid and low sugar content. However, the fruits are traditionally utilized for open sun drying, pulping to prepare different products such as jams, chutney and naturally fermented and distilled liquor. But, scientific literature on processing and value addition of wild apricot is scanty. Preparation of jam with 25 % wild apricot +75 % apple showed maximum score for organoleptic characteristics due to better taste and colour. Osmotic dehydration has been found as a suitable method for drying of wild type acidic apricots. A good quality sauce using wild apricot pulp and tomato pulp in the ratio of 1:1 has been prepared, while chutney of good acceptability prepared from wild apricot pulp (100 %) has also been documented. Preparation of apricot-soy protein enriched products like apricot-soya leather, toffee and fruit bars has been reported, which are reported to meet the protein requirements of adult and children as per the recommendations of ICMR. Besides these processed products, preparation of alcoholic beverages like wine, vermouth and brandy from wild apricot fruits has also been reported by various researchers. Further, after utilization of pulp for preparation of value added products, the stones left over have been successfully utilized for oil extraction which has medicinal and cosmetic value. The traditional method of oil extraction has been reported to be unhygienic and result in low oil yield with poor quality, whereas improved mechanical method of oil extraction has been found to produce good quality oil. The apricot kernel oil and press cake have

  13. Value addition of wild apricot fruits grown in North-West Himalayan regions-a review.

    PubMed

    Sharma, Rakesh; Gupta, Anil; Abrol, G S; Joshi, V K

    2014-11-01

    Wild apricot (Prunus armeniaca L.) commonly known as chulli is a potential fruit widely distributed in North-West Himalayan regions of the world. The fruits are good source of carbohydrates, vitamins, minerals besides having attractive colour and typical flavour. Unlike table purpose varieties of apricots like New Castle, the fruits of wild apricot are unsuitable for fresh consumption because of its high acid and low sugar content. However, the fruits are traditionally utilized for open sun drying, pulping to prepare different products such as jams, chutney and naturally fermented and distilled liquor. But, scientific literature on processing and value addition of wild apricot is scanty. Preparation of jam with 25 % wild apricot +75 % apple showed maximum score for organoleptic characteristics due to better taste and colour. Osmotic dehydration has been found as a suitable method for drying of wild type acidic apricots. A good quality sauce using wild apricot pulp and tomato pulp in the ratio of 1:1 has been prepared, while chutney of good acceptability prepared from wild apricot pulp (100 %) has also been documented. Preparation of apricot-soy protein enriched products like apricot-soya leather, toffee and fruit bars has been reported, which are reported to meet the protein requirements of adult and children as per the recommendations of ICMR. Besides these processed products, preparation of alcoholic beverages like wine, vermouth and brandy from wild apricot fruits has also been reported by various researchers. Further, after utilization of pulp for preparation of value added products, the stones left over have been successfully utilized for oil extraction which has medicinal and cosmetic value. The traditional method of oil extraction has been reported to be unhygienic and result in low oil yield with poor quality, whereas improved mechanical method of oil extraction has been found to produce good quality oil. The apricot kernel oil and press cake have

  14. TESTING FOR ADDITIVITY IN THE LOW DOSE REGION OF AN ENVIRONMENTALLY RELEVANT MIXTURE OF 18 OLYHALOGENATED AROMATIC HYDROCARBONS.

    EPA Science Inventory

    A common default assumption in risk assessment of chemical mixtures is that the chemicals combine additively in the low dose region. Under additivity, with information from single chemical dose-response data, the risk associated with the mixture can be estimated. The objective ...

  15. Canine parvovirus host range is determined by the specific conformation of an additional region of the capsid.

    PubMed Central

    Parker, J S; Parrish, C R

    1997-01-01

    We analyzed a region of the capsid of canine parvovirus (CPV) which determines the ability of the virus to infect canine cells. This region is distinct from those previously shown to determine the canine host range differences between CPV and feline panleukopenia virus. It lies on a ridge of the threefold spike of the capsid and is comprised of five interacting loops from three capsid protein monomers. We analyzed 12 mutants of CPV which contained amino acid changes in two adjacent loops exposed on the surface of this region. Nine mutants infected and grew in feline cells but were restricted in replication in one or the other of two canine cell lines tested. Three other mutants whose genomes contain mutations which affect one probable interchain bond were nonviable and could not be propagated in either canine or feline cells, although the VP1 and VP2 proteins from those mutants produced empty capsids when expressed from a plasmid vector. Although wild-type and mutant capsids bound to canine and feline cells in similar amounts, infection or viral DNA replication was greatly reduced after inoculation of canine cells with most of the mutants. The viral genomes of two host range-restricted mutants and two nonviable mutants replicated to wild-type levels in both feline and canine cells upon transfection with plasmid clones. The capsids of wild-type CPV and two mutants were similar in susceptibility to heat inactivation, but one of those mutants and one other were more stable against urea denaturation. Most mutations in this structural region altered the ability of monoclonal antibodies to recognize epitopes within a major neutralizing antigenic site, and that site could be subdivided into a number of distinct epitopes. These results argue that a specific structure of this region is required for CPV to retain its canine host range. PMID:9371580

  16. DNA copy number analysis of fresh and formalin-fixed specimens by shallow whole-genome sequencing with identification and exclusion of problematic regions in the genome assembly

    PubMed Central

    Scheinin, Ilari; Sie, Daoud; Bengtsson, Henrik; van de Wiel, Mark A.; Olshen, Adam B.; van Thuijl, Hinke F.; van Essen, Hendrik F.; Eijk, Paul P.; Rustenburg, François; Meijer, Gerrit A.; Reijneveld, Jaap C.; Wesseling, Pieter; Pinkel, Daniel; Albertson, Donna G.

    2014-01-01

    Detection of DNA copy number aberrations by shallow whole-genome sequencing (WGS) faces many challenges, including lack of completion and errors in the human reference genome, repetitive sequences, polymorphisms, variable sample quality, and biases in the sequencing procedures. Formalin-fixed paraffin-embedded (FFPE) archival material, the analysis of which is important for studies of cancer, presents particular analytical difficulties due to degradation of the DNA and frequent lack of matched reference samples. We present a robust, cost-effective WGS method for DNA copy number analysis that addresses these challenges more successfully than currently available procedures. In practice, very useful profiles can be obtained with ∼0.1× genome coverage. We improve on previous methods by first implementing a combined correction for sequence mappability and GC content, and second, by applying this procedure to sequence data from the 1000 Genomes Project in order to develop a blacklist of problematic genome regions. A small subset of these blacklisted regions was previously identified by ENCODE, but the vast majority are novel unappreciated problematic regions. Our procedures are implemented in a pipeline called QDNAseq. We have analyzed over 1000 samples, most of which were obtained from the fixed tissue archives of more than 25 institutions. We demonstrate that for most samples our sequencing and analysis procedures yield genome profiles with noise levels near the statistical limit imposed by read counting. The described procedures also provide better correction of artifacts introduced by low DNA quality than prior approaches and better copy number data than high-resolution microarrays at a substantially lower cost. PMID:25236618

  17. DNA copy number analysis of fresh and formalin-fixed specimens by shallow whole-genome sequencing with identification and exclusion of problematic regions in the genome assembly.

    PubMed

    Scheinin, Ilari; Sie, Daoud; Bengtsson, Henrik; van de Wiel, Mark A; Olshen, Adam B; van Thuijl, Hinke F; van Essen, Hendrik F; Eijk, Paul P; Rustenburg, François; Meijer, Gerrit A; Reijneveld, Jaap C; Wesseling, Pieter; Pinkel, Daniel; Albertson, Donna G; Ylstra, Bauke

    2014-12-01

    Detection of DNA copy number aberrations by shallow whole-genome sequencing (WGS) faces many challenges, including lack of completion and errors in the human reference genome, repetitive sequences, polymorphisms, variable sample quality, and biases in the sequencing procedures. Formalin-fixed paraffin-embedded (FFPE) archival material, the analysis of which is important for studies of cancer, presents particular analytical difficulties due to degradation of the DNA and frequent lack of matched reference samples. We present a robust, cost-effective WGS method for DNA copy number analysis that addresses these challenges more successfully than currently available procedures. In practice, very useful profiles can be obtained with ∼0.1× genome coverage. We improve on previous methods by first implementing a combined correction for sequence mappability and GC content, and second, by applying this procedure to sequence data from the 1000 Genomes Project in order to develop a blacklist of problematic genome regions. A small subset of these blacklisted regions was previously identified by ENCODE, but the vast majority are novel unappreciated problematic regions. Our procedures are implemented in a pipeline called QDNAseq. We have analyzed over 1000 samples, most of which were obtained from the fixed tissue archives of more than 25 institutions. We demonstrate that for most samples our sequencing and analysis procedures yield genome profiles with noise levels near the statistical limit imposed by read counting. The described procedures also provide better correction of artifacts introduced by low DNA quality than prior approaches and better copy number data than high-resolution microarrays at a substantially lower cost.

  18. Genomic regions of neurovirulence and attenuation in Theiler murine encephalomyelitis virus.

    PubMed Central

    Calenoff, M A; Faaberg, K S; Lipton, H L

    1990-01-01

    Full-length cDNA clones of two Theiler murine encephalomyelitis virus (TMEV) strains, one highly virulent and the other less virulent, were constructed in the bacterial plasmid pGEMR-3. Transfection of BHK-21 cells with RNA transcribed from these cDNAs yielded progeny viruses with the exact in vitro growth phenotype and mouse neurovirulence pattern of the respective parental virus strains. RNA transcripts derived from recombinant chimeras constructed by exchanging corresponding genomic regions [5' noncoding, leader/P1 (L/P1), P2, P3, and 3' noncoding] between the parental cDNAs were infectious and enabled analysis of the growth characteristics in vitro and mouse neurovirulence of the chimeras. A correlation was found between plaque size and temperature sensitivity and the origin of the L/P1 region. Neurovirulence mapped primarily to the L/P1 region encoding the leader and coat proteins. Depending on parental origin, the 5' noncoding region either influenced virus attenuation or augmented virulence. Images PMID:2153981

  19. Core and region-enriched networks of behaviorally regulated genes and the singing genome.

    PubMed

    Whitney, Osceola; Pfenning, Andreas R; Howard, Jason T; Blatti, Charles A; Liu, Fang; Ward, James M; Wang, Rui; Audet, Jean-Nicoles; Kellis, Manolis; Mukherjee, Sayan; Sinha, Saurabh; Hartemink, Alexander J; West, Anne E; Jarvis, Erich D

    2014-12-12

    Songbirds represent an important model organism for elucidating molecular mechanisms that link genes with complex behaviors, in part because they have discrete vocal learning circuits that have parallels with those that mediate human speech. We found that ~10% of the genes in the avian genome were regulated by singing, and we found a striking regional diversity of both basal and singing-induced programs in the four key song nuclei of the zebra finch, a vocal learning songbird. The region-enriched patterns were a result of distinct combinations of region-enriched transcription factors (TFs), their binding motifs, and presinging acetylation of histone 3 at lysine 27 (H3K27ac) enhancer activity in the regulatory regions of the associated genes. RNA interference manipulations validated the role of the calcium-response transcription factor (CaRF) in regulating genes preferentially expressed in specific song nuclei in response to singing. Thus, differential combinatorial binding of a small group of activity-regulated TFs and predefined epigenetic enhancer activity influences the anatomical diversity of behaviorally regulated gene networks.

  20. Mitochondrial Genome Analyses Suggest Multiple Trichuris Species in Humans, Baboons, and Pigs from Different Geographical Regions

    PubMed Central

    Hawash, Mohamed B. F.; Andersen, Lee O.; Gasser, Robin B.; Stensvold, Christen Rune; Nejsum, Peter

    2015-01-01

    Background The whipworms Trichuris trichiura and Trichuris suis are two parasitic nematodes of humans and pigs, respectively. Although whipworms in human and non-human primates historically have been referred to as T. trichiura, recent reports suggest that several Trichuris spp. are found in primates. Methods and Findings We sequenced and annotated complete mitochondrial genomes of Trichuris recovered from a human in Uganda, an olive baboon in the US, a hamadryas baboon in Denmark, and two pigs from Denmark and Uganda. Comparative analyses using other published mitochondrial genomes of Trichuris recovered from a human and a porcine host in China and from a françois’ leaf-monkey (China) were performed, including phylogenetic analyses and pairwise genetic and amino acid distances. Genetic and protein distances between human Trichuris in Uganda and China were high (~19% and 15%, respectively) suggesting that they represented different species. Trichuris from the olive baboon in US was genetically related to human Trichuris in China, while the other from the hamadryas baboon in Denmark was nearly identical to human Trichuris from Uganda. Baboon-derived Trichuris was genetically distinct from Trichuris from françois’ leaf monkey, suggesting multiple whipworm species circulating among non-human primates. The genetic and protein distances between pig Trichuris from Denmark and other regions were roughly 9% and 6%, respectively, while Chinese and Ugandan whipworms were more closely related. Conclusion and Significance Our results indicate that Trichuris species infecting humans and pigs are phylogenetically distinct across geographical regions, which might have important implications for the implementation of suitable and effective control strategies in different regions. Moreover, we provide support for the hypothesis that Trichuris infecting primates represents a complex of cryptic species with some species being able to infect both humans and non-human primates

  1. Narrowing down the apricot Plum pox virus resistance locus and comparative analysis with the peach genome syntenic region.

    PubMed

    Vera Ruiz, Elsa María; Soriano, José Miguel; Romero, Carlos; Zhebentyayeva, Tetyana; Terol, Javier; Zuriaga, Elena; Llácer, Gerardo; Abbott, Albert Glenn; Badenes, María Luisa

    2011-08-01

    Sharka disease, caused by the Plum pox virus (PPV), is one of the main limiting factors for stone fruit crops worldwide. Only a few resistance sources have been found in apricot (Prunus armeniaca L.), and most studies have located a major PPV resistance locus (PPVres) on linkage group 1 (LG1). However, the mapping accuracy was not sufficiently reliable and PPVres was predicted within a low confidence interval. In this study, we have constructed two high-density simple sequence repeat (SSR) improved maps with 0.70 and 0.68 markers/cm, corresponding to LG1 of 'Lito' and 'Goldrich' PPV-resistant cultivars, respectively. Using these maps, and excluding genotype-phenotype incongruent individuals, a new binary trait locus (BTL) analysis for PPV resistance was performed, narrowing down the PPVres support intervals to 7.3 and 5.9 cm in 'Lito' and 'Goldrich', respectively. Subsequently, 71 overlapping oligonucleotides (overgo) probes were hybridized against an apricot bacterial artificial chromosome (BAC) library, identifying 870 single BACs from which 340 were anchored onto a map region of approximately 30-40 cm encompassing PPVres. Partial BAC contigs assigned to the two allelic haplotypes (resistant/susceptible) of the PPVres locus were built by high-information content fingerprinting (HICF). In addition, a total of 300 BAC-derived sequences were obtained, and 257 showed significant homology with the peach genome scaffold_1 corresponding to LG1. According to the peach syntenic genome sequence, PPVres was predicted within a region of 2.16 Mb in which a few candidate resistance genes were identified.

  2. A Spontaneous Deletion of the US1.67/US2 Genomic Region on the Bovine Herpesvirus 1 Strain Cooper

    PubMed Central

    Campos, F. S.; Paim, W. P.; Silva, A. G.; Santos, R. N.; Firpo, R. M.; Scheffer, C. M.; Finoketti, F.; Franco, A. C.

    2016-01-01

    Bovine herpesvirus 1 (BoHV-1) is an alphaherpesvirus with a genome of 135 kb. Some BoHV-1 genes are nonessential and may be deleted from the viral genome. Here, a spontaneous gene deletion was identified in the BoHV-1 strain Cooper. Genes of the US1.67/US2 region were absent, as determined by next-generation sequencing. PMID:26847888

  3. Infection of mice by Salmonella enterica serovar Enteritidis involves additional genes that are absent in the genome of serovar Typhimurium.

    PubMed

    Silva, Cecilia A; Blondel, Carlos J; Quezada, Carolina P; Porwollik, Steffen; Andrews-Polymenis, Helene L; Toro, Cecilia S; Zaldívar, Mercedes; Contreras, Inés; McClelland, Michael; Santiviago, Carlos A

    2012-02-01

    Salmonella enterica serovar Enteritidis causes a systemic, typhoid-like infection in newly hatched poultry and mice. In the present study, a library of 54,000 transposon mutants of S. Enteritidis phage type 4 (PT4) strain P125109 was screened for mutants deficient in the in vivo colonization of the BALB/c mouse model using a microarray-based negative-selection screening. Mutants in genes known to contribute to systemic infection (e.g., Salmonella pathogenicity island 2 [SPI-2], aro, rfa, rfb, phoP, and phoQ) and enteric infection (e.g., SPI-1 and SPI-5) in this and other Salmonella serovars displayed colonization defects in our assay. In addition, a strong attenuation was observed for mutants in genes and genomic islands that are not present in S. Typhimurium or in most other Salmonella serovars. These genes include a type I restriction/modification system (SEN4290 to SEN4292), the peg fimbrial operon (SEN2144A to SEN2145B), a putative pathogenicity island (SEN1970 to SEN1999), and a type VI secretion system remnant SEN1001, encoding a hypothetical protein containing a lysin motif (LysM) domain associated with peptidoglycan binding. Proliferation defects for mutants in these individual genes and in exemplar genes for each of these clusters were confirmed in competitive infections with wild-type S. Enteritidis. A ΔSEN1001 mutant was defective for survival within RAW264.7 murine macrophages in vitro. Complementation assays directly linked the SEN1001 gene to phenotypes observed in vivo and in vitro. The genes identified here may perform novel virulence functions not characterized in previous Salmonella models.

  4. Dynamic chromatin environment of key lytic cycle regulatory regions of the Epstein-Barr virus genome.

    PubMed

    Ramasubramanyan, Sharada; Osborn, Kay; Flower, Kirsty; Sinclair, Alison J

    2012-02-01

    The ability of Epstein-Barr virus (EBV) to establish latency allows it to evade the immune system and to persist for the lifetime of its host; one distinguishing characteristic is the lack of transcription of the majority of viral genes. Entry into the lytic cycle is coordinated by the viral transcription factor, Zta (BZLF1, ZEBRA, and EB1), and downstream effectors, while viral genome replication requires the concerted action of Zta and six other viral proteins at the origins of lytic replication. We explored the chromatin context at key EBV lytic cycle promoters (BZLF1, BRLF1, BMRF1, and BALF5) and the origins of lytic replication during latency and lytic replication. We show that a repressive heterochromatin-like environment (trimethylation of histone H3 at lysine 9 [H3K9me3] and lysine 27 [H3K27me3]), which blocks the interaction of some transcription factors with DNA, encompasses the key early lytic regulatory regions. Epigenetic silencing of the EBV genome is also imposed by DNA methylation during latency. The chromatin environment changes during the lytic cycle with activation of histones H3, H4, and H2AX occurring at both the origins of replication and at the key lytic regulatory elements. We propose that Zta is able to reverse the effects of latency-associated repressive chromatin at EBV early lytic promoters by interacting with Zta response elements within the H3K9me3-associated chromatin and demonstrate that these interactions occur in vivo. Since the interaction of Zta with DNA is not inhibited by DNA methylation, it is clear that Zta uses two routes to overcome epigenetic silencing of its genome. PMID:22090141

  5. Dynamic chromatin environment of key lytic cycle regulatory regions of the Epstein-Barr virus genome.

    PubMed

    Ramasubramanyan, Sharada; Osborn, Kay; Flower, Kirsty; Sinclair, Alison J

    2012-02-01

    The ability of Epstein-Barr virus (EBV) to establish latency allows it to evade the immune system and to persist for the lifetime of its host; one distinguishing characteristic is the lack of transcription of the majority of viral genes. Entry into the lytic cycle is coordinated by the viral transcription factor, Zta (BZLF1, ZEBRA, and EB1), and downstream effectors, while viral genome replication requires the concerted action of Zta and six other viral proteins at the origins of lytic replication. We explored the chromatin context at key EBV lytic cycle promoters (BZLF1, BRLF1, BMRF1, and BALF5) and the origins of lytic replication during latency and lytic replication. We show that a repressive heterochromatin-like environment (trimethylation of histone H3 at lysine 9 [H3K9me3] and lysine 27 [H3K27me3]), which blocks the interaction of some transcription factors with DNA, encompasses the key early lytic regulatory regions. Epigenetic silencing of the EBV genome is also imposed by DNA methylation during latency. The chromatin environment changes during the lytic cycle with activation of histones H3, H4, and H2AX occurring at both the origins of replication and at the key lytic regulatory elements. We propose that Zta is able to reverse the effects of latency-associated repressive chromatin at EBV early lytic promoters by interacting with Zta response elements within the H3K9me3-associated chromatin and demonstrate that these interactions occur in vivo. Since the interaction of Zta with DNA is not inhibited by DNA methylation, it is clear that Zta uses two routes to overcome epigenetic silencing of its genome.

  6. Structure and transcription of an immediate-early region in the human herpesvirus 6 genome.

    PubMed Central

    Schiewe, U; Neipel, F; Schreiner, D; Fleckenstein, B

    1994-01-01

    The unique segment of the human herpesvirus 6 (HHV-6) genome is essentially collinear to the unique long DNA segment of another betaherpesvirus, the human cytomegalovirus (HCMV). However, the HHV-6 genomic section that is analogous in position to the major immediate-early (IE) locus of HCMV does not exhibit recognizable sequence homologies. The respective HHV-6 region of 5.5 kbp is flanked on one side by 25 to 28 incomplete tandem repeats of 105 to 110 bp that contain, with one exception, a single KpnI restriction site (KpnI repeats). About 250 reiterations of the sequence motif CACATA are located on the other end. We identified two open reading frames of 375 and 2,595 nucleotides, respectively, on one strand. Strand-specific Northern blot analyses with RNA harvested from HHV-6 (strain U1102)-infected HSB-2 cells or cord blood lymphocytes revealed two transcripts of about 3.5 and 4.7 kb in the corresponding orientation. Sequence analyses of the respective cDNA clones and primer extension experiments were used to map the mRNAs. The two transcripts are coterminal and multiply spliced and code for the same putative 104.6-kDa protein, but they are initiated from different promoters. Characterization of smaller cDNA clones and Northern blotting with other strand-specific probes showed that singly spliced mRNAs of 1.0 and 1.5 kb are transcribed from the opposite strand; they could code for a 17.2-kDa polypeptide. Blocking experiments with cycloheximide led to the conclusion that only the 3.5-kb mRNA is synthesized in the absence of protein biosynthesis upon infection with cell-free virus. This identifies a single IE gene of HHV-6 at the genomic position corresponding to the major IE region of HCMV, although the coding content and transcriptional regulation are quite different for these two herpesvirus IE regions. Images PMID:8151768

  7. Characterizing regions in the human genome unmappable by next-generation-sequencing at the read length of 1000 bases.

    PubMed

    Li, Wentian; Freudenberg, Jan

    2014-12-01

    Repetitive and redundant regions of a genome are particularly problematic for mapping sequencing reads. In the present paper, we compile a list of the unmappable regions in the human genome based on the following definition: hypothetical reads with length 1 kb which cannot be uniquely mapped with zero-mismatch alignment for the described regions, considering both the forward and reverse strand. The respective collection of unmappable regions covers 0.77% of the sequence of human autosomes and 8.25% of the sex chromosomes in the reference genome GRCh37/hg19 (overall 1.23%). Not surprisingly, our unmappable regions overlap greatly with segmental duplication, transposable elements, and structural variants. About 99.8% of bases in our unmappable regions are part of either segmental duplication or transposable elements and 98.3% overlap structural variant annotations. Notably, some of these regions overlap units with important biological functions, including 4% of protein-coding genes. In contrast, these regions have zero intersection with the ultraconserved elements, very low overlap with microRNAs, tRNAs, pseudogenes, CpG islands, tandem repeats, microsatellites, sensitive non-coding regions, and the mapping blacklist regions from the ENCODE project.

  8. Structure-infectivity analysis of the human rhinovirus genomic RNA 3' non-coding region.

    PubMed Central

    Todd, S; Semler, B L

    1996-01-01

    The specific recognition of genomic positive strand RNAS as templates for the synthesis of intermediate negative strands by the picornavirus replication machinery is presumably mediated by cis-acting sequences within the genomic RNA 3' non-coding region (NCR). A structure-infectivity analysis was conducted on the 44 nt human rhinovirus 14 (HRV14) 3' NCR to identify the primary sequence and/or secondary structure determinants required for viral replication. Using biochemical RNA secondary structure probing techniques, we have demonstrated the existence of a single stem-loop structure contained entirely within the 3' NCR, which appears to be phylogenetically conserved within the rhinovirus genus. We also report the in vivo analysis of a number of 3' NCR deletion mutations engineered into infectious cDNA clones which were designed to disrupt the stem-loop secondary structure to varying degrees. Large deletions (up to 37 nt) resulted in defective growth phenotypes, although they were not lethal. We propose that the absolute requirements for initiation of negative strand synthesis are less stringent than previously postulated, even though defined RNA secondary structure determinants may have evolved to facilitate and/or regulate the process of viral RNA replication. PMID:8668546

  9. A comparison of the molecular organization of genomic regions associated with resistance to common bacterial blight in two Phaseolus vulgaris genotypes.

    PubMed

    Perry, Gregory; Dinatale, Claudia; Xie, Weilong; Navabi, Alireza; Reinprecht, Yarmilla; Crosby, William; Yu, Kangfu; Shi, Chun; Pauls, K Peter

    2013-01-01

    Resistance to common bacterial blight, caused by Xanthomonas axonopodis pv. phaseoli, in Phaseolus vulgaris is conditioned by several loci on different chromosomes. Previous studies with OAC-Rex, a CBB-resistant, white bean variety of Mesoamerican origin, identified two resistance loci associated with the molecular markers Pv-CTT001 and SU91, on chromosome 4 and 8, respectively. Resistance to CBB is assumed to be derived from an interspecific cross with Phaseolus acutifolius in the pedigree of OAC-Rex. Our current whole genome sequencing effort with OAC-Rex provided the opportunity to compare its genome in the regions associated with CBB resistance with the v1.0 release of the P. vulgaris line G19833, which is a large seeded bean of Andean origin, and (assumed to be) CBB susceptible. In addition, the genomic regions containing SAP6, a marker associated with P. vulgaris-derived CBB-resistance on chromosome 10, were compared. These analyses indicated that gene content was highly conserved between G19833 and OAC-Rex across the regions examined (>80%). However, fifty-nine genes unique to OAC Rex were identified, with resistance gene homologues making up the largest category (10 genes identified). Two unique genes in OAC-Rex located within the SU91 resistance QTL have homology to P. acutifolius ESTs and may be potential sources of CBB resistance. As the genomic sequence assembly of OAC-Rex is completed, we expect that further comparisons between it and the G19833 genome will lead to a greater understanding of CBB resistance in bean. PMID:24009615

  10. Assigning Brassica microsatellite markers to the nine C-genome chromosomes using Brassica rapa var. trilocularis-B. oleracea var. alboglabra monosomic alien addition lines.

    PubMed

    Geleta, Mulatu; Heneen, Waheeb K; Stoute, Andrew I; Muttucumaru, Nira; Scott, Roderick J; King, Graham J; Kurup, Smita; Bryngelsson, Tomas

    2012-08-01

    Brassica rapa var. trilocularis-B. oleracea var. alboglabra monosomic alien addition lines (MAALs) were used to assign simple sequence repeat (SSR) markers to the nine C-genome chromosomes. A total of 64 SSR markers specific to single C-chromosomes were identified. The number of specific markers for each chromosome varied from two (C3) to ten (C4, C7 and C9), where the designation of the chromosomes was according to Cheng et al. (Genome 38:313-319, 1995). Seventeen additional SSRs, which were duplicated on 2-5 C-chromosomes, were also identified. Using the SSR markers assigned to the previously developed eight MAALs and recently obtained aneuploid plants, a new Brassica rapa-B. oleracea var. alboglabra MAAL carrying the alien chromosome C7 was identified and developed. The application of reported genetically mapped SSR markers on the nine MAALs contributed to the determination of the correspondence between numerical C-genome cytological (Cheng et al. in Genome 38:313-319, 1995) and linkage group designations. This correspondence facilitates the integration of C-genome genetic information that has been generated based on the two designation systems and accordingly increases our knowledge about each chromosome. The present study is a significant contribution to genetic linkage analysis of SSR markers and important agronomic traits in B. oleracea and to the potential use of the MAALs in plant breeding.

  11. Genomic regions associated with the nitrogen limitation response revealed in a global wheat core collection.

    PubMed

    Bordes, Jacques; Ravel, C; Jaubertie, J P; Duperrier, B; Gardet, O; Heumez, E; Pissavy, A L; Charmet, G; Le Gouis, J; Balfourier, F

    2013-03-01

    Modern wheat (Triticum aestivum L.) varieties in Western Europe have mainly been bred, and selected in conditions where high levels of nitrogen-rich fertilizer are applied. However, high input crop management has greatly increased the risk of nitrates leaching into groundwater with negative impacts on the environment. To investigate wheat nitrogen tolerance characteristics that could be adapted to low input crop management, we supplied 196 accessions of a wheat core collection of old and modern cultivars with high or moderate amounts of nitrogen fertilizer in an experimental network consisting of three sites and 2 years. The main breeding traits were assessed including grain yield and grain protein content. The response to nitrogen level was estimated for grain yield and grain number per m(2) using both the difference and the ratio between performance at the two input levels and the slope of joint regression. A large variability was observed for all the traits studied and the response to nitrogen level. Whole genome association mapping was carried out using 899 molecular markers taking into account the five ancestral group structure of the collection. We identified 54 main regions involving almost all chromosomes that influence yield and its components, plant height, heading date and grain protein concentration. Twenty-three regions, including several genes, spread over 16 chromosomes were involved in the response to nitrogen level. These chromosomal regions may be good candidates to be used in breeding programs to improve the performance of wheat varieties at moderate nitrogen input levels.

  12. An essential secondary structure in the 3' untranslated region of the mouse hepatitis virus genome.

    PubMed

    Hsue, B; Masters, P S

    1998-01-01

    The 3' untranslated regions (3' UTRs) of coronaviruses contain the signals necessary for negative strand RNA synthesis and may also harbor elements essential for positive strand replication and subgenomic RNA transcription. The 3' UTRs of mouse hepatitis virus (MHV) and bovine coronavirus (BCV) are more than 30% divergent. In an effort to learn what parts of these regions might be functionally interchangeable, we attempted to replace the 3' UTR of MHV with its BCV counterpart by targeted RNA recombination. Initially, we tried to substitute the 3' 267 nucleotides (nt) of the 301 nt MHV 3' UTR with the corresponding region of the BCV 3' UTR. This exchange did not yield viable recombinant viruses, and the donor DI RNA was shown to be unable to replicate with MHV as a helper virus. Subsequent analysis revealed that the entire BCV 3' UTR could be inserted into the MHV genome in place of the entire MHV 3' UTR. It resulted that the failure of the initial attempted substitution was due to the inadvertent disruption of an essential conserved bulged stem-loop secondary structure in the MHV and BCV 3' URTs immediately downstream of the N gene stop codon.

  13. siRNA Targeting the 2Apro Genomic Region Prevents Enterovirus 71 Replication In Vitro.

    PubMed

    Liu, Haibing; Qin, Yanyan; Kong, Zhenzhen; Shao, Qixiang; Su, Zhaoliang; Wang, Shengjun; Chen, Jianguo

    2016-01-01

    Enterovirus 71 (EV71) is the most important etiological agent of hand, foot, and mouth disease (HFMD) in young children, which is associated with severe neurological complications and has caused significant mortalities in recent HFMD outbreaks in Asia. However, there is no effective antiviral therapy against EV71. In this study, RNA interference (RNAi) was used as an antiviral strategy to inhibit EV71 replication. Three small interfering RNAs (siRNAs) targeting the 2Apro region of the EV71 genome were designed and synthesized. All the siRNAs were transfected individually into rhabdomyosarcoma (RD) cells, which were then infected with strain EV71-2006-52-9. The cytopathic effects (CPEs) in the infected RD cells, cell viability, viral titer, and viral RNA and protein expression were examined to evaluate the specific viral inhibition by the siRNAs. The results of cytopathogenicity and MTT tests indicated that the RD cells transfected with the three siRNAs showed slight CPEs and significantly high viability. The 50% tissue culture infective dose (TCID50) values demonstrated that the viral titer of the groups treated with three siRNAs were lower than those of the control groups. qRT-PCR and western blotting revealed that the levels of viral RNA and protein in the RD cells treated with the three siRNAs were lower than those in the controls. When RD cells transfected with siRNAs were also infected with strain EV71-2008-43-16, the expression of the VP1 protein was significantly inhibited. The levels of interferon α (IFN-α) and IFN-β did not differ significantly in any group. These results suggest that siRNAs targeting the 2Apro region of the EV71 genome exerted antiviral effects in vitro.

  14. siRNA Targeting the 2Apro Genomic Region Prevents Enterovirus 71 Replication In Vitro

    PubMed Central

    Kong, Zhenzhen; Shao, Qixiang; Su, Zhaoliang; Wang, Shengjun; Chen, Jianguo

    2016-01-01

    Enterovirus 71 (EV71) is the most important etiological agent of hand, foot, and mouth disease (HFMD) in young children, which is associated with severe neurological complications and has caused significant mortalities in recent HFMD outbreaks in Asia. However, there is no effective antiviral therapy against EV71. In this study, RNA interference (RNAi) was used as an antiviral strategy to inhibit EV71 replication. Three small interfering RNAs (siRNAs) targeting the 2Apro region of the EV71 genome were designed and synthesized. All the siRNAs were transfected individually into rhabdomyosarcoma (RD) cells, which were then infected with strain EV71-2006-52-9. The cytopathic effects (CPEs) in the infected RD cells, cell viability, viral titer, and viral RNA and protein expression were examined to evaluate the specific viral inhibition by the siRNAs. The results of cytopathogenicity and MTT tests indicated that the RD cells transfected with the three siRNAs showed slight CPEs and significantly high viability. The 50% tissue culture infective dose (TCID50) values demonstrated that the viral titer of the groups treated with three siRNAs were lower than those of the control groups. qRT–PCR and western blotting revealed that the levels of viral RNA and protein in the RD cells treated with the three siRNAs were lower than those in the controls. When RD cells transfected with siRNAs were also infected with strain EV71-2008-43-16, the expression of the VP1 protein was significantly inhibited. The levels of interferon α (IFN-α) and IFN-β did not differ significantly in any group. These results suggest that siRNAs targeting the 2Apro region of the EV71 genome exerted antiviral effects in vitro. PMID:26886455

  15. Genome-Wide Association Study Identified a Narrow Chromosome 1 Region Associated with Chicken Growth Traits

    PubMed Central

    Zhang, Chengguang; Zhang, Rong; Tang, Jun; Nie, Qinghua; Ma, Li; Hu, Xiaoxiang; Li, Ning; Da, Yang; Zhang, Xiquan

    2012-01-01

    Chicken growth traits are important economic traits in broilers. A large number of studies are available on finding genetic factors affecting chicken growth. However, most of these studies identified chromosome regions containing putative quantitative trait loci and finding causal mutations is still a challenge. In this genome-wide association study (GWAS), we identified a narrow 1.5 Mb region (173.5–175 Mb) of chicken (Gallus gallus) chromosome (GGA) 1 to be strongly associated with chicken growth using 47,678 SNPs and 489 F2 chickens. The growth traits included aggregate body weight (BW) at 0–90 d of age measured weekly, biweekly average daily gains (ADG) derived from weekly body weight, and breast muscle weight (BMW), leg muscle weight (LMW) and wing weight (WW) at 90 d of age. Five SNPs in the 1.5 Mb KPNA3-FOXO1A region at GGA1 had the highest significant effects for all growth traits in this study, including a SNP at 8.9 Kb upstream of FOXO1A for BW at 22–48 d and 70 d, a SNP at 1.9 Kb downstream of FOXO1A for WW, a SNP at 20.9 Kb downstream of ENSGALG00000022732 for ADG at 29–42 d, a SNP in INTS6 for BW at 90 d, and a SNP in KPNA3 for BMW and LMW. The 1.5 Mb KPNA3-FOXO1A region contained two microRNA genes that could bind to messenger ribonucleic acid (mRNA) of IGF1, FOXO1A and KPNA3. It was further indicated that the 1.5 Mb GGA1 region had the strongest effects on chicken growth during 22–42 d. PMID:22359555

  16. Differential DNA Methylation Regions in Cytokine and Transcription Factor Genomic Loci Associate with Childhood Physical Aggression

    PubMed Central

    Provençal, Nadine; Suderman, Matthew J.; Caramaschi, Doretta; Wang, Dongsha; Hallett, Michael; Vitaro, Frank

    2013-01-01

    Background Animal and human studies suggest that inflammation is associated with behavioral disorders including aggression. We have recently shown that physical aggression of boys during childhood is strongly associated with reduced plasma levels of cytokines IL-1α, IL-4, IL-6, IL-8 and IL-10, later in early adulthood. This study tests the hypothesis that there is an association between differential DNA methylation regions in cytokine genes in T cells and monocytes DNA in adult subjects and a trajectory of physical aggression from childhood to adolescence. Methodology/Principal Findings We compared the methylation profiles of the entire genomic loci encompassing the IL-1α, IL-6, IL-4, IL-10 and IL-8 and three of their regulatory transcription factors (TF) NFkB1, NFAT5 and STAT6 genes in adult males on a chronic physical aggression trajectory (CPA) and males with the same background who followed a normal physical aggression trajectory (control group) from childhood to adolescence. We used the method of methylated DNA immunoprecipitation with comprehensive cytokine gene loci and TF loci microarray hybridization, statistical analysis and false discovery rate correction. We found differentially methylated regions to associate with CPA in both the cytokine loci as well as in their transcription factors loci analyzed. Some of these differentially methylated regions were located in known regulatory regions whereas others, to our knowledge, were previously unknown as regulatory areas. However, using the ENCODE database, we were able to identify key regulatory elements in many of these regions that indicate that they might be involved in the regulation of cytokine expression. Conclusions We provide here the first evidence for an association between differential DNA methylation in cytokines and their regulators in T cells and monocytes and male physical aggression. PMID:23977113

  17. Genome-Based Identification of Active Prophage Regions by Next Generation Sequencing in Bacillus licheniformis DSM13

    PubMed Central

    Hertel, Robert; Rodríguez, David Pintor; Hollensteiner, Jacqueline; Dietrich, Sascha; Leimbach, Andreas; Hoppert, Michael; Liesegang, Heiko; Volland, Sonja

    2015-01-01

    Prophages are viruses, which have integrated their genomes into the genome of a bacterial host. The status of the prophage genome can vary from fully intact with the potential to form infective particles to a remnant state where only a few phage genes persist. Prophages have impact on the properties of their host and are therefore of great interest for genomic research and strain design. Here we present a genome- and next generation sequencing (NGS)-based approach for identification and activity evaluation of prophage regions. Seven prophage or prophage-like regions were identified in the genome of Bacillus licheniformis DSM13. Six of these regions show similarity to members of the Siphoviridae phage family. The remaining region encodes the B. licheniformis orthologue of the PBSX prophage from Bacillus subtilis. Analysis of isolated phage particles (induced by mitomycin C) from the wild-type strain and prophage deletion mutant strains revealed activity of the prophage regions BLi_Pp2 (PBSX-like), BLi_Pp3 and BLi_Pp6. In contrast to BLi_Pp2 and BLi_Pp3, neither phage DNA nor phage particles of BLi_Pp6 could be visualized. However, the ability of prophage BLi_Pp6 to generate particles could be confirmed by sequencing of particle-protected DNA mapping to prophage locus BLi_Pp6. The introduced NGS-based approach allows the investigation of prophage regions and their ability to form particles. Our results show that this approach increases the sensitivity of prophage activity analysis and can complement more conventional approaches such as transmission electron microscopy (TEM). PMID:25811873

  18. Recombination within the apospory specific genomic region leads to the uncoupling of apomixis components in Cenchrus ciliaris.

    PubMed

    Conner, Joann A; Gunawan, Gunawati; Ozias-Akins, Peggy

    2013-07-01

    Apomixis enables the clonal propagation of maternal genotypes through seed. If apomixis could be harnessed via genetic engineering or introgression, it would have a major economic impact for agricultural crops. In the grass species Pennisetum squamulatum and Cenchrus ciliaris (syn. P. ciliare), apomixis is controlled by a single dominant "locus", the apospory-specific genomic region (ASGR). For P. squamulatum, 18 published sequenced characterized amplified region (SCAR) markers have been identified which always co-segregate with apospory. Six of these markers are conserved SCARs in the closely related species, C. ciliaris and co-segregate with the trait. A screen of progeny from a cross of sexual × apomictic C. ciliaris genotypes identified a plant, A8, retaining two of the six ASGR-linked SCAR markers. Additional and newly identified ASGR-linked markers were generated to help identify the extent of recombination within the ASGR. Based on analysis of missing markers, the A8 recombinant plant has lost a significant portion of the ASGR but continues to form aposporous embryo sacs. Seedlings produced from aposporous embryo sacs are 6× in ploidy level and hence the A8 recombinant does not express parthenogenesis. The recombinant A8 plant represents a step forward in reducing the complexity of the ASGR locus to determine the factor(s) required for aposporous embryo sac formation and documents the separation of expression of the two components of apomixis in C. ciliaris.

  19. Additional Routes to Staphylococcus aureus Daptomycin Resistance as Revealed by Comparative Genome Sequencing, Transcriptional Profiling, and Phenotypic Studies

    PubMed Central

    Song, Yang; Rubio, Aileen; Jayaswal, Radheshyam K.; Silverman, Jared A.; Wilkinson, Brian J.

    2013-01-01

    Daptomycin is an extensively used anti-staphylococcal agent due to the rise in methicillin-resistant Staphylococcus aureus, but the mechanism(s) of resistance is poorly understood. Comparative genome sequencing, transcriptomics, ultrastructure, and cell envelope studies were carried out on two relatively higher level (4 and 8 µg/ml−1) laboratory-derived daptomycin-resistant strains (strains CB1541 and CB1540 respectively) compared to their parent strain (CB1118; MW2). Several mutations were found in the strains. Both strains had the same mutations in the two-component system genes walK and agrA. In strain CB1540 mutations were also detected in the ribose phosphate pyrophosphokinase (prs) and polyribonucleotide nucleotidyltransferase genes (pnpA), a hypothetical protein gene, and in an intergenic region. In strain CB1541 there were mutations in clpP, an ATP-dependent protease, and two different hypothetical protein genes. The strain CB1540 transcriptome was characterized by upregulation of cap (capsule) operon genes, genes involved in the accumulation of the compatible solute glycine betaine, ure genes of the urease operon, and mscL encoding a mechanosensitive chanel. Downregulated genes included smpB, femAB and femH involved in the formation of the pentaglycine interpeptide bridge, genes involved in protein synthesis and fermentation, and spa encoding protein A. Genes altered in their expression common to both transcriptomes included some involved in glycine betaine accumulation, mscL, ure genes, femH, spa and smpB. However, the CB1541 transcriptome was further characterized by upregulation of various heat shock chaperone and protease genes, consistent with a mutation in clpP, and lytM and sceD. Both strains showed slow growth, and strongly decreased autolytic activity that appeared to be mainly due to decreased autolysin production. In contrast to previous common findings, we did not find any mutations in phospholipid biosynthesis genes, and it appears there

  20. Conserved microstructure of the Brassica B Genome of Brassica nigra in relation to homologous regions of Arabidopsis thaliana, B. rapa and B. oleracea

    PubMed Central

    2013-01-01

    Background The Brassica B genome is known to carry several important traits, yet there has been limited analyses of its underlying genome structure, especially in comparison to the closely related A and C genomes. A bacterial artificial chromosome (BAC) library of Brassica nigra was developed and screened with 17 genes from a 222 kb region of A. thaliana that had been well characterised in both the Brassica A and C genomes. Results Fingerprinting of 483 apparently non-redundant clones defined physical contigs for the corresponding regions in B. nigra. The target region is duplicated in A. thaliana and six homologous contigs were found in B. nigra resulting from the whole genome triplication event shared by the Brassiceae tribe. BACs representative of each region were sequenced to elucidate the level of microscale rearrangements across the Brassica species divide. Conclusions Although the B genome species separated from the A/C lineage some 6 Mya, comparisons between the three paleopolyploid Brassica genomes revealed extensive conservation of gene content and sequence identity. The level of fractionation or gene loss varied across genomes and genomic regions; however, the greatest loss of genes was observed to be common to all three genomes. One large-scale chromosomal rearrangement differentiated the B genome suggesting such events could contribute to the lack of recombination observed between B genome species and those of the closely related A/C lineage. PMID:23586706

  1. Novel Bioinformatics Method for Identification of Genome-Wide Non-Canonical Spliced Regions Using RNA-Seq Data

    PubMed Central

    Ziyar, Ahdad; Li, Philip; Wright, Zachary; Menon, Rajasree; Omenn, Gilbert S.; Cavalcoli, James D.; Kaufman, Randal J.; Sartor, Maureen A.

    2014-01-01

    Setting During endoplasmic reticulum (ER) stress, the endoribonuclease (RNase) Ire1α initiates removal of a 26 nt region from the mRNA encoding the transcription factor Xbp1 via an unconventional mechanism (atypically within the cytosol). This causes an open reading frame-shift that leads to altered transcriptional regulation of numerous downstream genes in response to ER stress as part of the unfolded protein response (UPR). Strikingly, other examples of targeted, unconventional splicing of short mRNA regions have yet to be reported. Objective Our goal was to develop an approach to identify non-canonical, possibly very short, splicing regions using RNA-Seq data and apply it to ER stress-induced Ire1α heterozygous and knockout mouse embryonic fibroblast (MEF) cell lines to identify additional Ire1α targets. Results We developed a bioinformatics approach called the Read-Split-Walk (RSW) pipeline, and evaluated it using two Ire1α heterozygous and two Ire1α-null samples. The 26 nt non-canonical splice site in Xbp1 was detected as the top hit by our RSW pipeline in heterozygous samples but not in the negative control Ire1α knockout samples. We compared the Xbp1 results from our approach with results using the alignment program BWA, Bowtie2, STAR, Exonerate and the Unix “grep” command. We then applied our RSW pipeline to RNA-Seq data from the SKBR3 human breast cancer cell line. RSW reported a large number of non-canonical spliced regions for 108 genes in chromosome 17, which were identified by an independent study. Conclusions We conclude that our RSW pipeline is a practical approach for identifying non-canonical splice junction sites on a genome-wide level. We demonstrate that our pipeline can detect novel splice sites in RNA-Seq data generated under similar conditions for multiple species, in our case mouse and human. PMID:24991935

  2. An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: The Adh region

    SciTech Connect

    Ashburner, M.; Misra, S.; Roote, J.; Lewis, S.E.; Blazej, R.; Davis, T.; Doyle, C.; Galle, R.; George, R.; Harris, N.; Hartzell, G.; Harvey, D.; Hong, L.; Houston, K.; Hoskins, R.; Johnson, G.; Martin, C.; Moshrefi, A.; Palazzolo, M.; Reese, M.G.; Spradling, A.; Tsang, G.; Wan, K.; Whitelaw, K.; Kimmel, B.; Celniker, S.; Rubin, G.M.

    1999-03-24

    A contiguous sequence of nearly 3 Mb from the genome of Drosophila melanogaster has been sequenced from a series of overlapping P1 and BAC clones. This region covers 69 chromosome polytene bands on chromosome arm 2L, including the genetically well-characterized

  3. Additional Support for Individuals with Intellectual Disabilities and Challenging Behaviors in Regions of Northwest Europe

    ERIC Educational Resources Information Center

    Lunenborg, Carolien B.; Nakken, H.; van der Meulen, B. F.; Ruijssenaars, A. J. J. M.

    2011-01-01

    One in 10 individuals with intellectual disability (ID) exhibits behaviors that represent a significant challenge to the people who support them. Parents and staff (such as direct support professionals) often are challenged in trying to find a solution to overcome or reduce these behaviors. A form of additional professional support (i.e., external…

  4. Additive and epistatic genome-wide association for growth and ultrasound scan measures of carcass-related traits in Brahman cattle.

    PubMed

    Ali, A A; Khatkar, M S; Kadarmideen, H N; Thomson, P C

    2015-04-01

    Genome-wide association studies are routinely used to identify genomic regions associated with traits of interest. However, this ignores an important class of genomic associations, that of epistatic interactions. A genome-wide interaction analysis between single nucleotide polymorphisms (SNPs) using highly dense markers can detect epistatic interactions, but is a difficult task due to multiple testing and computational demand. However, It is important for revealing complex trait heredity. This study considers analytical methods that detect statistical interactions between pairs of loci. We investigated a three-stage modelling procedure: (i) a model without the SNP to estimate the variance components; (ii) a model with the SNP using variance component estimates from (i), thus avoiding iteration; and (iii) using the significant SNPs from (ii) for genome-wide epistasis analysis. We fitted these three-stage models to field data for growth and ultrasound measures for subcutaneous fat thickness in Brahman cattle. The study demonstrated the usefulness of modelling epistasis in the analysis of complex traits as it revealed extra sources of genetic variation and identified potential candidate genes affecting the concentration of insulin-like growth factor-1 and ultrasound scan measure of fat depth traits. Information about epistasis can add to our understanding of the complex genetic networks that form the fundamental basis of biological systems. PMID:25754883

  5. Additive and epistatic genome-wide association for growth and ultrasound scan measures of carcass-related traits in Brahman cattle.

    PubMed

    Ali, A A; Khatkar, M S; Kadarmideen, H N; Thomson, P C

    2015-04-01

    Genome-wide association studies are routinely used to identify genomic regions associated with traits of interest. However, this ignores an important class of genomic associations, that of epistatic interactions. A genome-wide interaction analysis between single nucleotide polymorphisms (SNPs) using highly dense markers can detect epistatic interactions, but is a difficult task due to multiple testing and computational demand. However, It is important for revealing complex trait heredity. This study considers analytical methods that detect statistical interactions between pairs of loci. We investigated a three-stage modelling procedure: (i) a model without the SNP to estimate the variance components; (ii) a model with the SNP using variance component estimates from (i), thus avoiding iteration; and (iii) using the significant SNPs from (ii) for genome-wide epistasis analysis. We fitted these three-stage models to field data for growth and ultrasound measures for subcutaneous fat thickness in Brahman cattle. The study demonstrated the usefulness of modelling epistasis in the analysis of complex traits as it revealed extra sources of genetic variation and identified potential candidate genes affecting the concentration of insulin-like growth factor-1 and ultrasound scan measure of fat depth traits. Information about epistasis can add to our understanding of the complex genetic networks that form the fundamental basis of biological systems.

  6. c-Fos: an AP-1 transcription factor with an additional cytoplasmic, non-genomic lipid synthesis activation capacity.

    PubMed

    Caputto, Beatriz L; Cardozo Gizzi, Andrés M; Gil, Germán A

    2014-09-01

    The mechanisms that co-ordinately activate lipid synthesis when high rates of membrane biogenesis are needed to support cell growth are largely unknown. c-Fos, a well known AP-1 transcription factor, has emerged as a unique protein with the capacity to associate to specific enzymes of the pathway of synthesis of phospholipids at the endoplasmic reticulum and activate their synthesis to accompany genomic decisions of growth. Herein, we discuss this cytoplasmic, non-genomic effect of c-Fos in the context of other mechanisms that have been proposed to regulate lipid synthesis.

  7. Interpreting Mammalian Evolution using Fugu Genome Comparisons

    SciTech Connect

    Stubbs, L; Ovcharenko, I; Loots, G G

    2004-04-02

    Comparative sequence analysis of the human and the pufferfish Fugu rubripes (fugu) genomes has revealed several novel functional coding and noncoding regions in the human genome. In particular, the fugu genome has been extremely valuable for identifying transcriptional regulatory elements in human loci harboring unusually high levels of evolutionary conservation to rodent genomes. In such regions, the large evolutionary distance between human and fishes provides an additional filter through which functional noncoding elements can be detected with high efficiency.

  8. Mapping Association between Long-Range Cis-Regulatory Regions and Their Target Genes Using Comparative Genomics

    NASA Astrophysics Data System (ADS)

    Mongin, Emmanuel; Dewar, Ken; Blanchette, Mathieu

    In chordates, long-range cis-regulatory regions are involved in the control of transcription initiation (either as repressors or enhancers). They can be located as far as 1 Mb from the transcription start site of the target gene and can regulate more than one gene. Therefore, proper characterization of functional interactions between long-range cis-regulatory regions and their target genes remains problematic. We present a novel method to predict such interactions based on the analysis of rearrangements between the human and 16 other vertebrate genomes. Our method is based on the assumption that genome rearrangements that would disrupt the functional interaction between a cis-regulatory region and its target gene are likely to be deleterious. Therefore, conservation of synteny through evolution would be an indication of a functional interaction. We use our algorithm to classify a set of 1,406,084 putative associations from the human genome. This genome-wide map of interactions has many potential applications, including the selection of candidate regions prior to in vivo experimental characterization, a better characterization of regulatory regions involved in position effect diseases, and an improved understanding of the mechanisms and importance of long-range regulation.

  9. Sequence analysis of bacterial artificial chromosome clones from the apospory-specific genomic region of Pennisetum and Cenchrus.

    PubMed

    Conner, Joann A; Goel, Shailendra; Gunawan, Gunawati; Cordonnier-Pratt, Marie-Michele; Johnson, Virgil Ed; Liang, Chun; Wang, Haiming; Pratt, Lee H; Mullet, John E; DeBarry, Jeremy; Yang, Lixing; Bennetzen, Jeffrey L; Klein, Patricia E; Ozias-Akins, Peggy

    2008-07-01

    Apomixis, asexual reproduction through seed, is widespread among angiosperm families. Gametophytic apomixis in Pennisetum squamulatum and Cenchrus ciliaris is controlled by the apospory-specific genomic region (ASGR), which is highly conserved and macrosyntenic between these species. Thirty-two ASGR bacterial artificial chromosomes (BACs) isolated from both species and one ASGR-recombining BAC from P. squamulatum, which together cover approximately 2.7 Mb of DNA, were used to investigate the genomic structure of this region. Phrap assembly of 4,521 high-quality reads generated 1,341 contiguous sequences (contigs; 730 from the ASGR and 30 from the ASGR-recombining BAC in P. squamulatum, plus 580 from the C. ciliaris ASGR). Contigs containing putative protein-coding regions unrelated to transposable elements were identified based on protein similarity after Basic Local Alignment Search Tool X analysis. These putative coding regions were further analyzed in silico with reference to the rice (Oryza sativa) and sorghum (Sorghum bicolor) genomes using the resources at Gramene (www.gramene.org) and Phytozome (www.phytozome.net) and by hybridization against sorghum BAC filters. The ASGR sequences reveal that the ASGR (1) contains both gene-rich and gene-poor segments, (2) contains several genes that may play a role in apomictic development, (3) has many classes of transposable elements, and (4) does not exhibit large-scale synteny with either rice or sorghum genomes but does contain multiple regions of microsynteny with these species. PMID:18508959

  10. Genomic organization of duplicated major histocompatibility complex class I regions in Atlantic salmon (Salmo salar)

    PubMed Central

    Lukacs, Morten F; Harstad, Håvard; Grimholt, Unni; Beetz-Sargent, Marianne; Cooper, Glenn A; Reid, Linda; Bakke, Hege G; Phillips, Ruth B; Miller, Kristina M; Davidson, William S; Koop, Ben F

    2007-01-01

    Background We have previously identified associations between major histocompatibility complex (MHC) class I and resistance towards bacterial and viral pathogens in Atlantic salmon. To evaluate if only MHC or also closely linked genes contributed to the observed resistance we ventured into sequencing of the duplicated MHC class I regions of Atlantic salmon. Results Nine BACs covering more than 500 kb of the two duplicated MHC class I regions of Atlantic salmon were sequenced and the gene organizations characterized. Both regions contained the proteasome components PSMB8, PSMB9, PSMB9-like and PSMB10 in addition to the transporter for antigen processing TAP2, as well as genes for KIFC1, ZBTB22, DAXX, TAPBP, BRD2, COL11A2, RXRB and SLC39A7. The IA region contained the recently reported MHC class I Sasa-ULA locus residing approximately 50 kb upstream of the major Sasa-UBA locus. The duplicated class IB region contained an MHC class I locus resembling the rainbow trout UCA locus, but although transcribed it was a pseudogene. No other MHC class I-like genes were detected in the two duplicated regions. Two allelic BACs spanning the UBA locus had 99.2% identity over 125 kb, while the IA region showed 82.5% identity over 136 kb to the IB region. The Atlantic salmon IB region had an insert of 220 kb in comparison to the IA region containing three chitin synthase genes. Conclusion We have characterized the gene organization of more than 500 kb of the two duplicated MHC class I regions in Atlantic salmon. Although Atlantic salmon and rainbow trout are closely related, the gene organization of their IB region has undergone extensive gene rearrangements. The Atlantic salmon has only one class I UCA pseudogene in the IB region while trout contains the four MHC UCA, UDA, UEA and UFA class I loci. The large differences in gene content and most likely function of the salmon and trout class IB region clearly argues that sequencing of salmon will not necessarily provide information

  11. Assembling the Setaria italica L. Beauv. genome into nine chromosomes and insights into regions affecting growth and drought tolerance

    PubMed Central

    Tsai, Kevin J.; Lu, Mei-Yeh Jade; Yang, Kai-Jung; Li, Mengyun; Teng, Yuchuan; Chen, Shihmay; Ku, Maurice S. B.; Li, Wen-Hsiung

    2016-01-01

    The diploid C4 plant foxtail millet (Setaria italica L. Beauv.) is an important crop in many parts of Africa and Asia for the vast consumption of its grain and ability to grow in harsh environments, but remains understudied in terms of complete genomic architecture. To date, there have been only two genome assembly and annotation efforts with neither assembly reaching over 86% of the estimated genome size. We have combined de novo assembly with custom reference-guided improvements on a popular cultivar of foxtail millet and have achieved a genome assembly of 477 Mbp in length, which represents over 97% of the estimated 490 Mbp. The assembly anchors over 98% of the predicted genes to the nine assembled nuclear chromosomes and contains more functional annotation gene models than previous assemblies. Our annotation has identified a large number of unique gene ontology terms related to metabolic activities, a region of chromosome 9 with several growth factor proteins, and regions syntenic with pearl millet or maize genomic regions that have been previously shown to affect growth. The new assembly and annotation for this important species can be used for detailed investigation and future innovations in growth for millet and other grains. PMID:27734962

  12. [Mutation frequencies in HIV-1 subtype-A genome in regions containing efficient RNAi targets].

    PubMed

    Kravatsky, Y V; Chechetkin, V R; Fedoseeva, D M; Gorbacheva, M A; Kretova, O V; Tchurikov, N A

    2016-01-01

    The development of gene-therapy technology using RNAi for AIDS/HIV-1 treatment is a prospective alternative to traditional anti-retroviral therapy. RNAi targets could be selected in HIV-1 transcripts and in CCR5 mRNA. Previously, we experimentally selected a number of efficient siRNAs that target HIV-1 RNAs. The viral genome mutates frequently, and RNAi strength is very sensitive, even for a single mismatches. That is why it is important to study nucleotide sequences of targets in clinical isolates of HIV-1. In the present study, we analyzed mutations in 6 of about 300-bp regions containing RNAi targets from HIV-1 subtype A isolates in Russia. Estimates of the mean frequencies of mutations in the targets were obtained and the frequencies of mutations in the different codon positions were compared. The frequencies of mutations in the vicinity of the targets and directly within the targets were also compared and have been shown to be approximately the same. The frequencies of indels in the chosen regions have been assessed. Their frequencies have proved to be two to three orders of magnitude less compared to that for mutations. PMID:27414786

  13. Selection Under Domestication: Evidence for a Sweep in the Rice Waxy Genomic Region

    PubMed Central

    Olsen, Kenneth M.; Caicedo, Ana L.; Polato, Nicholas; McClung, Anna; McCouch, Susan; Purugganan, Michael D.

    2006-01-01

    Rice (Oryza sativa) was cultivated by Asian Neolithic farmers >11,000 years ago, and different cultures have selected for divergent starch qualities in the rice grain during and after the domestication process. An intron 1 splice donor site mutation of the Waxy gene is responsible for the absence of amylose in glutinous rice varieties. This mutation appears to have also played an important role in the origin of low amylose, nonglutinous temperate japonica rice varieties, which form a primary component of Northeast Asian cuisines. Waxy DNA sequence analyses indicate that the splice donor mutation is prevalent in temperate japonica rice varieties, but rare or absent in tropical japonica, indica, aus, and aromatic varieties. Sequence analysis across a 500-kb genomic region centered on Waxy reveals patterns consistent with a selective sweep in the temperate japonicas associated with the mutation. The size of the selective sweep (>250 kb) indicates very strong selection in this region, with an inferred selection coefficient that is higher than similar estimates from maize domestication genes or wild species. These findings demonstrate that selection pressures associated with crop domestication regimes can exceed by one to two orders of magnitude those observed for genes under even strong selection in natural systems. PMID:16547098

  14. In silico comparison of genomic regions containing genes coding for enzymes and transcription factors for the phenylpropanoid pathway in Phaseolus vulgaris L. and Glycine max L. Merr

    PubMed Central

    Reinprecht, Yarmilla; Yadegari, Zeinab; Perry, Gregory E.; Siddiqua, Mahbuba; Wright, Lori C.; McClean, Phillip E.; Pauls, K. Peter

    2013-01-01

    Legumes contain a variety of phytochemicals derived from the phenylpropanoid pathway that have important effects on human health as well as seed coat color, plant disease resistance and nodulation. However, the information about the genes involved in this important pathway is fragmentary in common bean (Phaseolus vulgaris L.). The objectives of this research were to isolate genes that function in and control the phenylpropanoid pathway in common bean, determine their genomic locations in silico in common bean and soybean, and analyze sequences of the 4CL gene family in two common bean genotypes. Sequences of phenylpropanoid pathway genes available for common bean or other plant species were aligned, and the conserved regions were used to design sequence-specific primers. The PCR products were cloned and sequenced and the gene sequences along with common bean gene-based (g) markers were BLASTed against the Glycine max v.1.0 genome and the P. vulgaris v.1.0 (Andean) early release genome. In addition, gene sequences were BLASTed against the OAC Rex (Mesoamerican) genome sequence assembly. In total, fragments of 46 structural and regulatory phenylpropanoid pathway genes were characterized in this way and placed in silico on common bean and soybean sequence maps. The maps contain over 250 common bean g and SSR (simple sequence repeat) markers and identify the positions of more than 60 additional phenylpropanoid pathway gene sequences, plus the putative locations of seed coat color genes. The majority of cloned phenylpropanoid pathway gene sequences were mapped to one location in the common bean genome but had two positions in soybean. The comparison of the genomic maps confirmed previous studies, which show that common bean and soybean share genomic regions, including those containing phenylpropanoid pathway gene sequences, with conserved synteny. Indels identified in the comparison of Andean and Mesoamerican common bean 4CL gene sequences might be used to develop inter

  15. Additional UBVRI and JHKL photometry of T Tauri stars in the Taurus region

    NASA Astrophysics Data System (ADS)

    Rydgren, A. E.; Vrba, F. J.

    1983-07-01

    We present nearly simultaneous UB VRI and JHKL photometry for 21 T Tauri stars in the Taurus region and three stars in northern Orion. Some nonsimultaneous UBVRJ and JHKL photometry of T Tauri stars in the Taurus and NGC 2264 regions is also given. These new data reinforce the conclusion from Rydgren, Schmelz, and Vrba (1982) that the T Tauri loci in the (J - H, H - K) and (H - K, K - L) diagrams are relatively narrow, implying a limited range in maximum temperature for the associated circumstellar dust. From an analysis of the available data for five actively variable T Tauri stars we find that (1) the slopes d (B - V )/d V and d (V - I)/dV differ significantly between stars, (2) the j - H, H - K, and K - L colors tend to become larger when the star is fainter at V, and (3) the amplitude of variations at L is much smaller than the amplitude at V. The resulting constraints on models for T Tauri variability are considered.

  16. Robust physical methods that enrich genomic regions identical by descent for linkage studies: confirmation of a locus for osteogenesis imperfecta

    PubMed Central

    Brooks, Peter; Marcaillou, Charles; Vanpeene, Maud; Saraiva, Jean-Paul; Stockholm, Daniel; Francke, Stephan; Favis, Reyna; Cohen, Nadine; Rousseau, Francis; Tores, Frédéric; Lindenbaum, Pierre; Hager, Jörg; Philippi, Anne

    2009-01-01

    Background The monogenic disease osteogenesis imperfecta (OI) is due to single mutations in either of the collagen genes ColA1 or ColA2, but within the same family a given mutation is accompanied by a wide range of disease severity. Although this phenotypic variability implies the existence of modifier gene variants, genome wide scanning of DNA from OI patients has not been reported. Promising genome wide marker-independent physical methods for identifying disease-related loci have lacked robustness for widespread applicability. Therefore we sought to improve these methods and demonstrate their performance to identify known and novel loci relevant to OI. Results We have improved methods for enriching regions of identity-by-descent (IBD) shared between related, afflicted individuals. The extent of enrichment exceeds 10- to 50-fold for some loci. The efficiency of the new process is shown by confirmation of the identification of the Col1A2 locus in osteogenesis imperfecta patients from Amish families. Moreover the analysis revealed additional candidate linkage loci that may harbour modifier genes for OI; a locus on chromosome 1q includes COX-2, a gene implicated in osteogenesis. Conclusion Technology for physical enrichment of IBD loci is now robust and applicable for finding genes for monogenic diseases and genes for complex diseases. The data support the further investigation of genetic loci other than collagen gene loci to identify genes affecting the clinical expression of osteogenesis imperfecta. The discrimination of IBD mapping will be enhanced when the IBD enrichment procedure is coupled with deep resequencing. PMID:19331686

  17. Regional mosaic genomic heterogeneity in the elderly and in Alzheimer's disease as a correlate of neuronal vulnerability.

    PubMed

    Arendt, Thomas; Brückner, Martina K; Lösche, Andreas

    2015-10-01

    . We were able to demonstrate, moreover, that the neuronal DCV in the cerebral cortex of the normal elderly as well as the increased neuronal DCV in AD patients are not randomly distributed but instead show systematic regional differences which correspond to differences in vulnerability. These findings provide additional evidence that mosaic genomic heterogeneity may play a key role in AD pathology. PMID:26298468

  18. Regional mosaic genomic heterogeneity in the elderly and in Alzheimer's disease as a correlate of neuronal vulnerability.

    PubMed

    Arendt, Thomas; Brückner, Martina K; Lösche, Andreas

    2015-10-01

    . We were able to demonstrate, moreover, that the neuronal DCV in the cerebral cortex of the normal elderly as well as the increased neuronal DCV in AD patients are not randomly distributed but instead show systematic regional differences which correspond to differences in vulnerability. These findings provide additional evidence that mosaic genomic heterogeneity may play a key role in AD pathology.

  19. 30 CFR 250.1166 - What additional reporting is required for developments in the Alaska OCS Region?

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... developments in the Alaska OCS Region? 250.1166 Section 250.1166 Mineral Resources MINERALS MANAGEMENT SERVICE, DEPARTMENT OF THE INTERIOR OFFSHORE OIL AND GAS AND SULPHUR OPERATIONS IN THE OUTER CONTINENTAL SHELF Oil and Gas Production Requirements Other Requirements § 250.1166 What additional reporting is required...

  20. Genome-scale prediction of proteins with long intrinsically disordered regions.

    PubMed

    Peng, Zhenling; Mizianty, Marcin J; Kurgan, Lukasz

    2014-01-01

    Proteins with long disordered regions (LDRs), defined as having 30 or more consecutive disordered residues, are abundant in eukaryotes, and these regions are recognized as a distinct class of biologically functional domains. LDRs facilitate various cellular functions and are important for target selection in structural genomics. Motivated by the lack of methods that directly predict proteins with LDRs, we designed Super-fast predictor of proteins with Long Intrinsically DisordERed regions (SLIDER). SLIDER utilizes logistic regression that takes an empirically chosen set of numerical features, which consider selected physicochemical properties of amino acids, sequence complexity, and amino acid composition, as its inputs. Empirical tests show that SLIDER offers competitive predictive performance combined with low computational cost. It outperforms, by at least a modest margin, a comprehensive set of modern disorder predictors (that can indirectly predict LDRs) and is 16 times faster compared to the best currently available disorder predictor. Utilizing our time-efficient predictor, we characterized abundance and functional roles of proteins with LDRs over 110 eukaryotic proteomes. Similar to related studies, we found that eukaryotes have many (on average 30.3%) proteins with LDRs with majority of proteomes having between 25 and 40%, where higher abundance is characteristic to proteomes that have larger proteins. Our first-of-its-kind large-scale functional analysis shows that these proteins are enriched in a number of cellular functions and processes including certain binding events, regulation of catalytic activities, cellular component organization, biogenesis, biological regulation, and some metabolic and developmental processes. A webserver that implements SLIDER is available at http://biomine.ece.ualberta.ca/SLIDER/.

  1. Genomic characterization of Sinorhizobium meliloti AK21, a wild isolate from the Aral Sea Region.

    PubMed

    Molina-Sánchez, María Dolores; López-Contreras, José Antonio; Toro, Nicolás; Fernández-López, Manuel

    2015-01-01

    The symbiotic, nitrogen-fixing bacterium Sinorhizobium meliloti has been widely studied due to its ability to improve crop yields through direct interactions with leguminous plants. S. meliloti AK21 is a wild type strain that forms nodules on Medicago plants in saline and drought conditions in the Aral Sea Region. The aim of this work was to establish the genetic similarities and differences between S. meliloti AK21 and the reference strain S. meliloti 1021. Comparative genome hybridization with the model reference strain S. meliloti 1021 yielded 365 variable genes, grouped into 11 regions in the three main replicons in S. meliloti AK21. The most extensive regions of variability were found in the symbiotic plasmid pSymA, which also contained the largest number of orthologous and polymorphic sequences identified by suppression subtractive hybridization. This procedure identified a large number of divergent sequences and others without homology in the databases, the further investigation of which could provide new insight into the alternative metabolic pathways present in S. meliloti AK21. We identified a plasmid replication module from the repABC replicon family, together with plasmid mobilization-related genes (traG and a VirB9-like protein), which suggest that this indigenous isolate harbors an accessory plasmid. Furthermore, the transcriptomic profiles reflected differences in gene content and regulation between S. meliloti AK21 and S. meliloti 1021 (ExpR and PhoB regulons), but provided evidence for an as yet unknown, alternative mechanism involving activation of the cbb3 terminal oxidase. Finally, phenotypic microarrays characterization revealed a greater versatility of substrate use and chemical degradation than for S. meliloti 1021.

  2. Genomic characterization of Sinorhizobium meliloti AK21, a wild isolate from the Aral Sea Region.

    PubMed

    Molina-Sánchez, María Dolores; López-Contreras, José Antonio; Toro, Nicolás; Fernández-López, Manuel

    2015-01-01

    The symbiotic, nitrogen-fixing bacterium Sinorhizobium meliloti has been widely studied due to its ability to improve crop yields through direct interactions with leguminous plants. S. meliloti AK21 is a wild type strain that forms nodules on Medicago plants in saline and drought conditions in the Aral Sea Region. The aim of this work was to establish the genetic similarities and differences between S. meliloti AK21 and the reference strain S. meliloti 1021. Comparative genome hybridization with the model reference strain S. meliloti 1021 yielded 365 variable genes, grouped into 11 regions in the three main replicons in S. meliloti AK21. The most extensive regions of variability were found in the symbiotic plasmid pSymA, which also contained the largest number of orthologous and polymorphic sequences identified by suppression subtractive hybridization. This procedure identified a large number of divergent sequences and others without homology in the databases, the further investigation of which could provide new insight into the alternative metabolic pathways present in S. meliloti AK21. We identified a plasmid replication module from the repABC replicon family, together with plasmid mobilization-related genes (traG and a VirB9-like protein), which suggest that this indigenous isolate harbors an accessory plasmid. Furthermore, the transcriptomic profiles reflected differences in gene content and regulation between S. meliloti AK21 and S. meliloti 1021 (ExpR and PhoB regulons), but provided evidence for an as yet unknown, alternative mechanism involving activation of the cbb3 terminal oxidase. Finally, phenotypic microarrays characterization revealed a greater versatility of substrate use and chemical degradation than for S. meliloti 1021. PMID:26090306

  3. Variable Genome Sequences of the Murine Pneumotropic Virus (Polyomaviridae) Regulatory Region Isolated from an Infected Mouse Tissue Viral Suspension

    PubMed Central

    Libbey, Jane E.

    2016-01-01

    The murine pneumotropic virus genome, isolated from an infected murine tissue homogenate, was sequenced to completion. The lungs, liver, spleen, and kidneys were the source of the tissue homogenate in order to mirror the heterogeneity of the virus population in vivo. The regulatory region sequence was found to be highly variable. PMID:27231357

  4. High-resolution physical mapping in Pennisetum squamulatum reveals extensive chromosomal heteromorphism of the genomic region associated with apomixis.

    PubMed

    Akiyama, Yukio; Conner, Joann A; Goel, Shailendra; Morishige, Daryl T; Mullet, John E; Hanna, Wayne W; Ozias-Akins, Peggy

    2004-04-01

    Gametophytic apomixis is asexual reproduction as a consequence of parthenogenetic development of a chromosomally unreduced egg. The trait leads to the production of embryos with a maternal genotype, i.e. progeny are clones of the maternal plant. The application of the trait in agriculture could be a tremendous tool for crop improvement through conventional and nonconventional breeding methods. Unfortunately, there are no major crops that reproduce by apomixis, and interspecific hybridization with wild relatives has not yet resulted in commercially viable germplasm. Pennisetum squamulatum is an aposporous apomict from which the gene(s) for apomixis has been transferred to sexual pearl millet by backcrossing. Twelve molecular markers that are linked with apomixis coexist in a tight linkage block called the apospory-specific genomic region (ASGR), and several of these markers have been shown to be hemizygous in the polyploid genome of P. squamulatum. High resolution genetic mapping of these markers has not been possible because of low recombination in this region of the genome. We now show the physical arrangement of bacterial artificial chromosomes containing apomixis-linked molecular markers by high resolution fluorescence in situ hybridization on pachytene chromosomes. The size of the ASGR, currently defined as the entire hemizygous region that hybridizes with apomixis-linked bacterial artificial chromosomes, was estimated on pachytene and mitotic chromosomes to be approximately 50 Mbp (a quarter of the chromosome). The ASGR includes highly repetitive sequences from an Opie-2-like retrotransposon family that are particularly abundant in this region of the genome.

  5. 12q14 Microdeletions: Additional Case Series with Confirmation of a Macrocephaly Region

    PubMed Central

    Mc Cormack, Adrian; Sharpe, Cynthia; Gregersen, Nerine; Smith, Warwick; Hayes, Ian; George, Alice M.; Love, Donald R.

    2015-01-01

    To date, there have been only a few reports of patients carrying a microdeletion in chromosome 12q14. These patients usually present with pre- and postnatal growth retardation, and developmental delay. Here we report on two additional patients with both genotype and phenotype differences. Similar to previously published cases, one patient has haploinsufficiency of the HMGA2 gene and shows severe short stature and developmental delay. The second patient is only one of a handful without the loss of the HMGA2 gene and shows a much better growth profile, but with absolute macrocephaly. This patient's deletion is unique and hence defines a likely macrocephaly locus that contributes to the general phenotype characterising the 12q14 syndrome. PMID:26266063

  6. Deficiency screening for genomic regions with effects on environmental sensitivity of the sensory bristles of Drosophila melanogaster.

    PubMed

    Takahashi, Kazuo H; Okada, Yasukazu; Teramura, Kouhei

    2012-09-01

    Environmental canalization is defined as a reduction in the effect of external environmental perturbations on a phenotype, while phenotypic plasticity is defined as the production of different phenotypes in alternative environments. These terms describe different aspects of the same phenomenon, that is, the sensitivity of the phenotype to the environment. Genetic regulation of the environmental sensitivity has been a central topic in the field of evolutionary biology. In this study, we performed deficiency screening to detect genomic regions with effects on the environmental sensitivity of Drosophila melanogaster sensory bristles. We used a collection of isogenic deficiency strains established by the DrosDel Project for screening. We screened 423 genomic deficiencies that encompassed approximately 63.6% of the entire D. melanogaster genome. We identified 29 genomic deficiencies showing significant effects on environmental sensitivity, suggesting that multiple genomic regions may influence phenotypic variation. We also found significant correlations among the effects of deficiencies on environmental sensitivity for different bristle traits, suggesting that the same genetic mechanism can regulate environmental sensitivity of multiple traits. Current high-resolution mapping will facilitate the examination of individual candidate genes using mutations or RNAi approaches in future studies.

  7. Adaptation of Maize to Temperate Climates: Mid-Density Genome-Wide Association Genetics and Diversity Patterns Reveal Key Genomic Regions, with a Major Contribution of the Vgt2 (ZCN8) Locus

    PubMed Central

    Bouchet, Sophie; Servin, Bertrand; Bertin, Pascal; Madur, Delphine; Combes, Valérie; Dumas, Fabrice; Brunel, Dominique; Laborde, Jacques; Charcosset, Alain; Nicolas, Stéphane

    2013-01-01

    The migration of maize from tropical to temperate climates was accompanied by a dramatic evolution in flowering time. To gain insight into the genetic architecture of this adaptive trait, we conducted a 50K SNP-based genome-wide association and diversity investigation on a panel of tropical and temperate American and European representatives. Eighteen genomic regions were associated with flowering time. The number of early alleles cumulated along these regions was highly correlated with flowering time. Polymorphism in the vicinity of the ZCN8 gene, which is the closest maize homologue to Arabidopsis major flowering time (FT) gene, had the strongest effect. This polymorphism is in the vicinity of the causal factor of Vgt2 QTL. Diversity was lower, whereas differentiation and LD were higher for associated loci compared to the rest of the genome, which is consistent with selection acting on flowering time during maize migration. Selection tests also revealed supplementary loci that were highly differentiated among groups and not associated with flowering time in our panel, whereas they were in other linkage-based studies. This suggests that allele fixation led to a lack of statistical power when structure and relatedness were taken into account in a linear mixed model. Complementary designs and analysis methods are necessary to unravel the architecture of complex traits. Based on linkage disequilibrium (LD) estimates corrected for population structure, we concluded that the number of SNPs genotyped should be at least doubled to capture all QTLs contributing to the genetic architecture of polygenic traits in this panel. These results show that maize flowering time is controlled by numerous QTLs of small additive effect and that strong polygenic selection occurred under cool climatic conditions. They should contribute to more efficient genomic predictions of flowering time and facilitate the dissemination of diverse maize genetic resources under a wide range of

  8. The Variable Regions of Lactobacillus rhamnosus Genomes Reveal the Dynamic Evolution of Metabolic and Host-Adaptation Repertoires

    PubMed Central

    Ceapa, Corina; Davids, Mark; Ritari, Jarmo; Lambert, Jolanda; Wels, Michiel; Douillard, François P.; Smokvina, Tamara; de Vos, Willem M.; Knol, Jan; Kleerebezem, Michiel

    2016-01-01

    Lactobacillus rhamnosus is a diverse Gram-positive species with strains isolated from different ecological niches. Here, we report the genome sequence analysis of 40 diverse strains of L. rhamnosus and their genomic comparison, with a focus on the variable genome. Genomic comparison of 40 L. rhamnosus strains discriminated the conserved genes (core genome) and regions of plasticity involving frequent rearrangements and horizontal transfer (variome). The L. rhamnosus core genome encompasses 2,164 genes, out of 4,711 genes in total (the pan-genome). The accessory genome is dominated by genes encoding carbohydrate transport and metabolism, extracellular polysaccharides (EPS) biosynthesis, bacteriocin production, pili production, the cas system, and the associated clustered regularly interspaced short palindromic repeat (CRISPR) loci, and more than 100 transporter functions and mobile genetic elements like phages, plasmid genes, and transposons. A clade distribution based on amino acid differences between core (shared) proteins matched with the clade distribution obtained from the presence–absence of variable genes. The phylogenetic and variome tree overlap indicated that frequent events of gene acquisition and loss dominated the evolutionary segregation of the strains within this species, which is paralleled by evolutionary diversification of core gene functions. The CRISPR-Cas system could have contributed to this evolutionary segregation. Lactobacillus rhamnosus strains contain the genetic and metabolic machinery with strain-specific gene functions required to adapt to a large range of environments. A remarkable congruency of the evolutionary relatedness of the strains’ core and variome functions, possibly favoring interspecies genetic exchanges, underlines the importance of gene-acquisition and loss within the L. rhamnosus strain diversification. PMID:27358423

  9. Polymorphisms and genomic organization of repetitive DNA from centromeric regions of Arabidopsis chromosomes.

    PubMed Central

    Heslop-Harrison, J S; Murata, M; Ogura, Y; Schwarzacher, T; Motoyoshi, F

    1999-01-01

    A highly abundant repetitive DNA sequence family of Arabidopsis, AtCon, is composed of 178-bp tandemly repeated units and is located at the centromeres of all five chromosome pairs. Analysis of multiple copies of AtCon showed 95% conservation of nucleotides, with some alternative bases, and revealed two boxes, 30 and 24 bp long, that are 99% conserved. Sequences at the 3' end of these boxes showed similarity to yeast CDEI and human CENP-B DNA-protein binding motifs. When oligonucleotides from less conserved regions of AtCon were hybridized in situ and visualized by using primer extension, they were detected on specific chromosomes. When used for polymerase chain reaction with genomic DNA, single primers or primer pairs oriented in the same direction showed negligible amplification, indicating a head-to-tail repeat unit organization. Most primer pairs facing in opposite directions gave several strong bands corresponding to their positions within AtCon. However, consistent with the primer extension results, some primer pairs showed no amplification, indicating that there are chromosome-specific variants of AtCon. The results are significant because they elucidate the organization, mode of amplification, dispersion, and evolution of one of the major repeated sequence families of Arabidopsis. The evidence presented here suggests that AtCon, like human alpha satellites, plays a role in Arabidopsis centromere organization and function. PMID:9878630

  10. Genomic and network patterns of schizophrenia genetic variation in human evolutionary accelerated regions.

    PubMed

    Xu, Ke; Schadt, Eric E; Pollard, Katherine S; Roussos, Panos; Dudley, Joel T

    2015-05-01

    The population persistence of schizophrenia despite associated reductions in fitness and fecundity suggests that the genetic basis of schizophrenia has a complex evolutionary history. A recent meta-analysis of schizophrenia genome-wide association studies offers novel opportunities for assessment of the evolutionary trajectories of schizophrenia-associated loci. In this study, we hypothesize that components of the genetic architecture of schizophrenia are attributable to human lineage-specific evolution. Our results suggest that schizophrenia-associated loci enrich in genes near previously identified human accelerated regions (HARs). Specifically, we find that genes near HARs conserved in nonhuman primates (pHARs) are enriched for schizophrenia-associated loci, and that pHAR-associated schizophrenia genes are under stronger selective pressure than other schizophrenia genes and other pHAR-associated genes. We further evaluate pHAR-associated schizophrenia genes in regulatory network contexts to investigate associated molecular functions and mechanisms. We find that pHAR-associated schizophrenia genes significantly enrich in a GABA-related coexpression module that was previously found to be differentially regulated in schizophrenia affected individuals versus healthy controls. In another two independent networks constructed from gene expression profiles from prefrontal cortex samples, we find that pHAR-associated schizophrenia genes are located in more central positions and their average path lengths to the other nodes are significantly shorter than those of other schizophrenia genes. Together, our results suggest that HARs are associated with potentially important functional roles in the genetic architecture of schizophrenia.

  11. Narrowing and genomic annotation of the commonly deleted region of the 5q- syndrome

    SciTech Connect

    Boultwood, Jacqueline; Fidler, Carrie; Strickson, Amanda J.; Watkins, Fiona; Gama, Susana; Kearney, Lyndal; Tosi, Sabrina; Kasprzyk, Arek; Cheng, Jan-Fang; Jaju, Rina J.; Wainscoat, James S.

    2002-01-15

    The 5q syndrome is the most distinct of the myelodysplastic syndromes, and the molecular basis for this disorder remains unknown. We describe the narrowing of the common deleted region (CDR) of the 5q syndrome to the approximately 1.5-megabases interval at 5q32 flanked by D5S413 and the GLRA1 gene. The Ensemblgene prediction program has been used for the complete genomic annotation of the CDR. The CDR is gene rich and contains 24 known genes and 16 novel (predicted) genes. Of 40 genes in the CDR, 33 are expressed in CD34 cells and, therefore, represent candidate genes since they are expressed within the hematopoietic stem/progenitor cell compartment. A number of the genes assigned to the CDR represent good candidates for the 5q syndrome, including MEGF1, G3BP, and several of the novel gene predictions. These data now afford a comprehensive mutational/expression analysis of all candidate genes assigned to the CDR.

  12. Structural classification and prediction of reentrant regions in alpha-helical transmembrane proteins: application to complete genomes.

    PubMed

    Viklund, Håkan; Granseth, Erik; Elofsson, Arne

    2006-08-18

    Alongside the well-studied membrane spanning helices, alpha-helical transmembrane (TM) proteins contain several functionally and structurally important types of substructures. Here, existing 3D structures of transmembrane proteins have been used to define and study the concept of reentrant regions, i.e. membrane penetrating regions that enter and exit the membrane on the same side. We find that these regions can be divided into three distinct categories based on secondary structure motifs, namely long regions with a helix-coil-helix motif, regions of medium length with the structure helix-coil or coil-helix and regions of short to medium length consisting entirely of irregular secondary structure. The residues situated in reentrant regions are significantly smaller on average compared to other regions and reentrant regions can be detected in the inter-transmembrane loops with an accuracy of approximately 70% based on their amino acid composition. Using TOP-MOD, a novel method for predicting reentrant regions, we have scanned the genomes of Escherichia coli, Saccharomyces cerevisiae and Homo sapiens. The results suggest that more than 10% of transmembrane proteins contain reentrant regions and that the occurrence of reentrant regions increases linearly with the number of transmembrane regions. Reentrant regions seem to be most commonly found in channel proteins and least commonly in signal receptors.

  13. Identification of genomic regions involved in resistance against Sclerotinia sclerotiorum from wild Brassica oleracea.

    PubMed

    Mei, Jiaqin; Ding, Yijuan; Lu, Kun; Wei, Dayong; Liu, Yao; Disi, Joseph Onwusemu; Li, Jiana; Liu, Liezhao; Liu, Shengyi; McKay, John; Qian, Wei

    2013-02-01

    The lack of resistant source has greatly restrained resistance breeding of rapeseed (Brassica napus, AACC) against Sclerotinia sclerotiorum which causes severe yield losses in rapeseed production all over the world. Recently, several wild Brassica oleracea accessions (CC) with high level of resistance have been identified (Mei et al. in Euphytica 177:393-400, 2011), bringing a new hope to improve Sclerotinia resistance of rapeseed. To map quantitative trait loci (QTL) for Sclerotinia resistance from wild B. oleracea, an F2 population consisting of 149 genotypes, with several clones of each genotypes, was developed from one F1 individual derived from the cross between a resistant accession of wild B. oleracea (B. incana) and a susceptible accession of cultivated B. oleracea var. alboglabra. The F2 population was evaluated for Sclerotinia reaction in 2009 and 2010 under controlled condition. Significant differences among genotypes and high heritability for leaf and stem reaction indicated that genetic components accounted for a large portion of the phenotypic variance. A total of 12 QTL for leaf resistance and six QTL for stem resistance were identified in 2 years, each explaining 2.2-28.4 % of the phenotypic variation. The combined effect of alleles from wild B. oleracea reduced the relative susceptibility by 22.5 % in leaves and 15 % in stems on average over 2 years. A 12.8-cM genetic region on chromosome C09 of B. oleracea consisting of two major QTL intervals for both leaf and stem resistance was assigned into a 2.7-Mb genomic region on chromosome A09 of B. rapa, harboring about 30 putative resistance-related genes. Significant negative corrections were found between flowering time and relative susceptibility of leaf and stem. The association of flowering time with Sclerotinia resistance is discussed.

  14. Molecular analysis of the Adh region of the genome of Drosophila melanogaster.

    PubMed

    Chia, W; Karp, R; McGill, S; Ashburner, M

    1985-12-20

    A small region of the genome of Drosophila melanogaster has been cloned in a series of overlapping phage. A length of 165 X 10(3) base-pairs of contiguous DNA that spans polytene chromosome region 35A4 to 35B1 and includes the structural gene for alcohol dehydrogenase (Adh) as well as at least two other genes, outspread (osp) and no-ocelli (noc), has been characterized by mapping chromosome aberrations to the DNA. The relationship between osp and Adh is surprising: of nine osp alleles associated with chromosome breakpoints, five map distal (i.e. 5') to Adh and four map proximal (i.e. 3') to this gene. None affects the expression of Adh. As defined by these and other breakpoints, the osp gene spans at least 52 X 10(3) base-pairs and overlaps the Adh gene. The noc gene, as defined by the mapping of nearly 30 breakpoints, is at least 50 X 10(3) base-pairs in size. Alleles of noc and noc- deletions show either of two kinds of interaction with the recessive lethality of l(2)br29ScoR+1, a lethal that maps immediately distal to noc. One class of noc allele is viable when heterozygous with ScoR+1, while the other class is lethal or semi-lethal. Both classes, however, are homozygous or hemizygous viable. The locations of these two classes of noc allele on the DNA fall into two clusters, with those that are viable with ScoR+1 located proximal to those that are not. The physical boundary between these classes lies at a site just distal to that of the breakpoint of the inversion associated with ScoR+1 itself.

  15. High Level of Structural Polymorphism Driven by Mobile Elements in the Hox Genomic Region of the Chaetognath Spadella cephaloptera

    PubMed Central

    Marlétaz, Ferdinand; Gyapay, Gabor; Le Parco, Yannick

    2010-01-01

    Little is known about the relationships between genome polymorphism, mobile element dynamics, and population size among animal populations. The chaetognath species Spadella cephaloptera offers a unique perspective to examine this issue because they display a high level of genetic polymorphism at the population level. Here, we have investigated in detail the extent of nucleotide and structural polymorphism in a region harboring Hox1 and several coding genes and presumptive functional elements. Sequencing of several bacterial artificial chromosome inserts representative of this nuclear region uncovered a high level of structural heterogeneity, which is mainly caused by the polymorphic insertion of a diversity of genetic mobile elements. By anchoring this variation through individual genotyping, we demonstrated that sequence diversity could be attributed to the allelic pool of a single population, which was confirmed by detection of extensive recombination within the genomic region studied. The high average level of nucleotide heterozygosity provides clues of selection in both coding and noncoding domains. This pattern stresses how selective processes remarkably cope with intense sequence turnover due to substitutions, mobile element insertions, and recombination to preserve the integrity of functional landscape. These findings suggest that genome polymorphism could provide pivotal information for future functional annotation of genomes. PMID:20829282

  16. The complete mitochondrial genome of spittlebug Paphnutius ruficeps (Insecta: Hemiptera: Cercopidae) with a fairly short putative control region.

    PubMed

    Liu, Jie; Liang, Aiping

    2013-04-01

    The mitochondrial genome of the spittlebug Paphnutius ruficeps is a double-strand DNA circular molecule of 14,841 bp with a total A and T content of 73.8%. It is one of the shortest genomes among published hemipteran mitogenomes and encodes 13 protein-coding genes, 2 ribosome RNA genes and 22 transfer RNA (tRNA) genes. The gene order is consistent with the hypothesized ancestral arthropod genome arrangement. Most of the protein-coding genes use ATG as start and TAA as stop codon. The codons show an evident bias toward the nucleotides T and A at the third codon position and the most commonly used codons contain more A and T than their synonymous ones. The anticodons of the 22 tRNA genes are identical to those of the mitogenome of Philaenus spumarius, another studied spittlebug. All the tRNAs could be folded into traditional clover leaf secondary structures. The putative control region (traditionally called A + T-rich region) is the main non-coding part of the mitogenome. The AT content of this region (74.5%) is not significantly higher than that of the total mitogenome (73.8%) and slightly lower than that of the N-chain protein-coding genes (75.3%). The absence of repeat sequences as well as its short length is the most obvious characteristics of the mitochondrial genome of Paphnutius ruficeps compared with those of other published hemipteran species.

  17. Natural Variation in a Subtelomeric Region of Arabidopsis: Implications for the Genomic Dynamics of a Chromosome End

    PubMed Central

    Kuo, Hui-Fen; Olsen, Kenneth M.; Richards, Eric J.

    2006-01-01

    We investigated genome dynamics at a chromosome end in the model plant Arabidopsis thaliana through a study of natural variation in 35 wild accessions. We focused on the single-copy subtelomeric region of chromosome 1 north (∼3.5 kb), which represents the relatively simple organization of subtelomeric regions in this species. PCR fragment-length variation across the subtelomeric region indicated that the 1.4-kb distal region showed elevated structural variation relative to the centromere-proximal region. Examination of nucleotide sequences from this 1.4-kb region revealed diverse DNA rearrangements, including an inversion, several deletions, and an insertion of a retrotransposon LTR. The structures at the deletion and inversion breakpoints are characteristic of simple deletion-associated nonhomologous end-joining (NHEJ) events. There was strong linkage disequilibrium between the distal subtelomeric region and the proximal telomere, which contains degenerate and variant telomeric repeats. Variation in the proximal telomere was characterized by the expansion and deletion of blocks of repeats. Our sample of accessions documented two independent chromosome-healing events associated with terminal deletions of the subtelomeric region as well as the capture of a scrambled mitochondrial DNA segment in the proximal telomeric array. This natural variation study highlights the variety of genomic events that drive the fluidity of chromosome termini. PMID:16547105

  18. Amplification of GB virus-C/hepatitis G virus RNA with primers from different regions of the viral genome.

    PubMed

    Kao, J H; Chen, P J; Chen, W; Hsiang, S C; Lai, M Y; Chen, D S

    1997-04-01

    GB virus-C/hepatitis G virus (GBV-C/HGV) is a newly identified RNA virus. The aim of the study was to compare three primer pairs from the 5' untranslated region (5'UTR), envelope region 2 (E 2) and nonstructural region 3 (NS 3) of GBV-C/HGV genome for their ability to detect GBV-C/HGV RNA by polymerase chain reaction (PCR) assays. By using PCR with primers from different regions of the viral genome, serum GBV-C/HGV RNA was assayed in 200 at-risk individuals. The sensitivity of this assay was assessed by a titration experiment, and nucleotide sequences of the amplified products were determined directly. Of 200 serum samples, 43 (21.5%) were positive for GBV-C/HGV RNA with at least one of the primer pairs. The positive rates by 5'UTR, NS 3, and E 2 primers were 100%, 98%, and 84%, respectively, and the sensitivity of PCR assays using 5'UTR primers was 10 to 100 times more likely to detect GBV-C/HGV RNA than that of NS 3 and E 2 primers. The average homology of amplified targets to the prototype HGV genome was 89%, 80%, and 85% and the similarity between each amplified target was up to 100%, 90%, and 92% in the 5'UTR, E 2, and NS 3 regions, respectively. Therefore, the 5'UTR of GBV-C/HGV genome is highly conserved and primers deduced from this region can provideva sensitive and specific PCR assay for GBV-C/HGV RNA.

  19. Coding DNA repeated throughout intergenic regions of the Arabidopsis thaliana genome: Evolutionary footprints of RNA silencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pyknons are non-random sequence patterns significantly repeated throughout non-coding genomic DNA that also appear at least once among genes. They are interesting because they portend an unforeseen connection between coding and non-coding DNA. Pyknons have only been discovered in the human genome,...

  20. Implementation of the Realized Genomic Relationship Matrix to Open-Pollinated White Spruce Family Testing for Disentangling Additive from Nonadditive Genetic Effects.

    PubMed

    Gamal El-Dien, Omnia; Ratcliffe, Blaise; Klápště, Jaroslav; Porth, Ilga; Chen, Charles; El-Kassaby, Yousry A

    2016-03-01

    The open-pollinated (OP) family testing combines the simplest known progeny evaluation and quantitative genetics analyses as candidates' offspring are assumed to represent independent half-sib families. The accuracy of genetic parameter estimates is often questioned as the assumption of "half-sibling" in OP families may often be violated. We compared the pedigree- vs. marker-based genetic models by analysing 22-yr height and 30-yr wood density for 214 white spruce [Picea glauca (Moench) Voss] OP families represented by 1694 individuals growing on one site in Quebec, Canada. Assuming half-sibling, the pedigree-based model was limited to estimating the additive genetic variances which, in turn, were grossly overestimated as they were confounded by very minor dominance and major additive-by-additive epistatic genetic variances. In contrast, the implemented genomic pairwise realized relationship models allowed the disentanglement of additive from all nonadditive factors through genetic variance decomposition. The marker-based models produced more realistic narrow-sense heritability estimates and, for the first time, allowed estimating the dominance and epistatic genetic variances from OP testing. In addition, the genomic models showed better prediction accuracies compared to pedigree models and were able to predict individual breeding values for new individuals from untested families, which was not possible using the pedigree-based model. Clearly, the use of marker-based relationship approach is effective in estimating the quantitative genetic parameters of complex traits even under simple and shallow pedigree structure.

  1. Implementation of the Realized Genomic Relationship Matrix to Open-Pollinated White Spruce Family Testing for Disentangling Additive from Nonadditive Genetic Effects

    PubMed Central

    Gamal El-Dien, Omnia; Ratcliffe, Blaise; Klápště, Jaroslav; Porth, Ilga; Chen, Charles; El-Kassaby, Yousry A.

    2016-01-01

    The open-pollinated (OP) family testing combines the simplest known progeny evaluation and quantitative genetics analyses as candidates’ offspring are assumed to represent independent half-sib families. The accuracy of genetic parameter estimates is often questioned as the assumption of “half-sibling” in OP families may often be violated. We compared the pedigree- vs. marker-based genetic models by analysing 22-yr height and 30-yr wood density for 214 white spruce [Picea glauca (Moench) Voss] OP families represented by 1694 individuals growing on one site in Quebec, Canada. Assuming half-sibling, the pedigree-based model was limited to estimating the additive genetic variances which, in turn, were grossly overestimated as they were confounded by very minor dominance and major additive-by-additive epistatic genetic variances. In contrast, the implemented genomic pairwise realized relationship models allowed the disentanglement of additive from all nonadditive factors through genetic variance decomposition. The marker-based models produced more realistic narrow-sense heritability estimates and, for the first time, allowed estimating the dominance and epistatic genetic variances from OP testing. In addition, the genomic models showed better prediction accuracies compared to pedigree models and were able to predict individual breeding values for new individuals from untested families, which was not possible using the pedigree-based model. Clearly, the use of marker-based relationship approach is effective in estimating the quantitative genetic parameters of complex traits even under simple and shallow pedigree structure. PMID:26801647

  2. Implementation of the Realized Genomic Relationship Matrix to Open-Pollinated White Spruce Family Testing for Disentangling Additive from Nonadditive Genetic Effects.

    PubMed

    Gamal El-Dien, Omnia; Ratcliffe, Blaise; Klápště, Jaroslav; Porth, Ilga; Chen, Charles; El-Kassaby, Yousry A

    2016-03-01

    The open-pollinated (OP) family testing combines the simplest known progeny evaluation and quantitative genetics analyses as candidates' offspring are assumed to represent independent half-sib families. The accuracy of genetic parameter estimates is often questioned as the assumption of "half-sibling" in OP families may often be violated. We compared the pedigree- vs. marker-based genetic models by analysing 22-yr height and 30-yr wood density for 214 white spruce [Picea glauca (Moench) Voss] OP families represented by 1694 individuals growing on one site in Quebec, Canada. Assuming half-sibling, the pedigree-based model was limited to estimating the additive genetic variances which, in turn, were grossly overestimated as they were confounded by very minor dominance and major additive-by-additive epistatic genetic variances. In contrast, the implemented genomic pairwise realized relationship models allowed the disentanglement of additive from all nonadditive factors through genetic variance decomposition. The marker-based models produced more realistic narrow-sense heritability estimates and, for the first time, allowed estimating the dominance and epistatic genetic variances from OP testing. In addition, the genomic models showed better prediction accuracies compared to pedigree models and were able to predict individual breeding values for new individuals from untested families, which was not possible using the pedigree-based model. Clearly, the use of marker-based relationship approach is effective in estimating the quantitative genetic parameters of complex traits even under simple and shallow pedigree structure. PMID:26801647

  3. Mitochondrial genome sequence and gene order of Sipunculus nudus give additional support for an inclusion of Sipuncula into Annelida

    PubMed Central

    Mwinyi, Adina; Meyer, Achim; Bleidorn, Christoph; Lieb, Bernhard; Bartolomaeus, Thomas; Podsiadlowski, Lars

    2009-01-01

    Background Mitochondrial genomes are a valuable source of data for analysing phylogenetic relationships. Besides sequence information, mitochondrial gene order may add phylogenetically useful information, too. Sipuncula are unsegmented marine worms, traditionally placed in their own phylum. Recent molecular and morphological findings suggest a close affinity to the segmented Annelida. Results The first complete mitochondrial genome of a member of Sipuncula, Sipunculus nudus, is presented. All 37 genes characteristic for metazoan mtDNA were detected and are encoded on the same strand. The mitochondrial gene order (protein-coding and ribosomal RNA genes) resembles that of annelids, but shows several derivations so far found only in Sipuncula. Sequence based phylogenetic analysis of mitochondrial protein-coding genes results in significant bootstrap support for Annelida sensu lato, combining Annelida together with Sipuncula, Echiura, Pogonophora and Myzostomida. Conclusion The mitochondrial sequence data support a close relationship of Annelida and Sipuncula. Also the most parsimonious explanation of changes in gene order favours a derivation from the annelid gene order. These results complement findings from recent phylogenetic analyses of nuclear encoded genes as well as a report of a segmental neural patterning in Sipuncula. PMID:19149868

  4. Genome-Wide Anaplasma phagocytophilum AnkA-DNA Interactions Are Enriched in Intergenic Regions and Gene Promoters and Correlate with Infection-Induced Differential Gene Expression

    PubMed Central

    Dumler, J. Stephen; Sinclair, Sara H.; Pappas-Brown, Valeria; Shetty, Amol C.

    2016-01-01

    Anaplasma phagocytophilum, an obligate intracellular prokaryote, infects neutrophils, and alters cardinal functions via reprogrammed transcription. Large contiguous regions of neutrophil chromosomes are differentially expressed during infection. Secreted A. phagocytophilum effector AnkA transits into the neutrophil or granulocyte nucleus to complex with DNA in heterochromatin across all chromosomes. AnkA binds to gene promoters to dampen cis-transcription and also has features of matrix attachment region (MAR)-binding proteins that regulate three-dimensional chromatin architecture and coordinate transcriptional programs encoded in topologically-associated chromatin domains. We hypothesize that identification of additional AnkA binding sites will better delineate how A. phagocytophilum infection results in reprogramming of the neutrophil genome. Using AnkA-binding ChIP-seq, we showed that AnkA binds broadly throughout all chromosomes in a reproducible pattern, especially at: (i) intergenic regions predicted to be MARs; (ii) within predicted lamina-associated domains; and (iii) at promoters ≤ 3000 bp upstream of transcriptional start sites. These findings provide genome-wide support for AnkA as a regulator of cis-gene transcription. Moreover, the dominant mark of AnkA in distal intergenic regions known to be AT-enriched, coupled with frequent enrichment in the nuclear lamina, provides strong support for its role as a MAR-binding protein and genome “re-organizer.” AnkA must be considered a prime candidate to promote neutrophil reprogramming and subsequent functional changes that belie improved microbial fitness and pathogenicity. PMID:27703927

  5. Impacts of Additional HONO Sources on Concentrations and Deposition of NOy in the Beijing-Tianjin-Hebei Region of China

    NASA Astrophysics Data System (ADS)

    Li, Ying; An, Junling; Kajino, Mizuo; Li, Jian; Qu, Yu

    2015-04-01

    Reactive nitrogen-containing compounds (NOy) are involved in many important chemical processes in the atmosphere, including aerosol formation as well as ozone (O3) production and destruction. As NOy deposition was increasing rapidly in China during 1980s ~ 2000s, great effort is urgently needed to reduce N deposition. HONO, an important component of NOy, is a significant precursor of the hydroxyl radical (OH) that drives the formation of O3 and fine particles (PM2.5). Nevertheless, the detailed formation mechanisms of HONO and strength of its sources remain unclear. Unknown HONO sources and their potential impacts on air quality have gained extensive interests but to our current knowledge, the impact of HONO sources on regional-scale deposition of NOy has not been quantified up to date. The goal of this work is to evaluate the effects of the additional HONO sources on concentrations and deposition of individual NOy species as well as the NOy budget in the northern Chinese regions being affected by heavy pollution. Simulations of HONO contributions over Beijing-Tianjin-Hebei region (BTH) during summer and winter periods of 2007 using the fully coupled Weather Research and Forecasting /Chemistry (WRF/Chem) model are performed by including three additional HONO sources: 1) the reaction of photo-excited nitrogen dioxide (NO2*) with water vapor, 2) NO2 heterogeneous reaction at the aerosol surfaces, and 3) HONO emissions. The model results show that the three additional HONO sources produce a 20%~40% (> 100%) increase in monthly-mean OH concentrations in many urban areas in August (February), leading to a 10%~40% (10%~100%) variation in monthly-mean concentrations of NOx, nitrate and PAN, a 5%~10% (10%~40%) increase in the total dry deposition of NOy, and an enhancement of 1.4 Gg N (1.5 Gg N) in the total of dry and wet deposition of NOy over this region in August (February). These results suggest that the additional HONO sources aggravate regional-scale acid deposition

  6. Comparative genomics reveals a functional thyroid-specific element in the far upstream region of the PAX8 gene

    PubMed Central

    2010-01-01

    Background The molecular mechanisms leading to a fully differentiated thyrocite are still object of intense study even if it is well known that thyroglobulin, thyroperoxidase, NIS and TSHr are the marker genes of thyroid differentiation. It is also well known that Pax8, TTF-1, Foxe1 and Hhex are the thyroid-enriched transcription factors responsible for the expression of the above genes, thus are responsible for the differentiated thyroid phenotype. In particular, the role of Pax8 in the fully developed thyroid gland was studied in depth and it was established that it plays a key role in thyroid development and differentiation. However, to date the bases for the thyroid-enriched expression of this transcription factor have not been unraveled yet. Here, we report the identification and characterization of a functional thyroid-specific enhancer element located far upstream of the Pax8 gene. Results We hypothesized that regulatory cis-acting elements are conserved among mammalian genes. Comparison of a genomic region extending for about 100 kb at the 5'-flanking region of the mouse and human Pax8 gene revealed several conserved regions that were tested for enhancer activity in thyroid and non-thyroid cells. Using this approach we identified one putative thyroid-specific regulatory element located 84.6 kb upstream of the Pax8 transcription start site. The in silico data were verified by promoter-reporter assays in thyroid and non-thyroid cells. Interestingly, the identified far upstream element manifested a very high transcriptional activity in the thyroid cell line PC Cl3, but showed no activity in HeLa cells. In addition, the data here reported indicate that the thyroid-enriched transcription factor TTF-1 is able to bind in vitro and in vivo the Pax8 far upstream element, and is capable to activate transcription from it. Conclusions Results of this study reveal the presence of a thyroid-specific regulatory element in the 5' upstream region of the Pax8 gene. The

  7. Genome-wide association and regional heritability mapping to identify loci underlying variation in nematode resistance and body weight in Scottish Blackface lambs.

    PubMed

    Riggio, V; Matika, O; Pong-Wong, R; Stear, M J; Bishop, S C

    2013-05-01

    The genetic architecture underlying nematode resistance and body weight in Blackface lambs was evaluated comparing genome-wide association (GWA) and regional heritability mapping (RHM) approaches. The traits analysed were faecal egg count (FEC) and immunoglobulin A activity against third-stage larvae from Teladorsagia circumcincta, as indicators of nematode resistance, and body weight in a population of 752 Scottish Blackface lambs, genotyped with the 50k single-nucleotide polymorphism (SNP) chip. FEC for both Nematodirus and Strongyles nematodes (excluding Nematodirus), as well as body weight were collected at approximately 16, 20 and 24 weeks of age. In addition, a weighted average animal effect was estimated for both FEC and body weight traits. After quality control, 44 388 SNPs were available for the GWA analysis and 42 841 for the RHM, which utilises only mapped SNPs. The same fixed effects were used in both analyses: sex, year, management group, litter size and age of dam, with day of birth as covariate. Some genomic regions of interest for both nematode resistance and body weight traits were identified, using both GWA and RHM approaches. For both methods, strong evidence for association was found on chromosome 14 for Nematodirus average animal effect, chromosome 6 for Strongyles FEC at 16 weeks and chromosome 6 for body weight at 16 weeks. Across the entire data set, RHM identified more regions reaching the suggestive level than GWA, suggesting that RHM is capable of capturing some of the variation not detected by GWA analyses.

  8. Identification of Salmonella enterica species- and subgroup-specific genomic regions using Panseq 2.0.

    PubMed

    Laing, Chad; Villegas, Andre; Taboada, Eduardo N; Kropinski, Andrew; Thomas, James E; Gannon, Victor P J

    2011-12-01

    The pan-genome of a taxonomic group consists of evolutionarily conserved core genes shared by all members and accessory genes that are present only in some members of the group. Group- and subgroup-specific core genes are thought to contribute to shared phenotypes such as virulence and niche specificity. In this study we analyzed 39 Salmonella enterica genomes (16 closed, 23 draft), a species that contains two human-specific serovars that cause typhoid fever, as well as a large number of zoonotic serovars that cause gastroenteritis in humans. Panseq 2.0 was used to define the pan-genome by adjusting the threshold at which group-specific "core" loci are defined. We found the pan-genome to be 9.03 Mbp in size, and that the core genome size decreased, while the number of SNPs/100 bp increased, as the number of strains used to define the core genome increased, suggesting substantial divergence among S. enterica subgroups. Subgroup-specific "core" genes, in contrast, had fewer SNPs/100 bp, likely reflecting their more recent acquisition. Phylogenetic trees were created from the concatenated and aligned pan-genome, the core genome, and multi-locus-sequence typing (MLST) loci. Branch support increased among the trees, and strains of the same serovar grouped closer together as the number of loci used to create the tree increased. Further, high levels of discrimination were achieved even amongst the most closely related strains of S. enterica Typhi, suggesting that the data generated by Panseq may also be of value in short-term epidemiological studies. Panseq provides an easy and fast way of performing pan-genomic analyses, which can include the identification of group-dominant as well as group-specific loci and is available as a web-server and a standalone version at http://lfz.corefacility.ca/panseq/.

  9. Gap Closing/Finishing by Targeted Genomic Region Enrichment and Sequencing

    SciTech Connect

    Singh, Kanwar; Froula, Jeff; Trice, Hope; Pennacchio, Len A.; Chen, Feng

    2010-05-27

    Gap Closing/Finishing of draft genome assemblies is a labor and cost intensive process where several rounds of repetitious amplification and sequencing are required. Here we demonstrate a high throughput procedure where custom primers flanking gaps in draft genomes are designed. Primer libraries containing up to 4,000 unique pairs in independent droplets are merged with a fragmented genomic template. From this millions of picoliter scale droplets are formed, each one being the functional equivalent of an individual PCR reaction. The PCR products are concatenated and sequenced by Illumina which is then assembled and used for gap closure. Here we present an overall experimental strategy, primer design algorithm and initial results.

  10. Developmental roles of 21 Drosophila transcription factors are determined by quantitative differences in binding to an overlapping set of thousands of genomic regions

    SciTech Connect

    MacArthur, Stewart; Li, Xiao-Yong; Li, Jingyi; Brown, James B.; Chu, Hou Cheng; Zeng, Lucy; Grondona, Brandi P.; Hechmer, Aaron; Simirenko, Lisa; Keranen, Soile V.E.; Knowles, David W.; Stapleton, Mark; Bickel, Peter; Biggin, Mark D.; Eisen, Michael B.

    2009-05-15

    BACKGROUND: We previously established that six sequence-specific transcription factors that initiate anterior/posterior patterning in Drosophila bind to overlapping sets of thousands of genomic regions in blastoderm embryos. While regions bound at high levels include known and probable functional targets, more poorly bound regions are preferentially associated with housekeeping genes and/or genes not transcribed in the blastoderm, and are frequently found in protein coding sequences or in less conserved non-coding DNA, suggesting that many are likely non-functional. RESULTS: Here we show that an additional 15 transcription factors that regulate other aspects of embryo patterning show a similar quantitative continuum of function and binding to thousands of genomic regions in vivo. Collectively, the 21 regulators show a surprisingly high overlap in the regions they bind given that they belong to 11 DNA binding domain families, specify distinct developmental fates, and can act via different cis-regulatory modules. We demonstrate, however, that quantitative differences in relative levels of binding to shared targets correlate with the known biological and transcriptional regulatory specificities of these factors. CONCLUSIONS: It is likely that the overlap in binding of biochemically and functionally unrelated transcription factors arises from the high concentrations of these proteins in nuclei, which, coupled with their broad DNA binding specificities, directs them to regions of open chromatin. We suggest that most animal transcription factors will be found to show a similar broad overlapping pattern of binding in vivo, with specificity achieved by modulating the amount, rather than the identity, of bound factor.

  11. Long non-coding RNA containing ultraconserved genomic region 8 promotes bladder cancer tumorigenesis

    PubMed Central

    Durso, Montano; Romanelli, Alessandra; Avitabile, Concetta; De Cobelli, Ottavio; Messere, Anna; Bruzzese, Dario; Vannini, Ivan; Marinelli, Luciana; Novellino, Ettore; Zhang, Wei; Incoronato, Mariarosaria; Ilardi, Gennaro; Staibano, Stefania; Marra, Laura; Franco, Renato; Perdonà, Sisto; Terracciano, Daniela; Czerniak, Bogdan; Liguori, Giovanna L.; Colonna, Vincenza; Fabbri, Muller; Febbraio, Ferdinando

    2016-01-01

    Ultraconserved regions (UCRs) have been shown to originate non-coding RNA transcripts (T-UCRs) that have different expression profiles and play functional roles in the pathophysiology of multiple cancers. The relevance of these functions to the pathogenesis of bladder cancer (BlCa) is speculative. To elucidate this relevance, we first used genome-wide profiling to evaluate the expression of T-UCRs in BlCa tissues. Analysis of two datasets comprising normal bladder tissues and BlCa specimens with a custom T-UCR microarray identified ultraconserved RNA (uc.) 8+ as the most upregulated T-UCR in BlCa tissues, although its expression was lower than in pericancerous bladder tissues. These results were confirmed on BlCa tissues by real-time PCR and by in situ hybridization. Although uc.8+ is located within intron 1 of CASZ1, a zinc-finger transcription factor, the transcribed non-coding RNA encoding uc.8+ is expressed independently of CASZ1. In vitro experiments evaluating the effects of uc.8+ silencing, showed significantly decreased capacities for cancer cell invasion, migration, and proliferation. From this, we proposed and validated a model of interaction in which uc.8+ shuttles from the nucleus to the cytoplasm of BlCa cells, interacts with microRNA (miR)-596, and cooperates in the promotion and development of BlCa. Using computational analysis, we investigated the miR-binding domain accessibility, as determined by base-pairing interactions within the uc.8+ predicted secondary structure, RNA binding affinity, and RNA species abundance in bladder tissues and showed that uc.8+ is a natural decoy for miR-596. Thus uc.8+ upregulation results in increased expression of MMP9, increasing the invasive potential of BlCa cells. These interactions between evolutionarily conserved regions of DNA suggest that natural selection has preserved this potentially regulatory layer that uses RNA to modulate miR levels, opening up the possibility for development of useful markers for

  12. Long non-coding RNA containing ultraconserved genomic region 8 promotes bladder cancer tumorigenesis.

    PubMed

    Olivieri, Michele; Ferro, Matteo; Terreri, Sara; Durso, Montano; Romanelli, Alessandra; Avitabile, Concetta; De Cobelli, Ottavio; Messere, Anna; Bruzzese, Dario; Vannini, Ivan; Marinelli, Luciana; Novellino, Ettore; Zhang, Wei; Incoronato, Mariarosaria; Ilardi, Gennaro; Staibano, Stefania; Marra, Laura; Franco, Renato; Perdonà, Sisto; Terracciano, Daniela; Czerniak, Bogdan; Liguori, Giovanna L; Colonna, Vincenza; Fabbri, Muller; Febbraio, Ferdinando; Calin, George A; Cimmino, Amelia

    2016-04-12

    Ultraconserved regions (UCRs) have been shown to originate non-coding RNA transcripts (T-UCRs) that have different expression profiles and play functional roles in the pathophysiology of multiple cancers. The relevance of these functions to the pathogenesis of bladder cancer (BlCa) is speculative. To elucidate this relevance, we first used genome-wide profiling to evaluate the expression of T-UCRs in BlCa tissues. Analysis of two datasets comprising normal bladder tissues and BlCa specimens with a custom T-UCR microarray identified ultraconserved RNA (uc.) 8+ as the most upregulated T-UCR in BlCa tissues, although its expression was lower than in pericancerous bladder tissues. These results were confirmed on BlCa tissues by real-time PCR and by in situ hybridization. Although uc.8+ is located within intron 1 of CASZ1, a zinc-finger transcription factor, the transcribed non-coding RNA encoding uc.8+ is expressed independently of CASZ1. In vitro experiments evaluating the effects of uc.8+ silencing, showed significantly decreased capacities for cancer cell invasion, migration, and proliferation. From this, we proposed and validated a model of interaction in which uc.8+ shuttles from the nucleus to the cytoplasm of BlCa cells, interacts with microRNA (miR)-596, and cooperates in the promotion and development of BlCa. Using computational analysis, we investigated the miR-binding domain accessibility, as determined by base-pairing interactions within the uc.8+ predicted secondary structure, RNA binding affinity, and RNA species abundance in bladder tissues and showed that uc.8+ is a natural decoy for miR-596. Thus uc.8+ upregulation results in increased expression of MMP9, increasing the invasive potential of BlCa cells. These interactions between evolutionarily conserved regions of DNA suggest that natural selection has preserved this potentially regulatory layer that uses RNA to modulate miR levels, opening up the possibility for development of useful markers for

  13. Identification of conserved genomic regions and variation therein amongst Cetartiodactyla species using next generation sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background Next Generation Sequencing has created an opportunity to genetically characterize an individual both inexpensively and comprehensively. In earlier work produced in our collaboration [1], it was demonstrated that, for animals without a reference genome, their Next Generation Sequence data ...

  14. Repetitive genomic insertion of gene-sized dsDNAs by targeting the promoter region of a counter-selectable marker

    PubMed Central

    Jeong, Jaehwan; Seo, Han Na; Jung, Yu Kyung; Lee, Jeewon; Ryu, Gyuri; Lee, Wookjae; Kwon, Euijin; Ryoo, Keunsoo; Kim, Jungyeon; Cho, Hwa-Young; Cho, Kwang Myung; Park, Jin Hwan; Bang, Duhee

    2015-01-01

    Genome engineering can be used to produce bacterial strains with a wide range of desired phenotypes. However, the incorporation of gene-sized DNA fragments is often challenging due to the intricacy of the procedure, off-target effects, and low insertion efficiency. Here we report a genome engineering method enabling the continuous incorporation of gene-sized double-stranded DNAs (dsDNAs) into the Escherichia coli genome. DNA substrates are inserted without introducing additional marker genes, by synchronously turning an endogenous counter-selectable marker gene ON and OFF. To accomplish this, we utilized λ Red protein-mediated recombination to insert dsDNAs within the promoter region of a counter-selectable marker gene, tolC. By repeatedly switching the marker gene ON and OFF, a number of desired gene-sized dsDNAs can be inserted consecutively. With this method, we successfully inserted approximately 13 kb gene clusters to generate engineered E. coli strains producing 1,4-butanediol (1,4-BDO). PMID:25736821

  15. Structural complexity of Dengue virus untranslated regions: cis-acting RNA motifs and pseudoknot interactions modulating functionality of the viral genome

    PubMed Central

    Sztuba-Solinska, Joanna; Teramoto, Tadahisa; Rausch, Jason W.; Shapiro, Bruce A.; Padmanabhan, Radhakrishnan; Le Grice, Stuart F. J.

    2013-01-01

    The Dengue virus (DENV) genome contains multiple cis-acting elements required for translation and replication. Previous studies indicated that a 719-nt subgenomic minigenome (DENV-MINI) is an efficient template for translation and (−) strand RNA synthesis in vitro. We performed a detailed structural analysis of DENV-MINI RNA, combining chemical acylation techniques, Pb2+ ion-induced hydrolysis and site-directed mutagenesis. Our results highlight protein-independent 5′–3′ terminal interactions involving hybridization between recognized cis-acting motifs. Probing analyses identified tandem dumbbell structures (DBs) within the 3′ terminus spaced by single-stranded regions, internal loops and hairpins with embedded GNRA-like motifs. Analysis of conserved motifs and top loops (TLs) of these dumbbells, and their proposed interactions with downstream pseudoknot (PK) regions, predicted an H-type pseudoknot involving TL1 of the 5′ DB and the complementary region, PK2. As disrupting the TL1/PK2 interaction, via ‘flipping’ mutations of PK2, previously attenuated DENV replication, this pseudoknot may participate in regulation of RNA synthesis. Computer modeling implied that this motif might function as autonomous structural/regulatory element. In addition, our studies targeting elements of the 3′ DB and its complementary region PK1 indicated that communication between 5′–3′ terminal regions strongly depends on structure and sequence composition of the 5′ cyclization region. PMID:23531545

  16. Copy number variation-based genome wide association study reveals additional variants contributing to meat quality in Swine

    PubMed Central

    Wang, Ligang; Xu, Lingyang; Liu, Xin; Zhang, Tian; Li, Na; Hay, El Hamidi; Zhang, Yuebo; Yan, Hua; Zhao, Kebin; Liu, George E; Zhang, Longchao; Wang, Lixian

    2015-01-01

    Pork quality is important both to the meat processing industry and consumers’ purchasing attitude. Copy number variation (CNV) is a burgeoning kind of variants that may influence meat quality. In this study, a genome-wide association study (GWAS) was performed between CNVs and meat quality traits in swine. After false discovery rate (FDR) correction, a total of 8 CNVs on 6 chromosomes were identified to be significantly associated with at least one meat quality trait. All of the 8 CNVs were verified by next generation sequencing and six of them were verified by qPCR. Only the haplotype block containing CNV12 is adjacent to significant SNPs associated with meat quality, suggesting the effects of those CNVs were not likely captured by tag SNPs. The DNA dosage and EST expression of CNV12, which overlap with an obesity related gene Netrin-1 (Ntn1), were consistent with Ntn1 RNA expression, suggesting the CNV12 might be involved in the expression regulation of Ntn1 and finally influence meat quality. We concluded that CNVs may contribute to the genetic variations of meat quality beyond SNPs, and several candidate CNVs were worth further exploration. PMID:26234186

  17. First complete genome sequence of a capsicum chlorosis tospovirus isolate from Australia with an unusually large S RNA intergenic region.

    PubMed

    Widana Gamage, Shirani; Persley, Denis M; Higgins, Colleen M; Dietzgen, Ralf G

    2015-03-01

    The first complete genome sequence of capsicum chlorosis virus (CaCV) from Australia was determined using a combination of Illumina HiSeq RNA and Sanger sequencing technologies. Australian CaCV had a tripartite genome structure like other CaCV isolates. The large (L) RNA was 8913 nucleotides (nt) in length and contained a single open reading frame (ORF) of 8634 nt encoding a predicted RNA-dependent RNA polymerase (RdRp) in the viral-complementary (vc) sense. The medium (M) and small (S) RNA segments were 4846 and 3944 nt in length, respectively, each containing two non-overlapping ORFs in ambisense orientation, separated by intergenic regions (IGR). The M segment contained ORFs encoding the predicted non-structural movement protein (NSm; 927 nt) and precursor of glycoproteins (GP; 3366 nt) in the viral sense (v) and vc strand, respectively, separated by a 449-nt IGR. The S segment coded for the predicted nucleocapsid (N) protein (828 nt) and non-structural suppressor of silencing protein (NSs; 1320 nt) in the vc and v strand, respectively. The S RNA contained an IGR of 1663 nt, being the largest IGR of all CaCV isolates sequenced so far. Comparison of the Australian CaCV genome with complete CaCV genome sequences from other geographic regions showed highest sequence identity with a Taiwanese isolate. Genome sequence comparisons and phylogeny of all available CaCV isolates provided evidence for at least two highly diverged groups of CaCV isolates that may warrant re-classification of AIT-Thailand and CP-China isolates as unique tospoviruses, separate from CaCV.

  18. Evolution of the vertebrate genome as reflected in paralogous chromosomal regions in man and the house mouse

    SciTech Connect

    Lundin, L.G. )

    1993-04-01

    Gene constellations on several human chromosomes are interpreted as indications of large regional duplications that took place during evolution of the vertebrate genome. Four groups of paralogous chromosomal regions in man and the house mouse are suggested and are believed to be conserved remnants of the two or three rounds of tetraploidization that are likely to have occurred during evolution of the vertebrates. The phenomenon of differential silencing of genes is described. The importance of conservation of linkage of particular genes is discussed in relation to genetic regulation and cell differentiation. 120 refs., 5 tabs.

  19. Draft Genome Sequence of Bacillus sp. GZT, a 2,4,6-Tribromophenol-Degrading Strain Isolated from the River Sludge of an Electronic Waste-Dismantling Region

    PubMed Central

    Liang, Zhishu; Li, Guiying; Das, Ranjit

    2016-01-01

    Here, we report the draft genome sequence of Bacillus sp. strain GZT, a 2,4,6-tribromophenol (TBP)-degrading bacterium previously isolated from an electronic waste-dismantling region. The draft genome sequence is 5.18 Mb and has a G+C content of 35.1%. This is the first genome report of a brominated flame retardant-degrading strain. PMID:27257197

  20. Comparative genome mapping of the ataxia-telangiectasia region in mouse, rat, and Syrian hamster

    SciTech Connect

    Matsuda, Yoichi; Imai, Takashi; Shiomi, Tadahiro

    1996-06-15

    Chromosomal locations of the Atm (ataxia-telangiectasia (AT)-mutated) and Acat1 (mitochondrial acetoacetyl-CoA thiolase) genes in mouse, rat, and Syrian hamster were determined by direct R-banding FISH. Both genes were colocalized to the C-D band of mouse chromosome 9, the proximal end of q24.1 of ra chromosome 8, and qa4-aq5 of Syrian hamster chromosome 12. The regions in the mouse and rat were homologous to human chromosome 11q. Fine genetic linkage mapping of the mouse AT region was performed using the interspecific backcross mice. Atm, Acat1, and Npat, which is a new gene isolated from the AT region, and 12 flanking microsatellite DNA markers were examined. No recombinations were found among the Atm, Npat, Acat1, and D9Mit6 loci, and these loci were mapped 2.0 cM distal to D9Mit99 and 1.3 cM proximal to D9Mit102. Comparison of the linkage map of mouse chromosome 9 (MMU9) and that of human chromosome 11 (HSA11) indicates that there is a chromosomal rearrangement due to an inversion between Ets1 and Atm-Npat-Acat1 and that the inversion of MMu9 originated from the chromosomal breakage at the boundary between Gria4 and Atm-Npat-Acat1 on HSA11. This type of inversion appeared to be conserved in the three rodent species, mouse, rat, and Syrian hamster, using additional comparative mapping data with the Rck gene. 27 refs., 3 figs.

  1. Genome-wide genetic diversity and differentially selected regions among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep.

    PubMed

    Zhang, Lifan; Mousel, Michelle R; Wu, Xiaolin; Michal, Jennifer J; Zhou, Xiang; Ding, Bo; Dodson, Michael V; El-Halawany, Nermin K; Lewis, Gregory S; Jiang, Zhihua

    2013-01-01

    Sheep are among the major economically important livestock species worldwide because the animals produce milk, wool, skin, and meat. In the present study, the Illumina OvineSNP50 BeadChip was used to investigate genetic diversity and genome selection among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds from the United States. After quality-control filtering of SNPs (single nucleotide polymorphisms), we used 48,026 SNPs, including 46,850 SNPs on autosomes that were in Hardy-Weinberg equilibrium and 1,176 SNPs on chromosome × for analysis. Phylogenetic analysis based on all 46,850 SNPs clearly separated Suffolk from Rambouillet, Columbia, Polypay, and Targhee, which was not surprising as Rambouillet contributed to the synthesis of the later three breeds. Based on pair-wise estimates of F(ST), significant genetic differentiation appeared between Suffolk and Rambouillet (F(ST) = 0.1621), while Rambouillet and Targhee had the closest relationship (F(ST) = 0.0681). A scan of the genome revealed 45 and 41 differentially selected regions (DSRs) between Suffolk and Rambouillet and among Rambouillet-related breed populations, respectively. Our data indicated that regions 13 and 24 between Suffolk and Rambouillet might be good candidates for evaluating breed differences. Furthermore, ovine genome v3.1 assembly was used as reference to link functionally known homologous genes to economically important traits covered by these differentially selected regions. In brief, our present study provides a comprehensive genome-wide view on within- and between-breed genetic differentiation, biodiversity, and evolution among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds. These results may provide new guidance for the synthesis of new breeds with different breeding objectives.

  2. Genome-Wide Genetic Diversity and Differentially Selected Regions among Suffolk, Rambouillet, Columbia, Polypay, and Targhee Sheep

    PubMed Central

    Zhang, Lifan; Mousel, Michelle R.; Wu, Xiaolin; Michal, Jennifer J.; Zhou, Xiang; Ding, Bo; Dodson, Michael V.; El-Halawany, Nermin K.; Lewis, Gregory S.; Jiang, Zhihua

    2013-01-01

    Sheep are among the major economically important livestock species worldwide because the animals produce milk, wool, skin, and meat. In the present study, the Illumina OvineSNP50 BeadChip was used to investigate genetic diversity and genome selection among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds from the United States. After quality-control filtering of SNPs (single nucleotide polymorphisms), we used 48,026 SNPs, including 46,850 SNPs on autosomes that were in Hardy-Weinberg equilibrium and 1,176 SNPs on chromosome × for analysis. Phylogenetic analysis based on all 46,850 SNPs clearly separated Suffolk from Rambouillet, Columbia, Polypay, and Targhee, which was not surprising as Rambouillet contributed to the synthesis of the later three breeds. Based on pair-wise estimates of FST, significant genetic differentiation appeared between Suffolk and Rambouillet (FST = 0.1621), while Rambouillet and Targhee had the closest relationship (FST = 0.0681). A scan of the genome revealed 45 and 41 differentially selected regions (DSRs) between Suffolk and Rambouillet and among Rambouillet-related breed populations, respectively. Our data indicated that regions 13 and 24 between Suffolk and Rambouillet might be good candidates for evaluating breed differences. Furthermore, ovine genome v3.1 assembly was used as reference to link functionally known homologous genes to economically important traits covered by these differentially selected regions. In brief, our present study provides a comprehensive genome-wide view on within- and between-breed genetic differentiation, biodiversity, and evolution among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds. These results may provide new guidance for the synthesis of new breeds with different breeding objectives. PMID:23762451

  3. Genetic variation between Schistosoma japonicum lineages from lake and mountainous regions in China revealed by resequencing whole genomes.

    PubMed

    Yin, Mingbo; Liu, Xiao; Xu, Bin; Huang, Jian; Zheng, Qi; Yang, Zhong; Feng, Zheng; Han, Ze-Guang; Hu, Wei

    2016-09-01

    Schistosoma infection is a major cause of morbidity and mortality worldwide. Schistosomiasis japonica is endemic in mainland China along the Yangtze River, typically distributed in two geographical categories of lake and mountainous regions. Study on schistosome genetic diversity is of interest in respect of understanding parasite biology and transmission, and formulating control strategy. Certain genetic variations may be associated with adaptations to different ecological habitats. The aim of this study is to gain insight into Schistosoma japonicum genetic variation, evolutionary origin and associated causes of different geographic lineages through examining homozygous Single Nucleotide Polymorphisms (SNPs) based on resequenced genome data. We collected S. japonicum samples from four sites, three in the lake regions (LR) of mid-east (Guichi and Tonglin in Anhui province, Laogang in Hunan province) and one in mountainous region (MR) (Xichang in Sichuan province) of south-west of China, resequenced their genomes using Next Generation Sequencing (NGS) technology, and made use of the available database of S. japonicum draft genomic sequence as a reference in genome mapping. A total of 14,575 SNPs from 2059 genes were identified in the four lineages. Phylogenetic analysis confirmed significant genetic variation exhibited between the different geographical lineages, and further revealed that the MR Xichang lineage is phylogenetically closer to LR Guich lineage than to other two LR lineages, and the MR lineage might be evolved from LR lineages. More than two thirds of detected SNPs were nonsynonymous; functional annotation of the SNP-containing genes showed that they are involved mainly in biological processes such as signaling and response to stimuli. Notably, unique nonsynonymous SNP variations were detected in 66 genes of MR lineage, inferring possible genetic adaption to mountainous ecological condition. PMID:27207135

  4. “Replicated” genome wide association for dependence on illegal substances: genomic regions identified by overlapping clusters of nominally positive SNPs

    PubMed Central

    Drgon, Tomas; Johnson, Catherine; Nino, Michelle; Drgonova, Jana; Walther, Donna; Uhl, George R

    2010-01-01

    Declaring “replication” from results of genome wide association (GWA) studies is straightforward when major gene effects provide genome-wide significance for association of the same allele of the same SNP in each of multiple independent samples. However, such unambiguous replication may be unlikely when phenotypes display polygenic genetic architecture, allelic heterogeneity, locus heterogeneity and when different samples display linkage disequilibria with different fine structures. We seek chromosomal regions that are tagged by clustered SNPs that display nominally-significant association in each of several independent samples. This approach provides one “nontemplate” approach to identifying overall replication of groups of GWA results in the face of difficult genetic architectures. We apply this strategy to 1M SNP Affymetrix and Illumina GWA results for dependence on illegal substances. This approach provides high confidence in rejecting the null hypothesis that chance alone accounts for the extent to which clustered, nominally-significant SNPs from samples of the same racial/ethnic background identify the same chromosomal regions. There is more modest confidence in: a) identification of individual chromosomal regions and genes and b) overlap between results from samples of different racial/ethnic backgrounds. The strong overlap identified among the samples with similar racial/ethnic backgrounds, together with prior work that identified overlapping results in samples of different racial/ethnic backgrounds, support contributions to individual differences in vulnerability to addictions that come from both relatively older allelic variants that are common in many current human populations and newer allelic variants that are common in fewer current human populations. PMID:21302341

  5. Genomic Regions Identified by Overlapping Clusters of Nominally-Positive SNPs from Genome-Wide Studies of Alcohol and Illegal Substance Dependence

    PubMed Central

    Johnson, Catherine; Drgon, Tomas; Walther, Donna; Uhl, George R.

    2011-01-01

    Declaring “replication” from results of genome wide association (GWA) studies is straightforward when major gene effects provide genome-wide significance for association of the same allele of the same SNP in each of multiple independent samples. However, such unambiguous replication is unlikely when phenotypes display polygenic genetic architecture, allelic heterogeneity, locus heterogeneity and when different samples display linkage disequilibria with different fine structures. We seek chromosomal regions that are tagged by clustered SNPs that display nominally-significant association in each of several independent samples. This approach provides one “nontemplate” approach to identifying overall replication of groups of GWA results in the face of difficult genetic architectures. We apply this strategy to 1 M SNP GWA results for dependence on: a) alcohol (including many individuals with dependence on other addictive substances) and b) at least one illegal substance (including many individuals dependent on alcohol). This approach provides high confidence in rejecting the null hypothesis that chance alone accounts for the extent to which clustered, nominally-significant SNPs from samples of the same racial/ethnic background identify the same sets of chromosomal regions. It identifies several genes that are also reported in other independent alcohol-dependence GWA datasets. There is more modest confidence in: a) identification of individual chromosomal regions and genes that are not also identified by data from other independent samples, b) the more modest overlap between results from samples of different racial/ethnic backgrounds and c) the extent to which any gene not identified herein is excluded, since the power of each of these individual samples is modest. Nevertheless, the strong overlap identified among the samples with similar racial/ethnic backgrounds supports contributions to individual differences in vulnerability to addictions that come from newer

  6. Comparison of false-discovery rate for genome-wide and fine mapping regions.

    PubMed

    Tabangin, Meredith E; Woo, Jessica G; Liu, Chunyan; Nick, Todd G; Martin, Lisa J

    2007-01-01

    With technological advances in high-throughput genotyping, it is not unusual to perform hundreds of thousands of tests for each phenotype. Thus, correction to control type I error is essential. The false-discovery rate (FDR) has been successfully used in genome-wide expression data. However, its performance has not been evaluated for association analysis. Our objective was to analyze the Genetic Analysis Workshop 15 simulated data set, with answers, to evaluate FDR for genome-wide association and fine mapping. In genome-wide analysis, FDR performed well, with good localization of positive results. However, in fine mapping, all tested methods performed poorly, producing a high proportion of significant results. Thus, caution should be used when employing FDR for fine mapping.

  7. Organization, structure and evolution of 41kb of genomic DNA spanning the D-J-C region of the sheep TRB locus.

    PubMed

    Antonacci, R; Di Tommaso, S; Lanave, C; Cribiu, E P; Ciccarese, S; Massari, S

    2008-01-01

    A genomic region of 41,045 bp encompassing the 3'-end of the sheep T cell receptor beta chain was sequenced. Extensive molecular analysis has revealed that this region retains a unique structural feature for the presence of a third D-J-C cluster, never detected in any other mammalian species examined so far. A total of 3 TRBD, 18 TRBJ and 3 substantially identical TRBC genes were identified in about 28kb. At 13kb, downstream from the last TRBC gene, in an inverted transcriptional orientation, lies a TRBV gene. Sequence comparison and phylogenetic analyses have demonstrated that the extra D-J-C cluster originated from an unequal crossing over between the two ancestral TRBC genes. Interspersed repeats spanning 22.2% of the sequence, contribute to the wider size of the sheep TRB locus with respect to the other mammalian counterparts, without modifying the general genomic architecture. The nucleotide and predicted amino acid sequences from peripheral T cells cDNA clones indicated that the genes from cluster 3 are fully implicated in the beta chain recombination machinery. Closer inspections of the transcripts have also shown that inter-cluster rearrangements and splice variants, involving the additional cluster, increase the functional diversity of the sheep beta chain repertoire.

  8. Analysis of Genomic Regions Associated With Coronary Artery Disease Reveals Continent-Specific Single Nucleotide Polymorphisms in North African Populations

    PubMed Central

    Zanetti, Daniela; Via, Marc; Carreras-Torres, Robert; Esteban, Esther; Chaabani, Hassen; Anaibar, Fatima; Harich, Nourdin; Habbal, Rachida; Ghalim, Noreddine; Moral, Pedro

    2016-01-01

    Background In recent years, several genomic regions have been robustly associated with coronary artery disease (CAD) in different genome-wide association studies (GWASs) conducted mainly in people of European descent. These kinds of data are lacking in African populations, even though heart diseases are a major cause of premature death and disability. Methods Here, 384 single nucleotide polymorphisms (SNPs) in the top four CAD risk regions (1p13, 1q41, 9p21, and 10q11) were genotyped in 274 case-control samples from Morocco and Tunisia, with the aim of analyzing for the first time if the associations found in European populations were transferable to North Africans. Results The results indicate that, as in Europe, these four genetic regions are also important for CAD risk in North Africa. However, the individual SNPs associated with CAD in Africa are different from those identified in Europe in most cases (1p13, 1q41, and 9p21). Moreover, the seven risk variants identified in North Africans are efficient in discriminating between cases and controls in North African populations, but not in European populations. Conclusions This study indicates a disparity in markers associated to CAD susceptibility between North Africans and Europeans that may be related to population differences in the chromosomal architecture of these risk regions. PMID:26780859

  9. A genetic analysis of the 63E-64A genomic region of Drosophila melanogaster: Identification of mutations in a replication factor C subunit

    SciTech Connect

    Harrison, S.D.; Solomon, N.; Rubin, G.M.

    1995-04-01

    We have performed an F{sub 2} genetic screen to identify lethal mutations within the 63E-64A genomic region. We have isolated 122 mutations in 20 different complementation groups. Of these groups, 16 are represented by multiple alleles. We have also established that the Rop and Ras2 genes are located within the 63E-64A genomic domain at 64A10,11. We have sequenced 10.2 kb of DNA surrounding this gene pair and find that in addition to Rop and Ras2 there is another gene located within this DNA sequence. The gene product, which we have named Rfc40, shows 68% identity to the 40-kDa subunit of replication factor C. We find that the members of one complementation group (13 alleles) derived from our screen correspond to mutations in the Rop gene, whereas the members of another (five alleles) correspond to mutations in the Rfc40 gene. In addition we have isolated 11 new mutant alleles of the disembodied gene. 40 refs., 5 figs., 1 tab.

  10. Integration of multiethnic fine-mapping and genomic annotation to prioritize candidate functional SNPs at prostate cancer susceptibility regions

    PubMed Central

    Han, Ying; Hazelett, Dennis J.; Wiklund, Fredrik; Schumacher, Fredrick R.; Stram, Daniel O.; Berndt, Sonja I.; Wang, Zhaoming; Rand, Kristin A.; Hoover, Robert N.; Machiela, Mitchell J.; Yeager, Merideth; Burdette, Laurie; Chung, Charles C.; Hutchinson, Amy; Yu, Kai; Xu, Jianfeng; Travis, Ruth C.; Key, Timothy J.; Siddiq, Afshan; Canzian, Federico; Takahashi, Atsushi; Kubo, Michiaki; Stanford, Janet L.; Kolb, Suzanne; Gapstur, Susan M.; Diver, W. Ryan; Stevens, Victoria L.; Strom, Sara S.; Pettaway, Curtis A.; Al Olama, Ali Amin; Kote-Jarai, Zsofia; Eeles, Rosalind A.; Yeboah, Edward D.; Tettey, Yao; Biritwum, Richard B.; Adjei, Andrew A.; Tay, Evelyn; Truelove, Ann; Niwa, Shelley; Chokkalingam, Anand P.; Isaacs, William B.; Chen, Constance; Lindstrom, Sara; Le Marchand, Loic; Giovannucci, Edward L.; Pomerantz, Mark; Long, Henry; Li, Fugen; Ma, Jing; Stampfer, Meir; John, Esther M.; Ingles, Sue A.; Kittles, Rick A.; Murphy, Adam B.; Blot, William J.; Signorello, Lisa B.; Zheng, Wei; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Nemesure, Barbara; Carpten, John; Leske, M. Cristina; Wu, Suh-Yuh; Hennis, Anselm J. M.; Rybicki, Benjamin A.; Neslund-Dudas, Christine; Hsing, Ann W.; Chu, Lisa; Goodman, Phyllis J.; Klein, Eric A.; Zheng, S. Lilly; Witte, John S.; Casey, Graham; Riboli, Elio; Li, Qiyuan; Freedman, Matthew L.; Hunter, David J.; Gronberg, Henrik; Cook, Michael B.; Nakagawa, Hidewaki; Kraft, Peter; Chanock, Stephen J.; Easton, Douglas F.; Henderson, Brian E.; Coetzee, Gerhard A.; Conti, David V.; Haiman, Christopher A.

    2015-01-01

    Interpretation of biological mechanisms underlying genetic risk associations for prostate cancer is complicated by the relatively large number of risk variants (n = 100) and the thousands of surrogate SNPs in linkage disequilibrium. Here, we combined three distinct approaches: multiethnic fine-mapping, putative functional annotation (based upon epigenetic data and genome-encoded features), and expression quantitative trait loci (eQTL) analyses, in an attempt to reduce this complexity. We examined 67 risk regions using genotyping and imputation-based fine-mapping in populations of European (cases/controls: 8600/6946), African (cases/controls: 5327/5136), Japanese (cases/controls: 2563/4391) and Latino (cases/controls: 1034/1046) ancestry. Markers at 55 regions passed a region-specific significance threshold (P-value cutoff range: 3.9 × 10−4–5.6 × 10−3) and in 30 regions we identified markers that were more significantly associated with risk than the previously reported variants in the multiethnic sample. Novel secondary signals (P < 5.0 × 10−6) were also detected in two regions (rs13062436/3q21 and rs17181170/3p12). Among 666 variants in the 55 regions with P-values within one order of magnitude of the most-associated marker, 193 variants (29%) in 48 regions overlapped with epigenetic or other putative functional marks. In 11 of the 55 regions, cis-eQTLs were detected with nearby genes. For 12 of the 55 regions (22%), the most significant region-specific, prostate-cancer associated variant represented the strongest candidate functional variant based on our annotations; the number of regions increased to 20 (36%) and 27 (49%) when examining the 2 and 3 most significantly associated variants in each region, respectively. These results have prioritized subsets of candidate variants for downstream functional evaluation. PMID:26162851

  11. Integration of multiethnic fine-mapping and genomic annotation to prioritize candidate functional SNPs at prostate cancer susceptibility regions.

    PubMed

    Han, Ying; Hazelett, Dennis J; Wiklund, Fredrik; Schumacher, Fredrick R; Stram, Daniel O; Berndt, Sonja I; Wang, Zhaoming; Rand, Kristin A; Hoover, Robert N; Machiela, Mitchell J; Yeager, Merideth; Burdette, Laurie; Chung, Charles C; Hutchinson, Amy; Yu, Kai; Xu, Jianfeng; Travis, Ruth C; Key, Timothy J; Siddiq, Afshan; Canzian, Federico; Takahashi, Atsushi; Kubo, Michiaki; Stanford, Janet L; Kolb, Suzanne; Gapstur, Susan M; Diver, W Ryan; Stevens, Victoria L; Strom, Sara S; Pettaway, Curtis A; Al Olama, Ali Amin; Kote-Jarai, Zsofia; Eeles, Rosalind A; Yeboah, Edward D; Tettey, Yao; Biritwum, Richard B; Adjei, Andrew A; Tay, Evelyn; Truelove, Ann; Niwa, Shelley; Chokkalingam, Anand P; Isaacs, William B; Chen, Constance; Lindstrom, Sara; Le Marchand, Loic; Giovannucci, Edward L; Pomerantz, Mark; Long, Henry; Li, Fugen; Ma, Jing; Stampfer, Meir; John, Esther M; Ingles, Sue A; Kittles, Rick A; Murphy, Adam B; Blot, William J; Signorello, Lisa B; Zheng, Wei; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Nemesure, Barbara; Carpten, John; Leske, M Cristina; Wu, Suh-Yuh; Hennis, Anselm J M; Rybicki, Benjamin A; Neslund-Dudas, Christine; Hsing, Ann W; Chu, Lisa; Goodman, Phyllis J; Klein, Eric A; Zheng, S Lilly; Witte, John S; Casey, Graham; Riboli, Elio; Li, Qiyuan; Freedman, Matthew L; Hunter, David J; Gronberg, Henrik; Cook, Michael B; Nakagawa, Hidewaki; Kraft, Peter; Chanock, Stephen J; Easton, Douglas F; Henderson, Brian E; Coetzee, Gerhard A; Conti, David V; Haiman, Christopher A

    2015-10-01

    Interpretation of biological mechanisms underlying genetic risk associations for prostate cancer is complicated by the relatively large number of risk variants (n = 100) and the thousands of surrogate SNPs in linkage disequilibrium. Here, we combined three distinct approaches: multiethnic fine-mapping, putative functional annotation (based upon epigenetic data and genome-encoded features), and expression quantitative trait loci (eQTL) analyses, in an attempt to reduce this complexity. We examined 67 risk regions using genotyping and imputation-based fine-mapping in populations of European (cases/controls: 8600/6946), African (cases/controls: 5327/5136), Japanese (cases/controls: 2563/4391) and Latino (cases/controls: 1034/1046) ancestry. Markers at 55 regions passed a region-specific significance threshold (P-value cutoff range: 3.9 × 10(-4)-5.6 × 10(-3)) and in 30 regions we identified markers that were more significantly associated with risk than the previously reported variants in the multiethnic sample. Novel secondary signals (P < 5.0 × 10(-6)) were also detected in two regions (rs13062436/3q21 and rs17181170/3p12). Among 666 variants in the 55 regions with P-values within one order of magnitude of the most-associated marker, 193 variants (29%) in 48 regions overlapped with epigenetic or other putative functional marks. In 11 of the 55 regions, cis-eQTLs were detected with nearby genes. For 12 of the 55 regions (22%), the most significant region-specific, prostate-cancer associated variant represented the strongest candidate functional variant based on our annotations; the number of regions increased to 20 (36%) and 27 (49%) when examining the 2 and 3 most significantly associated variants in each region, respectively. These results have prioritized subsets of candidate variants for downstream functional evaluation.

  12. [Analysis of DNA polymorphism in populations of the Volga-Ural region using genome fingerprinting with phage M13 DNA].

    PubMed

    Khusnutdinova, E K; Khidiiatova, I M; Viktorova, T V; Fatkhlislamova, R I; Limborskaia, S A

    1999-04-01

    The hyperpolymorphism of minisatellite DNA hybridizing with DNA of bacteriophage M13 was analyzed in seven Turkic and Finno-Ugric populations from the Volga-Urals region. In total, hybridization revealed 80 BspRI genomic DNA fragments ranging in size from 1.7 to 10 kb; the average frequency of an individual fragment was 0.299 +/- 0.020. The average number of hybridization fragments per pattern (varying from 14 to 20 in different populations) and frequencies of individual fragments showed significant interpopulation differences. Parameters of this polymorphic system were assumed to reflect phenotypic diversity of populations. Genome fingerprinting with the use of phage M13 can be employed in the studies of population genetic structure and differentiation and in forensic medicine, for more accurate personal identification.

  13. Whole genome sequence analyses of Xylella fastidiosa PD strains from different geographical regions

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome sequences were determined for two Pierce’s disease (PD) causing Xylella fastidiosa (Xf) strains, one from Florida and one from Taiwan. The Florida strain was ATCC 35879, the type of strain used as a standard reference for related taxonomy research. By contrast, the Taiwan strain used was only...

  14. Comparative genomics of Campylobacter iguaniorum to unravel genetic regions associated with reptilian hosts

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Campylobacter iguaniorum is genetically related to the species C. fetus, C. hyointestinalis, and C. lanienae. Reptiles, chelonians and lizards in particular, appear to be the primary reservoir of this Campylobacter species. Here we report the genome comparison of C. iguaniorum strain 1485E, isolated...

  15. Recent artificial selection in U.S. Jersey cattle impacts autozygosity levels of specific genomic regions

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome signatures of artificial selection in U.S. Jersey cattle were identified by examining changes in haplotype homozygosity for a resource population of animals born between 1962 and 2005. Genetic merit of this population changed dramatically during this period for a number of traits, especially ...

  16. Development and validation of new SSR markers from expressed regions in the garlic genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Limited number of simple sequence repeat (SSR) markers is available for the genome of garlic (Allium sativum L.) although SSR markers have become one of the most preferred marker systems because they are typically co-dominant, reproducible, cross species transferable and highly polymorphic. In this ...

  17. A pathogenicity determinant maps to the N-terminal coat protein region of the Pepino mosaic virus genome.

    PubMed

    Duff-Farrier, Celia R A; Bailey, Andy M; Boonham, Neil; Foster, Gary D

    2015-04-01

    Pepino mosaic virus (PepMV) poses a worldwide threat to the tomato industry. Considerable differences at the genetic level allow for the distinction of four main genotypic clusters; however, the basis of the phenotypic outcome is difficult to elucidate. This work reports the generation of wild-type PepMV infectious clones of both EU (mild) and CH2 (aggressive) genotypes, from which chimeric infectious clones were created. Phenotypic analysis in three solanaceous hosts, Nicotiana benthamiana, Datura stramonium and Solanum lycopersicum, indicated that a PepMV pathogenicity determinant mapped to the 3'-terminal region of the genome. Increased aggression was only observed in N. benthamiana, showing that this factor is host specific. The determinant was localized to amino acids 11-26 of the N-terminal coat protein (CP) region; this is the first report of this region functioning as a virulence factor in PepMV. PMID:25131553

  18. Isolation of a Genomic Region Affecting Most Components of Metabolic Syndrome in a Chromosome-16 Congenic Rat Model

    PubMed Central

    Šedová, Lucie; Pravenec, Michal; Křenová, Drahomíra; Kazdová, Ludmila; Zídek, Václav; Krupková, Michaela; Liška, František; Křen, Vladimír; Šeda, Ondřej

    2016-01-01

    Metabolic syndrome is a highly prevalent human disease with substantial genomic and environmental components. Previous studies indicate the presence of significant genetic determinants of several features of metabolic syndrome on rat chromosome 16 (RNO16) and the syntenic regions of human genome. We derived the SHR.BN16 congenic strain by introgression of a limited RNO16 region from the Brown Norway congenic strain (BN-Lx) into the genomic background of the spontaneously hypertensive rat (SHR) strain. We compared the morphometric, metabolic, and hemodynamic profiles of adult male SHR and SHR.BN16 rats. We also compared in silico the DNA sequences for the differential segment in the BN-Lx and SHR parental strains. SHR.BN16 congenic rats had significantly lower weight, decreased concentrations of total triglycerides and cholesterol, and improved glucose tolerance compared with SHR rats. The concentrations of insulin, free fatty acids, and adiponectin were comparable between the two strains. SHR.BN16 rats had significantly lower systolic (18–28 mmHg difference) and diastolic (10–15 mmHg difference) blood pressure throughout the experiment (repeated-measures ANOVA, P < 0.001). The differential segment spans approximately 22 Mb of the telomeric part of the short arm of RNO16. The in silico analyses revealed over 1200 DNA variants between the BN-Lx and SHR genomes in the SHR.BN16 differential segment, 44 of which lead to missense mutations, and only eight of which (in Asb14, Il17rd, Itih1, Syt15, Ercc6, RGD1564958, Tmem161a, and Gatad2a genes) are predicted to be damaging to the protein product. Furthermore, a number of genes within the RNO16 differential segment associated with metabolic syndrome components in human studies showed polymorphisms between SHR and BN-Lx (including Lpl, Nrg3, Pbx4, Cilp2, and Stab1). Our novel congenic rat model demonstrates that a limited genomic region on RNO16 in the SHR significantly affects many of the features of metabolic syndrome

  19. Isolation of a Genomic Region Affecting Most Components of Metabolic Syndrome in a Chromosome-16 Congenic Rat Model.

    PubMed

    Šedová, Lucie; Pravenec, Michal; Křenová, Drahomíra; Kazdová, Ludmila; Zídek, Václav; Krupková, Michaela; Liška, František; Křen, Vladimír; Šeda, Ondřej

    2016-01-01

    Metabolic syndrome is a highly prevalent human disease with substantial genomic and environmental components. Previous studies indicate the presence of significant genetic determinants of several features of metabolic syndrome on rat chromosome 16 (RNO16) and the syntenic regions of human genome. We derived the SHR.BN16 congenic strain by introgression of a limited RNO16 region from the Brown Norway congenic strain (BN-Lx) into the genomic background of the spontaneously hypertensive rat (SHR) strain. We compared the morphometric, metabolic, and hemodynamic profiles of adult male SHR and SHR.BN16 rats. We also compared in silico the DNA sequences for the differential segment in the BN-Lx and SHR parental strains. SHR.BN16 congenic rats had significantly lower weight, decreased concentrations of total triglycerides and cholesterol, and improved glucose tolerance compared with SHR rats. The concentrations of insulin, free fatty acids, and adiponectin were comparable between the two strains. SHR.BN16 rats had significantly lower systolic (18-28 mmHg difference) and diastolic (10-15 mmHg difference) blood pressure throughout the experiment (repeated-measures ANOVA, P < 0.001). The differential segment spans approximately 22 Mb of the telomeric part of the short arm of RNO16. The in silico analyses revealed over 1200 DNA variants between the BN-Lx and SHR genomes in the SHR.BN16 differential segment, 44 of which lead to missense mutations, and only eight of which (in Asb14, Il17rd, Itih1, Syt15, Ercc6, RGD1564958, Tmem161a, and Gatad2a genes) are predicted to be damaging to the protein product. Furthermore, a number of genes within the RNO16 differential segment associated with metabolic syndrome components in human studies showed polymorphisms between SHR and BN-Lx (including Lpl, Nrg3, Pbx4, Cilp2, and Stab1). Our novel congenic rat model demonstrates that a limited genomic region on RNO16 in the SHR significantly affects many of the features of metabolic syndrome.

  20. Variability in Ozone in the Tropical Tropopause Region from the 1998-2000 SHADOZ (Southern Hemisphere ADditional OZonesondes) Data

    NASA Technical Reports Server (NTRS)

    Thompson, Anne M.; Witte, J. C.; Oltmans, S. J.; Schmidlin, F. J.

    2002-01-01

    The first view of stratospheric and tropospheric ozone variability in the southern hemisphere tropics is provided by a 3-year, 10-site record of ozone soundings from the Southern Hemisphere ADditional OZonesondes (SHADOZ) network. Observations covering 1998-2000 were made over Ascension Island; Nairobi, Kenya; Irene, South Africa; Reunion Island; Watukosek, Java; Fiji; Tahiti; American Samoa; San Cristobal, Galapagos; Natal, Brazil. Taking the TTL (tropical tropopause layer) as the region between 12 and 17 km, we examine ozone variability in this region on a week-to-week and seasonal basis. The TTL layer is lower in September-October-November than in March-April-May, when ozone is a minimum at most SHADOZ stations. A zonal wave-one pattern is apparent in column-integrated TTL ozone because ozone mixing ratios are greater over the Atlantic and adjacent continents than over the Pacific and eastern Indian Ocean. The wave-one persists all year with varying magnitude and appears to be due to general circulation - with subsidence over the Atlantic and frequent deep convection over the Pacific and Indian Ocean. The variability of deep convection - most prominent at Java, Fiji, Samoa and Natal - is explored in time-vs-altitude ozone curtains. Stratospheric incursions into the troposphere are most prominent in soundings at Irene and Reunion Island.

  1. Variability in Ozone in the Tropical Tropopause Region from the 1998-2000 SHADOZ (Southern Hemisphere ADditional OZonesondes) Data

    NASA Astrophysics Data System (ADS)

    Thompson, A. M.; Witte, J. C.; Oltmans, S. J.; Schmidlin, F. J.

    2002-05-01

    The first view of stratospheric and tropospheric ozone variability in the southern hemisphere tropics is provided by a 3-year, 10-site record of ozone soundings from the Southern Hemisphere ADditional OZonesondes (SHADOZ) network: (http://code916.gsfc.nasa.gov/Data_services/shadoz). Observations covering 1998-2000 were made over Ascension Island; Nairobi, Kenya; Irene, South Africa; Réunion Island; Watukosek, Java; Fiji; Tahiti; American Samoa; San Cristóbal, Galapagos; Natal, Brazil. Taking the TTL (tropical tropopause layer) as the region between 12 and 17 km, we examine ozone variability in this region on a week-to-week and seasonal basis. The TTL layer is lower in September-October-November than in March-April-May, when ozone is a minimum at most SHADOZ stations. A zonal wave-one pattern is apparent in column-integrated TTL ozone because ozone mixing ratios are greater over the Atlantic and adjacent continents than over the Pacific and eastern Indian Ocean. The wave-one persists all year with varying magnitude and appears to be due to general circulation - with subsidence over the Atlantic and frequent deep convection over the Pacific and Indian Ocean. The variability of deep convection - most prominent at Java, Fiji, Samoa and Natal - is explored in time-vs-altitude ozone curtains. Stratospheric incursions into the troposphere are most prominent in soundings at Irene and Réunion Island.

  2. Sequencing of a QTL-rich region of the Theobroma cacao genome using pooled BACs and the identification of trait specific candidate genes

    PubMed Central

    2011-01-01

    Background BAC-based physical maps provide for sequencing across an entire genome or a selected sub-genomic region of biological interest. Such a region can be approached with next-generation whole-genome sequencing and assembly as if it were an independent small genome. Using the minimum tiling path as a guide, specific BAC clones representing the prioritized genomic interval are selected, pooled, and used to prepare a sequencing library. Results This pooled BAC approach was taken to sequence and assemble a QTL-rich region, of ~3 Mbp and represented by twenty-seven BACs, on linkage group 5 of the Theobroma cacao cv. Matina 1-6 genome. Using various mixtures of read coverages from paired-end and linear 454 libraries, multiple assemblies of varied quality were generated. Quality was assessed by comparing the assembly of 454 reads with a subset of ten BACs individually sequenced and assembled using Sanger reads. A mixture of reads optimal for assembly was identified. We found, furthermore, that a quality assembly suitable for serving as a reference genome template could be obtained even with a reduced depth of sequencing coverage. Annotation of the resulting assembly revealed several genes potentially responsible for three T. cacao traits: black pod disease resistance, bean shape index, and pod weight. Conclusions Our results, as with other pooled BAC sequencing reports, suggest that pooling portions of a minimum tiling path derived from a BAC-based physical map is an effective method to target sub-genomic regions for sequencing. While we focused on a single QTL region, other QTL regions of importance could be similarly sequenced allowing for biological discovery to take place before a high quality whole-genome assembly is completed. PMID:21794110

  3. Temperature and CO(2) additively regulate physiology, morphology and genomic responses of larval sea urchins, Strongylocentrotus purpuratus.

    PubMed

    Padilla-Gamiño, Jacqueline L; Kelly, Morgan W; Evans, Tyler G; Hofmann, Gretchen E

    2013-05-22

    Ocean warming and ocean acidification, both consequences of anthropogenic production of CO2, will combine to influence the physiological performance of many species in the marine environment. In this study, we used an integrative approach to forecast the impact of future ocean conditions on larval purple sea urchins (Strongylocentrotus purpuratus) from the northeast Pacific Ocean. In laboratory experiments that simulated ocean warming and ocean acidification, we examined larval development, skeletal growth, metabolism and patterns of gene expression using an orthogonal comparison of two temperature (13°C and 18°C) and pCO2 (400 and 1100 μatm) conditions. Simultaneous exposure to increased temperature and pCO2 significantly reduced larval metabolism and triggered a widespread downregulation of histone encoding genes. pCO2 but not temperature impaired skeletal growth and reduced the expression of a major spicule matrix protein, suggesting that skeletal growth will not be further inhibited by ocean warming. Importantly, shifts in skeletal growth were not associated with developmental delay. Collectively, our results indicate that global change variables will have additive effects that exceed thresholds for optimized physiological performance in this keystone marine species.

  4. Temperature and CO2 additively regulate physiology, morphology and genomic responses of larval sea urchins, Strongylocentrotus purpuratus

    PubMed Central

    Padilla-Gamiño, Jacqueline L.; Kelly, Morgan W.; Evans, Tyler G.; Hofmann, Gretchen E.

    2013-01-01

    Ocean warming and ocean acidification, both consequences of anthropogenic production of CO2, will combine to influence the physiological performance of many species in the marine environment. In this study, we used an integrative approach to forecast the impact of future ocean conditions on larval purple sea urchins (Strongylocentrotus purpuratus) from the northeast Pacific Ocean. In laboratory experiments that simulated ocean warming and ocean acidification, we examined larval development, skeletal growth, metabolism and patterns of gene expression using an orthogonal comparison of two temperature (13°C and 18°C) and pCO2 (400 and 1100 μatm) conditions. Simultaneous exposure to increased temperature and pCO2 significantly reduced larval metabolism and triggered a widespread downregulation of histone encoding genes. pCO2 but not temperature impaired skeletal growth and reduced the expression of a major spicule matrix protein, suggesting that skeletal growth will not be further inhibited by ocean warming. Importantly, shifts in skeletal growth were not associated with developmental delay. Collectively, our results indicate that global change variables will have additive effects that exceed thresholds for optimized physiological performance in this keystone marine species. PMID:23536595

  5. Filarial and Wolbachia genomics.

    PubMed

    Scott, A L; Ghedin, E; Nutman, T B; McReynolds, L A; Poole, C B; Slatko, B E; Foster, J M

    2012-01-01

    Filarial nematode parasites, the causative agents for a spectrum of acute and chronic diseases including lymphatic filariasis and river blindness, threaten the well-being and livelihood of hundreds of millions of people in the developing regions of the world. The 2007 publication on a draft assembly of the 95-Mb genome of the human filarial parasite Brugia malayi- representing the first helminth parasite genome to be sequenced - has been followed in rapid succession by projects that have resulted in the genome sequencing of six additional filarial species, seven nonfilarial nematode parasites of animals and nearly 30 plant parasitic and free-living species. Parallel to the genomic sequencing, transcriptomic and proteomic projects have facilitated genome annotation, expanded our understanding of stage-associated gene expression and provided a first look at the role of epigenetic regulation of filarial genomes through microRNAs. The expansion in filarial genomics will also provide a significant enrichment in our knowledge of the diversity and variability in the genomes of the endosymbiotic bacterium Wolbachia leading to a better understanding of the genetic principles that govern filarial-Wolbachia mutualism. The goal here is to provide an overview of the trends and advances in filarial and Wolbachia genomics. PMID:22098559

  6. Further delineation of chromosomal consensus regions in primary mediastinal B-cell lymphomas: an analysis of 37 tumor samples using high-resolution genomic profiling (array-CGH).

    PubMed

    Wessendorf, S; Barth, T F E; Viardot, A; Mueller, A; Kestler, H A; Kohlhammer, H; Lichter, P; Bentz, M; Döhner, H; Möller, P; Schwaenen, C

    2007-12-01

    Primary mediastinal B-cell lymphoma (PMBL) is an aggressive extranodal B-cell non-Hodgkin's lymphoma with specific clinical, histopathological and genomic features. To characterize further the genotype of PMBL, we analyzed 37 tumor samples and PMBL cell lines Med-B1 and Karpas1106P using array-based comparative genomic hybridization (matrix- or array-CGH) to a 2.8k genomic microarray. Due to a higher genomic resolution, we identified altered chromosomal regions in much higher frequencies compared with standard CGH: for example, +9p24 (68%), +2p15 (51%), +7q22 (32%), +9q34 (32%), +11q23 (18%), +12q (30%) and +18q21 (24%). Moreover, previously unknown small interstitial chromosomal low copy number alterations (for example, -6p21, -11q13.3) and a total of 19 DNA amplifications were identified by array-CGH. For 17 chromosomal localizations (10 gains and 7 losses), which were altered in more than 10% of the analyzed cases, we delineated minimal consensus regions based on genomic base pair positions. These regions and selected immunohistochemistries point to candidate genes that are discussed in the context of NF-kappaB transcription activation, human leukocyte antigen class I/II defects, impaired apoptosis and Janus kinase/signal transducer and activator of transcription (JAK/STAT) activation. Our data confirm the genomic uniqueness of this tumor and provide physically mapped genomic regions of interest for focused candidate gene analysis. PMID:17728785

  7. Multiple recent horizontal transfers of a large genomic region in cheese making fungi

    PubMed Central

    Cheeseman, Kevin; Ropars, Jeanne; Renault, Pierre; Dupont, Joëlle; Gouzy, Jérôme; Branca, Antoine; Abraham, Anne-Laure; Ceppi, Maurizio; Conseiller, Emmanuel; Debuchy, Robert; Malagnac, Fabienne; Goarin, Anne; Silar, Philippe; Lacoste, Sandrine; Sallet, Erika; Bensimon, Aaron; Giraud, Tatiana; Brygoo, Yves

    2014-01-01

    While the extent and impact of horizontal transfers in prokaryotes are widely acknowledged, their importance to the eukaryotic kingdom is unclear and thought by many to be anecdotal. Here we report multiple recent transfers of a huge genomic island between Penicillium spp. found in the food environment. Sequencing of the two leading filamentous fungi used in cheese making, P. roqueforti and P. camemberti, and comparison with the penicillin producer P. rubens reveals a 575 kb long genomic island in P. roqueforti—called Wallaby—present as identical fragments at non-homologous loci in P. camemberti and P. rubens. Wallaby is detected in Penicillium collections exclusively in strains from food environments. Wallaby encompasses about 250 predicted genes, some of which are probably involved in competition with microorganisms. The occurrence of multiple recent eukaryotic transfers in the food environment provides strong evidence for the importance of this understudied and probably underestimated phenomenon in eukaryotes. PMID:24407037

  8. Multiple recent horizontal transfers of a large genomic region in cheese making fungi.

    PubMed

    Cheeseman, Kevin; Ropars, Jeanne; Renault, Pierre; Dupont, Joëlle; Gouzy, Jérôme; Branca, Antoine; Abraham, Anne-Laure; Ceppi, Maurizio; Conseiller, Emmanuel; Debuchy, Robert; Malagnac, Fabienne; Goarin, Anne; Silar, Philippe; Lacoste, Sandrine; Sallet, Erika; Bensimon, Aaron; Giraud, Tatiana; Brygoo, Yves

    2014-01-01

    While the extent and impact of horizontal transfers in prokaryotes are widely acknowledged, their importance to the eukaryotic kingdom is unclear and thought by many to be anecdotal. Here we report multiple recent transfers of a huge genomic island between Penicillium spp. found in the food environment. Sequencing of the two leading filamentous fungi used in cheese making, P. roqueforti and P. camemberti, and comparison with the penicillin producer P. rubens reveals a 575 kb long genomic island in P. roqueforti--called Wallaby--present as identical fragments at non-homologous loci in P. camemberti and P. rubens. Wallaby is detected in Penicillium collections exclusively in strains from food environments. Wallaby encompasses about 250 predicted genes, some of which are probably involved in competition with microorganisms. The occurrence of multiple recent eukaryotic transfers in the food environment provides strong evidence for the importance of this understudied and probably underestimated phenomenon in eukaryotes.

  9. Genome Sequences of 11 Brucella abortus Isolates from Persistently Infected Italian Regions.

    PubMed

    Garofolo, Giuliano; Foster, Jeffrey T; Drees, Kevin; Zilli, Katiuscia; Platone, Ilenia; Ancora, Massimo; Cammà, Cesare; De Massis, Fabrizio; Calistri, Paolo; Di Giannatale, Elisabetta

    2015-01-01

    Bovine brucellosis, typically caused by Brucella abortus, has been eradicated from much of the developed world. However, the disease remains prevalent in southern Italy, persisting as a public and livestock health concern. We report here the whole-genome sequences of 11 isolates from cattle (Bos taurus) and water buffalo (Bubalus bubalis) that are representative of the current genetic diversity of B. abortus lineages circulating in Italy. PMID:26679575

  10. Genome Sequences of 11 Brucella abortus Isolates from Persistently Infected Italian Regions

    PubMed Central

    Foster, Jeffrey T.; Drees, Kevin; Zilli, Katiuscia; Platone, Ilenia; Ancora, Massimo; Cammà, Cesare; De Massis, Fabrizio; Calistri, Paolo; Di Giannatale, Elisabetta

    2015-01-01

    Bovine brucellosis, typically caused by Brucella abortus, has been eradicated from much of the developed world. However, the disease remains prevalent in southern Italy, persisting as a public and livestock health concern. We report here the whole-genome sequences of 11 isolates from cattle (Bos taurus) and water buffalo (Bubalus bubalis) that are representative of the current genetic diversity of B. abortus lineages circulating in Italy. PMID:26679575

  11. Genome Sequences of 11 Brucella abortus Isolates from Persistently Infected Italian Regions.

    PubMed

    Garofolo, Giuliano; Foster, Jeffrey T; Drees, Kevin; Zilli, Katiuscia; Platone, Ilenia; Ancora, Massimo; Cammà, Cesare; De Massis, Fabrizio; Calistri, Paolo; Di Giannatale, Elisabetta

    2015-12-17

    Bovine brucellosis, typically caused by Brucella abortus, has been eradicated from much of the developed world. However, the disease remains prevalent in southern Italy, persisting as a public and livestock health concern. We report here the whole-genome sequences of 11 isolates from cattle (Bos taurus) and water buffalo (Bubalus bubalis) that are representative of the current genetic diversity of B. abortus lineages circulating in Italy.

  12. QTL Mapping in Three Rice Populations Uncovers Major Genomic Regions Associated with African Rice Gall Midge Resistance.

    PubMed

    Yao, Nasser; Lee, Cheng-Ruei; Semagn, Kassa; Sow, Mounirou; Nwilene, Francis; Kolade, Olufisayo; Bocco, Roland; Oyetunji, Olumoye; Mitchell-Olds, Thomas; Ndjiondjop, Marie-Noëlle

    2016-01-01

    African rice gall midge (AfRGM) is one of the most destructive pests of irrigated and lowland African ecologies. This study aimed to identify the quantitative trait loci (QTL) associated with AfRGM pest incidence and resistance in three independent bi-parental rice populations (ITA306xBW348-1, ITA306xTOG7106 and ITA306xTOS14519), and to conduct meta QTL (mQTL) analysis to explore whether any genomic regions are conserved across different genetic backgrounds. Composite interval mapping (CIM) conducted on the three populations independently uncovered a total of 28 QTLs associated with pest incidence (12) and pest severity (16). The number of QTLs per population associated with AfRGM resistance varied from three in the ITA306xBW348-1 population to eight in the ITA306xTOG7106 population. Each QTL individually explained 1.3 to 34.1% of the phenotypic variance. The major genomic region for AfRGM resistance had a LOD score and R2 of 60.0 and 34.1% respectively, and mapped at 111 cM on chromosome 4 (qAfrGM4) in the ITA306xTOS14519 population. The meta-analysis reduced the number of QTLs from 28 to 17 mQTLs, each explaining 1.3 to 24.5% of phenotypic variance, and narrowed the confidence intervals by 2.2 cM. There was only one minor effect mQTL on chromosome 1 that was common in the TOS14519 and TOG7106 genetic backgrounds; all other mQTLs were background specific. We are currently fine-mapping and validating the major effect genomic region on chromosome 4 (qAfRGM4). This is the first report in mapping the genomic regions associated with the AfRGM resistance, and will be highly useful for rice breeders. PMID:27508500

  13. QTL Mapping in Three Rice Populations Uncovers Major Genomic Regions Associated with African Rice Gall Midge Resistance

    PubMed Central

    Semagn, Kassa; Sow, Mounirou; Nwilene, Francis; Kolade, Olufisayo; Bocco, Roland; Oyetunji, Olumoye; Mitchell-Olds, Thomas; Ndjiondjop, Marie-Noëlle

    2016-01-01

    African rice gall midge (AfRGM) is one of the most destructive pests of irrigated and lowland African ecologies. This study aimed to identify the quantitative trait loci (QTL) associated with AfRGM pest incidence and resistance in three independent bi-parental rice populations (ITA306xBW348-1, ITA306xTOG7106 and ITA306xTOS14519), and to conduct meta QTL (mQTL) analysis to explore whether any genomic regions are conserved across different genetic backgrounds. Composite interval mapping (CIM) conducted on the three populations independently uncovered a total of 28 QTLs associated with pest incidence (12) and pest severity (16). The number of QTLs per population associated with AfRGM resistance varied from three in the ITA306xBW348-1 population to eight in the ITA306xTOG7106 population. Each QTL individually explained 1.3 to 34.1% of the phenotypic variance. The major genomic region for AfRGM resistance had a LOD score and R2 of 60.0 and 34.1% respectively, and mapped at 111 cM on chromosome 4 (qAfrGM4) in the ITA306xTOS14519 population. The meta-analysis reduced the number of QTLs from 28 to 17 mQTLs, each explaining 1.3 to 24.5% of phenotypic variance, and narrowed the confidence intervals by 2.2 cM. There was only one minor effect mQTL on chromosome 1 that was common in the TOS14519 and TOG7106 genetic backgrounds; all other mQTLs were background specific. We are currently fine-mapping and validating the major effect genomic region on chromosome 4 (qAfRGM4). This is the first report in mapping the genomic regions associated with the AfRGM resistance, and will be highly useful for rice breeders. PMID:27508500

  14. Genome-based identification of chromosomal regions specific for Salmonella spp.

    PubMed

    Hansen-Wester, Imke; Hensel, Michael

    2002-05-01

    Acquisition of genomic elements by horizontal gene transfer represents an important mechanism in the evolution of bacterial species. Pathogenicity islands are a subset of horizontally acquired elements present in various pathogens. These elements are frequently located adjacent to tRNA genes. We performed a comparative genome analysis of Salmonella enterica serovars Typhi and Typhimurium and Escherichia coli and scanned tRNA loci for the presence of species-specific, horizontally acquired genomic elements. A large number of species-specific elements were identified. Here, we describe the characteristics of four large chromosomal insertions at tRNA genes of Salmonella spp. The tRNA-associated elements harbor various genes previously identified as single virulence genes, indicating that these genes have been acquired with large chromosomal insertions. Southern blot analyses confirmed that the tRNA-associated elements are specific to Salmonella and also indicated a heterogeneous distribution within the salmonellae. Systematic scanning for insertions at tRNA genes thus represents a tool for the identification of novel pathogenicity islands.

  15. AnABlast: a new in silico strategy for the genome-wide search of novel genes and fossil regions

    PubMed Central

    Jimenez, Juan; Duncan, Caia D. S.; Gallardo, María; Mata, Juan; Perez-Pulido, Antonio J.

    2015-01-01

    Genome annotation, assisted by computer programs, is one of the great advances in modern biology. Nevertheless, the in silico identification of small and complex coding sequences is still challenging. We observed that amino acid sequences inferred from coding—but rarely from non-coding—DNA sequences accumulated alignments in low-stringency BLAST searches, suggesting that this alignments accumulation could be used to highlight coding regions in sequenced DNA. To investigate this possibility, we developed a computer program (AnABlast) that generates profiles of accumulated alignments in query amino acid sequences using a low-stringency BLAST strategy. To validate this approach, all six-frame translations of DNA sequences between every two annotated exons of the fission yeast genome were analysed with AnABlast. AnABlast-generated profiles identified three new copies of known genes, and four new genes supported by experimental evidence. New pseudogenes, ancestral carboxyl- and amino-terminal subtractions, complex gene rearrangements, and ancient fragments of mitDNA and of bacterial origin, were also inferred. Thus, this novel in silico approach provides a powerful tool to uncover new genes, as well as fossil-coding sequences, thus providing insight into the evolutionary history of annotated genomes. PMID:26494834

  16. AnABlast: a new in silico strategy for the genome-wide search of novel genes and fossil regions.

    PubMed

    Jimenez, Juan; Duncan, Caia D S; Gallardo, María; Mata, Juan; Perez-Pulido, Antonio J

    2015-12-01

    Genome annotation, assisted by computer programs, is one of the great advances in modern biology. Nevertheless, the in silico identification of small and complex coding sequences is still challenging. We observed that amino acid sequences inferred from coding-but rarely from non-coding-DNA sequences accumulated alignments in low-stringency BLAST searches, suggesting that this alignments accumulation could be used to highlight coding regions in sequenced DNA. To investigate this possibility, we developed a computer program (AnABlast) that generates profiles of accumulated alignments in query amino acid sequences using a low-stringency BLAST strategy. To validate this approach, all six-frame translations of DNA sequences between every two annotated exons of the fission yeast genome were analysed with AnABlast. AnABlast-generated profiles identified three new copies of known genes, and four new genes supported by experimental evidence. New pseudogenes, ancestral carboxyl- and amino-terminal subtractions, complex gene rearrangements, and ancient fragments of mitDNA and of bacterial origin, were also inferred. Thus, this novel in silico approach provides a powerful tool to uncover new genes, as well as fossil-coding sequences, thus providing insight into the evolutionary history of annotated genomes.

  17. Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes).

    PubMed

    Dessimoz, Christophe; Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro

    2011-09-01

    Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.

  18. The vertebrate ancestral repertoire of visual opsins, transducin alpha subunits and oxytocin/vasopressin receptors was established by duplication of their shared genomic region in the two rounds of early vertebrate genome duplications

    PubMed Central

    2013-01-01

    Background Vertebrate color vision is dependent on four major color opsin subtypes: RH2 (green opsin), SWS1 (ultraviolet opsin), SWS2 (blue opsin), and LWS (red opsin). Together with the dim-light receptor rhodopsin (RH1), these form the family of vertebrate visual opsins. Vertebrate genomes contain many multi-membered gene families that can largely be explained by the two rounds of whole genome duplication (WGD) in the vertebrate ancestor (2R) followed by a third round in the teleost ancestor (3R). Related chromosome regions resulting from WGD or block duplications are said to form a paralogon. We describe here a paralogon containing the genes for visual opsins, the G-protein alpha subunit families for transducin (GNAT) and adenylyl cyclase inhibition (GNAI), the oxytocin and vasopressin receptors (OT/VP-R), and the L-type voltage-gated calcium channels (CACNA1-L). Results Sequence-based phylogenies and analyses of conserved synteny show that the above-mentioned gene families, and many neighboring gene families, expanded in the early vertebrate WGDs. This allows us to deduce the following evolutionary scenario: The vertebrate ancestor had a chromosome containing the genes for two visual opsins, one GNAT, one GNAI, two OT/VP-Rs and one CACNA1-L gene. This chromosome was quadrupled in 2R. Subsequent gene losses resulted in a set of five visual opsin genes, three GNAT and GNAI genes, six OT/VP-R genes and four CACNA1-L genes. These regions were duplicated again in 3R resulting in additional teleost genes for some of the families. Major chromosomal rearrangements have taken place in the teleost genomes. By comparison with the corresponding chromosomal regions in the spotted gar, which diverged prior to 3R, we could time these rearrangements to post-3R. Conclusions We present an extensive analysis of the paralogon housing the visual opsin, GNAT and GNAI, OT/VP-R, and CACNA1-L gene families. The combined data imply that the early vertebrate WGD events contributed to the

  19. A novel rearrangement in the mitochondrial genome of tongue sole, Cynoglossus semilaevis: control region translocation and a tRNA gene inversion.

    PubMed

    Kong, Xiaoyu; Dong, Xiaoli; Zhang, Yanchun; Shi, Wei; Wang, Zhongming; Yu, Ziniu

    2009-12-01

    The organization of fish mitochondrial genomes (mitogenomes) is quite conserved, usually with the heavy strand encoding 12 of 13 protein-coding genes and 14 of 22 tRNA genes, and the light strand encoding ND6 and the remaining 8 tRNA genes. Currently, there are only a few reports on gene reorganization of fish mitogenomes, with only two types of rearrangements (shuffling and translocation) observed. No gene inversion has been detected in approximately 420 complete fish mitogenomes available so far. Here we report a novel rearrangement in the mitogenome of Cynoglossus semilaevis (Cynoglossinae, Cynoglossidae, Pleuronectiformes). The genome is 16,371 bp in length and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes, and 2 main noncoding regions, the putative control region and the light-strand replication origin. A striking finding of this study is that the tRNA(Gln) gene is translocated from the light to the heavy strand (Q inversion). This is accompanied by shuffling of the tRNA(Ile) gene and long-range translocation of the putative control region downstream to a site between ND1 and the tRNA(Gln) gene. The remaining gene order is identical to that of typical fish mitogenomes. Additionally, unique characters of this mitogenome, including a high A+T content and length variations of 8 protein-coding genes, were found through comparison of the mitogenome sequence with those from other flatfishes. All the features detected and their relationships with the rearrangements, as well as a possible rearrangement pathway, are discussed. These data provide interesting information for better understanding the molecular mechanisms of gene reorganization in fish mitogenomes.

  20. Genomic analysis of a 1 Mb region near the telomere of Hessian fly chromosome X2 and avirulence gene vH13

    PubMed Central

    Lobo, Neil F; Behura, Susanta K; Aggarwal, Rajat; Chen, Ming-Shun; Collins, Frank H; Stuart, Jeff J

    2006-01-01

    Background To have an insight into the Mayetiola destructor (Hessian fly) genome, we performed an in silico comparative genomic analysis utilizing genetic mapping, genomic sequence and EST sequence data along with data available from public databases. Results Chromosome walking and FISH were utilized to identify a contig of 50 BAC clones near the telomere of the short arm of Hessian fly chromosome X2 and near the avirulence gene vH13. These clones enabled us to correlate physical and genetic distance in this region of the Hessian fly genome. Sequence data from these BAC ends encompassing a 760 kb region, and a fully sequenced and assembled 42.6 kb BAC clone, was utilized to perform a comparative genomic study. In silico gene prediction combined with BLAST analyses was used to determine putative orthology to the sequenced dipteran genomes of the fruit fly, Drosophila melanogaster, and the malaria mosquito, Anopheles gambiae, and to infer evolutionary relationships. Conclusion This initial effort enables us to advance our understanding of the structure, composition and evolution of the genome of this important agricultural pest and is an invaluable tool for a whole genome sequencing effort. PMID:16412254

  1. Segment-Wise Genome-Wide Association Analysis Identifies a Candidate Region Associated with Schizophrenia in Three Independent Samples

    PubMed Central

    Rietschel, Marcella; Mattheisen, Manuel; Breuer, René; Schulze, Thomas G.; Nöthen, Markus M.; Levinson, Douglas; Shi, Jianxin; Gejman, Pablo V.; Cichon, Sven; Ophoff, Roel A.

    2012-01-01

    Recent studies suggest that variation in complex disorders (e.g., schizophrenia) is explained by a large number of genetic variants with small effect size (Odds Ratio∼1.05–1.1). The statistical power to detect these genetic variants in Genome Wide Association (GWA) studies with large numbers of cases and controls (∼15,000) is still low. As it will be difficult to further increase sample size, we decided to explore an alternative method for analyzing GWA data in a study of schizophrenia, dramatically reducing the number of statistical tests. The underlying hypothesis was that at least some of the genetic variants related to a common outcome are collocated in segments of chromosomes at a wider scale than single genes. Our approach was therefore to study the association between relatively large segments of DNA and disease status. An association test was performed for each SNP and the number of nominally significant tests in a segment was counted. We then performed a permutation-based binomial test to determine whether this region contained significantly more nominally significant SNPs than expected under the null hypothesis of no association, taking linkage into account. Genome Wide Association data of three independent schizophrenia case/control cohorts with European ancestry (Dutch, German, and US) using segments of DNA with variable length (2 to 32 Mbp) was analyzed. Using this approach we identified a region at chromosome 5q23.3-q31.3 (128–160 Mbp) that was significantly enriched with nominally associated SNPs in three independent case-control samples. We conclude that considering relatively wide segments of chromosomes may reveal reliable relationships between the genome and schizophrenia, suggesting novel methodological possibilities as well as raising theoretical questions. PMID:22723893

  2. Genomic organization of the S core region and the S flanking regions of a class-II S haplotype in Brassica rapa.

    PubMed

    Fukai, E; Fujimoto, R; Nishio, T

    2003-06-01

    The nucleotide sequence of an 86.4-kb region that includes the SP11, SRK, and SLG genes of Brassica rapa S-60 (a class-II S haplotype) was determined. In the sequenced region, 13 putative genes were found besides SP11-60, SRK-60, and SLG-60. Five of these sequences were isolated as cDNAs, five were homologues of known genes, cDNAs, or ORFs, and three are hypothetical ORFs. Based on their nucleotide sequences, however, some of them are thought to be non-functional. Two regions of colinearity between the class-II S-60 and Brassica class-I S haplotypes were identified, i.e., S flanking region 1 which shows partial colinearity of non-genic sequences and S flanking region 2 which shows a high level of colinearity. The observed colinearity made it possible to compare the order of SP-11, SRK, and SLG genes in the S locus between the five sequenced S haplotypes. It emerged that the order of SRK and SLG in class-II S-60 is the reverse of that in the four class-I S haplotypes reported so far, and the order of SP11, SRK and SLG is the opposite of that in the class-I haplotype S-910. The possible gene designated as SAN1 (S locus Anther-expressed Non-coding RNA like-1), which is located in the region between SP11-60 and SRK-60, has features reminiscent of genes for non-coding RNAs (ncRNAs), but no homologous sequences were found in the databases. This sequence is transcribed in anthers but not in stigmas or leaves. These features of the genomic structure of S-60 are discussed with special reference to the characteristics of class-II S haplotypes.

  3. Quality control parameters on a large dataset of regionally dissected human control brains for whole genome expression studies

    PubMed Central

    Trabzuni, Daniah; Ryten, Mina; Walker, Robert; Smith, Colin; Imran, Sabaena; Ramasamy, Adaikalavan; Weale, Michael E; Hardy, John

    2011-01-01

    We are building an open-access database of regional human brain expression designed to allow the genome-wide assessment of genetic variability on expression. Array and RNA sequencing technologies make assessment of genome-wide expression possible. Human brain tissue is a challenging source for this work because it can only be obtained several and variable hours post-mortem and after varying agonal states. These variables alter RNA integrity in a complex manner. In this report, we assess the effect of post-mortem delay, agonal state and age on gene expression, and the utility of pH and RNA integrity number as predictors of gene expression as measured on 1266 Affymetrix Exon Arrays. We assessed the accuracy of the array data using QuantiGene, as an independent non-PCR-based method. These quality control parameters will allow database users to assess data accuracy. We report that within the parameters of this study post-mortem delay, agonal state and age have little impact on array quality, array data are robust to variable RNA integrity, and brain pH has only a small effect on array performance. QuantiGene gave very similar expression profiles as array data. This study is the first step in our initiative to make human, regional brain expression freely available. PMID:21848658

  4. Distinctive mitochondrial genome of Calanoid copepod Calanus sinicus with multiple large non-coding regions and reshuffled gene order: Useful molecular markers for phylogenetic and population studies

    PubMed Central

    2011-01-01

    Background Copepods are highly diverse and abundant, resulting in extensive ecological radiation in marine ecosystems. Calanus sinicus dominates continental shelf waters in the northwest Pacific Ocean and plays an important role in the local ecosystem by linking primary production to higher trophic levels. A lack of effective molecular markers has hindered phylogenetic and population genetic studies concerning copepods. As they are genome-level informative, mitochondrial DNA sequences can be used as markers for population genetic studies and phylogenetic studies. Results The mitochondrial genome of C. sinicus is distinct from other arthropods owing to the concurrence of multiple non-coding regions and a reshuffled gene arrangement. Further particularities in the mitogenome of C. sinicus include low A + T-content, symmetrical nucleotide composition between strands, abbreviated stop codons for several PCGs and extended lengths of the genes atp6 and atp8 relative to other copepods. The monophyletic Copepoda should be placed within the Vericrustacea. The close affinity between Cyclopoida and Poecilostomatoida suggests reassigning the latter as subordinate to the former. Monophyly of Maxillopoda is rejected. Within the alignment of 11 C. sinicus mitogenomes, there are 397 variable sites harbouring three 'hotspot' variable sites and three microsatellite loci. Conclusion The occurrence of the circular subgenomic fragment during laboratory assays suggests that special caution should be taken when sequencing mitogenomes using long PCR. Such a phenomenon may provide additional evidence of mitochondrial DNA recombination, which appears to have been a prerequisite for shaping the present mitochondrial profile of C. sinicus during its evolution. The lack of synapomorphic gene arrangements among copepods has cast doubt on the utility of gene order as a useful molecular marker for deep phylogenetic analysis. However, mitochondrial genomic sequences have been valuable markers for

  5. Apollo 16 regolith breccias and soils - Recorders of exotic component addition to the Descartes region of the moon

    NASA Technical Reports Server (NTRS)

    Simon, S. B.; Papike, J. J.; Laul, J. C.; Hughes, S. S.; Schmitt, R. A.

    1988-01-01

    Using the subdivision of Apollo 16 regolith breccias into ancient (about 4 Gyr) and younger samples (McKay et al., 1986), with the present-day soils as a third sample, a petrologic and chemical determination of regolith evolution and exotic component addition at the A-16 site was performed. The modal petrologies and mineral and chemical compositions of the regolith breccias in the region are presented. It is shown that the early regolith was composed of fragments of plutonic rocks, impact melt rocks, and minerals and impact glasses. It is found that KREEP lithologies and impact melts formed early in lunar history. The mare components, mainly orange high-TiO2 glass and green low-TiO2 glass, were added to the site after formation of the ancient breccias and prior to the formation of young breccias. The major change in the regolith since the formation of the young breccias is an increase in maturity represented by the formation of fused soil particles with prolonged exposure to micrometeorite impacts.

  6. Additional stratifications in the equatorial F region at dawn and dusk during geomagnetic storms: Role of electrodynamics

    NASA Astrophysics Data System (ADS)

    Sreeja, V.; Balan, N.; Ravindran, Sudha; Pant, Tarun Kumar; Sridharan, R.; Bailey, G. J.

    2009-08-01

    The role of electrodynamics in producing additional stratifications in the equatorial F region (F 3 layer) at dawn and dusk during geomagnetic storms is discussed. Two cases of F 3 layer at dawn (0600-0730 LT on 5 October 2000 and 8 December 2000) and one case of F 3 layer at dusk (1600-1730 LT on 5 October 2000) are observed, for the first time, by the digital ionosonde at the equatorial station Trivandrum (8.5°N 77°E dip ˜ 0.5°N) in India. The unusual F 3 layers occurred during the geomagnetic storms and are associated with southward turning of interplanetary magnetic field B z , suggesting that eastward prompt penetration electric field could be the main cause of the F 3 layers. The dawn F 3 layer on 5 October is modeled using the Sheffield University Plasmasphere-Ionosphere Model by using the E × B drift estimated from the real height variation of the ionospheric peak during the morning period. The model qualitatively reproduces the dawn F 3 layer. While the existing F 2 layer rapidly drifts upward and forms the F 3 layer and topside ledge, a new layer forming at lower heights develops into the normal F 2 layer.

  7. Whole Genome Comparisons Suggest Random Distribution of Mycobacterium ulcerans Genotypes in a Buruli Ulcer Endemic Region of Ghana

    PubMed Central

    Ablordey, Anthony S.; Vandelannoote, Koen; Frimpong, Isaac A.; Ahortor, Evans K.; Amissah, Nana Ama; Eddyani, Miriam; Durnez, Lies; Portaels, Françoise; de Jong, Bouke C.; Leirs, Herwig; Porter, Jessica L.; Mangas, Kirstie M.; Lam, Margaret M. C.; Buultjens, Andrew; Seemann, Torsten; Tobias, Nicholas J.; Stinear, Timothy P.

    2015-01-01

    Efforts to control the spread of Buruli ulcer – an emerging ulcerative skin infection caused by Mycobacterium ulcerans - have been hampered by our poor understanding of reservoirs and transmission. To help address this issue, we compared whole genomes from 18 clinical M. ulcerans isolates from a 30km2 region within the Asante Akim North District, Ashanti region, Ghana, with 15 other M. ulcerans isolates from elsewhere in Ghana and the surrounding countries of Ivory Coast, Togo, Benin and Nigeria. Contrary to our expectations of finding minor DNA sequence variations among isolates representing a single M. ulcerans circulating genotype, we found instead two distinct genotypes. One genotype was closely related to isolates from neighbouring regions of Amansie West and Densu, consistent with the predicted local endemic clone, but the second genotype (separated by 138 single nucleotide polymorphisms [SNPs] from other Ghanaian strains) most closely matched M. ulcerans from Nigeria, suggesting another introduction of M. ulcerans to Ghana, perhaps from that country. Both the exotic genotype and the local Ghanaian genotype displayed highly restricted intra-strain genetic variation, with less than 50 SNP differences across a 5.2Mbp core genome within each genotype. Interestingly, there was no discernible spatial clustering of genotypes at the local village scale. Interviews revealed no obvious epidemiological links among BU patients who had been infected with identical M. ulcerans genotypes but lived in geographically separate villages. We conclude that M. ulcerans is spread widely across the region, with multiple genotypes present in any one area. These data give us new perspectives on the behaviour of possible reservoirs and subsequent transmission mechanisms of M. ulcerans. These observations also show for the first time that M. ulcerans can be mobilized, introduced to a new area and then spread within a population. Potential reservoirs of M. ulcerans thus might include

  8. Evaluation of Apis mellifera syriaca Levant Region honeybee conservation using Comparative Genome Hybridization

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Apis mellifera syriaca is the native honeybee subspecies of Jordan and much of the Levant Region. It expresses behavioral adaptations to a regional climate with very high temperatures, nectar dearth in summer, attacks of the Oriental wasp and is resistant to Varroa mites. The A. m. syriaca control r...

  9. DNA-guided establishment of nucleosome patterns within coding regions of a eukaryotic genome

    PubMed Central

    Beh, Leslie Y.; Müller, Manuel M.; Muir, Tom W.; Kaplan, Noam; Landweber, Laura F.

    2015-01-01

    A conserved hallmark of eukaryotic chromatin architecture is the distinctive array of well-positioned nucleosomes downstream from transcription start sites (TSS). Recent studies indicate that trans-acting factors establish this stereotypical array. Here, we present the first genome-wide in vitro and in vivo nucleosome maps for the ciliate Tetrahymena thermophila. In contrast with previous studies in yeast, we find that the stereotypical nucleosome array is preserved in the in vitro reconstituted map, which is governed only by the DNA sequence preferences of nucleosomes. Remarkably, this average in vitro pattern arises from the presence of subsets of nucleosomes, rather than the whole array, in individual Tetrahymena genes. Variation in GC content contributes to the positioning of these sequence-directed nucleosomes and affects codon usage and amino acid composition in genes. Given that the AT-rich Tetrahymena genome is intrinsically unfavorable for nucleosome formation, we propose that these “seed” nucleosomes—together with trans-acting factors—may facilitate the establishment of nucleosome arrays within genes in vivo, while minimizing changes to the underlying coding sequences. PMID:26330564

  10. DNA-guided establishment of nucleosome patterns within coding regions of a eukaryotic genome.

    PubMed

    Beh, Leslie Y; Müller, Manuel M; Muir, Tom W; Kaplan, Noam; Landweber, Laura F

    2015-11-01

    A conserved hallmark of eukaryotic chromatin architecture is the distinctive array of well-positioned nucleosomes downstream from transcription start sites (TSS). Recent studies indicate that trans-acting factors establish this stereotypical array. Here, we present the first genome-wide in vitro and in vivo nucleosome maps for the ciliate Tetrahymena thermophila. In contrast with previous studies in yeast, we find that the stereotypical nucleosome array is preserved in the in vitro reconstituted map, which is governed only by the DNA sequence preferences of nucleosomes. Remarkably, this average in vitro pattern arises from the presence of subsets of nucleosomes, rather than the whole array, in individual Tetrahymena genes. Variation in GC content contributes to the positioning of these sequence-directed nucleosomes and affects codon usage and amino acid composition in genes. Given that the AT-rich Tetrahymena genome is intrinsically unfavorable for nucleosome formation, we propose that these "seed" nucleosomes--together with trans-acting factors--may facilitate the establishment of nucleosome arrays within genes in vivo, while minimizing changes to the underlying coding sequences.

  11. Genomic organization of the S locus: Identification and characterization of genes in SLG/SRK region of S(9) haplotype of Brassica campestris (syn. rapa).

    PubMed

    Suzuki, G; Kai, N; Hirose, T; Fukui, K; Nishio, T; Takayama, S; Isogai, A; Watanabe, M; Hinata, K

    1999-09-01

    In Brassica, two self-incompatibility genes, encoding SLG (S locus glycoprotein) and SRK (S-receptor kinase), are located at the S locus and expressed in the stigma. Recent molecular analysis has revealed that the S locus is highly polymorphic and contains several genes, i.e., SLG, SRK, the as-yet-unidentified pollen S gene(s), and other linked genes. In the present study, we searched for expressed sequences in a 76-kb SLG/SRK region of the S(9) haplotype of Brassica campestris (syn. rapa) and identified 10 genes in addition to the four previously identified (SLG(9), SRK(9), SAE1, and SLL2) in this haplotype. This gene density (1 gene/5.4 kb) suggests that the S locus is embedded in a gene-rich region of the genome. The average G + C content in this region is 32.6%. An En/Spm-type transposon-like element was found downstream of SLG(9). Among the genes we identified that had not previously been found to be linked to the S locus were genes encoding a small cysteine-rich protein, a J-domain protein, and an antisilencing protein (ASF1) homologue. The small cysteine-rich protein was similar to a pollen coat protein, named PCP-A1, which had previously been shown to bind SLG.

  12. Genomic organization of the S locus: Identification and characterization of genes in SLG/SRK region of S(9) haplotype of Brassica campestris (syn. rapa).

    PubMed Central

    Suzuki, G; Kai, N; Hirose, T; Fukui, K; Nishio, T; Takayama, S; Isogai, A; Watanabe, M; Hinata, K

    1999-01-01

    In Brassica, two self-incompatibility genes, encoding SLG (S locus glycoprotein) and SRK (S-receptor kinase), are located at the S locus and expressed in the stigma. Recent molecular analysis has revealed that the S locus is highly polymorphic and contains several genes, i.e., SLG, SRK, the as-yet-unidentified pollen S gene(s), and other linked genes. In the present study, we searched for expressed sequences in a 76-kb SLG/SRK region of the S(9) haplotype of Brassica campestris (syn. rapa) and identified 10 genes in addition to the four previously identified (SLG(9), SRK(9), SAE1, and SLL2) in this haplotype. This gene density (1 gene/5.4 kb) suggests that the S locus is embedded in a gene-rich region of the genome. The average G + C content in this region is 32.6%. An En/Spm-type transposon-like element was found downstream of SLG(9). Among the genes we identified that had not previously been found to be linked to the S locus were genes encoding a small cysteine-rich protein, a J-domain protein, and an antisilencing protein (ASF1) homologue. The small cysteine-rich protein was similar to a pollen coat protein, named PCP-A1, which had previously been shown to bind SLG. PMID:10471721

  13. Silencing of neurotropic flavivirus replication in the central nervous system by combining multiple microRNA target insertions in two distinct viral genome regions

    PubMed Central

    Teterina, Natalya L.; Liu, Guangping; Maximova, Olga A.; Pletnev, Alexander G.

    2014-01-01

    In recent years, microRNA-targeting has become an effective strategy for selective control of tissue-tropism and pathogenesis of both DNA and RNA viruses. Here, using a neurotropic flavivirus as a model, we demonstrate that simultaneous miRNA targeting of the viral genome in the open reading frame and 3′-noncoding regions for brain-expressed miRNAs had an additive effect and produced a more potent attenuation of the virus compared to separate targeting of those regions. Multiple miRNA co-targeting of these two distantly located regions completely abolished the virus neurotropism as no viral replication was detected in the developing brain of neonatal mice. Furthermore, no viral antigens were detected in neurons, and neuronal integrity in the brain of mice was well preserved. This miRNA co-targeting approach can be adapted for other viruses in order to minimize their replication in a cell- or tissue-type specific manner, but most importantly, to prevent virus escape from miRNA-mediated silencing. PMID:24889244

  14. The Genomic Ancestry of Individuals from Different Geographical Regions of Brazil Is More Uniform Than Expected

    PubMed Central

    Pena, Sérgio D. J.; Di Pietro, Giuliano; Fuchshuber-Moraes, Mateus; Genro, Julia Pasqualini; Hutz, Mara H.; Kehdy, Fernanda de Souza Gomes; Kohlrausch, Fabiana; Magno, Luiz Alexandre Viana; Montenegro, Raquel Carvalho; Moraes, Manoel Odorico; de Moraes, Maria Elisabete Amaral; de Moraes, Milene Raiol; Ojopi, Élida B.; Perini, Jamila A.; Racciopi, Clarice; Ribeiro-dos-Santos, Ândrea Kely Campos; Rios-Santos, Fabrício; Romano-Silva, Marco A.; Sortica, Vinicius A.; Suarez-Kurtz, Guilherme

    2011-01-01

    Based on pre-DNA racial/color methodology, clinical and pharmacological trials have traditionally considered the different geographical regions of Brazil as being very heterogeneous. We wished to ascertain how such diversity of regional color categories correlated with ancestry. Using a panel of 40 validated ancestry-informative insertion-deletion DNA polymorphisms we estimated individually the European, African and Amerindian ancestry components of 934 self-categorized White, Brown or Black Brazilians from the four most populous regions of the Country. We unraveled great ancestral diversity between and within the different regions. Especially, color categories in the northern part of Brazil diverged significantly in their ancestry proportions from their counterparts in the southern part of the Country, indicating that diverse regional semantics were being used in the self-classification as White, Brown or Black. To circumvent these regional subjective differences in color perception, we estimated the general ancestry proportions of each of the four regions in a form independent of color considerations. For that, we multiplied the proportions of a given ancestry in a given color category by the official census information about the proportion of that color category in the specific region, to arrive at a “total ancestry” estimate. Once such a calculation was performed, there emerged a much higher level of uniformity than previously expected. In all regions studied, the European ancestry was predominant, with proportions ranging from 60.6% in the Northeast to 77.7% in the South. We propose that the immigration of six million Europeans to Brazil in the 19th and 20th centuries - a phenomenon described and intended as the “whitening of Brazil” - is in large part responsible for dissipating previous ancestry dissimilarities that reflected region-specific population histories. These findings, of both clinical and sociological importance for Brazil, should also be

  15. Familiarity to a Feed Additive Modulates Its Effects on Brain Responses in Reward and Memory Regions in the Pig Model.

    PubMed

    Val-Laillet, David; Meurice, Paul; Clouard, Caroline

    2016-01-01

    Brain responses to feed flavors with or without a feed additive (FA) were investigated in piglets familiarized or not with this FA. Sixteen piglets were allocated to 2 dietary treatments from weaning until d 37: the naive group (NAI) received a standard control feed and the familiarized group (FAM) received the same feed added with a FA mainly made of orange extracts. Animals were subjected to a feed transition at d 16 post-weaning, and to 2-choice feeding tests at d 16 and d 23. Production traits of the piglets were assessed up to d 28 post-weaning. From d 26 onwards, animals underwent 2 brain imaging sessions (positron emission tomography of 18FDG) under anesthesia to investigate the brain activity triggered by the exposure to the flavors of the feed with (FA) or without (C) the FA. Images were analyzed with SPM8 and a region of interest (ROI)-based small volume correction (p < 0.05, k ≥ 25 voxels per cluster). The brain ROI were selected upon their role in sensory evaluation, cognition and reward, and included the prefrontal cortex, insular cortex, fusiform gyrus, limbic system and corpus striatum. The FAM animals showed a moderate preference for the novel post-transition FA feed compared to the C feed on d 16, i.e., day of the feed transition (67% of total feed intake). The presence or absence of the FA in the diet from weaning had no impact on body weight, average daily gain, and feed efficiency of the animals over the whole experimental period (p ≥ 0.10). Familiar feed flavors activated the prefrontal cortex. The amygdala, insular cortex, and prepyriform area were only activated in familiarized animals exposed to the FA feed flavor. The perception of FA feed flavor in the familiarized animals activated the dorsal striatum differently than the perception of the C feed flavor in naive animals. Our data demonstrated that the perception of FA in familiarized individuals induced different brain responses in regions involved in reward anticipation and

  16. Familiarity to a Feed Additive Modulates Its Effects on Brain Responses in Reward and Memory Regions in the Pig Model

    PubMed Central

    Val-Laillet, David; Meurice, Paul; Clouard, Caroline

    2016-01-01

    Brain responses to feed flavors with or without a feed additive (FA) were investigated in piglets familiarized or not with this FA. Sixteen piglets were allocated to 2 dietary treatments from weaning until d 37: the naive group (NAI) received a standard control feed and the familiarized group (FAM) received the same feed added with a FA mainly made of orange extracts. Animals were subjected to a feed transition at d 16 post-weaning, and to 2-choice feeding tests at d 16 and d 23. Production traits of the piglets were assessed up to d 28 post-weaning. From d 26 onwards, animals underwent 2 brain imaging sessions (positron emission tomography of 18FDG) under anesthesia to investigate the brain activity triggered by the exposure to the flavors of the feed with (FA) or without (C) the FA. Images were analyzed with SPM8 and a region of interest (ROI)-based small volume correction (p < 0.05, k ≥ 25 voxels per cluster). The brain ROI were selected upon their role in sensory evaluation, cognition and reward, and included the prefrontal cortex, insular cortex, fusiform gyrus, limbic system and corpus striatum. The FAM animals showed a moderate preference for the novel post-transition FA feed compared to the C feed on d 16, i.e., day of the feed transition (67% of total feed intake). The presence or absence of the FA in the diet from weaning had no impact on body weight, average daily gain, and feed efficiency of the animals over the whole experimental period (p ≥ 0.10). Familiar feed flavors activated the prefrontal cortex. The amygdala, insular cortex, and prepyriform area were only activated in familiarized animals exposed to the FA feed flavor. The perception of FA feed flavor in the familiarized animals activated the dorsal striatum differently than the perception of the C feed flavor in naive animals. Our data demonstrated that the perception of FA in familiarized individuals induced different brain responses in regions involved in reward anticipation and

  17. QTL mapping in white spruce: gene maps and genomic regions underlying adaptive traits across pedigrees, years and environments

    PubMed Central

    2011-01-01

    Background The genomic architecture of bud phenology and height growth remains poorly known in most forest trees. In non model species, QTL studies have shown limited application because most often QTL data could not be validated from one experiment to another. The aim of our study was to overcome this limitation by basing QTL detection on the construction of genetic maps highly-enriched in gene markers, and by assessing QTLs across pedigrees, years, and environments. Results Four saturated individual linkage maps representing two unrelated mapping populations of 260 and 500 clonally replicated progeny were assembled from 471 to 570 markers, including from 283 to 451 gene SNPs obtained using a multiplexed genotyping assay. Thence, a composite linkage map was assembled with 836 gene markers. For individual linkage maps, a total of 33 distinct quantitative trait loci (QTLs) were observed for bud flush, 52 for bud set, and 52 for height growth. For the composite map, the corresponding numbers of QTL clusters were 11, 13, and 10. About 20% of QTLs were replicated between the two mapping populations and nearly 50% revealed spatial and/or temporal stability. Three to four occurrences of overlapping QTLs between characters were noted, indicating regions with potential pleiotropic effects. Moreover, some of the genes involved in the QTLs were also underlined by recent genome scans or expression profile studies. Overall, the proportion of phenotypic variance explained by each QTL ranged from 3.0 to 16.4% for bud flush, from 2.7 to 22.2% for bud set, and from 2.5 to 10.5% for height growth. Up to 70% of the total character variance could be accounted for by QTLs for bud flush or bud set, and up to 59% for height growth. Conclusions This study provides a basic understanding of the genomic architecture related to bud flush, bud set, and height growth in a conifer species, and a useful indicator to compare with Angiosperms. It will serve as a basic reference to functional and

  18. Additional copies of CBX2 in the genomes of males of mammals lacking SRY, the Amami spiny rat (Tokudaia osimensis) and the Tokunoshima spiny rat (Tokudaia tokunoshimensis).

    PubMed

    Kuroiwa, Asato; Handa, Sanae; Nishiyama, Chigusa; Chiba, Eriko; Yamada, Fumio; Abe, Shintaro; Matsuda, Yoichi

    2011-07-01

    Tokudaia osimensis (the Amami spiny rat) and Tokudaia tokunoshimensis (the Tokunoshima spiny rat) have a sex chromosome composition of XO/XO, no Y chromosome. The mammalian sex-determining gene, SRY, is also absent in these species, which indicates that these spiny rats exhibit a novel sex-determining mechanism that is independent of SRY. To identify a candidate gene that controls this mechanism, the copy numbers and chromosomal locations of 10 genes with important functions in gonadal differentiation were determined: ATRX, CBX2 (M33), DMRT1, FGF9, NR0B1 (DAX1), NR5A1 (Ad4BP/SF1), RSPO1, SOX9, WNT4, and WT1. Multiple bands were detected for NR0B1 in Southern blot analysis, which suggested the presence of multiple copies of the gene in the genomes of these two species. CBX2 was localized to two loci in both sexes of the two species by fluorescence in situ hybridization mapping: 3q24 and 6p11.2 in T. osimensis and 10q25-q26 and 14q12-q13.1 in T. tokunoshimensis. Quantification of copy numbers in the two species by quantitative real-time PCR indicated that there were two or three more copies of CBX2 per haploid genome in males (T. osimensis, n = 3; T. tokunoshimensis, n = 2) than in females (T. osimensis, n = 4; T. tokunoshimensis, n = 2), whereas NR0B1 was present as a single copy in both. The results suggest that additional copies of CBX2 in males might be involved in a novel sex-determining mechanism in species that lack SRY. PMID:21656076

  19. Association and Haplotype Analyses of Positional Candidate Genes in Five Genomic Regions Linked to Scrotal Hernia in Commercial Pig Lines

    PubMed Central

    Du, Zhi-Qiang; Zhao, Xia; Vukasinovic, Natascha; Rodriguez, Fernanda; Clutter, Archie C.; Rothschild, Max F.

    2009-01-01

    Scrotal hernia in pigs is a complex trait likely affected by genetic and environmental factors. A large-scale association analysis of positional and functional candidate genes was conducted in four previously identified genomic regions linked to hernia susceptibility on Sus scrofa chromosomes 2 and 12, as well as the fifth region around 67 cM on chromosome 2, respectively. In total, 151 out of 416 SNPs discovered were genotyped successfully. Using a family-based analysis we found that four regions surrounding ELF5, KIF18A, COL23A1 on chromosome 2, and NPTX1 on chromosome 12, respectively, may contain the genetic variants important for the development of the scrotal hernia in pigs. These findings were replicated in another case-control dataset. The SNPs around the ELF5 region were in high linkage disequilibrium with each other, and a haplotype containing SNPs from ELF5 and CAT was highly significantly associated with hernia development. Extensive re-sequencing work focused on the KIF18A gene did not detect any further SNPs with extensive association signals. These genes may be involved in the estrogen receptor signaling pathway (KIF18A and NPTX1), the epithelial-mesenchymal transition (ELF5) and the collagen metabolism pathway (COL23A1), which are associated with the important molecular characteristics of hernia pathophysiology. Further investigation on the molecular mechanisms of these genes may provide more molecular clues on hernia development in pigs. PMID:19287495

  20. 3. 6-Mb genomic and YAC physical map of the Down syndrome chromosome region on chromosome 21

    SciTech Connect

    Dufresne-Zacharia, M.C.; Dahmane, N.; Theophile, D.; Orti, R.; Chettouh, Z.; Sinet, P.M.; Delabar, J.M. )

    1994-02-01

    The Down syndrome chromosome region (DCR) on chromosome 21 has been shown to contain a gene(s) important in the pathogenesis of Down syndrome. The authors constructed a long-range restriction map of the D21S55-D21S65 region covering the proximal part of the DCR. Pulsed-field gel electrophoresis of lymphocyte DNA digested with three rare cutting enzymes, NotI, NruI, and Mlu1, was used to establish two physical linkage groups of 5 and 7 markers, respectively, spanning 4.6 Mb on the NotI map. Mapping analysis of 40 YACs allowed the selection of 13 YACs covering 95% of the D21S55-D21S65 region and spanning 3.6 Mb. The restriction maps of these YACs and their positioning on the genomic map allowed 19 markers to be ordered, including 4 NotI linking clones, 9 polymorphic markers, the CBR gene, and the AML1 gene. The distances between markers could also be estimated. This physical map and the location of eight NotI sites between D21S55 and D21S17 should facilitate the isolation of previously unidentified genes in this region. 34 refs., 2 figs., 2 tabs.

  1. Potential virulence determinants in terminal regions of variola smallpox virus genome.

    PubMed

    Massung, R F; Esposito, J J; Liu, L I; Qi, J; Utterback, T R; Knight, J C; Aubin, L; Yuran, T E; Parsons, J M; Loparev, V N

    Smallpox eradication culminated the most successful antimicrobial campaign in medical history. To characterize further the linear double-stranded DNA genome of the aetiological agent of smallpox, we have determined the entire nucleotide sequence of the highly virulent variola major virus, strain Bangladesh-1975 (VAR-BSH; 186,102 base pairs, 33.7% G + C; Genbank accession number, L22579). Here we highlight features of the molecule and focus on a few of the 187 putative proteins that probably contribute to pathogenicity and virus host-range properties. One hundred and fifty proteins were markedly similar to those of vaccinia virus (smallpox vaccine), for which a complete sequence has been reported for strain Copenhagen (VAC-CPN; 191,636 base pairs, 33.3% G + C). The remaining 37 proteins reflected variola-specific sequences or open reading frame divergences for variant proteins, which are often truncated or elongated compared with their vaccinia counterparts.

  2. Mapping codon usage of the translation initiation region in porcine reproductive and respiratory syndrome virus genome

    PubMed Central

    2011-01-01

    Background Porcine reproductive and respitatory syndrome virus (PRRSV) is a recently emerged pathogen and severely affects swine populations worldwide. The replication of PRRSV is tightly controlled by viral gene expression and the codon usage of translation initiation region within each gene could potentially regulate the translation rate. Therefore, a better understanding of the codon usage pattern of the initiation translation region would shed light on the regulation of PRRSV gene expression. Results In this study, the codon usage in the translation initiation region and in the whole coding sequence was compared in PRRSV ORF1a and ORFs2-7. To investigate the potential role of codon usage in affecting the translation initiation rate, we established a codon usage model for PRRSV translation initiation region. We observed that some non-preferential codons are preferentially used in the translation initiation region in particular ORFs. Although some positions vary with codons, they intend to use codons with negative CUB. Furthermore, our model of codon usage showed that the conserved pattern of CUB is not directly consensus with the conserved sequence, but shaped under the translation selection. Conclusions The non-variation pattern with negative CUB in the PRRSV translation initiation region scanned by ribosomes is considered the rate-limiting step in the translation process. PMID:22014033

  3. HAPPY mapping in a plant genome: reconstruction and analysis of a high-resolution physical map of a 1.9 Mbp region of Arabidopsis thaliana chromosome 4.

    PubMed

    Thangavelu, Madan; James, Allan B; Bankier, Alan; Bryan, Glenn J; Dear, Paul H; Waugh, Robbie

    2003-01-01

    HAPPY mapping is an in vitro approach for defining the order and spacing of DNA markers directly on native genomic DNA. This cloning-free technique is based on analysing the segregation of markers amplified from high molecular weight genomic DNA which has been broken randomly and 'segregated' by limiting dilution into subhaploid samples. It is a uniquely versatile tool, allowing for the construction of genome maps with flexible ranges and resolutions. Moreover, it is applicable to plant genomes, for which many of the techniques pioneered in animal genomes are inapplicable or inappropriate. We report here its demonstration in a plant genome by reconstructing the physical map of a 1.9 Mbp region around the FCA locus of Arabidopsis thaliana. The resulting map, spanning around 10% of chromosome 4, is in excellent agreement with the DNA sequence and has a mean marker spacing of 16 kbp. We argue that HAPPY maps of any required resolution can be made immediately and with relatively little effort for most plant species and, furthermore, that such maps can greatly aid the construction of regional or genome-wide physical maps.

  4. The complete mitochondrial genome of the clam Mactra veneriformis (Bivalvia: Mactridae): has a unique non-coding region, missing atp8 and typical tRNA Ser.

    PubMed

    Meng, Xueping; Shen, Xin; Zhao, Nana; Tian, Mei; Liang, Meng; Hao, Jue; Cheng, Hanliang; Yan, Binlun; Dong, Zhiguo; Zhu, Xiaoling

    2013-12-01

    Mactra veneriformis (Bivalvia: Mactridae) is one commonly cultured bivalve species in the western Pacific Ocean. In the current study, the complete mitrochondrial DNA (mtDNA) of the clam M. veneriformis was determined. The M. veneriformis mt genome is 16,854 bp in length and encodes 34 genes on the same strand, including 12 protein-coding genes (PCGs), 2 ribosomal RNA genes and 20 transfer RNA genes. The length of 12 PCGs is 11,358 bp, which accounts for 67.4% in whole mt genome. The proportion is similar to other clams' mt genomes and within those of bivalves mt genomes. Gene order (which is the same as that of RZ C. antiquata) of M. veneriformis mt genome is compared with that of other veneroids. Compared with the typical gene content of animal mt genomes, atp8 and two tRNA(Ser) genes are missing in the mt genome. All non-coding regions are 1978 bp in length, among them the longest one is speculated as the control region, which is located between the tRNA(His) and tRNA(Arg). The secondary largest non-coding region (NCR(664)) between the tRNA(Gln) and tRNA(Thr) in the M. veneriformis mt genome contains one section of tandem repeats (125 nt × 5.2 or 249 nt × 2.6). The tandem repeats account for 97.89% (650/664) of the NCR(664), which is a unique characteristic of the M. veneriformis mt non-coding regions compared with those of other veneroids.

  5. Genome sequence of foot-and-mouth disease virus outside the 3A region is also responsible for virus replication in bovine cells.

    PubMed

    Ma, Xueqing; Li, Pinghua; Sun, Pu; Lu, Zengjun; Bao, Huifang; Bai, Xingwen; Fu, Yuanfang; Cao, Yimei; Li, Dong; Chen, Yingli; Qiao, Zilin; Liu, Zaixin

    2016-07-15

    The deletion of residues 93-102 in non-structure protein 3A of foot-and-mouth disease virus (FMDV) is associated with the inability of FMDV to grow in bovine cells and attenuated virulence in cattle.Whereas, a previously reported FMDV strain O/HKN/21/70 harboring 93-102 deletion in 3A protein grew equally well in bovine and swine cells. This suggests that changes inFMDV genome sequence, in addition to 93-102 deletion in 3A, may also affectthe viral growth phenotype in bovine cellsduring infection and replication.However, it is nuclear that changes in which region (inside or outside of 3A region) influences FMDV growth phenotype in bovine cells.In this study, to determine the region in FMDV genomeaffecting viral growth phenotype in bovine cells, we constructed chimeric FMDVs, rvGZSB-HKN3A and rvHN-HKN3A, by introducing the 3A coding region of O/HKN/21/70 into the context of O/SEA/Mya-98 strain O/GZSB/2011 and O Cathay topotype strain O/HN/CHA/93, respectively, since O/GZSB/2011 containing full-length 3A protein replicated well in bovine and swine cells, and O/HN/CHA/93 harboring 93-102 deletion in 3A protein grew poorly in bovine cells.The chimeric virusesrvGZSB-HKN3A and rvHN-HKN3A displayed growth properties and plaque phenotypes similar to those of the parental virus rvGZSB and rv-HN in BHK-21 and primary fetal porcine kidney (FPK) cells. However, rvHN-HKN3A and rv-HN replicated poorly in primary fetal bovine kidney (FBK) cells with no visible plaques, and rvGZSB-HKN3A exhibited lower growth rate and smaller plaque size phenotypes than those of the parental virus in FBK cells, but similar growth properties and plaque phenotypes to those of the recombinant viruses harboring 93-102 deletion in 3A. These results demonstrate that the difference present in FMDV genome sequence outside the 3A coding region also have influence on FMDV replication ability in bovine cells. PMID:27094491

  6. Reselection of a Genomic Upstream Open Reading Frame in Mouse Hepatitis Coronavirus 5′-Untranslated-Region Mutants

    PubMed Central

    Wu, Hung-Yi; Guan, Bo-Jhih; Su, Yu-Pin; Fan, Yi-Hsin

    2014-01-01

    An AUG-initiated upstream open reading frame (uORF) encoding a potential polypeptide of 3 to 13 amino acids (aa) is found within the 5′ untranslated region (UTR) of >75% of coronavirus genomes based on 38 reference strains. Potential CUG-initiated uORFs are also found in many strains. The AUG-initiated uORF is presumably translated following genomic 5′-end cap-dependent ribosomal scanning, but its function is unknown. Here, in a reverse-genetics study with mouse hepatitis coronavirus, the following were observed. (i) When the uORF AUG-initiating codon was replaced with a UAG stop codon along with a U112A mutation to maintain a uORF-harboring stem-loop 4 structure, an unimpaired virus with wild-type (WT) growth kinetics was recovered. However, reversion was found at all mutated sites within five virus passages. (ii) When the uORF was fused with genomic (main) ORF1 by converting three in-frame stop codons to nonstop codons, a uORF-ORF1 fusion protein was made, and virus replicated at WT levels. However, a frameshifting G insertion at virus passage 7 established a slightly 5′-extended original uORF. (iii) When uAUG-eliminating deletions of 20, 30, or 51 nucleotides (nt) were made within stem-loop 4, viable but debilitated virus was recovered. However, a C80U mutation in the first mutant and an A77G mutation in the second appeared by passage 10, which generated alternate uORFs that correlated with restored WT growth kinetics. In vitro, the uORF-disrupting nondeletion mutants showed enhanced translation of the downstream ORF1 compared with the WT. These results together suggest that the uORF represses ORF1 translation yet plays a beneficial but nonessential role in coronavirus replication in cell culture. PMID:24173235

  7. Isolation of Specific Genomic Regions and Identification of Their Associated Molecules by Engineered DNA-Binding Molecule-Mediated Chromatin Immunoprecipitation (enChIP) Using the CRISPR System and TAL Proteins

    PubMed Central

    Fujii, Hodaka; Fujita, Toshitsugu

    2015-01-01

    Comprehensive understanding of genome functions requires identification of molecules (proteins, RNAs, genomic regions, etc.) bound to specific genomic regions of interest in vivo. To perform biochemical and molecular biological analysis of specific genomic regions, we developed engineered DNA-binding molecule-mediated chromatin immunoprecipitation (enChIP) to purify genomic regions of interest. In enChIP, specific genomic regions are tagged for biochemical purification using engineered DNA-binding molecules, such as transcription activator-like (TAL) proteins and a catalytically inactive form of the clustered regularly interspaced short palindromic repeats (CRISPR) system. enChIP is a comprehensive approach that emphasizes non-biased search using next-generation sequencing (NGS), microarrays, mass spectrometry (MS), and other methods. Moreover, this approach is not restricted to cultured cell lines and can be easily extended to organisms. In this review, we discuss applications of enChIP to elucidating the molecular mechanisms underlying genome functions. PMID:26370991

  8. Isolation of Specific Genomic Regions and Identification of Their Associated Molecules by Engineered DNA-Binding Molecule-Mediated Chromatin Immunoprecipitation (enChIP) Using the CRISPR System and TAL Proteins.

    PubMed

    Fujii, Hodaka; Fujita, Toshitsugu

    2015-09-09

    Comprehensive understanding of genome functions requires identification of molecules (proteins, RNAs, genomic regions, etc.) bound to specific genomic regions of interest in vivo. To perform biochemical and molecular biological analysis of specific genomic regions, we developed engineered DNA-binding molecule-mediated chromatin immunoprecipitation (enChIP) to purify genomic regions of interest. In enChIP, specific genomic regions are tagged for biochemical purification using engineered DNA-binding molecules, such as transcription activator-like (TAL) proteins and a catalytically inactive form of the clustered regularly interspaced short palindromic repeats (CRISPR) system. enChIP is a comprehensive approach that emphasizes non-biased search using next-generation sequencing (NGS), microarrays, mass spectrometry (MS), and other methods. Moreover, this approach is not restricted to cultured cell lines and can be easily extended to organisms. In this review, we discuss applications of enChIP to elucidating the molecular mechanisms underlying genome functions.

  9. Disease-Associated SNPs From non-Coding Regions in Juvenile Idiopathic Arthritis Are Located Within or Adjacent to Functional Genomic Elements of Human Neutrophils and CD4+ T Cells

    PubMed Central

    Jiang, Kaiyu; Zhu, Lisha; Buck, Michael J.; Chen, Yanmin; Carrier, Bradley; Liu, Tao; Jarvis, James N.

    2015-01-01

    Background Juvenile idiopathic arthritis (JIA) is considered a complex trait in which the environment interacts with inherited genes to produce a phenotype that shows broad inter-individual variance. A recently completed genome-wide association study (GWAS) identified 24 regions of genetic risk for JIA, for example. However, as is typical for GWAS, most of the regions of genetic risk for JIA (22 of 24) were in non-coding regions of the genome. The studies reported here were undertaken to identify functional elements (other than genes) that might be located within the regions of genetic risk. Methods We used paired end RNA sequencing to identify non-coding RNAs located within 5 kb of the disease-associated SNPs. In addition, we used chromatin immunoprecipitation-sequencing (ChIP-Seq) to identify epigenetic marks associated with enhancer function (H3K4me1 and H3K27ac) in human neutrophils to determine whether there was enrichment of enhancer-associated histone marks in linkage disequilibrium (LD) blocks that encompassed the 22 GWAS SNPs from the non-coding genome. Results In human neutrophils, we identified H3K4me1 and/or H3K27ac marks in 15 of the 22 regions previously as identified as risk loci for JIA. In CD4+ T cells, 18 regions demonstrate H3K4me1 and/or H3K27ac marks. In addition, we identified non-coding RNA transcripts at the rs4705862 and rs6894249 loci in human neutrophils. Conclusion Much of the genetic risk for JIA lies within or adjacent to regions of neutrophil and CD4+ T cell genomes that carry epigenetic marks associated with enhancer function and/or ncRNA transcripts. These findings are consistent with the hypothesis that JIA is fundamentally a disorder of gene regulation that includes both the innate and adaptive immune system. Elucidating the specific roles of these non-coding elements within leukocyte genomes in JIA pathogenesis will be critical to our understanding disease pathogenesis. PMID:25833190

  10. Pairing of Homologous Regions in the Mouse Genome Is Associated with Transcription but Not Imprinting Status

    PubMed Central

    Krueger, Christel; King, Michelle R.; Krueger, Felix; Branco, Miguel R.; Osborne, Cameron S.; Niakan, Kathy K.; Higgins, Michael J.; Reik, Wolf

    2012-01-01

    Although somatic homologous pairing is common in Drosophila it is not generally observed in mammalian cells. However, a number of regions have recently been shown to come into close proximity with their homologous allele, and it has been proposed that pairing might be involved in the establishment or maintenance of monoallelic expression. Here, we investigate the pairing properties of various imprinted and non-imprinted regions in mouse tissues and ES cells. We find by allele-specific 4C-Seq and DNA FISH that the Kcnq1 imprinted region displays frequent pairing but that this is not dependent on monoallelic expression. We demonstrate that pairing involves larger chromosomal regions and that the two chromosome territories come close together. Frequent pairing is not associated with imprinted status or DNA repair, but is influenced by chromosomal location and transcription. We propose that homologous pairing is not exclusive to specialised regions or specific functional events, and speculate that it provides the cell with the opportunity of trans-allelic effects on gene regulation. PMID:22802932

  11. Regional signals in the planarian body guide stem cell fate in the presence of genomic instability.

    PubMed

    Peiris, T Harshani; Ramirez, Daniel; Barghouth, Paul G; Ofoha, Udokanma; Davidian, Devon; Weckerle, Frank; Oviedo, Néstor J

    2016-05-15

    Cellular fate decisions are influenced by their topographical location in the adult body. For instance, tissue repair and neoplastic growth are greater in anterior than in posterior regions of adult animals. However, the molecular underpinnings of these regional differences are unknown. We identified a regional switch in the adult planarian body upon systemic disruption of homologous recombination with RNA-interference of Rad51 Rad51 knockdown increases DNA double-strand breaks (DSBs) throughout the body, but stem cells react differently depending on their location along the anteroposterior axis. In the presence of extensive DSBs, cells in the anterior part of the body resist death, whereas cells in the posterior region undergo apoptosis. Furthermore, we found that proliferation of cells with DNA damage is induced in the presence of brain tissue and that the retinoblastoma pathway enables overproliferation of cells with DSBs while attending to the demands of tissue growth and repair. Our results implicate both autonomous and non-autonomous mechanisms as key mediators of regional cell behavior and cellular transformation in the adult body. PMID:27013241

  12. QTLs Regulating the Contents of Antioxidants, Phenolics, and Flavonoids in Soybean Seeds Share a Common Genomic Region.

    PubMed

    Li, Man-Wah; Muñoz, Nacira B; Wong, Chi-Fai; Wong, Fuk-Ling; Wong, Kwong-Sen; Wong, Johanna Wing-Hang; Qi, Xinpeng; Li, Kwan-Pok; Ng, Ming-Sin; Lam, Hon-Ming

    2016-01-01

    Soybean seeds are a rich source of phenolic compounds, especially isoflavonoids, which are important nutraceuticals. Our study using 14 wild- and 16 cultivated-soybean accessions shows that seeds from cultivated soybeans generally contain lower total antioxidants compared to their wild counterparts, likely an unintended consequence of domestication or human selection. Using a recombinant inbred population resulting from a wild and a cultivated soybean parent and a bin map approach, we have identified an overlapping genomic region containing major quantitative trait loci (QTLs) that regulate the seed contents of total antioxidants, phenolics, and flavonoids. The QTL for seed antioxidant content contains 14 annotated genes based on the Williams 82 reference genome (Gmax1.01). None of these genes encodes functions that are related to the phenylpropanoid pathway of soybean. However, we found three putative Multidrug And Toxic Compound Extrusion (MATE) transporter genes within this QTL and one adjacent to it (GmMATE1-4). Moreover, we have identified non-synonymous changes between GmMATE1 and GmMATE2, and that GmMATE3 encodes an antisense transcript that expresses in pods. Whether the polymorphisms in GmMATE proteins are major determinants of the antioxidant contents, or whether the antisense transcripts of GmMATE3 play important regulatory roles, awaits further functional investigations.

  13. Identification and characterization of regions of difference between the Salmonella Gallinarum biovar Gallinarum and the Salmonella Gallinarum biovar Pullorum genomes.

    PubMed

    Batista, Diego Felipe Alves; Freitas Neto, Oliveiro Caetano; Barrow, Paul Andrew; Oliveira, Marcos Túlio de; Almeida, Adriana Maria; Ferraudo, Antonio Sergio; Berchieri, Angelo

    2015-03-01

    Salmonella Gallinarum is the causative agent of fowl typhoid, a severe septicaemic disease that affects birds of all ages, whereas S. Pullorum causes pullorum disease, a systemic disorder affecting primarily young birds. A proportion of birds with pullorum disease become carriers and are thereby able to transmit S. Pullorum vertically. Although these two pathogens cause distinct diseases, they are otherwise phenotypically and genetically similar. Therefore, the small variations that lead to the differences in virulence must have a genetic basis which currently is unknown. In the present study, we compared the genome sequences of S. Gallinarum (strains: SG287/91 and SG9) and S. Pullorum (strains: SP_CDC, SP_RKS, SP_FCAV, SP_S06) and identified 223 regions of difference (RODs), characterized by indels which were detected by using the software Artemis Comparison Tool. Some of the RODs led to pseudogenes frequently formed by frameshifts and premature stop codons in genes primarily involved in virulence and metabolism. We further verified the presence of some conserved RODs by PCR in 26 isolates of S. Gallinarum and 17 of S. Pullorum in order to extrapolate data analyses from genome comparison to field strains. The variations observed in virulence-related genes of S. Gallinarum and S. Pullorum appear not to be sufficient to explain the differences between the distinct biology of infection of fowl typhoid and pullorum disease. Thus, we suggest that the identified pseudogenes affecting metabolism might play a greater role during infection than previously thought. PMID:25497350

  14. QTLs Regulating the Contents of Antioxidants, Phenolics, and Flavonoids in Soybean Seeds Share a Common Genomic Region.

    PubMed

    Li, Man-Wah; Muñoz, Nacira B; Wong, Chi-Fai; Wong, Fuk-Ling; Wong, Kwong-Sen; Wong, Johanna Wing-Hang; Qi, Xinpeng; Li, Kwan-Pok; Ng, Ming-Sin; Lam, Hon-Ming

    2016-01-01

    Soybean seeds are a rich source of phenolic compounds, especially isoflavonoids, which are important nutraceuticals. Our study using 14 wild- and 16 cultivated-soybean accessions shows that seeds from cultivated soybeans generally contain lower total antioxidants compared to their wild counterparts, likely an unintended consequence of domestication or human selection. Using a recombinant inbred population resulting from a wild and a cultivated soybean parent and a bin map approach, we have identified an overlapping genomic region containing major quantitative trait loci (QTLs) that regulate the seed contents of total antioxidants, phenolics, and flavonoids. The QTL for seed antioxidant content contains 14 annotated genes based on the Williams 82 reference genome (Gmax1.01). None of these genes encodes functions that are related to the phenylpropanoid pathway of soybean. However, we found three putative Multidrug And Toxic Compound Extrusion (MATE) transporter genes within this QTL and one adjacent to it (GmMATE1-4). Moreover, we have identified non-synonymous changes between GmMATE1 and GmMATE2, and that GmMATE3 encodes an antisense transcript that expresses in pods. Whether the polymorphisms in GmMATE proteins are major determinants of the antioxidant contents, or whether the antisense transcripts of GmMATE3 play important regulatory roles, awaits further functional investigations. PMID:27379137

  15. QTLs Regulating the Contents of Antioxidants, Phenolics, and Flavonoids in Soybean Seeds Share a Common Genomic Region

    PubMed Central

    Li, Man-Wah; Muñoz, Nacira B.; Wong, Chi-Fai; Wong, Fuk-Ling; Wong, Kwong-Sen; Wong, Johanna Wing-Hang; Qi, Xinpeng; Li, Kwan-Pok; Ng, Ming-Sin; Lam, Hon-Ming

    2016-01-01

    Soybean seeds are a rich source of phenolic compounds, especially isoflavonoids, which are important nutraceuticals. Our study using 14 wild- and 16 cultivated-soybean accessions shows that seeds from cultivated soybeans generally contain lower total antioxidants compared to their wild counterparts, likely an unintended consequence of domestication or human selection. Using a recombinant inbred population resulting from a wild and a cultivated soybean parent and a bin map approach, we have identified an overlapping genomic region containing major quantitative trait loci (QTLs) that regulate the seed contents of total antioxidants, phenolics, and flavonoids. The QTL for seed antioxidant content contains 14 annotated genes based on the Williams 82 reference genome (Gmax1.01). None of these genes encodes functions that are related to the phenylpropanoid pathway of soybean. However, we found three putative Multidrug And Toxic Compound Extrusion (MATE) transporter genes within this QTL and one adjacent to it (GmMATE1-4). Moreover, we have identified non-synonymous changes between GmMATE1 and GmMATE2, and that GmMATE3 encodes an antisense transcript that expresses in pods. Whether the polymorphisms in GmMATE proteins are major determinants of the antioxidant contents, or whether the antisense transcripts of GmMATE3 play important regulatory roles, awaits further functional investigations. PMID:27379137

  16. Relative effects of mutability and selection on single nucleotide polymorphisms in transcribed regions of the human genome

    PubMed Central

    Gorlov, Ivan P; Gorlova, Olga Y; Amos, Christopher I

    2008-01-01

    Motivation Single nucleotide polymorphisms (SNPs) are the most common type of genetic variation in humans. However, the factors that affect SNP density are poorly understood. The goal of this study was to estimate the relative effects of mutability and selection on SNP density in transcribed regions of human genes. It is important for prediction of the regions that harbor functional polymorphisms. Results We used frequency-validated SNPs resulting from single-nucleotide substitutions. SNPs were subdivided into five functional categories: (i) 5' untranslated region (UTR) SNPs, (ii) 3' UTR SNPs, (iii) synonymous SNPs, (iv) SNPs producing conservative missense mutations, and (v) SNPs producing radical missense mutations. Each of these categories was further subdivided into nine mutational categories on the basis of the single-nucleotide substitution type. Thus, 45 functional/mutational categories were analyzed. The relative mutation rate in each mutational category was estimated on the basis of published data. The proportion of segregating sites (PSSs) for each functional/mutational category was estimated by dividing the observed number of SNPs by the number of potential sites in the genome for a given functional/mutational category. By analyzing each functional group separately, we found significant positive correlations between PSSs and relative mutation rates (Spearman's correlation coefficient, at least r = 0.96, df = 9, P < 0.001). We adjusted the PSSs for the mutation rate and found that the functional category had a significant effect on SNP density (F = 5.9, df = 4, P = 0.001), suggesting that selection affects SNP density in transcribed regions of the genome. We used analyses of variance and covariance to estimate the relative effects of selection (functional category) and mutability (relative mutation rate) on the PSSs and found that approximately 87% of variation in PSS was due to variation in the mutation rate and approximately 13% was due to selection

  17. Genome-wide association study reveals regions associated with gestation length in two pig populations.

    PubMed

    Hidalgo, A M; Lopes, M S; Harlizius, B; Bastiaansen, J W M

    2016-04-01

    Reproduction traits, such as gestation length (GLE), play an important role in dam line breeding in pigs. The objective of our study was to identify single nucleotide polymorphisms (SNPs) that are associated with GLE in two pig populations. Genotypes and deregressed breeding values were available for 2081 Dutch Landrace-based (DL) and 2301 Large White-based (LW) pigs. We identified two QTL regions for GLE, one in each population. For DL, three associated SNPs were detected in one QTL region spanning 0.52 Mbp on Sus scrofa chromosome (SSC) 2. For LW, four associated SNPs were detected in one region of 0.14 Mbp on SSC5. The region on SSC2 contains the heparin-binding EGF-like growth factor (HBEGF) gene, which promotes embryo implantation and has been described to be involved in embryo survival throughout gestation. The associated SNP can be used for marker-assisted selection in the studied populations, and further studies of the HBEGF gene are warranted to investigate its role in GLE.

  18. The compact Brachypodium genome conserves centromeric regions of a common ancestor with wheat and rice

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The evolution of five chromosomes of Brachypodium distachyon from a 12-chromosome ancestor of all grasses by dysploidy raises an interesting question about the fate of redundant centromeres. Three independent but complementary approaches were pursued to study centromeric region homologies among the ...

  19. Cloning and genomic nucleotide sequence of the matrix attachment region binding protein from the halotolerant alga Dunaliella salina.

    PubMed

    Wang, Peng-Ju; Wang, Tian-Yun; Wang, Ya-Feng; Yang, Rui; Li, Zhao-Xi

    2013-07-01

    In our previous study, the sequence of a matrix attachment region binding protein (MBP) cDNA was cloned from the unicellular green alga Dunaliella salina. However, the nucleotide sequence of this gene has not been reported so far. In this paper, the nucleotide sequence of MBP was cloned and characterized, and its gene copy number was determined. The MBP nucleotide sequence is 5641 bp long, and interrupted by 12 introns ranging from 132 to 562 bp. All the introns in the D. salina MBP gene have orthodox splice sites, exhibiting GT at the 5' end and AG at the 3' end. Southern blot analysis showed that MBP only has one copy in the D. salina genome. PMID:22961592

  20. Cloning and genomic nucleotide sequence of the matrix attachment region binding protein from the halotolerant alga Dunaliella salina.

    PubMed

    Wang, Peng-Ju; Wang, Tian-Yun; Wang, Ya-Feng; Yang, Rui; Li, Zhao-Xi

    2013-07-01

    In our previous study, the sequence of a matrix attachment region binding protein (MBP) cDNA was cloned from the unicellular green alga Dunaliella salina. However, the nucleotide sequence of this gene has not been reported so far. In this paper, the nucleotide sequence of MBP was cloned and characterized, and its gene copy number was determined. The MBP nucleotide sequence is 5641 bp long, and interrupted by 12 introns ranging from 132 to 562 bp. All the introns in the D. salina MBP gene have orthodox splice sites, exhibiting GT at the 5' end and AG at the 3' end. Southern blot analysis showed that MBP only has one copy in the D. salina genome.

  1. Genome-wide association studies identifies seven major regions responsible for iron deficiency chlorosis in soybean (Glycine max).

    PubMed

    Mamidi, Sujan; Lee, Rian K; Goos, Jay R; McClean, Phillip E

    2014-01-01

    Iron deficiency chlorosis (IDC) is a yield limiting problem in soybean (Glycine max (L.) Merr) production regions with calcareous soils. Genome-wide association study (GWAS) was performed using a high density SNP map to discover significant markers, QTL and candidate genes associated with IDC trait variation. A stepwise regression model included eight markers after considering LD between markers, and identified seven major effect QTL on seven chromosomes. Twelve candidate genes known to be associated with iron metabolism mapped near these QTL supporting the polygenic nature of IDC. A non-synonymous substitution with the highest significance in a major QTL region suggests soybean orthologs of FRE1 on Gm03 is a major gene responsible for trait variation. NAS3, a gene that encodes the enzyme nicotianamine synthase which synthesizes the iron chelator nicotianamine also maps to the same QTL region. Disease resistant genes also map to the major QTL, supporting the hypothesis that pathogens compete with the plant for Fe and increase iron deficiency. The markers and the allelic combinations identified here can be further used for marker assisted selection.

  2. Genome-Wide Association Studies Identifies Seven Major Regions Responsible for Iron Deficiency Chlorosis in Soybean (Glycine max)

    PubMed Central

    Mamidi, Sujan; Lee, Rian K.; Goos, Jay R.; McClean, Phillip E.

    2014-01-01

    Iron deficiency chlorosis (IDC) is a yield limiting problem in soybean (Glycine max (L.) Merr) production regions with calcareous soils. Genome-wide association study (GWAS) was performed using a high density SNP map to discover significant markers, QTL and candidate genes associated with IDC trait variation. A stepwise regression model included eight markers after considering LD between markers, and identified seven major effect QTL on seven chromosomes. Twelve candidate genes known to be associated with iron metabolism mapped near these QTL supporting the polygenic nature of IDC. A non-synonymous substitution with the highest significance in a major QTL region suggests soybean orthologs of FRE1 on Gm03 is a major gene responsible for trait variation. NAS3, a gene that encodes the enzyme nicotianamine synthase which synthesizes the iron chelator nicotianamine also maps to the same QTL region. Disease resistant genes also map to the major QTL, supporting the hypothesis that pathogens compete with the plant for Fe and increase iron deficiency. The markers and the allelic combinations identified here can be further used for marker assisted selection. PMID:25225893

  3. Origin of the CMS gene locus in rapeseed cybrid mitochondria: active and inactive recombination produces the complex CMS gene region in the mitochondrial genomes of Brassicaceae.

    PubMed

    Oshima, Masao; Kikuchi, Rie; Imamura, Jun; Handa, Hirokazu

    2010-01-01

    CMS (cytoplasmic male sterile) rapeseed is produced by asymmetrical somatic cell fusion between the Brassica napus cv. Westar and the Raphanus sativus Kosena CMS line (Kosena radish). The CMS rapeseed contains a CMS gene, orf125, which is derived from Kosena radish. Our sequence analyses revealed that the orf125 region in CMS rapeseed originated from recombination between the orf125/orfB region and the nad1C/ccmFN1 region by way of a 63 bp repeat. A precise sequence comparison among the related sequences in CMS rapeseed, Kosena radish and normal rapeseed showed that the orf125 region in CMS rapeseed consisted of the Kosena orf125/orfB region and the rapeseed nad1C/ccmFN1 region, even though Kosena radish had both the orf125/orfB region and the nad1C/ccmFN1 region in its mitochondrial genome. We also identified three tandem repeat sequences in the regions surrounding orf125, including a 63 bp repeat, which were involved in several recombination events. Interestingly, differences in the recombination activity for each repeat sequence were observed, even though these sequences were located adjacent to each other in the mitochondrial genome. We report results indicating that recombination events within the mitochondrial genomes are regulated at the level of specific repeat sequences depending on the cellular environment.

  4. Integrated pathway-based approach identifies association between genomic regions at CTCF and CACNB2 and schizophrenia.

    PubMed

    Juraeva, Dilafruz; Haenisch, Britta; Zapatka, Marc; Frank, Josef; Witt, Stephanie H; Mühleisen, Thomas W; Treutlein, Jens; Strohmaier, Jana; Meier, Sandra; Degenhardt, Franziska; Giegling, Ina; Ripke, Stephan; Leber, Markus; Lange, Christoph; Schulze, Thomas G; Mössner, Rainald; Nenadic, Igor; Sauer, Heinrich; Rujescu, Dan; Maier, Wolfgang; Børglum, Anders; Ophoff, Roel; Cichon, Sven; Nöthen, Markus M; Rietschel, Marcella; Mattheisen, Manuel; Brors, Benedikt

    2014-06-01

    In the present study, an integrated hierarchical approach was applied to: (1) identify pathways associated with susceptibility to schizophrenia; (2) detect genes that may be potentially affected in these pathways since they contain an associated polymorphism; and (3) annotate the functional consequences of such single-nucleotide polymorphisms (SNPs) in the affected genes or their regulatory regions. The Global Test was applied to detect schizophrenia-associated pathways using discovery and replication datasets comprising 5,040 and 5,082 individuals of European ancestry, respectively. Information concerning functional gene-sets was retrieved from the Kyoto Encyclopedia of Genes and Genomes, Gene Ontology, and the Molecular Signatures Database. Fourteen of the gene-sets or pathways identified in the discovery dataset were confirmed in the replication dataset. These include functional processes involved in transcriptional regulation and gene expression, synapse organization, cell adhesion, and apoptosis. For two genes, i.e. CTCF and CACNB2, evidence for association with schizophrenia was available (at the gene-level) in both the discovery study and published data from the Psychiatric Genomics Consortium schizophrenia study. Furthermore, these genes mapped to four of the 14 presently identified pathways. Several of the SNPs assigned to CTCF and CACNB2 have potential functional consequences, and a gene in close proximity to CACNB2, i.e. ARL5B, was identified as a potential gene of interest. Application of the present hierarchical approach thus allowed: (1) identification of novel biological gene-sets or pathways with potential involvement in the etiology of schizophrenia, as well as replication of these findings in an independent cohort; (2) detection of genes of interest for future follow-up studies; and (3) the highlighting of novel genes in previously reported candidate regions for schizophrenia.

  5. Proteins encoded in genomic regions associated with immune-mediated disease physically interact and suggest underlying biology.

    PubMed

    Rossin, Elizabeth J; Lage, Kasper; Raychaudhuri, Soumya; Xavier, Ramnik J; Tatar, Diana; Benita, Yair; Cotsapas, Chris; Daly, Mark J

    2011-01-01

    Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein-protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in

  6. Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology

    PubMed Central

    Rossin, Elizabeth J.; Lage, Kasper; Raychaudhuri, Soumya; Xavier, Ramnik J.; Tatar, Diana; Benita, Yair

    2011-01-01

    Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein–protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in

  7. SLaP mapper: a webserver for identifying and quantifying spliced-leader addition and polyadenylation site usage in kinetoplastid genomes.

    PubMed

    Fiebig, Michael; Gluenz, Eva; Carrington, Mark; Kelly, Steven

    2014-09-01

    The Kinetoplastida are a diverse and globally distributed class of free-living and parasitic single-celled eukaryotes that collectively cause a significant burden on human health and welfare. In kinetoplastids individual genes do not have promoters, but rather all genes are arranged downstream of a small number of RNA polymerase II transcription initiation sites and are thus transcribed in polycistronic gene clusters. Production of individual mRNAs from this continuous transcript occurs co-transcriptionally by trans-splicing of a ∼39 nucleotide capped RNA and subsequent polyadenylation of the upstream mRNA. SLaP mapper (Spliced-Leader and Polyadenylation mapper) is a fully automated web-service for identification, quantitation and gene-assignment of both spliced-leader and polyadenylation addition sites in Kinetoplastid genomes. SLaP mapper only requires raw read data from paired-end Illumina RNAseq and performs all read processing, mapping, quality control, quantification, and analysis in a fully automated pipeline. To provide usage examples and estimates of the quantity of sequence data required we use RNAseq obtained from two different library preparations from both Trypanosoma brucei and Leishmania mexicana to show the number of expected reads that are obtained from each preparation type. SLaP mapper is an easy to use, platform independent webserver that is freely available for use at http://www.stevekellylab.com/software/slap. Example files are provided on the website.

  8. Mapping of transforming region of the Harvey murine sarcoma virus genome by using insertion-deletion mutants constructed in vitro.

    PubMed Central

    Wei, C M; Lowy, D R; Scolnick, E M

    1980-01-01

    Circular DNA intermediates of Harvey murine sarcoma virus (Ha-MuSV) have been cloned in lambda gtWES . lambda B and shown to be capable of transforming mouse NIH 3T3 cells [Hager, G. L., Chang, E. H., Chan, H. W., Garon, C. F., Israel, M. A., Martin, M. A., Scolnick, E. M. & Lowy, D. R. (1979) J. Virol. 31, 795-809]. By using the cloned Ha-MuSV DNA insert as a parental genome, we have constructed a series of insertion-deletion mutants by inserting an octomer containing the Sal I linker sequence (G-G-T-C-G-A-C-C) into various regions of the Ha-MuSV genome after partial digestion with Hae III. After ligation into lambda gtWES . lambda B-Sal I vector molecules, the mutant Ha-MuSV DNAs were cloned. Fourteen insertion-deletion mutants have been mapped by restriction enzyme digestion, and their biological activities have been correlated with the locations of mutations. The mutants whose lesion mapped within 3.0 kilobases (kb) frm the 3'-end of the Ha-MuSV genome retained full transforming ability. The mutants containing the Sal I linker insertion at 0.4 or 1.5 kb from the 5'-end also retained transforming ability, but the number of foci induced by the DNAs in transfection assays was greatly reduced. However, a mutant containing a deletion of 1.5 kb at the 5'-end and a mutant with a deletion of the sequences between 1.0 and 1.5 kb from the 5'-end completely lost their transforming potential. A model for the transforming region of Ha-MuSV is discussed. Furthermore, because Ha-MuSV sequences can be rescued from the mouse cells transformed by these mutants using Moloney murine leukemia virus as a helper virus, it implies that the in vitro modified DNAs may be converted into genuine mutant viruses. Images PMID:6254037

  9. Constructing chromosome- and region-specific cosmid maps of the human genome.

    PubMed

    Carrano, A V; de Jong, P J; Branscomb, E; Slezak, T; Watkins, B W

    1989-01-01

    A chromosome-specific ordered set of cosmids would be a significant contribution toward understanding human chromosome structure and function. We are developing two parallel approaches for creating an ordered cosmid library of human chromosome 19 and other selected subregions of the human genome. The "bottom up" approach is used to establish sets of overlapping cosmids as islands or "contigs" along the chromosome, while the "top down" approach, using pulsed-field gel electrophoresis and yeast cloning, will establish a large-fragment map and close the inevitable gaps remaining from the "bottom up" approach. Source DNA consists of a single homolog of chromosome 19 from a hamster--human hybrid cell and human fragments cloned in yeast artificial chromosomes. We have constructed cosmid libraries in a vector that facilitates cloning small amounts of DNA, allows transcription of the insert termini, and contains unique sites for partial-digest mapping. Computer simulations of cosmid contig building suggest that near-optimal efficiency can be achieved with high-density restriction fragment digest schemes that can detect 20-30% overlap between cosmids. We developed the chemistry and data analysis tools to compare the ordering efficiencies of several cosmid restriction digest fingerprinting strategies. Restriction fragments from a four-cutter digest are labeled with a fluorochrome, separated by polyacrylamide gel electrophoresis, and detected after laser excitation as they traverse a fixed point in the gel. We have also developed the software to rapidly process the output signal to define and analyze the fragment peaks. Up to three cosmids (or three different digests of the same cosmid) plus a size standard are analyzed simultaneously in a single gel lane.(ABSTRACT TRUNCATED AT 250 WORDS) PMID:2698823

  10. Supershear rupture in the 24 May 2013 Mw 6.7 Okhotsk deep earthquake: Additional evidence from regional seismic stations

    NASA Astrophysics Data System (ADS)

    Zhan, Zhongwen; Shearer, Peter M.; Kanamori, Hiroo

    2015-10-01

    Zhan et al. (2014a) reported supershear rupture during the Mw 6.7 aftershock of the 2013 Mw 8.3 Sea of Okhotsk deep earthquake, relying heavily on the regional station PET, which played a critical role in constraining the vertical rupture dimension and rupture speed. Here we include five more regional stations and find that the durations of the source time functions derived from these stations are consistent with Zhan et al.'s supershear rupture model. Furthermore, to reduce the nonuniqueness of deconvolution and combine the bandwidths of different stations, we conduct a joint inversion of the six regional stations for a single broadband moment-rate function (MRF). The best fitting MRF, which explains all the regional waveforms well, has a smooth shape without any temporal gaps. The Mw 6.7 Okhotsk deep earthquake is more likely a continuous supershear rupture than a dynamically triggered doublet.

  11. Predicting the Effects of Nano-Scale Cerium Additives in Diesel Fuel on Regional-Scale Air Quality

    EPA Science Inventory

    Diesel vehicles are a major source of air pollutant emissions. Fuel additives containing nanoparticulate cerium (nCe) are currently being used in some diesel vehicles to improve fuel efficiency. These fuel additives also reduce fine particulate matter (PM2.5) emissio...

  12. Avian papillomaviruses: the parrot Psittacus erithacus papillomavirus (PePV) genome has a unique organization of the early protein region and is phylogenetically related to the chaffinch papillomavirus

    PubMed Central

    Tachezy, Ruth; Rector, Annabel; Havelkova, Marta; Wollants, Elke; Fiten, Pierre; Opdenakker, Ghislain; Jenson, A Bennett; Sundberg, John P; Van Ranst, Marc

    2002-01-01

    Background An avian papillomavirus genome has been cloned from a cutaneous exophytic papilloma from an African grey parrot (Psittacus erithacus). The nucleotide sequence, genome organization, and phylogenetic position of the Psittacus erithacus papillomavirus (PePV) were determined. This PePV sequence represents the first complete avian papillomavirus genome defined. Results The PePV genome (7304 basepairs) differs from other papillomaviruses, in that it has a unique organization of the early protein region lacking classical E6 and E7 open reading frames. Phylogenetic comparison of the PePV sequence with partial E1 and L1 sequences of the chaffinch (Fringilla coelebs) papillomavirus (FPV) reveals that these two avian papillomaviruses form a monophyletic cluster with a common branch that originates near the unresolved center of the papillomavirus evolutionary tree. Conclusions The PePV genome has a unique layout of the early protein region which represents a novel prototypic genomic organization for avian papillomaviruses. The close relationship between PePV and FPV, and between their Psittaciformes and Passeriformes hosts, supports the hypothesis that papillomaviruses have co-evolved and speciated together with their host species throughout evolution. PMID:12110158

  13. Multiple Sex-Associated Regions and a Putative Sex Chromosome in Zebrafish Revealed by RAD Mapping and Population Genomics

    PubMed Central

    Anderson, Jennifer L.; Rodríguez Marí, Adriana; Braasch, Ingo; Amores, Angel; Hohenlohe, Paul; Batzel, Peter; Postlethwait, John H.

    2012-01-01

    Within vertebrates, major sex determining genes can differ among taxa and even within species. In zebrafish (Danio rerio), neither heteromorphic sex chromosomes nor single sex determination genes of large effect, like Sry in mammals, have yet been identified. Furthermore, environmental factors can influence zebrafish sex determination. Although progress has been made in understanding zebrafish gonad differentiation (e.g. the influence of germ cells on gonad fate), the primary genetic basis of zebrafish sex determination remains poorly understood. To identify genetic loci associated with sex, we analyzed F2 offspring of reciprocal crosses between Oregon *AB and Nadia (NA) wild-type zebrafish stocks. Genome-wide linkage analysis, using more than 5,000 sequence-based polymorphic restriction site associated (RAD-tag) markers and population genomic analysis of more than 30,000 single nucleotide polymorphisms in our *ABxNA crosses revealed a sex-associated locus on the end of the long arm of chr-4 for both cross families, and an additional locus in the middle of chr-3 in one cross family. Additional sequencing showed that two SNPs in dmrt1 previously suggested to be functional candidates for sex determination in a cross of ABxIndia wild-type zebrafish, are not associated with sex in our AB fish. Our data show that sex determination in zebrafish is polygenic and that different genes may influence sex determination in different strains or that different genes become more important under different environmental conditions. The association of the end of chr-4 with sex is remarkable because, unique in the karyotype, this chromosome arm shares features with known sex chromosomes: it is highly heterochromatic, repetitive, late replicating, and has reduced recombination. Our results reveal that chr-4 has functional and structural properties expected of a sex chromosome. PMID:22792396

  14. Genome-Wide Identification of Chromatin Transitional Regions Reveals Diverse Mechanisms Defining the Boundary of Facultative Heterochromatin

    PubMed Central

    Li, Guangyao; Zhou, Lei

    2013-01-01

    Due to the self-propagating nature of the heterochromatic modification H3K27me3, chromatin barrier activities are required to demarcate the boundary and prevent it from encroaching into euchromatic regions. Studies in Drosophila and vertebrate systems have revealed several important chromatin barrier elements and their respective binding factors. However, epigenomic data indicate that the binding of these factors are not exclusive to chromatin boundaries. To gain a comprehensive understanding of facultative heterochromatin boundaries, we developed a two-tiered method to identify the Chromatin Transitional Region (CTR), i.e. the nucleosomal region that shows the greatest transition rate of the H3K27me3 modification as revealed by ChIP-Seq. This approach was applied to identify CTRs in Drosophila S2 cells and human HeLa cells. Although many insulator proteins have been characterized in Drosophila, less than half of the CTRs in S2 cells are associated with known insulator proteins, indicating unknown mechanisms remain to be characterized. Our analysis also revealed that the peak binding of insulator proteins are usually 1–2 nucleosomes away from the CTR. Comparison of CTR-associated insulator protein binding sites vs. those in heterochromatic region revealed that boundary-associated binding sites are distinctively flanked by nucleosome destabilizing sequences, which correlates with significant decreased nucleosome density and increased binding intensities of co-factors. Interestingly, several subgroups of boundaries have enhanced H3.3 incorporation but reduced nucleosome turnover rate. Our genome-wide study reveals that diverse mechanisms are employed to define the boundaries of facultative heterochromatin. In both Drosophila and mammalian systems, only a small fraction of insulator protein binding sites co-localize with H3K27me3 boundaries. However, boundary-associated insulator binding sites are distinctively flanked by nucleosome destabilizing sequences, which

  15. Characterization of biological pathways associated with a 1.37 Mbp genomic region protective of hypertension in Dahl S rats.

    PubMed

    Cowley, Allen W; Moreno, Carol; Jacob, Howard J; Peterson, Christine B; Stingo, Francesco C; Ahn, Kwang Woo; Liu, Pengyuan; Vannucci, Marina; Laud, Purushottam W; Reddy, Prajwal; Lazar, Jozef; Evans, Louise; Yang, Chun; Kurth, Theresa; Liang, Mingyu

    2014-06-01

    The goal of the present study was to narrow a region of chromosome 13 to only several genes and then apply unbiased statistical approaches to identify molecular networks and biological pathways relevant to blood-pressure salt sensitivity in Dahl salt-sensitive (SS) rats. The analysis of 13 overlapping subcongenic strains identified a 1.37 Mbp region on chromosome 13 that influenced the mean arterial blood pressure by at least 25 mmHg in SS rats fed a high-salt diet. DNA sequencing and analysis filled genomic gaps and provided identification of five genes in this region, Rfwd2, Fam5b, Astn1, Pappa2, and Tnr. A cross-platform normalization of transcriptome data sets obtained from our previously published Affymetrix GeneChip dataset and newly acquired RNA-seq data from renal outer medullary tissue provided 90 observations for each gene. Two Bayesian methods were used to analyze the data: 1) a linear model analysis to assess 243 biological pathways for their likelihood to discriminate blood pressure levels across experimental groups and 2) a Bayesian graphical modeling of pathways to discover genes with potential relationships to the candidate genes in this region. As none of these five genes are known to be involved in hypertension, this unbiased approach has provided useful clues to be experimentally explored. Of these five genes, Rfwd2, the gene most strongly expressed in the renal outer medulla, was notably associated with pathways that can affect blood pressure via renal transcellular Na(+) and K(+) electrochemical gradients and tubular Na(+) transport, mitochondrial TCA cycle and cell energetics, and circadian rhythms.

  16. Positive selection in the chromosome 16 VKORC1 genomic region has contributed to the variability of anticoagulant response in humans.

    PubMed

    Patillon, Blandine; Luisi, Pierre; Blanché, Hélène; Patin, Etienne; Cann, Howard M; Génin, Emmanuelle; Sabbagh, Audrey

    2012-01-01

    VKORC1 (vitamin K epoxide reductase complex subunit 1, 16p11.2) is the main genetic determinant of human response to oral anticoagulants of antivitamin K type (AVK). This gene was recently suggested to be a putative target of positive selection in East Asian populations. In this study, we genotyped the HGDP-CEPH Panel for six VKORC1 SNPs and downloaded chromosome 16 genotypes from the HGDP-CEPH database in order to characterize the geographic distribution of footprints of positive selection within and around this locus. A unique VKORC1 haplotype carrying the promoter mutation associated with AVK sensitivity showed especially high frequencies in all the 17 HGDP-CEPH East Asian population samples. VKORC1 and 24 neighboring genes were found to lie in a 505 kb region of strong linkage disequilibrium in these populations. Patterns of allele frequency differentiation and haplotype structure suggest that this genomic region has been submitted to a near complete selective sweep in all East Asian populations and only in this geographic area. The most extreme scores of the different selection tests are found within a smaller 45 kb region that contains VKORC1 and three other genes (BCKDK, MYST1 (KAT8), and PRSS8) with different functions. Because of the strong linkage disequilibrium, it is not possible to determine if VKORC1 or one of the three other genes is the target of this strong positive selection that could explain present-day differences among human populations in AVK dose requirement. Our results show that the extended region surrounding a presumable single target of positive selection should be analyzed for genetic variation in a wide range of genetically diverse populations in order to account for other neighboring and confounding selective events and the hitchhiking effect.

  17. Complete genome sequence of mandarin decline Citrus tristeza virus of the Northeastern Himalayan hill region of India: comparative analyses determine recombinant.

    PubMed

    Biswas, Kajal K; Tarafdar, Avijit; Sharma, Susheel K

    2012-03-01

    The complete genome sequence of a mandarin (Citrus reticulata) decline CTV isolate, Kpg3, of the Darjeeling hills of the Northeastern Himalayan region of India is reported for the first time. The complete Kpg3 genome has 19253 nt, and its nucleotide sequence identity ranged from 79% with the Florida CTV isolate T36 to 94% with the Israel isolate VT, whereas its identity to B165, the other Indian isolate, was 89%. Phylogenetic analysis indicated that the Kpg3 genome is closely related to isolate VT and distantly to T36 and B165. Recombination analysis indicated that Kpg3 is recombinant and originated through multiple recombination events in which parts of the genome were exchanged between divergent CTV sequences.

  18. Rapid parallel evolution of standing variation in a single, complex, genomic region is associated with life history in steelhead/rainbow trout.

    PubMed

    Pearse, Devon E; Miller, Michael R; Abadía-Cardoso, Alicia; Garza, John Carlos

    2014-05-22

    Rapid adaptation to novel environments may drive changes in genomic regions through natural selection. Such changes may be population-specific or, alternatively, may involve parallel evolution of the same genomic region in multiple populations, if that region contains genes or co-adapted gene complexes affecting the selected trait(s). Both quantitative and population genetic approaches have identified associations between specific genomic regions and the anadromous (steelhead) and resident (rainbow trout) life-history strategies of Oncorhynchus mykiss. Here, we use genotype data from 95 single nucleotide polymorphisms and show that the distribution of variation in a large region of one chromosome, Omy5, is strongly associated with life-history differentiation in multiple above-barrier populations of rainbow trout and their anadromous steelhead ancestors. The associated loci are in strong linkage disequilibrium, suggesting the presence of a chromosomal inversion or other rearrangement limiting recombination. These results provide the first evidence of a common genomic basis for life-history variation in O. mykiss in a geographically diverse set of populations and extend our knowledge of the heritable basis of rapid adaptation of complex traits in novel habitats.

  19. In situ optical sequencing and structure analysis of a trinucleotide repeat genome region by localization microscopy after specific COMBO-FISH nano-probing

    NASA Astrophysics Data System (ADS)

    Stuhlmüller, M.; Schwarz-Finsterle, J.; Fey, E.; Lux, J.; Bach, M.; Cremer, C.; Hinderhofer, K.; Hausmann, M.; Hildenbrand, G.

    2015-10-01

    Trinucleotide repeat expansions (like (CGG)n) of chromatin in the genome of cell nuclei can cause neurological disorders such as for example the Fragile-X syndrome. Until now the mechanisms are not clearly understood as to how these expansions develop during cell proliferation. Therefore in situ investigations of chromatin structures on the nanoscale are required to better understand supra-molecular mechanisms on the single cell level. By super-resolution localization microscopy (Spectral Position Determination Microscopy; SPDM) in combination with nano-probing using COMBO-FISH (COMBinatorial Oligonucleotide FISH), novel insights into the nano-architecture of the genome will become possible. The native spatial structure of trinucleotide repeat expansion genome regions was analysed and optical sequencing of repetitive units was performed within 3D-conserved nuclei using SPDM after COMBO-FISH. We analysed a (CGG)n-expansion region inside the 5' untranslated region of the FMR1 gene. The number of CGG repeats for a full mutation causing the Fragile-X syndrome was found and also verified by Southern blot. The FMR1 promotor region was similarly condensed like a centromeric region whereas the arrangement of the probes labelling the expansion region seemed to indicate a loop-like nano-structure. These results for the first time demonstrate that in situ chromatin structure measurements on the nanoscale are feasible. Due to further methodological progress it will become possible to estimate the state of trinucleotide repeat mutations in detail and to determine the associated chromatin strand structural changes on the single cell level. In general, the application of the described approach to any genome region will lead to new insights into genome nano-architecture and open new avenues for understanding mechanisms and their relevance in the development of heredity diseases.

  20. A novel mitochondrial genome architecture in thrips (Insecta: Thysanoptera): extreme size asymmetry among chromosomes and possible recent control region duplication

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Multi-partite mitochondrial genomes are very rare in animals but have been found previously in two insect orders with highly rearranged genomes, the Phthiraptera (parasitic lice), and the Psocoptera (booklice/barklice). We provide the first report of a multi-partite mitochondrial genome architecture...

  1. The mitochondrial genome of Pocillopora (Cnidaria: Scleractinia) contains two variable regions: the putative D-loop and a novel ORF of unknown function.

    PubMed

    Flot, Jean-François; Tillier, Simon

    2007-10-15

    The complete mitochondrial genomes of two individuals attributed to different morphospecies of the scleractinian coral genus Pocillopora have been sequenced. Both genomes, respectively 17,415 and 17,422 nt long, share the presence of a previously undescribed ORF encoding a putative protein made up of 302 amino acids and of unknown function. Surprisingly, this ORF turns out to be the second most variable region of the mitochondrial genome (1% nucleotide sequence difference between the two individuals) after the putative control region (1.5% sequence difference). Except for the presence of this ORF and for the location of the putative control region, the mitochondrial genome of Pocillopora is organized in a fashion similar to the other scleractinian coral genomes published to date. For the first time in a cnidarian, a putative second origin of replication is described based on its secondary structure similar to the stem-loop structure of O(L), the origin of L-strand replication in vertebrates.

  2. Sequence analysis of the E3 region and fiber gene of human adenovirus genome type 7h.

    PubMed

    Kajon, A E; Wadell, G

    1996-01-15

    Adenovirus type 7h is currently the predominant virulent genome type of serotype 7 isolated in Argentina, Chile, and Uruguay in association with severe infantile pneumonia. In order to characterize possible molecular determinants of pathogenicity, the nucleotide sequence of a 5904-bp fragment (76 to 93 mu) containing the entire E3 region and the fiber gene of Ad7h was established. The organization of the ORFs within the E3 region was similar to that reported for the prototype strains of Ad7 and Ad3. A comparison of the nucleotide and amino acid sequences of all ORFs revealed a higher homology between Ad7h and Ad7p than between Ad7h and Ad3 for 12.0K and 16.1K, whereas the 15.3K ORF and the adjacent fiber gene were strikingly more homologous to those of Ad3 (99.5 vs 81.1% and 98.2 vs 66.6%, respectively). The equivalent to ORF 7.7K in Ad7p was missing in Ad7h due to a deletion and a mutation affecting the start codon (ATG-->ATT). Although the hemagglutinin of the Ad7h fiber could not be characterized due to its lack of activity on monkey erythrocytes, our results indicate that Ad7h is an intermediate strain 7-3.

  3. Genomic analysis of a region encompassing QRfs1 and QRfs2: genes that underlie soybean resistance to sudden death syndrome.

    PubMed

    Triwitayakorn, K; Njiti, V N; Iqbal, M J; Yaegashi, S; Town, C; Lightfoot, D A

    2005-02-01

    Candidate genes were identified for two loci, QRfs2 providing resistance to the leaf scorch called soybean (Glycine max (L.) Merr.) sudden death syndrome (SDS) and QRfs1 providing resistance to root infection by the causal pathogen Fusarium solani f.sp. glycines. The 7.5 +/- 0.5 cM region of chromosome 18 (linkage group G) was shown to encompass a cluster of resistance loci using recombination events from 4 near-isogenic line populations and 9 DNA markers. The DNA markers anchored 9 physical map contigs (7 are shown on the soybean Gbrowse, 2 are unpublished), 45 BAC end sequences (41 in Gbrowse), and contiguous DNA sequences of 315, 127, and 110 kbp. Gene density was high at 1 gene per 7 kbp only around the already sequenced regions. Three to 4 gene-rich islands were inferred to be distributed across the entire 7.5 cM or 3.5 Mbp showing that genes are clustered in the soybean genome. Candidate resistance genes were identified and a molecular basis for interactions among the disease resistance genes in the cluster inferred. PMID:15729404

  4. A fine structure genomic map of the region of 12q13 containing SAS and CDK4

    SciTech Connect

    Linder, C.Y.; Elkahloun, A.G.; Su, Y.A.

    1994-09-01

    We have recently adapted a method, originally described by Rackwitz, to the rapid restriction mapping of multiple cosmid DNA samples. Linearization of the cosmids at the lambda cohesive site using lambda terminase is followed by partial digestion with selected restriction enzymes and hybridization to oligonucleotides specific for the right or left hand termini. Partial digestions are performed in a microtiter plate thus allowing up to 12 cosmid clones to be digested with one restriction enzyme. We have applied this rapid restriction mapping method to cosmids derived from a region of chromosome 12q13 that has recently been shown to be amplified in a variety of cancers including malignant fibrous histiocytoma, fibrosarcoma, liposarcoma, osteosarcoma and brain tumors. A small segment of this amplification unit containing three genes, SAS (a membrane protein), CDK4 (a cyclin dependent kinase) and OS-9 (a recently described cDNA) has been analyzed with the system described above. This fine structure genomic map will be useful for completing the expression map of this region as well as characterizing its pattern of amplification in tumor specimens.

  5. 30 CFR 250.1166 - What additional reporting is required for developments in the Alaska OCS Region?

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... MANAGEMENT, REGULATION, AND ENFORCEMENT, DEPARTMENT OF THE INTERIOR OFFSHORE OIL AND GAS AND SULPHUR OPERATIONS IN THE OUTER CONTINENTAL SHELF Oil and Gas Production Requirements Other Requirements § 250.1166... development in the Alaska OCS Region, you must submit an annual reservoir management report to the...

  6. Systematic analysis of enhancer and critical cis-acting RNA elements in the protein-encoding region of the hepatitis C virus genome.

    PubMed

    Chu, Derrick; Ren, Songyang; Hu, Stacy; Wang, Wei Gang; Subramanian, Aparna; Contreras, Deisy; Kanagavel, Vidhya; Chung, Eric; Ko, Justine; Amirtham Jacob Appadorai, Ranjit Singh; Sinha, Sanjeev; Jalali, Ziba; Hardy, David W; French, Samuel W; Arumugaswami, Vaithilingaraja

    2013-05-01

    Hepatitis C virus (HCV) causes chronic hepatitis, cirrhosis, and liver cancer. cis-acting RNA elements of the HCV genome are critical for translation initiation and replication of the viral genome. We hypothesized that the coding regions of nonstructural proteins harbor enhancer and essential cis-acting replication elements (CRE). In order to experimentally identify new cis RNA elements, we utilized an unbiased approach to introduce synonymous substitutions. The HCV genome coding for nonstructural proteins (nucleotide positions 3872 to 9097) was divided into 17 contiguous segments. The wobble nucleotide positions of each codon were replaced, resulting in 33% to 41% nucleotide changes. The HCV genome containing one of each of 17 mutant segments (S1 to S17) was tested for genome replication and infectivity. We observed that silent mutations in segment 13 (S13) (nucleotides [nt] 7457 to 7786), S14 (nt 7787 to 8113), S15 (nt 8114 to 8440), S16 (nt 8441 to 8767), and S17 (nt 8768 to 9097) resulted in impaired genome replication, suggesting CRE structures are enriched in the NS5B region. Subsequent high-resolution mutational analysis of NS5B (nt 7787 to 9289) using approximately 51-nucleotide contiguous subsegment mutant viruses having synonymous mutations revealed that subsegments SS8195-8245, SS8654-8704, and SS9011-9061 were required for efficient viral growth, suggesting that these regions act as enhancer elements. Covariant nucleotide substitution analysis of a stem-loop, JFH-SL9098, revealed the formation of an extended stem structure, which we designated JFH-SL9074. We have identified new enhancer RNA elements and an extended stem-loop in the NS5B coding region. Genetic modification of enhancer RNA elements can be utilized for designing attenuated HCV vaccine candidates.

  7. The compact Brachypodium genome conserves centromeric regions of a common ancestor with wheat and rice.

    PubMed

    Qi, Lili; Friebe, Bernd; Wu, Jiajie; Gu, Yongqiang; Qian, Chen; Gill, Bikram S

    2010-11-01

    The evolution of five chromosomes of Brachypodium distachyon from a 12-chromosome ancestor of all grasses by dysploidy raises an interesting question about the fate of redundant centromeres. Three independent but complementary approaches were pursued to study centromeric region homologies among the chromosomes of Brachypodium, wheat, and rice. The genes present in pericentromeres of the basic set of seven chromosomes of wheat and the Triticeae, and the 80 rice centromeric genes spanning the CENH3 binding domain of centromeres 3, 4, 5, 7, and 8 were used as "anchor" markers to identify centromere locations in the B. distachyon chromosomes. A total of 53 B. distachyon bacterial artificial chromosome (BAC) clones anchored by wheat pericentromeric expressed sequence tags (ESTs) were used as probes for BAC-fluorescence in situ hybridization (FISH) analysis of B. distachyon mitotic chromosomes. Integrated sequence alignment and BAC-FISH data were used to determine the approximate positions of active and inactive centromeres in the five B. distachyon chromosomes. The following syntenic relationships of the centromeres for Brachypodium (Bd), rice (R), and wheat (W) were evident: Bd1-R6, Bd2-R5-W1, Bd3-R10, Bd4-R11-W4, and Bd5-R4. Six rice centromeres syntenic to five wheat centromeres were inactive in Brachypodium chromosomes. The conservation of centromere gene synteny among several sets of homologous centromeres of three species indicates that active genes can persist in ancient centromeres with more than 40 million years of shared evolutionary history. Annotation of a BAC contig spanning an inactive centromere in chromosome Bd3 which is syntenic to rice Cen8 and W7 pericentromeres, along with BAC FISH data from inactive centromeres revealed that the centromere inactivation was accompanied by the loss of centromeric retrotransposons and turnover of centromere-specific satellites during Bd chromosome evolution.

  8. The 3' untranslated region of picornavirus RNA: features required for efficient genome replication.

    PubMed Central

    Rohll, J B; Moon, D H; Evans, D J; Almond, J W

    1995-01-01

    The role of the 3' untranslated region (3'UTR) in the replication of enteroviruses has been studied with a series of mutants derived from either poliovirus type 3 (PV3) or a PV3 replicon containing the reporter gene chloramphenicol acetyltransferase. Replication was observed when the PV3 3'UTR was replaced with that of either coxsackie B4 virus, human rhinovirus 14 (HRV14), bovine enterovirus, or hepatitis A virus, despite the lack of sequence and secondary structure homology of the 3'UTRs of these viruses. The levels of replication observed for recombinants containing the 3'UTRs of hepatitis A virus and bovine enterovirus were lower than those for PV3 and the other recombinants. Extensive site-directed mutagenesis of the single stem-loop structure formed by the HRV14 3'UTR indicated the importance of (i) the loop sequence, (ii) the stability of the stem, and (iii) the location of the stem immediately upstream of the poly(A) tail. The role of a 4-bp motif at the base of the HRV14 stem, highly conserved among rhinoviruses, was examined by site-directed mutagenesis of individual base pairs. This analysis did not pinpoint a particular base pair as crucial for function. The requirement for immediate adjacent positioning of the open reading frame and the 3'UTR was examined by insertion of a 1.1-kb heterologous sequence. A replicon containing this insert replicated to about 30% of the level observed for the wild type. However, the corresponding virus consistently deleted most of the inserted fragment, suggesting that its presence was incompatible with a full replication cycle. PMID:7494295

  9. QTL mapping in three tropical maize populations reveals a set of constitutive and adaptive genomic regions for drought tolerance.

    PubMed

    Almeida, Gustavo Dias; Makumbi, Dan; Magorokosho, Cosmos; Nair, Sudha; Borém, Aluízio; Ribaut, Jean-Marcel; Bänziger, Marianne; Prasanna, Boddupalli M; Crossa, Jose; Babu, Raman

    2013-03-01

    Despite numerous published reports of quantitative trait loci (QTL) for drought-related traits, practical applications of such QTL in maize improvement are scarce. Identifying QTL of sizeable effects that express more or less uniformly in diverse genetic backgrounds across contrasting water regimes could significantly complement conventional breeding efforts to improve drought tolerance. We evaluated three tropical bi-parental populations under water-stress (WS) and well-watered (WW) regimes in Mexico, Kenya and Zimbabwe to identify genomic regions responsible for grain yield (GY) and anthesis-silking interval (ASI) across multiple environments and diverse genetic backgrounds. Across the three populations, on average, drought stress reduced GY by more than 50 % and increased ASI by 3.2 days. We identified a total of 83 and 62 QTL through individual environment analyses for GY and ASI, respectively. In each population, most QTL consistently showed up in each water regime. Across the three populations, the phenotypic variance explained by various individual QTL ranged from 2.6 to 17.8 % for GY and 1.7 to 17.8 % for ASI under WS environments and from 5 to 19.5 % for GY under WW environments. Meta-QTL (mQTL) analysis across the three populations and multiple environments identified seven genomic regions for GY and one for ASI, of which six mQTL on chr.1, 4, 5 and 10 for GY were constitutively expressed across WS and WW environments. One mQTL on chr.7 for GY and one on chr.3 for ASI were found to be 'adaptive' to WS conditions. High throughput assays were developed for SNPs that delimit the physical intervals of these mQTL. At most of the QTL, almost equal number of favorable alleles was donated by either of the parents within each cross, thereby demonstrating the potential of drought tolerant × drought tolerant crosses to identify QTL under contrasting water regimes.

  10. The bonobo genome compared with the chimpanzee and human genomes.

    PubMed

    Prüfer, Kay; Munch, Kasper; Hellmann, Ines; Akagi, Keiko; Miller, Jason R; Walenz, Brian; Koren, Sergey; Sutton, Granger; Kodira, Chinnappa; Winer, Roger; Knight, James R; Mullikin, James C; Meader, Stephen J; Ponting, Chris P; Lunter, Gerton; Higashino, Saneyuki; Hobolth, Asger; Dutheil, Julien; Karakoç, Emre; Alkan, Can; Sajjadian, Saba; Catacchio, Claudia Rita; Ventura, Mario; Marques-Bonet, Tomas; Eichler, Evan E; André, Claudine; Atencia, Rebeca; Mugisha, Lawrence; Junhold, Jörg; Patterson, Nick; Siebauer, Michael; Good, Jeffrey M; Fischer, Anne; Ptak, Susan E; Lachmann, Michael; Symer, David E; Mailund, Thomas; Schierup, Mikkel H; Andrés, Aida M; Kelso, Janet; Pääbo, Svante

    2012-06-28

    Two African apes are the closest living relatives of humans: the chimpanzee (Pan troglodytes) and the bonobo (Pan paniscus). Although they are similar in many respects, bonobos and chimpanzees differ strikingly in key social and sexual behaviours, and for some of these traits they show more similarity with humans than with each other. Here we report the sequencing and assembly of the bonobo genome to study its evolutionary relationship with the chimpanzee and human genomes. We find that more than three per cent of the human genome is more closely related to either the bonobo or the chimpanzee genome than these are to each other. These regions allow various aspects of the ancestry of the two ape species to be reconstructed. In addition, many of the regions that overlap genes may eventually help us understand the genetic basis of phenotypes that humans share with one of the two apes to the exclusion of the other.

  11. Extensive duplication events account for multiple control regions and pseudo-genes in the mitochondrial genome of the velvet worm Metaperipatus inae (Onychophora, Peripatopsidae).

    PubMed

    Braband, Anke; Podsiadlowski, Lars; Cameron, Stephen L; Daniels, Savel; Mayer, Georg

    2010-10-01

    The phylogeny of Onychophora (velvet worms) is unresolved and even the monophyly of the two major onychophoran subgroups, Peripatidae and Peripatopsidae, is uncertain. Previous studies of complete mitochondrial genomes from two onychophoran species revealed two strikingly different gene arrangement patterns from highly conserved in a representative of Peripatopsidae to highly derived in a species of Peripatidae, suggesting that these data might be informative for clarifying the onychophoran phylogeny. In order to assess the diversity of mitochondrial genomes among onychophorans, we analyzed the complete mitochondrial genome of Metaperipatus inae, a second representative of Peripatopsidae from Chile. Compared to the proposed ancestral gene order in Onychophora, the mitochondrial genome of M. inae shows dramatic rearrangements, although all protein-coding and ribosomal RNA genes are encoded on the same strands as in the ancestral peripatopsid genome. The retained strand affiliation of all protein-coding and ribosomal RNA genes and the occurrence of three control regions and several pseudo-genes suggest that the derived mitochondrial gene arrangement pattern in M. inae evolved by partial genome duplications, followed by a subsequent loss of redundant genes. Our findings, thus, confirm the diversity of the mitochondrial gene arrangement patterns among onychophorans and support their utility for clarifying the phylogeography of Onychophora, in particular of the Peripatopsidae species from South Africa and Chile. PMID:20510379

  12. Extensive duplication events account for multiple control regions and pseudo-genes in the mitochondrial genome of the velvet worm Metaperipatus inae (Onychophora, Peripatopsidae).

    PubMed

    Braband, Anke; Podsiadlowski, Lars; Cameron, Stephen L; Daniels, Savel; Mayer, Georg

    2010-10-01

    The phylogeny of Onychophora (velvet worms) is unresolved and even the monophyly of the two major onychophoran subgroups, Peripatidae and Peripatopsidae, is uncertain. Previous studies of complete mitochondrial genomes from two onychophoran species revealed two strikingly different gene arrangement patterns from highly conserved in a representative of Peripatopsidae to highly derived in a species of Peripatidae, suggesting that these data might be informative for clarifying the onychophoran phylogeny. In order to assess the diversity of mitochondrial genomes among onychophorans, we analyzed the complete mitochondrial genome of Metaperipatus inae, a second representative of Peripatopsidae from Chile. Compared to the proposed ancestral gene order in Onychophora, the mitochondrial genome of M. inae shows dramatic rearrangements, although all protein-coding and ribosomal RNA genes are encoded on the same strands as in the ancestral peripatopsid genome. The retained strand affiliation of all protein-coding and ribosomal RNA genes and the occurrence of three control regions and several pseudo-genes suggest that the derived mitochondrial gene arrangement pattern in M. inae evolved by partial genome duplications, followed by a subsequent loss of redundant genes. Our findings, thus, confirm the diversity of the mitochondrial gene arrangement patterns among onychophorans and support their utility for clarifying the phylogeography of Onychophora, in particular of the Peripatopsidae species from South Africa and Chile.

  13. VCGDB: a dynamic genome database of the Chinese population

    PubMed Central

    2014-01-01

    Background The data released by the 1000 Genomes Project contain an increasing number of genome sequences from different nations and populations with a large number of genetic variations. As a result, the focus of human genome studies is changing from single and static to complex and dynamic. The currently available human reference genome (GRCh37) is based on sequencing data from 13 anonymous Caucasian volunteers, which might limit the scope of genomics, transcriptomics, epigenetics, and genome wide association studies. Description We used the massive amount of sequencing data published by the 1000 Genomes Project Consortium to construct the Virtual Chinese Genome Database (VCGDB), a dynamic genome database of the Chinese population based on the whole genome sequencing data of 194 individuals. VCGDB provides dynamic genomic information, which contains 35 million single nucleotide variations (SNVs), 0.5 million insertions/deletions (indels), and 29 million rare variations, together with genomic annotation information. VCGDB also provides a highly interactive user-friendly virtual Chinese genome browser (VCGBrowser) with functions like seamless zooming and real-time searching. In addition, we have established three population-specific consensus Chinese reference genomes that are compatible with mainstream alignment software. Conclusions VCGDB offers a feasible strategy for processing big data to keep pace with the biological data explosion by providing a robust resource for genomics studies; in particular, studies aimed at finding regions of the genome associated with diseases. PMID:24708222

  14. Functional annotations in bacterial genomes based on small RNA signatures.

    PubMed

    Sridhar, Jayavel; Rafi, Ziauddin Ahamed

    2008-04-04

    One of the key challenges in computational genomics is annotating coding genes and identification of regulatory RNAs in complete genomes. An attempt is made in this study which uses the regulatory RNA locations and their conserved flanking genes identified within the genomic backbone of template genome to search for similar RNA locations in query genomes. The search is based on recently reported coexistence of small RNAs and their conserved flanking genes in related genomes. Based on our study, 54 additional sRNA locations and functions of 96 uncharacterized genes are predicted in two draft genomes viz., Serratia marcesens Db1 and Yersinia enterocolitica 8081. Although most of the identified additional small RNA regions and their corresponding flanking genes are homologous in nature, the proposed anchoring technique could successfully identify four non-homologous small RNA regions in Y. enterocolitica genome also. The KEGG Orthology (KO) based automated functional predictions confirms the predicted functions of 65 flanking genes having defined KO numbers, out of the total 96 predictions made by this method. This coexistence based method shows more sensitivity than controlled vocabularies in locating orthologous gene pairs even in the absence of defined Orthology numbers. All functional predictions made by this study in Y. enterocolitica 8081 were confirmed by the recently published complete genome sequence and annotations. This study also reports the possible regions of gene rearrangements in these two genomes and further characterization of such RNA regions could shed more light on their possible role in genome evolution.

  15. Functional annotations in bacterial genomes based on small RNA signatures

    PubMed Central

    Sridhar, Jayavel; Rafi, Ziauddin Ahamed

    2008-01-01

    One of the key challenges in computational genomics is annotating coding genes and identification of regulatory RNAs in complete genomes. An attempt is made in this study which uses the regulatory RNA locations and their conserved flanking genes identified within the genomic backbone of template genome to search for similar RNA locations in query genomes. The search is based on recently reported coexistence of small RNAs and their conserved flanking genes in related genomes. Based on our study, 54 additional sRNA locations and functions of 96 uncharacterized genes are predicted in two draft genomes viz., Serratia marcesens Db1 and Yersinia enterocolitica 8081. Although most of the identified additional small RNA regions and their corresponding flanking genes are homologous in nature, the proposed anchoring technique could successfully identify four non-homologous small RNA regions in Y. enterocolitica genome also. The KEGG Orthology (KO) based automated functional predictions confirms the predicted functions of 65 flanking genes having defined KO numbers, out of the total 96 predictions made by this method. This coexistence based method shows more sensitivity than controlled vocabularies in locating orthologous gene pairs even in the absence of defined Orthology numbers. All functional predictions made by this study in Y. enterocolitica 8081 were confirmed by the recently published complete genome sequence and annotations. This study also reports the possible regions of gene rearrangements in these two genomes and further characterization of such RNA regions could shed more light on their possible role in genome evolution. PMID:18478081

  16. High-throughput engineering of a mammalian genome reveals building principles of methylation states at CG rich regions

    PubMed Central

    Krebs, Arnaud R; Dessus-Babus, Sophie; Burger, Lukas; Schübeler, Dirk

    2014-01-01

    The majority of mammalian promoters are CpG islands; regions of high CG density that require protection from DNA methylation to be functional. Importantly, how sequence architecture mediates this unmethylated state remains unclear. To address this question in a comprehensive manner, we developed a method to interrogate methylation states of hundreds of sequence variants inserted at the same genomic site in mouse embryonic stem cells. Using this assay, we were able to quantify the contribution of various sequence motifs towards the resulting DNA methylation state. Modeling of this comprehensive dataset revealed that CG density alone is a minor determinant of their unmethylated state. Instead, these data argue for a principal role for transcription factor binding sites, a prediction confirmed by testing synthetic mutant libraries. Taken together, these findings establish the hierarchy between the two cis-encoded mechanisms that define the DNA methylation state and thus the transcriptional compet