Bain, Peter A; Papanicolaou, Alexie; Kumar, Anupama
2015-01-01
Murray-Darling rainbowfish (Melanotaenia fluviatilis [Castelnau, 1878]; Atheriniformes: Melanotaeniidae) is a small-bodied teleost currently under development in Australasia as a test species for aquatic toxicological studies. To date, efforts towards the development of molecular biomarkers of contaminant exposure have been hindered by the lack of available sequence data. To address this, we sequenced messenger RNA from brain, liver and gonads of mature male and female fish and generated a high-quality draft transcriptome using a de novo assembly approach. 149,742 clusters of putative transcripts were obtained, encompassing 43,841 non-redundant protein-coding regions. Deduced amino acid sequences were annotated by functional inference based on similarity with sequences from manually curated protein sequence databases. The draft assembly contained protein-coding regions homologous to 95.7% of the complete cohort of predicted proteins from the taxonomically related species, Oryzias latipes (Japanese medaka). The mean length of rainbowfish protein-coding sequences relative to their medaka homologues was 92.1%, indicating that despite the limited number of tissues sampled a large proportion of the total expected number of protein-coding genes was captured in the study. Because of our interest in the effects of environmental contaminants on endocrine pathways, we manually curated subsets of coding regions for putative nuclear receptors and steroidogenic enzymes in the rainbowfish transcriptome, revealing 61 candidate nuclear receptors encompassing all known subfamilies, and 41 putative steroidogenic enzymes representing all major steroidogenic enzymes occurring in teleosts. The transcriptome presented here will be a valuable resource for researchers interested in biomarker development, protein structure and function, and contaminant-response genomics in Murray-Darling rainbowfish.
Alpert, Carl-Alfred; Crutz-Le Coq, Anne-Marie; Malleret, Christine; Zagorec, Monique
2003-01-01
The complete nucleotide sequence of the 13-kb plasmid pRV500, isolated from Lactobacillus sakei RV332, was determined. Sequence analysis enabled the identification of genes coding for a putative type I restriction-modification system, two genes coding for putative recombinases of the integrase family, and a region likely involved in replication. The structural features of this region, comprising a putative ori segment containing 11- and 22-bp repeats and a repA gene coding for a putative initiator protein, indicated that pRV500 belongs to the pUCL287 subfamily of theta-type replicons. A 3.7-kb fragment encompassing this region was fused to an Escherichia coli replicon to produce the shuttle vector pRV566 and was observed to be functional in L. sakei for plasmid replication. The L. sakei replicon alone could not support replication in E. coli. Plasmid pRV500 and its derivative pRV566 were determined to be at very low copy numbers in L. sakei. pRV566 was maintained at a reasonable rate over 20 generations in several lactobacilli, such as Lactobacillus curvatus, Lactobacillus casei, and Lactobacillus plantarum, in addition to L. sakei, making it an interesting basis for developing vectors. Sequence relationships with other plasmids are described and discussed. PMID:12957947
Correlation approach to identify coding regions in DNA sequences
NASA Technical Reports Server (NTRS)
Ossadnik, S. M.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.
1994-01-01
Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law correlations, whereas coding regions typically display only short-range correlations. We develop an algorithm based on this finding that enables investigators to perform a statistical analysis on long DNA sequences to locate possible coding regions. The algorithm is particularly successful in predicting the location of lengthy coding regions. For example, for the complete genome of yeast chromosome III (315,344 nucleotides), at least 82% of the predictions correspond to putative coding regions; the algorithm correctly identified all coding regions larger than 3000 nucleotides, 92% of coding regions between 2000 and 3000 nucleotides long, and 79% of coding regions between 1000 and 2000 nucleotides. The predictive ability of this new algorithm supports the claim that there is a fundamental difference in the correlation property between coding and noncoding sequences. This algorithm, which is not species-dependent, can be implemented with other techniques for rapidly and accurately locating relatively long coding regions in genomic sequences.
He, Hongjuan; Xiu, Youcheng; Guo, Jing; Liu, Hui; Liu, Qi; Zeng, Tiebo; Chen, Yan; Zhang, Yan; Wu, Qiong
2013-01-01
Long non-coding RNAs (lncRNAs) as a key group of non-coding RNAs have gained widely attention. Though lncRNAs have been functionally annotated and systematic explored in higher mammals, few are under systematical identification and annotation. Owing to the expression specificity, known lncRNAs expressed in embryonic brain tissues remain still limited. Considering a large number of lncRNAs are only transcribed in brain tissues, studies of lncRNAs in developmental brain are therefore of special interest. Here, publicly available RNA-sequencing (RNA-seq) data in embryonic brain are integrated to identify thousands of embryonic brain lncRNAs by a customized pipeline. A significant proportion of novel transcripts have not been annotated by available genomic resources. The putative embryonic brain lncRNAs are shorter in length, less spliced and show less conservation than known genes. The expression of putative lncRNAs is in one tenth on average of known coding genes, while comparable with known lncRNAs. From chromatin data, putative embryonic brain lncRNAs are associated with active chromatin marks, comparable with known lncRNAs. Embryonic brain expressed lncRNAs are also indicated to have expression though not evident in adult brain. Gene Ontology analysis of putative embryonic brain lncRNAs suggests that they are associated with brain development. The putative lncRNAs are shown to be related to possible cis-regulatory roles in imprinting even themselves are deemed to be imprinted lncRNAs. Re-analysis of one knockdown data suggests that four regulators are associated with lncRNAs. Taken together, the identification and systematic analysis of putative lncRNAs would provide novel insights into uncharacterized mouse non-coding regions and the relationships with mammalian embryonic brain development. PMID:23967161
Hyndman, Timothy H; Marschang, Rachel E; Wellehan, James F X; Nicholls, Philip K
2012-10-01
This paper describes the isolation and molecular identification of a novel paramyxovirus found during an investigation of an outbreak of neurorespiratory disease in a collection of Australian pythons. Using Illumina® high-throughput sequencing, a 17,187 nucleotide sequence was assembled from RNA extracts from infected viper heart cells (VH2) displaying widespread cytopathic effects in the form of multinucleate giant cells. The sequence appears to contain all the coding regions of the genome, including the following predicted paramyxoviral open reading frames (ORFs): 3'--Nucleocapsid (N)--putative Phosphoprotein (P)--Matrix (M)--Fusion (F)--putative attachment protein--Polymerase (L)--5'. There is also a 540 nucleotide ORF between the N and putative P genes that may be an additional coding region. Phylogenetic analyses of the complete N, M, F and L genes support the clustering of this virus within the family Paramyxoviridae but outside both of the current subfamilies: Paramyxovirinae and Pneumovirinae. We propose to name this new virus, Sunshine virus, after the geographic origin of the first isolate--the Sunshine Coast of Queensland, Australia. Copyright © 2012 Elsevier B.V. All rights reserved.
Tenebrio molitor antifreeze protein gene identification and regulation.
Qin, Wensheng; Walker, Virginia K
2006-02-15
The yellow mealworm, Tenebrio molitor, is a freeze susceptible, stored product pest. Its winter survival is facilitated by the accumulation of antifreeze proteins (AFPs), encoded by a small gene family. We have now isolated 11 different AFP genomic clones from 3 genomic libraries. All the clones had a single coding sequence, with no evidence of intervening sequences. Three genomic clones were further characterized. All have putative TATA box sequences upstream of the coding regions and multiple potential poly(A) signal sequences downstream of the coding regions. A TmAFP regulatory region, B1037, conferred transcriptional activity when ligated to a luciferase reporter sequence and after transfection into an insect cell line. A 143 bp core promoter including a TATA box sequence was identified. Its promoter activity was increased 4.4 times by inserting an exotic 245 bp intron into the construct, similar to the enhancement of transgenic expression seen in several other systems. The addition of a duplication of the first 120 bp sequence from the 143 bp core promoter decreased promoter activity by half. Although putative hormonal response sequences were identified, none of the five hormones tested enhanced reporter activity. These studies on the mechanisms of AFP transcriptional control are important for the consideration of any transfer of freeze-resistance phenotypes to beneficial hosts.
Molecular Evolution of the Non-Coding Eosinophil Granule Ontogeny Transcript
Rose, Dominic; Stadler, Peter F.
2011-01-01
Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs). The evolutionary history of mlncRNAs is still largely uncharted territory. In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT), an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs). EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyze patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrate here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved, and thermodynamic stable secondary structures. Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element. PMID:22303364
Mu-Like Prophage in Serogroup B Neisseria meningitidis Coding for Surface-Exposed Antigens
Masignani, Vega; Giuliani, Marzia Monica; Tettelin, Hervé; Comanducci, Maurizio; Rappuoli, Rino; Scarlato, Vincenzo
2001-01-01
Sequence analysis of the genome of Neisseria meningititdis serogroup B revealed the presence of an ∼35-kb region inserted within a putative gene coding for an ABC-type transporter. The region contains 46 open reading frames, 29 of which are colinear and homologous to the genes of Escherichia coli Mu phage. Two prophages with similar organizations were also found in serogroup A meningococcus, and one was found in Haemophilus influenzae. Early and late phage functions are well preserved in this family of Mu-like prophages. Several regions of atypical nucleotide content were identified. These likely represent genes acquired by horizontal transfer. Three of the acquired genes are shown to code for surface-associated antigens, and the encoded proteins are able to induce bactericidal antibodies. PMID:11254622
Intrinsic and extrinsic approaches for detecting genes in a bacterial genome.
Borodovsky, M; Rudd, K E; Koonin, E V
1994-01-01
The unannotated regions of the Escherichia coli genome DNA sequence from the EcoSeq6 database, totaling 1,278 'intergenic' sequences of the combined length of 359,279 basepairs, were analyzed using computer-assisted methods with the aim of identifying putative unknown genes. The proposed strategy for finding new genes includes two key elements: i) prediction of expressed open reading frames (ORFs) using the GeneMark method based on Markov chain models for coding and non-coding regions of Escherichia coli DNA, and ii) search for protein sequence similarities using programs based on the BLAST algorithm and programs for motif identification. A total of 354 putative expressed ORFs were predicted by GeneMark. Using the BLASTX and TBLASTN programs, it was shown that 208 ORFs located in the unannotated regions of the E. coli chromosome are significantly similar to other protein sequences. Identification of 182 ORFs as probable genes was supported by GeneMark and BLAST, comprising 51.4% of the GeneMark 'hits' and 87.5% of the BLAST 'hits'. 73 putative new genes, comprising 20.6% of the GeneMark predictions, belong to ancient conserved protein families that include both eubacterial and eukaryotic members. This value is close to the overall proportion of highly conserved sequences among eubacterial proteins, indicating that the majority of the putative expressed ORFs that are predicted by GeneMark, but have no significant BLAST hits, nevertheless are likely to be real genes. The majority of the putative genes identified by BLAST search have been described since the release of the EcoSeq6 database, but about 70 genes have not been detected so far. Among these new identifications are genes encoding proteins with a variety of predicted functions including dehydrogenases, kinases, several other metabolic enzymes, ATPases, rRNA methyltransferases, membrane proteins, and different types of regulatory proteins. Images PMID:7984428
Arita, Minetaro; Zhu, Shuang-Li; Yoshida, Hiromu; Yoneyama, Tetsuo; Miyamura, Tatsuo; Shimizu, Hiroyuki
2005-01-01
Outbreaks of poliomyelitis caused by circulating vaccine-derived polioviruses (cVDPVs) have been reported in areas where indigenous wild polioviruses (PVs) were eliminated by vaccination. Most of these cVDPVs contained unidentified sequences in the nonstructural protein coding region which were considered to be derived from human enterovirus species C (HEV-C) by recombination. In this study, we report isolation of a Sabin 3-derived PV recombinant (Cambodia-02) from an acute flaccid paralysis (AFP) case in Cambodia in 2002. We attempted to identify the putative recombination counterpart of Cambodia-02 by sequence analysis of nonpolio enterovirus isolates from AFP cases in Cambodia from 1999 to 2003. Based on the previously estimated evolution rates of PVs, the recombination event resulting in Cambodia-02 was estimated to have occurred within 6 months after the administration of oral PV vaccine (99.3% nucleotide identity in VP1 region). The 2BC and the 3Dpol coding regions of Cambodia-02 were grouped into the genetic cluster of indigenous coxsackie A virus type 17 (CAV17) (the highest [87.1%] nucleotide identity) and the cluster of indigenous CAV13-CAV18 (the highest [94.9%] nucleotide identity) by the phylogenic analysis of the HEV-C isolates in 2002, respectively. CAV13-CAV18 and CAV17 were the dominant HEV-C serotypes in 2002 but not in 2001 and in 2003. We found a putative recombination between CAV13-CAV18 and CAV17 in the 3CDpro coding region of a CAV17 isolate. These results suggested that a part of the 3Dpol coding region of PV3(Cambodia-02) was derived from a HEV-C strain genetically related to indigenous CAV13-CAV18 strains in 2002 in Cambodia. PMID:16188967
Lyssavirus in Japanese Pipistrelle, Taiwan.
Hu, Shu-Chia; Hsu, Chao-Lung; Lee, Ming-Shiuh; Tu, Yang-Chang; Chang, Jen-Chieh; Wu, Chieh-Hao; Lee, Shu-Hwae; Ting, Lu-Jen; Tsai, Kwok-Rong; Cheng, Ming-Chu; Tu, Wen-Jane; Hsu, Wei-Cheng
2018-04-01
A putative new lyssavirus was found in 2 Japanese pipistrelles (Pipistrellus abramus) in Taiwan in 2016 and 2017. The concatenated coding regions of the virus showed 62.9%-75.1% nucleotide identities to the other 16 species of lyssavirus, suggesting that it may be representative of a new species of this virus.
Network perturbation by recurrent regulatory variants in cancer
Cho, Ara; Lee, Insuk; Choi, Jung Kyoon
2017-01-01
Cancer driving genes have been identified as recurrently affected by variants that alter protein-coding sequences. However, a majority of cancer variants arise in noncoding regions, and some of them are thought to play a critical role through transcriptional perturbation. Here we identified putative transcriptional driver genes based on combinatorial variant recurrence in cis-regulatory regions. The identified genes showed high connectivity in the cancer type-specific transcription regulatory network, with high outdegree and many downstream genes, highlighting their causative role during tumorigenesis. In the protein interactome, the identified transcriptional drivers were not as highly connected as coding driver genes but appeared to form a network module centered on the coding drivers. The coding and regulatory variants associated via these interactions between the coding and transcriptional drivers showed exclusive and complementary occurrence patterns across tumor samples. Transcriptional cancer drivers may act through an extensive perturbation of the regulatory network and by altering protein network modules through interactions with coding driver genes. PMID:28333928
Lyssavirus in Japanese Pipistrelle, Taiwan
Hu, Shu-Chia; Hsu, Chao-Lung; Lee, Ming-Shiuh; Tu, Yang-Chang; Chang, Jen-Chieh; Wu, Chieh-Hao; Lee, Shu-Hwae; Ting, Lu-Jen; Tsai, Kwok-Rong; Cheng, Ming-Chu; Tu, Wen-Jane
2018-01-01
A putative new lyssavirus was found in 2 Japanese pipistrelles (Pipistrellus abramus) in Taiwan in 2016 and 2017. The concatenated coding regions of the virus showed 62.9%–75.1% nucleotide identities to the other 16 species of lyssavirus, suggesting that it may be representative of a new species of this virus. PMID:29553328
Neuhaus, H; Link, G
1987-01-01
The trnK gene endocing the tRNALys(UUU) has been located on mustard (Sinapis alba) chloroplast DNA, 263 bp upstream of the psbA gene on the same strand. The nucleotide sequence of the trnK gene and its flanking regions as well as the putative transcription start and termination sites are shown. The 5' end of the transcript lies 121 bp upstream of the 5' tRNA coding region and is preceded by procaryotic-type "-10" and "-35" sequence elements, while the 3' end maps 2.77 kb downstream to a DNA region with possible stemloop secondary structure. The anticodon loop of the tRNALys is interrupted by a 2,574 bp intron containing a long open reading frame, which codes for 524 amino acids. Based on conserved stem and loop structures, this intron has characteristic features of a class II intron. A region near the carboxyl terminus of the derived polypeptide appears structurally related to maturases.
Barbosa, M S; Wettstein, F O
1987-01-01
Cottontail rabbit papillomavirus (CRPV) early proteins are present at very low levels in virus-induced tumors and cannot be detected by immunological methods. Furthermore, cells in culture are not readily transformed by the virus. To overcome these difficulties in identifying and characterizing the putative transforming protein(s) coded by the E6 open reading frame, the early cottontail rabbit papillomavirus region was expressed under the control of the late simian virus 40 promoter. Mapping of the transcripts in transiently transfected COS-7 cells indicated that transcription was initiated in the late region of simian virus 40. Two E6-coded polypeptides were identified, representing translation products initiated at the first and second AUG codons. Images PMID:3039182
Vicente, Juan J; Galardi-Castilla, María; Escalante, Ricardo; Sastre, Leandro
2008-01-03
The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N), that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87-89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.
Bhattacharya, D; Steinkötter, J; Melkonian, M
1993-12-01
Centrin (= caltractin) is a ubiquitous, cytoskeletal protein which is a member of the EF-hand superfamily of calcium-binding proteins. A centrin-coding cDNA was isolated and characterized from the prasinophyte green alga Scherffelia dubia. Centrin PCR amplification primers were used to isolate partial, homologous cDNA sequences from the green algae Tetraselmis striata and Spermatozopsis similis. Annealing analyses suggested that centrin is a single-copy-coding region in T. striata and S. similis and other green algae studied. Centrin-coding regions from S. dubia, S. similis and T. striata encode four colinear EF-hand domains which putatively bind calcium. Phylogenetic analyses, including homologous sequences from Chlamydomonas reinhardtii and the land plant Atriplex nummularia, demonstrate that the domains of centrins are congruent and arose from the two-fold duplication of an ancestral EF hand with Domains 1+3 and Domains 2+4 clustering. The domains of centrins are also congruent with those of calmodulins demonstrating that, like calmodulin, centrin is an ancient protein which arose within the ancestor of all eukaryotes via gene duplication. Phylogenetic relationships inferred from centrin-coding region comparisons mirror results of small subunit ribosomal RNA sequence analyses suggesting that centrin-coding regions are useful evolutionary markers within the green algae.
Detection of non-coding RNA in bacteria and archaea using the DETR'PROK Galaxy pipeline.
Toffano-Nioche, Claire; Luo, Yufei; Kuchly, Claire; Wallon, Claire; Steinbach, Delphine; Zytnicki, Matthias; Jacq, Annick; Gautheret, Daniel
2013-09-01
RNA-seq experiments are now routinely used for the large scale sequencing of transcripts. In bacteria or archaea, such deep sequencing experiments typically produce 10-50 million fragments that cover most of the genome, including intergenic regions. In this context, the precise delineation of the non-coding elements is challenging. Non-coding elements include untranslated regions (UTRs) of mRNAs, independent small RNA genes (sRNAs) and transcripts produced from the antisense strand of genes (asRNA). Here we present a computational pipeline (DETR'PROK: detection of ncRNAs in prokaryotes) based on the Galaxy framework that takes as input a mapping of deep sequencing reads and performs successive steps of clustering, comparison with existing annotation and identification of transcribed non-coding fragments classified into putative 5' UTRs, sRNAs and asRNAs. We provide a step-by-step description of the protocol using real-life example data sets from Vibrio splendidus and Escherichia coli. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Tetrahymena thermophila acidic ribosomal protein L37 contains an archaebacterial type of C-terminus.
Hansen, T S; Andreasen, P H; Dreisig, H; Højrup, P; Nielsen, H; Engberg, J; Kristiansen, K
1991-09-15
We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63 nucleotides upstream from the translational start codon. The uppermost tsp mapped to the first T in a putative T. thermophila RNA polymerase II initiator element, TATAA. The coding region of L37 predicts a protein of 109 amino acid (aa) residues. A substantial part of the deduced aa sequence was verified by protein sequencing. The T. thermophila L37 clearly belongs to the P1-type family of eukaryotic A-proteins, but the C-terminal region has the hallmarks of archaebacterial A-proteins.
Austin, Christopher M; Tan, Mun Hua; Lee, Yin Peng; Croft, Laurence J; Meekan, Mark G; Pierce, Simon J; Gan, Han Ming
2016-01-01
The complete mitochondrial genome of the parasitic copepod Pandarus rhincodonicus was obtained from a partial genome scan using the HiSeq sequencing system. The Pandarus rhincodonicus mitogenome has 14,480 base pairs (62% A+T content) made up of 12 protein-coding genes, 2 ribosomal subunit genes, 22 transfer RNAs, and a putative 384 bp non-coding AT-rich region. This Pandarus mitogenome sequence is the first for the family Pandaridae, the second for the order Siphonostomatoida and the sixth for the Copepoda.
Wise, C A; Chiang, L C; Paznekas, W A; Sharma, M; Musy, M M; Ashley, J A; Lovett, M; Jabs, E W
1997-04-01
Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development.
Zhuo, L; Reed, K M; Phillips, R B
1995-06-01
Variation in the intergenic spacer (IGS) of the ribosomal DNA (rDNA) of lake trout (Salvelinus namaycush) was examined. Digestion of genomic DNA with restriction enzymes showed that almost every individual had a unique combination of length variants with most of this variation occurring within rather than between populations. Sequence analysis of a 2.3 kilobase (kb) EcoRI-DraI fragment spanning the 3' end of the 28S coding region and approximately 1.8 kb of the IGS revealed two blocks of repetitive DNA. Putative transcriptional termination sites were found approximately 220 bases (b) downstream from the end of the 28S coding region. Comparison of the 2.3-kb fragments with two longer (3.1 kb) fragments showed that the major difference in length resulted from variation in the number of short (89 b) repeats located 3' to the putative terminator. Repeat units within a single nucleolus organizer region (NOR) appeared relatively homogeneous and genetic analysis found variants to be stably inherited. A comparison of the number of spacer-length variants with the number of NORs found that the number of length variants per individual was always less than the number of NORs. Examination of spacer variants in five populations showed that populations with more NORs had more spacer variants, indicating that variants are present at different rDNA sites on nonhomologous chromosomes.
A Putative Multiple-Demand System in the Macaque Brain.
Mitchell, Daniel J; Bell, Andrew H; Buckley, Mark J; Mitchell, Anna S; Sallet, Jerome; Duncan, John
2016-08-17
In humans, cognitively demanding tasks of many types recruit common frontoparietal brain areas. Pervasive activation of this "multiple-demand" (MD) network suggests a core function in supporting goal-oriented behavior. A similar network might therefore be predicted in nonhuman primates that readily perform similar tasks after training. However, an MD network in nonhuman primates has not been described. Single-cell recordings from macaque frontal and parietal cortex show some similar properties to human MD fMRI responses (e.g., adaptive coding of task-relevant information). Invasive recordings, however, come from limited prespecified locations, so they do not delineate a macaque homolog of the MD system and their positioning could benefit from knowledge of where MD foci lie. Challenges of scanning behaving animals mean that few macaque fMRI studies specifically contrast levels of cognitive demand, so we sought to identify a macaque counterpart to the human MD system using fMRI connectivity in 35 rhesus macaques. Putative macaque MD regions, mapped from frontoparietal MD regions defined in humans, were found to be functionally connected under anesthesia. To further refine these regions, an iterative process was used to maximize their connectivity cross-validated across animals. Finally, whole-brain connectivity analyses identified voxels that were robustly connected to MD regions, revealing seven clusters across frontoparietal and insular cortex comparable to human MD regions and one unexpected cluster in the lateral fissure. The proposed macaque MD regions can be used to guide future electrophysiological investigation of MD neural coding and in task-based fMRI to test predictions of similar functional properties to human MD cortex. In humans, a frontoparietal "multiple-demand" (MD) brain network is recruited during a wide range of cognitively demanding tasks. Because this suggests a fundamental function, one might expect a similar network to exist in nonhuman primates, but this remains controversial. Here, we sought to identify a macaque counterpart to the human MD system using fMRI connectivity. Putative macaque MD regions were functionally connected under anesthesia and were further refined by iterative optimization. The result is a network including lateral frontal, dorsomedial frontal, and insular and inferior parietal regions closely similar to the human counterpart. The proposed macaque MD regions can be useful in guiding electrophysiological recordings or in task-based fMRI to test predictions of similar functional properties to human MD cortex. Copyright © 2016 Mitchell et al.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Umans, L.; Serneels, L.; Hilliker, C.
1994-08-01
The authors have cloned the mouse gene coding for {alpha}{sub 2}-macroglobulin in overlapping {lambda} clones and have analyzed its structure. The gene contains 36 exons, coding for the 4.8-kb cDNA that we cloned previously. Including putative control elements in the 5{prime} flanking region, the gene covers about 45 kb. A region of 3.8 kb, stretching from 835 bases upstream of the cDNA start site to exon 4, including all intervening sequences, was sequenced completely. The analysis demonstrated that the putative promoter region of the mouse A2M gene differed considerably from the known promoter sequences of the human A2M gene andmore » of the rat acute-phas A2M gene. Comparison of the exon-intron structure of all known genes of the A2M family confirmed that the rat acute phase A2M gene is more closely related to the human gene than to the mouse A2M gene. To generate mice with the A2M gene inactivated, an insertion type of construct containing 7.5 kb of genomic DNA of the mouse strain 129/J, encompassing exons 16 to 19, was synthesized. A hygromycin marker gene was embedded in intron 17. After electroporation, 198 hygromycin-resistant ES cell lines were isolated and analyzed by Southern blotting. Five ES cell lines were obtained with one allele of the mouse A2M gene targeted by this insertion construct, demonstrating that the position and the characteristics of the vector served the intended goal.« less
Decoding sORF translation - from small proteins to gene regulation.
Cabrera-Quio, Luis Enrique; Herberg, Sarah; Pauli, Andrea
2016-11-01
Translation is best known as the fundamental mechanism by which the ribosome converts a sequence of nucleotides into a string of amino acids. Extensive research over many years has elucidated the key principles of translation, and the majority of translated regions were thought to be known. The recent discovery of wide-spread translation outside of annotated protein-coding open reading frames (ORFs) came therefore as a surprise, raising the intriguing possibility that these newly discovered translated regions might have unrecognized protein-coding or gene-regulatory functions. Here, we highlight recent findings that provide evidence that some of these newly discovered translated short ORFs (sORFs) encode functional, previously missed small proteins, while others have regulatory roles. Based on known examples we will also speculate about putative additional roles and the potentially much wider impact that these translated regions might have on cellular homeostasis and gene regulation.
Complete mitochondrial DNA sequence of the Eastern keelback mullet Liza affinis.
Gong, Xiaoling; Zhu, Wenjia; Bao, Baolong
2016-05-01
Eastern keelback mullet (Liza affinis) inhabits inlet waters and estuaries of rivers. In this paper, we initially determined the complete mitochondrial genome of Liza affinis. The entire mtDNA sequence is 16,831 bp in length, including 2 rRNA genes, 22 tRNA genes, 13 protein-coding genes and 1 putative control region. Its order and numbers of genes are similar to most bony fishes.
Liu, Lijun; Ramsay, Trevor; Zinkgraf, Matthew; Sundell, David; Street, Nathaniel Robert; Filkov, Vladimir; Groover, Andrew
2015-06-01
Identifying transcription factor target genes is essential for modeling the transcriptional networks underlying developmental processes. Here we report a chromatin immunoprecipitation sequencing (ChIP-seq) resource consisting of genome-wide binding regions and associated putative target genes for four Populus homeodomain transcription factors expressed during secondary growth and wood formation. Software code (programs and scripts) for processing the Populus ChIP-seq data are provided within a publically available iPlant image, including tools for ChIP-seq data quality control and evaluation adapted from the human Encyclopedia of DNA Elements (ENCODE) project. Basic information for each transcription factor (including members of Class I KNOX, Class III HD ZIP, BEL1-like families) binding are summarized, including the number and location of binding regions, distribution of binding regions relative to gene features, associated putative target genes, and enriched functional categories of putative target genes. These ChIP-seq data have been integrated within the Populus Genome Integrative Explorer (PopGenIE) where they can be analyzed using a variety of web-based tools. We present an example analysis that shows preferential binding of transcription factor ARBORKNOX1 to the nearest neighbor genes in a pre-calculated co-expression network module, and enrichment for meristem-related genes within this module including multiple orthologs of Arabidopsis KNOTTED-like Arabidopsis 2/6. © 2015 Society for Experimental Biology and John Wiley & Sons Ltd This article has been contributed to by US Government employees and their work is in the public domain in the USA.
Transcriptional landscapes of Axolotl (Ambystoma mexicanum).
Caballero-Pérez, Juan; Espinal-Centeno, Annie; Falcon, Francisco; García-Ortega, Luis F; Curiel-Quesada, Everardo; Cruz-Hernández, Andrés; Bako, Laszlo; Chen, Xuemei; Martínez, Octavio; Alberto Arteaga-Vázquez, Mario; Herrera-Estrella, Luis; Cruz-Ramírez, Alfredo
2018-01-15
The axolotl (Ambystoma mexicanum) is the vertebrate model system with the highest regeneration capacity. Experimental tools established over the past 100 years have been fundamental to start unraveling the cellular and molecular basis of tissue and limb regeneration. In the absence of a reference genome for the Axolotl, transcriptomic analysis become fundamental to understand the genetic basis of regeneration. Here we present one of the most diverse transcriptomic data sets for Axolotl by profiling coding and non-coding RNAs from diverse tissues. We reconstructed a population of 115,906 putative protein coding mRNAs as full ORFs (including isoforms). We also identified 352 conserved miRNAs and 297 novel putative mature miRNAs. Systematic enrichment analysis of gene expression allowed us to identify tissue-specific protein-coding transcripts. We also found putative novel and conserved microRNAs which potentially target mRNAs which are reported as important disease candidates in heart and liver. Copyright © 2017 Elsevier Inc. All rights reserved.
Shi, Wan; Quan, Mingyang; Du, Qingzhang; Zhang, Deqiang
2017-01-01
Long non-coding RNAs (lncRNAs) are important regulatory factors for plant growth and development, but little is known about the allelic interactions of lncRNAs with mRNA in perennial plants. Here, we analyzed the interaction of the NERD (Needed for RDR2-independent DNA methylation) Populus tomentosa gene PtoNERD with its putative regulator, the lncRNA NERDL (NERD-related lncRNA), which partially overlaps with the promoter region of this gene. Expression analysis in eight tissues showed a positive correlation between NERDL and PtoNERD (r = 0.62), suggesting that the interaction of NERDL with its putative target might be involved in wood formation. We conducted association mapping in a natural population of P. tomentosa (435 unrelated individuals) to evaluate genetic variation and the interaction of the lncRNA NERDL with PtoNERD. Using additive and dominant models, we identified 30 SNPs (P < 0.01) associated with five tree growth and wood property traits. Each SNP explained 3.90–8.57% of phenotypic variance, suggesting that NERDL and its putative target play a common role in wood formation. Epistasis analysis uncovered nine SNP-SNP association pairs between NERDL and PtoNERD, with an information gain of -7.55 to 2.16%, reflecting the strong interactions between NERDL and its putative target. This analysis provides a powerful method for deciphering the genetic interactions of lncRNAs with mRNA and dissecting the complex genetic network of quantitative traits in trees. PMID:28674544
Kress, W John; Erickson, David L
2007-06-06
A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination.
Liu, Chen; Shen, He Ding; Zhou, Na
2016-01-01
The complete mitochondrial genome sequence of Platevindex sp. is firstly described in the article. The mitogenome (13,908 bp) contains 22 tRNA genes, 2 ribosomal RNA genes and 13 protein-coding genes, and 1 putative control region (CR). CR is not well characterized due to lack of discrete conserved sequence blocks. This characteristic is similar with CRs of other invertebrate mitochondrial genomes. The characteristic is the typical bivalvia mitochondrial gene composition.
DNA methylation of miRNA coding sequences putatively associated with childhood obesity.
Mansego, M L; Garcia-Lacarte, M; Milagro, F I; Marti, A; Martinez, J A
2017-02-01
Epigenetic mechanisms may be involved in obesity onset and its consequences. The aim of the present study was to evaluate whether DNA methylation status in microRNA (miRNA) coding regions is associated with childhood obesity. DNA isolated from white blood cells of 24 children (identification sample: 12 obese and 12 non-obese) from the Grupo Navarro de Obesidad Infantil study was hybridized in a 450 K methylation microarray. Several CpGs whose DNA methylation levels were statistically different between obese and non-obese were validated by MassArray® in 95 children (validation sample) from the same study. Microarray analysis identified 16 differentially methylated CpGs between both groups (6 hypermethylated and 10 hypomethylated). DNA methylation levels in miR-1203, miR-412 and miR-216A coding regions significantly correlated with body mass index standard deviation score (BMI-SDS) and explained up to 40% of the variation of BMI-SDS. The network analysis identified 19 well-defined obesity-relevant biological pathways from the KEGG database. MassArray® validation identified three regions located in or near miR-1203, miR-412 and miR-216A coding regions differentially methylated between obese and non-obese children. The current work identified three CpG sites located in coding regions of three miRNAs (miR-1203, miR-412 and miR-216A) that were differentially methylated between obese and non-obese children, suggesting a role of miRNA epigenetic regulation in childhood obesity. © 2016 World Obesity Federation.
Venieraki, Anastasia; Dimou, Maria; Vezyri, Eleni; Vamvakas, Alexandros; Katinaki, Pagona-Artemis; Chatzipavlidis, Iordanis; Tampakaki, Anastasia; Katinakis, Panagiotis
2014-01-01
The presence of nitrogen fixers within the genus Pseudomonas has been established and so far most isolated strains are phylogenetically affiliated to Pseudomonas stutzeri. A gene ortholog neighborhood analysis of the nitrogen fixation island (NFI) in four diazotrophic P. stutzeri strains and Pseudomonas azotifigens revealed that all are flanked by genes coding for cobalamin synthase (cobS) and glutathione peroxidise (gshP). The putative NFIs lack all the features characterizing a mobilizable genomic island. Nevertheless, bioinformatic analysis P. stutzeri DSM 4166 NFI demonstrated the presence of short inverted and/or direct repeats within both flanking regions. The other P. stutzeri strains carry only one set of repeats. The genetic diversity of eleven diazotrophic Pseudomonas isolates was also investigated. Multilocus sequence typing grouped nine isolates along with P. stutzeri and two isolates are grouped in a separate clade. A Rep-PCR fingerprinting analysis grouped the eleven isolates into four distinct genotypes. We also provided evidence that the putative NFI in our diazotrophic Pseudomonas isolates is flanked by cobS and gshP genes. Furthermore, we demonstrated that the putative NFI of Pseudomonas sp. Gr65 is flanked by inverted repeats identical to those found in P. stutzeri DSM 4166 and while the other P. stutzeri isolates harbor the repeats located in the intergenic region between cobS and glutaredoxin genes as in the case of P. stutzeri A1501. Taken together these data suggest that all putative NFIs of diazotrophic Pseudomonas isolates are anchored in an intergenic region between cobS and gshP genes and their flanking regions are designated by distinct repeats patterns. Moreover, the presence of almost identical NFIs in diazotrophic Pseudomonas strains isolated from distal geographical locations around the world suggested that this horizontal gene transfer event may have taken place early in the evolution. PMID:25251496
Venieraki, Anastasia; Dimou, Maria; Vezyri, Eleni; Vamvakas, Alexandros; Katinaki, Pagona-Artemis; Chatzipavlidis, Iordanis; Tampakaki, Anastasia; Katinakis, Panagiotis
2014-01-01
The presence of nitrogen fixers within the genus Pseudomonas has been established and so far most isolated strains are phylogenetically affiliated to Pseudomonas stutzeri. A gene ortholog neighborhood analysis of the nitrogen fixation island (NFI) in four diazotrophic P. stutzeri strains and Pseudomonas azotifigens revealed that all are flanked by genes coding for cobalamin synthase (cobS) and glutathione peroxidise (gshP). The putative NFIs lack all the features characterizing a mobilizable genomic island. Nevertheless, bioinformatic analysis P. stutzeri DSM 4166 NFI demonstrated the presence of short inverted and/or direct repeats within both flanking regions. The other P. stutzeri strains carry only one set of repeats. The genetic diversity of eleven diazotrophic Pseudomonas isolates was also investigated. Multilocus sequence typing grouped nine isolates along with P. stutzeri and two isolates are grouped in a separate clade. A Rep-PCR fingerprinting analysis grouped the eleven isolates into four distinct genotypes. We also provided evidence that the putative NFI in our diazotrophic Pseudomonas isolates is flanked by cobS and gshP genes. Furthermore, we demonstrated that the putative NFI of Pseudomonas sp. Gr65 is flanked by inverted repeats identical to those found in P. stutzeri DSM 4166 and while the other P. stutzeri isolates harbor the repeats located in the intergenic region between cobS and glutaredoxin genes as in the case of P. stutzeri A1501. Taken together these data suggest that all putative NFIs of diazotrophic Pseudomonas isolates are anchored in an intergenic region between cobS and gshP genes and their flanking regions are designated by distinct repeats patterns. Moreover, the presence of almost identical NFIs in diazotrophic Pseudomonas strains isolated from distal geographical locations around the world suggested that this horizontal gene transfer event may have taken place early in the evolution.
Tsuchiya, Takayoshi; Shibata, Minoru; Numabe, Hironao; Jinno, Tomoko; Nakabayashi, Kazuhiko; Nishimura, Gen; Nagai, Toshiro; Ogata, Tsutomu; Fukami, Maki
2014-02-01
Haploinsufficiency of SHOX on the short arm pseudoautosomal region (PAR1) leads to Leri-Weill dyschondrosteosis (LWD), and nullizygosity of SHOX results in Langer mesomelic dysplasia (LMD). Molecular defects of LWD/LMD include various microdeletions in PAR1 that involve exons and/or the putative upstream or downstream enhancer regions of SHOX, as well as several intragenic mutations. Here, we report on a Japanese male infant with mild manifestations of LMD and hitherto unreported microdeletions in PAR1. Clinical analysis revealed mesomelic short stature with various radiological findings indicative of LMD. Molecular analyses identified compound heterozygous deletions, that is, a maternally inherited ∼46 kb deletion involving the upstream region and exons 1-5 of SHOX, and a paternally inherited ∼500 kb deletion started from a position ∼300 kb downstream from SHOX. In silico analysis revealed that the downstream deletion did not affect the known putative enhancer regions of SHOX, although it encompassed several non-coding elements which were well conserved among various species with SHOX orthologs. These results provide the possibility of the presence of a novel enhancer for SHOX in the genomic region ∼300 to ∼800 kb downstream of the start codon. © 2013 Wiley Periodicals, Inc.
Collart, F R; Osipiuk, J; Trent, J; Olsen, G J; Huberman, E
1996-10-03
We have cloned and characterized the gene encoding inosine monophosphate dehydrogenase (IMPDH) from Pyrococcus furiosus (Pf), a hyperthermophillic archeon. Sequence analysis of the Pf gene indicated an open reading frame specifying a protein of 485 amino acids (aa) with a calculated M(r) of 52900. Canonical Archaea promoter elements, Box A and Box B, are located -49 and -17 nucleotides (nt), respectively, upstream of the putative start codon. The sequence of the putative active-site region conforms to the IMPDH signature motif and contains a putative active-site cysteine. Phylogenetic relationships derived by using all available IMPDH sequences are consistent with trees developed for other molecules; they do not precisely resolve the history of Pf IMPDH but indicate a close similarity to bacterial IMPDH proteins. The phylogenetic analysis indicates that a gene duplication occurred prior to the division between rodents and humans, accounting for the Type I and II isoforms identified in mice and humans.
Regulation of the alpha-glucuronidase-encoding gene ( aguA) from Aspergillus niger.
de Vries, R P; van de Vondervoort, P J I; Hendriks, L; van de Belt, M; Visser, J
2002-09-01
The alpha-glucuronidase gene aguA from Aspergillus niger was cloned and characterised. Analysis of the promoter region of aguA revealed the presence of four putative binding sites for the major carbon catabolite repressor protein CREA and one putative binding site for the transcriptional activator XLNR. In addition, a sequence motif was detected which differed only in the last nucleotide from the XLNR consensus site. A construct in which part of the aguA coding region was deleted still resulted in production of a stable mRNA upon transformation of A. niger. The putative XLNR binding sites and two of the putative CREA binding sites were mutated individually in this construct and the effects on expression were examined in A. niger transformants. Northern analysis of the transformants revealed that the consensus XLNR site is not actually functional in the aguA promoter, whereas the sequence that diverges from the consensus at a single position is functional. This indicates that XLNR is also able to bind to the sequence GGCTAG, and the XLNR binding site consensus should therefore be changed to GGCTAR. Both CREA sites are functional, indicating that CREA has a strong influence on aguA expression. A detailed expression analysis of aguA in four genetic backgrounds revealed a second regulatory system involved in activation of aguA gene expression. This system responds to the presence of glucuronic and galacturonic acids, and is not dependent on XLNR.
Zhao, J.; Chen, Y. H.; Kwan, H. S.
2000-01-01
The complete nucleotide sequence of putative glucoamylase gene gla1 from the basidiomycetous fungus Lentinula edodes strain L54 is reported. The coding region of the genomic glucoamylase sequence, which is preceded by eukaryotic promoter elements CAAT and TATA, spans 2,076 bp. The gla1 gene sequence codes for a putative polypeptide of 571 amino acids and is interrupted by seven introns. The open reading frame sequence of the gla1 gene shows strong homology with those of other fungal glucoamylase genes and encodes a protein with an N-terminal catalytic domain and a C-terminal starch-binding domain. The similarity between the Gla1 protein and other fungal glucoamylases is from 45 to 61%, with the region of highest conservation found in catalytic domains and starch-binding domains. We compared the kinetics of glucoamylase activity and levels of gene expression in L. edodes strain L54 grown on different carbon sources (glucose, starch, cellulose, and potato extract) and in various developmental stages (mycelium growth, primordium appearance, and fruiting body formation). Quantitative reverse transcription PCR utilizing pairs of primers specific for gla1 gene expression shows that expression of gla1 was induced by starch and increased during the process of fruiting body formation, which indicates that glucoamylases may play an important role in the morphogenesis of the basidiomycetous fungus. PMID:10831434
A new polymorphic and multicopy MHC gene family related to nonmammalian class I
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leelayuwat, C.; Degli-Esposti, M.A.; Abraham, L.J.
1994-12-31
The authors have used genomic analysis to characterize a region of the central major histocompatibility complex (MHC) spanning {approximately} 300 kilobases (kb) between TNF and HLA-B. This region has been suggested to carry genetic factors relevant to the development of autoimmune diseases such as myasthenia gravis (MG) and insulin dependent diabetes mellitus (IDDM). Genomic sequence was analyzed for coding potential, using two neural network programs, GRAIL and GeneParser. A genomic probe, JAB, containing putative coding sequences (PERB11) located 60 kb centromeric of HLA-B, was used for northern analysis of human tissues. Multiple transcripts were detected. Southern analysis of genomic DNAmore » and overlapping YAC clones, covering the region from BAT1 to HLA-F, indicated that there are at least five copies of PERB11, four of which are located within this region of the MHC. The partial cDNA sequence of PERB11 was obtained from poly-A RNA derived from skeletal muscle. The putative amino acid sequence of PERB11 shares {approximately} 30% identity to MHC class I molecules from various species, including reptiles, chickens, and frogs, as well as to other MHC class I-like molecules, such as the IgG FcR of the mouse and rat and the human Zn-{alpha}2-glycoprotein. From direct comparison of amino acid sequences, it is concluded that PERB11 is a distinct molecule more closely related to nonmammalian than known mammalian MHC class I molecules. Genomic sequence analysis of PERB11 from five MHC ancestral haplotypes (AH) indicated that the gene is polymorphic at both DNA and protein level. The results suggest that the authors have identified a novel polymorphic gene family with multiple copies within the MHC. 48 refs., 10 figs., 2 tabs.« less
Kress, W. John; Erickson, David L.
2007-01-01
Background A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Methodology/Principal Findings Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. Conclusions/Significance A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination. PMID:17551588
Giardina, P; Cannio, R; Martirani, L; Marzullo, L; Palmieri, G; Sannia, G
1995-01-01
The gene (pox1) encoding a phenol oxidase from Pleurotus ostreatus, a lignin-degrading basidiomycete, was cloned and sequenced, and the corresponding pox1 cDNA was also synthesized and sequenced. The isolated gene consists of 2,592 bp, with the coding sequence being interrupted by 19 introns and flanked by an upstream region in which putative CAAT and TATA consensus sequences could be identified at positions -174 and -84, respectively. The isolation of a second cDNA (pox2 cDNA), showing 84% similarity, and of the corresponding truncated genomic clones demonstrated the existence of a multigene family coding for isoforms of laccase in P. ostreatus. PCR amplifications of specific regions on the DNA of isolated monokaryons proved that the two genes are not allelic forms. The POX1 amino acid sequence deduced was compared with those of other known laccases from different fungi. PMID:7793961
From Genomes to Protein Models and Back
NASA Astrophysics Data System (ADS)
Tramontano, Anna; Giorgetti, Alejandro; Orsini, Massimiliano; Raimondo, Domenico
2007-12-01
The alternative splicing mechanism allows genes to generate more than one product. When the splicing events occur within protein coding regions they can modify the biological function of the protein. Alternative splicing has been suggested as one way for explaining the discrepancy between the number of human genes and functional complexity. We analysed the putative structure of the alternatively spliced gene products annotated in the ENCODE pilot project and discovered that many of the potential alternative gene products will be unlikely to produce stable functional proteins.
Effects of GWAS-Associated Genetic Variants on lncRNAs within IBD and T1D Candidate Loci
Brorsson, Caroline A.; Pociot, Flemming
2014-01-01
Long non-coding RNAs are a new class of non-coding RNAs that are at the crosshairs in many human diseases such as cancers, cardiovascular disorders, inflammatory and autoimmune disease like Inflammatory Bowel Disease (IBD) and Type 1 Diabetes (T1D). Nearly 90% of the phenotype-associated single-nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) lie outside of the protein coding regions, and map to the non-coding intervals. However, the relationship between phenotype-associated loci and the non-coding regions including the long non-coding RNAs (lncRNAs) is poorly understood. Here, we systemically identified all annotated IBD and T1D loci-associated lncRNAs, and mapped nominally significant GWAS/ImmunoChip SNPs for IBD and T1D within these lncRNAs. Additionally, we identified tissue-specific cis-eQTLs, and strong linkage disequilibrium (LD) signals associated with these SNPs. We explored sequence and structure based attributes of these lncRNAs, and also predicted the structural effects of mapped SNPs within them. We also identified lncRNAs in IBD and T1D that are under recent positive selection. Our analysis identified putative lncRNA secondary structure-disruptive SNPs within and in close proximity (+/−5 kb flanking regions) of IBD and T1D loci-associated candidate genes, suggesting that these RNA conformation-altering polymorphisms might be associated with diseased-phenotype. Disruption of lncRNA secondary structure due to presence of GWAS SNPs provides valuable information that could be potentially useful for future structure-function studies on lncRNAs. PMID:25144376
Evidence for regulation of columnar habit in apple by a putative 2OG-Fe(II) oxygenase.
Wolters, Pieter J; Schouten, Henk J; Velasco, Riccardo; Si-Ammour, Azeddine; Baldi, Paolo
2013-12-01
Understanding the genetic mechanisms controlling columnar-type growth in the apple mutant 'Wijcik' will provide insights on how tree architecture and growth are regulated in fruit trees. In apple, columnar-type growth is controlled by a single major gene at the Columnar (Co) locus. By comparing the genomic sequence of the Co region of 'Wijcik' with its wild-type 'McIntosh', a novel non-coding DNA element of 1956 bp specific to Pyreae was found to be inserted in an intergenic region of 'Wijcik'. Expression analysis of selected genes located in the vicinity of the insertion revealed the upregulation of the MdCo31 gene encoding a putative 2OG-Fe(II) oxygenase in axillary buds of 'Wijcik'. Constitutive expression of MdCo31 in Arabidopsis thaliana resulted in compact plants with shortened floral internodes, a phenotype reminiscent of the one observed in columnar apple trees. We conclude that MdCo31 is a strong candidate gene for the control of columnar growth in 'Wijcik'. No claim to original European Union works. New Phytologist © 2013 New Phytologist Trust.
2013-01-01
Background Polycomb Repressive Complex 2 (PRC2) is an essential regulator of gene expression that maintains genes in a repressed state by marking chromatin with trimethylated Histone H3 lysine 27 (H3K27me3). In Arabidopsis, loss of PRC2 function leads to pleiotropic effects on growth and development thought to be due to ectopic expression of seed and embryo-specific genes. While there is some understanding of the mechanisms by which specific genes are targeted by PRC2 in animal systems, it is still not clear how PRC2 is recruited to specific regions of plant genomes. Results We used ChIP-seq to determine the genome-wide distribution of hemagglutinin (HA)-tagged FERTLIZATION INDEPENDENT ENDOSPERM (FIE-HA), the Extra Sex Combs homolog protein present in all Arabidopsis PRC2 complexes. We found that the FIE-HA binding sites co-locate with a subset of the H3K27me3 sites in the genome and that the associated genes were more likely to be de-repressed in mutants of PRC2 components. The FIE-HA binding sites are enriched for three sequence motifs including a putative GAGA factor binding site that is also found in Drosophila Polycomb Response Elements (PREs). Conclusions Our results suggest that PRC2 binding sites in plant genomes share some sequence features with Drosophila PREs. However, unlike Drosophila PREs which are located in promoters and devoid of H3K27me3, Arabidopsis FIE binding sites tend to be in gene coding regions and co-localize with H3K27me3. PMID:24001316
The complete mitochondrial genome sequence of Aesopia cornuta (Pleuronectiformes: Soleidae).
Wang, Shu-Ying; Shi, Wei; Wang, Zhong-Ming; Gong, Li; Kong, Xiao-Yu
2015-02-01
Aesopia cornuta belongs to the family Soleidae of Pleuronectiformes, and the morphological characters are much similar to those of Zebrias. In this article, we sequenced, characterized, and compared the complete mitogenome of A. cornuta for the first time. The genome is 16,737 base pairs in length, and is typically consist of 37 genes, including 13 protein-coding genes, two ribosomal RNA, 22 transfer RNA, as well as a putative L-strand replication origin and a putative control region. The gene organization is identical to that of typical bony fishes. The overall base composition is 29.1, 28.3, 26.8 and 15.8% for C, A, T and G, respectively, with a slight AT bias of 55.1%. This result is expected to contribute to understanding the systematic evolution of the genus Aesopia and further taxonomic and phylogenetic studies of Soleidae and Pleuronectiformes.
Tobin, M B; Kovacevic, S; Madduri, K; Hoskins, J A; Skatrud, P L; Vining, L C; Stuttard, C; Miller, J R
1991-01-01
Lysine epsilon-aminotransferase (LAT) in the beta-lactam-producing actinomycetes is considered to be the first step in the antibiotic biosynthetic pathway. Cloning of restriction fragments from Streptomyces clavuligerus, a beta-lactam producer, into Streptomyces lividans, a nonproducer that lacks LAT activity, led to the production of LAT in the host. DNA sequencing of restriction fragments containing the putative lat gene revealed a single open reading frame encoding a polypeptide with an approximately Mr 49,000. Expression of this coding sequence in Escherichia coli led to the production of LAT activity. Hence, LAT activity in S. clavuligerus is derived from a single polypeptide. A second open reading frame began immediately downstream from lat. Comparison of this partial sequence with the sequences of delta-(L-alpha-aminoadipyl)-L-cysteinyl-D valine (ACV) synthetases from Penicillium chrysogenum and Cephalosporium acremonium and with nonribosomal peptide synthetases (gramicidin S and tyrocidine synthetases) found similarities among the open reading frames. Since mapping of the putative N and C termini of S. clavuligerus pcbAB suggests that the coding region occupies approximately 12 kbp and codes for a polypeptide related in size to the fungal ACV synthetases, the molecular characterization of the beta-lactam biosynthetic cluster between pcbC and cefE (approximately 25 kbp) is nearly complete. Images PMID:1917855
Javierre, Biola M; Burren, Oliver S; Wilder, Steven P; Kreuzhuber, Roman; Hill, Steven M; Sewitz, Sven; Cairns, Jonathan; Wingett, Steven W; Várnai, Csilla; Thiecke, Michiel J; Burden, Frances; Farrow, Samantha; Cutler, Antony J; Rehnström, Karola; Downes, Kate; Grassi, Luigi; Kostadima, Myrto; Freire-Pritchett, Paula; Wang, Fan; Stunnenberg, Hendrik G; Todd, John A; Zerbino, Daniel R; Stegle, Oliver; Ouwehand, Willem H; Frontini, Mattia; Wallace, Chris; Spivakov, Mikhail; Fraser, Peter
2016-11-17
Long-range interactions between regulatory elements and gene promoters play key roles in transcriptional regulation. The vast majority of interactions are uncharted, constituting a major missing link in understanding genome control. Here, we use promoter capture Hi-C to identify interacting regions of 31,253 promoters in 17 human primary hematopoietic cell types. We show that promoter interactions are highly cell type specific and enriched for links between active promoters and epigenetically marked enhancers. Promoter interactomes reflect lineage relationships of the hematopoietic tree, consistent with dynamic remodeling of nuclear architecture during differentiation. Interacting regions are enriched in genetic variants linked with altered expression of genes they contact, highlighting their functional role. We exploit this rich resource to connect non-coding disease variants to putative target promoters, prioritizing thousands of disease-candidate genes and implicating disease pathways. Our results demonstrate the power of primary cell promoter interactomes to reveal insights into genomic regulatory mechanisms underlying common diseases. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Gardner, Elliot M.; Johnson, Matthew G.; Ragone, Diane; Wickett, Norman J.; Zerega, Nyree J. C.
2016-01-01
Premise of the study: We used moderately low-coverage (17×) whole-genome sequencing of Artocarpus camansi (Moraceae) to develop genomic resources for Artocarpus and Moraceae. Methods and Results: A de novo assembly of Illumina short reads (251,378,536 pairs, 2 × 100 bp) accounted for 93% of the predicted genome size. Predicted coding regions were used in a three-way orthology search with published genomes of Morus notabilis and Cannabis sativa. Phylogenetic markers for Moraceae were developed from 333 inferred single-copy exons. Ninety-eight putative MADS-box genes were identified. Analysis of all predicted coding regions resulted in preliminary annotation of 49,089 genes. An analysis of synonymous substitutions for pairs of orthologs (Ks analysis) in M. notabilis and A. camansi strongly suggested a lineage-specific whole-genome duplication in Artocarpus. Conclusions: This study substantially increases the genomic resources available for Artocarpus and Moraceae and demonstrates the value of low-coverage de novo assemblies for nonmodel organisms with moderately large genomes. PMID:27437173
Complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi (Rajiformes: Rajidae).
Li, Weidong; Chen, Xiao; Liu, Wenai; Sun, Renjie; Zhou, Haolang
2016-07-01
The complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi was determined in this study. It is 16,974 bp in length and contains 13 protein-coding genes, two rRNA genes, 22 tRNA genes, and one putative control region. The overall base composition is 30.5% A, 27.8% C, 14.0% G, and 27.8% T. There are 28 bp short intergenic spaces located in 12 gene junctions and 31 bp overlaps located in nine gene junctions in the whole mitogenome. Two start codons (ATG and GTG) and two stop codons (TAG and TAA/T) were used in the protein-coding genes. The lengths of 22 tRNA genes range from 68 (tRNA-Ser2) to 75 (tRNA-Leu1) bp. The origin of L-strand replication (OL) sequence (37 bp) was identified between the tRNA-Asn and tRNA-Cys genes. The control region is 1311 bp in length with high A + T and poor G content.
Basu, Swaraj; Larsson, Erik
2018-05-31
Antisense transcripts and other long non-coding RNAs are pervasive in mammalian cells, and some of these molecules have been proposed to regulate proximal protein-coding genes in cis For example, non-coding transcription can contribute to inactivation of tumor suppressor genes in cancer, and antisense transcripts have been implicated in the epigenetic inactivation of imprinted genes. However, our knowledge is still limited and more such regulatory interactions likely await discovery. Here, we make use of available gene expression data from a large compendium of human tumors to generate hypotheses regarding non-coding-to-coding cis -regulatory relationships with emphasis on negative associations, as these are less likely to arise for reasons other than cis -regulation. We document a large number of possible regulatory interactions, including 193 coding/non-coding pairs that show expression patterns compatible with negative cis -regulation. Importantly, by this approach we capture several known cases, and many of the involved coding genes have known roles in cancer. Our study provides a large catalog of putative non-coding/coding cis -regulatory pairs that may serve as a basis for further experimental validation and characterization. Copyright © 2018 Basu and Larsson.
The LacI family protein GlyR3 co-regulates the celC operon and manB in Clostridium thermocellum
Choi, Jinlyung; Klingeman, Dawn M.; Brown, Steven D.; ...
2017-06-24
In this paper, we demonstrate that the GlyR3 protein mediates the regulation of manB. We first identify putative GlyR3 binding sites within or just upstream of the coding regions of manB and celT. Using an electrophoretic mobility shift assay (EMSA), we determined that a higher concentration of GlyR3 is required to effectively bind to the putative manB site in comparison to the celC site. Neither the putative celT site nor random DNA significantly binds GlyR3. While laminaribiose interfered with GlyR3 binding to the celC binding site, binding to the manB site was unaffected. In the presence of laminaribiose, in vivomore » transcription of the celC–glyR3–licA gene cluster increases, while manB expression is repressed, compared to in the absence of laminaribiose, consistent with the results from the EMSA. An in vitro transcription assay demonstrated that GlyR3 and laminaribiose interactions were responsible for the observed patters of in vivo transcription.« less
García Guerreiro, M P; Fontdevila, A
2007-01-01
A new transposable element, Isis, is identified as a LTR retrotransposon in Drosophila buzzatii. DNA sequence analysis shows that Isis contains three long ORFs similar to gag, pol and env genes of retroviruses. The ORF1 exhibits sequence homology to matrix, capsid and nucleocapsid gag proteins and ORF2 encodes a putative protease (PR), a reverse transcriptase (RT), an Rnase H (RH) and an integrase (IN) region. The analysis of a putative env product, encoded by the env ORF3, shows a degenerated protein containing several stop codons. The molecular study of the putative proteins coded by this new element shows striking similarities to both Ulysses and Osvaldo elements, two LTR retrotransposons, present in D. virilis and D. buzzatii, respectively. Comparisons of the predicted Isis RT to several known retrotransposons show strong phylogenetic relationships to gypsy-like elements, particulary to Ulysses retrotransposon. Studies of Isis chromosomal distribution show a strong hybridization signal in centromeric and pericentromeric regions, and a scattered distribution along all chromosomal arms. The existence of insertional polymorphisms between different strains and high molecular weight bands by Southern blot suggests the existence of full-sized copies that have been active recently. The presence of euchromatic insertion sites coincident between Isis and Osvaldo could indicate preferential insertion sites of Osvaldo element into Isis sequence or vice versa. Moreover, the presence of Isis in different species of the buzzatii complex indicates the ancient origin of this element.
Boyd, David A.; Thevenot, Tracy; Gumbmann, Markus; Honeyman, Allen L.; Hamilton, Ian R.
2000-01-01
Transposon mutagenesis and marker rescue were used to isolate and identify an 8.5-kb contiguous region containing six open reading frames constituting the operon for the sorbitol P-enolpyruvate phosphotransferase transport system (PTS) of Streptococcus mutans LT11. The first gene, srlD, codes for sorbitol-6-phosphate dehydrogenase, followed downstream by srlR, coding for a transcriptional regulator; srlM, coding for a putative activator; and the srlA, srlE, and srlB genes, coding for the EIIC, EIIBC, and EIIA components of the sorbitol PTS, respectively. Among all sorbitol PTS operons characterized to date, the srlD gene is found after the genes coding for the EII components; thus, the location of the gene in S. mutans is unique. The SrlR protein is similar to several transcriptional regulators found in Bacillus spp. that contain PTS regulator domains (J. Stülke, M. Arnaud, G. Rapoport, and I. Martin-Verstraete, Mol. Microbiol. 28:865–874, 1998), and its gene overlaps the srlM gene by 1 bp. The arrangement of these two regulatory genes is unique, having not been reported for other bacteria. PMID:10639465
Jakubowska, Agata K; Peters, Sander A; Ziemnicka, Jadwiga; Vlak, Just M; van Oers, Monique M
2006-03-01
The genome sequence of a Polish isolate of Agrotis segetum nucleopolyhedrovirus (AgseNPV-A) was determined and analysed. The circular genome is composed of 147,544 bp and has a G+C content of 45.7 mol%. It contains 153 putative, non-overlapping open reading frames (ORFs) encoding predicted proteins of more than 50 aa, together making up 89.8 % of the genome. The remaining 10.2 % of the DNA constitutes non-coding regions and homologous-repeat regions. One hundred and forty-three AgseNPV-A ORFs are homologues of previously reported baculovirus gene sequences. There are ten unique ORFs and they account for 3 % of the genome in total. All 62 lepidopteran baculovirus genes, including the 29 core baculovirus genes, were found in the AgseNPV-A genome. The gene content and gene order of AgseNPV-A are most similar to those of Spodoptera exigua (Se) multiple NPV and their shared homologous genes are 100 % collinear. Three putative enhancin genes were identified in the AgseNPV-A genome. In phylogenetic analysis, the AgseNPV-A enhancins form a cluster separated from enhancins of the Mamestra species NPVs.
Global Identification and Characterization of Transcriptionally Active Regions in the Rice Genome
Stolc, Viktor; Deng, Wei; He, Hang; Korbel, Jan; Chen, Xuewei; Tongprasit, Waraporn; Ronald, Pamela; Chen, Runsheng; Gerstein, Mark; Wang Deng, Xing
2007-01-01
Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs) not encoded by annotated exons in the rice (Oryza. sativa) subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83%) japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome. PMID:17372628
Suetomi, Yuta; Matsuda, Fuko; Uenoyama, Yoshihisa; Maeda, Kei-ichiro; Tsukamura, Hiroko; Ohkura, Satoshi
2013-10-01
Neurokinin B (NKB), encoded by TAC3, is thought to be an important accelerator of pulsatile gonadotropin-releasing hormone release. This study aimed to clarify the transcriptional regulatory mechanism of goat TAC3. First, we determined the full-length mRNA sequence of goat TAC3 from the hypothalamus to be 820 b, including a 381 b coding region, with the putative transcription start site located 143-b upstream of the start codon. The deduced amino acid sequence of NKB, which is produced from preproNKB, was completely conserved among goat, cattle, and human. Next, we cloned 5'-upstream region of goat TAC3 up to 3400 b from the translation initiation site, and this region was highly homologous with cattle TAC3 (89%). We used this goat TAC3 5'-upstream region to perform luciferase assays. We created a luciferase reporter vector containing DNA constructs from -2706, -1837, -834, -335, or -197 to +166 bp (the putative transcription start site was designated as +1) of goat TAC3 and these were transiently transfected into mouse hypothalamus-derived N7 cells and human neuroblastoma-derived SK-N-AS cells. The luciferase activity gradually increased with the deletion of the 5'-upstream region, suggesting that the transcriptional suppressive region is located between -2706 and -336 bp and that the core promoter exists downstream of -197 bp. Estradiol treatment did not lead to significant suppression of luciferase activity of any constructs, suggesting the existence of other factor(s) that regulate goat TAC3 transcription.
Ito, M; Mori, Y; Oiso, Y; Saito, H
1991-01-01
To elucidate the molecular mechanism of familial central diabetes insipidus (FDI), we sequenced the arginine vasopressin-neurophysin II (AVP-NPII) gene in 2 patients belonging to a pedigree that is consistent with an autosomal dominant mode of inheritance. 10 patients with idiopathic central diabetes insipidus (IDI) and 5 normals were also studied. The AVP-NPII gene, locating on chromosome 20, consists of three exons that encode putative signal peptide, AVP, NPII, and glycoprotein. Using polymerase chain reaction, fragments including the promoter region and all coding regions were amplified from genomic DNA and subjected to direct sequencing. Sequences of 10 patients with IDI were identical with those of normals, while in 2 patients with FDI, a single base substitution was detected in one of two alleles of the AVP-NPII gene, indicating they were heterozygotes for this mutation. It was a G----A transition at nucleotide position 1859 in the second exon, resulting in a substitution of Gly for Ser at amino acid position 57 in the NPII moiety. It was speculated that the mutated AVP-NPII precursor or the mutated NPII molecule, through their conformational changes, might be responsible for AVP deficiency. Images PMID:1840604
Genomic Structure of the Luciferase Gene from the Bioluminescent Beetle, Nyctophila cf. Caucasica
Day, John C.; Chaichi, Mohammad J.; Najafil, Iraj; Whiteley, Andrew S.
2006-01-01
The gene coding for beetle luciferase, the enzyme responsible for bioluminescence in over two thousand coleopteran species has, to date, only been characterized from one Palearctic species of Lampyridae. Here we report the characterization of the luciferase gene from a female beetle of an Iranian lampyrid species, Nyctophila cf. caucasica (Coleoptera:Lampyridae). The luciferase gene was composed of seven exons, coding for 547 amino acids, separated by six introns spanning 1976 bp of genomic DNA. The deduced amino acid sequences of the luciferase gene of N. caucasica showed 98.9% homology to that of the Palearctic species Lampyris noctiluca. Analysis of the 810 bp upstream region of the luciferase gene revealed three TATA boxes and several other consensus transcriptional factor recognition sequences presenting evidence for a putative core promoter region conserved in Lampyrinae from -190 through to -155 upstream of the luciferase start codon. Along with the core promoter region the luciferase gene was compared with orthologous sequences from other lampyrid species and found to have greatest identity to Lampyris turkistanicus and Lampyris noctiluca. The significant sequence identity to the former is discussed in relation to taxonomic issues of Iranian lampyrids. PMID:20298115
Wise, Carol A.; Chiang, Lydia C.; Paznekas, William A.; Sharma, Mridula; Musy, Maurice M.; Ashley, Jennifer A.; Lovett, Michael; Jabs, Ethylin W.
1997-01-01
Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development. PMID:9096354
Phylogenetic and Molecular Variability Studies Reveal a New Genetic Clade of Citrus leprosis virus C
Ramos-González, Pedro Luis; Chabi-Jesus, Camila; Guerra-Peraza, Orlene; Breton, Michèle Claire; Arena, Gabriella Dias; Nunes, Maria Andreia; Kitajima, Elliot Watanabe; Machado, Marcos Antonio; Freitas-Astúa, Juliana
2016-01-01
Citrus leprosis virus C (CiLV-C) causes a severe disease affecting citrus orchards in the Western hemisphere. This study reveals the molecular variability of the virus by analyzing four genomic regions (p29, p15, MP and RNA2-intergenic region) distributed over its two RNAs. Nucleotide diversity (π) values were relatively low but statistically different over the analyzed genes and subpopulations, indicating their distinct evolutionary history. Values of πp29 and πMP were higher than those of πp15 and πRNA2–IR, whereas πMP was increased due to novel discovered isolates phylogenetically clustered in a divergent clade that we called SJP. Isolate BR_SP_SJP_01 RNA1 and RNA2 sequences, clade SJP, showed an identity of 85.6% and 88.4%, respectively, with those corresponding to CiLV-C, the type member of the genus Cilevirus, and its RNA2 5′-proximal region was revealed as a minor donor in a putative inter-clade recombination event. In addition to citrus, BR_SP_SJP_01 naturally infects the weed Commelina benghalensis and is efficiently transmitted by Brevipalpus yothersi mites. Our data demonstrated that negative selection was the major force operating in the evaluated viral coding regions and defined amino acids putatively relevant for the biological function of cilevirus proteins. This work provides molecular tools and sets up a framework for further epidemiological studies. PMID:27275832
Parvari, R; Shen, J; Hershkovitz, E; Chen, Y T; Moses, S W
1998-04-01
Glycogen storage disease type III (GSD III) is an autosomal recessive disease caused by the deficiency of glycogen debranching enzyme (AGL). We report the finding of two new mutations in a GSD IIIa Ashkenazi Jewish patient. Both mutations are insertion of an adenine into a stretch of 8 adenines towards the 3' end of the coding region, one at position 3904 (3904insA) in exon 30, the second at position 4214 (4214insA) in exon 32. The mutations cause frameshifts and premature terminations of the glycogen debranching enzyme, the first causing a frameshift at amino acid 1304, the second causing a frameshift at amino acid 1408 of the total of 1532. These mutations demonstrate the importance of the 125 amino acids at the carboxy-terminus of the debrancher enzyme for its activity and support the suggestion that the putative glycogen binding domain is located in the carboxy-terminus of the AGL. The mutations cause distinctive single-strand conformation polymorphism (SSCP) patterns enabling easy detection.
Smith, David Roy; Hua, Jimeng; Archibald, John M.; Lee, Robert W.
2013-01-01
Organelle DNA is no stranger to palindromic repeats. But never has a mitochondrial or plastid genome been described in which every coding region is part of a distinct palindromic unit. While sequencing the mitochondrial DNA of the nonphotosynthetic green alga Polytomella magna, we uncovered precisely this type of genic arrangement. The P. magna mitochondrial genome is linear and made up entirely of palindromes, each containing 1–7 unique coding regions. Consequently, every gene in the genome is duplicated and in an inverted orientation relative to its partner. And when these palindromic genes are folded into putative stem-loops, their predicted translational start sites are often positioned in the apex of the loop. Gel electrophoresis results support the linear, 28-kb monomeric conformation of the P. magna mitochondrial genome. Analyses of other Polytomella taxa suggest that palindromic mitochondrial genes were present in the ancestor of the Polytomella lineage and lost or retained to various degrees in extant species. The possible origins and consequences of this bizarre genomic architecture are discussed. PMID:23940100
Marino, John A.; Perfecto, Ivette; Vandermeer, John
2015-01-01
The interaction of crop pests with their natural enemies is a fundament to their control. Natural enemies of fungal pathogens of crops are poorly known relative to those of insect pests, despite the diversity of fungal pathogens and their economic importance. Currently, many regions across Latin America are experiencing unprecedented epidemics of coffee rust (Hemileia vastatrix). Identification of natural enemies of coffee rust could aid in developing management strategies or in pinpointing species that could be used for biocontrol. In the present study, we characterized fungal communities associated with coffee rust lesions by single-molecule DNA sequencing of fungal rRNA gene bar codes from leaf discs (≈28 mm2) containing rust lesions and control discs with no rust lesions. The leaf disc communities were hyperdiverse in terms of fungi, with up to 69 operational taxonomic units (putative species) per control disc, and the diversity was only slightly reduced in rust-infected discs, with up to 63 putative species. However, geography had a greater influence on the fungal community than whether the disc was infected by coffee rust. Through comparisons between control and rust-infected leaf discs, as well as taxonomic criteria, we identified 15 putative mycoparasitic fungi. These fungi are concentrated in the fungal family Cordycipitaceae and the order Tremellales. These data emphasize the complexity of diverse fungi of unknown ecological function within a leaf that might influence plant disease epidemics or lead to the development of species for biocontrol of fungal disease. PMID:26567299
Feng, X; Happ, G M
1996-11-14
The cDNA for Sp23, a structural protein of the spermatophore of Tenebrio molitor, had been previously cloned and characterized (Paesen, G.C., Schwartz, M.B., Peferoen, M., Weyda, F. and Happ, G.M. (1992a) Amino acid sequence of Sp23, a structure protein of the spermatophore of the mealworm beetle, Tenebrio molitor. J. Biol. Chem. 257, 18852-18857). Using the labeled cDNA for Sp23 as a probe to screen a library of genomic DNA from Tenebrio molitor, we isolated a genomic clone for Sp23. A 5373-base pair (bp) restriction fragment containing the Sp23 gene was sequenced. The coding region is separated by a 55-bp intron which is located close to the translation start site. Three putative ecdysone response elements (EcRE) are identified in the 5' flanking region of the Sp23 gene. Comparison of the flanking regions of the Sp23 gene with those of the D-protein gene expressed in the accessory glands of Tenebrio reveals similar sequences present in the flanking regions of the two genes. The genomic organization of the coding region of the Sp23 gene shares similarities with that of the D-protein gene, three Drosophila accessory gland genes and two Drosophila 20-OH ecdysone-responsive genes.
Non-coding RNA generated following lariat-debranching mediates targeting of AID to DNA
Zheng, Simin; Vuong, Bao Q.; Vaidyanathan, Bharat; Lin, Jia-Yu; Huang, Feng-Ting; Chaudhuri, Jayanta
2015-01-01
SUMMARY Transcription through immunoglobulin switch (S) regions is essential for class switch recombination (CSR) but no molecular function of the transcripts has been described. Likewise, recruitment of activation-induced cytidine deaminase (AID) to S regions is critical for CSR; however, the underlying mechanism has not been fully elucidated. Here, we demonstrate that intronic switch RNA acts in trans to target AID to S region DNA. AID binds directly to switch RNA through G-quadruplexes formed by the RNA molecules. Disruption of this interaction by mutation of a key residue in the putative RNA-binding domain of AID impairs recruitment of AID to S region DNA, thereby abolishing CSR. Additionally, inhibition of RNA lariat processing leads to loss of AID localization to S regions and compromises CSR; both defects can be rescued by exogenous expression of switch transcripts in a sequence-specific manner. These studies uncover an RNA-mediated mechanism of targeting AID to DNA. PMID:25957684
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gault, J.; Zonana, J.; Zeltinger, J.
A conserved mouse genomic clone was used to identify a homologous human genomic clone (the DXS732E locus), which was subsequently employed to isolate cDNAs from a human fetal brain library. Nine unique overlapping cDNAs were isolated, and sequences analysis of 3.9 kb identified a putative 1 kb ORF. GRAIL analysis of the sequence supported the hypothesis that the putative ORF was coding sequence, and Prosite analysis of the putative ORF identified potential glycosylation and phosphorylation sites. The 5{prime} end of the gene maps within a CpG island, and comparison of cDNA sequences indicate the gene is alternatively spliced at itsmore » 3{prime} end. Northern analysis and RT-PCR indicate that two different sized messages appear to be expressed with the gene expressed in human fetal kidney, intestine, brain, and muscle. The gene is expressed in 77 day human skin, a time when hair follicle formation occurs. Anhidrotic ectodermal dysplasia (EDA) results in the abnormal morphogenesis of hair, teeth and eccrine sweat glands. A positional cloning strategy towards cloning the EDA gene had been used, and deletion and X-autosome translocation patients have been useful in further delimiting the EDA region. The present gene at the DXS732E locus is partially deleted in one EDA patient who does not have other apparent abnormalities. No rearrangements of the gene have been detected in two female X-autosome translocation EDA patients, nor in four additional male patients with submicroscopic molecular deletions.« less
Komatsu, Ken; Hirata, Hisae; Fukagawa, Takako; Yamaji, Yasuyuki; Okano, Yukari; Ishikawa, Kazuya; Adachi, Tatsushi; Maejima, Kensaku; Hashimoto, Masayoshi; Namba, Shigetou
2012-07-01
The first open-reading frame (ORF) of apple stem grooving virus (ASGV), of the genus Capillovirus, encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP). However, our previous study revealed that ASGV mutants with distinct and discontinuous Rep- and CP-coding regions successfully infect plants, indicating that CP expressed via a subgenomic RNA (sgRNA) is sufficient for viability of the virus. Here we identified a transcription start site of the CP sgRNA and revealed that CP translated from the sgRNA is essential for ASGV infection. We mapped the transcription start sites of both the CP and the movement protein (MP) sgRNAs of ASGV and found a hexanucleotide motif, UUAGGU, conserved upstream from both sgRNA transcription start sites. Mutational analysis of the putative CP initiation codon and of the UUAGGU sequence upstream from the transcription start site of CP sgRNA demonstrated their importance for ASGV accumulation. Our results also demonstrated that potato virus T (PVT), an unassigned species closely related to ASGV, produces two sgRNAs putatively deployed for the CP and MP expression and that the same hexanucleotide motif as found in ASGV is located upstream from the transcription start sites of both sgRNAs. This motif, which constituted putative core elements of the sgRNA promoter, is broadly conserved among viruses in the families Alphaflexiviridae and Betaflexiviridae, suggesting that the gene expression strategy of the viruses in both families has been conserved throughout evolution. Copyright © 2012 Elsevier B.V. All rights reserved.
In Silico Pattern-Based Analysis of the Human Cytomegalovirus Genome
Rigoutsos, Isidore; Novotny, Jiri; Huynh, Tien; Chin-Bow, Stephen T.; Parida, Laxmi; Platt, Daniel; Coleman, David; Shenk, Thomas
2003-01-01
More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/). PMID:12634390
In silico pattern-based analysis of the human cytomegalovirus genome.
Rigoutsos, Isidore; Novotny, Jiri; Huynh, Tien; Chin-Bow, Stephen T; Parida, Laxmi; Platt, Daniel; Coleman, David; Shenk, Thomas
2003-04-01
More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/).
Fan, Qiuyun; Anderson, Adam W; Davis, Nicole; Cutting, Laurie E
2014-10-24
With the advent of neuroimaging techniques, especially functional MRI (fMRI), studies have mapped brain regions that are associated with good and poor reading, most centrally a region within the left occipito-temporal/fusiform region (L-OT/F) often referred to as the visual word form area (VWFA). Despite an abundance of fMRI studies of the putative VWFA, research about its structural connectivity has just started. Provided that the putative VWFA may be connected to distributed regions in the brain, it remains unclear how this network is engaged in constituting a well-tuned reading circuitry in the brain. Here we used diffusion MRI to study the structural connectivity patterns of the putative VWFA and surrounding areas within the L-OT/F in children with typically developing (TD) reading ability and with word recognition deficits (WRD; sometimes referred to as dyslexia). We found that L-OT/F connectivity varied along a posterior-anterior gradient, with specific structural connectivity patterns related to reading ability in the ROIs centered upon the putative VWFA. Findings suggest that the architecture of the putative VWFA connectivity is fundamentally different between TD and WRD, with TD showing greater connectivity to linguistic regions than WRD, and WRD showing greater connectivity to visual and parahippocampal regions than TD. Findings thus reveal clear structural abnormalities underlying the functional abnormalities in the putative VWFA in WRD. Copyright © 2014 Elsevier B.V. All rights reserved.
Catalano, Sarah R; Whittington, Ian D; Donnellan, Stephen C; Bertozzi, Terry; Gillanders, Bronwyn M
2015-07-01
Dicyemids, poorly known parasites of benthic cephalopods, are one of the few phyla in which mitochondrial (mt) genome architecture departs from the typical ~16 kb circular metazoan genome. In addition to a putative circular genome, a series of mt minicircles that each comprises the mt encoded units (I-III) of the cytochrome c oxidase complex have been reported. Whether the structure of the mt minicircles is a consistent feature among dicyemid species is unknown. Here we analyse the complete cytochrome c oxidase subunit I (COI) minicircle molecule, containing the COI gene and an associated non-coding region (NCR), for ten dicyemid species, allowing for first time comparisons between species of minicircle architecture, NCR function and inferences of minicircle replication. Divergence in COI nucleotide sequences between dicyemid species was high (average net divergence = 31.6%) while within species diversity was lower (average net divergence = 0.2%). The NCR and putative 5' section of the COI gene were highly divergent between dicyemid species (average net nucleotide divergence of putative 5' COI section = 61.1%). No tRNA genes were found in the NCR, although palindrome sequences with the potential to form stem-loop structures were identified in some species, which may play a role in transcription or other biological processes.
Lery, Letícia M S; Bitar, Mainá; Costa, Mauricio G S; Rössle, Shaila C S; Bisch, Paulo M
2010-12-22
G. diazotrophicus and A. vinelandii are aerobic nitrogen-fixing bacteria. Although oxygen is essential for the survival of these organisms, it irreversibly inhibits nitrogenase, the complex responsible for nitrogen fixation. Both microorganisms deal with this paradox through compensatory mechanisms. In A. vinelandii a conformational protection mechanism occurs through the interaction between the nitrogenase complex and the FeSII protein. Previous studies suggested the existence of a similar system in G. diazotrophicus, but the putative protein involved was not yet described. This study intends to identify the protein coding gene in the recently sequenced genome of G. diazotrophicus and also provide detailed structural information of nitrogenase conformational protection in both organisms. Genomic analysis of G. diazotrophicus sequences revealed a protein coding ORF (Gdia0615) enclosing a conserved "fer2" domain, typical of the ferredoxin family and found in A. vinelandii FeSII. Comparative models of both FeSII and Gdia0615 disclosed a conserved beta-grasp fold. Cysteine residues that coordinate the 2[Fe-S] cluster are in conserved positions towards the metallocluster. Analysis of solvent accessible residues and electrostatic surfaces unveiled an hydrophobic dimerization interface. Dimers assembled by molecular docking presented a stable behaviour and a proper accommodation of regions possibly involved in binding of FeSII to nitrogenase throughout molecular dynamics simulations in aqueous solution. Molecular modeling of the nitrogenase complex of G. diazotrophicus was performed and models were compared to the crystal structure of A. vinelandii nitrogenase. Docking experiments of FeSII and Gdia0615 with its corresponding nitrogenase complex pointed out in both systems a putative binding site presenting shape and charge complementarities at the Fe-protein/MoFe-protein complex interface. The identification of the putative FeSII coding gene in G. diazotrophicus genome represents a large step towards the understanding of the conformational protection mechanism of nitrogenase against oxygen. In addition, this is the first study regarding the structural complementarities of FeSII-nitrogenase interactions in diazotrophic bacteria. The combination of bioinformatic tools for genome analysis, comparative protein modeling, docking calculations and molecular dynamics provided a powerful strategy for the elucidation of molecular mechanisms and structural features of FeSII-nitrogenase interaction.
Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.
Mayer, K; Schüller, C; Wambutt, R; Murphy, G; Volckaert, G; Pohl, T; Düsterhöft, A; Stiekema, W; Entian, K D; Terryn, N; Harris, B; Ansorge, W; Brandt, P; Grivell, L; Rieger, M; Weichselgartner, M; de Simone, V; Obermaier, B; Mache, R; Müller, M; Kreis, M; Delseny, M; Puigdomenech, P; Watson, M; Schmidtheini, T; Reichert, B; Portatelle, D; Perez-Alonso, M; Boutry, M; Bancroft, I; Vos, P; Hoheisel, J; Zimmermann, W; Wedler, H; Ridley, P; Langham, S A; McCullagh, B; Bilham, L; Robben, J; Van der Schueren, J; Grymonprez, B; Chuang, Y J; Vandenbussche, F; Braeken, M; Weltjens, I; Voet, M; Bastiaens, I; Aert, R; Defoor, E; Weitzenegger, T; Bothe, G; Ramsperger, U; Hilbert, H; Braun, M; Holzer, E; Brandt, A; Peters, S; van Staveren, M; Dirske, W; Mooijman, P; Klein Lankhorst, R; Rose, M; Hauf, J; Kötter, P; Berneiser, S; Hempel, S; Feldpausch, M; Lamberth, S; Van den Daele, H; De Keyser, A; Buysshaert, C; Gielen, J; Villarroel, R; De Clercq, R; Van Montagu, M; Rogers, J; Cronin, A; Quail, M; Bray-Allen, S; Clark, L; Doggett, J; Hall, S; Kay, M; Lennard, N; McLay, K; Mayes, R; Pettett, A; Rajandream, M A; Lyne, M; Benes, V; Rechmann, S; Borkova, D; Blöcker, H; Scharfe, M; Grimm, M; Löhnert, T H; Dose, S; de Haan, M; Maarse, A; Schäfer, M; Müller-Auer, S; Gabel, C; Fuchs, M; Fartmann, B; Granderath, K; Dauner, D; Herzl, A; Neumann, S; Argiriou, A; Vitale, D; Liguori, R; Piravandi, E; Massenet, O; Quigley, F; Clabauld, G; Mündlein, A; Felber, R; Schnabl, S; Hiller, R; Schmidt, W; Lecharny, A; Aubourg, S; Chefdor, F; Cooke, R; Berger, C; Montfort, A; Casacuberta, E; Gibbons, T; Weber, N; Vandenbol, M; Bargues, M; Terol, J; Torres, A; Perez-Perez, A; Purnelle, B; Bent, E; Johnson, S; Tacon, D; Jesse, T; Heijnen, L; Schwarz, S; Scholler, P; Heber, S; Francs, P; Bielke, C; Frishman, D; Haase, D; Lemcke, K; Mewes, H W; Stocker, S; Zaccaria, P; Bevan, M; Wilson, R K; de la Bastide, M; Habermann, K; Parnell, L; Dedhia, N; Gnoj, L; Schutz, K; Huang, E; Spiegel, L; Sehkon, M; Murray, J; Sheet, P; Cordes, M; Abu-Threideh, J; Stoneking, T; Kalicki, J; Graves, T; Harmon, G; Edwards, J; Latreille, P; Courtney, L; Cloud, J; Abbott, A; Scott, K; Johnson, D; Minx, P; Bentley, D; Fulton, B; Miller, N; Greco, T; Kemp, K; Kramer, J; Fulton, L; Mardis, E; Dante, M; Pepin, K; Hillier, L; Nelson, J; Spieth, J; Ryan, E; Andrews, S; Geisel, C; Layman, D; Du, H; Ali, J; Berghoff, A; Jones, K; Drone, K; Cotton, M; Joshu, C; Antonoiu, B; Zidanic, M; Strong, C; Sun, H; Lamar, B; Yordan, C; Ma, P; Zhong, J; Preston, R; Vil, D; Shekher, M; Matero, A; Shah, R; Swaby, I K; O'Shaughnessy, A; Rodriguez, M; Hoffmann, J; Till, S; Granat, S; Shohdy, N; Hasegawa, A; Hameed, A; Lodhi, M; Johnson, A; Chen, E; Marra, M; Martienssen, R; McCombie, W R
1999-12-16
The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.
Varmanen, P; Rantanen, T; Palva, A
1996-12-01
A proline iminopeptidase gene (pepI) of an industrial Lactobacillus helveticus strain was cloned and found to be organized in an operon-like structure of three open reading frames (ORF1, ORF2 and ORF3). ORF1 was preceded by a typical prokaryotic promoter region, and a putative transcription terminator was found downstream of ORF3, identified as the pepI gene. Using primer-extension analyses, only one transcription start site, upstream of ORF1, was identifiable in the predicted operon. Although the size of mRNA could not be judged by Northern analysis either with ORF1-, ORF2- or pepI-specific probes, reverse transcription-PCR analyses further supported the operon structure of the three genes. ORF1, ORF2 and ORF3 had coding capacities for 50.7, 24.5 and 33.8 kDa proteins, respectively. The ORF3-encoded PepI protein showed 65% identity with the PepI proteins from Lactobacillus delbrueckii subsp. bulgaricus and Lactobacillus delbrueckii subsp. lactis. The ORF1-encoded protein had significant homology with several members of the ABC transporter family but, with two distinct putative ATP-binding sites, it would represent an unusual type among the bacterial ABC transporters. ORF2 encoded a putative integral membrane protein also characteristic of the ABC transporter family. The pepI gene was overexpressed in Escherichia coli. Purified PepI hydrolysed only di and tripeptides with proline in the first position. Optimum PepI activity was observed at pH 7.5 and 40 degrees C. A gel filtration analysis indicated that PepI is a dimer of M(r) 53,000. PepI was shown to be a metal-independent serine peptidase having thiol groups at or near the active site. Kinetic studies with proline-p-nitroanilide as substrate revealed Km and Vmax values of 0.8 mM and 350 mmol min-1 mg-1, respectively, and a very high turnover number of 135,000 s-1.
Identification of G-quadruplex forming sequences in three manatee papillomaviruses
Zahin, Maryam; Dean, William L.; Ghim, Shin-je; Joh, Joongho; Gray, Robert D.; Khanal, Sujita; Bossart, Gregory D.; Mignucci-Giannoni, Antonio A.; Rouchka, Eric C.; Jenson, Alfred B.; Trent, John O.; Chaires, Jonathan B.
2018-01-01
The Florida manatee (Trichechus manatus latirotris) is a threatened aquatic mammal in United States coastal waters. Over the past decade, the appearance of papillomavirus-induced lesions and viral papillomatosis in manatees has been a concern for those involved in the management and rehabilitation of this species. To date, three manatee papillomaviruses (TmPVs) have been identified in Florida manatees, one forming cutaneous lesions (TmPV1) and two forming genital lesions (TmPV3 and TmPV4). We identified DNA sequences with the potential to form G-quadruplex structures (G4) across the three genomes. G4 were located on both DNA strands and across coding and non-coding regions on all TmPVs, offering multiple targets for viral control. Although G4 have been identified in several viral genomes, including human PVs, most research has focused on canonical structures comprised of three G-tetrads. In contrast, the vast majority of sequences we identified would allow the formation of non-canonical structures with only two G-tetrads. Our biophysical analysis confirmed the formation of G4 with parallel topology in three such sequences from the E2 region. Two of the structures appear comprised of multiple stacked two G-tetrad structures, perhaps serving to increase structural stability. Computational analysis demonstrated enrichment of G4 sequences on all TmPVs on the reverse strand in the E2/E4 region and on both strands in the L2 region. Several G4 sequences occurred at similar regional locations on all PVs, most notably on the reverse strand in the E2 region. In other cases, G4 were identified at similar regional locations only on PVs forming genital lesions. On all TmPVs, G4 sequences were located in the non-coding region near putative E2 binding sites. Together, these findings suggest that G4 are possible regulatory elements in TmPVs. PMID:29630682
The organization of the posterior parietal cortex devoted to upper limb actions: An fMRI study
Ferri, Stefania; Rizzolatti, Giacomo
2015-01-01
Abstract The present fMRI study examined whether upper‐limb action classes differing in their motor goal are encoded by different PPC sectors. Action observation was used as a proxy for action execution. Subjects viewed actors performing object‐related (e.g., grasping), skin‐displacing (e.g., rubbing the skin), and interpersonal upper limb actions (e.g., pushing someone). Observation of the three action classes activated a three‐level network including occipito‐temporal, parietal, and premotor cortex. The parietal region common to observing all three action classes was located dorsally to the left intraparietal sulcus (DIPSM/DIPSA border). Regions specific for observing an action class were obtained by combining the interaction between observing action classes and stimulus types with exclusive masking for observing the other classes, while for regions considered preferentially active for a class the interaction was exclusively masked with the regions common to all observed actions. Left putative human anterior intraparietal was specific for observing manipulative actions, and left parietal operculum including putative human SII region, specific for observing skin‐displacing actions. Control experiments demonstrated that this latter activation depended on seeing the skin being moved and not simply on seeing touch. Psychophysiological interactions showed that the two specific parietal regions had similar connectivities. Finally, observing interpersonal actions preferentially activated a dorsal sector of left DIPSA, possibly the homologue of ventral intraparietal coding the impingement of the target person's body into the peripersonal space of the actor. These results support the importance of segregation according to the action class as principle of posterior parietal cortex organization for action observation and by implication for action execution. Hum Brain Mapp 36:3845–3866, 2015. © 2015 The Authors Human Brain Mapping Published by Wiley Periodicals, Inc. PMID:26129732
Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P
1988-02-01
Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators.
Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P
1988-01-01
Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators. Images PMID:3257578
Bes, M T; Hernández, J A; Peleato, M L; Fillat, M F
2001-01-15
A gene coding for a Fur (ferric uptake regulation) protein from the cyanobacterium Anabaena PCC 7119 has been cloned and overexpressed in Escherichia coli. DNA sequence analysis confirmed the presence of a 151-amino-acid open reading frame that showed homology with the Fur proteins reported for the unicellular cyanobacteria Synechococcus 7942 and Synechocystis PCC 6803. Two putative Fur-binding sites were detected in the promoter regions of the fur gene from Anabaena. Partially purified recombinant Fur binds to the flavodoxin promoter as well as its own promoter. This suggests that the Fur gene is autoregulated in Anabaena.
Complete plastid genome of Astragalus mongholicus var. nakaianus (Fabaceae).
Choi, In-Su; Kim, Joo-Hwan; Choi, Byoung-Hee
2016-07-01
The first complete plastid genome (plastome) of the largest angiosperm genus, Astragalus, was sequenced for the Korean endangered endemic species A. mongholicus var. nakaianus. Its genome is relatively short (123,633 bp) because it lacks an Inverted Repeat (IR) region. It comprises 110 genes, including four unique rRNAs, 30 tRNAs, and 76 protein-coding genes. Similar to other closely related plastomes, rpl22 and rps16 are absent. The putative pseudogene with abnormal stop codons is atpE. This plastome has no additional inversions when compared with highly variable plastomes from IRLC tribes Fabeae and Trifolieae. Our phylogenetic analysis confirms the non-monophyly of Galegeae.
A novel polyomavirus from the nasal cavity of a giant panda (Ailuropoda melanoleuca).
Qi, Dunwu; Shan, Tongling; Liu, Zhijian; Deng, Xutao; Zhang, Zhihe; Bi, Wenlei; Owens, Jacob Robert; Feng, Feifei; Zheng, Lisong; Huang, Feng; Delwart, Eric; Hou, Rong; Zhang, Wen
2017-10-27
Polyomaviruses infect a wide variety of mammalian and avian hosts with a broad spectrum of outcomes including asymptomatic infection, acute systemic disease, and tumor induction. Viral metagenomics and general PCR methods were used to detected viral nucleic acid in the samples from a diseased and healthy giant pandas. A novel polyomavirus, the giant panda polyomavirus 1 (GPPyV1) from the nasal cavity of a dead giant panda (Ailuropoda melanoleuca) was characterized. The GPPyV1 genome is 5144 bp in size and reveals five putative open-reading frames coding for the classic small and large T antigens in the early region, and the VP1, VP2 and VP3 capsid proteins in the late region. Phylogenetic analyses of the large T antigen of the GPPyV1 indicated GPPyV1 belonged to a putative new species within genus Deltapolyomavirus, clustering with four human polyomavirus species. The GPPyV1 VP1 and VP2 clustered with genus Alphapolyomavirus. Our epidemiologic study indicated that this novel polyomavirus was also detected in nasal swabs and fecal samples collected from captive healthy giant pandas. A novel polyomavirus was detected in giant pandas and its complete genome was characterized, which may cause latency infection in giant pandas.
2014-01-01
Background Protein coding genes account for only about 2% of the human genome, whereas the vast majority of transcripts are non-coding RNAs including long non-coding RNAs. A growing volume of literature has proposed that lncRNAs are important players in cancer. HOTAIR was previously shown to be an oncogene and negative prognostic factor in a variety of cancers. However, the factors that contribute to its upregulation and the interaction between HOTAIR and miRNAs are largely unknown. Methods A computational screen of HOTAIR promoter was conducted to search for transcription-factor-binding sites. HOTAIR promoter activities were examined by luciferase reporter assay. The function of the c-Myc binding site in the HOTAIR promoter region was tested by a promoter assay with nucleotide substitutions in the putative E-box. The association of c-Myc with the HOTAIR promoter in vivo was confirmed by chromatin immunoprecipitation assay and Electrophoretic mobility shift assay. A search for miRNAs with complementary base paring with HOTAIR was performed utilizing online software program. Gain and loss of function approaches were employed to investigate the expression changes of HOTAIR or miRNA-130a. The expression levels of HOTAIR, c-Myc and miRNA-130a were examined in 65 matched pairs of gallbladder cancer tissues. The effects of HOTAIR and miRNA-130a on gallbladder cancer cell invasion and proliferation was tested using in vitro cell invasion and flow cytometric assays. Results We demonstrate that HOTAIR is a direct target of c-Myc through interaction with putative c-Myc target response element (RE) in the upstream region of HOTAIR in gallbladder cancer cells. A positive correlation between c-Myc and HOTAIR mRNA levels was observed in gallbladder cancer tissues. We predicted that HOTAIR harbors a miRNA-130a binding site. Our data showed that this binding site is vital for the regulation of miRNA-130a by HOTAIR. Moreover, a negative correlation between HOTAIR and miRNA-130a was observed in gallbladder cancer tissues. Finally, we demonstrate that the oncogenic activity of HOTAIR is in part through its negative regulation of miRNA-130a. Conclusion Together, these results suggest that HOTAIR is a c-Myc-activated driver of malignancy, which acts in part through repression of miRNA-130a. PMID:24953832
Foox, Jonathan; Brugler, Mercer; Siddall, Mark Edward; Rodríguez, Estefanía
2016-07-01
Six complete and three partial actiniarian mitochondrial genomes were amplified in two semi-circles using long-range PCR and pyrosequenced in a single run on a 454 GS Junior, doubling the number of complete mitogenomes available within the order. Typical metazoan mtDNA features included circularity, 13 protein-coding genes, 2 ribosomal RNA genes, and length ranging from 17,498 to 19,727 bp. Several typical anthozoan mitochondrial genome features were also observed including the presence of only two transfer RNA genes, elevated A + T richness ranging from 54.9 to 62.4%, large intergenic regions, and group 1 introns interrupting NADH dehydrogenase subunit 5 and cytochrome c oxidase subunit I, the latter of which possesses a homing endonuclease gene. Within the sea anemone Alicia sansibarensis, we report the first mitochondrial gene order rearrangement within the Actiniaria, as well as putative novel non-canonical protein-coding genes. Phylogenetic analyses of all 13 protein-coding and 2 ribosomal genes largely corroborated current hypotheses of sea anemone interrelatedness, with a few lower-level differences.
Flot, Jean-François; Tillier, Simon
2007-10-15
The complete mitochondrial genomes of two individuals attributed to different morphospecies of the scleractinian coral genus Pocillopora have been sequenced. Both genomes, respectively 17,415 and 17,422 nt long, share the presence of a previously undescribed ORF encoding a putative protein made up of 302 amino acids and of unknown function. Surprisingly, this ORF turns out to be the second most variable region of the mitochondrial genome (1% nucleotide sequence difference between the two individuals) after the putative control region (1.5% sequence difference). Except for the presence of this ORF and for the location of the putative control region, the mitochondrial genome of Pocillopora is organized in a fashion similar to the other scleractinian coral genomes published to date. For the first time in a cnidarian, a putative second origin of replication is described based on its secondary structure similar to the stem-loop structure of O(L), the origin of L-strand replication in vertebrates.
Yerrapragada, Shaila; Shukla, Animesh; Hallsworth-Pepin, Kymberlie; Choi, Kwangmin; Wollam, Aye; Clifton, Sandra; Qin, Xiang; Muzny, Donna; Raghuraman, Sriram; Ashki, Haleh; Uzman, Akif; Highlander, Sarah K.; Fryszczyn, Bartlomiej G.; Fox, George E.; Tirumalai, Madhan R.; Liu, Yamei; Kim, Sun
2015-01-01
Tolypothrix sp. PCC 7601 is a freshwater filamentous cyanobacterium with complex responses to environmental conditions. Here, we present its 9.96-Mbp draft genome sequence, containing 10,065 putative protein-coding sequences, including 305 predicted two-component system proteins and 27 putative phytochrome-class photoreceptors, the most such proteins in any sequenced genome. PMID:25953173
James, Timothy Y; Marino, John A; Perfecto, Ivette; Vandermeer, John
2016-01-15
The interaction of crop pests with their natural enemies is a fundament to their control. Natural enemies of fungal pathogens of crops are poorly known relative to those of insect pests, despite the diversity of fungal pathogens and their economic importance. Currently, many regions across Latin America are experiencing unprecedented epidemics of coffee rust (Hemileia vastatrix). Identification of natural enemies of coffee rust could aid in developing management strategies or in pinpointing species that could be used for biocontrol. In the present study, we characterized fungal communities associated with coffee rust lesions by single-molecule DNA sequencing of fungal rRNA gene bar codes from leaf discs (≈28 mm(2)) containing rust lesions and control discs with no rust lesions. The leaf disc communities were hyperdiverse in terms of fungi, with up to 69 operational taxonomic units (putative species) per control disc, and the diversity was only slightly reduced in rust-infected discs, with up to 63 putative species. However, geography had a greater influence on the fungal community than whether the disc was infected by coffee rust. Through comparisons between control and rust-infected leaf discs, as well as taxonomic criteria, we identified 15 putative mycoparasitic fungi. These fungi are concentrated in the fungal family Cordycipitaceae and the order Tremellales. These data emphasize the complexity of diverse fungi of unknown ecological function within a leaf that might influence plant disease epidemics or lead to the development of species for biocontrol of fungal disease. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Fan, SiGang; Hu, ChaoQun; Wen, Jing; Zhang, LvPing
2011-05-01
The complete mitochondrial DNA sequence contains useful information for phylogenetic analyses of metazoa. In this study, the complete mitochondrial DNA sequence of sea cucumber Stichopus horrens (Holothuroidea: Stichopodidae: Stichopus) is presented. The complete sequence was determined using normal and long PCRs. The mitochondrial genome of Stichopus horrens is a circular molecule 16257 bps long, composed of 13 protein-coding genes, two ribosomal RNA genes and 22 transfer RNA genes. Most of these genes are coded on the heavy strand except for one protein-coding gene (nad6) and five tRNA genes (tRNA ( Ser(UCN) ), tRNA ( Gln ), tRNA ( Ala ), tRNA ( Val ), tRNA ( Asp )) which are coded on the light strand. The composition of the heavy strand is 30.8% A, 23.7% C, 16.2% G, and 29.3% T bases (AT skew=0.025; GC skew=-0.188). A non-coding region of 675 bp was identified as a putative control region because of its location and AT richness. The intergenic spacers range from 1 to 50 bp in size, totaling 227 bp. A total of 25 overlapping nucleotides, ranging from 1 to 10 bp in size, exist among 11 genes. All 13 protein-coding genes are initiated with an ATG. The TAA codon is used as the stop codon in all the protein coding genes except nad3 and nad4 that use TAG as their termination codon. The most frequently used amino acids are Leu (16.29%), Ser (10.34%) and Phe (8.37%). All of the tRNA genes have the potential to fold into typical cloverleaf secondary structures. We also compared the order of the genes in the mitochondrial DNA from the five holothurians that are now available and found a novel gene arrangement in the mitochondrial DNA of Stichopus horrens.
Valenzuela-Miranda, Diego; Gallardo-Escárate, Cristian
2016-12-01
Despite the high prevalence and impact to Chilean salmon aquaculture of the intracellular bacterium Piscirickettsia salmonis, the molecular underpinnings of host-pathogen interactions remain unclear. Herein, the interplay of coding and non-coding transcripts has been proposed as a key mechanism involved in immune response. Therefore, the aim of this study was to evidence how coding and non-coding transcripts are modulated during the infection process of Atlantic salmon with P. salmonis. For this, RNA-seq was conducted in brain, spleen, and head kidney samples, revealing different transcriptional profiles according to bacterial load. Additionally, while most of the regulated genes annotated for diverse biological processes during infection, a common response associated with clathrin-mediated endocytosis and iron homeostasis was present in all tissues. Interestingly, while endocytosis-promoting factors and clathrin inductions were upregulated, endocytic receptors were mainly downregulated. Furthermore, the regulation of genes related to iron homeostasis suggested an intracellular accumulation of iron, a process in which heme biosynthesis/degradation pathways might play an important role. Regarding the non-coding response, 918 putative long non-coding RNAs were identified, where 425 were newly characterized for S. salar. Finally, co-localization and co-expression analyses revealed a strong correlation between the modulations of long non-coding RNAs and genes associated with endocytosis and iron homeostasis. These results represent the first comprehensive study of putative interplaying mechanisms of coding and non-coding RNAs during bacterial infection in salmonids. Copyright © 2016 Elsevier Ltd. All rights reserved.
Yerrapragada, Shaila; Shukla, Animesh; Hallsworth-Pepin, Kymberlie; Choi, Kwangmin; Wollam, Aye; Clifton, Sandra; Qin, Xiang; Muzny, Donna; Raghuraman, Sriram; Ashki, Haleh; Uzman, Akif; Highlander, Sarah K; Fryszczyn, Bartlomiej G; Fox, George E; Tirumalai, Madhan R; Liu, Yamei; Kim, Sun; Kehoe, David M; Weinstock, George M
2015-05-07
Tolypothrix sp. PCC 7601 is a freshwater filamentous cyanobacterium with complex responses to environmental conditions. Here, we present its 9.96-Mbp draft genome sequence, containing 10,065 putative protein-coding sequences, including 305 predicted two-component system proteins and 27 putative phytochrome-class photoreceptors, the most such proteins in any sequenced genome. Copyright © 2015 Yerrapragada et al.
Sequence of a cDNA encoding pancreatic preprosomatostatin-22.
Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E
1982-01-01
We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673
Ferreira, Dalila Souza Santos; Kato, Rodrigo Bentes; Miranda, Fábio Malcher; da Costa Pinheiro, Kenny; Fonseca, Paula Luize Camargos; Tomé, Luiz Marcelo Ribeiro; Vaz, Aline Bruna Martins; Badotti, Fernanda; Ramos, Rommel Thiago Jucá; Brenig, Bertram; Azevedo, Vasco Ariston de Carvalho; Benevides, Raquel Guimarães; Góes-Neto, Aristóteles
2018-06-01
Herein, we present the draft genome of Trametes villosa isolate CCMB561, a wood-decaying Basidiomycota commonly found in tropical semiarid climate. The genome assembly was 57.98 Mb in size with an L50 of 691. A total of 16,711 putative protein-encoding genes was predicted, including 590 genes coding for carbohydrate-active enzymes (CAZy), directly involved in the decomposition of lignocellulosic materials. This is the first genome of this species of high interest in bioenergy research. The draft genome of Trametes villosa isolate CCMB561 will provide an important resource for future investigations in biofuel production, bioremediation and other green technologies.
Biodegradation of the Organic Disulfide 4,4′-Dithiodibutyric Acid by Rhodococcus spp.
Khairy, Heba; Wübbeler, Jan Hendrik
2015-01-01
Four Rhodococcus spp. exhibited the ability to use 4,4′-dithiodibutyric acid (DTDB) as a sole carbon source for growth. The most important step for the production of a novel polythioester (PTE) using DTDB as a precursor substrate is the initial cleavage of DTDB. Thus, identification of the enzyme responsible for this step was mandatory. Because Rhodococcus erythropolis strain MI2 serves as a model organism for elucidation of the biodegradation of DTDB, it was used to identify the genes encoding the enzymes involved in DTDB utilization. To identify these genes, transposon mutagenesis of R. erythropolis MI2 was carried out using transposon pTNR-TA. Among 3,261 mutants screened, 8 showed no growth with DTDB as the sole carbon source. In five mutants, the insertion locus was mapped either within a gene coding for a polysaccharide deacetyltransferase, a putative ATPase, or an acetyl coenzyme A transferase, 1 bp upstream of a gene coding for a putative methylase, or 176 bp downstream of a gene coding for a putative kinase. In another mutant, the insertion was localized between genes encoding a putative transcriptional regulator of the TetR family (noxR) and an NADH:flavin oxidoreductase (nox). Moreover, in two other mutants, the insertion loci were mapped within a gene encoding a hypothetical protein in the vicinity of noxR and nox. The interruption mutant generated, R. erythropolis MI2 noxΩtsr, was unable to grow with DTDB as the sole carbon source. Subsequently, nox was overexpressed and purified, and its activity with DTDB was measured. The specific enzyme activity of Nox amounted to 1.2 ± 0.15 U/mg. Therefore, we propose that Nox is responsible for the initial cleavage of DTDB into 2 molecules of 4-mercaptobutyric acid (4MB). PMID:26407888
Multiple copies of a bile acid-inducible gene in Eubacterium sp. strain VPI 12708.
Gopal-Srivastava, R; Mallonee, D H; White, W B; Hylemon, P B
1990-01-01
Eubacterium sp. strain VPI 12708 is an anaerobic intestinal bacterium which possesses inducible bile acid 7-dehydroxylation activity. Several new polypeptides are produced in this strain following induction with cholic acid. Genes coding for two copies of a bile acid-inducible 27,000-dalton polypeptide (baiA1 and baiA2) have been previously cloned and sequenced. We now report on a gene coding for a third copy of this 27,000-dalton polypeptide (baiA3). The baiA3 gene has been cloned in lambda DASH on an 11.2-kilobase DNA fragment from a partial Sau3A digest of the Eubacterium DNA. DNA sequence analysis of the baiA3 gene revealed 100% homology with the baiA1 gene within the coding region of the 27,000-dalton polypeptides. The baiA2 gene shares 81% sequence identity with the other two genes at the nucleotide level. The flanking nucleotide sequences associated with the baiA1 and baiA3 genes are identical for 930 bases in the 5' direction from the initiation codon and for at least 325 bases in the 3' direction from the stop codon, including the putative promoter regions for the genes. An additional open reading frame (occupying from 621 to 648 bases, depending on the correct start codon) was found in the identical 5' regions associated with the baiA1 and baiA3 clones. The 5' sequence 930 bases upstream from the baiA1 and baiA3 genes was totally divergent. The baiA2 gene, which is part of a large bile acid-inducible operon, showed no homology with the other two genes either in the 5' or 3' direction from the polypeptide coding region, except for a 15-base-pair presumed ribosome-binding site in the 5' region. These studies strongly suggest that a gene duplication (baiA1 and baiA3) has occurred and is stably maintained in this bacterium. Images PMID:2376563
Fanning, T; Singer, M
1987-01-01
Recent work suggests that one or more members of the highly repeated LINE-1 (L1) DNA family found in all mammals may encode one or more proteins. Here we report the sequence of a portion of an L1 cloned from the domestic cat (Felis catus). These data permit comparison of the L1 sequences in four mammalian orders (Carnivore, Lagomorph, Rodent and Primate) and the comparison supports the suggested coding potential. In two separate, noncontiguous regions in the carboxy terminal half of the proteins predicted from the DNA sequences, there are several strongly conserved segments. In one region, these share homology with known or suspected reverse transcriptases, as described by others in rodents and primates. In the second region, closer to the carboxy terminus, the strongly conserved segments are over 90% homologous among the four orders. One of the latter segments is cysteine rich and resembles the putative metal binding domains of nucleic acid binding proteins, including those of TFIIIA and retroviruses. PMID:3562227
Raju, Hemalatha B.; Tsinoremas, Nicholas F.; Capobianco, Enrico
2016-01-01
Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein–protein interaction networks, other connectivity paths were identified between proteins already investigated in studies on disorders, such as Parkinson, Down syndrome, Huntington disease, and Alzheimer. Our findings suggest the importance of reusing gene expression data by meta-analysis approaches. PMID:27803687
Raju, Hemalatha B; Tsinoremas, Nicholas F; Capobianco, Enrico
2016-01-01
Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein-protein interaction networks, other connectivity paths were identified between proteins already investigated in studies on disorders, such as Parkinson, Down syndrome, Huntington disease, and Alzheimer. Our findings suggest the importance of reusing gene expression data by meta-analysis approaches.
The putative protein methyltransferase LAE1 controls cellulase gene expression in Trichoderma reesei
Seiboth, Bernhard; Karimi, Razieh Aghcheh; Phatale, Pallavi A; Linke, Rita; Hartl, Lukas; Sauer, Dominik G; Smith, Kristina M; Baker, Scott E; Freitag, Michael; Kubicek, Christian P
2012-01-01
Summary Trichoderma reesei is an industrial producer of enzymes that degrade lignocellulosic polysaccharides to soluble monomers, which can be fermented to biofuels. Here we show that the expression of genes for lignocellulose degradation are controlled by the orthologous T. reesei protein methyltransferase LAE1. In a lae1 deletion mutant we observed a complete loss of expression of all seven cellulases, auxiliary factors for cellulose degradation, β-glucosidases and xylanases were no longer expressed. Conversely, enhanced expression of lae1 resulted in significantly increased cellulase gene transcription. Lae1-modulated cellulase gene expression was dependent on the function of the general cellulase regulator XYR1, but also xyr1 expression was LAE1-dependent. LAE1 was also essential for conidiation of T. reesei. Chromatin immunoprecipitation followed by high-throughput sequencing (‘ChIP-seq’) showed that lae1 expression was not obviously correlated with H3K4 di- or trimethylation (indicative of active transcription) or H3K9 trimethylation (typical for heterochromatin regions) in CAZyme coding regions, suggesting that LAE1 does not affect CAZyme gene expression by directly modulating H3K4 or H3K9 methylation. Our data demonstrate that the putative protein methyltransferase LAE1 is essential for cellulase gene expression in T. reesei through mechanisms that remain to be identified. PMID:22554051
A long natural-antisense RNA is accumulated in the conidia of Aspergillus oryzae.
Tsujii, Masaru; Okuda, Satoshi; Ishi, Kazutomo; Madokoro, Kana; Takeuchi, Michio; Yamagata, Youhei
2016-01-01
Analysis of expressed sequence tag libraries from various culture conditions revealed the existence of conidia-specific transcripts assembled to putative conidiation-specific reductase gene (csrA) in Aspergillus oryzae. However, the all transcripts were transcribed with opposite direction to the gene csrA. The sequence analysis of the transcript revealed that the RNA overlapped mRNA of csrA with 3'-end, and did not code protein longer than 60 amino acid residues. We designated the transcript Conidia Specific Long Natural-antisense RNA (CSLNR). The real-time PCR analysis demonstrated that the CSLNR is conidia-specific transcript, which cannot be transcribed in the absence of brlA, and the amount of CSLNR was much more than that of the transcript from csrA in conidia. Furthermore, the csrA deletion, also lacking coding region of CSLNR in A. oryzae reduced the number of conidia. Overexpression of CsrA demonstrated the inhibition of growth and conidiation, while CSLNR did not affect conidiation.
Liu, Yan-Hua; Liu, Xin-Xin; Zhang, Ming-Hai
2016-07-01
Sika deer (Cervus nippon Temminck 1836) are classified in the order Artiodactyla, family Cervidae, subfamily Cervinae. At present, the phylogenetic studies of C. nippon are problematic. In this study, we first determined and described the complete mitochondrial sequence of the wild C. nippon hortulorum. The complete mitogenome sequence is 16 566 bp in length, including 13 protein-coding genes, two rRNA genes, 22 tRNA genes, a putative control region (CR) and a light-strand replication origin (OL). The overall base composition was 33.4% A, 28.6% T, 24.5% C, 13.5% G, with a 62.0% AT bias. The 13 protein-coding genes encode 3782 amino acids in total. To further validate the new determined sequences and phylogeny of Sika deer, phylogenetic trees involving 15 most closely related species available in GenBank database were constructed. These results are expected to provide useful molecular data for deer species identification and further phylogenetic studies of Artiodactyla.
MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity
Wang, Yupeng; Tang, Haibao; DeBarry, Jeremy D.; Tan, Xu; Li, Jingping; Wang, Xiyin; Lee, Tae-ho; Jin, Huizhe; Marler, Barry; Guo, Hui; Kissinger, Jessica C.; Paterson, Andrew H.
2012-01-01
MCScan is an algorithm able to scan multiple genomes or subgenomes in order to identify putative homologous chromosomal regions, and align these regions using genes as anchors. The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses. Applications of MCScanX to several sequenced plant genomes and gene families are shown as examples. MCScanX can be used to effectively analyze chromosome structural changes, and reveal the history of gene family expansions that might contribute to the adaptation of lineages and taxa. An integrated view of various modes of gene duplication can supplement the traditional gene tree analysis in specific families. The source code and documentation of MCScanX are freely available at http://chibba.pgml.uga.edu/mcscan2/. PMID:22217600
Zhao, Guangyu; Li, Hu; Zhao, Ping; Cai, Wanzhi
2015-01-01
In this study, we sequenced four new mitochondrial genomes and presented comparative mitogenomic analyses of five species in the genus Peirates (Hemiptera: Reduviidae). Mitochondrial genomes of these five assassin bugs had a typical set of 37 genes and retained the ancestral gene arrangement of insects. The A+T content, AT- and GC-skews were similar to the common base composition biases of insect mtDNA. Genomic size ranges from 15,702 bp to 16,314 bp and most of the size variation was due to length and copy number of the repeat unit in the putative control region. All of the control region sequences included large tandem repeats present in two or more copies. Our result revealed similarity in mitochondrial genomes of P. atromaculatus, P. fulvescens and P. turpis, as well as the highly conserved genomic-level characteristics of these three species, e.g., the same start and stop codons of protein-coding genes, conserved secondary structure of tRNAs, identical location and length of non-coding and overlapping regions, and conservation of structural elements and tandem repeat unit in control region. Phylogenetic analyses also supported a close relationship between P. atromaculatus, P. fulvescens and P. turpis, which might be recently diverged species. The present study indicates that mitochondrial genome has important implications on phylogenetics, population genetics and speciation in the genus Peirates. PMID:25689825
Li, S.-F.; Xu, J.-W.; Yang, Q.-L.; Wang, C.H.; Chen, Q.; Chapman, D.C.; Lu, G.
2009-01-01
Based upon morphological characters, Silver carp Hypophthalmichthys molitrix and bighead carp Hypophthalmichthys nobilis (or Aristichthys nobilis) have been classified into either the same genus or two distinct genera. Consequently, the taxonomic relationship of the two species at the generic level remains equivocal. This issue is addressed by sequencing complete mitochondrial genomes of H. molitrix and H. nobilis, comparing their mitogenome organization, structure and sequence similarity, and conducting a comprehensive phylogenetic analysis of cyprinid species. As with other cyprinid fishes, the mitogenomes of the two species were structurally conserved, containing 37 genes including 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA (tRNAs) genes and a putative control region (D-loop). Sequence similarity between the two mitogenomes varied in different genes or regions, being highest in the tRNA genes (98??8%), lowest in the control region (89??4%) and intermediate in the protein-coding genes (94??2%). Analyses of the sequence comparison and phylogeny using concatenated protein sequences support the view that the two species belong to the genus Hypophthalmichthys. Further studies using nuclear markers and involving more closely related species, and the systematic combination of traditional biology and molecular biology are needed in order to confirm this conclusion. ?? 2009 The Fisheries Society of the British Isles.
Shitara, M; Tsuboi, Y; Sekizuka, T; Tazumi, A; Moorei, J E; Millar, B C; Taneike, I; Matsuda, M
2008-01-01
Nucleotide sequences of approximately 3.1 kbp consisting of the full-length open reading frame (ORF) for grpE, a non-coding (NC) region and a putative ORF for the full-length dnaK gene (1860 bp) were identified from a urease-positive thermophilic Campylobacter (UPTC) CF89-12 isolate. Then, following the construction of a new degenerate polymerase chain reaction (PCR) primer pair for amplification of the dnaK structural gene, including the transcription terminator region of C. lari isolates, the dnaK region was amplified successfully, TA-cloned and sequenced in nine C. lari isolates. The dnaK gene sequences commenced with an ATG and terminated with a TAA in all 10 isolates, including CF89-12. In addition, the putative ORFs for the dnaK gene locus from seven UPTC isolates consisted of 1860 bases, and the four urease-negative (UN) C. lari isolates included C. lari RM2100 reference strain 1866. Interestingly, different probable ribosome binding sites and hypothetically intrinsic p-independent terminator structures were identified between the seven UPTC and four UN C. lari isolates, respectively. Moreover, it is interesting to note that 20 out of a total of 28 polymorphic sites occurred among amino acid sequences of the dnaK ORF from 11 C. lari isolates, identified to be alternatively UPTC-specific or UN C. lari-specific. In the neighbour-joining tree based on the nucleotide sequence information of the dnaK gene, C. lari forms two major distinct clusters consisting of UPTC and UN C. lari isolates, respectively, with UN C. lari being more closely related to other thermophilic campylobacters than to UPTC.
Dalla Valle, Luisa; Nardi, Alessia; Belvedere, Paola; Toni, Mattia; Alibardi, Lorenzo
2007-07-01
Beta-keratins of reptilian scales have been recently cloned and characterized in some lizards. Here we report for the first time the sequence of some beta-keratins from the snake Elaphe guttata. Five different cDNAs were obtained using 5'- and 3'-RACE analyses. Four sequences differ by only few nucleotides in the coding region, whereas the last cDNA shows, in this region, only 84% of identity. The gene corresponding to one of the cDNA sequences has a single intron present in the 5'-untranslated region. This genomic organization is similar to that of birds' beta-keratins. Cloning and Southern blotting analysis suggest that snake beta-keratins belong to a family of high-related genes as for geckos. PCR analysis suggests a head-to-tail orientation of genes in the same chromosome. In situ hybridization detected beta-keratin transcripts almost exclusively in differentiating oberhautchen and beta-cells of the snake epidermis in renewal phase. This is confirmed by Northern blotting that showed, in this phase, a high expression of two different transcripts whereas only the longer transcript is expressed at a much lower level in resting skin. The cDNA coding sequences encoded putative glycine-proline-serine rich proteins containing 137-139 amino acids, with apparent isoelectric point at 7.5 and 8.2. A central region, rich in proline, shows over 50% homology with avian scale, claw, and feather keratins. The prediction of secondary structure shows mainly a random coil conformation and few beta-strand regions in the central region, likely involved in the formation of a fibrous framework of beta-keratins. This region was possibly present in basic reptiles that originated reptiles and birds. Copyright 2007 Wiley-Liss, Inc.
Dweep, Harsh; Sticht, Carsten; Pandey, Priyanka; Gretz, Norbert
2011-10-01
MicroRNAs are small, non-coding RNA molecules that can complementarily bind to the mRNA 3'-UTR region to regulate the gene expression by transcriptional repression or induction of mRNA degradation. Increasing evidence suggests a new mechanism by which miRNAs may regulate target gene expression by binding in promoter and amino acid coding regions. Most of the existing databases on miRNAs are restricted to mRNA 3'-UTR region. To address this issue, we present miRWalk, a comprehensive database on miRNAs, which hosts predicted as well as validated miRNA binding sites, information on all known genes of human, mouse and rat. All mRNAs, mitochondrial genes and 10 kb upstream flanking regions of all known genes of human, mouse and rat were analyzed by using a newly developed algorithm named 'miRWalk' as well as with eight already established programs for putative miRNA binding sites. An automated and extensive text-mining search was performed on PubMed database to extract validated information on miRNAs. Combined information was put into a MySQL database. miRWalk presents predicted and validated information on miRNA-target interaction. Such a resource enables researchers to validate new targets of miRNA not only on 3'-UTR, but also on the other regions of all known genes. The 'Validated Target module' is updated every month and the 'Predicted Target module' is updated every 6 months. miRWalk is freely available at http://mirwalk.uni-hd.de/. Copyright © 2011 Elsevier Inc. All rights reserved.
Hirata, Hisae; Yamaji, Yasuyuki; Komatsu, Ken; Kagiwada, Satoshi; Oshima, Kenro; Okano, Yukari; Takahashi, Shuichiro; Ugaki, Masashi; Namba, Shigetou
2010-09-01
The first open-reading frame (ORF) of the genus Capillovirus encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP), while other viruses in the family Flexiviridae have separate ORFs encoding these proteins. To investigate the role of the full-length ORF1 polyprotein of capillovirus, we generated truncation mutants of ORF1 of apple stem grooving virus by inserting a termination codon into the variable region located between the putative Rep- and CP-coding regions. These mutants were capable of systemic infection, although their pathogenicity was attenuated. In vitro translation of ORF1 produced both the full-length polyprotein and the smaller Rep protein. The results of in vivo reporter assays suggested that the mechanism of this early termination is a ribosomal -1 frame-shift occurring downstream from the conserved Rep domains. The mechanism of capillovirus gene expression and the very close evolutionary relationship between the genera Capillovirus and Trichovirus are discussed. Copyright (c) 2010. Published by Elsevier B.V.
Gallardo-Escárate, Cristian; Valenzuela-Muñoz, Valentina; Nuñez-Acuña, Gustavo; Chávez-Mardones, Jacqueline; Maldonado-Aguayo, Waleska
2014-02-15
The couch potato (CPO) protein is a key biomolecule involved in regulating diapause through the RNA-binding process of the peripheral and central nervous systems in insects and also recently discovered in a few crustacean species. As such, ectoparasitic copepods are interesting model species that have no evidence of developmental arrest. The present study is the first to report on the cloning of a putative CPO gene from the salmon louse Caligus rogercresseyi (CrCPO), as identified by high-throughput transcriptome sequencing. In addition, the transcription expression in larvae and adults was evaluated using quantitative real-time PCR. The CrCPO cDNA sequence showed 3261 base pairs (bp), consisting of 713bp of 5' UTR, 1741bp of 3' UTR, and an open reading frame of 807bp encoding for 268 amino acids. The highly conserved RNA binding regions RNP2 (LFVSGL) and RNP1 (SPVGFVTF), as well the dimerization site (LEF), were also found. Furthermore, eight single nucleotide polymorphisms located in the untranslated regions and one located in the coding region were detected. Gene transcription analysis revealed that CrCPO has ubiquitous expression across larval stages and in adult individuals, with the highest expression from nauplius to copepodid stages. The present study suggests a putative biological function of CrCPO associated with the development of the nervous system in salmon lice and contributes molecular evidence for candidate genes related to host-parasite interactions. Copyright © 2013 Elsevier B.V. All rights reserved.
Jeukens, Julie; Bernatchez, Louis
2012-01-01
While gene expression divergence is known to be involved in adaptive phenotypic divergence and speciation, the relative importance of regulatory and structural evolution of genes is poorly understood. A recent next-generation sequencing experiment allowed identifying candidate genes potentially involved in the ongoing speciation of sympatric dwarf and normal lake whitefish (Coregonus clupeaformis), such as cytosolic malate dehydrogenase (MDH1), which showed both significant expression and sequence divergence. The main goal of this study was to investigate into more details the signatures of natural selection in the regulatory and coding sequences of MDH1 in lake whitefish and test for parallelism of these signatures with other coregonine species. Sequencing of the two regions in 118 fish from four sympatric pairs of whitefish and two cisco species revealed a total of 35 single nucleotide polymorphisms (SNPs), with more genetic diversity in European compared to North American coregonine species. While the coding region was found to be under purifying selection, an SNP in the proximal promoter exhibited significant allele frequency divergence in a parallel manner among independent sympatric pairs of North American lake whitefish and European whitefish (C. lavaretus). According to transcription factor binding simulation for 22 regulatory haplotypes of MDH1, putative binding profiles were fairly conserved among species, except for the region around this SNP. Moreover, we found evidence for the role of this SNP in the regulation of MDH1 expression level. Overall, these results provide further evidence for the role of natural selection in gene regulation evolution among whitefish species pairs and suggest its possible link with patterns of phenotypic diversity observed in coregonine species. PMID:22408741
Jeukens, Julie; Bernatchez, Louis
2012-01-01
While gene expression divergence is known to be involved in adaptive phenotypic divergence and speciation, the relative importance of regulatory and structural evolution of genes is poorly understood. A recent next-generation sequencing experiment allowed identifying candidate genes potentially involved in the ongoing speciation of sympatric dwarf and normal lake whitefish (Coregonus clupeaformis), such as cytosolic malate dehydrogenase (MDH1), which showed both significant expression and sequence divergence. The main goal of this study was to investigate into more details the signatures of natural selection in the regulatory and coding sequences of MDH1 in lake whitefish and test for parallelism of these signatures with other coregonine species. Sequencing of the two regions in 118 fish from four sympatric pairs of whitefish and two cisco species revealed a total of 35 single nucleotide polymorphisms (SNPs), with more genetic diversity in European compared to North American coregonine species. While the coding region was found to be under purifying selection, an SNP in the proximal promoter exhibited significant allele frequency divergence in a parallel manner among independent sympatric pairs of North American lake whitefish and European whitefish (C. lavaretus). According to transcription factor binding simulation for 22 regulatory haplotypes of MDH1, putative binding profiles were fairly conserved among species, except for the region around this SNP. Moreover, we found evidence for the role of this SNP in the regulation of MDH1 expression level. Overall, these results provide further evidence for the role of natural selection in gene regulation evolution among whitefish species pairs and suggest its possible link with patterns of phenotypic diversity observed in coregonine species.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lin, Biaoyang; Nasir, J.; Kalchman, M.A.
1995-02-10
We have previously cloned and characterized the murine homologue of the Huntington disease (HD) gene and shown that it maps to mouse chromosome 5 within a region of conserved synteny with human chromosome 4p16.3. Here we present a detailed comparison of the sequence of the putative promoter and the organization of the 5{prime} genomic region of the murine (Hdh) and human HD genes encompassing the first five exons. We show that in this region these two genes share identical exon boundaries, but have different-size introns. Two dinucleotide (CT) and one trinucleotide intronic polymorphism in Hdh and an intronic CA polymorphismmore » in the HD gene were identified. Comparison of 940-bp sequence 5{prime} to the putative translation start site reveals a highly conserved region (78.8% nucleotide identity) between Hdh and the HD gene from nucleotide -56 to -206 (of Hdh). Neither Hdh nor the HD gene have typical TATA or CCAAT elements, but both show one putative AP2 binding site and numerous potential Sp1 binding sites. The high sequence identity between Hdh and the HD gene for approximately 200 bp 5{prime} to the putative translation start site indicates that these sequences may play a role in regulating expression of the Huntington disease gene. 30 refs., 4 figs., 2 tabs.« less
Gillot, Guillaume; Jany, Jean-Luc; Dominguez-Santos, Rebeca; Poirier, Elisabeth; Debaets, Stella; Hidalgo, Pedro I; Ullán, Ricardo V; Coton, Emmanuel; Coton, Monika
2017-04-01
Mycophenolic acid (MPA) is a secondary metabolite produced by various Penicillium species including Penicillium roqueforti. The MPA biosynthetic pathway was recently described in Penicillium brevicompactum. In this study, an in silico analysis of the P. roqueforti FM164 genome sequence localized a 23.5-kb putative MPA gene cluster. The cluster contains seven genes putatively coding seven proteins (MpaA, MpaB, MpaC, MpaDE, MpaF, MpaG, MpaH) and is highly similar (i.e. gene synteny, sequence homology) to the P. brevicompactum cluster. To confirm the involvement of this gene cluster in MPA biosynthesis, gene silencing using RNA interference targeting mpaC, encoding a putative polyketide synthase, was performed in a high MPA-producing P. roqueforti strain (F43-1). In the obtained transformants, decreased MPA production (measured by LC-Q-TOF/MS) was correlated to reduced mpaC gene expression by Q-RT-PCR. In parallel, mycotoxin quantification on multiple P. roqueforti strains suggested strain-dependent MPA-production. Thus, the entire MPA cluster was sequenced for P. roqueforti strains with contrasted MPA production and a 174bp deletion in mpaC was observed in low MPA-producers. PCRs directed towards the deleted region among 55 strains showed an excellent correlation with MPA quantification. Our results indicated the clear involvement of mpaC gene as well as surrounding cluster in P. roqueforti MPA biosynthesis. Copyright © 2016 Elsevier Ltd. All rights reserved.
He, Zhang-Ping; Dai, Xia-Bin; Zhang, Shuai; Zhi, Ting-Ting; Lun, Zhao-Rong; Wu, Zhong-Dao; Yang, Ting-Bao
2016-01-01
The whole sequence (15,057 bp) of the mitochondrial DNA (mtDNA) of the terrestrial snail Achatina fulica (order Stylommatophora) was determined. The mitogenome, as the typical metazoan mtDNA, contains 13 protein-coding genes (PCG), 2 ribosomal RNA genes (rRNA) and 22 transfer RNA genes (tRNA). The tRNA genes include two trnS without standard secondary structure. Interestingly, among the known mitogenomes of Pulmonata species, we firstly characterized an unassigned lengthy sequence (551 bp) between the cox1 and the trnV which may be the CR for the sake of its AT bases usage bias (65.70%) and potential hairpin structure.
Spliced DNA Sequences in the Paramecium Germline: Their Properties and Evolutionary Potential
Catania, Francesco; McGrath, Casey L.; Doak, Thomas G.; Lynch, Michael
2013-01-01
Despite playing a crucial role in germline-soma differentiation, the evolutionary significance of developmentally regulated genome rearrangements (DRGRs) has received scant attention. An example of DRGR is DNA splicing, a process that removes segments of DNA interrupting genic and/or intergenic sequences. Perhaps, best known for shaping immune-system genes in vertebrates, DNA splicing plays a central role in the life of ciliated protozoa, where thousands of germline DNA segments are eliminated after sexual reproduction to regenerate a functional somatic genome. Here, we identify and chronicle the properties of 5,286 sequences that putatively undergo DNA splicing (i.e., internal eliminated sequences [IESs]) across the genomes of three closely related species of the ciliate Paramecium (P. tetraurelia, P. biaurelia, and P. sexaurelia). The study reveals that these putative IESs share several physical characteristics. Although our results are consistent with excision events being largely conserved between species, episodes of differential IES retention/excision occur, may have a recent origin, and frequently involve coding regions. Our findings indicate interconversion between somatic—often coding—DNA sequences and noncoding IESs, and provide insights into the role of DNA splicing in creating potentially functional genetic innovation. PMID:23737328
Molecular cloning and characterization of a gene encoding glutaminase from Aspergillus oryzae.
Koibuchi, K; Nagasaki, H; Yuasa, A; Kataoka, J; Kitamoto, K
2000-07-01
A glutaminase from Aspergillus oryzae was purified and its molecular weight was determined to be 82,091 by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Purified glutaminase catalysed the hydrolysis not only of L-glutamine but also of D-glutamine. Both the molecular weight and the substrate specificity of this glutaminase were different from those reported previously [Yano et al. (1998) J Ferment Technol 66: 137-143]. On the basis of its internal amino acid sequences, we have isolated and characterized the glutaminase gene (gtaA) from A. oryzae. The gtaA gene had an open reading frame coding for 690 amino acid residues, including a signal peptide of 20 amino acid residues and a mature protein of 670 amino acid residues. In the 5'-flanking region of the gene, there were three putative CreAp binding sequences and one putative AreAp binding sequence. The gtaA structural gene was introduced into A. oryzae NS4 and a marked increase in activity was detected in comparison with the control strain. The gtaA gene was also isolated from Aspergillus nidulans on the basis of the determined nucleotide sequence of the gtaA gene from A. oryzae.
Margam, Venu M.; Coates, Brad S.; Bayles, Darrell O.; Hellmich, Richard L.; Agunbiade, Tolulope; Seufferheld, Manfredo J.; Sun, Weilin; Kroemer, Jeremy A.; Ba, Malick N.; Binso-Dabire, Clementine L.; Baoua, Ibrahim; Ishiyaku, Mohammad F.; Covas, Fernando G.; Srinivasan, Ramasamy; Armstrong, Joel; Murdock, Larry L.; Pittendrigh, Barry R.
2011-01-01
The legume pod borer, Maruca vitrata (Lepidoptera: Crambidae), is an insect pest species of crops grown by subsistence farmers in tropical regions of Africa. We present the de novo assembly of 3729 contigs from 454- and Sanger-derived sequencing reads for midgut, salivary, and whole adult tissues of this non-model species. Functional annotation predicted that 1320 M. vitrata protein coding genes are present, of which 631 have orthologs within the Bombyx mori gene model. A homology-based analysis assigned M. vitrata genes into a group of paralogs, but these were subsequently partitioned into putative orthologs following phylogenetic analyses. Following sequence quality filtering, a total of 1542 putative single nucleotide polymorphisms (SNPs) were predicted within M. vitrata contig assemblies. Seventy one of 1078 designed molecular genetic markers were used to screen M. vitrata samples from five collection sites in West Africa. Population substructure may be present with significant implications in the insect resistance management recommendations pertaining to the release of biological control agents or transgenic cowpea that express Bacillus thuringiensis crystal toxins. Mutation data derived from transcriptome sequencing is an expeditious and economical source for genetic markers that allow evaluation of ecological differentiation. PMID:21754987
Aboussekhra, A; Chanet, R; Zgaga, Z; Cassier-Chauvat, C; Heude, M; Fabre, F
1989-09-25
A new type of radiation-sensitive mutant of S. cerevisiae is described. The recessive radH mutation sensitizes to the lethal effect of UV radiations haploids in the G1 but not in the G2 mitotic phase. Homozygous diploids are as sensitive as G1 haploids. The UV-induced mutagenesis is depressed, while the induction of gene conversion is increased. The mutation is believed to channel the repair of lesions engaged in the mutagenic pathway into a recombination process, successful if the events involve sister-chromatids but lethal if they involve homologous chromosomes. The sequence of the RADH gene reveals that it may code for a DNA helicase, with a Mr of 134 kDa. All the consensus domains of known DNA helicases are present. Besides these consensus regions, strong homologies with the Rep and UvrD helicases of E. coli were found. The RadH putative helicase appears to belong to the set of proteins involved in the error-prone repair mechanism, at least for UV-induced lesions, and could act in coordination with the Rev3 error-prone DNA polymerase.
Seligmann, Hervé
2013-03-01
Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Stotz, Henrik U; Harvey, Pascoe J; Haddadi, Parham; Mashanova, Alla; Kukol, Andreas; Larkan, Nicholas J; Borhan, M Hossein; Fitt, Bruce D L
2018-01-01
Genes coding for nucleotide-binding leucine-rich repeat (LRR) receptors (NLRs) control resistance against intracellular (cell-penetrating) pathogens. However, evidence for a role of genes coding for proteins with LRR domains in resistance against extracellular (apoplastic) fungal pathogens is limited. Here, the distribution of genes coding for proteins with eLRR domains but lacking kinase domains was determined for the Brassica napus genome. Predictions of signal peptide and transmembrane regions divided these genes into 184 coding for receptor-like proteins (RLPs) and 121 coding for secreted proteins (SPs). Together with previously annotated NLRs, a total of 720 LRR genes were found. Leptosphaeria maculans-induced expression during a compatible interaction with cultivar Topas differed between RLP, SP and NLR gene families; NLR genes were induced relatively late, during the necrotrophic phase of pathogen colonization. Seven RLP, one SP and two NLR genes were found in Rlm1 and Rlm3/Rlm4/Rlm7/Rlm9 loci for resistance against L. maculans on chromosome A07 of B. napus. One NLR gene at the Rlm9 locus was positively selected, as was the RLP gene on chromosome A10 with LepR3 and Rlm2 alleles conferring resistance against L. maculans races with corresponding effectors AvrLm1 and AvrLm2, respectively. Known loci for resistance against L. maculans (extracellular hemi-biotrophic fungus), Sclerotinia sclerotiorum (necrotrophic fungus) and Plasmodiophora brassicae (intracellular, obligate biotrophic protist) were examined for presence of RLPs, SPs and NLRs in these regions. Whereas loci for resistance against P. brassicae were enriched for NLRs, no such signature was observed for the other pathogens. These findings demonstrate involvement of (i) NLR genes in resistance against the intracellular pathogen P. brassicae and a putative NLR gene in Rlm9-mediated resistance against the extracellular pathogen L. maculans.
Hara, Yasushi; Hayashi, Kyohei; Nakajima, Takuya; Kagawa, Shizuko; Tazumi, Akihiro; Moore, John E; Matsuda, Motoo
2013-09-01
Clustered regularly interspaced short palindromic repeats (CRISPRs), of approximately 10,000 base pairs (bp) in length, were shown to occur in the Japanese Taylorella equigenitalis strain, EQ59. The locus was composed of the putative CRISPRs-associated with 5 (cas5), RAMP csd1, csd2, recB, cas1, a leader region, 13 CRISPR consensus sequence repeats (each 32 bp; 5'-TCAGCCACGTTCGCGTGGCTGTGTGTTTAAAG-3'). These were in turn separated by 12 non repetitive unique spacer regions of similar length. In addition, a leader region, a transposase/IS protein, a leader region, and cas3 were also seen. All seven putative open reading frames carry their ribosome binding sites. Promoter consensus sequences at the -35 and -10 regions and putative intrinsic ρ-independent transcription terminator regions also occurred. A possible long overlap of 170 bp in length occurred between the recB and cas1 loci. Positive reverse transcription PCR signals of cas5, RAMP csd1, csd2-recB/cas1, and cas3 were generated. A putative secondary structure of the CRISPR consensus repeats was constructed. Following this, CRISPR results of the T. equigenitalis EQ59 isolate were subsequently compared with those from the Taylorella asinigenitalis MCE3 isolate.
Biodegradation of the organic disulfide 4,4'-dithiodibutyric acid by Rhodococcus spp.
Khairy, Heba; Wübbeler, Jan Hendrik; Steinbüchel, Alexander
2015-12-01
Four Rhodococcus spp. exhibited the ability to use 4,4'-dithiodibutyric acid (DTDB) as a sole carbon source for growth. The most important step for the production of a novel polythioester (PTE) using DTDB as a precursor substrate is the initial cleavage of DTDB. Thus, identification of the enzyme responsible for this step was mandatory. Because Rhodococcus erythropolis strain MI2 serves as a model organism for elucidation of the biodegradation of DTDB, it was used to identify the genes encoding the enzymes involved in DTDB utilization. To identify these genes, transposon mutagenesis of R. erythropolis MI2 was carried out using transposon pTNR-TA. Among 3,261 mutants screened, 8 showed no growth with DTDB as the sole carbon source. In five mutants, the insertion locus was mapped either within a gene coding for a polysaccharide deacetyltransferase, a putative ATPase, or an acetyl coenzyme A transferase, 1 bp upstream of a gene coding for a putative methylase, or 176 bp downstream of a gene coding for a putative kinase. In another mutant, the insertion was localized between genes encoding a putative transcriptional regulator of the TetR family (noxR) and an NADH:flavin oxidoreductase (nox). Moreover, in two other mutants, the insertion loci were mapped within a gene encoding a hypothetical protein in the vicinity of noxR and nox. The interruption mutant generated, R. erythropolis MI2 noxΩtsr, was unable to grow with DTDB as the sole carbon source. Subsequently, nox was overexpressed and purified, and its activity with DTDB was measured. The specific enzyme activity of Nox amounted to 1.2 ± 0.15 U/mg. Therefore, we propose that Nox is responsible for the initial cleavage of DTDB into 2 molecules of 4-mercaptobutyric acid (4MB). Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.
Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D
2017-12-03
A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first evidence for a significant enrichment of X motifs in the genes of an extant organism. They raise two hypotheses: the X motifs may be evolutionary relics of the primitive codes used for translation, or they may continue to play a functional role in the complex processes of genome decoding and protein synthesis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Banerjee, Poulabi; Bahlo, Melanie; Schwartz, Jody R.
2002-01-01
Genome wide disease association analysis using SNPs is being explored as a method for dissecting complex genetic traits and a vast number of SNPs have been generated for this purpose. As there are cost and throughput limitations of genotyping large numbers of SNPs and statistical issues regarding the large number of dependent tests on the same data set, to make association analysis practical it has been proposed that SNPs should be prioritized based on likely functional importance. The most easily identifiable functional SNPs are coding SNPs (cSNPs) and accordingly cSNPs have been screened in a number of studies. SNPs inmore » gene regulatory sequences embedded in noncoding DNA are another class of SNPs suggested for prioritization due to their predicted quantitative impact on gene expression. The main challenge in evaluating these SNPs, in contrast to cSNPs is a lack of robust algorithms and databases for recognizing regulatory sequences in noncoding DNA. Approaches that have been previously used to delineate noncoding sequences with gene regulatory activity include cross-species sequence comparisons and the search for sequences recognized by transcription factors. We combined these two methods to sift through mouse human genomic sequences to identify putative gene regulatory elements and subsequently localized SNPs within these sequences in a 1 Megabase (Mb) region of human chromosome 5q31, orthologous to mouse chromosome 11 containing the Interleukin cluster.« less
Haplotypes and Sequence Variation in the Ovine Adiponectin Gene (ADIPOQ)
An, Qing-Ming; Zhou, Hui-Tong; Hu, Jiang; Luo, Yu-Zhu; Hickford, Jon G. H.
2015-01-01
The adiponectin gene (ADIPOQ) plays an important role in energy homeostasis. In this study five separate regions (regions 1 to 5) of ovine ADIPOQ were analysed using PCR-SSCP. Four different PCR-SSCP patterns (A1-D1, A2-D2) were detected in region-1 and region-2, respectively, with seven and six SNPs being revealed. In region-3, three different patterns (A3-C3) and three SNPs were observed. Two patterns (A4-B4, A5-B5) and two and one SNPs were observed in region-4 and region-5, respectively. In total, nineteen SNPs were detected, with five of them in the coding region and two (c.46T/C and c.515G/A) putatively resulting in amino acid changes (p.Tyr16His and p.Lys172Arg). In region-1, -2 and -3 of 316 sheep from eight New Zealand breeds, variants A1, A2 and A3 were the most common, although variant frequencies differed in the eight breeds. Across region-1 and region-3, nine haplotypes were identified and haplotypes A1-A3, A1-C3, B1-A3 and B1-C3 were most common. These results indicate that the ADIPOQ gene is polymorphic and suggest that further analysis is required to see if the variation in the gene is associated with animal production traits. PMID:26610572
Li, Shan; Dong, Xia; Su, Zhengchang
2013-07-30
Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamically transcribed under different conditions, and a large portion of genes and intergenic regions have antisense RNA (asRNA) and non-coding RNA (ncRNA) transcripts, respectively. Ironically, similar studies have not been conducted in the model bacterium E coli K12, thus it is unknown whether or not the bacterium possesses similar complex transcriptomes. Furthermore, although RNA-seq becomes the major method for analyzing the complexity of prokaryotic transcriptome, it is still a challenging task to accurately assemble full length transcripts using short RNA-seq reads. To fill these gaps, we have profiled the transcriptomes of E. coli K12 under different culture conditions and growth phases using a highly specific directional RNA-seq technique that can capture various types of transcripts in the bacterial cells, combined with a highly accurate and robust algorithm and tool TruHMM (http://bioinfolab.uncc.edu/TruHmm_package/) for assembling full length transcripts. We found that 46.9 ~ 63.4% of expressed operons were utilized in their putative alternative forms, 72.23 ~ 89.54% genes had putative asRNA transcripts and 51.37 ~ 72.74% intergenic regions had putative ncRNA transcripts under different culture conditions and growth phases. As has been demonstrated in many other prokaryotes, E. coli K12 also has a highly complex and dynamic transcriptomes under different culture conditions and growth phases. Such complex and dynamic transcriptomes might play important roles in the physiology of the bacterium. TruHMM is a highly accurate and robust algorithm for assembling full-length transcripts in prokaryotes using directional RNA-seq short reads.
2013-01-01
Background Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamically transcribed under different conditions, and a large portion of genes and intergenic regions have antisense RNA (asRNA) and non-coding RNA (ncRNA) transcripts, respectively. Ironically, similar studies have not been conducted in the model bacterium E coli K12, thus it is unknown whether or not the bacterium possesses similar complex transcriptomes. Furthermore, although RNA-seq becomes the major method for analyzing the complexity of prokaryotic transcriptome, it is still a challenging task to accurately assemble full length transcripts using short RNA-seq reads. Results To fill these gaps, we have profiled the transcriptomes of E. coli K12 under different culture conditions and growth phases using a highly specific directional RNA-seq technique that can capture various types of transcripts in the bacterial cells, combined with a highly accurate and robust algorithm and tool TruHMM (http://bioinfolab.uncc.edu/TruHmm_package/) for assembling full length transcripts. We found that 46.9 ~ 63.4% of expressed operons were utilized in their putative alternative forms, 72.23 ~ 89.54% genes had putative asRNA transcripts and 51.37 ~ 72.74% intergenic regions had putative ncRNA transcripts under different culture conditions and growth phases. Conclusions As has been demonstrated in many other prokaryotes, E. coli K12 also has a highly complex and dynamic transcriptomes under different culture conditions and growth phases. Such complex and dynamic transcriptomes might play important roles in the physiology of the bacterium. TruHMM is a highly accurate and robust algorithm for assembling full-length transcripts in prokaryotes using directional RNA-seq short reads. PMID:23899370
Tsoi, Tamara V.; Plotnikova, Elena G.; Cole, James R.; Guerin, William F.; Bagdasarian, Michael; Tiedje, James M.
1999-01-01
We have cloned and characterized novel oxygenolytic ortho-dehalogenation (ohb) genes from 2-chlorobenzoate (2-CBA)- and 2,4-dichlorobenzoate (2,4-dCBA)-degrading Pseudomonas aeruginosa 142. Among 3,700 Escherichia coli recombinants, two clones, DH5αF′(pOD22) and DH5αF′(pOD33), converted 2-CBA to catechol and 2,4-dCBA and 2,5-dCBA to 4-chlorocatechol. A subclone of pOD33, plasmid pE43, containing the 3,687-bp minimized ohb DNA region conferred to P. putida PB2440 the ability to grow on 2-CBA as a sole carbon source. Strain PB2440(pE43) also oxidized but did not grow on 2,4-dCBA, 2,5-dCBA, or 2,6-dCBA. Terminal oxidoreductase ISPOHB structural genes ohbA and ohbB, which encode polypeptides with molecular masses of 20,253 Da (β-ISP) and 48,243 Da (α-ISP), respectively, were identified; these proteins are in accord with the 22- and 48-kDa (as determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis) polypeptides synthesized in E. coli and P. aeruginosa parental strain 142. The ortho-halobenzoate 1,2-dioxygenase activity was manifested in the absence of ferredoxin and reductase genes, suggesting that the ISPOHB utilized electron transfer components provided by the heterologous hosts. ISPOHB formed a new phylogenetic cluster that includes aromatic oxygenases featuring atypical structural-functional organization and is distant from the other members of the family of primary aromatic oxygenases. A putative IclR-type regulatory gene (ohbR) was located upstream of the ohbAB genes. An open reading frame (ohbC) of unknown function that overlaps lengthwise with ohbB but is transcribed in the opposite direction was found. The ohbC gene codes for a 48,969-Da polypeptide, in accord with the 49-kDa protein detected in E. coli. The ohb genes are flanked by an IS1396-like sequence containing a putative gene for a 39,715-Da transposase A (tnpA) at positions 4731 to 5747 and a putative gene for a 45,247-Da DNA topoisomerase I/III (top) at positions 346 to 1563. The ohb DNA region is bordered by 14-bp imperfect inverted repeats at positions 56 to 69 and 5984 to 5997. PMID:10224014
Structure of the coding region and mRNA variants of the apyrase gene from pea (Pisum sativum)
NASA Technical Reports Server (NTRS)
Shibata, K.; Abe, S.; Davies, E.
2001-01-01
Partial amino acid sequences of a 49 kDa apyrase (ATP diphosphohydrolase, EC 3.6.1.5) from the cytoskeletal fraction of etiolated pea stems were used to derive oligonucleotide DNA primers to generate a cDNA fragment of pea apyrase mRNA by RT-PCR and these primers were used to screen a pea stem cDNA library. Two almost identical cDNAs differing in just 6 nucleotides within the coding regions were found, and these cDNA sequences were used to clone genomic fragments by PCR. Two nearly identical gene fragments containing 8 exons and 7 introns were obtained. One of them (H-type) encoded the mRNA sequence described by Hsieh et al. (1996) (DDBJ/EMBL/GenBank Z32743), while the other (S-type) differed by the same 6 nucleotides as the mRNAs, suggesting that these genes may be alleles. The six nucleotide differences between these two alleles were found solely in the first exon, and these mutation sites had two types of consensus sequences. These mRNAs were found with varying lengths of 3' untranslated regions (3'-UTR). There are some similarities between the 3'-UTR of these mRNAs and those of actin and actin binding proteins in plants. The putative roles of the 3'-UTR and alternative polyadenylation sites are discussed in relation to their possible role in targeting the mRNAs to different subcellular compartments.
Yakhnin, Helen; Baker, Carol S.; Berezin, Igor; Evangelista, Michael A.; Rassin, Alisa; Romeo, Tony; Babitzke, Paul
2011-01-01
The RNA binding protein CsrA is the central component of a conserved global regulatory system that activates or represses gene expression posttranscriptionally. In every known example of CsrA-mediated translational control, CsrA binds to the 5′ untranslated region of target transcripts, thereby repressing translation initiation and/or altering the stability of the RNA. Furthermore, with few exceptions, repression by CsrA involves binding directly to the Shine-Dalgarno sequence and blocking ribosome binding. sdiA encodes the quorum-sensing receptor for N-acyl-l-homoserine lactone in Escherichia coli. Because sdiA indirectly stimulates transcription of csrB, which encodes a small RNA (sRNA) antagonist of CsrA, we further explored the relationship between sdiA and the Csr system. Primer extension analysis revealed four putative transcription start sites within 85 nucleotides of the sdiA initiation codon. Potential σ70-dependent promoters were identified for each of these primer extension products. In addition, two CsrA binding sites were predicted in the initially translated region of sdiA. Expression of chromosomally integrated sdiA′-′lacZ translational fusions containing the entire promoter and CsrA binding site regions indicates that CsrA represses sdiA expression. The results from gel shift and footprint studies demonstrate that tight binding of CsrA requires both of these sites. Furthermore, the results from toeprint and in vitro translation experiments indicate that CsrA represses translation of sdiA by directly competing with 30S ribosomal subunit binding. Thus, this represents the first example of CsrA preventing translation by interacting solely within the coding region of an mRNA target. PMID:21908661
Understanding Neurodevelopmental Disorders: The Promise of Regulatory Variation in the 3'UTRome.
Wanke, Kai A; Devanna, Paolo; Vernes, Sonja C
2018-04-01
Neurodevelopmental disorders have a strong genetic component, but despite widespread efforts, the specific genetic factors underlying these disorders remain undefined for a large proportion of affected individuals. Given the accessibility of exome sequencing, this problem has thus far been addressed from a protein-centric standpoint; however, protein-coding regions only make up ∼1% to 2% of the human genome. With the advent of whole genome sequencing we are in the midst of a paradigm shift as it is now possible to interrogate the entire sequence of the human genome (coding and noncoding) to fill in the missing heritability of complex disorders. These new technologies bring new challenges, as the number of noncoding variants identified per individual can be overwhelming, making it prudent to focus on noncoding regions of known function, for which the effects of variation can be predicted and directly tested to assess pathogenicity. The 3'UTRome is a region of the noncoding genome that perfectly fulfills these criteria and is of high interest when searching for pathogenic variation related to complex neurodevelopmental disorders. Herein, we review the regulatory roles of the 3'UTRome as binding sites for microRNAs or RNA binding proteins, or during alternative polyadenylation. We detail existing evidence that these regions contribute to neurodevelopmental disorders and outline strategies for identification and validation of novel putatively pathogenic variation in these regions. This evidence suggests that studying the 3'UTRome will lead to the identification of new risk factors, new candidate disease genes, and a better understanding of the molecular mechanisms contributing to neurodevelopmental disorders. Copyright © 2017 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Tran, Ngoc Tuan; Liu, Han; Jakovlić, Ivan; Wang, Wei-Min
2015-01-01
MyD88 and TRAF6 play an essential role in the innate immune response in most animals. This study reports the full-length MaMyD88 and MaTRAF6 genes identified from the blunt snout bream (Megalobrama amblycephala) transcriptome profile. MaMyD88 is 2501 base pairs (bp) long, encoding a putative protein of 284 amino acids (aa), including the N-terminal DEATH domain of 78 aa and the C-terminal TIR domain of 138 aa. MaTRAF6 is 2252 bp long, encoding a putative protein of 542 aa, including the N-terminal low-complexity region, RING domain (40 aa), a coiled-coil region (64 aa) and C-terminal MATH domain (147 aa). Coding regions of MaMyD88 and MaTRAF6 genomic sequences consisted of five and six exons, respectively. Physicochemical and functional characteristics of the proteins were analysed. Alpha helices were dominant in the secondary structure of the proteins. Homology models of the MaMyD88 and MaTRAF6 domains were constructed applying the comparative modelling method. RT-qPCR was used to analyse the expression of MaMyD88 and MaTRAF6 mRNA transcripts in response to Aeromonas hydrophila challenge. Both genes were highly upregulated in the liver, spleen and kidney during the first 24 h after the challenge. While MyD88 and TRAF6 have been reported in various aquatic species, this is the first report and characterisation of these genes in blunt snout bream. This research also provides evidence of the important roles of these two genes in the blunt snout bream innate immune system. PMID:25830478
Silar, Philippe; Barreau, Christian; Debuchy, Robert; Kicka, Sébastien; Turcq, Béatrice; Sainsard-Chanet, Annie; Sellem, Carole H; Billault, Alain; Cattolico, Laurence; Duprat, Simone; Weissenbach, Jean
2003-08-01
A Podospora anserina BAC library of 4800 clones has been constructed in the vector pBHYG allowing direct selection in fungi. Screening of the BAC collection for centromeric sequences of chromosome V allowed the recovery of clones localized on either sides of the centromere, but no BAC clone was found to contain the centromere. Seven BAC clones containing 322,195 and 156,244bp from either sides of the centromeric region were sequenced and annotated. One 5S rRNA gene, 5 tRNA genes, and 163 putative coding sequences (CDS) were identified. Among these, only six CDS seem specific to P. anserina. The gene density in the centromeric region is approximately one gene every 2.8kb. Extrapolation of this gene density to the whole genome of P. anserina suggests that the genome contains about 11,000 genes. Synteny analyses between P. anserina and Neurospora crassa show that co-linearity extends at the most to a few genes, suggesting rapid genome rearrangements between these two species.
Paznekas, W A; Zhang, N; Gridley, T; Jabs, E W
1997-09-08
Mutations in the human TCOF1 gene have been identified in patients with Treacher Collins Syndrome (Mandibulofacial Dysostosis), an autosomal dominant condition affecting the craniofacial region. We report the isolation of the entire mouse Tcof1 coding sequence (3960 bp) by performing a computer-based search for mouse cDNA clones homologous to TCOF1 and generating overlapping RT-PCR products from mouse RNA. Tcof1 is a 1320 amino acid protein of 135 kd with 61.4% identity to TCOF1 and displays repeating motifs enriched for serine- and acidic amino acid-rich regions with potential phosphorylation sites and putative nuclear localization signals. Tcof1 maps to the mouse chromosome 18 region syntenic with human chromosome 5q32-->q33 which contains the TCOF1 locus. Northern blot hybridization indicates Tcof1 expression is ubiquitous in adult tissues and in the embryonic stage, is elevated at 11 dpc when the branchial arches and facial swellings are present in mouse. Our results are consistent with TCOF1 mutations leading to the Treacher Collins syndrome phenotype.
Sost, independent of the non-coding enhancer ECR5, is required for bone mechanoadaptation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Robling, Alexander G.; Kang, Kyung Shin; Bullock, Whitney A.
Here, sclerostin ( Sost) is a negative regulator of bone formation that acts upon the Wnt signaling pathway. Sost is mechanically regulated at both mRNA and protein level such that loading represses and unloading enhances Sost expression, in osteocytes and in circulation. The non-coding evolutionarily conserved enhancer ECR5 has been previously reported as a transcriptional regulatory element required for modulating Sost expression in osteocytes. Here we explored the mechanisms by which ECR5, or several other putative transcriptional enhancers regulate Sost expression, in response to mechanical stimulation. We found that in vivo ulna loading is equally osteoanabolic in wildtype and Sostmore » –/– mice, although Sost is required for proper distribution of load-induced bone formation to regions of high strain. Using Luciferase reporters carrying the ECR5 non-coding enhancer and heterologous or homologous h SOST promoters, we found that ECR5 is mechanosensitive in vitro and that ECR5-driven Luciferase activity decreases in osteoblasts exposed to oscillatory fluid flow. Yet, ECR5–/– mice showed similar magnitude of load-induced bone formation and similar periosteal distribution of bone formation to high-strain regions compared to wildtype mice. Further, we found that in contrast to Sost–/– mice, which are resistant to disuse-induced bone loss, ECR5–/– mice lose bone upon unloading to a degree similar to wildtype control mice. ECR5 deletion did not abrogate positive effects of unloading on Sost, suggesting that additional transcriptional regulators and regulatory elements contribute to load-induced regulation of Sost.« less
Sost, independent of the non-coding enhancer ECR5, is required for bone mechanoadaptation
Robling, Alexander G.; Kang, Kyung Shin; Bullock, Whitney A.; ...
2016-09-04
Here, sclerostin ( Sost) is a negative regulator of bone formation that acts upon the Wnt signaling pathway. Sost is mechanically regulated at both mRNA and protein level such that loading represses and unloading enhances Sost expression, in osteocytes and in circulation. The non-coding evolutionarily conserved enhancer ECR5 has been previously reported as a transcriptional regulatory element required for modulating Sost expression in osteocytes. Here we explored the mechanisms by which ECR5, or several other putative transcriptional enhancers regulate Sost expression, in response to mechanical stimulation. We found that in vivo ulna loading is equally osteoanabolic in wildtype and Sostmore » –/– mice, although Sost is required for proper distribution of load-induced bone formation to regions of high strain. Using Luciferase reporters carrying the ECR5 non-coding enhancer and heterologous or homologous h SOST promoters, we found that ECR5 is mechanosensitive in vitro and that ECR5-driven Luciferase activity decreases in osteoblasts exposed to oscillatory fluid flow. Yet, ECR5–/– mice showed similar magnitude of load-induced bone formation and similar periosteal distribution of bone formation to high-strain regions compared to wildtype mice. Further, we found that in contrast to Sost–/– mice, which are resistant to disuse-induced bone loss, ECR5–/– mice lose bone upon unloading to a degree similar to wildtype control mice. ECR5 deletion did not abrogate positive effects of unloading on Sost, suggesting that additional transcriptional regulators and regulatory elements contribute to load-induced regulation of Sost.« less
Decoding the genome with an integrative analysis tool: combinatorial CRM Decoder.
Kang, Keunsoo; Kim, Joomyeong; Chung, Jae Hoon; Lee, Daeyoup
2011-09-01
The identification of genome-wide cis-regulatory modules (CRMs) and characterization of their associated epigenetic features are fundamental steps toward the understanding of gene regulatory networks. Although integrative analysis of available genome-wide information can provide new biological insights, the lack of novel methodologies has become a major bottleneck. Here, we present a comprehensive analysis tool called combinatorial CRM decoder (CCD), which utilizes the publicly available information to identify and characterize genome-wide CRMs in a species of interest. CCD first defines a set of the epigenetic features which is significantly associated with a set of known CRMs as a code called 'trace code', and subsequently uses the trace code to pinpoint putative CRMs throughout the genome. Using 61 genome-wide data sets obtained from 17 independent mouse studies, CCD successfully catalogued ∼12 600 CRMs (five distinct classes) including polycomb repressive complex 2 target sites as well as imprinting control regions. Interestingly, we discovered that ∼4% of the identified CRMs belong to at least two different classes named 'multi-functional CRM', suggesting their functional importance for regulating spatiotemporal gene expression. From these examples, we show that CCD can be applied to any potential genome-wide datasets and therefore will shed light on unveiling genome-wide CRMs in various species.
del Val, Coral; Rivas, Elena; Torres-Quesada, Omar; Toro, Nicolás; Jiménez-Zurdo, José I
2007-01-01
Bacterial small non-coding RNAs (sRNAs) are being recognized as novel widespread regulators of gene expression in response to environmental signals. Here, we present the first search for sRNA-encoding genes in the nitrogen-fixing endosymbiont Sinorhizobium meliloti, performed by a genome-wide computational analysis of its intergenic regions. Comparative sequence data from eight related α-proteobacteria were obtained, and the interspecies pairwise alignments were scored with the programs eQRNA and RNAz as complementary predictive tools to identify conserved and stable secondary structures corresponding to putative non-coding RNAs. Northern experiments confirmed that eight of the predicted loci, selected among the original 32 candidates as most probable sRNA genes, expressed small transcripts. This result supports the combined use of eQRNA and RNAz as a robust strategy to identify novel sRNAs in bacteria. Furthermore, seven of the transcripts accumulated differentially in free-living and symbiotic conditions. Experimental mapping of the 5′-ends of the detected transcripts revealed that their encoding genes are organized in autonomous transcription units with recognizable promoter and, in most cases, termination signatures. These findings suggest novel regulatory functions for sRNAs related to the interactions of α-proteobacteria with their eukaryotic hosts. PMID:17971083
Beamer, B A; Negri, C; Yen, C J; Gavrilova, O; Rumberger, J M; Durcan, M J; Yarnall, D P; Hawkins, A L; Griffin, C A; Burns, D K; Roth, J; Reitman, M; Shuldiner, A R
1997-04-28
We determined the chromosomal localization and partial genomic structure of the coding region of the human PPAR gamma gene (hPPAR gamma), a nuclear receptor important for adipocyte differentiation and function. Sequence analysis and long PCR of human genomic DNA with primers that span putative introns revealed that intron positions and sizes of hPPAR gamma are similar to those previously determined for the mouse PPAR gamma gene[13]. Fluorescent in situ hybridization localized hPPAR gamma to chromosome 3, band 3p25. Radiation hybrid mapping with two independent primer pairs was consistent with hPPAR gamma being within 1.5 Mb of marker D3S1263 on 3p25-p24.2. These sequences of the intron/exon junctions of the 6 coding exons shared by hPPAR gamma 1 and hPPAR gamma 2 will facilitate screening for possible mutations. Furthermore, D3S1263 is a suitable polymorphic marker for linkage analysis to evaluate PPAR gamma's potential contribution to genetic susceptibility to obesity, lipoatrophy, insulin resistance, and diabetes.
Comparative Genetic Analyses of Human Rhinovirus C (HRV-C) Complete Genome from Malaysia.
Khaw, Yam Sim; Chan, Yoke Fun; Jafar, Faizatul Lela; Othman, Norlijah; Chee, Hui Yee
2016-01-01
Human rhinovirus-C (HRV-C) has been implicated in more severe illnesses than HRV-A and HRV-B, however, the limited number of HRV-C complete genomes (complete 5' and 3' non-coding region and open reading frame sequences) has hindered the in-depth genetic study of this virus. This study aimed to sequence seven complete HRV-C genomes from Malaysia and compare their genetic characteristics with the 18 published HRV-Cs. Seven Malaysian HRV-C complete genomes were obtained with newly redesigned primers. The seven genomes were classified as HRV-C6, C12, C22, C23, C26, C42, and pat16 based on the VP4/VP2 and VP1 pairwise distance threshold classification. Five of the seven Malaysian isolates, namely, 3430-MY-10/C22, 8713-MY-10/C23, 8097-MY-11/C26, 1570-MY-10/C42, and 7383-MY-10/pat16 are the first newly sequenced complete HRV-C genomes. All seven Malaysian isolates genomes displayed nucleotide similarity of 63-81% among themselves and 63-96% with other HRV-Cs. Malaysian HRV-Cs had similar putative immunogenic sites, putative receptor utilization and potential antiviral sites as other HRV-Cs. The genomic features of Malaysian isolates were similar to those of other HRV-Cs. Negative selections were frequently detected in HRV-Cs complete coding sequences indicating that these sequences were under functional constraint. The present study showed that HRV-Cs from Malaysia have diverse genetic sequences but share conserved genomic features with other HRV-Cs. This genetic information could provide further aid in the understanding of HRV-C infection.
Hücker, Sarah M.; Ardern, Zachary; Goldberg, Tatyana; Schafferhans, Andrea; Bernhofer, Michael; Vestergaard, Gisle; Nelson, Chase W.; Schloter, Michael; Rost, Burkhard; Scherer, Siegfried
2017-01-01
In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Transcriptome sequencing (RNAseq) signals outside of annotated genes have usually been interpreted to indicate either ncRNA or pervasive transcription. Therefore, in addition to the transcriptome, the translatome (RIBOseq) of the enteric pathogen Escherichia coli O157:H7 strain Sakai was determined at two optimal growth conditions and a severe stress condition combining low temperature and high osmotic pressure. All intergenic open reading frames potentially encoding a protein of ≥ 30 amino acids were investigated with regard to coverage by transcription and translation signals and their translatability expressed by the ribosomal coverage value. This led to discovery of 465 unique, putative novel genes not yet annotated in this E. coli strain, which are evenly distributed over both DNA strands of the genome. For 255 of the novel genes, annotated homologs in other bacteria were found, and a machine-learning algorithm, trained on small protein-coding E. coli genes, predicted that 89% of these translated open reading frames represent bona fide genes. The remaining 210 putative novel genes without annotated homologs were compared to the 255 novel genes with homologs and to 250 short annotated genes of this E. coli strain. All three groups turned out to be similar with respect to their translatability distribution, fractions of differentially regulated genes, secondary structure composition, and the distribution of evolutionary constraint, suggesting that both novel groups represent legitimate genes. However, the machine-learning algorithm only recognized a small fraction of the 210 genes without annotated homologs. It is possible that these genes represent a novel group of genes, which have unusual features dissimilar to the genes of the machine-learning algorithm training set. PMID:28902868
Comparative Genetic Analyses of Human Rhinovirus C (HRV-C) Complete Genome from Malaysia
Khaw, Yam Sim; Chan, Yoke Fun; Jafar, Faizatul Lela; Othman, Norlijah; Chee, Hui Yee
2016-01-01
Human rhinovirus-C (HRV-C) has been implicated in more severe illnesses than HRV-A and HRV-B, however, the limited number of HRV-C complete genomes (complete 5′ and 3′ non-coding region and open reading frame sequences) has hindered the in-depth genetic study of this virus. This study aimed to sequence seven complete HRV-C genomes from Malaysia and compare their genetic characteristics with the 18 published HRV-Cs. Seven Malaysian HRV-C complete genomes were obtained with newly redesigned primers. The seven genomes were classified as HRV-C6, C12, C22, C23, C26, C42, and pat16 based on the VP4/VP2 and VP1 pairwise distance threshold classification. Five of the seven Malaysian isolates, namely, 3430-MY-10/C22, 8713-MY-10/C23, 8097-MY-11/C26, 1570-MY-10/C42, and 7383-MY-10/pat16 are the first newly sequenced complete HRV-C genomes. All seven Malaysian isolates genomes displayed nucleotide similarity of 63–81% among themselves and 63–96% with other HRV-Cs. Malaysian HRV-Cs had similar putative immunogenic sites, putative receptor utilization and potential antiviral sites as other HRV-Cs. The genomic features of Malaysian isolates were similar to those of other HRV-Cs. Negative selections were frequently detected in HRV-Cs complete coding sequences indicating that these sequences were under functional constraint. The present study showed that HRV-Cs from Malaysia have diverse genetic sequences but share conserved genomic features with other HRV-Cs. This genetic information could provide further aid in the understanding of HRV-C infection. PMID:27199901
Chen, Ying; Dai, Hongzheng; Chen, Sidi; Zhang, Luoying; Long, Manyuan
2011-04-26
Sphinx is a lineage-specific non-coding RNA gene involved in regulating courtship behavior in Drosophila melanogaster. The 5' flanking region of the gene is conserved across Drosophila species, with the proximal 300 bp being conserved out to D. virilis and a further 600 bp region being conserved amongst the melanogaster subgroup (D. melanogaster, D. simulans, D. sechellia, D. yakuba, and D. erecta). Using a green fluorescence protein transformation system, we demonstrated that a 253 bp region of the highly conserved segment was sufficient to drive sphinx expression in male accessory gland. GFP signals were also observed in brain, wing hairs and leg bristles. An additional ∼800 bp upstream region was able to enhance expression specifically in proboscis, suggesting the existence of enhancer elements. Using anti-GFP staining, we identified putative sphinx expression signal in the brain antennal lobe and inner antennocerebral tract, suggesting that sphinx might be involved in olfactory neuron mediated regulation of male courtship behavior. Whole genome expression profiling of the sphinx knockout mutation identified significant up-regulated gene categories related to accessory gland protein function and odor perception, suggesting sphinx might be a negative regulator of its target genes.
Chen, Sidi; Zhang, Luoying; Long, Manyuan
2011-01-01
Sphinx is a lineage-specific non-coding RNA gene involved in regulating courtship behavior in Drosophila melanogaster. The 5′ flanking region of the gene is conserved across Drosophila species, with the proximal 300 bp being conserved out to D. virilis and a further 600 bp region being conserved amongst the melanogaster subgroup (D. melanogaster, D. simulans, D. sechellia, D. yakuba, and D. erecta). Using a green fluorescence protein transformation system, we demonstrated that a 253 bp region of the highly conserved segment was sufficient to drive sphinx expression in male accessory gland. GFP signals were also observed in brain, wing hairs and leg bristles. An additional ∼800 bp upstream region was able to enhance expression specifically in proboscis, suggesting the existence of enhancer elements. Using anti-GFP staining, we identified putative sphinx expression signal in the brain antennal lobe and inner antennocerebral tract, suggesting that sphinx might be involved in olfactory neuron mediated regulation of male courtship behavior. Whole genome expression profiling of the sphinx knockout mutation identified significant up-regulated gene categories related to accessory gland protein function and odor perception, suggesting sphinx might be a negative regulator of its target genes. PMID:21541324
Farreyrol, K; Pearson, M N; Grisoni, M; Cohen, D; Beck, D
2006-05-01
Sequence was determined for the coat protein (CP) gene and 3' non-translated region (3'NTR) of two vanilla mosaic virus (VanMV) isolates from Vanilla tahitensis, respectively from the Cook Islands (VanMV-CI) and French Polynesia (VanMV-FP). Both viruses displayed distinctive features in the N-terminal region of their CPs; for VanMV-CI, a 16-amino-acid deletion including the aphid transmission-related DAG motif, and for VanMV-FP, a stretch of GTN repeats that putatively belongs to the class of natively unfolded proteins. VanMV-FP CP also has a novel DVG motif in place of the DAG motif, and an uncommon Q//V protease cleavage site. The sequences were compared to a range of Dasheen mosaic virus (DsMV) strains and to potyviruses infecting orchids. Identity was low to DsMV strains across the entire CP coding region and across the 3'NTR, but high across the CP core and the CI-6K2-NIa region. In accordance with current ICTV criteria for species demarcation within the family Potyviridae, VanMV-CI and VanMV-FP are strains of DsMV that exclusively infect vanilla.
Itoh, S; Yanagimoto, T; Tagawa, S; Hashimoto, H; Kitamura, R; Nakajima, Y; Okochi, T; Fujimoto, S; Uchino, J; Kamataki, T
1992-03-24
P-450IIIA7 is a form of cytochrome P-450 which was isolated from human fetal livers and termed P-450HFLa. This form has been clarified to be expressed during fetal life specifically (Komori, M., Nishio, K., Kitada, M., Shiramatsu, K., Muroya, K., Soma, M., Nagashima, K. and Kamataki, T. (1990) Biochemistry 29, 4430-4433). In the present study, we isolated five independent clones which probably corresponded to the human P-450IIIA7 gene. These clones were completely sequenced, all exons, exon-intron junctions and the 5' flanking region from the cap site to-869. Although the sequences in the coding region were completely identical to P-450IIIA7, it is possible that genomic fragments sequenced in this study encode portions of other P-450IIIA7-related genes since we could not obtain a complete overlapping set of genomic clones. Within its 5' flanking sequence, the putative binding sites of several transcriptional regulatory factors existed. Among them, it was shown that a basic transcription element binding factor (BTEB) actually interacted with the 5' flanking region of this gene.
Neville, P J; Thomas, N; Campbell, I G
2001-02-01
Many tumor types including that of the ovary show loss of heterozygosity (LOH) on chromosome arm 7q, which suggests the existence of at least one tumor suppressor gene (TSG) on this chromosome arm. We have studied the region surrounding the putative tumor suppressor gene CUTL1 at 7q22 in 127 epithelial ovarian tumors. LOH was found across 7q22 in 31% of malignant and 14% of benign ovarian tumors. In 16% of the tumors the LOH appeared to be centered on the CUTL1 gene. This gene has been implicated previously as a TSG in both uterine leiomyomas and breast carcinoma. However, mutation analysis of the CUTL1 gene in 47 tumors with 7q22 LOH failed to identify any somatic alterations in the coding regions. This finding suggests that CUTL1 may not be the target of the 7q22 LOH in ovarian cancers.
Cheewachaiwit, S; Warin, N; Phuangrat, B; Rukpratanporn, S; Gajanandana, O; Balatero, C H; Chatchawankanphanich, O
2017-07-01
Overall, 244 samples of cucurbit crops with yellowing symptoms and selected weed species, from 15 provinces in Thailand, were screened by RT-PCR using primers Polero-CP-F and Polero-CP-R. A total of 160 samples (~66%) were infected by poleroviruses. Analysis of a 1.4 kb region covering the 3' RNA-dependent RNA polymerase (RdRp) gene, the intergenic non-coding region (iNCR), and the coat protein (CP), showed that four poleroviruses, namely, cucurbit aphid-borne yellows virus (CABYV), luffa aphid-borne yellows virus (LABYV), melon aphid-borne yellows virus (MABYV) and suakwa aphid-borne yellows virus (SABYV) were associated with the yellowing symptoms in cucurbit crops. Further analyses indicated presence of putative recombinant viruses referred to as CABYV-R and SABYV-R. CABYV-R was derived from the recombination between MABYV and the common strain of CABYV (CABYV-C). SABYV-R was derived from the recombination of MABYV and SABYV.
Zhu, Hu; Urban, Daniel J.; Blashka, Jared; McPheeters, Matthew T.; Kroeze, Wesley K.; Mieczkowski, Piotr; Overholser, James C.; Jurjus, George J.; Dieter, Lesa; Mahajan, Gouri J.; Rajkowska, Grazyna; Wang, Zefeng; Sullivan, Patrick F.; Stockmeier, Craig A.; Roth, Bryan L.
2012-01-01
A-to-I RNA editing is a post-transcriptional modification of single nucleotides in RNA by adenosine deamination, which thereby diversifies the gene products encoded in the genome. Thousands of potential RNA editing sites have been identified by recent studies (e.g. see Li et al, Science 2009); however, only a handful of these sites have been independently confirmed. Here, we systematically and quantitatively examined 109 putative coding region A-to-I RNA editing sites in three sets of normal human brain samples by ultra-high-throughput sequencing (uHTS). Forty of 109 putative sites, including 25 previously confirmed sites, were validated as truly edited in our brain samples, suggesting an overestimation of A-to-I RNA editing in these putative sites by Li et al (2009). To evaluate RNA editing in human disease, we analyzed 29 of the confirmed sites in subjects with major depressive disorder and schizophrenia using uHTS. In striking contrast to many prior studies, we did not find significant alterations in the frequency of RNA editing at any of the editing sites in samples from these patients, including within the 5HT2C serotonin receptor (HTR2C). Our results indicate that uHTS is a fast, quantitative and high-throughput method to assess RNA editing in human physiology and disease and that many prior studies of RNA editing may overestimate both the extent and disease-related variability of RNA editing at the sites we examined in the human brain. PMID:22912834
Current Research on Non-Coding Ribonucleic Acid (RNA).
Wang, Jing; Samuels, David C; Zhao, Shilin; Xiang, Yu; Zhao, Ying-Yong; Guo, Yan
2017-12-05
Non-coding ribonucleic acid (RNA) has without a doubt captured the interest of biomedical researchers. The ability to screen the entire human genome with high-throughput sequencing technology has greatly enhanced the identification, annotation and prediction of the functionality of non-coding RNAs. In this review, we discuss the current landscape of non-coding RNA research and quantitative analysis. Non-coding RNA will be categorized into two major groups by size: long non-coding RNAs and small RNAs. In long non-coding RNA, we discuss regular long non-coding RNA, pseudogenes and circular RNA. In small RNA, we discuss miRNA, transfer RNA, piwi-interacting RNA, small nucleolar RNA, small nuclear RNA, Y RNA, single recognition particle RNA, and 7SK RNA. We elaborate on the origin, detection method, and potential association with disease, putative functional mechanisms, and public resources for these non-coding RNAs. We aim to provide readers with a complete overview of non-coding RNAs and incite additional interest in non-coding RNA research.
Santibáñez-López, Carlos E; Cid-Uribe, Jimena I; Zamudio, Fernando Z; Batista, Cesar V F; Ortiz, Ernesto; Possani, Lourival D
2017-07-01
The soluble venom from the Mexican scorpion Megacormus gertschi of the family Euscorpiidae was obtained and its biological effects were tested in several animal models. This venom is not toxic to mice at doses of 100 μg per 20 g of mouse weight, while being lethal to arthropods (insects and crustaceans), at doses of 20 μg (for crickets) and 100 μg (for shrimps) per animal. Samples of the venom were separated by high performance liquid chromatography and circa 80 distinct chromatographic fractions were obtained from which 67 components have had their molecular weights determined by mass spectrometry analysis. The N-terminal amino acid sequence of seven protein/peptides were obtained by Edman degradation and are reported. Among the high molecular weight components there are enzymes with experimentally-confirmed phospholipase activity. A pair of telsons from this scorpion species was dissected, from which total RNA was extracted and used for cDNA library construction. Massive sequencing by the Illumina protocol, followed by de novo assembly, resulted in a total of 110,528 transcripts. From those, we were able to annotate 182, which putatively code for peptides/proteins with sequence similarity to previously-reported venom components available from different protein databases. Transcripts seemingly coding for enzymes showed the richest diversity, with 52 sequences putatively coding for proteases, 20 for phospholipases, 8 for lipases and 5 for hyaluronidases. The number of different transcripts potentially coding for peptides with sequence similarity to those that affect ion channels was 19, for putative antimicrobial peptides 19, and for protease inhibitor-like peptides, 18. Transcripts seemingly coding for other venom components were identified and described. The LC/MS analysis of a trypsin-digested venom aliquot resulted in 23 matches with the translated transcriptome database, which validates the transcriptome. The proteomic and transcriptomic analyses reported here constitute the first approach to study the venom components from a scorpion species belonging to the family Euscorpiidae. The data certainly show that this venom is different from all the ones described thus far in the literature. Copyright © 2017 Elsevier Ltd. All rights reserved.
A deep learning method for lincRNA detection using auto-encoder algorithm.
Yu, Ning; Yu, Zeng; Pan, Yi
2017-12-06
RNA sequencing technique (RNA-seq) enables scientists to develop novel data-driven methods for discovering more unidentified lincRNAs. Meantime, knowledge-based technologies are experiencing a potential revolution ignited by the new deep learning methods. By scanning the newly found data set from RNA-seq, scientists have found that: (1) the expression of lincRNAs appears to be regulated, that is, the relevance exists along the DNA sequences; (2) lincRNAs contain some conversed patterns/motifs tethered together by non-conserved regions. The two evidences give the reasoning for adopting knowledge-based deep learning methods in lincRNA detection. Similar to coding region transcription, non-coding regions are split at transcriptional sites. However, regulatory RNAs rather than message RNAs are generated. That is, the transcribed RNAs participate the biological process as regulatory units instead of generating proteins. Identifying these transcriptional regions from non-coding regions is the first step towards lincRNA recognition. The auto-encoder method achieves 100% and 92.4% prediction accuracy on transcription sites over the putative data sets. The experimental results also show the excellent performance of predictive deep neural network on the lincRNA data sets compared with support vector machine and traditional neural network. In addition, it is validated through the newly discovered lincRNA data set and one unreported transcription site is found by feeding the whole annotated sequences through the deep learning machine, which indicates that deep learning method has the extensive ability for lincRNA prediction. The transcriptional sequences of lincRNAs are collected from the annotated human DNA genome data. Subsequently, a two-layer deep neural network is developed for the lincRNA detection, which adopts the auto-encoder algorithm and utilizes different encoding schemes to obtain the best performance over intergenic DNA sequence data. Driven by those newly annotated lincRNA data, deep learning methods based on auto-encoder algorithm can exert their capability in knowledge learning in order to capture the useful features and the information correlation along DNA genome sequences for lincRNA detection. As our knowledge, this is the first application to adopt the deep learning techniques for identifying lincRNA transcription sequences.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Xie, Enzhong; Zhu, Lingyu; Zhao, Lingyun
1996-08-01
The complete 4775-nt cDNA encoding the human serotonin 5-HT{sub 2C} receptor (5-HT{sub 2C}R), a G-protein-coupled receptor, has been isolated. It contains a 1377-nt coding region flanked by a 728-nt 5{prime}-untranslated region and a 2670-nt 3{prime}-untranslated region. By using the cloned 5-HT{sub 2C}R cDNA probe, the complete human gene for this receptor has been isolated and shown to contain six exons and five introns spanning at least 230 kb of DNA. The coding region of the human 5-HT{sub 2C}R gene is interrupted by three introns, and the positions of the intron/exon junctions are conserved between the human and the rodent genes.more » In addition, an alternatively spliced 5-HT{sub 2C}R RNA that contains a 95-nt deletion in the region coding for the second intracellular loop and the fourth transmembrane domain of the receptor has been identified. This deletion leads to a frameshift and premature termination so that the short isoform RNA encodes a putative protein of 248 amino acids. The ratio for the short isoform over the 5-HT{sub 2C}R RNA was found to be higher in choroid plexus tumor than in normal brain tissue, suggesting the possibility of differential regulation of the 5-HT{sub 2C}R gene in different neural tissues or during tumorigenesis. Transcription of the human 5-HT{sub 2C}R gene was found to be initiated at multiple sites. No classical TATA-box sequence was found at the appropriate location, and the 5{prime}-flanking sequence contains many potential transcription factor-binding sites. A 7.3-kb 5{prime}-flanking 5-HT{sub 2C}R DNA directed the efficient expression of a luciferase reported gene in SK-N-SH and IMR32 neuroblastoma cells, indicating that is contains a functional promoter. 69 refs., 8 figs., 1 tab.« less
Parallel evolution of chordate cis-regulatory code for development.
Doglio, Laura; Goode, Debbie K; Pelleri, Maria C; Pauls, Stefan; Frabetti, Flavia; Shimeld, Sebastian M; Vavouri, Tanya; Elgar, Greg
2013-11-01
Urochordates are the closest relatives of vertebrates and at the larval stage, possess a characteristic bilateral chordate body plan. In vertebrates, the genes that orchestrate embryonic patterning are in part regulated by highly conserved non-coding elements (CNEs), yet these elements have not been identified in urochordate genomes. Consequently the evolution of the cis-regulatory code for urochordate development remains largely uncharacterised. Here, we use genome-wide comparisons between C. intestinalis and C. savignyi to identify putative urochordate cis-regulatory sequences. Ciona conserved non-coding elements (ciCNEs) are associated with largely the same key regulatory genes as vertebrate CNEs. Furthermore, some of the tested ciCNEs are able to activate reporter gene expression in both zebrafish and Ciona embryos, in a pattern that at least partially overlaps that of the gene they associate with, despite the absence of sequence identity. We also show that the ability of a ciCNE to up-regulate gene expression in vertebrate embryos can in some cases be localised to short sub-sequences, suggesting that functional cross-talk may be defined by small regions of ancestral regulatory logic, although functional sub-sequences may also be dispersed across the whole element. We conclude that the structure and organisation of cis-regulatory modules is very different between vertebrates and urochordates, reflecting their separate evolutionary histories. However, functional cross-talk still exists because the same repertoire of transcription factors has likely guided their parallel evolution, exploiting similar sets of binding sites but in different combinations.
Analysis of the Genome of the Sexually Transmitted Insect Virus Helicoverpa zea Nudivirus 2
Burand, John P.; Kim, Woojin; Afonso, Claudio L.; Tulman, Edan R.; Kutish, Gerald F.; Lu, Zhiqiang; Rock, Daniel L.
2012-01-01
The sexually transmitted insect virus Helicoverpa zea nudivirus 2 (HzNV-2) was determined to have a circular double-stranded DNA genome of 231,621 bp coding for an estimated 113 open reading frames (ORFs). HzNV-2 is most closely related to the nudiviruses, a sister group of the insect baculoviruses. Several putative ORFs that share homology with the baculovirus core genes were identified in the viral genome. However, HzNV-2 lacks several key genetic features of baculoviruses including the late transcriptional regulation factor, LEF-1 and the palindromic hrs, which serve as origins of replication. The HzNV-2 genome was found to code for three ORFs that had significant sequence homology to cellular genes which are not generally found in viral genomes. These included a presumed juvenile hormone esterase gene, a gene coding for a putative zinc-dependent matrix metalloprotease, and a major facilitator superfamily protein gene; all of which are believed to play a role in the cellular proliferation and the tissue hypertrophy observed in the malformation of reproductive organs observed in HzNV-2 infected corn earworm moths, Helicoverpa zea. PMID:22355451
Kato, Hirotomo; Jochim, Ryan C.; Gomez, Eduardo A.; Sakoda, Ryo; Iwata, Hiroyuki; Valenzuela, Jesus G.; Hashiguchi, Yoshihisa
2010-01-01
Triatoma (T.) dimidiata is a hematophagous Hemiptera and a main vector of Chagas disease. The saliva of this and other blood-sucking insects contains potent pharmacologically active components that assist them in counteracting the host hemostatic and inflammatory systems during blood feeding. To describe the repertoire of potential bioactive salivary molecules from this insect, a number of randomly selected transcripts from the salivary gland cDNA library of T. dimidiata were sequenced and analyzed. This analysis showed that 77.5% of the isolated transcripts coded for putative secreted proteins, and 89.9% of these coded for variants of the lipocalin family proteins. The most abundant transcript was a homologue of procalin, the major allergen of T. protracta saliva, and contributed more than 50% of the transcripts coding for putative secreted proteins, suggesting that it may play an important role in the blood-feeding process. Other salivary transcripts encoding lipocalin family proteins had homology to triabin (a thrombin inhibitor), triafestin (an inhibitor of kallikrein–kinin system), pallidipin (an inhibitor of collagen-induced platelet aggregation) and others with unknown function. PMID:19900580
Analysis of the genome of the sexually transmitted insect virus Helicoverpa zea nudivirus 2.
Burand, John P; Kim, Woojin; Afonso, Claudio L; Tulman, Edan R; Kutish, Gerald F; Lu, Zhiqiang; Rock, Daniel L
2012-01-01
The sexually transmitted insect virus Helicoverpa zea nudivirus 2 (HzNV-2) was determined to have a circular double-stranded DNA genome of 231,621 bp coding for an estimated 113 open reading frames (ORFs). HzNV-2 is most closely related to the nudiviruses, a sister group of the insect baculoviruses. Several putative ORFs that share homology with the baculovirus core genes were identified in the viral genome. However, HzNV-2 lacks several key genetic features of baculoviruses including the late transcriptional regulation factor, LEF-1 and the palindromic hrs, which serve as origins of replication. The HzNV-2 genome was found to code for three ORFs that had significant sequence homology to cellular genes which are not generally found in viral genomes. These included a presumed juvenile hormone esterase gene, a gene coding for a putative zinc-dependent matrix metalloprotease, and a major facilitator superfamily protein gene; all of which are believed to play a role in the cellular proliferation and the tissue hypertrophy observed in the malformation of reproductive organs observed in HzNV-2 infected corn earworm moths, Helicoverpa zea.
Identification of functional domains in Arabidopsis thaliana mRNA decapping enzyme (AtDcp2)
Gunawardana, Dilantha; Cheng, Heung-Chin; Gayler, Kenwyn R.
2008-01-01
The Arabidopsis thaliana decapping enzyme (AtDcp2) was characterized by bioinformatics analysis and by biochemical studies of the enzyme and mutants produced by recombinant expression. Three functionally significant regions were detected: (i) a highly disordered C-terminal region with a putative PSD-95, Discs-large, ZO-1 (PDZ) domain-binding motif, (ii) a conserved Nudix box constituting the putative active site and (iii) a putative RNA binding domain consisting of the conserved Box B and a preceding loop region. Mutation of the putative PDZ domain-binding motif improved the stability of recombinant AtDcp2 and secondary mutants expressed in Escherichia coli. Such recombinant AtDcp2 specifically hydrolysed capped mRNA to produce 7-methyl GDP and decapped RNA. AtDcp2 activity was Mn2+- or Mg2+-dependent and was inhibited by the product 7-methyl GDP. Mutation of the conserved glutamate-154 and glutamate-158 in the Nudix box reduced AtDcp2 activity up to 400-fold and showed that AtDcp2 employs the catalytic mechanism conserved amongst Nudix hydrolases. Unlike many Nudix hydrolases, AtDcp2 is refractory to inhibition by fluoride ions. Decapping was dependent on binding to the mRNA moiety rather than to the 7-methyl diguanosine triphosphate cap of the substrate. Mutational analysis of the putative RNA-binding domain confirmed the functional significance of an 11-residue loop region and the conserved Box B. PMID:18025047
Hao, Jiasheng; Sun, Qianqian; Zhao, Huabin; Sun, Xiaoyan; Gai, Yonghua; Yang, Qun
2012-01-01
We here report the first complete mitochondrial (mt) genome of a skipper, Ctenoptilum vasava Moore, 1865 (Lepidoptera: Hesperiidae: Pyrginae). The mt genome of the skipper is a circular molecule of 15,468 bp, containing 2 ribosomal RNA genes, 24 putative transfer RNA (tRNA), genes including an extra copy of trnS (AGN) and a tRNA-like insertion trnL (UUR), 13 protein-coding genes and an AT-rich region. All protein-coding genes (PCGs) are initiated by ATN codons and terminated by the typical stop codon TAA or TAG, except for COII which ends with a single T. The intergenic spacer sequence between trnS (AGN) and ND1 genes also contains the ATACTAA motif. The AT-rich region of 429 bp is comprised of nonrepetitive sequences, including the motif ATAGA followed by an 19 bp poly-T stretch, a microsatellite-like (AT)3 (TA)9 element next to the ATTTA motif, an 11 bp poly-A adjacent to tRNAs. Phylogenetic analyses (ML and BI methods) showed that Papilionoidea is not a natural group, and Hesperioidea is placed within the Papilionoidea as a sister to ((Pieridae + Lycaenidae) + Nymphalidae) while Papilionoidae is paraphyletic to Hesperioidea. This result is remarkably different from the traditional view where Papilionoidea and Hesperioidea are considered as two distinct superfamilies. PMID:22577351
Mehdizadeh Gohari, Iman; Kropinski, Andrew M; Weese, Scott J; Parreira, Valeria R; Whitehead, Ashley E; Boerlin, Patrick; Prescott, John F
2016-01-01
The recent discovery of a novel beta-pore-forming toxin, NetF, which is strongly associated with canine and foal necrotizing enteritis should improve our understanding of the role of type A Clostridium perfringens associated disease in these animals. The current study presents the complete genome sequence of two netF-positive strains, JFP55 and JFP838, which were recovered from cases of foal necrotizing enteritis and canine hemorrhagic gastroenteritis, respectively. Genome sequencing was done using Single Molecule, Real-Time (SMRT) technology-PacBio and Illumina Hiseq2000. The JFP55 and JFP838 genomes include a single 3.34 Mb and 3.53 Mb chromosome, respectively, and both genomes include five circular plasmids. Plasmid annotation revealed that three plasmids were shared by the two newly sequenced genomes, including a NetF/NetE toxins-encoding tcp-conjugative plasmid, a CPE/CPB2 toxins-encoding tcp-conjugative plasmid and a putative bacteriocin-encoding plasmid. The putative beta-pore-forming toxin genes, netF, netE and netG, were located in unique pathogenicity loci on tcp-conjugative plasmids. The C. perfringens JFP55 chromosome carries 2,825 protein-coding genes whereas the chromosome of JFP838 contains 3,014 protein-encoding genes. Comparison of these two chromosomes with three available reference C. perfringens chromosome sequences identified 48 (~247 kb) and 81 (~430 kb) regions unique to JFP55 and JFP838, respectively. Some of these divergent genomic regions in both chromosomes are phage- and plasmid-related segments. Sixteen of these unique chromosomal regions (~69 kb) were shared between the two isolates. Five of these shared regions formed a mosaic of plasmid-integrated segments, suggesting that these elements were acquired early in a clonal lineage of netF-positive C. perfringens strains. These results provide significant insight into the basis of canine and foal necrotizing enteritis and are the first to demonstrate that netF resides on a large and unique plasmid-encoded locus.
Transcription Factors Bind Thousands of Active and InactiveRegions in the Drosophila Blastoderm
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Xiao-Yong; MacArthur, Stewart; Bourgon, Richard
2008-01-10
Identifying the genomic regions bound by sequence-specific regulatory factors is central both to deciphering the complex DNA cis-regulatory code that controls transcription in metazoans and to determining the range of genes that shape animal morphogenesis. Here, we use whole-genome tiling arrays to map sequences bound in Drosophila melanogaster embryos by the six maternal and gap transcription factors that initiate anterior-posterior patterning. We find that these sequence-specific DNA binding proteins bind with quantitatively different specificities to highly overlapping sets of several thousand genomic regions in blastoderm embryos. Specific high- and moderate-affinity in vitro recognition sequences for each factor are enriched inmore » bound regions. This enrichment, however, is not sufficient to explain the pattern of binding in vivo and varies in a context-dependent manner, demonstrating that higher-order rules must govern targeting of transcription factors. The more highly bound regions include all of the over forty well-characterized enhancers known to respond to these factors as well as several hundred putative new cis-regulatory modules clustered near developmental regulators and other genes with patterned expression at this stage of embryogenesis. The new targets include most of the microRNAs (miRNAs) transcribed in the blastoderm, as well as all major zygotically transcribed dorsal-ventral patterning genes, whose expression we show to be quantitatively modulated by anterior-posterior factors. In addition to these highly bound regions, there are several thousand regions that are reproducibly bound at lower levels. However, these poorly bound regions are, collectively, far more distant from genes transcribed in the blastoderm than highly bound regions; are preferentially found in protein-coding sequences; and are less conserved than highly bound regions. Together these observations suggest that many of these poorly-bound regions are not involved in early-embryonic transcriptional regulation, and a significant proportion may be nonfunctional. Surprisingly, for five of the six factors, their recognition sites are not unambiguously more constrained evolutionarily than the immediate flanking DNA, even in more highly bound and presumably functional regions, indicating that comparative DNA sequence analysis is limited in its ability to identify functional transcription factor targets.« less
Ordóñez-Baquera, Perla Lucía; González-Rodríguez, Everardo; Aguado-Santacruz, Gerardo Armando; Rascón-Cruz, Quintín; Conesa, Ana; Moreno-Brito, Verónica; Echavarria, Raquel; Dominguez-Viveros, Joel
2017-02-01
MicroRNAs (miRNAs) are small non-coding RNA molecules that regulate signal transduction, development, metabolism, and stress responses in plants through post-transcriptional degradation and/or translational repression of target mRNAs. Several studies have addressed the role of miRNAs in model plant species, but miRNA expression and function in economically important forage crops, such as Bouteloua gracilis (Poaceae), a high-quality and drought-resistant grass distributed in semiarid regions of the United States and northern Mexico remain unknown. We applied high-throughput sequencing technology and bioinformatics analysis and identified 31 conserved miRNA families and 53 novel putative miRNAs with different abundance of reads in chlorophyllic cell cultures derived from B. gracilis. Some conserved miRNA families were highly abundant and possessed predicted targets involved in metabolism, plant growth and development, and stress responses. We also predicted additional identified novel miRNAs with specific targets, including B. gracilis ESTs, which were detected under drought stress conditions. Here we report 31 conserved miRNA families and 53 putative novel miRNAs in B. gracilis. Our results suggested the presence of regulatory miRNAs involved in modulating physiological and stress responses in this grass species. Copyright © 2016 Elsevier Ltd. All rights reserved.
Positional cloning of a gene responsible for the cts mutation of the silkworm, Bombyx mori.
Ito, Katsuhiko; Kidokoro, Kurako; Katsuma, Susumu; Shimada, Toru; Yamamoto, Kimiko; Mita, Kazuei; Kadono-Okuda, Keiko
2012-07-01
The larval head cuticle and anal plates of the silkworm mutant cheek and tail spot (cts) have chocolate-colored spots, unlike the entirely white appearance of the wild-type (WT) strain. We report the identification and characterization of the gene responsible for the cts mutation. Positional cloning revealed a cts candidate on chromosome 16, designated BmMFS, based on the high similarity of the deduced amino acid sequence between the candidate gene from the WT strain and the major facilitator superfamily (MFS) protein. BmMFS likely encodes a membrane protein with 11 putative transmembrane domains, while the putative structure deduced from the cts-type allele possesses only 10-pass transmembrane domains owing to a deletion in its coding region. Quantitative RT-PCR analysis showed that BmMFS mRNA was strongly expressed in the integument of the head and tail, where the cts phenotype is observed; expression markedly increased at the molting and newly ecdysed stages. These results indicate that the novel BmMFS gene is cts and the membrane structure of its protein accounts for the cts phenotype. These expression profiles and the cts phenotype are quite similar to those of melanin-related genes, such as Bmyellow-e and Bm-iAANAT, suggesting that BmMFS is involved in the melanin synthesis pathway.
Krishnamurthi, Revathy; Ghosh, Swagatha; Khedkar, Supriya; Seshasayee, Aswin Sai Narain
2017-01-01
Horizontal gene transfer is a major driving force behind the genomic diversity seen in prokaryotes. The cryptic rac prophage in Escherichia coli K-12 carries the gene for a putative transcription factor RacR, whose deletion is lethal. We have shown that the essentiality of racR in E. coli K-12 is attributed to its role in transcriptionally repressing toxin gene(s) called ydaS and ydaT , which are adjacent to and coded divergently to racR . IMPORTANCE Transcription factors in the bacterium E. coli are rarely essential, and when they are essential, they are largely toxin-antitoxin systems. While studying transcription factors encoded in horizontally acquired regions in E. coli , we realized that the protein RacR, a putative transcription factor encoded by a gene on the rac prophage, is an essential protein. Here, using genetics, biochemistry, and bioinformatics, we show that its essentiality derives from its role as a transcriptional repressor of the ydaS and ydaT genes, whose products are toxic to the cell. Unlike type II toxin-antitoxin systems in which transcriptional regulation involves complexes of the toxin and antitoxin, repression by RacR is sufficient to keep ydaS transcriptionally silent.
Insel, Nathan; Barnes, Carol A.
2015-01-01
The medial prefrontal cortex is thought to be important for guiding behavior according to an animal's expectations. Efforts to decode the region have focused not only on the question of what information it computes, but also how distinct circuit components become engaged during behavior. We find that the activity of regular-firing, putative projection neurons contains rich information about behavioral context and firing fields cluster around reward sites, while activity among putative inhibitory and fast-spiking neurons is most associated with movement and accompanying sensory stimulation. These dissociations were observed even between adjacent neurons with apparently reciprocal, inhibitory–excitatory connections. A smaller population of projection neurons with burst-firing patterns did not show clustered firing fields around rewards; these neurons, although heterogeneous, were generally less selective for behavioral context than regular-firing cells. The data suggest a network that tracks an animal's behavioral situation while, at the same time, regulating excitation levels to emphasize high valued positions. In this scenario, the function of fast-spiking inhibitory neurons is to constrain network output relative to incoming sensory flow. This scheme could serve as a bridge between abstract sensorimotor information and single-dimensional codes for value, providing a neural framework to generate expectations from behavioral state. PMID:24700585
Genomewide identification and expression analysis of the ARF gene family in apple.
Luo, Xiao-Cui; Sun, Mei-Hong; Xu, Rui-Rui; Shu, Huai-Rui; Wang, Jia-Wei; Zhang, Shi-Zhong
2014-12-01
Auxin response factors (ARF) are transcription factors that regulate auxin responses in plants. Although the genomewide analysis of this family has been performed in some species, little is known regarding ARF genes in apple (Malus domestica). In this study, 31 putative apple ARF genes have been identified and located within the apple genome. The phylogenetic analysis revealed that MdARFs could be divided into three subfamilies (groups I, II and III). The predicted MdARFs were distributed across 15 of 17 chromosomes with different densities. In addition, the analysis of exon-intron junctions and of the intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. Expression profile analyses of MdARF genes were performed in different tissues (root, stem, leaf, flower and fruit), and all the selected genes were expressed in at least one of the tissues that were tested, which indicated that MdARFs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this report is the first to provide a genomewide analysis of the apple ARF gene family. This study provides valuable information for understanding the classification and putative functions of the ARF signal in apple.
Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K
2008-01-01
Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis. PMID:18954468
Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K
2008-10-28
The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis.
Johnson, Timothy J; Siek, Kylie E; Johnson, Sara J; Nolan, Lisa K
2006-01-01
ColV plasmids have long been associated with the virulence of Escherichia coli, despite the fact that their namesake trait, ColV production, does not appear to contribute to virulence. Such plasmids or their associated sequences appear to be quite common among avian pathogenic E. coli (APEC) and are strongly linked to the virulence of these organisms. In the present study, a 180-kb ColV plasmid was sequenced and analyzed. This plasmid, pAPEC-O2-ColV, possesses a 93-kb region containing several putative virulence traits, including iss, tsh, and four putative iron acquisition and transport systems. The iron acquisition and transport systems include those encoding aerobactin and salmochelin, the sit ABC iron transport system, and a putative iron transport system novel to APEC, eit. In order to determine the prevalence of the virulence-associated genes within this region among avian E. coli strains, 595 APEC and 199 avian commensal E. coli isolates were examined for genes of this region using PCR. Results indicate that genes contained within a portion of this putative virulence region are highly conserved among APEC and that the genes of this region occur significantly more often in APEC than in avian commensal E. coli. The region of pAPEC-O2-ColV containing genes that are highly prevalent among APEC appears to be a distinguishing trait of APEC strains.
Johnson, Timothy J.; Siek, Kylie E.; Johnson, Sara J.; Nolan, Lisa K.
2006-01-01
ColV plasmids have long been associated with the virulence of Escherichia coli, despite the fact that their namesake trait, ColV production, does not appear to contribute to virulence. Such plasmids or their associated sequences appear to be quite common among avian pathogenic E. coli (APEC) and are strongly linked to the virulence of these organisms. In the present study, a 180-kb ColV plasmid was sequenced and analyzed. This plasmid, pAPEC-O2-ColV, possesses a 93-kb region containing several putative virulence traits, including iss, tsh, and four putative iron acquisition and transport systems. The iron acquisition and transport systems include those encoding aerobactin and salmochelin, the sit ABC iron transport system, and a putative iron transport system novel to APEC, eit. In order to determine the prevalence of the virulence-associated genes within this region among avian E. coli strains, 595 APEC and 199 avian commensal E. coli isolates were examined for genes of this region using PCR. Results indicate that genes contained within a portion of this putative virulence region are highly conserved among APEC and that the genes of this region occur significantly more often in APEC than in avian commensal E. coli. The region of pAPEC-O2-ColV containing genes that are highly prevalent among APEC appears to be a distinguishing trait of APEC strains. PMID:16385064
Dreyer, Hermann; Steiner, Gerhard
2006-01-01
Background Mitochondrial (mt) gene arrangement is highly variable among molluscs and especially among bivalves. Of the 30 complete molluscan mt-genomes published to date, only one is of a heterodont bivalve, although this is the most diverse taxon in terms of species numbers. We determined the complete sequence of the mitochondrial genomes of Acanthocardia tuberculata and Hiatella arctica, (Mollusca, Bivalvia, Heterodonta) and describe their gene contents and genome organisations to assess the variability of these features among the Bivalvia and their value for phylogenetic inference. Results The size of the mt-genome in Acanthocardia tuberculata is 16.104 basepairs (bp), and in Hiatella arctica 18.244 bp. The Acanthocardia mt-genome contains 12 of the typical protein coding genes, lacking the Atpase subunit 8 (atp8) gene, as all published marine bivalves. In contrast, a complete atp8 gene is present in Hiatella arctica. In addition, we found a putative truncated atp8 gene when re-annotating the mt-genome of Venerupis philippinarum. Both mt-genomes reported here encode all genes on the same strand and have an additional trnM. In Acanthocardia several large non-coding regions are present. One of these contains 3.5 nearly identical copies of a 167 bp motive. In Hiatella, the 3' end of the NADH dehydrogenase subunit (nad)6 gene is duplicated together with the adjacent non-coding region. The gene arrangement of Hiatella is markedly different from all other known molluscan mt-genomes, that of Acanthocardia shows few identities with the Venerupis philippinarum. Phylogenetic analyses on amino acid and nucleotide levels robustly support the Heterodonta and the sister group relationship of Acanthocardia and Venerupis. Monophyletic Bivalvia are resolved only by a Bayesian inference of the nucleotide data set. In all other analyses the two unionid species, being to only ones with genes located on both strands, do not group with the remaining bivalves. Conclusion The two mt-genomes reported here add to and underline the high variability of gene order and presence of duplications in bivalve and molluscan taxa. Some genomic traits like the loss of the atp8 gene or the encoding of all genes on the same strand are homoplastic among the Bivalvia. These characters, gene order, and the nucleotide sequence data show considerable potential of resolving phylogenetic patterns at lower taxonomic levels. PMID:16948842
Development of a set of SNP markers present in expressed genes of the apple.
Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S
2008-11-01
Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.
Davies, Kalina T J; Tsagkogeorga, Georgia; Rossiter, Stephen J
2014-12-19
The majority of DNA contained within vertebrate genomes is non-coding, with a certain proportion of this thought to play regulatory roles during development. Conserved Non-coding Elements (CNEs) are an abundant group of putative regulatory sequences that are highly conserved across divergent groups and thus assumed to be under strong selective constraint. Many CNEs may contain regulatory factor binding sites, and their frequent spatial association with key developmental genes - such as those regulating sensory system development - suggests crucial roles in regulating gene expression and cellular patterning. Yet surprisingly little is known about the molecular evolution of CNEs across diverse mammalian taxa or their role in specific phenotypic adaptations. We examined 3,110 vertebrate-specific and ~82,000 mammalian-specific CNEs across 19 and 9 mammalian orders respectively, and tested for changes in the rate of evolution of CNEs located in the proximity of genes underlying the development or functioning of auditory systems. As we focused on CNEs putatively associated with genes underlying the development/functioning of auditory systems, we incorporated echolocating taxa in our dataset because of their highly specialised and derived auditory systems. Phylogenetic reconstructions of concatenated CNEs broadly recovered accepted mammal relationships despite high levels of sequence conservation. We found that CNE substitution rates were highest in rodents and lowest in primates, consistent with previous findings. Comparisons of CNE substitution rates from several genomic regions containing genes linked to auditory system development and hearing revealed differences between echolocating and non-echolocating taxa. Wider taxonomic sampling of four CNEs associated with the homeobox genes Hmx2 and Hmx3 - which are required for inner ear development - revealed family-wise variation across diverse bat species. Specifically within one family of echolocating bats that utilise frequency-modulated echolocation calls varying widely in frequency and intensity high levels of sequence divergence were found. Levels of selective constraint acting on CNEs differed both across genomic locations and taxa, with observed variation in substitution rates of CNEs among bat species. More work is needed to determine whether this variation can be linked to echolocation, and wider taxonomic sampling is necessary to fully document levels of conservation in CNEs across diverse taxa.
Sahoo, Prabhati Kumari; Goel, Chirag; Kumar, Rohit; Dhama, Nisha; Ali, Shahnawaz; Sarma, Dandadhar; Nanda, Prasanta; Barat, Ashoktaru
2015-10-10
The chocolate mahseer (Neolissochilus hexagonolepis) is an important food and game fish of North Eastern India. To study the phylogenetic status we sequenced the complete mitochondrial genome of N. hexagonolepis. The mitogenome is 16,563 bp in length and composed of 13 protein coding genes, 22 tRNAs, 2 rRNAs and one putative control region. The overall base composition was A 31.8%, T 25.0%, G 15.8%, C 27.4% and A+T content 56.9%, G+C content 43.1%. The phylogenetic analysis using the complete mitochondrial genome revealed that the chocolate mahseer belonged to same clade of mahseer group of fishes but different from genera Barbus and Acrossocheilus. The present study will be helpful for the evolution and conservation genetic studies of N. hexagonolepis. Copyright © 2015 Elsevier B.V. All rights reserved.
Dostie, Josée; Lemire, Edmond; Bouchard, Philippe; Field, Michael; Jones, Kristie; Lorenz, Birgit; Menten, Björn; Buysse, Karen; Pattyn, Filip; Friedli, Marc; Ucla, Catherine; Rossier, Colette; Wyss, Carine; Speleman, Frank; De Paepe, Anne; Dekker, Job; Antonarakis, Stylianos E.; De Baere, Elfride
2009-01-01
To date, the contribution of disrupted potentially cis-regulatory conserved non-coding sequences (CNCs) to human disease is most likely underestimated, as no systematic screens for putative deleterious variations in CNCs have been conducted. As a model for monogenic disease we studied the involvement of genetic changes of CNCs in the cis-regulatory domain of FOXL2 in blepharophimosis syndrome (BPES). Fifty-seven molecularly unsolved BPES patients underwent high-resolution copy number screening and targeted sequencing of CNCs. Apart from three larger distant deletions, a de novo deletion as small as 7.4 kb was found at 283 kb 5′ to FOXL2. The deletion appeared to be triggered by an H-DNA-induced double-stranded break (DSB). In addition, it disrupts a novel long non-coding RNA (ncRNA) PISRT1 and 8 CNCs. The regulatory potential of the deleted CNCs was substantiated by in vitro luciferase assays. Interestingly, Chromosome Conformation Capture (3C) of a 625 kb region surrounding FOXL2 in expressing cellular systems revealed physical interactions of three upstream fragments and the FOXL2 core promoter. Importantly, one of these contains the 7.4 kb deleted fragment. Overall, this study revealed the smallest distant deletion causing monogenic disease and impacts upon the concept of mutation screening in human disease and developmental disorders in particular. PMID:19543368
Li, Hu; Liu, Hui; Shi, Aimin; Štys, Pavel; Zhou, Xuguo; Cai, Wanzhi
2012-01-01
Many of true bugs are important insect pests to cultivated crops and some are important vectors of human diseases, but few cladistic analyses have addressed relationships among the seven infraorders of Heteroptera. The Enicocephalomorpha and Nepomorpha are consider the basal groups of Heteroptera, but the basal-most lineage remains unresolved. Here we report the mitochondrial genome of the unique-headed bug Stenopirates sp., the first mitochondrial genome sequenced from Enicocephalomorpha. The Stenopirates sp. mitochondrial genome is a typical circular DNA molecule of 15, 384 bp in length, and contains 37 genes and a large non-coding fragment. The gene order differs substantially from other known insect mitochondrial genomes, with rearrangements of both tRNA genes and protein-coding genes. The overall AT content (82.5%) of Stenopirates sp. is the highest among all the known heteropteran mitochondrial genomes. The strand bias is consistent with other true bugs with negative GC-skew and positive AT-skew for the J-strand. The heteropteran mitochondrial atp8 exhibits the highest evolutionary rate, whereas cox1 appears to have the lowest rate. Furthermore, a negative correlation was observed between the variation of nucleotide substitutions and the GC content of each protein-coding gene. A microsatellite was identified in the putative control region. Finally, phylogenetic reconstruction suggests that Enicocephalomorpha is the sister group to all the remaining Heteroptera. PMID:22235294
Beissinger, Timothy M.; Hirsch, Candice N.; Vaillancourt, Brieanne; Deshpande, Shweta; Barry, Kerrie; Buell, C. Robin; Kaeppler, Shawn M.; Gianola, Daniel; de Leon, Natalia
2014-01-01
A genome-wide scan to detect evidence of selection was conducted in the Golden Glow maize long-term selection population. The population had been subjected to selection for increased number of ears per plant for 30 generations, with an empirically estimated effective population size ranging from 384 to 667 individuals and an increase of more than threefold in the number of ears per plant. Allele frequencies at >1.2 million single-nucleotide polymorphism loci were estimated from pooled whole-genome resequencing data, and FST values across sliding windows were employed to assess divergence between the population preselection and the population postselection. Twenty-eight highly divergent regions were identified, with half of these regions providing gene-level resolution on potentially selected variants. Approximately 93% of the divergent regions do not demonstrate a significant decrease in heterozygosity, which suggests that they are not approaching fixation. Also, most regions display a pattern consistent with a soft-sweep model as opposed to a hard-sweep model, suggesting that selection mostly operated on standing genetic variation. For at least 25% of the regions, results suggest that selection operated on variants located outside of currently annotated coding regions. These results provide insights into the underlying genetic effects of long-term artificial selection and identification of putative genetic elements underlying number of ears per plant in maize. PMID:24381334
Kapanadze, B; Kashuba, V; Baranova, A; Rasool, O; van Everdink, W; Liu, Y; Syomov, A; Corcoran, M; Poltaraus, A; Brodyansky, V; Syomova, N; Kazakov, A; Ibbotson, R; van den Berg, A; Gizatullin, R; Fedorova, L; Sulimova, G; Zelenin, A; Deaven, L; Lehrach, H; Grander, D; Buys, C; Oscier, D; Zabarovsky, E R; Einhorn, S; Yankovsky, N
1998-04-17
B-cell chronic lymphocytic leukemia (B-CLL) is a human hematological neoplastic disease often associated with the loss of a chromosome 13 region between RB1 gene and locus D13S25. A new tumor suppressor gene (TSG) may be located in the region. A cosmid contig has been constructed between the loci D13S1168 (WI9598) and D13S25 (H2-42), which corresponds to the minimal region shared by B-CLL associated deletions. The contig includes more than 200 LANL and ICRF cosmid clones covering 620 kb. Three cDNAs likely corresponding to three different genes have been found in the minimally deleted region, sequenced and mapped against the contigged cosmids. cDNA clone 10k4 as well as a chimeric clone 13g3, codes for a zinc-finger domain of the RING type and shares homology to some known genes involved in tumorigenesis (RET finger protein, BRCA1) and embryogenesis (MID1). We have termed the gene corresponding to 10k4/13g3 clones LEU5. This is the first gene with homology to known TSGs which has been found in the region of B-CLL rearrangements.
2012-01-01
Background Epinotia aporema (Lepidoptera: Tortricidae) is an important pest of legume crops in South America. Epinotia aporema granulovirus (EpapGV) is a baculovirus that causes a polyorganotropic infection in the host larva. Its high pathogenicity and host specificity make EpapGV an excellent candidate to be used as a biological control agent. Results The genome of Epinotia aporema granulovirus (EpapGV) was sequenced and analyzed. Its circular double-stranded DNA genome is 119,082 bp in length and codes for 133 putative genes. It contains the 31 baculovirus core genes and a set of 19 genes that are GV exclusive. Seventeen ORFs were unique to EpapGV in comparison with other baculoviruses. Of these, 16 found no homologues in GenBank, and one encoded a thymidylate kinase. Analysis of nucleotide sequence repeats revealed the presence of 16 homologous regions (hrs) interspersed throughout the genome. Each hr was characterized by the presence of 1 to 3 clustered imperfect palindromes which are similar to previously described palindromes of tortricid-specific GVs. Also, one of the hrs (hr4) has flanking sequences suggestive of a putative non-hr ori. Interestingly, two more complex hrs were found in opposite loci, dividing the circular dsDNA genome in two halves. Gene synteny maps showed the great colinearity of sequenced GVs, being EpapGV the most dissimilar as it has a 20 kb-long gene block inversion. Phylogenetic study performed with 31 core genes of 58 baculoviral genomes suggests that EpapGV is the baculovirus isolate closest to the putative common ancestor of tortricid specific betabaculoviruses. Conclusions This study, along with previous characterization of EpapGV infection, is useful for the better understanding of the pathology caused by this virus and its potential utilization as a bioinsecticide. PMID:23051685
Jonckheere, Wim; Dermauw, Wannes; Zhurov, Vladimir; Wybouw, Nicky; Van den Bulcke, Jan; Villarroel, Carlos A; Greenhalgh, Robert; Grbić, Mike; Schuurink, Rob C; Tirry, Luc; Baggerman, Geert; Clark, Richard M; Kant, Merijn R; Vanholme, Bartel; Menschaert, Gerben; Van Leeuwen, Thomas
2016-12-01
The two-spotted spider mite Tetranychus urticae is an extremely polyphagous crop pest. Alongside an unparalleled detoxification potential for plant secondary metabolites, it has recently been shown that spider mites can attenuate or even suppress plant defenses. Salivary constituents, notably effectors, have been proposed to play an important role in manipulating plant defenses and might determine the outcome of plant-mite interactions. Here, the proteomic composition of saliva from T. urticae lines adapted to various host plants-bean, maize, soy, and tomato-was analyzed using a custom-developed feeding assay coupled with nano-LC tandem mass spectrometry. About 90 putative T. urticae salivary proteins were identified. Many are of unknown function, and in numerous cases belonging to multimembered gene families. RNAseq expression analysis revealed that many genes coding for these salivary proteins were highly expressed in the proterosoma, the mite body region that includes the salivary glands. A subset of genes encoding putative salivary proteins was selected for whole-mount in situ hybridization, and were found to be expressed in the anterior and dorsal podocephalic glands. Strikingly, host plant dependent expression was evident for putative salivary proteins, and was further studied in detail by micro-array based genome-wide expression profiling. This meta-analysis revealed for the first time the salivary protein repertoire of a phytophagous chelicerate. The availability of this salivary proteome will assist in unraveling the molecular interface between phytophagous mites and their host plants, and may ultimately facilitate the development of mite-resistant crops. Furthermore, the technique used in this study is a time- and resource-efficient method to examine the salivary protein composition of other small arthropods for which saliva or salivary glands cannot be isolated easily. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Jonckheere, Wim; Zhurov, Vladimir; Villarroel, Carlos A.; Greenhalgh, Robert; Grbić, Mike; Schuurink, Rob C.; Tirry, Luc; Kant, Merijn R.; Vanholme, Bartel
2016-01-01
The two-spotted spider mite Tetranychus urticae is an extremely polyphagous crop pest. Alongside an unparalleled detoxification potential for plant secondary metabolites, it has recently been shown that spider mites can attenuate or even suppress plant defenses. Salivary constituents, notably effectors, have been proposed to play an important role in manipulating plant defenses and might determine the outcome of plant-mite interactions. Here, the proteomic composition of saliva from T. urticae lines adapted to various host plants—bean, maize, soy, and tomato—was analyzed using a custom-developed feeding assay coupled with nano-LC tandem mass spectrometry. About 90 putative T. urticae salivary proteins were identified. Many are of unknown function, and in numerous cases belonging to multimembered gene families. RNAseq expression analysis revealed that many genes coding for these salivary proteins were highly expressed in the proterosoma, the mite body region that includes the salivary glands. A subset of genes encoding putative salivary proteins was selected for whole-mount in situ hybridization, and were found to be expressed in the anterior and dorsal podocephalic glands. Strikingly, host plant dependent expression was evident for putative salivary proteins, and was further studied in detail by micro-array based genome-wide expression profiling. This meta-analysis revealed for the first time the salivary protein repertoire of a phytophagous chelicerate. The availability of this salivary proteome will assist in unraveling the molecular interface between phytophagous mites and their host plants, and may ultimately facilitate the development of mite-resistant crops. Furthermore, the technique used in this study is a time- and resource-efficient method to examine the salivary protein composition of other small arthropods for which saliva or salivary glands cannot be isolated easily. PMID:27703040
Genome-wide identification and expression analysis of MAPK and MAPKK gene family in Malus domestica.
Zhang, Shizhong; Xu, Ruirui; Luo, Xiaocui; Jiang, Zesheng; Shu, Huairui
2013-12-01
MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, which are composed of three classes of hierarchically organized protein kinases, namely MAPKKKs, MAPKKs, and MAPKs. Although genome-wide analysis of this family has been carried out in some species, little is known about MAPK and MAPKK genes in apple (Malus domestica). In this study, a total of 26 putative apple MAPK genes (MdMPKs) and 9 putative apple MAPKK genes (MdMKKs) have been identified and located within the apple genome. Phylogenetic analysis revealed that MdMAPKs and MdMAPKKs could be divided into 4 subfamilies (groups A, B, C and D), respectively. The predicted MdMAPKs and MdMAPKKs were distributed across 13 out of 17 chromosomes with different densities. In addition, analysis of exon-intron junctions and of intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. According to the microarray and expressed sequence tag (EST) analysis, the different expression patterns indicate that they may play different roles during fruit development and rootstock-scion interaction process. Moreover, MAPK and MAPKK genes were performed expression profile analyses in different tissues (root, stem, leaf, flower and fruit), and all of the selected genes were expressed in at least one of the tissues tested, indicating that the MAPKs and MAPKKs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this is the first report of a genome-wide analysis of the apple MAPK and MAPKK gene family. This study provides valuable information for understanding the classification and putative functions of the MAPK signal in apple. © 2013.
Schwientek, Patrick; Neshat, Armin; Kalinowski, Jörn; Klein, Andreas; Rückert, Christian; Schneiker-Bekel, Susanne; Wendler, Sergej; Stoye, Jens; Pühler, Alfred
2014-11-20
Actinoplanes sp. SE50/110 is the producer of the alpha-glucosidase inhibitor acarbose, which is an economically relevant and potent drug in the treatment of type-2 diabetes mellitus. In this study, we present the detection of transcription start sites on this genome by sequencing enriched 5'-ends of primary transcripts. Altogether, 1427 putative transcription start sites were initially identified. With help of the annotated genome sequence, 661 transcription start sites were found to belong to the leader region of protein-coding genes with the surprising result that roughly 20% of these genes rank among the class of leaderless transcripts. Next, conserved promoter motifs were identified for protein-coding genes with and without leader sequences. The mapped transcription start sites were finally used to improve the annotation of the Actinoplanes sp. SE50/110 genome sequence. Concerning protein-coding genes, 41 translation start sites were corrected and 9 novel protein-coding genes could be identified. In addition to this, 122 previously undetermined non-coding RNA (ncRNA) genes of Actinoplanes sp. SE50/110 were defined. Focusing on antisense transcription start sites located within coding genes or their leader sequences, it was discovered that 96 of those ncRNA genes belong to the class of antisense RNA (asRNA) genes. The remaining 26 ncRNA genes were found outside of known protein-coding genes. Four chosen examples of prominent ncRNA genes, namely the transfer messenger RNA gene ssrA, the ribonuclease P class A RNA gene rnpB, the cobalamin riboswitch RNA gene cobRS, and the selenocysteine-specific tRNA gene selC, are presented in more detail. This study demonstrates that sequencing of enriched 5'-ends of primary transcripts and the identification of transcription start sites are valuable tools for advanced genome annotation of Actinoplanes sp. SE50/110 and most probably also for other bacteria. Copyright © 2014 Elsevier B.V. All rights reserved.
de Souza, C R; Aragão, F J; Moreira, E C O; Costa, C N M; Nascimento, S B; Carvalho, L J
2009-03-24
Cassava is one of the most important tropical food crops for more than 600 million people worldwide. Transgenic technologies can be useful for increasing its nutritional value and its resistance to viral diseases and insect pests. However, tissue-specific promoters that guarantee correct expression of transgenes would be necessary. We used inverse polymerase chain reaction to isolate a promoter sequence of the Mec1 gene coding for Pt2L4, a glutamic acid-rich protein differentially expressed in cassava storage roots. In silico analysis revealed putative cis-acting regulatory elements within this promoter sequence, including root-specific elements that may be required for its expression in vascular tissues. Transient expression experiments showed that the Mec1 promoter is functional, since this sequence was able to drive GUS expression in bean embryonic axes. Results from our computational analysis can serve as a guide for functional experiments to identify regions with tissue-specific Mec1 promoter activity. The DNA sequence that we identified is a new promoter that could be a candidate for genetic engineering of cassava roots.
Singh, Vikas K; Khan, Aamir W; Saxena, Rachit K; Sinha, Pallavi; Kale, Sandip M; Parupalli, Swathi; Kumar, Vinay; Chitikineni, Annapurna; Vechalapu, Suryanarayana; Sameer Kumar, Chanda Venkata; Sharma, Mamta; Ghanta, Anuradha; Yamini, Kalinati Narasimhan; Muniswamy, Sonnappa; Varshney, Rajeev K
2017-07-01
Identification of candidate genomic regions associated with target traits using conventional mapping methods is challenging and time-consuming. In recent years, a number of single nucleotide polymorphism (SNP)-based mapping approaches have been developed and used for identification of candidate/putative genomic regions. However, in the majority of these studies, insertion-deletion (Indel) were largely ignored. For efficient use of Indels in mapping target traits, we propose Indel-seq approach, which is a combination of whole-genome resequencing (WGRS) and bulked segregant analysis (BSA) and relies on the Indel frequencies in extreme bulks. Deployment of Indel-seq approach for identification of candidate genomic regions associated with fusarium wilt (FW) and sterility mosaic disease (SMD) resistance in pigeonpea has identified 16 Indels affecting 26 putative candidate genes. Of these 26 affected putative candidate genes, 24 genes showed effect in the upstream/downstream of the genic region and two genes showed effect in the genes. Validation of these 16 candidate Indels in other FW- and SMD-resistant and FW- and SMD-susceptible genotypes revealed a significant association of five Indels (three for FW and two for SMD resistance). Comparative analysis of Indel-seq with other genetic mapping approaches highlighted the importance of the approach in identification of significant genomic regions associated with target traits. Therefore, the Indel-seq approach can be used for quick and precise identification of candidate genomic regions for any target traits in any crop species. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Bai, Wen L; Zhao, Su J; Wang, Ze Y; Zhu, Yu B; Dang, Yun L; Cong, Yu Y; Xue, Hui L; Wang, Wei; Deng, Liang; Guo, Dan; Wang, Shi Q; Zhu, Yan X; Yin, Rong H
2018-07-03
Long noncoding RNAs (lncRNAs) are a novel class of eukaryotic transcripts. They are thought to act as a critical regulator of protein-coding gene expression. Herein, we identified and characterized 13 putative lncRNAs from the expressed sequence tags from secondary hair follicle of Cashmere goat. Furthermore, we investigated their transcriptional pattern in secondary hair follicle of Liaoning Cashmere goat during telogen and anagen phases. Also, we generated intracellular regulatory networks of upregulated lncRNAs at anagen in Wnt signaling pathway based on bioinformatics analysis. The relative expression of six putative lncRNAs (lncRNA-599618, -599556, -599554, -599547, -599531, and -599509) at the anagen phase is significantly higher than that at telogen. Compared with anagen, the relative expression of four putative lncRNAs (lncRNA-599528, -599518, -599511, and -599497) was found to be significantly upregulated at telogen phase. The network generated showed that a rich and complex regulatory relationship of the putative lncRNAs and related miRNAs with their target genes in Wnt signaling pathway. Our results from the present study provided a foundation for further elucidating the functional and regulatory mechanisms of these putative lncRNAs in the development of secondary hair follicle and cashmere fiber growth of Cashmere goat.
Das, Abhishek; Panda, Arijit; Singh, Deeksha; Chandrababunaidu, Mathu Malar; Mishra, Gyan Prakash; Bhan, Sushma
2015-01-01
Scytonema tolypothrichoides VB-61278, a terrestrial cyanobacterium, can be exploited to produce commercially important products. Here, we report for the first time a 10-Mb draft genome assembly of S. tolypothrichoides VB-61278, with 214 scaffolds and 7,148 putative protein-coding genes. PMID:25838486
Permuth-Wey, Jennifer; Lawrenson, Kate; Shen, Howard C.; Velkova, Aneliya; Tyrer, Jonathan P.; Chen, Zhihua; Lin, Hui-Yi; Chen, Y. Ann; Tsai, Ya-Yu; Qu, Xiaotao; Ramus, Susan J.; Karevan, Rod; Lee, Janet; Lee, Nathan; Larson, Melissa C.; Aben, Katja K.; Anton-Culver, Hoda; Antonenkova, Natalia; Antoniou, Antonis; Armasu, Sebastian M.; Bacot, François; Baglietto, Laura; Bandera, Elisa V.; Barnholtz-Sloan, Jill; Beckmann, Matthias W.; Birrer, Michael J.; Bloom, Greg; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Brown, Robert; Butzow, Ralf; Cai, Qiuyin; Campbell, Ian; Chang-Claude, Jenny; Chanock, Stephen; Chenevix-Trench, Georgia; Cheng, Jin Q.; Cicek, Mine S.; Coetzee, Gerhard A.; Cook, Linda S.; Couch, Fergus J.; Cramer, Daniel W.; Cunningham, Julie M.; Dansonka-Mieszkowska, Agnieszka; Despierre, Evelyn; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Easton, Douglas F; Eccles, Diana; Edwards, Robert; Ekici, Arif B.; Fasching, Peter A.; Fenstermacher, David A.; Flanagan, James M.; Garcia-Closas, Montserrat; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind M.; Gonzalez-Bosquet, Jesus; Goodman, Marc T.; Gore, Martin; Górski, Bohdan; Gronwald, Jacek; Hall, Per; Halle, Mari K.; Harter, Philipp; Heitz, Florian; Hillemanns, Peter; Hoatlin, Maureen; Høgdall, Claus K.; Høgdall, Estrid; Hosono, Satoyo; Jakubowska, Anna; Jensen, Allan; Jim, Heather; Kalli, Kimberly R.; Karlan, Beth Y.; Kaye, Stanley B.; Kelemen, Linda E.; Kiemeney, Lambertus A.; Kikkawa, Fumitaka; Konecny, Gottfried E.; Krakstad, Camilla; Kjaer, Susanne Krüger; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Lancaster, Johnathan M.; Le, Nhu D.; Leminen, Arto; Levine, Douglas A.; Liang, Dong; Lim, Boon Kiong; Lin, Jie; Lissowska, Jolanta; Lu, Karen H.; Lubiński, Jan; Lurie, Galina; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B.; Nakanishi, Toru; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Nickels, Stefan; Noushmehr, Houtan; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Paul, James; Pearce, Celeste L; Pejovic, Tanja; Pelttari, Liisa M.; Pike, Malcolm C.; Poole, Elizabeth M.; Raska, Paola; Renner, Stefan P.; Risch, Harvey A.; Rodriguez-Rodriguez, Lorna; Rossing, Mary Anne; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schwaab, Ira; Severi, Gianluca; Shridhar, Vijayalakshmi; Shu, Xiao-Ou; Shvetsov, Yurii B.; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Spiewankiewicz, Beata; Stram, Daniel; Sutphen, Rebecca; Teo, Soo-Hwang; Terry, Kathryn L.; Tessier, Daniel C.; Thompson, Pamela J.; Tworoger, Shelley S.; van Altena, Anne M.; Vergote, Ignace; Vierkant, Robert A.; Vincent, Daniel; Vitonis, Allison F.; Wang-Gohrke, Shan; Weber, Rachel Palmieri; Wentzensen, Nicolas; Whittemore, Alice S.; Wik, Elisabeth; Wilkens, Lynne R.; Winterhoff, Boris; Woo, Yin Ling; Wu, Anna H.; Xiang, Yong-Bing; Yang, Hannah P.; Zheng, Wei; Ziogas, Argyrios; Zulkifli, Famida; Phelan, Catherine M.; Iversen, Edwin; Schildkraut, Joellen M.; Berchuck, Andrew; Fridley, Brooke L.; Goode, Ellen L.; Pharoah, Paul D. P.; Monteiro, Alvaro N.A.; Sellers, Thomas A.; Gayther, Simon A.
2013-01-01
Epithelial ovarian cancer (EOC) has a heritable component that remains to be fully characterized. Most identified common susceptibility variants lie in non-protein-coding sequences. We hypothesized that variants in the 3′ untranslated region at putative microRNA (miRNA) binding sites represent functional targets that influence EOC susceptibility. Here, we evaluate the association between 767 miRNA binding site single nucleotide polymorphisms (miRSNPs) and EOC risk in 18,174 EOC cases and 26,134 controls from 43 studies genotyped through the Collaborative Oncological Gene-environment Study. We identify several miRSNPs associated with invasive serous EOC risk (OR=1.12, P=10−8) mapping to an inversion polymorphism at 17q21.31. Additional genotyping of non-miRSNPs at 17q21.31 reveals stronger signals outside the inversion (P=10−10). Variation at 17q21.31 associates with neurological diseases, and our collaboration is the first to report an association with EOC susceptibility. An integrated molecular analysis in this region provides evidence for ARHGAP27 and PLEKHM1 as candidate EOC susceptibility genes. PMID:23535648
Signatures of selection in tilapia revealed by whole genome resequencing.
Xia, Jun Hong; Bai, Zhiyi; Meng, Zining; Zhang, Yong; Wang, Le; Liu, Feng; Jing, Wu; Wan, Zi Yi; Li, Jiale; Lin, Haoran; Yue, Gen Hua
2015-09-16
Natural selection and selective breeding for genetic improvement have left detectable signatures within the genome of a species. Identification of selection signatures is important in evolutionary biology and for detecting genes that facilitate to accelerate genetic improvement. However, selection signatures, including artificial selection and natural selection, have only been identified at the whole genome level in several genetically improved fish species. Tilapia is one of the most important genetically improved fish species in the world. Using next-generation sequencing, we sequenced the genomes of 47 tilapia individuals. We identified a total of 1.43 million high-quality SNPs and found that the LD block sizes ranged from 10-100 kb in tilapia. We detected over a hundred putative selective sweep regions in each line of tilapia. Most selection signatures were located in non-coding regions of the tilapia genome. The Wnt signaling, gonadotropin-releasing hormone receptor and integrin signaling pathways were under positive selection in all improved tilapia lines. Our study provides a genome-wide map of genetic variation and selection footprints in tilapia, which could be important for genetic studies and accelerating genetic improvement of tilapia.
LHD1, an allele of DTH8/Ghd8, controls late heading date in common wild rice (Oryza rufipogon).
Dai, Xiaodong; Ding, Younian; Tan, Lubin; Fu, Yongcai; Liu, Fengxia; Zhu, Zuofeng; Sun, Xianyou; Sun, Xuewen; Gu, Ping; Cai, Hongwei; Sun, Chuanqing
2012-10-01
Flowering at suitable time is very important for plants to adapt to complicated environments and produce their seeds successfully for reproduction. In rice (Oryza rufipogon Griff.) photoperiod regulation is one of the important factors for controlling heading date. Common wild rice, the ancestor of cultivated rice, exhibits a late heading date and a more sensitive photoperiodic response than cultivated rice. Here, through map-based cloning, we identified a major quantitative trait loci (QTL) LHD1 (Late Heading Date 1), an allele of DTH8/Ghd8, which controls the late heading date of wild rice and encodes a putative HAP3/NF-YB/CBF-A subunit of the CCAAT-box-binding transcription factor. Sequence analysis revealed that several variants in the coding region of LHD1 were correlated with a late heading date, and a further complementary study successfully rescued the phenotype. These results suggest that a functional site for LHD1 could be among those variants present in the coding region. We also found that LHD1 could down-regulate the expression of several floral transition activators such as Ehd1, Hd3a and RFT1 under long-day conditions, but not under short-day conditions. This indicates that LHD1 may delay flowering by repressing the expression of Ehd1, Hd3a and RFT1 under long-day conditions. © 2012 Institute of Botany, Chinese Academy of Sciences.
dos Reis, Sávio Pinho; Tavares, Liliane de Souza Conceição; Costa, Carinne de Nazaré Monteiro; Brígida, Aílton Borges Santa; de Souza, Cláudia Regina Batista
2012-06-01
Cassava (Manihot esculenta Crantz) is one of the world's most important food crops. It is cultivated mainly in developing countries of tropics, since its root is a major source of calories for low-income people due to its high productivity and resistance to many abiotic and biotic factors. A previous study has identified a partial cDNA sequence coding for a putative RING zinc finger in cassava storage root. The RING zinc finger protein is a specialized type of zinc finger protein found in many organisms. Here, we isolated the full-length cDNA sequence coding for M. esculenta RZF (MeRZF) protein by a combination of 5' and 3' RACE assays. BLAST analysis showed that its deduced amino acid sequence has a high level of similarity to plant proteins of RZF family. MeRZF protein contains a signature sequence motif for a RING zinc finger at its C-terminal region. In addition, this protein showed a histidine residue at the fifth coordination site, likely belonging to the RING-H2 subgroup, as confirmed by our phylogenetic analysis. There is also a transmembrane domain in its N-terminal region. Finally, semi-quantitative RT-PCR assays showed that MeRZF expression is increased in detached leaves treated with sodium chloride. Here, we report the first evidence of a RING zinc finger gene of cassava showing potential role in response to salt stress.
Takahata, Satoshi; Yago, Takumi; Iwabuchi, Keisuke; Hirakawa, Hideki; Suzuki, Yutaka; Onodera, Yasuyuki
2016-01-01
Spinach (Spinacia oleracea, 2n = 12) and sugar beet (Beta vulgaris, 2n = 18) are important crop members of the family Chenopodiaceae ss Sugar beet has a basic chromosome number of 9 and a cosexual breeding system, as do most members of the Chenopodiaceae ss. family. By contrast, spinach has a basic chromosome number of 6 and, although certain cultivars and genotypes produce monoecious plants, is considered to be a dioecious species. The loci determining male and monoecious sexual expression were mapped to different loci on the spinach sex chromosomes. In this study, a linkage map with 46 mapped protein-coding sequences was constructed for the spinach sex chromosomes. Comparison of the linkage map with a reference genome sequence of sugar beet revealed that the spinach sex chromosomes exhibited extensive synteny with sugar beet chromosomes 4 and 9. Tightly linked protein-coding genes linked to the male-determining locus in spinach corresponded to genes located in or around the putative pericentromeric and centromeric regions of sugar beet chromosomes 4 and 9, supporting the observation that recombination rates were low in the vicinity of the male-determining locus. The locus for monoecism was confined to a chromosomal segment corresponding to a region of approximately 1.7Mb on sugar beet chromosome 9, which may facilitate future positional cloning of the locus. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Tramontano, A; Macchiato, M F
1986-01-01
An algorithm to determine the probability that a reading frame codifies for a protein is presented. It is based on the results of our previous studies on the thermodynamic characteristics of a translated reading frame. We also develop a prediction procedure to distinguish between coding and non-coding reading frames. The procedure is based on the characteristics of the putative product of the DNA sequence and not on periodicity characteristics of the sequence, so the prediction is not biased by the presence of overlapping translated reading frames or by the presence of translated reading frames on the complementary DNA strand. PMID:3753761
Pauciullo, Alfredo; Erhardt, Georg
2015-01-01
In the present paper, we report for the first time the characterization of llama (Lama glama) caseins at transcriptomic and genetic level. A total of 288 casein clones transcripts were analysed from two lactating llamas. The most represented mRNA populations were those correctly assembled (85.07%) and they encoded for mature proteins of 215, 217, 187 and 162 amino acids respectively for the CSN1S1, CSN2, CSN1S2 and CSN3 genes. The exonic subdivision evidenced a structure made of 21, 9, 17 and 6 exons for the αs1-, β-, αs2- and κ-casein genes respectively. Exon skipping and duplication events were evidenced. Two variants A and B were identified in the αs1-casein gene as result of the alternative out-splicing of the exon 18. An additional exon coding for a novel esapeptide was found to be cryptic in the κ-casein gene, whereas one extra exon was found in the αs2-casein gene by the comparison with the Camelus dromedaries sequence. A total of 28 putative phosphorylated motifs highlighted a complex heterogeneity and a potential variable degree of post-translational modifications. Ninety-six polymorphic sites were found through the comparison of the lama casein cDNAs with the homologous camel sequences, whereas the first description and characterization of the 5’- and 3’-regulatory regions allowed to identify the main putative consensus sequences involved in the casein genes expression, thus opening the way to new investigations -so far- never achieved in this species. PMID:25923814
Pauciullo, Alfredo; Erhardt, Georg
2015-01-01
In the present paper, we report for the first time the characterization of llama (Lama glama) caseins at transcriptomic and genetic level. A total of 288 casein clones transcripts were analysed from two lactating llamas. The most represented mRNA populations were those correctly assembled (85.07%) and they encoded for mature proteins of 215, 217, 187 and 162 amino acids respectively for the CSN1S1, CSN2, CSN1S2 and CSN3 genes. The exonic subdivision evidenced a structure made of 21, 9, 17 and 6 exons for the αs1-, β-, αs2- and κ-casein genes respectively. Exon skipping and duplication events were evidenced. Two variants A and B were identified in the αs1-casein gene as result of the alternative out-splicing of the exon 18. An additional exon coding for a novel esapeptide was found to be cryptic in the κ-casein gene, whereas one extra exon was found in the αs2-casein gene by the comparison with the Camelus dromedaries sequence. A total of 28 putative phosphorylated motifs highlighted a complex heterogeneity and a potential variable degree of post-translational modifications. Ninety-six polymorphic sites were found through the comparison of the lama casein cDNAs with the homologous camel sequences, whereas the first description and characterization of the 5'- and 3'-regulatory regions allowed to identify the main putative consensus sequences involved in the casein genes expression, thus opening the way to new investigations -so far- never achieved in this species.
Systematic analysis and evolution of 5S ribosomal DNA in metazoans.
Vierna, J; Wehner, S; Höner zu Siederdissen, C; Martínez-Lage, A; Marz, M
2013-11-01
Several studies on 5S ribosomal DNA (5S rDNA) have been focused on a subset of the following features in mostly one organism: number of copies, pseudogenes, secondary structure, promoter and terminator characteristics, genomic arrangements, types of non-transcribed spacers and evolution. In this work, we systematically analyzed 5S rDNA sequence diversity in available metazoan genomes, and showed organism-specific and evolutionary-conserved features. Putatively functional sequences (12,766) from 97 organisms allowed us to identify general features of this multigene family in animals. Interestingly, we show that each mammal species has a highly conserved (housekeeping) 5S rRNA type and many variable ones. The genomic organization of 5S rDNA is still under debate. Here, we report the occurrence of several paralog 5S rRNA sequences in 58 of the examined species, and a flexible genome organization of 5S rDNA in animals. We found heterogeneous 5S rDNA clusters in several species, supporting the hypothesis of an exchange of 5S rDNA from one locus to another. A rather high degree of variation of upstream, internal and downstream putative regulatory regions appears to characterize metazoan 5S rDNA. We systematically studied the internal promoters and described three different types of termination signals, as well as variable distances between the coding region and the typical termination signal. Finally, we present a statistical method for detection of linkage among noncoding RNA (ncRNA) gene families. This method showed no evolutionary-conserved linkage among 5S rDNAs and any other ncRNA genes within Metazoa, even though we found 5S rDNA to be linked to various ncRNAs in several clades.
Systematic analysis and evolution of 5S ribosomal DNA in metazoans
Vierna, J; Wehner, S; Höner zu Siederdissen, C; Martínez-Lage, A; Marz, M
2013-01-01
Several studies on 5S ribosomal DNA (5S rDNA) have been focused on a subset of the following features in mostly one organism: number of copies, pseudogenes, secondary structure, promoter and terminator characteristics, genomic arrangements, types of non-transcribed spacers and evolution. In this work, we systematically analyzed 5S rDNA sequence diversity in available metazoan genomes, and showed organism-specific and evolutionary-conserved features. Putatively functional sequences (12 766) from 97 organisms allowed us to identify general features of this multigene family in animals. Interestingly, we show that each mammal species has a highly conserved (housekeeping) 5S rRNA type and many variable ones. The genomic organization of 5S rDNA is still under debate. Here, we report the occurrence of several paralog 5S rRNA sequences in 58 of the examined species, and a flexible genome organization of 5S rDNA in animals. We found heterogeneous 5S rDNA clusters in several species, supporting the hypothesis of an exchange of 5S rDNA from one locus to another. A rather high degree of variation of upstream, internal and downstream putative regulatory regions appears to characterize metazoan 5S rDNA. We systematically studied the internal promoters and described three different types of termination signals, as well as variable distances between the coding region and the typical termination signal. Finally, we present a statistical method for detection of linkage among noncoding RNA (ncRNA) gene families. This method showed no evolutionary-conserved linkage among 5S rDNAs and any other ncRNA genes within Metazoa, even though we found 5S rDNA to be linked to various ncRNAs in several clades. PMID:23838690
Paal, Jürgen; Henselewski, Heike; Muth, Jost; Meksem, Khalid; Menéndez, Cristina M; Salamini, Francesco; Ballvora, Agim; Gebhardt, Christiane
2004-04-01
The endoparasitic root cyst nematode Globodera rostochiensis causes considerable damage in potato cultivation. In the past, major genes for nematode resistance have been introgressed from related potato species into cultivars. Elucidating the molecular basis of resistance will contribute to the understanding of nematode-plant interactions and assist in breeding nematode-resistant cultivars. The Gro1 resistance locus to G. rostochiensis on potato chromosome VII co-localized with a resistance-gene-like (RGL) DNA marker. This marker was used to isolate from genomic libraries 15 members of a closely related candidate gene family. Analysis of inheritance, linkage mapping, and sequencing reduced the number of candidate genes to three. Complementation analysis by stable potato transformation showed that the gene Gro1-4 conferred resistance to G. rostochiensis pathotype Ro1. Gro1-4 encodes a protein of 1136 amino acids that contains Toll-interleukin 1 receptor (TIR), nucleotide-binding (NB), leucine-rich repeat (LRR) homology domains and a C-terminal domain with unknown function. The deduced Gro1-4 protein differed by 29 amino acid changes from susceptible members of the Gro1 gene family. Sequence characterization of 13 members of the Gro1 gene family revealed putative regulatory elements and a variable microsatellite in the promoter region, insertion of a retrotransposon-like element in the first intron, and a stop codon in the NB coding region of some genes. Sequence analysis of RT-PCR products showed that Gro1-4 is expressed, among other members of the family including putative pseudogenes, in non-infected roots of nematode-resistant plants. RT-PCR also demonstrated that members of the Gro1 gene family are expressed in most potato tissues.
Almeida, Tânia; Menéndez, Esther; Capote, Tiago; Ribeiro, Teresa; Santos, Conceição; Gonçalves, Sónia
2013-01-15
The molecular processes associated with cork development in Quercus suber L. are poorly understood. A previous molecular approach identified a list of genes potentially important for cork formation and differentiation, providing a new basis for further molecular studies. This report is the first molecular characterization of one of these candidate genes, QsMYB1, coding for an R2R3-MYB transcription factor. The R2R3-MYB gene sub-family has been described as being involved in the phenylpropanoid and lignin pathways, both involved in cork biosynthesis. The results showed that the expression of QsMYB1 is putatively mediated by an alternative splicing (AS) mechanism that originates two different transcripts (QsMYB1.1 and QsMYB1.2), differing only in the 5'-untranslated region, due to retention of the first intron in one of the variants. Moreover, within the retained intron, a simple sequence repeat (SSR) was identified. The upstream regulatory region of QsMYB1 was extended by a genome walking approach, which allowed the identification of the putative gene promoter region. The relative expression pattern of QsMYB1 transcripts determined by reverse transcription quantitative polymerase chain reaction (RT-qPCR) revealed that both transcripts were up-regulated in cork tissues; the detected expression was several times higher in newly formed cork harvested from trees producing virgin, second or reproduction cork when compared with wood. Moreover, the expression analysis of QsMYB1 in several Q. suber organs showed very low expression in young branches and roots, whereas in leaves, immature acorns or male flowers, no expression was detected. These preliminary results suggest that QsMYB1 may be related to secondary growth and, in particular, with the cork biosynthesis process with a possible alternative splicing mechanism associated with its regulatory function. Copyright © 2012 Elsevier GmbH. All rights reserved.
CisMiner: Genome-Wide In-Silico Cis-Regulatory Module Prediction by Fuzzy Itemset Mining
Navarro, Carmen; Lopez, Francisco J.; Cano, Carlos; Garcia-Alcalde, Fernando; Blanco, Armando
2014-01-01
Eukaryotic gene control regions are known to be spread throughout non-coding DNA sequences which may appear distant from the gene promoter. Transcription factors are proteins that coordinately bind to these regions at transcription factor binding sites to regulate gene expression. Several tools allow to detect significant co-occurrences of closely located binding sites (cis-regulatory modules, CRMs). However, these tools present at least one of the following limitations: 1) scope limited to promoter or conserved regions of the genome; 2) do not allow to identify combinations involving more than two motifs; 3) require prior information about target motifs. In this work we present CisMiner, a novel methodology to detect putative CRMs by means of a fuzzy itemset mining approach able to operate at genome-wide scale. CisMiner allows to perform a blind search of CRMs without any prior information about target CRMs nor limitation in the number of motifs. CisMiner tackles the combinatorial complexity of genome-wide cis-regulatory module extraction using a natural representation of motif combinations as itemsets and applying the Top-Down Fuzzy Frequent- Pattern Tree algorithm to identify significant itemsets. Fuzzy technology allows CisMiner to better handle the imprecision and noise inherent to regulatory processes. Results obtained for a set of well-known binding sites in the S. cerevisiae genome show that our method yields highly reliable predictions. Furthermore, CisMiner was also applied to putative in-silico predicted transcription factor binding sites to identify significant combinations in S. cerevisiae and D. melanogaster, proving that our approach can be further applied genome-wide to more complex genomes. CisMiner is freely accesible at: http://genome2.ugr.es/cisminer. CisMiner can be queried for the results presented in this work and can also perform a customized cis-regulatory module prediction on a query set of transcription factor binding sites provided by the user. PMID:25268582
Liu, Li; Venkatesh, Jelli; Jo, Yeong Deuk; Koeda, Sota; Hosokawa, Munetaka; Kang, Jin-Ho; Goritschnig, Sandra; Kang, Byoung-Cheorl
2016-08-01
The sy - 2 temperature-sensitive gene from Capsicum chinense was fine mapped to a 138.8-kb region at the distal portion of pepper chromosome 1. Based on expression analyses, two putative F-box genes were identified as sy - 2 candidate genes. Seychelles-2 ('sy-2') is a temperature-sensitive natural mutant of Capsicum chinense, which exhibits an abnormal leaf phenotype when grown at temperatures below 24 °C. We previously showed that the sy-2 phenotype is controlled by a single recessive gene, sy-2, located on pepper chromosome 1. In this study, a high-resolution genetic and physical map for the sy-2 locus was constructed using two individual F2 mapping populations derived from a cross between C. chinense mutant 'sy-2' and wild-type 'No. 3341'. The sy-2 gene was fine mapped to a 138.8-kb region between markers SNP 5-5 and SNP 3-8 at the distal portion of chromosome 1, based on comparative genomic analysis and genomic information from pepper. The sy-2 target region was predicted to contain 27 genes. Expression analysis of these predicted genes showed a differential expression pattern for ORF10 and ORF20 between mutant and wild-type plants; with both having significantly lower expression in 'sy-2' than in wild-type plants. In addition, the coding sequences of both ORF10 and ORF20 contained single nucleotide polymorphisms (SNPs) causing amino acid changes, which may have important functional consequences. ORF10 and ORF20 are predicted to encode F-box proteins, which are components of the SCF complex. Based on the differential expression pattern and the presence of nonsynonymous SNPs, we suggest that these two putative F-box genes are most likely responsible for the temperature-sensitive phenotypes in pepper. Further investigation of these genes may enable a better understanding of the molecular mechanisms of low temperature sensitivity in plants.
Doyle, Jacqueline M.; Katzner, Todd E.; Roemer, Gary; Cain, James W.; Millsap, Brian; McIntyre, Carol; Sonsthagen, Sarah A.; Fernandez, Nadia B.; Wheeler, Maria; Bulut, Zafer; Bloom, Peter; DeWoody, J. Andrew
2016-01-01
Molecular markers can reveal interesting aspects of organismal ecology and evolution, especially when surveyed in rare or elusive species. Herein, we provide a preliminary assessment of golden eagle (Aquila chrysaetos) population structure in North America using novel single nucleotide polymorphisms (SNPs). These SNPs included one molecular sexing marker, two mitochondrial markers, 85 putatively neutral markers that were derived from noncoding regions within large intergenic intervals, and 74 putatively nonneutral markers found in or very near protein-coding genes. We genotyped 523 eagle samples at these 162 SNPs and quantified genotyping error rates and variability at each marker. Our samples corresponded to 344 individual golden eagles as assessed by unique multilocus genotypes. Observed heterozygosity of known adults was significantly higher than of chicks, as was the number of heterozygous loci, indicating that mean zygosity measured across all 159 autosomal markers was an indicator of fitness as it is associated with eagle survival to adulthood. Finally, we used chick samples of known provenance to test for population differentiation across portions of North America and found pronounced structure among geographic sampling sites. These data indicate that cryptic genetic population structure is likely widespread in the golden eagle gene pool, and that extensive field sampling and genotyping will be required to more clearly delineate management units within North America and elsewhere.
Song, Xuhao; Shen, Fujun; Huang, Jie; Huang, Yan; Du, Lianming; Wang, Chengdong; Fan, Zhenxin; Hou, Rong; Yue, Bisong; Zhang, Xiuyue
2016-09-01
Recently, an increasing number of microsatellites or simple sequence repeats (SSRs) have been found and characterized from transcriptomes. Such SSRs can be employed as putative functional markers to easily tag corresponding genes, which play an important role in biomedical studies and genetic analysis. However, the transcriptome-derived SSRs for giant panda (Ailuropoda melanoleuca) are not yet available. In this work, we identified and characterized 20 tetranucleotide microsatellite loci from a transcript database generated from the blood of giant panda. Furthermore, we assigned their predicted transcriptome locations: 16 loci were assigned to untranslated regions (UTRs) and 4 loci were assigned to coding regions (CDSs). Gene identities of 14 transcripts contained corresponding microsatellites were determined, which provide useful information to study the potential contribution of SSRs to gene regulation in giant panda. The polymorphic information content (PIC) values ranged from 0.293 to 0.789 with an average of 0.603 for the 16 UTRs-derived SSRs. Interestingly, 4 CDS-derived microsatellites developed in our study were also polymorphic, and the instability of these 4 CDS-derived SSRs was further validated by re-genotyping and sequencing. The genes containing these 4 CDS-derived SSRs were embedded with various types of repeat motifs. The interaction of all the length-changing SSRs might provide a way against coding region frameshift caused by microsatellite instability. We hope these newly gene-associated biomarkers will pave the way for genetic and biomedical studies for giant panda in the future. In sum, this set of transcriptome-derived markers complements the genetic resources available for giant panda. © The American Genetic Association. 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Evidence for multiple, distinct representations of the human body.
Schwoebel, John; Coslett, H Branch
2005-04-01
Previous data from single-case and small group studies have suggested distinctions among structural, conceptual, and online sensorimotor representations of the human body. We developed a battery of tasks to further examine the prevalence and anatomic substrates of these body representations. The battery was administered to 70 stroke patients. Fifty-one percent of the patients were impaired relative to controls on at least one body representation measure. Further, principal components analysis of the patient data as well as direct comparisons of patient and control performance suggested a triple dissociation between measures of the 3 putative body representations. Consistent with previous distinctions between the "what" and "how" pathways, lesions of the left temporal lobe were most consistently associated with impaired performance on tasks assessing knowledge of the shape or lexical-semantic information about the body, whereas lesions of the dorsolateral frontal and parietal regions resulted in impaired performance on tasks requiring on-line coding of body posture.
The complete mitogenome of brown trout (Salmo trutta fario) and its phylogeny.
Sahoo, Prabhati K; Singh, Lalit; Sharma, Lata; Kumar, Rohit; Singh, Vijay K; Ali, S; Singh, Atul K; Barat, Ashoktaru
2016-11-01
The complete mitochondrial genome of Salmo trutta fario, commonly known as brown trout, was sequenced using NGS technology. The mitochondrial genome size was determined to be 16 677 bp and composed of 13 protein-coding gene (PCG), 22 tRNAs, 2 rRNA genes, and 1 putative control region. The overall mitogenome composition of S. trutta fario is A: 28.13%, G: 16.44%, C: 29.47%, and T: 25.96% with A + T content of 54.09% and G + C content of 45.91%. The gene arrangement and the order are similar to other vertebrates. The phylogenetic tree constructed using 42 complete mitogenomes of Salmonidae fishes confirmed the position of the present species under the genus Salmo of subfamily Salmoninae. NGS platform was proved to be a rapid and time-saving technology to reveal complete mitogenomes.
Picornavirus Modification of a Host mRNA Decay Protein
Rozovics, Janet M.; Chase, Amanda J.; Cathcart, Andrea L.; Chou, Wayne; Gershon, Paul D.; Palusa, Saiprasad; Wilusz, Jeffrey; Semler, Bert L.
2012-01-01
ABSTRACT Due to the limited coding capacity of picornavirus genomic RNAs, host RNA binding proteins play essential roles during viral translation and RNA replication. Here we describe experiments suggesting that AUF1, a host RNA binding protein involved in mRNA decay, plays a role in the infectious cycle of picornaviruses such as poliovirus and human rhinovirus. We observed cleavage of AUF1 during poliovirus or human rhinovirus infection, as well as interaction of this protein with the 5′ noncoding regions of these viral genomes. Additionally, the picornavirus proteinase 3CD, encoded by poliovirus or human rhinovirus genomic RNAs, was shown to cleave all four isoforms of recombinant AUF1 at a specific N-terminal site in vitro. Finally, endogenous AUF1 was found to relocalize from the nucleus to the cytoplasm in poliovirus-infected HeLa cells to sites adjacent to (but distinct from) putative viral RNA replication complexes. PMID:23131833
Habenicht, A; Quesada, A; Cerff, R
1997-10-01
A cDNA-library has been constructed from Nicotiana plumbaginifolia seedlings, and the non-phosphorylating glyceraldehyde-3-phosphate dehydrogenase (GapN, EC 1.2.1.9) was isolated by plaque hybridization using the cDNA from pea as a heterologous probe. The cDNA comprises the entire GapN coding region. A putative polyadenylation signal is identified. Phylogenetic analysis based on the deduced amino acid sequences revealed that the GapN gene family represents a separate ancient branch within the aldehyde dehydrogenase superfamily. It can be shown that the GapN gene family and other distinct branches of the superfamily have its phylogenetic origin before the separation of primary life-forms. This further demonstrates that already very early in evolution, a broad diversification of the aldehyde dehydrogenases led to the formation of the superfamily.
PUTATIVE GENE PROMOTER SEQUENCES IN THE CHLORELLA VIRUSES
Fitzgerald, Lisa A.; Boucher, Philip T.; Yanai-Balser, Giane; Suhre, Karsten; Graves, Michael V.; Van Etten, James L.
2008-01-01
Three short (7 to 9 nucleotides) highly conserved nucleotide sequences were identified in the putative promoter regions (150 bp upstream and 50 bp downstream of the ATG translation start site) of three members of the genus Chlorovirus, family Phycodnaviridae. Most of these sequences occurred in similar locations within the defined promoter regions. The sequence and location of the motifs were often conserved among homologous ORFs within the Chlorovirus family. One of these conserved sequences (AATGACA) is predominately associated with genes expressed early in virus replication. PMID:18768195
Purfield, Deirdre C.; McParland, Sinead; Wall, Eamon; Berry, Donagh P.
2017-01-01
Domestication and the subsequent selection of animals for either economic or morphological features can leave a variety of imprints on the genome of a population. Genomic regions subjected to high selective pressures often show reduced genetic diversity and frequent runs of homozygosity (ROH). Therefore, the objective of the present study was to use 42,182 autosomal SNPs to identify genomic regions in 3,191 sheep from six commercial breeds subjected to selection pressure and to quantify the genetic diversity within each breed using ROH. In addition, the historical effective population size of each breed was also estimated and, in conjunction with ROH, was used to elucidate the demographic history of the six breeds. ROH were common in the autosomes of animals in the present study, but the observed breed differences in patterns of ROH length and burden suggested differences in breed effective population size and recent management. ROH provided a sufficient predictor of the pedigree inbreeding coefficient, with an estimated correlation between both measures of 0.62. Genomic regions under putative selection were identified using two complementary algorithms; the fixation index and hapFLK. The identified regions under putative selection included candidate genes associated with skin pigmentation, body size and muscle formation; such characteristics are often sought after in modern-day breeding programs. These regions of selection frequently overlapped with high ROH regions both within and across breeds. Multiple yet uncharacterised genes also resided within putative regions of selection. This further substantiates the need for a more comprehensive annotation of the sheep genome as these uncharacterised genes may contribute to traits of interest in the animal sciences. Despite this, the regions identified as under putative selection in the current study provide an insight into the mechanisms leading to breed differentiation and genetic variation in meat production. PMID:28463982
Molin, William T; Wright, Alice A; Lawton-Rauh, Amy; Saski, Christopher A
2017-01-17
The expanding number and global distributions of herbicide resistant weedy species threaten food, fuel, fiber and bioproduct sustainability and agroecosystem longevity. Amongst the most competitive weeds, Amaranthus palmeri S. Wats has rapidly evolved resistance to glyphosate primarily through massive amplification and insertion of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene across the genome. Increased EPSPS gene copy numbers results in higher titers of the EPSPS enzyme, the target of glyphosate, and confers resistance to glyphosate treatment. To understand the genomic unit and mechanism of EPSPS gene copy number proliferation, we developed and used a bacterial artificial chromosome (BAC) library from a highly resistant biotype to sequence the local genomic landscape flanking the EPSPS gene. By sequencing overlapping BACs, a 297 kb sequence was generated, hereafter referred to as the "EPSPS cassette." This region included several putative genes, dense clusters of tandem and inverted repeats, putative helitron and autonomous replication sequences, and regulatory elements. Whole genome shotgun sequencing (WGS) of two biotypes exhibiting high and no resistance to glyphosate was performed to compare genomic representation across the EPSPS cassette. Mapping of sequences for both biotypes to the reference EPSPS cassette revealed significant differences in upstream and downstream sequences relative to EPSPS with regard to both repetitive units and coding content between these biotypes. The differences in sequence may have resulted from a compounded-building mechanism such as repetitive transpositional events. The association of putative helitron sequences with the cassette suggests a possible amplification and distribution mechanism. Flow cytometry revealed that the EPSPS cassette added measurable genomic content. The adoption of glyphosate resistant cropping systems in major crops such as corn, soybean, cotton and canola coupled with excessive use of glyphosate herbicide has led to evolved glyphosate resistance in several important weeds. In Amaranthus palmeri, the amplification of the EPSPS cassette, characterized by a complex array of repetitive elements and putative helitron sequences, suggests an adaptive structural genomic mechanism that drives amplification and distribution around the genome. The added genomic content not found in glyphosate sensitive plants may be driving evolution through genome expansion.
Das, Abhishek; Panda, Arijit; Singh, Deeksha; Chandrababunaidu, Mathu Malar; Mishra, Gyan Prakash; Bhan, Sushma; Adhikary, Siba Prasad; Tripathy, Sucheta
2015-04-02
Scytonema tolypothrichoides VB-61278, a terrestrial cyanobacterium, can be exploited to produce commercially important products. Here, we report for the first time a 10-Mb draft genome assembly of S. tolypothrichoides VB-61278, with 214 scaffolds and 7,148 putative protein-coding genes. Copyright © 2015 Das et al.
The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)
G.A. Tuskan; S. DiFazio; S. Jansson; J. Bohlmann; I. Grigoriev; U. Hellsten; N. Putnam; S. Ralph; S. Rombauts; A. Salamov; J. Schein; L. Sterck; A. Aerts; R.R. Bhalerao; R.P. Bhalerao; D. Blaudez; W. Boerjan; A. Brun; A. Brunner; V. Busov; M. Campbell; J. Carlson; M. Chalot; J. Chapman; G.-L. Chen; D. Cooper; P.M. Coutinho; J. Couturier; S. Covert; Q. Cronk; R. Cunningham; J. Davis; S. Degroeve; A. Dejardin; C. dePamphilis; J. Detter; B. Dirks; U. Dubchak; S. Duplessis; J. Ehlting; B. Ellis; K. Gendler; D. Goodstein; M. Gribskov; J. Grimwood; A. Groover; L. Gunter; B. Hamberger; B. Heinze; Y. Helariutta; B. Henrissat; D. Holligan; R. Holt; W. Huang; N. Islam-Faridi; S. Jones; M. Jones-Rhoades; R. Jorgensen; C. Joshi; J. Kangasjarvi; J. Karlsson; C. Kelleher; R. Kirkpatrick; M. Kirst; A. Kohler; U. Kalluri; F. Larimer; J. Leebens-Mack; J.-C. Leple; P. Locascio; Y. Lou; S. Lucas; F. Martin; B. Montanini; C. Napoli; D.R. Nelson; C. Nelson; K. Nieminen; O. Nilsson; V. Pereda; G. Peter; R. Philippe; G. Pilate; A. Poliakov; J. Razumovskaya; P. Richardson; C. Rinaldi; K. Ritland; P. Rouze; D. Ryaboy; J. Schumtz; J. Schrader; B. Segerman; H. Shin; A. Siddiqui; F. Sterky; A. Terry; C.-J. Tsai; E. Uberbacher; P. Unneberg; J. Vahala; K. Wall; S. Wessler; G. Yang; T. Yin; C. Douglas; M. Marra; G. Sandberg; Y. Van de Peer; D. Rokhsar
2006-01-01
We report the draft genome of the black cottonwood tree, Populus trichocarpa. Integration of shotgun sequence assembly with genetic mapping enabled chromosome-scale reconstruction of the genome. More than 45,000 putative protein-coding genes were identified. Analysis of the assembled genome revealed a whole-genome duplication event; about 8000 pairs...
Wang, Yue; Xu, Tingting; He, Weiyi; Shen, Xiujing; Zhao, Qian; Bai, Jianlin; You, Minsheng
2018-01-01
Long non-coding RNAs (lncRNAs) are of particular interest because of their contributions to many biological processes. Here, we present the genome-wide identification and characterization of putative lncRNAs in a global insect pest, Plutella xylostella. A total of 8096 lncRNAs were identified and classified into three groups. The average length of exons in lncRNAs was longer than that in coding genes and the GC content was lower than that in mRNAs. Most lncRNAs were flanked by canonical splice sites, similar to mRNAs. Expression profiling identified 114 differentially expressed lncRNAs during the DBM development and found that majority were temporally specific. While the biological functions of lncRNAs remain uncharacterized, many are microRNA precursors or competing endogenous RNAs involved in micro-RNA regulatory pathways. This work provides a valuable resource for further studies on molecular bases for development of DBM and lay the foundation for discovery of lncRNA functions in P. xylostella. Copyright © 2017 Elsevier Inc. All rights reserved.
Stone, David M; Kerr, Rose C; Hughes, Margaret; Radford, Alan D; Darby, Alistair C
2013-11-01
The complete coding sequences were determined for four putative vesiculoviruses isolated from fish. Sequence alignment and phylogenetic analysis based on the predicted amino acid sequences of the five main proteins assigned tench rhabdovirus and grass carp rhabdovirus together with spring viraemia of carp and pike fry rhabdovirus to a lineage that was distinct from the mammalian vesiculoviruses. Perch rhabdovirus, eel virus European X, lake trout rhabdovirus 903/87 and sea trout virus were placed in a second lineage that was also distinct from the recognised genera in the family Rhabdoviridae. Establishment of two new rhabdovirus genera, "Perhabdovirus" and "Sprivivirus", is discussed.
Seligmann, Hervé
2013-05-07
GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.
Détrée, Camille; Núñez-Acuña, Gustavo; Tapia, Fabian; Gallardo-Escárate, Cristian
2017-06-01
Increasing evidence suggests that long non-coding RNAs (lncRNAs) play diverse roles in cellular processes, including in the regulation of embryogenesis and growth. However, little is known about the role of lncRNAs in marine invertebrates inhabiting changing environments. Therefore, the aim of this study was to present the first characterization of lncRNAs in an intertidal marine gastropod. Specifically, Tegula atra individuals were sampled in four sites of the central-northern Chilean coastline (28-31°) during summer and winter. A pipeline was constructed, and 3524 putative lncRNAs were identified from transcriptome databases specific to T. atra. These lncRNAs exhibited characteristics common to known lncRNAs, including a length shorter than coding sequences, low GC-content, and low sequence conservation. Expression analyses revealed that lncRNAs varied more in the summer. Furthermore, a majority of the differentially expressed lncRNAs were found in the southernmost population, the seasonal temperatures of which varied the greatest among all groups. Additionally, co-expression analysis found some lncRNAs strongly correlated with coding genes involved in the environmental stress response, such as heat shock proteins and metalloproteins. In contrast, other lncRNA expressions were strongly uncorrelated with genes involved in lipid/carbohydrates metabolism and cell-cell communication. This study provides the first large-scale characterization of lncRNAs in a marine gastropod, with results suggesting a putative role of lncRNAs in thermal tolerance, as well as an association with molecular mechanisms involved in the local adaptations of marine invertebrate populations. Copyright © 2017 Elsevier B.V. All rights reserved.
Fernández, Cecilia S; Bruque, Carlos D; Taboas, Melisa; Buzzalino, Noemí D; Espeche, Lucia D; Pasqualini, Titania; Charreau, Eduardo H; Alba, Liliana G; Ghiringhelli, Pablo D; Dain, Liliana
2015-09-01
The aim of the current study was to search for the presence of genetic variants in the CYP21A2 Z promoter regulatory region in patients with congenital adrenal hyperplasia due to 21-hydroxylase deficiency. Screening of the 10 most frequent pseudogene-derived mutations was followed by direct sequencing of the entire coding sequence, the proximal promoter, and a distal regulatory region in DNA samples from patients with at least one non-determined allele. We report three non-classical patients that presented a novel genetic variant-g.15626A>G-within the Z promoter regulatory region. In all the patients, the novel variant was found in cis with the mild, less frequent, p.P482S mutation located in the exon 10 of the CYP21A2 gene. The putative pathogenic implication of the novel variant was assessed by in silico analyses and in vitro assays. Topological analyses showed differences in the curvature and bendability of the DNA region bearing the novel variant. By performing functional studies, a significantly decreased activity of a reporter gene placed downstream from the regulatory region was found by the G transition. Our results may suggest that the activity of an allele bearing the p.P482S mutation may be influenced by the misregulated CYP21A2 transcriptional activity exerted by the Z promoter A>G variation.
Mehdizadeh Gohari, Iman; Kropinski, Andrew M.; Weese, Scott J.; Parreira, Valeria R.; Whitehead, Ashley E.; Boerlin, Patrick; Prescott, John F.
2016-01-01
The recent discovery of a novel beta-pore-forming toxin, NetF, which is strongly associated with canine and foal necrotizing enteritis should improve our understanding of the role of type A Clostridium perfringens associated disease in these animals. The current study presents the complete genome sequence of two netF-positive strains, JFP55 and JFP838, which were recovered from cases of foal necrotizing enteritis and canine hemorrhagic gastroenteritis, respectively. Genome sequencing was done using Single Molecule, Real-Time (SMRT) technology-PacBio and Illumina Hiseq2000. The JFP55 and JFP838 genomes include a single 3.34 Mb and 3.53 Mb chromosome, respectively, and both genomes include five circular plasmids. Plasmid annotation revealed that three plasmids were shared by the two newly sequenced genomes, including a NetF/NetE toxins-encoding tcp-conjugative plasmid, a CPE/CPB2 toxins-encoding tcp-conjugative plasmid and a putative bacteriocin-encoding plasmid. The putative beta-pore-forming toxin genes, netF, netE and netG, were located in unique pathogenicity loci on tcp-conjugative plasmids. The C. perfringens JFP55 chromosome carries 2,825 protein-coding genes whereas the chromosome of JFP838 contains 3,014 protein-encoding genes. Comparison of these two chromosomes with three available reference C. perfringens chromosome sequences identified 48 (~247 kb) and 81 (~430 kb) regions unique to JFP55 and JFP838, respectively. Some of these divergent genomic regions in both chromosomes are phage- and plasmid-related segments. Sixteen of these unique chromosomal regions (~69 kb) were shared between the two isolates. Five of these shared regions formed a mosaic of plasmid-integrated segments, suggesting that these elements were acquired early in a clonal lineage of netF-positive C. perfringens strains. These results provide significant insight into the basis of canine and foal necrotizing enteritis and are the first to demonstrate that netF resides on a large and unique plasmid-encoded locus. PMID:26859667
Influence of putative exopolysaccharide genes on Pseudomonas putida KT2440 biofilm stability.
Nilsson, Martin; Chiang, Wen-Chi; Fazli, Mustafa; Gjermansen, Morten; Givskov, Michael; Tolker-Nielsen, Tim
2011-05-01
We report a study of the role of putative exopolysaccharide gene clusters in the formation and stability of Pseudomonas putida KT2440 biofilm. Two novel putative exopolysaccharide gene clusters, pea and peb, were identified, and evidence is provided that they encode products that stabilize P. putida KT2440 biofilm. The gene clusters alg and bcs, which code for proteins mediating alginate and cellulose biosynthesis, were found to play minor roles in P. putida KT2440 biofilm formation and stability under the conditions tested. A P. putida KT2440 derivative devoid of any identifiable exopolysaccharide genes was found to form biofilm with a structure similar to wild-type biofilm, but with a stability lower than that of wild-type biofilm. Based on our data, we suggest that the formation of structured P. putida KT2440 biofilm can occur in the absence of exopolysaccharides; however, exopolysaccharides play a role as structural stabilizers. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.
Zade, Amrutraj; Sengupta, Malavi; Kondabagil, Kiran
2015-01-01
Rab GTPases are the key regulators of intracellular membrane trafficking in eukaryotes. Many viruses and intracellular bacterial pathogens have evolved to hijack the host Rab GTPase functions, mainly through activators and effector proteins, for their benefit. Acanthamoeba polyphaga mimivirus (APMV) is one of the largest viruses and belongs to the monophyletic clade of nucleo-cytoplasmic large DNA viruses (NCLDV). The inner membrane lining is integral to the APMV virion structure. APMV assembly involves extensive host membrane modifications, like vesicle budding and fusion, leading to the formation of a membrane sheet that is incorporated into the virion. Intriguingly, APMV and all group I members of the Mimiviridae family code for a putative Rab GTPase protein. APMV is the first reported virus to code for a Rab GTPase (encoded by R214 gene). Our thorough in silico analysis of the subfamily specific (SF) region of Mimiviridae Rab GTPase sequences suggests that they are related to Rab5, a member of the group II Rab GTPases, of lower eukaryotes. Because of their high divergence from the existing three isoforms, A, B, and C of the Rab5-family, we suggest that Mimiviridae Rabs constitute a new isoform, Rab5D. Phylogenetic analysis indicated probable horizontal acquisition from a lower eukaryotic ancestor followed by selection and divergence. Furthermore, interaction network analysis suggests that vps34 (a Class III PI3K homolog, coded by APMV L615), Atg-8 and dynamin (host proteins) are recruited by APMV Rab GTPase during capsid assembly. Based on these observations, we hypothesize that APMV Rab plays a role in the acquisition of inner membrane during virion assembly.
Evidence for an ergot alkaloid gene cluster in Claviceps purpurea.
Tudzynski, P; Hölter, K; Correia, T; Arntz, C; Grammel, N; Keller, U
1999-02-01
A gene (cpd1) coding for the dimethylallyltryptophan synthase (DMATS) that catalyzes the first specific step in the biosynthesis of ergot alkaloids, was cloned from a strain of Claviceps purpurea that produces alkaloids in axenic culture. The derived gene product (CPD1) shows only 70% similarity to the corresponding gene previously isolated from Claviceps strain ATCC 26245, which is likely to be an isolate of C. fusiformis. Therefore, the related cpd1 most probably represents the first C. purpurea gene coding for an enzymatic step of the alkaloid biosynthetic pathway to be cloned. Analysis of the 3'-flanking region of cpd1 revealed a second, closely linked ergot alkaloid biosynthetic gene named cpps1, which codes for a 356-kDa polypeptide showing significant similarity to fungal modular peptide synthetases. The protein contains three amino acid-activating modules, and in the second module a sequence is found which matches that of an internal peptide (17 amino acids in length) obtained from a tryptic digest of lysergyl peptide synthetase 1 (LPS1) of C. purpurea, thus confirming that cpps1 encodes LPS1. LPS1 activates the three amino acids of the peptide portion of ergot peptide alkaloids during D-lysergyl peptide assembly. Chromosome walking revealed the presence of additional genes upstream of cpd1 which are probably also involved in ergot alkaloid biosynthesis: cpox1 probably codes for an FAD-dependent oxidoreductase (which could represent the chanoclavine cyclase), and a second putative oxidoreductase gene, cpox2, is closely linked to it in inverse orientation. RT-PCR experiments confirm that all four genes are expressed under conditions of peptide alkaloid biosynthesis. These results strongly suggest that at least some genes of ergot alkaloid biosynthesis in C. purpurea are clustered, opening the way for a detailed molecular genetic analysis of the pathway.
Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab; Khandekar, Sushant; Crawford, Erin; Zirbel, Craig L; Leisner, Scott; Prakash, Ashwin; Fedorova, Larisa; Fedorov, Alexei
2014-09-10
Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals. Copyright © 2014 Elsevier B.V. All rights reserved.
Liu, Jie; Bu, Cuiping; Wipfler, Benjamin; Liang, Aiping
2014-01-01
The present study compares the mitochondrial genomes of five species of the spittlebug tribe Callitettixini (Hemiptera: Cercopoidea: Cercopidae) from eastern Asia. All genomes of the five species sequenced are circular double-stranded DNA molecules and range from 15,222 to 15,637 bp in length. They contain 22 tRNA genes, 13 protein coding genes (PCGs) and 2 rRNA genes and share the putative ancestral gene arrangement of insects. The PCGs show an extreme bias of nucleotide and amino acid composition. Significant differences of the substitution rates among the different genes as well as the different codon position of each PCG are revealed by the comparative evolutionary analyses. The substitution speeds of the first and second codon position of different PCGs are negatively correlated with their GC content. Among the five species, the AT-rich region features great differences in length and pattern and generally shows a 2–5 times higher substitution rate than the fastest PCG in the mitochondrial genome, atp8. Despite the significant variability in length, short conservative segments were identified in the AT-rich region within Callitettixini, although absent from the other groups of the spittlebug superfamily Cercopoidea. PMID:25285442
NASA Technical Reports Server (NTRS)
Larimer, James; Piantanida, Thomas
1990-01-01
The optics of the eye form an image on a surface at the back of the eyeball called the retina. The retina contains the photoreceptors that sample the image and convert it into a neural signal. The spacing of the photoreceptors in the retina is not uniform and varies with retinal locus. The central retinal field, called the macula, is densely packed with photoreceptors. The packing density falls off rapidly as a function of retinal eccentricity with respect to the macular region and there are regions in which there are no photoreceptors at all. The retinal regions without photoreceptors are called blind spots or scotomas. The neural transformations which convert retinal image signals into percepts fills in the gaps and regularizes the inhomogeneities of the retinal photoreceptor sampling mosaic. The filling-in mechamism plays an important role in understanding visual performance. The filling-in mechanism is not well understood. A systematic collaborative research program at the Ames Research Center and SRI in Menlo Park, California, was designed to explore this mechanism. It was shown that the perceived fields which are in fact different from the image on the retina due to filling-in, control some aspects of performance and not others. Researchers have linked these mechanisms to putative mechanisms of color coding and color constancy.
RNA Sequencing of the Exercise Transcriptome in Equine Athletes
Verini-Supplizi, Andrea; Barcaccia, Gianni; Albiero, Alessandro; D'Angelo, Michela; Campagna, Davide; Valle, Giorgio; Felicetti, Michela; Silvestrelli, Maurizio; Cappelli, Katia
2013-01-01
The horse is an optimal model organism for studying the genomic response to exercise-induced stress, due to its natural aptitude for athletic performance and the relative homogeneity of its genetic and environmental backgrounds. Here, we applied RNA-sequencing analysis through the use of SOLiD technology in an experimental framework centered on exercise-induced stress during endurance races in equine athletes. We monitored the transcriptional landscape by comparing gene expression levels between animals at rest and after competition. Overall, we observed a shift from coding to non-coding regions, suggesting that the stress response involves the differential expression of not annotated regions. Notably, we observed significant post-race increases of reads that correspond to repeats, especially the intergenic and intronic L1 and L2 transposable elements. We also observed increased expression of the antisense strands compared to the sense strands in intronic and regulatory regions (1 kb up- and downstream) of the genes, suggesting that antisense transcription could be one of the main mechanisms for transposon regulation in the horse under stress conditions. We identified a large number of transcripts corresponding to intergenic and intronic regions putatively associated with new transcriptional elements. Gene expression and pathway analysis allowed us to identify several biological processes and molecular functions that may be involved with exercise-induced stress. Ontology clustering reflected mechanisms that are already known to be stress activated (e.g., chemokine-type cytokines, Toll-like receptors, and kinases), as well as “nucleic acid binding” and “signal transduction activity” functions. There was also a general and transient decrease in the global rates of protein synthesis, which would be expected after strenuous global stress. In sum, our network analysis points toward the involvement of specific gene clusters in equine exercise-induced stress, including those involved in inflammation, cell signaling, and immune interactions. PMID:24391776
PuTmiR: A database for extracting neighboring transcription factors of human microRNAs
2010-01-01
Background Some of the recent investigations in systems biology have revealed the existence of a complex regulatory network between genes, microRNAs (miRNAs) and transcription factors (TFs). In this paper, we focus on TF to miRNA regulation and provide a novel interface for extracting the list of putative TFs for human miRNAs. A putative TF of an miRNA is considered here as those binding within the close genomic locality of that miRNA with respect to its starting or ending base pair on the chromosome. Recent studies suggest that these putative TFs are possible regulators of those miRNAs. Description The interface is built around two datasets that consist of the exhaustive lists of putative TFs binding respectively in the 10 kb upstream region (USR) and downstream region (DSR) of human miRNAs. A web server, named as PuTmiR, is designed. It provides an option for extracting the putative TFs for human miRNAs, as per the requirement of a user, based on genomic locality, i.e., any upstream or downstream region of interest less than 10 kb. The degree distributions of the number of putative TFs and miRNAs against each other for the 10 kb USR and DSR are analyzed from the data and they explore some interesting results. We also report about the finding of a significant regulatory activity of the YY1 protein over a set of oncomiRNAs related to the colon cancer. Conclusion The interface provided by the PuTmiR web server provides an important resource for analyzing the direct and indirect regulation of human miRNAs. While it is already an established fact that miRNAs are regulated by TFs binding to their USR, this database might possibly help to study whether an miRNA can also be regulated by the TFs binding to their DSR. PMID:20398296
Remarkable sequence conservation of the last intron in the PKD1 gene.
Rodova, Marianna; Islam, M Rafiq; Peterson, Kenneth R; Calvet, James P
2003-10-01
The last intron of the PKD1 gene (intron 45) was found to have exceptionally high sequence conservation across four mammalian species: human, mouse, rat, and dog. This conservation did not extend to the comparable intron in pufferfish. Pairwise comparisons for intron 45 showed 91% identity (human vs. dog) to 100% identity (mouse vs. rat) for an average for all four species of 94% identity. In contrast, introns 43 and 44 of the PKD1 gene had average pairwise identities of 57% and 54%, and exons 43, 44, and 45 and the coding region of exon 46 had average pairwise identities of 80%, 84%, 82%, and 80%. Intron 45 is 90 to 95 bp in length, with the major region of sequence divergence being in a central 4-bp to 9-bp variable region. RNA secondary structure analysis of intron 45 predicts a branching stem-loop structure in which the central variable region lies in one loop and the putative branch point sequence lies in another loop, suggesting that the intron adopts a specific stem-loop structure that may be important for its removal. Although intron 45 appears to conform to the class of small, G-triplet-containing introns that are spliced by a mechanism utilizing intron definition, its high sequence conservation may be a reflection of constraints imposed by a unique mechanism that coordinates splicing of this last PKD1 intron with polyadenylation.
Dinant, S; Lot, H; Albouy, J; Kuziak, C; Meyer, M; Astier-Manifacier, S
1991-01-01
DNA complementary to the 3' terminal 1651 nucleotides of the genome of the common strain of lettuce mosaic virus (LMV-O) has been cloned and sequenced. Microsequencing of the N-terminus enabled localization of the coat protein gene in this sequence. It showed also that the LMV coat protein coding region is at the 3' end of the genome, and that the coat protein is processed from a larger protein by cleavage at an unusual Q/V dipeptide between the polymerase and the coat protein. This is the first report of such a site for cleavage of a potyvirus polyprotein, where only Q/A, Q/S, and Q/G cleavage sites have been reported. The LMV coat protein gene encodes a 278 amino acid polypeptide with a calculated Mr of 31,171 and is flanked by a region which has a high degree of homology with the putative polymerase and a 3' untranslated region of 211 nucleotides in length. Percentage of homology with the coat protein of other potyviruses confirms that LMV is a distinct member of this group. Moreover, amino acid homologies noticed with the coat protein of potexvirus, bymovirus, and carlavirus elongated plant viruses suggest a functional significance for the conserved domains.
Shao, Yuan-jun; Hu, Xian-qiong; Peng, Guang-da; Wang, Rui-xian; Gao, Rui-na; Lin, Chao; Shen, Wei-de; Li, Rui; Li, Bing
2012-12-01
The first complete mitochondrial genome (mitogenome) of Tachinidae Exorista sorbillans (Diptera) is sequenced by PCR-based approach. The circular mitogenome is 14,960 bp long and has the representative mitochondrial gene (mt gene) organization and order of Diptera. All protein-coding sequences are initiated with ATN codon; however, the only exception is Cox I gene, which has a 4-bp ATCG putative start codon. Ten of the thirteen protein-coding genes have a complete termination codon (TAA), but the rest are seated on the H strand with incomplete codons. The mitogenome of E. sorbillans is biased toward A+T content at 78.4 %, and the strand-specific bias is in reflection of the third codon positions of mt genes, and their T/C ratios as strand indictor are higher on the H strand more than those on the L strand pointing at any strain of seven Diptera flies. The length of the A+T-rich region of E. sorbillans is 106 bp, including a tandem triple copies of a13-bp fragment. Compared to Haematobia irritans, E. sorbillans holds distant relationship with Drosophila. Phylogenetic topologies based on the amino acid sequences, supporting that E. sorbillans (Tachinidae) is clustered with strains of Calliphoridae and Oestridae, and superfamily Oestroidea are polyphyletic groups with Muscidae in a clade.
Liu, Yuan; Cui, Zhaoxia
2010-06-01
Given the commercial and ecological importance of the Asian paddle crab, Charybdis japonica, there is a clearly need for genetic and molecular research on this species. Here, we present the complete mitochondrial genome sequence of C. japonica, determined by the long-polymerase chain reaction and primer walking sequencing method. The entire genome is 15,738 bp in length, encoding a standard set of 13 protein-coding genes, two ribosomal RNA genes, and 22 transfer RNA genes, plus the putative control region, which is typical for metazoans. The total A+T content of the genome is 69.2%, lower than the other brachyuran crabs except for Callinectes sapidus. The gene order is identical to the published marine brachyurans and differs from the ancestral pancrustacean order by only the position of the tRNA ( His ) gene. Phylogenetic analyses using the concatenated nucleotide and amino acid sequences of 13 protein-coding genes strongly support the monophyly of Dendrobranchiata and Pleocyemata, which is consistent with the previous taxonomic classification. However, the systematic status of Charybdis within subfamily Thalamitinae of family Portunidae is not supported. C. japonica, as the first species of Charybdis with complete mitochondrial genome available, will provide important information on both genomics and molecular ecology of the group.
Guo, D; Maiss, E; Adam, G; Casper, R
1995-05-01
The RNA3 of prunus necrotic ringspot ilarvirus (PNRSV) has been cloned and its entire sequence determined. The RNA3 consists of 1943 nucleotides (nt) and possesses two large open reading frames (ORFs) separated by an intergenic region of 74 nt. The 5' proximal ORF is 855 nt in length and codes for a protein of molecular mass 31.4 kDa which has homologies with the putative movement protein of other members of the Bromoviridae. The 3' proximal ORF of 675 nt is the cistron for the coat protein (CP) and has a predicted molecular mass of 24.9 kDa. The sequence of the 3' non-coding region (NCR) of PNRSV RNA3 showed a high degree of similarity with those of tobacco streak virus (TSV), prune dwarf virus (PDV), apple mosaic virus (ApMV) and also alfalfa mosaic virus (AIMV). In addition it contained potential stem-loop structures with interspersed AUGC motifs characteristic for ilar- and alfamoviruses. This conserved primary and secondary structure in all 3' NCRs may be responsible for the interaction with homologous and heterologous CPs and subsequent activation of genome replication. The CP gene of an ApMV isolate (ApMV-G) of 657 nt has also been cloned and sequenced. Although ApMV and PNRSV have a distant serological relationship, the deduced amino acid sequences of their CPs have an identity of only 51.8%. The N termini of PNRSV and ApMV CPs have in common a zinc-finger motif and the potential to form an amphipathic helix.
Liu, Jianping; Hayashi, Kyoko; Matsuoka, Ken
2015-01-01
S-adenosylmethionine (SAM)-dependent methyltransferases (MTases) transfer methyl groups to substrates. In this study, a novel putative tobacco SAM-MTase termed Golgi-localized methyl transferase 1 (GLMT1) has been characterized. GLMT1 is comprised of 611 amino acids with short N-terminal region, putative transmembrane region, and C-terminal SAM-MTase domain. Expression of monomeric red fluorescence protein (mRFP)-tagged protein in tobacco BY-2 cell indicated that GLMT1 is a Golgi-localized protein. Analysis of the membrane topology by protease digestion suggested that both C-terminal catalytic region and N-terminal region seem to be located to the cytosolic side of the Golgi apparatus. Therefore, GLMT1 might have a different function than the previously studied SAM-MTases in plants.
Lijun Liu; Trevor Ramsay; Matthew S. Zinkgraf; David Sundell; Nathaniel Robert Street; Vladimir Filkov; Andrew Groover
2015-01-01
Identifying transcription factor target genes is essential for modeling the transcriptional networks underlying developmental processes. Here we report a chromatin immunoprecipitation sequencing (ChIP-seq) resource consisting of genome-wide binding regions and associated putative target genes for four Populus homeodomain transcription factors...
Zhang, Hong-Li; Ye, Fei
2017-01-01
Praying mantises are a diverse group of predatory insects. Although some Mantodea mitogenomes have been reported, a comprehensive comparative and evolutionary genomic study is lacking for this group. In the present study, four new mitogenomes were sequenced, annotated, and compared to the previously published mitogenomes of other Mantodea species. Most Mantodea mitogenomes share a typical set of mitochondrial genes and a putative control region (CR). Additionally, and most intriguingly, another large non-coding region (LNC) was detected between trnM and ND2 in all six Paramantini mitogenomes examined. The main section in this common region of Paramantini may have initially originated from the corresponding control region for each species, whereas sequence differences between the LNCs and CRs and phylogenetic analyses indicate that LNC and CR are largely independently evolving. Namely, the LNC (the duplicated CR) may have subsequently degenerated during evolution. Furthermore, evidence suggests that special intergenic gaps have been introduced in some species through gene rearrangement and duplication. These gaps are actually the original abutting sequences of migrated or duplicated genes. Some gaps (G5 and G6) are homologous to the 5' and 3' surrounding regions of the duplicated gene in the original gene order, and another specific gap (G7) has tandem repeats. We analysed the phylogenetic relationships of fifteen Mantodea species using 37 concatenated mitochondrial genes and detected several synapomorphies unique to species in some clades. PMID:28367101
2011-01-01
Background Mounting evidence suggests a major role for epigenetic feedback in Plasmodium falciparum transcriptional regulation. Long non-coding RNAs (lncRNAs) have recently emerged as a new paradigm in epigenetic remodeling. We therefore set out to investigate putative roles for lncRNAs in P. falciparum transcriptional regulation. Results We used a high-resolution DNA tiling microarray to survey transcriptional activity across 22.6% of the P. falciparum strain 3D7 genome. We identified 872 protein-coding genes and 60 putative P. falciparum lncRNAs under developmental regulation during the parasite's pathogenic human blood stage. Further characterization of lncRNA candidates led to the discovery of an intriguing family of lncRNA telomere-associated repetitive element transcripts, termed lncRNA-TARE. We have quantified lncRNA-TARE expression at 15 distinct chromosome ends and mapped putative transcriptional start and termination sites of lncRNA-TARE loci. Remarkably, we observed coordinated and stage-specific expression of lncRNA-TARE on all chromosome ends tested, and two dominant transcripts of approximately 1.5 kb and 3.1 kb transcribed towards the telomere. Conclusions We have characterized a family of 22 telomere-associated lncRNAs in P. falciparum. Homologous lncRNA-TARE loci are coordinately expressed after parasite DNA replication, and are poised to play an important role in P. falciparum telomere maintenance, virulence gene regulation, and potentially other processes of parasite chromosome end biology. Further study of lncRNA-TARE and other promising lncRNA candidates may provide mechanistic insight into P. falciparum transcriptional regulation. PMID:21689454
Population Coding of Forelimb Joint Kinematics by Peripheral Afferents in Monkeys
Umeda, Tatsuya; Seki, Kazuhiko; Sato, Masa-aki; Nishimura, Yukio; Kawato, Mitsuo; Isa, Tadashi
2012-01-01
Various peripheral receptors provide information concerning position and movement to the central nervous system to achieve complex and dexterous movements of forelimbs in primates. The response properties of single afferent receptors to movements at a single joint have been examined in detail, but the population coding of peripheral afferents remains poorly defined. In this study, we obtained multichannel recordings from dorsal root ganglion (DRG) neurons in cervical segments of monkeys. We applied the sparse linear regression (SLiR) algorithm to the recordings, which selects useful input signals to reconstruct movement kinematics. Multichannel recordings of peripheral afferents were performed by inserting multi-electrode arrays into the DRGs of lower cervical segments in two anesthetized monkeys. A total of 112 and 92 units were responsive to the passive joint movements or the skin stimulation with a painting brush in Monkey 1 and Monkey 2, respectively. Using the SLiR algorithm, we reconstructed the temporal changes of joint angle, angular velocity, and acceleration at the elbow, wrist, and finger joints from temporal firing patterns of the DRG neurons. By automatically selecting a subset of recorded units, the SLiR achieved superior generalization performance compared with a regularized linear regression algorithm. The SLiR selected not only putative muscle units that were responsive to only the passive movements, but also a number of putative cutaneous units responsive to the skin stimulation. These results suggested that an ensemble of peripheral primary afferents that contains both putative muscle and cutaneous units encode forelimb joint kinematics of non-human primates. PMID:23112841
Methodology for fast detection of false sharing in threaded scientific codes
Chung, I-Hsin; Cong, Guojing; Murata, Hiroki; Negishi, Yasushi; Wen, Hui-Fang
2014-11-25
A profiling tool identifies a code region with a false sharing potential. A static analysis tool classifies variables and arrays in the identified code region. A mapping detection library correlates memory access instructions in the identified code region with variables and arrays in the identified code region while a processor is running the identified code region. The mapping detection library identifies one or more instructions at risk, in the identified code region, which are subject to an analysis by a false sharing detection library. A false sharing detection library performs a run-time analysis of the one or more instructions at risk while the processor is re-running the identified code region. The false sharing detection library determines, based on the performed run-time analysis, whether two different portions of the cache memory line are accessed by the generated binary code.
Lünse, Christina E.; Corbino, Keith A.; Ames, Tyler D.; Nelson, James W.; Roth, Adam; Perkins, Kevin R.; Sherlock, Madeline E.
2017-01-01
Abstract The discovery of structured non-coding RNAs (ncRNAs) in bacteria can reveal new facets of biology and biochemistry. Comparative genomics analyses executed by powerful computer algorithms have successfully been used to uncover many novel bacterial ncRNA classes in recent years. However, this general search strategy favors the discovery of more common ncRNA classes, whereas progressively rarer classes are correspondingly more difficult to identify. In the current study, we confront this problem by devising several methods to select subsets of intergenic regions that can concentrate these rare RNA classes, thereby increasing the probability that comparative sequence analysis approaches will reveal their existence. By implementing these methods, we discovered 224 novel ncRNA classes, which include ROOL RNA, an RNA class averaging 581 nt and present in multiple phyla, several highly conserved and widespread ncRNA classes with properties that suggest sophisticated biochemical functions and a multitude of putative cis-regulatory RNA classes involved in a variety of biological processes. We expect that further research on these newly found RNA classes will reveal additional aspects of novel biology, and allow for greater insights into the biochemistry performed by ncRNAs. PMID:28977401
Signatures of selection in tilapia revealed by whole genome resequencing
Hong Xia, Jun; Bai, Zhiyi; Meng, Zining; Zhang, Yong; Wang, Le; Liu, Feng; Jing, Wu; Yi Wan, Zi; Li, Jiale; Lin, Haoran; Hua Yue, Gen
2015-01-01
Natural selection and selective breeding for genetic improvement have left detectable signatures within the genome of a species. Identification of selection signatures is important in evolutionary biology and for detecting genes that facilitate to accelerate genetic improvement. However, selection signatures, including artificial selection and natural selection, have only been identified at the whole genome level in several genetically improved fish species. Tilapia is one of the most important genetically improved fish species in the world. Using next-generation sequencing, we sequenced the genomes of 47 tilapia individuals. We identified a total of 1.43 million high-quality SNPs and found that the LD block sizes ranged from 10–100 kb in tilapia. We detected over a hundred putative selective sweep regions in each line of tilapia. Most selection signatures were located in non-coding regions of the tilapia genome. The Wnt signaling, gonadotropin-releasing hormone receptor and integrin signaling pathways were under positive selection in all improved tilapia lines. Our study provides a genome-wide map of genetic variation and selection footprints in tilapia, which could be important for genetic studies and accelerating genetic improvement of tilapia. PMID:26373374
Sakai, Yoriko; Ogawa, Naoto; Shimomura, Yumi; Fujii, Takeshi
2014-03-01
Analysis of the complete nucleotide sequence of plasmid pM7012 from 2,4-dichlorophenoxyacetic-acid (2,4-D)-degrading bacterium Burkholderia sp. M701 revealed that the plasmid had 582 142 bp, with 541 putative protein-coding sequences and 39 putative tRNA genes for the transport of the standard 20 aa. pM7012 contains sequences homologous to the regions involved in conjugal transfer and plasmid maintenance found in plasmids byi_2p from Burkholderia sp. YI23 and pBVIE01 from Burkholderia sp. G4. No relaxase gene was found in any of these plasmids, although genes for a type IV secretion system and type IV coupling proteins were identified. Plasmids with no relaxase gene have been classified as non-mobile plasmids. However, nucleotide sequences with a high level of similarity to the genes for plasmid transfer, plasmid maintenance, 2,4-D degradation and arsenic resistance contained on pM7012 were also detected in eight other megaplasmids (~600 or 900 kb) found in seven Burkholderia strains and a strain of Cupriavidus, which were isolated as 2,4-D-degrading bacteria in Japan and the United States. These results suggested that the 2,4-D degradation megaplasmids related to pM7012 are mobile and distributed across various bacterial species worldwide, and that the plasmid group could be distinguished from known mobile plasmid groups.
Jin, Weiyue; Xu, Xian; Jiang, Ling; Zhang, Zhidong; Li, Shuang; Huang, He
2015-11-01
Putative genes crtE, crtB, and crtI from Deinococcus wulumiqiensis R12, a novel species, were identified by genome mining and were co-expressed using the optimized Shine-Dalgarno (SD) regions to improve lycopene yield. A lycopene biosynthesis pathway was constructed by co-expressing these three genes in Escherichia coli. After optimizing the upstream SD regions and the culture medium, the recombinant strain EDW11 produced 88 mg lycopene g(-1) dry cell wt (780 mg lycopene l(-1)) after 40 h fermentation without IPTG induction, while the strain EDW without optimized SD regions only produced 49 mg lycopene g(-1) dry cell wt (417 mg lycopene l(-1)). Based on the optimization of the upstream SD regions and culture medium, the yield of the strain EDW11 reached a high level during microbial lycopene production until now.
Ramamoorthy, Vellaisamy; Dhingra, Sourabh; Kincaid, Alexander; Shantappa, Sourabha; Feng, Xuehuan; Calvo, Ana M.
2013-01-01
Secondary metabolism in the model fungus Aspergillus nidulans is controlled by the conserved global regulator VeA, which also governs morphological differentiation. Among the secondary metabolites regulated by VeA is the mycotoxin sterigmatocystin (ST). The presence of VeA is necessary for the biosynthesis of this carcinogenic compound. We identified a revertant mutant able to synthesize ST intermediates in the absence of VeA. The point mutation occurred at the coding region of a gene encoding a novel putative C2H2 zinc finger domain transcription factor that we denominated mtfA. The A. nidulans mtfA gene product localizes at nuclei independently of the illumination regime. Deletion of the mtfA gene restores mycotoxin biosynthesis in the absence of veA, but drastically reduced mycotoxin production when mtfA gene expression was altered, by deletion or overexpression, in A. nidulans strains with a veA wild-type allele. Our study revealed that mtfA regulates ST production by affecting the expression of the specific ST gene cluster activator aflR. Importantly, mtfA is also a regulator of other secondary metabolism gene clusters, such as genes responsible for the synthesis of terrequinone and penicillin. As in the case of ST, deletion or overexpression of mtfA was also detrimental for the expression of terrequinone genes. Deletion of mtfA also decreased the expression of the genes in the penicillin gene cluster, reducing penicillin production. However, in this case, over-expression of mtfA enhanced the transcription of penicillin genes, increasing penicillin production more than 5 fold with respect to the control. Importantly, in addition to its effect on secondary metabolism, mtfA also affects asexual and sexual development in A. nidulans. Deletion of mtfA results in a reduction of conidiation and sexual stage. We found mtfA putative orthologs conserved in other fungal species. PMID:24066102
Graentzdoerffer, Andrea; Rauh, David; Pich, Andreas; Andreesen, Jan R
2003-01-01
Two gene clusters encoding similar formate dehydrogenases (FDH) were identified in Eubacterium acidaminophilum. Each cluster is composed of one gene coding for a catalytic subunit ( fdhA-I, fdhA-II) and one for an electron-transferring subunit ( fdhB-I, fdhB-II). Both fdhA genes contain a TGA codon for selenocysteine incorporation and the encoded proteins harbor five putative iron-sulfur clusters in their N-terminal region. Both FdhB subunits resemble the N-terminal region of FdhA on the amino acid level and contain five putative iron-sulfur clusters. Four genes thought to encode the subunits of an iron-only hydrogenase are located upstream of the FDH gene cluster I. By sequence comparison, HymA and HymB are predicted to contain one and four iron-sulfur clusters, respectively, the latter protein also binding sites for FMN and NAD(P). Thus, HymA and HymB seem to represent electron-transferring subunits, and HymC the putative catalytic subunit containing motifs for four iron-sulfur clusters and one H-cluster specific for Fe-only hydrogenases. HymD has six predicted transmembrane helices and might be an integral membrane protein. Viologen-dependent FDH activity was purified from serine-grown cells of E. acidaminophilum and the purified protein complex contained four subunits, FdhA and FdhB, encoded by FDH gene cluster II, and HymA and HymB, identified after determination of their N-terminal sequences. Thus, this complex might represent the most simple type of a formate hydrogen lyase. The purified formate dehydrogenase fraction contained iron, tungsten, a pterin cofactor, and zinc, but no molybdenum. FDH-II had a two-fold higher K(m) for formate (0.37 mM) than FDH-I and also catalyzed CO(2) reduction to formate. Reverse transcription (RT)-PCR pointed to increased expression of FDH-II in serine-grown cells, supporting the isolation of this FDH isoform. The fdhA-I gene was expressed as inactive protein in Escherichia coli. The in-frame UGA codon for selenocysteine incorporation was read in the heterologous system only as stop codon, although its potential SECIS element exhibited a quite high similarity to that of E. coli FDH.
Mohanty, Sujit Kumar; Yu, Chi-Li; Das, Shuvendu; Louie, Tai Man; Gakhar, Lokesh
2012-01-01
The molecular basis of the ability of bacteria to live on caffeine via the C-8 oxidation pathway is unknown. The first step of this pathway, caffeine to trimethyluric acid (TMU), has been attributed to poorly characterized caffeine oxidases and a novel quinone-dependent caffeine dehydrogenase. Here, we report the detailed characterization of the second enzyme, a novel NADH-dependent trimethyluric acid monooxygenase (TmuM), a flavoprotein that catalyzes the conversion of TMU to 1,3,7-trimethyl-5-hydroxyisourate (TM-HIU). This product spontaneously decomposes to racemic 3,6,8-trimethylallantoin (TMA). TmuM prefers trimethyluric acids and, to a lesser extent, dimethyluric acids as substrates, but it exhibits no activity on uric acid. Homology models of TmuM against uric acid oxidase HpxO (which catalyzes uric acid to 5-hydroxyisourate) reveal a much bigger and hydrophobic cavity to accommodate the larger substrates. Genes involved in the caffeine C-8 oxidation pathway are located in a 25.2-kb genomic DNA fragment of CBB1, including cdhABC (coding for caffeine dehydrogenase) and tmuM (coding for TmuM). Comparison of this gene cluster to the uric acid-metabolizing gene cluster and pathway of Klebsiella pneumoniae revealed two major open reading frames coding for the conversion of TM-HIU to S-(+)-trimethylallantoin [S-(+)-TMA]. The first one, designated tmuH, codes for a putative TM-HIU hydrolase, which catalyzes the conversion of TM-HIU to 3,6,8-trimethyl-2-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline (TM-OHCU). The second one, designated tmuD, codes for a putative TM-OHCU decarboxylase which catalyzes the conversion of TM-OHCU to S-(+)-TMA. Based on a combination of enzymology and gene-analysis, a new degradative pathway for caffeine has been proposed via TMU, TM-HIU, TM-OHCU to S-(+)-TMA. PMID:22609920
González, Leonardo Galindo; Deyholos, Michael K
2012-11-21
Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated.
2012-01-01
Background Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. Results Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. Conclusions The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated. PMID:23171245
Antonini, S R; N'Diaye, N; Baldacchino, V; Hamet, P; Tremblay, J; Lacroix, A
2004-07-01
Gastric inhibitory polypeptide (GIP)-dependent Cushing's syndrome (CS) results from the ectopic expression of non-mutated GIP receptor (hGIPR) in the adrenal cortex. We evaluated whether mutations or polymorphisms in the regulatory region of the GIPR gene could lead to this aberrant expression. We studied 9.0kb upstream and 1.3kb downstream of the GIPR gene putative promoter (pProm) by sequencing leukocyte DNA from controls and from adrenal tissues of GIP- and non-GIP-dependent CS patients. The putative proximal promoter region (800 bp) and the first exon and intron of the hGIPR gene were sequenced on adrenal DNA from nine GIP-dependent CS, as well as on leukocyte DNA of nine normal controls. Three variations found in this region were found in all patients and controls; at position -4/-5, an insertion of a T was seen in four out of nine patients and in five out of nine controls. Transient transfection studies conducted in rat GC and mouse Y1 cells showed that the TT allele confers loss of 40% in the promoter activity. The analysis of the 8-kb distal pProm region revealed eight distal single nucleotide polymorphisms (SNPs) without probable association with the disease, since frequencies in patients and controls were very similar. In conclusion, mutations or SNPs in the regulatory region of the GIPR gene are unlikely to underlie GIP-dependent CS. Copyright 2004 Elsevier Ltd.
Mutational analysis in a patient with a variant form of Gaucher disease caused by SAP-2 deficiency
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rafi, M.A.; Gala, G. de; Xunling Zhang
1993-01-01
It is now clear that the lysosomal hydrolysis of sphingolipids requires both lysosomal enzymes and so-called sphingolipid activator proteins (SAPs). One gene, called prosaposin, codes for a precursor protein that is proteolytically cut into four putative SAPs. These four SAPs, of about 80 amino acids, share some structural features but differ somewhat in their specificity. Domain 3 of prosaposin mRNA contains the coding region for SAP-2, an activator of glucocerebrosidase. While most patients with Gaucher disease store glucosylceramide due to defects in glucocerebrosidase, a few patients store this lipid in the presence of normal enzyme levels. In this paper themore » authors describe the identification of a point mutation in domain 3 of a patient who died with this variant form of Gaucher disease. Polymerase chain reaction amplification was performed in the small amount of genomic DNA available using primers generated from the intronic sequence surrounding domain 3. The patient was found to have a T-to-G substitution at position 1144 (counting from the A of ATG initiation codon) in half of the M13 recombinant clones. This changes the codon for cysteine[sub 382] to glycine. His father and unaffected brother also had this mutation, but his mother did not. She was found to have half of the normal amount of mRNA for prosaposin in her cultured skin fibroblasts. Therefore, this child inherited a point mutation in domain 3 from his father and a deficiency of all four SAPs coded for by prosaposin from his mother. 29 refs., 3 figs., 1 tab.« less
Singh, Vineet K; Ring, Robert P; Aswani, Vijay; Stemper, Mary E; Kislow, Jennifer; Ye, Zhan; Shukla, Sanjay K
2017-12-01
Staphylococcus aureus is an opportunistic human pathogen that can cause serious infections in humans. A plethora of known and putative virulence factors are produced by staphylococci that collectively orchestrate pathogenesis. Ear protein (Escherichia coli ampicillin resistance) in S. aureus is an exoprotein in COL strain, predicted to be a superantigen, and speculated to play roles in antibiotic resistance and virulence. The goal of this study was to determine if expression of ear is modulated by single nucleotide polymorphisms in its promoter and coding sequences and whether this gene plays roles in antibiotic resistance and virulence. Promoter, coding sequences and expression of the ear gene in clinical and carriage S. aureus strains with distinct genetic backgrounds were analysed. The JE2 strain and its isogenic ear mutant were used in a systemic infection mouse model to determine the competiveness of the ear mutant.Results/Key findings. The ear gene showed a variable expression, with USA300FPR3757 showing a high-level expression compared to many of the other strains tested including some showing negligible expression. Higher expression was associated with agr type 1 but not correlated with phylogenetic relatedness of the ear gene based upon single nucleotide polymorphisms in the promoter or coding regions suggesting a complex regulation. An isogenic JE2 (USA300 background) ear mutant showed no significant difference in its growth, antibiotic susceptibility or virulence in a mouse model. Our data suggests that despite being highly expressed in a USA300 genetic background, Ear is not a significant contributor to virulence in that strain.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fernández-Sainz, I.J.; Largo, E.; Gladue, D.P.
E2, along with E{sup rns} and E1, is an envelope glycoprotein of Classical Swine Fever Virus (CSFV). E2 is involved in several virus functions: cell attachment, host range susceptibility and virulence in natural hosts. Here we evaluate the role of a specific E2 region, {sup 818}CPIGWTGVIEC{sup 828}, containing a putative fusion peptide (FP) sequence. Reverse genetics utilizing a full-length infectious clone of the highly virulent CSFV strain Brescia (BICv) was used to evaluate how individual amino acid substitutions within this region of E2 may affect replication of BICv. A synthetic peptide representing the complete E2 FP amino acid sequence adoptedmore » a β-type extended conformation in membrane mimetics, penetrated into model membranes, and perturbed lipid bilayer integrity in vitro. Similar peptides harboring amino acid substitutions adopted comparable conformations but exhibited different membrane activities. Therefore, a preliminary characterization of the putative FP {sup 818}CPIGWTGVIEC{sup 828} indicates a membrane fusion activity and a critical role in virus replication. - Highlights: • A putative fusion peptide (FP) region in CSFV E2 protein was shown to be critical for virus growth. • Synthetic FPs were shown to efficiently penetrate into lipid membranes using an in vitro model. • Individual residues in the FP affecting virus replication were identified by reverse genetics. • The same FP residues are also responsible for mediating membrane fusion.« less
Characterization of carotenoid hydroxylase gene promoter in Haematococcus pluvialis.
Meng, C X; Wei, W; Su, Z- L; Qin, S
2006-10-01
Astaxanthin, a high-value ketocarotenoid is mainly used in fish aquaculture. It also has potential in human health due to its higher antioxidant capacity than beta-carotene and vitamin E. The unicellular green alga Haematococcus pluvialis is known to accumulate astaxanthin in response to environmental stresses, such as high light intensity and salt stress. Carotenoid hydroxylase plays a key role in astaxanthin biosynthesis in H. pluvialis. In this paper, we report the characterization of a promoter-like region (-378 to -22 bp) of carotenoid hydroxylase gene by cloning, sequence analysis and functional verification of its 919 bp 5'-flanking region in H. pluvialis. The 5'-flanking region was characterized using micro-particle bombardment method and transient expression of LacZ reporter gene. Results of sequence analysis showed that the 5'-flanking region might have putative cis-acting elements, such as ABA (abscisic acid)-responsive element (ABRE), C-repeat/dehydration responsive element (C-repeat/DRE), ethylene-responsive element (ERE), heat-shock element (HSE), wound-responsive element (WUN-motif), gibberellin-responsive element (P-box), MYB-binding site (MBS) etc., except for typical TATA and CCAAT boxes. Results of 5' deletions construct and beta-galactosidase assays revealed that a highest promoter-like region might exist from -378 to -22 bp and some negative regulatory elements might lie in the region from -919 to -378 bp. Results of site-directed mutagenesis of a putative C-repeat/DRE and an ABRE-like motif in the promoter-like region (-378 to -22 bp) indicated that the putative C-repeat/DRE and ABRE-like motif might be important for expression of carotenoid hydroxylase gene.
Are plant formins integral membrane proteins?
Cvrcková, F
2000-01-01
The formin family of proteins has been implicated in signaling pathways of cellular morphogenesis in both animals and fungi; in the latter case, at least, they participate in communication between the actin cytoskeleton and the cell surface. Nevertheless, they appear to be cytoplasmic or nuclear proteins, and it is not clear whether they communicate with the plasma membrane, and if so, how. Because nothing is known about formin function in plants, I performed a systematic search for putative Arabidopsis thaliana formin homologs. I found eight putative formin-coding genes in the publicly available part of the Arabidopsis genome sequence and analyzed their predicted protein sequences. Surprisingly, some of them lack parts of the conserved formin-homology 2 (FH2) domain and the majority of them seem to have signal sequences and putative transmembrane segments that are not found in yeast or animals formins. Plant formins define a distinct subfamily. The presence in most Arabidopsis formins of sequence motifs typical or transmembrane proteins suggests a mechanism of membrane attachment that may be specific to plant formins, and indicates an unexpected evolutionary flexibility of the conserved formin domain.
Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism
Yin, Ling; An, Yunhe; Qu, Junjie; Li, Xinlong; Zhang, Yali; Dry, Ian; Wu, Huijuan; Lu, Jiang
2017-01-01
Plasmopara viticola causes downy mildew disease of grapevine which is one of the most devastating diseases of viticulture worldwide. Here we report a 101.3 Mb whole genome sequence of P. viticola isolate ‘JL-7-2’ obtained by a combination of Illumina and PacBio sequencing technologies. The P. viticola genome contains 17,014 putative protein-coding genes and has ~26% repetitive sequences. A total of 1,301 putative secreted proteins, including 100 putative RXLR effectors and 90 CRN effectors were identified in this genome. In the secretome, 261 potential pathogenicity genes and 95 carbohydrate-active enzymes were predicted. Transcriptional analysis revealed that most of the RXLR effectors, pathogenicity genes and carbohydrate-active enzymes were significantly up-regulated during infection. Comparative genomic analysis revealed that P. viticola evolved independently from the Arabidopsis downy mildew pathogen Hyaloperonospora arabidopsidis. The availability of the P. viticola genome provides a valuable resource not only for comparative genomic analysis and evolutionary studies among oomycetes, but also enhance our knowledge on the mechanism of interactions between this biotrophic pathogen and its host. PMID:28417959
Global functional atlas of Escherichia coli encompassing previously uncharacterized proteins.
Hu, Pingzhao; Janga, Sarath Chandra; Babu, Mohan; Díaz-Mejía, J Javier; Butland, Gareth; Yang, Wenhong; Pogoutse, Oxana; Guo, Xinghua; Phanse, Sadhna; Wong, Peter; Chandran, Shamanta; Christopoulos, Constantine; Nazarians-Armavil, Anaies; Nasseri, Negin Karimi; Musso, Gabriel; Ali, Mehrab; Nazemof, Nazila; Eroukova, Veronika; Golshani, Ashkan; Paccanaro, Alberto; Greenblatt, Jack F; Moreno-Hagelsieb, Gabriel; Emili, Andrew
2009-04-28
One-third of the 4,225 protein-coding genes of Escherichia coli K-12 remain functionally unannotated (orphans). Many map to distant clades such as Archaea, suggesting involvement in basic prokaryotic traits, whereas others appear restricted to E. coli, including pathogenic strains. To elucidate the orphans' biological roles, we performed an extensive proteomic survey using affinity-tagged E. coli strains and generated comprehensive genomic context inferences to derive a high-confidence compendium for virtually the entire proteome consisting of 5,993 putative physical interactions and 74,776 putative functional associations, most of which are novel. Clustering of the respective probabilistic networks revealed putative orphan membership in discrete multiprotein complexes and functional modules together with annotated gene products, whereas a machine-learning strategy based on network integration implicated the orphans in specific biological processes. We provide additional experimental evidence supporting orphan participation in protein synthesis, amino acid metabolism, biofilm formation, motility, and assembly of the bacterial cell envelope. This resource provides a "systems-wide" functional blueprint of a model microbe, with insights into the biological and evolutionary significance of previously uncharacterized proteins.
RhoA Regulation of Cardiomyocyte Differentiation
Kaarbø, Mari; Crane, Denis I.; Murrell, Wayne G.
2013-01-01
Earlier findings from our laboratory implicated RhoA in heart developmental processes. To investigate factors that potentially regulate RhoA expression, RhoA gene organisation and promoter activity were analysed. Comparative analysis indicated strict conservation of both gene organisation and coding sequence of the chick, mouse, and human RhoA genes. Bioinformatics analysis of the derived promoter region of mouse RhoA identified putative consensus sequence binding sites for several transcription factors involved in heart formation and organogenesis generally. Using luciferase reporter assays, RhoA promoter activity was shown to increase in mouse-derived P19CL6 cells that were induced to differentiate into cardiomyocytes. Overexpression of a dominant negative mutant of mouse RhoA (mRhoAN19) blocked this cardiomyocyte differentiation of P19CL6 cells and led to the accumulation of the cardiac transcription factors SRF and GATA4 and the early cardiac marker cardiac α-actin. Taken together, these findings indicate a fundamental role for RhoA in the differentiation of cardiomyocytes. PMID:23935420
Functional domains of the poliovirus receptor
DOE Office of Scientific and Technical Information (OSTI.GOV)
Koike, Satoshi; Ise, Iku; Nomoto, Akio
1991-05-15
A number of mutant cDNAs of the human poliovirus receptor were constructed to identify essential regions of the molecule as the receptor. All mutant cDNAs carrying the sequence coding for the entire N-terminal immunoglobulin-like domain (domain I) confer permissiveness for poliovirus to mouse L cells, but a mutant cDNA lacking the sequence for domain I does not. The transformants permissive for poliovirus were able to bind the virus and were also recognized by monoclonal antibody D171, which competes with poliovirus for the cellular receptor. These results strongly suggest that the poliovirus binding site resides in domain I of the receptor.more » Mutant cDNAs for the sequence encoding the intracellular peptide were also constructed and expressed in mouse L cells. Susceptibility of these cells to poliovirus revealed that the entire putative cytoplasmic domain is not essential for virus infection. Thus, the cytoplasmic domain of the molecule appears not to play a role in the penetration of poliovirus.« less
Hermann, Andreas; Kitzler, Hagen H; Pollack, Tobias; Biskup, Saskia; Krüger, Stefanie; Funke, Claudia; Terrile, Caterina; Haack, Tobias B
2017-01-01
Static encephalopathy of childhood with neurodegeneration in adulthood is a phenotypically distinctive, X-linked dominant subtype of neurodegeneration with brain iron accumulation (NBIA). WDR45 mutations were recently identified as causal. WDR45 encodes a beta-propeller scaffold protein with a putative role in autophagy, and the disease has been renamed beta-propeller protein-associated neurodegeneration (BPAN). Here we describe a female patient suffering from a classical BPAN phenotype due to a novel heterozygous deletion of WDR45 . An initial gene panel and Sanger sequencing approach failed to uncover the molecular defect. Based on the typical clinical and neuroimaging phenotype, quantitative polymerase chain reaction of the WDR45 coding regions was undertaken, and this showed a reduction of the gene dosage by 50% compared with controls. An extended search for deletions should be performed in apparently WDR45- negative cases presenting with features of NBIA and should also be considered in young patients with predominant intellectual disabilities and hypertonia/parkinsonism/dystonia.
ERIC Educational Resources Information Center
Takeuchi, Hikaru; Taki, Yasuyuki; Sassa, Yuko; Hashizume, Hiroshi; Sekiguchi, Atsushi; Fukushima, Ai; Kawashima, Ryuta
2011-01-01
Working memory is the limited capacity storage system involved in the maintenance and manipulation of information over short periods of time. Previous imaging studies have suggested that the frontoparietal regions are activated during working memory tasks; a putative association between the structure of the frontoparietal regions and working…
Diallinas, G; Gorfinkiel, L; Arst, H N; Cecchetto, G; Scazzocchio, C
1995-04-14
In Aspergillus nidulans, loss-of-function mutations in the uapA and azgA genes, encoding the major uric acid-xanthine and hypoxanthine-adenine-guanine permeases, respectively, result in impaired utilization of these purines as sole nitrogen sources. The residual growth of the mutant strains is due to the activity of a broad specificity purine permease. We have identified uapC, the gene coding for this third permease through the isolation of both gain-of-function and loss-of-function mutations. Uptake studies with wild-type and mutant strains confirmed the genetic analysis and showed that the UapC protein contributes 30% and 8-10% to uric acid and hypoxanthine transport rates, respectively. The uapC gene was cloned, its expression studied, its sequence and transcript map established, and the sequence of its putative product analyzed. uapC message accumulation is: (i) weakly induced by 2-thiouric acid; (ii) repressed by ammonium; (iii) dependent on functional uaY and areA regulatory gene products (mediating uric acid induction and nitrogen metabolite repression, respectively); (iv) increased by uapC gain-of-function mutations which specifically, but partially, suppress a leucine to valine mutation in the zinc finger of the protein coded by the areA gene. The putative uapC gene product is a highly hydrophobic protein of 580 amino acids (M(r) = 61,251) including 12-14 putative transmembrane segments. The UapC protein is highly similar (58% identity) to the UapA permease and significantly similar (23-34% identity) to a number of bacterial transporters. Comparisons of the sequences and hydropathy profiles of members of this novel family of transporters yield insights into their structure, functionally important residues, and possible evolutionary relationships.
Ares, Miguel A; Rios-Sarabia, Nora; De la Cruz, Miguel A; Rivera-Gutiérrez, Sandra; García-Morales, Lázaro; León-Solís, Lizbel; Espitia, Clara; Pacheco, Sabino; Cerna-Cortés, Jorge F; Helguera-Repetto, Cecilia A; García, María Jesús; González-Y-Merchand, Jorge A
2017-07-01
This work examined the expression of the septum site determining gene (ssd) of Mycobacterium tuberculosis CDC1551 and its ∆sigD mutant under different growing conditions. The results showed an up-regulation of ssd during stationary phase and starvation conditions, but not during in vitro dormancy, suggesting a putative role for SigD in the control of ssd expression mainly under lack-of-nutrients environments. Furthermore, we elucidated a putative link between ssd expression and cell elongation of bacilli at stationary phase. In addition, a -35 sigD consensus sequence was found for the ssd promoter region, reinforcing the putative regulation of ssd by SigD, and in turn, supporting this protein role during the adaptation of M. tuberculosis to some stressful environments.
Dohm, J.M.; Ferris, J.C.; Baker, V.R.; Anderson, R.C.; Hare, T.M.; Strom, R.G.; Barlow, N.G.; Tanaka, K.L.; Klemaszewski, J.E.; Scott, D.H.
2001-01-01
Paleotopographic reconstructions based on a synthesis of published geologic information and high-resolution topography, including topographic profiles, reveal the potential existence of an enormous drainage basin/aquifer system in the eastern part of the Tharsis region during the Noachian Period. Large topographic highs formed the margin of the gigantic drainage basin. Subsequently, lavas, sediments, and volatiles partly infilled the basin, resulting in an enormous and productive regional aquifer. The stacked sequences of water-bearing strata were then deformed locally and, in places, exposed by magmatic-driven uplifts, tectonic deformation, and erosion. This basin model provides a potential source of water necessary to carve the large outflow channel systems of the Tharsis and surrounding regions and to contribute to the formation of putative northern-plains ocean(s) and/or paleolakes. Copyright 2001 by the American Geophysical Union.
Pelsy, F.; Merdinoglu, D.
2002-09-01
A chromosome-walking strategy was used to sequence and characterize retrotransposons in the grapevine genome. The reconstitution of a family of retroelements, named Tvv1, was achieved by six successive steps. These elements share a single, highly conserved open reading frame 4,153 nucleotides-long, putatively encoding the gag, pro, int, rt and rh proteins. Comparison of the Tvv1 open reading frame coding potential with those of drosophila copia and tobacco Tnt1, revealed that Tvv1 is closely related to Ty 1 copia-like retrotransposons. A highly variable untranslated leader region, upstream of the open reading frame, allowed us to differentiate Tvv1 variants, which represent a family of at least 28 copies, in varying sizes. This internal region is flanked by two long terminal repeats in direct orientation, sized between 149 and 157 bp. Among elements theoretically sized from 4,970 to 5,550 bp, we describe the full-length sequence of a reference element Tvv1-1, 5,343 nucleotides-long. The full-length sequence of Tvv1-1 compared to pea PDR1 shows a 53.3% identity. In addition, both elements contain long terminal repeats of nearly the same size in which the U5 region could be entirely absent. Therefore, we assume that Tvv1 and PDR1 could constitute a particular class of short LTRs retroelements.
Perera, N C N; Godahewa, G I; Lee, Jehee
2016-12-01
Mitogen-activated protein kinase (MAPK) is involved in the regulation of cellular events by mediating signal transduction pathways. MAPK1 is a member of the extracellular-signal regulated kinases (ERKs), playing roles in cell proliferation, differentiation, and development. This is mainly in response to growth factors, mitogens, and many environmental stresses. In the current study, we have characterized the structural features of a homolog of MAPK1 from disk abalone (AbMAPK1). Further, we have unraveled its expressional kinetics against different experimental pathogenic infections or related chemical stimulants. AbMAPK1 harbors a 5' untranslated region (UTR) of 23 bps, a coding sequence of 1104 bps, and a 3' UTR of 448 bp. The putative peptide comprises a predicted molecular mass of 42.2 kDa, with a theoretical pI of 6.28. Based on the in silico analysis, AbMAPK1 possesses two N-glycosylation sites, one S_TK catalytic domain, and a conserved His-Arg-Asp domain (HRD). In addition, a conservative glycine rich ATP-phosphate-binding loop and a threonine-x-tyrosine motif (TEY) important for the autophosphorylation were also identified in the protein. Homology assessment of AbMAPK1 showed several conserved regions, and ark clam (Aplysia californica) showed the highest sequence identity (87.9%). The phylogenetic analysis supported close evolutionary kinship with molluscan orthologs. Constitutive expression of AbMAPK1 was observed in six different tissues of disk abalone, with the highest expression in the digestive tract, followed by the gills and hemocytes. Highest AbMAPK1 mRNA expression level was detected at the trochophore developmental stage, suggesting its role in abalone cell differentiation and proliferation. Significant modulation of AbMAPK1 expression under pathogenic stress suggested its putative involvement in the immune defense mechanism. Copyright © 2016 Elsevier Ltd. All rights reserved.
Pramono, Ajeng K.; Kuwahara, Hirokazu; Itoh, Takehiko; Toyoda, Atsushi; Yamada, Akinori; Hongoh, Yuichi
2017-01-01
Termites depend nutritionally on their gut microbes, and protistan, bacterial, and archaeal gut communities have been extensively studied. However, limited information is available on viruses in the termite gut. We herein report the complete genome sequence (99,517 bp) of a phage obtained during a genome analysis of “Candidatus Azobacteroides pseudotrichonymphae” phylotype ProJPt-1, which is an obligate intracellular symbiont of the cellulolytic protist Pseudotrichonympha sp. in the gut of the termite Prorhinotermes japonicus. The genome of the phage, designated ProJPt-Bp1, was circular or circularly permuted, and was not integrated into the two circular chromosomes or five circular plasmids composing the host ProJPt-1 genome. The phage was putatively affiliated with the order Caudovirales based on sequence similarities with several phage-related genes; however, most of the 52 protein-coding sequences had no significant homology to sequences in the databases. The phage genome contained a tRNA-Gln (CAG) gene, which showed the highest sequence similarity to the tRNA-Gln (CAA) gene of the host “Ca. A. pseudotrichonymphae” phylotype ProJPt-1. Since the host genome lacked a tRNA-Gln (CAG) gene, the phage tRNA gene may compensate for differences in codon usage bias between the phage and host genomes. The phage genome also contained a non-coding region with high nucleotide sequence similarity to a region in one of the host plasmids. No other phage-related sequences were found in the host ProJPt-1 genome. To the best of our knowledge, this is the first report of a phage from an obligate, mutualistic endosymbiont permanently associated with eukaryotic cells. PMID:28321010
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.
Rehm, Charlotte; Wurmthaler, Lena A; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S
2015-01-01
In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1-5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.
Rehm, Charlotte; Wurmthaler, Lena A.; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S.
2015-01-01
In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1–5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6–9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria. PMID:26695179
The pine Pschi4 promoter directs wound-induced transcription
Haiguo Wu; Charles H. Michler; Liborio LaRussa; John M. Davis
1999-01-01
Mechanical wounding stimulates the accumulation of Pschi4 transcripts (encoding a putative extracellular chitinase) in pine trees. To gain insight into the transcriptional regulatory region(s) in this gymnosperm defense gene, the 5'-flanking region of Pschi4 was fused to the uidA reporter gene encoding -...
Carapelli, Antonio; Comandi, Sara; Convey, Peter; Nardi, Francesco; Frati, Francesco
2008-01-01
Background Mitogenomics data, i.e. complete mitochondrial genome sequences, are popular molecular markers used for phylogenetic, phylogeographic and ecological studies in different animal lineages. Their comparative analysis has been used to shed light on the evolutionary history of given taxa and on the molecular processes that regulate the evolution of the mitochondrial genome. A considerable literature is available in the fields of invertebrate biochemical and ecophysiological adaptation to extreme environmental conditions, exemplified by those of the Antarctic. Nevertheless, limited molecular data are available from terrestrial Antarctic species, and this study represents the first attempt towards the description of a mitochondrial genome from one of the most widespread and common collembolan species of Antarctica. Results In this study we describe the mitochondrial genome of the Antarctic collembolan Cryptopygus antarcticus Willem, 1901. The genome contains the standard set of 37 genes usually present in animal mtDNAs and a large non-coding fragment putatively corresponding to the region (A+T-rich) responsible for the control of replication and transcription. All genes are arranged in the gene order typical of Pancrustacea. Three additional short non-coding regions are present at gene junctions. Two of these are located in positions of abrupt shift of the coding polarity of genes oriented on opposite strands suggesting a role in the attenuation of the polycistronic mRNA transcription(s). In addition, remnants of an additional copy of trnL(uag) are present between trnS(uga) and nad1. Nucleotide composition is biased towards a high A% and T% (A+T = 70.9%), as typically found in hexapod mtDNAs. There is also a significant strand asymmetry, with the J-strand being more abundant in A and C. Within the A+T-rich region, some short sequence fragments appear to be similar (in position and primary sequence) to those involved in the origin of the N-strand replication of the Drosophila mtDNA. Conclusion The mitochondrial genome of C. antarcticus shares several features with other pancrustacean genomes, although the presence of unusual non-coding regions is also suggestive of molecular rearrangements that probably occurred before the differentiation of major collembolan families. Closer examination of gene boundaries also confirms previous observations on the presence of unusual start and stop codons, and suggests a role for tRNA secondary structures as potential cleavage signals involved in the maturation of the primary transcript. Sequences potentially involved in the regulation of replication/transcription are present both in the A+T-rich region and in other areas of the genome. Their position is similar to that observed in a limited number of insect species, suggesting unique replication/transcription mechanisms for basal and derived hexapod lineages. This initial description and characterization of the mitochondrial genome of C. antarcticus will constitute the essential foundation prerequisite for investigations of the evolutionary history of one of the most speciose collembolan genera present in Antarctica and other localities of the Southern Hemisphere. PMID:18593463
Iiyama, Kazuhiro; Otao, Masahiro; Mori, Kazuki; Mon, Hiroaki; Lee, Jae Man; Kusakabe, Takahiro; Tashiro, Kousuke; Asano, Shin-Ichiro; Yasunaga-Aoki, Chisa
2014-01-01
To determine the phylogenetic relationship among Paenibacillus species, putative replication origin regions were compared. In the rsmG-gyrA region, gene arrangements in Paenibacillus species were identical to those of Bacillus species, with the exception of an open reading frame (orf14) positioned between gyrB and gyrA, which was observed only in Paenibacillus species. The orf14 product was homologous to the endospore-associated proteins YheC and YheD of Bacillus subtilis. Phylogenetic analysis based on the YheCD proteins suggested that Orf14 could be categorized into the YheC group. In the Paenibacillus genome, DnaA box clusters were found in rpmH-dnaA and dnaA-dnaN intergenic regions, known as box regions C and R, respectively; this localization was similar to that observed in B. halodurans. A phylogenetic tree based on the nucleotide sequences of the whole replication origin regions suggested that P. popilliae, P. thiaminolyticus, and P. dendritiformis are closely related species.
Kawaguchi, Fuki; Kigoshi, Hiroto; Nakajima, Ayaka; Matsumoto, Yuta; Uemoto, Yoshinobu; Fukushima, Moriyuki; Yoshida, Emi; Iwamoto, Eiji; Akiyama, Takayuki; Kohama, Namiko; Kobayashi, Eiji; Honda, Takeshi; Oyama, Kenji; Mannen, Hideyuki; Sasazaki, Shinji
2018-05-17
Fatty acid composition is an important indicator of beef quality. The objective of this study was to search the potential candidate region for fatty acid composition. We performed pool-based genome-wide association studies (GWAS) for oleic acid percentage (C18:1) in a Japanese Black cattle population from the Hyogo prefecture. GWAS analysis revealed two novel candidate regions on BTA9 and BTA14. The most significant single nucleotide polymorphisms (SNPs) in each region were genotyped in a population (n = 899) to verify their effect on C18:1. Statistical analysis revealed that both SNPs were significantly associated with C18:1 (p = .0080 and .0003), validating the quantitative trait loci (QTLs) detected in GWAS. We subsequently selected VNN1 and LYPLA1 genes as candidate genes from each region on BTA9 and BTA14, respectively. We sequenced full-length coding sequence (CDS) of these genes in eight individuals and identified a nonsynonymous SNP T66M on VNN1 gene as a putative candidate polymorphism. The polymorphism was also significantly associated with C18:1, but the p value (p = .0162) was higher than the most significant SNP on BTA9, suggesting that it would not be responsible for the QTL. Although further investigation will be needed to determine the responsible gene and polymorphism, our findings would contribute to development of selective markers for fatty acid composition in the Japanese Black cattle of Hyogo. © 2018 Japanese Society of Animal Science.
Bröker, Daniel; Arenskötter, Matthias; Legatzki, Antje; Nies, Dietrich H.; Steinbüchel, Alexander
2004-01-01
The complete sequence of the circular 101,016-bp megaplasmid pKB1 from the cis-1,4-polyisoprene-degrading bacterium Gordonia westfalica Kb1, which represents the first described extrachromosomal DNA of a member of this genus, was determined. Plasmid pKB1 harbors 105 open reading frames. The predicted products of 46 of these are significantly related to proteins of known function. Plasmid pKB1 is organized into three functional regions that are flanked by insertion sequence (IS) elements: (i) a replication and putative partitioning region, (ii) a putative metabolic region, and (iii) a large putative conjugative transfer region, which is interrupted by an additional IS element. Southern hybridization experiments revealed the presence of another copy of this conjugational transfer region on the bacterial chromosome. The origin of replication (oriV) of pKB1 was identified and used for construction of Escherichia coli-Gordonia shuttle vectors, which was also suitable for several other Gordonia species and related genera. The metabolic region included the heavy-metal resistance gene cadA, encoding a P-type ATPase. Expression of cadA in E. coli mediated resistance to cadmium, but not to zinc, and decreased the cellular content of cadmium in this host. When G. westfalica strain Kb1 was cured of plasmid pKB1, the resulting derivative strains exhibited slightly decreased cadmium resistance. Furthermore, they had lost the ability to use isoprene rubber as a sole source of carbon and energy, suggesting that genes essential for rubber degradation are encoded by pKB1. PMID:14679241
Molecular and functional characterization of the promoter of ETS2, the human c-ets-2 gene.
Mavrothalassitis, G J; Watson, D K; Papas, T S
1990-01-01
The 5' end of the human c-ets-2 gene, ETS2, was cloned and characterized. The major transcription initiation start sites were identified, and the pertinent sequences surrounding the ETS2 promoter were determined. The promoter region of ETS2 does not possess typical "TATA" and "CAAT" elements. However, this promoter contains several repeat regions, as well as two consensus AP2 binding sites and three putative Sp1 sites. There is also a palindromic region similar to the serum response element of the c-fos gene, located 1400 base pairs (bp) upstream from the first major transcription initiation site. A G + C-rich sequence (GC element) with dyad symmetry can be seen in the ETS2 promoter, immediately following an unusually long (approximately 250-bp) polypurine-polypyrimidine tract. A series of deletion fragments from the putative promoter region were ligated in front of the bacterial chloramphenicol acetyltransferase gene and tested for activity following transfection into HeLa cells. The 5' boundary of the region needed for maximum promoter activity was found to be 159 bp upstream of the major initiation site. This region of 159 bp contains putative binding sites for transcription factors Sp1 and AP2 (one for each), the GC element, one small forward repeat, one inverted repeat, and half of the polypurine-pyrimidine tract. The promoter of ETS2 (within the polypyrimidine tract) serves to illustrate an alternative structure that may be present in genes with "TATA-less" promoters. Images PMID:2405393
Duret, Laurent; Cohen, Jean; Jubin, Claire; Dessen, Philippe; Goût, Jean-François; Mousset, Sylvain; Aury, Jean-Marc; Jaillon, Olivier; Noël, Benjamin; Arnaiz, Olivier; Bétermier, Mireille; Wincker, Patrick; Meyer, Eric; Sperling, Linda
2008-01-01
Ciliates are the only unicellular eukaryotes known to separate germinal and somatic functions. Diploid but silent micronuclei transmit the genetic information to the next sexual generation. Polyploid macronuclei express the genetic information from a streamlined version of the genome but are replaced at each sexual generation. The macronuclear genome of Paramecium tetraurelia was recently sequenced by a shotgun approach, providing access to the gene repertoire. The 72-Mb assembly represents a consensus sequence for the somatic DNA, which is produced after sexual events by reproducible rearrangements of the zygotic genome involving elimination of repeated sequences, precise excision of unique-copy internal eliminated sequences (IES), and amplification of the cellular genes to high copy number. We report use of the shotgun sequencing data (>106 reads representing 13× coverage of a completely homozygous clone) to evaluate variability in the somatic DNA produced by these developmental genome rearrangements. Although DNA amplification appears uniform, both of the DNA elimination processes produce sequence heterogeneity. The variability that arises from IES excision allowed identification of hundreds of putative new IESs, compared to 42 that were previously known, and revealed cases of erroneous excision of segments of coding sequences. We demonstrate that IESs in coding regions are under selective pressure to introduce premature termination of translation in case of excision failure. PMID:18256234
Genome sequencing of the sweetpotato whitefly Bemisia tabaci MED/Q.
Xie, Wen; Chen, Chunhai; Yang, Zezhong; Guo, Litao; Yang, Xin; Wang, Dan; Chen, Ming; Huang, Jinqun; Wen, Yanan; Zeng, Yang; Liu, Yating; Xia, Jixing; Tian, Lixia; Cui, Hongying; Wu, Qingjun; Wang, Shaoli; Xu, Baoyun; Li, Xianchun; Tan, Xinqiu; Ghanim, Murad; Qiu, Baoli; Pan, Huipeng; Chu, Dong; Delatte, Helene; Maruthi, M N; Ge, Feng; Zhou, Xueping; Wang, Xiaowei; Wan, Fanghao; Du, Yuzhou; Luo, Chen; Yan, Fengming; Preisser, Evan L; Jiao, Xiaoguo; Coates, Brad S; Zhao, Jinyang; Gao, Qiang; Xia, Jinquan; Yin, Ye; Liu, Yong; Brown, Judith K; Zhou, Xuguo Joe; Zhang, Youjun
2017-05-01
The sweetpotato whitefly Bemisia tabaci is a highly destructive agricultural and ornamental crop pest. It damages host plants through both phloem feeding and vectoring plant pathogens. Introductions of B. tabaci are difficult to quarantine and eradicate because of its high reproductive rates, broad host plant range, and insecticide resistance. A total of 791 Gb of raw DNA sequence from whole genome shotgun sequencing, and 13 BAC pooling libraries were generated by Illumina sequencing using different combinations of mate-pair and pair-end libraries. Assembly gave a final genome with a scaffold N50 of 437 kb, and a total length of 658 Mb. Annotation of repetitive elements and coding regions resulted in 265.0 Mb TEs (40.3%) and 20 786 protein-coding genes with putative gene family expansions, respectively. Phylogenetic analysis based on orthologs across 14 arthropod taxa suggested that MED/Q is clustered into a hemipteran clade containing A. pisum and is a sister lineage to a clade containing both R. prolixus and N. lugens. Genome completeness, as estimated using the CEGMA and Benchmarking Universal Single-Copy Orthologs pipelines, reached 96% and 79%. These MED/Q genomic resources lay a foundation for future 'pan-genomic' comparisons of invasive vs. noninvasive, invasive vs. invasive, and native vs. exotic Bemisia, which, in return, will open up new avenues of investigation into whitefly biology, evolution, and management. © The Author 2017. Published by Oxford University Press.
Bester-Van Der Merwe, Aletta; Blaauw, Sonja; Du Plessis, Jana; Roodt-Wilding, Rouvay
2013-09-23
Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.
Montgomery, H J; Romanov, V; Guillemette, J G
2000-02-18
Neuronal nitric-oxide synthase (NOS) and endothelial NOS are constitutive NOS isoforms that are activated by binding calmodulin in response to elevated intracellular calcium. In contrast, the inducible NOS isoform binds calmodulin at low basal levels of calcium in resting cells. Primary sequence comparisons show that each constitutive NOS isozyme contains a polypeptide segment within its reductase domain, which is absent in the inducible NOS enzyme. To study a possible link between the presence of these additional polypeptide segments in constitutive NOS enzymes and their calcium-dependent calmodulin activation, three deletion mutants were created. The putative inhibitory insert was removed from the FMN binding regions of the neuronal NOS holoenzyme and from two truncated neuronal NOS reductase enzymes in which the calmodulin binding region was either included or deleted. All three mutant enzymes showed reduced incorporation of FMN and required reconstitution with exogenous FMN for activity. The combined removal of both the calmodulin binding domain and the putative inhibitory insert did not result in a calmodulin-independent neuronal NOS reductase. Thus, although the putative inhibitory element has an effect on the calcium-dependent calmodulin activation of neuronal NOS, it does not have the properties of the typical autoinhibitory domain found in calmodulin-activated enzymes.
Audit, Benjamin; Zaghloul, Lamia; Vaillant, Cédric; Chevereau, Guillaume; d'Aubenton-Carafa, Yves; Thermes, Claude; Arneodo, Alain
2009-01-01
For years, progress in elucidating the mechanisms underlying replication initiation and its coupling to transcriptional activities and to local chromatin structure has been hampered by the small number (approximately 30) of well-established origins in the human genome and more generally in mammalian genomes. Recent in silico studies of compositional strand asymmetries revealed a high level of organization of human genes around 1000 putative replication origins. Here, by comparing with recently experimentally identified replication origins, we provide further support that these putative origins are active in vivo. We show that regions ∼300-kb wide surrounding most of these putative replication origins that replicate early in the S phase are hypersensitive to DNase I cleavage, hypomethylated and present a significant enrichment in genomic energy barriers that impair nucleosome formation (nucleosome-free regions). This suggests that these putative replication origins are specified by an open chromatin structure favored by the DNA sequence. We discuss how this distinctive attribute makes these origins, further qualified as ‘master’ replication origins, priviledged loci for future research to decipher the human spatio-temporal replication program. Finally, we argue that these ‘master’ origins are likely to play a key role in genome dynamics during evolution and in pathological situations. PMID:19671527
Kilpert, Fabian; Podsiadlowski, Lars
2006-01-01
Background Sequence data and other characters from mitochondrial genomes (gene translocations, secondary structure of RNA molecules) are useful in phylogenetic studies among metazoan animals from population to phylum level. Moreover, the comparison of complete mitochondrial sequences gives valuable information about the evolution of small genomes, e.g. about different mechanisms of gene translocation, gene duplication and gene loss, or concerning nucleotide frequency biases. The Peracarida (gammarids, isopods, etc.) comprise about 21,000 species of crustaceans, living in many environments from deep sea floor to arid terrestrial habitats. Ligia oceanica is a terrestrial isopod living at rocky seashores of the european North Sea and Atlantic coastlines. Results The study reveals the first complete mitochondrial DNA sequence from a peracarid crustacean. The mitochondrial genome of Ligia oceanica is a circular double-stranded DNA molecule, with a size of 15,289 bp. It shows several changes in mitochondrial gene order compared to other crustacean species. An overview about mitochondrial gene order of all crustacean taxa yet sequenced is also presented. The largest non-coding part (the putative mitochondrial control region) of the mitochondrial genome of Ligia oceanica is unexpectedly not AT-rich compared to the remainder of the genome. It bears two repeat regions (4× 10 bp and 3× 64 bp), and a GC-rich hairpin-like secondary structure. Some of the transfer RNAs show secondary structures which derive from the usual cloverleaf pattern. While some tRNA genes are putative targets for RNA editing, trnR could not be localized at all. Conclusion Gene order is not conserved among Peracarida, not even among isopods. The two isopod species Ligia oceanica and Idotea baltica show a similarly derived gene order, compared to the arthropod ground pattern and to the amphipod Parhyale hawaiiensis, suggesting that most of the translocation events were already present the last common ancestor of these isopods. Beyond that, the positions of three tRNA genes differ in the two isopod species. Strand bias in nucleotide frequency is reversed in both isopod species compared to other Malacostraca. This is probably due to a reversal of the replication origin, which is further supported by the fact that the hairpin structure typically found in the control region shows a reversed orientation in the isopod species, compared to other crustaceans. PMID:16987408
Ataya, Farid S.; Fouad, Dalia; Al-Olayan, Ebtsam; Malik, Ajamaluddin
2012-01-01
Superoxide dismutase (SOD) is the first line of defense against oxidative stress induced by endogenous and/or exogenous factors and thus helps in maintaining the cellular integrity. Its activity is related to many diseases; so, it is of importance to study the structure and expression of SOD gene in an animal naturally exposed most of its life to the direct sunlight as a cause of oxidative stress. Arabian camel (one humped camel, Camelus dromedarius) is adapted to the widely varying desert climatic conditions that extremely changes during daily life in the Arabian Gulf. Studying the cSOD1 in C. dromedarius could help understand the impact of exposure to direct sunlight and desert life on the health status of such mammal. The full coding region of a putative CuZnSOD gene of C. dromedarius (cSOD1) was amplified by reverse transcription PCR and cloned for the first time (gene bank accession number for nucleotides and amino acids are JF758876 and AEF32527, respectively). The cDNA sequencing revealed an open reading frame of 459 nucleotides encoding a protein of 153 amino acids which is equal to the coding region of SOD1 gene and protein from many organisms. The calculated molecular weight and isoelectric point of cSOD1 was 15.7 kDa and 6.2, respectively. The level of expression of cSOD1 in different camel tissues (liver, kidney, spleen, lung and testis) was examined using Real Time-PCR. The highest level of cSOD1 transcript was found in the camel liver (represented as 100%) followed by testis (45%), kidney (13%), lung (11%) and spleen (10%), using 18S ribosomal subunit as endogenous control. The deduced amino acid sequence exhibited high similarity with Cebus apella (90%), Sus scrofa (88%), Cavia porcellus (88%), Mus musculus (88%), Macaca mulatta (87%), Pan troglodytes (87%), Homo sapiens (87%), Canis familiaris (86%), Bos taurus (86%), Pongo abelii (85%) and Equus caballus (82%). Phylogenetic analysis revealed that cSOD1 is grouped together with S. scrofa. The predicted 3D structure of cSOD1 showed high similarity with the human and bovine CuZnSOD homologues. The Root-mean-square deviation (rmsd) between cSOD1/hSOD1 and cSOD1/bSOD1 superimposed structure pairs were 0.557 and 0.425 A. The Q-score of cSOD1-hSOD1 and cSOD1-bSOD1 were 0.948 and 0.961, respectively. PMID:22312292
Genome-wide identification and characterization of the SBP-box gene family in Petunia.
Zhou, Qin; Zhang, Sisi; Chen, Feng; Liu, Baojun; Wu, Lan; Li, Fei; Zhang, Jiaqi; Bao, Manzhu; Liu, Guofeng
2018-03-12
SQUAMOSA PROMOTER BINDING PROTEIN (SBP)-box genes encode a family of plant-specific transcription factors (TFs) that play important roles in many growth and development processes including phase transition, leaf initiation, shoot and inflorescence branching, fruit development and ripening etc. The SBP-box gene family has been identified and characterized in many species, but has not been well studied in Petunia, an important ornamental genus. We identified 21 putative SPL genes of Petunia axillaris and P. inflata from the reference genome of P. axillaris N and P. inflata S6, respectively, which were supported by the transcriptome data. For further confirmation, all the 21 genes were also cloned from P. hybrida line W115 (Mitchel diploid). Phylogenetic analysis based on the highly conserved SBP domains arranged PhSPLs in eight groups, analogous to those from Arabidopsis and tomato. Furthermore, the Petunia SPL genes had similar exon-intron structure and the deduced proteins contained very similar conserved motifs within the same subgroup. Out of 21 PhSPL genes, fourteen were predicted to be potential targets of PhmiR156/157, and the putative miR156/157 response elements (MREs) were located in the coding region of group IV, V, VII and VIII genes, but in the 3'-UTR regions of group VI genes. SPL genes were also identified from another two wild Petunia species, P. integrifolia and P. exserta, based on their transcriptome databases to investigate the origin of PhSPLs. Phylogenetic analysis and multiple alignments of the coding sequences of PhSPLs and their orthologs from wild species indicated that PhSPLs were originated mainly from P. axillaris. qRT-PCR analysis demonstrated differential spatiotemperal expression patterns of PhSPL genes in petunia and many were expressed predominantly in the axillary buds and/or inflorescences. In addition, overexpression of PhSPL9a and PhSPL9b in Arabidopsis suggested that these genes play a conserved role in promoting the vegetative-to-reproductive phase transition. Petunia genome contains at least 21 SPL genes, and most of the genes are expressed in different tissues. The PhSPL genes may play conserved and diverse roles in plant growth and development, including flowering regulation, leaf initiation, axillary bud and inflorescence development. This work provides a comprehensive understanding of the SBP-box gene family in Petunia and lays a significant foundation for future studies on the function and evolution of SPL genes in petunia.
2011-01-01
Background Pneumonia and myocarditis are the most commonly reported diseases due to Histophilus somni, an opportunistic pathogen of the reproductive and respiratory tracts of cattle. Thus far only a few genes involved in metabolic and virulence functions have been identified and characterized in H. somni using traditional methods. Analyses of the genome sequences of several Pasteurellaceae species have provided insights into their biology and evolution. In view of the economic and ecological importance of H. somni, the genome sequence of pneumonia strain 2336 has been determined and compared to that of commensal strain 129Pt and other members of the Pasteurellaceae. Results The chromosome of strain 2336 (2,263,857 bp) contained 1,980 protein coding genes, whereas the chromosome of strain 129Pt (2,007,700 bp) contained only 1,792 protein coding genes. Although the chromosomes of the two strains differ in size, their average GC content, gene density (total number of genes predicted on the chromosome), and percentage of sequence (number of genes) that encodes proteins were similar. The chromosomes of these strains also contained a number of discrete prophage regions and genomic islands. One of the genomic islands in strain 2336 contained genes putatively involved in copper, zinc, and tetracycline resistance. Using the genome sequence data and comparative analyses with other members of the Pasteurellaceae, several H. somni genes that may encode proteins involved in virulence (e.g., filamentous haemaggutinins, adhesins, and polysaccharide biosynthesis/modification enzymes) were identified. The two strains contained a total of 17 ORFs that encode putative glycosyltransferases and some of these ORFs had characteristic simple sequence repeats within them. Most of the genes/loci common to both the strains were located in different regions of the two chromosomes and occurred in opposite orientations, indicating genome rearrangement since their divergence from a common ancestor. Conclusions Since the genome of strain 129Pt was ~256,000 bp smaller than that of strain 2336, these genomes provide yet another paradigm for studying evolutionary gene loss and/or gain in regard to virulence repertoire and pathogenic ability. Analyses of the complete genome sequences revealed that bacteriophage- and transposon-mediated horizontal gene transfer had occurred at several loci in the chromosomes of strains 2336 and 129Pt. It appears that these mobile genetic elements have played a major role in creating genomic diversity and phenotypic variability among the two H. somni strains. PMID:22111657
Siddaramappa, Shivakumara; Challacombe, Jean F; Duncan, Alison J; Gillaspy, Allison F; Carson, Matthew; Gipson, Jenny; Orvis, Joshua; Zaitshik, Jeremy; Barnes, Gentry; Bruce, David; Chertkov, Olga; Detter, J Chris; Han, Cliff S; Tapia, Roxanne; Thompson, Linda S; Dyer, David W; Inzana, Thomas J
2011-11-23
Pneumonia and myocarditis are the most commonly reported diseases due to Histophilus somni, an opportunistic pathogen of the reproductive and respiratory tracts of cattle. Thus far only a few genes involved in metabolic and virulence functions have been identified and characterized in H. somni using traditional methods. Analyses of the genome sequences of several Pasteurellaceae species have provided insights into their biology and evolution. In view of the economic and ecological importance of H. somni, the genome sequence of pneumonia strain 2336 has been determined and compared to that of commensal strain 129Pt and other members of the Pasteurellaceae. The chromosome of strain 2336 (2,263,857 bp) contained 1,980 protein coding genes, whereas the chromosome of strain 129Pt (2,007,700 bp) contained only 1,792 protein coding genes. Although the chromosomes of the two strains differ in size, their average GC content, gene density (total number of genes predicted on the chromosome), and percentage of sequence (number of genes) that encodes proteins were similar. The chromosomes of these strains also contained a number of discrete prophage regions and genomic islands. One of the genomic islands in strain 2336 contained genes putatively involved in copper, zinc, and tetracycline resistance. Using the genome sequence data and comparative analyses with other members of the Pasteurellaceae, several H. somni genes that may encode proteins involved in virulence (e.g., filamentous haemaggutinins, adhesins, and polysaccharide biosynthesis/modification enzymes) were identified. The two strains contained a total of 17 ORFs that encode putative glycosyltransferases and some of these ORFs had characteristic simple sequence repeats within them. Most of the genes/loci common to both the strains were located in different regions of the two chromosomes and occurred in opposite orientations, indicating genome rearrangement since their divergence from a common ancestor. Since the genome of strain 129Pt was ~256,000 bp smaller than that of strain 2336, these genomes provide yet another paradigm for studying evolutionary gene loss and/or gain in regard to virulence repertoire and pathogenic ability. Analyses of the complete genome sequences revealed that bacteriophage- and transposon-mediated horizontal gene transfer had occurred at several loci in the chromosomes of strains 2336 and 129Pt. It appears that these mobile genetic elements have played a major role in creating genomic diversity and phenotypic variability among the two H. somni strains.
Mikhailov, Alexander T; Torrado, Mario
2018-05-12
There is growing evidence that putative gene regulatory networks including cardio-enriched transcription factors, such as PITX2, TBX5, ZFHX3, and SHOX2, and their effector/target genes along with downstream non-coding RNAs can play a potentially important role in the process of adaptive and maladaptive atrial rhythm remodeling. In turn, expression of atrial fibrillation-associated transcription factors is under the control of upstream regulatory non-coding RNAs. This review broadly explores gene regulatory mechanisms associated with susceptibility to atrial fibrillation-with key examples from both animal models and patients-within the context of both cardiac transcription factors and non-coding RNAs. These two systems appear to have multiple levels of cross-regulation and act coordinately to achieve effective control of atrial rhythm effector gene expression. Perturbations of a dynamic expression balance between transcription factors and corresponding non-coding RNAs can provoke the development or promote the progression of atrial fibrillation. We also outline deficiencies in current models and discuss ongoing studies to clarify remaining mechanistic questions. An understanding of the function of transcription factors and non-coding RNAs in gene regulatory networks associated with atrial fibrillation risk will enable the development of innovative therapeutic strategies.
Bowers, Jeffrey S
2009-01-01
A fundamental claim associated with parallel distributed processing (PDP) theories of cognition is that knowledge is coded in a distributed manner in mind and brain. This approach rejects the claim that knowledge is coded in a localist fashion, with words, objects, and simple concepts (e.g. "dog"), that is, coded with their own dedicated representations. One of the putative advantages of this approach is that the theories are biologically plausible. Indeed, advocates of the PDP approach often highlight the close parallels between distributed representations learned in connectionist models and neural coding in brain and often dismiss localist (grandmother cell) theories as biologically implausible. The author reviews a range a data that strongly challenge this claim and shows that localist models provide a better account of single-cell recording studies. The author also contrast local and alternative distributed coding schemes (sparse and coarse coding) and argues that common rejection of grandmother cell theories in neuroscience is due to a misunderstanding about how localist models behave. The author concludes that the localist representations embedded in theories of perception and cognition are consistent with neuroscience; biology only calls into question the distributed representations often learned in PDP models.
SNP discovery by high-throughput sequencing in soybean
2010-01-01
Background With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. Development of high-density genetic markers in the QTL regions of specific mapping populations is essential for fine-mapping and map-based cloning of economically important genes. Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variation existing between any diverse genotypes that are usually used for QTL mapping studies. The massively parallel sequencing technologies (Roche GS/454, Illumina GA/Solexa, and ABI/SOLiD), have been widely applied to identify genome-wide sequence variations. However, it is still remains unclear whether sequence data at a low sequencing depth are enough to detect the variations existing in any QTL regions of interest in a crop genome, and how to prepare sequencing samples for a complex genome such as soybean. Therefore, with the aims of identifying SNP markers in a cost effective way for fine-mapping several QTL regions, and testing the validation rate of the putative SNPs predicted with Solexa short sequence reads at a low sequencing depth, we evaluated a pooled DNA fragment reduced representation library and SNP detection methods applied to short read sequences generated by Solexa high-throughput sequencing technology. Results A total of 39,022 putative SNPs were identified by the Illumina/Solexa sequencing system using a reduced representation DNA library of two parental lines of a mapping population. The validation rates of these putative SNPs predicted with low and high stringency were 72% and 85%, respectively. One hundred sixty four SNP markers resulted from the validation of putative SNPs and have been selectively chosen to target a known QTL, thereby increasing the marker density of the targeted region to one marker per 42 K bp. Conclusions We have demonstrated how to quickly identify large numbers of SNPs for fine mapping of QTL regions by applying massively parallel sequencing combined with genome complexity reduction techniques. This SNP discovery approach is more efficient for targeting multiple QTL regions in a same genetic population, which can be applied to other crops. PMID:20701770
Tasaki, E; Hirayama, J; Tazumi, A; Hayashi, K; Hara, Y; Ueno, H; Moore, J E; Millar, B C; Matsuda, M
2012-02-01
Novel clustered regularly-interspaced short palindromic repeats (CRISPRs) locus [7,500 base pairs (bp) in length] occurred in the urease-positive thermophilic Campylobacter (UPTC) Japanese isolate, CF89-12. The 7,500 bp gene loci consisted of the 5'-methylaminomethyl-2-thiouridylate methyltransferase gene, putative (P) CRISPR associated (p-Cas), putative open reading frames, Cas1 and Cas2, leader sequence region (146 bp), 12 CRISPRs consensus sequence repeats (each 36 bp) separated by a non-repetitive unique spacer region of similar length (26-31 bp) and the phosphatidyl glycerophosphatase A gene. When the CRISPRs loci in the UPTC CF89-12 and five C. jejuni isolates were compared with one another, these six isolates contained p-Cas, Cas1 and Cas2 within the loci. Four to 12 CRISPRs consensus sequence repeats separated by a non-repetitive unique spacer region occurred in six isolates and the nucleotide sequences of those repeats gave approximately 92-100% similarity with each other. However, no sequence similarity occurred in the unique spacer regions among these isolates. The putative σ(70) transcriptional promoter and the hypothetical ρ-independent terminator structures for the CRISPRs and Cas were detected. No in vivo transcription of p-Cas, Cas1 and Cas2 was confirmed in the UPTC cells.
Quantized phase coding and connected region labeling for absolute phase retrieval.
Chen, Xiangcheng; Wang, Yuwei; Wang, Yajun; Ma, Mengchao; Zeng, Chunnian
2016-12-12
This paper proposes an absolute phase retrieval method for complex object measurement based on quantized phase-coding and connected region labeling. A specific code sequence is embedded into quantized phase of three coded fringes. Connected regions of different codes are labeled and assigned with 3-digit-codes combining the current period and its neighbors. Wrapped phase, more than 36 periods, can be restored with reference to the code sequence. Experimental results verify the capability of the proposed method to measure multiple isolated objects.
Wüthrich, Daniel; Bruggmann, Rémy; Berthoud, Hélène; Arias-Roth, Emmanuelle
2015-01-01
Clostridium tyrobutyricum is the main microorganism responsible for late blowing defect in cheeses. Here, we present the draft genome sequences of two C. tyrobutyricum strains isolated from a Swiss semihard red-smear cheese. The two draft genomes comprise 3.05 and 3.08 Mbp and contain 3,030 and 3,089 putative coding sequences, respectively. PMID:25767226
Rogel, Marco A.; Zúñiga-Dávila, Doris; Martínez-Romero, Esperanza
2018-01-01
ABSTRACT The complete genome sequence of Bradyrhizobium icense LMTR 13T, a root nodule bacterium isolated from the legume Phaseolus lunatus, is reported here. The genome consists of a circular 8,322,773-bp chromosome which codes for a large and novel symbiotic island as well as genes putatively involved in soil and root colonization. PMID:29519840
A Deeper Examination of Thorellius atrox Scorpion Venom Components with Omic Techonologies.
Romero-Gutierrez, Teresa; Peguero-Sanchez, Esteban; Cevallos, Miguel A; Batista, Cesar V F; Ortiz, Ernesto; Possani, Lourival D
2017-12-12
This communication reports a further examination of venom gland transcripts and venom composition of the Mexican scorpion Thorellius atrox using RNA-seq and tandem mass spectrometry. The RNA-seq, which was performed with the Illumina protocol, yielded more than 20,000 assembled transcripts. Following a database search and annotation strategy, 160 transcripts were identified, potentially coding for venom components. A novel sequence was identified that potentially codes for a peptide with similarity to spider ω-agatoxins, which act on voltage-gated calcium channels, not known before to exist in scorpion venoms. Analogous transcripts were found in other scorpion species. They could represent members of a new scorpion toxin family, here named omegascorpins. The mass fingerprint by LC-MS identified 135 individual venom components, five of which matched with the theoretical masses of putative peptides translated from the transcriptome. The LC-MS/MS de novo sequencing allowed to reconstruct and identify 42 proteins encoded by assembled transcripts, thus validating the transcriptome analysis. Earlier studies conducted with this scorpion venom permitted the identification of only twenty putative venom components. The present work performed with more powerful and modern omic technologies demonstrates the capacity of accomplishing a deeper characterization of scorpion venom components and the identification of novel molecules with potential applications in biomedicine and the study of ion channel physiology.
Protein and gene structure of a blue laccase from Pleurotus ostreatus1.
Giardina, P; Palmieri, G; Scaloni, A; Fontanella, B; Faraco, V; Cennamo, G; Sannia, G
1999-01-01
A new laccase isoenzyme (POXA1b, where POX is phenol oxidase), produced by Pleurotus ostreatus in cultures supplemented with copper sulphate, has been purified and fully characterized. The main characteristics of this protein (molecular mass in native and denaturing conditions, pI and catalytic properties) are almost identical to the previously studied laccase POXA1w. However, POXA1b contains four copper atoms per molecule instead of one copper, two zinc and one iron atom per molecule of POXA1w. Furthermore, POXA1b shows an unusually high stability at alkaline pH. The gene and cDNA coding for POXA1b have been cloned and sequenced. The gene coding sequence contains 1599 bp, interrupted by 15 introns. Comparison of the structure of the poxa1b gene with the two previously studied P. ostreatus laccase genes (pox1 and poxc) suggests that these genes belong to two different subfamilies. The amino acid sequence of POXA1b deduced from the cDNA sequence has been almost completely verified by means of matrix-assisted laser desorption ionization MS. It has been demonstrated that three out of six putative glycosylation sites are post-translationally modified and the structure of the bound glycosidic moieties has been determined, whereas two other putative glycosylation sites are unmodified. PMID:10417329
Gudhka, Reema K; Neilan, Brett A; Burns, Brendan P
2015-01-01
Halococcus hamelinensis was the first archaeon isolated from stromatolites. These geomicrobial ecosystems are thought to be some of the earliest known on Earth, yet, despite their evolutionary significance, the role of Archaea in these systems is still not well understood. Detailed here is the genome sequencing and analysis of an archaeon isolated from stromatolites. The genome of H. hamelinensis consisted of 3,133,046 base pairs with an average G+C content of 60.08% and contained 3,150 predicted coding sequences or ORFs, 2,196 (68.67%) of which were protein-coding genes with functional assignments and 954 (29.83%) of which were of unknown function. Codon usage of the H. hamelinensis genome was consistent with a highly acidic proteome, a major adaptive mechanism towards high salinity. Amino acid transport and metabolism, inorganic ion transport and metabolism, energy production and conversion, ribosomal structure, and unknown function COG genes were overrepresented. The genome of H. hamelinensis also revealed characteristics reflecting its survival in its extreme environment, including putative genes/pathways involved in osmoprotection, oxidative stress response, and UV damage repair. Finally, genome analyses indicated the presence of putative transposases as well as positive matches of genes of H. hamelinensis against various genomes of Bacteria, Archaea, and viruses, suggesting the potential for horizontal gene transfer.
Zhang, Lin-Lin; Tan, Mei-Juan; Liu, Guang-Lei; Chi, Zhe; Wang, Guang-Yuan; Chi, Zhen-Ming
2015-04-01
The INU1 gene encoding an exo-inulinase from the marine-derived yeast Candida membranifaciens subsp. flavinogenie W14-3 was cloned and characterized. It had an open reading frame of 1,536 bp long encoding an inulinase. The coding region of it was not interrupted by any intron. The cloned gene encoded 512 amino acid residues of a protein with a putative signal peptide of 23 amino acids and a calculated molecular mass of 57.8 kDa. The protein sequence deduced from the inulinase gene contained the inulinase consensus sequences (WMNDPNGL), (RDP), ECP FS and Q. The protein also had six conserved putative N-glycosylation sites. The deduced inulinase from the yeast strain W14-3 was found to be closely related to that from Candida kutaonensis sp. nov. KRF1, Kluyveromyces marxianus, and Cryptococcus aureus G7a. The inulinase gene with its signal peptide encoding sequence was subcloned into the pMIRSC11 expression vector and expressed in Saccharomyces sp. W0. The recombinant yeast strain W14-3-INU-112 obtained could produce 16.8 U/ml of inulinase activity and 12.5 % (v/v) ethanol from 250 g/l of inulin within 168 h. The monosaccharides were detected after the hydrolysis of inulin with the crude inulinase (the yeast culture). All the results indicated that the cloned gene and the recombinant yeast strain W14-3-INU-112 had potential applications in biotechnology.
BLSSpeller: exhaustive comparative discovery of conserved cis-regulatory elements.
De Witte, Dieter; Van de Velde, Jan; Decap, Dries; Van Bel, Michiel; Audenaert, Pieter; Demeester, Piet; Dhoedt, Bart; Vandepoele, Klaas; Fostier, Jan
2015-12-01
The accurate discovery and annotation of regulatory elements remains a challenging problem. The growing number of sequenced genomes creates new opportunities for comparative approaches to motif discovery. Putative binding sites are then considered to be functional if they are conserved in orthologous promoter sequences of multiple related species. Existing methods for comparative motif discovery usually rely on pregenerated multiple sequence alignments, which are difficult to obtain for more diverged species such as plants. As a consequence, misaligned regulatory elements often remain undetected. We present a novel algorithm that supports both alignment-free and alignment-based motif discovery in the promoter sequences of related species. Putative motifs are exhaustively enumerated as words over the IUPAC alphabet and screened for conservation using the branch length score. Additionally, a confidence score is established in a genome-wide fashion. In order to take advantage of a cloud computing infrastructure, the MapReduce programming model is adopted. The method is applied to four monocotyledon plant species and it is shown that high-scoring motifs are significantly enriched for open chromatin regions in Oryza sativa and for transcription factor binding sites inferred through protein-binding microarrays in O.sativa and Zea mays. Furthermore, the method is shown to recover experimentally profiled ga2ox1-like KN1 binding sites in Z.mays. BLSSpeller was written in Java. Source code and manual are available at http://bioinformatics.intec.ugent.be/blsspeller Klaas.Vandepoele@psb.vib-ugent.be or jan.fostier@intec.ugent.be. Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
BLSSpeller: exhaustive comparative discovery of conserved cis-regulatory elements
De Witte, Dieter; Van de Velde, Jan; Decap, Dries; Van Bel, Michiel; Audenaert, Pieter; Demeester, Piet; Dhoedt, Bart; Vandepoele, Klaas; Fostier, Jan
2015-01-01
Motivation: The accurate discovery and annotation of regulatory elements remains a challenging problem. The growing number of sequenced genomes creates new opportunities for comparative approaches to motif discovery. Putative binding sites are then considered to be functional if they are conserved in orthologous promoter sequences of multiple related species. Existing methods for comparative motif discovery usually rely on pregenerated multiple sequence alignments, which are difficult to obtain for more diverged species such as plants. As a consequence, misaligned regulatory elements often remain undetected. Results: We present a novel algorithm that supports both alignment-free and alignment-based motif discovery in the promoter sequences of related species. Putative motifs are exhaustively enumerated as words over the IUPAC alphabet and screened for conservation using the branch length score. Additionally, a confidence score is established in a genome-wide fashion. In order to take advantage of a cloud computing infrastructure, the MapReduce programming model is adopted. The method is applied to four monocotyledon plant species and it is shown that high-scoring motifs are significantly enriched for open chromatin regions in Oryza sativa and for transcription factor binding sites inferred through protein-binding microarrays in O.sativa and Zea mays. Furthermore, the method is shown to recover experimentally profiled ga2ox1-like KN1 binding sites in Z.mays. Availability and implementation: BLSSpeller was written in Java. Source code and manual are available at http://bioinformatics.intec.ugent.be/blsspeller Contact: Klaas.Vandepoele@psb.vib-ugent.be or jan.fostier@intec.ugent.be Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26254488
Hou, Xiao-Jin; Li, Si-Bei; Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi
2014-01-01
MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB) family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB). Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus. PMID:25375352
Tucker, Matthew R.; Ma, Chao; Phan, Jana; Neumann, Kylie; Shirley, Neil J.; Hahn, Michael G.; Cozzolino, Daniel; Burton, Rachel A.
2017-01-01
Seeds from the myxospermous species Plantago ovata release a polysaccharide-rich mucilage upon contact with water. This seed coat derived mucilage is composed predominantly of heteroxylan (HX) and is utilized as a gluten-free dietary fiber supplement to promote human colorectal health. In this study, a gamma-irradiated P. ovata population was generated and screened using histological stains and Fourier Transform Mid Infrared (FTMIR) spectroscopy to identify putative mutants showing defects in seed coat mucilage HX composition and/or structure. FTMIR analysis of dry seed revealed variation in regions of the IR spectra previously linked to xylan structure in Secale cereale (rye). Subsequent absorbance ratio and PCA multivariate analysis identified 22 putative mutant families with differences in the HX IR fingerprint region. Many of these showed distinct changes in the amount and subtle changes in structure of HX after mucilage extrusion, while 20% of the putative HX mutants identified by FTMIR showed no difference in staining patterns of extruded mucilage compared to wild-type. Transcriptional screening analysis of two putative reduced xylan in mucilage (rxm) mutants, rxm1 and rxm3, revealed that changes in HX levels in rxm1 correlate with reduced transcription of known and novel genes associated with xylan synthesis, possibly indicative of specific co-regulatory units within the xylan biosynthetic pathway. These results confirm that FTMIR is a suitable method for identifying putative mutants with altered mucilage HX composition in P. ovata, and therefore forms a resource to identify novel genes involved in xylan biosynthesis. PMID:28377777
Disruption of a -35kb enhancer impairs CTCF binding and MLH1 expression in colorectal cells.
Liu, Qing; Thoms, Julie A; Nunez, Andrea C; Huang, Yizhou; Knezevic, Kathy; Packham, Deborah; Poulos, Rebecca C; Williams, Rachel; Beck, Dominik; Hawkins, Nicholas J; Ward, Robyn L; Wong, Jason W H; Hesson, Luke B; Sloane, Mathew A; Pimanda, John
2018-06-13
MLH1 is a major tumour suppressor gene involved in the pathogenesis of Lynch syndrome and various sporadic cancers. Despite their potential pathogenic importance, genomic regions capable of regulating MLH1 expression over long distances have yet to be identified. Here we use chromosome conformation capture (3C) to screen a 650-kb region flanking the MLH1 locus to identify interactions between the MLH1 promoter and distal regions in MLH1 expressing and non-expressing cells. Putative enhancers were functionally validated using luciferase reporter assays, chromatin immunoprecipitation and CRISPR-Cas9 mediated deletion of endogenous regions. To evaluate whether germline variants in the enhancer might contribute to impaired MLH1 expression in patients with suspected Lynch syndrome, we also screened germline DNA from a cohort of 74 patients with no known coding mutations or epimutations at the MLH1 promoter. A 1.8kb DNA fragment, 35kb upstream of the MLH1 transcription start site enhances MLH1 gene expression in colorectal cells. The enhancer was bound by CTCF and CRISPR-Cas9 mediated deletion of a core binding region impairs endogenous MLH1 expression. 5.4% of suspected Lynch syndrome patients have a rare single nucleotide variant (G>A; rs143969848; 2.5% in gnomAD European, non-Finnish) within a highly conserved CTCF binding motif, which disrupts enhancer activity in SW620 colorectal carcinoma cells. A CTCF bound region within the MLH1 -35 enhancer regulates MLH1 expression in colorectal cells and is worthy of scrutiny in future genetic screening strategies for suspected Lynch syndrome associated with loss of MLH1 expression. Copyright ©2018, American Association for Cancer Research.
A strategy to discover new organizers identifies a putative heart organizer
Anderson, Claire; Khan, Mohsin A. F.; Wong, Frances; Solovieva, Tatiana; Oliveira, Nidia M. M.; Baldock, Richard A.; Tickle, Cheryll; Burt, Dave W.; Stern, Claudio D.
2016-01-01
Organizers are regions of the embryo that can both induce new fates and impart pattern on other regions. So far, surprisingly few organizers have been discovered, considering the number of patterned tissue types generated during development. This may be because their discovery has relied on transplantation and ablation experiments. Here we describe a new approach, using chick embryos, to discover organizers based on a common gene expression signature, and use it to uncover the anterior intestinal portal (AIP) endoderm as a putative heart organizer. We show that the AIP can induce cardiac identity from non-cardiac mesoderm and that it can pattern this by specifying ventricular and suppressing atrial regional identity. We also uncover some of the signals responsible. The method holds promise as a tool to discover other novel organizers acting during development. PMID:27557800
Structure and regulation of KGD1, the structural gene for yeast alpha-ketoglutarate dehydrogenase.
Repetto, B; Tzagoloff, A
1989-06-01
Nuclear respiratory-defective mutants of Saccharomyces cerevisiae have been screened for lesions in the mitochondrial alpha-ketoglutarate dehydrogenase complex. Strains assigned to complementation group G70 were ascertained to be deficient in enzyme activity due to mutations in the KGD1 gene coding for the alpha-ketoglutarate dehydrogenase component of the complex. The KGD1 gene has been cloned by transformation of a representative kgd1 mutant, C225/U1, with a recombinant plasmid library of wild-type yeast nuclear DNA. Transformants containing the gene on a multicopy plasmid had three- to four-times-higher alpha-ketoglutarate dehydrogenase activity than did wild-type S. cerevisiae. Substitution of the chromosomal copy of KGD1 with a disrupted allele (kgd1::URA3) induced a deficiency in alpha-ketoglutarate dehydrogenase. The sequence of the cloned region of DNA which complements kgd1 mutants was found to have an open reading frame of 3,042 nucleotides capable of coding for a protein of Mw 114,470. The encoded protein had 38% identical residues with the reported sequence of alpha-ketoglutarate dehydrogenase from Escherichia coli. Two lines of evidence indicated that transcription of KGD1 is catabolite repressed. Higher steady-state levels of KGD1 mRNA were detected in wild-type yeast grown on the nonrepressible sugar galactose than in yeast grown on high glucose. Regulation of KGD1 was also studied by fusing different 5'-flanking regions of KGD1 to the lacZ gene of E. coli and measuring the expression of beta-galactosidase in yeast. Transformants harboring a fusion of 693 nucleotides of the 5'-flanking sequence expressed 10 times more beta-galactosidase activity when grown under derepressed conditions. The response to the carbon source was reduced dramatically when the same lacZ fusion was present in a hap2 or hap3 mutant. The promoter element(s) responsible for the regulated expression of KGD1 has been mapped to the -354 to -143 region. This region contained several putative activation sites with sequences matching the core element proposed to be essential for binding of the HAP2 and HAP3 regulatory proteins.
Variant discovery in the sheep milk transcriptome using RNA sequencing.
Suárez-Vega, Aroa; Gutiérrez-Gil, Beatriz; Klopp, Christophe; Tosser-Klopp, Gwenola; Arranz, Juan José
2017-02-15
The identification of genetic variation underlying desired phenotypes is one of the main challenges of current livestock genetic research. High-throughput transcriptome sequencing (RNA-Seq) offers new opportunities for the detection of transcriptome variants (SNPs and short indels) in different tissues and species. In this study, we used RNA-Seq on Milk Sheep Somatic Cells (MSCs) with the goal of characterizing the genetic variation within the coding regions of the milk transcriptome in Churra and Assaf sheep, two common dairy sheep breeds farmed in Spain. A total of 216,637 variants were detected in the MSCs transcriptome of the eight ewes analyzed. Among them, a total of 57,795 variants were detected in the regions harboring Quantitative Trait Loci (QTL) for milk yield, protein percentage and fat percentage, of which 21.44% were novel variants. Among the total variants detected, 561 (2.52%) and 1,649 (7.42%) were predicted to produce high or moderate impact changes in the corresponding transcriptional unit, respectively. In the functional enrichment analysis of the genes positioned within selected QTL regions harboring novel relevant functional variants (high and moderate impact), the KEGG pathway with the highest enrichment was "protein processing in endoplasmic reticulum". Additionally, a total of 504 and 1,063 variants were identified in the genes encoding principal milk proteins and molecules involved in the lipid metabolism, respectively. Of these variants, 20 mutations were found to have putative relevant effects on the encoded proteins. We present herein the first transcriptomic approach aimed at identifying genetic variants of the genes expressed in the lactating mammary gland of sheep. Through the transcriptome analysis of variability within regions harboring QTL for milk yield, protein percentage and fat percentage, we have found several pathways and genes that harbor mutations that could affect dairy production traits. Moreover, remarkable variants were also found in candidate genes coding for major milk proteins and proteins related to milk fat metabolism. Several of the SNPs found in this study could be included as suitable markers in genotyping platforms or custom SNP arrays to perform association analyses in commercial populations and apply genomic selection protocols in the dairy production industry.
Complete nucleotide sequence and annotation of the temperate corynephage ϕ16 genome.
Lobanova, Juliya S; Gak, Evgueni R; Andreeva, Irina G; Rybak, Konstantin V; Krylov, Alexander A; Mashko, Sergey V
2017-08-01
The complete genome of ϕ16, a temperate corynephage from Corynebacterium glutamicum ATCC 21792, was sequenced and annotated (GenBank: KY250482). The electron microscopy study of ϕ16 virion confirmed that it belongs to the family Siphoviridae. The ϕ16 genome consists of a linear double-stranded DNA molecule of 58,200 bp (G+C = 52.2%) with protruding cohesive 3'-ends of 14 nt. Four major structural proteins were separated by SDS-PAGE and identified by peptide mass fingerprinting technique. Using bioinformatics analysis, 101 putative ORFs and 5 tRNA genes were predicted. Only 27 putative gene products could be assigned to known biological functions. The ϕ16 genome was divided into functional modules. Seven putative promoters and eight putative unidirectional intrinsic terminators were predicted. One site of putative «-1» programmed ribosomal frameshifting was proposed in the phage tail assembly genome region. C. glutamicum genetic tools could be broadened by exploiting the known integrase gene (gp33) and the newly identified excisionase gene (gp47), participating in site-specific recombination between ϕ16-attP/attB.
The putative drug efflux systems of the Bacillus cereus group
Elbourne, Liam D. H.; Vörös, Aniko; Kroeger, Jasmin K.; Simm, Roger; Tourasse, Nicolas J.; Finke, Sarah; Henderson, Peter J. F.; Økstad, Ole Andreas; Paulsen, Ian T.; Kolstø, Anne-Brit
2017-01-01
The Bacillus cereus group of bacteria includes seven closely related species, three of which, B. anthracis, B. cereus and B. thuringiensis, are pathogens of humans, animals and/or insects. Preliminary investigations into the transport capabilities of different bacterial lineages suggested that genes encoding putative efflux systems were unusually abundant in the B. cereus group compared to other bacteria. To explore the drug efflux potential of the B. cereus group all putative efflux systems were identified in the genomes of prototypical strains of B. cereus, B. anthracis and B. thuringiensis using our Transporter Automated Annotation Pipeline. More than 90 putative drug efflux systems were found within each of these strains, accounting for up to 2.7% of their protein coding potential. Comparative analyses demonstrated that the efflux systems are highly conserved between these species; 70–80% of the putative efflux pumps were shared between all three strains studied. Furthermore, 82% of the putative efflux system proteins encoded by the prototypical B. cereus strain ATCC 14579 (type strain) were found to be conserved in at least 80% of 169 B. cereus group strains that have high quality genome sequences available. However, only a handful of these efflux pumps have been functionally characterized. Deletion of individual efflux pump genes from B. cereus typically had little impact to drug resistance phenotypes or the general fitness of the strains, possibly because of the large numbers of alternative efflux systems that may have overlapping substrate specificities. Therefore, to gain insight into the possible transport functions of efflux systems in B. cereus, we undertook large-scale qRT-PCR analyses of efflux pump gene expression following drug shocks and other stress treatments. Clustering of gene expression changes identified several groups of similarly regulated systems that may have overlapping drug resistance functions. In this article we review current knowledge of the small molecule efflux pumps encoded by the B. cereus group and suggest the likely functions of numerous uncharacterised pumps. PMID:28472044
Ryan, Joseph F.; Mazza, Maureen E.; Pang, Kevin; Matus, David Q.; Baxevanis, Andreas D.; Martindale, Mark Q.; Finnerty, John R.
2007-01-01
Background Hox genes were critical to many morphological innovations of bilaterian animals. However, early Hox evolution remains obscure. Phylogenetic, developmental, and genomic analyses on the cnidarian sea anemone Nematostella vectensis challenge recent claims that the Hox code is a bilaterian invention and that no “true” Hox genes exist in the phylum Cnidaria. Methodology/Principal Findings Phylogenetic analyses of 18 Hox-related genes from Nematostella identify putative Hox1, Hox2, and Hox9+ genes. Statistical comparisons among competing hypotheses bolster these findings, including an explicit consideration of the gene losses implied by alternate topologies. In situ hybridization studies of 20 Hox-related genes reveal that multiple Hox genes are expressed in distinct regions along the primary body axis, supporting the existence of a pre-bilaterian Hox code. Additionally, several Hox genes are expressed in nested domains along the secondary body axis, suggesting a role in “dorsoventral” patterning. Conclusions/Significance A cluster of anterior and posterior Hox genes, as well as ParaHox cluster of genes evolved prior to the cnidarian-bilaterian split. There is evidence to suggest that these clusters were formed from a series of tandem gene duplication events and played a role in patterning both the primary and secondary body axes in a bilaterally symmetrical common ancestor. Cnidarians and bilaterians shared a common ancestor some 570 to 700 million years ago, and as such, are derived from a common body plan. Our work reveals several conserved genetic components that are found in both of these diverse lineages. This finding is consistent with the hypothesis that a set of developmental rules established in the common ancestor of cnidarians and bilaterians is still at work today. PMID:17252055
Zhu, Changfu; Kauder, Friedrich; Römer, Susanne; Sandmann, Gerhard
2007-02-01
Two 9-cis-epoxycarotenoid dioxygenase (NCED) cDNAs have been cloned from a petal library of Gentiana lutea. Both cDNAs carry a putative transit sequence for chloroplast import and differ mainly in their length and the 5'-flanking regions. GlNCED1 was evolutionary closely related to Arabidopsis thaliana NCED6 whereas GlNCED2 showed highest homology to tomato NCED1 and A. thaliana NCED3. The amounts of GlNCED2 transcript were below Northern detection in G. lutea. In contrast, GlNCED1 was specifically expressed at higher levels in developing flowers when petals start appearing. By genetic engineering of tobacco with coding regions of either gene under a constitutive promoter, their function was further analyzed. Although mRNA of both genes was detectable in the corresponding transgenic plants, a physiological effect was only found for GlNCED1 but not for GlNCED2. In germination experiments of GlNCED1 transgenic lines, delayed radicle formation and cotyledon appearance were observed. However, the transformants exhibited no improved tolerance against desiccation stress. In contrast to other plants with over-expressed NCEDs, prolonged delay of seed germination is the only abscisic-acid-related phenotypic effect in the GlNCED1 transgenic lines.
Catania, Francesco; Lynch, Michael
2010-05-04
In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.
Urasaki, Naoya; Takagi, Hiroki; Natsume, Satoshi; Uemura, Aiko; Taniai, Naoki; Miyagi, Norimichi; Fukushima, Mai; Suzuki, Shouta; Tarora, Kazuhiko; Tamaki, Moritoshi; Sakamoto, Moriaki; Terauchi, Ryohei; Matsumura, Hideo
2017-02-01
Bitter gourd (Momordica charantia) is an important vegetable and medicinal plant in tropical and subtropical regions globally. In this study, the draft genome sequence of a monoecious bitter gourd inbred line, OHB3-1, was analyzed. Through Illumina sequencing and de novo assembly, scaffolds of 285.5 Mb in length were generated, corresponding to ∼84% of the estimated genome size of bitter gourd (339 Mb). In this draft genome sequence, 45,859 protein-coding gene loci were identified, and transposable elements accounted for 15.3% of the whole genome. According to synteny mapping and phylogenetic analysis of conserved genes, bitter gourd was more related to watermelon (Citrullus lanatus) than to cucumber (Cucumis sativus) or melon (C. melo). Using RAD-seq analysis, 1507 marker loci were genotyped in an F2 progeny of two bitter gourd lines, resulting in an improved linkage map, comprising 11 linkage groups. By anchoring RAD tag markers, 255 scaffolds were assigned to the linkage map. Comparative analysis of genome sequences and predicted genes determined that putative trypsin-inhibitor and ribosome-inactivating genes were distinctive in the bitter gourd genome. These genes could characterize the bitter gourd as a medicinal plant. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Tay, W T; Elfekih, S; Court, L; Gordon, K H; De Barro, P J
2016-01-01
The complete length of the Asia I member of the Bemisia tabaci species complex mitochondrial DNA genome (mitogenome) is 15,210 bp (GenBank accession no. KJ778614) with an A-T biased nucleotide composition (A: 32.7%; T: 42.4%; G: 14.0%; C: 10.8%). The mitogenome consists of 13 protein-coding genes (PCGs), 22 transfer RNAs (tRNAs), 2 ribosomal RNA (rRNAs) and a 467 bp putative control region which also includes the A+T rich repeat region. All PCGs have an ATA (n = 8) or ATG (n = 5) start codon. Gene synteny of Asia I is overall similar to B. afer and two other members of the B. tabaci species complex Mediterranean and New World 1, and contains the tRNA-Ser2 located between the Cytb and ND1 genes found in Mediterranean and New World 1, but which is absent in B. afer. The orientation of the tRNA-Arg in Asia I is on the "plus" strand and differed from Mediterranean which is found on the "minus" strand. The Asia I mitogenome size is currently ranked the second smallest after B. afer (14,968 bp) followed by New World 1 (15,322 bp) and Mediterranean (15,632 bp).
Niedermaier, Michael; Schwabe, Georg C; Fees, Stephan; Helmrich, Anne; Brieske, Norbert; Seemann, Petra; Hecht, Jochen; Seitz, Volkhard; Stricker, Sigmar; Leschik, Gundula; Schrock, Evelin; Selby, Paul B; Mundlos, Stefan
2005-04-01
Short digits (Dsh) is a radiation-induced mouse mutant. Homozygous mice are characterized by multiple defects strongly resembling those resulting from Sonic hedgehog (Shh) inactivation. Heterozygous mice show a limb reduction phenotype with fusion and shortening of the proximal and middle phalanges in all digits, similar to human brachydactyly type A1, a condition caused by mutations in Indian hedgehog (IHH). We mapped Dsh to chromosome 5 in a region containing Shh and were able to demonstrate an inversion comprising 11.7 Mb. The distal breakpoint is 13.298 kb upstream of Shh, separating the coding sequence from several putative regulatory elements identified by interspecies comparison. The inversion results in almost complete downregulation of Shh expression during E9.5-E12.5, explaining the homozygous phenotype. At E13.5 and E14.5, however, Shh is upregulated in the phalangeal anlagen of Dsh/+ mice, at a time point and in a region where WT Shh is never expressed. The dysregulation of Shh expression causes the local upregulation of hedgehog target genes such as Gli1-3, patched, and Pthlh, as well as the downregulation of Ihh and Gdf5. This results in shortening of the digits through an arrest of chondrocyte differentiation and the disruption of joint development.
Niedermaier, Michael; Schwabe, Georg C.; Fees, Stephan; Helmrich, Anne; Brieske, Norbert; Seemann, Petra; Hecht, Jochen; Seitz, Volkhard; Stricker, Sigmar; Leschik, Gundula; Schrock, Evelin; Selby, Paul B.; Mundlos, Stefan
2005-01-01
Short digits (Dsh) is a radiation-induced mouse mutant. Homozygous mice are characterized by multiple defects strongly resembling those resulting from Sonic hedgehog (Shh) inactivation. Heterozygous mice show a limb reduction phenotype with fusion and shortening of the proximal and middle phalanges in all digits, similar to human brachydactyly type A1, a condition caused by mutations in Indian hedgehog (IHH). We mapped Dsh to chromosome 5 in a region containing Shh and were able to demonstrate an inversion comprising 11.7 Mb. The distal breakpoint is 13.298 kb upstream of Shh, separating the coding sequence from several putative regulatory elements identified by interspecies comparison. The inversion results in almost complete downregulation of Shh expression during E9.5–E12.5, explaining the homozygous phenotype. At E13.5 and E14.5, however, Shh is upregulated in the phalangeal anlagen of Dsh/+ mice, at a time point and in a region where WT Shh is never expressed. The dysregulation of Shh expression causes the local upregulation of hedgehog target genes such as Gli1-3, patched, and Pthlh, as well as the downregulation of Ihh and Gdf5. This results in shortening of the digits through an arrest of chondrocyte differentiation and the disruption of joint development. PMID:15841179
Genomic interval engineering of mice identified a novel modulator of triglyceride production
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhu, Y.; Jong, M.C.; Frazer, K.A.
1999-10-01
To accelerate the biological annotation of novel genes discovered in sequenced of mammalian genomes, we are creating large deletions in the mouse genome targeted to include clusters of such genes. Here we describe the targeted deletion of a 450 kb region on mouse chromosome 11 which, based on computational analysis of the deleted murine sequences and human 5q orthologous sequences, codes for nine putative genes. Mice homozygous for the deletion had a variety of abnormalities including severe hypertriglyceridemia, hepatic and cardiac enlargement, growth retardation and premature mortality. Analysis of triglyceride metabolism in these animals demonstrated a several-fold increase in hepaticmore » very-low density lipoprotein (VLDL) triglyceride secretion, the most prevalent mechanism responsible for hypertriglyceridemia in humans. A series of mouse BAC and human YAC transgenes covering different intervals of the 450 kb deleted region were assessed for their ability to complement the deletion induced abnormalities. These studies revealed that OCTN2, a gene recently shown to play a role in carnitine transport, was able to correct the triglyceride abnormalities. The discovery of this previously unappreciated relationship between OCTN2, carnitine and hepatic triglyceride production is of particular importance due to the clinical consequence of hypertriglyceridemia and the paucity of genes known to modulate triglyceride secretion.« less
Seabream ghrelin: cDNA cloning, genomic organization and promoter studies.
Yeung, Chung-Man; Chan, Chi-Bun; Woo, Norman Y S; Cheng, Christopher H K
2006-05-01
Recent studies have indicated that ghrelin stimulates growth hormone release from the pituitary via the growth hormone secretagogue receptor (GHSR). We have previously isolated two GHSR subtypes from the pituitary of the black seabream Acanthopagrus schlegeli. In the present study, we have cloned and characterized ghrelin from the same fish species at both the cDNA and gene levels. The full-length seabream ghrelin cDNA, isolated from sea-bream stomach using a novel approach by exploiting a single conserved region in the coding region, was found to encode a prepropeptide of 107 amino acids, with the predicted mature ghrelin peptide consisting of 20 amino acids (GSSFLSPSQKPQNRGKSSRV). Embedded in this full-length cDNA is a putative fish orthologue of the recently reported mammalian obestatin peptide. The ghrelin gene in black seabream, obtained by genomic PCR, was found to encompass four exons and three introns, possessing the same structural organization as in tilapia and goldfish, but different from that in rainbow trout. In addition, a 2230-bp 5'-flanking region of the seabream ghrelin gene was obtained by genome walking. Sequence analysis revealed that, as in the case of the human ghrelin gene, there is neither a GC box nor a CAAT box present in the isolated 5'-flanking region. However, a number of putative transcription factor-binding sites different from the human counterpart were found in the 5'-flanking region of the seabream ghrelin gene, suggesting that different cis- and trans-acting elements are involved in controlling their gene expression. Functional activity of this 5'-flanking region was examined by cloning it into the pGL3-Basic vector upstream of the luciferase reporter gene and transfected into various cell lines. Positive promoter activity could only be recorded in the colon-derived Caco-2 cells, suggesting that the cloned 5'-flanking region represents the functional promoter of the seabream ghrelin gene, which exhibits tissue-specific promoter activity. Using reverse transcriptase PCR analysis, expression of ghrelin was detected only in the seabream stomach, but not in the other tissues examined, including the brain, gill, intestine, kidney, liver and spleen. This stomach-specific expression of ghrelin in seabream is subject to regulation, as administration of growth hormone or ipamorelin to the fish in vivo was demonstrated to enhance its expression. Reminiscent of the homologous upregulation found in the transcriptional control of the seabream GHSR gene, a similar homologous regulatory mechanism might also exist in controlling the expression of seabream ghrelin. The identification of both GHSR and ghrelin from a single fish species would facilitate our subsequent studies on the elucidation of the physiological functions of the ghrelin/GHSR system in teleost. The possible existence of obestatin in teleost opens up new research avenues on the somatotropic axis in fish.
Subramanian, Devika; Natarajan, Jeyakumar
2015-12-10
Staphylococcus aureus is a major human pathogen and ramoplanin is an antimicrobial attributed for effective treatment. The goal of this study was to examine the transcriptomic profiles of ramoplanin sensitive and resistant S. aureus to identify putative modules responsible for virulence and resistance-mechanisms and its characteristic novel genes. The dysregulated genes were used to reconstruct protein functional association networks for virulence-factors and resistance-mechanisms individually. Strong link between metabolic-pathways and development of virulence/resistance is suggested. We identified 15 putative modules of virulence factors. Six hypothetical genes were annotated with novel virulence activity among which SACOL0281 was discovered to be an essential virulence factor EsaD. The roles of MazEF toxin-antitoxin system, SACOL0202/SACOL0201 two-component system and that of amino-sugar and nucleotide-sugar metabolism in virulence are also suggested. In addition, 14 putative modules of resistance mechanisms including modules of ribosomal protein-coding genes and metabolic pathways such as biotin-synthesis, TCA-cycle, riboflavin-biosynthesis, peptidoglycan-biosynthesis etc. are also indicated. Copyright © 2015 Elsevier B.V. All rights reserved.
Virues-Ortega, Javier; Montaño-Fidalgo, Montserrat; Froján-Parga, María Xesús; Calero-Elvira, Ana
2011-12-01
This study analyzes the interobserver agreement and hypothesis-based known-group validity of the Therapist's Verbal Behavior Category System (SISC-INTER). The SISC-INTER is a behavioral observation protocol comprised of a set of verbal categories representing putative behavioral functions of the in-session verbal behavior of a therapist (e.g., discriminative, reinforcing, punishing, and motivational operations). The complete therapeutic process of a clinical case of an individual with marital problems was recorded (10 sessions, 8 hours), and data were arranged in a temporal sequence using 10-min periods. Hypotheses based on the expected performance of the putative behavioral functions portrayed by the SISC-INTER codes across prevalent clinical activities (i.e., assessing, explaining, Socratic method, providing clinical guidance) were tested using autoregressive integrated moving average (ARIMA) models. Known-group validity analyses provided support to all hypotheses. The SISC-INTER may be a useful tool to describe therapist-client interaction in operant terms. The utility of reliable and valid protocols for the descriptive analysis of clinical practice in terms of verbal behavior is discussed. Copyright © 2011. Published by Elsevier Ltd.
Karreth, Florian A.; Tay, Yvonne; Perna, Daniele; Ala, Ugo; Tan, Shen Mynn; Rust, Alistair G.; DeNicola, Gina; Webster, Kaitlyn A.; Weiss, Dror; Perez-Mancera, Pedro A.; Krauthammer, Michael; Halaban, Ruth; Provero, Paolo; Adams, David J.; Tuveson, David A.; Pandolfi, Pier Paolo
2011-01-01
Summary We recently proposed that competitive endogenous RNAs (ceRNAs) sequester microRNAs to regulate mRNA transcripts containing common microRNA recognition elements (MREs). However, the functional role of ceRNAs in cancer remains unknown. Loss of PTEN, a tumor suppressor regulated by ceRNA activity, frequently occurs in melanoma. Here, we report the discovery of significant enrichment of putative PTEN ceRNAs among genes whose loss accelerates tumorigenesis following Sleeping Beauty insertional mutagenesis in a mouse model of melanoma. We validated several putative PTEN ceRNAs and further characterized one, the ZEB2 transcript. We show that ZEB2 modulates PTEN protein levels in a microRNA-dependent, protein coding-independent manner. Attenuation of ZEB2 expression activates the PI3K/AKT pathway, enhances cell transformation, and commonly occurs in human melanomas and other cancers expressing low PTEN levels. Our study genetically identifies multiple putative microRNA decoys for PTEN, validates ZEB2 mRNA as a bona fide PTEN ceRNA, and demonstrates that abrogated ZEB2 expression cooperates with BRAFV600E to promote melanomagenesis. PMID:22000016
Kapanadze, B; Makeeva, N; Corcoran, M; Jareborg, N; Hammarsund, M; Baranova, A; Zabarovsky, E; Vorontsova, O; Merup, M; Gahrton, G; Jansson, M; Yankovsky, N; Einhorn, S; Oscier, D; Grandér, D; Sangfelt, O
2000-12-15
Previous studies have indicated the presence of a putative tumor suppressor gene on human chromosome 13q14, commonly deleted in patients with B-cell chronic lymphocytic leukemia (B-CLL). We have recently identified a minimally deleted region encompassing parts of two adjacent genes, termed LEU1 and LEU2 (leukemia-associated genes 1 and 2), and several additional transcripts. In addition, 50 kb centromeric to this region we have identified another gene, LEU5/RFP2. To elucidate further the complex genomic organization of this region, we have identified, mapped, and sequenced the homologous region in the mouse. Fluorescence in situ hybridization analysis demonstrated that the region maps to mouse chromosome 14. The overall organization and gene order in this region were found to be highly conserved in the mouse. Sequence comparison between the human deletion hotspot region and its homologous mouse region revealed a high degree of sequence conservation with an overall score of 74%. However, our data also show that in terms of transcribed sequences, only two of those, human LEU2 and LEU5/RFP2, are clearly conserved, strengthening the case for these genes as putative candidate B-CLL tumor suppressor genes.
Darris, Maxwell
2017-01-01
ABSTRACT Most of the 24 known Chitinophaga species were originally isolated from soils. We report the draft genome sequence of a putatively novel Chitinophaga sp. from a biofilm in an air conditioner condensate pipe. The genome comprises 7,661,303 bp in one scaffold, 5,694 predicted protein-coding sequences, and a G+C content of 47.6%. PMID:29051259
Storari, Michelangelo; Wüthrich, Daniel; Bruggmann, Rémy; Berthoud, Hélène; Arias-Roth, Emmanuelle
2015-03-12
Clostridium tyrobutyricum is the main microorganism responsible for late blowing defect in cheeses. Here, we present the draft genome sequences of two C. tyrobutyricum strains isolated from a Swiss semihard red-smear cheese. The two draft genomes comprise 3.05 and 3.08 Mbp and contain 3,030 and 3,089 putative coding sequences, respectively. Copyright © 2015 Storari et al.
Chiriac, Cecilia; Baricz, Andreea
2018-01-01
ABSTRACT The draft genome assembly of Janthinobacterium sp. strain ROICE36 has 207 contigs, with a total genome size of 5,977,006 bp and a G+C content of 62%. Preliminary genome analysis identified 5,363 protein-coding genes and a total of 7 secondary metabolic gene clusters (encoding bacteriocins, nonribosomal peptide-synthetase [NRPS], terpene, hserlactone, and other ketide synthases). PMID:29650588
Ormeño-Orrillo, Ernesto; Rogel, Marco A; Zúñiga-Dávila, Doris; Martínez-Romero, Esperanza
2018-03-08
The complete genome sequence of Bradyrhizobium icense LMTR 13 T , a root nodule bacterium isolated from the legume Phaseolus lunatus , is reported here. The genome consists of a circular 8,322,773-bp chromosome which codes for a large and novel symbiotic island as well as genes putatively involved in soil and root colonization. Copyright © 2018 Ormeño-Orrillo et al.
SORL1 variants across Alzheimer's disease European American cohorts.
Fernández, Maria Victoria; Black, Kathleen; Carrell, David; Saef, Ben; Budde, John; Deming, Yuetiva; Howells, Bill; Del-Aguila, Jorge L; Ma, Shengmei; Bi, Catherine; Norton, Joanne; Chasse, Rachel; Morris, John; Goate, Alison; Cruchaga, Carlos
2016-12-01
The accumulation of the toxic Aβ peptide in Alzheimer's disease (AD) largely relies upon an efficient recycling of amyloid precursor protein (APP). Recent genetic association studies have described rare variants in SORL1 with putative pathogenic consequences in the recycling of APP. In this work, we examine the presence of rare coding variants in SORL1 in three different European American cohorts: early-onset, late-onset AD (LOAD) and familial LOAD.
Wu, Yueh-Lung; Wu, Carol-P; Huang, Yu-Hui; Huang, Sheng-Ping; Lo, Huei-Ru; Chang, Hao-Shuo; Lin, Pi-Hsiu; Wu, Ming-Cheng; Chang, Chia-Jung; Chao, Yu-Chan
2014-11-01
The p143 gene from Autographa californica multinucleocapsid nucleopolyhedrovirus (AcMNPV) has been found to increase the expression of luciferase, which is driven by the polyhedrin gene promoter, in a plasmid with virus coinfection. Further study indicated that this is due to the presence of a replication origin (ori) in the coding region of this gene. Transient DNA replication assays showed that a specific fragment of the p143 coding sequence, p143-3, underwent virus-dependent DNA replication in Spodoptera frugiperda IPLB-Sf-21 (Sf-21) cells. Deletion analysis of the p143-3 fragment showed that subfragment p143-3.2a contained the essential sequence of this putative ori. Sequence analysis of this region revealed a unique distribution of imperfect palindromes with high AT contents. No sequence homology or similarity between p143-3.2a and any other known ori was detected, suggesting that it is a novel baculovirus ori. Further study showed that the p143-3.2a ori can replicate more efficiently in infected Sf-21 cells than baculovirus homologous regions (hrs), the major baculovirus ori, or non-hr oris during virus replication. Previously, hr on its own was unable to replicate in mammalian cells, and for mammalian viral oris, viral proteins are generally required for their proper replication in host cells. However, the p143-3.2a ori was, surprisingly, found to function as an efficient ori in mammalian cells without the need for any viral proteins. We conclude that p143 contains a unique sequence that can function as an ori to enhance gene expression in not only insect cells but also mammalian cells. Baculovirus DNA replication relies on both hr and non-hr oris; however, so far very little is known about the latter oris. Here we have identified a new non-hr ori, the p143 ori, which resides in the coding region of p143. By developing a novel DNA replication-enhanced reporter system, we have identified and located the core region required for the p143 ori. This ori contains a large number of imperfect inverted repeats and is the most active ori in the viral genome during virus infection in insect cells. We also found that it is a unique ori that can replicate in mammalian cells without the assistance of baculovirus gene products. The identification of this ori should contribute to a better understanding of baculovirus DNA replication. Also, this ori is very useful in assisting with gene expression in mammalian cells. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Petrova, L P; Prilipov, A G; Katsy, E I
2017-01-01
It is known that in Azospirillum brasilense strains Sp245 and SR75 included in serogroup I, the repeat units of their O-polysaccharides consist of five residues of D-rhamnose, and in strain SR15, of four; and the heteropolymeric O-polysaccharide of A. brasilense type strain Sp7 from serogroup II contains not less than five types of repeat units. In the present work, a complex of nondegenerate primers to the genes of A. brasilense Sp245 plasmids AZOBR_p6, AZOBR_p3, and AZOBR_p2, which encode putative enzymes for the biosynthesis of core oligosaccharide and O-polysaccharide of lipopolysaccharide, capsular polysaccharides, and exopolysaccharides, was proposed. By using the designed primers, products of the expected sizes were synthesized in polymerase chain reactions on genomic DNA of A. brasilense Sp245, SR75, SR15, and Sp7 in 36, 29, 23, and 12 cases, respectively. As a result of sequencing of a number of amplicons, a high (86–99%) level of identity of the corresponding putative polysaccharide biosynthesis genes in three A. brasilense strains from serogroup I was detected. In a blotting-hybridization reaction with the biotin-labeled DNA of the A. brasilense gene AZOBR_p60122 coding for putative permease of the ABC transporter of polysaccharides, localization of the homologous gene in ~120-MDa plasmids of the bacteria A. brasilense SR15 and SR75 was revealed.
Lazzarato, F; Franceschinis, G; Botta, M; Cordero, F; Calogero, R A
2004-11-01
RRE allows the extraction of non-coding regions surrounding a coding sequence [i.e. gene upstream region, 5'-untranslated region (5'-UTR), introns, 3'-UTR, downstream region] from annotated genomic datasets available at NCBI. RRE parser and web-based interface are accessible at http://www.bioinformatica.unito.it/bioinformatics/rre/rre.html
Using a Euclid distance discriminant method to find protein coding genes in the yeast genome.
Zhang, Chun-Ting; Wang, Ju; Zhang, Ren
2002-02-01
The Euclid distance discriminant method is used to find protein coding genes in the yeast genome, based on the single nucleotide frequencies at three codon positions in the ORFs. The method is extremely simple and may be extended to find genes in prokaryotic genomes or eukaryotic genomes with less introns. Six-fold cross-validation tests have demonstrated that the accuracy of the algorithm is better than 93%. Based on this, it is found that the total number of protein coding genes in the yeast genome is less than or equal to 5579 only, about 3.8-7.0% less than 5800-6000, which is currently widely accepted. The base compositions at three codon positions are analyzed in details using a graphic method. The result shows that the preference codons adopted by yeast genes are of the RGW type, where R, G and W indicate the bases of purine, non-G and A/T, whereas the 'codons' in the intergenic sequences are of the form NNN, where N denotes any base. This fact constitutes the basis of the algorithm to distinguish between coding and non-coding ORFs in the yeast genome. The names of putative non-coding ORFs are listed here in detail.
Memory Accumulation Mechanisms in Human Cortex Are Independent of Motor Intentions
Tosoni, Annalisa; Mignogna, Valeria; McAvoy, Mark P.; Shulman, Gordon L.; Corbetta, Maurizio; Romani, Gian Luca
2014-01-01
Previous studies on perceptual decision-making have often emphasized a tight link between decisions and motor intentions. Human decisions, however, also depend on memories or experiences that are not closely tied to specific motor responses. Recent neuroimaging findings have suggested that, during episodic retrieval, parietal activity reflects the accumulation of evidence for memory decisions. It is currently unknown, however, whether these evidence accumulation signals are functionally linked to signals for motor intentions coded in frontoparietal regions and whether activity in the putative memory accumulator tracks the amount of evidence for only previous experience, as reflected in “old” reports, or for both old and new decisions, as reflected in the accuracy of memory judgments. Here, human participants used saccadic-eye and hand-pointing movements to report recognition judgments on pictures defined by different degrees of evidence for old or new decisions. A set of cortical regions, including the middle intraparietal sulcus, showed a monotonic variation of the fMRI BOLD signal that scaled with perceived memory strength (older > newer), compatible with an asymmetrical memory accumulator. Another set, including the hippocampus and the angular gyrus, showed a nonmonotonic response profile tracking memory accuracy (higher > lower evidence), compatible with a symmetrical accumulator. In contrast, eye and hand effector-specific regions in frontoparietal cortex tracked motor intentions but were not modulated by the amount of evidence for the effector outcome. We conclude that item recognition decisions are supported by a combination of symmetrical and asymmetrical accumulation signals largely segregated from motor intentions. PMID:24828652
Conceição, Inês C; Rama, Maria M; Oliveira, Bárbara; Café, Cátia; Almeida, Joana; Mouga, Susana; Duque, Frederico; Oliveira, Guiomar; Vicente, Astrid M
2017-04-01
The PARK2 gene encodes Parkin, a component of a multiprotein E3 ubiquitin ligase complex that targets substrate proteins for proteasomal degradation. PARK2 mutations are frequently associated with Parkinson's disease, but structural alterations have also been described in patients with neurodevelopmental disorders (NDD), suggesting a pathological effect ubiquitous to neurodevelopmental and neurodegenerative brain processes. The present study aimed to define the critical regions for NDD within PARK2. To clarify PARK2 involvement in NDDs, we examined the frequency and location of copy number variants (CNVs) identified in patients from our sample and reported in the literature and relevant databases, and compared with control populations. Overall, the frequency of PARK2 CNVs was higher in controls than in NDD cases. However, closer inspection of the CNV location in PARK2 showed that the frequency of CNVs targeting the Parkin C-terminal, corresponding to the ring-between-ring (RBR) domain responsible for Parkin activity, is significantly higher in NDD cases than in controls. In contrast, CNVs targeting the N-terminal of Parkin, including domains that regulate ubiquitination activity, are very common both in cases and in controls. Although PARK2 may be a pathological factor for NDDs, likely not all variants are pathogenic, and a conclusive assessment of PARK2 variant pathogenicity requires an accurate analysis of their location within the coding region and encoded functional domains.
Lee, Chien-Yueh; Hsieh, Ping-Han; Chiang, Li-Mei; Chattopadhyay, Amrita; Li, Kuan-Yi; Lee, Yi-Fang; Lu, Tzu-Pin; Lai, Liang-Chuan; Lin, En-Chung; Lee, Hsinyu; Ding, Shih-Torng; Tsai, Mong-Hsun; Chen, Chien-Yu; Chuang, Eric Y
2018-05-01
The Mikado pheasant (Syrmaticus mikado) is a nearly endangered species indigenous to high-altitude regions of Taiwan. This pheasant provides an opportunity to investigate evolutionary processes following geographic isolation. Currently, the genetic background and adaptive evolution of the Mikado pheasant remain unclear. We present the draft genome of the Mikado pheasant, which consists of 1.04 Gb of DNA and 15,972 annotated protein-coding genes. The Mikado pheasant displays expansion and positive selection of genes related to features that contribute to its adaptive evolution, such as energy metabolism, oxygen transport, hemoglobin binding, radiation response, immune response, and DNA repair. To investigate the molecular evolution of the major histocompatibility complex (MHC) across several avian species, 39 putative genes spanning 227 kb on a contiguous region were annotated and manually curated. The MHC loci of the pheasant revealed a high level of synteny, several rapidly evolving genes, and inverse regions compared to the same loci in the chicken. The complete mitochondrial genome was also sequenced, assembled, and compared against four long-tailed pheasants. The results from molecular clock analysis suggest that ancestors of the Mikado pheasant migrated from the north to Taiwan about 3.47 million years ago. This study provides a valuable genomic resource for the Mikado pheasant, insights into its adaptation to high altitude, and the evolutionary history of the genus Syrmaticus, which could potentially be useful for future studies that investigate molecular evolution, genomics, ecology, and immunogenetics.
2011-01-01
Background Stenospermocarpy is a mechanism through which certain genotypes of Vitis vinifera L. such as Sultanina produce berries with seeds reduced in size. Stenospermocarpy has not yet been characterized at the molecular level. Results Genetic and physical maps were integrated with the public genomic sequence of Vitis vinifera L. to improve QTL analysis for seedlessness and berry size in experimental progeny derived from a cross of two seedless genotypes. Major QTLs co-positioning for both traits on chromosome 18 defined a 92-kb confidence interval. Functional information from model species including Vitis suggested that VvAGL11, included in this confidence interval, might be the main positional candidate gene responsible for seed and berry development. Characterization of VvAGL11 at the sequence level in the experimental progeny identified several SNPs and INDELs in both regulatory and coding regions. In association analyses performed over three seasons, these SNPs and INDELs explained up to 78% and 44% of the phenotypic variation in seed and berry weight, respectively. Moreover, genetic experiments indicated that the regulatory region has a larger effect on the phenotype than the coding region. Transcriptional analysis lent additional support to the putative role of VvAGL11's regulatory region, as its expression is abolished in seedless genotypes at key stages of seed development. These results transform VvAGL11 into a functional candidate gene for further analyses based on genetic transformation. For breeding purposes, intragenic markers were tested individually for marker assisted selection, and the best markers were those closest to the transcription start site. Conclusion We propose that VvAGL11 is the major functional candidate gene for seedlessness, and we provide experimental evidence suggesting that the seedless phenotype might be caused by variations in its promoter region. Current knowledge of the function of its orthologous genes, its expression profile in Vitis varieties and the strong association between its sequence variation and the degree of seedlessness together indicate that the D-lineage MADS-box gene VvAGL11 corresponds to the Seed Development Inhibitor locus described earlier as a major locus for seedlessness. These results provide new hypotheses for further investigations of the molecular mechanisms involved in seed and berry development. PMID:21447172
Prevalence of transcription promoters within archaeal operons and coding sequences
Koide, Tie; Reiss, David J; Bare, J Christopher; Pang, Wyming Lee; Facciotti, Marc T; Schmid, Amy K; Pan, Min; Marzolf, Bruz; Van, Phu T; Lo, Fang-Yin; Pratap, Abhishek; Deutsch, Eric W; Peterson, Amelia; Martin, Dan; Baliga, Nitin S
2009-01-01
Despite the knowledge of complex prokaryotic-transcription mechanisms, generalized rules, such as the simplified organization of genes into operons with well-defined promoters and terminators, have had a significant role in systems analysis of regulatory logic in both bacteria and archaea. Here, we have investigated the prevalence of alternate regulatory mechanisms through genome-wide characterization of transcript structures of ∼64% of all genes, including putative non-coding RNAs in Halobacterium salinarum NRC-1. Our integrative analysis of transcriptome dynamics and protein–DNA interaction data sets showed widespread environment-dependent modulation of operon architectures, transcription initiation and termination inside coding sequences, and extensive overlap in 3′ ends of transcripts for many convergently transcribed genes. A significant fraction of these alternate transcriptional events correlate to binding locations of 11 transcription factors and regulators (TFs) inside operons and annotated genes—events usually considered spurious or non-functional. Using experimental validation, we illustrate the prevalence of overlapping genomic signals in archaeal transcription, casting doubt on the general perception of rigid boundaries between coding sequences and regulatory elements. PMID:19536208
Prevalence of transcription promoters within archaeal operons and coding sequences.
Koide, Tie; Reiss, David J; Bare, J Christopher; Pang, Wyming Lee; Facciotti, Marc T; Schmid, Amy K; Pan, Min; Marzolf, Bruz; Van, Phu T; Lo, Fang-Yin; Pratap, Abhishek; Deutsch, Eric W; Peterson, Amelia; Martin, Dan; Baliga, Nitin S
2009-01-01
Despite the knowledge of complex prokaryotic-transcription mechanisms, generalized rules, such as the simplified organization of genes into operons with well-defined promoters and terminators, have had a significant role in systems analysis of regulatory logic in both bacteria and archaea. Here, we have investigated the prevalence of alternate regulatory mechanisms through genome-wide characterization of transcript structures of approximately 64% of all genes, including putative non-coding RNAs in Halobacterium salinarum NRC-1. Our integrative analysis of transcriptome dynamics and protein-DNA interaction data sets showed widespread environment-dependent modulation of operon architectures, transcription initiation and termination inside coding sequences, and extensive overlap in 3' ends of transcripts for many convergently transcribed genes. A significant fraction of these alternate transcriptional events correlate to binding locations of 11 transcription factors and regulators (TFs) inside operons and annotated genes-events usually considered spurious or non-functional. Using experimental validation, we illustrate the prevalence of overlapping genomic signals in archaeal transcription, casting doubt on the general perception of rigid boundaries between coding sequences and regulatory elements.
Oh, Chang Seok; Lee, Soong Deok; Kim, Yi-Suk; Shin, Dong Hoon
2015-01-01
Previous study showed that East Asian mtDNA haplogroups, especially those of Koreans, could be successfully assigned by the coupled use of analyses on coding region SNP markers and control region mutation motifs. In this study, we tried to see if the same triple multiplex analysis for coding regions SNPs could be also applicable to ancient samples from East Asia as the complementation for sequence analysis of mtDNA control region. By the study on Joseon skeleton samples, we know that mtDNA haplogroup determined by coding region SNP markers successfully falls within the same haplogroup that sequence analysis on control region can assign. Considering that ancient samples in previous studies make no small number of errors in control region mtDNA sequencing, coding region SNP analysis can be used as good complimentary to the conventional haplogroup determination, especially of archaeological human bone samples buried underground over long periods. PMID:26345190
On fuzzy semantic similarity measure for DNA coding.
Ahmad, Muneer; Jung, Low Tang; Bhuiyan, Md Al-Amin
2016-02-01
A coding measure scheme numerically translates the DNA sequence to a time domain signal for protein coding regions identification. A number of coding measure schemes based on numerology, geometry, fixed mapping, statistical characteristics and chemical attributes of nucleotides have been proposed in recent decades. Such coding measure schemes lack the biologically meaningful aspects of nucleotide data and hence do not significantly discriminate coding regions from non-coding regions. This paper presents a novel fuzzy semantic similarity measure (FSSM) coding scheme centering on FSSM codons׳ clustering and genetic code context of nucleotides. Certain natural characteristics of nucleotides i.e. appearance as a unique combination of triplets, preserving special structure and occurrence, and ability to own and share density distributions in codons have been exploited in FSSM. The nucleotides׳ fuzzy behaviors, semantic similarities and defuzzification based on the center of gravity of nucleotides revealed a strong correlation between nucleotides in codons. The proposed FSSM coding scheme attains a significant enhancement in coding regions identification i.e. 36-133% as compared to other existing coding measure schemes tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. Copyright © 2015 Elsevier Ltd. All rights reserved.
Revisiting the operational RNA code for amino acids: Ensemble attributes and their implications.
Shaul, Shaul; Berel, Dror; Benjamini, Yoav; Graur, Dan
2010-01-01
It has been suggested that tRNA acceptor stems specify an operational RNA code for amino acids. In the last 20 years several attributes of the putative code have been elucidated for a small number of model organisms. To gain insight about the ensemble attributes of the code, we analyzed 4925 tRNA sequences from 102 bacterial and 21 archaeal species. Here, we used a classification and regression tree (CART) methodology, and we found that the degrees of degeneracy or specificity of the RNA codes in both Archaea and Bacteria differ from those of the genetic code. We found instances of taxon-specific alternative codes, i.e., identical acceptor stem determinants encrypting different amino acids in different species, as well as instances of ambiguity, i.e., identical acceptor stem determinants encrypting two or more amino acids in the same species. When partitioning the data by class of synthetase, the degree of code ambiguity was significantly reduced. In cryptographic terms, a plausible interpretation of this result is that the class distinction in synthetases is an essential part of the decryption rules for resolving the subset of RNA code ambiguities enciphered by identical acceptor stem determinants of tRNAs acylated by enzymes belonging to the two classes. In evolutionary terms, our findings lend support to the notion that in the pre-DNA world, interactions between tRNA acceptor stems and synthetases formed the basis for the distinction between the two classes; hence, ambiguities in the ancient RNA code were pivotal for the fixation of these enzymes in the genomes of ancestral prokaryotes.
Revisiting the operational RNA code for amino acids: Ensemble attributes and their implications
Shaul, Shaul; Berel, Dror; Benjamini, Yoav; Graur, Dan
2010-01-01
It has been suggested that tRNA acceptor stems specify an operational RNA code for amino acids. In the last 20 years several attributes of the putative code have been elucidated for a small number of model organisms. To gain insight about the ensemble attributes of the code, we analyzed 4925 tRNA sequences from 102 bacterial and 21 archaeal species. Here, we used a classification and regression tree (CART) methodology, and we found that the degrees of degeneracy or specificity of the RNA codes in both Archaea and Bacteria differ from those of the genetic code. We found instances of taxon-specific alternative codes, i.e., identical acceptor stem determinants encrypting different amino acids in different species, as well as instances of ambiguity, i.e., identical acceptor stem determinants encrypting two or more amino acids in the same species. When partitioning the data by class of synthetase, the degree of code ambiguity was significantly reduced. In cryptographic terms, a plausible interpretation of this result is that the class distinction in synthetases is an essential part of the decryption rules for resolving the subset of RNA code ambiguities enciphered by identical acceptor stem determinants of tRNAs acylated by enzymes belonging to the two classes. In evolutionary terms, our findings lend support to the notion that in the pre-DNA world, interactions between tRNA acceptor stems and synthetases formed the basis for the distinction between the two classes; hence, ambiguities in the ancient RNA code were pivotal for the fixation of these enzymes in the genomes of ancestral prokaryotes. PMID:19952117
Facial asymmetry and clinical manifestations in patients with novel insertion of the TCOF1 gene.
Su, P-H; Liu, Y-F; Yu, J-S; Chen, J-Y; Chen, S-J; Lai, Y-J
2012-11-01
This study explored the role of TCOF1 insertion mutations in Taiwanese patients with craniofacial anomalies. Twelve patients with single or multiple, asymmetrical congenital craniofacial anomalies were enrolled. Genomic DNA was prepared from leukocytes; the coding regions of TCOF1 were analyzed by polymerase chain reaction and direct sequencing. Clinical manifestations were correlated to the TCOF1 mutation. Six of 12 patients diagnosed with hemifacial microsomia exhibited a novel insertion mutation 4127 ins G (frameshift) in exon 24 in the TCOF1 gene. All six patients were diagnosed with anomalies on the left side. In addition, four of these six patients had hearing impairment; three had other major anomalies; and two had developmental delay. The insertion caused a frameshift, an early truncation, the loss of two putative nuclear localization signals (residues 1404-1420 and 1424-1440), and the loss of coiled coil domain (1406-1426) in treacle protein. These findings support the existence of two regulators of growth of the mandibular condyles. © 2011 John Wiley & Sons A/S.
Ferro, Myriam; Tardif, Marianne; Reguer, Erwan; Cahuzac, Romain; Bruley, Christophe; Vermat, Thierry; Nugues, Estelle; Vigouroux, Marielle; Vandenbrouck, Yves; Garin, Jérôme; Viari, Alain
2008-05-01
PepLine is a fully automated software which maps MS/MS fragmentation spectra of trypsic peptides to genomic DNA sequences. The approach is based on Peptide Sequence Tags (PSTs) obtained from partial interpretation of QTOF MS/MS spectra (first module). PSTs are then mapped on the six-frame translations of genomic sequences (second module) giving hits. Hits are then clustered to detect potential coding regions (third module). Our work aimed at optimizing the algorithms of each component to allow the whole pipeline to proceed in a fully automated manner using raw nucleic acid sequences (i.e., genomes that have not been "reduced" to a database of ORFs or putative exons sequences). The whole pipeline was tested on controlled MS/MS spectra sets from standard proteins and from Arabidopsis thaliana envelope chloroplast samples. Our results demonstrate that PepLine competed with protein database searching softwares and was fast enough to potentially tackle large data sets and/or high size genomes. We also illustrate the potential of this approach for the detection of the intron/exon structure of genes.
Charles, J. P.; Chihara, C.; Nejad, S.; Riddiford, L. M.
1997-01-01
A 36-kb genomic DNA segment of the Drosophila melanogaster genome containing 12 clustered cuticle genes has been mapped and partially sequenced. The cluster maps at 65A 5-6 on the left arm of the third chromosome, in agreement with the previously determined location of a putative cluster encompassing the genes for the third instar larval cuticle proteins LCP5, LCP6 and LCP8. This cluster is the largest cuticle gene cluster discovered to date and shows a number of surprising features that explain in part the genetic complexity of the LCP5, LCP6 and LCP8 loci. The genes encoding LCP5 and LCP8 are multiple copy genes and the presence of extensive similarity in their coding regions gives the first evidence for gene conversion in cuticle genes. In addition, five genes in the cluster are intronless. Four of these five have arisen by retroposition. The other genes in the cluster have a single intron located at an unusual location for insect cuticle genes. PMID:9383064
Evolution and Variation of Renin Genes in Mice
Dickinson, Douglas P.; Gross, Kenneth W.; Piccini, Nina; Wilson, Carol M.
1984-01-01
Inbred strains of mice carry Ren-1, a gene encoding the thermostable Renin-1 isozyme. Ren-1 is expressed at relatively low levels in mouse submandibular gland and kidney. Some strains also carry Ren-2, a gene encoding the thermolabile Renin-2 isozyme. Ren-2 is expressed at high levels in the mouse submandibular gland and at very low levels, if at all, in the kidney. Ren-1 and Ren-2 are closely linked on mouse chromosome 1, show extensive homology in coding and noncoding regions and provide a model for studying the regulation of gene expression. An investigation of renin genes and enzymatic activity in wild-derived mice identified several restriction site polymorphisms as well as putative variants in renin gene expression and protein structure. The number of renin genes carried by different subpopulations of wild-derived mice is consistent with the occurrence of a gene duplication event prior to the divergence of M. spretus (2.75–5.5 million yr ago). This conclusion is in agreement with a prior estimate based upon comparative sequence analysis of Ren-1 and Ren-2 from inbred laboratory mice. PMID:6389258
Zipper plot: visualizing transcriptional activity of genomic regions.
Avila Cobos, Francisco; Anckaert, Jasper; Volders, Pieter-Jan; Everaert, Celine; Rombaut, Dries; Vandesompele, Jo; De Preter, Katleen; Mestdagh, Pieter
2017-05-02
Reconstructing transcript models from RNA-sequencing (RNA-seq) data and establishing these as independent transcriptional units can be a challenging task. Current state-of-the-art tools for long non-coding RNA (lncRNA) annotation are mainly based on evolutionary constraints, which may result in false negatives due to the overall limited conservation of lncRNAs. To tackle this problem we have developed the Zipper plot, a novel visualization and analysis method that enables users to simultaneously interrogate thousands of human putative transcription start sites (TSSs) in relation to various features that are indicative for transcriptional activity. These include publicly available CAGE-sequencing, ChIP-sequencing and DNase-sequencing datasets. Our method only requires three tab-separated fields (chromosome, genomic coordinate of the TSS and strand) as input and generates a report that includes a detailed summary table, a Zipper plot and several statistics derived from this plot. Using the Zipper plot, we found evidence of transcription for a set of well-characterized lncRNAs and observed that fewer mono-exonic lncRNAs have CAGE peaks overlapping with their TSSs compared to multi-exonic lncRNAs. Using publicly available RNA-seq data, we found more than one hundred cases where junction reads connected protein-coding gene exons with a downstream mono-exonic lncRNA, revealing the need for a careful evaluation of lncRNA 5'-boundaries. Our method is implemented using the statistical programming language R and is freely available as a webtool.
McBride, David J.; Buckle, Adam; van Heyningen, Veronica; Kleinjan, Dirk A.
2011-01-01
The PAX6 gene plays a crucial role in development of the eye, brain, olfactory system and endocrine pancreas. Consistent with its pleiotropic role the gene exhibits a complex developmental expression pattern which is subject to strict spatial, temporal and quantitative regulation. Control of expression depends on a large array of cis-elements residing in an extended genomic domain around the coding region of the gene. The minimal essential region required for proper regulation of this complex locus has been defined through analysis of human aniridia-associated breakpoints and YAC transgenic rescue studies of the mouse smalleye mutant. We have carried out a systematic DNase I hypersensitive site (HS) analysis across 200 kb of this critical region of mouse chromosome 2E3 to identify putative regulatory elements. Mapping the identified HSs onto a percent identity plot (PIP) shows many HSs correspond to recognisable genomic features such as evolutionarily conserved sequences, CpG islands and retrotransposon derived repeats. We then focussed on a region previously shown to contain essential long range cis-regulatory information, the Pax6 downstream regulatory region (DRR), allowing comparison of mouse HS data with previous human HS data for this region. Reporter transgenic mice for two of the HS sites, HS5 and HS6, show that they function as tissue specific regulatory elements. In addition we have characterised enhancer activity of an ultra-conserved cis-regulatory region located near Pax6, termed E60. All three cis-elements exhibit multiple spatio-temporal activities in the embryo that overlap between themselves and other elements in the locus. Using a deletion set of YAC reporter transgenic mice we demonstrate functional interdependence of the elements. Finally, we use the HS6 enhancer as a marker for the migration of precerebellar neuro-epithelium cells to the hindbrain precerebellar nuclei along the posterior and anterior extramural streams allowing visualisation of migratory defects in both pathways in Pax6Sey/Sey mice. PMID:22220192
Wan, Xuehua; Darris, Maxwell; Hou, Shaobin; Donachie, Stuart P
2017-10-19
Most of the 24 known Chitinophaga species were originally isolated from soils. We report the draft genome sequence of a putatively novel Chitinophaga sp. from a biofilm in an air conditioner condensate pipe. The genome comprises 7,661,303 bp in one scaffold, 5,694 predicted protein-coding sequences, and a G+C content of 47.6%. Copyright © 2017 Wan et al.
A Deeper Examination of Thorellius atrox Scorpion Venom Components with Omic Techonologies
Romero-Gutierrez, Teresa; Batista, Cesar V. F.
2017-01-01
This communication reports a further examination of venom gland transcripts and venom composition of the Mexican scorpion Thorellius atrox using RNA-seq and tandem mass spectrometry. The RNA-seq, which was performed with the Illumina protocol, yielded more than 20,000 assembled transcripts. Following a database search and annotation strategy, 160 transcripts were identified, potentially coding for venom components. A novel sequence was identified that potentially codes for a peptide with similarity to spider ω-agatoxins, which act on voltage-gated calcium channels, not known before to exist in scorpion venoms. Analogous transcripts were found in other scorpion species. They could represent members of a new scorpion toxin family, here named omegascorpins. The mass fingerprint by LC-MS identified 135 individual venom components, five of which matched with the theoretical masses of putative peptides translated from the transcriptome. The LC-MS/MS de novo sequencing allowed to reconstruct and identify 42 proteins encoded by assembled transcripts, thus validating the transcriptome analysis. Earlier studies conducted with this scorpion venom permitted the identification of only twenty putative venom components. The present work performed with more powerful and modern omic technologies demonstrates the capacity of accomplishing a deeper characterization of scorpion venom components and the identification of novel molecules with potential applications in biomedicine and the study of ion channel physiology. PMID:29231872
DOE Office of Scientific and Technical Information (OSTI.GOV)
Villard, L.; Lossi, A.M.; Fontes, M.
We have previously reported the isolation of a gene from Xq13 that codes for a putative regulator of transcription (XNP) and has now been shown to be the gene involved in the X-linked {alpha}-thalassemia with mental retardation (ATR-X) syndrome. The widespread expression and numerous domains present in the putative protein suggest that this gene could be involved in other phenotypes. The predominant expression of the gene in the developing brain, as well as its association with neuron differentiation, indicates that mutations of this gene might result in a mental retardation (MR) phenotype. In this paper we present a family withmore » a splice junction mutation in XNP that results in the skipping of an exon and in the introduction of a stop codon in the middle of the XNP-coding sequence. Only the abnormal transcript is expressed in two first cousins presenting the classic ATR-X phenotype (with {alpha}-thalassemia and HbH inclusions). In a distant cousin presenting a similar dysmorphic MR phenotype but not having thalassemia, {approximately}30% of the XNP transcripts are normal. These data demonstrate that the mode of action of the XNP gene product on globin expression is distinct from its mode of action in brain development and facial morphogenesis and suggest that other dysmorphic mental retardation phenotypes, such as Juberg-Marsidi or some sporadic cases of Coffin-Lowry, could be due to mutations in XNP. 20 refs., 5 figs., 2 tabs.« less
Regulatory single nucleotide polymorphisms (rSNPs) at the promoters 1A and 1B of the human APC gene.
Matveeva, Marina Yu; Kashina, Elena V; Reshetnikov, Vasily V; Bryzgalov, Leonid O; Antontseva, Elena V; Bondar, Natalia P; Merkulova, Tatiana I
2016-12-22
Germline mutations in the coding sequence of the tumour suppressor APC gene give rise to familial adenomatous polyposis (which leads to colorectal cancer) and are associated with many other oncopathologies. The loss of APC function because of deletion of putative promoter 1A or 1B also results in the development of colorectal cancer. Since the regions of promoters 1A and 1B contain many single nucleotide polymorphisms (SNPs), the aim of this study was to perform functional analysis of some of these SNPs by means of an electrophoretic mobility shift assay (EMSA) and a luciferase reporter assay. First, it was shown that both putative promoters of APC (1A and 1B) drive transcription in an in vitro reporter experiment. From eleven randomly selected SNPs of promoter 1A and four SNPs of promoter 1B, nine and two respectively showed differential patterns of binding of nuclear proteins to oligonucleotide probes corresponding to alternative alleles. The luciferase reporter assay showed that among the six SNPs tested, the rs75612255 C allele and rs113017087 C allele in promoter 1A as well as the rs138386816 T allele and rs115658307 T allele in promoter 1B significantly increased luciferase activity in the human erythromyeloblastoid leukaemia cell line K562. In human colorectal cancer HCT-116 cells, none of the substitutions under study had any effect, with the exception of minor allele G of rs79896135 in promoter 1B. This allele significantly decreased the luciferase reporter's activity CONCLUSION: Our results indicate that many SNPs in APC promoters 1A and 1B are functionally relevant and that allele G of rs79896135 may be associated with the predisposition to colorectal cancer.
Characterization of uncultivable bat influenza virus using a replicative synthetic virus.
Zhou, Bin; Ma, Jingjiao; Liu, Qinfang; Bawa, Bhupinder; Wang, Wei; Shabman, Reed S; Duff, Michael; Lee, Jinhwa; Lang, Yuekun; Cao, Nan; Nagy, Abdou; Lin, Xudong; Stockwell, Timothy B; Richt, Juergen A; Wentworth, David E; Ma, Wenjun
2014-10-01
Bats harbor many viruses, which are periodically transmitted to humans resulting in outbreaks of disease (e.g., Ebola, SARS-CoV). Recently, influenza virus-like sequences were identified in bats; however, the viruses could not be cultured. This discovery aroused great interest in understanding the evolutionary history and pandemic potential of bat-influenza. Using synthetic genomics, we were unable to rescue the wild type bat virus, but could rescue a modified bat-influenza virus that had the HA and NA coding regions replaced with those of A/PR/8/1934 (H1N1). This modified bat-influenza virus replicated efficiently in vitro and in mice, resulting in severe disease. Additional studies using a bat-influenza virus that had the HA and NA of A/swine/Texas/4199-2/1998 (H3N2) showed that the PR8 HA and NA contributed to the pathogenicity in mice. Unlike other influenza viruses, engineering truncations hypothesized to reduce interferon antagonism into the NS1 protein didn't attenuate bat-influenza. In contrast, substitution of a putative virulence mutation from the bat-influenza PB2 significantly attenuated the virus in mice and introduction of a putative virulence mutation increased its pathogenicity. Mini-genome replication studies and virus reassortment experiments demonstrated that bat-influenza has very limited genetic and protein compatibility with Type A or Type B influenza viruses, yet it readily reassorts with another divergent bat-influenza virus, suggesting that the bat-influenza lineage may represent a new Genus/Species within the Orthomyxoviridae family. Collectively, our data indicate that the bat-influenza viruses recently identified are authentic viruses that pose little, if any, pandemic threat to humans; however, they provide new insights into the evolution and basic biology of influenza viruses.
Characterization of Uncultivable Bat Influenza Virus Using a Replicative Synthetic Virus
Bawa, Bhupinder; Wang, Wei; Shabman, Reed S.; Duff, Michael; Lee, Jinhwa; Lang, Yuekun; Cao, Nan; Nagy, Abdou; Lin, Xudong; Stockwell, Timothy B.; Richt, Juergen A.; Wentworth, David E.; Ma, Wenjun
2014-01-01
Bats harbor many viruses, which are periodically transmitted to humans resulting in outbreaks of disease (e.g., Ebola, SARS-CoV). Recently, influenza virus-like sequences were identified in bats; however, the viruses could not be cultured. This discovery aroused great interest in understanding the evolutionary history and pandemic potential of bat-influenza. Using synthetic genomics, we were unable to rescue the wild type bat virus, but could rescue a modified bat-influenza virus that had the HA and NA coding regions replaced with those of A/PR/8/1934 (H1N1). This modified bat-influenza virus replicated efficiently in vitro and in mice, resulting in severe disease. Additional studies using a bat-influenza virus that had the HA and NA of A/swine/Texas/4199-2/1998 (H3N2) showed that the PR8 HA and NA contributed to the pathogenicity in mice. Unlike other influenza viruses, engineering truncations hypothesized to reduce interferon antagonism into the NS1 protein didn't attenuate bat-influenza. In contrast, substitution of a putative virulence mutation from the bat-influenza PB2 significantly attenuated the virus in mice and introduction of a putative virulence mutation increased its pathogenicity. Mini-genome replication studies and virus reassortment experiments demonstrated that bat-influenza has very limited genetic and protein compatibility with Type A or Type B influenza viruses, yet it readily reassorts with another divergent bat-influenza virus, suggesting that the bat-influenza lineage may represent a new Genus/Species within the Orthomyxoviridae family. Collectively, our data indicate that the bat-influenza viruses recently identified are authentic viruses that pose little, if any, pandemic threat to humans; however, they provide new insights into the evolution and basic biology of influenza viruses. PMID:25275541
Gupta, Shefali; Garg, Vanika; Bhatia, Sabhyata
2015-01-01
Considering the economic importance of chickpea (C. arietinum L.) seeds, it is important to understand the mechanisms underlying seed development for which a cDNA library was constructed from 6 day old chickpea embryos. A total of 8,186 ESTs were obtained from which 4,048 high quality ESTs were assembled into 1,480 unigenes that majorly encoded genes involved in various metabolic and regulatory pathways. Of these, 95 ESTs were found to be involved in ubiquitination related protein degradation pathways and 12 ESTs coded specifically for putative F-box proteins. Differential transcript accumulation of these putative F-box genes was observed in chickpea tissues as evidenced by quantitative real-time PCR. Further, to explore the role of F-box proteins in chickpea seed development, two F-box genes were selected for molecular characterization. These were named as CarF-box_PP2 and CarF-box_LysM depending on their C-terminal domains, PP2 and LysM, respectively. Their highly conserved structures led us to predict their target substrates. Subcellular localization experiment revealed that CarF-box_PP2 was localized in the cytoplasm and CarF-box_LysM was localized in the nucleus. We demonstrated their physical interactions with SKP1 protein, which validated that they function as F-box proteins in the formation of SCF complexes. Sequence analysis of their promoter regions revealed certain seed specific cis-acting elements that may be regulating their preferential transcript accumulation in the seed. Overall, the study helped in expanding the EST database of chickpea, which was further used to identify two novel F-box genes having a potential role in seed development. PMID:25803812
D’Addabbo, Pietro; Caizzi, Ruggiero
2016-01-01
Bari elements are members of the Tc1-mariner superfamily of DNA transposons, originally discovered in Drosophila melanogaster, and subsequently identified in silico in 11 sequenced Drosophila genomes and as experimentally isolated in four non-sequenced Drosophila species. Bari-like elements have been also studied for their mobility both in vivo and in vitro. We analyzed 23 Drosophila genomes and carried out a detailed characterization of the Bari elements identified, including those from the heterochromatic Bari1 cluster in D. melanogaster. We have annotated 401 copies of Bari elements classified either as putatively autonomous or inactive according to the structure of the terminal sequences and the presence of a complete transposase-coding region. Analyses of the integration sites revealed that Bari transposase prefers AT-rich sequences in which the TA target is cleaved and duplicated. Furthermore evaluation of transposon’s co-occurrence near the integration sites of Bari elements showed a non-random distribution of other transposable elements. We also unveil the existence of a putatively autonomous Bari1 variant characterized by two identical long Terminal Inverted Repeats, in D. rhopaloa. In addition, we detected MITEs related to Bari transposons in 9 species. Phylogenetic analyses based on transposase gene and the terminal sequences confirmed that Bari-like elements are distributed into three subfamilies. A few inconsistencies in Bari phylogenetic tree with respect to the Drosophila species tree could be explained by the occurrence of horizontal transfer events as also suggested by the results of dS analyses. This study further clarifies the Bari transposon’s evolutionary dynamics and increases our understanding on the Tc1-mariner elements’ biology. PMID:27213270
Palazzo, Antonio; Lovero, Domenica; D'Addabbo, Pietro; Caizzi, Ruggiero; Marsano, René Massimiliano
2016-01-01
Bari elements are members of the Tc1-mariner superfamily of DNA transposons, originally discovered in Drosophila melanogaster, and subsequently identified in silico in 11 sequenced Drosophila genomes and as experimentally isolated in four non-sequenced Drosophila species. Bari-like elements have been also studied for their mobility both in vivo and in vitro. We analyzed 23 Drosophila genomes and carried out a detailed characterization of the Bari elements identified, including those from the heterochromatic Bari1 cluster in D. melanogaster. We have annotated 401 copies of Bari elements classified either as putatively autonomous or inactive according to the structure of the terminal sequences and the presence of a complete transposase-coding region. Analyses of the integration sites revealed that Bari transposase prefers AT-rich sequences in which the TA target is cleaved and duplicated. Furthermore evaluation of transposon's co-occurrence near the integration sites of Bari elements showed a non-random distribution of other transposable elements. We also unveil the existence of a putatively autonomous Bari1 variant characterized by two identical long Terminal Inverted Repeats, in D. rhopaloa. In addition, we detected MITEs related to Bari transposons in 9 species. Phylogenetic analyses based on transposase gene and the terminal sequences confirmed that Bari-like elements are distributed into three subfamilies. A few inconsistencies in Bari phylogenetic tree with respect to the Drosophila species tree could be explained by the occurrence of horizontal transfer events as also suggested by the results of dS analyses. This study further clarifies the Bari transposon's evolutionary dynamics and increases our understanding on the Tc1-mariner elements' biology.
Bosch, Jason; Noubiap, Jean Jacques N; Dandara, Collet; Makubalo, Nomlindo; Wright, Galen; Entfellner, Jean-Baka Domelevo; Tiffin, Nicki; Wonkam, Ambroise
2014-11-01
Mutations in the GJB2 gene, encoding connexin 26, could account for 50% of congenital, nonsyndromic, recessive deafness cases in some Caucasian/Asian populations. There is a scarcity of published data in sub-Saharan Africans. We Sanger sequenced the coding region of the GJB2 gene in 205 Cameroonian and Xhosa South Africans with congenital, nonsyndromic deafness; and performed bioinformatic analysis of variations in the GJB2 gene, incorporating data from the 1000 Genomes Project. Amongst Cameroonian patients, 26.1% were familial. The majority of patients (70%) suffered from sensorineural hearing loss. Ten GJB2 genetic variants were detected by sequencing. A previously reported pathogenic mutation, g.3741_3743delTTC (p.F142del), and a putative pathogenic mutation, g.3816G>A (p.V167M), were identified in single heterozygous samples. Amongst eight the remaining variants, two novel variants, g.3318-41G>A and g.3332G>A, were reported. There were no statistically significant differences in allele frequencies between cases and controls. Principal Components Analyses differentiated between Africans, Asians, and Europeans, but only explained 40% of the variation. The present study is the first to compare African GJB2 sequences with the data from the 1000 Genomes Project and have revealed the low variation between population groups. This finding has emphasized the hypothesis that the prevalence of mutations in GJB2 in nonsyndromic deafness amongst European and Asian populations is due to founder effects arising after these individuals migrated out of Africa, and not to a putative "protective" variant in the genomic structure of GJB2 in Africans. Our results confirm that mutations in GJB2 are not associated with nonsyndromic deafness in Africans.
Choe, Keith P; Kato, Akira; Hirose, Shigehisa; Plata, Consuelo; Sindic, Aleksandra; Romero, Michael F; Claiborne, J B; Evans, David H
2005-11-01
In mammals, the Na+/H+ exchanger 3 (NHE3) is expressed with Na+/K+-ATPase in renal proximal tubules, where it secretes H+ and absorbs Na+ to maintain blood pH and volume. In elasmobranchs (sharks, skates, and stingrays), the gills are the dominant site of pH and osmoregulation. This study was conducted to determine whether epithelial NHE homologs exist in elasmobranchs and, if so, to localize their expression in gills and determine whether their expression is altered by environmental salinity or hypercapnia. Degenerate primers and RT-PCR were used to deduce partial sequences of mammalian NHE2 and NHE3 homologs from the gills of the euryhaline Atlantic stingray (Dasyatis sabina). Real-time PCR was then used to demonstrate that mRNA expression of the NHE3 homolog increased when stingrays were transferred to low salinities but not during hypercapnia. Expression of the NHE2 homolog did not change with either treatment. Rapid amplification of cDNA was then used to deduce the complete sequence of a putative NHE3. The 2,744-base pair cDNA includes a coding region for a 2,511-amino acid protein that is 70% identical to human NHE3 (SLC9A3). Antisera generated against the carboxyl tail of the putative stingray NHE3 labeled the apical membranes of Na+/K+-ATPase-rich epithelial cells, and acclimation to freshwater caused a redistribution of labeling in the gills. This study provides the first NHE3 cloned from an elasmobranch and is the first to demonstrate an increase in gill NHE3 expression during acclimation to low salinities, suggesting that NHE3 can absorb Na+ from ion-poor environments.
Dallery, Jean-Félix; Lapalu, Nicolas; Zampounis, Antonios; Pigné, Sandrine; Luyten, Isabelle; Amselem, Joëlle; Wittenberg, Alexander H J; Zhou, Shiguo; de Queiroz, Marisa V; Robin, Guillaume P; Auger, Annie; Hainaut, Matthieu; Henrissat, Bernard; Kim, Ki-Tae; Lee, Yong-Hwan; Lespinet, Olivier; Schwartz, David C; Thon, Michael R; O'Connell, Richard J
2017-08-29
The ascomycete fungus Colletotrichum higginsianum causes anthracnose disease of brassica crops and the model plant Arabidopsis thaliana. Previous versions of the genome sequence were highly fragmented, causing errors in the prediction of protein-coding genes and preventing the analysis of repetitive sequences and genome architecture. Here, we re-sequenced the genome using single-molecule real-time (SMRT) sequencing technology and, in combination with optical map data, this provided a gapless assembly of all twelve chromosomes except for the ribosomal DNA repeat cluster on chromosome 7. The more accurate gene annotation made possible by this new assembly revealed a large repertoire of secondary metabolism (SM) key genes (89) and putative biosynthetic pathways (77 SM gene clusters). The two mini-chromosomes differed from the ten core chromosomes in being repeat- and AT-rich and gene-poor but were significantly enriched with genes encoding putative secreted effector proteins. Transposable elements (TEs) were found to occupy 7% of the genome by length. Certain TE families showed a statistically significant association with effector genes and SM cluster genes and were transcriptionally active at particular stages of fungal development. All 24 subtelomeres were found to contain one of three highly-conserved repeat elements which, by providing sites for homologous recombination, were probably instrumental in four segmental duplications. The gapless genome of C. higginsianum provides access to repeat-rich regions that were previously poorly assembled, notably the mini-chromosomes and subtelomeres, and allowed prediction of the complete SM gene repertoire. It also provides insights into the potential role of TEs in gene and genome evolution and host adaptation in this asexual pathogen.
Identification of coding and non-coding mutational hotspots in cancer genomes.
Piraino, Scott W; Furney, Simon J
2017-01-05
The identification of mutations that play a causal role in tumour development, so called "driver" mutations, is of critical importance for understanding how cancers form and how they might be treated. Several large cancer sequencing projects have identified genes that are recurrently mutated in cancer patients, suggesting a role in tumourigenesis. While the landscape of coding drivers has been extensively studied and many of the most prominent driver genes are well characterised, comparatively less is known about the role of mutations in the non-coding regions of the genome in cancer development. The continuing fall in genome sequencing costs has resulted in a concomitant increase in the number of cancer whole genome sequences being produced, facilitating systematic interrogation of both the coding and non-coding regions of cancer genomes. To examine the mutational landscapes of tumour genomes we have developed a novel method to identify mutational hotspots in tumour genomes using both mutational data and information on evolutionary conservation. We have applied our methodology to over 1300 whole cancer genomes and show that it identifies prominent coding and non-coding regions that are known or highly suspected to play a role in cancer. Importantly, we applied our method to the entire genome, rather than relying on predefined annotations (e.g. promoter regions) and we highlight recurrently mutated regions that may have resulted from increased exposure to mutational processes rather than selection, some of which have been identified previously as targets of selection. Finally, we implicate several pan-cancer and cancer-specific candidate non-coding regions, which could be involved in tumourigenesis. We have developed a framework to identify mutational hotspots in cancer genomes, which is applicable to the entire genome. This framework identifies known and novel coding and non-coding mutional hotspots and can be used to differentiate candidate driver regions from likely passenger regions susceptible to somatic mutation.
van der Ploeg, Jan R.
2005-01-01
In Streptococcus mutans, competence for genetic transformation and biofilm formation are dependent on the two-component signal transduction system ComDE together with the inducer peptide pheromone competence-stimulating peptide (CSP) (encoded by comC). Here, it is shown that the same system is also required for expression of the nlmAB genes, which encode a two-peptide nonlantibiotic bacteriocin. Expression from a transcriptional nlmAB′-lacZ fusion was highest at high cell density and was increased up to 60-fold following addition of CSP, but it was abolished when the comDE genes were interrupted. Two more genes, encoding another putative bacteriocin and a putative bacteriocin immunity protein, were also regulated by this system. The regions upstream of these genes and of two further putative bacteriocin-encoding genes and a gene encoding a putative bacteriocin immunity protein contained a conserved 9-bp repeat element just upstream of the transcription start, which suggests that expression of these genes is also dependent on the ComCDE regulatory system. Mutations in the repeat element of the nlmAB promoter region led to a decrease in CSP-dependent expression of nlmAB′-lacZ. In agreement with these results, a comDE mutant and mutants unable to synthesize or export CSP did not produce bacteriocins. It is speculated that, at high cell density, bacteriocin production is induced to liberate DNA from competing streptococci. PMID:15937160
SITEHOUND-web: a server for ligand binding site identification in protein structures.
Hernandez, Marylens; Ghersi, Dario; Sanchez, Roberto
2009-07-01
SITEHOUND-web (http://sitehound.sanchezlab.org) is a binding-site identification server powered by the SITEHOUND program. Given a protein structure in PDB format SITEHOUND-web will identify regions of the protein characterized by favorable interactions with a probe molecule. These regions correspond to putative ligand binding sites. Depending on the probe used in the calculation, sites with preference for different ligands will be identified. Currently, a carbon probe for identification of binding sites for drug-like molecules, and a phosphate probe for phosphorylated ligands (ATP, phoshopeptides, etc.) have been implemented. SITEHOUND-web will display the results in HTML pages including an interactive 3D representation of the protein structure and the putative sites using the Jmol java applet. Various downloadable data files are also provided for offline data analysis.
Statistical properties of DNA sequences
NASA Technical Reports Server (NTRS)
Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Simons, M.; Stanley, H. E.
1995-01-01
We review evidence supporting the idea that the DNA sequence in genes containing non-coding regions is correlated, and that the correlation is remarkably long range--indeed, nucleotides thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene. We resolve the problem of the "non-stationarity" feature of the sequence of base pairs by applying a new algorithm called detrended fluctuation analysis (DFA). We address the claim of Voss that there is no difference in the statistical properties of coding and non-coding regions of DNA by systematically applying the DFA algorithm, as well as standard FFT analysis, to every DNA sequence (33301 coding and 29453 non-coding) in the entire GenBank database. Finally, we describe briefly some recent work showing that the non-coding sequences have certain statistical features in common with natural and artificial languages. Specifically, we adapt to DNA the Zipf approach to analyzing linguistic texts. These statistical properties of non-coding sequences support the possibility that non-coding regions of DNA may carry biological information.
Frébortová, Jitka; Greplová, Marta; Seidl, Michael F; Heyl, Alexander; Frébort, Ivo
2015-01-01
Cytokinins, a class of phytohormones, are adenine derivatives common to many different organisms. In plants, these play a crucial role as regulators of plant development and the reaction to abiotic and biotic stress. Key enzymes in the cytokinin synthesis and degradation in modern land plants are the isopentyl transferases and the cytokinin dehydrogenases, respectively. Their encoding genes have been probably introduced into the plant lineage during the primary endosymbiosis. To shed light on the evolution of these proteins, the genes homologous to plant adenylate isopentenyl transferase and cytokinin dehydrogenase were amplified from the genomic DNA of cyanobacterium Nostoc sp. PCC 7120 and expressed in Escherichia coli. The putative isopentenyl transferase was shown to be functional in a biochemical assay. In contrast, no enzymatic activity was detected for the putative cytokinin dehydrogenase, even though the principal domains necessary for its function are present. Several mutant variants, in which conserved amino acids in land plant cytokinin dehydrogenases had been restored, were inactive. A combination of experimental data with phylogenetic analysis indicates that adenylate-type isopentenyl transferases might have evolved several times independently. While the Nostoc genome contains a gene coding for protein with characteristics of cytokinin dehydrogenase, the organism is not able to break down cytokinins in the way shown for land plants.
Frébortová, Jitka; Greplová, Marta; Seidl, Michael F.; Heyl, Alexander; Frébort, Ivo
2015-01-01
Cytokinins, a class of phytohormones, are adenine derivatives common to many different organisms. In plants, these play a crucial role as regulators of plant development and the reaction to abiotic and biotic stress. Key enzymes in the cytokinin synthesis and degradation in modern land plants are the isopentyl transferases and the cytokinin dehydrogenases, respectively. Their encoding genes have been probably introduced into the plant lineage during the primary endosymbiosis. To shed light on the evolution of these proteins, the genes homologous to plant adenylate isopentenyl transferase and cytokinin dehydrogenase were amplified from the genomic DNA of cyanobacterium Nostoc sp. PCC 7120 and expressed in Escherichia coli. The putative isopentenyl transferase was shown to be functional in a biochemical assay. In contrast, no enzymatic activity was detected for the putative cytokinin dehydrogenase, even though the principal domains necessary for its function are present. Several mutant variants, in which conserved amino acids in land plant cytokinin dehydrogenases had been restored, were inactive. A combination of experimental data with phylogenetic analysis indicates that adenylate-type isopentenyl transferases might have evolved several times independently. While the Nostoc genome contains a gene coding for protein with characteristics of cytokinin dehydrogenase, the organism is not able to break down cytokinins in the way shown for land plants. PMID:26376297
Delcourt, Vivian; Lucier, Jean-François; Gagnon, Jules; Beaudoin, Maxime C; Vanderperre, Benoît; Breton, Marc-André; Motard, Julie; Jacques, Jean-François; Brunelle, Mylène; Gagnon-Arsenault, Isabelle; Fournier, Isabelle; Ouangraoua, Aida; Hunting, Darel J; Cohen, Alan A; Landry, Christian R; Scott, Michelle S
2017-01-01
Recent functional, proteomic and ribosome profiling studies in eukaryotes have concurrently demonstrated the translation of alternative open-reading frames (altORFs) in addition to annotated protein coding sequences (CDSs). We show that a large number of small proteins could in fact be coded by these altORFs. The putative alternative proteins translated from altORFs have orthologs in many species and contain functional domains. Evolutionary analyses indicate that altORFs often show more extreme conservation patterns than their CDSs. Thousands of alternative proteins are detected in proteomic datasets by reanalysis using a database containing predicted alternative proteins. This is illustrated with specific examples, including altMiD51, a 70 amino acid mitochondrial fission-promoting protein encoded in MiD51/Mief1/SMCR7L, a gene encoding an annotated protein promoting mitochondrial fission. Our results suggest that many genes are multicoding genes and code for a large protein and one or several small proteins. PMID:29083303
Filip'echeva, Yulia A; Shelud'ko, Andrei V; Prilipov, Alexei G; Burygin, Gennady L; Telesheva, Elizaveta M; Yevstigneyeva, Stella S; Chernyshova, Marina P; Petrova, Lilia P; Katsy, Elena I
2018-02-01
Azospirillum brasilense can swim and swarm owing to the activity of a constitutive polar flagellum (Fla) and inducible lateral flagella (Laf), respectively. Experimental data on the regulation of the Fla and Laf assembly in azospirilla are scarce. Here, the coding sequence (CDS) AZOBR_p1160043 (fabG1) for a putative 3-oxoacyl-[acyl-carrier protein (ACP)] reductase was found essential for the construction of both types of flagella. In an immotile leaky Fla - Laf - fabG1::Omegon-Km mutant, Sp245.1610, defects in flagellation and motility were fully complemented by expressing the CDS AZOBR_p1160043 from plasmid pRK415. When pRK415 with the cloned CDS AZOBR_p1160045 (fliC) for a putative 65.2 kDa Sp245 Fla flagellin was transferred into the Sp245.1610 cells, the bacteria also became able to assemble a motile single flagellum. Some cells, however, had unusual swimming behavior, probably because of the side location of the organelle. Although the assembly of Laf was not restored in Sp245.1610 (pRK415-p1160045), this strain was somewhat capable of swarming motility. We propose that the putative 3-oxoacyl-[ACP] reductase encoded by the CDS AZOBR_p1160043 plays a role in correct flagellar location in the cell envelope and (or) in flagellar modification(s), which are also required for the inducible construction of Laf and for proper swimming and swarming motility of A. brasilense Sp245.
Alam, Syed Imteyaz; Dwivedi, Pratistha
2016-10-01
The whole genome sequencing and annotation of Clostridium perfringens strains revealed several genes coding for proteins of unknown function with no significant similarities to genes in other organisms. Our previous studies clearly demonstrated that hypothetical proteins CPF_2500, CPF_1441, CPF_0876, CPF_0093, CPF_2002, CPF_2314, CPF_1179, CPF_1132, CPF_2853, CPF_0552, CPF_2032, CPF_0438, CPF_1440, CPF_2918, CPF_0656, and CPF_2364 are genuine proteins of C. perfringens expressed in high abundance. This study explored the putative role of these hypothetical proteins using bioinformatic tools and evaluated their potential as putative candidates for prophylaxis. Apart from a group of eight hypothetical proteins (HPs), a putative function was predicted for the rest of the hypothetical proteins using one or more of the algorithms used. The phylogenetic analysis did not suggest an evidence of a horizontal gene transfer event except for HP CPF_0876. HP CPF_2918 is an abundant extracellular protein, unique to C. perfringens species with maximum strain coverage and did not show any significant match in the database. CPF_2918 was cloned, recombinant protein was purified to near homogeneity, and probing with mouse anti-CPF_2918 serum revealed surface localization of the protein in C. perfringens ATCC13124 cultures. The purified recombinant CPF_2918 protein induced antibody production, a mixed Th1 and Th2 kind of response, and provided partial protection to immunized mice in direct C. perfringens challenge. Copyright © 2016 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fyfe, Cameron D.; Grinter, Rhys; Josts, Inokentijs
The X-ray structure of protease-cleaved E. coli α-2-macroglobulin is described, which reveals a putative mechanism of activation and conformational change essential for protease inhibition. Bacterial α-2-macroglobulins have been suggested to function in defence as broad-spectrum inhibitors of host proteases that breach the outer membrane. Here, the X-ray structure of protease-cleaved Escherichia coli α-2-macroglobulin is described, which reveals a putative mechanism of activation and conformational change essential for protease inhibition. In this competitive mechanism, protease cleavage of the bait-region domain results in the untethering of an intrinsically disordered region of this domain which disrupts native interdomain interactions that maintain E. colimore » α-2-macroglobulin in the inactivated form. The resulting global conformational change results in entrapment of the protease and activation of the thioester bond that covalently links to the attacking protease. Owing to the similarity in structure and domain architecture of Escherichia coli α-2-macroglobulin and human α-2-macroglobulin, this protease-activation mechanism is likely to operate across the diverse members of this group.« less
Goris, Tobias; Schiffmann, Christian L.; Gadkari, Jennifer; Schubert, Torsten; Seifert, Jana; Jehmlich, Nico; von Bergen, Martin; Diekert, Gabriele
2015-01-01
Organohalide respiration is an environmentally important but poorly characterized type of anaerobic respiration. We compared the global proteome of the versatile organohalide-respiring Epsilonproteobacterium Sulfurospirillum multivorans grown with different electron acceptors (fumarate, nitrate, or tetrachloroethene [PCE]). The most significant differences in protein abundance were found for gene products of the organohalide respiration region. This genomic region encodes the corrinoid and FeS cluster containing PCE reductive dehalogenase PceA and other proteins putatively involved in PCE metabolism such as those involved in corrinoid biosynthesis. The latter gene products as well as PceA and a putative quinol dehydrogenase were almost exclusively detected in cells grown with PCE. This finding suggests an electron flow from the electron donor such as formate or pyruvate via the quinone pool and a quinol dehydrogenase to PceA and the terminal electron acceptor PCE. Two putative accessory proteins, an IscU-like protein and a peroxidase-like protein, were detected with PCE only and might be involved in PceA maturation. The proteome of cells grown with pyruvate instead of formate as electron donor indicates a route of electrons from reduced ferredoxin via an Epsilonproteobacterial complex I and the quinone pool to PCE. PMID:26387727
Integrated Post-GWAS Analysis Sheds New Light on the Disease Mechanisms of Schizophrenia
Lin, Jhih-Rong; Cai, Ying; Zhang, Quanwei; Zhang, Wen; Nogales-Cadenas, Rubén; Zhang, Zhengdong D.
2016-01-01
Schizophrenia is a severe mental disorder with a large genetic component. Recent genome-wide association studies (GWAS) have identified many schizophrenia-associated common variants. For most of the reported associations, however, the underlying biological mechanisms are not clear. The critical first step for their elucidation is to identify the most likely disease genes as the source of the association signals. Here, we describe a general computational framework of post-GWAS analysis for complex disease gene prioritization. We identify 132 putative schizophrenia risk genes in 76 risk regions spanning 120 schizophrenia-associated common variants, 78 of which have not been recognized as schizophrenia disease genes by previous GWAS. Even more significantly, 29 of them are outside the risk regions, likely under regulation of transcriptional regulatory elements contained therein. These putative schizophrenia risk genes are transcriptionally active in both brain and the immune system, and highly enriched among cellular pathways, consistent with leading pathophysiological hypotheses about the pathogenesis of schizophrenia. With their involvement in distinct biological processes, these putative schizophrenia risk genes, with different association strengths, show distinctive temporal expression patterns, and play specific biological roles during brain development. PMID:27754856
Lipinska, B; Rao, A S; Bolten, B M; Balakrishnan, R; Goldberg, E B
1989-01-01
We sequenced bacteriophage T4 genes 2 and 3 and the putative C-terminal portion of gene 50. They were found to have appropriate open reading frames directed counterclockwise on the T4 map. Mutations in genes 2 and 64 were shown to be in the same open reading frame, which we now call gene 2. This gene codes for a protein of 27,068 daltons. The open reading frame corresponding to gene 3 codes for a protein of 20,634 daltons. Appropriate bands on polyacrylamide gels were identified at 30 and 20 kilodaltons, respectively. We found that the product of the cloned gene 2 can protect T4 DNA double-stranded ends from exonuclease V action. Images PMID:2644202
Molecular cloning of chitinase 33 (chit33) gene from Trichoderma atroviride
Matroudi, S.; Zamani, M.R.; Motallebi, M.
2008-01-01
In this study Trichoderma atroviride was selected as over producer of chitinase enzyme among 30 different isolates of Trichoderma sp. on the basis of chitinase specific activity. From this isolate the genomic and cDNA clones encoding chit33 have been isolated and sequenced. Comparison of genomic and cDNA sequences for defining gene structure indicates that this gene contains three short introns and also an open reading frame coding for a protein of 321 amino acids. The deduced amino acid sequence includes a 19 aa putative signal peptide. Homology between this sequence and other reported Trichoderma Chit33 proteins are discussed. The coding sequence of chit33 gene was cloned in pEt26b(+) expression vector and expressed in E. coli. PMID:24031242
Flavivirus RNAi suppression: decoding non-coding RNA.
Pijlman, Gorben P
2014-08-01
Flaviviruses are important human pathogens that are transmitted by invertebrate vectors, mostly mosquitoes and ticks. During replication in their vector, flaviviruses are subject to a potent innate immune response known as antiviral RNA interference (RNAi). This defense mechanism is associated with the production of small interfering (si)RNA that lead to degradation of viral RNA. To what extent flaviviruses would benefit from counteracting antiviral RNAi is subject of debate. Here, the experimental evidence to suggest the existence of flavivirus RNAi suppressors is discussed. I will highlight the putative role of non-coding, subgenomic flavivirus RNA in suppression of RNAi in insect and mammalian cells. Novel insights from ongoing research will reveal how arthropod-borne viruses modulate innate immunity including antiviral RNAi. Copyright © 2014 Elsevier B.V. All rights reserved.
PRRSV strain VR-2332 Nsp2 deletion mutants attenuate clinical symptoms in swine
USDA-ARS?s Scientific Manuscript database
PRRSV nonstructural protein 2 (nsp2) contains a N-terminal cysteine proteinase (PL2) domain, a middle hypervariable region and C-terminal putative transmembrane domain. Prior studies had shown that as much as 403 amino acids could be removed from the hypervariable region without losing virus viabil...
José, Marco V.; Govezensky, Tzipe; García, José A.; Bobadilla, Juan R.
2009-01-01
Herein two genetic codes from which the primeval RNA code could have originated the standard genetic code (SGC) are derived. One of them, called extended RNA code type I, consists of all codons of the type RNY (purine-any base-pyrimidine) plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. In order to test if putative nucleotide sequences in the RNA World and in both extended RNA codes, share the same scaling and statistical properties to those encountered in current prokaryotes, we used the genomes of four Eubacteria and three Archaeas. For each prokaryote, we obtained their respective genomes obeying the RNA code or the extended RNA codes types I and II. In each case, we estimated the scaling properties of triplet sequences via a renormalization group approach, and we calculated the frequency distributions of distances for each codon. Remarkably, the scaling properties of the distance series of some codons from the RNA code and most codons from both extended RNA codes turned out to be identical or very close to the scaling properties of codons of the SGC. To test for the robustness of these results, we show, via computer simulation experiments, that random mutations of current genomes, at the rates of 10−10 per site per year during three billions of years, were not enough for destroying the observed patterns. Therefore, we conclude that most current prokaryotes may still contain relics of the primeval RNA World and that both extended RNA codes may well represent two plausible evolutionary paths between the RNA code and the current SGC. PMID:19183813
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kalchman, M.; Lin, B.; Nasir, J.
1994-09-01
The mouse homologue of the Huntington disease gene (Hdh) has recently been cloned and mapped to a region of synteny with the human, on mouse chromosome 5. The two genes share a high degree of both coding (90% amino acid) and nucleotide (86.2%) identity. We have subsequently performed a detailed comparison of the genomic organization of the 5{prime} region of the two genes encompassing the promoter region and first five exons of both the human and mouse genes. The comparative sequence analysis of the promoter region between HD and Hdh reveals two highly conserved regions. One region (-56 to -118)more » (+1 is the ATG start codon), shared 84% nucleotide identity and another region (-130 to -206) had 81% nucleotide identity. Nine putative Sp1 sites appear in the human promoter region contrasted with only 3 in a similar region in the mouse. Furthermore, 17 and 20 base pair direct repeats present in the HD 5{prime} region are absent in the similar Hdh region. Although both the mouse and human intron/exon boundaries conform to the GT/AG rule, the intron sizes between HD and Hdh are markedly different. The first four introns in Hdh are 15, 7, 5 and 0.5 kb compared to sizes of 10, 15, 7 and 0.5 kb, respectively. Comparison between the mouse and human intronic sequences immediately adjacent to the first five exons (excluding exon 1) reveals only about 46 to 50% identity within the first 60 bp of intronic sequence. Furthermore, we have identified novel polymorphic di-, tri- and tetra-nucleotide repeats in Hdh introns of various mouse strains that are not present in the human. For example, polymorphic CT repeats are present in introns 2 and 4 of Hdh and a novel mouse 56 AAG trinucleotide repeat (interrupted by an AAGG) is also located within intron 2. This information concerning the promoter and genomic organization of both HD and Hdh is critical for designing appropriate gene targetting vectors for studying the normal function of the HD and Hdh genes in model systems.« less
Visual-Vestibular Responses During Space Flight
NASA Technical Reports Server (NTRS)
Reschke, M. F.; Kozlovskaya, I. B.; Paloski, W. H.
1999-01-01
Given the documented disruptions that occur in spatial orientation during space flight and the putative sensory-motor information underlying eye and head spatial coding, the primary purpose of this paper is to examine components of the target acquisition system in subjects free to make head and eye movements in three dimensional space both during and following adaptation to long duration space flight. It is also our intention to suggest a simple model of adaptation that has components in common with cerebellar disorders whose neurobiological substrate has been identified.
Genetic heterogeneity of the hepatitis C virus.
Bukh, J; Miller, R H; Purcell, R H
1995-01-01
Hepatitis C virus (HCV) is an important etiological agent in the development of chronic liver diseases such as chronic hepatitis, cirrhosis, and hepatocellular carcinoma (HCC). The virus, identified only recently, contains a single-stranded RNA genome of positive polarity, is distantly related to pestiviruses and flaviviruses, and has been classified as the first member of a third genus within the family Flaviviridae. Extensive analysis of HCV genomic sequences demonstrated that this virus possesses significant genetic heterogeneity. Different regions of the viral genome demonstrate a varying degree of heterogeneity; the regions coding for the putative envelope proteins are the most variable sites between different isolates. Furthermore, HCV circulates as a quasispecies in the host. During the course of acute and chronic infection, the sequence composition of the HCV population in one patient has been found to change sequentially with an extremely high rate of nonconserved nucleotide changes in the hypervariable region I (HVR1) of HCV. Such sequence changes alter the antigenicity of the epitopes coded within HVR1 so that these are not always recognized by preexisting antibodies. It has been suggested that this could represent one mechanism by which HCV evades host immune surveillance and may account for the high rate of chronicity observed in such infections. Continuous viral replication may, in turn, lead to the development of chronic liver disease, including HCC, in infected individuals. To date, at least nine major genetic groups (genotypes 1-9) and more than 30 subgroups of HCV have been recognized based on genetic differences. A distinct difference has been observed in the genotype distribution in Africa compared with other continents. Recent data have suggested a difference in pathogenesis and in the outcome of interferon therapy in individuals infected with HCV of certain genotypes. For example, genotype 1b (II) seems to be associated with more severe liver disease, including HCC, and with a poorer response to interferon therapy. The extensive genetic heterogeneity of HCV may have serious implications for the diagnosis, treatment and prevention of hepatitis C as well as in understanding the biology of infection by this important human pathogen.
Behl, Jyotsna Dhingra; Mishra, Priyanka; Verma, N K; Niranjan, S K; Dangi, P S; Sharma, Rekha; Behl, Rahul
2016-03-15
The present study was undertaken to characterize the genetic variation present in lymphoxin A gene (LTA gene) encoding for the lymphotoxin A protein also known as tumor necrosis factor beta, a cytokine produced by lymphocytes, known to be cytotoxic for a wide range of tumor cells both in vitro and in vivo, and, which is essential for normal immunological development; in 40 animals of 5 diverse Bos indicus Indian zebu cattle breeds. These breeds survive under the harsh and tough tropical climatic conditions of various parts of the Indian subcontinent. The LTA gene in the present study was observed to contain 33 SNPs and 3 small insertion/deletion polymorphisms. Four SNPs occurred in the coding regions of the gene viz. g.1327A>G and g.1400C>T in exon 2 and g.1840C>T and g.1942C>T in exon 3, of which the SNP g.1327A>G in exon 2 resulted in a non-synonymous amino acid change G38D. This amino acid change was however predicted not be affecting the protein function in any manner. The gene contained putative transcription factor binding sites for the c-Re1 and for Pax-4 transcription factors. A putative promoter region was also predicted on the reverse DNA strand from position 894 to 644. Several repeat elements and microsatellite repeats were detected to be occurring across the 3.2kb LTA gene sequence. The study showed the occurrence of 40 genotypes and 48 most probable haplotypes. The genotypes at the observed SNP positions in the LTA gene were in near Hardy-Weinberg equilibrium. A negative Tajima's D value that was not significant statistically at P>0.10 indicated that the neutral mutation hypothesis could not be excluded. The genetic variations observed in the LTA gene in the present study have not been reported earlier and these could possibly be used as molecular markers for further studies involving association of the gene variability with disease resistance/tolerance traits. Copyright © 2015 Elsevier B.V. All rights reserved.
Nicosia, Aldo; Maggio, Teresa; Mazzola, Salvatore; Cuttitta, Angela
2013-10-30
Anemonia viridis is a widespread and extensively studied Mediterranean species of sea anemone from which a large number of polypeptide toxins, such as blood depressing substances (BDS) peptides, have been isolated. The first members of this class, BDS-1 and BDS-2, are polypeptides belonging to the β-defensin fold family and were initially described for their antihypertensive and antiviral activities. BDS-1 and BDS-2 are 43 amino acid peptides characterised by three disulfide bonds that act as neurotoxins affecting Kv3.1, Kv3.2 and Kv3.4 channel gating kinetics. In addition, BDS-1 inactivates the Nav1.7 and Nav1.3 channels. The development of a large dataset of A. viridis expressed sequence tags (ESTs) and the identification of 13 putative BDS-like cDNA sequences has attracted interest, especially as scientific and diagnostic tools. A comparison of BDS cDNA sequences showed that the untranslated regions are more conserved than the protein-coding regions. Moreover, the KA/KS ratios calculated for all pairwise comparisons showed values greater than 1, suggesting mechanisms of accelerated evolution. The structures of the BDS homologs were predicted by molecular modelling. All toxins possess similar 3D structures that consist of a triple-stranded antiparallel β-sheet and an additional small antiparallel β-sheet located downstream of the cleavage/maturation site; however, the orientation of the triple-stranded β-sheet appears to differ among the toxins. To characterise the spatial expression profile of the putative BDS cDNA sequences, tissue-specific cDNA libraries, enriched for BDS transcripts, were constructed. In addition, the proper amplification of ectodermal or endodermal markers ensured the tissue specificity of each library. Sequencing randomly selected clones from each library revealed ectodermal-specific expression of ten BDS transcripts, while transcripts of BDS-8, BDS-13, BDS-14 and BDS-15 failed to be retrieved, likely due to under-representation in our cDNA libraries. The calculation of the relative abundance of BDS transcripts in the cDNA libraries revealed that BDS-1, BDS-3, BDS-4, BDS-5 and BDS-6 are the most represented transcripts.
Tumour suppressor protein p53 regulates the stress activated bilirubin oxidase cytochrome P450 2A6
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hu, Hao, E-mail: hao.hu1@uqconnect.edu.au; Yu, Ting, E-mail: t.yu2@uq.edu.au; Arpiainen, Satu, E-mail: Satu.Juhila@orion.fi
2015-11-15
Human cytochrome P450 (CYP) 2A6 enzyme has been proposed to play a role in cellular defence against chemical-induced oxidative stress. The encoding gene is regulated by various stress activated transcription factors. This paper demonstrates that p53 is a novel transcriptional regulator of the gene. Sequence analysis of the CYP2A6 promoter revealed six putative p53 binding sites in a 3 kb proximate promoter region. The site closest to transcription start site (TSS) is highly homologous with the p53 consensus sequence. Transfection with various stepwise deletions of CYP2A6-5′-Luc constructs – down to − 160 bp from the TSS – showed p53 responsivenessmore » in p53 overexpressed C3A cells. However, a further deletion from − 160 to − 74 bp, including the putative p53 binding site, totally abolished the p53 responsiveness. Electrophoretic mobility shift assay with a probe containing the putative binding site showed specific binding of p53. A point mutation at the binding site abolished both the binding and responsiveness of the recombinant gene to p53. Up-regulation of the endogenous p53 with benzo[α]pyrene – a well-known p53 activator – increased the expression of the p53 responsive positive control and the CYP2A6-5′-Luc construct containing the intact p53 binding site but not the mutated CYP2A6-5′-Luc construct. Finally, inducibility of the native CYP2A6 gene by benzo[α]pyrene was demonstrated by dose-dependent increases in CYP2A6 mRNA and protein levels along with increased p53 levels in the nucleus. Collectively, the results indicate that p53 protein is a regulator of the CYP2A6 gene in C3A cells and further support the putative cytoprotective role of CYP2A6. - Highlights: • CYP2A6 is an immediate target gene of p53. • Six putative p53REs located on 3 kb proximate CYP2A6 promoter region. • The region − 160 bp from TSS is highly homologous with the p53 consensus sequence. • P53 specifically bind to the p53RE on the − 160 bp region. • HNF4α may interact with p53 in regulating CYP2A6 expression.« less
Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium
2010-01-01
Background In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. Results By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Conclusions Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes. PMID:20441586
Li, Xiaobin; Xie, Yingzhou; Liu, Meng; Tai, Cui; Sun, Jingyong; Deng, Zixin; Ou, Hong-Yu
2018-05-04
oriTfinder is a web server that facilitates the rapid identification of the origin of transfer site (oriT) of a conjugative plasmid or chromosome-borne integrative and conjugative element. The utilized back-end database oriTDB was built upon more than one thousand known oriT regions of bacterial mobile genetic elements (MGEs) as well as the known MGE-encoding relaxases and type IV coupling proteins (T4CP). With a combination of similarity searches for the oriTDB-archived oriT nucleotide sequences and the co-localization of the flanking relaxase homologous genes, the oriTfinder can predict the oriT region with high accuracy in the DNA sequence of a bacterial plasmid or chromosome in minutes. The server also detects the other transfer-related modules, including the potential relaxase gene, T4CP gene and the type IV secretion system gene cluster, and the putative genes coding for virulence factors and acquired antibiotic resistance determinants. oriTfinder may contribute to meeting the increasing demands of re-annotations for bacterial conjugative, mobilizable or non-transferable elements and aid in the rapid risk accession of disease-relevant trait dissemination in pathogenic bacteria of interest. oriTfinder is freely available to all users without any login requirement at http://bioinfo-mml.sjtu.edu.cn/oriTfinder.
Functional analysis of the ComK protein of Bacillus coagulans.
Kovács, Ákos T; Eckhardt, Tom H; van Hartskamp, Mariska; van Kranenburg, Richard; Kuipers, Oscar P
2013-01-01
The genes for DNA uptake and recombination in Bacilli are commonly regulated by the transcriptional factor ComK. We have identified a ComK homologue in Bacillus coagulans, an industrial relevant organism that is recalcitrant for transformation. Introduction of B. coagulans comK gene under its own promoter region into Bacillus subtilis comK strain results in low transcriptional induction of the late competence gene comGA, but lacking bistable expression. The promoter regions of B. coagulans comK and the comGA genes are recognized in B. subtilis and expression from these promoters is activated by B. subtilis ComK. Purified ComK protein of B. coagulans showed DNA-binding ability in gel retardation assays with B. subtilis- and B. coagulans-derived probes. These experiments suggest that the function of B. coagulans ComK is similar to that of ComK of B. subtilis. When its own comK is overexpressed in B. coagulans the comGA gene expression increases 40-fold, while the expression of another late competence gene, comC is not elevated and no reproducible DNA-uptake could be observed under these conditions. Our results demonstrate that B. coagulans ComK can recognize several B.subtilis comK-responsive elements, and vice versa, but indicate that the activation of the transcription of complete sets of genes coding for a putative DNA uptake apparatus in B. coagulans might differ from that of B. subtilis.
Allen, Michael S.; Hurst, Gregory B.; Lu, Tse-Yuan S.; ...
2015-04-08
Rhodopseudomonas palustris encodes 16 extracytoplasmic function (ECF) σ factors. In this paper, to begin to investigate the regulatory network of one of these ECF σ factors, the whole proteome of R. palustris CGA010 was quantitatively analyzed by tandem mass spectrometry from cultures episomally expressing the ECF σ RPA4225 (ecfT) versus a WT control. Among the proteins with the greatest increase in abundance were catalase KatE, trehalose synthase, a DPS-like protein, and several regulatory proteins. Alignment of the cognate promoter regions driving expression of several upregulated proteins suggested a conserved binding motif in the -35 and -10 regions with the consensusmore » sequence GGAAC-18N-TT. Additionally, the putative anti-σ factor RPA4224, whose gene is contained in the same predicted operon as RPA4225, was identified as interacting directly with the predicted response regulator RPA4223 by mass spectrometry of affinity-isolated protein complexes. Furthermore, another gene (RPA4226) coding for a protein that contains a cytoplasmic histidine kinase domain is located immediately upstream of RPA4225. The genomic organization of orthologs for these four genes is conserved in several other strains of R. palustris as well as in closely related α-Proteobacteria. Finally, taken together, these data suggest that ECF σ RPA4225 and the three additional genes make up a sigma factor mimicry system in R. palustris.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Allen, Michael S.; Hurst, Gregory B.; Lu, Tse-Yuan S.
Rhodopseudomonas palustris encodes 16 extracytoplasmic function (ECF) σ factors. In this paper, to begin to investigate the regulatory network of one of these ECF σ factors, the whole proteome of R. palustris CGA010 was quantitatively analyzed by tandem mass spectrometry from cultures episomally expressing the ECF σ RPA4225 (ecfT) versus a WT control. Among the proteins with the greatest increase in abundance were catalase KatE, trehalose synthase, a DPS-like protein, and several regulatory proteins. Alignment of the cognate promoter regions driving expression of several upregulated proteins suggested a conserved binding motif in the -35 and -10 regions with the consensusmore » sequence GGAAC-18N-TT. Additionally, the putative anti-σ factor RPA4224, whose gene is contained in the same predicted operon as RPA4225, was identified as interacting directly with the predicted response regulator RPA4223 by mass spectrometry of affinity-isolated protein complexes. Furthermore, another gene (RPA4226) coding for a protein that contains a cytoplasmic histidine kinase domain is located immediately upstream of RPA4225. The genomic organization of orthologs for these four genes is conserved in several other strains of R. palustris as well as in closely related α-Proteobacteria. Finally, taken together, these data suggest that ECF σ RPA4225 and the three additional genes make up a sigma factor mimicry system in R. palustris.« less
Mycobacterium ahvazicum sp. nov., the nineteenth species of the Mycobacterium simiae complex.
Bouam, Amar; Heidarieh, Parvin; Shahraki, Abodolrazagh Hashemi; Pourahmad, Fazel; Mirsaeidi, Mehdi; Hashemzadeh, Mohamad; Baptiste, Emeline; Armstrong, Nicholas; Levasseur, Anthony; Robert, Catherine; Drancourt, Michel
2018-03-07
Four slowly growing mycobacteria isolates were isolated from the respiratory tract and soft tissue biopsies collected in four unrelated patients in Iran. Conventional phenotypic tests indicated that these four isolates were identical to Mycobacterium lentiflavum while 16S rRNA gene sequencing yielded a unique sequence separated from that of M. lentiflavum. One representative strain AFP-003 T was characterized as comprising a 6,121,237-bp chromosome (66.24% guanosine-cytosine content) encoding for 5,758 protein-coding genes, 50 tRNA and one complete rRNA operon. A total of 2,876 proteins were found to be associated with the mobilome, including 195 phage proteins. A total of 1,235 proteins were found to be associated with virulence and 96 with toxin/antitoxin systems. The genome of AFP-003 T has the genetic potential to produce secondary metabolites, with 39 genes found to be associated with polyketide synthases and non-ribosomal peptide syntases and 11 genes encoding for bacteriocins. Two regions encoding putative prophages and three OriC regions separated by the dnaA gene were predicted. Strain AFP-003 T genome exhibits 86% average nucleotide identity with Mycobacterium genavense genome. Genetic and genomic data indicate that strain AFP-003 T is representative of a novel Mycobacterium species that we named Mycobacterium ahvazicum, the nineteenth species of the expanding Mycobacterium simiae complex.
Willkomm, Dagmar K.; Minnerup, Jens; Hüttenhofer, Alexander; Hartmann, Roland K.
2005-01-01
By an experimental RNomics approach, we have generated a cDNA library from small RNAs expressed from the genome of the hyperthermophilic bacterium Aquifex aeolicus. The library included RNAs that were antisense to mRNAs and tRNAs as well as RNAs encoded in intergenic regions. Substantial steady-state levels in A.aeolicus cells were confirmed for several of the cloned RNAs by northern blot analysis. The most abundant intergenic RNA of the library was identified as the 6S RNA homolog of A.aeolicus. Although shorter in size (150 nt) than its γ-proteobacterial homologs (∼185 nt), it is predicted to have the most stable structure among known 6S RNAs. As in the γ-proteobacteria, the A.aeolicus 6S RNA gene (ssrS) is located immediately upstream of the ygfA gene encoding a widely conserved 5-formyltetrahydrofolate cyclo-ligase. We identifed novel 6S RNA candidates within the γ-proteobacteria but were unable to identify reasonable 6S RNA candidates in other bacterial branches, utilizing mfold analyses of the region immediately upstream of ygfA combined with 6S RNA blastn searches. By RACE experiments, we mapped the major transcription initiation site of A.aeolicus 6S RNA primary transcripts, located within the pheT gene preceding ygfA, as well as three processing sites. PMID:15814812
Diversity and structure of PIF/Harbinger-like elements in the genome of Medicago truncatula
Grzebelus, Dariusz; Lasota, Slawomir; Gambin, Tomasz; Kucherov, Gregory; Gambin, Anna
2007-01-01
Background Transposable elements constitute a significant fraction of plant genomes. The PIF/Harbinger superfamily includes DNA transposons (class II elements) carrying terminal inverted repeats and producing a 3 bp target site duplication upon insertion. The presence of an ORF coding for the DDE/DDD transposase, required for transposition, is characteristic for the autonomous PIF/Harbinger-like elements. Based on the above features, PIF/Harbinger-like elements were identified in several plant genomes and divided into several evolutionary lineages. Availability of a significant portion of Medicago truncatula genomic sequence allowed for mining PIF/Harbinger-like elements, starting from a single previously described element MtMaster. Results Twenty two putative autonomous, i.e. carrying an ORF coding for TPase and complete terminal inverted repeats, and 67 non-autonomous PIF/Harbinger-like elements were found in the genome of M. truncatula. They were divided into five families, MtPH-A5, MtPH-A6, MtPH-D,MtPH-E, and MtPH-M, corresponding to three previously identified and two new lineages. The largest families, MtPH-A6 and MtPH-M were further divided into four and three subfamilies, respectively. Non-autonomous elements were usually direct deletion derivatives of the putative autonomous element, however other types of rearrangements, including inversions and nested insertions were also observed. An interesting structural characteristic – the presence of 60 bp tandem repeats – was observed in a group of elements of subfamily MtPH-A6-4. Some families could be related to miniature inverted repeat elements (MITEs). The presence of empty loci (RESites), paralogous to those flanking the identified transposable elements, both autonomous and non-autonomous, as well as the presence of transposon insertion related size polymorphisms, confirmed that some of the mined elements were capable for transposition. Conclusion The population of PIF/Harbinger-like elements in the genome of M. truncatula is diverse. A detailed intra-family comparison of the elements' structure proved that they proliferated in the genome generally following the model of abortive gap repair. However, the presence of tandem repeats facilitated more pronounced rearrangements of the element internal regions. The insertion polymorphism of the MtPH elements and related MITE families in different populations of M. truncatula, if further confirmed experimentally, could be used as a source of molecular markers complementary to other marker systems. PMID:17996080
Weisberg, Jill; McCullough, Stephen; Emmorey, Karen
2018-01-01
Code-blends (simultaneous words and signs) are a unique characteristic of bimodal bilingual communication. Using fMRI, we investigated code-blend comprehension in hearing native ASL-English bilinguals who made a semantic decision (edible?) about signs, audiovisual words, and semantically equivalent code-blends. English and ASL recruited a similar fronto-temporal network with expected modality differences: stronger activation for English in auditory regions of bilateral superior temporal cortex, and stronger activation for ASL in bilateral occipitotemporal visual regions and left parietal cortex. Code-blend comprehension elicited activity in a combination of these regions, and no cognitive control regions were additionally recruited. Furthermore, code-blends elicited reduced activation relative to ASL presented alone in bilateral prefrontal and visual extrastriate cortices, and relative to English alone in auditory association cortex. Consistent with behavioral facilitation observed during semantic decisions, the findings suggest that redundant semantic content induces more efficient neural processing in language and sensory regions during bimodal language integration. PMID:26177161
A genome-wide survey of maternal and embryonic transcripts during Xenopus tropicalis development.
Paranjpe, Sarita S; Jacobi, Ulrike G; van Heeringen, Simon J; Veenstra, Gert Jan C
2013-11-06
Dynamics of polyadenylation vs. deadenylation determine the fate of several developmentally regulated genes. Decay of a subset of maternal mRNAs and new transcription define the maternal-to-zygotic transition, but the full complement of polyadenylated and deadenylated coding and non-coding transcripts has not yet been assessed in Xenopus embryos. To analyze the dynamics and diversity of coding and non-coding transcripts during development, both polyadenylated mRNA and ribosomal RNA-depleted total RNA were harvested across six developmental stages and subjected to high throughput sequencing. The maternally loaded transcriptome is highly diverse and consists of both polyadenylated and deadenylated transcripts. Many maternal genes show peak expression in the oocyte and include genes which are known to be the key regulators of events like oocyte maturation and fertilization. Of all the transcripts that increase in abundance between early blastula and larval stages, about 30% of the embryonic genes are induced by fourfold or more by the late blastula stage and another 35% by late gastrulation. Using a gene model validation and discovery pipeline, we identified novel transcripts and putative long non-coding RNAs (lncRNA). These lncRNA transcripts were stringently selected as spliced transcripts generated from independent promoters, with limited coding potential and a codon bias characteristic of noncoding sequences. Many lncRNAs are conserved and expressed in a developmental stage-specific fashion. These data reveal dynamics of transcriptome polyadenylation and abundance and provides a high-confidence catalogue of novel and long non-coding RNAs.
Jawaharlal, Jeya Prita Parasurama; Madhumathi, Jayaprakasam; Prince, Rajaiah Prabhu; Kaliraj, Perumal
2014-09-01
Transmission of lymphatic filariasis is mediated through microfilariae (L1 stage of the parasite) which is encased in an eggshell called sheath. The sheath protein Shp-1 stabilizes the structure due to the unique repeat region with Met-Pro-Pro-Gln-Gly sequences. Microfilarial proteins could be used as transmission blocking vaccines. Since the repeat region of Shp-1 was predicted to carry putative B epitopes, this region was used to analyze its reactivity with clinical samples towards construction of peptide vaccine. In silico analysis of Shp-1 showed the presence of B epitopes in the region 49-107. The polypeptide epitopic region Shp-149-107 was cloned and expressed in Escherichia coli. Antibody reactivity of the Shp-149-107 construct was evaluated in filarial endemic population by ELISA. Putatively immune endemic normals (EN) showed significantly high reactivity (P < 0.05) when compared to all the other categories. Antibody reactivity of Shp-1 repeat region was similar to that of whole protein proving that this region carries B epitopes responsible for its humoral response in humans. Thus this can be employed for inducing anti-microfilarial immunity in the infected population that may lead to reduction in transmission intensity and also it could be used along with other epitopes from different stages of the parasite in order to manage the disease effectively.
Lévesque, Céline; Duplessis, Martin; Labonté, Jessica; Labrie, Steve; Fremaux, Christophe; Tremblay, Denise; Moineau, Sylvain
2005-01-01
The Streptococcus thermophilus virulent pac-type phage 2972 was isolated from a yogurt made in France in 1999. It is a representative of several phages that have emerged with the industrial use of the exopolysaccharide-producing S. thermophilus strain RD534. The genome of phage 2972 has 34,704 bp with an overall G+C content of 40.15%, making it the shortest S. thermophilus phage genome analyzed so far. Forty-four open reading frames (ORFs) encoding putative proteins of 40 or more amino acids were identified, and bioinformatic analyses led to the assignment of putative functions to 23 ORFs. Comparative genomic analysis of phage 2972 with the six other sequenced S. thermophilus phage genomes confirmed that the replication module is conserved and that cos- and pac-type phages have distinct structural and packaging genes. Two group I introns were identified in the genome of 2972. They interrupted the genes coding for the putative endolysin and the terminase large subunit. Phage mRNA splicing was demonstrated for both introns, and the secondary structures were predicted. Eight structural proteins were also identified by N-terminal sequencing and/or matrix-assisted laser desorption ionization—time-of-flight mass spectrometry. Detailed analysis of the putative minor tail proteins ORF19 and ORF21 as well as the putative receptor-binding protein ORF20 showed the following interesting features: (i) ORF19 is a hybrid protein, because it displays significant identity with both pac- and cos-type phages; (ii) ORF20 is unique; and (iii) a protein similar to ORF21 of 2972 was also found in the structure of the cos-type phage DT1, indicating that this structural protein is present in both S. thermophilus phage groups. The implications of these findings for phage classification are discussed. PMID:16000821
Junghare, Madan; Spiteller, Dieter; Schink, Bernhard
2016-09-01
The pathway of anaerobic degradation of o-phthalate was studied in the nitrate-reducing bacterium Azoarcus sp. strain PA01. Differential two-dimensional protein gel profiling allowed the identification of specifically induced proteins in o-phthalate-grown compared to benzoate-grown cells. The genes encoding o-phthalate-induced proteins were found in a 9.9 kb gene cluster in the genome of Azoarcus sp. strain PA01. The o-phthalate-induced gene cluster codes for proteins homologous to a dicarboxylic acid transporter, putative CoA-transferases and a UbiD-like decarboxylase that were assigned to be specifically involved in the initial steps of anaerobic o-phthalate degradation. We propose that o-phthalate is first activated to o-phthalyl-CoA by a putative succinyl-CoA-dependent succinyl-CoA:o-phthalate CoA-transferase, and o-phthalyl-CoA is subsequently decarboxylated to benzoyl-CoA by a putative o-phthalyl-CoA decarboxylase. Results from in vitro enzyme assays with cell-free extracts of o-phthalate-grown cells demonstrated the formation of o-phthalyl-CoA from o-phthalate and succinyl-CoA as CoA donor, and its subsequent decarboxylation to benzoyl-CoA. The putative succinyl-CoA:o-phthalate CoA-transferase showed high substrate specificity for o-phthalate and did not accept isophthalate, terephthalate or 3-fluoro-o-phthalate whereas the putative o-phthalyl-CoA decarboxylase converted fluoro-o-phthalyl-CoA to fluoro-benzoyl-CoA. No decarboxylase activity was observed with isophthalyl-CoA or terephthalyl-CoA. Both enzyme activities were oxygen-insensitive and inducible only after growth with o-phthalate. Further degradation of benzoyl-CoA proceeds analogous to the well-established anaerobic benzoyl-CoA degradation pathway of nitrate-reducing bacteria. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
Celestino, Michele; Calistri, Arianna; Del Vecchio, Claudia; Salata, Cristiano; Chiuppesi, Flavia; Pistello, Mauro; Borsetti, Alessandra; Palù, Giorgio; Parolin, Cristina
2012-06-01
Tetherin (BST2) is the host cell factor that blocks the particle release of some enveloped viruses. Two putative feline tetherin proteins differing at the level of the N-terminal coding region have recently been described and tested for their antiviral activity. By cloning and comparing the two reported feline tetherins (called here cBST2(504) and cBST2*) and generating specific derivative mutants, this study provides evidence that feline tetherin has a shorter intracytoplasmic domain than those of other known homologues. The minimal tetherin promoter was identified and assayed for its ability to drive tetherin expression in an alpha interferon-inducible manner. We also demonstrated that cBST2(504) is able to dimerize, is localized at the cellular membrane, and impairs human immunodeficiency virus type 1 (HIV-1) particle release, regardless of the presence of the Vpu antagonist accessory protein. While cBST2(504) failed to restrict wild-type feline immunodeficiency virus (FIV) egress, FIV mutants, bearing a frameshift at the level of the envelope-encoding region, were potently blocked. The transient expression of the FIV envelope glycoprotein was able to rescue mutant particle release from feline tetherin-positive cells but did not antagonize human BST2 activity. Moreover, cBST2(504) was capable of specifically immunoprecipitating the FIV envelope glycoprotein. Finally, cBST2(504) also exerted its function on HIV-2 ROD10 and on the simian immunodeficiency virus SIVmac239. Taken together, these results show that feline tetherin does indeed have a short N-terminal region and that the FIV envelope glycoprotein is the predominant factor counteracting tetherin restriction.
Li, Ling; Li, Dan; Liu, Li; Li, Shijun; Feng, Yanping; Peng, Xiuli; Gong, Yanzhang
2015-01-01
Endothelin receptor B subtype 2 (EDNRB2) is a seven-transmembrane G-protein coupled receptor. In this study, we investigated EDNRB2 gene as a candidate gene for duck spot plumage pattern according to studies of chicken and Japanese quail. The entire coding region was cloned by the reverse transcription polymerase chain reaction (RT-PCR). Sequence analysis showed that duck EDNRB2 cDNA contained a 1311 bp open reading frame and encoded a putative protein of 436 amino acids residues. The transcript shared 89%-90% identity with the counterparts in other avian species. A phylogenetic tree based on amino acid sequences showed that duck EDNRB2 was evolutionary conserved in avian clade. The entire coding region of EDNRB2 were sequenced in 20 spot and 20 non-spot ducks, and 13 SNPs were identified. Two of them (c.940G>A and c.995G>A) were non-synonymous substitutions, and were genotyped in 647 ducks representing non-spot and spot phenotypes. The c.995G>A mutation, which results in the amino acid substitution of Arg332His, was completely associated with the spot phenotype: all 152 spot ducks were carriers of the AA genotype and the other 495 individuals with non-spot phenotype were carriers of GA or GG genotype, respectively. Segregation in 17 GA×GG and 22 GA×GA testing combinations confirmed this association since the segregation ratios and genotypes of the offspring were in agreement with the hypothesis. In order to investigate the underlying mechanism of the spot phenotype, MITF gene was used as cell type marker of melanocyte progenitor cells while TYR and TYRP1 gene were used as cell type markers of mature melanocytes. Transcripts of MITF, TYR and TYRP1 gene with expected size were identified in all pigmented skin tissues while PCR products were not obtained from non-pigmented skin tissues. It was inferred that melanocytes are absent in non-pigmented skin tissues of spot ducks.
Human brain factor 1, a new member of the fork head gene family
DOE Office of Scientific and Technical Information (OSTI.GOV)
Murphy, D.B.; Wiese, S.; Burfeind, P.
1994-06-01
Analysis of cDNA clones that cross-hybridized with the fork head domain of the rat HNF-3 gene family revealed 10 cDNAs from human fetal brain and human testis cDNA libraries containing this highly conserved DNA-binding domain. Three of these cDNAs (HFK1, HFK2, and HFK3) were further analyzed. The cDNA HFK1 has a length of 2557 nucleotides and shows strong homology at the nucleotide level (91.2%) to brain factor 1 (BF-1) from rat. The HFK1 cDNA codes for a putative 476 amino acid protein. The homology to BF-1 from rat in the coding region at the amino acid level is 87.5%. Themore » fork head homologous region includes 111 amino acids starting at amino acid 160 and has a 97.5% homology to BF-1. Southern hybridization revealed that HFK1 is highly conserved among mammalian species and possibly birds. Northern analysis with total RNA from human tissues and poly(A)-rich RNA from mouse revealed a 3.2-kb transcript that is present in human and mouse fetal brain and in adult mouse brain. In situ hybridization with sections of mouse embryo and human fetal brain reveals that HFK1 expression is restricted to the neuronal cells in the telencepthalon, with strong expression being observed in the developing dentate gyrus and hippocampus. HFK1 was chromosomally localized by in situ hybridization to 14q12. The cDNA clones HFK2 and HFK3 were analyzed by restriction analysis and sequencing. HFK2 and HFK3 were found to be closely related but different from HFK1. Therefore, it would appear that HFK1, HFK2, HFK3, and BF-1 form a new fork head related subfamily. 33 refs., 6 figs.« less
Adefenwa, Mufliat A; Peters, Sunday O; Agaviezor, Brilliant O; Wheto, Matthew; Adekoya, Khalid O; Okpeku, Moses; Oboh, Bola; Williams, Gabriel O; Adebambo, Olufunmilayo A; Singh, Mahipal; Thomas, Bolaji; De Donato, Marcos; Imumorin, Ikhide G
2013-07-01
The agouti-signaling protein (ASIP) plays a major role in mammalian pigmentation as an antagonist to melanocortin-1 receptor gene to stimulate pheomelanin synthesis, a major pigment conferring mammalian coat color. We sequenced a 352 bp fragment of ASIP gene spanning part of exon 2 and part of intron 2 in 215 animals representing six goat breeds from Nigeria and the United States: West African Dwarf, predominantly black; Red Sokoto, mostly red; and Sahel, mostly white from Nigeria; black and white Alpine, brown and white Spanish and white Saanen from the US. Twenty haplotypes from nine mutations representing three intronic, one silent and five missense (p.S19R, p.N35K, p.L36V, p.M42L and p.L45W) mutations were identified in Nigerian goats. Approximately 89 % of Nigerian goats carry haplotype 1 (TGCCATCCG) which seems to be the wild type configuration of mutations in this region of the gene. Although we found no association between these polymorphisms in the ASIP gene and coat color in Nigerian goats, in-silico functional analysis predicts putative deleterious functional impact of the p.L45W mutation on the basic amino-terminal domain of ASIP. In the American goats, two intronic mutations, g.293G>A and g.327C>A, were identified in the Alpine breed, although the g.293G>A mutation is common to American and Nigerian goat populations. All Sannen and Sahel goats in this study belong to haplotypes 1 of both populations which seem to be the wild-type composite ASIP haplotype. Overall, there was no clear association of this portion of the ASIP gene interrogated in this study with coat color variation. Therefore, additional genomic analyses of promoter sequence, the entire coding and non-coding regions of the ASIP gene will be required to obtain a definite conclusion.
Szeleczky, Zsófia; Dán, Adám; Ursu, Krisztina; Ivanics, Eva; Kiss, István; Erdélyi, Károly; Belák, Sándor; Muller, Claude P; Brown, Ian H; Bálint, Adám
2009-10-20
Highly pathogenic avian influenza (HPAI) H5N1 viruses were introduced to Hungary during 2006-2007 in three separate waves. This study aimed at determining the full-length genomic coding regions of the index strains from these epizootics in order to: (i) understand the phylogenetic relationship to other European H5N1 isolates, (ii) elucidate the possible connection between the different outbreaks and (iii) determine the putative origin and way of introduction of the different virus variants. Molecular analysis of the HA gene of Hungarian HPAI isolates obtained from wild birds during the first introduction revealed two groups designated Hungarian1 (HUN1) and Hungarian2 (HUN2) within sublineage 2.2B and clade 2.2.1, respectively. Sequencing the whole coding region of the two index viruses A/mute swan/Hungary/3472/2006 and A/mute swan/4571/Hungary/2006 suggests the role of wild birds in the introduction of HUN1 and HUN2 viruses: the most similar isolates to HUN1 and HUN2 group were found in wild avian species in Croatia and Slovakia, respectively. The second introduction of HPAI H5N1 led to the largest epizootic in domestic waterfowl in Europe. The index strain of the epizootic A/goose/Hungary/14756/2006 clustered to sublineage 2.2.A1 forming the Hungarian3 (HUN3) group. A common ancestry of HUN3 isolates with Bavarian strains is suggested as the most likely scenario of origin. Hungarian4 (HUN4) viruses isolated from the third introduction clustered with isolate A/turkey/United Kingdom/750/2007 forming a sublineage 2.2.A2. The origin and way of introduction of HUN4 viruses is still obscure, thus further genetic, phylogenetic, ecological and epidemiological data are required in order to elucidate it.
Ferreira, Pedro Eduardo; Veiga, Maria Isabel; Cavaco, Isa; Martins, J Paulo; Andersson, Björn; Mushin, Shaliya; Ali, Abullah S; Bhattarai, Achuyt; Ribeiro, Vera; Björkman, Anders; Gil, José Pedro
2008-02-01
Artemisinin-based combination therapy is a main strategy for malaria control in Africa. Zanzibar introduced this new treatment policy in 2003. The authors have studied the prevalence of a number of functional single nucleotide polymorphisms (SNPs) in genes associated with the elimination of the artemisinin-based combination therapy compounds in use in Zanzibar to investigate the frequencies of subgroups potentially at higher drug exposure and therefore possible higher risk of toxicity. One hundred three unrelated children with uncomplicated malaria from the Unguja and Pemba islands of Zanzibar were enrolled. With use of polymerase chain reaction (PCR)-restriction fragment length polymorphism and real-time PCR-based allele discrimination methods, the CYP2B6 (G15631T), CYP3A4 (A-392G), CYP3A5 (A6986G, G14690A, 27131-132 insT, C3699T) SNPs and MDR1 SNPs C3435T, G2677T/A, and T-129C were analyzed. PCR product sequencing was applied to regulatory regions of MDR1, the CYP3A4 proximal promoter, and to exons 2 and 5 of PXR, a gene coding for a nuclear factor activated by artemisinin antimalarials and associated with the transcription induction of most of the studied genes. Homozygous subjects for alleles coding for low activity proteins were found at the following frequencies: 1) MDR1: 2.9%; 2) CYP2B6: 9.7%; 3) CYP3A5: 14.1%; and 4) CYP3A4: 49.5%. No functionally relevant allele was found in the analyzed regions of PXR. A new MDR1 SNP was found (T-158C), located in a putative antigen recognition element. Ten (10.1%) subjects were predicted to be low metabolizers simultaneously for CYP3A4 and CYP3A5. This fraction of the population is suggested to be under higher exposure to certain antimalarials, including lumefantrine and quinine.
Probing the reaching-grasping network in humans through multivoxel pattern decoding.
Di Bono, Maria Grazia; Begliomini, Chiara; Castiello, Umberto; Zorzi, Marco
2015-11-01
The quest for a putative human homolog of the reaching-grasping network identified in monkeys has been the focus of many neuropsychological and neuroimaging studies in recent years. These studies have shown that the network underlying reaching-only and reach-to-grasp movements includes the superior parieto-occipital cortex (SPOC), the anterior part of the human intraparietal sulcus (hAIP), the ventral and the dorsal portion of the premotor cortex, and the primary motor cortex (M1). Recent evidence for a wider frontoparietal network coding for different aspects of reaching-only and reach-to-grasp actions calls for a more fine-grained assessment of the reaching-grasping network in humans by exploiting pattern decoding methods (multivoxel pattern analysis--MVPA). Here, we used MPVA on functional magnetic resonance imaging (fMRI) data to assess whether regions of the frontoparietal network discriminate between reaching-only and reach-to-grasp actions, natural and constrained grasping, different grasp types, and object sizes. Participants were required to perform either reaching-only movements or two reach-to-grasp types (precision or whole hand grasp) upon spherical objects of different sizes. Multivoxel pattern analysis highlighted that, independently from the object size, all the selected regions of both hemispheres contribute in coding for grasp type, with the exception of SPOC and the right hAIP. Consistent with recent neurophysiological findings on monkeys, there was no evidence for a clear-cut distinction between a dorsomedial and a dorsolateral pathway that would be specialized for reaching-only and reach-to-grasp actions, respectively. Nevertheless, the comparison of decoding accuracy across brain areas highlighted their different contributions to reaching-only and grasping actions. Altogether, our findings enrich the current knowledge regarding the functional role of key brain areas involved in the cortical control of reaching-only and reach-to-grasp actions in humans, by revealing novel fine-grained distinctions among action types within a wide frontoparietal network.
Dworschak, Wolfgang; Ratz, Christoph; Wagner, Michael
2016-11-01
Numerous studies have reported a high prevalence of challenging behavior among students with intellectual disabilities (ID). They discuss different putative risk markers as well as their influence on the occurrence of challenging behavior. The study investigates the prevalence of challenging behavior and evaluates in terms of a replication study well-known putative risk markers among a representative sample of students with ID (N=1629) in Bavaria, one of the largest regions in Germany. The research is based on a modified version of the Developmental Behavior Checklist (DBC). Findings indicate a prevalence rate of 52% for challenging behavior. The following putative risk markers are associated with challenging behavior: intense need for care, male gender, lack of communication skills, and residential setting. These risk markers explain 8.4% of the variance concerning challenging behavior. These results reveal that challenging behavior either is to a large extent determined by situations and interactions between individuals and environment and cannot be explained by the measured individual and social risk markers alone, or it is determined by further risk markers that were not measured. Copyright © 2016 Elsevier Ltd. All rights reserved.
Dynamics of Intersubject Brain Networks during Anxious Anticipation
Najafi, Mahshid; Kinnison, Joshua; Pessoa, Luiz
2017-01-01
How do large-scale brain networks reorganize during the waxing and waning of anxious anticipation? Here, threat was dynamically modulated during human functional MRI as two circles slowly meandered on the screen; if they touched, an unpleasant shock was delivered. We employed intersubject correlation analysis, which allowed the investigation of network-level functional connectivity across brains, and sought to determine how network connectivity changed during periods of approach (circles moving closer) and periods of retreat (circles moving apart). Analysis of positive connection weights revealed that dynamic threat altered connectivity within and between the salience, executive, and task-negative networks. For example, dynamic functional connectivity increased within the salience network during approach and decreased during retreat. The opposite pattern was found for the functional connectivity between the salience and task-negative networks: decreases during approach and increases during approach. Functional connections between subcortical regions and the salience network also changed dynamically during approach and retreat periods. Subcortical regions exhibiting such changes included the putative periaqueductal gray, putative habenula, and putative bed nucleus of the stria terminalis. Additional analysis of negative functional connections revealed dynamic changes, too. For example, negative weights within the salience network decreased during approach and increased during retreat, opposite what was found for positive weights. Together, our findings unraveled dynamic features of functional connectivity of large-scale networks and subcortical regions across participants while threat levels varied continuously, and demonstrate the potential of characterizing emotional processing at the level of dynamic networks. PMID:29209184
Becker, Sara J; Squires, Daniel D; Strong, David R; Barnett, Nancy P; Monti, Peter M; Petry, Nancy M
2016-01-01
Few prospective studies have evaluated theory-driven approaches to the implementation of evidence-based opioid treatment. This study compared the effectiveness of an implementation model (Science to Service Laboratory; SSL) to training as usual (TAU) in promoting the adoption of contingency management across a multisite opioid addiction treatment program. We also examined whether the SSL affected putative mediators of contingency management adoption (perceived innovation characteristics and organizational readiness to change). Sixty treatment providers (39 SSL, 21 TAU) from 15 geographically diverse satellite clinics (7 SSL, 8 TAU) participated in the 12-month study. Both conditions received didactic contingency management training and those in the predetermined experimental region received 9 months of SSL-enhanced training. Contingency management adoption was monitored biweekly, whereas putative mediators were measured at baseline, 3 months, and 12 months. Relative to providers in the TAU region, treatment providers in the SSL region had comparable likelihood of contingency management adoption in the first 20 weeks of the study, and then significantly higher likelihood of adoption (odds ratios = 2.4-13.5) for the remainder of the study. SSL providers also reported higher levels of one perceived innovation characteristic (Observability) and one aspect of organizational readiness to change (Adequacy of Training Resources), although there was no evidence that the SSL affected these putative mediators over time. Results of this study indicate that a fully powered randomized trial of the SSL is warranted. Considerations for a future evaluation are discussed.
Kamalakaran, Sitharthan; Radhakrishnan, Senthil K; Beck, William T
2005-06-03
We developed a pipeline to identify novel genes regulated by the steroid hormone-dependent transcription factor, estrogen receptor, through a systematic analysis of upstream regions of all human and mouse genes. We built a data base of putative promoter regions for 23,077 human and 19,984 mouse transcripts from National Center for Biotechnology Information annotation and 8793 human and 6785 mouse promoters from the Data Base of Transcriptional Start Sites. We used this data base of putative promoters to identify potential targets of estrogen receptor by identifying estrogen response elements (EREs) in their promoters. Our program correctly identified EREs in genes known to be regulated by estrogen in addition to several new genes whose putative promoters contained EREs. We validated six genes (KIAA1243, NRIP1, MADH9, NME3, TPD52L, and ABCG2) to be estrogen-responsive in MCF7 cells using reverse transcription PCR. To allow for extensibility of our program in identifying targets of other transcription factors, we have built a Web interface to access our data base and programs. Our Web-based program for Promoter Analysis of Genome, PAGen@UIC, allows a user to identify putative target genes for vertebrate transcription factors through the analysis of their upstream sequences. The interface allows the user to search the human and mouse promoter data bases for potential target genes containing one or more listed transcription factor binding sites (TFBSs) in their upstream elements, using either regular expression-based consensus or position weight matrices. The data base can also be searched for promoters harboring user-defined TFBSs given as a consensus or a position weight matrix. Furthermore, the user can retrieve putative promoter sequences for any given gene together with identified TFBSs located on its promoter. Orthologous promoters are also analyzed to determine conserved elements.
King, Lanikea B.; Walum, Hasse; Inoue, Kiyoshi; Eyrich, Nicholas W.; Young, Larry J.
2015-01-01
Background Oxytocin (OXT) modulates several aspects of social behavior. Intranasal OXT is a leading candidate for treating social deficits in autism spectrum disorder (ASD) and common genetic variants in the human oxytocin receptor (OXTR) are associated with emotion recognition, relationship quality and ASD. Animal models have revealed that individual differences in Oxtr expression in the brain drive social behavior variation. Our understanding of how genetic variation contributes to brain OXTR expression is very limited. Methods We investigated Oxtr expression in monogamous prairie voles, which have a well characterized OXT system. We quantified brain region-specific levels of Oxtr mRNA and OXTR protein with established neuroanatomical methods. We used pyrosequencing to investigate allelic imbalance of Oxtr mRNA, a molecular signature of polymorphic genetic regulatory elements. We performed next-generation sequencing to discover variants in and near the Oxtr gene. We investigated social attachment using the partner preference test. Results Our allelic imbalance data demonstrates that genetic variants contribute to individual differences in Oxtr expression, but only in particular brain regions, including the nucleus accumbens (NAcc), where OXTR signaling facilitates social attachment. Next-generation sequencing identified one polymorphism in the Oxtr intron, near a putative cis-regulatory element, explaining 74% of the variance in striatal Oxtr expression specifically. Males homozygous for the high expressing allele display enhanced social attachment. Discussion Taken together, these findings provide convincing evidence for robust genetic influence on Oxtr expression and provide novel insights into how non-coding polymorphisms in the OXTR might influence individual differences in human social cognition and behavior PMID:26893121
Modulation of hepatocyte growth factor gene expression by estrogen in mouse ovary.
Liu, Y; Lin, L; Zarnegar, R
1994-09-01
Hepatocyte growth factor (HGF) is expressed in a variety of tissues and cell types under normal conditions and in response to various stimuli such as tissue injury. In the present study, we demonstrate that the transcription of the HGF gene is stimulated by estrogen in mouse ovary. A single injection of 17 beta-estradiol results in a dramatic and transient elevation of the levels of mouse HGF mRNA. Sequence analysis has found that two putative estrogen responsive elements (ERE) reside at -872 in the 5'-flanking region and at +511 in the first intron, respectively, of the mouse HGF gene. To test whether these ERE elements are responsible for estrogen induction of HGF gene expression, chimeric plasmids containing variable regions of the 5'-flanking sequence of HGF gene and the coding region for chloramphenicol acetyltransferase (CAT) gene were transiently transfected into both human endometrial carcinoma RL 95-2 cells and mouse fibroblast NIH 3T3 cells to assess hormone responsiveness. Transfection results indicate that the ERE elements of the mouse HGF gene can confer estrogen action to either homologous or heterologous promoters. Nuclear protein extracts either from RL95-2 cells transfected with the estrogen receptor expression vector or from mouse liver bound in vitro to ERE elements specifically, as shown by band shift assay. Therefore, our results demonstrate that the HGF gene is transcriptionally regulated by estrogen in mouse ovary; and such regulation is mediated via a direct interaction of the estrogen receptor complex with cis-acting ERE elements identified in the mouse HGF gene.
Mutation in Pyrroline-5-Carboxylate Reductase 1 Gene in Families with Cutis Laxa Type 2
Guernsey, Duane L.; Jiang, Haiyan; Evans, Susan C.; Ferguson, Meghan; Matsuoka, Makoto; Nightingale, Mathew; Rideout, Andrea L.; Provost, Sylvie; Bedard, Karen; Orr, Andrew; Dubé, Marie-Pierre; Ludman, Mark; Samuels, Mark E.
2009-01-01
Autosomal-recessive cutis laxa type 2 (ARCL2) is a multisystem disorder characterized by the appearance of premature aging, wrinkled and lax skin, joint laxity, and a general developmental delay. Cutis laxa includes a family of clinically overlapping conditions with confusing nomenclature, generally requiring molecular analyses for definitive diagnosis. Six genes are currently known to mutate to yield one of these related conditions. We ascertained a cohort of typical ARCL2 patients from a subpopulation isolate within eastern Canada. Homozygosity mapping with high-density SNP genotyping excluded all six known genes, and instead identified a single homozygous region near the telomere of chromosome 17, shared identically by state by all genotyped affected individuals from the families. A putative pathogenic variant was identified by direct DNA sequencing of genes within the region. The single nucleotide change leads to a missense mutation adjacent to a splice junction in the gene encoding pyrroline-5-carboxylate reductase 1 (PYCR1). Bioinformatic analysis predicted a pathogenic effect of the variant on splice donor site function. Skipping of the associated exon was confirmed in RNA from blood lymphocytes of affected homozygotes and heterozygous mutation carriers. Exon skipping leads to deletion of the reductase functional domain-coding region and an obligatory downstream frameshift. PYCR1 plays a critical role in proline biosynthesis. Pathogenicity of the genetic variant in PYCR1 is likely, given that a similar clinical phenotype has been documented for mutation carriers of another proline biosynthetic enzyme, pyrroline-5-carboxylate synthase. Our results support a significant role for proline in normal development. PMID:19576563
Sequence variability of Campylobacter temperate bacteriophages
Clark, Clifford G; Ng, Lai-King
2008-01-01
Background Prophages integrated within the chromosomes of Campylobacter jejuni isolates have been demonstrated very recently. Prior work with Campylobacter temperate bacteriophages, as well as evidence from prophages in other enteric bacteria, suggests these prophages might have a role in the biology and virulence of the organism. However, very little is known about the genetic variability of Campylobacter prophages which, if present, could lead to differential phenotypes in isolates carrying the phages versus those that do not. As a first step in the characterization of C. jejuni prophages, we investigated the distribution of prophage DNA within a C. jejuni population assessed the DNA and protein sequence variability within a subset of the putative prophages found. Results Southern blotting of C. jejuni DNA using probes from genes within the three putative prophages of the C. jejuni sequenced strain RM 1221 demonstrated the presence of at least one prophage gene in a large proportion (27/35) of isolates tested. Of these, 15 were positive for 5 or more of the 7 Campylobacter Mu-like phage 1 (CMLP 1, also designated Campylobacter jejuni integrated element 1, or CJIE 1) genes tested. Twelve of these putative prophages were chosen for further analysis. DNA sequencing of a 9,000 to 11,000 nucleotide region of each prophage demonstrated a close homology with CMLP 1 in both gene order and nucleotide sequence. Structural and sequence variability, including short insertions, deletions, and allele replacements, were found within the prophage genomes, some of which would alter the protein products of the ORFs involved. No insertions of novel genes were detected within the sequenced regions. The 12 prophages and RM 1221 had a % G+C very similar to C. jejuni sequenced strains, as well as promoter regions characteristic of C. jejuni. None of the putative prophages were successfully induced and propagated, so it is not known if they were functional or if they represented remnant prophage DNA in the bacterial chromosomes. Conclusion These putative prophages form a family of phages with conserved sequences, and appear to be adapted to Campylobacter. There was evidence for recombination among groups of prophages, suggesting that the prophages had a mosaic structure. In many of these properties, the Mu-like CMLP 1 homologs characterized in this study resemble temperate bacteriophages of enteric bacteria that are responsible for contributions to virulence and host adaptation. PMID:18366706
Edge orientation signals in tactile afferents of macaques
Suresh, Aneesha K.
2016-01-01
The orientation of edges indented into the skin has been shown to be encoded in the responses of neurons in primary somatosensory cortex in a manner that draws remarkable analogies to their counterparts in primary visual cortex. According to the classical view, orientation tuning arises from the integration of untuned input from thalamic neurons with aligned but spatially displaced receptive fields (RFs). In a recent microneurography study with human subjects, the precise temporal structure of the responses of individual mechanoreceptive afferents to scanned edges was found to carry information about their orientation. This putative mechanism could in principle contribute to or complement the classical rate-based code for orientation. In the present study, we further examine orientation information carried by mechanoreceptive afferents of Rhesus monkeys. To this end, we record the activity evoked in cutaneous mechanoreceptive afferents when edges are indented into or scanned across the skin. First, we confirm that information about the edge orientation can be extracted from the temporal patterning in afferent responses of monkeys, as is the case in humans. Second, we find that while the coarse temporal profile of the response can be predicted linearly from the layout of the RF, the fine temporal profile cannot. Finally, we show that orientation signals in tactile afferents are often highly dependent on stimulus features other than orientation, which complicates putative decoding strategies. We discuss the challenges associated with establishing a neural code at the somatosensory periphery, where afferents are exquisitely sensitive and nearly deterministic. PMID:27655968
Fraisier, V; Gojon, A; Tillard, P; Daniel-Vedele, F
2000-08-01
The NpNRT2.1 gene encodes a putative inducible component of the high-affinity nitrate (NO3-) uptake system in Nicotiana plumbaginifolia. Here we report functional and physiological analyses of transgenic plants expressing the NpNRT2.1 coding sequence fused to the CaMV 35S or rolD promoters. Irrespective of the level of NO3- supplied, NO3- contents were found to be remarkably similar in wild-type and transgenic plants. Under specific conditions (growth on 10 mM NO3-), the steady-state NpNRT2. 1 mRNA level resulting from the deregulated transgene expression was accompanied by an increase in 15NO3- influx measured in the low concentration range. This demonstrates for the first time that the NRT2.1 sequence codes a limiting element of the inducible high-affinity transport system. Both 15NO3- influx and mRNA levels decreased in the wild type after exposure to ammonium, in agreement with previous results from many species. Surprisingly, however, influx was also markedly decreased in transgenic plants, despite stable levels of transgene expression in independent transformants after ammonium addition. We conclude that the conditions associated with the supply of a reduced nitrogen source such as ammonium, or with the generation of a further downstream metabolite, probably exert a repressive effect on NO3- influx at both transcriptional and post-transcriptional levels.
Khanna, Namita; Ghosh, Ananta Kumar; Huntemann, Marcel; Deshpande, Shweta; Han, James; Chen, Amy; Kyrpides, Nikos; Mavrommatis, Kostas; Szeto, Ernest; Markowitz, Victor; Ivanova, Natalia; Pagani, Ioanna; Pati, Amrita; Pitluck, Sam; Nolan, Matt; Woyke, Tanja; Teshima, Hazuki; Chertkov, Olga; Daligault, Hajnalka; Davenport, Karen; Gu, Wei; Munk, Christine; Zhang, Xiaojing; Bruce, David; Detter, Chris; Xu, Yan; Quintana, Beverly; Reitenga, Krista; Kunde, Yulia; Green, Lance; Erkkila, Tracy; Han, Cliff; Brambilla, Evelyne-Marie; Lang, Elke; Klenk, Hans-Peter; Goodwin, Lynne; Chain, Patrick; Das, Debabrata
2013-12-20
Enterobacter sp. IIT-BT 08 belongs to Phylum: Proteobacteria, Class: Gammaproteobacteria, Order: Enterobacteriales, Family: Enterobacteriaceae. The organism was isolated from the leaves of a local plant near the Kharagpur railway station, Kharagpur, West Bengal, India. It has been extensively studied for fermentative hydrogen production because of its high hydrogen yield. For further enhancement of hydrogen production by strain development, complete genome sequence analysis was carried out. Sequence analysis revealed that the genome was linear, 4.67 Mbp long and had a GC content of 56.01%. The genome properties encode 4,393 protein-coding and 179 RNA genes. Additionally, a putative pathway of hydrogen production was suggested based on the presence of formate hydrogen lyase complex and other related genes identified in the genome. Thus, in the present study we describe the specific properties of the organism and the generation, annotation and analysis of its genome sequence as well as discuss the putative pathway of hydrogen production by this organism.
Egan, Sharon A.; Ward, Philip N.; Watson, Michael; Field, Terence R.
2012-01-01
The regulation and control of gene expression in response to differing environmental stimuli is crucial for successful pathogen adaptation and persistence. The regulatory gene vru of Streptococcus uberis encodes a stand-alone response regulator with similarity to the Mga of group A Streptococcus. Mga controls expression of a number of important virulence determinants. Experimental intramammary challenge of dairy cattle with a mutant of S. uberis carrying an inactivating lesion in vru showed reduced ability to colonize the mammary gland and an inability to induce clinical signs of mastitis compared with the wild-type strain. Analysis of transcriptional differences of gene expression in the mutant, determined by microarray analysis, identified a number of coding sequences with altered expression in the absence of Vru. These consisted of known and putative virulence determinants, including Lbp (Sub0145), SclB (Sub1095), PauA (Sub1785) and hasA (Sub1696). PMID:22383474
The Human Retrosplenial Cortex and Thalamus Code Head Direction in a Global Reference Frame.
Shine, Jonathan P; Valdés-Herrera, José P; Hegarty, Mary; Wolbers, Thomas
2016-06-15
Spatial navigation is a multisensory process involving integration of visual and body-based cues. In rodents, head direction (HD) cells, which are most abundant in the thalamus, integrate these cues to code facing direction. Human fMRI studies examining HD coding in virtual environments (VE) have reported effects in retrosplenial complex and (pre-)subiculum, but not the thalamus. Furthermore, HD coding appeared insensitive to global landmarks. These tasks, however, provided only visual cues for orientation, and attending to global landmarks did not benefit task performance. In the present study, participants explored a VE comprising four separate locales, surrounded by four global landmarks. To provide body-based cues, participants wore a head-mounted display so that physical rotations changed facing direction in the VE. During subsequent MRI scanning, subjects saw stationary views of the environment and judged whether their orientation was the same as in the preceding trial. Parameter estimates extracted from retrosplenial cortex and the thalamus revealed significantly reduced BOLD responses when HD was repeated. Moreover, consistent with rodent findings, the signal did not continue to adapt over repetitions of the same HD. These results were supported by a whole-brain analysis showing additional repetition suppression in the precuneus. Together, our findings suggest that: (1) consistent with the rodent literature, the human thalamus may integrate visual and body-based, orientation cues; (2) global reference frame cues can be used to integrate HD across separate individual locales; and (3) immersive training procedures providing full body-based cues may help to elucidate the neural mechanisms supporting spatial navigation. In rodents, head direction (HD) cells signal facing direction in the environment via increased firing when the animal assumes a certain orientation. Distinct brain regions, the retrosplenial cortex (RSC) and thalamus, code for visual and vestibular cues of orientation, respectively. Putative HD signals have been observed in human RSC but not the thalamus, potentially because body-based cues were not provided. Here, participants encoded HD in a novel virtual environment while wearing a head-mounted display to provide body-based cues for orientation. In subsequent fMRI scanning, we found evidence of an HD signal in RSC, thalamus, and precuneus. These findings harmonize rodent and human data, and suggest that immersive training procedures provide a viable way to examine the neural basis of navigation. Copyright © 2016 the authors 0270-6474/16/366371-11$15.00/0.
Langner, Ingo; Mikolajczyk, Rafael; Garbe, Edeltraut
2011-08-17
Health insurance claims data are increasingly used for health services research in Germany. Hospital diagnoses in these data are coded according to the International Classification of Diseases, German modification (ICD-10-GM). Due to the historical division into West and East Germany, different coding practices might persist in both former parts. Additionally, the introduction of Diagnosis Related Groups (DRGs) in Germany in 2003/2004 might have changed the coding. The aim of this study was to investigate regional and temporal variations in coding of hospitalisation diagnoses in Germany. We analysed hospitalisation diagnoses for oesophageal bleeding (OB) and upper gastrointestinal bleeding (UGIB) from the official German Hospital Statistics provided by the Federal Statistical Office. Bleeding diagnoses were classified as "specific" (origin of bleeding provided) or "unspecific" (origin of bleeding not provided) coding. We studied regional (former East versus West Germany) differences in incidence of hospitalisations with specific or unspecific coding for OB and UGIB and temporal variations between 2000 and 2005. For each year, incidence ratios of hospitalisations for former East versus West Germany were estimated with log-linear regression models adjusting for age, gender and population density. Significant differences in specific and unspecific coding between East and West Germany and over time were found for both, OB and UGIB hospitalisation diagnoses, respectively. For example in 2002, incidence ratios of hospitalisations for East versus West Germany were 1.24 (95% CI 1.16-1.32) for specific and 0.67 (95% CI 0.60-0.74) for unspecific OB diagnoses and 1.43 (95% CI 1.36-1.51) for specific and 0.83 (95% CI 0.80-0.87) for unspecific UGIB. Regional differences nearly disappeared and time trends were less marked when using combined specific and unspecific diagnoses of OB or UGIB, respectively. During the study period, there were substantial regional and temporal variations in the coding of OB and UGIB diagnoses in hospitalised patients. Possible explanations for the observed regional variations are different coding preferences, further influenced by changes in coding and reimbursement rules. Analysing groups of diagnoses including specific and unspecific codes reduces the influence of varying coding practices.
mTOR referees memory and disease through mRNA repression and competition.
Raab-Graham, Kimberly F; Niere, Farr
2017-06-01
Mammalian target of rapamycin (mTOR) activity is required for memory and is dysregulated in disease. Activation of mTOR promotes protein synthesis; however, new studies are demonstrating that mTOR activity also represses the translation of mRNAs. Almost three decades ago, Kandel and colleagues hypothesised that memory was due to the induction of positive regulators and removal of negative constraints. Are these negative constraints repressed mRNAs that code for proteins that block memory formation? Herein, we will discuss the mRNAs coded by putative memory suppressors, how activation/inactivation of mTOR repress protein expression at the synapse, how mTOR activity regulates RNA binding proteins, mRNA stability, and translation, and what the possible implications of mRNA repression are to memory and neurodegenerative disorders. © 2017 Federation of European Biochemical Societies.
Paul, Sujay; Zhang, Angel; Ludeña, Yvette; Villena, Gretty K; Yu, Fengan; Sherman, David H; Gutiérrez-Correa, Marcel
2017-06-10
Here, we report the complete genome sequence of a high alkaline cellulase producing Aspergillus fumigatus strain LMB-35Aa isolated from soil of Peruvian Amazon rainforest. The genome is ∼27.5mb in size, comprises of 228 scaffolds with an average GC content of 50%, and is predicted to contain a total of 8660 protein-coding genes. Of which, 6156 are with known function; it codes for 607 putative CAZymes families potentially involved in carbohydrate metabolism. Several important cellulose degrading genes, such as endoglucanase A, endoglucanase B, endoglucanase D and beta-glucosidase, are also identified. The genome of A. fumigatus strain LMB-35Aa represents the first whole sequenced genome of non-clinical, high cellulase producing A. fumigatus strain isolated from forest soil. Copyright © 2017 Elsevier B.V. All rights reserved.
Cavanagh, Jorunn Pauline; Hjerde, Erik; Holden, Matthew T G; Kahlke, Tim; Klingenberg, Claus; Flægstad, Trond; Parkhill, Julian; Bentley, Stephen D; Sollid, Johanna U Ericson
2014-11-01
Staphylococcus haemolyticus is an emerging cause of nosocomial infections, primarily affecting immunocompromised patients. A comparative genomic analysis was performed on clinical S. haemolyticus isolates to investigate their genetic relationship and explore the coding sequences with respect to antimicrobial resistance determinants and putative hospital adaptation. Whole-genome sequencing was performed on 134 isolates of S. haemolyticus from geographically diverse origins (Belgium, 2; Germany, 10; Japan, 13; Norway, 54; Spain, 2; Switzerland, 43; UK, 9; USA, 1). Each genome was individually assembled. Protein coding sequences (CDSs) were predicted and homologous genes were categorized into three types: Type I, core genes, homologues present in all strains; Type II, unique core genes, homologues shared by only a subgroup of strains; and Type III, unique genes, strain-specific CDSs. The phylogenetic relationship between the isolates was built from variable sites in the form of single nucleotide polymorphisms (SNPs) in the core genome and used to construct a maximum likelihood phylogeny. SNPs in the genome core regions divided the isolates into one major group of 126 isolates and one minor group of isolates with highly diverse genomes. The major group was further subdivided into seven clades (A-G), of which four (A-D) encompassed isolates only from Europe. Antimicrobial multiresistance was observed in 77.7% of the collection. High levels of homologous recombination were detected in genes involved in adherence, staphylococcal host adaptation and bacterial cell communication. The presence of several successful and highly resistant clones underlines the adaptive potential of this opportunistic pathogen. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy.
Cavanagh, Jorunn Pauline; Hjerde, Erik; Holden, Matthew T. G.; Kahlke, Tim; Klingenberg, Claus; Flægstad, Trond; Parkhill, Julian; Bentley, Stephen D.; Sollid, Johanna U. Ericson
2014-01-01
Objectives Staphylococcus haemolyticus is an emerging cause of nosocomial infections, primarily affecting immunocompromised patients. A comparative genomic analysis was performed on clinical S. haemolyticus isolates to investigate their genetic relationship and explore the coding sequences with respect to antimicrobial resistance determinants and putative hospital adaptation. Methods Whole-genome sequencing was performed on 134 isolates of S. haemolyticus from geographically diverse origins (Belgium, 2; Germany, 10; Japan, 13; Norway, 54; Spain, 2; Switzerland, 43; UK, 9; USA, 1). Each genome was individually assembled. Protein coding sequences (CDSs) were predicted and homologous genes were categorized into three types: Type I, core genes, homologues present in all strains; Type II, unique core genes, homologues shared by only a subgroup of strains; and Type III, unique genes, strain-specific CDSs. The phylogenetic relationship between the isolates was built from variable sites in the form of single nucleotide polymorphisms (SNPs) in the core genome and used to construct a maximum likelihood phylogeny. Results SNPs in the genome core regions divided the isolates into one major group of 126 isolates and one minor group of isolates with highly diverse genomes. The major group was further subdivided into seven clades (A–G), of which four (A–D) encompassed isolates only from Europe. Antimicrobial multiresistance was observed in 77.7% of the collection. High levels of homologous recombination were detected in genes involved in adherence, staphylococcal host adaptation and bacterial cell communication. Conclusions The presence of several successful and highly resistant clones underlines the adaptive potential of this opportunistic pathogen. PMID:25038069
Voß, Björn; Bolhuis, Henk; Fewer, David P.; Kopf, Matthias; Möke, Fred; Haas, Fabian; El-Shehawy, Rehab; Hayes, Paul; Bergman, Birgitta; Sivonen, Kaarina; Dittmann, Elke; Scanlan, Dave J.; Hagemann, Martin; Stal, Lucas J.; Hess, Wolfgang R.
2013-01-01
Nodularia spumigena is a filamentous diazotrophic cyanobacterium that dominates the annual late summer cyanobacterial blooms in the Baltic Sea. But N. spumigena also is common in brackish water bodies worldwide, suggesting special adaptation allowing it to thrive at moderate salinities. A draft genome analysis of N. spumigena sp. CCY9414 yielded a single scaffold of 5,462,271 nucleotides in length on which genes for 5,294 proteins were annotated. A subsequent strand-specific transcriptome analysis identified more than 6,000 putative transcriptional start sites (TSS). Orphan TSSs located in intergenic regions led us to predict 764 non-coding RNAs, among them 70 copies of a possible retrotransposon and several potential RNA regulators, some of which are also present in other N2-fixing cyanobacteria. Approximately 4% of the total coding capacity is devoted to the production of secondary metabolites, among them the potent hepatotoxin nodularin, the linear spumigin and the cyclic nodulapeptin. The transcriptional complexity associated with genes involved in nitrogen fixation and heterocyst differentiation is considerably smaller compared to other Nostocales. In contrast, sophisticated systems exist for the uptake and assimilation of iron and phosphorus compounds, for the synthesis of compatible solutes, and for the formation of gas vesicles, required for the active control of buoyancy. Hence, the annotation and interpretation of this sequence provides a vast array of clues into the genomic underpinnings of the physiology of this cyanobacterium and indicates in particular a competitive edge of N. spumigena in nutrient-limited brackish water ecosystems. PMID:23555932
Yinda, Claude Kwe; Ghogomu, Stephen Mbigha; Conceição-Neto, Nádia; Beller, Leen; Deboutte, Ward; Vanhulle, Emiel; Maes, Piet; Van Ranst, Marc; Matthijnssens, Jelle
2018-01-01
Most human emerging infectious diseases originate from wildlife and bats are a major reservoir of viruses, a few of which have been highly pathogenic to humans. In some regions of Cameroon, bats are hunted and eaten as a delicacy. This close proximity between human and bats provides ample opportunity for zoonotic events. To elucidate the viral diversity of Cameroonian fruit bats, we collected and metagenomically screened eighty-seven fecal samples of Eidolon helvum and Epomophorus gambianus fruit bats. The results showed a plethora of known and novel viruses. Phylogenetic analyses of the eleven gene segments of the first complete bat rotavirus H genome, showed clearly separated clusters of human, porcine, and bat rotavirus H strains, not indicating any recent interspecies transmission events. Additionally, we identified and analyzed a bat bastrovirus genome (a novel group of recently described viruses, related to astroviruses and hepatitis E viruses), confirming their recombinant nature, and provide further evidence of additional recombination events among bat bastroviruses. Interestingly, picobirnavirus-like RNA-dependent RNA polymerase gene segments were identified using an alternative mitochondrial genetic code, and further principal component analyses suggested that they may have a similar lifestyle to mitoviruses, a group of virus-like elements known to infect the mitochondria of fungi. Although identified bat coronavirus, parvovirus, and cyclovirus strains belong to established genera, most of the identified partitiviruses and densoviruses constitute putative novel genera in their respective families. Finally, the results of the phage community analyses of these bats indicate a very diverse geographically distinct bat phage population, probably reflecting different diets and gut bacterial ecosystems.
Autism-like behavioral phenotypes in BTBR T+tf/J mice.
McFarlane, H G; Kusek, G K; Yang, M; Phoenix, J L; Bolivar, V J; Crawley, J N
2008-03-01
Autism is a behaviorally defined neurodevelopmental disorder of unknown etiology. Mouse models with face validity to the core symptoms offer an experimental approach to test hypotheses about the causes of autism and translational tools to evaluate potential treatments. We discovered that the inbred mouse strain BTBR T+tf/J (BTBR) incorporates multiple behavioral phenotypes relevant to all three diagnostic symptoms of autism. BTBR displayed selectively reduced social approach, low reciprocal social interactions and impaired juvenile play, as compared with C57BL/6J (B6) controls. Impaired social transmission of food preference in BTBR suggests communication deficits. Repetitive behaviors appeared as high levels of self-grooming by juvenile and adult BTBR mice. Comprehensive analyses of procedural abilities confirmed that social recognition and olfactory abilities were normal in BTBR, with no evidence for high anxiety-like traits or motor impairments, supporting an interpretation of highly specific social deficits. Database comparisons between BTBR and B6 on 124 putative autism candidate genes showed several interesting single nucleotide polymorphisms (SNPs) in the BTBR genetic background, including a nonsynonymous coding region polymorphism in Kmo. The Kmo gene encodes kynurenine 3-hydroxylase, an enzyme-regulating metabolism of kynurenic acid, a glutamate antagonist with neuroprotective actions. Sequencing confirmed this coding SNP in Kmo, supporting further investigation into the contribution of this polymorphism to autism-like behavioral phenotypes. Robust and selective social deficits, repetitive self-grooming, genetic stability and commercial availability of the BTBR inbred strain encourage its use as a research tool to search for background genes relevant to the etiology of autism, and to explore therapeutics to treat the core symptoms.
Evolution of Transcription Activator-Like Effectors in Xanthomonas oryzae
Erkes, Annett; Reschke, Maik; Boch, Jens
2017-01-01
Abstract Transcription activator-like effectors (TALEs) are secreted by plant–pathogenic Xanthomonas bacteria into plant cells where they act as transcriptional activators and, hence, are major drivers in reprogramming the plant for the benefit of the pathogen. TALEs possess a highly repetitive DNA-binding domain of typically 34 amino acid (AA) tandem repeats, where AA 12 and 13, termed repeat variable di-residue (RVD), determine target specificity. Different Xanthomonas strains possess different repertoires of TALEs. Here, we study the evolution of TALEs from the level of RVDs determining target specificity down to the level of DNA sequence with focus on rice-pathogenic Xanthomonas oryzae pv. oryzae (Xoo) and Xanthomonas oryzae pv. oryzicola (Xoc) strains. We observe that codon pairs coding for individual RVDs are conserved to a similar degree as the flanking repeat sequence. We find strong indications that TALEs may evolve 1) by base substitutions in codon pairs coding for RVDs, 2) by recombination of N-terminal or C-terminal regions of existing TALEs, or 3) by deletion of individual TALE repeats, and we propose possible mechanisms. We find indications that the reassortment of TALE genes in clusters is mediated by an integron-like mechanism in Xoc. We finally study the effect of the presence/absence and evolutionary modifications of TALEs on transcriptional activation of putative target genes in rice, and find that even single RVD swaps may lead to considerable differences in activation. This correlation allowed a refined prediction of TALE targets, which is the crucial step to decipher their virulence activity. PMID:28637323
Al Jawaldeh, Ayoub; Sayed, Ghada
2018-04-05
Optimal breastfeeding practices and appropriate complementary feeding improve child health, survival and development. The countries of the Eastern Mediterranean Region have made significant strides in formulation and implementation of legislation to protect and promote breastfeeding based on The International Code of Marketing of Breast-milk Substitutes (the Code) and subsequent relevant World Health Assembly resolutions. To assess the implementation of the Code in the Region. Assessment was conducted by the World Health Organization (WHO) Regional Office for the Eastern Mediterranean using a WHO standard questionnaire. Seventeen countries in the Region have enacted legislation to protect breastfeeding. Only 6 countries have comprehensive legislation or other legal measures reflecting all or most provisions of the Code; 4 countries have legal measures incorporating many provisions of the Code; 7 countries have legal measures that contain a few provisions of the Code; 4 countries are currently studying the issue; and only 1 country has no measures in place. Further analysis of the legislation found that the text of articles in the laws fully reflected the Code articles in only 6 countries. Most countries need to revisit and amend existing national legislation to implement fully the Code and relevant World Health Assembly resolutions, supported by systematic monitoring and reporting. Copyright © World Health Organization (WHO) 2018. Some rights reserved. This work is available under the CC BY-NC-SA 3.0 IGO license (https://creativecommons.org/licenses/by-nc-sa/3.0/igo).
Improving the sensitivity and specificity of the abbreviated injury scale coding system.
Kramer, C F; Barancik, J I; Thode, H C
1990-01-01
The Abbreviated Injury Scale with Epidemiologic Modifications (AIS 85-EM) was developed to make it possible to code information about anatomic injury types and locations that, although generally available from medical records, is not codable under the standard Abbreviated Injury Scale, published by the American Association for Automotive Medicine in 1985 (AIS 85). In a population-based sample of 3,223 motor vehicle trauma cases, 68 percent of the patients had one or more injuries that were coded to the AIS 85 body region nonspecific category external. When the same patients' injuries were coded using the AIS 85-EM coding procedure, only 15 percent of the patients had injuries that could not be coded to a specific body region. With AIS 85-EM, the proportion of codable head injury cases increased from 16 percent to 37 percent, thereby improving the potential for identifying cases with head and threshold brain injury. The data suggest that body region coding of all injuries is necessary to draw valid and reliable conclusions about changes in injury patterns and their sequelae. The increased specificity of body region coding improves assessments of the efficacy of injury intervention strategies and countermeasure programs using epidemiologic methodology. PMID:2116633
25 CFR 900.125 - What shall a construction contract proposal contain?
Code of Federal Regulations, 2012 CFR
2012-04-01
... tribal building codes and engineering standards; (4) Structural integrity; (5) Accountability of funds..., standards and methods (including national, regional, state, or tribal building codes or construction... methods (including national, regional, state, or tribal building codes or construction industry standards...
25 CFR 900.125 - What shall a construction contract proposal contain?
Code of Federal Regulations, 2014 CFR
2014-04-01
... tribal building codes and engineering standards; (4) Structural integrity; (5) Accountability of funds..., standards and methods (including national, regional, state, or tribal building codes or construction... methods (including national, regional, state, or tribal building codes or construction industry standards...
25 CFR 900.125 - What shall a construction contract proposal contain?
Code of Federal Regulations, 2013 CFR
2013-04-01
... tribal building codes and engineering standards; (4) Structural integrity; (5) Accountability of funds..., standards and methods (including national, regional, state, or tribal building codes or construction... methods (including national, regional, state, or tribal building codes or construction industry standards...
25 CFR 900.125 - What shall a construction contract proposal contain?
Code of Federal Regulations, 2011 CFR
2011-04-01
... tribal building codes and engineering standards; (4) Structural integrity; (5) Accountability of funds..., standards and methods (including national, regional, state, or tribal building codes or construction... methods (including national, regional, state, or tribal building codes or construction industry standards...
25 CFR 900.125 - What shall a construction contract proposal contain?
Code of Federal Regulations, 2010 CFR
2010-04-01
... tribal building codes and engineering standards; (4) Structural integrity; (5) Accountability of funds..., standards and methods (including national, regional, state, or tribal building codes or construction... methods (including national, regional, state, or tribal building codes or construction industry standards...
Bioinformatic analysis suggests that the Orbivirus VP6 cistron encodes an overlapping gene
Firth, Andrew E
2008-01-01
Background The genus Orbivirus includes several species that infect livestock – including Bluetongue virus (BTV) and African horse sickness virus (AHSV). These viruses have linear dsRNA genomes divided into ten segments, all of which have previously been assumed to be monocistronic. Results Bioinformatic evidence is presented for a short overlapping coding sequence (CDS) in the Orbivirus genome segment 9, overlapping the VP6 cistron in the +1 reading frame. In BTV, a 77–79 codon AUG-initiated open reading frame (hereafter ORFX) is present in all 48 segment 9 sequences analysed. The pattern of base variations across the 48-sequence alignment indicates that ORFX is subject to functional constraints at the amino acid level (even when the constraints due to coding in the overlapping VP6 reading frame are taken into account; MLOGD software). In fact the translated ORFX shows greater amino acid conservation than the overlapping region of VP6. The ORFX AUG codon has a strong Kozak context in all 48 sequences. Each has only one or two upstream AUG codons, always in the VP6 reading frame, and (with a single exception) always with weak or medium Kozak context. Thus, in BTV, ORFX may be translated via leaky scanning. A long (83–169 codon) ORF is present in a corresponding location and reading frame in all other Orbivirus species analysed except Saint Croix River virus (SCRV; the most divergent). Again, the pattern of base variations across sequence alignments indicates multiple coding in the VP6 and ORFX reading frames. Conclusion At ~9.5 kDa, the putative ORFX product in BTV is too small to appear on most published protein gels. Nonetheless, a review of past literature reveals a number of possible detections. We hope that presentation of this bioinformatic analysis will stimulate an attempt to experimentally verify the expression and functional role of ORFX, and hence lead to a greater understanding of the molecular biology of these important pathogens. PMID:18489030
Frisso, Giulia; Detta, Nicola; Coppola, Pamela; Mazzaccara, Cristina; Pricolo, Maria Rosaria; D'Onofrio, Antonio; Limongelli, Giuseppe; Calabrò, Raffaele; Salvatore, Francesco
2016-11-10
Point mutations are the most common cause of inherited diseases. Bioinformatics tools can help to predict the pathogenicity of mutations found during genetic screening, but they may work less well in determining the effect of point mutations in non-coding regions. In silico analysis of intronic variants can reveal their impact on the splicing process, but the consequence of a given substitution is generally not predictable. The aim of this study was to functionally test five intronic variants ( MYBPC3 -c.506-2A>C, MYBPC3 -c.906-7G>T, MYBPC3 -c.2308+3G>C, SCN5A -c.393-5C>A, and ACTC1 -c.617-7T>C) found in five patients affected by inherited cardiomyopathies in the attempt to verify their pathogenic role. Analysis of the MYBPC3 -c.506-2A>C mutation in mRNA from the peripheral blood of one of the patients affected by hypertrophic cardiac myopathy revealed the loss of the canonical splice site and the use of an alternative splicing site, which caused the loss of the first seven nucleotides of exon 5 ( MYBPC3 -G169AfsX14). In the other four patients, we generated minigene constructs and transfected them in HEK-293 cells. This minigene approach showed that MYBPC3 -c.2308+3G>C and SCN5A -c.393-5C>A altered pre-mRNA processing, thus resulting in the skipping of one exon. No alterations were found in either MYBPC3 -c.906-7G>T or ACTC1 -c.617-7T>C. In conclusion, functional in vitro analysis of the effects of potential splicing mutations can confirm or otherwise the putative pathogenicity of non-coding mutations, and thus help to guide the patient's clinical management and improve genetic counseling in affected families.
Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P
2017-03-01
Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Singer, Meromit; Engström, Alexander; Schönhuth, Alexander; Pachter, Lior
2011-09-23
Recent experimental and computational work confirms that CpGs can be unmethylated inside coding exons, thereby showing that codons may be subjected to both genomic and epigenomic constraint. It is therefore of interest to identify coding CpG islands (CCGIs) that are regions inside exons enriched for CpGs. The difficulty in identifying such islands is that coding exons exhibit sequence biases determined by codon usage and constraints that must be taken into account. We present a method for finding CCGIs that showcases a novel approach we have developed for identifying regions of interest that are significant (with respect to a Markov chain) for the counts of any pattern. Our method begins with the exact computation of tail probabilities for the number of CpGs in all regions contained in coding exons, and then applies a greedy algorithm for selecting islands from among the regions. We show that the greedy algorithm provably optimizes a biologically motivated criterion for selecting islands while controlling the false discovery rate. We applied this approach to the human genome (hg18) and annotated CpG islands in coding exons. The statistical criterion we apply to evaluating islands reduces the number of false positives in existing annotations, while our approach to defining islands reveals significant numbers of undiscovered CCGIs in coding exons. Many of these appear to be examples of functional epigenetic specialization in coding exons.
Ciok, Anna; Adamczuk, Marcin; Bartosik, Dariusz; Dziewit, Lukasz
2016-11-28
Pseudomonas strains isolated from the heavily contaminated Lubin copper mine and Zelazny Most post-flotation waste reservoir in Poland were screened for the presence of integrons. This analysis revealed that two strains carried homologous DNA regions composed of a gene encoding a DNA_BRE_C domain-containing tyrosine recombinase (with no significant sequence similarity to other integrases of integrons) plus a three-component array of putative integron gene cassettes. The predicted gene cassettes encode three putative polypeptides with homology to (i) transmembrane proteins, (ii) GCN5 family acetyltransferases, and (iii) hypothetical proteins of unknown function (homologous proteins are encoded by the gene cassettes of several class 1 integrons). Comparative sequence analyses identified three structural variants of these novel integron-like elements within the sequenced bacterial genomes. Analysis of their distribution revealed that they are found exclusively in strains of the genus Pseudomonas .
Putative melatonin receptors in a human biological clock
DOE Office of Scientific and Technical Information (OSTI.GOV)
Reppert, S.M.; Weaver, D.R.; Rivkees, S.A.
In vitro autoradiography with /sup 125/I-labeled melatonin was used to examine melatonin binding sites in human hypothalamus. Specific /sup 125/I-labeled melatonin binding was localized to the suprachiasmatic nuclei, the site of a putative biological clock, and was not apparent in other hypothalamic regions. Specific /sup 125/I-labeled melatonin binding was consistently found in the suprachiasmatic nuclei of hypothalami from adults and fetuses. Densitometric analysis of competition experiments with varying concentrations of melatonin showed monophasic competition curves, with comparable half-maximal inhibition values for the suprachiasmatic nuclei of adults (150 picomolar) and fetuses (110 picomolar). Micromolar concentrations of the melatonin agonist 6-chloromelatonin completelymore » inhibited specific /sup 125/I-labeled melatonin binding, whereas the same concentrations of serotonin and norepinephrine caused only a partial reduction in specific binding. The results suggest that putative melatonin receptors are located in a human biological clock.« less
2011-01-01
Background Many plants have large and complex genomes with an abundance of repeated sequences. Many plants are also polyploid. Both of these attributes typify the genome architecture in the tribe Triticeae, whose members include economically important wheat, rye and barley. Large genome sizes, an abundance of repeated sequences, and polyploidy present challenges to genome-wide SNP discovery using next-generation sequencing (NGS) of total genomic DNA by making alignment and clustering of short reads generated by the NGS platforms difficult, particularly in the absence of a reference genome sequence. Results An annotation-based, genome-wide SNP discovery pipeline is reported using NGS data for large and complex genomes without a reference genome sequence. Roche 454 shotgun reads with low genome coverage of one genotype are annotated in order to distinguish single-copy sequences and repeat junctions from repetitive sequences and sequences shared by paralogous genes. Multiple genome equivalents of shotgun reads of another genotype generated with SOLiD or Solexa are then mapped to the annotated Roche 454 reads to identify putative SNPs. A pipeline program package, AGSNP, was developed and used for genome-wide SNP discovery in Aegilops tauschii-the diploid source of the wheat D genome, and with a genome size of 4.02 Gb, of which 90% is repetitive sequences. Genomic DNA of Ae. tauschii accession AL8/78 was sequenced with the Roche 454 NGS platform. Genomic DNA and cDNA of Ae. tauschii accession AS75 was sequenced primarily with SOLiD, although some Solexa and Roche 454 genomic sequences were also generated. A total of 195,631 putative SNPs were discovered in gene sequences, 155,580 putative SNPs were discovered in uncharacterized single-copy regions, and another 145,907 putative SNPs were discovered in repeat junctions. These SNPs were dispersed across the entire Ae. tauschii genome. To assess the false positive SNP discovery rate, DNA containing putative SNPs was amplified by PCR from AL8/78 and AS75 and resequenced with the ABI 3730 xl. In a sample of 302 randomly selected putative SNPs, 84.0% in gene regions, 88.0% in repeat junctions, and 81.3% in uncharacterized regions were validated. Conclusion An annotation-based genome-wide SNP discovery pipeline for NGS platforms was developed. The pipeline is suitable for SNP discovery in genomic libraries of complex genomes and does not require a reference genome sequence. The pipeline is applicable to all current NGS platforms, provided that at least one such platform generates relatively long reads. The pipeline package, AGSNP, and the discovered 497,118 Ae. tauschii SNPs can be accessed at (http://avena.pw.usda.gov/wheatD/agsnp.shtml). PMID:21266061
Glubb, Dylan M.; Johnatty, Sharon E.; Quinn, Michael C.J.; O’Mara, Tracy A.; Tyrer, Jonathan P.; Gao, Bo; Fasching, Peter A.; Beckmann, Matthias W.; Lambrechts, Diether; Vergote, Ignace; Velez Edwards, Digna R.; Beeghly-Fadiel, Alicia; Benitez, Javier; Garcia, Maria J.; Goodman, Marc T.; Thompson, Pamela J.; Dörk, Thilo; Dürst, Matthias; Modungo, Francesmary; Moysich, Kirsten; Heitz, Florian; du Bois, Andreas; Pfisterer, Jacobus; Hillemanns, Peter; Karlan, Beth Y.; Lester, Jenny; Goode, Ellen L.; Cunningham, Julie M.; Winham, Stacey J.; Larson, Melissa C.; McCauley, Bryan M.; Kjær, Susanne Krüger; Jensen, Allan; Schildkraut, Joellen M.; Berchuck, Andrew; Cramer, Daniel W.; Terry, Kathryn L.; Salvesen, Helga B.; Bjorge, Line; Webb, Penny M.; Grant, Peter; Pejovic, Tanja; Moffitt, Melissa; Hogdall, Claus K.; Hogdall, Estrid; Paul, James; Glasspool, Rosalind; Bernardini, Marcus; Tone, Alicia; Huntsman, David; Woo, Michelle; Group, AOCS; deFazio, Anna; Kennedy, Catherine J.; Pharoah, Paul D.P.; MacGregor, Stuart; Chenevix-Trench, Georgia
2017-01-01
We previously identified associations with ovarian cancer outcome at five genetic loci. To identify putatively causal genetic variants and target genes, we prioritized two ovarian outcome loci (1q22 and 19p12) for further study. Bioinformatic and functional genetic analyses indicated that MEF2D and ZNF100 are targets of candidate outcome variants at 1q22 and 19p12, respectively. At 19p12, the chromatin interaction of a putative regulatory element with the ZNF100 promoter region correlated with candidate outcome variants. At 1q22, putative regulatory elements enhanced MEF2D promoter activity and haplotypes containing candidate outcome variants modulated these effects. In a public dataset, MEF2D and ZNF100 expression were both associated with ovarian cancer progression-free or overall survival time. In an extended set of 6,162 epithelial ovarian cancer patients, we found that functional candidates at the 1q22 and 19p12 loci, as well as other regional variants, were nominally associated with patient outcome; however, no associations reached our threshold for statistical significance (p<1×10-5). Larger patient numbers will be needed to convincingly identify any true associations at these loci. PMID:29029385
Luis, Luis; Serrano, María Luisa; Hidalgo, Mariana; Mendoza-León, Alexis
2013-01-01
Differential susceptibility to microtubule agents has been demonstrated between mammalian cells and kinetoplastid organisms such as Leishmania spp. and Trypanosoma spp. The aims of this study were to identify and characterize the architecture of the putative colchicine binding site of Leishmania spp. and investigate the molecular basis of colchicine resistance. We cloned and sequenced the β-tubulin gene of Leishmania (Viannia) guyanensis and established the theoretical 3D model of the protein, using the crystallographic structure of the bovine protein as template. We identified mutations on the Leishmania β-tubulin gene sequences on regions related to the putative colchicine-binding pocket, which generate amino acid substitutions and changes in the topology of this region, blocking the access of colchicine. The same mutations were found in the β-tubulin sequence of kinetoplastid organisms such as Trypanosoma cruzi, T. brucei, and T. evansi. Using molecular modelling approaches, we demonstrated that conformational changes include an elongation and torsion of an α-helix structure and displacement to the inside of the pocket of one β-sheet that hinders access of colchicine. We propose that kinetoplastid organisms show resistance to colchicine due to amino acids substitutions that generate structural changes in the putative colchicine-binding domain, which prevent colchicine access. PMID:24083244
Schübbe, Sabrina; Kube, Michael; Scheffel, André; Wawer, Cathrin; Heyen, Udo; Meyerdierks, Anke; Madkour, Mohamed H.; Mayer, Frank; Reinhardt, Richard; Schüler, Dirk
2003-01-01
Frequent spontaneous loss of the magnetic phenotype was observed in stationary-phase cultures of the magnetotactic bacterium Magnetospirillum gryphiswaldense MSR-1. A nonmagnetic mutant, designated strain MSR-1B, was isolated and characterized. The mutant lacked any structures resembling magnetosome crystals as well as internal membrane vesicles. The growth of strain MSR-1B was impaired under all growth conditions tested, and the uptake and accumulation of iron were drastically reduced under iron-replete conditions. A large chromosomal deletion of approximately 80 kb was identified in strain MSR-1B, which comprised both the entire mamAB and mamDC clusters as well as further putative operons encoding a number of magnetosome-associated proteins. A bacterial artificial chromosome clone partially covering the deleted region was isolated from the genomic library of wild-type M. gryphiswaldense. Sequence analysis of this fragment revealed that all previously identified mam genes were closely linked with genes encoding other magnetosome-associated proteins within less than 35 kb. In addition, this region was remarkably rich in insertion elements and harbored a considerable number of unknown gene families which appeared to be specific for magnetotactic bacteria. Overall, these findings suggest the existence of a putative large magnetosome island in M. gryphiswaldense and other magnetotactic bacteria. PMID:13129949
Quarta, Angela; Mita, Giovanni; Durante, Miriana; Arlorio, Marco; De Paolis, Angelo
2013-07-01
The polyphenol oxidase (PPO) enzyme, which can catalyze the oxidation of phenolics to quinones, has been reported to be involved in undesirable browning in many plant foods. This phenomenon is particularly severe in artichoke heads wounded during the manufacturing process. A full-length cDNA encoding for a putative polyphenol oxidase (designated as CsPPO) along with a 1432 bp sequence upstream of the starting ATG codon was characterized for the first time from [Cynara cardunculus var. scolymus (L.) Fiori]. The 1764 bp CsPPO sequence encodes a putative protein of 587 amino acids with a calculated molecular mass of 65,327 Da and an isoelectric point of 5.50. Analysis of the promoter region revealed the presence of cis-acting elements, some of which are putatively involved in the response to light and wounds. Expression analysis of the gene in wounded capitula indicated that CsPPO was significantly induced after 48 h, even though the browning process had started earlier. This suggests that the early browning event observed in artichoke heads was not directly related to de novo mRNA synthesis. Finally, we provide the complete gene sequence encoding for polyphenol oxidase and the upstream regulative region in artichoke. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Jordão, Rita; Campos, Bruno; Lemos, Marco F L; Soares, Amadeu M V M; Tauler, Romà; Barata, Carlos
2016-06-01
Multixenobiotic resistance mechanisms (MXR) were recently identified in Daphnia magna. Previous results characterized gene transcripts of genes encoding and efflux activities of four putative ABCB1 and ABCC transporters that were chemically induced but showed low specificity against model transporter substrates and inhibitors, thus preventing us from distinguishing between activities of different efflux transporter types. In this study we report on the specificity of induction of ABC transporters and of the stress protein hsp70 in clones selected to be genetically resistant to ABCB1 chemical substrates. Clones resistant to mitoxantrone, ivermectin and pentachlorophenol showed distinctive transcriptional responses of transporter protein coding genes and of putative transporter dye activities. Expression of hsp70 proteins also varied across resistant clones. Clones resistant to mitoxantrone and pentachlorophenol showed high constitutive levels of hsp70. Transcriptional levels of the abcb1 gene transporter and of putative dye transporter activity were also induced to a greater extent in the pentachlorophenol resistant clone. Observed higher dye transporter activities in individuals from clones resistant to mitoxantrone and ivermectin were unrelated with transcriptional levels of the studied four abcc and abcb1 transporter genes. These findings suggest that Abcb1 induction in D. magna may be a part of a general cellular stress response. Copyright © 2016 Elsevier B.V. All rights reserved.
Li, You-Hai; Han, Wen-Jin; Gui, Xi-Wu; Wei, Tao; Tang, Shuang-Yan; Jin, Jian-Ming
2016-08-02
Tentoxin, a cyclic tetrapeptide produced by several Alternaria species, inhibits the F₁-ATPase activity of chloroplasts, resulting in chlorosis in sensitive plants. In this study, we report two clustered genes, encoding a putative non-ribosome peptide synthetase (NRPS) TES and a cytochrome P450 protein TES1, that are required for tentoxin biosynthesis in Alternaria alternata strain ZJ33, which was isolated from blighted leaves of Eupatorium adenophorum. Using a pair of primers designed according to the consensus sequences of the adenylation domain of NRPSs, two fragments containing putative adenylation domains were amplified from A. alternata ZJ33, and subsequent PCR analyses demonstrated that these fragments belonged to the same NRPS coding sequence. With no introns, TES consists of a single 15,486 base pair open reading frame encoding a predicted 5161 amino acid protein. Meanwhile, the TES1 gene is predicted to contain five introns and encode a 506 amino acid protein. The TES protein is predicted to be comprised of four peptide synthase modules with two additional N-methylation domains, and the number and arrangement of the modules in TES were consistent with the number and arrangement of the amino acid residues of tentoxin, respectively. Notably, both TES and TES1 null mutants generated via homologous recombination failed to produce tentoxin. This study provides the first evidence concerning the biosynthesis of tentoxin in A. alternata.
Bruque, Carlos D; Delea, Marisol; Fernández, Cecilia S; Orza, Juan V; Taboas, Melisa; Buzzalino, Noemí; Espeche, Lucía D; Solari, Andrea; Luccerini, Verónica; Alba, Liliana; Nadra, Alejandro D; Dain, Liliana
2016-12-14
Congenital adrenal hyperplasia due to 21-hydroxylase deficiency accounts for 90-95% of CAH cases. In this work we performed an extensive survey of mutations and SNPs modifying the coding sequence of the CYP21A2 gene. Using bioinformatic tools and two plausible CYP21A2 structures as templates, we initially classified all known mutants (n = 343) according to their putative functional impacts, which were either reported in the literature or inferred from structural models. We then performed a detailed analysis on the subset of mutations believed to exclusively impact protein stability. For those mutants, the predicted stability was calculated and correlated with the variant's expected activity. A high concordance was obtained when comparing our predictions with available in vitro residual activities and/or the patient's phenotype. The predicted stability and derived activity of all reported mutations and SNPs lacking functional assays (n = 108) were assessed. As expected, most of the SNPs (52/76) showed no biological implications. Moreover, this approach was applied to evaluate the putative synergy that could emerge when two mutations occurred in cis. In addition, we propose a putative pathogenic effect of five novel mutations, p.L107Q, p.L122R, p.R132H, p.P335L and p.H466fs, found in 21-hydroxylase deficient patients of our cohort.
Bruque, Carlos D.; Delea, Marisol; Fernández, Cecilia S.; Orza, Juan V.; Taboas, Melisa; Buzzalino, Noemí; Espeche, Lucía D.; Solari, Andrea; Luccerini, Verónica; Alba, Liliana; Nadra, Alejandro D.; Dain, Liliana
2016-01-01
Congenital adrenal hyperplasia due to 21-hydroxylase deficiency accounts for 90–95% of CAH cases. In this work we performed an extensive survey of mutations and SNPs modifying the coding sequence of the CYP21A2 gene. Using bioinformatic tools and two plausible CYP21A2 structures as templates, we initially classified all known mutants (n = 343) according to their putative functional impacts, which were either reported in the literature or inferred from structural models. We then performed a detailed analysis on the subset of mutations believed to exclusively impact protein stability. For those mutants, the predicted stability was calculated and correlated with the variant’s expected activity. A high concordance was obtained when comparing our predictions with available in vitro residual activities and/or the patient’s phenotype. The predicted stability and derived activity of all reported mutations and SNPs lacking functional assays (n = 108) were assessed. As expected, most of the SNPs (52/76) showed no biological implications. Moreover, this approach was applied to evaluate the putative synergy that could emerge when two mutations occurred in cis. In addition, we propose a putative pathogenic effect of five novel mutations, p.L107Q, p.L122R, p.R132H, p.P335L and p.H466fs, found in 21-hydroxylase deficient patients of our cohort. PMID:27966633
The putative visual word form area is functionally connected to the dorsal attention network.
Vogel, Alecia C; Miezin, Fran M; Petersen, Steven E; Schlaggar, Bradley L
2012-03-01
The putative visual word form area (pVWFA) is the most consistently activated region in single word reading studies (i.e., Vigneau et al. 2006), yet its function remains a matter of debate. The pVWFA may be predominantly used in reading or it could be a more general visual processor used in reading but also in other visual tasks. Here, resting-state functional connectivity magnetic resonance imaging (rs-fcMRI) is used to characterize the functional relationships of the pVWFA to help adjudicate between these possibilities. rs-fcMRI defines relationships based on correlations in slow fluctuations of blood oxygen level-dependent activity occurring at rest. In this study, rs-fcMRI correlations show little relationship between the pVWFA and reading-related regions but a strong relationship between the pVWFA and dorsal attention regions thought to be related to spatial and feature attention. The rs-fcMRI correlations between the pVWFA and regions of the dorsal attention network increase with age and reading skill, while the correlations between the pVWFA and reading-related regions do not. These results argue the pVWFA is not used predominantly in reading but is a more general visual processor used in other visual tasks, as well as reading.
The Putative Visual Word Form Area Is Functionally Connected to the Dorsal Attention Network
Miezin, Fran M.; Petersen, Steven E.; Schlaggar, Bradley L.
2012-01-01
The putative visual word form area (pVWFA) is the most consistently activated region in single word reading studies (i.e., Vigneau et al. 2006), yet its function remains a matter of debate. The pVWFA may be predominantly used in reading or it could be a more general visual processor used in reading but also in other visual tasks. Here, resting-state functional connectivity magnetic resonance imaging (rs-fcMRI) is used to characterize the functional relationships of the pVWFA to help adjudicate between these possibilities. rs-fcMRI defines relationships based on correlations in slow fluctuations of blood oxygen level–dependent activity occurring at rest. In this study, rs-fcMRI correlations show little relationship between the pVWFA and reading-related regions but a strong relationship between the pVWFA and dorsal attention regions thought to be related to spatial and feature attention. The rs-fcMRI correlations between the pVWFA and regions of the dorsal attention network increase with age and reading skill, while the correlations between the pVWFA and reading-related regions do not. These results argue the pVWFA is not used predominantly in reading but is a more general visual processor used in other visual tasks, as well as reading. PMID:21690259
Reiman, Eric M.; Chen, Kewei; Caselli, Richard J.; Alexander, Gene E.; Bandy, Daniel; Adamson, Jennifer L.; Lee, Wendy; Cannon, Ashley; Stephan, Elizabeth A.; Stephan, Dietrich A.; Papassotiropoulos, Andreas
2008-01-01
We recently implicated a cluster of nine single nucleotide polymorphisms from seven cholesterol-related genes in the risk of Alzheimer’s disease (AD) in a European cohort, and we proposed calculating an aggregate cholesterol-related genetic score (CREGS) to characterize a person’s risk. In a separate study, we found that apolipoprotein E (APOE) ε4 gene dose, an established AD risk factor, was correlated with fluorodeoxyglucose (FDG) positron emission tomography (PET) measurements of hypometabolism in AD-affected brain regions in a cognitively normal American cohort, and we proposed using PET as a presymptomatic endophenotype to help assess putative modifiers of AD risk. Thus, the objective in the present study is to determine whether CREGS is related to PET measurements of hypometabolism in AD-affected brain regions. DNA and PET data from 141 cognitively normal late middle-aged APOE ε4 homozygotes, heterozygotes and non-carriers were analyzed to evaluate the relationship between CREGS and regional PET measurements. Cholesterol-related genetic risk scores were associated with hypometabolism in AD-affected brain regions, even when controlling for the effects of APOE ε4 gene dose. The results support the role of cholesterol-related genes in the predisposition to AD, and support the value of neuroimaging in the presymptomatic assessment of putative modifiers of AD risk. PMID:18280754
A motion compensation technique using sliced blocks and its application to hybrid video coding
NASA Astrophysics Data System (ADS)
Kondo, Satoshi; Sasai, Hisao
2005-07-01
This paper proposes a new motion compensation method using "sliced blocks" in DCT-based hybrid video coding. In H.264 ? MPEG-4 Advance Video Coding, a brand-new international video coding standard, motion compensation can be performed by splitting macroblocks into multiple square or rectangular regions. In the proposed method, on the other hand, macroblocks or sub-macroblocks are divided into two regions (sliced blocks) by an arbitrary line segment. The result is that the shapes of the segmented regions are not limited to squares or rectangles, allowing the shapes of the segmented regions to better match the boundaries between moving objects. Thus, the proposed method can improve the performance of the motion compensation. In addition, adaptive prediction of the shape according to the region shape of the surrounding macroblocks can reduce overheads to describe shape information in the bitstream. The proposed method also has the advantage that conventional coding techniques such as mode decision using rate-distortion optimization can be utilized, since coding processes such as frequency transform and quantization are performed on a macroblock basis, similar to the conventional coding methods. The proposed method is implemented in an H.264-based P-picture codec and an improvement in bit rate of 5% is confirmed in comparison with H.264.
2011-01-01
Background Human Rhinoviruses (HRVs) are well recognized viral pathogens associated with acute respiratory tract illnesses (RTIs) abundant worldwide. Although recent studies have phylogenetically identified the new HRV species (HRV-C), data on molecular epidemiology, genetic diversity, and clinical manifestation have been limited. Result To gain new insight into HRV genetic diversity, we determined the complete coding sequences of putative new members of HRV species C (HRV-CU072 with 1% prevalence) and HRV-B (HRV-CU211) identified from clinical specimens collected from pediatric patients diagnosed with a symptom of acute lower RTI. Complete coding sequence and phylogenetic analysis revealed that the HRV-CU072 strain shared a recent common ancestor with most closely related Chinese strain (N4). Comparative analysis at the protein level showed that HRV-CU072 might accumulate substitutional mutations in structural proteins, as well as nonstructural proteins 3C and 3 D. Comparative analysis of all available HRVs and HEVs indicated that HRV-C contains a relatively high G+C content and is more closely related to HEV-D. This might be correlated to their replication and capability to adapt to the high temperature environment of the human lower respiratory tract. We herein report an infrequently occurring intra-species recombination event in HRV-B species (HRV-CU211) with a crossing over having taken place at the boundary of VP2 and VP3 genes. Moreover, we observed phylogenetic compatibility in all HRV species and suggest that dynamic mechanisms for HRV evolution seem to be related to recombination events. These findings indicated that the elementary units shaping the genetic diversity of HRV-C could be found in the nonstructural 2A and 3D genes. Conclusion This study provides information for understanding HRV genetic diversity and insight into the role of selection pressure and recombination mechanisms influencing HRV evolution. PMID:21214911
Linsuwanon, Piyada; Payungporn, Sunchai; Suwannakarn, Kamol; Chieochansin, Thaweesak; Theamboonlers, Apiradee; Poovorawan, Yong
2011-01-07
Human Rhinoviruses (HRVs) are well recognized viral pathogens associated with acute respiratory tract illnesses (RTIs) abundant worldwide. Although recent studies have phylogenetically identified the new HRV species (HRV-C), data on molecular epidemiology, genetic diversity, and clinical manifestation have been limited. To gain new insight into HRV genetic diversity, we determined the complete coding sequences of putative new members of HRV species C (HRV-CU072 with 1% prevalence) and HRV-B (HRV-CU211) identified from clinical specimens collected from pediatric patients diagnosed with a symptom of acute lower RTI. Complete coding sequence and phylogenetic analysis revealed that the HRV-CU072 strain shared a recent common ancestor with most closely related Chinese strain (N4). Comparative analysis at the protein level showed that HRV-CU072 might accumulate substitutional mutations in structural proteins, as well as nonstructural proteins 3C and 3 D. Comparative analysis of all available HRVs and HEVs indicated that HRV-C contains a relatively high G+C content and is more closely related to HEV-D. This might be correlated to their replication and capability to adapt to the high temperature environment of the human lower respiratory tract. We herein report an infrequently occurring intra-species recombination event in HRV-B species (HRV-CU211) with a crossing over having taken place at the boundary of VP2 and VP3 genes. Moreover, we observed phylogenetic compatibility in all HRV species and suggest that dynamic mechanisms for HRV evolution seem to be related to recombination events. These findings indicated that the elementary units shaping the genetic diversity of HRV-C could be found in the nonstructural 2A and 3D genes. This study provides information for understanding HRV genetic diversity and insight into the role of selection pressure and recombination mechanisms influencing HRV evolution.
Al-Tobasei, Rafet; Ali, Ali; Leeds, Timothy D; Liu, Sixin; Palti, Yniv; Kenney, Brett; Salem, Mohamed
2017-08-07
Coding/functional SNPs change the biological function of a gene and, therefore, could serve as "large-effect" genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, muscle yield, muscle fat content, shear force, and whiteness. Phenotypic data were collected for approximately 500 fish, representing 98 families (5 fish/family), from a growth-selected line, and the muscle transcriptome was sequenced from 22 families with divergent phenotypes (4 low- versus 4 high-ranked families per trait). GATK detected 59,112 putative SNPs; of these SNPs, 4798 showed allelic imbalances (>2.0 as an amplification and <0.5 as loss of heterozygosity). SAMtools detected 87,066 putative SNPs; and of them, 4962 had allelic imbalances between the low- and high-ranked families. Only 1829 SNPs with allelic imbalances were common between the two datasets, indicating significant differences in algorithms. The two datasets contained 7930 non-redundant SNPs of which 4439 mapped to 1498 protein-coding genes (with 6.4% non-synonymous SNPs) and 684 mapped to 295 lncRNAs. Validation of a subset of 92 SNPs revealed 1) 86.7-93.8% success rate in calling polymorphic SNPs and 2) 95.4% consistent matching between DNA and cDNA genotypes indicating a high rate of identifying SNPs with allelic imbalances. In addition, 4.64% SNPs revealed random monoallelic expression. Genome distribution of the SNPs with allelic imbalances exhibited high density for all five traits in several chromosomes, especially chromosome 9, 20 and 28. Most of the SNP-harboring genes were assigned to important growth-related metabolic pathways. These results demonstrate utility of RNA-Seq in assessing phenotype-associated allelic imbalances in pooled RNA-Seq samples. The SNPs identified in this study were included in a new SNP-Chip design (available from Affymetrix) for genomic and genetic analyses in rainbow trout.
Valenzuela-Muñoz, Valentina; Valenzuela-Miranda, Diego; Gallardo-Escárate, Cristian
2018-05-24
The increasing capacity of transcriptomic analysis by high throughput sequencing has highlighted the presence of a large proportion of transcripts that do not encode proteins. In particular, long non-coding RNAs (lncRNAs) are sequences with low coding potential and conservation among species. Moreover, cumulative evidence has revealed important roles in post-transcriptional gene modulation in several taxa. In fish, the role of lncRNAs has been scarcely studied and even less so during the immune response against sea lice. In the present study we mined for lncRNAs in Atlantic salmon (Salmo salar) and Coho salmon (Oncorhynkus kisutch), which are affected by the sea louse Caligus rogercresseyi, evaluating the degree of sequence conservation between these two fish species and their putative roles during the infection process. Herein, Atlantic and Coho salmon were infected with 35 lice/fish and evaluated after 7 and 14 days post-infestation (dpi). For RNA sequencing, samples from skin and head kidney were collected. A total of 5658/4140 and 3678/2123 lncRNAs were identified in uninfected/infected Atlantic and Coho salmon transcriptomes, respectively. Species-specific transcription patterns were observed in exclusive lncRNAs according to the tissue analyzed. Furthermore, neighbor gene GO enrichment analysis of the top 100 highly regulated lncRNAs in Atlantic salmon showed that lncRNAs were localized near genes related to the immune response. On the other hand, in Coho salmon the highly regulated lncRNAs were localized near genes involved in tissue repair processes. This study revealed high regulation of lncRNAs closely localized to immune and tissue repair-related genes in Atlantic and Coho salmon, respectively, suggesting putative roles for lncRNAs in salmon against sea lice infestation. Copyright © 2018 Elsevier Ltd. All rights reserved.
The Glucuronic Acid Utilization Gene Cluster from Bacillus stearothermophilus T-6
Shulami, Smadar; Gat, Orit; Sonenshein, Abraham L.; Shoham, Yuval
1999-01-01
A λ-EMBL3 genomic library of Bacillus stearothermophilus T-6 was screened for hemicellulolytic activities, and five independent clones exhibiting β-xylosidase activity were isolated. The clones overlap each other and together represent a 23.5-kb chromosomal segment. The segment contains a cluster of xylan utilization genes, which are organized in at least three transcriptional units. These include the gene for the extracellular xylanase, xylanase T-6; part of an operon coding for an intracellular xylanase and a β-xylosidase; and a putative 15.5-kb-long transcriptional unit, consisting of 12 genes involved in the utilization of α-d-glucuronic acid (GlcUA). The first four genes in the potential GlcUA operon (orf1, -2, -3, and -4) code for a putative sugar transport system with characteristic components of the binding-protein-dependent transport systems. The most likely natural substrate for this transport system is aldotetraouronic acid [2-O-α-(4-O-methyl-α-d-glucuronosyl)-xylotriose] (MeGlcUAXyl3). The following two genes code for an intracellular α-glucuronidase (aguA) and a β-xylosidase (xynB). Five more genes (kdgK, kdgA, uxaC, uxuA, and uxuB) encode proteins that are homologous to enzymes involved in galacturonate and glucuronate catabolism. The gene cluster also includes a potential regulatory gene, uxuR, the product of which resembles repressors of the GntR family. The apparent transcriptional start point of the cluster was determined by primer extension analysis and is located 349 bp from the initial ATG codon. The potential operator site is a perfect 12-bp inverted repeat located downstream from the promoter between nucleotides +170 and +181. Gel retardation assays indicated that UxuR binds specifically to this sequence and that this binding is efficiently prevented in vitro by MeGlcUAXyl3, the most likely molecular inducer. PMID:10368143
Scaling features of noncoding DNA
NASA Technical Reports Server (NTRS)
Stanley, H. E.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Peng, C. K.; Simons, M.
1999-01-01
We review evidence supporting the idea that the DNA sequence in genes containing noncoding regions is correlated, and that the correlation is remarkably long range--indeed, base pairs thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene, and utilize this fact to build a Coding Sequence Finder Algorithm, which uses statistical ideas to locate the coding regions of an unknown DNA sequence. Finally, we describe briefly some recent work adapting to DNA the Zipf approach to analyzing linguistic texts, and the Shannon approach to quantifying the "redundancy" of a linguistic text in terms of a measurable entropy function, and reporting that noncoding regions in eukaryotes display a larger redundancy than coding regions. Specifically, we consider the possibility that this result is solely a consequence of nucleotide concentration differences as first noted by Bonhoeffer and his collaborators. We find that cytosine-guanine (CG) concentration does have a strong "background" effect on redundancy. However, we find that for the purine-pyrimidine binary mapping rule, which is not affected by the difference in CG concentration, the Shannon redundancy for the set of analyzed sequences is larger for noncoding regions compared to coding regions.
Biolistic transformation of Carrizo citrange (Citrus sinensis Osb. × Poncirus trifoliata L. Raf.).
Wu, Hao; Acanda, Yosvanis; Jia, Hongge; Wang, Nian; Zale, Janice
2016-09-01
The development of transgenic citrus plants by the biolistic method. A protocol for the biolistic transformation of epicotyl explants and transgenic shoot regeneration of immature citrange rootstock, cv. Carrizo (Citrus sinensis Osb. × Poncirus trifoliata L. Raf.) and plant regeneration is described. Immature epicotyl explants were bombarded with a vector containing the nptII selectable marker and the gfp reporter. The number of independent, stably transformed tissues/total number of explants, recorded by monitoring GFP fluorescence 4 weeks after bombardment was substantial at 18.4 %, and some fluorescing tissues regenerated into shoots. Fluorescing GFP, putative transgenic shoots were micro-grafted onto immature Carrizo rootstocks in vitro, confirmed by PCR amplification of nptII and gfp coding regions, followed by secondary grafting onto older rootstocks grown in soil. Southern blot analysis indicated that all the fluorescing shoots were transgenic. Multiple and single copies of nptII integrations were confirmed in five regenerated transgenic lines. There is potential to develop a higher throughput biolistics transformation system by optimizing the tissue culture medium to improve shoot regeneration and narrowing the window for plant sampling. This system will be appropriate for transformation with minimal cassettes.
Biotype-specific tcpA genes in Vibrio cholerae.
Iredell, J R; Manning, P A
1994-08-01
The tcpA gene, encoding the structural subunit of the toxin-coregulated pilus, has been isolated from a variety of clinical isolates of Vibrio cholerae, and the nucleotide sequence determined. Strict biotype-specific conservation within both the coding and putative regulatory regions was observed, with important differences between the El Tor and classical biotypes. V. cholerae O139 Bengal strains appear to have El Tor-type tcpA genes. Environmental O1 and non-O1 isolates have sequences that bind an El Tor-specific tcpA DNA probe and that are weakly and variably amplified by tcpA-specific polymerase chain reaction primers, under conditions of reduced stringency. The data presented allow the selection of primer pairs to help distinguish between clinical and environmental isolates, and to distinguish El Tor (and Bengal) biotypes from classical biotypes of V. cholerae. While the role of TcpA in cholera vaccine preparations remains unclear, the data strongly suggest that TcpA-containing vaccines directed at O1 strains need include only the two forms of TcpA, and that such vaccines directed at (O139) Bengal strains should include the TcpA of El Tor biotype.
CID-miRNA: A web server for prediction of novel miRNA precursors in human genome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tyagi, Sonika; Vaz, Candida; Gupta, Vipin
2008-08-08
microRNAs (miRNA) are a class of non-protein coding functional RNAs that are thought to regulate expression of target genes by direct interaction with mRNAs. miRNAs have been identified through both experimental and computational methods in a variety of eukaryotic organisms. Though these approaches have been partially successful, there is a need to develop more tools for detection of these RNAs as they are also thought to be present in abundance in many genomes. In this report we describe a tool and a web server, named CID-miRNA, for identification of miRNA precursors in a given DNA sequence, utilising secondary structure-based filteringmore » systems and an algorithm based on stochastic context free grammar trained on human miRNAs. CID-miRNA analyses a given sequence using a web interface, for presence of putative miRNA precursors and the generated output lists all the potential regions that can form miRNA-like structures. It can also scan large genomic sequences for the presence of potential miRNA precursors in its stand-alone form. The web server can be accessed at (http://mirna.jnu.ac.in/cidmirna/)« less
Alterations in CDH15 and KIRREL3 in Patients with Mild to Severe Intellectual Disability
Bhalla, Kavita; Luo, Yue; Buchan, Tim; Beachem, Michael A.; Guzauskas, Gregory F.; Ladd, Sydney; Bratcher, Shelly J.; Schroer, Richard J.; Balsamo, Janne; DuPont, Barbara R.; Lilien, Jack; Srivastava, Anand K.
2008-01-01
Cell-adhesion molecules play critical roles in brain development, as well as maintaining synaptic structure, function, and plasticity. Here we have found the disruption of two genes encoding putative cell-adhesion molecules, CDH15 (cadherin superfamily) and KIRREL3 (immunoglobulin superfamily), by a chromosomal translocation t(11;16) in a female patient with intellectual disability (ID). We screened coding regions of these two genes in a cohort of patients with ID and controls and identified four nonsynonymous CDH15 variants and three nonsynonymous KIRREL3 variants that appear rare and unique to ID. These variations altered highly conserved residues and were absent in more than 600 unrelated patients with ID and 800 control individuals. Furthermore, in vivo expression studies showed that three of the CDH15 variations adversely altered its ability to mediate cell-cell adhesion. We also show that in neuronal cells, human KIRREL3 colocalizes and interacts with the synaptic scaffolding protein, CASK, recently implicated in X-linked brain malformation and ID. Taken together, our data suggest that alterations in CDH15 and KIRREL3, either alone or in combination with other factors, could play a role in phenotypic expression of ID in some patients. PMID:19012874
A global assembly of cotton ESTs
Udall, Joshua A.; Swanson, Jordan M.; Haller, Karl; Rapp, Ryan A.; Sparks, Michael E.; Hatfield, Jamie; Yu, Yeisoo; Wu, Yingru; Dowd, Caitriona; Arpat, Aladdin B.; Sickler, Brad A.; Wilkins, Thea A.; Guo, Jin Ying; Chen, Xiao Ya; Scheffler, Jodi; Taliercio, Earl; Turley, Ricky; McFadden, Helen; Payton, Paxton; Klueva, Natalya; Allen, Randell; Zhang, Deshui; Haigler, Candace; Wilkerson, Curtis; Suo, Jinfeng; Schulze, Stefan R.; Pierce, Margaret L.; Essenberg, Margaret; Kim, HyeRan; Llewellyn, Danny J.; Dennis, Elizabeth S.; Kudrna, David; Wing, Rod; Paterson, Andrew H.; Soderlund, Cari; Wendel, Jonathan F.
2006-01-01
Approximately 185,000 Gossypium EST sequences comprising >94,800,000 nucleotides were amassed from 30 cDNA libraries constructed from a variety of tissues and organs under a range of conditions, including drought stress and pathogen challenges. These libraries were derived from allopolyploid cotton (Gossypium hirsutum; AT and DT genomes) as well as its two diploid progenitors, Gossypium arboreum (A genome) and Gossypium raimondii (D genome). ESTs were assembled using the Program for Assembling and Viewing ESTs (PAVE), resulting in 22,030 contigs and 29,077 singletons (51,107 unigenes). Further comparisons among the singletons and contigs led to recognition of 33,665 exemplar sequences that represent a nonredundant set of putative Gossypium genes containing partial or full-length coding regions and usually one or two UTRs. The assembly, along with their UniProt BLASTX hits, GO annotation, and Pfam analysis results, are freely accessible as a public resource for cotton genomics. Because ESTs from diploid and allotetraploid Gossypium were combined in a single assembly, we were in many cases able to bioinformatically distinguish duplicated genes in allotetraploid cotton and assign them to either the A or D genome. The assembly and associated information provide a framework for future investigation of cotton functional and evolutionary genomics. PMID:16478941
Spectrum of mutations in leiomyosarcomas identified by clinical targeted next-generation sequencing.
Lee, Paul J; Yoo, Naomi S; Hagemann, Ian S; Pfeifer, John D; Cottrell, Catherine E; Abel, Haley J; Duncavage, Eric J
2017-02-01
Recurrent genomic mutations in uterine and non-uterine leiomyosarcomas have not been well established. Using a next generation sequencing (NGS) panel of common cancer-associated genes, 25 leiomyosarcomas arising from multiple sites were examined to explore genetic alterations, including single nucleotide variants (SNV), small insertions/deletions (indels), and copy number alterations (CNA). Sequencing showed 86 non-synonymous, coding region somatic variants within 151 gene targets in 21 cases, with a mean of 4.1 variants per case; 4 cases had no putative mutations in the panel of genes assayed. The most frequently altered genes were TP53 (36%), ATM and ATRX (16%), and EGFR and RB1 (12%). CNA were identified in 85% of cases, with the most frequent copy number losses observed in chromosomes 10 and 13 including PTEN and RB1; the most frequent gains were seen in chromosomes 7 and 17. Our data show that deletions in canonical cancer-related genes are common in leiomyosarcomas. Further, the spectrum of gene mutations observed shows that defects in DNA repair and chromosomal maintenance are central to the biology of leiomyosarcomas, and that activating mutations observed in other common cancer types are rare in leiomyosarcomas. Copyright © 2017 Elsevier Inc. All rights reserved.
Comprehensive evolutionary and phylogenetic analysis of Hepacivirus N (HNV).
da Silva, M S; Junqueira, D M; Baumbach, L F; Cibulski, S P; Mósena, A C S; Weber, M N; Silveira, S; de Moraes, G M; Maia, R D; Coimbra, V C S; Canal, C W
2018-05-24
Hepaciviruses (HVs) have been detected in several domestic and wild animals and present high genetic diversity. The actual classification divides the genus Hepacivirus into 14 species (A-N), according to their phylogenetic relationships, including the bovine hepacivirus [Hepacivirus N (HNV)]. In this study, we confirmed HNV circulation in Brazil and sequenced the whole genome of two strains. Based on the current classification of HCV, which is divided into genotypes and subtypes, we analysed all available bovine hepacivirus sequences in the GenBank database and proposed an HNV classification. All of the sequences were grouped into a single genotype, putatively named 'genotype 1'. This genotype can be clearly divided into four subtypes: A and D containing sequences from Germany and Brazil, respectively, and B and C containing Ghanaian sequences. In addition, the NS3-coding region was used to estimate the time to the most recent common ancestor (TMRCA) of each subtype, using a Bayesian approach and a relaxed molecular clock model. The analyses indicated a common origin of the virus circulating in Germany and Brazil. Ghanaian sequences seemed to have an older TMRCA, indicating a long time of circulation of these viruses in the African continent.
Simpson, A J; Reinach, F C; Arruda, P; Abreu, F A; Acencio, M; Alvarenga, R; Alves, L M; Araya, J E; Baia, G S; Baptista, C S; Barros, M H; Bonaccorsi, E D; Bordin, S; Bové, J M; Briones, M R; Bueno, M R; Camargo, A A; Camargo, L E; Carraro, D M; Carrer, H; Colauto, N B; Colombo, C; Costa, F F; Costa, M C; Costa-Neto, C M; Coutinho, L L; Cristofani, M; Dias-Neto, E; Docena, C; El-Dorry, H; Facincani, A P; Ferreira, A J; Ferreira, V C; Ferro, J A; Fraga, J S; França, S C; Franco, M C; Frohme, M; Furlan, L R; Garnier, M; Goldman, G H; Goldman, M H; Gomes, S L; Gruber, A; Ho, P L; Hoheisel, J D; Junqueira, M L; Kemper, E L; Kitajima, J P; Krieger, J E; Kuramae, E E; Laigret, F; Lambais, M R; Leite, L C; Lemos, E G; Lemos, M V; Lopes, S A; Lopes, C R; Machado, J A; Machado, M A; Madeira, A M; Madeira, H M; Marino, C L; Marques, M V; Martins, E A; Martins, E M; Matsukuma, A Y; Menck, C F; Miracca, E C; Miyaki, C Y; Monteriro-Vitorello, C B; Moon, D H; Nagai, M A; Nascimento, A L; Netto, L E; Nhani, A; Nobrega, F G; Nunes, L R; Oliveira, M A; de Oliveira, M C; de Oliveira, R C; Palmieri, D A; Paris, A; Peixoto, B R; Pereira, G A; Pereira, H A; Pesquero, J B; Quaggio, R B; Roberto, P G; Rodrigues, V; de M Rosa, A J; de Rosa, V E; de Sá, R G; Santelli, R V; Sawasaki, H E; da Silva, A C; da Silva, A M; da Silva, F R; da Silva, W A; da Silveira, J F; Silvestri, M L; Siqueira, W J; de Souza, A A; de Souza, A P; Terenzi, M F; Truffi, D; Tsai, S M; Tsuhako, M H; Vallada, H; Van Sluys, M A; Verjovski-Almeida, S; Vettore, A L; Zago, M A; Zatz, M; Meidanis, J; Setubal, J C
2000-07-13
Xylella fastidiosa is a fastidious, xylem-limited bacterium that causes a range of economically important plant diseases. Here we report the complete genome sequence of X. fastidiosa clone 9a5c, which causes citrus variegated chlorosis--a serious disease of orange trees. The genome comprises a 52.7% GC-rich 2,679,305-base-pair (bp) circular chromosome and two plasmids of 51,158 bp and 1,285 bp. We can assign putative functions to 47% of the 2,904 predicted coding regions. Efficient metabolic functions are predicted, with sugars as the principal energy and carbon source, supporting existence in the nutrient-poor xylem sap. The mechanisms associated with pathogenicity and virulence involve toxins, antibiotics and ion sequestration systems, as well as bacterium-bacterium and bacterium-host interactions mediated by a range of proteins. Orthologues of some of these proteins have only been identified in animal and human pathogens; their presence in X. fastidiosa indicates that the molecular basis for bacterial pathogenicity is both conserved and independent of host. At least 83 genes are bacteriophage-derived and include virulence-associated genes from other bacteria, providing direct evidence of phage-mediated horizontal gene transfer.
Bhatia, Shipra; Gordon, Christopher T.; Foster, Robert G.; Melin, Lucie; Abadie, Véronique; Baujat, Geneviève; Vazquez, Marie-Paule; Amiel, Jeanne; Lyonnet, Stanislas; van Heyningen, Veronica; Kleinjan, Dirk A.
2015-01-01
Disruption of gene regulation by sequence variation in non-coding regions of the genome is now recognised as a significant cause of human disease and disease susceptibility. Sequence variants in cis-regulatory elements (CREs), the primary determinants of spatio-temporal gene regulation, can alter transcription factor binding sites. While technological advances have led to easy identification of disease-associated CRE variants, robust methods for discerning functional CRE variants from background variation are lacking. Here we describe an efficient dual-colour reporter transgenesis approach in zebrafish, simultaneously allowing detailed in vivo comparison of spatio-temporal differences in regulatory activity between putative CRE variants and assessment of altered transcription factor binding potential of the variant. We validate the method on known disease-associated elements regulating SHH, PAX6 and IRF6 and subsequently characterise novel, ultra-long-range SOX9 enhancers implicated in the craniofacial abnormality Pierre Robin Sequence. The method provides a highly cost-effective, fast and robust approach for simultaneously unravelling in a single assay whether, where and when in embryonic development a disease-associated CRE-variant is affecting its regulatory function. PMID:26030420
Rabara, Roel C; Tripathi, Prateek; Lin, Jun; Rushton, Paul J
2013-02-15
Drought is one of the important environmental factors affecting crop production worldwide and therefore understanding the molecular response of plant to stress is an important step in crop improvement. WRKY transcription factors are one of the 10 largest transcription factor families across the green lineage. In this study, highly upregulated dehydration-induced WRKY and enzyme-coding genes from tobacco and soybean were selected from microarray data for promoter analyses. Putative stress-related cis-regulatory elements such as TGACG motif, ABRE-like elements; W and G-like sequences were identified by an in silico analyses of promoter region of the selected genes. GFP quantification of transgenic BY-2 cell culture showed these promoters direct higher expression in-response to 100 μM JA treatment compared to 100 μM ABA, 10% PEG and 85 mM NaCl treatments. Thus promoter activity upon JA treatment and enrichment of MeJA-responsive elements in the promoter of the selected genes provides insights for these genes to be jasmonic acid responsive with potential of mediating cross-talk during dehydration responses. Copyright © 2013 Elsevier Inc. All rights reserved.
A Third Approach to Gene Prediction Suggests Thousands of Additional Human Transcribed Regions
Glusman, Gustavo; Qin, Shizhen; El-Gewely, M. Raafat; Siegel, Andrew F; Roach, Jared C; Hood, Leroy; Smit, Arian F. A
2006-01-01
The identification and characterization of the complete ensemble of genes is a main goal of deciphering the digital information stored in the human genome. Many algorithms for computational gene prediction have been described, ultimately derived from two basic concepts: (1) modeling gene structure and (2) recognizing sequence similarity. Successful hybrid methods combining these two concepts have also been developed. We present a third orthogonal approach to gene prediction, based on detecting the genomic signatures of transcription, accumulated over evolutionary time. We discuss four algorithms based on this third concept: Greens and CHOWDER, which quantify mutational strand biases caused by transcription-coupled DNA repair, and ROAST and PASTA, which are based on strand-specific selection against polyadenylation signals. We combined these algorithms into an integrated method called FEAST, which we used to predict the location and orientation of thousands of putative transcription units not overlapping known genes. Many of the newly predicted transcriptional units do not appear to code for proteins. The new algorithms are particularly apt at detecting genes with long introns and lacking sequence conservation. They therefore complement existing gene prediction methods and will help identify functional transcripts within many apparent “genomic deserts.” PMID:16543943
Naum-Onganía, Gabriela; Gago-Zachert, Selma; Peña, Eduardo; Grau, Oscar; Garcia, Maria Laura
2003-10-01
Citrus psorosis virus (CPsV), the type member of genus Ophiovirus, has three genomic RNAs. Complete sequencing of CPsV RNA 1 revealed a size of 8184 nucleotides and Northern blot hybridization with chain specific probes showed that its non-coding strand is preferentially encapsidated. The complementary strand of RNA 1 contains two open reading frames (ORFs) separated by a 109-nt intergenic region, one located near the 5'-end potentially encoding a 24K protein of unknown function, and another of 280K containing the core polymerase motifs characteristic of viral RNA-dependent RNA polymerases (RdRp). Comparison of the core RdRp motifs of negative-stranded RNA viruses, supports grouping CPsV, Ranunculus white mottle virus (RWMV) and Mirafiori lettuce virus (MiLV) within the same genus (Ophiovirus), constituting a monophyletic group separated from all other negative-stranded RNA viruses. Furthermore, RNAs 1 of MiLV, CPsV and RWMV are similar in size and those of MiLV and CPsV also in genomic organization and sequence.
Avian sarcoma virus 17 carries the jun oncogene.
Maki, Y; Bos, T J; Davis, C; Starbuck, M; Vogt, P K
1987-01-01
Biologically active molecular clones of avian sarcoma virus 17 (ASV 17) contain a replication-defective proviral genome of 3.5 kilobases (kb). The genome retains partial gag and env sequences, which flank a cell-derived putative oncogene of 0.93 kb, termed jun. The jun gene lacks preserved coding domains of tyrosine-specific protein kinases. It also shows no significant nucleic acid homology with other known oncogenes. The probable transformation-specific protein in ASV 17-transformed cells is a 55-kDa gag-jun fusion product. Images PMID:3033666
Bachman, Peter; Reichenberg, Abraham; Rice, Patrick; Woolsey, Mary; Chaves, Olga; Martinez, David; Maples, Natalie; Velligan, Dawn I; Glahn, David C
2010-05-01
Cognitive processing inefficiency, often measured using digit symbol coding tasks, is a putative vulnerability marker for schizophrenia and a reliable indicator of illness severity and functional outcome. Indeed, performance on the digit symbol coding task may be the most severe neuropsychological deficit patients with schizophrenia display at the group level. Yet, little is known about the contributions of simpler cognitive processes to coding performance in schizophrenia (e.g. decision making, visual scanning, relational memory, motor ability). We developed an experimental behavioral task, based on a computerized digit symbol coding task, which allows the manipulation of demands placed on visual scanning efficiency and relational memory while holding decisional and motor requirements constant. Although patients (n=85) were impaired on all aspects of the task when compared to demographically matched healthy comparison subjects (n=30), they showed a particularly striking failure to benefit from the presence of predictable target information. These findings are consistent with predicted impairments in cognitive processing speed due to schizophrenia patients' well-known memory impairment, suggesting that this mnemonic deficit may have consequences for critical aspects of information processing that are traditionally considered quite separate from the memory domain. Future investigation into the mechanisms underlying the wide-ranging consequences of mnemonic deficits in schizophrenia should provide additional insight. Copyright (c) 2010 Elsevier B.V. All rights reserved.
Task-Based Core-Periphery Organization of Human Brain Dynamics
Bassett, Danielle S.; Wymbs, Nicholas F.; Rombach, M. Puck; Porter, Mason A.; Mucha, Peter J.; Grafton, Scott T.
2013-01-01
As a person learns a new skill, distinct synapses, brain regions, and circuits are engaged and change over time. In this paper, we develop methods to examine patterns of correlated activity across a large set of brain regions. Our goal is to identify properties that enable robust learning of a motor skill. We measure brain activity during motor sequencing and characterize network properties based on coherent activity between brain regions. Using recently developed algorithms to detect time-evolving communities, we find that the complex reconfiguration patterns of the brain's putative functional modules that control learning can be described parsimoniously by the combined presence of a relatively stiff temporal core that is composed primarily of sensorimotor and visual regions whose connectivity changes little in time and a flexible temporal periphery that is composed primarily of multimodal association regions whose connectivity changes frequently. The separation between temporal core and periphery changes over the course of training and, importantly, is a good predictor of individual differences in learning success. The core of dynamically stiff regions exhibits dense connectivity, which is consistent with notions of core-periphery organization established previously in social networks. Our results demonstrate that core-periphery organization provides an insightful way to understand how putative functional modules are linked. This, in turn, enables the prediction of fundamental human capacities, including the production of complex goal-directed behavior. PMID:24086116
Dutta, Usha R; Hansmann, Ingo; Schlote, Dietmar
2015-03-01
Short stature refers to the height of an individual which is below expected. The causes are heterogenous and influenced by several genetic and environmental factors. Chromosomal abnormalities are a major cause of diseases and cytogenetic mapping is one of the powerful tools for the identification of novel disease genes. Here we report a three generation family with a heterozygous pericentric inversion of 46, XX, inv(3) (p24.1q26.1) associated with Short stature. Positional cloning strategy was used to physically map the breakpoint regions by Fluorescence in situ hybridization (FISH). Fine mapping was performed with Bacterial Artificial Chromosome (BAC) clones spanning the breakpoint regions. In order to further characterize the breakpoint regions extensive molecular mapping was carried out with the breakpoint spanning BACs which narrowed down the breakpoint region to 2.9 kb and 5.3 kb regions on p and q arm respectively. Although these breakpoints did not disrupt any validated genes, we had identified a novel putative gene in the vicinity of 3q26.1 breakpoint region by in silico analysis. Trying to find the presence of any transcripts of this putative gene we analyzed human total RNA by RT-PCR and identified transcripts containing three new exons confirming the existence of a so far unknown gene close to the 3q breakpoint. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui; ...
2014-10-02
Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui
Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
Competitive region orientation code for palmprint verification and identification
NASA Astrophysics Data System (ADS)
Tang, Wenliang
2015-11-01
Orientation features of the palmprint have been widely investigated in coding-based palmprint-recognition methods. Conventional orientation-based coding methods usually used discrete filters to extract the orientation feature of palmprint. However, in real operations, the orientations of the filter usually are not consistent with the lines of the palmprint. We thus propose a competitive region orientation-based coding method. Furthermore, an effective weighted balance scheme is proposed to improve the accuracy of the extracted region orientation. Compared with conventional methods, the region orientation of the palmprint extracted using the proposed method can precisely and robustly describe the orientation feature of the palmprint. Extensive experiments on the baseline PolyU and multispectral palmprint databases are performed and the results show that the proposed method achieves a promising performance in comparison to conventional state-of-the-art orientation-based coding methods in both palmprint verification and identification.
Hall, L; Laird, J E; Craig, R K
1984-01-01
Nucleotide sequence analysis of cloned guinea-pig casein B cDNA sequences has identified two casein B variants related to the bovine and rat alpha s1 caseins. Amino acid homology was largely confined to the known bovine or predicted rat phosphorylation sites and within the 'signal' precursor sequence. Comparison of the deduced nucleotide sequence of the guinea-pig and rat alpha s1 casein mRNA species showed greater sequence conservation in the non-coding than in the coding regions, suggesting a functional and possibly regulatory role for the non-coding regions of casein mRNA. The results provide insight into the evolution of the casein genes, and raise questions as to the role of conserved nucleotide sequences within the non-coding regions of mRNA species. Images Fig. 1. PMID:6548375
Futagami, Taiki; Kadooka, Chihiro; Ando, Yoshinori; Okutsu, Kayu; Yoshizaki, Yumiko; Setoguchi, Shinji; Takamine, Kazunori; Kawai, Mikihiko; Tamaki, Hisanori
2017-10-01
Shochu is a traditional Japanese distilled spirit. The formation of the distinguishing flavour of shochu produced in individual distilleries is attributed to putative indigenous yeast strains. In this study, we performed the first (to our knowledge) phylogenetic classification of shochu strains based on nucleotide gene sequences. We performed phylogenetic classification of 21 putative indigenous shochu yeast strains isolated from 11 distilleries. All of these strains were shown or confirmed to be Saccharomyces cerevisiae, sharing species identification with 34 known S. cerevisiae strains (including commonly used shochu, sake, ale, whisky, bakery, bioethanol and laboratory yeast strains and clinical isolate) that were tested in parallel. Our analysis used five genes that reflect genome-level phylogeny for the strain-level classification. In a first step, we demonstrated that partial regions of the ZAP1, THI7, PXL1, YRR1 and GLG1 genes were sufficient to reproduce previous sub-species classifications. In a second step, these five analysed regions from each of 25 strains (four commonly used shochu strains and the 21 putative indigenous shochu strains) were concatenated and used to generate a phylogenetic tree. Further analysis revealed that the putative indigenous shochu yeast strains form a monophyletic group that includes both the shochu yeasts and a subset of the sake group strains; this cluster is a sister group to other sake yeast strains, together comprising a sake-shochu group. Differences among shochu strains were small, suggesting that it may be possible to correlate subtle phenotypic differences among shochu flavours with specific differences in genome sequences. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Theophilou, Georgios; Morais, Camilo L M; Halliwell, Diane E; Lima, Kássio M G; Drury, Josephine; Martin-Hirsch, Pierre L; Stringfellow, Helen F; Hapangama, Dharani K; Martin, Francis L
2018-05-09
The cyclical process of regeneration of the endometrium suggests that it may contain a cell population that can provide daughter cells with high proliferative potential. These cell lineages are clinically significant as they may represent clonogenic cells that may also be involved in tumourigenesis as well as endometriotic lesion development. To determine whether the putative stem cell location within human uterine tissue can be derived using vibrational spectroscopy techniques, normal endometrial tissue was interrogated by two spectroscopic techniques. Paraffin-embedded uterine tissues containing endometrial glands were sectioned to 10-μm-thick parallel tissue sections and were floated onto BaF 2 slides for synchrotron radiation-based Fourier-transform infrared (SR-FTIR) microspectroscopy and globar focal plane array-based FTIR spectroscopy. Different spectral characteristics were identified depending on the location of the glands examined. The resulting infrared spectra were subjected to multivariate analysis to determine associated biophysical differences along the length of longitudinal and crosscut gland sections. Comparison of the epithelial cellular layer of transverse gland sections revealed alterations indicating the presence of putative transient-amplifying-like cells in the basalis and mitotic cells in the functionalis. SR-FTIR microspectroscopy of the base of the endometrial glands identified the location where putative stem cells may reside at the same time pointing towards ν s PO 2 - in DNA and RNA, nucleic acids and amide I and II vibrations as major discriminating factors. This study supports the view that vibration spectroscopy technologies are a powerful adjunct to our understanding of the stem cell biology of endometrial tissue. Graphical abstract ᅟ.
Koizumi, Takahiko; Nara, Kazuhide
2017-06-24
Dwarf shrubs of the family Ericaceae are common in arctic and alpine regions. Many of these plants are associated with ericoid mycorrhizal (ERM) fungi, which allow them to take nutrients and water from the soil under harsh environmental conditions and, thus, affect host plant survival. Despite the importance of ERM fungi to alpine plant communities, limited information is available on the effects of microhabitat and host identity on ERM fungal communities. We investigated the communities of putative ERM fungi isolated from five dwarf shrub species (Arcterica nana, Diapensia lapponica, Empetrum nigrum, Loiseleuria procumbens, and Vaccinium vitis-idaea) that co-occur in an alpine region of Japan, with reference to distinct microhabitats provided by large stone pine (Pinus pumila) shrubs (i.e. bare ground, the edge of stone pine shrubs, and the inside of stone pine shrubs). We obtained 703 fungal isolates from 222 individual plants. These isolates were classified into 55 operational taxonomic units (OTUs) based on the sequencing of internal transcribed spacer regions in ribosomal DNA. These putative ERM fungal communities were dominated by Helotiales fungi for all host species. Cistella and Trimmatostroma species, which have rarely been detected in ERM roots in previous studies, were abundant. ERM fungal communities were significantly different among microhabitats (R 2 =0.28), while the host effect explained less variance in the fungal communities after excluding the microhabitat effect (R 2 =0.17). Our results suggest that the host effect on ERM fungal communities is minor and the distributions of hosts and fungal communities may be assessed based on microhabitat conditions.
Koizumi, Takahiko; Nara, Kazuhide
2017-01-01
Dwarf shrubs of the family Ericaceae are common in arctic and alpine regions. Many of these plants are associated with ericoid mycorrhizal (ERM) fungi, which allow them to take nutrients and water from the soil under harsh environmental conditions and, thus, affect host plant survival. Despite the importance of ERM fungi to alpine plant communities, limited information is available on the effects of microhabitat and host identity on ERM fungal communities. We investigated the communities of putative ERM fungi isolated from five dwarf shrub species (Arcterica nana, Diapensia lapponica, Empetrum nigrum, Loiseleuria procumbens, and Vaccinium vitis-idaea) that co-occur in an alpine region of Japan, with reference to distinct microhabitats provided by large stone pine (Pinus pumila) shrubs (i.e. bare ground, the edge of stone pine shrubs, and the inside of stone pine shrubs). We obtained 703 fungal isolates from 222 individual plants. These isolates were classified into 55 operational taxonomic units (OTUs) based on the sequencing of internal transcribed spacer regions in ribosomal DNA. These putative ERM fungal communities were dominated by Helotiales fungi for all host species. Cistella and Trimmatostroma species, which have rarely been detected in ERM roots in previous studies, were abundant. ERM fungal communities were significantly different among microhabitats (R2=0.28), while the host effect explained less variance in the fungal communities after excluding the microhabitat effect (R2=0.17). Our results suggest that the host effect on ERM fungal communities is minor and the distributions of hosts and fungal communities may be assessed based on microhabitat conditions. PMID:28529264
Becker, Sara J.; Squires, Daniel D.; Strong, David R.; Barnett, Nancy P.; Monti, Peter M.; Petry, Nancy M.
2016-01-01
Background Few prospective studies have evaluated theory-driven approaches to the implementation of evidence-based opioid treatment. This study compared the effectiveness of an implementation model (Science to Service Laboratory; SSL) to training as usual (TAU) in promoting the adoption of contingency management across a multi-site opiate addiction treatment program. We also examined whether the SSL affected putative mediators of contingency management adoption (perceived innovation characteristics and organizational readiness to change). Methods Sixty treatment providers (39 SSL, 21 TAU) from 15 geographically diverse satellite clinics (7 SSL, 8 TAU) participated in the 12-month study. Both conditions received didactic contingency management training and those in the pre-determined experimental region received 9 months of SSL-enhanced training. Contingency management adoption was monitored biweekly, while putative mediators were measured at baseline, 3-, and 12-months. Results Relative to providers in the TAU region, treatment providers in the SSL region had comparable likelihood of contingency management adoption in the first 20 weeks of the study, and then significantly higher likelihood of adoption (odds ratios = 2.4-13.5) for the remainder of the study. SSL providers also reported higher levels of one perceived innovation characteristic (Observability) and one aspect of organizational readiness to change (Adequacy of Training Resources), although there was no evidence that the SSL affected these putative mediators over time. Conclusions Results of this study indicate that a fully powered randomized trial of the SSL is warranted. Considerations for a future evaluation are discussed. PMID:26682582
DLEU2 encodes an antisense RNA for the putative bicistronic RFP2/LEU5 gene in humans and mouse.
Corcoran, Martin M; Hammarsund, Marianne; Zhu, Chaoyong; Lerner, Mikael; Kapanadze, Bagrat; Wilson, Bill; Larsson, Catharina; Forsberg, Lars; Ibbotson, Rachel E; Einhorn, Stefan; Oscier, David G; Grandér, Dan; Sangfelt, Olle
2004-08-01
Our group previously identified two novel genes, RFP2/LEU5 and DLEU2, within a 13q14.3 genomic region of loss seen in various malignancies. However, no specific inactivating mutations were found in these or other genes in the vicinity of the deletion, suggesting that a nonclassical tumor-suppressor mechanism may be involved. Here, we present data showing that the DLEU2 gene encodes a putative noncoding antisense RNA, with one exon directly overlapping the first exon of the RFP2/LEU5 gene in the opposite orientation. In addition, the RFP2/LEU5 transcript can be alternatively spliced to produce either several monocistronic transcripts or a putative bicistronic transcript encoding two separate open-reading frames, adding to the complexity of the locus. The finding that these gene structures are conserved in the mouse, including the putative bicistronic RFP2/LEU5 transcript as well as the antisense relationship with DLEU2, further underlines the significance of this unusual organization and suggests a biological function for DLEU2 in the regulation of RFP2/LEU5. Copyright 2004 Wiley-Liss, Inc.
The search for the number form area: A functional neuroimaging meta-analysis.
Yeo, Darren J; Wilkey, Eric D; Price, Gavin R
2017-07-01
Recent studies report a putative "number form area" (NFA) in the inferior temporal gyrus (ITG) suggested to be specialized for Arabic numeral processing. However, a number of earlier studies report no such NFA. The reasons for such discrepancies across studies are unclear. To examine evidence for a convergent NFA across studies, we conducted two activation likelihood estimation meta-analyses on 31 and a subset of 20 neuroimaging studies that have contrasted digits with other meaningful symbols. Results suggest the potential existence of an NFA in the right ITG, in addition to a 'symbolic number processing network' comprising bilateral parietal regions, and right-lateralized superior and inferior frontal regions. Critically, convergent localization for the NFA was only evident when contrasts were appropriately controlled for task demands, and does not appear to depend on employing methods designed to overcome fMRI signal dropout in the ITG. Importantly, only five studies had foci within the identified ITG NFA cluster boundary, indicating that more empirical evidence is necessary to determine the true functional specialization and regional specificity of the putative NFA. Copyright © 2017 Elsevier Ltd. All rights reserved.
Complete genome sequence of lymphocystis disease virus isolated from China.
Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang
2004-07-01
Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1.
Complete Genome Sequence of Lymphocystis Disease Virus Isolated from China
Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang
2004-01-01
Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1. PMID:15194775
2012-01-01
Background Detecting the borders between coding and non-coding regions is an essential step in the genome annotation. And information entropy measures are useful for describing the signals in genome sequence. However, the accuracies of previous methods of finding borders based on entropy segmentation method still need to be improved. Methods In this study, we first applied a new recursive entropic segmentation method on DNA sequences to get preliminary significant cuts. A 22-symbol alphabet is used to capture the differential composition of nucleotide doublets and stop codon patterns along three phases in both DNA strands. This process requires no prior training datasets. Results Comparing with the previous segmentation methods, the experimental results on three bacteria genomes, Rickettsia prowazekii, Borrelia burgdorferi and E.coli, show that our approach improves the accuracy for finding the borders between coding and non-coding regions in DNA sequences. Conclusions This paper presents a new segmentation method in prokaryotes based on Jensen-Rényi divergence with a 22-symbol alphabet. For three bacteria genomes, comparing to A12_JR method, our method raised the accuracy of finding the borders between protein coding and non-coding regions in DNA sequences. PMID:23282225
Moyo, Lindani; Ramesh, Shunmugiah V; Kappagantu, Madhu; Mitter, Neena; Sathuvalli, Vidyasagar; Pappu, Hanu R
2017-07-17
Potato virus Y (PVY) is one of the most economically important pathogen of potato that is present as biologically distinct strains. The virus-derived small interfering RNAs (vsiRNAs) from potato cv. Russet Burbank individually infected with PVY-N, PVY-NTN and PVY-O strains were recently characterized. Plant defense RNA-silencing mechanisms deployed against viruses produce vsiRNAs to degrade homologous viral transcripts. Based on sequence complementarity, the vsiRNAs can potentially degrade host RNA transcripts raising the prospect of vsiRNAs as pathogenicity determinants in virus-host interactions. This study investigated the global effects of PVY vsiRNAs on the host potato transcriptome. The strain-specific vsiRNAs of PVY, expressed in high copy number, were analyzed in silico for their proclivity to target potato coding and non-coding RNAs using psRobot and psRNATarget algorithms. Functional annotation of target coding transcripts was carried out to predict physiological effects of the vsiRNAs on the potato cv. Russet Burbank. The downregulation of selected target coding transcripts was further validated using qRT-PCR. The vsiRNAs derived from biologically distinct strains of PVY displayed diversity in terms of absolute number, copy number and hotspots for siRNAs on their respective genomes. The vsiRNAs populations were derived with a high frequency from 6 K1, P1 and Hc-Pro for PVY-N, P1, Hc-Pro and P3 for PVY-NTN, and P1, 3' UTR and NIa for PVY-O genomic regions. The number of vsiRNAs that displayed interaction with potato coding transcripts and number of putative coding target transcripts were comparable between PVY-N and PVY-O, and were relatively higher for PVY-NTN. The most abundant target non-coding RNA transcripts for the strain specific PVY-derived vsiRNAs were found to be MIR821, 28S rRNA,18S rRNA, snoR71, tRNA-Met and U5. Functional annotation and qRT-PCR validation suggested that the vsiRNAs target genes involved in plant hormone signaling, genetic information processing, plant-pathogen interactions, plant defense and stress response processes in potato. The findings suggested that the PVY-derived vsiRNAs could act as a pathogenicity determinant and as a counter-defense strategy to host RNA silencing in PVY-potato interactions. The broad range of host genes targeted by PVY vsiRNAs in infected potato suggests a diverse role for vsiRNAs that includes suppression of host stress responses and developmental processes. The interactome scenario is the first report on the interaction between one of the most important Potyvirus genome-derived siRNAs and the potato transcripts.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eipers, P.G.
1992-01-01
The gene for the human p58[sup clk[minus]1] protein kinase, a cell division control-related gene, has been mapped by somatic cell hybrid analyses, in situ localization with the chromosomal gene, and nested polymerase chain reaction amplification of microdissected chromosomes. These studies indicate that the expressed p58[sup clk[minus]1] chromosomal gene maps to 1p36, while a highly related p58[sup clk[minus]1] sequence of unknown nature maps to chromosome 15. Assignment of a p34[sup cdc2]-related gene to 1p36 region, including neuroblastoma, ductal carcinoma of the breast, malignant melanoma, Merkel cell carcinoma and endocrine neoplasia among others. Aberrant expression of this protein kinase negatively regulates normalmore » cellular growth. The p58[sup clk[minus]1] protein contains a central domain of 299 amino acids that is 46% identical to human p34[sup cdc2], the master mitotic protein kinase. This dissertation details the complete structure of the p58[sup clk[minus]1] chromosomal gene, including its putative promoter region, transcriptional start sites, exonic sequences, and intron/exon boundary sequences. The gene is 10 kb in size and contains 12 exons and 11 introns. Interestingly, the rather large 2.0 kb 3[prime] untranslated region is interrupted by an intron that separates a region containing numerous AUUUA destabilization motifs from the coding region. Furthermore, the expression of this gene in normal human tissues, as well as several human tumor cell samples and lines, is examined. The origin of multiple human transcripts from the same chromosomal gene, and the possible differential stability of these various transcripts, is discussed with regard to the transcriptional and post-transcriptional regulation of this gene. This is the first report of the chromosomal gene structure of a member of the p34[sup cdc2] supergene family.« less
Liu, Charles; Kayima, Peter; Riesel, Johanna; Situma, Martin; Chang, David; Firth, Paul
2017-11-01
The lack of a classification system for surgical procedures in resource-limited settings hinders outcomes measurement and reporting. Existing procedure coding systems are prohibitively large and expensive to implement. We describe the creation and prospective validation of 3 brief procedure code lists applicable in low-resource settings, based on analysis of surgical procedures performed at Mbarara Regional Referral Hospital, Uganda's second largest public hospital. We reviewed operating room logbooks to identify all surgical operations performed at Mbarara Regional Referral Hospital during 2014. Based on the documented indication for surgery and procedure(s) performed, we assigned each operation up to 4 procedure codes from the International Classification of Diseases, 9th Revision, Clinical Modification. Coding of procedures was performed by 2 investigators, and a random 20% of procedures were coded by both investigators. These codes were aggregated to generate procedure code lists. During 2014, 6,464 surgical procedures were performed at Mbarara Regional Referral Hospital, to which we assigned 435 unique procedure codes. Substantial inter-rater reliability was achieved (κ = 0.7037). The 111 most common procedure codes accounted for 90% of all codes assigned, 180 accounted for 95%, and 278 accounted for 98%. We considered these sets of codes as 3 procedure code lists. In a prospective validation, we found that these lists described 83.2%, 89.2%, and 92.6% of surgical procedures performed at Mbarara Regional Referral Hospital during August to September of 2015, respectively. Empirically generated brief procedure code lists based on International Classification of Diseases, 9th Revision, Clinical Modification can be used to classify almost all surgical procedures performed at a Ugandan referral hospital. Such a standardized procedure coding system may enable better surgical data collection for administration, research, and quality improvement in resource-limited settings. Copyright © 2017 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kwon, Deug-Nam; Park, Mi-Ryung; Park, Jong-Yi
Highlights: {yields} The sequences of -604 to -84 bp of the pUPII promoter contained the region of a putative negative cis-regulatory element. {yields} The core promoter was located in the 5F-1. {yields} Transcription factor HNF4 can directly bind in the pUPII core promoter region, which plays a critical role in controlling promoter activity. {yields} These features of the pUPII promoter are fundamental to development of a target-specific vector. -- Abstract: Uroplakin II (UPII) is a one of the integral membrane proteins synthesized as a major differentiation product of mammalian urothelium. UPII gene expression is bladder specific and differentiation dependent, butmore » little is known about its transcription response elements and molecular mechanism. To identify the cis-regulatory elements in the pig UPII (pUPII) gene promoter region, we constructed pUPII 5' upstream region deletion mutants and demonstrated that each of the deletion mutants participates in controlling the expression of the pUPII gene in human bladder carcinoma RT4 cells. We also identified a new core promoter region and putative negative cis-regulatory element within a minimal promoter region. In addition, we showed that hepatocyte nuclear factor 4 (HNF4) can directly bind in the pUPII core promoter (5F-1) region, which plays a critical role in controlling promoter activity. Transient cotransfection experiments showed that HNF4 positively regulates pUPII gene promoter activity. Thus, the binding element and its binding protein, HNF4 transcription factor, may be involved in the mechanism that specifically regulates pUPII gene transcription.« less
A genome-wide scan for signatures of selection in Azeri and Khuzestani buffalo breeds.
Mokhber, Mahdi; Moradi-Shahrbabak, Mohammad; Sadeghi, Mostafa; Moradi-Shahrbabak, Hossein; Stella, Alessandra; Nicolzzi, Ezequiel; Rahmaninia, Javad; Williams, John L
2018-06-11
Identification of genomic regions that have been targets of selection may shed light on the genetic history of livestock populations and help to identify variation controlling commercially important phenotypes. The Azeri and Kuzestani buffalos are the most common indigenous Iranian breeds which have been subjected to divergent selection and are well adapted to completely different regions. Examining the genetic structure of these populations may identify genomic regions associated with adaptation to the different environments and production goals. A set of 385 water buffalo samples from Azeri (N = 262) and Khuzestani (N = 123) breeds were genotyped using the Axiom® Buffalo Genotyping 90 K Array. The unbiased fixation index method (F ST ) was used to detect signatures of selection. In total, 13 regions with outlier F ST values (0.1%) were identified. Annotation of these regions using the UMD3.1 Bos taurus Genome Assembly was performed to find putative candidate genes and QTLs within the selected regions. Putative candidate genes identified include FBXO9, NDFIP1, ACTR3, ARHGAP26, SERPINF2, BOLA-DRB3, BOLA-DQB, CLN8, and MYOM2. Candidate genes identified in regions potentially under selection were associated with physiological pathways including milk production, cytoskeleton organization, growth, metabolic function, apoptosis and domestication-related changes include immune and nervous system development. The QTL identified are involved in economically important traits in buffalo related to milk composition, udder structure, somatic cell count, meat quality, and carcass and body weight.
BNDF methylation in mothers and newborns is associated with maternal exposure to war trauma.
Kertes, Darlene A; Bhatt, Samarth S; Kamin, Hayley S; Hughes, David A; Rodney, Nicole C; Mulligan, Connie J
2017-01-01
The BDNF gene codes for brain-derived neurotrophic factor, a growth factor involved in neural development, cell differentiation, and synaptic plasticity. Present in both the brain and periphery, BDNF plays critical roles throughout the body and is essential for placental and fetal development. Rodent studies show that early life stress, including prenatal stress, broadly alters BDNF methylation, with presumed changes in gene expression. No studies have assessed prenatal exposure to maternal traumatic stress and BDNF methylation in humans. This study examined associations of prenatal exposure to maternal stress and BDNF methylation at CpG sites across the BDNF gene. Among 24 mothers and newborns in the eastern Democratic Republic of Congo, a region with extreme conflict and violence to women, maternal experiences of war trauma and chronic stress were associated with BDNF methylation in umbilical cord blood, placental tissue, and maternal venous blood. Associations of maternal stress and BDNF methylation showed high tissue specificity. The majority of significant associations were observed in putative transcription factor binding regions. This is the first study in humans to examine BDNF methylation in relation to prenatal exposure to maternal stress in three tissues simultaneously and the first in any mammalian species to report associations of prenatal stress and BDNF methylation in placental tissue. The findings add to the growing body of evidence highlighting the importance of considering epigenetic effects when examining the impacts of trauma and stress, not only for adults but also for offspring exposed via effects transmitted before birth.
Sandbaken, M. G.; Culbertson, M. R.
1988-01-01
A mutational analysis of the eukaryotic elongation factor EF-1α indicates that this protein functions to limit the frequency of errors during genetic code translation. We found that both amino acid misincorporation and reading frame errors are controlled by EF-1α. In order to examine the function of this protein, the TEF2 gene, which encodes EF-1α in Saccharomyces cerevisiae, was mutagenized in vitro with hydroxylamine. Sixteen independent TEF2 alleles were isolated by their ability to suppress frameshift mutations. DNA sequence analysis identified eight different sites in the EF-1α protein that elevate the frequency of mistranslation when mutated. These sites are located in two different regions of the protein. Amino acid substitutions located in or near the GTP-binding and hydrolysis domain of the protein cause suppression of frameshift and nonsense mutations. These mutations may effect mistranslation by altering the binding or hydrolysis of GTP. Amino acid substitutions located adjacent to a putative aminoacyl-tRNA binding region also suppress frameshift and nonsense mutations. These mutations may alter the binding of aminoacyl-tRNA by EF-1α. The identification of frameshift and nonsense suppressor mutations in EF-1α indicates a role for this protein in limiting amino acid misincorporation and reading frame errors. We suggest that these types of errors are controlled by a common mechanism or closely related mechanisms. PMID:3066688
Functional Analysis of the ComK Protein of Bacillus coagulans
Kovács, Ákos T.; Eckhardt, Tom H.; van Kranenburg, Richard; Kuipers, Oscar P.
2013-01-01
The genes for DNA uptake and recombination in Bacilli are commonly regulated by the transcriptional factor ComK. We have identified a ComK homologue in Bacillus coagulans, an industrial relevant organism that is recalcitrant for transformation. Introduction of B. coagulans comK gene under its own promoter region into Bacillus subtilis comK strain results in low transcriptional induction of the late competence gene comGA, but lacking bistable expression. The promoter regions of B. coagulans comK and the comGA genes are recognized in B. subtilis and expression from these promoters is activated by B. subtilis ComK. Purified ComK protein of B. coagulans showed DNA-binding ability in gel retardation assays with B. subtilis- and B. coagulans-derived probes. These experiments suggest that the function of B. coagulans ComK is similar to that of ComK of B. subtilis. When its own comK is overexpressed in B. coagulans the comGA gene expression increases 40-fold, while the expression of another late competence gene, comC is not elevated and no reproducible DNA-uptake could be observed under these conditions. Our results demonstrate that B. coagulans ComK can recognize several B. subtilis comK-responsive elements, and vice versa, but indicate that the activation of the transcription of complete sets of genes coding for a putative DNA uptake apparatus in B. coagulans might differ from that of B. subtilis. PMID:23301076
Genomic characterization and regulation of CYP3a13: role of xenobiotics and nuclear receptors.
Anakk, Sayeepriyadarshini; Kalsotra, Auinash; Shen, Qi; Vu, Mary T; Staudinger, Jeffrey L; Davies, Peter J A; Strobel, Henry W
2003-09-01
We report that CYP3a13 gene, located on mouse chromosome 5, spans 27.5 Kb and contains 13 exons. The transcription start site is 35 bp upstream of the coding region and results in a 109 bp 5' untranslated region. CYP3a13 promoter shows putative binding sites for retinoid X receptor, pregnane X receptor, and estrogen receptor. CYP3a13 shows a broad tissue distribution with predominant expression in liver. Although CYP3a13 shares 92% nucleotide identity with the female-specific rat CYP3A9, its expression does not exhibit sexual dimorphism. Ligand activation of peroxisomal proliferator-activated receptor-gamma and retinoid X receptor inhibit expression of CYP3a13 at the transcription level in a tissue-specific manner. Another novel finding is hepatic induction of CYP3a13 by dexamethasone occurring only in pregnane X receptor null mice. We also report that pregnane X receptor is essential to maintain robust in vivo basal levels of CYP3a13 in contrast to CYP3a11. CYP3a13 protein expressed in vitro can metabolize clinically active drugs ethylmorphine and erythromycin, as well as benzphetamine. We conclude that CYP3a13 is regulated differentially by various nuclear receptors. In humans this may lead to altered drug metabolism, as many of the newly synthesized ligands/drugs targeted toward these nuclear receptors could influence CYP3A gene expression.
Cooper, Wendy N.; Dickinson, Rachel E.; Dallol, Ashraf; Grigorieva, Elvira V.; Pavlova, Tatiana V.; Hesson, Luke B.; Bieche, Ivan; Broggini, Massimo; Maher, Eamonn R; Zabarovsky, Eugene R.; Clark, Geoffrey J; Latif, Farida
2010-01-01
RASSF2 is a recently identified member of a class of novel tumour suppressor genes, all containing a ras association domain. We previously demonstrated that the A isoform of RASSF2, is frequently inactivated by promoter region hypermethylation in colorectal tumours and adenomas, methylation was tumour specific and that expression in methylated tumour lines could be reactivated by treatment with 5-aza-2dc. RASSF2 resides at 20p13, this region has been demonstrated to be frequently lost in human cancers. In this report we investigated methylation status of the RASSF2A promoter CpG island in a series of breast, ovarian and non-small cell lung cancers (NSCLC). RASSF2A was frequently methylated in breast tumour cell lines 65% (13/20) and in primary breast tumours 38% (15/40). RASSF2A gene expression could be switched back on in methylated breast tumour cell lines after treatment with 5-aza-2dC, whilst unmethylated lines showed no difference in level of expression before and after 5-aza-2dC treatment. RASSF2A was also frequently methylated in NSCLC tumours 44% (22/50). Methylation in breast tumours and NSCLC was tumour specific. We did not detect RASSF2A methylation in ovarian tumours (0/17). Furthermore no mutations were found in the coding region of RASSF2A in these ovarian tumours. RASSF2A suppressed breast tumour cell growth in vitro (through colony formation and soft agar assays) and in vivo. We identified a highly conserved putative bipartite nuclear localisation signal (NLS) between amino acids 151 and 167 in the RASSF2A sequence and demonstrated that endogenous RASSF2A localised to the nucleus. Mutation of the putative nuclear localisation signal abolished the nuclear localisation so RASSF2A became predominantly cytoplasmic. Our data indicates that RASSF2A is frequently methylated in colorectal, breast and NSCLC tumours, furthermore, the methylation is tumour specific. Hence we have identified RASSF2A as a novel methylation marker for multiple malignancies and it has the potential to be developed into a valuable marker for screening several cancers in parallel using promoter hypermethylation profiles. We also demonstrate that RASSF2 has a functional NLS signal. Furthermore this is the first report demonstrating that RASSF2 suppresses growth of cancer cells in vivo. Hence providing further evidence for its role as a tumour suppressor gene located at 20p13. PMID:17891178
Perera, N C N; Godahewa, G I; Lee, Jehee
2016-10-01
Copper-zinc-superoxide dismutase (CuZnSOD) from Hippocampus abdominalis (HaCuZnSOD) is a metalloenzyme which belongs to the ubiquitous family of SODs. Here, we determined the characteristic structural features of HaCuZnSOD, analyzed its evolutionary relationships, and identified its potential immune responses and biological functions in relation to antioxidant defense mechanisms in the seahorse. The gene had a 5' untranslated region (UTR) of 67 bp, a coding sequence of 465 bp and a 3' UTR of 313 bp. The putative peptide consists of 154 amino acids. HaCuZnSOD had a predicted molecular mass of 15.94 kDa and a theoretical pI value of 5.73, which is favorable for copper binding activity. In silico analysis revealed that HaCuZnSOD had a prominent Cu-Zn_superoxide_dismutase domain, two Cu/Zn signature sequences, a putative N-glycosylation site, and several active sites including Cu(2+) and Zn(2+) binding sites. The three dimensional structure indicated a β-sheet barrel with 8 β-sheets and two short α-helical regions. Multiple alignment analyses revealed many conserved regions and active sites among its orthologs. The highest amino acid identity to HaCuZnSOD was found in Siniperca chuatsi (87.4%), while Maylandia zebra shared a close relationship in the phylogenetic analysis. Functional assays were performed to assess the antioxidant, biophysical and biochemical properties of overexpressed recombinant (r) HaCuZnSOD. A xanthine/XOD assay gave optimum results at pH 9 and 25 °C indicating these may be the best conditions for its antioxidant action in the seahorse. An MTT assay and flow cytometry confirmed that rHaCuZnSOD showed peroxidase activity in the presence of HCO3(-). In all the functional assays, the level of antioxidant activity of rHaCuZnSOD was concentration dependent; metal ion supplementation also increased its activity. The highest mRNA expressional level of HaCuZnSOD was found in blood. Temporal assessment under pathological stress showed a delay response by HaCuZnSOD. Our findings demonstrated that HaCuZnSOD is an important antioxidant, which might be involved in the host antioxidant defense mechanism against oxidative stress. Copyright © 2016 Elsevier Ltd. All rights reserved.
Phylogenetic Network for European mtDNA
Finnilä, Saara; Lehtonen, Mervi S.; Majamaa, Kari
2001-01-01
The sequence in the first hypervariable segment (HVS-I) of the control region has been used as a source of evolutionary information in most phylogenetic analyses of mtDNA. Population genetic inference would benefit from a better understanding of the variation in the mtDNA coding region, but, thus far, complete mtDNA sequences have been rare. We determined the nucleotide sequence in the coding region of mtDNA from 121 Finns, by conformation-sensitive gel electrophoresis and subsequent sequencing and by direct sequencing of the D loop. Furthermore, 71 sequences from our previous reports were included, so that the samples represented all the mtDNA haplogroups present in the Finnish population. We found a total of 297 variable sites in the coding region, which allowed the compilation of unambiguous phylogenetic networks. The D loop harbored 104 variable sites, and, in most cases, these could be localized within the coding-region networks, without discrepancies. Interestingly, many homoplasies were detected in the coding region. Nucleotide variation in the rRNA and tRNA genes was 6%, and that in the third nucleotide positions of structural genes amounted to 22% of that in the HVS-I. The complete networks enabled the relationships between the mtDNA haplogroups to be analyzed. Phylogenetic networks based on the entire coding-region sequence in mtDNA provide a rich source for further population genetic studies, and complete sequences make it easier to differentiate between disease-causing mutations and rare polymorphisms. PMID:11349229
Dynamics of cortical dendritic membrane potential and spikes in freely behaving rats.
Moore, Jason J; Ravassard, Pascal M; Ho, David; Acharya, Lavanya; Kees, Ashley L; Vuong, Cliff; Mehta, Mayank R
2017-03-24
Neural activity in vivo is primarily measured using extracellular somatic spikes, which provide limited information about neural computation. Hence, it is necessary to record from neuronal dendrites, which can generate dendritic action potentials (DAPs) in vitro, which can profoundly influence neural computation and plasticity. We measured neocortical sub- and suprathreshold dendritic membrane potential (DMP) from putative distal-most dendrites using tetrodes in freely behaving rats over multiple days with a high degree of stability and submillisecond temporal resolution. DAP firing rates were several-fold larger than somatic rates. DAP rates were also modulated by subthreshold DMP fluctuations, which were far larger than DAP amplitude, indicating hybrid, analog-digital coding in the dendrites. Parietal DAP and DMP exhibited egocentric spatial maps comparable to pyramidal neurons. These results have important implications for neural coding and plasticity. Copyright © 2017, American Association for the Advancement of Science.
BTKbase, mutation database for X-linked agammaglobulinemia (XLA).
Vihinen, M; Brandau, O; Brandén, L J; Kwan, S P; Lappalainen, I; Lester, T; Noordzij, J G; Ochs, H D; Ollila, J; Pienaar, S M; Riikonen, P; Saha, B K; Smith, C I
1998-01-01
X-linked agammaglobulinemia (XLA) is an immunodeficiency caused by mutations in the gene coding for Bruton's agammaglobulinemia tyrosine kinase (BTK). A database (BTKbase) of BTK mutations has been compiled and the recent update lists 463 mutation entries from 406 unrelated families showing 303 unique molecular events. In addition to mutations, the database also lists variants or polymorphisms. Each patient is given a unique patient identity number (PIN). Information is included regarding the phenotype including symptoms. Mutations in all the five domains of BTK have been noticed to cause the disease, the most common event being missense mutations. The mutations appear almost uniformly throughout the molecule and frequently affect CpG sites that code for arginine residues. The putative structural implications of all the missense mutations are given in the database. The improved version of the registry having a number of new features is available at http://www. helsinki.fi/science/signal/btkbase.html PMID:9399844
NASA Astrophysics Data System (ADS)
Sowerby, Stephen J.; Petersen, George B.
2002-08-01
The hypothesis that life originated and evolved from linear informational molecules capable of facilitating their own catalytic replication is deeply entrenched. However, widespread acceptance of this paradigm seems oblivious to a lack of direct experimental support. Here, we outline the fundamental objections to the de novo appearance of linear, self-replicating polymers and examine an alternative hypothesis of template-directed coding of peptide catalysts by adsorbed purine bases. The bases (which encode biological information in modern nucleic acids) spontaneously self-organize into two-dimensional molecular solids adsorbed to the uncharged surfaces of crystalline minerals; their molecular arrangement is specified by hydrogen bonding rules between adjacent molecules and can possess the aperiodic complexity to encode putative protobiological information. The persistence of such information through self-reproduction, together with the capacity of adsorbed bases to exhibit enantiomorphism and effect amino acid discrimination, would seem to provide the necessary machinery for a primitive genetic coding mechanism.
Systematic analysis of coding and noncoding DNA sequences using methods of statistical linguistics
NASA Technical Reports Server (NTRS)
Mantegna, R. N.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Peng, C. K.; Simons, M.; Stanley, H. E.
1995-01-01
We compare the statistical properties of coding and noncoding regions in eukaryotic and viral DNA sequences by adapting two tests developed for the analysis of natural languages and symbolic sequences. The data set comprises all 30 sequences of length above 50 000 base pairs in GenBank Release No. 81.0, as well as the recently published sequences of C. elegans chromosome III (2.2 Mbp) and yeast chromosome XI (661 Kbp). We find that for the three chromosomes we studied the statistical properties of noncoding regions appear to be closer to those observed in natural languages than those of coding regions. In particular, (i) a n-tuple Zipf analysis of noncoding regions reveals a regime close to power-law behavior while the coding regions show logarithmic behavior over a wide interval, while (ii) an n-gram entropy measurement shows that the noncoding regions have a lower n-gram entropy (and hence a larger "n-gram redundancy") than the coding regions. In contrast to the three chromosomes, we find that for vertebrates such as primates and rodents and for viral DNA, the difference between the statistical properties of coding and noncoding regions is not pronounced and therefore the results of the analyses of the investigated sequences are less conclusive. After noting the intrinsic limitations of the n-gram redundancy analysis, we also briefly discuss the failure of the zeroth- and first-order Markovian models or simple nucleotide repeats to account fully for these "linguistic" features of DNA. Finally, we emphasize that our results by no means prove the existence of a "language" in noncoding DNA.
Osato, Naoki
2018-01-19
Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional enrichments were related to the cellular functions. The normalized number of functional enrichments of human putative transcriptional target genes changed according to the criteria of enhancer-promoter assignments and correlated with the median expression level of the target genes. These analyses and characters of human putative transcriptional target genes would be useful to examine the criteria of enhancer-promoter assignments and to predict the novel mechanisms and factors such as DNA binding proteins and DNA sequences of enhancer-promoter interactions.
Constant time worker thread allocation via configuration caching
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eichenberger, Alexandre E; O'Brien, John K. P.
Mechanisms are provided for allocating threads for execution of a parallel region of code. A request for allocation of worker threads to execute the parallel region of code is received from a master thread. Cached thread allocation information identifying prior thread allocations that have been performed for the master thread are accessed. Worker threads are allocated to the master thread based on the cached thread allocation information. The parallel region of code is executed using the allocated worker threads.
Mix, Heiko; Lobanov, Alexey V.; Gladyshev, Vadim N.
2007-01-01
Expression of selenocysteine (Sec)-containing proteins requires the presence of a cis-acting mRNA structure, called selenocysteine insertion sequence (SECIS) element. In bacteria, this structure is located in the coding region immediately downstream of the Sec-encoding UGA codon, whereas in eukaryotes a completely different SECIS element has evolved in the 3′-untranslated region. Here, we report that SECIS elements in the coding regions of selenoprotein mRNAs support Sec insertion in higher eukaryotes. Comprehensive computational analysis of all available viral genomes revealed a SECIS element within the ORF of a naturally occurring selenoprotein homolog of glutathione peroxidase 4 in fowlpox virus. The fowlpox SECIS element supported Sec insertion when expressed in mammalian cells as part of the coding region of viral or mammalian selenoproteins. In addition, readthrough at UGA was observed when the viral SECIS element was located upstream of the Sec codon. We also demonstrate successful de novo design of a functional SECIS element in the coding region of a mammalian selenoprotein. Our data provide evidence that the location of the SECIS element in the untranslated region is not a functional necessity but rather is an evolutionary adaptation to enable a more efficient synthesis of selenoproteins. PMID:17169995
Coded Cooperation for Multiway Relaying in Wireless Sensor Networks †
Si, Zhongwei; Ma, Junyang; Thobaben, Ragnar
2015-01-01
Wireless sensor networks have been considered as an enabling technology for constructing smart cities. One important feature of wireless sensor networks is that the sensor nodes collaborate in some manner for communications. In this manuscript, we focus on the model of multiway relaying with full data exchange where each user wants to transmit and receive data to and from all other users in the network. We derive the capacity region for this specific model and propose a coding strategy through coset encoding. To obtain good performance with practical codes, we choose spatially-coupled LDPC (SC-LDPC) codes for the coded cooperation. In particular, for the message broadcasting from the relay, we construct multi-edge-type (MET) SC-LDPC codes by repeatedly applying coset encoding. Due to the capacity-achieving property of the SC-LDPC codes, we prove that the capacity region can theoretically be achieved by the proposed MET SC-LDPC codes. Numerical results with finite node degrees are provided, which show that the achievable rates approach the boundary of the capacity region in both binary erasure channels and additive white Gaussian channels. PMID:26131675
Coded Cooperation for Multiway Relaying in Wireless Sensor Networks.
Si, Zhongwei; Ma, Junyang; Thobaben, Ragnar
2015-06-29
Wireless sensor networks have been considered as an enabling technology for constructing smart cities. One important feature of wireless sensor networks is that the sensor nodes collaborate in some manner for communications. In this manuscript, we focus on the model of multiway relaying with full data exchange where each user wants to transmit and receive data to and from all other users in the network. We derive the capacity region for this specific model and propose a coding strategy through coset encoding. To obtain good performance with practical codes, we choose spatially-coupled LDPC (SC-LDPC) codes for the coded cooperation. In particular, for the message broadcasting from the relay, we construct multi-edge-type (MET) SC-LDPC codes by repeatedly applying coset encoding. Due to the capacity-achieving property of the SC-LDPC codes, we prove that the capacity region can theoretically be achieved by the proposed MET SC-LDPC codes. Numerical results with finite node degrees are provided, which show that the achievable rates approach the boundary of the capacity region in both binary erasure channels and additive white Gaussian channels.