gene order compared: Topics by Science.gov

Sample records for gene order compared

Assessment of gene order computing methods for Alzheimer's disease

PubMed Central

2013-01-01

Background Computational genomics of Alzheimer disease (AD), the most common form of senile dementia, is a nascent field in AD research. The field includes AD gene clustering by computing gene order which generates higher quality gene clustering patterns than most other clustering methods. However, there are few available gene order computing methods such as Genetic Algorithm (GA) and Ant Colony Optimization (ACO). Further, their performance in gene order computation using AD microarray data is not known. We thus set forth to evaluate the performances of current gene order computing methods with different distance formulas, and to identify additional features associated with gene order computation. Methods Using different distance formulas- Pearson distance and Euclidean distance, the squared Euclidean distance, and other conditions, gene orders were calculated by ACO and GA (including standard GA and improved GA) methods, respectively. The qualities of the gene orders were compared, and new features from the calculated gene orders were identified. Results Compared to the GA methods tested in this study, ACO fits the AD microarray data the best when calculating gene order. In addition, the following features were revealed: different distance formulas generated a different quality of gene order, and the commonly used Pearson distance was not the best distance formula when used with both GA and ACO methods for AD microarray data. Conclusion Compared with Pearson distance and Euclidean distance, the squared Euclidean distance generated the best quality gene order computed by GA and ACO methods. PMID:23369541
PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes

PubMed Central

Fong, Christine; Rohmer, Laurence; Radey, Matthew; Wasnick, Michael; Brittnacher, Mitchell J

2008-01-01

Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT) is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any web browser with no client side software setup or installation required. Source code is freely available to researchers interested in setting up a local version of PSAT for analysis of genomes not available through the public server. Access to the public web server and instructions for obtaining source code can be found at . PMID:18366802
YouGenMap: a web platform for dynamic multi-comparative mapping and visualization of genetic maps

Treesearch

Keith Batesole; Kokulapalan Wimalanathan; Lin Liu; Fan Zhang; Craig S. Echt; Chun Liang

2014-01-01

Comparative genetic maps are used in examination of genome organization, detection of conserved gene order, and exploration of marker order variations. YouGenMap is an open-source web tool that offers dynamic comparative mapping capability of users' own genetic mapping between 2 or more map sets. Users' genetic map data and optional gene annotations are...
Conserved Gene Order and Expanded Inverted Repeats Characterize Plastid Genomes of Thalassiosirales

PubMed Central

Ashworth, Matt P.; Baeshen, Nabih A.; Baeshen, Mohammad N.; Bahieldin, Ahmed; Theriot, Edward C.; Jansen, Robert K.

2014-01-01

Diatoms are mostly photosynthetic eukaryotes within the heterokont lineage. Variable plastid genome sizes and extensive genome rearrangements have been observed across the diatom phylogeny, but little is known about plastid genome evolution within order- or family-level clades. The Thalassiosirales is one of the more comprehensively studied orders in terms of both genetics and morphology. Seven complete diatom plastid genomes are reported here including four Thalassiosirales: Thalassiosira weissflogii, Roundia cardiophora, Cyclotella sp. WC03_2, Cyclotella sp. L04_2, and three additional non-Thalassiosirales species Chaetoceros simplex, Cerataulina daemon, and Rhizosolenia imbricata. The sizes of the seven genomes vary from 116,459 to 129,498 bp, and their genomes are compact and lack introns. The larger size of the plastid genomes of Thalassiosirales compared to other diatoms is due primarily to expansion of the inverted repeat. Gene content within Thalassiosirales is more conserved compared to other diatom lineages. Gene order within Thalassiosirales is highly conserved except for the extensive genome rearrangement in Thalassiosira oceanica. Cyclotella nana, Thalassiosira weissflogii and Roundia cardiophora share an identical gene order, which is inferred to be the ancestral order for the Thalassiosirales, differing from that of the other two Cyclotella species by a single inversion. The genes ilvB and ilvH are missing in all six diatom plastid genomes except for Cerataulina daemon, suggesting an independent gain of these genes in this species. The acpP1 gene is missing in all Thalassiosirales, suggesting that its loss may be a synapomorphy for the order and this gene may have been functionally transferred to the nucleus. Three genes involved in photosynthesis, psaE, psaI, psaM, are missing in Rhizosolenia imbricata, which represents the first documented instance of the loss of photosynthetic genes in diatom plastid genomes. PMID:25233465
The complete mitochondrial genome of the common sea slater, Ligia oceanica (Crustacea, Isopoda) bears a novel gene order and unusual control region features

PubMed Central

Kilpert, Fabian; Podsiadlowski, Lars

2006-01-01

Background Sequence data and other characters from mitochondrial genomes (gene translocations, secondary structure of RNA molecules) are useful in phylogenetic studies among metazoan animals from population to phylum level. Moreover, the comparison of complete mitochondrial sequences gives valuable information about the evolution of small genomes, e.g. about different mechanisms of gene translocation, gene duplication and gene loss, or concerning nucleotide frequency biases. The Peracarida (gammarids, isopods, etc.) comprise about 21,000 species of crustaceans, living in many environments from deep sea floor to arid terrestrial habitats. Ligia oceanica is a terrestrial isopod living at rocky seashores of the european North Sea and Atlantic coastlines. Results The study reveals the first complete mitochondrial DNA sequence from a peracarid crustacean. The mitochondrial genome of Ligia oceanica is a circular double-stranded DNA molecule, with a size of 15,289 bp. It shows several changes in mitochondrial gene order compared to other crustacean species. An overview about mitochondrial gene order of all crustacean taxa yet sequenced is also presented. The largest non-coding part (the putative mitochondrial control region) of the mitochondrial genome of Ligia oceanica is unexpectedly not AT-rich compared to the remainder of the genome. It bears two repeat regions (4× 10 bp and 3× 64 bp), and a GC-rich hairpin-like secondary structure. Some of the transfer RNAs show secondary structures which derive from the usual cloverleaf pattern. While some tRNA genes are putative targets for RNA editing, trnR could not be localized at all. Conclusion Gene order is not conserved among Peracarida, not even among isopods. The two isopod species Ligia oceanica and Idotea baltica show a similarly derived gene order, compared to the arthropod ground pattern and to the amphipod Parhyale hawaiiensis, suggesting that most of the translocation events were already present the last common ancestor of these isopods. Beyond that, the positions of three tRNA genes differ in the two isopod species. Strand bias in nucleotide frequency is reversed in both isopod species compared to other Malacostraca. This is probably due to a reversal of the replication origin, which is further supported by the fact that the hairpin structure typically found in the control region shows a reversed orientation in the isopod species, compared to other crustaceans. PMID:16987408
Gene context conservation of a higher order than operons.

PubMed

Lathe, W C; Snel, B; Bork, P

2000-10-01

Operons, co-transcribed and co-regulated contiguous sets of genes, are poorly conserved over short periods of evolutionary time. The gene order, gene content and regulatory mechanisms of operons can be very different, even in closely related species. Here, we present several lines of evidence which suggest that, although an operon and its individual genes and regulatory structures are rearranged when comparing the genomes of different species, this rearrangement is a conservative process. Genomic rearrangements invariably maintain individual genes in very specific functional and regulatory contexts. We call this conserved context an uber-operon.
Inversions and Gene Order Shuffling in Anopheles gambiae and A. funestus

NASA Astrophysics Data System (ADS)

Sharakhov, Igor V.; Serazin, Andrew C.; Grushko, Olga G.; Dana, Ali; Lobo, Neil; Hillenmeyer, Maureen E.; Westerman, Richard; Romero-Severson, Jeanne; Costantini, Carlo; Sagnon, N'Fale; Collins, Frank H.; Besansky, Nora J.

2002-10-01

In tropical Africa, Anopheles funestus is one of the three most important malaria vectors. We physically mapped 157 A. funestus complementary DNAs (cDNAs) to the polytene chromosomes of this species. Sequences of the cDNAs were mapped in silico to the A. gambiae genome as part of a comparative genomic study of synteny, gene order, and sequence conservation between A. funestus and A. gambiae. These species are in the same subgenus and diverged about as recently as humans and chimpanzees. Despite nearly perfect preservation of synteny, we found substantial shuffling of gene order along corresponding chromosome arms. Since the divergence of these species, at least 70 chromosomal inversions have been fixed, the highest rate of rearrangement of any eukaryote studied to date. The high incidence of paracentric inversions and limited colinearity suggests that locating genes in one anopheline species based on gene order in another may be limited to closely related taxa.
The invasive MED/Q Bemisia tabaci genome: a tale of gene loss and gene gain

USDA-ARS?s Scientific Manuscript database

Whiteflies are a group of invasive crop pests that impact global agriculture. An analysis was conducted to compare draft genomes of two whitefly strains, which demonstrated the relative conserved gene order, but a number of genes were either novel (added) or omitted (deleted) between genomes. This...
Genome-wide gene order distances support clustering the gram-positive bacteria

PubMed Central

House, Christopher H.; Pellegrini, Matteo; Fitz-Gibbon, Sorel T.

2015-01-01

Initially using 143 genomes, we developed a method for calculating the pair-wise distance between prokaryotic genomes using a Monte Carlo method to estimate the conservation of gene order. The method was based on repeatedly selecting five or six non-adjacent random orthologs from each of two genomes and determining if the chosen orthologs were in the same order. The raw distances were then corrected for gene order convergence using an adaptation of the Jukes-Cantor model, as well as using the common distance correction D′ = −ln(1-D). First, we compared the distances found via the order of six orthologs to distances found based on ortholog gene content and small subunit rRNA sequences. The Jukes-Cantor gene order distances are reasonably well correlated with the divergence of rRNA (R2 = 0.24), especially at rRNA Jukes-Cantor distances of less than 0.2 (R2 = 0.52). Gene content is only weakly correlated with rRNA divergence (R2 = 0.04) over all distances, however, it is especially strongly correlated at rRNA Jukes-Cantor distances of less than 0.1 (R2 = 0.67). This initial work suggests that gene order may be useful in conjunction with other methods to help understand the relatedness of genomes. Using the gene order distances in 143 genomes, the relations of prokaryotes were studied using neighbor joining and agreement subtrees. We then repeated our study of the relations of prokaryotes using gene order in 172 complete genomes better representing a wider-diversity of prokaryotes. Consistently, our trees show the Actinobacteria as a sister group to the bulk of the Firmicutes. In fact, the robustness of gene order support was found to be considerably greater for uniting these two phyla than for uniting any of the proteobacterial classes together. The results are supportive of the idea that Actinobacteria and Firmicutes are closely related, which in turn implies a single origin for the gram-positive cell. PMID:25653643
Predicting Protein Function by Genomic Context: Quantitative Evaluation and Qualitative Inferences

PubMed Central

Huynen, Martijn; Snel, Berend; Lathe, Warren; Bork, Peer

2000-01-01

Various new methods have been proposed to predict functional interactions between proteins based on the genomic context of their genes. The types of genomic context that they use are Type I: the fusion of genes; Type II: the conservation of gene-order or co-occurrence of genes in potential operons; and Type III: the co-occurrence of genes across genomes (phylogenetic profiles). Here we compare these types for their coverage, their correlations with various types of functional interaction, and their overlap with homology-based function assignment. We apply the methods to Mycoplasma genitalium, the standard benchmarking genome in computational and experimental genomics. Quantitatively, conservation of gene order is the technique with the highest coverage, applying to 37% of the genes. By combining gene order conservation with gene fusion (6%), the co-occurrence of genes in operons in absence of gene order conservation (8%), and the co-occurrence of genes across genomes (11%), significant context information can be obtained for 50% of the genes (the categories overlap). Qualitatively, we observe that the functional interactions between genes are stronger as the requirements for physical neighborhood on the genome are more stringent, while the fraction of potential false positives decreases. Moreover, only in cases in which gene order is conserved in a substantial fraction of the genomes, in this case six out of twenty-five, does a single type of functional interaction (physical interaction) clearly dominate (>80%). In other cases, complementary function information from homology searches, which is available for most of the genes with significant genomic context, is essential to predict the type of interaction. Using a combination of genomic context and homology searches, new functional features can be predicted for 10% of M. genitalium genes. PMID:10958638
Comparing Pearson, Spearman and Hoeffding's D measure for gene expression association analysis.

PubMed

Fujita, André; Sato, João Ricardo; Demasi, Marcos Angelo Almeida; Sogayar, Mari Cleide; Ferreira, Carlos Eduardo; Miyano, Satoru

2009-08-01

DNA microarrays have become a powerful tool to describe gene expression profiles associated with different cellular states, various phenotypes and responses to drugs and other extra- or intra-cellular perturbations. In order to cluster co-expressed genes and/or to construct regulatory networks, definition of distance or similarity between measured gene expression data is usually required, the most common choices being Pearson's and Spearman's correlations. Here, we evaluate these two methods and also compare them with a third one, namely Hoeffding's D measure, which is used to infer nonlinear and non-monotonic associations, i.e. independence in a general sense. By comparing three different variable association approaches, namely Pearson's correlation, Spearman's correlation and Hoeffding's D measure, we aimed at assessing the most appropriate one for each purpose. Using simulations, we demonstrate that the Hoeffding's D measure outperforms Pearson's and Spearman's approaches in identifying nonlinear associations. Our results demonstrate that Hoeffding's D measure is less sensitive to outliers and is a more powerful tool to identify nonlinear and non-monotonic associations. We have also applied Hoeffding's D measure in order to identify new putative genes associated with tp53. Therefore, we propose the Hoeffding's D measure to identify nonlinear associations between gene expression profiles.
Probabilistic modeling of the evolution of gene synteny within reconciled phylogenies

PubMed Central

2015-01-01

Background Most models of genome evolution concern either genetic sequences, gene content or gene order. They sometimes integrate two of the three levels, but rarely the three of them. Probabilistic models of gene order evolution usually have to assume constant gene content or adopt a presence/absence coding of gene neighborhoods which is blind to complex events modifying gene content. Results We propose a probabilistic evolutionary model for gene neighborhoods, allowing genes to be inserted, duplicated or lost. It uses reconciled phylogenies, which integrate sequence and gene content evolution. We are then able to optimize parameters such as phylogeny branch lengths, or probabilistic laws depicting the diversity of susceptibility of syntenic regions to rearrangements. We reconstruct a structure for ancestral genomes by optimizing a likelihood, keeping track of all evolutionary events at the level of gene content and gene synteny. Ancestral syntenies are associated with a probability of presence. We implemented the model with the restriction that at most one gene duplication separates two gene speciations in reconciled gene trees. We reconstruct ancestral syntenies on a set of 12 drosophila genomes, and compare the evolutionary rates along the branches and along the sites. We compare with a parsimony method and find a significant number of results not supported by the posterior probability. The model is implemented in the Bio++ library. It thus benefits from and enriches the classical models and methods for molecular evolution. PMID:26452018
Multiple genome alignment for identifying the core structure among moderately related microbial genomes.

PubMed

Uchiyama, Ikuo

2008-10-31

Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.
Comparative linkage mapping of genes on sheep chromosome 3 provides evidence of chromosomal rearrangements in the evolution of the Bovidae.

PubMed

Jenkins, Z A; Henry, H M; Galloway, S M; Dodds, K G; Montgomery, G W

1997-01-01

Three genes--parathyroid hormone-like hormone (PTHLH), insulin-like growth factor 1 (IGF 1), and retinoic acid receptor gamma (RARG)--have been mapped to sheep (Ovis aries) chromosome 3 (OAR 3). The order and genetic distances between loci on OAR 3 are similar to those on cattle (Bos taurus) chromosome 5, as expected from their close evolutionary relationship. The OAR 3 linkage map shows conserved synteny with human chromosome 12, but there are at least two rearrangements in gene order between the species.
Two fundamentally different classes of microbial genes.

PubMed

Wolf, Yuri I; Makarova, Kira S; Lobkovsky, Alexander E; Koonin, Eugene V

2016-11-07

The evolution of bacterial and archaeal genomes is highly dynamic and involves extensive horizontal gene transfer and gene loss 1-4 . Furthermore, many microbial species appear to have open pangenomes, where each newly sequenced genome contains more than 10% ORFans, that is, genes without detectable homologues in other species 5,6 . Here, we report a quantitative analysis of microbial genome evolution by fitting the parameters of a simple, steady-state evolutionary model to the comparative genomic data on the gene content and gene order similarity between archaeal genomes. The results reveal two sharply distinct classes of microbial genes, one of which is characterized by effectively instantaneous gene replacement, and the other consists of genes with finite, distributed replacement rates. These findings imply a conservative estimate of the size of the prokaryotic genomic universe, which appears to consist of at least a billion distinct genes. Furthermore, the same distribution of constraints is shown to govern the evolution of gene complement and gene order, without the need to invoke long-range conservation or the selfish operon concept 7 .
Plastomes of the green algae Hydrodictyon reticulatum and Pediastrum duplex (Sphaeropleales, Chlorophyceae).

PubMed

McManus, Hilary A; Sanchez, Daniel J; Karol, Kenneth G

2017-01-01

Comparative studies of chloroplast genomes (plastomes) across the Chlorophyceae are revealing dynamic patterns of size variation, gene content, and genome rearrangements. Phylogenomic analyses are improving resolution of relationships, and uncovering novel lineages as new plastomes continue to be characterized. To gain further insight into the evolution of the chlorophyte plastome and increase the number of representative plastomes for the Sphaeropleales, this study presents two fully sequenced plastomes from the green algal family Hydrodictyaceae (Sphaeropleales, Chlorophyceae), one from Hydrodictyon reticulatum and the other from Pediastrum duplex . Genomic DNA from Hydrodictyon reticulatum and Pediastrum duplex was subjected to Illumina paired-end sequencing and the complete plastomes were assembled for each. Plastome size and gene content were characterized and compared with other plastomes from the Sphaeropleales. Homology searches using BLASTX were used to characterize introns and open reading frames (orfs) ≥ 300 bp. A phylogenetic analysis of gene order across the Sphaeropleales was performed. The plastome of Hydrodictyon reticulatum is 225,641 bp and Pediastrum duplex is 232,554 bp. The plastome structure and gene order of H. reticulatum and P. duplex are more similar to each other than to other members of the Sphaeropleales. Numerous unique open reading frames are found in both plastomes and the plastome of P. duplex contains putative viral protein genes, not found in other Sphaeropleales plastomes. Gene order analyses support the monophyly of the Hydrodictyaceae and their sister relationship to the Neochloridaceae. The complete plastomes of Hydrodictyon reticulatum and Pediastrum duplex , representing the largest of the Sphaeropleales sequenced thus far, once again highlight the variability in size, architecture, gene order and content across the Chlorophyceae. Novel intron insertion sites and unique orfs indicate recent, independent invasions into each plastome, a hypothesis testable with an expanded plastome investigation within the Hydrodictyaceae.
Genomicus update 2015: KaryoView and MatrixView provide a genome-wide perspective to multispecies comparative genomics

PubMed Central

Louis, Alexandra; Nguyen, Nga Thi Thuy; Muffato, Matthieu; Roest Crollius, Hugues

2015-01-01

The Genomicus web server (http://www.genomicus.biologie.ens.fr/genomicus) is a visualization tool allowing comparative genomics in four different phyla (Vertebrate, Fungi, Metazoan and Plants). It provides access to genomic information from extant species, as well as ancestral gene content and gene order for vertebrates and flowering plants. Here we present the new features available for vertebrate genome with a focus on new graphical tools. The interface to enter the database has been improved, two pairwise genome comparison tools are now available (KaryoView and MatrixView) and the multiple genome comparison tools (PhyloView and AlignView) propose three new kinds of representation and a more intuitive menu. These new developments have been implemented for Genomicus portal dedicated to vertebrates. This allows the analysis of 68 extant animal genomes, as well as 58 ancestral reconstructed genomes. The Genomicus server also provides access to ancestral gene orders, to facilitate evolutionary and comparative genomics studies, as well as computationally predicted regulatory interactions, thanks to the representation of conserved non-coding elements with their putative gene targets. PMID:25378326
Comparative Analysis of Four Calypogeia Species Revealed Unexpected Change in Evolutionarily-Stable Liverwort Mitogenomes

PubMed Central

Ślipiko, Monika; Buczkowska-Chmielewska, Katarzyna; Bączkiewicz, Alina; Szczecińska, Monika; Sawicki, Jakub

2017-01-01

Liverwort mitogenomes are considered to be evolutionarily stable. A comparative analysis of four Calypogeia species revealed differences compared to previously sequenced liverwort mitogenomes. Such differences involve unexpected structural changes in the two genes, cox1 and atp1, which have lost three and two introns, respectively. The group I introns in the cox1 gene are proposed to have been lost by two-step localized retroprocessing, whereas one-step retroprocessing could be responsible for the disappearance of the group II introns in the atp1 gene. These cases represent the first identified losses of introns in mitogenomes of leafy liverworts (Jungermanniopsida) contrasting the stability of mitochondrial gene order with certain changes in the gene content and intron set in liverworts. PMID:29257096
GreenPhylDB v2.0: comparative and functional genomics in plants.

PubMed

Rouard, Mathieu; Guignon, Valentin; Aluome, Christelle; Laporte, Marie-Angélique; Droc, Gaëtan; Walde, Christian; Zmasek, Christian M; Périn, Christophe; Conte, Matthieu G

2011-01-01

GreenPhylDB is a database designed for comparative and functional genomics based on complete genomes. Version 2 now contains sixteen full genomes of members of the plantae kingdom, ranging from algae to angiosperms, automatically clustered into gene families. Gene families are manually annotated and then analyzed phylogenetically in order to elucidate orthologous and paralogous relationships. The database offers various lists of gene families including plant, phylum and species specific gene families. For each gene cluster or gene family, easy access to gene composition, protein domains, publications, external links and orthologous gene predictions is provided. Web interfaces have been further developed to improve the navigation through information related to gene families. New analysis tools are also available, such as a gene family ontology browser that facilitates exploration. GreenPhylDB is a component of the South Green Bioinformatics Platform (http://southgreen.cirad.fr/) and is accessible at http://greenphyl.cirad.fr. It enables comparative genomics in a broad taxonomy context to enhance the understanding of evolutionary processes and thus tends to speed up gene discovery.
Comparative genomics of ParaHox clusters of teleost fishes: gene cluster breakup and the retention of gene sets following whole genome duplications

PubMed Central

Siegel, Nicol; Hoegg, Simone; Salzburger, Walter; Braasch, Ingo; Meyer, Axel

2007-01-01

Background The evolutionary lineage leading to the teleost fish underwent a whole genome duplication termed FSGD or 3R in addition to two prior genome duplications that took place earlier during vertebrate evolution (termed 1R and 2R). Resulting from the FSGD, additional copies of genes are present in fish, compared to tetrapods whose lineage did not experience the 3R genome duplication. Interestingly, we find that ParaHox genes do not differ in number in extant teleost fishes despite their additional genome duplication from the genomic situation in mammals, but they are distributed over twice as many paralogous regions in fish genomes. Results We determined the DNA sequence of the entire ParaHox C1 paralogon in the East African cichlid fish Astatotilapia burtoni, and compared it to orthologous regions in other vertebrate genomes as well as to the paralogous vertebrate ParaHox D paralogons. Evolutionary relationships among genes from these four chromosomal regions were studied with several phylogenetic algorithms. We provide evidence that the genes of the ParaHox C paralogous cluster are duplicated in teleosts, just as it had been shown previously for the D paralogon genes. Overall, however, synteny and cluster integrity seems to be less conserved in ParaHox gene clusters than in Hox gene clusters. Comparative analyses of non-coding sequences uncovered conserved, possibly co-regulatory elements, which are likely to contain promoter motives of the genes belonging to the ParaHox paralogons. Conclusion There seems to be strong stabilizing selection for gene order as well as gene orientation in the ParaHox C paralogon, since with a few exceptions, only the lengths of the introns and intergenic regions differ between the distantly related species examined. The high degree of evolutionary conservation of this gene cluster's architecture in particular – but possibly clusters of genes more generally – might be linked to the presence of promoter, enhancer or inhibitor motifs that serve to regulate more than just one gene. Therefore, deletions, inversions or relocations of individual genes could destroy the regulation of the clustered genes in this region. The existence of such a regulation network might explain the evolutionary conservation of gene order and orientation over the course of hundreds of millions of years of vertebrate evolution. Another possible explanation for the highly conserved gene order might be the existence of a regulator not located immediately next to its corresponding gene but further away since a relocation or inversion would possibly interrupt this interaction. Different ParaHox clusters were found to have experienced differential gene loss in teleosts. Yet the complete set of these homeobox genes was maintained, albeit distributed over almost twice the number of chromosomes. Selection due to dosage effects and/or stoichiometric disturbance might act more strongly to maintain a modal number of homeobox genes (and possibly transcription factors more generally) per genome, yet permit the accumulation of other (non regulatory) genes associated with these homeobox gene clusters. PMID:17822543

Genomicus update 2015: KaryoView and MatrixView provide a genome-wide perspective to multispecies comparative genomics.

PubMed

Louis, Alexandra; Nguyen, Nga Thi Thuy; Muffato, Matthieu; Roest Crollius, Hugues

2015-01-01

The Genomicus web server (http://www.genomicus.biologie.ens.fr/genomicus) is a visualization tool allowing comparative genomics in four different phyla (Vertebrate, Fungi, Metazoan and Plants). It provides access to genomic information from extant species, as well as ancestral gene content and gene order for vertebrates and flowering plants. Here we present the new features available for vertebrate genome with a focus on new graphical tools. The interface to enter the database has been improved, two pairwise genome comparison tools are now available (KaryoView and MatrixView) and the multiple genome comparison tools (PhyloView and AlignView) propose three new kinds of representation and a more intuitive menu. These new developments have been implemented for Genomicus portal dedicated to vertebrates. This allows the analysis of 68 extant animal genomes, as well as 58 ancestral reconstructed genomes. The Genomicus server also provides access to ancestral gene orders, to facilitate evolutionary and comparative genomics studies, as well as computationally predicted regulatory interactions, thanks to the representation of conserved non-coding elements with their putative gene targets. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
The mitochondrial genome of the ascalaphid owlfly Libelloides macaronius and comparative evolutionary mitochondriomics of neuropterid insects

PubMed Central

2011-01-01

Background The insect order Neuroptera encompasses more than 5,700 described species. To date, only three neuropteran mitochondrial genomes have been fully and one partly sequenced. Current knowledge on neuropteran mitochondrial genomes is limited, and new data are strongly required. In the present work, the mitochondrial genome of the ascalaphid owlfly Libelloides macaronius is described and compared with the known neuropterid mitochondrial genomes: Megaloptera, Neuroptera and Raphidioptera. These analyses are further extended to other endopterygotan orders. Results The mitochondrial genome of L. macaronius is a circular molecule 15,890 bp long. It includes the entire set of 37 genes usually present in animal mitochondrial genomes. The gene order of this newly sequenced genome is unique among Neuroptera and differs from the ancestral type of insects in the translocation of trnC. The L. macaronius genome shows the lowest A+T content (74.50%) among known neuropterid genomes. Protein-coding genes possess the typical mitochondrial start codons, except for cox1, which has an unusual ACG. Comparisons among endopterygotan mitochondrial genomes showed that A+T content and AT/GC-skews exhibit a broad range of variation among 84 analyzed taxa. Comparative analyses showed that neuropterid mitochondrial protein-coding genes experienced complex evolutionary histories, involving features ranging from codon usage to rate of substitution, that make them potential markers for population genetics/phylogenetics studies at different taxonomic ranks. The 22 tRNAs show variable substitution patterns in Neuropterida, with higher sequence conservation in genes located on the α strand. Inferred secondary structures for neuropterid rrnS and rrnL genes largely agree with those known for other insects. For the first time, a model is provided for domain I of an insect rrnL. The control region in Neuropterida, as in other insects, is fast-evolving genomic region, characterized by AT-rich motifs. Conclusions The new genome shares many features with known neuropteran genomes but differs in its low A+T content. Comparative analysis of neuropterid mitochondrial genes showed that they experienced distinct evolutionary patterns. Both tRNA families and ribosomal RNAs show composite substitution pathways. The neuropterid mitochondrial genome is characterized by a complex evolutionary history. PMID:21569260
Uterine responses to early pre-attachment embryos in the domestic dog and comparisons with other domestic animal species†

PubMed Central

Graubner, Felix R.; Gram, Aykut; Kautz, Ewa; Bauersachs, Stefan; Aslan, Selim; Agaoglu, Ali R.; Boos, Alois

2017-01-01

Abstract In the dog, there is no luteolysis in the absence of pregnancy. Thus, this species lacks any anti-luteolytic endocrine signal as found in other species that modulate uterine function during the critical period of pregnancy establishment. Nevertheless, in the dog an embryo-maternal communication must occur in order to prevent rejection of embryos. Based on this hypothesis, we performed microarray analysis of canine uterine samples collected during pre-attachment phase (days 10-12) and in corresponding non-pregnant controls, in order to elucidate the embryo attachment signal. An additional goal was to identify differences in uterine responses to pre-attachment embryos between dogs and other mammalian species exhibiting different reproductive patterns with regard to luteolysis, implantation, and preparation for placentation. Therefore, the canine microarray data were compared with gene sets from pigs, cattle, horses, and humans. We found 412 genes differentially regulated between the two experimental groups. The functional terms most strongly enriched in response to pre-attachment embryos related to extracellular matrix function and remodeling, and to immune and inflammatory responses. Several candidate genes were validated by semi-quantitative PCR. When compared with other species, best matches were found with human and equine counterparts. Especially for the pig, the majority of overlapping genes showed opposite expression patterns. Interestingly, 1926 genes did not pair with any of the other gene sets. Using a microarray approach, we report the uterine changes in the dog driven by the presence of embryos and compare these results with datasets from other mammalian species, finding common-, contrary-, and exclusively canine-regulated genes. PMID:28651344
Conservation of regulatory sequences and gene expression patterns in the disintegrating Drosophila Hox gene complex

PubMed Central

Negre, Bárbara; Casillas, Sònia; Suzanne, Magali; Sánchez-Herrero, Ernesto; Akam, Michael; Nefedov, Michael; Barbadilla, Antonio; de Jong, Pieter; Ruiz, Alfredo

2005-01-01

Homeotic (Hox) genes are usually clustered and arranged in the same order as they are expressed along the anteroposterior body axis of metazoans. The mechanistic explanation for this colinearity has been elusive, and it may well be that a single and universal cause does not exist. The Hox-gene complex (HOM-C) has been rearranged differently in several Drosophila species, producing a striking diversity of Hox gene organizations. We investigated the genomic and functional consequences of the two HOM-C splits present in Drosophila buzzatii. Firstly, we sequenced two regions of the D. buzzatii genome, one containing the genes labial and abdominal A, and another one including proboscipedia, and compared their organization with that of D. melanogaster and D. pseudoobscura in order to map precisely the two splits. Then, a plethora of conserved noncoding sequences, which are putative enhancers, were identified around the three Hox genes closer to the splits. The position and order of these enhancers are conserved, with minor exceptions, between the three Drosophila species. Finally, we analyzed the expression patterns of the same three genes in embryos and imaginal discs of four Drosophila species with different Hox-gene organizations. The results show that their expression patterns are conserved despite the HOM-C splits. We conclude that, in Drosophila, Hox-gene clustering is not an absolute requirement for proper function. Rather, the organization of Hox genes is modular, and their clustering seems the result of phylogenetic inertia more than functional necessity. PMID:15867430
Plastome Evolution in Hemiparasitic Mistletoes

PubMed Central

Petersen, Gitte; Cuenca, Argelia; Seberg, Ole

2015-01-01

Santalales is an order of plants consisting almost entirely of parasites. Some, such as Osyris, are facultative root parasites whereas others, such as Viscum, are obligate stem parasitic mistletoes. Here, we report the complete plastome sequences of one species of Osyris and three species of Viscum, and we investigate the evolutionary aspects of structural changes and changes in gene content in relation to parasitism. Compared with typical angiosperms plastomes, the four Santalales plastomes are all reduced in size (10–22% compared with Vitis), and they have experienced rearrangements, mostly but not exclusively in the border areas of the inverted repeats. Additionally, a number of protein-coding genes (matK, infA, ccsA, rpl33, and all 11 ndh genes) as well as two transfer RNA genes (trnG-UCC and trnV-UAC) have been pseudogenized or completely lost. Most of the remaining plastid genes have a significantly changed selection pattern compared with other dicots, and the relaxed selection of photosynthesis genes is noteworthy. Although gene loss obviously reduces plastome size, intergenic regions were also shortened. As plastome modifications are generally most prominent in Viscum, they are most likely correlated with the increased nutritional dependence on the host compared with Osyris. PMID:26319577
Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution

PubMed Central

Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Hubisz, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Zhang, Peili; Liu, Jing; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catharine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenée; Verduzco, Daniel; Clerc-Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.

2005-01-01

We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each arm gene order has been extensively reshuffled, leading to a minimum of 921 syntenic blocks shared between the species. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 25–55 million years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences between the species—but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila. PMID:15632085
Estimation of gene induction enables a relevance-based ranking of gene sets.

PubMed

Bartholomé, Kilian; Kreutz, Clemens; Timmer, Jens

2009-07-01

In order to handle and interpret the vast amounts of data produced by microarray experiments, the analysis of sets of genes with a common biological functionality has been shown to be advantageous compared to single gene analyses. Some statistical methods have been proposed to analyse the differential gene expression of gene sets in microarray experiments. However, most of these methods either require threshhold values to be chosen for the analysis, or they need some reference set for the determination of significance. We present a method that estimates the number of differentially expressed genes in a gene set without requiring a threshold value for significance of genes. The method is self-contained (i.e., it does not require a reference set for comparison). In contrast to other methods which are focused on significance, our approach emphasizes the relevance of the regulation of gene sets. The presented method measures the degree of regulation of a gene set and is a useful tool to compare the induction of different gene sets and place the results of microarray experiments into the biological context. An R-package is available.
Arm-specific dynamics of chromosome evolution in malaria mosquitoes

PubMed Central

2011-01-01

Background The malaria mosquito species of subgenus Cellia have rich inversion polymorphisms that correlate with environmental variables. Polymorphic inversions tend to cluster on the chromosomal arms 2R and 2L but not on X, 3R and 3L in Anopheles gambiae and homologous arms in other species. However, it is unknown whether polymorphic inversions on homologous chromosomal arms of distantly related species from subgenus Cellia nonrandomly share similar sets of genes. It is also unclear if the evolutionary breakage of inversion-poor chromosomal arms is under constraints. Results To gain a better understanding of the arm-specific differences in the rates of genome rearrangements, we compared gene orders and established syntenic relationships among Anopheles gambiae, Anopheles funestus, and Anopheles stephensi. We provided evidence that polymorphic inversions on the 2R arms in these three species nonrandomly captured similar sets of genes. This nonrandom distribution of genes was not only a result of preservation of ancestral gene order but also an outcome of extensive reshuffling of gene orders that created new combinations of homologous genes within independently originated polymorphic inversions. The statistical analysis of distribution of conserved gene orders demonstrated that the autosomal arms differ in their tolerance to generating evolutionary breakpoints. The fastest evolving 2R autosomal arm was enriched with gene blocks conserved between only a pair of species. In contrast, all identified syntenic blocks were preserved on the slowly evolving 3R arm of An. gambiae and on the homologous arms of An. funestus and An. stephensi. Conclusions Our results suggest that natural selection favors specific gene combinations within polymorphic inversions when distant species are exposed to similar environmental pressures. This knowledge could be useful for the discovery of genes responsible for an association of inversion polymorphisms with phenotypic variations in multiple species. Our data support the chromosomal arm specificity in rates of gene order disruption during mosquito evolution. We conclude that the distribution of breakpoint regions is evolutionary conserved on slowly evolving arms and tends to be lineage-specific on rapidly evolving arms. PMID:21473772
Inferring genetic interactions from comparative fitness data

PubMed Central

2017-01-01

Darwinian fitness is a central concept in evolutionary biology. In practice, however, it is hardly possible to measure fitness for all genotypes in a natural population. Here, we present quantitative tools to make inferences about epistatic gene interactions when the fitness landscape is only incompletely determined due to imprecise measurements or missing observations. We demonstrate that genetic interactions can often be inferred from fitness rank orders, where all genotypes are ordered according to fitness, and even from partial fitness orders. We provide a complete characterization of rank orders that imply higher order epistasis. Our theory applies to all common types of gene interactions and facilitates comprehensive investigations of diverse genetic interactions. We analyzed various genetic systems comprising HIV-1, the malaria-causing parasite Plasmodium vivax, the fungus Aspergillus niger, and the TEM-family of β-lactamase associated with antibiotic resistance. For all systems, our approach revealed higher order interactions among mutations. PMID:29260711
Inferring genetic interactions from comparative fitness data.

PubMed

Crona, Kristina; Gavryushkin, Alex; Greene, Devin; Beerenwinkel, Niko

2017-12-20

Darwinian fitness is a central concept in evolutionary biology. In practice, however, it is hardly possible to measure fitness for all genotypes in a natural population. Here, we present quantitative tools to make inferences about epistatic gene interactions when the fitness landscape is only incompletely determined due to imprecise measurements or missing observations. We demonstrate that genetic interactions can often be inferred from fitness rank orders, where all genotypes are ordered according to fitness, and even from partial fitness orders. We provide a complete characterization of rank orders that imply higher order epistasis. Our theory applies to all common types of gene interactions and facilitates comprehensive investigations of diverse genetic interactions. We analyzed various genetic systems comprising HIV-1, the malaria-causing parasite Plasmodium vivax , the fungus Aspergillus niger , and the TEM-family of β-lactamase associated with antibiotic resistance. For all systems, our approach revealed higher order interactions among mutations.
Primary and Secondary Abscission in Pisum sativum and Euphorbia pulcherrima—How Do They Compare and How Do They Differ?

PubMed Central

Hvoslef-Eide, Anne K.; Munster, Cristel M.; Mathiesen, Cecilie A.; Ayeh, Kwadwo O.; Melby, Tone I.; Rasolomanana, Paoly; Lee, YeonKyeong

2016-01-01

Abscission is a highly regulated and coordinated developmental process in plants. It is important to understand the processes leading up to the event, in order to better control abscission in crop plants. This has the potential to reduce yield losses in the field and increase the ornamental value of flowers and potted plants. A reliable method of abscission induction in poinsettia (Euphorbia pulcherrima) flowers has been established to study the process in a comprehensive manner. By correctly decapitating buds of the third order, abscission can be induced in 1 week. AFLP differential display (DD) was used to search for genes regulating abscission. Through validation using qRT-PCR, more information of the genes involved during induced secondary abscission have been obtained. A study using two pea (Pisum sativum) mutants in the def (Developmental funiculus) gene, which was compared with wild type peas (tall and dwarf in both cases) was performed. The def mutant results in a deformed, abscission-less zone instead of normal primary abscission at the funiculus. RNA in situ hybridization studies using gene sequences from the poinsettia differential display, resulted in six genes differentially expressed for abscission specific genes in both poinsettia and pea. Two of these genes are associated with gene up- or down-regulation during the first 2 days after decapitation in poinsettia. Present and previous results in poinsettia (biochemically and gene expressions), enables a more detailed division of the secondary abscission phases in poinsettia than what has previously been described from primary abscission in Arabidopsis. This study compares the inducible secondary abscission in poinsettia and the non-abscising mutants/wild types in pea demonstrating primary abscission zones. The results may have wide implications on the understanding of abscission, since pea and poinsettia have been separated for 94–98 million years in evolution, hence any genes or processes in common are bound to be widespread in the plant kingdom. PMID:26858724
Entropy Based Genetic Association Tests and Gene-Gene Interaction Tests

PubMed Central

de Andrade, Mariza; Wang, Xin

2011-01-01

In the past few years, several entropy-based tests have been proposed for testing either single SNP association or gene-gene interaction. These tests are mainly based on Shannon entropy and have higher statistical power when compared to standard χ2 tests. In this paper, we extend some of these tests using a more generalized entropy definition, Rényi entropy, where Shannon entropy is a special case of order 1. The order λ (>0) of Rényi entropy weights the events (genotype/haplotype) according to their probabilities (frequencies). Higher λ places more emphasis on higher probability events while smaller λ (close to 0) tends to assign weights more equally. Thus, by properly choosing the λ, one can potentially increase the power of the tests or the p-value level of significance. We conducted simulation as well as real data analyses to assess the impact of the order λ and the performance of these generalized tests. The results showed that for dominant model the order 2 test was more powerful and for multiplicative model the order 1 or 2 had similar power. The analyses indicate that the choice of λ depends on the underlying genetic model and Shannon entropy is not necessarily the most powerful entropy measure for constructing genetic association or interaction tests. PMID:23089811
Comparative genomic analysis and expression of the APETALA2-like genes from barley, wheat, and barley-wheat amphiploids

PubMed Central

Gil-Humanes, Javier; Pistón, Fernando; Martín, Antonio; Barro, Francisco

2009-01-01

Background The APETALA2-like genes form a large multi-gene family of transcription factors which play an important role during the plant life cycle, being key regulators of many developmental processes. Many studies in Arabidopsis have revealed that the APETALA2 (AP2) gene is implicated in the establishment of floral meristem and floral organ identity as well as temporal and spatial regulation of flower homeotic gene expression. Results In this work, we have cloned and characterised the AP2-like gene from accessions of Hordeum chilense and Hordeum vulgare, wild and domesticated barley, respectively, and compared with other AP2 homoeologous genes, including the Q gene in wheat. The Hordeum AP2-like genes contain two plant-specific DNA binding motifs called AP2 domains, as does the Q gene of wheat. We confirm that the H. chilense AP2-like gene is located on chromosome 5Hch. Patterns of expression of the AP2-like genes were examined in floral organs and other tissues in barley, wheat and in tritordeum amphiploids (barley × wheat hybrids). In tritordeum amphiploids, the level of transcription of the barley AP2-like gene was lower than in its barley parental and the chromosome substitutions 1D/1Hch and 2D/2Hch were seen to modify AP2 gene expression levels. Conclusion The results are of interest in order to understand the role of the AP2-like gene in the spike morphology of barley and wheat, and to understand the regulation of this gene in the amphiploids obtained from barley-wheat crossing. This information may have application in cereal breeding programs to up- or down-regulate the expression of AP2-like genes in order to modify spike characteristics and to obtain free-threshing plants. PMID:19480686
Activation and comparative analysis of cryptic xiamycin gene cluster from marine-derived Streptomyces sp. FXJ 7.388.

PubMed

Uhong Lü, Yuhong; Liu, Xiaoli; Wang, Miao; Li, Yuanyuan; Liu, Ning; Bao, Yuxin; Liu, Minghao; Li, Xiaoqian; Wang, Yinyin; Qian, Shenyan; Yue, Changwu; Huang, Ying

2016-09-01

In order to obtain the natural products synthesized by the three putative xiamycin biosynthesis gene clusters which were predicted via antiSMASH during the genome mining of marine Streptomyces sp. FXJ 7.388, Streptomyces sp. FXJ 8.012, and Streptomyces olivaceus FXJ 7.023. Sixteen genes involved in xiamycin assembly, modification, and regulation with higher identity than the newest reported xiamycin biosynthetic gene cluster from marine Streptomyces sp. SCSIO 02999, Streptomyces sp. HKI0576, and Streptomyces sp. FXJ 7.388 were discovered via gene cluster comparative analysis. A ribosome engineering strategy was adopted to activate such cryptic gene clusters with different final concentrations antibiotics that act on the ribosome, and two indolosesquiterpenes were isolated from idlethaldose streptomycin-resistant Streptomyces sp. FXJ 7.388 strains. However, no such product was detected in Streptomyces sp. FXJ 8.012 and Streptomyces olivaceus FXJ 7.023 under the same treatment. This result suggested that these genes might hold the least gene content for xiamycin biosynthesis.
Complete mitochondrial genomes and nuclear ribosomal RNA operons of two species of Diplostomum (Platyhelminthes: Trematoda): a molecular resource for taxonomy and molecular epidemiology of important fish pathogens.

PubMed

Brabec, Jan; Kostadinova, Aneta; Scholz, Tomáš; Littlewood, D Timothy J

2015-06-19

The genus Diplostomum (Platyhelminthes: Trematoda: Diplostomidae) is a diverse group of freshwater parasites with complex life-cycles and global distribution. The larval stages are important pathogens causing eye fluke disease implicated in substantial impacts on natural fish populations and losses in aquaculture. However, the problematic species delimitation and difficulties in the identification of larval stages hamper the assessment of the distributional and host ranges of Diplostomum spp. and their transmission ecology. Total genomic DNA was isolated from adult worms and shotgun sequenced using Illumina MiSeq technology. Mitochondrial (mt) genomes and nuclear ribosomal RNA (rRNA) operons were assembled using established bioinformatic tools and fully annotated. Mt protein-coding genes and nuclear rRNA genes were subjected to phylogenetic analysis by maximum likelihood and the resulting topologies compared. We characterised novel complete mt genomes and nuclear rRNA operons of two closely related species, Diplostomum spathaceum and D. pseudospathaceum. Comparative mt genome assessment revealed that the cox1 gene and its 'barcode' region used for molecular identification are the most conserved regions; instead, nad4 and nad5 genes were identified as most promising molecular diagnostic markers. Using the novel data, we provide the first genome wide estimation of the phylogenetic relationships of the order Diplostomida, one of the two fundamental lineages of the Digenea. Analyses of the mitogenomic data invariably recovered the Diplostomidae as a sister lineage of the order Plagiorchiida rather than as a basal lineage of the Diplostomida as inferred in rDNA phylogenies; this was concordant with the mt gene order of Diplostomum spp. exhibiting closer match to the conserved gene order of the Plagiorchiida. Complete sequences of the mt genome and rRNA operon of two species of Diplostomum provide a valuable resource for novel genetic markers for species delineation and large-scale molecular epidemiology and disease ecology studies based on the most accessible life-cycle stages of eye flukes.
Uterine responses to early pre-attachment embryos in the domestic dog and comparisons with other domestic animal species.

PubMed

Graubner, Felix R; Gram, Aykut; Kautz, Ewa; Bauersachs, Stefan; Aslan, Selim; Agaoglu, Ali R; Boos, Alois; Kowalewski, Mariusz P

2017-08-01

In the dog, there is no luteolysis in the absence of pregnancy. Thus, this species lacks any anti-luteolytic endocrine signal as found in other species that modulate uterine function during the critical period of pregnancy establishment. Nevertheless, in the dog an embryo-maternal communication must occur in order to prevent rejection of embryos. Based on this hypothesis, we performed microarray analysis of canine uterine samples collected during pre-attachment phase (days 10-12) and in corresponding non-pregnant controls, in order to elucidate the embryo attachment signal. An additional goal was to identify differences in uterine responses to pre-attachment embryos between dogs and other mammalian species exhibiting different reproductive patterns with regard to luteolysis, implantation, and preparation for placentation. Therefore, the canine microarray data were compared with gene sets from pigs, cattle, horses, and humans. We found 412 genes differentially regulated between the two experimental groups. The functional terms most strongly enriched in response to pre-attachment embryos related to extracellular matrix function and remodeling, and to immune and inflammatory responses. Several candidate genes were validated by semi-quantitative PCR. When compared with other species, best matches were found with human and equine counterparts. Especially for the pig, the majority of overlapping genes showed opposite expression patterns. Interestingly, 1926 genes did not pair with any of the other gene sets. Using a microarray approach, we report the uterine changes in the dog driven by the presence of embryos and compare these results with datasets from other mammalian species, finding common-, contrary-, and exclusively canine-regulated genes. © The Authors 2017. Published by Oxford University Press on behalf of Society for the Study of Reproduction.
Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification.

PubMed

Alshamlan, Hala M; Badr, Ghada H; Alohali, Yousef A

2015-06-01

Naturally inspired evolutionary algorithms prove effectiveness when used for solving feature selection and classification problems. Artificial Bee Colony (ABC) is a relatively new swarm intelligence method. In this paper, we propose a new hybrid gene selection method, namely Genetic Bee Colony (GBC) algorithm. The proposed algorithm combines the used of a Genetic Algorithm (GA) along with Artificial Bee Colony (ABC) algorithm. The goal is to integrate the advantages of both algorithms. The proposed algorithm is applied to a microarray gene expression profile in order to select the most predictive and informative genes for cancer classification. In order to test the accuracy performance of the proposed algorithm, extensive experiments were conducted. Three binary microarray datasets are use, which include: colon, leukemia, and lung. In addition, another three multi-class microarray datasets are used, which are: SRBCT, lymphoma, and leukemia. Results of the GBC algorithm are compared with our recently proposed technique: mRMR when combined with the Artificial Bee Colony algorithm (mRMR-ABC). We also compared the combination of mRMR with GA (mRMR-GA) and Particle Swarm Optimization (mRMR-PSO) algorithms. In addition, we compared the GBC algorithm with other related algorithms that have been recently published in the literature, using all benchmark datasets. The GBC algorithm shows superior performance as it achieved the highest classification accuracy along with the lowest average number of selected genes. This proves that the GBC algorithm is a promising approach for solving the gene selection problem in both binary and multi-class cancer classification. Copyright © 2015 Elsevier Ltd. All rights reserved.
Comparative analysis of chloroplast genomes of the genus Citrus and its close relatives.

PubMed

Liu, Xiaogang; Wu, Hongkun; Luo, Yan; Xi, Wanpeng; Zhou, Zhiqin

2017-01-01

The genus Citrus and its close relatives are economically and nutritionally important fruit trees. However, the huge controversy over the phylogeny of key wild species, as well as the genetic relationship between the cultivated species and their putative wild progenitors, remains unresolved. Comparative analyses of chloroplast (cp) genomes have been useful in resolving various phylogenetic issues. Thus far, the cp genomes of only two Citrus species have been sequenced. In this study, we sequenced six complete cp genomes, four belonging to the genus Citrus, and two belonging to the genera Fortunella and Poncirus, respectively. These newly sequenced genomes together with the two publicly available were used for comparative analyses of the genus Citrus and its close relatives. All eight cp genomes share similar basic structure, gene order and gene content. Phylogenetic analyses supported the monophyly of the three genera in the order Sapindales within the major clade Malvidae.
Understanding the pharmacogenetics of selective serotonin reuptake inhibitors.

PubMed

Fabbri, Chiara; Minarini, Alessandro; Niitsu, Tomihisa; Serretti, Alessandro

2014-08-01

The genetic background of antidepressant response represents a unique opportunity to identify biological markers of treatment outcome. Encouraging results alternating with inconsistent findings made antidepressant pharmacogenetics a stimulating but often discouraging field that requires careful discussion about cumulative evidence and methodological issues. The present review discusses both known and less replicated genes that have been implicated in selective serotonin reuptake inhibitors (SSRIs) efficacy and side effects. Candidate genes studies and genome-wide association studies (GWAS) were collected through MEDLINE database search (articles published till January 2014). Further, GWAS signals localized in promising genetic regions according to candidate gene studies are reported in order to assess the general comparability of results obtained through these two types of pharmacogenetic studies. Finally, a pathway enrichment approach is applied to the top genes (those harboring SNPs with p < 0.0001) outlined by previous GWAS in order to identify possible molecular mechanisms involved in SSRI effect. In order to improve the understanding of SSRI pharmacogenetics, the present review discusses the proposal of moving from the analysis of individual polymorphisms to genes and molecular pathways, and from the separation across different methodological approaches to their combination. Efforts in this direction are justified by the recent evidence of a favorable cost-utility of gene-guided antidepressant treatment.
Evolution of multicopper oxidase genes in coprophilous and non-coprophilous members of the order sordariales.

PubMed

Pöggeler, Stefanie

2011-04-01

Multicopper oxidases (MCO) catalyze the biological oxidation of various aromatic substrates and have been identified in plants, insects, bacteria, and wood rotting fungi. In nature, they are involved in biodegradation of biopolymers such as lignin and humic compounds, but have also been tested for various industrial applications. In fungi, MCOs have been shown to play important roles during their life cycles, such as in fruiting body formation, pigment formation and pathogenicity. Coprophilous fungi, which grow on the dung of herbivores, appear to encode an unexpectedly high number of enzymes capable of at least partly degrading lignin. This study compared the MCO-coding capacity of the coprophilous filamentous ascomycetes Podospora anserina and Sordaria macrospora with closely related non-coprophilous members of the order Sordariales. An increase of MCO genes in coprophilic members of the Sordariales most probably occurred by gene duplication and horizontal gene transfer events.

Divergent and convergent modes of interaction between wheat and Puccinia graminis f. sp. tritici isolates revealed by the comparative gene co-expression network and genome analyses

USDA-ARS?s Scientific Manuscript database

Two opposing evolutionary constraints exert pressure on pathogens: one to diversify virulence factors in order to evade host defenses, and the other to retain virulence factors critical for maintaining a compatible interaction. To better understand how the diversified arsenals of fungal genes promot...
GenomicusPlants: a web resource to study genome evolution in flowering plants.

PubMed

Louis, Alexandra; Murat, Florent; Salse, Jérôme; Crollius, Hugues Roest

2015-01-01

Comparative genomics combined with phylogenetic reconstructions are powerful approaches to study the evolution of genes and genomes. However, the current rapid expansion of the volume of genomic information makes it increasingly difficult to interrogate, integrate and synthesize comparative genome data while taking into account the maximum breadth of information available. GenomicusPlants (http://www.genomicus.biologie.ens.fr/genomicus-plants) is an extension of the Genomicus webserver that addresses this issue by allowing users to explore flowering plant genomes in an intuitive way, across the broadest evolutionary scales. Extant genomes of 26 flowering plants can be analyzed, as well as 23 ancestral reconstructed genomes. Ancestral gene order provides a long-term chronological view of gene order evolution, greatly facilitating comparative genomics and evolutionary studies. Four main interfaces ('views') are available where: (i) PhyloView combines phylogenetic trees with comparisons of genomic loci across any number of genomes; (ii) AlignView projects loci of interest against all other genomes to visualize its topological conservation; (iii) MatrixView compares two genomes in a classical dotplot representation; and (iv) Karyoview visualizes chromosome karyotypes 'painted' with colours of another genome of interest. All four views are interconnected and benefit from many customizable features. © The Author 2014. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists.
Evolution of gastropod mitochondrial genome arrangements

PubMed Central

2008-01-01

Background Gastropod mitochondrial genomes exhibit an unusually great variety of gene orders compared to other metazoan mitochondrial genome such as e.g those of vertebrates. Hence, gastropod mitochondrial genomes constitute a good model system to study patterns, rates, and mechanisms of mitochondrial genome rearrangement. However, this kind of evolutionary comparative analysis requires a robust phylogenetic framework of the group under study, which has been elusive so far for gastropods in spite of the efforts carried out during the last two decades. Here, we report the complete nucleotide sequence of five mitochondrial genomes of gastropods (Pyramidella dolabrata, Ascobulla fragilis, Siphonaria pectinata, Onchidella celtica, and Myosotella myosotis), and we analyze them together with another ten complete mitochondrial genomes of gastropods currently available in molecular databases in order to reconstruct the phylogenetic relationships among the main lineages of gastropods. Results Comparative analyses with other mollusk mitochondrial genomes allowed us to describe molecular features and general trends in the evolution of mitochondrial genome organization in gastropods. Phylogenetic reconstruction with commonly used methods of phylogenetic inference (ME, MP, ML, BI) arrived at a single topology, which was used to reconstruct the evolution of mitochondrial gene rearrangements in the group. Conclusion Four main lineages were identified within gastropods: Caenogastropoda, Vetigastropoda, Patellogastropoda, and Heterobranchia. Caenogastropoda and Vetigastropoda are sister taxa, as well as, Patellogastropoda and Heterobranchia. This result rejects the validity of the derived clade Apogastropoda (Caenogastropoda + Heterobranchia). The position of Patellogastropoda remains unclear likely due to long-branch attraction biases. Within Heterobranchia, the most heterogeneous group of gastropods, neither Euthyneura (because of the inclusion of P. dolabrata) nor Pulmonata (polyphyletic) nor Opisthobranchia (because of the inclusion S. pectinata) were recovered as monophyletic groups. The gene order of the Vetigastropoda might represent the ancestral mitochondrial gene order for Gastropoda and we propose that at least three major rearrangements have taken place in the evolution of gastropods: one in the ancestor of Caenogastropoda, another in the ancestor of Patellogastropoda, and one more in the ancestor of Heterobranchia. PMID:18302768
Methods of Combinatorial Optimization to Reveal Factors Affecting Gene Length

PubMed Central

Bolshoy, Alexander; Tatarinova, Tatiana

2012-01-01

In this paper we present a novel method for genome ranking according to gene lengths. The main outcomes described in this paper are the following: the formulation of the genome ranking problem, presentation of relevant approaches to solve it, and the demonstration of preliminary results from prokaryotic genomes ordering. Using a subset of prokaryotic genomes, we attempted to uncover factors affecting gene length. We have demonstrated that hyperthermophilic species have shorter genes as compared with mesophilic organisms, which probably means that environmental factors affect gene length. Moreover, these preliminary results show that environmental factors group together in ranking evolutionary distant species. PMID:23300345
Inferring gene regression networks with model trees

PubMed Central

2010-01-01

Background Novel strategies are required in order to handle the huge amount of data produced by microarray technologies. To infer gene regulatory networks, the first step is to find direct regulatory relationships between genes building the so-called gene co-expression networks. They are typically generated using correlation statistics as pairwise similarity measures. Correlation-based methods are very useful in order to determine whether two genes have a strong global similarity but do not detect local similarities. Results We propose model trees as a method to identify gene interaction networks. While correlation-based methods analyze each pair of genes, in our approach we generate a single regression tree for each gene from the remaining genes. Finally, a graph from all the relationships among output and input genes is built taking into account whether the pair of genes is statistically significant. For this reason we apply a statistical procedure to control the false discovery rate. The performance of our approach, named REGNET, is experimentally tested on two well-known data sets: Saccharomyces Cerevisiae and E.coli data set. First, the biological coherence of the results are tested. Second the E.coli transcriptional network (in the Regulon database) is used as control to compare the results to that of a correlation-based method. This experiment shows that REGNET performs more accurately at detecting true gene associations than the Pearson and Spearman zeroth and first-order correlation-based methods. Conclusions REGNET generates gene association networks from gene expression data, and differs from correlation-based methods in that the relationship between one gene and others is calculated simultaneously. Model trees are very useful techniques to estimate the numerical values for the target genes by linear regression functions. They are very often more precise than linear regression models because they can add just different linear regressions to separate areas of the search space favoring to infer localized similarities over a more global similarity. Furthermore, experimental results show the good performance of REGNET. PMID:20950452
The transcriptomic fingerprint of glucoamylase over-expression in Aspergillus niger

PubMed Central

2012-01-01

Background Filamentous fungi such as Aspergillus niger are well known for their exceptionally high capacity for secretion of proteins, organic acids, and secondary metabolites and they are therefore used in biotechnology as versatile microbial production platforms. However, system-wide insights into their metabolic and secretory capacities are sparse and rational strain improvement approaches are therefore limited. In order to gain a genome-wide view on the transcriptional regulation of the protein secretory pathway of A. niger, we investigated the transcriptome of A. niger when it was forced to overexpression the glaA gene (encoding glucoamylase, GlaA) and secrete GlaA to high level. Results An A. niger wild-type strain and a GlaA over-expressing strain, containing multiple copies of the glaA gene, were cultivated under maltose-limited chemostat conditions (specific growth rate 0.1 h-1). Elevated glaA mRNA and extracellular GlaA levels in the over-expressing strain were accompanied by elevated transcript levels from 772 genes and lowered transcript levels from 815 genes when compared to the wild-type strain. Using GO term enrichment analysis, four higher-order categories were identified in the up-regulated gene set: i) endoplasmic reticulum (ER) membrane translocation, ii) protein glycosylation, iii) vesicle transport, and iv) ion homeostasis. Among these, about 130 genes had predicted functions for the passage of proteins through the ER and those genes included target genes of the HacA transcription factor that mediates the unfolded protein response (UPR), e.g. bipA, clxA, prpA, tigA and pdiA. In order to identify those genes that are important for high-level secretion of proteins by A. niger, we compared the transcriptome of the GlaA overexpression strain of A. niger with six other relevant transcriptomes of A. niger. Overall, 40 genes were found to have either elevated (from 36 genes) or lowered (from 4 genes) transcript levels under all conditions that were examined, thus defining the core set of genes important for ensuring high protein traffic through the secretory pathway. Conclusion We have defined the A. niger genes that respond to elevated secretion of GlaA and, furthermore, we have defined a core set of genes that appear to be involved more generally in the intensified traffic of proteins through the secretory pathway of A. niger. The consistent up-regulation of a gene encoding the acetyl-coenzyme A transporter suggests a possible role for transient acetylation to ensure correct folding of secreted proteins. PMID:23237452
Complete mitochondrial genomes of three crickets (Orthoptera: Gryllidae) and comparative analyses within Ensifera mitogenomes.

PubMed

Yang, Jing; Ren, Qianli; Huang, Yuan

2016-03-17

The complete mitochondrial genomes (mitogenomes) of Velarifictorus hemelytrus, Loxoblemmus equestris and Teleogryllus emma are 16123 bp, 16314 bp and 15697 bp, in size, respectively. All three mitogenomes possess the same gene order of the inversion of the gene cluster trnE-trnS^(AGN)-trnN compared with the ancestral gene order of Orthoptera. The atypical initiation codon for the cox1 gene in three crickets is TTA. Pronounced A skew and T skew have been found in Grylloidea comparing with Gryllotalpoidea and Tettigonioidea. The T-stretch in the minority strand is interrupted by C to form (T)_n(C)₂(T)_n sequences in five species of Gryllinae (V. hemelytrus, L. equestris, T. emma, T. oceanicus, T. commodus). This T-stretch variant with its neighbouring A-stretch variant (A-stretch is interrupted by G), which were discovered in the A+T-rich regions of all taxa from infraorder Gryllidea, could form a conserved stem-loop structure (including 15 ~ 17 base pairs). This potential stem-loop structure is a favorable candidate that may participate in the replication origin of the minority strand of Gryllidea mitogenome. Phylogenetic analysis indicated that within the Gryllinae, genus Teleogryllus and Velarifictorus are closely related, sister to the genus Loxoblemmus. The relationships among the five superfamilies of Ensifera presented here were ((Grylloidea, Gryllotalpoidea) (Tettigonioidea, (Hagloidea, Rhaphidophoroidea))).
Dynamic Evolution of the Chloroplast Genome in the Green Algal Classes Pedinophyceae and Trebouxiophyceae.

PubMed

Turmel, Monique; Otis, Christian; Lemieux, Claude

2015-07-01

Previous studies of trebouxiophycean chloroplast genomes revealed little information regarding the evolutionary dynamics of this genome because taxon sampling was too sparse and the relationships between the sampled taxa were unknown. We recently sequenced the chloroplast genomes of 27 trebouxiophycean and 2 pedinophycean green algae to resolve the relationships among the main lineages recognized for the Trebouxiophyceae. These taxa and the previously sampled members of the Pedinophyceae and Trebouxiophyceae are included in the comparative chloroplast genome analysis we report here. The 38 genomes examined display considerable variability at all levels, except gene content. Our results highlight the high propensity of the rDNA-containing large inverted repeat (IR) to vary in size, gene content and gene order as well as the repeated losses it experienced during trebouxiophycean evolution. Of the seven predicted IR losses, one event demarcates a superclade of 11 taxa representing 5 late-diverging lineages. IR expansions/contractions account not only for changes in gene content in this region but also for changes in gene order and gene duplications. Inversions also led to gene rearrangements within the IR, including the reversal or disruption of the rDNA operon in some lineages. Most of the 20 IR-less genomes are more rearranged compared with their IR-containing homologs and tend to show an accelerated rate of sequence evolution. In the IR-less superclade, several ancestral operons were disrupted, a few genes were fragmented, and a subgroup of taxa features a G+C-biased nucleotide composition. Our analyses also unveiled putative cases of gene acquisitions through horizontal transfer. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Large Diversity of Nonstandard Genes and Dynamic Evolution of Chloroplast Genomes in Siphonous Green Algae (Bryopsidales, Chlorophyta)

PubMed Central

Leliaert, Frederik; Marcelino, Vanessa R

2018-01-01

Abstract Chloroplast genomes have undergone tremendous alterations through the evolutionary history of the green algae (Chloroplastida). This study focuses on the evolution of chloroplast genomes in the siphonous green algae (order Bryopsidales). We present five new chloroplast genomes, which along with existing sequences, yield a data set representing all but one families of the order. Using comparative phylogenetic methods, we investigated the evolutionary dynamics of genomic features in the order. Our results show extensive variation in chloroplast genome architecture and intron content. Variation in genome size is accounted for by the amount of intergenic space and freestanding open reading frames that do not show significant homology to standard plastid genes. We show the diversity of these nonstandard genes based on their conserved protein domains, which are often associated with mobile functions (reverse transcriptase/intron maturase, integrases, phage- or plasmid-DNA primases, transposases, integrases, ligases). Investigation of the introns showed proliferation of group II introns in the early evolution of the order and their subsequent loss in the core Halimedineae, possibly through RT-mediated intron loss. PMID:29635329
Finding approximate gene clusters with Gecko 3.

PubMed

Winter, Sascha; Jahn, Katharina; Wehner, Stefanie; Kuchenbecker, Leon; Marz, Manja; Stoye, Jens; Böcker, Sebastian

2016-11-16

Gene-order-based comparison of multiple genomes provides signals for functional analysis of genes and the evolutionary process of genome organization. Gene clusters are regions of co-localized genes on genomes of different species. The rapid increase in sequenced genomes necessitates bioinformatics tools for finding gene clusters in hundreds of genomes. Existing tools are often restricted to few (in many cases, only two) genomes, and often make restrictive assumptions such as short perfect conservation, conserved gene order or monophyletic gene clusters. We present Gecko 3, an open-source software for finding gene clusters in hundreds of bacterial genomes, that comes with an easy-to-use graphical user interface. The underlying gene cluster model is intuitive, can cope with low degrees of conservation as well as misannotations and is complemented by a sound statistical evaluation. To evaluate the biological benefit of Gecko 3 and to exemplify our method, we search for gene clusters in a dataset of 678 bacterial genomes using Synechocystis sp. PCC 6803 as a reference. We confirm detected gene clusters reviewing the literature and comparing them to a database of operons; we detect two novel clusters, which were confirmed by publicly available experimental RNA-Seq data. The computational analysis is carried out on a laptop computer in <40 min. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Finding genes discriminating smokers from non-smokers by applying a growing self-organizing clustering method to large airway epithelium cell microarray data.

PubMed

Shahdoust, Maryam; Hajizadeh, Ebrahim; Mozdarani, Hossein; Chehrei, Ali

2013-01-01

Cigarette smoking is the major risk factor for development of lung cancer. Identification of effects of tobacco on airway gene expression may provide insight into the causes. This research aimed to compare gene expression of large airway epithelium cells in normal smokers (n=13) and non-smokers (n=9) in order to find genes which discriminate the two groups and assess cigarette smoking effects on large airway epithelium cells. Genes discriminating smokers from non-smokers were identified by applying a neural network clustering method, growing self-organizing maps (GSOM), to microarray data according to class discrimination scores. An index was computed based on differentiation between each mean of gene expression in the two groups. This clustering approach provided the possibility of comparing thousands of genes simultaneously. The applied approach compared the mean of 7,129 genes in smokers and non-smokers simultaneously and classified the genes of large airway epithelium cells which had differently expressed in smokers comparing with non-smokers. Seven genes were identified which had the highest different expression in smokers compared with the non-smokers group: NQO1, H19, ALDH3A1, AKR1C1, ABHD2, GPX2 and ADH7. Most (NQO1, ALDH3A1, AKR1C1, H19 and GPX2) are known to be clinically notable in lung cancer studies. Furthermore, statistical discriminate analysis showed that these genes could classify samples in smokers and non-smokers correctly with 100% accuracy. With the performed GSOM map, other nodes with high average discriminate scores included genes with alterations strongly related to the lung cancer such as AKR1C3, CYP1B1, UCHL1 and AKR1B10. This clustering by comparing expression of thousands of genes at the same time revealed alteration in normal smokers. Most of the identified genes were strongly relevant to lung cancer in the existing literature. The genes may be utilized to identify smokers with increased risk for lung cancer. A large sample study is now recommended to determine relations between the genes ABHD2 and ADH7 and smoking.
Comparative study on gene set and pathway topology-based enrichment methods.

PubMed

Bayerlová, Michaela; Jung, Klaus; Kramer, Frank; Klemm, Florian; Bleckmann, Annalen; Beißbarth, Tim

2015-10-22

Enrichment analysis is a popular approach to identify pathways or sets of genes which are significantly enriched in the context of differentially expressed genes. The traditional gene set enrichment approach considers a pathway as a simple gene list disregarding any knowledge of gene or protein interactions. In contrast, the new group of so called pathway topology-based methods integrates the topological structure of a pathway into the analysis. We comparatively investigated gene set and pathway topology-based enrichment approaches, considering three gene set and four topological methods. These methods were compared in two extensive simulation studies and on a benchmark of 36 real datasets, providing the same pathway input data for all methods. In the benchmark data analysis both types of methods showed a comparable ability to detect enriched pathways. The first simulation study was conducted with KEGG pathways, which showed considerable gene overlaps between each other. In this study with original KEGG pathways, none of the topology-based methods outperformed the gene set approach. Therefore, a second simulation study was performed on non-overlapping pathways created by unique gene IDs. Here, methods accounting for pathway topology reached higher accuracy than the gene set methods, however their sensitivity was lower. We conducted one of the first comprehensive comparative works on evaluating gene set against pathway topology-based enrichment methods. The topological methods showed better performance in the simulation scenarios with non-overlapping pathways, however, they were not conclusively better in the other scenarios. This suggests that simple gene set approach might be sufficient to detect an enriched pathway under realistic circumstances. Nevertheless, more extensive studies and further benchmark data are needed to systematically evaluate these methods and to assess what gain and cost pathway topology information introduces into enrichment analysis. Both types of methods for enrichment analysis require further improvements in order to deal with the problem of pathway overlaps.
A large-scale benchmark of gene prioritization methods.

PubMed

Guala, Dimitri; Sonnhammer, Erik L L

2017-04-21

In order to maximize the use of results from high-throughput experimental studies, e.g. GWAS, for identification and diagnostics of new disease-associated genes, it is important to have properly analyzed and benchmarked gene prioritization tools. While prospective benchmarks are underpowered to provide statistically significant results in their attempt to differentiate the performance of gene prioritization tools, a strategy for retrospective benchmarking has been missing, and new tools usually only provide internal validations. The Gene Ontology(GO) contains genes clustered around annotation terms. This intrinsic property of GO can be utilized in construction of robust benchmarks, objective to the problem domain. We demonstrate how this can be achieved for network-based gene prioritization tools, utilizing the FunCoup network. We use cross-validation and a set of appropriate performance measures to compare state-of-the-art gene prioritization algorithms: three based on network diffusion, NetRank and two implementations of Random Walk with Restart, and MaxLink that utilizes network neighborhood. Our benchmark suite provides a systematic and objective way to compare the multitude of available and future gene prioritization tools, enabling researchers to select the best gene prioritization tool for the task at hand, and helping to guide the development of more accurate methods.
Generation, annotation and analysis of ESTs from Trichoderma harzianum CECT 2413

PubMed Central

Vizcaíno, Juan Antonio; González, Francisco Javier; Suárez, M Belén; Redondo, José; Heinrich, Julian; Delgado-Jarana, Jesús; Hermosa, Rosa; Gutiérrez, Santiago; Monte, Enrique; Llobell, Antonio; Rey, Manuel

2006-01-01

Background The filamentous fungus Trichoderma harzianum is used as biological control agent of several plant-pathogenic fungi. In order to study the genome of this fungus, a functional genomics project called "TrichoEST" was developed to give insights into genes involved in biological control activities using an approach based on the generation of expressed sequence tags (ESTs). Results Eight different cDNA libraries from T. harzianum strain CECT 2413 were constructed. Different growth conditions involving mainly different nutrient conditions and/or stresses were used. We here present the analysis of the 8,710 ESTs generated. A total of 3,478 unique sequences were identified of which 81.4% had sequence similarity with GenBank entries, using the BLASTX algorithm. Using the Gene Ontology hierarchy, we performed the annotation of 51.1% of the unique sequences and compared its distribution among the gene libraries. Additionally, the InterProScan algorithm was used in order to further characterize the sequences. The identification of the putatively secreted proteins was also carried out. Later, based on the EST abundance, we examined the highly expressed genes and a hydrophobin was identified as the gene expressed at the highest level. We compared our collection of ESTs with the previous collections obtained from Trichoderma species and we also compared our sequence set with different complete eukaryotic genomes from several animals, plants and fungi. Accordingly, the presence of similar sequences in different kingdoms was also studied. Conclusion This EST collection and its annotation provide a significant resource for basic and applied research on T. harzianum, a fungus with a high biotechnological interest. PMID:16872539
The first complete chloroplast genome sequence of a lycophyte,Huperzia lucidula (Lycopodiaceae)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wolf, Paul G.; Karol, Kenneth G.; Mandoli, Dina F.

2005-02-01

We used a unique combination of techniques to sequence the first complete chloroplast genome of a lycophyte, Huperzia lucidula. This plant belongs to a significant clade hypothesized to represent the sister group to all other vascular plants. We used fluorescence-activated cell sorting (FACS) to isolate the organelles, rolling circle amplification (RCA) to amplify the genome, and shotgun sequencing to 8x depth coverage to obtain the complete chloroplast genome sequence. The genome is 154,373bp, containing inverted repeats of 15,314 bp each, a large single-copy region of 104,088 bp, and a small single-copy region of 19,671 bp. Gene order is more similarmore » to those of mosses, liverworts, and hornworts than to gene order for other vascular plants. For example, the Huperziachloroplast genome possesses the bryophyte gene order for a previously characterized 30 kb inversion, thus supporting the hypothesis that lycophytes are sister to all other extant vascular plants. The lycophytechloroplast genome data also enable a better reconstruction of the basaltracheophyte genome, which is useful for inferring relationships among bryophyte lineages. Several unique characters are observed in Huperzia, such as movement of the gene ndhF from the small single copy region into the inverted repeat. We present several analyses of evolutionary relationships among land plants by using nucleotide data, amino acid sequences, and by comparing gene arrangements from chloroplast genomes. The results, while still tentative pending the large number of chloroplast genomes from other key lineages that are soon to be sequenced, are intriguing in themselves, and contribute to a growing comparative database of genomic and morphological data across the green plants.« less
Phylogenomic Analysis and Dynamic Evolution of Chloroplast Genomes in Salicaceae

PubMed Central

Huang, Yuan; Wang, Jun; Yang, Yongping; Fan, Chuanzhu; Chen, Jiahui

2017-01-01

Chloroplast genomes of plants are highly conserved in both gene order and gene content. Analysis of the whole chloroplast genome is known to provide much more informative DNA sites and thus generates high resolution for plant phylogenies. Here, we report the complete chloroplast genomes of three Salix species in family Salicaceae. Phylogeny of Salicaceae inferred from complete chloroplast genomes is generally consistent with previous studies but resolved with higher statistical support. Incongruences of phylogeny, however, are observed in genus Populus, which most likely results from homoplasy. By comparing three Salix chloroplast genomes with the published chloroplast genomes of other Salicaceae species, we demonstrate that the synteny and length of chloroplast genomes in Salicaceae are highly conserved but experienced dynamic evolution among species. We identify seven positively selected chloroplast genes in Salicaceae, which might be related to the adaptive evolution of Salicaceae species. Comparative chloroplast genome analysis within the family also indicates that some chloroplast genes are lost or became pseudogenes, infer that the chloroplast genes horizontally transferred to the nucleus genome. Based on the complete nucleus genome sequences from two Salicaceae species, we remarkably identify that the entire chloroplast genome is indeed transferred and integrated to the nucleus genome in the individual of the reference genome of P. trichocarpa at least once. This observation, along with presence of the large nuclear plastid DNA (NUPTs) and NUPTs-containing multiple chloroplast genes in their original order in the chloroplast genome, favors the DNA-mediated hypothesis of organelle to nucleus DNA transfer. Overall, the phylogenomic analysis using chloroplast complete genomes clearly elucidates the phylogeny of Salicaceae. The identification of positively selected chloroplast genes and dynamic chloroplast-to-nucleus gene transfers in Salicaceae provide resources to better understand the successful adaptation of Salicaceae species. PMID:28676809
Ex-situ conservaton of Holstein-Friesian cattle comparing the Dutch, French and USA germplasm collections

USDA-ARS?s Scientific Manuscript database

Holstein-Friesian (HF) gene bank collections were established in France, the Netherlands and USA in order to conserve as much genetic diversity as possible for this breed. Genetic variability of HF collections within and between countries was assessed and compared with active male HF populations in ...
Ex situ conservation of Holstein-Friesian cattle: Comparing the Dutch, French and USA germplasm collections

USDA-ARS?s Scientific Manuscript database

Holstein-Friesian (HF) gene bank collections were established in France, the Netherlands and USA in order to conserve genetic diversity for this breed. Genetic diversity of HF collections within and between countries was assessed and compared with active HF bulls in each country by using pedigree da...
Inheritance of gene density–related higher order chromatin arrangements in normal and tumor cell nuclei

PubMed Central

Cremer, Marion; Küpper, Katrin; Wagler, Babett; Wizelman, Leah; Hase, Johann v.; Weiland, Yanina; Kreja, Ludwika; Diebold, Joachim; Speicher, Michael R.; Cremer, Thomas

2003-01-01

A gene density–related difference in the radial arrangement of chromosome territories (CTs) was previously described for human lymphocyte nuclei with gene-poor CT #18 located toward the nuclear periphery and gene-dense CT #19 in the nuclear interior (Croft, J.A., J.M. Bridger, S. Boyle, P. Perry, P. Teague, and W.A. Bickmore. 1999. J. Cell Biol. 145:1119–1131). Here, we analyzed the radial distribution of chromosome 18 and 19 chromatin in six normal cell types and in eight tumor cell lines, some of them with imbalances and rearrangements of the two chromosomes. Our findings demonstrate that a significant difference in the radial distribution of #18 and #19 chromatin is a common feature of higher order chromatin architecture in both normal and malignant cell types. However, in seven of eight tumor cell lines, the difference was less pronounced compared with normal cell nuclei due to a higher fraction of nuclei showing an inverted CT position, i.e., a CT #18 located more internally than a CT #19. This observation emphasizes a partial loss of radial chromatin order in tumor cell nuclei. PMID:12952935
Partial mitochondrial gene arrangements support a close relationship between Tardigrada and Arthropoda.

PubMed

Ryu, Shi Hyun; Lee, Ji Min; Jang, Kuem-Hee; Choi, Eun Hwa; Park, Shin Ju; Chang, Cheon Young; Kim, Won; Hwang, Ui Wook

2007-12-31

Regions (about 3.7-3.8 kb) of the mitochondrial genomes (rrnL-cox1) of two tardigrades, a heterotardigrade, Batillipes pennaki, and a eutardigrade, Pseudobiotus spinifer, were sequenced and characterized. The gene order in Batillipes was rrnL-V-rrnS-Q-I-M-nad2-W-C-Y-cox1, and in Pseudobiotus it was rrnL-V-rrnS-Q-M-nad2-W-C-Y-cox1. With the exception of the trnI gene, the two tardigrade regions have the same gene content and order. Their gene orders are strikingly similar to that of the chelicerate Limulus polyphemus (rrnL-V-rrnS-CR-I-Q-M-nad2-W-C-Y-cox1), which is considered to be ancestral for arthropods. Although the tardigrades do not have a distinct control region (CR) within this segment, the trnI gene in Pseudobiotus is located between rrnL-trnL1 and trnL2-nad1, and the trnI gene in Batillipes is located between trnQ and trnM. In addition, the 106-bp region between trnQ and trnM in Batillipes not only contains two plausible trnI genes with opposite orientations, but also exhibits some CR-like characteristics. The mitochondrial gene arrangements of 183 other protostomes were compared. 60 (52.2%) of the 115 arthropods examined have the M-nad2-W-C-Y-cox1 arrangement, and 88 (76.5%) the M-nad2-W arrangement, as found in the tardigrades. In contrast, no such arrangement was seen in the 70 non-arthropod protostomes studied. These are the first non-sequence molecular data that support the close relationship of tardigrades and arthropods.

Global Landscape of a Co-Expressed Gene Network in Barley and its Application to Gene Discovery in Triticeae Crops

PubMed Central

Mochida, Keiichi; Uehara-Yamaguchi, Yukiko; Yoshida, Takuhiro; Sakurai, Tetsuya; Shinozaki, Kazuo

2011-01-01

Accumulated transcriptome data can be used to investigate regulatory networks of genes involved in various biological systems. Co-expression analysis data sets generated from comprehensively collected transcriptome data sets now represent efficient resources that are capable of facilitating the discovery of genes with closely correlated expression patterns. In order to construct a co-expression network for barley, we analyzed 45 publicly available experimental series, which are composed of 1,347 sets of GeneChip data for barley. On the basis of a gene-to-gene weighted correlation coefficient, we constructed a global barley co-expression network and classified it into clusters of subnetwork modules. The resulting clusters are candidates for functional regulatory modules in the barley transcriptome. To annotate each of the modules, we performed comparative annotation using genes in Arabidopsis and Brachypodium distachyon. On the basis of a comparative analysis between barley and two model species, we investigated functional properties from the representative distributions of the gene ontology (GO) terms. Modules putatively involved in drought stress response and cellulose biogenesis have been identified. These modules are discussed to demonstrate the effectiveness of the co-expression analysis. Furthermore, we applied the data set of co-expressed genes coupled with comparative analysis in attempts to discover potentially Triticeae-specific network modules. These results demonstrate that analysis of the co-expression network of the barley transcriptome together with comparative analysis should promote the process of gene discovery in barley. Furthermore, the insights obtained should be transferable to investigations of Triticeae plants. The associated data set generated in this analysis is publicly accessible at http://coexpression.psc.riken.jp/barley/. PMID:21441235
Multiconstrained gene clustering based on generalized projections

PubMed Central

2010-01-01

Background Gene clustering for annotating gene functions is one of the fundamental issues in bioinformatics. The best clustering solution is often regularized by multiple constraints such as gene expressions, Gene Ontology (GO) annotations and gene network structures. How to integrate multiple pieces of constraints for an optimal clustering solution still remains an unsolved problem. Results We propose a novel multiconstrained gene clustering (MGC) method within the generalized projection onto convex sets (POCS) framework used widely in image reconstruction. Each constraint is formulated as a corresponding set. The generalized projector iteratively projects the clustering solution onto these sets in order to find a consistent solution included in the intersection set that satisfies all constraints. Compared with previous MGC methods, POCS can integrate multiple constraints from different nature without distorting the original constraints. To evaluate the clustering solution, we also propose a new performance measure referred to as Gene Log Likelihood (GLL) that considers genes having more than one function and hence in more than one cluster. Comparative experimental results show that our POCS-based gene clustering method outperforms current state-of-the-art MGC methods. Conclusions The POCS-based MGC method can successfully combine multiple constraints from different nature for gene clustering. Also, the proposed GLL is an effective performance measure for the soft clustering solutions. PMID:20356386
Unusual Gene Order and Organization of the Sea Urchin Hox Cluster

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cameron, R A; Rowen, L; Nesbitt, R

2005-10-11

The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3 gene is Hox5. (The gene order is :more » 5-Hox1, 2, 3, 11/13c, 11/13b, 11/13a, 9/10, 8, 7, 6, 5 - 3). The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.« less
Unusual Gene Order and Organization of the Sea Urchin HoxCluster

DOE Office of Scientific and Technical Information (OSTI.GOV)

Richardson, Paul M.; Lucas, Susan; Cameron, R. Andrew

2005-05-10

The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3' gene is Hox5. (The gene order is :more » 5'-Hox1,2, 3, 11/13c, 11/13b, '11/13a, 9/10, 8, 7, 6, 5 - 3)'. The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.« less
Ensembl comparative genomics resources.

PubMed

Herrero, Javier; Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J; Searle, Stephen M J; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

2016-01-01

Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org. © The Author(s) 2016. Published by Oxford University Press.
Ensembl comparative genomics resources

PubMed Central

Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J.; Searle, Stephen M. J.; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

2016-01-01

Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org. PMID:26896847
Motif-independent prediction of a secondary metabolism gene cluster using comparative genomics: application to sequenced genomes of Aspergillus and ten other filamentous fungal species.

PubMed

Takeda, Itaru; Umemura, Myco; Koike, Hideaki; Asai, Kiyoshi; Machida, Masayuki

2014-08-01

Despite their biological importance, a significant number of genes for secondary metabolite biosynthesis (SMB) remain undetected due largely to the fact that they are highly diverse and are not expressed under a variety of cultivation conditions. Several software tools including SMURF and antiSMASH have been developed to predict fungal SMB gene clusters by finding core genes encoding polyketide synthase, nonribosomal peptide synthetase and dimethylallyltryptophan synthase as well as several others typically present in the cluster. In this work, we have devised a novel comparative genomics method to identify SMB gene clusters that is independent of motif information of the known SMB genes. The method detects SMB gene clusters by searching for a similar order of genes and their presence in nonsyntenic blocks. With this method, we were able to identify many known SMB gene clusters with the core genes in the genomic sequences of 10 filamentous fungi. Furthermore, we have also detected SMB gene clusters without core genes, including the kojic acid biosynthesis gene cluster of Aspergillus oryzae. By varying the detection parameters of the method, a significant difference in the sequence characteristics was detected between the genes residing inside the clusters and those outside the clusters. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Comparative mitogenomic analyses of three North American stygobiont amphipods of the genus Stygobromus (Crustacea: Amphipoda)

USGS Publications Warehouse

Aunins, Aaron W.; Nelms, David L.; Hobson, Christopher S.; King, Timothy L.

2016-01-01

The mitochondrial genomes of three North American stygobiont amphipods Stygobromus tenuis potomacus, S. foliatus and S. indentatus collected from Caroline County, VA, were sequenced using a shotgun sequencing approach on an Illumina NextSeq500 (Illumina Inc., San Diego, CA). All three mitogenomes displayed 13 protein-coding genes, 22 tRNAs and two rRNAs typical of metazoans. While S. tenuis and S. indentatusdisplayed identical gene orders similar to the pancrustacean ground pattern, S. foliatus displayed a transposition of the trnL2-cox2 genes to after atp8-atp6. In addition, a short atp8 gene, longer rrnL gene and large inverted repeat within the Control Region distinguished S. foliatus from S. tenuis potomacus and S. indentatus. Overall, it appears that gene order varies considerably among amphipods, and the addition of these Stygobromus mitogenomes to the existing sequenced amphipod mitogenomes will prove useful for characterizing evolutionary relationships among various amphipod taxa, as well as investigations of the evolutionary dynamics of the mitogenome in general.
Outcrossed sex allows a selfish gene to invade yeast populations.

PubMed

Goddard, M R; Greig, D; Burt, A

2001-12-22

Homing endonuclease genes (HEGs) in eukaryotes are optional genes that have no obvious effect on host phenotype except for causing chromosomes not containing a copy of the gene to be cut, thus causing them to be inherited at a greater than Mendelian rate via gene conversion. These genes are therefore expected to increase in frequency in outcrossed populations, but not in obligately selfed populations. In order to test this idea, we compared the dynamics of the VDE HEG in six replicate outcrossed and inbred populations of yeast (Saccharomyces cerevisiae). VDE increased in frequency from 0.21 to 0.55 in four outcrossed generations, but showed no change in frequency in the inbred populations. The absence of change in the inbred populations indicates that any effect of VDE on mitotic replication rates is less than 1%. The data from the outcrossed populations best fit a model in which 82% of individuals are derived from outcrossing and VDE is inherited by 74% of the meiotic products from heterozygotes (as compared with 50% for Mendelian genes). These results empirically demonstrate how a host mating system plays a key role in determining the population dynamics of a selfish gene.
Outcrossed sex allows a selfish gene to invade yeast populations.

PubMed Central

Goddard, M. R.; Greig, D.; Burt, A.

2001-01-01

Homing endonuclease genes (HEGs) in eukaryotes are optional genes that have no obvious effect on host phenotype except for causing chromosomes not containing a copy of the gene to be cut, thus causing them to be inherited at a greater than Mendelian rate via gene conversion. These genes are therefore expected to increase in frequency in outcrossed populations, but not in obligately selfed populations. In order to test this idea, we compared the dynamics of the VDE HEG in six replicate outcrossed and inbred populations of yeast (Saccharomyces cerevisiae). VDE increased in frequency from 0.21 to 0.55 in four outcrossed generations, but showed no change in frequency in the inbred populations. The absence of change in the inbred populations indicates that any effect of VDE on mitotic replication rates is less than 1%. The data from the outcrossed populations best fit a model in which 82% of individuals are derived from outcrossing and VDE is inherited by 74% of the meiotic products from heterozygotes (as compared with 50% for Mendelian genes). These results empirically demonstrate how a host mating system plays a key role in determining the population dynamics of a selfish gene. PMID:11749707
Validation of reference genes for quantifying changes in gene expression in virus-infected tobacco.

PubMed

Baek, Eseul; Yoon, Ju-Yeon; Palukaitis, Peter

2017-10-01

To facilitate quantification of gene expression changes in virus-infected tobacco plants, eight housekeeping genes were evaluated for their stability of expression during infection by one of three systemically-infecting viruses (cucumber mosaic virus, potato virus X, potato virus Y) or a hypersensitive-response-inducing virus (tobacco mosaic virus; TMV) limited to the inoculated leaf. Five reference-gene validation programs were used to establish the order of the most stable genes for the systemically-infecting viruses as ribosomal protein L25 > β-Tubulin > Actin, and the least stable genes Ubiquitin-conjugating enzyme (UCE) < PP2A < GAPDH. For local infection by TMV, the most stable genes were EF1α > Cysteine protease > Actin, and the least stable genes were GAPDH < PP2A < UCE. Using two of the most stable and the two least stable validated reference genes, three defense responsive genes were examined to compare their relative changes in gene expression caused by each virus. Copyright © 2017 Elsevier Inc. All rights reserved.
Whole genome analyses of marine fish pathogenic isolate, Mycobacterium sp. 012931.

PubMed

Kurokawa, Satoru; Kabayama, Jun; Hwang, Seong Don; Nho, Seong Won; Hikima, Jun-ichi; Jung, Tae Sung; Kondo, Hidehiro; Hirono, Ikuo; Takeyama, Haruko; Mori, Tetsushi; Aoki, Takashi

2014-10-01

Mycobacterium is a genus within the order Actinomycetales that comprises of a large number of well-characterized species, several of which includes pathogens known to cause serious disease in human and animal. Here, we report the whole genome sequence of Mycobacterium sp. strain 012931 isolated from the marine fish, yellowtail (Seriola quinqueradiata). Mycobacterium sp. 012931 is a fish pathogen causing serious damage to aquaculture farms in Japan. DNA dot plot analysis showed that Mycobacterium sp. 012931 was more closely related to Mycobacterium marinum when compared across several Mycobacterium species. However, little conservation of the gene order was observed between Mycobacterium sp. 012931 and M. marinum genome. The annotated 5,464 genes of Mycobacterium sp. 012931 was classified into 26 subsystems. The insertion/deletion gene analysis shows Mycobacterium sp. 012931 had 643 unique genes that were not found in the M. marinum strains. In the virulence, disease, and defense subsystem, both insertion and deletion genes of Mycobacterium sp. 012931 were associated with the PPE gene cluster of Mycobacteria. Of seven plcB genes in Mycobacterium sp. 012931, plcB_2 and plcB_3 showed low identities with those of M. marinum strains. Therefore, Mycobacterium sp. 012931 has differences on genetic and virulence from M. marinum and may induce different interaction mechanisms between host and pathogen.
Comparative architecture of silks, fibrous proteins and their encoding genes in insects and spiders.

PubMed

Craig, Catherine L; Riekel, Christian

2002-12-01

The known silk fibroins and fibrous glues are thought to be encoded by members of the same gene family. All silk fibroins sequenced to date contain regions of long-range order (crystalline regions) and/or short-range order (non-crystalline regions). All of the sequenced fibroin silks (Flag or silk from flagelliform gland in spiders; Fhc or heavy chain fibroin silks produced by Lepidoptera larvae) are made up of hierarchically organized, repetitive arrays of amino acids. Fhc fibroin genes are characterized by a similar molecular genetic architecture of two exons and one intron, but the organization and size of these units differs. The Flag, Ser (sericin gene) and BR (Balbiani ring genes; both fibrous proteins) genes are made up of multiple exons and introns. Sequences coding for crystalline and non-crystalline protein domains are integrated in the repetitive regions of Fhc and MA exons, but not in the protein glues Ser1 and BR-1. Genetic 'hot-spots' promote recombination errors in Fhc, MA, and Flag. Codon bias, structural constraint, point mutations, and shortened coding arrays may be alternative means of stabilizing precursor mRNA transcripts. Differential regulation of gene expression and selective splicing of the mRNA transcript may allow rapid adaptation of silk functional properties to different physical environments.
Conservation of synteny between the genome of the pufferfish (Fugu rubripes) and the region on human chromosome 14 (14q24.3) associated with familial Alzheimer disease (AD3 locus)

PubMed

Trower, M K; Orton, S M; Purvis, I J; Sanseau, P; Riley, J; Christodoulou, C; Burt, D; See, C G; Elgar, G; Sherrington, R; Rogaev, E I; St George-Hyslop, P; Brenner, S; Dykes, C W

1996-02-20

The genome of the pufferfish (Fugu rubripes) (400 Mb) is approximately 7.5 times smaller than the human genome, but it has a similar gene repertoire to that of man. If regions of the two genomes exhibited conservation of gene order (i.e., were syntenic), it should be possible to reduce dramatically the effort required for identification of candidate genes in human disease loci by sequencing syntenic regions of the compact Fugu genome. We have demonstrated that three genes (dihydrolipoamide succinyltransferase, S31iii125, and S20i15), which are linked to FOS in the familial Alzheimer disease focus (AD3) on human chromosome 14, have homologues in the Fugu genome adjacent to Fugu cFOS. The relative gene order of cFOS, S31iii125, and S20i15 was the same in both genomes, but in Fugu these three genes lay within a 12.4-kb region, compared to >600 kb in the human AD3 locus. These results demonstrate the conservation of synteny between the genomes of Fugu and man and highlight the utility of this approach for sequence-based identification of genes in human disease loci.
A highly polymorphic dinucleotide repeat on the proximal short arm of the human X chromosome: linkage mapping of the synapsin I/A-raf-1 genes.

PubMed Central

Kirchgessner, C U; Trofatter, J A; Mahtani, M M; Willard, H F; DeGennaro, L J

1991-01-01

A compound (AC)n repeat located 1,000 bp downstream from the human synapsin I gene and within the last intron of the A-raf-1 gene has been identified. DNA data-base comparisons of the sequences surrounding the repeat indicate that the synapsin I gene and the A-raf-1 gene lie immediately adjacent to each other, in opposite orientation. PCR amplification of this synapsin I/A-raf-1 associated repeat by using total genomic DNA from members of the 40 reference pedigree families of the Centre d'Etude du Polymorphisme Humaine showed it to be highly polymorphic, with a PIC value of .84 and a minimum of eight alleles. Because the synapsin I gene has been mapped previously to the short arm of the human X chromosome at Xp11.2, linkage analysis was performed with markers on the proximal short arm of the X chromosome. The most likely gene order is DXS7SYN/ARAF1TIMPDXS255DXS146, with a relative probability of 5 x 10(8) as compared with the next most likely order. This highly informative repeat should serve as a valuable marker for disease loci mapped to the Xp11 region. Images Figure 2 PMID:1905878
The complete mitochondrial genome of the invasive Africanized Honey Bee, Apis mellifera scutellata (Insecta: Hymenoptera: Apidae).

PubMed

Gibson, Joshua D; Hunt, Greg J

2016-01-01

The complete mitochondrial genome from an Africanized honey bee population (AHB, derived from Apis mellifera scutellata) was assembled and analyzed. The mitogenome is 16,411 bp long and contains the same gene repertoire and gene order as the European honey bee (13 protein coding genes, 22 tRNA genes and 2 rRNA genes). ND4 appears to use an alternate start codon and the long rRNA gene is 48 bp shorter in AHB due to a deletion in a terminal AT dinucleotide repeat. The dihydrouracil arm is missing from tRNA-Ser (AGN) and tRNA-Glu is missing the TV loop. The A + T content is comparable to the European honey bee (84.7%), which increases to 95% for the 3rd position in the protein coding genes.
Complete mitochondrial DNA sequence of oyster Crassostrea hongkongensis-a case of "Tandem duplication-random loss" for genome rearrangement in Crassostrea?

PubMed Central

Yu, Ziniu; Wei, Zhengpeng; Kong, Xiaoyu; Shi, Wei

2008-01-01

Background Mitochondrial DNA sequences are extensively used as genetic markers not only for studies of population or ecological genetics, but also for phylogenetic and evolutionary analyses. Complete mt-sequences can reveal information about gene order and its variation, as well as gene and genome evolution when sequences from multiple phyla are compared. Mitochondrial gene order is highly variable among mollusks, with bivalves exhibiting the most variability. Of the 41 complete mt genomes sequenced so far, 12 are from bivalves. We determined, in the current study, the complete mitochondrial DNA sequence of Crassostrea hongkongensis. We present here an analysis of features of its gene content and genome organization in comparison with two other Crassostrea species to assess the variation within bivalves and among main groups of mollusks. Results The complete mitochondrial genome of C. hongkongensis was determined using long PCR and a primer walking sequencing strategy with genus-specific primers. The genome is 16,475 bp in length and contains 12 protein-coding genes (the atp8 gene is missing, as in most bivalves), 22 transfer tRNA genes (including a suppressor tRNA gene), and 2 ribosomal RNA genes, all of which appear to be transcribed from the same strand. A striking finding of this study is that a DNA segment containing four tRNA genes (trnk1, trnC, trnQ1 and trnN) and two duplicated or split rRNA gene (rrnL5' and rrnS) are absent from the genome, when compared with that of two other extant Crassostrea species, which is very likely a consequence of loss of a single genomic region present in ancestor of C. hongkongensis. It indicates this region seem to be a "hot spot" of genomic rearrangements over the Crassostrea mt-genomes. The arrangement of protein-coding genes in C. hongkongensis is identical to that of Crassostrea gigas and Crassostrea virginica, but higher amino acid sequence identities are shared between C. hongkongensis and C. gigas than between other pairs. There exists significant codon bias, favoring codons ending in A or T and against those ending with C. Pair analysis of genome rearrangements showed that the rearrangement distance is great between C. gigas-C. hongkongensis and C. virginica, indicating a high degree of rearrangements within Crassostrea. The determination of complete mt-genome of C. hongkongensis has yielded useful insight into features of gene order, variation, and evolution of Crassostrea and bivalve mt-genomes. Conclusion The mt-genome of C. hongkongensis shares some similarity with, and interesting differences to, other Crassostrea species and bivalves. The absence of trnC and trnN genes and duplicated or split rRNA genes from the C. hongkongensis genome is a completely novel feature not previously reported in Crassostrea species. The phenomenon is likely due to the loss of a segment that is present in other Crassostrea species and was present in ancestor of C. hongkongensis, thus a case of "tandem duplication-random loss (TDRL)". The mt-genome and new feature presented here reveal and underline the high level variation of gene order and gene content in Crassostrea and bivalves, inspiring more research to gain understanding to mechanisms underlying gene and genome evolution in bivalves and mollusks. PMID:18847502
Translational Advances of Hydrofection by Hydrodynamic Injection

PubMed Central

Herrero, María José; Aliño, Salvador F.

2018-01-01

Hydrodynamic gene delivery has proven to be a safe and efficient procedure for gene transfer, able to mediate, in murine model, therapeutic levels of proteins encoded by the transfected gene. In different disease models and targeting distinct organs, it has been demonstrated to revert the pathologic symptoms and signs. The therapeutic potential of hydrofection led different groups to work on the clinical translation of the procedure. In order to prevent the hemodynamic side effects derived from the rapid injection of a large volume, the conditions had to be moderated to make them compatible with its use in mid-size animal models such as rat, hamster and rabbit and large animals as dog, pig and primates. Despite the different approaches performed to adapt the conditions of gene delivery, the results obtained in any of these mid-size and large animals have been poorer than those obtained in murine model. Among these different strategies to reduce the volume employed, the most effective one has been to exclude the vasculature of the target organ and inject the solution directly. This procedure has permitted, by catheterization and surgical procedures in large animals, achieving protein expression levels in tissue close to those achieved in gold standard models. These promising results and the possibility of employing these strategies to transfer gene constructs able to edit genes, such as CRISPR, have renewed the clinical interest of this procedure of gene transfer. In order to translate the hydrodynamic gene delivery to human use, it is demanding the standardization of the procedure conditions and the molecular parameters of evaluation in order to be able to compare the results and establish a homogeneous manner of expressing the data obtained, as ‘classic’ drugs. PMID:29494564
Comparative genomic and transcriptomic analysis of selected fatty acid biosynthesis genes and CNL disease resistance genes in oil palm.

PubMed

Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder; Murphy, Denis J

2018-01-01

Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops.
Comparative genomic and transcriptomic analysis of selected fatty acid biosynthesis genes and CNL disease resistance genes in oil palm

PubMed Central

Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E.; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder

2018-01-01

Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops. PMID:29672525

Mammalian Comparative Genomics Reveals Genetic and Epigenetic Features Associated with Genome Reshuffling in Rodentia

PubMed Central

Capilla, Laia; Sánchez-Guillén, Rosa Ana; Farré, Marta; Paytuví-Gallart, Andreu; Malinverni, Roberto; Ventura, Jacint; Larkin, Denis M.

2016-01-01

Abstract Understanding how mammalian genomes have been reshuffled through structural changes is fundamental to the dynamics of its composition, evolutionary relationships between species and, in the long run, speciation. In this work, we reveal the evolutionary genomic landscape in Rodentia, the most diverse and speciose mammalian order, by whole-genome comparisons of six rodent species and six representative outgroup mammalian species. The reconstruction of the evolutionary breakpoint regions across rodent phylogeny shows an increased rate of genome reshuffling that is approximately two orders of magnitude greater than in other mammalian species here considered. We identified novel lineage and clade-specific breakpoint regions within Rodentia and analyzed their gene content, recombination rates and their relationship with constitutive lamina genomic associated domains, DNase I hypersensitivity sites and chromatin modifications. We detected an accumulation of protein-coding genes in evolutionary breakpoint regions, especially genes implicated in reproduction and pheromone detection and mating. Moreover, we found an association of the evolutionary breakpoint regions with active chromatin state landscapes, most probably related to gene enrichment. Our results have two important implications for understanding the mechanisms that govern and constrain mammalian genome evolution. The first is that the presence of genes related to species-specific phenotypes in evolutionary breakpoint regions reinforces the adaptive value of genome reshuffling. Second, that chromatin conformation, an aspect that has been often overlooked in comparative genomic studies, might play a role in modeling the genomic distribution of evolutionary breakpoints. PMID:28175287
Mammalian Comparative Genomics Reveals Genetic and Epigenetic Features Associated with Genome Reshuffling in Rodentia.

PubMed

Capilla, Laia; Sánchez-Guillén, Rosa Ana; Farré, Marta; Paytuví-Gallart, Andreu; Malinverni, Roberto; Ventura, Jacint; Larkin, Denis M; Ruiz-Herrera, Aurora

2016-12-01

Understanding how mammalian genomes have been reshuffled through structural changes is fundamental to the dynamics of its composition, evolutionary relationships between species and, in the long run, speciation. In this work, we reveal the evolutionary genomic landscape in Rodentia, the most diverse and speciose mammalian order, by whole-genome comparisons of six rodent species and six representative outgroup mammalian species. The reconstruction of the evolutionary breakpoint regions across rodent phylogeny shows an increased rate of genome reshuffling that is approximately two orders of magnitude greater than in other mammalian species here considered. We identified novel lineage and clade-specific breakpoint regions within Rodentia and analyzed their gene content, recombination rates and their relationship with constitutive lamina genomic associated domains, DNase I hypersensitivity sites and chromatin modifications. We detected an accumulation of protein-coding genes in evolutionary breakpoint regions, especially genes implicated in reproduction and pheromone detection and mating. Moreover, we found an association of the evolutionary breakpoint regions with active chromatin state landscapes, most probably related to gene enrichment. Our results have two important implications for understanding the mechanisms that govern and constrain mammalian genome evolution. The first is that the presence of genes related to species-specific phenotypes in evolutionary breakpoint regions reinforces the adaptive value of genome reshuffling. Second, that chromatin conformation, an aspect that has been often overlooked in comparative genomic studies, might play a role in modeling the genomic distribution of evolutionary breakpoints.
Fast gene ontology based clustering for microarray experiments.

PubMed

Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa

2008-11-21

Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.
Leveraging Comparative Genomics to Identify and Functionally Characterize Genes Associated with Sperm Phenotypes in Python bivittatus (Burmese Python)

PubMed Central

Rutllant, Josep

2016-01-01

Comparative genomics approaches provide a means of leveraging functional genomics information from a highly annotated model organism's genome (such as the mouse genome) in order to make physiological inferences about the role of genes and proteins in a less characterized organism's genome (such as the Burmese python). We employed a comparative genomics approach to produce the functional annotation of Python bivittatus genes encoding proteins associated with sperm phenotypes. We identify 129 gene-phenotype relationships in the python which are implicated in 10 specific sperm phenotypes. Results obtained through our systematic analysis identified subsets of python genes exhibiting associations with gene ontology annotation terms. Functional annotation data was represented in a semantic scatter plot. Together, these newly annotated Python bivittatus genome resources provide a high resolution framework from which the biology relating to reptile spermatogenesis, fertility, and reproduction can be further investigated. Applications of our research include (1) production of genetic diagnostics for assessing fertility in domestic and wild reptiles; (2) enhanced assisted reproduction technology for endangered and captive reptiles; and (3) novel molecular targets for biotechnology-based approaches aimed at reducing fertility and reproduction of invasive reptiles. Additional enhancements to reptile genomic resources will further enhance their value. PMID:27200191
Leveraging Comparative Genomics to Identify and Functionally Characterize Genes Associated with Sperm Phenotypes in Python bivittatus (Burmese Python).

PubMed

Irizarry, Kristopher J L; Rutllant, Josep

2016-01-01

Comparative genomics approaches provide a means of leveraging functional genomics information from a highly annotated model organism's genome (such as the mouse genome) in order to make physiological inferences about the role of genes and proteins in a less characterized organism's genome (such as the Burmese python). We employed a comparative genomics approach to produce the functional annotation of Python bivittatus genes encoding proteins associated with sperm phenotypes. We identify 129 gene-phenotype relationships in the python which are implicated in 10 specific sperm phenotypes. Results obtained through our systematic analysis identified subsets of python genes exhibiting associations with gene ontology annotation terms. Functional annotation data was represented in a semantic scatter plot. Together, these newly annotated Python bivittatus genome resources provide a high resolution framework from which the biology relating to reptile spermatogenesis, fertility, and reproduction can be further investigated. Applications of our research include (1) production of genetic diagnostics for assessing fertility in domestic and wild reptiles; (2) enhanced assisted reproduction technology for endangered and captive reptiles; and (3) novel molecular targets for biotechnology-based approaches aimed at reducing fertility and reproduction of invasive reptiles. Additional enhancements to reptile genomic resources will further enhance their value.
Bioluminescent symbionts of the Caribbean flashlight fish (Kryptophanaron alfredi) have a single rRNA operon.

PubMed

Wolfe, C J; Haygood, M G

1993-08-01

Ribosomal RNA (rRNA) operon copy number and gene order were determined for the luminous bacterial symbiont of Kryptophanaron alfredi, an anomalopid (flashlight) fish, and estimated for the luminous symbionts of 3 other fish families and of 3 luminous seawater isolates. Compared with the seawater isolates and other fish symbionts, the copy number of rRNA genes in the K. alfredi symbiont was radically reduced, although gene order appeared conserved among all the strains. The K. alfredi symbiont possesses only a single rRNA operon, whereas the other strains examined have minimum copy numbers ranging from 8 to 11. No difference in copy number was observed between light organ and seawater isolates of the same species, or between isolates of the same species from the light organs of 2 different host families. Thus, the anomalopid symbiosis appears unique among characterized light organ symbioses.
The complete mitochondrial genome of Setaria digitata (Nematoda: Filarioidea): Mitochondrial gene content, arrangement and composition compared with other nematodes.

PubMed

Yatawara, Lalani; Wickramasinghe, Susiji; Rajapakse, R P V J; Agatsuma, Takeshi

2010-09-01

In the present study, we determined the complete mitochondrial (mt) genome sequence (13,839bp) of parasitic nematode Setaria digitata and its structure and organization compared with Onchocerca volvulus, Dirofilaria immitis and Brugia malayi. The mt genome of S. digitata is slightly larger than the mt genomes of other filarial nematodes. S. digitata mt genome contains 36 genes (12 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs) that are typically found in metazoans. This genome contains a high A+T (75.1%) content and low G+C content (24.9%). The mt gene order for S. digitata is the same as those for O. volvulus, D. immitis and B. malayi but it is distinctly different from other nematodes compared. The start codons inferred in the mt genome of S. digitata are TTT, ATT, TTG, ATG, GTT and ATA. Interestingly, the initiation codon TTT is unique to S. digitata mt genome and four protein-coding genes use this codon as a translation initiation codon. Five protein-coding genes use TAG as a stop codon whereas three genes use TAA and four genes use T as a termination codon. Out of 64 possible codons, only 57 are used for mitochondrial protein-coding genes of S. digitata. T-rich codons such as TTT (18.9%), GTT (7.9%), TTG (7.8%), TAT (7%), ATT (5.7%), TCT (4.8%) and TTA (4.1%) are used more frequently. This pattern of codon usage reflects the strong bias for T in the mt genome of S. digitata. In conclusion, the present investigation provides new molecular data for future studies of the comparative mitochondrial genomics and systematic of parasitic nematodes of socio-economic importance. 2010 Elsevier B.V. All rights reserved.
Comparative Sequence and X-Inactivation Analyses of a Domain of Escape in Human Xp11.2 and the Conserved Segment in Mouse

PubMed Central

Tsuchiya, Karen D.; Greally, John M.; Yi, Yajun; Noel, Kevin P.; Truong, Jean-Pierre; Disteche, Christine M.

2004-01-01

We have performed X-inactivation and sequence analyses on 350 kb of sequence from human Xp11.2, a region shown previously to contain a cluster of genes that escape X inactivation, and we compared this region with the region of conserved synteny in mouse. We identified several new transcripts from this region in human and in mouse, which defined the full extent of the domain escaping X inactivation in both species. In human, escape from X inactivation involves an uninterrupted 235-kb domain of multiple genes. Despite highly conserved gene content and order between the two species, Smcx is the only mouse gene from the conserved segment that escapes inactivation. As repetitive sequences are believed to facilitate spreading of X inactivation along the chromosome, we compared the repetitive sequence composition of this region between the two species. We found that long terminal repeats (LTRs) were decreased in the human domain of escape, but not in the majority of the conserved mouse region adjacent to Smcx in which genes were subject to X inactivation, suggesting that these repeats might be excluded from escape domains to prevent spreading of silencing. Our findings indicate that genomic context, as well as gene-specific regulatory elements, interact to determine expression of a gene from the inactive X-chromosome. PMID:15197169
Diversity and community composition of methanogenic archaea in the rumen of Scottish upland sheep assessed by different methods.

PubMed

Snelling, Timothy J; Genç, Buğra; McKain, Nest; Watson, Mick; Waters, Sinéad M; Creevey, Christopher J; Wallace, R John

2014-01-01

Ruminal archaeomes of two mature sheep grazing in the Scottish uplands were analysed by different sequencing and analysis methods in order to compare the apparent archaeal communities. All methods revealed that the majority of methanogens belonged to the Methanobacteriales order containing the Methanobrevibacter, Methanosphaera and Methanobacteria genera. Sanger sequenced 1.3 kb 16S rRNA gene amplicons identified the main species of Methanobrevibacter present to be a SGMT Clade member Mbb. millerae (≥ 91% of OTUs); Methanosphaera comprised the remainder of the OTUs. The primers did not amplify ruminal Thermoplasmatales-related 16S rRNA genes. Illumina sequenced V6-V8 16S rRNA gene amplicons identified similar Methanobrevibacter spp. and Methanosphaera clades and also identified the Thermoplasmatales-related order as 13% of total archaea. Unusually, both methods concluded that Mbb. ruminantium and relatives from the same clade (RO) were almost absent. Sequences mapping to rumen 16S rRNA and mcrA gene references were extracted from Illumina metagenome data. Mapping of the metagenome data to 16S rRNA gene references produced taxonomic identification to Order level including 2-3% Thermoplasmatales, but was unable to discriminate to species level. Mapping of the metagenome data to mcrA gene references resolved 69% to unclassified Methanobacteriales. Only 30% of sequences were assigned to species level clades: of the sequences assigned to Methanobrevibacter, most mapped to SGMT (16%) and RO (10%) clades. The Sanger 16S amplicon and Illumina metagenome mcrA analyses showed similar species richness (Chao1 Index 19-35), while Illumina metagenome and amplicon 16S rRNA analysis gave lower richness estimates (10-18). The values of the Shannon Index were low in all methods, indicating low richness and uneven species distribution. Thus, although much information may be extracted from the other methods, Illumina amplicon sequencing of the V6-V8 16S rRNA gene would be the method of choice for studying rumen archaeal communities.
Insights into origin and evolution of α-proteobacterial gene transfer agents

PubMed Central

Shakya, Migun; Soucy, Shannon M

2017-01-01

Abstract Several bacterial and archaeal lineages produce nanostructures that morphologically resemble small tailed viruses, but, unlike most viruses, contain apparently random pieces of the host genome. Since these elements can deliver the packaged DNA to other cells, they were dubbed gene transfer agents (GTAs). Because many genes involved in GTA production have viral homologs, it has been hypothesized that the GTA ancestor was a virus. Whether GTAs represent an atypical virus, a defective virus, or a virus co-opted by the prokaryotes for some function, remains to be elucidated. To evaluate these possibilities, we examined the distribution and evolutionary histories of genes that encode a GTA in the α-proteobacterium Rhodobacter capsulatus (RcGTA). We report that although homologs of many individual RcGTA genes are abundant across bacteria and their viruses, RcGTA-like genomes are mainly found in one subclade of α-proteobacteria. When compared with the viral homologs, genes of the RcGTA-like genomes evolve significantly slower, and do not have higher %A+T nucleotides than their host chromosomes. Moreover, they appear to reside in stable regions of the bacterial chromosomes that are generally conserved across taxonomic orders. These findings argue against RcGTA being an atypical or a defective virus. Our phylogenetic analyses suggest that RcGTA ancestor likely originated in the lineage that gave rise to contemporary α-proteobacterial orders Rhizobiales, Rhodobacterales, Caulobacterales, Parvularculales, and Sphingomonadales, and since that time the RcGTA-like element has co-evolved with its host chromosomes. Such evolutionary history is compatible with maintenance of these elements by bacteria due to some selective advantage. As for many other prokaryotic traits, horizontal gene transfer played a substantial role in the evolution of RcGTA-like elements, not only in shaping its genome components within the orders, but also in occasional dissemination of RcGTA-like regions across the orders and even to different bacterial phyla. PMID:29250433
Insights into origin and evolution of α-proteobacterial gene transfer agents.

PubMed

Shakya, Migun; Soucy, Shannon M; Zhaxybayeva, Olga

2017-07-01

Several bacterial and archaeal lineages produce nanostructures that morphologically resemble small tailed viruses, but, unlike most viruses, contain apparently random pieces of the host genome. Since these elements can deliver the packaged DNA to other cells, they were dubbed gene transfer agents (GTAs). Because many genes involved in GTA production have viral homologs, it has been hypothesized that the GTA ancestor was a virus. Whether GTAs represent an atypical virus, a defective virus, or a virus co-opted by the prokaryotes for some function, remains to be elucidated. To evaluate these possibilities, we examined the distribution and evolutionary histories of genes that encode a GTA in the α-proteobacterium Rhodobacter capsulatus (RcGTA). We report that although homologs of many individual RcGTA genes are abundant across bacteria and their viruses, RcGTA-like genomes are mainly found in one subclade of α-proteobacteria. When compared with the viral homologs, genes of the RcGTA-like genomes evolve significantly slower, and do not have higher %A+T nucleotides than their host chromosomes. Moreover, they appear to reside in stable regions of the bacterial chromosomes that are generally conserved across taxonomic orders. These findings argue against RcGTA being an atypical or a defective virus. Our phylogenetic analyses suggest that RcGTA ancestor likely originated in the lineage that gave rise to contemporary α-proteobacterial orders Rhizobiales , Rhodobacterales , Caulobacterales , Parvularculales , and Sphingomonadales , and since that time the RcGTA-like element has co-evolved with its host chromosomes. Such evolutionary history is compatible with maintenance of these elements by bacteria due to some selective advantage. As for many other prokaryotic traits, horizontal gene transfer played a substantial role in the evolution of RcGTA-like elements, not only in shaping its genome components within the orders, but also in occasional dissemination of RcGTA-like regions across the orders and even to different bacterial phyla.
The complete mitogenome of the Australian tadpole shrimp Triops australiensis (Spencer & Hall, 1895) (Crustacea: Branchiopoda: Notostraca).

PubMed

Gan, Han Ming; Tan, Mun Hua; Lee, Yin Peng; Austin, Christopher M

2016-05-01

The mitochondrial genome sequence of the Australian tadpole shrimp, Triops australiensis is presented (GenBank Accession Number: NC_024439) and compared with other Triops species. Triops australiensis has a mitochondrial genome of 15,125 base pairs consisting of 13 protein-coding genes, 2 ribosomal subunit genes, 22 transfer RNAs, and a non-coding AT-rich region. The T. australiensis mitogenome is composed of 36.4% A, 16.1% C, 12.3% G and 35.1% T. The mitogenome gene order conforms to the primitive arrangement for Branchiopod crustaceans, which is also conserved within the Pancrustacean.
A double-strand break can trigger immunoglobulin gene conversion

PubMed Central

Bastianello, Giulia; Arakawa, Hiroshi

2017-01-01

All three B cell-specific activities of the immunoglobulin (Ig) gene re-modeling system—gene conversion, somatic hypermutation and class switch recombination—require activation-induced deaminase (AID). AID-induced DNA lesions must be further processed and dissected into different DNA recombination pathways. In order to characterize potential intermediates for Ig gene conversion, we inserted an I-SceI recognition site into the complementarity determining region 1 (CDR1) of the Ig light chain locus of the AID knockout DT40 cell line, and conditionally expressed I-SceI endonuclease. Here, we show that a double-strand break (DSB) in CDR1 is sufficient to trigger Ig gene conversion in the absence of AID. The pattern and pseudogene usage of DSB-induced gene conversion were comparable to those of AID-induced gene conversion; surprisingly, sometimes a single DSB induced multiple gene conversion events. These constitute direct evidence that a DSB in the V region can be an intermediate for gene conversion. The fate of the DNA lesion downstream of a DSB had more flexibility than that of AID, suggesting two alternative models: (i) DSBs during the physiological gene conversion are in the minority compared to single-strand breaks (SSBs), which are frequently generated following DNA deamination, or (ii) the physiological gene conversion is mediated by a tightly regulated DSB that is locally protected from non-homologous end joining (NHEJ) or other non-homologous DNA recombination machineries. PMID:27701075
TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis

PubMed Central

Ji, Zhicheng; Ji, Hongkai

2016-01-01

When analyzing single-cell RNA-seq data, constructing a pseudo-temporal path to order cells based on the gradual transition of their transcriptomes is a useful way to study gene expression dynamics in a heterogeneous cell population. Currently, a limited number of computational tools are available for this task, and quantitative methods for comparing different tools are lacking. Tools for Single Cell Analysis (TSCAN) is a software tool developed to better support in silico pseudo-Time reconstruction in Single-Cell RNA-seq ANalysis. TSCAN uses a cluster-based minimum spanning tree (MST) approach to order cells. Cells are first grouped into clusters and an MST is then constructed to connect cluster centers. Pseudo-time is obtained by projecting each cell onto the tree, and the ordered sequence of cells can be used to study dynamic changes of gene expression along the pseudo-time. Clustering cells before MST construction reduces the complexity of the tree space. This often leads to improved cell ordering. It also allows users to conveniently adjust the ordering based on prior knowledge. TSCAN has a graphical user interface (GUI) to support data visualization and user interaction. Furthermore, quantitative measures are developed to objectively evaluate and compare different pseudo-time reconstruction methods. TSCAN is available at https://github.com/zji90/TSCAN and as a Bioconductor package. PMID:27179027
TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis.

PubMed

Ji, Zhicheng; Ji, Hongkai

2016-07-27

When analyzing single-cell RNA-seq data, constructing a pseudo-temporal path to order cells based on the gradual transition of their transcriptomes is a useful way to study gene expression dynamics in a heterogeneous cell population. Currently, a limited number of computational tools are available for this task, and quantitative methods for comparing different tools are lacking. Tools for Single Cell Analysis (TSCAN) is a software tool developed to better support in silico pseudo-Time reconstruction in Single-Cell RNA-seq ANalysis. TSCAN uses a cluster-based minimum spanning tree (MST) approach to order cells. Cells are first grouped into clusters and an MST is then constructed to connect cluster centers. Pseudo-time is obtained by projecting each cell onto the tree, and the ordered sequence of cells can be used to study dynamic changes of gene expression along the pseudo-time. Clustering cells before MST construction reduces the complexity of the tree space. This often leads to improved cell ordering. It also allows users to conveniently adjust the ordering based on prior knowledge. TSCAN has a graphical user interface (GUI) to support data visualization and user interaction. Furthermore, quantitative measures are developed to objectively evaluate and compare different pseudo-time reconstruction methods. TSCAN is available at https://github.com/zji90/TSCAN and as a Bioconductor package. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Comparative Genome Analyses Reveal Distinct Structure in the Saltwater Crocodile MHC

PubMed Central

Jaratlerdsiri, Weerachai; Deakin, Janine; Godinez, Ricardo M.; Shan, Xueyan; Peterson, Daniel G.; Marthey, Sylvain; Lyons, Eric; McCarthy, Fiona M.; Isberg, Sally R.; Higgins, Damien P.; Chong, Amanda Y.; John, John St; Glenn, Travis C.; Ray, David A.; Gongora, Jaime

2014-01-01

The major histocompatibility complex (MHC) is a dynamic genome region with an essential role in the adaptive immunity of vertebrates, especially antigen presentation. The MHC is generally divided into subregions (classes I, II and III) containing genes of similar function across species, but with different gene number and organisation. Crocodylia (crocodilians) are widely distributed and represent an evolutionary distinct group among higher vertebrates, but the genomic organisation of MHC within this lineage has been largely unexplored. Here, we studied the MHC region of the saltwater crocodile (Crocodylus porosus) and compared it with that of other taxa. We characterised genomic clusters encompassing MHC class I and class II genes in the saltwater crocodile based on sequencing of bacterial artificial chromosomes. Six gene clusters spanning ∼452 kb were identified to contain nine MHC class I genes, six MHC class II genes, three TAP genes, and a TRIM gene. These MHC class I and class II genes were in separate scaffold regions and were greater in length (2–6 times longer) than their counterparts in well-studied fowl B loci, suggesting that the compaction of avian MHC occurred after the crocodilian-avian split. Comparative analyses between the saltwater crocodile MHC and that from the alligator and gharial showed large syntenic areas (>80% identity) with similar gene order. Comparisons with other vertebrates showed that the saltwater crocodile had MHC class I genes located along with TAP, consistent with birds studied. Linkage between MHC class I and TRIM39 observed in the saltwater crocodile resembled MHC in eutherians compared, but absent in avian MHC, suggesting that the saltwater crocodile MHC appears to have gene organisation intermediate between these two lineages. These observations suggest that the structure of the saltwater crocodile MHC, and other crocodilians, can help determine the MHC that was present in the ancestors of archosaurs. PMID:25503521
Genome-wide comparative analysis of codon usage bias and codon context patterns among cyanobacterial genomes.

PubMed

Prabha, Ratna; Singh, Dhananjaya P; Sinha, Swati; Ahmad, Khurshid; Rai, Anil

2017-04-01

With the increasing accumulation of genomic sequence information of prokaryotes, the study of codon usage bias has gained renewed attention. The purpose of this study was to examine codon selection pattern within and across cyanobacterial species belonging to diverse taxonomic orders and habitats. We performed detailed comparative analysis of cyanobacterial genomes with respect to codon bias. Our analysis reflects that in cyanobacterial genomes, A- and/or T-ending codons were used predominantly in the genes whereas G- and/or C-ending codons were largely avoided. Variation in the codon context usage of cyanobacterial genes corresponded to the clustering of cyanobacteria as per their GC content. Analysis of codon adaptation index (CAI) and synonymous codon usage order (SCUO) revealed that majority of genes are associated with low codon bias. Codon selection pattern in cyanobacterial genomes reflected compositional constraints as major influencing factor. It is also identified that although, mutational constraint may play some role in affecting codon usage bias in cyanobacteria, compositional constraint in terms of genomic GC composition coupled with environmental factors affected codon selection pattern in cyanobacterial genomes. Copyright © 2016 Elsevier B.V. All rights reserved.
Genomic composition and dynamics among Methanomicrobiales predict adaptation to contrasting environments.

PubMed

Browne, Patrick; Tamaki, Hideyuki; Kyrpides, Nikos; Woyke, Tanja; Goodwin, Lynne; Imachi, Hiroyuki; Bräuer, Suzanna; Yavitt, Joseph B; Liu, Wen-Tso; Zinder, Stephen; Cadillo-Quiroz, Hinsby

2017-01-01

Members of the order Methanomicrobiales are abundant, and sometimes dominant, hydrogenotrophic (H 2 -CO 2 utilizing) methanoarchaea in a broad range of anoxic habitats. Despite their key roles in greenhouse gas emissions and waste conversion to methane, little is known about the physiological and genomic bases for their widespread distribution and abundance. In this study, we compared the genomes of nine diverse Methanomicrobiales strains, examined their pangenomes, reconstructed gene flow and identified genes putatively mediating their success across different habitats. Most strains slowly increased gene content whereas one, Methanocorpusculum labreanum, evidenced genome downsizing. Peat-dwelling Methanomicrobiales showed adaptations centered on improved transport of scarce inorganic nutrients and likely use H + rather than Na + transmembrane chemiosmotic gradients during energy conservation. In contrast, other Methanomicrobiales show the potential to concurrently use Na + and H + chemiosmotic gradients. Analyses also revealed that the Methanomicrobiales lack a canonical electron bifurcation system (MvhABGD) known to produce low potential electrons in other orders of hydrogenotrophic methanogens. Additional putative differences in anabolic metabolism suggest that the dynamics of interspecies electron transfer from Methanomicrobiales syntrophic partners can also differ considerably. Altogether, these findings suggest profound differences in electron trafficking in the Methanomicrobiales compared with other hydrogenotrophs, and warrant further functional evaluations.
The complete mitochondrial genome of the citrus red mite Panonychus citri (Acari: Tetranychidae): high genome rearrangement and extremely truncated tRNAs

PubMed Central

2010-01-01

Background The family Tetranychidae (Chelicerata: Acari) includes ~1200 species, many of which are of agronomic importance. To date, mitochondrial genomes of only two Tetranychidae species have been sequenced, and it has been found that these two mitochondrial genomes are characterized by many unusual features in genome organization and structure such as gene order and nucleotide frequency. The scarcity of available sequence data has greatly impeded evolutionary studies in Acari (mites and ticks). Information on Tetranychidae mitochondrial genomes is quite important for phylogenetic evaluation and population genetics, as well as the molecular evolution of functional genes such as acaricide-resistance genes. In this study, we sequenced the complete mitochondrial genome of Panonychus citri (Family Tetranychidae), a worldwide citrus pest, and provide a comparison to other Acari. Results The mitochondrial genome of P. citri is a typical circular molecule of 13,077 bp, and contains the complete set of 37 genes that are usually found in metazoans. This is the smallest mitochondrial genome within all sequenced Acari and other Chelicerata, primarily due to the significant size reduction of protein coding genes (PCGs), a large rRNA gene, and the A + T-rich region. The mitochondrial gene order for P. citri is the same as those for P. ulmi and Tetranychus urticae, but distinctly different from other Acari by a series of gene translocations and/or inversions. The majority of the P. citri mitochondrial genome has a high A + T content (85.28%), which is also reflected by AT-rich codons being used more frequently, but exhibits a positive GC-skew (0.03). The Acari mitochondrial nad1 exhibits a faster amino acid substitution rate than other genes, and the variation of nucleotide substitution patterns of PCGs is significantly correlated with the G + C content. Most tRNA genes of P. citri are extremely truncated and atypical (44-65, 54.1 ± 4.1 bp), lacking either the T- or D-arm, as found in P. ulmi, T. urticae, and other Acariform mites. Conclusions The P. citri mitochondrial gene order is markedly different from those of other chelicerates, but is conserved within the family Tetranychidae indicating that high rearrangements have occurred after Tetranychidae diverged from other Acari. Comparative analyses suggest that the genome size, gene order, gene content, codon usage, and base composition are strongly variable among Acari mitochondrial genomes. While extremely small and unusual tRNA genes seem to be common for Acariform mites, further experimental evidence is needed. PMID:20969792
Second generation DNA sequencing of the mitogenome of the Chinstrap penguin and comparative genomics of Antarctic penguins.

PubMed

Subramanian, Sankar; Lingala, Syamala Gowri; Swaminathan, Siva; Huynen, Leon; Lambert, David

2014-08-01

The complete mitochondrial genome of the Chinstrap penguin (Pygoscelis antarcticus) was sequenced and compared with other penguin mitogenomes. The genome is 15,972 bp in length with the number and order of protein coding genes and RNAs being very similar to that of other known penguin mitogenomes. Comparative nucleotide analysis showed the Chinstrap mitogenome shares 94% homology with the mitogenome of its sister species, Pygoscelis adelie (Adélie penguin). Divergence at nonsynonymous nucleotide positions was found to be up to 23 times less than that observed in synonymous positions of protein coding genes, suggesting high selection constraints. The complete mitogenome data will be useful for genetic and evolutionary studies of penguins.

Modeling of Phenoxy Acid Herbicide Mineralization and Growth of Microbial Degraders in 15 Soils Monitored by Quantitative Real-Time PCR of the Functional tfdA Gene

PubMed Central

Bælum, Jacob; Prestat, Emmanuel; David, Maude M.; Strobel, Bjarne W.

2012-01-01

Mineralization potentials, rates, and kinetics of the three phenoxy acid (PA) herbicides, 2,4-dichlorophenoxyacetic acid (2,4-D), 4-chloro-2-methylphenoxyacetic acid (MCPA), and 2-(4-chloro-2-methylphenoxy)propanoic acid (MCPP), were investigated and compared in 15 soils collected from five continents. The mineralization patterns were fitted by zero/linear or exponential growth forms of the three-half-order models and by logarithmic (log), first-order, or zero-order kinetic models. Prior and subsequent to the mineralization event, tfdA genes were quantified using real-time PCR to estimate the genetic potential for degrading PA in the soils. In 25 of the 45 mineralization scenarios, ∼60% mineralization was observed within 118 days. Elevated concentrations of tfdA in the range 1 × 105 to 5 × 107 gene copies g−1 of soil were observed in soils where mineralization could be described by using growth-linked kinetic models. A clear trend was observed that the mineralization rates of the three PAs occurred in the order 2,4-D > MCPA > MCPP, and a correlation was observed between rapid mineralization and soils exposed to PA previously. Finally, for 2,4-D mineralization, all seven mineralization patterns which were best fitted by the exponential model yielded a higher tfdA gene potential after mineralization had occurred than the three mineralization patterns best fitted by the Lin model. PMID:22635998
Comparative Study of Regulatory Circuits in Two Sea Urchin Species Reveals Tight Control of Timing and High Conservation of Expression Dynamics

PubMed Central

Gildor, Tsvia; Ben-Tabou de-Leon, Smadar

2015-01-01

Accurate temporal control of gene expression is essential for normal development and must be robust to natural genetic and environmental variation. Studying gene expression variation within and between related species can delineate the level of expression variability that development can tolerate. Here we exploit the comprehensive model of sea urchin gene regulatory networks and generate high-density expression profiles of key regulatory genes of the Mediterranean sea urchin, Paracentrotus lividus (Pl). The high resolution of our studies reveals highly reproducible gene initiation times that have lower variation than those of maximal mRNA levels between different individuals of the same species. This observation supports a threshold behavior of gene activation that is less sensitive to input concentrations. We then compare Mediterranean sea urchin gene expression profiles to those of its Pacific Ocean relative, Strongylocentrotus purpuratus (Sp). These species shared a common ancestor about 40 million years ago and show highly similar embryonic morphologies. Our comparative analyses of five regulatory circuits operating in different embryonic territories reveal a high conservation of the temporal order of gene activation but also some cases of divergence. A linear ratio of 1.3-fold between gene initiation times in Pl and Sp is partially explained by scaling of the developmental rates with temperature. Scaling the developmental rates according to the estimated Sp-Pl ratio and normalizing the expression levels reveals a striking conservation of relative dynamics of gene expression between the species. Overall, our findings demonstrate the ability of biological developmental systems to tightly control the timing of gene activation and relative dynamics and overcome expression noise induced by genetic variation and growth conditions. PMID:26230518
The coffee genome provides insight into the convergent evolution of caffeine biosynthesis.

PubMed

Denoeud, France; Carretero-Paulet, Lorenzo; Dereeper, Alexis; Droc, Gaëtan; Guyot, Romain; Pietrella, Marco; Zheng, Chunfang; Alberti, Adriana; Anthony, François; Aprea, Giuseppe; Aury, Jean-Marc; Bento, Pascal; Bernard, Maria; Bocs, Stéphanie; Campa, Claudine; Cenci, Alberto; Combes, Marie-Christine; Crouzillat, Dominique; Da Silva, Corinne; Daddiego, Loretta; De Bellis, Fabien; Dussert, Stéphane; Garsmeur, Olivier; Gayraud, Thomas; Guignon, Valentin; Jahn, Katharina; Jamilloux, Véronique; Joët, Thierry; Labadie, Karine; Lan, Tianying; Leclercq, Julie; Lepelley, Maud; Leroy, Thierry; Li, Lei-Ting; Librado, Pablo; Lopez, Loredana; Muñoz, Adriana; Noel, Benjamin; Pallavicini, Alberto; Perrotta, Gaetano; Poncet, Valérie; Pot, David; Priyono; Rigoreau, Michel; Rouard, Mathieu; Rozas, Julio; Tranchant-Dubreuil, Christine; VanBuren, Robert; Zhang, Qiong; Andrade, Alan C; Argout, Xavier; Bertrand, Benoît; de Kochko, Alexandre; Graziosi, Giorgio; Henry, Robert J; Jayarama; Ming, Ray; Nagai, Chifumi; Rounsley, Steve; Sankoff, David; Giuliano, Giovanni; Albert, Victor A; Wincker, Patrick; Lashermes, Philippe

2014-09-05

Coffee is a valuable beverage crop due to its characteristic flavor, aroma, and the stimulating effects of caffeine. We generated a high-quality draft genome of the species Coffea canephora, which displays a conserved chromosomal gene order among asterid angiosperms. Although it shows no sign of the whole-genome triplication identified in Solanaceae species such as tomato, the genome includes several species-specific gene family expansions, among them N-methyltransferases (NMTs) involved in caffeine production, defense-related genes, and alkaloid and flavonoid enzymes involved in secondary compound synthesis. Comparative analyses of caffeine NMTs demonstrate that these genes expanded through sequential tandem duplications independently of genes from cacao and tea, suggesting that caffeine in eudicots is of polyphyletic origin. Copyright © 2014, American Association for the Advancement of Science.
Comparative transcriptomics among floral organs of the basal eudicot Eschscholzia californica as reference for floral evolutionary developmental studies

PubMed Central

2010-01-01

Background Molecular genetic studies of floral development have concentrated on several core eudicots and grasses (monocots), which have canalized floral forms. Basal eudicots possess a wider range of floral morphologies than the core eudicots and grasses and can serve as an evolutionary link between core eudicots and monocots, and provide a reference for studies of other basal angiosperms. Recent advances in genomics have enabled researchers to profile gene activities during floral development, primarily in the eudicot Arabidopsis thaliana and the monocots rice and maize. However, our understanding of floral developmental processes among the basal eudicots remains limited. Results Using a recently generated expressed sequence tag (EST) set, we have designed an oligonucleotide microarray for the basal eudicot Eschscholzia californica (California poppy). We performed microarray experiments with an interwoven-loop design in order to characterize the E. californica floral transcriptome and to identify differentially expressed genes in flower buds with pre-meiotic and meiotic cells, four floral organs at pre-anthesis stages (sepals, petals, stamens and carpels), developing fruits, and leaves. Conclusions Our results provide a foundation for comparative gene expression studies between eudicots and basal angiosperms. We identified whorl-specific gene expression patterns in E. californica and examined the floral expression of several gene families. Interestingly, most E. californica homologs of Arabidopsis genes important for flower development, except for genes encoding MADS-box transcription factors, show different expression patterns between the two species. Our comparative transcriptomics study highlights the unique evolutionary position of E. californica compared with basal angiosperms and core eudicots. PMID:20950453
Gene order in rosid phylogeny, inferred from pairwise syntenies among extant genomes

PubMed Central

2012-01-01

Background Ancestral gene order reconstruction for flowering plants has lagged behind developments in yeasts, insects and higher animals, because of the recency of widespread plant genome sequencing, sequencers' embargoes on public data use, paralogies due to whole genome duplication (WGD) and fractionation of undeleted duplicates, extensive paralogy from other sources, and the computational cost of existing methods. Results We address these problems, using the gene order of four core eudicot genomes (cacao, castor bean, papaya and grapevine) that have escaped any recent WGD events, and two others (poplar and cucumber) that descend from independent WGDs, in inferring the ancestral gene order of the rosid clade and those of its main subgroups, the fabids and malvids. We improve and adapt techniques including the OMG method for extracting large, paralogy-free, multiple orthologies from conflated pairwise synteny data among the six genomes and the PATHGROUPS approach for ancestral gene order reconstruction in a given phylogeny, where some genomes may be descendants of WGD events. We use the gene order evidence to evaluate the hypothesis that the order Malpighiales belongs to the malvids rather than as traditionally assigned to the fabids. Conclusions Gene orders of ancestral eudicot species, involving 10,000 or more genes can be reconstructed in an efficient, parsimonious and consistent way, despite paralogies due to WGD and other processes. Pairwise genomic syntenies provide appropriate input to a parameter-free procedure of multiple ortholog identification followed by gene-order reconstruction in solving instances of the "small phylogeny" problem. PMID:22759433
Structure and variation of the mitochondrial genome of fishes.

PubMed

Satoh, Takashi P; Miya, Masaki; Mabuchi, Kohji; Nishida, Mutsumi

2016-09-07

The mitochondrial (mt) genome has been used as an effective tool for phylogenetic and population genetic analyses in vertebrates. However, the structure and variability of the vertebrate mt genome are not well understood. A potential strategy for improving our understanding is to conduct a comprehensive comparative study of large mt genome data. The aim of this study was to characterize the structure and variability of the fish mt genome through comparative analysis of large datasets. An analysis of the secondary structure of proteins for 250 fish species (248 ray-finned and 2 cartilaginous fishes) illustrated that cytochrome c oxidase subunits (COI, COII, and COIII) and a cytochrome bc1 complex subunit (Cyt b) had substantial amino acid conservation. Among the four proteins, COI was the most conserved, as more than half of all amino acid sites were invariable among the 250 species. Our models identified 43 and 58 stems within 12S rRNA and 16S rRNA, respectively, with larger numbers than proposed previously for vertebrates. The models also identified 149 and 319 invariable sites in 12S rRNA and 16S rRNA, respectively, in all fishes. In particular, the present result verified that a region corresponding to the peptidyl transferase center in prokaryotic 23S rRNA, which is homologous to mt 16S rRNA, is also conserved in fish mt 16S rRNA. Concerning the gene order, we found 35 variations (in 32 families) that deviated from the common gene order in vertebrates. These gene rearrangements were mostly observed in the area spanning the ND5 gene to the control region as well as two tRNA gene cluster regions (IQM and WANCY regions). Although many of such gene rearrangements were unique to a specific taxon, some were shared polyphyletically between distantly related species. Through a large-scale comparative analysis of 250 fish species mt genomes, we elucidated various structural aspects of the fish mt genome and the encoded genes. The present results will be important for understanding functions of the mt genome and developing programs for nucleotide sequence analysis. This study demonstrated the significance of extensive comparisons for understanding the structure of the mt genome.
Complete chloroplast genome of Trachelium caeruleum: extensiverearrangements are associated with repeats and tRNAs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Haberle, Rosemarie C.; Fourcade, Matthew L.; Boore, Jeffrey L.

2006-01-09

Chloroplast genome structure, gene order and content arehighly conserved in land plants. We sequenced the complete chloroplastgenome sequence of Trachelium caeruleum (Campanulaceae) a member of anangiosperm family known for highly rearranged chloroplast genomes. Thetotal genome size is 162,321 bp with an IR of 27,273 bp, LSC of 100,113bp and SSC of 7,661 bp. The genome encodes 115 unique genes, with 19duplicated in the IR, a tRNA (trnI-CAU) duplicated once in the LSC and aprotein coding gene (psbJ) duplicated twice, for a total of 137 genes.Four genes (ycf15, rpl23, infA and accD) are truncated and likelynonfunctional; three others (clpP, ycf1 andmore » ycf2) are so highly divergedthat they may now be pseudogenes. The most conspicuous feature of theTrachelium genome is the presence of eighteen internally unrearrangedblocks of genes that have been inverted or relocated within the genome,relative to the typical gene order of most angiosperm chloroplastgenomes. Recombination between repeats or tRNAs has been suggested as twomeans of chloroplast genome rearrangements. We compared the relativenumber of repeats in Trachelium to eight other angiosperm chloroplastgenomes, and evaluated the location of repeats and tRNAs in relation torearrangements. Trachelium has the highest number and largest repeats,which are concentrated near inversion endpoints or other rearrangements.tRNAs occur at many but not all inversion endpoints. There is likely nosingle mechanism responsible for the remarkable number of alterations inthis genome, but both repeats and tRNAs are clearly associated with theserearrangements. Land plant chloroplast genomes are highly conserved instructure, gene order and content. The chloroplast genomes of ferns, thegymnosperm Ginkgo, and most angiosperms are nearly collinear, reflectingthe gene order in lineages that diverged from lycopsids and the ancestralchloroplast gene order over 350 million years ago (Raubeson and Jansen,1992). Although earlier mapping studies identified a number of taxa inwhich several rearrangements have occurred (reviewed in Raubeson andJansen, 2005), an extraordinary number of chloroplast genome alterationsare concentrated in several families in the angiosperm order Asterales(sensu APGII, Bremer et al., 2003). Gene mapping studies ofrepresentatives of the Campanulaceae (Cosner, 1993; Cosner et al.,1997,2004) and Lobeliaceae (Knox et al., 1993; Knox and Palmer, 1999)identified large inversions, contraction and expansion of the invertedrepeat regions, and several insertions and deletions in the cpDNAs ofthese closely related taxa. Detailed restriction site and gene mapping ofthe chloroplast genome of Trachelium caeruleum (Campanulaceae) identifiedseven to ten large inversions, families of repeats associated withrearrangements, possible transpositions, and even the disruption ofoperons (Cosner et al., 1997). Seventeen other members of theCampanulaceae were mapped and exhibit many additional rearrangements(Cosner et al., 2004). What happened in this lineage that made itsusceptible to so many chloroplast genome rearrangements? How do normallyvery conserved chloroplast genomes change? The cause of rearrangements inthis group is unclear based on the limited resolution available withmapping techniques. Several mechanisms have been proposed to explain howrearrangements occur: recombination between repeats, transposition, ortemporary instability due to loss of the inverted repeat (Raubeson andJansen, 2005). Sequencing whole chloroplast genomes within theCampanulaceae offers a unique opportunity to examine both the extent andmechanisms of rearrangements within a phylogenetic framework.We reporthere the first complete chloroplast genome sequence of a member of theCampanulaceae, Trachelium caeruleum. This work will serve as a benchmarkfor subsequent, comparative sequencing and analysis of other members ofthis family and close relatives, with the goal of further understandingchloroplast genome evolution. We confirmed features previously identifiedthrough mapping, and discovered many additional structural changes,including several partial to entire gene duplications, deterioration ofat least four normally conserved chloroplast genes into gene fragments,and the nature and position of numerous repeat elements at or nearinversion endpoints. The focus of this paper is on analyses of sequencesat or near these rearrangements in Trachelium caeruleum. Inversions arebelieved to occur due to the presence of repeat elements subject tohomologous recombination (Palmer, 1991; Knox et al., 1993). Repeats mayfacilitate inversions or other genome rearrangements (Achaz et al.,2003), and higher incidences of repeats have been correlated with greaternumbers of rearrangements (Rocha, 2003). Alternatively, repeats mayproliferate within a genome asa result of DNA strand repair mechanismsfollowing a rearrangement event such as an inversion. Gene« less
Optimization lighting layout based on gene density improved genetic algorithm for indoor visible light communications

NASA Astrophysics Data System (ADS)

Liu, Huanlin; Wang, Xin; Chen, Yong; Kong, Deqian; Xia, Peijie

2017-05-01

For indoor visible light communication system, the layout of LED lamps affects the uniformity of the received power on communication plane. In order to find an optimized lighting layout that meets both the lighting needs and communication needs, a gene density genetic algorithm (GDGA) is proposed. In GDGA, a gene indicates a pair of abscissa and ordinate of a LED, and an individual represents a LED layout in the room. The segmented crossover operation and gene mutation strategy based on gene density are put forward to make the received power on communication plane more uniform and increase the population's diversity. A weighted differences function between individuals is designed as the fitness function of GDGA for reserving the population having the useful LED layout genetic information and ensuring the global convergence of GDGA. Comparing square layout and circular layout, with the optimized layout achieved by the GDGA, the power uniformity increases by 83.3%, 83.1% and 55.4%, respectively. Furthermore, the convergence of GDGA is verified compared with evolutionary algorithm (EA). Experimental results show that GDGA can quickly find an approximation of optimal layout.
The vacuolar protein sorting genes in insects: A comparative genome view.

PubMed

Li, Zhaofei; Blissard, Gary

2015-07-01

In eukaryotic cells, regulated vesicular trafficking is critical for directing protein transport and for recycling and degradation of membrane lipids and proteins. Through carefully regulated transport vesicles, the endomembrane system performs a large and important array of dynamic cellular functions while maintaining the integrity of the cellular membrane system. Genetic studies in yeast Saccharomyces cerevisiae have identified approximately 50 vacuolar protein sorting (VPS) genes involved in vesicle trafficking, and most of these genes are also characterized in mammals. The VPS proteins form distinct functional complexes, which include complexes known as ESCRT, retromer, CORVET, HOPS, GARP, and PI3K-III. Little is known about the orthologs of VPS proteins in insects. Here, with the newly annotated Manduca sexta genome, we carried out genomic comparative analysis of VPS proteins in yeast, humans, and 13 sequenced insect genomes representing the Orders Hymenoptera, Diptera, Hemiptera, Phthiraptera, Lepidoptera, and Coleoptera. Amino acid sequence alignments and domain/motif structure analyses reveal that most of the components of ESCRT, retromer, CORVET, HOPS, GARP, and PI3K-III are evolutionarily conserved across yeast, insects, and humans. However, in contrast to the VPS gene expansions observed in the human genome, only four VPS genes (VPS13, VPS16, VPS33, and VPS37) were expanded in the six insect Orders. Additionally, VPS2 was expanded only in species from Phthiraptera, Lepidoptera, and Coleoptera. These studies provide a baseline for understanding the evolution of vesicular trafficking across yeast, insect, and human genomes, and also provide a basis for further addressing specific functional roles of VPS proteins in insects. Copyright © 2014 Elsevier Ltd. All rights reserved.
Uniform standards for genome databases in forest and fruit trees

USDA-ARS?s Scientific Manuscript database

TreeGenes and tfGDR serve the international forestry and fruit tree genomics research communities, respectively. These databases hold similar sequence data and provide resources for the submission and recovery of this information in order to enable comparative genomics research. Large-scale genotype...
The History of Bordetella pertussis Genome Evolution Includes Structural Rearrangement

PubMed Central

Peng, Yanhui; Loparev, Vladimir; Batra, Dhwani; Bowden, Katherine E.; Burroughs, Mark; Cassiday, Pamela K.; Davis, Jamie K.; Johnson, Taccara; Juieng, Phalasy; Knipe, Kristen; Mathis, Marsenia H.; Pruitt, Andrea M.; Rowe, Lori; Sheth, Mili; Tondella, M. Lucia; Williams, Margaret M.

2017-01-01

ABSTRACT Despite high pertussis vaccine coverage, reported cases of whooping cough (pertussis) have increased over the last decade in the United States and other developed countries. Although Bordetella pertussis is well known for its limited gene sequence variation, recent advances in long-read sequencing technology have begun to reveal genomic structural heterogeneity among otherwise indistinguishable isolates, even within geographically or temporally defined epidemics. We have compared rearrangements among complete genome assemblies from 257 B. pertussis isolates to examine the potential evolution of the chromosomal structure in a pathogen with minimal gene nucleotide sequence diversity. Discrete changes in gene order were identified that differentiated genomes from vaccine reference strains and clinical isolates of various genotypes, frequently along phylogenetic boundaries defined by single nucleotide polymorphisms. The observed rearrangements were primarily large inversions centered on the replication origin or terminus and flanked by IS481, a mobile genetic element with >240 copies per genome and previously suspected to mediate rearrangements and deletions by homologous recombination. These data illustrate that structural genome evolution in B. pertussis is not limited to reduction but also includes rearrangement. Therefore, although genomes of clinical isolates are structurally diverse, specific changes in gene order are conserved, perhaps due to positive selection, providing novel information for investigating disease resurgence and molecular epidemiology. IMPORTANCE Whooping cough, primarily caused by Bordetella pertussis, has resurged in the United States even though the coverage with pertussis-containing vaccines remains high. The rise in reported cases has included increased disease rates among all vaccinated age groups, provoking questions about the pathogen's evolution. The chromosome of B. pertussis includes a large number of repetitive mobile genetic elements that obstruct genome analysis. However, these mobile elements facilitate large rearrangements that alter the order and orientation of essential protein-encoding genes, which otherwise exhibit little nucleotide sequence diversity. By comparing the complete genome assemblies from 257 isolates, we show that specific rearrangements have been conserved throughout recent evolutionary history, perhaps by eliciting changes in gene expression, which may also provide useful information for molecular epidemiology. PMID:28167525
Comparative genomic analysis by microbial COGs self-attraction rate.

PubMed

Santoni, Daniele; Romano-Spica, Vincenzo

2009-06-21

Whole genome analysis provides new perspectives to determine phylogenetic relationships among microorganisms. The availability of whole nucleotide sequences allows different levels of comparison among genomes by several approaches. In this work, self-attraction rates were considered for each cluster of orthologous groups of proteins (COGs) class in order to analyse gene aggregation levels in physical maps. Phylogenetic relationships among microorganisms were obtained by comparing self-attraction coefficients. Eighteen-dimensional vectors were computed for a set of 168 completely sequenced microbial genomes (19 archea, 149 bacteria). The components of the vector represent the aggregation rate of the genes belonging to each of 18 COGs classes. Genes involved in nonessential functions or related to environmental conditions showed the highest aggregation rates. On the contrary genes involved in basic cellular tasks showed a more uniform distribution along the genome, except for translation genes. Self-attraction clustering approach allowed classification of Proteobacteria, Bacilli and other species belonging to Firmicutes. Rearrangement and Lateral Gene Transfer events may influence divergences from classical taxonomy. Each set of COG classes' aggregation values represents an intrinsic property of the microbial genome. This novel approach provides a new point of view for whole genome analysis and bacterial characterization.
The Complete Mitogenome of the Wood-Feeding Cockroach Cryptocercus meridianus (Blattodea: Cryptocercidae) and Its Phylogenetic Relationship among Cockroach Families.

PubMed

Li, Weijun; Wang, Zongqing; Che, Yanli

2017-11-12

In this study, the complete mitochondrial genome of Cryptocercus meridianus was sequenced. The circular mitochondrial genome is 15,322 bp in size and contains 13 protein-coding genes, two ribosomal RNA genes (12S rRNA and 16S rRNA), 22 transfer RNA genes, and one D-loop region. We compare the mitogenome of C. meridianus with that of C. relictus and C. kyebangensis . The base composition of the whole genome was 45.20%, 9.74%, 16.06%, and 29.00% for A, G, C, and T, respectively; it shows a high AT content (74.2%), similar to the mitogenomes of C. relictus and C. kyebangensis . The protein-coding genes are initiated with typical mitochondrial start codons except for cox1 with TTG. The gene order of the C. meridianus mitogenome differs from the typical insect pattern for the translocation of tRNA-Ser AGN , while the mitogenomes of the other two Cryptocercus species, C. relictus and C. kyebangensis , are consistent with the typical insect pattern. There are two very long non-coding intergenic regions lying on both sides of the rearranged gene tRNA-Ser AGN . The phylogenetic relationships were constructed based on the nucleotide sequence of 13 protein-coding genes and two ribosomal RNA genes. The mitogenome of C. meridianus is the first representative of the order Blattodea that demonstrates rearrangement, and it will contribute to the further study of the phylogeny and evolution of the genus Cryptocercus and related taxa.
Expression profile of genes associated with mastitis in dairy cattle

PubMed Central

2009-01-01

In order to characterize the expression of genes associated with immune response mechanisms to mastitis, we quantified the relative expression of the IL-2, IL-4, IL-6, IL-8, IL-10, IFN-γ and TNF- α genes in milk cells of healthy cows and cows with clinical mastitis. Total RNA was extracted from milk cells of six Black and White Holstein (BW) cows and six Gyr cows, including three animals with and three without mastitis per breed. Gene expression was analyzed by real-time PCR. IL-10 gene expression was higher in the group of BW and Gyr cows with mastitis compared to animals free of infection from both breeds (p < 0.05). It was also higher in BW Holstein animals with clinical mastitis (p < 0.001), but it was not significant when Gyr cows with and without mastitis were compared (0.05 < p < 0.10). Among healthy cows, BW Holstein animals tended to present a higher expression of all genes studied, with a significant difference for the IL-2 and IFN- γ genes (p < 0.001). For animals with mastitis no significant difference in gene expression was observed between the two breeds. These findings suggest that animals with mastitis develop a preferentially cell-mediated immune response. Further studies including larger samples are necessary to better characterize the gene expression profile in cows with mastitis. PMID:21637453
Selection of higher order regression models in the analysis of multi-factorial transcription data.

PubMed

Prazeres da Costa, Olivia; Hoffman, Arthur; Rey, Johannes W; Mansmann, Ulrich; Buch, Thorsten; Tresch, Achim

2014-01-01

Many studies examine gene expression data that has been obtained under the influence of multiple factors, such as genetic background, environmental conditions, or exposure to diseases. The interplay of multiple factors may lead to effect modification and confounding. Higher order linear regression models can account for these effects. We present a new methodology for linear model selection and apply it to microarray data of bone marrow-derived macrophages. This experiment investigates the influence of three variable factors: the genetic background of the mice from which the macrophages were obtained, Yersinia enterocolitica infection (two strains, and a mock control), and treatment/non-treatment with interferon-γ. We set up four different linear regression models in a hierarchical order. We introduce the eruption plot as a new practical tool for model selection complementary to global testing. It visually compares the size and significance of effect estimates between two nested models. Using this methodology we were able to select the most appropriate model by keeping only relevant factors showing additional explanatory power. Application to experimental data allowed us to qualify the interaction of factors as either neutral (no interaction), alleviating (co-occurring effects are weaker than expected from the single effects), or aggravating (stronger than expected). We find a biologically meaningful gene cluster of putative C2TA target genes that appear to be co-regulated with MHC class II genes. We introduced the eruption plot as a tool for visual model comparison to identify relevant higher order interactions in the analysis of expression data obtained under the influence of multiple factors. We conclude that model selection in higher order linear regression models should generally be performed for the analysis of multi-factorial microarray data.
Transcriptional over-expression of chloride intracellular channels 3 and 4 in malignant pleural mesothelioma.

PubMed

Tasiopoulou, Vasiliki; Magouliotis, Dimitrios; Solenov, Evgeniy I; Vavougios, Georgios; Molyvdas, Paschalis-Adam; Gourgoulianis, Konstantinos I; Hatzoglou, Chrissi; Zarogiannis, Sotirios G

2015-12-01

Chloride Intracellular Channels (CLICs) are contributing to the regulation of multiple cellular functions. CLICs have been found over-expressed in several malignancies, and therefore they are currently considered as potential drug targets. The goal of our study was to assess the gene expression levels of the CLIC's 1-6 in malignant pleural mesothelioma (MPM) as compared to controls. We used gene expression data from a publicly available microarray dataset comparing MPM versus healthy tissue in order to investigate the differential expression profile of CLIC 1-6. False discovery rates were calculated and the interactome of the significantly differentially expressed CLICs was constructed and Functional Enrichment Analysis for Gene Ontologies (FEAGO) was performed. In MPM, the gene expressions of CLIC3 and CLIC4 were significantly increased compared to controls (p=0.001 and p<0.001 respectively). A significant positive correlation between the gene expressions of CLIC3 and CLIC4 (p=0.0008 and Pearson's r=0.51) was found. Deming regression analysis provided an association equation between the CLIC3 and CLIC4 gene expressions: CLIC3=4.42CLIC4-10.07. Our results indicate that CLIC3 and CLIC4 are over-expressed in human MPM. Moreover, their expressions correlate suggesting that they either share common gene expression inducers or that their products act synergistically. FAEGO showed that CLIC interactome might contribute to TGF beta signaling and water transport. Copyright © 2015 Elsevier Ltd. All rights reserved.
Selecting a set of housekeeping genes for quantitative real-time PCR in normal and tetraploid haemocytes of soft-shell clams, Mya arenaria.

PubMed

Siah, A; Dohoo, C; McKenna, P; Delaporte, M; Berthe, F C J

2008-09-01

The transcripts involved in the molecular mechanisms of haemic neoplasia in relation to the haemocyte ploidy status of the soft-shell clam, Mya arenaria, have yet to be identified. For this purpose, real-time quantitative RT-PCR constitutes a sensitive and efficient technique, which can help determine the gene expression involved in haemocyte tetraploid status in clams affected by haemic neoplasia. One of the critical steps in comparing transcription profiles is the stability of selected housekeeping genes, as well as an accurate normalization. In this study, we selected five reference genes, S18, L37, EF1, EF2 and actin, generally used as single control genes. Their expression was analyzed by real-time quantitative RT-PCR at different levels of haemocyte ploidy status in order to select the most stable genes. Using the geNorm software, our results showed that L37, EF1 and S18 represent the most stable gene expressions related to various ploidy status ranging from 0 to 78% of tetraploid haemocytes in clams sampled in North River (Prince Edward Island, Canada). However, actin gene expression appeared to be highly regulated. Hence, using it as a housekeeping gene in tetraploid haemocytes can result in inaccurate data. To compare gene expression levels related to haemocyte ploidy status in Mya arenaria, using L37, EF1 and S18 as housekeeping genes for accurate normalization is therefore recommended.
Speciation in the Derrida-Higgs model with finite genomes and spatial populations

NASA Astrophysics Data System (ADS)

de Aguiar, Marcus A. M.

2017-02-01

The speciation model proposed by Derrida and Higgs demonstrated that a sexually reproducing population can split into different species in the absence of natural selection or any type of geographic isolation, provided that mating is assortative and the number of genes involved in the process is infinite. Here we revisit this model and simulate it for finite genomes, focusing on the question of how many genes it actually takes to trigger neutral sympatric speciation. We find that, for typical parameters used in the original model, it takes the order of 105 genes. We compare the results with a similar spatially explicit model where about 100 genes suffice for speciation. We show that when the number of genes is small the species that emerge are strongly segregated in space. For a larger number of genes, on the other hand, the spatial structure of the population is less important and the species distribution overlap considerably.
Sampling strategies for improving tree accuracy and phylogenetic analyses: a case study in ciliate protists, with notes on the genus Paramecium.

PubMed

Yi, Zhenzhen; Strüder-Kypke, Michaela; Hu, Xiaozhong; Lin, Xiaofeng; Song, Weibo

2014-02-01

In order to assess how dataset-selection for multi-gene analyses affects the accuracy of inferred phylogenetic trees in ciliates, we chose five genes and the genus Paramecium, one of the most widely used model protist genera, and compared tree topologies of the single- and multi-gene analyses. Our empirical study shows that: (1) Using multiple genes improves phylogenetic accuracy, even when their one-gene topologies are in conflict with each other. (2) The impact of missing data on phylogenetic accuracy is ambiguous: resolution power and topological similarity, but not number of represented taxa, are the most important criteria of a dataset for inclusion in concatenated analyses. (3) As an example, we tested the three classification models of the genus Paramecium with a multi-gene based approach, and only the monophyly of the subgenus Paramecium is supported. Copyright © 2013 Elsevier Inc. All rights reserved.
Complete mitochondrial genome of the invasive brown alga Sargassum muticum (Sargassaceae, Phaeophyceae).

PubMed

Liu, Feng; Pang, Shaojun

2016-01-01

Sargassum muticum (Yendo) Fensholt is an invasive canopy-forming brown alga, expanding its presence from Northeast Asia to North America and Europe. The complete mitochondrial genome of S. muticum is characterized as a circular molecule of 34,720 bp. The overall AT content of S. muticum mitogenome is 63.41%. This mitogenome contains 65 genes typically found in brown algae, including 3 ribosomal RNA genes, 25 transfer RNA genes, 35 protein-coding genes, and 2 conserved open reading frames (ORFs). The gene order of mitogenome for S. muticum is identical to that for Sargassum horneri, Fucus vesiculosus and Desmarestia viridis. Phylogenetic analyses based on 35 protein-coding genes reveal that S. muticum has a close evolutionary relationship with S. horneri and a distant relationship with Dictyota dichotoma, supporting current taxonomic systems. The present investigation provides new molecular data for studies of S. muticum population diversity as well as comparative genomics in the Phaeophyceae.

Reconsideration of systematic relationships within the order Euplotida (Protista, Ciliophora) using new sequences of the gene coding for small-subunit rRNA and testing the use of combined data sets to construct phylogenies of the Diophrys-complex.

PubMed

Yi, Zhenzhen; Song, Weibo; Clamp, John C; Chen, Zigui; Gao, Shan; Zhang, Qianqian

2009-03-01

Comprehensive molecular analyses of phylogenetic relationships within euplotid ciliates are relatively rare, and the relationships among some families remain questionable. We performed phylogenetic analyses of the order Euplotida based on new sequences of the gene coding for small-subunit RNA (SSrRNA) from a variety of taxa across the entire order as well as sequences from some of these taxa of other genes (ITS1-5.8S-ITS2 region and histone H4) that have not been included in previous analyses. Phylogenetic trees based on SSrRNA gene sequences constructed with four different methods had a consistent branching pattern that included the following features: (1) the "typical" euplotids comprised a paraphyletic assemblage composed of two divergent clades (family Uronychiidae and families Euplotidae-Certesiidae-Aspidiscidae-Gastrocirrhidae), (2) in the family Uronychiidae, the genera Uronychia and Paradiophrys formed a clearly outlined, well-supported clade that seemed to be rather divergent from Diophrys and Diophryopsis, suggesting that the Diophrys-complex may have had a longer and more separate evolutionary history than previously supposed, (3) inclusion of 12 new SSrRNA sequences in analyses of Euplotidae revealed two new clades of species within the family and cast additional doubt on the present classification of genera within the family, and (4) the intraspecific divergence among five species of Aspidisca was far greater than those of closely related genera. The ITS1-5.8S-ITS2 coding regions and partial histone H4 genes of six morphospecies in the Diophrys-complex were sequenced along with their SSrRNA genes and used to compare phylogenies constructed from single data sets to those constructed from combined sets. Results indicated that combined analyses could be used to construct more reliable, less ambiguous phylogenies of complex groups like the order Euplotida, because they provide a greater amount and diversity of information.
Exploiting the full power of temporal gene expression profiling through a new statistical test: application to the analysis of muscular dystrophy data.

PubMed

Vinciotti, Veronica; Liu, Xiaohui; Turk, Rolf; de Meijer, Emile J; 't Hoen, Peter A C

2006-04-03

The identification of biologically interesting genes in a temporal expression profiling dataset is challenging and complicated by high levels of experimental noise. Most statistical methods used in the literature do not fully exploit the temporal ordering in the dataset and are not suited to the case where temporal profiles are measured for a number of different biological conditions. We present a statistical test that makes explicit use of the temporal order in the data by fitting polynomial functions to the temporal profile of each gene and for each biological condition. A Hotelling T2-statistic is derived to detect the genes for which the parameters of these polynomials are significantly different from each other. We validate the temporal Hotelling T2-test on muscular gene expression data from four mouse strains which were profiled at different ages: dystrophin-, beta-sarcoglycan and gamma-sarcoglycan deficient mice, and wild-type mice. The first three are animal models for different muscular dystrophies. Extensive biological validation shows that the method is capable of finding genes with temporal profiles significantly different across the four strains, as well as identifying potential biomarkers for each form of the disease. The added value of the temporal test compared to an identical test which does not make use of temporal ordering is demonstrated via a simulation study, and through confirmation of the expression profiles from selected genes by quantitative PCR experiments. The proposed method maximises the detection of the biologically interesting genes, whilst minimising false detections. The temporal Hotelling T2-test is capable of finding relatively small and robust sets of genes that display different temporal profiles between the conditions of interest. The test is simple, it can be used on gene expression data generated from any experimental design and for any number of conditions, and it allows fast interpretation of the temporal behaviour of genes. The R code is available from V.V. The microarray data have been submitted to GEO under series GSE1574 and GSE3523.
Exploiting the full power of temporal gene expression profiling through a new statistical test: Application to the analysis of muscular dystrophy data

PubMed Central

Vinciotti, Veronica; Liu, Xiaohui; Turk, Rolf; de Meijer, Emile J; 't Hoen, Peter AC

2006-01-01

Background The identification of biologically interesting genes in a temporal expression profiling dataset is challenging and complicated by high levels of experimental noise. Most statistical methods used in the literature do not fully exploit the temporal ordering in the dataset and are not suited to the case where temporal profiles are measured for a number of different biological conditions. We present a statistical test that makes explicit use of the temporal order in the data by fitting polynomial functions to the temporal profile of each gene and for each biological condition. A Hotelling T2-statistic is derived to detect the genes for which the parameters of these polynomials are significantly different from each other. Results We validate the temporal Hotelling T2-test on muscular gene expression data from four mouse strains which were profiled at different ages: dystrophin-, beta-sarcoglycan and gamma-sarcoglycan deficient mice, and wild-type mice. The first three are animal models for different muscular dystrophies. Extensive biological validation shows that the method is capable of finding genes with temporal profiles significantly different across the four strains, as well as identifying potential biomarkers for each form of the disease. The added value of the temporal test compared to an identical test which does not make use of temporal ordering is demonstrated via a simulation study, and through confirmation of the expression profiles from selected genes by quantitative PCR experiments. The proposed method maximises the detection of the biologically interesting genes, whilst minimising false detections. Conclusion The temporal Hotelling T2-test is capable of finding relatively small and robust sets of genes that display different temporal profiles between the conditions of interest. The test is simple, it can be used on gene expression data generated from any experimental design and for any number of conditions, and it allows fast interpretation of the temporal behaviour of genes. The R code is available from V.V. The microarray data have been submitted to GEO under series GSE1574 and GSE3523. PMID:16584545
Comparative genomics of the mimicry switch in Papilio dardanus.

PubMed

Timmermans, Martijn J T N; Baxter, Simon W; Clark, Rebecca; Heckel, David G; Vogel, Heiko; Collins, Steve; Papanicolaou, Alexie; Fukova, Iva; Joron, Mathieu; Thompson, Martin J; Jiggins, Chris D; ffrench-Constant, Richard H; Vogler, Alfried P

2014-07-22

The African Mocker Swallowtail, Papilio dardanus, is a textbook example in evolutionary genetics. Classical breeding experiments have shown that wing pattern variation in this polymorphic Batesian mimic is determined by the polyallelic H locus that controls a set of distinct mimetic phenotypes. Using bacterial artificial chromosome (BAC) sequencing, recombination analyses and comparative genomics, we show that H co-segregates with an interval of less than 500 kb that is collinear with two other Lepidoptera genomes and contains 24 genes, including the transcription factor genes engrailed (en) and invected (inv). H is located in a region of conserved gene order, which argues against any role for genomic translocations in the evolution of a hypothesized multi-gene mimicry locus. Natural populations of P. dardanus show significant associations of specific morphs with single nucleotide polymorphisms (SNPs), centred on en. In addition, SNP variation in the H region reveals evidence of non-neutral molecular evolution in the en gene alone. We find evidence for a duplication potentially driving physical constraints on recombination in the lamborni morph. Absence of perfect linkage disequilibrium between different genes in the other morphs suggests that H is limited to nucleotide positions in the regulatory and coding regions of en. Our results therefore support the hypothesis that a single gene underlies wing pattern variation in P. dardanus.
Genomicus 2018: karyotype evolutionary trees and on-the-fly synteny computing

PubMed Central

Nguyen, Nga Thi Thuy; Vincens, Pierre

2018-01-01

Abstract Since 2010, the Genomicus web server is available online at http://genomicus.biologie.ens.fr/genomicus. This graphical browser provides access to comparative genomic analyses in four different phyla (Vertebrate, Plants, Fungi, and non vertebrate Metazoans). Users can analyse genomic information from extant species, as well as ancestral gene content and gene order for vertebrates and flowering plants, in an integrated evolutionary context. New analyses and visualization tools have recently been implemented in Genomicus Vertebrate. Karyotype structures from several genomes can now be compared along an evolutionary pathway (Multi-KaryotypeView), and synteny blocks can be computed and visualized between any two genomes (PhylDiagView). PMID:29087490
The complete mitochondrial genome of the medicinal fungus Ganoderma applanatum (Polyporales, Basidiomycota).

PubMed

Wang, Xin-Cun; Shao, Junjie; Liu, Chang

2016-07-01

We have determined the complete nucleotide sequence of the mitochondrial genome of the medicinal fungus Ganoderma applanatum (Pers.) Pat. using the next-generation sequencing technology. The circular molecule is 119,803 bp long with a GC content of 26.66%. Gene prediction revealed genes encoding 15 conserved proteins, 25 tRNAs, the large and small ribosomal RNAs, all genes are located on the same strand except trnW-CCA. Compared with previously sequenced genomes of G. lucidum, G. meredithiae and G. sinense, the order of the protein and rRNA genes is highly conserved; however, the types of tRNA genes are slightly different. The mitochondrial genome of G. applanatum will contribute to the understanding of the phylogeny and evolution of Ganoderma and Ganodermataceae, the group containing many species with high medicinal values.
Comparative analysis of grapevine whole-genome gene predictions, functional annotation, categorization and integration of the predicted gene sequences

PubMed Central

2012-01-01

Background The first draft assembly and gene prediction of the grapevine genome (8X base coverage) was made available to the scientific community in 2007, and functional annotation was developed on this gene prediction. Since then additional Sanger sequences were added to the 8X sequences pool and a new version of the genomic sequence with superior base coverage (12X) was produced. Results In order to more efficiently annotate the function of the genes predicted in the new assembly, it is important to build on as much of the previous work as possible, by transferring 8X annotation of the genome to the 12X version. The 8X and 12X assemblies and gene predictions of the grapevine genome were compared to answer the question, “Can we uniquely map 8X predicted genes to 12X predicted genes?” The results show that while the assemblies and gene structure predictions are too different to make a complete mapping between them, most genes (18,725) showed a one-to-one relationship between 8X predicted genes and the last version of 12X predicted genes. In addition, reshuffled genomic sequence structures appeared. These highlight regions of the genome where the gene predictions need to be taken with caution. Based on the new grapevine gene functional annotation and in-depth functional categorization, twenty eight new molecular networks have been created for VitisNet while the existing networks were updated. Conclusions The outcomes of this study provide a functional annotation of the 12X genes, an update of VitisNet, the system of the grapevine molecular networks, and a new functional categorization of genes. Data are available at the VitisNet website (http://www.sdstate.edu/ps/research/vitis/pathways.cfm). PMID:22554261
Complete genome sequence and comparative genome analysis of Klebsiella oxytoca HKOPL1 isolated from giant panda feces.

PubMed

Jiang, Jingwei; Tun, Hein Min; Mauroo, Nathalie France; Ma, Angel Po Yee; Chan, San Yuen; Leung, Frederick C

2014-11-23

The giant panda (Ailuropoda melanoleuca) is an endangered species well-known for ingesting bamboo as a major part of their diet despite the fact that it belongs to order Carnivora. However, the giant panda's draft genome shows no direct evidence of enzymatic genes responsible for cellulose digestion. To explore this phenomenon, we study the giant panda's gut microbiota using genomic approaches in order to better understand their physiological processes as well as any potential microbial cellulose digestion processes. A complete genome of isolated Klebsiella oxytoca HKOPL1 of 5.9 Mb has been successfully sequenced, closed and comprehensively annotated against various databases. Genome comparisons within the Klebsiella genus and K. oxytoca species have also been performed. A total of 5,772 genes were predicted, and among them, 211 potential virulence genes, 35 pathogenicity island-like regions, 1,615 potential horizontal transferring genes, 23 potential antibiotics resistant genes, a potential prophage integrated region, 8 genes in 2,3-Butanediol production pathway and 3 genes in the cellulose degradation pathway could be identified and discussed based on the comparative genomic studies between the complete genome sequence of K. oxytoca HKOPL1 and other Klebsiella strains. A functional study shows that K. oxytoca HKOPL1 can degrade cellulose within 72 hours. Phylogenomic studies indicate that K. oxytoca HKOPL1 is clustered with K. oxytoca strains 1686 and E718. K. oxytoca HKOPL1 is a gram-negative bacterium able to degrade cellulose. We report here the first complete genome sequence of K. oxytoca isolated from giant panda feces. These studies have provided further insight into the role of gut microbiota in giant panda digestive physiology. In addition, K. oxytoca HKOPL1 has the potential for biofuel application in terms of cellulose degradation and potential for the production of 2,3-Butanediol (an important industrial raw material).
Genetic relatedness of orbiviruses by RNA-RNA blot hybridization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bodkin, D.K.

1985-01-01

RNA-RNA blot hybridization was developed in order to identify type-specific genes among double-stranded (ds) RNA viruses, to assess the genetic relatedness of dsRNA viruses and to classify new strains. Viral dsRNA segments were electrophoresed through 10% polyacrylamide gels, transferred to membranes, and hybridized to (5'/sup 32/P)-pCp labeled genomic RNA from a related strain. Hybridization was performed at 52/sup 0/C, 50% formamide, 5X SSC. Under these conditions heterologous RNA species must share greater than or equal to 74% sequence homology in order to form stable dsRNA hybrids. Cognate genes of nine members of the Palyam serogroup of orbiviruses were identified andmore » their sequence relatedness to the prototype. Palyam virus, was determined. Reciprocal blot hybridizations were performed using radiolabeled genomic RNA of all members of the Palyam serogroup. Unique and variant genes were identified by lack of cross-homology or by weak homology between segments. Since genes 2 and 6 exhibited the highest degree of sequence variability, response to the vertebrate immune system may be a major cause of sequence divergence among members of a single serogroup. Changuinola serogroup isolates were compared by dot-blot hybridization, while Colorado tick fever (CTF) serogroup isolates were compared by the RNA-RNA blot hybridization procedure described for reovirus and Palyam serogroup isolates. Preliminary blot hybridization data were also obtained on the relatedness of members of different Orbivirus serogroups.« less
GEM-TREND: a web tool for gene expression data mining toward relevant network discovery

PubMed Central

Feng, Chunlai; Araki, Michihiro; Kunimoto, Ryo; Tamon, Akiko; Makiguchi, Hiroki; Niijima, Satoshi; Tsujimoto, Gozoh; Okuno, Yasushi

2009-01-01

Background DNA microarray technology provides us with a first step toward the goal of uncovering gene functions on a genomic scale. In recent years, vast amounts of gene expression data have been collected, much of which are available in public databases, such as the Gene Expression Omnibus (GEO). To date, most researchers have been manually retrieving data from databases through web browsers using accession numbers (IDs) or keywords, but gene-expression patterns are not considered when retrieving such data. The Connectivity Map was recently introduced to compare gene expression data by introducing gene-expression signatures (represented by a set of genes with up- or down-regulated labels according to their biological states) and is available as a web tool for detecting similar gene-expression signatures from a limited data set (approximately 7,000 expression profiles representing 1,309 compounds). In order to support researchers to utilize the public gene expression data more effectively, we developed a web tool for finding similar gene expression data and generating its co-expression networks from a publicly available database. Results GEM-TREND, a web tool for searching gene expression data, allows users to search data from GEO using gene-expression signatures or gene expression ratio data as a query and retrieve gene expression data by comparing gene-expression pattern between the query and GEO gene expression data. The comparison methods are based on the nonparametric, rank-based pattern matching approach of Lamb et al. (Science 2006) with the additional calculation of statistical significance. The web tool was tested using gene expression ratio data randomly extracted from the GEO and with in-house microarray data, respectively. The results validated the ability of GEM-TREND to retrieve gene expression entries biologically related to a query from GEO. For further analysis, a network visualization interface is also provided, whereby genes and gene annotations are dynamically linked to external data repositories. Conclusion GEM-TREND was developed to retrieve gene expression data by comparing query gene-expression pattern with those of GEO gene expression data. It could be a very useful resource for finding similar gene expression profiles and constructing its gene co-expression networks from a publicly available database. GEM-TREND was designed to be user-friendly and is expected to support knowledge discovery. GEM-TREND is freely available at . PMID:19728865
GEM-TREND: a web tool for gene expression data mining toward relevant network discovery.

PubMed

Feng, Chunlai; Araki, Michihiro; Kunimoto, Ryo; Tamon, Akiko; Makiguchi, Hiroki; Niijima, Satoshi; Tsujimoto, Gozoh; Okuno, Yasushi

2009-09-03

DNA microarray technology provides us with a first step toward the goal of uncovering gene functions on a genomic scale. In recent years, vast amounts of gene expression data have been collected, much of which are available in public databases, such as the Gene Expression Omnibus (GEO). To date, most researchers have been manually retrieving data from databases through web browsers using accession numbers (IDs) or keywords, but gene-expression patterns are not considered when retrieving such data. The Connectivity Map was recently introduced to compare gene expression data by introducing gene-expression signatures (represented by a set of genes with up- or down-regulated labels according to their biological states) and is available as a web tool for detecting similar gene-expression signatures from a limited data set (approximately 7,000 expression profiles representing 1,309 compounds). In order to support researchers to utilize the public gene expression data more effectively, we developed a web tool for finding similar gene expression data and generating its co-expression networks from a publicly available database. GEM-TREND, a web tool for searching gene expression data, allows users to search data from GEO using gene-expression signatures or gene expression ratio data as a query and retrieve gene expression data by comparing gene-expression pattern between the query and GEO gene expression data. The comparison methods are based on the nonparametric, rank-based pattern matching approach of Lamb et al. (Science 2006) with the additional calculation of statistical significance. The web tool was tested using gene expression ratio data randomly extracted from the GEO and with in-house microarray data, respectively. The results validated the ability of GEM-TREND to retrieve gene expression entries biologically related to a query from GEO. For further analysis, a network visualization interface is also provided, whereby genes and gene annotations are dynamically linked to external data repositories. GEM-TREND was developed to retrieve gene expression data by comparing query gene-expression pattern with those of GEO gene expression data. It could be a very useful resource for finding similar gene expression profiles and constructing its gene co-expression networks from a publicly available database. GEM-TREND was designed to be user-friendly and is expected to support knowledge discovery. GEM-TREND is freely available at http://cgs.pharm.kyoto-u.ac.jp/services/network.
Development of PCR primers specific for the amplification and direct sequencing of gyrB genes from microbacteria, order Actinomycetales.

PubMed

Richert, Kathrin; Brambilla, Evelyne; Stackebrandt, Erko

2005-01-01

PCR primer sets were developed for the specific amplification and sequence analyses encoding the gyrase subunit B (gyrB) of members of the family Microbacteriaceae, class Actinobacteria. The family contains species highly related by 16S rRNA gene sequence analyses. In order to test if the gene sequence analysis of gyrB is appropriate to discriminate between closely related species, we evaluate the 16S rRNA gene phylogeny of its members. As the published universal primer set for gyrB failed to amplify the responding gene of the majority of the 80 type strains of the family, three new primer sets were identified that generated fragments with a composite sequence length of about 900 nt. However, the amplification of all three fragments was successful only in 25% of the 80 type strains. In this study, the substitution frequencies in genes encoding gyrase and 16S rDNA were compared for 10 strains of nine genera. The frequency of gyrB nucleotide substitution is significantly higher than that of the 16S rDNA, and no linear correlation exists between the similarities of both molecules among members of the Microbacteriaceae. The phylogenetic analyses using the gyrB sequences provide higher resolution than using 16S rDNA sequences and seem able to discriminate between closely related species.
Gene network biological validity based on gene-gene interaction relevance.

PubMed

Gómez-Vela, Francisco; Díaz-Díaz, Norberto

2014-01-01

In recent years, gene networks have become one of the most useful tools for modeling biological processes. Many inference gene network algorithms have been developed as techniques for extracting knowledge from gene expression data. Ensuring the reliability of the inferred gene relationships is a crucial task in any study in order to prove that the algorithms used are precise. Usually, this validation process can be carried out using prior biological knowledge. The metabolic pathways stored in KEGG are one of the most widely used knowledgeable sources for analyzing relationships between genes. This paper introduces a new methodology, GeneNetVal, to assess the biological validity of gene networks based on the relevance of the gene-gene interactions stored in KEGG metabolic pathways. Hence, a complete KEGG pathway conversion into a gene association network and a new matching distance based on gene-gene interaction relevance are proposed. The performance of GeneNetVal was established with three different experiments. Firstly, our proposal is tested in a comparative ROC analysis. Secondly, a randomness study is presented to show the behavior of GeneNetVal when the noise is increased in the input network. Finally, the ability of GeneNetVal to detect biological functionality of the network is shown.
LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes.

PubMed

Wang, Dapeng; Zhang, Yubin; Fan, Zhonghua; Liu, Guiming; Yu, Jun

2012-01-01

Animal genes of different lineages, such as vertebrates and arthropods, are well-organized and blended into dynamic chromosomal structures that represent a primary regulatory mechanism for body development and cellular differentiation. The majority of genes in a genome are actually clustered, which are evolutionarily stable to different extents and biologically meaningful when evaluated among genomes within and across lineages. Until now, many questions concerning gene organization, such as what is the minimal number of genes in a cluster and what is the driving force leading to gene co-regulation, remain to be addressed. Here, we provide a user-friendly database-LCGbase (a comprehensive database for lineage-based co-regulated genes)-hosting information on evolutionary dynamics of gene clustering and ordering within animal kingdoms in two different lineages: vertebrates and arthropods. The database is constructed on a web-based Linux-Apache-MySQL-PHP framework and effective interactive user-inquiry service. Compared to other gene annotation databases with similar purposes, our database has three comprehensible advantages. First, our database is inclusive, including all high-quality genome assemblies of vertebrates and representative arthropod species. Second, it is human-centric since we map all gene clusters from other genomes in an order of lineage-ranks (such as primates, mammals, warm-blooded, and reptiles) onto human genome and start the database from well-defined gene pairs (a minimal cluster where the two adjacent genes are oriented as co-directional, convergent, and divergent pairs) to large gene clusters. Furthermore, users can search for any adjacent genes and their detailed annotations. Third, the database provides flexible parameter definitions, such as the distance of transcription start sites between two adjacent genes, which is extendable to genes that flanking the cluster across species. We also provide useful tools for sequence alignment, gene ontology (GO) annotation, promoter identification, gene expression (co-expression), and evolutionary analysis. This database not only provides a way to define lineage-specific and species-specific gene clusters but also facilitates future studies on gene co-regulation, epigenetic control of gene expression (DNA methylation and histone marks), and chromosomal structures in a context of gene clusters and species evolution. LCGbase is freely available at http://lcgbase.big.ac.cn/LCGbase.
Genome-wide analysis reveals class and gene specific codon usage adaptation in avian paramyxoviruses 1

USDA-ARS?s Scientific Manuscript database

In order to characterize the evolutionary adaptations of avian paramyxovirus 1 (APMV-1) genomes, we have compared codon usage and codon adaptation indexes among groups of Newcastle disease viruses that differ in biological, ecological, and genetic characteristics. We have used available GenBank com...
The complete chloroplast DNA sequence of the green alga Oltmannsiellopsis viridis reveals a distinctive quadripartite architecture in the chloroplast genome of early diverging ulvophytes

PubMed Central

Pombert, Jean-François; Lemieux, Claude; Turmel, Monique

2006-01-01

Background The phylum Chlorophyta contains the majority of the green algae and is divided into four classes. The basal position of the Prasinophyceae has been well documented, but the divergence order of the Ulvophyceae, Trebouxiophyceae and Chlorophyceae is currently debated. The four complete chloroplast DNA (cpDNA) sequences presently available for representatives of these classes have revealed extensive variability in overall structure, gene content, intron composition and gene order. The chloroplast genome of Pseudendoclonium (Ulvophyceae), in particular, is characterized by an atypical quadripartite architecture that deviates from the ancestral type by a large inverted repeat (IR) featuring an inverted rRNA operon and a small single-copy (SSC) region containing 14 genes normally found in the large single-copy (LSC) region. To gain insights into the nature of the events that led to the reorganization of the chloroplast genome in the Ulvophyceae, we have determined the complete cpDNA sequence of Oltmannsiellopsis viridis, a representative of a distinct, early diverging lineage. Results The 151,933 bp IR-containing genome of Oltmannsiellopsis differs considerably from Pseudendoclonium and other chlorophyte cpDNAs in intron content and gene order, but shares close similarities with its ulvophyte homologue at the levels of quadripartite architecture, gene content and gene density. Oltmannsiellopsis cpDNA encodes 105 genes, contains five group I introns, and features many short dispersed repeats. As in Pseudendoclonium cpDNA, the rRNA genes in the IR are transcribed toward the single copy region featuring the genes typically found in the ancestral LSC region, and the opposite single copy region harbours genes characteristic of both the ancestral SSC and LSC regions. The 52 genes that were transferred from the ancestral LSC to SSC region include 12 of those observed in Pseudendoclonium cpDNA. Surprisingly, the overall gene organization of Oltmannsiellopsis cpDNA more closely resembles that of Chlorella (Trebouxiophyceae) cpDNA. Conclusion The chloroplast genome of the last common ancestor of Oltmannsiellopsis and Pseudendoclonium contained a minimum of 108 genes, carried only a few group I introns, and featured a distinctive quadripartite architecture. Numerous changes were experienced by the chloroplast genome in the lineages leading to Oltmannsiellopsis and Pseudendoclonium. Our comparative analyses of chlorophyte cpDNAs support the notion that the Ulvophyceae is sister to the Chlorophyceae. PMID:16472375
Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation

PubMed Central

Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.; Taylor, Ronald C.; Weisenhorn, Pamela; Olson, Robert D.; Stevens, Rick L.; Rocha, Miguel; Rocha, Isabel; Best, Aaron A.; DeJongh, Matthew; Tintle, Nathan L.; Parrello, Bruce; Overbeek, Ross; Henry, Christopher S.

2016-01-01

Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets, Atomic Regulons (ARs), represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here, we describe an approach for inferring ARs that leverages large-scale expression data sets, gene context, and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: Hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB, showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness (CLR) analysis, finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs, it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms, we computed ARs for Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus, each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance, but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms, comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain. PMID:27933038
Comparative Genomics of Bacteriophage of the Genus Seuratvirus

PubMed Central

Sazinas, Pavelas; Redgwell, Tamsin; Rihtman, Branko; Grigonyte, Aurelija; Michniewski, Slawomir; Scanlan, David J; Hobman, Jon

2018-01-01

Abstract Despite being more abundant and having smaller genomes than their bacterial host, relatively few bacteriophages have had their genomes sequenced. Here, we isolated 14 bacteriophages from cattle slurry and performed de novo genome sequencing, assembly, and annotation. The commonly used marker genes polB and terL showed these bacteriophages to be closely related to members of the genus Seuratvirus. We performed a core-gene analysis using the 14 new and four closely related genomes. A total of 58 core genes were identified, the majority of which has no known function. These genes were used to construct a core-gene phylogeny, the results of which confirmed the new isolates to be part of the genus Seuratvirus and expanded the number of species within this genus to four. All bacteriophages within the genus contained the genes queCDE encoding enzymes involved in queuosine biosynthesis. We suggest these genes are carried as a mechanism to modify DNA in order to protect these bacteriophages against host endonucleases. PMID:29272407
Differentially expressed genes in healthy and plum pox virus-infected Nicotiana benthamiana plants.

PubMed

Vozárová, Z; Žilová, M; Šubr, Z

2015-12-01

Viruses use both material and energy sources of their hosts and redirect the production of disposable compounds in order to make viral replication more efficient. Metabolism of infected organisms is modified by these enhanced requirements as well by their own defense response. Resulting complex story consists of many regulation events on various gene expression levels. Elucidating these processes may contribute to the knowledge on virus-host interactions and to evolving new antiviral strategies. In our work we applied a subtractive cloning technique to compare the transcriptomes of healthy and plum pox virus (PPV)-infected Nicotiana benthamiana plants. Several genes were found to be induced or repressed by the PPV infection. The induced genes were mainly related to general stress response or photosynthesis, several repressed genes could be connected with growth defects evoked by the infection. Interestingly, some genes usually up-regulated by fungal or bacterial infection were found repressed in PPV-infected plants. Potential involvement of particular differently expressed genes in the process of PPV infection is discussed.
Phylogenetic analysis of two Plectus mitochondrial genomes (Nematoda: Plectida) supports a sister group relationship between Plectida and Rhabditida within Chromadorea.

PubMed

Kim, Jiyeon; Kern, Elizabeth; Kim, Taeho; Sim, Mikang; Kim, Jaebum; Kim, Yuseob; Park, Chungoo; Nadler, Steven A; Park, Joong-Ki

2017-02-01

Plectida is an important nematode order with species that occupy many different biological niches. The order includes free-living aquatic and soil-dwelling species, but its phylogenetic position has remained uncertain. We sequenced the complete mitochondrial genomes of two members of this order, Plectus acuminatus and Plectus aquatilis and compared them with those of other major nematode clades. The genome size and base composition of these species are similar to other nematodes; 14,831 and 14,372bp, respectively, with AT contents of 71.0% and 70.1%. Gene content was also similar to other nematodes, but gene order and coding direction of Plectus mtDNAs were dissimilar from other chromadorean species. P. acuminatus and P. aquatilis are the first chromadorean species found to contain a gene inversion. We reconstructed mitochondrial genome phylogenetic trees using nucleotide and amino acid datasets from 87 nematodes that represent major nematode clades, including the Plectus sequences. Trees from phylogenetic analyses using maximum likelihood and Bayesian methods depicted Plectida as the sister group to other sequenced chromadorean nematodes. This finding is consistent with several phylogenetic results based on SSU rDNA, but disagrees with a classification based on morphology. Mitogenomes representing other basal chromadorean groups (Araeolaimida, Monhysterida, Desmodorida, Chromadorida) are needed to confirm their phylogenetic relationships. Copyright © 2016 Elsevier Inc. All rights reserved.

The complete mitochondrial genome of parasitic nematode Camallanus cotti: extreme discontinuity in the rate of mitogenomic architecture evolution within the Chromadorea class.

PubMed

Zou, Hong; Jakovlić, Ivan; Chen, Rong; Zhang, Dong; Zhang, Jin; Li, Wen-Xiang; Wang, Gui-Tang

2017-11-02

Complete mitochondrial genomes are much better suited for the taxonomic identification and phylogenetic studies of nematodes than morphology or traditionally-used molecular markers, but they remain unavailable for the entire Camallanidae family (Chromadorea). As the only published mitogenome in the Camallanina suborder (Dracunculoidea superfamily) exhibited a unique gene order, the other objective of this research was to study the evolution of mitochondrial architecture in the Spirurida order. Thus, we sequenced the complete mitogenome of the Camallanus cotti fish parasite and conducted structural and phylogenomic comparative analyses with all available Spirurida mitogenomes. The mitogenome is exceptionally large (17,901 bp) among the Chromadorea and, with 46 (pseudo-) genes, exhibits a unique architecture among nematodes. Six protein-coding genes (PCGs) and six tRNAs are duplicated. An additional (seventh) tRNA (Trp) was probably duplicated by the remolding of tRNA-Ser2 (missing). Two pairs of these duplicated PCGs might be functional; three were incomplete and one contained stop codons. Apart from Ala and Asp, all other duplicated tRNAs are conserved and probably functional. Only 19 unique tRNAs were found. Phylogenomic analysis included Gnathostomatidae (Spirurina) in the Camallanina suborder. Within the Nematoda, comparable PCG duplications were observed only in the enoplean Mermithidae family, but those result from mitochondrial recombination, whereas characteristics of the studied mitogenome suggest that likely rearrangement mechanisms are either a series of duplications, transpositions and random loss events, or duplication, fragmentation and subsequent reassembly of the mitogenome. We put forward a hypothesis that the evolution of mitogenomic architecture is extremely discontinuous, and that once a long period of stasis in gene order and content has been punctuated by a rearrangement event, such a destabilised mitogenome is much more likely to undergo subsequent rearrangement events, resulting in an exponentially accelerated evolutionary rate of mitogenomic rearrangements. Implications of this model are particularly important for the application of gene order similarity as an additive source of phylogenetic information. Chromadorean nematodes, and particularly Camallanina clade (with C. cotti as an example of extremely accelerated rate of rearrangements), might be a good model to further study this discontinuity in the dynamics of mitogenomic evolution.
Global transcriptome analysis of the C57BL/6J mouse testis by SAGE: evidence for nonrandom gene order.

PubMed

Divina, Petr; Vlcek, Cestmír; Strnad, Petr; Paces, Václav; Forejt, Jirí

2005-03-05

We generated the gene expression profile of the total testis from the adult C57BL/6J male mice using serial analysis of gene expression (SAGE). Two high-quality SAGE libraries containing a total of 76 854 tags were constructed. An extensive bioinformatic analysis and comparison of SAGE transcriptomes of the total testis, testicular somatic cells and other mouse tissues was performed and the theory of male-biased gene accumulation on the X chromosome was tested. We sorted out 829 genes predominantly expressed from the germinal part and 944 genes from the somatic part of the testis. The genes preferentially and specifically expressed in total testis and testicular somatic cells were identified by comparing the testis SAGE transcriptomes to the available transcriptomes of seven non-testis tissues. We uncovered chromosomal clusters of adjacent genes with preferential expression in total testis and testicular somatic cells by a genome-wide search and found that the clusters encompassed a significantly higher number of genes than expected by chance. We observed a significant 3.2-fold enrichment of the proportion of X-linked genes specific for testicular somatic cells, while the proportions of X-linked genes specific for total testis and for other tissues were comparable. In contrast to the tissue-specific genes, an under-representation of X-linked genes in the total testis transcriptome but not in the transcriptomes of testicular somatic cells and other tissues was detected. Our results provide new evidence in favor of the theory of male-biased genes accumulation on the X chromosome in testicular somatic cells and indicate the opposite action of the meiotic X-inactivation in testicular germ cells.
Global transcriptome analysis of the C57BL/6J mouse testis by SAGE: evidence for nonrandom gene order

PubMed Central

Divina, Petr; Vlček, Čestmír; Strnad, Petr; Pačes, Václav; Forejt, Jiří

2005-01-01

Background We generated the gene expression profile of the total testis from the adult C57BL/6J male mice using serial analysis of gene expression (SAGE). Two high-quality SAGE libraries containing a total of 76 854 tags were constructed. An extensive bioinformatic analysis and comparison of SAGE transcriptomes of the total testis, testicular somatic cells and other mouse tissues was performed and the theory of male-biased gene accumulation on the X chromosome was tested. Results We sorted out 829 genes predominantly expressed from the germinal part and 944 genes from the somatic part of the testis. The genes preferentially and specifically expressed in total testis and testicular somatic cells were identified by comparing the testis SAGE transcriptomes to the available transcriptomes of seven non-testis tissues. We uncovered chromosomal clusters of adjacent genes with preferential expression in total testis and testicular somatic cells by a genome-wide search and found that the clusters encompassed a significantly higher number of genes than expected by chance. We observed a significant 3.2-fold enrichment of the proportion of X-linked genes specific for testicular somatic cells, while the proportions of X-linked genes specific for total testis and for other tissues were comparable. In contrast to the tissue-specific genes, an under-representation of X-linked genes in the total testis transcriptome but not in the transcriptomes of testicular somatic cells and other tissues was detected. Conclusion Our results provide new evidence in favor of the theory of male-biased genes accumulation on the X chromosome in testicular somatic cells and indicate the opposite action of the meiotic X-inactivation in testicular germ cells. PMID:15748293
Distribution and quantification of antibiotic resistant genes and bacteria across agricultural and non-agricultural metagenomes.

PubMed

Durso, Lisa M; Miller, Daniel N; Wienhold, Brian J

2012-01-01

There is concern that antibiotic resistance can potentially be transferred from animals to humans through the food chain. The relationship between specific antibiotic resistant bacteria and the genes they carry remains to be described. Few details are known about the ecology of antibiotic resistant genes and bacteria in food production systems, or how antibiotic resistance genes in food animals compare to antibiotic resistance genes in other ecosystems. Here we report the distribution of antibiotic resistant genes in publicly available agricultural and non-agricultural metagenomic samples and identify which bacteria are likely to be carrying those genes. Antibiotic resistance, as coded for in the genes used in this study, is a process that was associated with all natural, agricultural, and human-impacted ecosystems examined, with between 0.7 to 4.4% of all classified genes in each habitat coding for resistance to antibiotic and toxic compounds (RATC). Agricultural, human, and coastal-marine metagenomes have characteristic distributions of antibiotic resistance genes, and different bacteria that carry the genes. There is a larger percentage of the total genome associated with antibiotic resistance in gastrointestinal-associated and agricultural metagenomes compared to marine and Antarctic samples. Since antibiotic resistance genes are a natural part of both human-impacted and pristine habitats, presence of these resistance genes in any specific habitat is therefore not sufficient to indicate or determine impact of anthropogenic antibiotic use. We recommend that baseline studies and control samples be taken in order to determine natural background levels of antibiotic resistant bacteria and/or antibiotic resistance genes when investigating the impacts of veterinary use of antibiotics on human health. We raise questions regarding whether the underlying biology of each type of bacteria contributes to the likelihood of transfer via the food chain.
Validation of reference genes for real-time quantitative PCR normalization in soybean developmental and germinating seeds.

PubMed

Li, Qing; Fan, Cheng-Ming; Zhang, Xiao-Mei; Fu, Yong-Fu

2012-10-01

Most of traditional reference genes chosen for real-time quantitative PCR normalization were assumed to be ubiquitously and constitutively expressed in vegetative tissues. However, seeds show distinct transcriptomes compared with the vegetative tissues. Therefore, there is a need for re-validation of reference genes in samples of seed development and germination, especially for soybean seeds. In this study, we aimed at identifying reference genes suitable for the quantification of gene expression level in soybean seeds. In order to identify the best reference genes for soybean seeds, 18 putative reference genes were tested with various methods in different seed samples. We combined the outputs of both geNorm and NormFinder to assess the expression stability of these genes. The reference genes identified as optimums for seed development were TUA5 and UKN2, whereas for seed germination they were novel reference genes Glyma05g37470 and Glyma08g28550. Furthermore, for total seed samples it was necessary to combine four genes of Glyma05g37470, Glyma08g28550, Glyma18g04130 and UKN2 [corrected] for normalization. Key message We identified several reference genes that stably expressed in soybean seed developmental and germinating processes.
The complete mitochondrial genomes of two rice planthoppers, Nilaparvata lugens and Laodelphax striatellus: conserved genome rearrangement in Delphacidae and discovery of new characteristics of atp8 and tRNA genes.

PubMed

Zhang, Kai-Jun; Zhu, Wen-Chao; Rong, Xia; Zhang, Yan-Kai; Ding, Xiu-Lei; Liu, Jing; Chen, Da-Song; Du, Yu; Hong, Xiao-Yue

2013-06-22

Nilaparvata lugens (the brown planthopper, BPH) and Laodelphax striatellus (the small brown planthopper, SBPH) are two of the most important pests of rice. Up to now, there was only one mitochondrial genome of rice planthopper has been sequenced and very few dependable information of mitochondria could be used for research on population genetics, phylogeographics and phylogenetic evolution of these pests. To get more valuable information from the mitochondria, we sequenced the complete mitochondrial genomes of BPH and SBPH. These two planthoppers were infected with two different functional Wolbachia (intracellular endosymbiont) strains (wLug and wStri). Since both mitochondria and Wolbachia are transmitted by cytoplasmic inheritance and it was difficult to separate them when purified the Wolbachia particles, concomitantly sequencing the genome of Wolbachia using next generation sequencing method, we also got nearly complete mitochondrial genome sequences of these two rice planthoppers. After gap closing, we present high quality and reliable complete mitochondrial genomes of these two planthoppers. The mitogenomes of N. lugens (BPH) and L. striatellus (SBPH) are 17, 619 bp and 16, 431 bp long with A + T contents of 76.95% and 77.17%, respectively. Both species have typical circular mitochondrial genomes that encode the complete set of 37 genes which are usually found in metazoans. However, the BPH mitogenome also possesses two additional copies of the trnC gene. In both mitochondrial genomes, the lengths of the atp8 gene were conspicuously shorter than that of all other known insect mitochondrial genomes (99 bp for BPH, 102 bp for SBPH). That two rearrangement regions (trnC-trnW and nad6-trnP-trnT) of mitochondrial genomes differing from other known insect were found in these two distantly related planthoppers revealed that the gene order of mitochondria might be conservative in Delphacidae. The large non-coding fragment (the A+T-rich region) putatively corresponding responsible for the control of replication and transcription of mitochondria contained a variable number of tandem repeats (VNTRs) block in different natural individuals of these two planthoppers. Comparison with a previously sequenced individual of SBPH revealed that the mitochondrial genetic variation within a species exists not only in the sequence and secondary structure of genes, but also in the gene order (the different location of trnH gene). The mitochondrial genome arrangement pattern found in planthoppers was involved in rearrangements of both tRNA genes and protein-coding genes (PCGs). Different species from different genera of Delphacidae possessing the same mitochondrial gene rearrangement suggests that gene rearrangements of mitochondrial genome probably occurred before the differentiation of this family. After comparatively analyzing the gene order of different species of Hemiptera, we propose that except for some specific taxonomical group (e.g. the whiteflies) the gene order might have diversified in family level of this order. The VNTRs detected in the control region might provide additional genetic markers for studying population genetics, individual difference and phylogeographics of planthoppers.
The complete mitochondrial genomes of two rice planthoppers, Nilaparvata lugens and Laodelphax striatellus: conserved genome rearrangement in Delphacidae and discovery of new characteristics of atp8 and tRNA genes

PubMed Central

2013-01-01

Background Nilaparvata lugens (the brown planthopper, BPH) and Laodelphax striatellus (the small brown planthopper, SBPH) are two of the most important pests of rice. Up to now, there was only one mitochondrial genome of rice planthopper has been sequenced and very few dependable information of mitochondria could be used for research on population genetics, phylogeographics and phylogenetic evolution of these pests. To get more valuable information from the mitochondria, we sequenced the complete mitochondrial genomes of BPH and SBPH. These two planthoppers were infected with two different functional Wolbachia (intracellular endosymbiont) strains (wLug and wStri). Since both mitochondria and Wolbachia are transmitted by cytoplasmic inheritance and it was difficult to separate them when purified the Wolbachia particles, concomitantly sequencing the genome of Wolbachia using next generation sequencing method, we also got nearly complete mitochondrial genome sequences of these two rice planthoppers. After gap closing, we present high quality and reliable complete mitochondrial genomes of these two planthoppers. Results The mitogenomes of N. lugens (BPH) and L. striatellus (SBPH) are 17, 619 bp and 16, 431 bp long with A + T contents of 76.95% and 77.17%, respectively. Both species have typical circular mitochondrial genomes that encode the complete set of 37 genes which are usually found in metazoans. However, the BPH mitogenome also possesses two additional copies of the trnC gene. In both mitochondrial genomes, the lengths of the atp8 gene were conspicuously shorter than that of all other known insect mitochondrial genomes (99 bp for BPH, 102 bp for SBPH). That two rearrangement regions (trnC-trnW and nad6-trnP-trnT) of mitochondrial genomes differing from other known insect were found in these two distantly related planthoppers revealed that the gene order of mitochondria might be conservative in Delphacidae. The large non-coding fragment (the A+T-rich region) putatively corresponding responsible for the control of replication and transcription of mitochondria contained a variable number of tandem repeats (VNTRs) block in different natural individuals of these two planthoppers. Comparison with a previously sequenced individual of SBPH revealed that the mitochondrial genetic variation within a species exists not only in the sequence and secondary structure of genes, but also in the gene order (the different location of trnH gene). Conclusion The mitochondrial genome arrangement pattern found in planthoppers was involved in rearrangements of both tRNA genes and protein-coding genes (PCGs). Different species from different genera of Delphacidae possessing the same mitochondrial gene rearrangement suggests that gene rearrangements of mitochondrial genome probably occurred before the differentiation of this family. After comparatively analyzing the gene order of different species of Hemiptera, we propose that except for some specific taxonomical group (e.g. the whiteflies) the gene order might have diversified in family level of this order. The VNTRs detected in the control region might provide additional genetic markers for studying population genetics, individual difference and phylogeographics of planthoppers. PMID:23799924
DLGP: A database for lineage-conserved and lineage-specific gene pairs in animal and plant genomes.

PubMed

Wang, Dapeng

2016-01-15

The conservation of gene organization in the genome with lineage-specificity is an invaluable resource to decipher their potential functionality with diverse selective constraints, especially in higher animals and plants. Gene pairs appear to be the minimal structure for such kind of gene clusters that tend to reside in their preferred locations, representing the distinctive genomic characteristics in single species or a given lineage. Despite gene families having been investigated in a widespread manner, the definition of gene pair families in various taxa still lacks adequate attention. To address this issue, we report DLGP (http://lcgbase.big.ac.cn/DLGP/) that stores the pre-calculated lineage-based gene pairs in currently available 134 animal and plant genomes and inspect them under the same analytical framework, bringing out a set of innovational features. First, the taxonomy or lineage has been classified into four levels such as Kingdom, Phylum, Class and Order. It adopts all-to-all comparison strategy to identify the possible conserved gene pairs in all species for each gene pair in certain species and reckon those that are conserved in over a significant proportion of species in a given lineage (e.g. Primates, Diptera or Poales) as the lineage-conserved gene pairs. Furthermore, it predicts the lineage-specific gene pairs by retaining the above-mentioned lineage-conserved gene pairs that are not conserved in any other lineages. Second, it carries out pairwise comparison for the gene pairs between two compared species and creates the table including all the conserved gene pairs and the image elucidating the conservation degree of gene pairs in chromosomal level. Third, it supplies gene order browser to extend gene pairs to gene clusters, allowing users to view the evolution dynamics in the gene context in an intuitive manner. This database will be able to facilitate the particular comparison between animals and plants, between vertebrates and arthropods, and between monocots and eudicots, accounting for the significant contribution of gene pairs to speciation and diversification in specific lineages. Copyright © 2015 Elsevier Inc. All rights reserved.
Prevalence and Prognostic Impact of Wilms' Tumor 1 (WT1) Gene, Including SNP rs16754 in Cytogenetically Normal Acute Myeloblastic Leukemia (CN-AML): An Iranian Experience.

PubMed

Toogeh, Gholamreza; Ramzi, Mani; Faranoush, Mohammad; Amirizadeh, Naser; Haghpanah, Sezaneh; Moghadam, Mohammad; Cohan, Nader

2016-03-01

The aim of this study was to evaluate the effect of Wilms' tumor 1 (WT1) gene mutations in adult cytogenetically normal acute myeloblastic leukemia (CN-AML) patients on survival and clinical outcome. A total of 88 untreated Iranian adult patients with CN-AML were selected as a study group. Exons 7 (including the SNP rs16754), 8, and 9 as a WT1 gene hotspot region were evaluated by polymerase chain reaction and direct sequencing for detection of mutations. Response to treatment and clinical outcome including overall survival (OS) and disease-free survival (DFS) were evaluated according to WT1 gene mutational status. WT1 gene mutations were found in 12.5% of patients, most of which were found in exon 7. Complete remission was lower and relapse was higher in patients with WT1 gene mutation compared with WT1 gene wild type patients. OS and DFS was significantly lower in patients with WT1 gene mutation compared with patients with WT1 gene wild type (P < .001). Also, we did not find any significant effects of SNP rs16754 in exon 7 on clinical outcome and survival in patients with CN-AML. WT1 gene mutations are a predictor indicator of a poor prognosis factor in CN-AML patients. It is recommended that WT1 gene mutations be included in the molecular testing panel in order to better diagnose and confirm their prognostic significance for better management and treatment strategy. Copyright © 2016 Elsevier Inc. All rights reserved.
Origins of De Novo Genes in Human and Chimpanzee.

PubMed

Ruiz-Orera, Jorge; Hernandez-Rodriguez, Jessica; Chiva, Cristina; Sabidó, Eduard; Kondova, Ivanela; Bontrop, Ronald; Marqués-Bonet, Tomàs; Albà, M Mar

2015-12-01

The birth of new genes is an important motor of evolutionary innovation. Whereas many new genes arise by gene duplication, others originate at genomic regions that did not contain any genes or gene copies. Some of these newly expressed genes may acquire coding or non-coding functions and be preserved by natural selection. However, it is yet unclear which is the prevalence and underlying mechanisms of de novo gene emergence. In order to obtain a comprehensive view of this process, we have performed in-depth sequencing of the transcriptomes of four mammalian species--human, chimpanzee, macaque, and mouse--and subsequently compared the assembled transcripts and the corresponding syntenic genomic regions. This has resulted in the identification of over five thousand new multiexonic transcriptional events in human and/or chimpanzee that are not observed in the rest of species. Using comparative genomics, we show that the expression of these transcripts is associated with the gain of regulatory motifs upstream of the transcription start site (TSS) and of U1 snRNP sites downstream of the TSS. In general, these transcripts show little evidence of purifying selection, suggesting that many of them are not functional. However, we find signatures of selection in a subset of de novo genes which have evidence of protein translation. Taken together, the data support a model in which frequently-occurring new transcriptional events in the genome provide the raw material for the evolution of new proteins.
Origins of De Novo Genes in Human and Chimpanzee

PubMed Central

Ruiz-Orera, Jorge; Hernandez-Rodriguez, Jessica; Chiva, Cristina; Sabidó, Eduard; Kondova, Ivanela; Bontrop, Ronald; Marqués-Bonet, Tomàs; Albà, M.Mar

2015-01-01

The birth of new genes is an important motor of evolutionary innovation. Whereas many new genes arise by gene duplication, others originate at genomic regions that did not contain any genes or gene copies. Some of these newly expressed genes may acquire coding or non-coding functions and be preserved by natural selection. However, it is yet unclear which is the prevalence and underlying mechanisms of de novo gene emergence. In order to obtain a comprehensive view of this process, we have performed in-depth sequencing of the transcriptomes of four mammalian species—human, chimpanzee, macaque, and mouse—and subsequently compared the assembled transcripts and the corresponding syntenic genomic regions. This has resulted in the identification of over five thousand new multiexonic transcriptional events in human and/or chimpanzee that are not observed in the rest of species. Using comparative genomics, we show that the expression of these transcripts is associated with the gain of regulatory motifs upstream of the transcription start site (TSS) and of U1 snRNP sites downstream of the TSS. In general, these transcripts show little evidence of purifying selection, suggesting that many of them are not functional. However, we find signatures of selection in a subset of de novo genes which have evidence of protein translation. Taken together, the data support a model in which frequently-occurring new transcriptional events in the genome provide the raw material for the evolution of new proteins. PMID:26720152
Comparative Mitogenomic Analyses of Praying Mantises (Dictyoptera, Mantodea): Origin and Evolution of Unusual Intergenic Gaps

PubMed Central

Zhang, Hong-Li; Ye, Fei

2017-01-01

Praying mantises are a diverse group of predatory insects. Although some Mantodea mitogenomes have been reported, a comprehensive comparative and evolutionary genomic study is lacking for this group. In the present study, four new mitogenomes were sequenced, annotated, and compared to the previously published mitogenomes of other Mantodea species. Most Mantodea mitogenomes share a typical set of mitochondrial genes and a putative control region (CR). Additionally, and most intriguingly, another large non-coding region (LNC) was detected between trnM and ND2 in all six Paramantini mitogenomes examined. The main section in this common region of Paramantini may have initially originated from the corresponding control region for each species, whereas sequence differences between the LNCs and CRs and phylogenetic analyses indicate that LNC and CR are largely independently evolving. Namely, the LNC (the duplicated CR) may have subsequently degenerated during evolution. Furthermore, evidence suggests that special intergenic gaps have been introduced in some species through gene rearrangement and duplication. These gaps are actually the original abutting sequences of migrated or duplicated genes. Some gaps (G5 and G6) are homologous to the 5' and 3' surrounding regions of the duplicated gene in the original gene order, and another specific gap (G7) has tandem repeats. We analysed the phylogenetic relationships of fifteen Mantodea species using 37 concatenated mitochondrial genes and detected several synapomorphies unique to species in some clades. PMID:28367101
Order or chaos in Boolean gene networks depends on the mean fraction of canalizing functions

NASA Astrophysics Data System (ADS)

Karlsson, Fredrik; Hörnquist, Michael

2007-10-01

We explore the connection between order/chaos in Boolean networks and the naturally occurring fraction of canalizing functions in such systems. This fraction turns out to give a very clear indication of whether the system possesses ordered or chaotic dynamics, as measured by Derrida plots, and also the degree of order when we compare different networks with the same number of vertices and edges. By studying also a wide distribution of indegrees in a network, we show that the mean probability of canalizing functions is a more reliable indicator of the type of dynamics for a finite network than the classical result on stability relating the bias to the mean indegree. Finally, we compare by direct simulations two biologically derived networks with networks of similar sizes but with power-law and Poisson distributions of indegrees, respectively. The biologically motivated networks are not more ordered than the latter, and in one case the biological network is even chaotic while the others are not.
Biased expression, under the control of single promoter, of human interferon α-2b and Escherichia coli methionine amino peptidase genes in E. coli, irrespective of their distance from the promoter.

PubMed

Arif, Amina; Rashid, Naeem; Aslam, Farheen; Mahmood, Nasir; Akhtar, Muhammad

2016-03-01

Human interferon α-2b and Escherichia coli methionine amino peptidase genes were cloned independently as well as bicistronically in expression plasmid pET-21a (+). Production of human interferon α-2b was comparable to that of E. coli methionine amino peptidase when these genes were expressed independently in E. coli BL21-CodonPlus (DE3)-RIL. However, human interferon α-2b was produced in a much less amount whereas there was no difference in the production of methionine amino peptidase when the encoding genes were expressed bicistronically. It is important to note that human interferon α-2b was the first gene in order, after the promoter and E. coli methionine amino peptidase was the next with a linker sequence of 27 nucleotides between them.
Bayesian median regression for temporal gene expression data

NASA Astrophysics Data System (ADS)

Yu, Keming; Vinciotti, Veronica; Liu, Xiaohui; 't Hoen, Peter A. C.

2007-09-01

Most of the existing methods for the identification of biologically interesting genes in a temporal expression profiling dataset do not fully exploit the temporal ordering in the dataset and are based on normality assumptions for the gene expression. In this paper, we introduce a Bayesian median regression model to detect genes whose temporal profile is significantly different across a number of biological conditions. The regression model is defined by a polynomial function where both time and condition effects as well as interactions between the two are included. MCMC-based inference returns the posterior distribution of the polynomial coefficients. From this a simple Bayes factor test is proposed to test for significance. The estimation of the median rather than the mean, and within a Bayesian framework, increases the robustness of the method compared to a Hotelling T2-test previously suggested. This is shown on simulated data and on muscular dystrophy gene expression data.
Effect of castration on carcass quality and differential gene expression of longissimus muscle between steer and bull.

PubMed

Zhou, Zheng-Kui; Gao, Xue; Li, Jun-Ya; Chen, Jin-Bao; Xu, Shang-Zhong

2011-11-01

The effect of castration on carcass quality was investigated by ten Chinese Simmental calves. Five calves were castrated randomly at 2 months old and the others were retained as normal intact bulls. All animals were slaughtered at 22 months old. The results showed that bulls carcass had higher weight (P < 0.05), dressing percentages and bigger longissimus muscle areas (P < 0.05) than steers. But steer meat had lower shear force values and was fatter (P < 0.05) than bull. Furthermore, in order to discover genes that were involved in determining steer meat quality, we compared related candidate gene expression in longissimus muscle between steer (tester) and bull (driver) using suppressive subtractive hybridization. Ten genes were identified as preferentially expressed in longissimus muscle of steer. The expression of four selected differentially expressed genes was confirmed by quantitative real-time PCR. Overall, a 1.96, 2.41, 2.89, 2.41-fold increase in expression level was observed in steer compared with bull for actin, gamma 2, smooth muscle, tropomyosin-2, insulin like growth factor 1 and hormone-sensitive lipase, respectively. These results implied that these differentially expressed genes could play an important role in the regulation of steer meat quality.
Octocoral Mitochondrial Genomes Provide Insights into the Phylogenetic History of Gene Order Rearrangements, Order Reversals, and Cnidarian Phylogenetics

PubMed Central

Figueroa, Diego F.; Baco, Amy R.

2015-01-01

We use full mitochondrial genomes to test the robustness of the phylogeny of the Octocorallia, to determine the evolutionary pathway for the five known mitochondrial gene rearrangements in octocorals, and to test the suitability of using mitochondrial genomes for higher taxonomic-level phylogenetic reconstructions. Our phylogeny supports three major divisions within the Octocorallia and show that Paragorgiidae is paraphyletic, with Sibogagorgia forming a sister branch to the Coralliidae. Furthermore, Sibogagorgia cauliflora has what is presumed to be the ancestral gene order in octocorals, but the presence of a pair of inverted repeat sequences suggest that this gene order was not conserved but rather evolved back to this apparent ancestral state. Based on this we recommend the resurrection of the family Sibogagorgiidae to fix the paraphyly of the Paragorgiidae. This is the first study to show that in the Octocorallia, mitochondrial gene orders have evolved back to an ancestral state after going through a gene rearrangement, with at least one of the gene orders evolving independently in different lineages. A number of studies have used gene boundaries to determine the type of mitochondrial gene arrangement present. However, our findings suggest that this method known as gene junction screening may miss evolutionary reversals. Additionally, substitution saturation analysis demonstrates that while whole mitochondrial genomes can be used effectively for phylogenetic analyses within Octocorallia, their utility at higher taxonomic levels within Cnidaria is inadequate. Therefore for phylogenetic reconstruction at taxonomic levels higher than subclass within the Cnidaria, nuclear genes will be required, even when whole mitochondrial genomes are available. PMID:25539723
Expression analysis of some genes regulated by retinoic acid in controls and triadimefon-exposed embryos: is the amphibian Xenopus laevis a suitable model for gene-based comparative teratology?

PubMed

Di Renzo, Francesca; Rossi, Federica; Bacchetta, Renato; Prati, Mariangela; Giavini, Erminio; Menegola, Elena

2011-06-01

The use of nonmammal models in teratological studies is a matter of debate and seems to be justified if the embryotoxic mechanism involves conserved processes. Published data on mammals and Xenopus laevis suggest that azoles are teratogenic by altering the endogenous concentration of retinoic acid (RA). The expression of some genes (Shh, Ptch-1, Gsc, and Msx2) controlled by retinoic acid is downregulated in rat embryos exposed at the phylotypic stage to the triazole triadimefon (FON). In order to propose X. laevis as a model for gene-based comparative teratology, this work evaluates the expression of Shh, Ptch-1, Gsc, and Msx2 in FON-exposed X. laevis embryos. Embryos, exposed to a high concentration level (500 µM) of FON from stage 13 till 17, were examined at stages 17, 27, and 47. Stage 17 and 27 embryos were processed to perform quantitative RT-PCR. The developmental rate was never affected by FON at any considered stage. FON-exposed stage 47 larvae showed the typical craniofacial malformations. A significant downregulation of Gsc was observed in FON-exposed stage 17 embryos. Shh, Ptch-1, Msx2 showed a high fluctuation of expression both in control and in FON-exposed samples both at stages 17 and 27. The downregulation of Gsc mimics the effects of FON on rat embryos, showing for this gene a common effect of FON in the two vertebrate classes. The high fluctuation observed in the gene expression of the other genes, however, suggests that X. laevis at this stage has limited utility for gene-based comparative teratology. © 2011 Wiley-Liss, Inc.
Selection of reliable reference genes for gene expression studies in Trichoderma afroharzianum LTR-2 under oxalic acid stress.

PubMed

Lyu, Yuping; Wu, Xiaoqing; Ren, He; Zhou, Fangyuan; Zhou, Hongzi; Zhang, Xinjian; Yang, Hetong

2017-10-01

An appropriate reference gene is required to get reliable results from gene expression analysis by quantitative real-time reverse transcription PCR (qRT-PCR). In order to identify stable and reliable reference genes in Trichoderma afroharzianum under oxalic acid (OA) stress, six commonly used housekeeping genes, i.e., elongation factor 1, ubiquitin, ubiquitin-conjugating enzyme, glyceraldehyde-3-phosphate dehydrogenase, α-tubulin, actin, from the effective biocontrol isolate T. afroharzianum strain LTR-2 were tested for their expression during growth in liquid culture amended with OA. Four in silico programs (comparative ΔCt, NormFinder, geNorm and BestKeeper) were used to evaluate the expression stabilities of six candidate reference genes. The elongation factor 1 gene EF-1 was identified as the most stably expressed reference gene, and was used as the normalizer to quantify the expression level of the oxalate decarboxylase coding gene OXDC in T. afroharzianum strain LTR-2 under OA stress. The result showed that the expression of OXDC was significantly up-regulated as expected. This study provides an effective method to quantify expression changes of target genes in T. afroharzianum under OA stress. Copyright © 2017 Elsevier B.V. All rights reserved.
The complete chloroplast DNA sequences of the charophycean green algae Staurastrum and Zygnema reveal that the chloroplast genome underwent extensive changes during the evolution of the Zygnematales

PubMed Central

Turmel, Monique; Otis, Christian; Lemieux, Claude

2005-01-01

Background The Streptophyta comprise all land plants and six monophyletic groups of charophycean green algae. Phylogenetic analyses of four genes from three cellular compartments support the following branching order for these algal lineages: Mesostigmatales, Chlorokybales, Klebsormidiales, Zygnematales, Coleochaetales and Charales, with the last lineage being sister to land plants. Comparative analyses of the Mesostigma viride (Mesostigmatales) and land plant chloroplast genome sequences revealed that this genome experienced many gene losses, intron insertions and gene rearrangements during the evolution of charophyceans. On the other hand, the chloroplast genome of Chaetosphaeridium globosum (Coleochaetales) is highly similar to its land plant counterparts in terms of gene content, intron composition and gene order, indicating that most of the features characteristic of land plant chloroplast DNA (cpDNA) were acquired from charophycean green algae. To gain further insight into when the highly conservative pattern displayed by land plant cpDNAs originated in the Streptophyta, we have determined the cpDNA sequences of the distantly related zygnematalean algae Staurastrum punctulatum and Zygnema circumcarinatum. Results The 157,089 bp Staurastrum and 165,372 bp Zygnema cpDNAs encode 121 and 125 genes, respectively. Although both cpDNAs lack an rRNA-encoding inverted repeat (IR), they are substantially larger than Chaetosphaeridium and land plant cpDNAs. This increased size is explained by the expansion of intergenic spacers and introns. The Staurastrum and Zygnema genomes differ extensively from one another and from their streptophyte counterparts at the level of gene order, with the Staurastrum genome more closely resembling its land plant counterparts than does Zygnema cpDNA. Many intergenic regions in Zygnema cpDNA harbor tandem repeats. The introns in both Staurastrum (8 introns) and Zygnema (13 introns) cpDNAs represent subsets of those found in land plant cpDNAs. They represent 16 distinct insertion sites, only five of which are shared by the two zygnematalean genomes. Three of these insertions sites have not been identified in Chaetosphaeridium cpDNA. Conclusion The chloroplast genome experienced substantial changes in overall structure, gene order, and intron content during the evolution of the Zygnematales. Most of the features considered earlier as typical of land plant cpDNAs probably originated before the emergence of the Zygnematales and Coleochaetales. PMID:16236178

The physical map of wheat chromosome 1BS provides insights into its gene space organization and evolution

PubMed Central

2013-01-01

Background The wheat genome sequence is an essential tool for advanced genomic research and improvements. The generation of a high-quality wheat genome sequence is challenging due to its complex 17 Gb polyploid genome. To overcome these difficulties, sequencing through the construction of BAC-based physical maps of individual chromosomes is employed by the wheat genomics community. Here, we present the construction of the first comprehensive physical map of chromosome 1BS, and illustrate its unique gene space organization and evolution. Results Fingerprinted BAC clones were assembled into 57 long scaffolds, anchored and ordered with 2,438 markers, covering 83% of chromosome 1BS. The BAC-based chromosome 1BS physical map and gene order of the orthologous regions of model grass species were consistent, providing strong support for the reliability of the chromosome 1BS assembly. The gene space for chromosome 1BS spans the entire length of the chromosome arm, with 76% of the genes organized in small gene islands, accompanied by a two-fold increase in gene density from the centromere to the telomere. Conclusions This study provides new evidence on common and chromosome-specific features in the organization and evolution of the wheat genome, including a non-uniform distribution of gene density along the centromere-telomere axis, abundance of non-syntenic genes, the degree of colinearity with other grass genomes and a non-uniform size expansion along the centromere-telomere axis compared with other model cereal genomes. The high-quality physical map constructed in this study provides a solid basis for the assembly of a reference sequence of chromosome 1BS and for breeding applications. PMID:24359668
Genomicus 2018: karyotype evolutionary trees and on-the-fly synteny computing.

PubMed

Nguyen, Nga Thi Thuy; Vincens, Pierre; Roest Crollius, Hugues; Louis, Alexandra

2018-01-04

Since 2010, the Genomicus web server is available online at http://genomicus.biologie.ens.fr/genomicus. This graphical browser provides access to comparative genomic analyses in four different phyla (Vertebrate, Plants, Fungi, and non vertebrate Metazoans). Users can analyse genomic information from extant species, as well as ancestral gene content and gene order for vertebrates and flowering plants, in an integrated evolutionary context. New analyses and visualization tools have recently been implemented in Genomicus Vertebrate. Karyotype structures from several genomes can now be compared along an evolutionary pathway (Multi-KaryotypeView), and synteny blocks can be computed and visualized between any two genomes (PhylDiagView). © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Why does the giant panda eat bamboo? A comparative analysis of appetite-reward-related genes among mammals.

PubMed

Jin, Ke; Xue, Chenyi; Wu, Xiaoli; Qian, Jinyi; Zhu, Yong; Yang, Zhen; Yonezawa, Takahiro; Crabbe, M James C; Cao, Ying; Hasegawa, Masami; Zhong, Yang; Zheng, Yufang

2011-01-01

The giant panda has an interesting bamboo diet unlike the other species in the order of Carnivora. The umami taste receptor gene T1R1 has been identified as a pseudogene during its genome sequencing project and confirmed using a different giant panda sample. The estimated mutation time for this gene is about 4.2 Myr. Such mutation coincided with the giant panda's dietary change and also reinforced its herbivorous life style. However, as this gene is preserved in herbivores such as cow and horse, we need to look for other reasons behind the giant panda's diet switch. Since taste is part of the reward properties of food related to its energy and nutrition contents, we did a systematic analysis on those genes involved in the appetite-reward system for the giant panda. We extracted the giant panda sequence information for those genes and compared with the human sequence first and then with seven other species including chimpanzee, mouse, rat, dog, cat, horse, and cow. Orthologs in panda were further analyzed based on the coding region, Kozak consensus sequence, and potential microRNA binding of those genes. Our results revealed an interesting dopamine metabolic involvement in the panda's food choice. This finding suggests a new direction for molecular evolution studies behind the panda's dietary switch.
Why Does the Giant Panda Eat Bamboo? A Comparative Analysis of Appetite-Reward-Related Genes among Mammals

PubMed Central

Jin, Ke; Xue, Chenyi; Wu, Xiaoli; Qian, Jinyi; Zhu, Yong; Yang, Zhen; Yonezawa, Takahiro; Crabbe, M. James C.; Cao, Ying; Hasegawa, Masami; Zhong, Yang; Zheng, Yufang

2011-01-01

Background The giant panda has an interesting bamboo diet unlike the other species in the order of Carnivora. The umami taste receptor gene T1R1 has been identified as a pseudogene during its genome sequencing project and confirmed using a different giant panda sample. The estimated mutation time for this gene is about 4.2 Myr. Such mutation coincided with the giant panda's dietary change and also reinforced its herbivorous life style. However, as this gene is preserved in herbivores such as cow and horse, we need to look for other reasons behind the giant panda's diet switch. Methodology/Principal Findings Since taste is part of the reward properties of food related to its energy and nutrition contents, we did a systematic analysis on those genes involved in the appetite-reward system for the giant panda. We extracted the giant panda sequence information for those genes and compared with the human sequence first and then with seven other species including chimpanzee, mouse, rat, dog, cat, horse, and cow. Orthologs in panda were further analyzed based on the coding region, Kozak consensus sequence, and potential microRNA binding of those genes. Conclusions/Significance Our results revealed an interesting dopamine metabolic involvement in the panda's food choice. This finding suggests a new direction for molecular evolution studies behind the panda's dietary switch. PMID:21818345
From Biophysics to Evolutionary Genetics: Statistical Aspects of Gene Regulation

NASA Astrophysics Data System (ADS)

Lässig, Michael

Genomic functions often cannot be understood at the level of single genes but require the study of gene networks. This systems biology credo is nearly commonplace by now. Evidence comes from the comparative analysis of entire genomes: current estimates put, for example, the number of human genes at around 22,000, hardly more than the 14,000 of the fruit fly, and not even an order of magnitude higher than the 6,000 of baker's yeast. The complexity and diversity of higher animals, therefore, cannot be explained in terms of their gene numbers. If, however, a biological function requires the concerted action of several genes, and conversely, a gene takes part in several functional contexts, an organism may be defined less by its individual genes but by their interactions. The emerging picture of the genome as a strongly interacting system with many degrees of freedom brings new challenges for experiment and theory, many of which are of a statistical nature. And indeed, this picture continues to make the subject attractive to a growing number of statistical physicists.
Visual gene developer: a fully programmable bioinformatics software for synthetic gene optimization.

PubMed

Jung, Sang-Kyu; McDonald, Karen

2011-08-16

Direct gene synthesis is becoming more popular owing to decreases in gene synthesis pricing. Compared with using natural genes, gene synthesis provides a good opportunity to optimize gene sequence for specific applications. In order to facilitate gene optimization, we have developed a stand-alone software called Visual Gene Developer. The software not only provides general functions for gene analysis and optimization along with an interactive user-friendly interface, but also includes unique features such as programming capability, dedicated mRNA secondary structure prediction, artificial neural network modeling, network & multi-threaded computing, and user-accessible programming modules. The software allows a user to analyze and optimize a sequence using main menu functions or specialized module windows. Alternatively, gene optimization can be initiated by designing a gene construct and configuring an optimization strategy. A user can choose several predefined or user-defined algorithms to design a complicated strategy. The software provides expandable functionality as platform software supporting module development using popular script languages such as VBScript and JScript in the software programming environment. Visual Gene Developer is useful for both researchers who want to quickly analyze and optimize genes, and those who are interested in developing and testing new algorithms in bioinformatics. The software is available for free download at http://www.visualgenedeveloper.net.
Visual gene developer: a fully programmable bioinformatics software for synthetic gene optimization

PubMed Central

2011-01-01

Background Direct gene synthesis is becoming more popular owing to decreases in gene synthesis pricing. Compared with using natural genes, gene synthesis provides a good opportunity to optimize gene sequence for specific applications. In order to facilitate gene optimization, we have developed a stand-alone software called Visual Gene Developer. Results The software not only provides general functions for gene analysis and optimization along with an interactive user-friendly interface, but also includes unique features such as programming capability, dedicated mRNA secondary structure prediction, artificial neural network modeling, network & multi-threaded computing, and user-accessible programming modules. The software allows a user to analyze and optimize a sequence using main menu functions or specialized module windows. Alternatively, gene optimization can be initiated by designing a gene construct and configuring an optimization strategy. A user can choose several predefined or user-defined algorithms to design a complicated strategy. The software provides expandable functionality as platform software supporting module development using popular script languages such as VBScript and JScript in the software programming environment. Conclusion Visual Gene Developer is useful for both researchers who want to quickly analyze and optimize genes, and those who are interested in developing and testing new algorithms in bioinformatics. The software is available for free download at http://www.visualgenedeveloper.net. PMID:21846353
Different Transcriptional Response to Xanthomonas citri subsp. citri between Kumquat and Sweet Orange with Contrasting Canker Tolerance

PubMed Central

Fu, Xing-Zheng; Gong, Xiao-Qing; Zhang, Yue-Xin; Wang, Yin; Liu, Ji-Hong

2012-01-01

Citrus canker disease caused by Xanthomonas citri subsp. citri (Xcc) is one of the most devastating biotic stresses affecting the citrus industry. Meiwa kumquat (Fortunella crassifolia) is canker-resistant, while Newhall navel orange (Citrus sinensis Osbeck) is canker-sensitive. To understand the molecular mechanisms underlying the differences in responses to Xcc, transcriptomic profiles of these two genotypes following Xcc attack were compared by using the Affymetrix citrus genome GeneChip. A total of 794 and 1324 differentially expressed genes (DEGs) were identified as canker-responsive genes in Meiwa and Newhall, respectively. Of these, 230 genes were expressed in common between both genotypes, while 564 and 1094 genes were only significantly expressed in either Meiwa or Newhall. Gene ontology (GO) annotation and Singular Enrichment Analysis (SEA) of the DEGs showed that genes related to the cell wall and polysaccharide metabolism were induced for basic defense in both Meiwa and Newhall, such as chitinase, glucanase and thaumatin-like protein. Moreover, apart from inducing basic defense, Meiwa showed specially upregulated expression of several genes involved in the response to biotic stimulus, defense response, and cation binding as comparing with Newhall. And in Newhall, abundant photosynthesis-related genes were significantly down-regulated, which may be in order to ensure the basic defense. This study revealed different molecular responses to canker disease in Meiwa and Newhall, affording insight into the response to canker and providing valuable information for the identification of potential genes for engineering canker tolerance in the future. PMID:22848606
Major Histocompatibility Complex Genes Map to Two Chromosomes in an Evolutionarily Ancient Reptile, the Tuatara Sphenodon punctatus

PubMed Central

Miller, Hilary C.; O’Meally, Denis; Ezaz, Tariq; Amemiya, Chris; Marshall-Graves, Jennifer A.; Edwards, Scott

2015-01-01

Major histocompatibility complex (MHC) genes are a central component of the vertebrate immune system and usually exist in a single genomic region. However, considerable differences in MHC organization and size exist between different vertebrate lineages. Reptiles occupy a key evolutionary position for understanding how variation in MHC structure evolved in vertebrates, but information on the structure of the MHC region in reptiles is limited. In this study, we investigate the organization and cytogenetic location of MHC genes in the tuatara (Sphenodon punctatus), the sole extant representative of the early-diverging reptilian order Rhynchocephalia. Sequencing and mapping of 12 clones containing class I and II MHC genes from a bacterial artificial chromosome library indicated that the core MHC region is located on chromosome 13q. However, duplication and translocation of MHC genes outside of the core region was evident, because additional class I MHC genes were located on chromosome 4p. We found a total of seven class I sequences and 11 class II β sequences, with evidence for duplication and pseudogenization of genes within the tuatara lineage. The tuatara MHC is characterized by high repeat content and low gene density compared with other species and we found no antigen processing or MHC framework genes on the MHC gene-containing clones. Our findings indicate substantial differences in MHC organization in tuatara compared with mammalian and avian MHCs and highlight the dynamic nature of the MHC. Further sequencing and annotation of tuatara and other reptile MHCs will determine if the tuatara MHC is representative of nonavian reptiles in general. PMID:25953959
Effects of advanced treatment systems on the removal of antibiotic resistance genes in wastewater treatment plants from Hangzhou, China.

PubMed

Chen, Hong; Zhang, Mingmei

2013-08-06

This study aimed at quantifying the concentration and removal of antibiotic resistance genes (ARGs) in three municipal wastewater treatment plants (WWTPs) employing different advanced treatment systems [biological aerated filter, constructed wetland, and ultraviolet (UV) disinfection]. The concentrations of tetM, tetO, tetQ, tetW, sulI, sulII, intI1, and 16S rDNA genes were examined in wastewater and biosolid samples. In municipal WWTPs, ARG reductions of 1-3 orders of magnitude were observed, and no difference was found among the three municipal WWTPs with different treatment processes (p > 0.05). In advanced treatment systems, 1-3 orders of magnitude of reductions in ARGs were observed in constructed wetlands, 0.6-1.2 orders of magnitude of reductions in ARGs were observed in the biological aerated filter, but no apparent decrease by UV disinfection was observed. A significant difference was found between constructed wetlands and biological filter (p < 0.05) and between constructed wetlands and UV disinfection (p < 0.05). In the constructed wetlands, significant correlations were observed in the removal of ARGs and 16S rDNA genes (R(2) = 0.391-0.866; p < 0.05). Constructed wetlands not only have the comparable ARG removal values with WWTP (p > 0.05) but also have the advantage in ARG relative abundance removal, and it should be given priority to be an advanced treatment system for further ARG attenuation from WWTP.
Mitogenomes of two neotropical bird species and the multiple independent origin of mitochondrial gene orders in Passeriformes.

PubMed

Caparroz, Renato; Rocha, Amanda V; Cabanne, Gustavo S; Tubaro, Pablo; Aleixo, Alexandre; Lemmon, Emily M; Lemmon, Alan R

2018-06-01

At least four mitogenome arrangements occur in Passeriformes and differences among them are derived from an initial tandem duplication involving a segment containing the control region (CR), followed by loss or reduction of some parts of this segment. However, it is still unclear how often duplication events have occurred in this bird order. In this study, the mitogenomes from two species of Neotropical passerines (Sicalis olivascens and Lepidocolaptes angustirostris) with different gene arrangements were first determined. We also estimated how often duplication events occurred in Passeriformes and if the two CR copies demonstrate a pattern of concerted evolution in Sylvioidea. One tissue sample for each species was used to obtain the mitogenomes as a byproduct using next generation sequencing. The evolutionary history of mitogenome rearrangements was reconstructed mapping these characters onto a mitogenome Bayesian phylogenetic tree of Passeriformes. Finally, we performed a Bayesian analysis for both CRs from some Sylvioidea species in order to evaluate the evolutionary process involving these two copies. Both mitogenomes described comprise 2 rRNAs, 22 tRNAs, 13 protein-codon genes and the CR. However, S. olivascens has 16,768 bp showing the ancestral avian arrangement, while L. angustirostris has 16,973 bp and the remnant CR2 arrangement. Both species showed the expected gene order compared to their closest relatives. The ancestral state reconstruction suggesting at least six independent duplication events followed by partial deletions or loss of one copy in some lineages. Our results also provide evidence that both CRs in some Sylvioidea species seem to be maintained in an apparently functional state, perhaps by concerted evolution, and that this mechanism may be important for the evolution of the bird mitogenome.
Comparative chloroplast genomics and phylogenetics of Fagopyrum esculentum ssp. ancestrale – A wild ancestor of cultivated buckwheat

PubMed Central

Logacheva, Maria D; Samigullin, Tahir H; Dhingra, Amit; Penin, Aleksey A

2008-01-01

Background Chloroplast genome sequences are extremely informative about species-interrelationships owing to its non-meiotic and often uniparental inheritance over generations. The subject of our study, Fagopyrum esculentum, is a member of the family Polygonaceae belonging to the order Caryophyllales. An uncertainty remains regarding the affinity of Caryophyllales and the asterids that could be due to undersampling of the taxa. With that background, having access to the complete chloroplast genome sequence for Fagopyrum becomes quite pertinent. Results We report the complete chloroplast genome sequence of a wild ancestor of cultivated buckwheat, Fagopyrum esculentum ssp. ancestrale. The sequence was rapidly determined using a previously described approach that utilized a PCR-based method and employed universal primers, designed on the scaffold of multiple sequence alignment of chloroplast genomes. The gene content and order in buckwheat chloroplast genome is similar to Spinacia oleracea. However, some unique structural differences exist: the presence of an intron in the rpl2 gene, a frameshift mutation in the rpl23 gene and extension of the inverted repeat region to include the ycf1 gene. Phylogenetic analysis of 61 protein-coding gene sequences from 44 complete plastid genomes provided strong support for the sister relationships of Caryophyllales (including Polygonaceae) to asterids. Further, our analysis also provided support for Amborella as sister to all other angiosperms, but interestingly, in the bayesian phylogeny inference based on first two codon positions Amborella united with Nymphaeales. Conclusion Comparative genomics analyses revealed that the Fagopyrum chloroplast genome harbors the characteristic gene content and organization as has been described for several other chloroplast genomes. However, it has some unique structural features distinct from previously reported complete chloroplast genome sequences. Phylogenetic analysis of the dataset, including this new sequence from non-core Caryophyllales supports the sister relationship between Caryophyllales and asterids. PMID:18492277
Evidence of Molecular Adaptation to Extreme Environments and Applicability to Space Environments

NASA Astrophysics Data System (ADS)

Filipovic, M. D.; Ognjanovic, S.; Ognjanovic, M.

2008-06-01

This is initial investigation of gene signatures responsible for adapting microscopic life to the extreme Earth environments. We present preliminary results on identification of the clusters of orthologous groups (COGs) common to several hyperthermophiles and exclusion of those common to a mesophile (non-hyperthermophile): Escherichia coli (E. coli K12), will yield a group of proteins possibly involved in adaptation to life under extreme temperatures. Comparative genome analyses represent a powerful tool in discovery of novel genes responsible for adaptation to specific extreme environments. Methanogens stand out as the only group of organisms that have species capable of growth at 0° C (Metarhizium frigidum (M.~frigidum) and Methanococcoides burtonii (M.~burtonii)) and 110° C (Methanopyrus kandleri (M.~kandleri)). Although not all the components of heat adaptation can be attributed to novel genes, the chaperones known as heat shock proteins stabilize the enzymes under elevated temperature. However, highly conserved chaperons found in bacteria and eukaryots are not present in hyperthermophilic Archea, rather, they have a unique chaperone TF55. Our aim was to use software which we specifically developed for extremophile genome comparative analyses in order to search for additional novel genes involved in hyperthermophile adaptation. The following hyperthermophile genomes incorporated in this software were used for these studies: Methanocaldococcus jannaschii (M.~jannaschii), M.~kandleri, Archaeoglobus fulgidus (A.~fulgidus) and three species of Pyrococcus. Common genes were annotated and grouped according to their roles in cellular processes where such information was available and proteins not previously implicated in the heat-adaptation of hyperthermophiles were identified. Additional experimental data are needed in order to learn more about these proteins. To address non-gene based components of thermal adaptation, all sequenced extremophiles were analysed for their GC contents and aminoacid hydrophobicity. Finally, we develop a prediction model for optimal growth temperature.
The Sorghum bicolor genome and the diversification of grasses

DOE Office of Scientific and Technical Information (OSTI.GOV)

Paterson, Andrew H.; Bowers, John E.; Bruggmann, Remy

2008-08-20

Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approx730-megabase Sorghum bicolor (L.) Moench genome, placing approx98percent of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approx75percent larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidizationmore » approx70 million years ago, most duplicated gene sets lost one member before the sorghum rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24percent of genes are grass-specific and 7percent are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.« less
The Sorghum bicolor genome and the diversification of grasses.

PubMed

Paterson, Andrew H; Bowers, John E; Bruggmann, Rémy; Dubchak, Inna; Grimwood, Jane; Gundlach, Heidrun; Haberer, Georg; Hellsten, Uffe; Mitros, Therese; Poliakov, Alexander; Schmutz, Jeremy; Spannagl, Manuel; Tang, Haibao; Wang, Xiyin; Wicker, Thomas; Bharti, Arvind K; Chapman, Jarrod; Feltus, F Alex; Gowik, Udo; Grigoriev, Igor V; Lyons, Eric; Maher, Christopher A; Martis, Mihaela; Narechania, Apurva; Otillar, Robert P; Penning, Bryan W; Salamov, Asaf A; Wang, Yu; Zhang, Lifang; Carpita, Nicholas C; Freeling, Michael; Gingle, Alan R; Hash, C Thomas; Keller, Beat; Klein, Patricia; Kresovich, Stephen; McCann, Maureen C; Ming, Ray; Peterson, Daniel G; Mehboob-ur-Rahman; Ware, Doreen; Westhoff, Peter; Mayer, Klaus F X; Messing, Joachim; Rokhsar, Daniel S

2009-01-29

Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approximately 730-megabase Sorghum bicolor (L.) Moench genome, placing approximately 98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approximately 75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization approximately 70 million years ago, most duplicated gene sets lost one member before the sorghum-rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.
Biobibliometrics (UGDH-TP53-BRCA1) Genes Connections in the Possible Relationship Between Breast Cancer and EEG.

PubMed

Martzoukos, Yannis; Papavlasopoulos, Sozon; Poulos, Marios; Syrrou, Maria

2017-01-01

In recent years there has been an increasingly amount of data stored in biomedical Databases due to the breakthroughs in biology and bioinformatics, biomedical information is growing exponentially making efficient information retrieval from scientist more and more challenging. New Scientific fields as Bioinformatics seem to be the tool needed to extract scientifically important data based on experimental results and information provided by papers and journals. In this paper we are going to implement a custom made IT system in order to find connections between genes in the breast cancer pathways such the BRCA1 with the electrical energy in the human brain with UGDH gene via the TP53 tumor gene. The proposed system will be able to identify the appearance of each gene ID and compare the coexistence of two genes in PubMed articles/papers. The final system could become a useful tool against the struggle of scientists and medical professionals in the near future.
Comparative genome analysis of PHB gene family reveals deep evolutionary origins and diverse gene function.

PubMed

Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S

2010-10-07

PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out to dissect the PHB gene function. The conserved gene evolution indicated that the study in the model species can be translated to human and mammalian studies.
Pangenome Analysis of Burkholderia pseudomallei: Genome Evolution Preserves Gene Order despite High Recombination Rates.

PubMed

Spring-Pearson, Senanu M; Stone, Joshua K; Doyle, Adina; Allender, Christopher J; Okinaka, Richard T; Mayo, Mark; Broomall, Stacey M; Hill, Jessica M; Karavis, Mark A; Hubbard, Kyle S; Insalaco, Joseph M; McNew, Lauren A; Rosenzweig, C Nicole; Gibbons, Henry S; Currie, Bart J; Wagner, David M; Keim, Paul; Tuanyok, Apichai

2015-01-01

The pangenomic diversity in Burkholderia pseudomallei is high, with approximately 5.8% of the genome consisting of genomic islands. Genomic islands are known hotspots for recombination driven primarily by site-specific recombination associated with tRNAs. However, recombination rates in other portions of the genome are also high, a feature we expected to disrupt gene order. We analyzed the pangenome of 37 isolates of B. pseudomallei and demonstrate that the pangenome is 'open', with approximately 136 new genes identified with each new genome sequenced, and that the global core genome consists of 4568±16 homologs. Genes associated with metabolism were statistically overrepresented in the core genome, and genes associated with mobile elements, disease, and motility were primarily associated with accessory portions of the pangenome. The frequency distribution of genes present in between 1 and 37 of the genomes analyzed matches well with a model of genome evolution in which 96% of the genome has very low recombination rates but 4% of the genome recombines readily. Using homologous genes among pairs of genomes, we found that gene order was highly conserved among strains, despite the high recombination rates previously observed. High rates of gene transfer and recombination are incompatible with retaining gene order unless these processes are either highly localized to specific sites within the genome, or are characterized by symmetrical gene gain and loss. Our results demonstrate that both processes occur: localized recombination introduces many new genes at relatively few sites, and recombination throughout the genome generates the novel multi-locus sequence types previously observed while preserving gene order.
The complete mitochondrial genome of the house dust mite Dermatophagoides pteronyssinus (Trouessart): a novel gene arrangement among arthropods

PubMed Central

Dermauw, Wannes; Van Leeuwen, Thomas; Vanholme, Bartel; Tirry, Luc

2009-01-01

Background The apparent scarcity of available sequence data has greatly impeded evolutionary studies in Acari (mites and ticks). This subclass encompasses over 48,000 species and forms the largest group within the Arachnida. Although mitochondrial genomes are widely utilised for phylogenetic and population genetic studies, only 20 mitochondrial genomes of Acari have been determined, of which only one belongs to the diverse order of the Sarcoptiformes. In this study, we describe the mitochondrial genome of the European house dust mite Dermatophagoides pteronyssinus, the most important member of this largely neglected group. Results The mitochondrial genome of D. pteronyssinus is a circular DNA molecule of 14,203 bp. It contains the complete set of 37 genes (13 protein coding genes, 2 rRNA genes and 22 tRNA genes), usually present in metazoan mitochondrial genomes. The mitochondrial gene order differs considerably from that of other Acari mitochondrial genomes. Compared to the mitochondrial genome of Limulus polyphemus, considered as the ancestral arthropod pattern, only 11 of the 38 gene boundaries are conserved. The majority strand has a 72.6% AT-content but a GC-skew of 0.194. This skew is the reverse of that normally observed for typical animal mitochondrial genomes. A microsatellite was detected in a large non-coding region (286 bp), which probably functions as the control region. Almost all tRNA genes lack a T-arm, provoking the formation of canonical cloverleaf tRNA-structures, and both rRNA genes are considerably reduced in size. Finally, the genomic sequence was used to perform a phylogenetic study. Both maximum likelihood and Bayesian inference analysis clustered D. pteronyssinus with Steganacarus magnus, forming a sistergroup of the Trombidiformes. Conclusion Although the mitochondrial genome of D. pteronyssinus shares different features with previously characterised Acari mitochondrial genomes, it is unique in many ways. Gene order is extremely rearranged and represents a new pattern within the Acari. Both tRNAs and rRNAs are truncated, corroborating the theory of the functional co-evolution of these molecules. Furthermore, the strong and reversed GC- and AT-skews suggest the inversion of the control region as an evolutionary event. Finally, phylogenetic analysis using concatenated mt gene sequences succeeded in recovering Acari relationships concordant with traditional views of phylogeny of Acari. PMID:19284646
Hybrid Binary Imperialist Competition Algorithm and Tabu Search Approach for Feature Selection Using Gene Expression Data.

PubMed

Wang, Shuaiqun; Aorigele; Kong, Wei; Zeng, Weiming; Hong, Xiaomin

2016-01-01

Gene expression data composed of thousands of genes play an important role in classification platforms and disease diagnosis. Hence, it is vital to select a small subset of salient features over a large number of gene expression data. Lately, many researchers devote themselves to feature selection using diverse computational intelligence methods. However, in the progress of selecting informative genes, many computational methods face difficulties in selecting small subsets for cancer classification due to the huge number of genes (high dimension) compared to the small number of samples, noisy genes, and irrelevant genes. In this paper, we propose a new hybrid algorithm HICATS incorporating imperialist competition algorithm (ICA) which performs global search and tabu search (TS) that conducts fine-tuned search. In order to verify the performance of the proposed algorithm HICATS, we have tested it on 10 well-known benchmark gene expression classification datasets with dimensions varying from 2308 to 12600. The performance of our proposed method proved to be superior to other related works including the conventional version of binary optimization algorithm in terms of classification accuracy and the number of selected genes.

Hybrid Binary Imperialist Competition Algorithm and Tabu Search Approach for Feature Selection Using Gene Expression Data

PubMed Central

Aorigele; Zeng, Weiming; Hong, Xiaomin

2016-01-01

Gene expression data composed of thousands of genes play an important role in classification platforms and disease diagnosis. Hence, it is vital to select a small subset of salient features over a large number of gene expression data. Lately, many researchers devote themselves to feature selection using diverse computational intelligence methods. However, in the progress of selecting informative genes, many computational methods face difficulties in selecting small subsets for cancer classification due to the huge number of genes (high dimension) compared to the small number of samples, noisy genes, and irrelevant genes. In this paper, we propose a new hybrid algorithm HICATS incorporating imperialist competition algorithm (ICA) which performs global search and tabu search (TS) that conducts fine-tuned search. In order to verify the performance of the proposed algorithm HICATS, we have tested it on 10 well-known benchmark gene expression classification datasets with dimensions varying from 2308 to 12600. The performance of our proposed method proved to be superior to other related works including the conventional version of binary optimization algorithm in terms of classification accuracy and the number of selected genes. PMID:27579323
A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research.

PubMed

Weidner, Christopher; Steinfath, Matthias; Wistorf, Elisa; Oelgeschläger, Michael; Schneider, Marlon R; Schönfelder, Gilbert

2017-08-16

Recent studies that compared transcriptomic datasets of human diseases with datasets from mouse models using traditional gene-to-gene comparison techniques resulted in contradictory conclusions regarding the relevance of animal models for translational research. A major reason for the discrepancies between different gene expression analyses is the arbitrary filtering of differentially expressed genes. Furthermore, the comparison of single genes between different species and platforms often is limited by technical variance, leading to misinterpretation of the con/discordance between data from human and animal models. Thus, standardized approaches for systematic data analysis are needed. To overcome subjective gene filtering and ineffective gene-to-gene comparisons, we recently demonstrated that gene set enrichment analysis (GSEA) has the potential to avoid these problems. Therefore, we developed a standardized protocol for the use of GSEA to distinguish between appropriate and inappropriate animal models for translational research. This protocol is not suitable to predict how to design new model systems a-priori, as it requires existing experimental omics data. However, the protocol describes how to interpret existing data in a standardized manner in order to select the most suitable animal model, thus avoiding unnecessary animal experiments and misleading translational studies.
[Effect of gene optimization on the expression and purification of HDV small antigen produced by genetic engineering].

PubMed

Ding, Jun-Ying; Meng, Qing-Ling; Guo, Min-Zhuo; Yi, Yao; Su, Qiu-Dong; Lu, Xue-Xin; Qiu, Feng; Bi, Sheng-Li

2012-10-01

To study the effect of gene optimization on the expression and purification of HDV small antigen produced by genetic engineering. Based on the colon preference of E. coli, the HDV small antigen original gene from GenBank was optimized. Both the original gene and the optimized gene expressed in prokaryotic cells, SDS-PAGE was made to analyze the protein expression yield and to decide which protein expression style was more proportion than the other. Furthermore, two antigens were purified by chromatography in order to compare the purity by SDS-PAGE and Image Lab software. SDS-PAGE indicated that the molecular weight of target proteins from two groups were the same as we expected. Gene optimization resulted in the higher yield and it could make the product more soluble. After chromatography, the purity of target protein from optimized gene was up to 96.3%, obviously purer than that from original gene. Gene optimization could increase the protein expression yield and solubility of genetic engineering HDV small antigen. In addition, the product from the optimized gene group was easier to be purified for diagnosis usage.
DNA sequence analysis of the photosynthesis region of Rhodobacter sphaeroides 2.4.1.

PubMed

Choudhary, M; Kaplan, S

2000-02-15

This paper describes the DNA sequence of the photosynthesis region of Rhodobacter sphaeroides 2.4.1 (T). The photosynthesis gene cluster is located within a approximately 73 kb Ase I genomic DNA fragment containing the puf, puhA, cycA and puc operons. A total of 65 open reading frames (ORFs) have been identified, of which 61 showed significant similarity to genes/proteins of other organisms while only four did not reveal any significant sequence similarity to any gene/protein sequences in the database. The data were compared with the corresponding genes/ORFs from a different strain of R.sphaeroides and Rhodobacter capsulatus, a close relative of R. sphaeroides. A detailed analysis of the gene organization in the photosynthesis region revealed a similar gene order in both species with some notable differences located to the pucBAC = cycA region. In addition, photosynthesis gene regulatory protein (PpsR, FNR, IHF) binding motifs in upstream sequences of a number of photosynthesis genes have been identified and shown to differ between these two species. The difference in gene organization relative to pucBAC and cycA suggests that this region originated independently of the photosynthesis gene cluster of R.sphaeroides.
[Key effect genes responding to nerve injury identified by gene ontology and computer pattern recognition].

PubMed

Pan, Qian; Peng, Jin; Zhou, Xue; Yang, Hao; Zhang, Wei

2012-07-01

In order to screen out important genes from large gene data of gene microarray after nerve injury, we combine gene ontology (GO) method and computer pattern recognition technology to find key genes responding to nerve injury, and then verify one of these screened-out genes. Data mining and gene ontology analysis of gene chip data GSE26350 was carried out through MATLAB software. Cd44 was selected from screened-out key gene molecular spectrum by comparing genes' different GO terms and positions on score map of principal component. Function interferences were employed to influence the normal binding of Cd44 and one of its ligands, chondroitin sulfate C (CSC), to observe neurite extension. Gene ontology analysis showed that the first genes on score map (marked by red *) mainly distributed in molecular transducer activity, receptor activity, protein binding et al molecular function GO terms. Cd44 is one of six effector protein genes, and attracted us with its function diversity. After adding different reagents into the medium to interfere the normal binding of CSC and Cd44, varying-degree remissions of CSC's inhibition on neurite extension were observed. CSC can inhibit neurite extension through binding Cd44 on the neuron membrane. This verifies that important genes in given physiological processes can be identified by gene ontology analysis of gene chip data.
Integrative gene network construction to analyze cancer recurrence using semi-supervised learning.

PubMed

Park, Chihyun; Ahn, Jaegyoon; Kim, Hyunjin; Park, Sanghyun

2014-01-01

The prognosis of cancer recurrence is an important research area in bioinformatics and is challenging due to the small sample sizes compared to the vast number of genes. There have been several attempts to predict cancer recurrence. Most studies employed a supervised approach, which uses only a few labeled samples. Semi-supervised learning can be a great alternative to solve this problem. There have been few attempts based on manifold assumptions to reveal the detailed roles of identified cancer genes in recurrence. In order to predict cancer recurrence, we proposed a novel semi-supervised learning algorithm based on a graph regularization approach. We transformed the gene expression data into a graph structure for semi-supervised learning and integrated protein interaction data with the gene expression data to select functionally-related gene pairs. Then, we predicted the recurrence of cancer by applying a regularization approach to the constructed graph containing both labeled and unlabeled nodes. The average improvement rate of accuracy for three different cancer datasets was 24.9% compared to existing supervised and semi-supervised methods. We performed functional enrichment on the gene networks used for learning. We identified that those gene networks are significantly associated with cancer-recurrence-related biological functions. Our algorithm was developed with standard C++ and is available in Linux and MS Windows formats in the STL library. The executable program is freely available at: http://embio.yonsei.ac.kr/~Park/ssl.php.
Octocoral mitochondrial genomes provide insights into the phylogenetic history of gene order rearrangements, order reversals, and cnidarian phylogenetics.

PubMed

Figueroa, Diego F; Baco, Amy R

2014-12-24

We use full mitochondrial genomes to test the robustness of the phylogeny of the Octocorallia, to determine the evolutionary pathway for the five known mitochondrial gene rearrangements in octocorals, and to test the suitability of using mitochondrial genomes for higher taxonomic-level phylogenetic reconstructions. Our phylogeny supports three major divisions within the Octocorallia and show that Paragorgiidae is paraphyletic, with Sibogagorgia forming a sister branch to the Coralliidae. Furthermore, Sibogagorgia cauliflora has what is presumed to be the ancestral gene order in octocorals, but the presence of a pair of inverted repeat sequences suggest that this gene order was not conserved but rather evolved back to this apparent ancestral state. Based on this we recommend the resurrection of the family Sibogagorgiidae to fix the paraphyly of the Paragorgiidae. This is the first study to show that in the Octocorallia, mitochondrial gene orders have evolved back to an ancestral state after going through a gene rearrangement, with at least one of the gene orders evolving independently in different lineages. A number of studies have used gene boundaries to determine the type of mitochondrial gene arrangement present. However, our findings suggest that this method known as gene junction screening may miss evolutionary reversals. Additionally, substitution saturation analysis demonstrates that while whole mitochondrial genomes can be used effectively for phylogenetic analyses within Octocorallia, their utility at higher taxonomic levels within Cnidaria is inadequate. Therefore for phylogenetic reconstruction at taxonomic levels higher than subclass within the Cnidaria, nuclear genes will be required, even when whole mitochondrial genomes are available. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The complete mitochondrial genome of eastern lowland gorilla, Gorilla beringei graueri, and comparative mitochondrial genomics of Gorilla species.

PubMed

Hu, Xiao-di; Gao, Li-zhi

2016-01-01

In this study, we determined the complete mitochondrial (mt) genome of eastern lowland gorilla, Gorilla beringei graueri for the first time. The total genome was 16,416 bp in length. It contained a total of 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and 1 control region (D-loop region). The base composition was A (30.88%), G (13.10%), C (30.89%) and T (25.13%), indicating that the percentage of A+T (56.01%) was higher than G+C (43.99%). Comparisons with the other publicly available Gorilla mitogenome showed the conservation of gene order and base compositions but a bunch of nucleotide diversity. This complete mitochondrial genome sequence will provide valuable genetic information for further studies on conservation genetics of eastern lowland gorilla.
The complete chloroplast genome sequence of Aconitum coreanum and Aconitum carmichaelii and comparative analysis with other Aconitum species

PubMed Central

Park, Inkyu; Kim, Wook-jin; Yang, Sungyu; Yeo, Sang-Min; Li, Hulin

2017-01-01

Aconitum species (belonging to the Ranunculaceae) are well known herbaceous medicinal ingredients and have great economic value in Asian countries. However, there are still limited genomic resources available for Aconitum species. In this study, we sequenced the chloroplast (cp) genomes of two Aconitum species, A. coreanum and A. carmichaelii, using the MiSeq platform. The two Aconitum chloroplast genomes were 155,880 and 157,040 bp in length, respectively, and exhibited LSC and SSC regions separated by a pair of inverted repeat regions. Both cp genomes had 38% GC content and contained 131 unique functional genes including 86 protein-coding genes, eight ribosomal RNA genes, and 37 transfer RNA genes. The gene order, content, and orientation of the two Aconitum cp genomes exhibited the general structure of angiosperms, and were similar to those of other Aconitum species. Comparison of the cp genome structure and gene order with that of other Aconitum species revealed general contraction and expansion of the inverted repeat regions and single copy boundary regions. Divergent regions were also identified. In phylogenetic analysis, Aconitum species positon among the Ranunculaceae was determined with other family cp genomes in the Ranunculales. We obtained a barcoding target sequence in a divergent region, ndhC–trnV, and successfully developed a SCAR (sequence characterized amplified region) marker for discrimination of A. coreanum. Our results provide useful genetic information and a specific barcode for discrimination of Aconitum species. PMID:28863163
The complete chloroplast genome sequence of Aconitum coreanum and Aconitum carmichaelii and comparative analysis with other Aconitum species.

PubMed

Park, Inkyu; Kim, Wook-Jin; Yang, Sungyu; Yeo, Sang-Min; Li, Hulin; Moon, Byeong Cheol

2017-01-01

Aconitum species (belonging to the Ranunculaceae) are well known herbaceous medicinal ingredients and have great economic value in Asian countries. However, there are still limited genomic resources available for Aconitum species. In this study, we sequenced the chloroplast (cp) genomes of two Aconitum species, A. coreanum and A. carmichaelii, using the MiSeq platform. The two Aconitum chloroplast genomes were 155,880 and 157,040 bp in length, respectively, and exhibited LSC and SSC regions separated by a pair of inverted repeat regions. Both cp genomes had 38% GC content and contained 131 unique functional genes including 86 protein-coding genes, eight ribosomal RNA genes, and 37 transfer RNA genes. The gene order, content, and orientation of the two Aconitum cp genomes exhibited the general structure of angiosperms, and were similar to those of other Aconitum species. Comparison of the cp genome structure and gene order with that of other Aconitum species revealed general contraction and expansion of the inverted repeat regions and single copy boundary regions. Divergent regions were also identified. In phylogenetic analysis, Aconitum species positon among the Ranunculaceae was determined with other family cp genomes in the Ranunculales. We obtained a barcoding target sequence in a divergent region, ndhC-trnV, and successfully developed a SCAR (sequence characterized amplified region) marker for discrimination of A. coreanum. Our results provide useful genetic information and a specific barcode for discrimination of Aconitum species.
[Identification of new conserved and variable regions in the 16S rRNA gene of acetic acid bacteria and acetobacteraceae family].

PubMed

Chakravorty, S; Sarkar, S; Gachhui, R

2015-01-01

The Acetobacteraceae family of the class Alpha Proteobacteria is comprised of high sugar and acid tolerant bacteria. The Acetic Acid Bacteria are the economically most significant group of this family because of its association with food products like vinegar, wine etc. Acetobacteraceae are often hard to culture in laboratory conditions and they also maintain very low abundances in their natural habitats. Thus identification of the organisms in such environments is greatly dependent on modern tools of molecular biology which require a thorough knowledge of specific conserved gene sequences that may act as primers and or probes. Moreover unconserved domains in genes also become markers for differentiating closely related genera. In bacteria, the 16S rRNA gene is an ideal candidate for such conserved and variable domains. In order to study the conserved and variable domains of the 16S rRNA gene of Acetic Acid Bacteria and the Acetobacteraceae family, sequences from publicly available databases were aligned and compared. Near complete sequences of the gene were also obtained from Kombucha tea biofilm, a known Acetobacteraceae family habitat, in order to corroborate the domains obtained from the alignment studies. The study indicated that the degree of conservation in the gene is significantly higher among the Acetic Acid Bacteria than the whole Acetobacteraceae family. Moreover it was also observed that the previously described hypervariable regions V1, V3, V5, V6 and V7 were more or less conserved in the family and the spans of the variable regions are quite distinct as well.
Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes Fungi

PubMed Central

Ohm, Robin A.; Feau, Nicolas; Henrissat, Bernard; Schoch, Conrad L.; Horwitz, Benjamin A.; Barry, Kerrie W.; Condon, Bradford J.; Copeland, Alex C.; Dhillon, Braham; Glaser, Fabian; Hesse, Cedar N.; Kosti, Idit; LaButti, Kurt; Lindquist, Erika A.; Lucas, Susan; Salamov, Asaf A.; Bradshaw, Rosie E.; Ciuffetti, Lynda; Hamelin, Richard C.; Kema, Gert H. J.; Lawrence, Christopher; Scott, James A.; Spatafora, Joseph W.; Turgeon, B. Gillian; de Wit, Pierre J. G. M.; Zhong, Shaobin; Goodwin, Stephen B.; Grigoriev, Igor V.

2012-01-01

The class Dothideomycetes is one of the largest groups of fungi with a high level of ecological diversity including many plant pathogens infecting a broad range of hosts. Here, we compare genome features of 18 members of this class, including 6 necrotrophs, 9 (hemi)biotrophs and 3 saprotrophs, to analyze genome structure, evolution, and the diverse strategies of pathogenesis. The Dothideomycetes most likely evolved from a common ancestor more than 280 million years ago. The 18 genome sequences differ dramatically in size due to variation in repetitive content, but show much less variation in number of (core) genes. Gene order appears to have been rearranged mostly within chromosomal boundaries by multiple inversions, in extant genomes frequently demarcated by adjacent simple repeats. Several Dothideomycetes contain one or more gene-poor, transposable element (TE)-rich putatively dispensable chromosomes of unknown function. The 18 Dothideomycetes offer an extensive catalogue of genes involved in cellulose degradation, proteolysis, secondary metabolism, and cysteine-rich small secreted proteins. Ancestors of the two major orders of plant pathogens in the Dothideomycetes, the Capnodiales and Pleosporales, may have had different modes of pathogenesis, with the former having fewer of these genes than the latter. Many of these genes are enriched in proximity to transposable elements, suggesting faster evolution because of the effects of repeat induced point (RIP) mutations. A syntenic block of genes, including oxidoreductases, is conserved in most Dothideomycetes and upregulated during infection in L. maculans, suggesting a possible function in response to oxidative stress. PMID:23236275
Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes Fungi

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ohm, Robin A.; Feau, Nicolas; Henrissat, Bernard

The class Dothideomycetes is one of the largest groups of fungi with a high level of ecological diversity including many plant pathogens infecting a broad range of hosts. Here, we compare genome features of 18 members of this class, including 6 necrotrophs, 9 (hemi)biotrophs and 3 saprotrophs, to analyze genome structure, evolution, and the diverse strategies of pathogenesis. The Dothideomycetes most likely evolved from a common ancestor more than 280 million years ago. The 18 genome sequences differ dramatically in size due to variation in repetitive content, but show much less variation in number of (core) genes. Gene order appearsmore » to have been rearranged mostly within chromosomal boundaries by multiple inversions, in extant genomes frequently demarcated by adjacent simple repeats. Several Dothideomycetes contain one or more gene-poor, transposable element (TE)-rich putatively dispensable chromosomes of unknown function. The 18 Dothideomycetes offer an extensive catalogue of genes involved in cellulose degradation, proteolysis, secondary metabolism, and cysteine-rich small secreted proteins. Ancestors of the two major orders of plant pathogens in the Dothideomycetes, the Capnodiales and Pleosporales, may have had different modes of pathogenesis, with the former having fewer of these genes than the latter. Many of these genes are enriched in proximity to transposable elements, suggesting faster evolution because of the effects of repeat induced point (RIP) mutations. A syntenic block of genes, including oxidoreductases, is conserved in most Dothideomycetes and upregulated during infection in L. maculans, suggesting a possible function in response to oxidative stress.« less
Acclimation of microorganisms to harsh soil crust conditions: Experimental and genomic approaches

NASA Astrophysics Data System (ADS)

Raanan, Hagai; Kaplan, Aaron

2015-04-01

Biological soil crusts (BSC) are formed by the adhesion of sand particles to cyanobacterial exo- polysaccharides and play an important role in stabilizing sandy desert. Its destruction promotes desertification. These organisms cope with extreme temperatures, excess light and frequent hydration/dehydration cycles; the mechanisms involved are largely unknown. With the genome of newly sequenced Leptolyngbya, isolated from Nizzana BSC, we conduct comparative genomics of three desiccation tolerant cyanobacteria. This yield 46 unique genes, some of them similar to genes involve in sporulation of the gram positive bacteria Bacillus. In order to understand the molecular mechanisms taking place during desiccation we built an environmental chamber capable of simulating dynamic changes of environmental conditions in the crust. This chamber allows us to perform repetitive and accurate desiccation/rehydration experiments and follow cyanobacterial physiological and molecular response to such environmental changes. When we compared fast desiccation (less than 5 min) of isolated cyanobacteria to simulation of natural desiccation, we observed a 60% lower fluorescence recovery rate. The extent of damage from desiccation depended on the stress conditions during the dry period. These results suggest that cyanobacteria activated protection mechanisms in response to desiccation stress but which were not activated in 5 min desiccation tests. Gene expression patterns during desiccation are being analyzed in order to provide a better understanding of desiccation stress protection mechanisms.
Minimising Immunohistochemical False Negative ER Classification Using a Complementary 23 Gene Expression Signature of ER Status

PubMed Central

Li, Qiyuan; Eklund, Aron C.; Juul, Nicolai; Haibe-Kains, Benjamin; Workman, Christopher T.; Richardson, Andrea L.; Szallasi, Zoltan; Swanton, Charles

2010-01-01

Background Expression of the oestrogen receptor (ER) in breast cancer predicts benefit from endocrine therapy. Minimising the frequency of false negative ER status classification is essential to identify all patients with ER positive breast cancers who should be offered endocrine therapies in order to improve clinical outcome. In routine oncological practice ER status is determined by semi-quantitative methods such as immunohistochemistry (IHC) or other immunoassays in which the ER expression level is compared to an empirical threshold[1], [2]. The clinical relevance of gene expression-based ER subtypes as compared to IHC-based determination has not been systematically evaluated. Here we attempt to reduce the frequency of false negative ER status classification using two gene expression approaches and compare these methods to IHC based ER status in terms of predictive and prognostic concordance with clinical outcome. Methodology/Principal Findings Firstly, ER status was discriminated by fitting the bimodal expression of ESR1 to a mixed Gaussian model. The discriminative power of ESR1 suggested bimodal expression as an efficient way to stratify breast cancer; therefore we identified a set of genes whose expression was both strongly bimodal, mimicking ESR expression status, and highly expressed in breast epithelial cell lines, to derive a 23-gene ER expression signature-based classifier. We assessed our classifiers in seven published breast cancer cohorts by comparing the gene expression-based ER status to IHC-based ER status as a predictor of clinical outcome in both untreated and tamoxifen treated cohorts. In untreated breast cancer cohorts, the 23 gene signature-based ER status provided significantly improved prognostic power compared to IHC-based ER status (P = 0.006). In tamoxifen-treated cohorts, the 23 gene ER expression signature predicted clinical outcome (HR = 2.20, P = 0.00035). These complementary ER signature-based strategies estimated that between 15.1% and 21.8% patients of IHC-based negative ER status would be classified with ER positive breast cancer. Conclusion/Significance Expression-based ER status classification may complement IHC to minimise false negative ER status classification and optimise patient stratification for endocrine therapies. PMID:21152022
Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution

PubMed Central

Clarke, Thomas H.; Garb, Jessica E.; Hayashi, Cheryl Y.; Arensburger, Peter; Ayoub, Nadia A.

2015-01-01

The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). PMID:26058392
A comprehensive whole-genome integrated cytogenetic map for the alpaca (Lama pacos).

PubMed

Avila, Felipe; Baily, Malorie P; Perelman, Polina; Das, Pranab J; Pontius, Joan; Chowdhary, Renuka; Owens, Elaine; Johnson, Warren E; Merriwether, David A; Raudsepp, Terje

2014-01-01

Genome analysis of the alpaca (Lama pacos, LPA) has progressed slowly compared to other domestic species. Here, we report the development of the first comprehensive whole-genome integrated cytogenetic map for the alpaca using fluorescence in situ hybridization (FISH) and CHORI-246 BAC library clones. The map is comprised of 230 linearly ordered markers distributed among all 36 alpaca autosomes and the sex chromosomes. For the first time, markers were assigned to LPA14, 21, 22, 28, and 36. Additionally, 86 genes from 15 alpaca chromosomes were mapped in the dromedary camel (Camelus dromedarius, CDR), demonstrating exceptional synteny and linkage conservation between the 2 camelid genomes. Cytogenetic mapping of 191 protein-coding genes improved and refined the known Zoo-FISH homologies between camelids and humans: we discovered new homologous synteny blocks (HSBs) corresponding to HSA1-LPA/CDR11, HSA4-LPA/CDR31 and HSA7-LPA/CDR36, and revised the location of breakpoints for others. Overall, gene mapping was in good agreement with the Zoo-FISH and revealed remarkable evolutionary conservation of gene order within many human-camelid HSBs. Most importantly, 91 FISH-mapped markers effectively integrated the alpaca whole-genome sequence and the radiation hybrid maps with physical chromosomes, thus facilitating the improvement of the sequence assembly and the discovery of genes of biological importance. © 2015 S. Karger AG, Basel.
The Genome of Tolypocladium inflatum: Evolution, Organization, and Expression of the Cyclosporin Biosynthetic Gene Cluster

PubMed Central

Bushley, Kathryn E.; Raja, Rajani; Jaiswal, Pankaj; Cumbie, Jason S.; Nonogaki, Mariko; Boyd, Alexander E.; Owensby, C. Alisha; Knaus, Brian J.; Elser, Justin; Miller, Daniel; Di, Yanming; McPhail, Kerry L.; Spatafora, Joseph W.

2013-01-01

The ascomycete fungus Tolypocladium inflatum, a pathogen of beetle larvae, is best known as the producer of the immunosuppressant drug cyclosporin. The draft genome of T. inflatum strain NRRL 8044 (ATCC 34921), the isolate from which cyclosporin was first isolated, is presented along with comparative analyses of the biosynthesis of cyclosporin and other secondary metabolites in T. inflatum and related taxa. Phylogenomic analyses reveal previously undetected and complex patterns of homology between the nonribosomal peptide synthetase (NRPS) that encodes for cyclosporin synthetase (simA) and those of other secondary metabolites with activities against insects (e.g., beauvericin, destruxins, etc.), and demonstrate the roles of module duplication and gene fusion in diversification of NRPSs. The secondary metabolite gene cluster responsible for cyclosporin biosynthesis is described. In addition to genes necessary for cyclosporin biosynthesis, it harbors a gene for a cyclophilin, which is a member of a family of immunophilins known to bind cyclosporin. Comparative analyses support a lineage specific origin of the cyclosporin gene cluster rather than horizontal gene transfer from bacteria or other fungi. RNA-Seq transcriptome analyses in a cyclosporin-inducing medium delineate the boundaries of the cyclosporin cluster and reveal high levels of expression of the gene cluster cyclophilin. In medium containing insect hemolymph, weaker but significant upregulation of several genes within the cyclosporin cluster, including the highly expressed cyclophilin gene, was observed. T. inflatum also represents the first reference draft genome of Ophiocordycipitaceae, a third family of insect pathogenic fungi within the fungal order Hypocreales, and supports parallel and qualitatively distinct radiations of insect pathogens. The T. inflatum genome provides additional insight into the evolution and biosynthesis of cyclosporin and lays a foundation for further investigations of the role of secondary metabolite gene clusters and their metabolites in fungal biology. PMID:23818858
Comparative Chloroplast Genomes of Photosynthetic Orchids: Insights into Evolution of the Orchidaceae and Development of Molecular Markers for Phylogenetic Applications

PubMed Central

Niu, Zhi-Tao; Liu, Wei; Xue, Qing-Yun; Ding, Xiao-Yu

2014-01-01

The orchid family Orchidaceae is one of the largest angiosperm families, including many species of important economic value. While chloroplast genomes are very informative for systematics and species identification, there is very limited information available on chloroplast genomes in the Orchidaceae. Here, we report the complete chloroplast genomes of the medicinal plant Dendrobium officinale and the ornamental orchid Cypripedium macranthos, demonstrating their gene content and order and potential RNA editing sites. The chloroplast genomes of the above two species and five known photosynthetic orchids showed similarities in structure as well as gene order and content, but differences in the organization of the inverted repeat/small single-copy junction and ndh genes. The organization of the inverted repeat/small single-copy junctions in the chloroplast genomes of these orchids was classified into four types; we propose that inverted repeats flanking the small single-copy region underwent expansion or contraction among Orchidaceae. The AT-rich regions of the ycf1 gene in orchids could be linked to the recombination of inverted repeat/small single-copy junctions. Relative species in orchids displayed similar patterns of variation in ndh gene contents. Furthermore, fifteen highly divergent protein-coding genes were identified, which are useful for phylogenetic analyses in orchids. To test the efficiency of these genes serving as markers in phylogenetic analyses, coding regions of four genes (accD, ccsA, matK, and ycf1) were used as a case study to construct phylogenetic trees in the subfamily Epidendroideae. High support was obtained for placement of previously unlocated subtribes Collabiinae and Dendrobiinae in the subfamily Epidendroideae. Our findings expand understanding of the diversity of orchid chloroplast genomes and provide a reference for study of the molecular systematics of this family. PMID:24911363
Comparative chloroplast genomes of photosynthetic orchids: insights into evolution of the Orchidaceae and development of molecular markers for phylogenetic applications.

PubMed

Luo, Jing; Hou, Bei-Wei; Niu, Zhi-Tao; Liu, Wei; Xue, Qing-Yun; Ding, Xiao-Yu

2014-01-01

The orchid family Orchidaceae is one of the largest angiosperm families, including many species of important economic value. While chloroplast genomes are very informative for systematics and species identification, there is very limited information available on chloroplast genomes in the Orchidaceae. Here, we report the complete chloroplast genomes of the medicinal plant Dendrobium officinale and the ornamental orchid Cypripedium macranthos, demonstrating their gene content and order and potential RNA editing sites. The chloroplast genomes of the above two species and five known photosynthetic orchids showed similarities in structure as well as gene order and content, but differences in the organization of the inverted repeat/small single-copy junction and ndh genes. The organization of the inverted repeat/small single-copy junctions in the chloroplast genomes of these orchids was classified into four types; we propose that inverted repeats flanking the small single-copy region underwent expansion or contraction among Orchidaceae. The AT-rich regions of the ycf1 gene in orchids could be linked to the recombination of inverted repeat/small single-copy junctions. Relative species in orchids displayed similar patterns of variation in ndh gene contents. Furthermore, fifteen highly divergent protein-coding genes were identified, which are useful for phylogenetic analyses in orchids. To test the efficiency of these genes serving as markers in phylogenetic analyses, coding regions of four genes (accD, ccsA, matK, and ycf1) were used as a case study to construct phylogenetic trees in the subfamily Epidendroideae. High support was obtained for placement of previously unlocated subtribes Collabiinae and Dendrobiinae in the subfamily Epidendroideae. Our findings expand understanding of the diversity of orchid chloroplast genomes and provide a reference for study of the molecular systematics of this family.

A radiation hybrid map of river buffalo (Bubalus bubalis) chromosome 7 and comparative mapping to the cattle and human genomes

PubMed Central

Goldammer, T.; Weikard, R.; Miziara, M.N.; Brunner, R.M.; Agarwala, R.; Schäffer, A.A.; Womack, J.E.; Amaral, M.E.J.

2013-01-01

A preliminary radiation hybrid (RH) map containing 50 loci on chromosome 7 of the domestic river buffalo Bubalus bubalis (BBU; 2n = 50) was constructed based on a comparative mapping approach. The RH map of BBU7 includes thirty-seven gene markers and thirteen microsatellites. All loci have been previously assigned to Bos taurus (BTA) chromosome BTA6, which is known for its association with several economically important milk production traits in cattle. The map consists of two linkage groups spanning a total length of 627.9 cR5,000. Comparative analysis of the BBU7 RH5,000 map with BTA6 in cattle gave new evidence for strong similarity between the two chromosomes over their entire length and exposed minor differences in locus order. Comparison of the BBU7 RH5,000 map with the Homo sapiens (HSA) genome revealed similarity with a large chromosome segment of HSA4. Comparative analysis of loci in both species revealed more variability than previously known in gene order and several chromosome rearrangements including centromere relocation. The data obtained in our study define the evolutionary conserved segment on BBU7 and HSA4 to be between 3.5 megabases (Mb) and 115.8 Mb in the HSA4 (genome build 36) DNA sequence. PMID:18253035
The genetic structure of the A mating-type locus of Lentinula edodes.

PubMed

Au, Chun Hang; Wong, Man Chun; Bao, Dapeng; Zhang, Meiyan; Song, Chunyan; Song, Wenhua; Law, Patrick Tik Wan; Kües, Ursula; Kwan, Hoi Shan

2014-02-10

The Shiitake mushroom, Lentinula edodes (Berk.) Pegler is a tetrapolar basidiomycete with two unlinked mating-type loci, commonly called the A and B loci. Identifying the mating-types in shiitake is important for enhancing the breeding and cultivation of this economically-important edible mushroom. Here, we identified the A mating-type locus from the first draft genome sequence of L. edodes and characterized multiple alleles from different monokaryotic strains. Two intron-length polymorphism markers were developed to facilitate rapid molecular determination of A mating-type. L. edodes sequences were compared with those of known tetrapolar and bipolar basidiomycete species. The A mating-type genes are conserved at the homeodomain region across the order Agaricales. However, we observed unique genomic organization of the locus in L. edodes which exhibits atypical gene order and multiple repetitive elements around its A locus. To our knowledge, this is the first known exception among Homobasidiomycetes, in which the mitochondrial intermediate peptidase (mip) gene is not closely linked to A locus. Copyright © 2013 Elsevier B.V. All rights reserved.
Accelerated Evolution of Developmentally Biased Genes in the Tetraphenic Ant Cardiocondyla obscurior.

PubMed

Schrader, Lukas; Helanterä, Heikki; Oettler, Jan

2017-03-01

Plastic gene expression underlies phenotypic plasticity and plastically expressed genes evolve under different selection regimes compared with ubiquitously expressed genes. Social insects are well-suited models to elucidate the evolutionary dynamics of plastic genes for their genetically and environmentally induced discrete polymorphisms. Here, we study the evolution of plastically expressed genes in the ant Cardiocondyla obscurior-a species that produces two discrete male morphs in addition to the typical female polymorphism of workers and queens. Based on individual-level gene expression data from 28 early third instar larvae, we test whether the same evolutionary dynamics that pertain to plastically expressed genes in adults also pertain to genes with plastic expression during development. In order to quantify plasticity of gene expression over multiple contrasts, we develop a novel geometric measure. For genes expressed during development, we show that plasticity of expression is positively correlated with evolutionary rates. We furthermore find a strong correlation between expression plasticity and expression variation within morphs, suggesting a close link between active and passive plasticity of gene expression. Our results support the notion of relaxed selection and neutral processes as important drivers in the evolution of adaptive plasticity. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Computational, Integrative, and Comparative Methods for the Elucidation of Genetic Coexpression Networks

DOE PAGES

Baldwin, Nicole E.; Chesler, Elissa J.; Kirov, Stefan; ...

2005-01-01

Gene expression microarray data can be used for the assembly of genetic coexpression network graphs. Using mRNA samples obtained from recombinant inbred Mus musculus strains, it is possible to integrate allelic variation with molecular and higher-order phenotypes. The depth of quantitative genetic analysis of microarray data can be vastly enhanced utilizing this mouse resource in combination with powerful computational algorithms, platforms, and data repositories. The resulting network graphs transect many levels of biological scale. This approach is illustrated with the extraction of cliques of putatively co-regulated genes and their annotation using gene ontology analysis and cis -regulatory element discovery. Themore » causal basis for co-regulation is detected through the use of quantitative trait locus mapping.« less
[Molecular cloning and expression of Nattokinase gene in Bacillus subtilis].

PubMed

Liu, B Y; Song, H Y

2002-05-01

In order to characterize biochemically the nattokinase,the nucleotide sequence of the nattokinase gene was amplified from the chromosomal DNA of B.subtilis (natto) by PCR. The expression plasmid pBL NK was constructed and was used to transform Bacillus subtilis containing a chromosomal deletion in its subtilisin gene. The supernatant of the culture was collected after 15 h culture. The target proteins were identified by SDS-PAGE. Nattokinase was purified by a method including ultrafiltration, Sephacryl S-100 gel filtration and S-Sepharose ion-exchange chromatography, and 100 mg of purified nattokinase was obtained from one liter of culture. The purity of the protein and the specific activity were 95% and 12 000 u/mg (compared to tPA), respectively.
Hypervariable and highly divergent intron-exon organizations in the chordate Oikopleura dioica.

PubMed

Edvardsen, Rolf B; Lerat, Emmanuelle; Maeland, Anne Dorthea; Flåt, Mette; Tewari, Rita; Jensen, Marit F; Lehrach, Hans; Reinhardt, Richard; Seo, Hee-Chan; Chourrout, Daniel

2004-10-01

Oikopleura dioica is a pelagic tunicate with a very small genome and a very short life cycle. In order to investigate the intron-exon organizations in Oikopleura, we have isolated and characterized ribosomal protein EF-1alpha, Hox, and alpha-tubulin genes. Their intron positions have been compared with those of the same genes from various invertebrates and vertebrates, including four species with entirely sequenced genomes. Oikopleura genes, like Caenorhabditis genes, have introns at a large number of nonconserved positions, which must originate from late insertions or intron sliding of ancient insertions. Both species exhibit hypervariable intron-exon organization within their alpha-tubulin gene family. This is due to localization of most nonconserved intron positions in single members of this gene family. The hypervariability and divergence of intron positions in Oikopleura and Caenorhabditis may be related to the predominance of short introns, the processing of which is not very dependent upon the exonic environment compared to large introns. Also, both species have an undermethylated genome, and the control of methylation-induced point mutations imposes a control on exon size, at least in vertebrate genes. That introns placed at such variable positions in Oikopleura or C. elegans may serve a specific purpose is not easy to infer from our current knowledge and hypotheses on intron functions. We propose that new introns are retained in species with very short life cycles, because illegitimate exchanges including gene conversion are repressed. We also speculate that introns placed at gene-specific positions may contribute to suppressing these exchanges and thereby favor their own persistence.
Identification of three homologous latex-clearing protein (lcp) genes from the genome of Streptomyces sp. strain CFMR 7.

PubMed

Nanthini, Jayaram; Ong, Su Yean; Sudesh, Kumar

2017-09-10

Rubber materials have greatly contributed to human civilization. However, being a polymeric material does not decompose easily, it has caused huge environmental problems. On the other hand, only few bacteria are known to degrade rubber, with studies pertaining them being intensively focusing on the mechanism involved in microbial rubber degradation. The Streptomyces sp. strain CFMR 7, which was previously confirmed to possess rubber-degrading ability, was subjected to whole genome sequencing using the single molecule sequencing technology of the PacBio® RS II system. The genome was further analyzed and compared with previously reported rubber-degrading bacteria in order to identify the potential genes involved in rubber degradation. This led to the interesting discovery of three homologues of latex-clearing protein (Lcp) on the chromosome of this strain, which are probably responsible for rubber degrading activities. Genes encoding oxidoreductase α-subunit (oxiA) and oxidoreductase β-subunit (oxiB) were also found downstream of two lcp genes which are located adjacent to each other. In silico analysis reveals genes that have been identified to be involved in the microbial degradation of rubber in the Streptomyces sp. strain CFMR 7. This is the first whole genome sequence of a clear-zone-forming natural rubber- degrading Streptomyces sp., which harbours three Lcp homologous genes with the presence of oxiA and oxiB genes compared to the previously reported Gordonia polyisoprenivorans strain VH2 (with two Lcp homologous genes) and Nocardia nova SH22a (with only one Lcp gene). Copyright © 2017 Elsevier B.V. All rights reserved.
Divergent and nonuniform gene expression patterns in mouse brain

PubMed Central

Morris, John A.; Royall, Joshua J.; Bertagnolli, Darren; Boe, Andrew F.; Burnell, Josh J.; Byrnes, Emi J.; Copeland, Cathy; Desta, Tsega; Fischer, Shanna R.; Goldy, Jeff; Glattfelder, Katie J.; Kidney, Jolene M.; Lemon, Tracy; Orta, Geralyn J.; Parry, Sheana E.; Pathak, Sayan D.; Pearson, Owen C.; Reding, Melissa; Shapouri, Sheila; Smith, Kimberly A.; Soden, Chad; Solan, Beth M.; Weller, John; Takahashi, Joseph S.; Overly, Caroline C.; Lein, Ed S.; Hawrylycz, Michael J.; Hohmann, John G.; Jones, Allan R.

2010-01-01

Considerable progress has been made in understanding variations in gene sequence and expression level associated with phenotype, yet how genetic diversity translates into complex phenotypic differences remains poorly understood. Here, we examine the relationship between genetic background and spatial patterns of gene expression across seven strains of mice, providing the most extensive cellular-resolution comparative analysis of gene expression in the mammalian brain to date. Using comprehensive brainwide anatomic coverage (more than 200 brain regions), we applied in situ hybridization to analyze the spatial expression patterns of 49 genes encoding well-known pharmaceutical drug targets. Remarkably, over 50% of the genes examined showed interstrain expression variation. In addition, the variability was nonuniformly distributed across strain and neuroanatomic region, suggesting certain organizing principles. First, the degree of expression variance among strains mirrors genealogic relationships. Second, expression pattern differences were concentrated in higher-order brain regions such as the cortex and hippocampus. Divergence in gene expression patterns across the brain could contribute significantly to variations in behavior and responses to neuroactive drugs in laboratory mouse strains and may help to explain individual differences in human responsiveness to neuroactive drugs. PMID:20956311
MARQ: an online tool to mine GEO for experiments with similar or opposite gene expression signatures.

PubMed

Vazquez, Miguel; Nogales-Cadenas, Ruben; Arroyo, Javier; Botías, Pedro; García, Raul; Carazo, Jose M; Tirado, Francisco; Pascual-Montano, Alberto; Carmona-Saez, Pedro

2010-07-01

The enormous amount of data available in public gene expression repositories such as Gene Expression Omnibus (GEO) offers an inestimable resource to explore gene expression programs across several organisms and conditions. This information can be used to discover experiments that induce similar or opposite gene expression patterns to a given query, which in turn may lead to the discovery of new relationships among diseases, drugs or pathways, as well as the generation of new hypotheses. In this work, we present MARQ, a web-based application that allows researchers to compare a query set of genes, e.g. a set of over- and under-expressed genes, against a signature database built from GEO datasets for different organisms and platforms. MARQ offers an easy-to-use and integrated environment to mine GEO, in order to identify conditions that induce similar or opposite gene expression patterns to a given experimental condition. MARQ also includes additional functionalities for the exploration of the results, including a meta-analysis pipeline to find genes that are differentially expressed across different experiments. The application is freely available at http://marq.dacya.ucm.es.
Genome Evolution in the Obligate but Environmentally Active Luminous Symbionts of Flashlight Fish

PubMed Central

Hendry, Tory A.; de Wet, Jeffrey R.; Dougan, Katherine E.; Dunlap, Paul V.

2016-01-01

The luminous bacterial symbionts of anomalopid flashlight fish are thought to be obligately dependent on their hosts for growth and share several aspects of genome evolution with unrelated obligate symbionts, including genome reduction. However, in contrast to most obligate bacteria, anomalopid symbionts have an active environmental phase that may be important for symbiont transmission. Here we investigated patterns of evolution between anomalopid symbionts compared with patterns in free-living relatives and unrelated obligate symbionts to determine if trends common to obligate symbionts are also found in anomalopid symbionts. Two symbionts, “Candidatus Photodesmus katoptron” and “Candidatus Photodesmus blepharus,” have genomes that are highly similar in gene content and order, suggesting genome stasis similar to ancient obligate symbionts present in insect lineages. This genome stasis exists in spite of the symbiont’s inferred ability to recombine, which is frequently lacking in obligate symbionts with stable genomes. Additionally, we used genome comparisons and tests of selection to infer which genes may be particularly important for the symbiont’s ecology compared with relatives. In keeping with obligate dependence, substitution patterns suggest that most symbiont genes are experiencing relaxed purifying selection compared with relatives. However, genes involved in motility and carbon storage, which are likely to be used outside the host, appear to be under increased purifying selection. Two chemoreceptor chemotaxis genes are retained by both species and show high conservation with amino acid sensing genes, suggesting that the bacteria may actively seek out hosts using chemotaxis toward amino acids, which the symbionts are not able to synthesize. PMID:27389687
GeneYenta: a phenotype-based rare disease case matching tool based on online dating algorithms for the acceleration of exome interpretation.

PubMed

Gottlieb, Michael M; Arenillas, David J; Maithripala, Savanie; Maurer, Zachary D; Tarailo Graovac, Maja; Armstrong, Linlea; Patel, Millan; van Karnebeek, Clara; Wasserman, Wyeth W

2015-04-01

Advances in next-generation sequencing (NGS) technologies have helped reveal causal variants for genetic diseases. In order to establish causality, it is often necessary to compare genomes of unrelated individuals with similar disease phenotypes to identify common disrupted genes. When working with cases of rare genetic disorders, finding similar individuals can be extremely difficult. We introduce a web tool, GeneYenta, which facilitates the matchmaking process, allowing clinicians to coordinate detailed comparisons for phenotypically similar cases. Importantly, the system is focused on phenotype annotation, with explicit limitations on highly confidential data that create barriers to participation. The procedure for matching of patient phenotypes, inspired by online dating services, uses an ontology-based semantic case matching algorithm with attribute weighting. We evaluate the capacity of the system using a curated reference data set and 19 clinician entered cases comparing four matching algorithms. We find that the inclusion of clinician weights can augment phenotype matching. © 2015 WILEY PERIODICALS, INC.
Final Report for LDRD Project 02-ERD-069: Discovering the Unknown Mechanism(s) of Virulence in a BW, Class A Select Agent

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chain, P; Garcia, E

2003-02-06

The goal of this proposed effort was to assess the difficulty in identifying and characterizing virulence candidate genes in an organism for which very limited data exists. This was accomplished by first addressing the finishing phase of draft-sequenced F. tularensis genomes and conducting comparative analyses to determine the coding potential of each genome; to discover the differences in genome structure and content, and to identify potential genes whose products may be involved in the F. tularensis virulence process. The project was divided into three parts: (1) Genome finishing: This part involves determining the order and orientation of the consensus sequencesmore » of contigs obtained from Phrap assemblies of random draft genomic sequences. This tedious process consists of linking contig ends using information embedded in each sequence file that relates the sequence to the original cloned insert. Since inserts are sequenced from both ends, we can establish a link between these paired-ends in different contigs and thus order and orient contigs. Since these genomes carry numerous copies of insertion sequences, these repeated elements ''confuse'' the Phrap assembly program. It is thus necessary to break these contigs apart at the repeated sequences and individually join the proper flanking regions using paired-end information, or using results of comparisons against a similar genome. Larger repeated elements such as the small subunit ribosomal RNA operon require verification with PCR. Tandem repeats require manual intervention and typically rely on single nucleotide polymorphisms to be resolved. Remaining gaps require PCR reactions and sequencing. Once the genomes have been ''closed'', low quality regions are addressed by resequencing reactions. (2) Genome analysis: The final consensus sequences are processed by combining the results of three gene modelers: Glimmer, Critica and Generation. The final gene models are submitted to a battery of homology searches and domain prediction programs in order to annotate them (e.g. BLAST, Pfam, TIGRfam, COG, KEGG, InterPro, TMhmm, SignalP). The genome structure is also assessed in terms of G+C content, GC bias (GC skew), and locations of repeated regions (e.g. IS elements) and phage-like genes. (3) Comparative genomics: The results of the various genome analyses are compared between the finished (or almost finished) genomes. Here, we have compared the F. tularensis genomes from the extremely lethal strain Schu4 (subsp. tularensis), the vaccine strain LVS (subsp. holartica), and strain UT01-4992 of the less virulent, opportunistic subsp. novicida. Regions present in the highly virulent strain that are absent from the other less virulent strains may provide insight into what factors are required for the high level of virulence.« less
A multigene phylogenetic synthesis for the class Lecanoromycetes (Ascomycota): 1307 fungi representing 1139 infrageneric taxa, 317 genera and 66 families

PubMed Central

Miadlikowska, Jolanta; Kauff, Frank; Högnabba, Filip; Oliver, Jeffrey C.; Molnár, Katalin; Fraker, Emily; Gaya, Ester; Hafellner, Josef; Hofstetter, Valérie; Gueidan, Cécile; Otálora, Mónica A.G.; Hodkinson, Brendan; Kukwa, Martin; Lücking, Robert; Björk, Curtis; Sipman, Harrie J.M.; Burgaz, Ana Rosa; Thell, Arne; Passo, Alfredo; Myllys, Leena; Goward, Trevor; Fernández-Brime, Samantha; Hestmark, Geir; Lendemer, James; Lumbsch, H. Thorsten; Schmull, Michaela; Schoch, Conrad; Sérusiaux, Emmanuël; Maddison, David R.; Arnold, A. Elizabeth; Lutzoni, François; Stenroos, Soili

2014-01-01

The Lecanoromycetes is the largest class of lichenized Fungi, and one of the most species-rich classes in the kingdom. Here we provide a multigene phylogenetic synthesis (using three ribosomal RNA-coding and two protein-coding genes) of the Lecanoromycetes based on 642 newly generated and 3329 publicly available sequences representing 1139 taxa, 317 genera, 66 families, 17 orders and five subclasses (four currently recognized: Acarosporomycetidae, Lecanoromycetidae, Ostropomycetidae, Umbilicariomycetidae; and one provisionarily recognized, ‘Candelariomycetidae’). Maximum likelihood phylogenetic analyses on four multigene datasets assembled using a cumulative supermatrix approach with a progressively higher number of species and missing data (5-gene, 5+4-gene, 5+4+3-gene and 5+4+3+2-gene datasets) show that the current classification includes non-monophyletic taxa at various ranks, which need to be recircumscribed and require revisionary treatments based on denser taxon sampling and more loci. Two newly circumscribed orders (Arctomiales and Hymeneliales in the Ostropomycetidae) and three families (Ramboldiaceae and Psilolechiaceae in the Lecanorales, and Strangosporaceae in the Lecanoromycetes inc. sed.) are introduced. The potential resurrection of the families Eigleraceae and Lopadiaceae is considered here to alleviate phylogenetic and classification disparities. An overview of the photobionts associated with the main fungal lineages in the Lecanoromycetes based on available published records is provided. A revised schematic classification at the family level in the phylogenetic context of widely accepted and newly revealed relationships across Lecanoromycetes is included. The cumulative addition of taxa with an increasing amount of missing data (i.e., a cumulative supermatrix approach, starting with taxa for which sequences were available for all five targeted genes and ending with the addition of taxa for which only two genes have been sequenced) revealed relatively stable relationships for many families and orders. However, the increasing number of taxa without the addition of more loci also resulted in an expected substantial loss of phylogenetic resolving power and support (especially for deep phylogenetic relationships), potentially including the misplacements of several taxa. Future phylogenetic analyses should include additional single copy protein-coding markers in order to improve the tree of the Lecanoromycetes. As part of this study, a new module (“Hypha”) of the freely available Mesquite software was developed to compare and display the internodal support values derived from this cumulative supermatrix approach. PMID:24747130
BactoGeNIE: A large-scale comparative genome visualization for big displays

DOE PAGES

Aurisano, Jillian; Reda, Khairi; Johnson, Andrew; ...

2015-08-13

The volume of complete bacterial genome sequence data available to comparative genomics researchers is rapidly increasing. However, visualizations in comparative genomics--which aim to enable analysis tasks across collections of genomes--suffer from visual scalability issues. While large, multi-tiled and high-resolution displays have the potential to address scalability issues, new approaches are needed to take advantage of such environments, in order to enable the effective visual analysis of large genomics datasets. In this paper, we present Bacterial Gene Neighborhood Investigation Environment, or BactoGeNIE, a novel and visually scalable design for comparative gene neighborhood analysis on large display environments. We evaluate BactoGeNIE throughmore » a case study on close to 700 draft Escherichia coli genomes, and present lessons learned from our design process. In conclusion, BactoGeNIE accommodates comparative tasks over substantially larger collections of neighborhoods than existing tools and explicitly addresses visual scalability. Given current trends in data generation, scalable designs of this type may inform visualization design for large-scale comparative research problems in genomics.« less
BactoGeNIE: a large-scale comparative genome visualization for big displays

PubMed Central

2015-01-01

Background The volume of complete bacterial genome sequence data available to comparative genomics researchers is rapidly increasing. However, visualizations in comparative genomics--which aim to enable analysis tasks across collections of genomes--suffer from visual scalability issues. While large, multi-tiled and high-resolution displays have the potential to address scalability issues, new approaches are needed to take advantage of such environments, in order to enable the effective visual analysis of large genomics datasets. Results In this paper, we present Bacterial Gene Neighborhood Investigation Environment, or BactoGeNIE, a novel and visually scalable design for comparative gene neighborhood analysis on large display environments. We evaluate BactoGeNIE through a case study on close to 700 draft Escherichia coli genomes, and present lessons learned from our design process. Conclusions BactoGeNIE accommodates comparative tasks over substantially larger collections of neighborhoods than existing tools and explicitly addresses visual scalability. Given current trends in data generation, scalable designs of this type may inform visualization design for large-scale comparative research problems in genomics. PMID:26329021
BactoGeNIE: A large-scale comparative genome visualization for big displays

DOE Office of Scientific and Technical Information (OSTI.GOV)

Aurisano, Jillian; Reda, Khairi; Johnson, Andrew

The volume of complete bacterial genome sequence data available to comparative genomics researchers is rapidly increasing. However, visualizations in comparative genomics--which aim to enable analysis tasks across collections of genomes--suffer from visual scalability issues. While large, multi-tiled and high-resolution displays have the potential to address scalability issues, new approaches are needed to take advantage of such environments, in order to enable the effective visual analysis of large genomics datasets. In this paper, we present Bacterial Gene Neighborhood Investigation Environment, or BactoGeNIE, a novel and visually scalable design for comparative gene neighborhood analysis on large display environments. We evaluate BactoGeNIE throughmore » a case study on close to 700 draft Escherichia coli genomes, and present lessons learned from our design process. In conclusion, BactoGeNIE accommodates comparative tasks over substantially larger collections of neighborhoods than existing tools and explicitly addresses visual scalability. Given current trends in data generation, scalable designs of this type may inform visualization design for large-scale comparative research problems in genomics.« less
Comparative transcriptome analysis of papilla and skin in the sea cucumber, Apostichopus japonicus.

PubMed

Zhou, Xiaoxu; Cui, Jun; Liu, Shikai; Kong, Derong; Sun, He; Gu, Chenlei; Wang, Hongdi; Qiu, Xuemei; Chang, Yaqing; Liu, Zhanjiang; Wang, Xiuli

2016-01-01

Papilla and skin are two important organs of the sea cucumber. Both tissues have ectodermic origin, but they are morphologically and functionally very different. In the present study, we performed comparative transcriptome analysis of the papilla and skin from the sea cucumber (Apostichopus japonicus) in order to identify and characterize gene expression profiles by using RNA-Seq technology. We generated 30.6 and 36.4 million clean reads from the papilla and skin and de novo assembled in 156,501 transcripts. The Gene Ontology (GO) analysis indicated that cell part, metabolic process and catalytic activity were the most abundant GO category in cell component, biological process and molecular funcation, respectively. Comparative transcriptome analysis between the papilla and skin allowed the identification of 1,059 differentially expressed genes, of which 739 genes were expressed at higher levels in papilla, while 320 were expressed at higher levels in skin. In addition, 236 differentially expressed unigenes were not annotated with any database, 160 of which were apparently expressed at higher levels in papilla, 76 were expressed at higher levels in skin. We identified a total of 288 papilla-specific genes, 171 skin-specific genes and 600 co-expressed genes. Also, 40 genes in papilla-specific were not annotated with any database, 2 in skin-specific. Development-related genes were also enriched, such as fibroblast growth factor, transforming growth factor-β, collagen-α2 and Integrin-α2, which may be related to the formation of the papilla and skin in sea cucumber. Further pathway analysis identified ten KEGG pathways that were differently enriched between the papilla and skin. The findings on expression profiles between two key organs of the sea cucumber should be valuable to reveal molecular mechanisms involved in the development of organs that are related but with morphological differences in the sea cucumber.
Comparative Genome Analysis of “Candidatus Phytoplasma australiense” (Subgroup tuf-Australia I; rp-A) and “Ca. Phytoplasma asteris” Strains OY-M and AY-WB▿ †

PubMed Central

Tran-Nguyen, L. T. T.; Kube, M.; Schneider, B.; Reinhardt, R.; Gibb, K. S.

2008-01-01

The chromosome sequence of “Candidatus Phytoplasma australiense” (subgroup tuf-Australia I; rp-A), associated with dieback in papaya, Australian grapevine yellows in grapevine, and several other important plant diseases, was determined. The circular chromosome is represented by 879,324 nucleotides, a GC content of 27%, and 839 protein-coding genes. Five hundred two of these protein-coding genes were functionally assigned, while 337 genes were hypothetical proteins with unknown function. Potential mobile units (PMUs) containing clusters of DNA repeats comprised 12.1% of the genome. These PMUs encoded genes involved in DNA replication, repair, and recombination; nucleotide transport and metabolism; translation; and ribosomal structure. Elements with similarities to phage integrases found in these mobile units were difficult to classify, as they were similar to both insertion sequences and bacteriophages. Comparative analysis of “Ca. Phytoplasma australiense” with “Ca. Phytoplasma asteris” strains OY-M and AY-WB showed that the gene order was more conserved between the closely related “Ca. Phytoplasma asteris” strains than to “Ca. Phytoplasma australiense.” Differences observed between “Ca. Phytoplasma australiense” and “Ca. Phytoplasma asteris” strains included the chromosome size (18,693 bp larger than OY-M), a larger number of genes with assigned function, and hypothetical proteins with unknown function. PMID:18359806
Horizontal gene transfer in silkworm, Bombyx mori.

PubMed

Zhu, Bo; Lou, Miao-Miao; Xie, Guan-Lin; Zhang, Guo-Qing; Zhou, Xue-Ping; Li, Bin; Jin, Gu-Lei

2011-05-19

The domesticated silkworm, Bombyx mori, is the model insect for the order Lepidoptera, has economically important values, and has gained some representative behavioral characteristics compared to its wild ancestor. The genome of B. mori has been fully sequenced while function analysis of BmChi-h and BmSuc1 genes revealed that horizontal gene transfer (HGT) maybe bestow a clear selective advantage to B. mori. However, the role of HGT in the evolutionary history of B. mori is largely unexplored. In this study, we compare the whole genome of B. mori with those of 382 prokaryotic and eukaryotic species to investigate the potential HGTs. Ten candidate HGT events were defined in B. mori by comprehensive sequence analysis using Maximum Likelihood and Bayesian method combining with EST checking. Phylogenetic analysis of the candidate HGT genes suggested that one HGT was plant-to- B. mori transfer while nine were bacteria-to- B. mori transfer. Furthermore, functional analysis based on expression, coexpression and related literature searching revealed that several HGT candidate genes have added important characters, such as resistance to pathogen, to B. mori. Results from this study clearly demonstrated that HGTs play an important role in the evolution of B. mori although the number of HGT events in B. mori is in general smaller than those of microbes and other insects. In particular, interdomain HGTs in B. mori may give rise to functional, persistent, and possibly evolutionarily significant new genes.
A Review of Gene Knockout Strategies for Microbial Cells.

PubMed

Tang, Phooi Wah; Chua, Pooi San; Chong, Shiue Kee; Mohamad, Mohd Saberi; Choon, Yee Wen; Deris, Safaai; Omatu, Sigeru; Corchado, Juan Manuel; Chan, Weng Howe; Rahim, Raha Abdul

2015-01-01

Predicting the effects of genetic modification is difficult due to the complexity of metabolic net- works. Various gene knockout strategies have been utilised to deactivate specific genes in order to determine the effects of these genes on the function of microbes. Deactivation of genes can lead to deletion of certain proteins and functions. Through these strategies, the associated function of a deleted gene can be identified from the metabolic networks. The main aim of this paper is to review the available techniques in gene knockout strategies for microbial cells. The review is done in terms of their methodology, recent applications in microbial cells. In addition, the advantages and disadvantages of the techniques are compared and discuss and the related patents are also listed as well. Traditionally, gene knockout is done through wet lab (in vivo) techniques, which were conducted through laboratory experiments. However, these techniques are costly and time consuming. Hence, various dry lab (in silico) techniques, where are conducted using computational approaches, have been developed to surmount these problem. The development of numerous techniques for gene knockout in microbial cells has brought many advancements in the study of gene functions. Based on the literatures, we found that the gene knockout strategies currently used are sensibly implemented with regard to their benefits.

Deciphering life history transcriptomes in different environments

PubMed Central

Etges, William J.; Trotter, Meredith V.; de Oliveira, Cássia C.; Rajpurohit, Subhash; Gibbs, Allen G.; Tuljapurkar, Shripad

2014-01-01

We compared whole transcriptome variation in six preadult stages and seven adult female ages in two populations of cactophilic Drosophila mojavensis reared on two host plants in order to understand how differences in gene expression influence standing life history variation. We used Singular Value Decomposition (SVD) to identify dominant trajectories of life cycle gene expression variation, performed pair-wise comparisons of stage and age differences in gene expression across the life cycle, identified when genes exhibited maximum levels of life cycle gene expression, and assessed population and host cactus effects on gene expression. Life cycle SVD analysis returned four significant components of transcriptional variation, revealing functional enrichment of genes responsible for growth, metabolic function, sensory perception, neural function, translation and aging. Host cactus effects on female gene expression revealed population and stage specific differences, including significant host plant effects on larval metabolism and development, as well as adult neurotransmitter binding and courtship behavior gene expression levels. In 3 - 6 day old virgin females, significant up-regulation of genes associated with meiosis and oogenesis was accompanied by down-regulation of genes associated with somatic maintenance, evidence for a life history tradeoff. The transcriptome of D. mojavensis reared in natural environments throughout its life cycle revealed core developmental transitions and genome wide influences on life history variation in natural populations. PMID:25442828
Characterization of the complete mitogenomes of two Neoscona spiders (Araneae: Araneidae) and its phylogenetic implications.

PubMed

Wang, Zheng-Liang; Li, Chao; Fang, Wen-Yuan; Yu, Xiao-Ping

2016-09-30

The complete mitogenomes of two orb-weaving spiders Neoscona doenitzi and Neoscona nautica were determined and a comparative mitogenomic analysis was performed to depict evolutionary trends of spider mitogenomes. The circular mitogenomes are 14,161bp with A+T content of 74.6% in N. doenitzi and 14,049bp with A+T content of 78.8% in N. nautica, respectively. Both mitogenomes contain a standard set of 37 genes typically presented in metazoans. Gene content and orientation are identical to all previously sequenced spider mitogenomes, while gene order is rearranged by tRNAs translocation when compared with the putative ancestral gene arrangement pattern presented by Limulus polyphemus. A comparative mitogenomic analysis reveals that the nucleotide composition bias is obviously divergent between spiders in suborder Opisthothelae and Mesothelae. The loss of D-arm in the trnS(UCN) among all of Opisthothelae spiders highly suggested that this common feature is a synapomorphy for entire suborder Opisthothelae. Moreover, the trnS(AGN) in araneoids preferred to use TCT as an anticodon rather than the typical anticodon GCT. Phylogenetic analysis based on the 13 protein-coding gene sequences consistently yields trees that nest the two Neoscona spiders within Araneidae and recover superfamily Araneoidea as a monophyletic group. The molecular information acquired from the results of this study should be very useful for future research on mitogenomic evolution and genetic diversities in spiders. Copyright © 2016 Elsevier B.V. All rights reserved.
Comparing effects of perfusion and hydrostatic pressure on gene profiles of human chondrocyte.

PubMed

Zhu, Ge; Mayer-Wagner, Susanne; Schröder, Christian; Woiczinski, Matthias; Blum, Helmut; Lavagi, Ilaria; Krebs, Stefan; Redeker, Julia I; Hölzer, Andreas; Jansson, Volkmar; Betz, Oliver; Müller, Peter E

2015-09-20

Hydrostatic pressure and perfusion have been shown to regulate the chondrogenic potential of articular chondrocytes. In order to compare the effects of hydrostatic pressure plus perfusion (HPP) and perfusion (P) we investigated the complete gene expression profiles of human chondrocytes under HPP and P. A simplified bioreactor was constructed to apply loading (0.1 MPa for 2 h) and perfusion (2 ml) through the same piping by pressurizing the medium directly. High-density monolayer cultures of human chondrocytes were exposed to HPP or P for 4 days. Controls (C) were maintained in static cultures. Gene expression was evaluated by sequencing (RNAseq) and quantitative real-time PCR analysis. Both treatments changed gene expression levels of human chondrocytes significantly. Specifically, HPP and P increased COL2A1 expression and decreased COL1A1 and MMP-13 expression. Despite of these similarities, RNAseq revealed a list of cartilage genes including ACAN, ITGA10 and TNC, which were differentially expressed by HPP and P. Of these candidates, adhesion related molecules were found to be upregulated in HPP. Both HPP and P treatment had beneficial effects on chondrocyte differentiation and decreased catabolic enzyme expression. The study provides new insight into how hydrostatic pressure and perfusion enhance cartilage differentiation and inhibit catabolic effects. Copyright © 2015 Elsevier B.V. All rights reserved.
Rates and Patterns of Chromosomal Evolution in Drosophila pseudoobscura and D. miranda

PubMed Central

Bartolomé, Carolina; Charlesworth, Brian

2006-01-01

Comparisons of gene orders between species permit estimation of the rate of chromosomal evolution since their divergence from a common ancestor. We have compared gene orders on three chromosomes of Drosophila pseudoobscura with its close relative, D. miranda, and the distant outgroup species, D. melanogaster, by using the public genome sequences of D. pseudoobscura and D. melanogaster and ∼50 in situ hybridizations of gene probes in D. miranda. We find no evidence for extensive transfer of genes among chromosomes in D. miranda. The rates of chromosomal rearrangements between D. miranda and D. pseudoobscura are far higher than those found before in Drosophila and approach those for nematodes, the fastest rates among higher eukaryotes. In addition, we find that the D. pseudoobscura chromosome with the highest level of inversion polymorphism (Muller's element C) does not show an unusually fast rate of evolution with respect to chromosome structure, suggesting that this classic case of inversion polymorphism reflects selection rather than mutational processes. On the basis of our results, we propose possible ancestral arrangements for the D. pseudoobscura C chromosome, which are different from those in the current literature. We also describe a new method for correcting for rearrangements that are not detected with a limited set of markers. PMID:16547107
Identification of differentially expressed genes from Trichoderma harzianum during growth on cell wall of Fusarium solani as a tool for biotechnological application

PubMed Central

2013-01-01

Background The species of T. harzianum are well known for their biocontrol activity against many plant pathogens. However, there is a lack of studies concerning its use as a biological control agent against F. solani, a pathogen involved in several crop diseases. In this study, we have used subtractive library hybridization (SSH) and quantitative real-time PCR (RT-qPCR) techniques in order to explore changes in T. harzianum genes expression during growth on cell wall of F. solani (FSCW) or glucose. RT-qPCR was also used to examine the regulation of 18 genes, potentially involved in biocontrol, during confrontation between T. harzianum and F. solani. Results Data obtained from two subtractive libraries were compared after annotation using the Blast2GO suite. A total of 417 and 78 readable EST sequence were annotated in the FSCW and glucose libraries, respectively. Functional annotation of these genes identified diverse biological processes and molecular functions required during T. harzianum growth on FSCW or glucose. We identified various genes of biotechnological value encoding to proteins which function such as transporters, hydrolytic activity, adherence, appressorium development and pathogenesis. Fifteen genes were up-regulated and sixteen were down-regulated at least at one-time point during growth of T. harzianum in FSCW. During the confrontation assay most of the genes were up-regulated, mainly after contact, when the interaction has been established. Conclusions This study demonstrates that T. harzianum expressed different genes when grown on FSCW compared to glucose. It provides insights into the mechanisms of gene expression involved in mycoparasitism of T. harzianum against F. solani. The identification and evaluation of these genes may contribute to the development of an efficient biological control agent. PMID:23497274
Identification of differentially expressed genes from Trichoderma harzianum during growth on cell wall of Fusarium solani as a tool for biotechnological application.

PubMed

Vieira, Pabline Marinho; Coelho, Alexandre Siqueira Guedes; Steindorff, Andrei Stecca; de Siqueira, Saulo José Linhares; Silva, Roberto do Nascimento; Ulhoa, Cirano José

2013-03-15

The species of T. harzianum are well known for their biocontrol activity against many plant pathogens. However, there is a lack of studies concerning its use as a biological control agent against F. solani, a pathogen involved in several crop diseases. In this study, we have used subtractive library hybridization (SSH) and quantitative real-time PCR (RT-qPCR) techniques in order to explore changes in T. harzianum genes expression during growth on cell wall of F. solani (FSCW) or glucose. RT-qPCR was also used to examine the regulation of 18 genes, potentially involved in biocontrol, during confrontation between T. harzianum and F. solani. Data obtained from two subtractive libraries were compared after annotation using the Blast2GO suite. A total of 417 and 78 readable EST sequence were annotated in the FSCW and glucose libraries, respectively. Functional annotation of these genes identified diverse biological processes and molecular functions required during T. harzianum growth on FSCW or glucose. We identified various genes of biotechnological value encoding to proteins which function such as transporters, hydrolytic activity, adherence, appressorium development and pathogenesis. Fifteen genes were up-regulated and sixteen were down-regulated at least at one-time point during growth of T. harzianum in FSCW. During the confrontation assay most of the genes were up-regulated, mainly after contact, when the interaction has been established. This study demonstrates that T. harzianum expressed different genes when grown on FSCW compared to glucose. It provides insights into the mechanisms of gene expression involved in mycoparasitism of T. harzianum against F. solani. The identification and evaluation of these genes may contribute to the development of an efficient biological control agent.
Major Histocompatibility Complex Genes Map to Two Chromosomes in an Evolutionarily Ancient Reptile, the Tuatara Sphenodon punctatus.

PubMed

Miller, Hilary C; O'Meally, Denis; Ezaz, Tariq; Amemiya, Chris; Marshall-Graves, Jennifer A; Edwards, Scott

2015-05-07

Major histocompatibility complex (MHC) genes are a central component of the vertebrate immune system and usually exist in a single genomic region. However, considerable differences in MHC organization and size exist between different vertebrate lineages. Reptiles occupy a key evolutionary position for understanding how variation in MHC structure evolved in vertebrates, but information on the structure of the MHC region in reptiles is limited. In this study, we investigate the organization and cytogenetic location of MHC genes in the tuatara (Sphenodon punctatus), the sole extant representative of the early-diverging reptilian order Rhynchocephalia. Sequencing and mapping of 12 clones containing class I and II MHC genes from a bacterial artificial chromosome library indicated that the core MHC region is located on chromosome 13q. However, duplication and translocation of MHC genes outside of the core region was evident, because additional class I MHC genes were located on chromosome 4p. We found a total of seven class I sequences and 11 class II β sequences, with evidence for duplication and pseudogenization of genes within the tuatara lineage. The tuatara MHC is characterized by high repeat content and low gene density compared with other species and we found no antigen processing or MHC framework genes on the MHC gene-containing clones. Our findings indicate substantial differences in MHC organization in tuatara compared with mammalian and avian MHCs and highlight the dynamic nature of the MHC. Further sequencing and annotation of tuatara and other reptile MHCs will determine if the tuatara MHC is representative of nonavian reptiles in general. Copyright © 2015 Miller et al.
Increased vitamin D receptor gene expression and rs11568820 and rs4516035 promoter polymorphisms in autistic disorder.

PubMed

Balta, Burhan; Gumus, Hakan; Bayramov, Ruslan; Korkmaz Bayramov, Keziban; Erdogan, Murat; Oztop, Didem Behice; Dogan, Muhammet Ensar; Taheri, Serpil; Dundar, Munis

2018-05-18

Although there are a large number of sequence variants of different genes and copy number variations at various loci identified in autistic disorder (AD) patients, the pathogenesis of AD has not been elucidated completely. Recently, in AD patients, a large number of expression array and transcriptome studies have shown an increase in the expression of genes especially related to innate immune response. Antimicrobial effects of vitamin D and VDR are exerted through Toll-Like-Receptors (TLR) which have an important role in the innate immune response, are expressed by antigen presenting cells and recognize foreign microorganisms. In this study, age and gender matched 30 patients diagnosed with AD and 30 healthy controls were included in the study. Comparatively whole blood VDR gene expression and rs11568820 and rs4516035 SNP profile of the promoter region of the VDR gene were investigated by real time PCR. Whole blood VDR gene expression was significantly higher in the AD group compared to control subjects (p < 0.0001). There were no significant differences among allele and genotype distribution of rs11568820 and rs4516035 polymorphisms between AD patients and controls. The increase of VDR gene expression in patients with AD may be in accordance with an increase in the innate immune response in patients with AD. Furthermore, this study will stimulate new studies in order to clarify the relationship among AD, vitamin D, VDR, and innate immunity.
qPCR in gastrointestinal stromal tumors: Evaluation of reference genes and expression analysis of KIT and the alternative receptor tyrosine kinases FLT3, CSF1-R, PDGFRB, MET and AXL

PubMed Central

2010-01-01

Background Gastrointestinal stromal tumors (GIST) represent the most common mesenchymal tumors of the gastrointestinal tract. About 85% carry an activating mutation in the KIT or PDGFRA gene. Approximately 10% of GIST are so-called wild type GIST (wt-GIST) without mutations in the hot spots. In the present study we evaluated appropriate reference genes for the expression analysis of formalin-fixed, paraffin-embedded and fresh frozen samples from gastrointestinal stromal tumors. We evaluated the gene expression of KIT as well as of the alternative receptor tyrosine kinase genes FLT3, CSF1-R, PDGFRB, AXL and MET by qPCR. wt-GIST were compared to samples with mutations in KIT exon 9 and 11 and PDGFRA exon 18 in order to evaluate whether overexpression of these alternative RTK might contribute to the pathogenesis of wt-GIST. Results Gene expression variability of the pooled cDNA samples is much lower than the single reverse transcription cDNA synthesis. By combining the lowest variability values of fixed and fresh tissue, the genes POLR2A, PPIA, RPLPO and TFRC were chosen for further analysis of the GIST samples. Overexpression of KIT compared to the corresponding normal tissue was detected in each GIST subgroup except in GIST with PDGFRA exon 18 mutation. Comparing our sample groups, no significant differences in the gene expression levels of FLT3, CSF1R and AXL were determined. An exception was the sample group with KIT exon 9 mutation. A significantly reduced expression of CSF1R, FLT3 and PDGFRB compared to the normal tissue was detected. GIST with mutations in KIT exon 9 and 11 and in PDGFRA exon 18 showed a significant PDGFRB downregulation. Conclusions As the variability of expression levels for the reference genes is very high comparing fresh frozen and formalin-fixed tissue there is a strong need for validation in each tissue type. None of the alternative receptor tyrosine kinases analyzed is associated with the pathogenesis of wild-type or mutated GIST. It remains to be clarified whether an autocrine or paracrine mechanism by overexpression of receptor tyrosine kinase ligands is responsible for the tumorigenesis of wt-GIST. PMID:21171987
Update on Genomic Databases and Resources at the National Center for Biotechnology Information.

PubMed

Tatusova, Tatiana

2016-01-01

The National Center for Biotechnology Information (NCBI), as a primary public repository of genomic sequence data, collects and maintains enormous amounts of heterogeneous data. Data for genomes, genes, gene expressions, gene variation, gene families, proteins, and protein domains are integrated with the analytical, search, and retrieval resources through the NCBI website, text-based search and retrieval system, provides a fast and easy way to navigate across diverse biological databases.Comparative genome analysis tools lead to further understanding of evolution processes quickening the pace of discovery. Recent technological innovations have ignited an explosion in genome sequencing that has fundamentally changed our understanding of the biology of living organisms. This huge increase in DNA sequence data presents new challenges for the information management system and the visualization tools. New strategies have been designed to bring an order to this genome sequence shockwave and improve the usability of associated data.
RNA interference tools for the western flower thrips, Frankliniella occidentalis.

PubMed

Badillo-Vargas, Ismael E; Rotenberg, Dorith; Schneweis, Brandi A; Whitfield, Anna E

2015-05-01

The insect order Thysanoptera is exclusively comprised of small insects commonly known as thrips. The western flower thrips, Frankliniella occidentalis, is an economically important pest amongst thysanopterans due to extensive feeding damage and tospovirus transmission to hundreds of plant species worldwide. Geographically-distinct populations of F. occidentalis have developed resistance against many types of traditional chemical insecticides, and as such, management of thrips and tospoviruses are a persistent challenge in agriculture. Molecular methods for defining the role(s) of specific genes in thrips-tospovirus interactions and for assessing their potential as gene targets in thrips management strategies is currently lacking. The goal of this work was to develop an RNA interference (RNAi) tool that enables functional genomic assays and to evaluate RNAi for its potential as a biologically-based approach for controlling F. occidentalis. Using a microinjection system, we delivered double-stranded RNA (dsRNA) directly to the hemocoel of female thrips to target the vacuolar ATP synthase subunit B (V-ATPase-B) gene of F. occidentalis. Gene expression analysis using real-time quantitative reverse transcriptase-PCR (qRT-PCR) revealed significant reductions of V-ATPase-B transcripts at 2 and 3 days post-injection (dpi) with dsRNA of V-ATPase-B compared to injection with dsRNA of GFP. Furthermore, the effect of knockdown of the V-ATPase-B gene in females at these two time points was mirrored by the decreased abundance of V-ATPase-B protein as determined by quantitative analysis of Western blots. Reduction in V-ATPase-B expression in thrips resulted in increased female mortality and reduced fertility, i.e., number of viable offspring produced. Survivorship decreased significantly by six dpi compared to the dsRNA-GFP control group, which continued decreasing significantly until the end of the bioassay. Surviving female thrips injected with dsRNA-V-ATPase-B produced significantly fewer offspring compared to those in the dsRNA-GFP control group. Our findings indicate that an RNAi-based strategy to study gene function in thrips is feasible, can result in quantifiable phenotypes, and provides a much-needed tool for investigating the molecular mechanisms of thrips-tospovirus interactions. To our knowledge, this represents the first report of RNAi for any member of the insect order Thysanoptera and demonstrates the potential for translational research in the area of thrips pest control. Copyright © 2015 Elsevier Ltd. All rights reserved.
Statistical Analysis of Microarray Data with Replicated Spots: A Case Study with Synechococcus WH8102

PubMed Central

Thomas, E. V.; Phillippy, K. H.; Brahamsha, B.; Haaland, D. M.; Timlin, J. A.; Elbourne, L. D. H.; Palenik, B.; Paulsen, I. T.

2009-01-01

Until recently microarray experiments often involved relatively few arrays with only a single representation of each gene on each array. A complete genome microarray with multiple spots per gene (spread out spatially across the array) was developed in order to compare the gene expression of a marine cyanobacterium and a knockout mutant strain in a defined artificial seawater medium. Statistical methods were developed for analysis in the special situation of this case study where there is gene replication within an array and where relatively few arrays are used, which can be the case with current array technology. Due in part to the replication within an array, it was possible to detect very small changes in the levels of expression between the wild type and mutant strains. One interesting biological outcome of this experiment is the indication of the extent to which the phosphorus regulatory system of this cyanobacterium affects the expression of multiple genes beyond those strictly involved in phosphorus acquisition. PMID:19404483
Statistical Analysis of Microarray Data with Replicated Spots: A Case Study with Synechococcus WH8102

DOE PAGES

Thomas, E. V.; Phillippy, K. H.; Brahamsha, B.; ...

2009-01-01

Until recently microarray experiments often involved relatively few arrays with only a single representation of each gene on each array. A complete genome microarray with multiple spots per gene (spread out spatially across the array) was developed in order to compare the gene expression of a marine cyanobacterium and a knockout mutant strain in a defined artificial seawater medium. Statistical methods were developed for analysis in the special situation of this case study where there is gene replication within an array and where relatively few arrays are used, which can be the case with current array technology. Due in partmore » to the replication within an array, it was possible to detect very small changes in the levels of expression between the wild type and mutant strains. One interesting biological outcome of this experiment is the indication of the extent to which the phosphorus regulatory system of this cyanobacterium affects the expression of multiple genes beyond those strictly involved in phosphorus acquisition.« less
The multiple sex chromosomes of platypus and echidna are not completely identical and several share homology with the avian Z.

PubMed

Rens, Willem; O'Brien, Patricia C M; Grützner, Frank; Clarke, Oliver; Graphodatskaya, Daria; Tsend-Ayush, Enkhjargal; Trifonov, Vladimir A; Skelton, Helen; Wallis, Mary C; Johnston, Steve; Veyrunes, Frederic; Graves, Jennifer A M; Ferguson-Smith, Malcolm A

2007-01-01

Sex-determining systems have evolved independently in vertebrates. Placental mammals and marsupials have an XY system, birds have a ZW system. Reptiles and amphibians have different systems, including temperature-dependent sex determination, and XY and ZW systems that differ in origin from birds and placental mammals. Monotremes diverged early in mammalian evolution, just after the mammalian clade diverged from the sauropsid clade. Our previous studies showed that male platypus has five X and five Y chromosomes, no SRY, and DMRT1 on an X chromosome. In order to investigate monotreme sex chromosome evolution, we performed a comparative study of platypus and echidna by chromosome painting and comparative gene mapping. Chromosome painting reveals a meiotic chain of nine sex chromosomes in the male echidna and establishes their order in the chain. Two of those differ from those in the platypus, three of the platypus sex chromosomes differ from those of the echidna and the order of several chromosomes is rearranged. Comparative gene mapping shows that, in addition to bird autosome regions, regions of bird Z chromosomes are homologous to regions in four platypus X chromosomes, that is, X1, X2, X3, X5, and in chromosome Y1. Monotreme sex chromosomes are easiest to explain on the hypothesis that autosomes were added sequentially to the translocation chain, with the final additions after platypus and echidna divergence. Genome sequencing and contig anchoring show no homology yet between platypus and therian Xs; thus, monotremes have a unique XY sex chromosome system that shares some homology with the avian Z.
Is It an Ant or a Butterfly? Convergent Evolution in the Mitochondrial Gene Order of Hymenoptera and Lepidoptera

PubMed Central

Babbucci, Massimiliano; Basso, Andrea; Scupola, Antonio; Patarnello, Tomaso; Negrisolo, Enrico

2014-01-01

Insect mitochondrial genomes (mtDNA) are usually double helical and circular molecules containing 37 genes that are encoded on both strands. The arrangement of the genes is not constant for all species, and produces distinct gene orders (GOs) that have proven to be diagnostic in defining clades at different taxonomic levels. In general, it is believed that distinct taxa have a very low chance of sharing identically arranged GOs. However, examples of identical, homoplastic local rearrangements occurring in distinct taxa do exist. In this study, we sequenced the complete mtDNAs of the ants Formica fusca and Myrmica scabrinodis (Formicidae, Hymenoptera) and compared their GOs with those of other Insecta. The GO of F. fusca was found to be identical to the GO of Dytrisia (the largest clade of Lepidoptera). This finding is the first documented case of an identical GO shared by distinct groups of Insecta, and it is the oldest known event of GO convergent evolution in animals. Both Hymenoptera and Lepidoptera acquired this GO early in their evolution. Using a phylogenetic approach combined with new bioinformatic tools, the chronological order of the evolutionary events that produced the diversity of the hymenopteran GOs was determined. Additionally, new local homoplastic rearrangements shared by distinct groups of insects were identified. Our study showed that local and global homoplasies affecting the insect GOs are more widespread than previously thought. Homoplastic GOs can still be useful for characterizing the various clades, provided that they are appropriately considered in a phylogenetic and taxonomic context. PMID:25480682
Quantitative real-time PCR normalization for gene expression studies in the plant pathogenic fungi Lasiodiplodia theobromae.

PubMed

Paolinelli-Alfonso, Marcos; Galindo-Sánchez, Clara Elizabeth; Hernandez-Martinez, Rufina

2016-08-01

Lasiodiplodia theobromae is a highly virulent plant pathogen. It has been suggested that heat stress increases its virulence. The aim of this work was to evaluate, compare, and recommend normalization strategies for gene expression analysis of the fungus growing with grapevine wood under heat stress. Using RT-qPCR-derived data, reference gene stability was evaluated through geNorm, NormFinder and Bestkeeper applications. Based on the geometric mean using the ranking position obtained for each independent analysis, genes were ranked from least to most stable as follows: glyceraldehyde-3-phosphate dehydrogenase (GAPDH), actin (ACT), β-tubulin (TUB) and elongation factor-1α (EF1α). Using RNAseq-derived data based on the calculated tagwise dispersion these genes were ordered by increasing stability as follows: GAPDH, ACT, TUB, and EF1α. The correlation between RNAseq and RTqPCR results was used as criteria to identify the best RT-qPCR normalization approach. The gene TUB is recommended as the best option for normalization among the commonly used reference genes, but alternative fungal reference genes are also suggested. Copyright © 2016 Elsevier B.V. All rights reserved.
Systematic identification and validation of candidate genes for detection of circulating tumor cells in peripheral blood specimens of colorectal cancer patients.

PubMed

Findeisen, Peter; Röckel, Matthias; Nees, Matthias; Röder, Christian; Kienle, Peter; Von Knebel Doeberitz, Magnus; Kalthoff, Holger; Neumaier, Michael

2008-11-01

The presence of tumor cells in peripheral blood is being regarded increasingly as a clinically relevant prognostic factor for colorectal cancer patients. Current molecular methods are very sensitive but due to low specificity their diagnostic value is limited. This study was undertaken in order to systematically identify and validate new colorectal cancer (CRC) marker genes for improved detection of minimal residual disease in peripheral blood mononuclear cells of colorectal cancer patients. Marker genes with upregulated gene expression in colorectal cancer tissue and cell lines were identified using microarray experiments and publicly available gene expression data. A systematic iterative approach was used to reduce a set of 346 candidate genes, reportedly associated with CRC to a selection of candidate genes that were then further validated by relative quantitative real-time RT-PCR. Analytical sensitivity of RT-PCR assays was determined by spiking experiments with CRC cells. Diagnostic sensitivity as well as specificity was tested on a control group consisting of 18 CRC patients compared to 12 individuals without malignant disease. From a total of 346-screened genes only serine (or cysteine) proteinase inhibitor, clade B (ovalbumin), member 5 (SERPINB5) showed significantly elevated transcript levels in peripheral venous blood specimens of tumor patients when compared to the nonmalignant control group. These results were confirmed by analysis of an enlarged collective consisting of 63 CRC patients and 36 control individuals without malignant disease. In conclusion SERPINB5 seems to be a promising marker for detection of circulating tumor cells in peripheral blood of colorectal cancer patients.
The Sinocyclocheilus cavefish genome provides insights into cave adaptation.

PubMed

Yang, Junxing; Chen, Xiaoli; Bai, Jie; Fang, Dongming; Qiu, Ying; Jiang, Wansheng; Yuan, Hui; Bian, Chao; Lu, Jiang; He, Shiyang; Pan, Xiaofu; Zhang, Yaolei; Wang, Xiaoai; You, Xinxin; Wang, Yongsi; Sun, Ying; Mao, Danqing; Liu, Yong; Fan, Guangyi; Zhang, He; Chen, Xiaoyong; Zhang, Xinhui; Zheng, Lanping; Wang, Jintu; Cheng, Le; Chen, Jieming; Ruan, Zhiqiang; Li, Jia; Yu, Hui; Peng, Chao; Ma, Xingyu; Xu, Junmin; He, You; Xu, Zhengfeng; Xu, Pao; Wang, Jian; Yang, Huanming; Wang, Jun; Whitten, Tony; Xu, Xun; Shi, Qiong

2016-01-04

An emerging cavefish model, the cyprinid genus Sinocyclocheilus, is endemic to the massive southwestern karst area adjacent to the Qinghai-Tibetan Plateau of China. In order to understand whether orogeny influenced the evolution of these species, and how genomes change under isolation, especially in subterranean habitats, we performed whole-genome sequencing and comparative analyses of three species in this genus, S. grahami, S. rhinocerous and S. anshuiensis. These species are surface-dwelling, semi-cave-dwelling and cave-restricted, respectively. The assembled genome sizes of S. grahami, S. rhinocerous and S. anshuiensis are 1.75 Gb, 1.73 Gb and 1.68 Gb, respectively. Divergence time and population history analyses of these species reveal that their speciation and population dynamics are correlated with the different stages of uplifting of the Qinghai-Tibetan Plateau. We carried out comparative analyses of these genomes and found that many genetic changes, such as gene loss (e.g. opsin genes), pseudogenes (e.g. crystallin genes), mutations (e.g. melanogenesis-related genes), deletions (e.g. scale-related genes) and down-regulation (e.g. circadian rhythm pathway genes), are possibly associated with the regressive features (such as eye degeneration, albinism, rudimentary scales and lack of circadian rhythms), and that some gene expansion (e.g. taste-related transcription factor gene) may point to the constructive features (such as enhanced taste buds) which evolved in these cave fishes. As the first report on cavefish genomes among distinct species in Sinocyclocheilus, our work provides not only insights into genetic mechanisms of cave adaptation, but also represents a fundamental resource for a better understanding of cavefish biology.
EqualTDRL: illustrating equivalent tandem duplication random loss rearrangements.

PubMed

Hartmann, Tom; Bernt, Matthias; Middendorf, Martin

2018-05-30

To study the differences between two unichromosomal circular genomes, e.g., mitochondrial genomes, under the tandem duplication random loss (TDRL) rearrangement it is important to consider the whole set of potential TDRL rearrangement events that could have taken place. The reason is that for two given circular gene orders there can exist different TDRL rearrangements that transform one of the gene orders into the other. Hence, a TDRL event cannot always be reconstructed only from the knowledge of the circular gene order before a TDRL event and the circular gene order after it. We present the program EqualTDRL that computes and illustrates the complete set of TDRLs for pairs of circular gene orders that differ by only one TDRL. EqualTDRL considers the circularity of the given genomes and certain restrictions on the TDRL rearrangements. Examples for the latter are sequences of genes that have to be conserved during a TDRL or pairs of genes that frame intergenic regions which might represent remnants of duplicated genes. Additionally, EqualTDRL allows to determine the set of TDRLs that are minimum with respect to the number of duplicated genes. EqualTDRL supports scientists to study the complete set of TDRLs that possibly could have taken place in the evolution of mitochondrial genomes. EqualTDRL is implemented in C++ using the ggplot2 package of the open source programming language R and is freely available from http://pacosy.informatik.uni-leipzig.de/equaltdrl .
The complete chloroplast genome sequences of Lychnis wilfordii and Silene capitata and comparative analyses with other Caryophyllaceae genomes.

PubMed

Kang, Jong-Soo; Lee, Byoung Yoon; Kwak, Myounghai

2017-01-01

The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.

Multi-gene fluorescence in situ hybridization to detect cell cycle gene copy number aberrations in young breast cancer patients

PubMed Central

Li, Chunyan; Bai, Jingchao; Hao, Xiaomeng; Zhang, Sheng; Hu, Yunhui; Zhang, Xiaobei; Yuan, Weiping; Hu, Linping; Cheng, Tao; Zetterberg, Anders; Lee, Mong-Hong; Zhang, J

2014-01-01

Breast cancer is a disease of cell cycle, and the dysfunction of cell cycle checkpoints plays a vital role in the occurrence and development of breast cancer. We employed multi-gene fluorescence in situ hybridization (M-FISH) to investigate gene copy number aberrations (CNAs) of 4 genes (Rb1, CHEK2, c-Myc, CCND1) that are involved in the regulation of cell cycle, in order to analyze the impact of gene aberrations on prognosis in the young breast cancer patients. Gene copy number aberrations of these 4 genes were more frequently observed in young breast cancer patients when compared with the older group. Further, these CNAs were more frequently seen in Luminal B type, Her2 overexpression, and tiple-negative breast cancer (TNBC) type in young breast cancer patients. The variations of CCND1, Rb1, and CHEK2 were significantly correlated with poor survival in the young breast cancer patient group, while the amplification of c-Myc was not obviously correlated with poor survival in young breast cancer patients. Thus, gene copy number aberrations (CNAs) of cell cycle-regulated genes can serve as an important tool for prognosis in young breast cancer patients. PMID:24621502
Alu distribution and mutation types of cancer genes

PubMed Central

2011-01-01

Background Alu elements are the most abundant retrotransposable elements comprising ~11% of the human genome. Many studies have highlighted the role that Alu elements have in genetic instability and how their contribution to the assortment of mutagenic events can lead to cancer. As of yet, little has been done to quantitatively assess the association between Alu distribution and genes that are causally implicated in oncogenesis. Results We have investigated the effect of various Alu densities on the mutation type based classifications of cancer genes. In order to establish the direct relationship between Alus and the cancer genes of interest, genome wide Alu-related densities were measured using genes rather than the sliding windows of fixed length as the units. Several novel genomic features, such as the density of the adjacent Alu pairs and the number of Alu-Exon-Alu triplets, were developed in order to extend the investigation via the multivariate statistical analysis toward more advanced biological insight. In addition, we characterized the genome-wide intron Alu distribution with a mixture model that distinguished genes containing Alu elements from those with no Alus, and evaluated the gene-level effect of the 5'-TTAAAA motif associated with Alu insertion sites using a two-step regression analysis method. Conclusions The study resulted in several novel findings worthy of further investigation. They include: (1) Recessive cancer genes (tumor suppressor genes) are enriched with Alu elements (p < 0.01) compared to dominant cancer genes (oncogenes) and the entire set of genes in the human genome; (2) Alu-related genomic features can be used to cluster cancer genes into biological meaningful groups; (3) The retention of exon Alus has been restricted in the human genome development, and an upper limit to the chromosome-level exon Alu densities is suggested by the distribution profile; (4) For the genes with at least one intron Alu repeat in individual chromosomes, the intron Alu densities can be well fitted by a Gamma distribution; (5) The effect of the 5'-TTAAAA motif on Alu densities varies across different chromosomes. PMID:21429208
An in vivo and in silico approach to study cis-antisense: a short cut to higher order response

NASA Astrophysics Data System (ADS)

Courtney, Colleen; Varanasi, Usha; Chatterjee, Anushree

2014-03-01

Antisense interactions are present in all domains of life. Typically sense, antisense RNA pairs originate from overlapping genes with convergent face to face promoters, and are speculated to be involved in gene regulation. Recent studies indicate the role of transcriptional interference (TI) in regulating expression of genes in convergent orientation. Modeling antisense, TI gene regulation mechanisms allows us to understand how organisms control gene expression. We present a modeling and experimental framework to understand convergent transcription that combines the effects of transcriptional interference and cis-antisense regulation. Our model shows that combining transcriptional interference and antisense RNA interaction adds multiple-levels of regulation which affords a highly tunable biological output, ranging from first order response to complex higher-order response. To study this system we created a library of experimental constructs with engineered TI and antisense interaction by using face-to-face inducible promoters separated by carefully tailored overlapping DNA sequences to control expression of a set of fluorescent reporter proteins. Studying this gene expression mechanism allows for an understanding of higher order behavior of gene expression networks.
Simultaneous determination of androgenic and estrogenic endpoints in the threespine stickleback (Gasterosteus aculeatus) using quantitative RT-PCR.

PubMed

Hogan, Natacha S; Wartman, Cheryl A; Finley, Megan A; van der Lee, Jennifer G; van den Heuvel, Michael R

2008-12-11

A method to evaluate the expression of three hormone responsive genes, vitellogenin (estrogens), spiggin (androgens), and an androgen receptor (ARbeta) using real-time PCR in threespine stickleback is presented. Primers were designed from previously characterised spiggin and ARbeta sequences, while a homology cloning strategy was used to isolate a partial gene sequence for stickleback vitellogenin (Vtg). Spiggin mRNA was significantly higher in kidneys of field-caught males compared to females by greater than five orders of magnitude while ARbeta levels were only 1.4-fold higher in males. Female fish had four order of magnitude higher liver Vtg expression than wild-captured males. To determine the sensitivity of these genes to induction by hormones, male and female sticklebacks were exposed to 1, 10 and 100 ng/L of methyltestosterone (MT) or estradiol (E2) in a flow-through exposure system for 7 days. Spiggin induction in females, and Vtg induction in males were both detectable at 10 ng/L of MT and E2, respectively. MT exposure did not induce ARbeta expression in the kidneys of female stickleback. In vitro gonadal steroid hormones production was measured in testes and ovaries of exposed stickleback to compare gene expression endpoints to an endpoint of hormonal reproductive alteration. Reduction in testosterone production in ovaries at all three MT exposure concentrations, and ovarian estradiol synthesis at the 100 ng/L exposure were the only effects observed in the in vitro steroidogenesis for either hormone exposure. Application of these methods to assess both androgenic, estrogenic, and anti-steroidogenic properties of environmental contaminants in a single fish species will be a valuable tool for identifying compounds causing reproductive dysfunction in fishes.
Comparative analyses of plastid genomes from fourteen Cornales species: inferences for phylogenetic relationships and genome evolution.

PubMed

Fu, Chao-Nan; Li, Hong-Tao; Milne, Richard; Zhang, Ting; Ma, Peng-Fei; Yang, Jing; Li, De-Zhu; Gao, Lian-Ming

2017-12-08

The Cornales is the basal lineage of the asterids, the largest angiosperm clade. Phylogenetic relationships within the order were previously not fully resolved. Fifteen plastid genomes representing 14 species, ten genera and seven families of Cornales were newly sequenced for comparative analyses of genome features, evolution, and phylogenomics based on different partitioning schemes and filtering strategies. All plastomes of the 14 Cornales species had the typical quadripartite structure with a genome size ranging from 156,567 bp to 158,715 bp, which included two inverted repeats (25,859-26,451 bp) separated by a large single-copy region (86,089-87,835 bp) and a small single-copy region (18,250-18,856 bp) region. These plastomes encoded the same set of 114 unique genes including 31 transfer RNA, 4 ribosomal RNA and 79 coding genes, with an identical gene order across all examined Cornales species. Two genes (rpl22 and ycf15) contained premature stop codons in seven and five species respectively. The phylogenetic relationships among all sampled species were fully resolved with maximum support. Different filtering strategies (none, light and strict) of sequence alignment did not have an effect on these relationships. The topology recovered from coding and noncoding data sets was the same as for the whole plastome, regardless of filtering strategy. Moreover, mutational hotspots and highly informative regions were identified. Phylogenetic relationships among families and intergeneric relationships within family of Cornales were well resolved. Different filtering strategies and partitioning schemes do not influence the relationships. Plastid genomes have great potential to resolve deep phylogenetic relationships of plants.
Genomic Features of the Damselfly Calopteryx splendens Representing a Sister Clade to Most Insect Orders

PubMed Central

Ioannidis, Panagiotis; Simao, Felipe A.; Waterhouse, Robert M.; Manni, Mosè; Seppey, Mathieu; Robertson, Hugh M.; Misof, Bernhard; Niehuis, Oliver

2017-01-01

Insects comprise the most diverse and successful animal group with over one million described species that are found in almost every terrestrial and limnic habitat, with many being used as important models in genetics, ecology, and evolutionary research. Genome sequencing projects have greatly expanded the sampling of species from many insect orders, but genomic resources for species of certain insect lineages have remained relatively limited to date. To address this paucity, we sequenced the genome of the banded demoiselle, Calopteryx splendens, a damselfly (Odonata: Zygoptera) belonging to Palaeoptera, the clade containing the first winged insects. The 1.6 Gbp C. splendens draft genome assembly is one of the largest insect genomes sequenced to date and encodes a predicted set of 22,523 protein-coding genes. Comparative genomic analyses with other sequenced insects identified a relatively small repertoire of C. splendens detoxification genes, which could explain its previously noted sensitivity to habitat pollution. Intriguingly, this repertoire includes a cytochrome P450 gene not previously described in any insect genome. The C. splendens immune gene repertoire appears relatively complete and features several genes encoding novel multi-domain peptidoglycan recognition proteins. Analysis of chemosensory genes revealed the presence of both gustatory and ionotropic receptors, as well as the insect odorant receptor coreceptor gene (OrCo) and at least four partner odorant receptors (ORs). This represents the oldest known instance of a complete OrCo/OR system in insects, and provides the molecular underpinning for odonate olfaction. The C. splendens genome improves the sampling of insect lineages that diverged before the radiation of Holometabola and offers new opportunities for molecular-level evolutionary, ecological, and behavioral studies. PMID:28137743
DMRT gene cluster analysis in the platypus: new insights into genomic organization and regulatory regions.

PubMed

El-Mogharbel, Nisrine; Wakefield, Matthew; Deakin, Janine E; Tsend-Ayush, Enkhjargal; Grützner, Frank; Alsop, Amber; Ezaz, Tariq; Marshall Graves, Jennifer A

2007-01-01

We isolated and characterized a cluster of platypus DMRT genes and compared their arrangement, location, and sequence across vertebrates. The DMRT gene cluster on human 9p24.3 harbors, in order, DMRT1, DMRT3, and DMRT2, which share a DM domain. DMRT1 is highly conserved and involved in sexual development in vertebrates, and deletions in this region cause sex reversal in humans. Sequence comparisons of DMRT genes between species have been valuable in identifying exons, control regions, and conserved nongenic regions (CNGs). The addition of platypus sequences is expected to be particularly valuable, since monotremes fill a gap in the vertebrate genome coverage. We therefore isolated and fully sequenced platypus BAC clones containing DMRT3 and DMRT2 as well as DMRT1 and then generated multispecies alignments and ran prediction programs followed by experimental verification to annotate this gene cluster. We found that the three genes have 58-66% identity to their human orthologues, lie in the same order as in other vertebrates, and colocate on 1 of the 10 platypus sex chromosomes, X5. We also predict that optimal annotation of the newly sequenced platypus genome will be challenging. The analysis of platypus sequence revealed differences in structure and sequence of the DMRT gene cluster. Multispecies comparison was particularly effective for detecting CNGs, revealing several novel potential regulatory regions within DMRT3 and DMRT2 as well as DMRT1. RT-PCR indicated that platypus DMRT1 and DMRT3 are expressed specifically in the adult testis (and not ovary), but DMRT2 has a wider expression profile, as it does for other mammals. The platypus DMRT1 expression pattern, and its location on an X chromosome, suggests an involvement in monotreme sexual development.
Complete Chloroplast Genome Sequences of Four Meliaceae Species and Comparative Analyses

PubMed Central

Mader, Malte; Pakull, Birte; Blanc-Jolivet, Céline; Paulini-Drewes, Maike; Bouda, Zoéwindé Henri-Noël; Degen, Bernd; Small, Ian

2018-01-01

The Meliaceae family mainly consists of trees and shrubs with a pantropical distribution. In this study, the complete chloroplast genomes of four Meliaceae species were sequenced and compared with each other and with the previously published Azadirachta indica plastome. The five plastomes are circular and exhibit a quadripartite structure with high conservation of gene content and order. They include 130 genes encoding 85 proteins, 37 tRNAs and 8 rRNAs. Inverted repeat expansion resulted in a duplication of rps19 in the five Meliaceae species, which is consistent with that in many other Sapindales, but different from many other rosids. Compared to Azadirachta indica, the four newly sequenced Meliaceae individuals share several large deletions, which mainly contribute to the decreased genome sizes. A whole-plastome phylogeny supports previous findings that the four species form a monophyletic sister clade to Azadirachta indica within the Meliaceae. SNPs and indels identified in all complete Meliaceae plastomes might be suitable targets for the future development of genetic markers at different taxonomic levels. The extended analysis of SNPs in the matK gene led to the identification of four potential Meliaceae-specific SNPs as a basis for future validation and marker development. PMID:29494509
Careful Selection of Reference Genes Is Required for Reliable Performance of RT-qPCR in Human Normal and Cancer Cell Lines

PubMed Central

Jacob, Francis; Guertler, Rea; Naim, Stephanie; Nixdorf, Sheri; Fedier, André; Hacker, Neville F.; Heinzelmann-Schwarz, Viola

2013-01-01

Reverse Transcription - quantitative Polymerase Chain Reaction (RT-qPCR) is a standard technique in most laboratories. The selection of reference genes is essential for data normalization and the selection of suitable reference genes remains critical. Our aim was to 1) review the literature since implementation of the MIQE guidelines in order to identify the degree of acceptance; 2) compare various algorithms in their expression stability; 3) identify a set of suitable and most reliable reference genes for a variety of human cancer cell lines. A PubMed database review was performed and publications since 2009 were selected. Twelve putative reference genes were profiled in normal and various cancer cell lines (n = 25) using 2-step RT-qPCR. Investigated reference genes were ranked according to their expression stability by five algorithms (geNorm, Normfinder, BestKeeper, comparative ΔCt, and RefFinder). Our review revealed 37 publications, with two thirds patient samples and one third cell lines. qPCR efficiency was given in 68.4% of all publications, but only 28.9% of all studies provided RNA/cDNA amount and standard curves. GeNorm and Normfinder algorithms were used in 60.5% in combination. In our selection of 25 cancer cell lines, we identified HSPCB, RRN18S, and RPS13 as the most stable expressed reference genes. In the subset of ovarian cancer cell lines, the reference genes were PPIA, RPS13 and SDHA, clearly demonstrating the necessity to select genes depending on the research focus. Moreover, a cohort of at least three suitable reference genes needs to be established in advance to the experiments, according to the guidelines. For establishing a set of reference genes for gene normalization we recommend the use of ideally three reference genes selected by at least three stability algorithms. The unfortunate lack of compliance to the MIQE guidelines reflects that these need to be further established in the research community. PMID:23554992
Common themes and cell type specific variations of higher order chromatin arrangements in the mouse

PubMed Central

Mayer, Robert; Brero, Alessandro; von Hase, Johann; Schroeder, Timm; Cremer, Thomas; Dietzel, Steffen

2005-01-01

Background Similarities as well as differences in higher order chromatin arrangements of human cell types were previously reported. For an evolutionary comparison, we now studied the arrangements of chromosome territories and centromere regions in six mouse cell types (lymphocytes, embryonic stem cells, macrophages, fibroblasts, myoblasts and myotubes) with fluorescence in situ hybridization and confocal laser scanning microscopy. Both species evolved pronounced differences in karyotypes after their last common ancestors lived about 87 million years ago and thus seem particularly suited to elucidate common and cell type specific themes of higher order chromatin arrangements in mammals. Results All mouse cell types showed non-random correlations of radial chromosome territory positions with gene density as well as with chromosome size. The distribution of chromosome territories and pericentromeric heterochromatin changed during differentiation, leading to distinct cell type specific distribution patterns. We exclude a strict dependence of these differences on nuclear shape. Positional differences in mouse cell nuclei were less pronounced compared to human cell nuclei in agreement with smaller differences in chromosome size and gene density. Notably, the position of chromosome territories relative to each other was very variable. Conclusion Chromosome territory arrangements according to chromosome size and gene density provide common, evolutionary conserved themes in both, human and mouse cell types. Our findings are incompatible with a previously reported model of parental genome separation. PMID:16336643
[Comparative analysis of methylation profiles in tissues of oral leukoplakia and oral squamous cell carcinoma].

PubMed

Fu, J; Su, Y; Liu, Y; Zhang, X Y

2018-04-09

Objective: To compare the methylation profiles in tissues of oral leukoplakia (OLK) and oral squamous cell carcinoma (OSCC) with healthy tissues of oral mucosa, in order to identify the role of DNA methylation played in tumorigenesis. Methods: DNA samples extracted from tissues of 4 healthy oral mucosa, 4 OSCC and 4 OLK collected from patients of the Department of Oral Medicine, Capital Medical University School of Stomatology were examined and compared using Methylation 450 Bead Chip. The genes associated with differentially methylated CpG sites were selected for gene ontology (GO) analysis and Kyoto encyclopedia of genes and genomes (KEGG) pathway enrichment. Results: Multiple differentially methylated CpG sites were identified by using the above mentioned assay. Hypermethylation constitutes 86.18% (23 290/27 025) of methylation changes in OLK and hypomethylation accounts for 13.82% (3 734/27 025) of methylation changes. Both hypermethylated and hypomethylated CpG sites were markedly increased in OSCC tissue compared with OLK tissue. The majority of differentially methylated CpG sites were located outside CpG islands, with approximately one-fourth in CpG shores flanking the islands, which were considered highly important for gene regulation and tumorigenesis. Pathway analysis revealed that differentially methylated CpG sites in both OLK and OSCC patients shared the same pathway enrichments, most of which were correlated with carcinogenesis and cancer progression (e.g., DNA repair, cell cycle, and apoptosis). Conclusions: In the present study, methylation-associated alterations affect almost all pathways in the cellular network in both OLK and OSCC. OLK and OSCC shared similar methylation changes whether in pathways or genes, indicating that epigenetically they might have the same molecular basis for disease progression.
Molecular Characterization and Expression Analysis of Creatine Kinase Muscle (CK-M) Gene in Horse.

PubMed

Do, Kyong-Tak; Cho, Hyun-Woo; Badrinath, Narayanasamy; Park, Jeong-Woong; Choi, Jae-Young; Chung, Young-Hwa; Lee, Hak-Kyo; Song, Ki-Duk; Cho, Byung-Wook

2015-12-01

Since ancient days, domestic horses have been closely associated with human civilization. Today, horse racing is an important industry. Various genes involved in energy production and muscle contraction are differentially regulated during a race. Among them, creatine kinase (CK) is well known for its regulation of energy preservation in animal cells. CK is an iso-enzyme, encoded by different genes and expressed in skeletal muscle, heart, brain and leucocytes. We confirmed that the expression of CK-M significantly increased in the blood after a 30 minute exercise period, while no considerable change was observed in skeletal muscle. Analysis of various tissues showed an ubiquitous expression of the CK-M gene in the horse; CK-M mRNA expression was predominant in the skeletal muscle and the cardiac muscle compared to other tissues. An evolutionary study by synonymous and non-synonymous single nucleotide polymorphism ratio of CK-M gene revealed a positive selection that was conserved in the horse. More studies are warranted in order to develop the expression of CK-M gene as a biomarker in blood of thoroughbred horses.
Identification and Evaluation of Reliable Reference Genes for Quantitative Real-Time PCR Analysis in Tea Plant (Camellia sinensis (L.) O. Kuntze)

PubMed Central

Hao, Xinyuan; Horvath, David P.; Chao, Wun S.; Yang, Yajun; Wang, Xinchao; Xiao, Bin

2014-01-01

Reliable reference selection for the accurate quantification of gene expression under various experimental conditions is a crucial step in qRT-PCR normalization. To date, only a few housekeeping genes have been identified and used as reference genes in tea plant. The validity of those reference genes are not clear since their expression stabilities have not been rigorously examined. To identify more appropriate reference genes for qRT-PCR studies on tea plant, we examined the expression stability of 11 candidate reference genes from three different sources: the orthologs of Arabidopsis traditional reference genes and stably expressed genes identified from whole-genome GeneChip studies, together with three housekeeping gene commonly used in tea plant research. We evaluated the transcript levels of these genes in 94 experimental samples. The expression stabilities of these 11 genes were ranked using four different computation programs including geNorm, Normfinder, BestKeeper, and the comparative ∆CT method. Results showed that the three commonly used housekeeping genes of CsTUBULIN1, CsACINT1 and Cs18S rRNA1 together with CsUBQ1 were the most unstable genes in all sample ranking order. However, CsPTB1, CsEF1, CsSAND1, CsCLATHRIN1 and CsUBC1 were the top five appropriate reference genes for qRT-PCR analysis in complex experimental conditions. PMID:25474086
Immunome differences between porcine ileal and jejunal Peyer's patches revealed by global transcriptome sequencing of gut-associated lymphoid tissues.

PubMed

Maroilley, T; Berri, M; Lemonnier, G; Esquerré, D; Chevaleyre, C; Mélo, S; Meurens, F; Coville, J L; Leplat, J J; Rau, A; Bed'hom, B; Vincent-Naulleau, S; Mercat, M J; Billon, Y; Lepage, P; Rogel-Gaillard, C; Estellé, J

2018-06-13

The epithelium of the intestinal mucosa and the gut-associated lymphoid tissues (GALT) constitute an essential physical and immunological barrier against pathogens. In order to study the specificities of the GALT transcriptome in pigs, we compared the transcriptome profiles of jejunal and ileal Peyer's patches (PPs), mesenteric lymph nodes (MLNs) and peripheral blood (PB) of four male piglets by RNA-Seq. We identified 1,103 differentially expressed (DE) genes between ileal PPs (IPPs) and jejunal PPs (JPPs), and six times more DE genes between PPs and MLNs. The master regulator genes FOXP3, GATA3, STAT4, TBX21 and RORC were less expressed in IPPs compared to JPPs, whereas the transcription factor BCL6 was found more expressed in IPPs. In comparison between IPPs and JPPs, our analyses revealed predominant differential expression related to the differentiation of T cells into Th1, Th2, Th17 and iTreg in JPPs. Our results were consistent with previous reports regarding a higher T/B cells ratio in JPPs compared to IPPs. We found antisense transcription for respectively 24%, 22% and 14% of the transcripts detected in MLNs, PPs and PB, and significant positive correlations between PB and GALT transcriptomes. Allele-specific expression analyses revealed both shared and tissue-specific cis-genetic control of gene expression.
Identifying spatially similar gene expression patterns in early stage fruit fly embryo images: binary feature versus invariant moment digital representations

PubMed Central

Gurunathan, Rajalakshmi; Van Emden, Bernard; Panchanathan, Sethuraman; Kumar, Sudhir

2004-01-01

Background Modern developmental biology relies heavily on the analysis of embryonic gene expression patterns. Investigators manually inspect hundreds or thousands of expression patterns to identify those that are spatially similar and to ultimately infer potential gene interactions. However, the rapid accumulation of gene expression pattern data over the last two decades, facilitated by high-throughput techniques, has produced a need for the development of efficient approaches for direct comparison of images, rather than their textual descriptions, to identify spatially similar expression patterns. Results The effectiveness of the Binary Feature Vector (BFV) and Invariant Moment Vector (IMV) based digital representations of the gene expression patterns in finding biologically meaningful patterns was compared for a small (226 images) and a large (1819 images) dataset. For each dataset, an ordered list of images, with respect to a query image, was generated to identify overlapping and similar gene expression patterns, in a manner comparable to what a developmental biologist might do. The results showed that the BFV representation consistently outperforms the IMV representation in finding biologically meaningful matches when spatial overlap of the gene expression pattern and the genes involved are considered. Furthermore, we explored the value of conducting image-content based searches in a dataset where individual expression components (or domains) of multi-domain expression patterns were also included separately. We found that this technique improves performance of both IMV and BFV based searches. Conclusions We conclude that the BFV representation consistently produces a more extensive and better list of biologically useful patterns than the IMV representation. The high quality of results obtained scales well as the search database becomes larger, which encourages efforts to build automated image query and retrieval systems for spatial gene expression patterns. PMID:15603586
Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool

PubMed Central

2013-01-01

Background System-wide profiling of genes and proteins in mammalian cells produce lists of differentially expressed genes/proteins that need to be further analyzed for their collective functions in order to extract new knowledge. Once unbiased lists of genes or proteins are generated from such experiments, these lists are used as input for computing enrichment with existing lists created from prior knowledge organized into gene-set libraries. While many enrichment analysis tools and gene-set libraries databases have been developed, there is still room for improvement. Results Here, we present Enrichr, an integrative web-based and mobile software application that includes new gene-set libraries, an alternative approach to rank enriched terms, and various interactive visualization approaches to display enrichment results using the JavaScript library, Data Driven Documents (D3). The software can also be embedded into any tool that performs gene list analysis. We applied Enrichr to analyze nine cancer cell lines by comparing their enrichment signatures to the enrichment signatures of matched normal tissues. We observed a common pattern of up regulation of the polycomb group PRC2 and enrichment for the histone mark H3K27me3 in many cancer cell lines, as well as alterations in Toll-like receptor and interlukin signaling in K562 cells when compared with normal myeloid CD33+ cells. Such analyses provide global visualization of critical differences between normal tissues and cancer cell lines but can be applied to many other scenarios. Conclusions Enrichr is an easy to use intuitive enrichment analysis web-based tool providing various types of visualization summaries of collective functions of gene lists. Enrichr is open source and freely available online at: http://amp.pharm.mssm.edu/Enrichr. PMID:23586463
Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool.

PubMed

Chen, Edward Y; Tan, Christopher M; Kou, Yan; Duan, Qiaonan; Wang, Zichen; Meirelles, Gabriela Vaz; Clark, Neil R; Ma'ayan, Avi

2013-04-15

System-wide profiling of genes and proteins in mammalian cells produce lists of differentially expressed genes/proteins that need to be further analyzed for their collective functions in order to extract new knowledge. Once unbiased lists of genes or proteins are generated from such experiments, these lists are used as input for computing enrichment with existing lists created from prior knowledge organized into gene-set libraries. While many enrichment analysis tools and gene-set libraries databases have been developed, there is still room for improvement. Here, we present Enrichr, an integrative web-based and mobile software application that includes new gene-set libraries, an alternative approach to rank enriched terms, and various interactive visualization approaches to display enrichment results using the JavaScript library, Data Driven Documents (D3). The software can also be embedded into any tool that performs gene list analysis. We applied Enrichr to analyze nine cancer cell lines by comparing their enrichment signatures to the enrichment signatures of matched normal tissues. We observed a common pattern of up regulation of the polycomb group PRC2 and enrichment for the histone mark H3K27me3 in many cancer cell lines, as well as alterations in Toll-like receptor and interlukin signaling in K562 cells when compared with normal myeloid CD33+ cells. Such analyses provide global visualization of critical differences between normal tissues and cancer cell lines but can be applied to many other scenarios. Enrichr is an easy to use intuitive enrichment analysis web-based tool providing various types of visualization summaries of collective functions of gene lists. Enrichr is open source and freely available online at: http://amp.pharm.mssm.edu/Enrichr.
Venom-Related Transcripts from Bothrops jararaca Tissues Provide Novel Molecular Insights into the Production and Evolution of Snake Venom

PubMed Central

Junqueira-de-Azevedo, Inácio L.M.; Bastos, Carolina Mancini Val; Ho, Paulo Lee; Luna, Milene Schmidt; Yamanouye, Norma; Casewell, Nicholas R.

2015-01-01

Attempts to reconstruct the evolutionary history of snake toxins in the context of their co-option to the venom gland rarely account for nonvenom snake genes that are paralogous to toxins, and which therefore represent important connectors to ancestral genes. In order to reevaluate this process, we conducted a comparative transcriptomic survey on body tissues from a venomous snake. A nonredundant set of 33,000 unigenes (assembled transcripts of reference genes) was independently assembled from six organs of the medically important viperid snake Bothrops jararaca, providing a reference list of 82 full-length toxins from the venom gland and specific products from other tissues, such as pancreatic digestive enzymes. Unigenes were then screened for nontoxin transcripts paralogous to toxins revealing 1) low level coexpression of approximately 20% of toxin genes (e.g., bradykinin-potentiating peptide, C-type lectin, snake venom metalloproteinase, snake venom nerve growth factor) in body tissues, 2) the identity of the closest paralogs to toxin genes in eight classes of toxins, 3) the location and level of paralog expression, indicating that, in general, co-expression occurs in a higher number of tissues and at lower levels than observed for toxin genes, and 4) strong evidence of a toxin gene reverting back to selective expression in a body tissue. In addition, our differential gene expression analyses identify specific cellular processes that make the venom gland a highly specialized secretory tissue. Our results demonstrate that the evolution and production of venom in snakes is a complex process that can only be understood in the context of comparative data from other snake tissues, including the identification of genes paralogous to venom toxins. PMID:25502939
Evolution and phylogeny of the mud shrimps (Crustacea: Decapoda) revealed from complete mitochondrial genomes.

PubMed

Lin, Feng-Jiau; Liu, Yuan; Sha, Zhongli; Tsang, Ling Ming; Chu, Ka Hou; Chan, Tin-Yam; Liu, Ruiyu; Cui, Zhaoxia

2012-11-16

The evolutionary history and relationships of the mud shrimps (Crustacea: Decapoda: Gebiidea and Axiidea) are contentious, with previous attempts revealing mixed results. The mud shrimps were once classified in the infraorder Thalassinidea. Recent molecular phylogenetic analyses, however, suggest separation of the group into two individual infraorders, Gebiidea and Axiidea. Mitochondrial (mt) genome sequence and structure can be especially powerful in resolving higher systematic relationships that may offer new insights into the phylogeny of the mud shrimps and the other decapod infraorders, and test the hypothesis of dividing the mud shrimps into two infraorders. We present the complete mitochondrial genome sequences of five mud shrimps, Austinogebia edulis, Upogebia major, Thalassina kelanang (Gebiidea), Nihonotrypaea thermophilus and Neaxius glyptocercus (Axiidea). All five genomes encode a standard set of 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and a putative control region. Except for T. kelanang, mud shrimp mitochondrial genomes exhibited rearrangements and novel patterns compared to the pancrustacean ground pattern. Each of the two Gebiidea species (A. edulis and U. major) and two Axiidea species (N. glyptocercus and N. thermophiles) share unique gene order specific to their infraorders and analyses further suggest these two derived gene orders have evolved independently. Phylogenetic analyses based on the concatenated nucleotide and amino acid sequences of 13 protein-coding genes indicate the possible polyphyly of mud shrimps, supporting the division of the group into two infraorders. However, the infraordinal relationships among the Gebiidea and Axiidea, and other reptants are poorly resolved. The inclusion of mt genome from more taxa, in particular the reptant infraorders Polychelida and Glypheidea is required in further analysis. Phylogenetic analyses on the mt genome sequences and the distinct gene orders provide further evidences for the divergence between the two mud shrimp infraorders, Gebiidea and Axiidea, corroborating previous molecular phylogeny and justifying their infraordinal status. Mitochondrial genome sequences appear to be promising markers for resolving phylogenetic issues concerning decapod crustaceans that warrant further investigations and our present study has also provided further information concerning the mt genome evolution of the Decapoda.
Evolution and phylogeny of the mud shrimps (Crustacea: Decapoda) revealed from complete mitochondrial genomes

PubMed Central

2012-01-01

Background The evolutionary history and relationships of the mud shrimps (Crustacea: Decapoda: Gebiidea and Axiidea) are contentious, with previous attempts revealing mixed results. The mud shrimps were once classified in the infraorder Thalassinidea. Recent molecular phylogenetic analyses, however, suggest separation of the group into two individual infraorders, Gebiidea and Axiidea. Mitochondrial (mt) genome sequence and structure can be especially powerful in resolving higher systematic relationships that may offer new insights into the phylogeny of the mud shrimps and the other decapod infraorders, and test the hypothesis of dividing the mud shrimps into two infraorders. Results We present the complete mitochondrial genome sequences of five mud shrimps, Austinogebia edulis, Upogebia major, Thalassina kelanang (Gebiidea), Nihonotrypaea thermophilus and Neaxius glyptocercus (Axiidea). All five genomes encode a standard set of 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and a putative control region. Except for T. kelanang, mud shrimp mitochondrial genomes exhibited rearrangements and novel patterns compared to the pancrustacean ground pattern. Each of the two Gebiidea species (A. edulis and U. major) and two Axiidea species (N. glyptocercus and N. thermophiles) share unique gene order specific to their infraorders and analyses further suggest these two derived gene orders have evolved independently. Phylogenetic analyses based on the concatenated nucleotide and amino acid sequences of 13 protein-coding genes indicate the possible polyphyly of mud shrimps, supporting the division of the group into two infraorders. However, the infraordinal relationships among the Gebiidea and Axiidea, and other reptants are poorly resolved. The inclusion of mt genome from more taxa, in particular the reptant infraorders Polychelida and Glypheidea is required in further analysis. Conclusions Phylogenetic analyses on the mt genome sequences and the distinct gene orders provide further evidences for the divergence between the two mud shrimp infraorders, Gebiidea and Axiidea, corroborating previous molecular phylogeny and justifying their infraordinal status. Mitochondrial genome sequences appear to be promising markers for resolving phylogenetic issues concerning decapod crustaceans that warrant further investigations and our present study has also provided further information concerning the mt genome evolution of the Decapoda. PMID:23153176

The complete mitochondrial genome of the stomatopod crustacean Squilla mantis

PubMed Central

Cook, Charles E

2005-01-01

Background Animal mitochondrial genomes are physically separate from the much larger nuclear genomes and have proven useful both for phylogenetic studies and for understanding genome evolution. Within the phylum Arthropoda the subphylum Crustacea includes over 50,000 named species with immense variation in body plans and habitats, yet only 23 complete mitochondrial genomes are available from this subphylum. Results I describe here the complete mitochondrial genome of the crustacean Squilla mantis (Crustacea: Malacostraca: Stomatopoda). This 15994-nucleotide genome, the first described from a hoplocarid, contains the standard complement of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and a non-coding AT-rich region that is found in most other metazoans. The gene order is identical to that considered ancestral for hexapods and crustaceans. The 70% AT base composition is within the range described for other arthropods. A single unusual feature of the genome is a 230 nucleotide non-coding region between a serine transfer RNA and the nad1 gene, which has no apparent function. I also compare gene order, nucleotide composition, and codon usage of the S. mantis genome and eight other malacostracan crustaceans. A translocation of the histidine transfer RNA gene is shared by three taxa in the order Decapoda, infraorder Brachyura; Callinectes sapidus, Portunus trituberculatus and Pseudocarcinus gigas. This translocation may be diagnostic for the Brachyura. For all nine taxa nucleotide composition is biased towards AT-richness, as expected for arthropods, and is within the range reported for other arthropods. Codon usage is biased, and much of this bias is probably due to the skew in nucleotide composition towards AT-richness. Conclusion The mitochondrial genome of Squilla mantis contains one unusual feature, a 230 base pair non-coding region has so far not been described in any other malacostracan. Comparisons with other Malacostraca show that all nine genomes, like most other mitochondrial genomes, share a bias toward AT-richness and a related bias in codon usage. The nine malacostracans included in this analysis are not representative of the diversity of the class Malacostraca, and additional malacostracan sequences would surely reveal other unusual genomic features that could be useful in understanding mitochondrial evolution in this taxon. PMID:16091132
Transcriptome profiling of Pinus radiata juvenile wood with contrasting stiffness identifies putative candidate genes involved in microfibril orientation and cell wall mechanics

PubMed Central

2011-01-01

Background The mechanical properties of wood are largely determined by the orientation of cellulose microfibrils in secondary cell walls. Several genes and their allelic variants have previously been found to affect microfibril angle (MFA) and wood stiffness; however, the molecular mechanisms controlling microfibril orientation and mechanical strength are largely uncharacterised. In the present study, cDNA microarrays were used to compare gene expression in developing xylem with contrasting stiffness and MFA in juvenile Pinus radiata trees in order to gain further insights into the molecular mechanisms underlying microfibril orientation and cell wall mechanics. Results Juvenile radiata pine trees with higher stiffness (HS) had lower MFA in the earlywood and latewood of each ring compared to low stiffness (LS) trees. Approximately 3.4 to 14.5% out of 3, 320 xylem unigenes on cDNA microarrays were differentially regulated in juvenile wood with contrasting stiffness and MFA. Greater variation in MFA and stiffness was observed in earlywood compared to latewood, suggesting earlywood contributes most to differences in stiffness; however, 3-4 times more genes were differentially regulated in latewood than in earlywood. A total of 108 xylem unigenes were differentially regulated in juvenile wood with HS and LS in at least two seasons, including 43 unigenes with unknown functions. Many genes involved in cytoskeleton development and secondary wall formation (cellulose and lignin biosynthesis) were preferentially transcribed in wood with HS and low MFA. In contrast, several genes involved in cell division and primary wall synthesis were more abundantly transcribed in LS wood with high MFA. Conclusions Microarray expression profiles in Pinus radiata juvenile wood with contrasting stiffness has shed more light on the transcriptional control of microfibril orientation and the mechanical properties of wood. The identified candidate genes provide an invaluable resource for further gene function and association genetics studies aimed at deepening our understanding of cell wall biomechanics with a view to improving the mechanical properties of wood. PMID:21962175
Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution.

PubMed

Clarke, Thomas H; Garb, Jessica E; Hayashi, Cheryl Y; Arensburger, Peter; Ayoub, Nadia A

2015-06-08

The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The Genetics of Symbiotic Nitrogen Fixation: Comparative Genomics of 14 Rhizobia Strains by Resolution of Protein Clusters

PubMed Central

Black, Michael; Moolhuijzen, Paula; Chapman, Brett; Barrero, Roberto; Howieson, John; Hungria, Mariangela; Bellgard, Matthew

2012-01-01

The symbiotic relationship between legumes and nitrogen fixing bacteria is critical for agriculture, as it may have profound impacts on lowering costs for farmers, on land sustainability, on soil quality, and on mitigation of greenhouse gas emissions. However, despite the importance of the symbioses to the global nitrogen cycling balance, very few rhizobial genomes have been sequenced so far, although there are some ongoing efforts in sequencing elite strains. In this study, the genomes of fourteen selected strains of the order Rhizobiales, all previously fully sequenced and annotated, were compared to assess differences between the strains and to investigate the feasibility of defining a core ‘symbiome’—the essential genes required by all rhizobia for nodulation and nitrogen fixation. Comparison of these whole genomes has revealed valuable information, such as several events of lateral gene transfer, particularly in the symbiotic plasmids and genomic islands that have contributed to a better understanding of the evolution of contrasting symbioses. Unique genes were also identified, as well as omissions of symbiotic genes that were expected to be found. Protein comparisons have also allowed the identification of a variety of similarities and differences in several groups of genes, including those involved in nodulation, nitrogen fixation, production of exopolysaccharides, Type I to Type VI secretion systems, among others, and identifying some key genes that could be related to host specificity and/or a better saprophytic ability. However, while several significant differences in the type and number of proteins were observed, the evidence presented suggests no simple core symbiome exists. A more abstract systems biology concept of nitrogen fixing symbiosis may be required. The results have also highlighted that comparative genomics represents a valuable tool for capturing specificities and generalities of each genome. PMID:24704847
Mitochondrial genomes of Meloidogyne chitwoodi and M. incognita (Nematoda: Tylenchina): comparative analysis, gene order and phylogenetic relationships with other nematodes.

PubMed

Humphreys-Pereira, Danny A; Elling, Axel A

2014-01-01

Root-knot nematodes (Meloidogyne spp.) are among the most important plant pathogens. In this study, the mitochondrial (mt) genomes of the root-knot nematodes, M. chitwoodi and M. incognita were sequenced. PCR analyses suggest that both mt genomes are circular, with an estimated size of 19.7 and 18.6-19.1kb, respectively. The mt genomes each contain a large non-coding region with tandem repeats and the control region. The mt gene arrangement of M. chitwoodi and M. incognita is unlike that of other nematodes. Sequence alignments of the two Meloidogyne mt genomes showed three translocations; two in transfer RNAs and one in cox2. Compared with other nematode mt genomes, the gene arrangement of M. chitwoodi and M. incognita was most similar to Pratylenchus vulnus. Phylogenetic analyses (Maximum Likelihood and Bayesian inference) were conducted using 78 complete mt genomes of diverse nematode species. Analyses based on nucleotides and amino acids of the 12 protein-coding mt genes showed strong support for the monophyly of class Chromadorea, but only amino acid-based analyses supported the monophyly of class Enoplea. The suborder Spirurina was not monophyletic in any of the phylogenetic analyses, contradicting the Clade III model, which groups Ascaridomorpha, Spiruromorpha and Oxyuridomorpha based on the small subunit ribosomal RNA gene. Importantly, comparisons of mt gene arrangement and tree-based methods placed Meloidogyne as sister taxa of Pratylenchus, a migratory plant endoparasitic nematode, and not with the sedentary endoparasitic Heterodera. Thus, comparative analyses of mt genomes suggest that sedentary endoparasitism in Meloidogyne and Heterodera is based on convergent evolution. Copyright © 2014 Elsevier B.V. All rights reserved.
Moraxella osloensis gene expression in the slug host Deroceras reticulatum.

PubMed

An, Ruisheng; Sreevatsan, Srinand; Grewal, Parwinder S

2008-01-28

The bacterium Moraxella osloensis is a mutualistic symbiont of the slug-parasitic nematode Phasmarhabditis hermaphrodita. In nature, P. hermaphrodita vectors M. osloensis into the shell cavity of the slug host Deroceras reticulatum in which the bacteria multiply and kill the slug. As M. osloensis is the main killing agent, genes expressed by M. osloensis in the slug are likely to play important roles in virulence. Studies on pathogenic interactions between bacteria and lower order hosts are few, but such studies have the potential to shed light on the evolution of bacterial virulence. Therefore, we investigated such an interaction by determining gene expression of M. osloensis in its slug host D. reticulatum by selectively capturing transcribed sequences. Thirteen M. osloensis genes were identified to be up-regulated post infection in D. reticulatum. Compared to the in vitro expressed genes in the stationary phase, we found that genes of ubiquinone synthetase (ubiS) and acyl-coA synthetase (acs) were up-regulated in both D. reticulatum and stationary phase in vitro cultures, but the remaining 11 genes were exclusively expressed in D. reticulatum and are hence infection specific. Mutational analysis on genes of protein-disulfide isomerase (dsbC) and ubiS showed that the virulence of both mutants to slugs was markedly reduced and could be complemented. Further, compared to the growth rate of wild-type M. osloensis, the dsbC and ubiS mutants showed normal and reduced growth rate in vitro, respectively. We conclude that 11 out of the 13 up-regulated M. osloensis genes are infection specific. Distribution of these identified genes in various bacterial pathogens indicates that the virulence genes are conserved among different pathogen-host interactions. Mutagenesis, growth rate and virulence bioassays further confirmed that ubiS and dsbC genes play important roles in M. osloensis survival and virulence, respectively in D. reticulatum.
Moraxella osloensis Gene Expression in the Slug Host Deroceras reticulatum

PubMed Central

An, Ruisheng; Sreevatsan, Srinand; Grewal, Parwinder S

2008-01-01

Background The bacterium Moraxella osloensis is a mutualistic symbiont of the slug-parasitic nematode Phasmarhabditis hermaphrodita. In nature, P. hermaphrodita vectors M. osloensis into the shell cavity of the slug host Deroceras reticulatum in which the bacteria multiply and kill the slug. As M. osloensis is the main killing agent, genes expressed by M. osloensis in the slug are likely to play important roles in virulence. Studies on pathogenic interactions between bacteria and lower order hosts are few, but such studies have the potential to shed light on the evolution of bacterial virulence. Therefore, we investigated such an interaction by determining gene expression of M. osloensis in its slug host D. reticulatum by selectively capturing transcribed sequences. Results Thirteen M. osloensis genes were identified to be up-regulated post infection in D. reticulatum. Compared to the in vitro expressed genes in the stationary phase, we found that genes of ubiquinone synthetase (ubiS) and acyl-coA synthetase (acs) were up-regulated in both D. reticulatum and stationary phase in vitro cultures, but the remaining 11 genes were exclusively expressed in D. reticulatum and are hence infection specific. Mutational analysis on genes of protein-disulfide isomerase (dsbC) and ubiS showed that the virulence of both mutants to slugs was markedly reduced and could be complemented. Further, compared to the growth rate of wild-type M. osloensis, the dsbC and ubiS mutants showed normal and reduced growth rate in vitro, respectively. Conclusion We conclude that 11 out of the 13 up-regulated M. osloensis genes are infection specific. Distribution of these identified genes in various bacterial pathogens indicates that the virulence genes are conserved among different pathogen-host interactions. Mutagenesis, growth rate and virulence bioassays further confirmed that ubiS and dsbC genes play important roles in M. osloensis survival and virulence, respectively in D. reticulatum. PMID:18226222
Constitutional downregulation of SEMA5A expression in autism.

PubMed

Melin, M; Carlsson, B; Anckarsater, H; Rastam, M; Betancur, C; Isaksson, A; Gillberg, C; Dahl, N

2006-01-01

There is strong evidence for the importance of genetic factors in idiopathic autism. The results from independent twin and family studies suggest that the disorder is caused by the action of several genes, possibly acting epistatically. We have used cDNA microarray technology for the identification of constitutional changes in the gene expression profile associated with idiopathic autism. Samples were obtained and analyzed from 6 affected subjects belonging to multiplex autism families and from 6 healthy controls. We assessed the expression levels for approximately 7,700 genes by cDNA microarrays using mRNA derived from Epstein-Barr virus-transformed B lymphocytes. The microarray data were analyzed in order to identify up- or downregulation of specific genes. A common pattern with nine downregulated genes was identified among samples derived from individuals with autism when compared to controls. Four of these nine genes encode proteins involved in biological processes associated with brain function or the immune system, and are consequently considered as candidates for genes associated with autism. Quantitative real-time PCR confirms the downregulation of the gene encoding SEMA5A, a protein involved in axonal guidance. Epstein-Barr virus should be considered as a possible source for altered expression, but our consistent results make us suggest SEMA5A as a candidate gene in the etiology of idiopathic autism.
Constitutional downregulation of SEMA5A expression in autism

PubMed Central

Melin, Malin; Carlsson, Birgit; Anckarsäter, Henrik; Rastam, Maria; Betancur, Catalina; Isaksson, Anders; Gillberg, Christopher; Dahl, Niklas

2006-01-01

There is strong evidence for the importance of genetic factors in idiopathic autism. The results from independent twin and family studies suggest that the disorder is caused by the action of several genes, possibly acting epistatically. We have used cDNA microarray technology for the identification of constitutional changes in the gene expression profile associated with idiopathic autism. Samples were obtained and analyzed from six affected subjects belonging to multiplex autism families and from six healthy controls. We assessed the expression levels for approximately 7,700 genes by cDNA microarrays using mRNA derived from Epstein Barr virus (EBV)-transformed B-lymphocytes. The microarray data was analyzed in order to identify up- or down-regulation of specific genes. A common pattern with nine down-regulated genes was identified among samples derived from individuals with autism when compared to controls. Four of these nine genes encode proteins involved in biological processes associated with brain function or the immune system, and are consequently considered as candidates for genes associated with autism. Quantitative realtime PCR confirms the down-regulation of the gene encoding SEMA5A, a protein involved in axonal guidance. EBV should be considered as a possible source for altered expression but our consistent results make us suggest SEMA5A a candidate gene in the etiology of idiopathic autism. PMID:17028446
Mitochondrial genome of Pteronotus personatus (Chiroptera: Mormoopidae): comparison with selected bats and phylogenetic considerations.

PubMed

López-Wilchis, Ricardo; Del Río-Portilla, Miguel Ángel; Guevara-Chumacero, Luis Manuel

2017-02-01

We described the complete mitochondrial genome (mitogenome) of the Wagner's mustached bat, Pteronotus personatus, a species belonging to the family Mormoopidae, and compared it with other published mitogenomes of bats (Chiroptera). The mitogenome of P. personatus was 16,570 bp long and contained a typically conserved structure including 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and one control region (D-loop). Most of the genes were encoded on the H-strand, except for eight tRNA and the ND6 genes. The order of protein-coding and rRNA genes was highly conserved in all mitogenomes. All protein-coding genes started with an ATG codon, except for ND2, ND3, and ND5, which initiated with ATA, and terminated with the typical stop codon TAA/TAG or the codon AGA. Phylogenetic trees constructed using Maximum Parsimony, Maximum Likelihood, and Bayesian inference methods showed an identical topology and indicated the monophyly of different families of bats (Mormoopidae, Phyllostomidae, Vespertilionidae, Rhinolophidae, and Pteropopidae) and the existence of two major clades corresponding to the suborders Yangochiroptera and Yinpterochiroptera. The mitogenome sequence provided here will be useful for further phylogenetic analyses and population genetic studies in mormoopid bats.
Differential translation efficiency of orthologous genes is involved in phenotypic divergence of yeast species.

PubMed

Man, Orna; Pilpel, Yitzhak

2007-03-01

A major challenge in comparative genomics is to understand how phenotypic differences between species are encoded in their genomes. Phenotypic divergence may result from differential transcription of orthologous genes, yet less is known about the involvement of differential translation regulation in species phenotypic divergence. In order to assess translation effects on divergence, we analyzed approximately 2,800 orthologous genes in nine yeast genomes. For each gene in each species, we predicted translation efficiency, using a measure of the adaptation of its codons to the organism's tRNA pool. Mining this data set, we found hundreds of genes and gene modules with correlated patterns of translational efficiency across the species. One signal encompassed entire modules that are either needed for oxidative respiration or fermentation and are efficiently translated in aerobic or anaerobic species, respectively. In addition, the efficiency of translation of the mRNA splicing machinery strongly correlates with the number of introns in the various genomes. Altogether, we found extensive selection on synonymous codon usage that modulates translation according to gene function and organism phenotype. We conclude that, like factors such as transcription regulation, translation efficiency affects and is affected by the process of species divergence.
A comparison of Agrobacterium-mediated transformation and protoplast-mediated transformation with CRISPR-Cas9 and bipartite gene targeting substrates, as effective gene targeting tools for Aspergillus carbonarius.

PubMed

Weyda, István; Yang, Lei; Vang, Jesper; Ahring, Birgitte K; Lübeck, Mette; Lübeck, Peter S

2017-04-01

In recent years, versatile genetic tools have been developed and applied to a number of filamentous fungi of industrial importance. However, the existing techniques have limitations when it comes to achieve the desired genetic modifications, especially for efficient gene targeting. In this study, we used Aspergillus carbonarius as a host strain due to its potential as a cell factory, and compared three gene targeting techniques by disrupting the ayg1 gene involved in the biosynthesis of conidial pigment in A. carbonarius. The absence of the ayg1 gene leads to phenotypic change in conidia color, which facilitated the analysis on the gene targeting frequency. The examined transformation techniques included Agrobacterium-mediated transformation (AMT) and protoplast-mediated transformation (PMT). Furthermore, the PMT for the disruption of the ayg1 gene was carried out with bipartite gene targeting fragments and the recently adapted CRISPR-Cas9 system. All three techniques were successful in generating Δayg1 mutants, but showed different efficiencies. The most efficient method for gene targeting was AMT, but further it was shown to be dependent on the choice of Agrobacterium strain. However, there are different advantages and disadvantages of all three gene targeting methods which are discussed, in order to facilitate future approaches for fungal strain improvements. Copyright © 2017 Elsevier B.V. All rights reserved.
Characterization of the complete mitochondrial genome of the storage mite pest Tyrophagus longior (Gervais) (Acari: Acaridae) and comparative mitogenomic analysis of four acarid mites.

PubMed

Yang, Banghe; Li, Chaopin

2016-02-01

Mites of the genus Tyrophagus are economically important polyphagous pest commonly living on stored products and also responsible for allergic reactions to humans. Complete mitochondrial genomes (mitogenomes) and the gene features therein are widely used as molecular markers in the study of population genetics, phylogenetics as well as molecular evolution. However, scarcity on the sequence data has greatly impeded the studies in these areas pertaining to the Acari (mites and ticks). Information on the Tyrophagus mitogenomes is quite critical for phylogenetic evaluation and molecular evolution of the mitogenomes within Acariformes. Herein, we reported the complete mitogenome of the allergenic acarid storage mite Tyrophagus longior (Astigmata: Acaridae), an important member of stored food pests, and compared with those of other three acarid mites. The complete mitogenome of T. longior was a circular molecule of 13,271 bp. Unexpectedly, only 19 transfer RNA genes (tRNAs) were present, lacking trnF, trnS1 and trnQ. Furthermore, it also contained 13 protein-coding genes (PCGs) and 2 genes for rRNA (rrnS and rrnL) commonly detected in metazoans. The four mitogenomes displayed similar characteristics with respect to the gene content, nucleotide comparison, and codon usages. Yet, the gene order of T. longior was different from that in other Acari. The J-strands of the four mitogenomes possessed high A+T content (67.4-70.0%), and exhibited positive GC-skews and negative AT-skews. Most inferred tRNAs of T. longior were extremely truncated, lacking either a D- or T-arm, as found in other acarid mites. In T. longior mitogenome the A+T-rich region was just 50 bp in length and can be folded as a stable stem-loop structure, whereas in the region some structures of microsatellite-like (AT)n and palindromic sequences was not present. Besides, reconstructing of the phylogenetic relationship based on concatenated amino acid sequences of 13 PCGs supported that monophyly of the family Acaridae and the order Astigmata, to which the former belongs. Our results were consistent with the traditional classifications. Copyright © 2015 Elsevier B.V. All rights reserved.
Horizontal gene transfer in silkworm, Bombyx mori

PubMed Central

2011-01-01

Background The domesticated silkworm, Bombyx mori, is the model insect for the order Lepidoptera, has economically important values, and has gained some representative behavioral characteristics compared to its wild ancestor. The genome of B. mori has been fully sequenced while function analysis of BmChi-h and BmSuc1 genes revealed that horizontal gene transfer (HGT) maybe bestow a clear selective advantage to B. mori. However, the role of HGT in the evolutionary history of B. mori is largely unexplored. In this study, we compare the whole genome of B. mori with those of 382 prokaryotic and eukaryotic species to investigate the potential HGTs. Results Ten candidate HGT events were defined in B. mori by comprehensive sequence analysis using Maximum Likelihood and Bayesian method combining with EST checking. Phylogenetic analysis of the candidate HGT genes suggested that one HGT was plant-to- B. mori transfer while nine were bacteria-to- B. mori transfer. Furthermore, functional analysis based on expression, coexpression and related literature searching revealed that several HGT candidate genes have added important characters, such as resistance to pathogen, to B. mori. Conclusions Results from this study clearly demonstrated that HGTs play an important role in the evolution of B. mori although the number of HGT events in B. mori is in general smaller than those of microbes and other insects. In particular, interdomain HGTs in B. mori may give rise to functional, persistent, and possibly evolutionarily significant new genes. PMID:21595916
Robust transcriptional tumor signatures applicable to both formalin-fixed paraffin-embedded and fresh-frozen samples

PubMed Central

Cheng, Jun; He, Jun; Liu, Huaping; Cai, Hao; Hong, Guini; Zhang, Jiahui; Li, Na; Ao, Lu; Guo, Zheng

2017-01-01

Formalin-fixed paraffin-embedded (FFPE) samples represent a valuable resource for clinical researches. However, FFPE samples are usually considered an unreliable source for gene expression analysis due to the partial RNA degradation. In this study, through comparing gene expression profiles between FFPE samples and paired fresh-frozen (FF) samples for three cancer types, we firstly showed that expression measurements of thousands of genes had at least two-fold change in FFPE samples compared with paired FF samples. Therefore, for a transcriptional signature based on risk scores summarized from the expression levels of the signature genes, the risk score thresholds trained from FFPE (or FF) samples could not be applied to FF (or FFPE) samples. On the other hand, we found that more than 90% of the relative expression orderings (REOs) of gene pairs in the FF samples were maintained in their paired FFPE samples and largely unaffected by the storage time. The result suggested that the REOs of gene pairs were highly robust against partial RNA degradation in FFPE samples. Finally, as a case study, we developed a REOs-based signature to distinguish liver cirrhosis from hepatocellular carcinoma (HCC) using FFPE samples. The signature was validated in four datasets of FFPE samples and eight datasets of FF samples. In conclusion, the valuable FFPE samples can be fully exploited to identify REOs-based diagnostic and prognostic signatures which could be robustly applicable to both FF samples and FFPE samples with degraded RNA. PMID:28036264
Genome wide analysis of the transition to pathogenic lifestyles in Magnaporthales fungi.

PubMed

Zhang, Ning; Cai, Guohong; Price, Dana C; Crouch, Jo Anne; Gladieux, Pierre; Hillman, Bradley; Khang, Chang Hyun; LeBrun, Marc-Henri; Lee, Yong-Hwan; Luo, Jing; Qiu, Huan; Veltri, Daniel; Wisecaver, Jennifer H; Zhu, Jie; Bhattacharya, Debashish

2018-04-12

The rice blast fungus Pyricularia oryzae (syn. Magnaporthe oryzae, Magnaporthe grisea), a member of the order Magnaporthales in the class Sordariomycetes, is an important plant pathogen and a model species for studying pathogen infection and plant-fungal interaction. In this study, we generated genome sequence data from five additional Magnaporthales fungi including non-pathogenic species, and performed comparative genome analysis of a total of 13 fungal species in the class Sordariomycetes to understand the evolutionary history of the Magnaporthales and of fungal pathogenesis. Our results suggest that the Magnaporthales diverged ca. 31 millon years ago from other Sordariomycetes, with the phytopathogenic blast clade diverging ca. 21 million years ago. Little evidence of inter-phylum horizontal gene transfer (HGT) was detected in Magnaporthales. In contrast, many genes underwent positive selection in this order and the majority of these sequences are clade-specific. The blast clade genomes contain more secretome and avirulence effector genes, which likely play key roles in the interaction between Pyricularia species and their plant hosts. Finally, analysis of transposable elements (TE) showed differing proportions of TE classes among Magnaporthales genomes, suggesting that species-specific patterns may hold clues to the history of host/environmental adaptation in these fungi.
Complete mitochondrial DNA genome of bonnethead shark, Sphyrna tiburo, and phylogenetic relationships among main superorders of modern elasmobranchs

PubMed Central

Díaz-Jaimes, Píndaro; Bayona-Vásquez, Natalia J.; Adams, Douglas H.; Uribe-Alcocer, Manuel

2015-01-01

Elasmobranchs are one of the most diverse groups in the marine realm represented by 18 orders, 55 families and about 1200 species reported, but also one of the most vulnerable to exploitation and to climate change. Phylogenetic relationships among main orders have been controversial since the emergence of the Hypnosqualean hypothesis by Shirai (1992) that considered batoids as a sister group of sharks. The use of the complete mitochondrial DNA (mtDNA) may shed light to further validate this hypothesis by increasing the number of informative characters. We report the mtDNA genome of the bonnethead shark Sphyrna tiburo, and compare it with mitogenomes of other 48 species to assess phylogenetic relationships. The mtDNA genome of S. tiburo, is quite similar in size to that of congeneric species but also similar to the reported mtDNA genome of other Carcharhinidae species. Like most vertebrate mitochondrial genomes, it contained 13 protein coding genes, two rRNA genes and 22 tRNA genes and the control region of 1086 bp (D-loop). The Bayesian analysis of the 49 mitogenomes supported the view that sharks and batoids are separate groups. PMID:27014583
De novo assembly and characterization of the transcriptome of the parasitic weed dodder identifies genes associated with plant parasitism.

PubMed

Ranjan, Aashish; Ichihashi, Yasunori; Farhi, Moran; Zumstein, Kristina; Townsley, Brad; David-Schwartz, Rakefet; Sinha, Neelima R

2014-11-01

Parasitic flowering plants are one of the most destructive agricultural pests and have major impact on crop yields throughout the world. Being dependent on finding a host plant for growth, parasitic plants penetrate their host using specialized organs called haustoria. Haustoria establish vascular connections with the host, which enable the parasite to steal nutrients and water. The underlying molecular and developmental basis of parasitism by plants is largely unknown. In order to investigate the process of parasitism, RNAs from different stages (i.e. seed, seedling, vegetative strand, prehaustoria, haustoria, and flower) were used to de novo assemble and annotate the transcriptome of the obligate plant stem parasite dodder (Cuscuta pentagona). The assembled transcriptome was used to dissect transcriptional dynamics during dodder development and parasitism and identified key gene categories involved in the process of plant parasitism. Host plant infection is accompanied by increased expression of parasite genes underlying transport and transporter categories, response to stress and stimuli, as well as genes encoding enzymes involved in cell wall modifications. By contrast, expression of photosynthetic genes is decreased in the dodder infective stages compared with normal stem. In addition, genes relating to biosynthesis, transport, and response of phytohormones, such as auxin, gibberellins, and strigolactone, were differentially expressed in the dodder infective stages compared with stems and seedlings. This analysis sheds light on the transcriptional changes that accompany plant parasitism and will aid in identifying potential gene targets for use in controlling the infestation of crops by parasitic weeds. © 2014 American Society of Plant Biologists. All Rights Reserved.
De Novo Assembly and Characterization of the Transcriptome of the Parasitic Weed Dodder Identifies Genes Associated with Plant Parasitism1[C][W][OPEN

PubMed Central

Ranjan, Aashish; Ichihashi, Yasunori; Farhi, Moran; Zumstein, Kristina; Townsley, Brad; David-Schwartz, Rakefet; Sinha, Neelima R.

2014-01-01

Parasitic flowering plants are one of the most destructive agricultural pests and have major impact on crop yields throughout the world. Being dependent on finding a host plant for growth, parasitic plants penetrate their host using specialized organs called haustoria. Haustoria establish vascular connections with the host, which enable the parasite to steal nutrients and water. The underlying molecular and developmental basis of parasitism by plants is largely unknown. In order to investigate the process of parasitism, RNAs from different stages (i.e. seed, seedling, vegetative strand, prehaustoria, haustoria, and flower) were used to de novo assemble and annotate the transcriptome of the obligate plant stem parasite dodder (Cuscuta pentagona). The assembled transcriptome was used to dissect transcriptional dynamics during dodder development and parasitism and identified key gene categories involved in the process of plant parasitism. Host plant infection is accompanied by increased expression of parasite genes underlying transport and transporter categories, response to stress and stimuli, as well as genes encoding enzymes involved in cell wall modifications. By contrast, expression of photosynthetic genes is decreased in the dodder infective stages compared with normal stem. In addition, genes relating to biosynthesis, transport, and response of phytohormones, such as auxin, gibberellins, and strigolactone, were differentially expressed in the dodder infective stages compared with stems and seedlings. This analysis sheds light on the transcriptional changes that accompany plant parasitism and will aid in identifying potential gene targets for use in controlling the infestation of crops by parasitic weeds. PMID:24399359
Taxonomic resolutions based on 18S rRNA genes: a case study of subclass copepoda.

PubMed

Wu, Shu; Xiong, Jie; Yu, Yuhe

2015-01-01

Biodiversity studies are commonly conducted using 18S rRNA genes. In this study, we compared the inter-species divergence of variable regions (V1-9) within the copepod 18S rRNA gene, and tested their taxonomic resolutions at different taxonomic levels. Our results indicate that the 18S rRNA gene is a good molecular marker for the study of copepod biodiversity, and our conclusions are as follows: 1) 18S rRNA genes are highly conserved intra-species (intra-species similarities are close to 100%); and could aid in species-level analyses, but with some limitations; 2) nearly-whole-length sequences and some partial regions (around V2, V4, and V9) of the 18S rRNA gene can be used to discriminate between samples at both the family and order levels (with a success rate of about 80%); 3) compared with other regions, V9 has a higher resolution at the genus level (with an identification success rate of about 80%); and 4) V7 is most divergent in length, and would be a good candidate marker for the phylogenetic study of Acartia species. This study also evaluated the correlation between similarity thresholds and the accuracy of using nuclear 18S rRNA genes for the classification of organisms in the subclass Copepoda. We suggest that sample identification accuracy should be considered when a molecular sequence divergence threshold is used for taxonomic identification, and that the lowest similarity threshold should be determined based on a pre-designated level of acceptable accuracy.

Taxonomic Resolutions Based on 18S rRNA Genes: A Case Study of Subclass Copepoda

PubMed Central

Wu, Shu; Xiong, Jie; Yu, Yuhe

2015-01-01

Biodiversity studies are commonly conducted using 18S rRNA genes. In this study, we compared the inter-species divergence of variable regions (V1–9) within the copepod 18S rRNA gene, and tested their taxonomic resolutions at different taxonomic levels. Our results indicate that the 18S rRNA gene is a good molecular marker for the study of copepod biodiversity, and our conclusions are as follows: 1) 18S rRNA genes are highly conserved intra-species (intra-species similarities are close to 100%); and could aid in species-level analyses, but with some limitations; 2) nearly-whole-length sequences and some partial regions (around V2, V4, and V9) of the 18S rRNA gene can be used to discriminate between samples at both the family and order levels (with a success rate of about 80%); 3) compared with other regions, V9 has a higher resolution at the genus level (with an identification success rate of about 80%); and 4) V7 is most divergent in length, and would be a good candidate marker for the phylogenetic study of Acartia species. This study also evaluated the correlation between similarity thresholds and the accuracy of using nuclear 18S rRNA genes for the classification of organisms in the subclass Copepoda. We suggest that sample identification accuracy should be considered when a molecular sequence divergence threshold is used for taxonomic identification, and that the lowest similarity threshold should be determined based on a pre-designated level of acceptable accuracy. PMID:26107258
The mitochondrial genome of Priapulus caudatus Lamarck (Priapulida: Priapulidae).

PubMed

Webster, Bonnie L; Mackenzie-Dodds, Jacqueline A; Telford, Maximilian J; Littlewood, D Timothy J

2007-03-01

We sequenced and annotated the complete mitochondrial (mt) genome of the priapulid Priapulus caudatus in order to provide a source of phylogenetic characters including an assessment of gene order arrangement. The genome was 14,919 bp in its entirety with few, short non-coding regions. A number of protein-coding and tRNA genes overlapped, making the genome relatively compact. The gene order was: cox1, cox2, trnK, trnD, atp8, atp6, cox3, trnG, nad3, trnA, trnR, trnN, rrnS, trnV, rrnL, trnL(yaa), trnL(nag), nad1, -trnS(nga), -cob, -nad6, trnP, -trnT, nad4L, nad4, trnH, nad5, trnF, -trnE, -trnS(nct), trnI, -trnQ, trnM, nad2, trnW, -trnC, -trnY; where '-' indicates genes transcribed on the opposite strand. The gene order, although unique amongst Metazoa, shared the greatest number of gene boundaries and the longest contiguous fragments with the chelicerate Limulus polyphemus. The mt genomes of these taxa differed only by a single inversion of 18 contiguous genes bounded by rrnS and trnS(nct). Other arthropods and nematodes shared fewer gene boundaries but considerably more than the most similar non-ecdysozoan.
Phylogeny of Syndermata (syn. Rotifera): Mitochondrial gene order verifies epizoic Seisonidea as sister to endoparasitic Acanthocephala within monophyletic Hemirotifera.

PubMed

Sielaff, Malte; Schmidt, Hanno; Struck, Torsten H; Rosenkranz, David; Mark Welch, David B; Hankeln, Thomas; Herlyn, Holger

2016-03-01

A monophyletic origin of endoparasitic thorny-headed worms (Acanthocephala) and wheel-animals (Rotifera) is widely accepted. However, the phylogeny inside the clade, be it called Syndermata or Rotifera, has lacked validation by mitochondrial (mt) data. Herein, we present the first mt genome of the key taxon Seison and report conflicting results of phylogenetic analyses: while mt sequence-based topologies showed monophyletic Lemniscea (Bdelloidea+Acanthocephala), gene order analyses supported monophyly of Pararotatoria (Seisonidea+Acanthocephala) and Hemirotifera (Bdelloidea+Pararotatoria). Sequence-based analyses obviously suffered from substitution saturation, compositional bias, and branch length heterogeneity; however, we observed no compromising effects in gene order analyses. Moreover, gene order-based topologies were robust to changes in coding (genes vs. gene pairs, two-state vs. multistate, aligned vs. non-aligned), tree reconstruction methods, and the treatment of the two monogonont mt genomes. Thus, mt gene order verifies seisonids as sister to acanthocephalans within monophyletic Hemirotifera, while deviating results of sequence-based analyses reflect artificial signal. This conclusion implies that the complex life cycle of extant acanthocephalans evolved from a free-living state, as retained by most monogononts and bdelloids, via an epizoic state with a simple life cycle, as shown by seisonids. Hence, Acanthocephala represent a rare example where ancestral transitional stages have counterparts amongst the closest relatives. Copyright © 2015 Elsevier Inc. All rights reserved.
Target research on tumor biology characteristics of mir-155-5p regulation on gastric cancer cell.

PubMed

Feng, Jun-an

2016-03-01

After the mir-155-5p over expressed in gastric cancer cells, the expression profile chip was adopted to screen its target genes. Some of the intersection of target genes were selected based on the bioinformatics prediction, in order to study the mechanism of its function and role of research. Affymetrix eukaryotic gene expression spectrum was conducted to screen mir-155-5p regulated genetic experiment. Western blot technique was employed to detect and screen the protein expression of target genes. Mimics was transfected in BGC-823 of gastric cancer cells. Compared with mimics-nc group and mock group, the mRNA expression quantities of SMAD1, STAT1, CAB39, CXCR4 and CA9 were significantly lower. After the gastric cancer cells BGC-823 and MKN-45 had been transfected by mimics, compared with mimics-nc (MNC) group and mock (MOCK) group, it was decreased for the protein expression of SMAD1, STAT1 and CAB39 in mimics (MIMICS) group. The verification of qRT-PCR demonstrated that SMAD1, STAT1, CAB39, CXCR4 and CA9 were the predicted target genes and target proteins of mir-155-5p, the over expression of mir-155-5p could enable the decreasing of its expression level in gastric cancer cells MKN-45 and BGC-823.
Structure and Evolution of Chlorate Reduction Composite Transposons

PubMed Central

Clark, Iain C.; Melnyk, Ryan A.; Engelbrektson, Anna; Coates, John D.

2013-01-01

ABSTRACT The genes for chlorate reduction in six bacterial strains were analyzed in order to gain insight into the metabolism. A newly isolated chlorate-reducing bacterium (Shewanella algae ACDC) and three previously isolated strains (Ideonella dechloratans, Pseudomonas sp. strain PK, and Dechloromarinus chlorophilus NSS) were genome sequenced and compared to published sequences (Alicycliphilus denitrificans BC plasmid pALIDE01 and Pseudomonas chloritidismutans AW-1). De novo assembly of genomes failed to join regions adjacent to genes involved in chlorate reduction, suggesting the presence of repeat regions. Using a bioinformatics approach and finishing PCRs to connect fragmented contigs, we discovered that chlorate reduction genes are flanked by insertion sequences, forming composite transposons in all four newly sequenced strains. These insertion sequences delineate regions with the potential to move horizontally and define a set of genes that may be important for chlorate reduction. In addition to core metabolic components, we have highlighted several such genes through comparative analysis and visualization. Phylogenetic analysis places chlorate reductase within a functionally diverse clade of type II dimethyl sulfoxide (DMSO) reductases, part of a larger family of enzymes with reactivity toward chlorate. Nucleotide-level forensics of regions surrounding chlorite dismutase (cld), as well as its phylogenetic clustering in a betaproteobacterial Cld clade, indicate that cld has been mobilized at least once from a perchlorate reducer to build chlorate respiration. PMID:23919996
De novo comparative transcriptome analysis of genes involved in fruit morphology of pumpkin cultivars with extreme size difference and development of EST-SSR markers.

PubMed

Xanthopoulou, Aliki; Ganopoulos, Ioannis; Psomopoulos, Fotis; Manioudaki, Maria; Moysiadis, Theodoros; Kapazoglou, Aliki; Osathanunkul, Maslin; Michailidou, Sofia; Kalivas, Apostolos; Tsaftaris, Athanasios; Nianiou-Obeidat, Irini; Madesis, Panagiotis

2017-07-30

The genetic basis of fruit size and shape was investigated for the first time in Cucurbita species and genetic loci associated with fruit morphology have been identified. Although extensive genomic resources are available at present for tomato (Solanum lycopersicum), cucumber (Cucumis sativus), melon (Cucumis melo) and watermelon (Citrullus lanatus), genomic databases for Cucurbita species are limited. Recently, our group reported the generation of pumpkin (Cucurbita pepo) transcriptome databases from two contrasting cultivars with extreme fruit sizes. In the current study we used these databases to perform comparative transcriptome analysis in order to identify genes with potential roles in fruit morphology and fruit size. Differential Gene Expression (DGE) analysis between cv. 'Munchkin' (small-fruit) and cv. 'Big Moose' (large-fruit) revealed a variety of candidate genes associated with fruit morphology with significant differences in gene expression between the two cultivars. In addition, we have set the framework for generating EST-SSR markers, which discriminate different C. pepo cultivars and show transferability to related Cucurbitaceae species. The results of the present study will contribute to both further understanding the molecular mechanisms regulating fruit morphology and furthermore identifying the factors that determine fruit size. Moreover, they may lead to the development of molecular marker tools for selecting genotypes with desired morphological traits. Copyright © 2017. Published by Elsevier B.V.
Gene expression complex networks: synthesis, identification, and analysis.

PubMed

Lopes, Fabrício M; Cesar, Roberto M; Costa, Luciano Da F

2011-10-01

Thanks to recent advances in molecular biology, allied to an ever increasing amount of experimental data, the functional state of thousands of genes can now be extracted simultaneously by using methods such as cDNA microarrays and RNA-Seq. Particularly important related investigations are the modeling and identification of gene regulatory networks from expression data sets. Such a knowledge is fundamental for many applications, such as disease treatment, therapeutic intervention strategies and drugs design, as well as for planning high-throughput new experiments. Methods have been developed for gene networks modeling and identification from expression profiles. However, an important open problem regards how to validate such approaches and its results. This work presents an objective approach for validation of gene network modeling and identification which comprises the following three main aspects: (1) Artificial Gene Networks (AGNs) model generation through theoretical models of complex networks, which is used to simulate temporal expression data; (2) a computational method for gene network identification from the simulated data, which is founded on a feature selection approach where a target gene is fixed and the expression profile is observed for all other genes in order to identify a relevant subset of predictors; and (3) validation of the identified AGN-based network through comparison with the original network. The proposed framework allows several types of AGNs to be generated and used in order to simulate temporal expression data. The results of the network identification method can then be compared to the original network in order to estimate its properties and accuracy. Some of the most important theoretical models of complex networks have been assessed: the uniformly-random Erdös-Rényi (ER), the small-world Watts-Strogatz (WS), the scale-free Barabási-Albert (BA), and geographical networks (GG). The experimental results indicate that the inference method was sensitive to average degree variation, decreasing its network recovery rate with the increase of . The signal size was important for the inference method to get better accuracy in the network identification rate, presenting very good results with small expression profiles. However, the adopted inference method was not sensible to recognize distinct structures of interaction among genes, presenting a similar behavior when applied to different network topologies. In summary, the proposed framework, though simple, was adequate for the validation of the inferred networks by identifying some properties of the evaluated method, which can be extended to other inference methods.
Gene Expression in Uterine Leiomyoma from Tumors Likely to Be Growing (from Black Women over 35) and Tumors Likely to Be Non-Growing (from White Women over 35)

PubMed Central

Davis, Barbara J.; Risinger, John I.; Chandramouli, Gadisetti V. R.; Bushel, Pierre R.; Baird, Donna Day; Peddada, Shyamal D.

2013-01-01

The study of uterine leiomyomata (fibroids) provides a unique opportunity to investigate the physiological and molecular determinants of hormone dependent tumor growth and spontaneous tumor regression. We conducted a longitudinal clinical study of premenopausal women with leiomyoma that showed significantly different growth rates between white and black women depending on their age. Growth rates for leiomyoma were on average much higher from older black women than for older white women, and we now report gene expression pattern differences in tumors from these two groups of study participants. Total RNA from 52 leiomyoma and 8 myometrial samples were analyzed using Affymetrix Gene Chip expression arrays. Gene expression data was first compared between all leiomyoma and normal myometrium and then between leiomyoma from older black women (age 35 or older) and from older white women. Genes that were found significant in pairwise comparisons were further analyzed for canonical pathways, networks and biological functions using the Ingenuity Pathway Analysis (IPA) software. Whereas our comparison of leiomyoma to myometrium produced a very large list of genes highly similar to numerous previous studies, distinct sets of genes and signaling pathways were identified in comparisons of older black and white women whose tumors were likely to be growing and non-growing, respectively. Key among these were genes associated with regulation of apoptosis. To our knowledge, this is the first study to compare two groups of tumors that are likely to have different growth rates in order to reveal molecular signals likely to be influential in tumor growth. PMID:23785396
Differential accumulation of retroelements and diversification of NB-LRR disease resistance genes in duplicated regions following polyploidy in the ancestor of soybean.

PubMed

Innes, Roger W; Ameline-Torregrosa, Carine; Ashfield, Tom; Cannon, Ethalinda; Cannon, Steven B; Chacko, Ben; Chen, Nicolas W G; Couloux, Arnaud; Dalwani, Anita; Denny, Roxanne; Deshpande, Shweta; Egan, Ashley N; Glover, Natasha; Hans, Christian S; Howell, Stacy; Ilut, Dan; Jackson, Scott; Lai, Hongshing; Mammadov, Jafar; Del Campo, Sara Martin; Metcalf, Michelle; Nguyen, Ashley; O'Bleness, Majesta; Pfeil, Bernard E; Podicheti, Ram; Ratnaparkhe, Milind B; Samain, Sylvie; Sanders, Iryna; Ségurens, Béatrice; Sévignac, Mireille; Sherman-Broyles, Sue; Thareau, Vincent; Tucker, Dominic M; Walling, Jason; Wawrzynski, Adam; Yi, Jing; Doyle, Jeff J; Geffroy, Valérie; Roe, Bruce A; Maroof, M A Saghai; Young, Nevin D

2008-12-01

The genomes of most, if not all, flowering plants have undergone whole genome duplication events during their evolution. The impact of such polyploidy events is poorly understood, as is the fate of most duplicated genes. We sequenced an approximately 1 million-bp region in soybean (Glycine max) centered on the Rpg1-b disease resistance gene and compared this region with a region duplicated 10 to 14 million years ago. These two regions were also compared with homologous regions in several related legume species (a second soybean genotype, Glycine tomentella, Phaseolus vulgaris, and Medicago truncatula), which enabled us to determine how each of the duplicated regions (homoeologues) in soybean has changed following polyploidy. The biggest change was in retroelement content, with homoeologue 2 having expanded to 3-fold the size of homoeologue 1. Despite this accumulation of retroelements, over 77% of the duplicated low-copy genes have been retained in the same order and appear to be functional. This finding contrasts with recent analyses of the maize (Zea mays) genome, in which only about one-third of duplicated genes appear to have been retained over a similar time period. Fluorescent in situ hybridization revealed that the homoeologue 2 region is located very near a centromere. Thus, pericentromeric localization, per se, does not result in a high rate of gene inactivation, despite greatly accelerated retrotransposon accumulation. In contrast to low-copy genes, nucleotide-binding-leucine-rich repeat disease resistance gene clusters have undergone dramatic species/homoeologue-specific duplications and losses, with some evidence for partitioning of subfamilies between homoeologues.
Complete mitochondrial genome of the aluminum-tolerant fungus Rhodotorula taiwanensis RS1 and comparative analysis of Basidiomycota mitochondrial genomes

PubMed Central

Zhao, Xue Qiang; Aizawa, Tomoko; Schneider, Jessica; Wang, Chao; Shen, Ren Fang; Sunairi, Michio

2013-01-01

The complete mitochondrial genome of Rhodotorula taiwanensis RS1, an aluminum-tolerant Basidiomycota fungus, was determined and compared with the known mitochondrial genomes of 12 Basidiomycota species. The mitochondrial genome of R. taiwanensis RS1 is a circular DNA molecule of 40,392 bp and encodes the typical 15 mitochondrial proteins, 23 tRNAs, and small and large rRNAs as well as 10 intronic open reading frames. These genes are apparently transcribed in two directions and do not show syntenies in gene order with other investigated Basidiomycota species. The average G+C content (41%) of the mitochondrial genome of R. taiwanensis RS1 is the highest among the Basidiomycota species. Two introns were detected in the sequence of the atp9 gene of R. taiwanensis RS1, but not in that of other Basidiomycota species. Rhodotorula taiwanensis is the first species of the genus Rhodotorula whose full mitochondrial genome has been sequenced; and the data presented here supply valuable information for understanding the evolution of fungal mitochondrial genomes and researching the mechanism of aluminum tolerance in microorganisms. PMID:23427135
The complete genomes of Lactobacillus plantarum and Lactobacillus johnsonii reveal extensive differences in chromosome organization and gene content.

PubMed

Boekhorst, Jos; Siezen, Roland J; Zwahlen, Marie-Camille; Vilanova, David; Pridmore, Raymond D; Mercenier, Annick; Kleerebezem, Michiel; de Vos, Willem M; Brüssow, Harald; Desiere, Frank

2004-11-01

The first comprehensive comparative analysis of lactobacilli was done by comparing the genomes of Lactobacillus plantarum (3.3 Mb) and Lactobacillus johnsonii (2.0 Mb). L. johnsonii is predominantly found in the gastrointestinal tract, while L. plantarum is also found on plants and plant-derived material, and is used in a variety of industrial fermentations. The L. plantarum and L. johnsonii chromosomes have only 28 regions with conservation of gene order, totalling about 0.75 Mb; these regions are not co-linear, indicating major chromosomal rearrangements. Metabolic reconstruction indicates many differences between L. johnsonii and L. plantarum: numerous enzymes involved in sugar metabolism and in biosynthesis of amino acids, nucleotides, fatty acids and cofactors are lacking in L. johnsonii. Major differences were seen in the number and types of putative extracellular proteins, which are of interest because of their possible role in host-microbe interactions. The differences between L. plantarum and L. johnsonii, both in genome organization and gene content, are exceptionally large for two bacteria of the same genus, emphasizing the difficulty in taxonomic classification of lactobacilli.
Comparative genomics reveals phylogenetic distribution patterns of secondary metabolites in Amycolatopsis species.

PubMed

Adamek, Martina; Alanjary, Mohammad; Sales-Ortells, Helena; Goodfellow, Michael; Bull, Alan T; Winkler, Anika; Wibberg, Daniel; Kalinowski, Jörn; Ziemert, Nadine

2018-06-01

Genome mining tools have enabled us to predict biosynthetic gene clusters that might encode compounds with valuable functions for industrial and medical applications. With the continuously increasing number of genomes sequenced, we are confronted with an overwhelming number of predicted clusters. In order to guide the effective prioritization of biosynthetic gene clusters towards finding the most promising compounds, knowledge about diversity, phylogenetic relationships and distribution patterns of biosynthetic gene clusters is necessary. Here, we provide a comprehensive analysis of the model actinobacterial genus Amycolatopsis and its potential for the production of secondary metabolites. A phylogenetic characterization, together with a pan-genome analysis showed that within this highly diverse genus, four major lineages could be distinguished which differed in their potential to produce secondary metabolites. Furthermore, we were able to distinguish gene cluster families whose distribution correlated with phylogeny, indicating that vertical gene transfer plays a major role in the evolution of secondary metabolite gene clusters. Still, the vast majority of the diverse biosynthetic gene clusters were derived from clusters unique to the genus, and also unique in comparison to a database of known compounds. Our study on the locations of biosynthetic gene clusters in the genomes of Amycolatopsis' strains showed that clusters acquired by horizontal gene transfer tend to be incorporated into non-conserved regions of the genome thereby allowing us to distinguish core and hypervariable regions in Amycolatopsis genomes. Using a comparative genomics approach, it was possible to determine the potential of the genus Amycolatopsis to produce a huge diversity of secondary metabolites. Furthermore, the analysis demonstrates that horizontal and vertical gene transfer play an important role in the acquisition and maintenance of valuable secondary metabolites. Our results cast light on the interconnections between secondary metabolite gene clusters and provide a way to prioritize biosynthetic pathways in the search and discovery of novel compounds.
Mitochondrial comparative genomics and phylogenetic signal assessment of mtDNA among arbuscular mycorrhizal fungi.

PubMed

Nadimi, Maryam; Daubois, Laurence; Hijri, Mohamed

2016-05-01

Mitochondrial (mt) genes, such as cytochrome C oxidase genes (cox), have been widely used for barcoding in many groups of organisms, although this approach has been less powerful in the fungal kingdom due to the rapid evolution of their mt genomes. The use of mt genes in phylogenetic studies of Dikarya has been met with success, while early diverging fungal lineages remain less studied, particularly the arbuscular mycorrhizal fungi (AMF). Advances in next-generation sequencing have substantially increased the number of publically available mtDNA sequences for the Glomeromycota. As a result, comparison of mtDNA across key AMF taxa can now be applied to assess the phylogenetic signal of individual mt coding genes, as well as concatenated subsets of coding genes. Here we show comparative analyses of publically available mt genomes of Glomeromycota, augmented with two mtDNA genomes that were newly sequenced for this study (Rhizophagus irregularis DAOM240159 and Glomus aggregatum DAOM240163), resulting in 16 complete mtDNA datasets. R. irregularis isolate DAOM240159 and G. aggregatum isolate DAOM240163 showed mt genomes measuring 72,293bp and 69,505bp with G+C contents of 37.1% and 37.3%, respectively. We assessed the phylogenies inferred from single mt genes and complete sets of coding genes, which are referred to as "supergenes" (16 concatenated coding genes), using Shimodaira-Hasegawa tests, in order to identify genes that best described AMF phylogeny. We found that rnl, nad5, cox1, and nad2 genes, as well as concatenated subset of these genes, provided phylogenies that were similar to the supergene set. This mitochondrial genomic analysis was also combined with principal coordinate and partitioning analyses, which helped to unravel certain evolutionary relationships in the Rhizophagus genus and for G. aggregatum within the Glomeromycota. We showed evidence to support the position of G. aggregatum within the R. irregularis 'species complex'. Copyright © 2016 Elsevier Inc. All rights reserved.
A functional promoter shift of a chloroplast gene: a transcriptional fusion between a novel psbA gene copy and the trnK (UUU) gene in Pinus contorta.

PubMed

Lidholm, J; Gustafsson, P

1992-11-01

A comparative transcription analysis of the chloroplast trnK-psbA-trnH region of the two pine species Pinus contorta and Pinus sylvestris is reported. The chloroplast genome of P. contorta has previously been shown to contain a duplicated psbA gene copy integrated closely upstream of the split trnK gene. This rearrangement has resulted in the gene order psbAI-trnK-psbAII-trnH, where psbAII is the ancestral psbA gene copy. In P. sylvestris, a species which lacks the psbA duplication, transcription of the trnK gene originates from a position 291 bp upstream of the trnK 5' exon, adjacent to a canonical promoter structure. In P. contorta, the corresponding promoter structure has been separated from the trnK gene by the insertion of psbAI, and has, in addition, been partially deleted. Analysis of the transcriptional organization of the trnK-psbA-trnH region of the two pine species revealed that the trnK gene in P. contorta is transcriptionally fused to the inserted psbAI gene copy. As a result, trnK is under the control of the psbA promoter in this species and has therefore acquired psbA-like expression characteristics. In P. sylvestris, accumulation of trnK transcripts is not significantly higher in light-grown than in dark-grown seedlings. In contrast, the level of trnK transcripts in P. contorta is approximately 12-fold higher in the light than in the dark. When light-grown seedlings of the two pine species were compared, an approximately 20-fold higher level of trnK RNAs was found in P. contorta. In both pine species, evidence was obtained for trnK-psbA and psbA-trnH co-transcription.
Perceptron ensemble of graph-based positive-unlabeled learning for disease gene identification.

PubMed

Jowkar, Gholam-Hossein; Mansoori, Eghbal G

2016-10-01

Identification of disease genes, using computational methods, is an important issue in biomedical and bioinformatics research. According to observations that diseases with the same or similar phenotype have the same biological characteristics, researchers have tried to identify genes by using machine learning tools. In recent attempts, some semi-supervised learning methods, called positive-unlabeled learning, is used for disease gene identification. In this paper, we present a Perceptron ensemble of graph-based positive-unlabeled learning (PEGPUL) on three types of biological attributes: gene ontologies, protein domains and protein-protein interaction networks. In our method, a reliable set of positive and negative genes are extracted using co-training schema. Then, the similarity graph of genes is built using metric learning by concentrating on multi-rank-walk method to perform inference from labeled genes. At last, a Perceptron ensemble is learned from three weighted classifiers: multilevel support vector machine, k-nearest neighbor and decision tree. The main contributions of this paper are: (i) incorporating the statistical properties of gene data through choosing proper metrics, (ii) statistical evaluation of biological features, and (iii) noise robustness characteristic of PEGPUL via using multilevel schema. In order to assess PEGPUL, we have applied it on 12950 disease genes with 949 positive genes from six class of diseases and 12001 unlabeled genes. Compared with some popular disease gene identification methods, the experimental results show that PEGPUL has reasonable performance. Copyright © 2016 Elsevier Ltd. All rights reserved.
MADS-Box gene diversity in seed plants 300 million years ago.

PubMed

Becker, A; Winter, K U; Meyer, B; Saedler, H; Theissen, G

2000-10-01

MADS-box genes encode a family of transcription factors which control diverse developmental processes in flowering plants ranging from root development to flower and fruit development. Through phylogeny reconstructions, most of these genes can be subdivided into defined monophyletic gene clades whose members share similar expression patterns and functions. Therefore, the establishment of the diversity of gene clades was probably an important event in land plant evolution. In order to determine when these clades originated, we isolated cDNAs of 19 different MADS-box genes from Gnetum gnemon, a gymnosperm model species and thus a representative of the sister group of the angiosperms. Phylogeny reconstructions involving all published MADS-box genes were then used to identify gene clades containing putative orthologs from both angiosperm and gymnosperm lineages. Thus, the minimal number of MADS-box genes that were already present in the last common ancestor of extant gymnosperms and angiosperms was determined. Comparative expression studies involving pairs of putatively orthologous genes revealed a diversity of patterns that has been largely conserved since the time when the angiosperm and gymnosperm lineages separated. Taken together, our data suggest that there were already at least seven different MADS-box genes present at the base of extant seed plants about 300 MYA. These genes were probably already quite diverse in terms of both sequence and function. In addition, our data demonstrate that the MADS-box gene families of extant gymnosperms and angiosperms are of similar complexities.
Identifying arsenic trioxide (ATO) functions in leukemia cells by using time series gene expression profiles.

PubMed

Yang, Hong; Lin, Shan; Cui, Jingru

2014-02-10

Arsenic trioxide (ATO) is presently the most active single agent in the treatment of acute promyelocytic leukemia (APL). In order to explore the molecular mechanism of ATO in leukemia cells with time series, we adopted bioinformatics strategy to analyze expression changing patterns and changes in transcription regulation modules of time series genes filtered from Gene Expression Omnibus database (GSE24946). We totally screened out 1847 time series genes for subsequent analysis. The KEGG (Kyoto encyclopedia of genes and genomes) pathways enrichment analysis of these genes showed that oxidative phosphorylation and ribosome were the top 2 significantly enriched pathways. STEM software was employed to compare changing patterns of gene expression with assigned 50 expression patterns. We screened out 7 significantly enriched patterns and 4 tendency charts of time series genes. The result of Gene Ontology showed that functions of times series genes mainly distributed in profiles 41, 40, 39 and 38. Seven genes with positive regulation of cell adhesion function were enriched in profile 40, and presented the same first increased model then decreased model as profile 40. The transcription module analysis showed that they mainly involved in oxidative phosphorylation pathway and ribosome pathway. Overall, our data summarized the gene expression changes in ATO treated K562-r cell lines with time and suggested that time series genes mainly regulated cell adhesive. Furthermore, our result may provide theoretical basis of molecular biology in treating acute promyelocytic leukemia. Copyright © 2013 Elsevier B.V. All rights reserved.
[Community structure and phylogenetic analysis of cyanobacteria in cryoconite from surface of the Glacier No. 1 in the Tianshan Mountains].

PubMed

Ni, Xuejiao; Qi, Xing'e; Gu, Yanling; Zheng, Xiaoji; Dong, Juan; Ni, Yongqing; Cheng, Guodong

2014-11-04

The purpose of this study is to characterize the community composition and phylogenetic analysis of cyanobacteria from supraglacial cryoconite of the Glacier No. 1 in the Tianshan Mountains, China. We amplified 16S rRNA genes from the extracted cryoconite DNA by PCR with 2 pairs of cyanobacteria-specific primers. Amplificon was used to construct 16S rRNA genes clone library. The estimation of species richness, diversity indices, and rarefaction curve of the 16S rRNA genes library were determined based on representative phylotypes (OTUs). Analysis of 16S rRNA gene sequences allowed grouping of 101 clones into 12 phylotypes (OTUs) using a cut-off of 97% identity. The phylogenetic analysis revealed that most of sequences affiliated to the order Oscillatoriales and Chroococcales except that three were unclassified. The clone library was dominated by representatives of the order Oscillatoriales (81% of the total clones), and the most abundant organisms within this order were in the genus Phormidium (68 clones) including clones grouping into four phylotypes. The only clone of Chroococcales was closely related to the genus Chamaesiphon with 97% similarity. In addition, comparison of soil chemical properties between different habitats indicated that supraglacial cryoconite supported significantly higher the content of available phosphorus and potassium, nitrate nitrogen and organic matter compared with the forefield of the Glacier No. 1. The diversity index of cyanobacteria were relatively high in supraglacial cryoconite of the Glacier No. 1 in the Tianshan Mountains. The community structure was dominated by members of the genus Phormidium. This study may enrich our knowledge on biogeochemical processes and ecological distribution of cyanobacterial populations in glacial ecosystem.
Generation of cell lines for drug discovery through random activation of gene expression: application to the human histamine H3 receptor.

PubMed

Song, J; Doucette, C; Hanniford, D; Hunady, K; Wang, N; Sherf, B; Harrington, J J; Brunden, K R; Stricker-Krongrad, A

2005-06-01

Target-based high-throughput screening (HTS) plays an integral role in drug discovery. The implementation of HTS assays generally requires high expression levels of the target protein, and this is typically accomplished using recombinant cDNA methodologies. However, the isolated gene sequences to many drug targets have intellectual property claims that restrict the ability to implement drug discovery programs. The present study describes the pharmacological characterization of the human histamine H3 receptor that was expressed using random activation of gene expression (RAGE), a technology that over-expresses proteins by up-regulating endogenous genes rather than introducing cDNA expression vectors into the cell. Saturation binding analysis using [125I]iodoproxyfan and RAGE-H3 membranes revealed a single class of binding sites with a K(D) value of 0.77 nM and a B(max) equal to 756 fmol/mg of protein. Competition binding studies showed that the rank order of potency for H3 agonists was N(alpha)-methylhistamine approximately (R)-alpha- methylhistamine > histamine and that the rank order of potency for H3 antagonists was clobenpropit > iodophenpropit > thioperamide. The same rank order of potency for H3 agonists and antagonists was observed in the functional assays as in the binding assays. The Fluorometic Imaging Plate Reader assays in RAGE-H3 cells gave high Z' values for agonist and antagonist screening, respectively. These results reveal that the human H3 receptor expressed with the RAGE technology is pharmacologically comparable to that expressed through recombinant methods. Moreover, the level of expression of the H3 receptor in the RAGE-H3 cells is suitable for HTS and secondary assays.
Evidence for a close phylogenetic relationship between Melissococcus pluton, the causative agent of European foulbrood disease, and the genus Enterococcus.

PubMed

Cai, J; Collins, M D

1994-04-01

The 16S rRNA gene sequence of Melissococcus pluton, the causative agent of European foulbrood disease, was determined in order to investigate the phylogenetic relationships between this organism and other low-G + C-content gram-positive bacteria. A comparative sequence analysis revealed that M. pluton is a close phylogenetic relative of the genus Enterococcus.

The multiple sex chromosomes of platypus and echidna are not completely identical and several share homology with the avian Z

PubMed Central

Rens, Willem; O'Brien, Patricia CM; Grützner, Frank; Clarke, Oliver; Graphodatskaya, Daria; Tsend-Ayush, Enkhjargal; Trifonov, Vladimir A; Skelton, Helen; Wallis, Mary C; Johnston, Steve; Veyrunes, Frederic; Graves, Jennifer AM; Ferguson-Smith, Malcolm A

2007-01-01

Background Sex-determining systems have evolved independently in vertebrates. Placental mammals and marsupials have an XY system, birds have a ZW system. Reptiles and amphibians have different systems, including temperature-dependent sex determination, and XY and ZW systems that differ in origin from birds and placental mammals. Monotremes diverged early in mammalian evolution, just after the mammalian clade diverged from the sauropsid clade. Our previous studies showed that male platypus has five X and five Y chromosomes, no SRY, and DMRT1 on an X chromosome. In order to investigate monotreme sex chromosome evolution, we performed a comparative study of platypus and echidna by chromosome painting and comparative gene mapping. Results Chromosome painting reveals a meiotic chain of nine sex chromosomes in the male echidna and establishes their order in the chain. Two of those differ from those in the platypus, three of the platypus sex chromosomes differ from those of the echidna and the order of several chromosomes is rearranged. Comparative gene mapping shows that, in addition to bird autosome regions, regions of bird Z chromosomes are homologous to regions in four platypus X chromosomes, that is, X1, X2, X3, X5, and in chromosome Y1. Conclusion Monotreme sex chromosomes are easiest to explain on the hypothesis that autosomes were added sequentially to the translocation chain, with the final additions after platypus and echidna divergence. Genome sequencing and contig anchoring show no homology yet between platypus and therian Xs; thus, monotremes have a unique XY sex chromosome system that shares some homology with the avian Z. PMID:18021405
A Dynamical Model Reveals Gene Co-Localizations in Nucleus

PubMed Central

Yao, Ye; Lin, Wei; Hennessy, Conor; Fraser, Peter; Feng, Jianfeng

2011-01-01

Co-localization of networks of genes in the nucleus is thought to play an important role in determining gene expression patterns. Based upon experimental data, we built a dynamical model to test whether pure diffusion could account for the observed co-localization of genes within a defined subnuclear region. A simple standard Brownian motion model in two and three dimensions shows that preferential co-localization is possible for co-regulated genes without any direct interaction, and suggests the occurrence may be due to a limitation in the number of available transcription factors. Experimental data of chromatin movements demonstrates that fractional rather than standard Brownian motion is more appropriate to model gene mobilizations, and we tested our dynamical model against recent static experimental data, using a sub-diffusion process by which the genes tend to colocalize more easily. Moreover, in order to compare our model with recently obtained experimental data, we studied the association level between genes and factors, and presented data supporting the validation of this dynamic model. As further applications of our model, we applied it to test against more biological observations. We found that increasing transcription factor number, rather than factory number and nucleus size, might be the reason for decreasing gene co-localization. In the scenario of frequency- or amplitude-modulation of transcription factors, our model predicted that frequency-modulation may increase the co-localization between its targeted genes. PMID:21760760
Transcriptional modulation of some Staphylococcus aureus iron-regulated genes during growth in vitro and in a tissue cage model in vivo.

PubMed

Allard, Marianne; Moisan, Hélène; Brouillette, Eric; Gervais, Alain L; Jacques, Mario; Lacasse, Pierre; Diarra, Moussa S; Malouin, François

2006-06-01

Staphylococcus aureus can proliferate in iron-limited environments such as the mammalian host. The transcriptional profiles of 460 genes (iron-regulated, putative Fur-regulated, membrane transport, pathogenesis) obtained for S. aureus grown in iron-restricted environments in vitro and in vivo were compared in order to identify new iron-regulated genes and to evaluate their potential as possible therapeutic targets in vivo. Iron deprivation was created in vitro by 2,2-dipyridyl, and in vivo, S. aureus was grown in tissue cages implanted in mice. Bacterial RNA was obtained from each growth condition and cDNA probes were co-hybridized on DNA arrays. Thirty-six upregulated and 11 downregulated genes were commonly modulated in animals and in the low-iron medium. Real-time PCR confirmed the iron-dependent modulation of four novel genes (SACOL0161, 2170, 2369, 2431) with a Fur box motif. Some genes expressed in the dipyridyl medium were not expressed in vivo (e.g., copA, frpA, SACOL1045). Downregulated genes included an iron-storage protein gene and genes of the succinate dehydrogenase complex, reminiscent of a small RNA-dependent regulation thus far only demonstrated in Gram-negative bacteria. The expression of iron-regulated genes in distinct low-iron environments provided insight into their relative importance in vitro and in vivo and their usefulness for vaccine and drug development.
Influence of tetracycline on tetracycline-resistant heterotrophs and tet genes in activated sludge process.

PubMed

Yu, Jie; Liu, Dongfang; Li, Kexun

2015-03-01

The concentrations of tetracycline-intermediate resistant, tetracycline-resistant heterotrophic bacteria, and total heterotrophic bacteria were examined to assess the influence of tetracycline on tetracycline-resistant heterotrophs by the R2A agar cultivation method in the tetracycline fortified activated sludge process and in the natural background. Results showed that the percentages of both tetracycline-intermediate resistant and tetracycline-resistant heterotrophic bacteria in total heterotrophic bacteria were significantly increased, after tetracycline was fed to activated sludge for a 3 months period under four different operating conditions, as compared with the background. In order to investigate the mechanism of activated sludge resistance to tetracycline, polymerase chain reaction experiments were carried out to analyze the existence and evolution of tet genes in the presence of tetracycline. Results revealed that only tet A and tet B genes out of the 11 target tet genes were observed in tetracycline treated activated sludge while no tet gene was detected in background. This indicated that tet A gene could accumulate in activated sludge with slower and continuous influent, while the accumulation of tet B gene could be attributed to shorter hydraulic retention time. Therefore, it was proposed in this study that tetracycline-resistant genes created by efflux pumps spread earlier and quicker to encode resistance to tetracycline, which facilitated the increase in tetracycline-resistance.
An improved Pearson's correlation proximity-based hierarchical clustering for mining biological association between genes.

PubMed

Booma, P M; Prabhakaran, S; Dhanalakshmi, R

2014-01-01

Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality.
An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes

PubMed Central

Booma, P. M.; Prabhakaran, S.; Dhanalakshmi, R.

2014-01-01

Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality. PMID:25136661
Gene Regulatory Network Inferences Using a Maximum-Relevance and Maximum-Significance Strategy

PubMed Central

Liu, Wei; Zhu, Wen; Liao, Bo; Chen, Xiangtao

2016-01-01

Recovering gene regulatory networks from expression data is a challenging problem in systems biology that provides valuable information on the regulatory mechanisms of cells. A number of algorithms based on computational models are currently used to recover network topology. However, most of these algorithms have limitations. For example, many models tend to be complicated because of the “large p, small n” problem. In this paper, we propose a novel regulatory network inference method called the maximum-relevance and maximum-significance network (MRMSn) method, which converts the problem of recovering networks into a problem of how to select the regulator genes for each gene. To solve the latter problem, we present an algorithm that is based on information theory and selects the regulator genes for a specific gene by maximizing the relevance and significance. A first-order incremental search algorithm is used to search for regulator genes. Eventually, a strict constraint is adopted to adjust all of the regulatory relationships according to the obtained regulator genes and thus obtain the complete network structure. We performed our method on five different datasets and compared our method to five state-of-the-art methods for network inference based on information theory. The results confirm the effectiveness of our method. PMID:27829000
Expression variability of co-regulated genes differentiates Saccharomyces cerevisiae strains

PubMed Central

2011-01-01

Background Saccharomyces cerevisiae (Baker's yeast) is found in diverse ecological niches and is characterized by high adaptive potential under challenging environments. In spite of recent advances on the study of yeast genome diversity, little is known about the underlying gene expression plasticity. In order to shed new light onto this biological question, we have compared transcriptome profiles of five environmental isolates, clinical and laboratorial strains at different time points of fermentation in synthetic must medium, during exponential and stationary growth phases. Results Our data unveiled diversity in both intensity and timing of gene expression. Genes involved in glucose metabolism and in the stress response elicited during fermentation were among the most variable. This gene expression diversity increased at the onset of stationary phase (diauxic shift). Environmental isolates showed lower average transcript abundance of genes involved in the stress response, assimilation of nitrogen and vitamins, and sulphur metabolism, than other strains. Nitrogen metabolism genes showed significant variation in expression among the environmental isolates. Conclusions Wild type yeast strains respond differentially to the stress imposed by nutrient depletion, ethanol accumulation and cell density increase, during fermentation of glucose in synthetic must medium. Our results support previous data showing that gene expression variability is a source of phenotypic diversity among closely related organisms. PMID:21507216
The expression of the Saccharomyces cerevisiae HAL1 gene increases salt tolerance in transgenic watermelon [Citrullus lanatus (Thunb.) Matsun. & Nakai.].

PubMed

Ellul, P; Ríos, G; Atarés, A; Roig, L A; Serrano, R; Moreno, V

2003-08-01

An optimised Agrobacterium-mediated gene transfer protocol was developed in order to obtain watermelon transgenic plants [Citrullus lanatus (Thunb.) Matsun. & Nakai.]. Transformation efficiencies ranged from 2.8% to 5.3%, depending on the cultivar. The method was applied to obtain genetically engineered watermelon plants expressing the Saccharomyces cerevisiae HAL1 gene related to salt tolerance. In order to enhance its constitutive expression in plants, the HAL1 gene was cloned in a pBiN19 plasmid under control of the 35S promoter with a double enhancer sequence from the cauliflower mosaic virus and the RNA4 leader sequence of the alfalfa mosaic virus. This vector was introduced into Agrobacterium tumefaciens strain LBA4404 for further inoculation of watermelon half-cotyledon explants. The introduction of both the neomycin phosphotransferase II and HAL1 genes was assessed in primary transformants (TG1) by polymerase chain reaction analysis and Southern hybridisation. The expression of the HAL1 gene was determined by Northern analysis, and the diploid level of transgenic plants was confirmed by flow cytometry. The presence of the selectable marker gene in the expected Mendelian ratios was demonstrated in TG2 progenies. The TG2 kanamycin-resistant plantlets elongated better and produced new roots and leaves in culture media supplemented with NaCl compared with the control. Salt tolerance was confirmed in a semi-hydroponic system (EC=6 dS m(-1)) on the basis of the higher growth performance of homozygous TG3 lines with respect to their respective azygous control lines without the transgene. The halotolerance observed confirmed the inheritance of the trait and supports the potential usefulness of the HAL1 gene of S. cerevisiae as a molecular tool for genetic engineering of salt-stress protection in other crop species.
Comparative transcriptomes analysis of the wing disc between two silkworm strains with different size of wings

PubMed Central

Zhang, Jing; Blessing, Danso; Wu, Chenyu; Liu, Na; Li, Juan; Qin, Sheng

2017-01-01

Wings of Bombyx mori (B. mori) develop from the primordium, and different B. mori strains have different wing types. In order to identify the key factors influencing B. mori wing development, we chose strains P50 and U11, which are typical for normal wing and minute wing phenotypes, respectively. We dissected the wing disc on the 1st-day of wandering stage (P50D1 and U11D1), 2nd-day of wandering stage (P50D2 and U11D2), and 3rd-day of wandering stage (P50D3 and U11D3). Subsequently, RNA-sequencing (RNA-Seq) was performed on both strains in order to construct their gene expression profiles. P50 exhibited 628 genes differentially expressed to U11, 324 up-regulated genes, and 304 down-regulated genes. Five enriched gene ontology (GO) terms were identified by GO enrichment analysis based on these differentially expressed genes (DEGs). KEGG enrichment analysis results showed that the DEGs were enriched in five pathways; of these, we identified three pathways related to the development of wings. The three pathways include amino sugar and nucleotide sugar metabolism pathway, proteasome signaling pathway, and the Hippo signaling pathway. The representative genes in the enrichment pathways were further verified by quantitative real-time reverse transcription polymerase chain reaction (qRT-PCR). The RNA-Seq and qRT-PCR results were largely consistent with each other. Our results also revealed that the significantly different genes obtained in our study might be involved in the development of the size of B. mori wings. In addition, several KEGG enriched pathways might be involved in the regulation of the pathways of wing formation. These results provide a basis for further research of wing development in B. mori. PMID:28617839
The mitochondrial genomes of the human hookworms, Ancylostoma duodenale and Necator americanus (Nematoda: Secernentea).

PubMed

Hu, Min; Chilton, Neil B; Gasser, Robin B

2002-02-01

The complete mitochondrial genome sequences were determined for two species of human hookworms, Ancylostoma duodenale (13,721 bp) and Necator americanus (13,604 bp). The circular hookworm genomes are amongst the smallest reported to date for any metazoan organism. Their relatively small size relates mainly to a reduced length in the AT-rich region. Both hookworm genomes encode 12 protein, two ribosomal RNA and 22 transfer RNA genes, but lack the ATP synthetase subunit 8 gene, which is consistent with three other species of Secernentea studied to date. All genes are transcribed in the same direction and have a nucleotide composition high in A and T, but low in G and C. The AT bias had a significant effect on both the codon usage pattern and amino acid composition of proteins. For both hookworm species, genes were arranged in the same order as for Caenorhabditis elegans, except for the presence of a non-coding region between genes nad3 and nad5. In A. duodenale, this non-coding region is predicted to form a stem-and-loop structure which is not present in N. americanus. The mitochondrial genome structure for both hookworms differs from Ascaris suum only in the location of the AT-rich region, whereas there are substantial differences when compared with Onchocerca volvulus, including four gene or gene-block translocations and the positions of some transfer RNA genes and the AT-rich region. Based on genome organisation and amino acid sequence identity, A. duodenale and N. americanus were more closely related to C. elegans than to A. suum or O. volvulus (all secernentean nematodes), consistent with a previous phylogenetic study using ribosomal DNA sequence data. Determination of the complete mitochondrial genome sequences for two human hookworms (the first members of the order Strongylida ever sequenced) provides a foundation for studying the systematics, population genetics and ecology of these and other nematodes of socio-economic importance.
Establishing Substantial Equivalence: Transcriptomics

NASA Astrophysics Data System (ADS)

Baudo, María Marcela; Powers, Stephen J.; Mitchell, Rowan A. C.; Shewry, Peter R.

Regulatory authorities in Western Europe require transgenic crops to be substantially equivalent to conventionally bred forms if they are to be approved for commercial production. One way to establish substantial equivalence is to compare the transcript profiles of developing grain and other tissues of transgenic and conventionally bred lines, in order to identify any unintended effects of the transformation process. We present detailed protocols for transcriptomic comparisons of developing wheat grain and leaf material, and illustrate their use by reference to our own studies of lines transformed to express additional gluten protein genes controlled by their own endosperm-specific promoters. The results show that the transgenes present in these lines (which included those encoding marker genes) did not have any significant unpredicted effects on the expression of endogenous genes and that the transgenic plants were therefore substantially equivalent to the corresponding parental lines.
The effect of a gold coin fine on C-reactive protein test ordering in a tertiary referral emergency department.

PubMed

Mallows, James L

2013-12-16

To examine the effect of an education campaign based around a gold coin fine on ordering of C-reactive protein (CRP) tests. A retrospective analysis of CRP test ordering before and after the intervention in the emergency department (ED) of a tertiary referral hospital in metropolitan Sydney that sees about 60,000 patients per annum. The date of the intervention - 2 August 2013 - corresponded with Jeans for Genes Day. Number of CRP tests ordered in the ED. 1290 CRP tests were ordered before the intervention (1-31 July), and 394 were ordered after the intervention (2-31 August). This decrease in CRP test ordering was despite an increased number of ED presentations in August compared with July (5219 v 5497 presentations). This represented an absolute reduction in the rate of CRP test ordering of 17.6% (95% CI, 16.2%-18.9%; P < 0.001). The threat of a gold coin fine for ordering a CRP test, as part of a broader education campaign, significantly reduced the number of CRP tests ordered in a tertiary referral ED.
Metabolic Adaptation to Nutrients Involves Coregulation of Gene Expression by the RNA Helicase Dbp2 and the Cyc8 Corepressor in Saccharomyces cerevisiae.

PubMed

Wang, Siwen; Xing, Zheng; Pascuzzi, Pete E; Tran, Elizabeth J

2017-07-05

Cells fine-tune their metabolic programs according to nutrient availability in order to maintain homeostasis. This is achieved largely through integrating signaling pathways and the gene expression program, allowing cells to adapt to nutritional change. Dbp2, a member of the DEAD-box RNA helicase family in Saccharomyces cerevisiae , has been proposed to integrate gene expression with cellular metabolism. Prior work from our laboratory has reported the necessity of DBP2 in proper gene expression, particularly for genes involved in glucose-dependent regulation. Here, by comparing differentially expressed genes in dbp2 ∆ to those of 700 other deletion strains from other studies, we find that CYC8 and TUP1 , which form a complex and inhibit transcription of numerous genes, corepress a common set of genes with DBP2 Gene ontology (GO) annotations reveal that these corepressed genes are related to cellular metabolism, including respiration, gluconeogenesis, and alternative carbon-source utilization genes. Consistent with a direct role in metabolic gene regulation, loss of either DBP2 or CYC8 results in increased cellular respiration rates. Furthermore, we find that corepressed genes have a propensity to be associated with overlapping long noncoding RNAs and that upregulation of these genes in the absence of DBP2 correlates with decreased binding of Cyc8 to these gene promoters. Taken together, this suggests that Dbp2 integrates nutrient availability with energy homeostasis by maintaining repression of glucose-repressed, Cyc8-targeted genes across the genome. Copyright © 2017 Wang et al.
The complete chloroplast genome sequence of the chlorophycean green alga Scenedesmus obliquus reveals a compact gene organization and a biased distribution of genes on the two DNA strands

PubMed Central

de Cambiaire, Jean-Charles; Otis, Christian; Lemieux, Claude; Turmel, Monique

2006-01-01

Background The phylum Chlorophyta contains the majority of the green algae and is divided into four classes. While the basal position of the Prasinophyceae is well established, the divergence order of the Ulvophyceae, Trebouxiophyceae and Chlorophyceae (UTC) remains uncertain. The five complete chloroplast DNA (cpDNA) sequences currently available for representatives of these classes display considerable variability in overall structure, gene content, gene density, intron content and gene order. Among these genomes, that of the chlorophycean green alga Chlamydomonas reinhardtii has retained the least ancestral features. The two single-copy regions, which are separated from one another by the large inverted repeat (IR), have similar sizes, rather than unequal sizes, and differ radically in both gene contents and gene organizations relative to the single-copy regions of prasinophyte and ulvophyte cpDNAs. To gain insights into the various changes that underwent the chloroplast genome during the evolution of chlorophycean green algae, we have sequenced the cpDNA of Scenedesmus obliquus, a member of a distinct chlorophycean lineage. Results The 161,452 bp IR-containing genome of Scenedesmus features single-copy regions of similar sizes, encodes 96 genes, i.e. only two additional genes (infA and rpl12) relative to its Chlamydomonas homologue and contains seven group I and two group II introns. It is clearly more compact than the four UTC algal cpDNAs that have been examined so far, displays the lowest proportion of short repeats among these algae and shows a stronger bias in clustering of genes on the same DNA strand compared to Chlamydomonas cpDNA. Like the latter genome, Scenedesmus cpDNA displays only a few ancestral gene clusters. The two chlorophycean genomes share 11 gene clusters that are not found in previously sequenced trebouxiophyte and ulvophyte cpDNAs as well as a few genes that have an unusual structure; however, their single-copy regions differ considerably in gene content. Conclusion Our results underscore the remarkable plasticity of the chlorophycean chloroplast genome. Owing to this plasticity, only a sketchy portrait could be drawn for the chloroplast genome of the last common ancestor of Scenedesmus and Chlamydomonas. PMID:16638149
Comparative Analysis and Distribution of Omega-3 lcPUFA Biosynthesis Genes in Marine Molluscs

PubMed Central

Surm, Joachim M.; Prentis, Peter J.; Pavasovic, Ana

2015-01-01

Recent research has identified marine molluscs as an excellent source of omega-3 long-chain polyunsaturated fatty acids (lcPUFAs), based on their potential for endogenous synthesis of lcPUFAs. In this study we generated a representative list of fatty acyl desaturase (Fad) and elongation of very long-chain fatty acid (Elovl) genes from major orders of Phylum Mollusca, through the interrogation of transcriptome and genome sequences, and various publicly available databases. We have identified novel and uncharacterised Fad and Elovl sequences in the following species: Anadara trapezia, Nerita albicilla, Nerita melanotragus, Crassostrea gigas, Lottia gigantea, Aplysia californica, Loligo pealeii and Chlamys farreri. Based on alignments of translated protein sequences of Fad and Elovl genes, the haeme binding motif and histidine boxes of Fad proteins, and the histidine box and seventeen important amino acids in Elovl proteins, were highly conserved. Phylogenetic analysis of aligned reference sequences was used to reconstruct the evolutionary relationships for Fad and Elovl genes separately. Multiple, well resolved clades for both the Fad and Elovl sequences were observed, suggesting that repeated rounds of gene duplication best explain the distribution of Fad and Elovl proteins across the major orders of molluscs. For Elovl sequences, one clade contained the functionally characterised Elovl5 proteins, while another clade contained proteins hypothesised to have Elovl4 function. Additional well resolved clades consisted only of uncharacterised Elovl sequences. One clade from the Fad phylogeny contained only uncharacterised proteins, while the other clade contained functionally characterised delta-5 desaturase proteins. The discovery of an uncharacterised Fad clade is particularly interesting as these divergent proteins may have novel functions. Overall, this paper presents a number of novel Fad and Elovl genes suggesting that many mollusc groups possess most of the required enzymes for the synthesis of lcPUFAs. PMID:26308548
Increased expression of a set of genes enriched in oxygen binding function discloses a predisposition of breast cancer bone metastases to generate metastasis spread in multiple organs.

PubMed

Capulli, Mattia; Angelucci, Adriano; Driouch, Keltouma; Garcia, Teresa; Clement-Lacroix, Philippe; Martella, Francesco; Ventura, Luca; Bologna, Mauro; Flamini, Stefano; Moreschini, Oreste; Lidereau, Rosette; Ricevuto, Enrico; Muraca, Maurizio; Teti, Anna; Rucci, Nadia

2012-11-01

Bone is the preferential site of distant metastasis in breast carcinoma (BrCa). Patients with metastasis restricted to bone (BO) usually show a longer overall survival compared to patients who rapidly develop multiple metastases also involving liver and lung. Hence, molecular predisposition to generate bone and visceral metastases (BV) represents a clear indication of poor clinical outcome. We performed microarray analysis with two different chip platforms, Affymetrix and Agilent, on bone metastasis samples from BO and BV patients. The unsupervised hierarchical clustering of the resulting transcriptomes correlated with the clinical progression, segregating the BO from the BV profiles. Matching the twofold significantly regulated genes from Affymetrix and Agilent chips resulted in a 15-gene signature with 13 upregulated and two downregulated genes in BV versus BO bone metastasis samples. In order to validate the resulting signature, we isolated different MDA-MB-231 clonal subpopulations that metastasize only in the bone (MDA-BO) or in bone and visceral tissues (MDA-BV). Six of the signature genes were also significantly upregulated in MDA-BV compared to MDA-BO clones. A group of upregulated genes, including Hemoglobin B (HBB), were involved in oxygen metabolism, and in vitro functional analysis of HBB revealed that its expression in the MDA subpopulations was associated with a reduced production of hydrogen peroxide. Expression of HBB was detected in primary BrCa tissue but not in normal breast epithelial cells. Metastatic lymph nodes were frequently more positive for HBB compared to the corresponding primary tumors, whereas BO metastases had a lower expression than BV metastases, suggesting a positive correlation between HBB and ability of bone metastasis to rapidly spread to other organs. We propose that HBB, along with other genes involved in oxygen metabolism, confers a more aggressive metastatic phenotype in BrCa cells disseminated to bone. Copyright © 2012 American Society for Bone and Mineral Research.
Reserch of the gene polymorphism TOX3 / LOC643714 and the risk of breast cancer development in persons exposed to ionizing radiation after Chornobyl disaster.

PubMed

Polinyk, S I; Rybchenko, L A; Klimyk, B T

2017-12-01

The objective of this work was to identify and compare the polymorphism of the rs3803662 polymorphism of the TOX3/LOC643714 gene in breast cancer patients who have undergone ionizing radiation due to the Chornobyl accident and in patients without ionizing radiation (IR) in the history. The determination of the rs3803662 polymorphism of the TOX3/LOC643714 gene was per formed by polymerase chain reaction (PCR) in 83 patients with breast cancer: 42 subjects who were exposed to ion izing radiation due to the Chornobyl accident, 41 people without ionizing radiation in history and 17 controls in Ukraine without cancer pathology. In order to compare the obtained data on spontaneous and radiation associated breast cancer and to calculate the differences in the frequencies of alleles and the risk of oncopathology, data from literature on control groups of the populations of the Russian Federation, Sweden, and the United Kingdom were used. Comparing with the literature data and the group of exposed subjects, the homozygous carriers of the minor alleles of the TOX3/LOC643714 ТТ gene revealed an increased risk of developing breast cancer: OR = 2.89, p = 0.02 (CI 95% 1.17 7,16). In subjects without the influence of IR in history, the carrier of homozygous minor axis of the gene TOX3/LOC643714 ТТ is also associated with the risk of breast cancer: OR = 3.83, p = 0.0002 (CI 95% 0.82-14.14). In the homozygous carriers of the minor alleles of the TOX3 / LOC643714 gene exposed to IR, there was no increase in the risk of developing breast cancer (OR = 0.65, p = 0.46, CI 95% 0.21-2.04) compared with the con trol group of Ukrainian population. The carrier of homozygous minor alleles of the TOX3/LOC643714 gene is not a risk factor for the devel opment of breast cancer under conditions of exposure to ionizing radiation in the study group of the Ukrainian population. S. I. Polinyk, L. A. Rybchenko, B. T. Klimyk.
Genomewide annotation and comparative genomics of cytochrome P450 monooxygenases (P450s) in the polypore species Bjerkandera adusta, Ganoderma sp. and Phlebia brevispora.

PubMed

Syed, Khajamohiddin; Nelson, David R; Riley, Robert; Yadav, Jagjit S

2013-01-01

Genomewide annotation of cytochrome P450 monooxygenases (P450s) in three white-rot species of the fungal order Polyporales, namely Bjerkandera adusta, Ganoderma sp. and Phlebia brevispora, revealed a large contingent of P450 genes (P450ome) in their genomes. A total of 199 P450 genes in B. adusta and 209 P450 genes each in Ganoderma sp. and P. brevispora were identified. These P450omes were classified into families and subfamilies as follows: B. adusta (39 families, 86 subfamilies), Ganoderma sp. (41 families, 105 subfamilies) and P. brevispora (42 families, 111 subfamilies). Of note, the B. adusta genome lacked the CYP505 family (P450foxy), a group of P450-CPR fusion proteins. The three polypore species revealed differential enrichment of individual P450 families in their genomes. The largest CYP families in the three genomes were CYP5144 (67 P450s), CYP5359 (46 P450s) and CYP5344 (43 P450s) in B. adusta, Ganoderma sp. and P. brevispora, respectively. Our analyses showed that tandem gene duplications led to expansions in certain P450 families. An estimated 33% (72 P450s), 28% (55 P450s) and 23% (49 P450s) of P450ome genes were duplicated in P. brevispora, B. adusta and Ganoderma sp., respectively. Family-wise comparative analysis revealed that 22 CYP families are common across the three Polypore species. Comparative P450ome analysis with Ganoderma lucidum revealed the presence of 143 orthologs and 56 paralogs in Ganoderma sp. Multiple P450s were found near the characteristic biosynthetic genes for secondary metabolites, namely polyketide synthase (PKS), non-ribosomal peptide synthetase (NRPS), terpene cyclase and terpene synthase in the three genomes, suggesting a likely role of these P450s in secondary metabolism in these Polyporales. Overall, the three species had a richer P450 diversity both in terms of the P450 genes and P450 subfamilies as compared to the model white-rot and brown-rot polypore species Phanerochaete chrysosporium and Postia placenta.
Mitochondrial Genomes of Kinorhyncha: trnM Duplication and New Gene Orders within Animals.

PubMed

Popova, Olga V; Mikhailov, Kirill V; Nikitin, Mikhail A; Logacheva, Maria D; Penin, Aleksey A; Muntyan, Maria S; Kedrova, Olga S; Petrov, Nikolai B; Panchin, Yuri V; Aleoshin, Vladimir V

2016-01-01

Many features of mitochondrial genomes of animals, such as patterns of gene arrangement, nucleotide content and substitution rate variation are extensively used in evolutionary and phylogenetic studies. Nearly 6,000 mitochondrial genomes of animals have already been sequenced, covering the majority of animal phyla. One of the groups that escaped mitogenome sequencing is phylum Kinorhyncha-an isolated taxon of microscopic worm-like ecdysozoans. The kinorhynchs are thought to be one of the early-branching lineages of Ecdysozoa, and their mitochondrial genomes may be important for resolving evolutionary relations between major animal taxa. Here we present the results of sequencing and analysis of mitochondrial genomes from two members of Kinorhyncha, Echinoderes svetlanae (Cyclorhagida) and Pycnophyes kielensis (Allomalorhagida). Their mitochondrial genomes are circular molecules approximately 15 Kbp in size. The kinorhynch mitochondrial gene sequences are highly divergent, which precludes accurate phylogenetic inference. The mitogenomes of both species encode a typical metazoan complement of 37 genes, which are all positioned on the major strand, but the gene order is distinct and unique among Ecdysozoa or animals as a whole. We predict four types of start codons for protein-coding genes in E. svetlanae and five in P. kielensis with a consensus DTD in single letter code. The mitochondrial genomes of E. svetlanae and P. kielensis encode duplicated methionine tRNA genes that display compensatory nucleotide substitutions. Two distant species of Kinorhyncha demonstrate similar patterns of gene arrangements in their mitogenomes. Both genomes have duplicated methionine tRNA genes; the duplication predates the divergence of two species. The kinorhynchs share a few features pertaining to gene order that align them with Priapulida. Gene order analysis reveals that gene arrangement specific of Priapulida may be ancestral for Scalidophora, Ecdysozoa, and even Protostomia.

Mitochondrial Genomes of Kinorhyncha: trnM Duplication and New Gene Orders within Animals

PubMed Central

Popova, Olga V.; Mikhailov, Kirill V.; Nikitin, Mikhail A.; Logacheva, Maria D.; Penin, Aleksey A.; Muntyan, Maria S.; Kedrova, Olga S.; Petrov, Nikolai B.; Panchin, Yuri V.

2016-01-01

Many features of mitochondrial genomes of animals, such as patterns of gene arrangement, nucleotide content and substitution rate variation are extensively used in evolutionary and phylogenetic studies. Nearly 6,000 mitochondrial genomes of animals have already been sequenced, covering the majority of animal phyla. One of the groups that escaped mitogenome sequencing is phylum Kinorhyncha—an isolated taxon of microscopic worm-like ecdysozoans. The kinorhynchs are thought to be one of the early-branching lineages of Ecdysozoa, and their mitochondrial genomes may be important for resolving evolutionary relations between major animal taxa. Here we present the results of sequencing and analysis of mitochondrial genomes from two members of Kinorhyncha, Echinoderes svetlanae (Cyclorhagida) and Pycnophyes kielensis (Allomalorhagida). Their mitochondrial genomes are circular molecules approximately 15 Kbp in size. The kinorhynch mitochondrial gene sequences are highly divergent, which precludes accurate phylogenetic inference. The mitogenomes of both species encode a typical metazoan complement of 37 genes, which are all positioned on the major strand, but the gene order is distinct and unique among Ecdysozoa or animals as a whole. We predict four types of start codons for protein-coding genes in E. svetlanae and five in P. kielensis with a consensus DTD in single letter code. The mitochondrial genomes of E. svetlanae and P. kielensis encode duplicated methionine tRNA genes that display compensatory nucleotide substitutions. Two distant species of Kinorhyncha demonstrate similar patterns of gene arrangements in their mitogenomes. Both genomes have duplicated methionine tRNA genes; the duplication predates the divergence of two species. The kinorhynchs share a few features pertaining to gene order that align them with Priapulida. Gene order analysis reveals that gene arrangement specific of Priapulida may be ancestral for Scalidophora, Ecdysozoa, and even Protostomia. PMID:27755612
The mitochondrial genomes of Amphiascoides atopus and Schizopera knabeni (Harpacticoida: Miraciidae) reveal similarities between the copepod orders Harpacticoida and Poecilostomatoida.

PubMed

Easton, Erin E; Darrow, Emily M; Spears, Trisha; Thistle, David

2014-03-15

Members of subclass Copepoda are abundant, diverse, and-as a result of their variety of ecological roles in marine and freshwater environments-important, but their phylogenetic interrelationships are unclear. Recent studies of arthropods have used gene arrangements in the mitochondrial (mt) genome to infer phylogenies, but for copepods, only seven complete mt genomes have been published. These data revealed several within-order and few among-order similarities. To increase the data available for comparisons, we sequenced the complete mt genome (13,831base pairs) of Amphiascoides atopus and 10,649base pairs of the mt genome of Schizopera knabeni (both in the family Miraciidae of the order Harpacticoida). Comparison of our data to those for Tigriopus japonicus (family Harpacticidae, order Harpacticoida) revealed similarities in gene arrangement among these three species that were consistent with those found within and among families of other copepod orders. Comparison of the mt genomes of our species with those known from other copepod orders revealed the arrangement of mt genes of our Harpacticoida species to be more similar to that of Sinergasilus polycolpus (order Poecilostomatoida) than to that of T. japonicus. The similarities between S. polycolpus and our species are the first to be noted across the boundaries of copepod orders and support the possibility that mt-gene arrangement might be used to infer copepod phylogenies. We also found that our two species had extremely truncated transfer RNAs and that gene overlaps occurred much more frequently than has been reported for other copepod mt genomes. Published by Elsevier B.V.
Suppression of prolactin gene expression in GH cells correlates with site-specific DNA methylation.

PubMed

Zhang, Z X; Kumar, V; Rivera, R T; Pasion, S G; Chisholm, J; Biswas, D K

1989-10-01

Prolactin- (PRL) producing and nonproducing subclones of the GH line of (rat) pituitary tumor cells have been compared to elucidate the regulatory mechanisms of PRL gene expression. Particular emphasis was placed on delineating the molecular basis of the suppressed state of the PRL gene in the prolactin-nonproducing (PRL-) GH subclone (GH(1)2C1). We examined six methylatable cytosine residues (5, -CCGG- and 1, -GCGC-) within the 30-kb region of the PRL gene in these subclones. This analysis revealed that -CCGG-sequences of the transcribed region, and specifically, one in the fourth exon of the PRL gene, were heavily methylated in the PRL-, GH(1)2C1 cells. Furthermore, the inhibition of PRL gene expression in GH(1)2C1 was reversed by short-term treatment of the cells with a sublethal concentration of azacytidine (AzaC), an inhibitor of DNA methylation. The reversion of PRL gene expression by AzaC was correlated with the concurrent demethylation of the same -CCGG- sequences in the transcribed region of PRL gene. An inverse correlation between PRL gene expression and the level of methylation of the internal -C- residues in the specific -CCGG-sequence of the transcribed region of the PRL gene was demonstrated. The DNase I sensitivity of these regions of the PRL gene in PRL+, PRL-, and AzaC-treated cells was also consistent with an inverse relationship between methylation state, a higher order of structural modification, and gene expression.(ABSTRACT TRUNCATED AT 250 WORDS)
A framework for analyzing the relationship between gene expression and morphological, topological, and dynamical patterns in neuronal networks.

PubMed

de Arruda, Henrique Ferraz; Comin, Cesar Henrique; Miazaki, Mauro; Viana, Matheus Palhares; Costa, Luciano da Fontoura

2015-04-30

A key point in developmental biology is to understand how gene expression influences the morphological and dynamical patterns that are observed in living beings. In this work we propose a methodology capable of addressing this problem that is based on estimating the mutual information and Pearson correlation between the intensity of gene expression and measurements of several morphological properties of the cells. A similar approach is applied in order to identify effects of gene expression over the system dynamics. Neuronal networks were artificially grown over a lattice by considering a reference model used to generate artificial neurons. The input parameters of the artificial neurons were determined according to two distinct patterns of gene expression and the dynamical response was assessed by considering the integrate-and-fire model. As far as single gene dependence is concerned, we found that the interaction between the gene expression and the network topology, as well as between the former and the dynamics response, is strongly affected by the gene expression pattern. In addition, we observed a high correlation between the gene expression and some topological measurements of the neuronal network for particular patterns of gene expression. To our best understanding, there are no similar analyses to compare with. A proper understanding of gene expression influence requires jointly studying the morphology, topology, and dynamics of neurons. The proposed framework represents a first step towards predicting gene expression patterns from morphology and connectivity. Copyright © 2015. Published by Elsevier B.V.
The effect of polymorphism in gene of insulin-like growth factor-I on the serum periparturient concentration in Holstein dairy cows.

PubMed

Mirzaei, A; Sharifiyazdi, H; Ahmadi, M R; Ararooti, T; Ghasrodashti, A Rowshan; Kadivar, A

2012-10-01

To investigate the relationship between polymorphism within the 5'-untranslated region (5'-UTR) of IGF-I gene and its periparturient concentration in Iranian Holstein dairy cows. Blood samples (5 mL, n = 37) were collected by caudal venipuncture from each animal into sample tubes containing the EDTA and DNA was extracted from blood. In order to measure IGF-I concentration the collection of blood samples (n = 111) was also done at 14 d before calving (prepartum), 25 and 45 d postpartum. We found evidence for a significant effect of C to T mutation in position 512 of IGF-I gene on its serum concentration in dairy cows in Iran. Cows with CC genotype had significantly higher concentration (Mean±SD) of IGF-I at 14 d prepartum (91.8±18.1) µg/L compared to those with TT genotype (73.3±14.4) µg/L (P=0.04). A significant trend (quadratic) was found for IGF-I concentration, as higher in CC cows compared to ones with TT genotype, during the 14 d before calving to 45 d postpartum (P=0.01). We concluded that C/T transition in the promoter region of IGF-I gene can influence the serum concentration of IGF-I in periparturient dairy cows.
fabH deletion increases DHA production in Escherichia coli expressing Pfa genes.

PubMed

Giner-Robles, Laura; Lázaro, Beatriz; de la Cruz, Fernando; Moncalián, Gabriel

2018-06-08

Some marine bacteria, such as Moritella marina, produce the nutraceutical docosahexaenoic acid (DHA) thanks to a specific enzymatic complex called Pfa synthase. Escherichia coli heterologously expressing the pfa gene cluster from M. marina also produces DHA. The aim of this study was to find genetic or metabolic conditions to increase DHA production in E. coli. First, we analysed the effect of the antibiotic cerulenin, showing that DHA production increased twofold. Then, we tested a series of single gene knockout mutations affecting fatty acid biosynthesis, in order to optimize the synthesis of DHA. The most effective mutant, fabH, showed a threefold increase compared to wild type strain. The combination of cerulenin inhibition and fabH deletion rendered a 6.5-fold improvement compared to control strain. Both strategies seem to have the same mechanism of action, in which fatty acid synthesis via the canonical pathway (fab pathway) is affected in its first catalytic step, which allows the substrates to be used by the heterologous pathway to synthesize DHA. DHA-producing E. coli strain that carries a fabH gene deletion boosts DHA production by tuning down the competing canonical biosynthesis pathway. Our approach can be used for optimization of DHA production in different organisms.
A Streamlined Protocol for Molecular Testing of the DMD Gene within a Diagnostic Laboratory: A Combination of Array Comparative Genomic Hybridization and Bidirectional Sequence Analysis

PubMed Central

Marquis-Nicholson, Renate; Lai, Daniel; Love, Jennifer M.; Love, Donald R.

2013-01-01

Purpose. The aim of this study was to develop a streamlined mutation screening protocol for the DMD gene in order to confirm a clinical diagnosis of Duchenne or Becker muscular dystrophy in affected males and to clarify the carrier status of female family members. Methods. Sequence analysis and array comparative genomic hybridization (aCGH) were used to identify mutations in the dystrophin DMD gene. We analysed genomic DNA from six individuals with a range of previously characterised mutations and from eight individuals who had not previously undergone any form of molecular analysis. Results. We successfully identified the known mutations in all six patients. A molecular diagnosis was also made in three of the four patients with a clinical diagnosis who had not undergone prior genetic screening, and testing for familial mutations was successfully completed for the remaining four patients. Conclusion. The mutation screening protocol described here meets best practice guidelines for molecular testing of the DMD gene in a diagnostic laboratory. The aCGH method is a superior alternative to more conventional assays such as multiplex ligation-dependent probe amplification (MLPA). The combination of aCGH and sequence analysis will detect mutations in 98% of patients with the Duchenne or Becker muscular dystrophy. PMID:23476807
KERIS: kaleidoscope of gene responses to inflammation between species

PubMed Central

Li, Peng; Tompkins, Ronald G; Xiao, Wenzhong

2017-01-01

A cornerstone of modern biomedical research is the use of animal models to study disease mechanisms and to develop new therapeutic approaches. In order to help the research community to better explore the similarities and differences of genomic response between human inflammatory diseases and murine models, we developed KERIS: kaleidoscope of gene responses to inflammation between species (available at http://www.igenomed.org/keris/). As of June 2016, KERIS includes comparisons of the genomic response of six human inflammatory diseases (burns, trauma, infection, sepsis, endotoxin and acute respiratory distress syndrome) and matched mouse models, using 2257 curated samples from the Inflammation and the Host Response to Injury Glue Grant studies and other representative studies in Gene Expression Omnibus. A researcher can browse, query, visualize and compare the response patterns of genes, pathways and functional modules across different diseases and corresponding murine models. The database is expected to help biologists choosing models when studying the mechanisms of particular genes and pathways in a disease and prioritizing the translation of findings from disease models into clinical studies. PMID:27789704
Characterization of embryo-specific genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sung, Z.R.

1988-01-01

The objective of the proposed research is to characterize the structure and function of a set of genes whose expression is regulated in embryo development, and that are not expressed in mature tissues -- the embryogenic genes. In order to isolate these genes, we immunized a rabbit with total extracts of somatic embryos of carrot, and enriched the anti-embryo antiserum for antibodies reacting with extracts of carrot somatic embryos. Using this enriched antiserum, we screened a lambda gt11 cDNA library constructed from embryo poly A{sup +} RNA, and isolated 10 cDNA clones that detect embryogenic mRNAs. Monospecific antibodies have beenmore » purified for proteins corresponding to each cDNA sequence. Four cDNA clones were further characterized in terms of the expression of their corresponding mRNA and protein in somatic embryos of carrot. In some cases, comparable gene sequences or products have been detected in somatic and zygotic embryos of other plant species. The characteristics of these 4 cDNA clones -- clone Nos. 8, 59, and 66 -- are described in this report. 3 figs.« less
Comparative Genome and Proteome Analysis of Anopheles gambiae and Drosophila melanogaster

NASA Astrophysics Data System (ADS)

Zdobnov, Evgeny M.; von Mering, Christian; Letunic, Ivica; Torrents, David; Suyama, Mikita; Copley, Richard R.; Christophides, George K.; Thomasova, Dana; Holt, Robert A.; Subramanian, G. Mani; Mueller, Hans-Michael; Dimopoulos, George; Law, John H.; Wells, Michael A.; Birney, Ewan; Charlab, Rosane; Halpern, Aaron L.; Kokoza, Elena; Kraft, Cheryl L.; Lai, Zhongwu; Lewis, Suzanna; Louis, Christos; Barillas-Mury, Carolina; Nusskern, Deborah; Rubin, Gerald M.; Salzberg, Steven L.; Sutton, Granger G.; Topalis, Pantelis; Wides, Ron; Wincker, Patrick; Yandell, Mark; Collins, Frank H.; Ribeiro, Jose; Gelbart, William M.; Kafatos, Fotis C.; Bork, Peer

2002-10-01

Comparison of the genomes and proteomes of the two diptera Anopheles gambiae and Drosophila melanogaster, which diverged about 250 million years ago, reveals considerable similarities. However, numerous differences are also observed; some of these must reflect the selection and subsequent adaptation associated with different ecologies and life strategies. Almost half of the genes in both genomes are interpreted as orthologs and show an average sequence identity of about 56%, which is slightly lower than that observed between the orthologs of the pufferfish and human (diverged about 450 million years ago). This indicates that these two insects diverged considerably faster than vertebrates. Aligned sequences reveal that orthologous genes have retained only half of their intron/exon structure, indicating that intron gains or losses have occurred at a rate of about one per gene per 125 million years. Chromosomal arms exhibit significant remnants of homology between the two species, although only 34% of the genes colocalize in small ``microsyntenic'' clusters, and major interarm transfers as well as intra-arm shuffling of gene order are detected.
A Comparative Encyclopedia of DNA Elements in the Mouse Genome

PubMed Central

Yue, Feng; Cheng, Yong; Breschi, Alessandra; Vierstra, Jeff; Wu, Weisheng; Ryba, Tyrone; Sandstrom, Richard; Ma, Zhihai; Davis, Carrie; Pope, Benjamin D.; Shen, Yin; Pervouchine, Dmitri D.; Djebali, Sarah; Thurman, Bob; Kaul, Rajinder; Rynes, Eric; Kirilusha, Anthony; Marinov, Georgi K.; Williams, Brian A.; Trout, Diane; Amrhein, Henry; Fisher-Aylor, Katherine; Antoshechkin, Igor; DeSalvo, Gilberto; See, Lei-Hoon; Fastuca, Meagan; Drenkow, Jorg; Zaleski, Chris; Dobin, Alex; Prieto, Pablo; Lagarde, Julien; Bussotti, Giovanni; Tanzer, Andrea; Denas, Olgert; Li, Kanwei; Bender, M. A.; Zhang, Miaohua; Byron, Rachel; Groudine, Mark T.; McCleary, David; Pham, Long; Ye, Zhen; Kuan, Samantha; Edsall, Lee; Wu, Yi-Chieh; Rasmussen, Matthew D.; Bansal, Mukul S.; Keller, Cheryl A.; Morrissey, Christapher S.; Mishra, Tejaswini; Jain, Deepti; Dogan, Nergiz; Harris, Robert S.; Cayting, Philip; Kawli, Trupti; Boyle, Alan P.; Euskirchen, Ghia; Kundaje, Anshul; Lin, Shin; Lin, Yiing; Jansen, Camden; Malladi, Venkat S.; Cline, Melissa S.; Erickson, Drew T.; Kirkup, Vanessa M; Learned, Katrina; Sloan, Cricket A.; Rosenbloom, Kate R.; de Sousa, Beatriz Lacerda; Beal, Kathryn; Pignatelli, Miguel; Flicek, Paul; Lian, Jin; Kahveci, Tamer; Lee, Dongwon; Kent, W. James; Santos, Miguel Ramalho; Herrero, Javier; Notredame, Cedric; Johnson, Audra; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Canfield, Theresa; Sabo, Peter J.; Wilken, Matthew S.; Reh, Thomas A.; Giste, Erika; Shafer, Anthony; Kutyavin, Tanya; Haugen, Eric; Dunn, Douglas; Reynolds, Alex P.; Neph, Shane; Humbert, Richard; Hansen, R. Scott; De Bruijn, Marella; Selleri, Licia; Rudensky, Alexander; Josefowicz, Steven; Samstein, Robert; Eichler, Evan E.; Orkin, Stuart H.; Levasseur, Dana; Papayannopoulou, Thalia; Chang, Kai-Hsin; Skoultchi, Arthur; Gosh, Srikanta; Disteche, Christine; Treuting, Piper; Wang, Yanli; Weiss, Mitchell J.; Blobel, Gerd A.; Good, Peter J.; Lowdon, Rebecca F.; Adams, Leslie B.; Zhou, Xiao-Qiao; Pazin, Michael J.; Feingold, Elise A.; Wold, Barbara; Taylor, James; Kellis, Manolis; Mortazavi, Ali; Weissman, Sherman M.; Stamatoyannopoulos, John; Snyder, Michael P.; Guigo, Roderic; Gingeras, Thomas R.; Gilbert, David M.; Hardison, Ross C.; Beer, Michael A.; Ren, Bing

2014-01-01

Summary As the premier model organism in biomedical research, the laboratory mouse shares the majority of protein-coding genes with humans, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications, and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of other sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases. PMID:25409824
A comparative encyclopedia of DNA elements in the mouse genome.

PubMed

Yue, Feng; Cheng, Yong; Breschi, Alessandra; Vierstra, Jeff; Wu, Weisheng; Ryba, Tyrone; Sandstrom, Richard; Ma, Zhihai; Davis, Carrie; Pope, Benjamin D; Shen, Yin; Pervouchine, Dmitri D; Djebali, Sarah; Thurman, Robert E; Kaul, Rajinder; Rynes, Eric; Kirilusha, Anthony; Marinov, Georgi K; Williams, Brian A; Trout, Diane; Amrhein, Henry; Fisher-Aylor, Katherine; Antoshechkin, Igor; DeSalvo, Gilberto; See, Lei-Hoon; Fastuca, Meagan; Drenkow, Jorg; Zaleski, Chris; Dobin, Alex; Prieto, Pablo; Lagarde, Julien; Bussotti, Giovanni; Tanzer, Andrea; Denas, Olgert; Li, Kanwei; Bender, M A; Zhang, Miaohua; Byron, Rachel; Groudine, Mark T; McCleary, David; Pham, Long; Ye, Zhen; Kuan, Samantha; Edsall, Lee; Wu, Yi-Chieh; Rasmussen, Matthew D; Bansal, Mukul S; Kellis, Manolis; Keller, Cheryl A; Morrissey, Christapher S; Mishra, Tejaswini; Jain, Deepti; Dogan, Nergiz; Harris, Robert S; Cayting, Philip; Kawli, Trupti; Boyle, Alan P; Euskirchen, Ghia; Kundaje, Anshul; Lin, Shin; Lin, Yiing; Jansen, Camden; Malladi, Venkat S; Cline, Melissa S; Erickson, Drew T; Kirkup, Vanessa M; Learned, Katrina; Sloan, Cricket A; Rosenbloom, Kate R; Lacerda de Sousa, Beatriz; Beal, Kathryn; Pignatelli, Miguel; Flicek, Paul; Lian, Jin; Kahveci, Tamer; Lee, Dongwon; Kent, W James; Ramalho Santos, Miguel; Herrero, Javier; Notredame, Cedric; Johnson, Audra; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Canfield, Theresa; Sabo, Peter J; Wilken, Matthew S; Reh, Thomas A; Giste, Erika; Shafer, Anthony; Kutyavin, Tanya; Haugen, Eric; Dunn, Douglas; Reynolds, Alex P; Neph, Shane; Humbert, Richard; Hansen, R Scott; De Bruijn, Marella; Selleri, Licia; Rudensky, Alexander; Josefowicz, Steven; Samstein, Robert; Eichler, Evan E; Orkin, Stuart H; Levasseur, Dana; Papayannopoulou, Thalia; Chang, Kai-Hsin; Skoultchi, Arthur; Gosh, Srikanta; Disteche, Christine; Treuting, Piper; Wang, Yanli; Weiss, Mitchell J; Blobel, Gerd A; Cao, Xiaoyi; Zhong, Sheng; Wang, Ting; Good, Peter J; Lowdon, Rebecca F; Adams, Leslie B; Zhou, Xiao-Qiao; Pazin, Michael J; Feingold, Elise A; Wold, Barbara; Taylor, James; Mortazavi, Ali; Weissman, Sherman M; Stamatoyannopoulos, John A; Snyder, Michael P; Guigo, Roderic; Gingeras, Thomas R; Gilbert, David M; Hardison, Ross C; Beer, Michael A; Ren, Bing

2014-11-20

The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.
Evaluation of the mutagenicity and carcinogenicity of motor vehicle emissions in short-term bioassays.

PubMed Central

Lewtas, J

1983-01-01

Incomplete combustion of fuel in motor vehicles results in the emission of submicron carbonaceous particles which, after cooling and dilution, contain varying quantities of extractable organic constituents. These organics are mutagenic in bacteria. Confirmatory bioassays in mammalian cells provide the capability of detecting chromosomal and DNA damage in addition to gene mutations. In order to evaluate the mutagenicity of these organics in mammalian cells, extractable organics from particle emissions from several diesel and gasoline vehicles were compared in a battery of microbial, mammalian cell and in vivo bioassays. The mammalian cell mutagenicity bioassays were selected to detect gene mutations, DNA damage, and chromosomal effects. Carcinogenesis bioassays conducted included short-term assays for oncogenic transformation and skin tumorigenesis. The results in different assay systems are compared both qualitatively and quantitatively. Good quantitative correlations were observed between several mutagenesis and carcinogenesis bioassays for this series of diesel and gasoline emissions. PMID:6186475
Plant comparative genetics after 10 years.

PubMed

Gale, M D; Devos, K M

1998-10-23

The past 10 years have seen the discovery of unexpected levels of conservation of gene content and gene orders over millions of years of evolution within grasses, crucifers, legumes, some trees, and Solanaceae crops. Within the grasses, which include the three 500-million-ton-plus-per-year crops (wheat, maize, and rice), and the crucifers, which include all the Brassica crops, colinearity looks good enough to do most map-based cloning only in the small genome model species, rice and Arabidopsis. Elsewhere, knowledge gained in a few major crops is being pooled and applied across the board. The extrapolation of information from the well-studied species to orphan crops, which include many tropical species, is providing a solid base for their improvement. Genome rearrangements are giving new insights into evolution. In fact, comparative genetics is the key that will unlock the secrets of crop plants with genomes larger than that of humans.
Agave: a biofuel feedstock for arid and semi-arid environments

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gross, Stephen; Martin, Jeffrey; Simpson, June

2011-05-31

Efficient production of plant-based, lignocellulosic biofuels relies upon continued improvement of existing biofuel feedstock species, as well as the introduction of newfeedstocks capable of growing on marginal lands to avoid conflicts with existing food production and minimize use of water and nitrogen resources. To this end, specieswithin the plant genus Agave have recently been proposed as new biofuel feedstocks. Many Agave species are adapted to hot and arid environments generally unsuitable forfood production, yet have biomass productivity rates comparable to other second-generation biofuel feedstocks such as switchgrass and Miscanthus. Agavesachieve remarkable heat tolerance and water use efficiency in part throughmore » a Crassulacean Acid Metabolism (CAM) mode of photosynthesis, but the genes andregulatory pathways enabling CAM and thermotolerance in agaves remain poorly understood. We seek to accelerate the development of agave as a new biofuelfeedstock through genomic approaches using massively-parallel sequencing technologies. First, we plan to sequence the transcriptome of A. tequilana to provide adatabase of protein-coding genes to the agave research community. Second, we will compare transcriptome-wide gene expression of agaves under different environmentalconditions in order to understand genetic pathways controlling CAM, water use efficiency, and thermotolerance. Finally, we aim to compare the transcriptome of A.tequilana with that of other Agave species to gain further insight into molecular mechanisms underlying traits desirable for biofuel feedstocks. These genomicapproaches will provide sequence and gene expression information critical to the breeding and domestication of Agave species suitable for biofuel production.« less
Methylation of the oxytocin receptor gene in clinically depressed patients compared to controls: The role of OXTR rs53576 genotype.

PubMed

Reiner, I; Van IJzendoorn, M H; Bakermans-Kranenburg, M J; Bleich, S; Beutel, M; Frieling, H

2015-06-01

The emerging field of epigenetics provides a biological basis for gene-environment interactions relevant to depression. We focus on DNA methylation of exon 1 and 2 of the oxytocin receptor gene (OXTR) promoter. The research aims of the current study were to compare OXTR DNA methylation of depressed patients with healthy control subjects and to investigate possible influences of the OXTR rs53576 genotype. The sample of the present study consisted of 43 clinically depressed women recruited from a psychosomatic inpatient unit and 42 healthy, female control subjects - mean age 30 years (SD = 9). DNA methylation profiles of the OXTR gene were assessed from leukocyte DNA by means of bisulfite sequencing. Depressed female patients had decreased OXTR exon 1 DNA methylation compared to non-depressed women. The association between depression and methylation level was moderated by OXTR rs53576 genotype. Exon 2 methylation was associated with OXTR rs53576 genotype but not with depression. Our findings suggest exon-specific methylation mechanisms. Exon 1 methylation appears to be associated with depressive phenotypes whereas exon 2 methylation is influenced by genotype. Previously reported divergent associations between OXTR genotype and depression might be explained by varying exon 1 methylation. In order to further understand the etiology of depression, research on the interplay between genotype, environmental influences and exon-specific methylation patterns is needed. Copyright © 2015 Elsevier Ltd. All rights reserved.
Leveraging network analytics to infer patient syndrome and identify causal genes in rare disease cases.

PubMed

Krämer, Andreas; Shah, Sohela; Rebres, Robert Anthony; Tang, Susan; Richards, Daniel Rene

2017-08-11

Next-generation sequencing is widely used to identify disease-causing variants in patients with rare genetic disorders. Identifying those variants from whole-genome or exome data can be both scientifically challenging and time consuming. A significant amount of time is spent on variant annotation, and interpretation. Fully or partly automated solutions are therefore needed to streamline and scale this process. We describe Phenotype Driven Ranking (PDR), an algorithm integrated into Ingenuity Variant Analysis, that uses observed patient phenotypes to prioritize diseases and genes in order to expedite causal-variant discovery. Our method is based on a network of phenotype-disease-gene relationships derived from the QIAGEN Knowledge Base, which allows for efficient computational association of phenotypes to implicated diseases, and also enables scoring and ranking. We have demonstrated the utility and performance of PDR by applying it to a number of clinical rare-disease cases, where the true causal gene was known beforehand. It is also shown that PDR compares favorably to a representative alternative tool.
Eotaxin-3 and a uniquely conserved gene-expression profile in eosinophilic esophagitis

PubMed Central

Blanchard, Carine; Wang, Ning; Stringer, Keith F.; Mishra, Anil; Fulkerson, Patricia C.; Abonia, J. Pablo; Jameson, Sean C.; Kirby, Cassie; Konikoff, Michael R.; Collins, Margaret H.; Cohen, Mitchell B.; Akers, Rachel; Hogan, Simon P.; Assa’ad, Amal H.; Putnam, Philip E.; Aronow, Bruce J.; Rothenberg, Marc E.

2006-01-01

Eosinophilic esophagitis (EE) is an emerging disorder with a poorly understood pathogenesis. In order to define disease mechanisms, we took an empirical approach analyzing esophageal tissue by a genome-wide microarray expression analysis. EE patients had a striking transcript signature involving 1% of the human genome that was remarkably conserved across sex, age, and allergic status and was distinct from that associated with non-EE chronic esophagitis. Notably, the gene encoding the eosinophil-specific chemoattractant eotaxin-3 (also known as CCL26) was the most highly induced gene in EE patients compared with its expression level in healthy individuals. Esophageal eotaxin-3 mRNA and protein levels strongly correlated with tissue eosinophilia and mastocytosis. Furthermore, a single-nucleotide polymorphism in the human eotaxin-3 gene was associated with disease susceptibility. Finally, mice deficient in the eotaxin receptor (also known as CCR3) were protected from experimental EE. These results implicate eotaxin-3 as a critical effector molecule for EE and provide insight into disease pathogenesis. PMID:16453027
minet: A R/Bioconductor package for inferring large transcriptional networks using mutual information.

PubMed

Meyer, Patrick E; Lafitte, Frédéric; Bontempi, Gianluca

2008-10-29

This paper presents the R/Bioconductor package minet (version 1.1.6) which provides a set of functions to infer mutual information networks from a dataset. Once fed with a microarray dataset, the package returns a network where nodes denote genes, edges model statistical dependencies between genes and the weight of an edge quantifies the statistical evidence of a specific (e.g transcriptional) gene-to-gene interaction. Four different entropy estimators are made available in the package minet (empirical, Miller-Madow, Schurmann-Grassberger and shrink) as well as four different inference methods, namely relevance networks, ARACNE, CLR and MRNET. Also, the package integrates accuracy assessment tools, like F-scores, PR-curves and ROC-curves in order to compare the inferred network with a reference one. The package minet provides a series of tools for inferring transcriptional networks from microarray data. It is freely available from the Comprehensive R Archive Network (CRAN) as well as from the Bioconductor website.
Phylogeny and mitochondrial gene order variation in Lophotrochozoa in the light of new mitogenomic data from Nemertea

PubMed Central

Podsiadlowski, Lars; Braband, Anke; Struck, Torsten H; von Döhren, Jörn; Bartolomaeus, Thomas

2009-01-01

Background The new animal phylogeny established several taxa which were not identified by morphological analyses, most prominently the Ecdysozoa (arthropods, roundworms, priapulids and others) and Lophotrochozoa (molluscs, annelids, brachiopods and others). Lophotrochozoan interrelationships are under discussion, e.g. regarding the position of Nemertea (ribbon worms), which were discussed to be sister group to e.g. Mollusca, Brachiozoa or Platyhelminthes. Mitochondrial genomes contributed well with sequence data and gene order characters to the deep metazoan phylogeny debate. Results In this study we present the first complete mitochondrial genome record for a member of the Nemertea, Lineus viridis. Except two trnP and trnT, all genes are located on the same strand. While gene order is most similar to that of the brachiopod Terebratulina retusa, sequence based analyses of mitochondrial genes place nemerteans close to molluscs, phoronids and entoprocts without clear preference for one of these taxa as sister group. Conclusion Almost all recent analyses with large datasets show good support for a taxon comprising Annelida, Mollusca, Brachiopoda, Phoronida and Nemertea. But the relationships among these taxa vary between different studies. The analysis of gene order differences gives evidence for a multiple independent occurrence of a large inversion in the mitochondrial genome of Lophotrochozoa and a re-inversion of the same part in gastropods. We hypothesize that some regions of the genome have a higher chance for intramolecular recombination than others and gene order data have to be analysed carefully to detect convergent rearrangement events. PMID:19660126

The mitochondrial genome sequence of Enterobius vermicularis (Nematoda: Oxyurida)--an idiosyncratic gene order and phylogenetic information for chromadorean nematodes.

PubMed

Kang, Seokha; Sultana, Tahera; Eom, Keeseon S; Park, Yung Chul; Soonthornpong, Nathan; Nadler, Steven A; Park, Joong-Ki

2009-01-15

The complete mitochondrial genome sequence was determined for the human pinworm Enterobius vermicularis (Oxyurida: Nematoda) and used to infer its phylogenetic relationship to other major groups of chromadorean nematodes. The E. vermicularis genome is a 14,010-bp circular DNA molecule that encodes 36 genes (12 proteins, 22 tRNAs, and 2 rRNAs). This mtDNA genome lacks atp8, as reported for almost all other nematode species investigated. Phylogenetic analyses (maximum parsimony, maximum likelihood, neighbor joining, and Bayesian inference) of nucleotide sequences for the 12 protein-coding genes of 25 nematode species placed E. vermicularis, a representative of the order Oxyurida, as sister to the main Ascaridida+Rhabditida group. Tree topology comparisons using statistical tests rejected an alternative hypothesis favoring a closer relationship among Ascaridida, Spirurida, and Oxyurida, which has been supported from most studies based on nuclear ribosomal DNA sequences. Unlike the relatively conserved gene arrangement found for most chromadorean taxa, E. vermicularis mtDNA gene order is very unique, not sharing similarity to any other nematode species reported to date. This lack of gene order similarity may represent idiosyncratic gene rearrangements unique to this specific lineage of the oxyurids. To more fully understand the extent of gene rearrangement and its evolutionary significance within the nematode phylogenetic framework, additional mitochondrial genomes representing a greater evolutionary diversity of species must be characterized.
SORBS2 and TLR3 induce premature senescence in primary human fibroblasts and keratinocytes

PubMed Central

2013-01-01

Background Genetic aberrations are required for the progression of HPV-induced cervical precancers. A prerequisite for clonal expansion of cancer cells is unlimited proliferative capacity. In a cell culture model for cervical carcinogenesis loss of genes located on chromosome 4q35→qter and chromosome 10p14-p15 were found to be associated with escape from senescence. Moreover, by LOH and I-FISH analyses a higher frequency of allele loss of these regions was also observed in cervical carcinomas as compared to CIN3. The aim of this study was to identify candidate senescence-related genes located on chromosome 4q35→qter and chromosome 10p14-p15 which may contribute to clonal expansion at the transition of CIN3 to cancer. Methods Microarray expression analyses were used to identify candidate genes down-regulated in cervical carcinomas as compared to CIN3. In order to relate these genes with the process of senescence their respective cDNAs were overexpressed in HPV16-immortalized keratinocytes as well as in primary human fibroblasts and keratinocytes using lentivirus mediated gene transduction. Results Overall fifteen genes located on chromosome 4q35→qter and chromosome 10p14-p15 were identified. Ten of these genes could be validated in biopsies by RT-PCR. Of interest is the novel finding that SORBS2 and TLR3 can induce senescence in primary human fibroblasts and keratinocytes but not in HPV-immortalized cell lines. Intriguingly, the endogenous expression of both genes increases during finite passaging of primary keratinocytes in vitro. Conclusions The relevance of the genes SORBS2 and TLR3 in the process of cellular senescence warrants further investigation. In ongoing experiments we are investigating whether this increase in gene expression is also characteristic of replicative senescence. PMID:24165198
In silico analysis of expressed sequence tags from Trichostrongylus vitrinus (Nematoda): comparison of the automated ESTExplorer workflow platform with conventional database searches.

PubMed

Nagaraj, Shivashankar H; Gasser, Robin B; Nisbet, Alasdair J; Ranganathan, Shoba

2008-01-01

The analysis of expressed sequence tags (EST) offers a rapid and cost effective approach to elucidate the transcriptome of an organism, but requires several computational methods for assembly and annotation. Researchers frequently analyse each step manually, which is laborious and time consuming. We have recently developed ESTExplorer, a semi-automated computational workflow system, in order to achieve the rapid analysis of EST datasets. In this study, we evaluated EST data analysis for the parasitic nematode Trichostrongylus vitrinus (order Strongylida) using ESTExplorer, compared with database matching alone. We functionally annotated 1776 ESTs obtained via suppressive-subtractive hybridisation from T. vitrinus, an important parasitic trichostrongylid of small ruminants. Cluster and comparative genomic analyses of the transcripts using ESTExplorer indicated that 290 (41%) sequences had homologues in Caenorhabditis elegans, 329 (42%) in parasitic nematodes, 202 (28%) in organisms other than nematodes, and 218 (31%) had no significant match to any sequence in the current databases. Of the C. elegans homologues, 90 were associated with 'non-wildtype' double-stranded RNA interference (RNAi) phenotypes, including embryonic lethality, maternal sterility, sterile progeny, larval arrest and slow growth. We could functionally classify 267 (38%) sequences using the Gene Ontologies (GO) and establish pathway associations for 230 (33%) sequences using the Kyoto Encyclopedia of Genes and Genomes (KEGG). Further examination of this EST dataset revealed a number of signalling molecules, proteases, protease inhibitors, enzymes, ion channels and immune-related genes. In addition, we identified 40 putative secreted proteins that could represent potential candidates for developing novel anthelmintics or vaccines. We further compared the automated EST sequence annotations, using ESTExplorer, with database search results for individual T. vitrinus ESTs. ESTExplorer reliably and rapidly annotated 301 ESTs, with pathway and GO information, eliminating 60 low quality hits from database searches. We evaluated the efficacy of ESTExplorer in analysing EST data, and demonstrate that computational tools can be used to accelerate the process of gene discovery in EST sequencing projects. The present study has elucidated sets of relatively conserved and potentially novel genes for biological investigation, and the annotated EST set provides further insight into the molecular biology of T. vitrinus, towards the identification of novel drug targets.
Applying dynamic Bayesian networks to perturbed gene expression data.

PubMed

Dojer, Norbert; Gambin, Anna; Mizera, Andrzej; Wilczyński, Bartek; Tiuryn, Jerzy

2006-05-08

A central goal of molecular biology is to understand the regulatory mechanisms of gene transcription and protein synthesis. Because of their solid basis in statistics, allowing to deal with the stochastic aspects of gene expressions and noisy measurements in a natural way, Bayesian networks appear attractive in the field of inferring gene interactions structure from microarray experiments data. However, the basic formalism has some disadvantages, e.g. it is sometimes hard to distinguish between the origin and the target of an interaction. Two kinds of microarray experiments yield data particularly rich in information regarding the direction of interactions: time series and perturbation experiments. In order to correctly handle them, the basic formalism must be modified. For example, dynamic Bayesian networks (DBN) apply to time series microarray data. To our knowledge the DBN technique has not been applied in the context of perturbation experiments. We extend the framework of dynamic Bayesian networks in order to incorporate perturbations. Moreover, an exact algorithm for inferring an optimal network is proposed and a discretization method specialized for time series data from perturbation experiments is introduced. We apply our procedure to realistic simulations data. The results are compared with those obtained by standard DBN learning techniques. Moreover, the advantages of using exact learning algorithm instead of heuristic methods are analyzed. We show that the quality of inferred networks dramatically improves when using data from perturbation experiments. We also conclude that the exact algorithm should be used when it is possible, i.e. when considered set of genes is small enough.
A HaemAtlas: characterizing gene expression in differentiated human blood cells.

PubMed

Watkins, Nicholas A; Gusnanto, Arief; de Bono, Bernard; De, Subhajyoti; Miranda-Saavedra, Diego; Hardie, Debbie L; Angenent, Will G J; Attwood, Antony P; Ellis, Peter D; Erber, Wendy; Foad, Nicola S; Garner, Stephen F; Isacke, Clare M; Jolley, Jennifer; Koch, Kerstin; Macaulay, Iain C; Morley, Sarah L; Rendon, Augusto; Rice, Kate M; Taylor, Niall; Thijssen-Timmer, Daphne C; Tijssen, Marloes R; van der Schoot, C Ellen; Wernisch, Lorenz; Winzer, Thilo; Dudbridge, Frank; Buckley, Christopher D; Langford, Cordelia F; Teichmann, Sarah; Göttgens, Berthold; Ouwehand, Willem H

2009-05-07

Hematopoiesis is a carefully controlled process that is regulated by complex networks of transcription factors that are, in part, controlled by signals resulting from ligand binding to cell-surface receptors. To further understand hematopoiesis, we have compared gene expression profiles of human erythroblasts, megakaryocytes, B cells, cytotoxic and helper T cells, natural killer cells, granulocytes, and monocytes using whole genome microarrays. A bioinformatics analysis of these data was performed focusing on transcription factors, immunoglobulin superfamily members, and lineage-specific transcripts. We observed that the numbers of lineage-specific genes varies by 2 orders of magnitude, ranging from 5 for cytotoxic T cells to 878 for granulocytes. In addition, we have identified novel coexpression patterns for key transcription factors involved in hematopoiesis (eg, GATA3-GFI1 and GATA2-KLF1). This study represents the most comprehensive analysis of gene expression in hematopoietic cells to date and has identified genes that play key roles in lineage commitment and cell function. The data, which are freely accessible, will be invaluable for future studies on hematopoiesis and the role of specific genes and will also aid the understanding of the recent genome-wide association studies.
A HaemAtlas: characterizing gene expression in differentiated human blood cells

PubMed Central

Gusnanto, Arief; de Bono, Bernard; De, Subhajyoti; Miranda-Saavedra, Diego; Hardie, Debbie L.; Angenent, Will G. J.; Attwood, Antony P.; Ellis, Peter D.; Erber, Wendy; Foad, Nicola S.; Garner, Stephen F.; Isacke, Clare M.; Jolley, Jennifer; Koch, Kerstin; Macaulay, Iain C.; Morley, Sarah L.; Rendon, Augusto; Rice, Kate M.; Taylor, Niall; Thijssen-Timmer, Daphne C.; Tijssen, Marloes R.; van der Schoot, C. Ellen; Wernisch, Lorenz; Winzer, Thilo; Dudbridge, Frank; Buckley, Christopher D.; Langford, Cordelia F.; Teichmann, Sarah; Göttgens, Berthold; Ouwehand, Willem H.

2009-01-01

Hematopoiesis is a carefully controlled process that is regulated by complex networks of transcription factors that are, in part, controlled by signals resulting from ligand binding to cell-surface receptors. To further understand hematopoiesis, we have compared gene expression profiles of human erythroblasts, megakaryocytes, B cells, cytotoxic and helper T cells, natural killer cells, granulocytes, and monocytes using whole genome microarrays. A bioinformatics analysis of these data was performed focusing on transcription factors, immunoglobulin superfamily members, and lineage-specific transcripts. We observed that the numbers of lineage-specific genes varies by 2 orders of magnitude, ranging from 5 for cytotoxic T cells to 878 for granulocytes. In addition, we have identified novel coexpression patterns for key transcription factors involved in hematopoiesis (eg, GATA3-GFI1 and GATA2-KLF1). This study represents the most comprehensive analysis of gene expression in hematopoietic cells to date and has identified genes that play key roles in lineage commitment and cell function. The data, which are freely accessible, will be invaluable for future studies on hematopoiesis and the role of specific genes and will also aid the understanding of the recent genome-wide association studies. PMID:19228925
DOE Office of Scientific and Technical Information (OSTI.GOV)

Price, Morgan N.; Arkin, Adam P.; Alm, Eric J.

Operons are a major feature of all prokaryotic genomes, but how and why operon structures vary is not well understood. To elucidate the life-cycle of operons, we compared gene order between Escherichia coli K12 and its relatives and identified the recently formed and destroyed operons in E. coli. This allowed us to determine how operons form, how they become closely spaced, and how they die. Our findings suggest that operon evolution is driven by selection on gene expression patterns. First, both operon creation and operon destruction lead to large changes in gene expression patterns. For example, the removal of lysAmore » and ruvA from ancestral operons that contained essential genes allowed their expression to respond to lysine levels and DNA damage, respectively. Second, some operons have undergone accelerated evolution, with multiple new genes being added during a brief period. Third, although most operons are closely spaced because of a neutral bias towards deletion and because of selection against large overlaps, highly expressed operons tend to be widely spaced because of regulatory fine-tuning by intervening sequences. Although operon evolution seems to be adaptive, it need not be optimal: new operons often comprise functionally unrelated genes that were already in proximity before the operon formed.« less
Identification of differentially regulated genes in human patent ductus arteriosus

PubMed Central

Parikh, Pratik; Bai, Haiqing; Swartz, Michael F; Alfieris, George M

2016-01-01

In order to identify differentially expressed genes that are specific to the ductus arteriosus, 18 candidate genes were evaluated in matched ductus arteriosus and aortic samples from infants with coarctation of the aorta. The cell specificity of the gene's promoters was assessed by performing transient transfection studies in primary cells derived from several patients. Segments of ductus arteriosus and aorta were isolated from infants requiring repair for coarctation of the aorta and used for mRNA quantitation and culturing of cells. Differences in expression were determined by quantitative PCR using the ΔΔCt method. Promoter regions of six of these genes were cloned into luciferase reporter plasmids for transient transfection studies in matched human ductus arteriosus and aorta cells. Transcription factor AP-2b and phospholipase A2 were significantly up-regulated in ductus arteriosus compared to aorta in whole tissues and cultured cells, respectively. In transient transfection experiments, Angiotensin II type 1 receptor and Prostaglandin E receptor 4 promoters consistently gave higher expression in matched ductus arteriosus versus aorta cells from multiple patients. Taken together, these results demonstrate that several genes are differentially expressed in ductus arteriosus and that their promoters may be used to drive ductus arteriosus-enriched transgene expression. PMID:27465141
Frequent genomic imbalances suggest commonly altered tumour genes in human hepatocarcinogenesis

PubMed Central

Niketeghad, F; Decker, H J; Caselmann, W H; Lund, P; Geissler, F; Dienes, H P; Schirmacher, P

2001-01-01

Hepatocellular carcinoma (HCC) is one of the most frequent-occurring malignant tumours worldwide, but molecular changes of tumour DNA, with the exception of viral integrations and p53 mutations, are poorly understood. In order to search for common macro-imbalances of genomic tumour DNA, 21 HCCs and 3 HCC-cell lines were characterized by comparative genomic hybridization (CGH), subsequent database analyses and in selected cases by fluorescence in situ hybridization (FISH). Chromosomal subregions of 1q, 8q, 17q and 20q showed frequent gains of genomic material, while losses were most prevalent in subregions of 4q, 6q, 13q and 16q. Deleted regions encompass tumour suppressor genes, like RB-1 and the cadherin gene cluster, some of them previously identified as potential target genes in HCC development. Several potential growth- or transformation-promoting genes located in chromosomal subregions showed frequent gains of genomic material. The present study provides a basis for further genomic and expression analyses in HCCs and in addition suggests chromosome 4q to carry a so far unidentified tumour suppressor gene relevant for HCC development. © 2001 Cancer Research Campaign http://www.bjcancer.com PMID:11531255
Improved luciferase gene expression using ultrasound targeted microbubble destruction therapy in swine

NASA Astrophysics Data System (ADS)

Noble, Misty L.; Song, Shuxian; Sun, Ryan R.; Fan, Luping; DiBlasi, Robert M.; O'Kelly-Priddy, Colleen; Loeb, Keith R.; Miao, Carol H.

2012-11-01

Ultrasound (US) targeted microbubble (MB) destruction (UTMD) has been shown to be an effective method in delivering drugs and plasmid DNA (pDNA) into cells. We previously reported successful gene transfection of a reporter luciferase gene, pGL4, into livers of mice and rats using UTMD. The challenge is to translate and achieve similar gene expression in large animals, like swine, where the treated tissue volume is substantially larger. The scale-up study requires proportionally increased amount of pDNA/MBs delivered to tissues and an equivalent increase in US energy. We use different MBs and surgical strategies to retain most of pDNA/MB locally during US application in order to maximize the effect of UTMD in gene transfection. Our results show significant increase in luciferase expression in swine injected with MBs and exposed to 2.7 MPa US. We obtained up to 1800-fold enhancement in the pig experiment using Definity® MBs, and 2000-fold and 6300-fold enhancement in two pig studies using RN18 MBs compared to sham. These results represent an important developmental step towards US mediated gene delivery in large animals and clinical trials.
Comparative genomic analysis of the Tribolium immune system

PubMed Central

Zou, Zhen; Evans, Jay D; Lu, Zhiqiang; Zhao, Picheng; Williams, Michael; Sumathipala, Niranji; Hetru, Charles; Hultmark, Dan; Jiang, Haobo

2007-01-01

Background Tribolium castaneum is a species of Coleoptera, the largest and most diverse order of all eukaryotes. Components of the innate immune system are hardly known in this insect, which is in a key phylogenetic position to inform us about genetic innovations accompanying the evolution of holometabolous insects. We have annotated immunity-related genes and compared them with homologous molecules from other species. Results Around 300 candidate defense proteins are identified based on sequence similarity to homologs known to participate in immune responses. In most cases, paralog counts are lower than those of Drosophila melanogaster or Anopheles gambiae but are substantially higher than those of Apis mellifera. The genome contains probable orthologs for nearly all members of the Toll, IMD, and JAK/STAT pathways. While total numbers of the clip-domain serine proteinases are approximately equal in the fly (29), mosquito (32) and beetle (30), lineage-specific expansion of the family is discovered in all three species. Sixteen of the thirty-one serpin genes form a large cluster in a 50 kb region that resulted from extensive gene duplications. Among the nine Toll-like proteins, four are orthologous to Drosophila Toll. The presence of scavenger receptors and other related proteins indicates a role of cellular responses in the entire system. The structures of some antimicrobial peptides drastically differ from those in other orders of insects. Conclusion A framework of information on Tribolium immunity is established, which may serve as a stepping stone for future genetic analyses of defense responses in a nondrosophiline genetic model insect. PMID:17727709
Differential protein modulation by ketoprofen and ibuprofen underlines different cellular response by gastric epithelium.

PubMed

Brandolini, Laura; d'Angelo, Michele; Antonosante, Andrea; Villa, Sara; Cristiano, Loredana; Castelli, Vanessa; Benedetti, Elisabetta; Catanesi, Mariano; Aramini, Andrea; Luini, Alberto; Parashuraman, Seetharaman; Mayo, Emilia; Giordano, Antonio; Cimini, Annamaria; Allegretti, Marcello

2018-03-01

Ketoprofen L-lysine salt (KLS), is widely used due to its analgesic efficacy and tolerability, and L-lysine was reported to increase the solubility and the gastric tolerance of ketoprofen. In a recent report, L-lysine salification has been shown to exert a gastroprotective effect due to its specific ability to counteract the NSAIDs-induced oxidative stress and up-regulate gastroprotective proteins. In order to derive further insights into the safety and efficacy profile of KLS, in this study we additionally compared the effect of lysine and arginine, another amino acid counterion commonly used for NSAIDs salification, in control and in ethanol challenged human gastric mucosa model. KLS is widely used for the control of post-surgical pain and for the management of pain and fever in inflammatory conditions in children and adults. It is generally well tolerated in pediatric patients, and data from three studies in >900 children indicate that oral administration is well tolerated when administered for up to 3 weeks after surgery. Since only few studies have so far investigated the effect of ketoprofen on gastric mucosa maintenance and adaptive mechanisms, in the second part of the study we applied the cMap approach to compare ketoprofen-induced and ibuprofen-induced gene expression profiles in order to explore compound-specific targeted biological pathways. Among the several genes exclusively modulated by ketoprofen, our attention was particularly focused on genes involved in the maintenance of gastric mucosa barrier integrity (cell junctions, morphology, and viability). The hypothesis was further validated by Real-time PCR. © 2017 Wiley Periodicals, Inc.
Phylogenetic Resolution of Deep Eukaryotic and Fungal Relationships Using Highly Conserved Low-Copy Nuclear Genes

PubMed Central

Ren, Ren; Sun, Yazhou; Zhao, Yue; Geiser, David

2016-01-01

Abstract A comprehensive and reliable eukaryotic tree of life is important for many aspects of biological studies from comparative developmental and physiological analyses to translational medicine and agriculture. Both gene-rich and taxon-rich approaches are effective strategies to improve phylogenetic accuracy and are greatly facilitated by marker genes that are universally distributed, well conserved, and orthologous among divergent eukaryotes. In this article, we report the identification of 943 low-copy eukaryotic genes and we show that many of these genes are promising tools in resolving eukaryotic phylogenies, despite the challenges of determining deep eukaryotic relationships. As a case study, we demonstrate that smaller subsets of ∼20 and 52 genes could resolve controversial relationships among widely divergent taxa and provide strong support for deep relationships such as the monophyly and branching order of several eukaryotic supergroups. In addition, the use of these genes resulted in fungal phylogenies that are congruent with previous phylogenomic studies that used much larger datasets, and successfully resolved several difficult relationships (e.g., forming a highly supported clade with Microsporidia, Mitosporidium and Rozella sister to other fungi). We propose that these genes are excellent for both gene-rich and taxon-rich analyses and can be applied at multiple taxonomic levels and facilitate a more complete understanding of the eukaryotic tree of life. PMID:27604879
[Preliminary analysis of retinal gene expression profile of diabetic rat].

PubMed

Mei, Yan; Zhou, Hong-ying; Xiang, Tao; Lu, You-guang; Li, Ai-dong; Tang, En-jie; Yang, Hui-jun

2005-10-01

Establishing the retinal gene expression profiles of non-diabetic rat and diabetic rat and comparing the profiles in order to analyze the possible genes related with diabetic retinopathy. The whole retinal transcriptional fragments of non-diabetic rat and 8-week diabetic rat were obtained by restriction fragments differential display-PCR (RFDD-PCR). Bioinformatic analysis of retinal gene expression was performed using soft wares, including Fragment Analysis. After comparison of the expression profiles, the related gene fragments of diabetic retinopathy were initially selected as the target gene of further approach. A total of 3639 significant fragments were obtained. By means of more than 3-fold contrast of fluorescent intensity as the differential expression standard, the authors got 840 differential fragments, accounting for 23.08% of the expressed numbers and including 5 visual related genes, 13 excitatory neruotransmitter genes and 3 inhibitory neurotransmitter genes. At the 8th week, the expression of Rhodopsin kinase, beta-arrestin, Phosducinìrod photoreceptor cGMP-gated channel and Rpe65 as well as iGlu R1-4 were down-regulated. mGluRs and GABA-Rs were all up-regulated, whereas the expression of GlyR was unchanged. These results prompt again that the changes in retinal nervous layer of rat have occurred at an early stage of diabetes. The genes expression pattern of visual related genes and excitatory and inhibitory neurotransmitters in rat diabetic retina have been involved in neuro-dysfunctions of diabetic retina.
Evaluation of endogenous control gene(s) for gene expression studies in human blood exposed to 60Co γ-rays ex vivo.

PubMed

Vaiphei, S Thangminlal; Keppen, Joshua; Nongrum, Saibadaiahun; Chaubey, R C; Kma, L; Sharan, R N

2015-01-01

In gene expression studies, it is critical to normalize data using a stably expressed endogenous control gene in order to obtain accurate and reliable results. However, we currently do not have a universally applied endogenous control gene for normalization of data for gene expression studies, particularly those involving (60)Co γ-ray-exposed human blood samples. In this study, a comparative assessment of the gene expression of six widely used housekeeping endogenous control genes, namely 18S, ACTB, B2M, GAPDH, MT-ATP6 and CDKN1A, was undertaken for a range of (60)Co γ-ray doses (0.5, 1.0, 2.0 and 4.0 Gy) at 8.4 Gy min(-1) at 0 and 24 h post-irradiation time intervals. Using the NormFinder algorithm, real-time PCR data obtained from six individuals (three males and three females) were analyzed with respect to the threshold cycle (Ct) value and abundance, ΔCt pair-wise comparison, intra- and inter-group variability assessments, etc. GAPDH, either alone or in combination with 18S, was found to be the most suitable endogenous control gene and should be used in gene expression studies, especially those involving qPCR of γ-ray-exposed human blood samples. © The Author 2014. Published by Oxford University Press on behalf of The Japan Radiation Research Society and Japanese Society for Radiation Oncology.
Improving draft genome contiguity with reference-derived in silico mate-pair libraries.

PubMed

Grau, José Horacio; Hackl, Thomas; Koepfli, Klaus-Peter; Hofreiter, Michael

2018-05-01

Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available. In order to improve genome contiguity, we have developed Cross-Species Scaffolding-a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico. We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ∼30x coverage of shotgun sequencing data.
Daddy, where did (PS)I come from?

PubMed

Baymann, F; Brugna, M; Mühlenhoff, U; Nitschke, W

2001-10-30

The reacton centre I (RCI)-type photosystems from plants, cyano-, helio- and green sulphur bacteria are compared and the essential properties of an archetypal RCI are deduced. Species containing RCI-type photosystems most probably cluster together on a common branch of the phylogenetic tree. The predicted branching order is green sulphur, helio- and cyanobacteria. Striking similarities between RCI- and RCII-type photosystems recently became apparent in the three-dimensional structures of photosystem I (PSI), PSII and RCII. The phylogenetic relationship between all presently known photosystems is analysed suggesting (a) RCI as the ancestral photosystem and (b) the descendence of PSII from RCI via gene duplication and gene splitting. An evolutionary model trying to rationalise available data is presented.
Draft Genome Sequence of Pseudomonas sp. EpS/L25, Isolated from the Medicinal Plant Echinacea purpurea and Able To Synthesize Antimicrobial Compounds.

PubMed

Presta, Luana; Bosi, Emanuele; Fondi, Marco; Maida, Isabel; Perrin, Elena; Miceli, Elisangela; Maggini, Valentina; Bogani, Patrizia; Firenzuoli, Fabio; Di Pilato, Vincenzo; Rossolini, Gian Maria; Mengoni, Alessio; Fani, Renato

2016-05-05

We announce here the draft genome sequence of Pseudomonas sp. strain EpS/L25, isolated from the stem/leaves of the medicinal plant Echinacea purpurea This genome will allow for comparative genomics in order to identify genes associated with the production of bioactive compounds and antibiotic resistance. Copyright © 2016 Presta et al.
Predicting response to primary chemotherapy: gene expression profiling of paraffin-embedded core biopsy tissue.

PubMed

Mina, Lida; Soule, Sharon E; Badve, Sunil; Baehner, Fredrick L; Baker, Joffre; Cronin, Maureen; Watson, Drew; Liu, Mei-Lan; Sledge, George W; Shak, Steve; Miller, Kathy D

2007-06-01

Primary chemotherapy provides an ideal opportunity to correlate gene expression with response to treatment. We used paraffin-embedded core biopsies from a completed phase II trial to identify genes that correlate with response to primary chemotherapy. Patients with newly diagnosed stage II or III breast cancer were treated with sequential doxorubicin 75 mg/M2 q2 wks x 3 and docetaxel 40 mg/M2 weekly x 6; treatment order was randomly assigned. Pretreatment core biopsy samples were interrogated for genes that might correlate with pathologic complete response (pCR). In addition to the individual genes, the correlation of the Oncotype DX Recurrence Score with pCR was examined. Of 70 patients enrolled in the parent trial, core biopsies samples with sufficient RNA for gene analyses were available from 45 patients; 9 (20%) had inflammatory breast cancer (IBC). Six (14%) patients achieved a pCR. Twenty-two of the 274 candidate genes assessed correlated with pCR (p < 0.05). Genes correlating with pCR could be grouped into three large clusters: angiogenesis-related genes, proliferation related genes, and invasion-related genes. Expression of estrogen receptor (ER)-related genes and Recurrence Score did not correlate with pCR. In an exploratory analysis we compared gene expression in IBC to non-inflammatory breast cancer; twenty-four (9%) of the genes were differentially expressed (p < 0.05), 5 were upregulated and 19 were downregulated in IBC. Gene expression analysis on core biopsy samples is feasible and identifies candidate genes that correlate with pCR to primary chemotherapy. Gene expression in IBC differs significantly from noninflammatory breast cancer.
Transcriptomic Analysis of Persistent Infection with Foot-and-Mouth Disease Virus in Cattle Suggests Impairment of Apoptosis and Cell-Mediated Immunity in the Nasopharynx

DOE PAGES

Eschbaumer, Michael; Stenfeldt, Carolina; Smoliga, George R.; ...

2016-09-19

In order to investigate the mechanisms of persistent foot-and-mouth disease virus (FMDV) infection in cattle, transcriptome alterations associated with the FMDV carrier state were characterized using a bovine whole-transcriptome microarray. Eighteen cattle (8 vaccinated with a recombinant FMDV A vaccine, 10 non-vaccinated) were challenged with FMDV A 24 Cruzeiro, and the gene expression profiles of nasopharyngeal tissues collected between 21 and 35 days after challenge were compared between 11 persistently infected carriers and 7 non-carriers. Carriers and non-carrierswere further compared to 2 naive animals that had been neither vaccinated nor challenged. At a controlled false-discovery rate of 10% and amore » minimum difference in expression of 50%, 648 genes were differentially expressed between FMDV carriers and non-carriers, and most (467) had higher expression in carriers.Among these, genes associated with cellular proliferation and the immune response–such as chemokines, cytokines and genes regulating T and B cells–were significantly over represented. Differential gene expression was significantly correlated between non-vaccinated and vaccinated animals (biological correlation +0.97), indicating a similar transcriptome profile across these groups. Genes related to prostaglandin E 2 production and the induction of regulatoryT cells were over expressed in carriers. In contrast, tissues from non-carrier animals expressed higher levels of complement regulators and pro-apoptotic genes that could promote virus clearance. Furthermore, based on these findings, we propose a working hypothesis for FMDV persistence in nasopharyngeal tissues of cattle, in which the virus may be maintained by an impairment of apoptosis and the local suppression of cell-mediated antiviral immunity by inducible regulatoryT cells.« less

Transcriptomic Analysis of Persistent Infection with Foot-and-Mouth Disease Virus in Cattle Suggests Impairment of Apoptosis and Cell-Mediated Immunity in the Nasopharynx

DOE Office of Scientific and Technical Information (OSTI.GOV)

Eschbaumer, Michael; Stenfeldt, Carolina; Smoliga, George R.

In order to investigate the mechanisms of persistent foot-and-mouth disease virus (FMDV) infection in cattle, transcriptome alterations associated with the FMDV carrier state were characterized using a bovine whole-transcriptome microarray. Eighteen cattle (8 vaccinated with a recombinant FMDV A vaccine, 10 non-vaccinated) were challenged with FMDV A 24 Cruzeiro, and the gene expression profiles of nasopharyngeal tissues collected between 21 and 35 days after challenge were compared between 11 persistently infected carriers and 7 non-carriers. Carriers and non-carrierswere further compared to 2 naive animals that had been neither vaccinated nor challenged. At a controlled false-discovery rate of 10% and amore » minimum difference in expression of 50%, 648 genes were differentially expressed between FMDV carriers and non-carriers, and most (467) had higher expression in carriers.Among these, genes associated with cellular proliferation and the immune response–such as chemokines, cytokines and genes regulating T and B cells–were significantly over represented. Differential gene expression was significantly correlated between non-vaccinated and vaccinated animals (biological correlation +0.97), indicating a similar transcriptome profile across these groups. Genes related to prostaglandin E 2 production and the induction of regulatoryT cells were over expressed in carriers. In contrast, tissues from non-carrier animals expressed higher levels of complement regulators and pro-apoptotic genes that could promote virus clearance. Furthermore, based on these findings, we propose a working hypothesis for FMDV persistence in nasopharyngeal tissues of cattle, in which the virus may be maintained by an impairment of apoptosis and the local suppression of cell-mediated antiviral immunity by inducible regulatoryT cells.« less
Early Developmental and Evolutionary Origins of Gene Body DNA Methylation Patterns in Mammalian Placentas

PubMed Central

Schroeder, Diane I.; Jayashankar, Kartika; Douglas, Kory C.; Thirkill, Twanda L.; York, Daniel; Dickinson, Pete J.; Williams, Lawrence E.; Samollow, Paul B.; Ross, Pablo J.; Bannasch, Danika L.; Douglas, Gordon C.; LaSalle, Janine M.

2015-01-01

Over the last 20-80 million years the mammalian placenta has taken on a variety of morphologies through both divergent and convergent evolution. Recently we have shown that the human placenta genome has a unique epigenetic pattern of large partially methylated domains (PMDs) and highly methylated domains (HMDs) with gene body DNA methylation positively correlating with level of gene expression. In order to determine the evolutionary conservation of DNA methylation patterns and transcriptional regulatory programs in the placenta, we performed a genome-wide methylome (MethylC-seq) analysis of human, rhesus macaque, squirrel monkey, mouse, dog, horse, and cow placentas as well as opossum extraembryonic membrane. We found that, similar to human placenta, mammalian placentas and opossum extraembryonic membrane have globally lower levels of methylation compared to somatic tissues. Higher relative gene body methylation was the conserved feature across all mammalian placentas, despite differences in PMD/HMDs and absolute methylation levels. Specifically, higher methylation over the bodies of genes involved in mitosis, vesicle-mediated transport, protein phosphorylation, and chromatin modification was observed compared with the rest of the genome. As in human placenta, higher methylation is associated with higher gene expression and is predictive of genic location across species. Analysis of DNA methylation in oocytes and preimplantation embryos shows a conserved pattern of gene body methylation similar to the placenta. Intriguingly, mouse and cow oocytes and mouse early embryos have PMD/HMDs but their placentas do not, suggesting that PMD/HMDs are a feature of early preimplantation methylation patterns that become lost during placental development in some species and following implantation of the embryo. PMID:26241857
High resolution physical mapping of single gene fragments on pachytene chromosome 4 and 7 of Rosa.

PubMed

Kirov, Ilya V; Van Laere, Katrijn; Khrustaleva, Ludmila I

2015-07-02

Rosaceae is a family containing many economically important fruit and ornamental species. Although fluorescence in situ hybridization (FISH)-based physical mapping of plant genomes is a valuable tool for map-based cloning, comparative genomics and evolutionary studies, no studies using high resolution physical mapping have been performed in this family. Previously we proved that physical mapping of single-copy genes as small as 1.1 kb is possible on mitotic metaphase chromosomes of Rosa wichurana using Tyramide-FISH. In this study we aimed to further improve the physical map of Rosa wichurana by applying high resolution FISH to pachytene chromosomes. Using high resolution Tyramide-FISH and multicolor Tyramide-FISH, 7 genes (1.7-3 kb) were successfully mapped on pachytene chromosomes 4 and 7 of Rosa wichurana. Additionally, by using multicolor Tyramide-FISH three closely located genes were simultaneously visualized on chromosome 7. A detailed map of heterochromatine/euchromatine patterns of chromosome 4 and 7 was developed with indication of the physical position of these 7 genes. Comparison of the gene order between Rosa wichurana and Fragaria vesca revealed a poor collinearity for chromosome 7, but a perfect collinearity for chromosome 4. High resolution physical mapping of short probes on pachytene chromosomes of Rosa wichurana was successfully performed for the first time. Application of Tyramide-FISH on pachytene chromosomes allowed the mapping resolution to be increased up to 20 times compared to mitotic metaphase chromosomes. High resolution Tyramide-FISH and multicolor Tyramide-FISH might become useful tools for further physical mapping of single-copy genes and for the integration of physical and genetic maps of Rosa wichurana and other members of the Rosaceae.
Comparative DNA microarray analysis of human monocyte derived dendritic cells and MUTZ-3 cells exposed to the moderate skin sensitizer cinnamaldehyde

DOE Office of Scientific and Technical Information (OSTI.GOV)

Python, Francois; Goebel, Carsten; Aeby, Pierre

2009-09-15

The number of studies involved in the development of in vitro skin sensitization tests has increased since the adoption of the EU 7th amendment to the cosmetics directive proposing to ban animal testing for cosmetic ingredients by 2013. Several studies have recently demonstrated that sensitizers induce a relevant up-regulation of activation markers such as CD86, CD54, IL-8 or IL-1{beta} in human myeloid cell lines (e.g., U937, MUTZ-3, THP-1) or in human peripheral blood monocyte-derived dendritic cells (PBMDCs). The present study aimed at the identification of new dendritic cell activation markers in order to further improve the in vitro evaluation ofmore » the sensitizing potential of chemicals. We have compared the gene expression profiles of PBMDCs and the human cell line MUTZ-3 after a 24-h exposure to the moderate sensitizer cinnamaldehyde. A list of 80 genes modulated in both cell types was obtained and a set of candidate marker genes was selected for further analysis. Cells were exposed to selected sensitizers and non-sensitizers for 24 h and gene expression was analyzed by quantitative real-time reverse transcriptase-polymerase chain reaction. Results indicated that PIR, TRIM16 and two Nrf2-regulated genes, CES1 and NQO1, are modulated by most sensitizers. Up-regulation of these genes could also be observed in our recently published DC-activation test with U937 cells. Due to their role in DC activation, these new genes may help to further refine the in vitro approaches for the screening of the sensitizing properties of a chemical.« less
Elephant Transcriptome Provides Insights into the Evolution of Eutherian Placentation

PubMed Central

Hou, Zhuo-Cheng; Sterner, Kirstin N.; Romero, Roberto; Than, Nandor Gabor; Gonzalez, Juan M.; Weckle, Amy; Xing, Jun; Benirschke, Kurt; Goodman, Morris; Wildman, Derek E.

2012-01-01

The chorioallantoic placenta connects mother and fetus in eutherian pregnancies. In order to understand the evolution of the placenta and provide further understanding of placenta biology, we sequenced the transcriptome of a term placenta of an African elephant (Loxodonta africana) and compared these data with RNA sequence and microarray data from other eutherian placentas including human, mouse, and cow. We characterized the composition of 55,910 expressed sequence tag (i.e., cDNA) contigs using our custom annotation pipeline. A Markov algorithm was used to cluster orthologs of human, mouse, cow, and elephant placenta transcripts. We found 2,963 genes are commonly expressed in the placentas of these eutherian mammals. Gene ontology categories previously suggested to be important for placenta function (e.g., estrogen receptor signaling pathway, cell motion and migration, and adherens junctions) were significantly enriched in these eutherian placenta–expressed genes. Genes duplicated in different lineages and also specifically expressed in the placenta contribute to the great diversity observed in mammalian placenta anatomy. We identified 1,365 human lineage–specific, 1,235 mouse lineage–specific, 436 cow lineage–specific, and 904 elephant-specific placenta-expressed (PE) genes. The most enriched clusters of human-specific PE genes are signal/glycoprotein and immunoglobulin, and humans possess a deeply invasive human hemochorial placenta that comes into direct contact with maternal immune cells. Inference of phylogenetically conserved and derived transcripts demonstrates the power of comparative transcriptomics to trace placenta evolution and variation across mammals and identified candidate genes that may be important in the normal function of the human placenta, and their dysfunction may be related to human pregnancy complications. PMID:22546564
The complete mitochondrial genome of the three-spot seahorse, Hippocampus trimaculatus (Teleostei, Syngnathidae).

PubMed

Chang, Chia-Hao; Shao, Kwang-Tsao; Lin, Yeong-Shin; Liao, Yun-Chih

2013-12-01

The complete mitochondrial genome of the three-spot seahorse was sequenced using a polymerase chain reaction-based method. The total length of mitochondrial DNA is 16,535 bp and includes 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes, and a control region. The mitochondrial gene order of the three-spot seahorse also conforms to the distinctive vertebrate mitochondrial gene order. The base composition of the genome is A (32.7%), T (29.3%), C (23.4%), and G (14.6%) with an A + T-rich hallmark as that of other vertebrate mitochondrial genomes.
Chicken genome analysis reveals novel genes encoding biotin-binding proteins related to avidin family

PubMed Central

Niskanen, Einari A; Hytönen, Vesa P; Grapputo, Alessandro; Nordlund, Henri R; Kulomaa, Markku S; Laitinen, Olli H

2005-01-01

Background A chicken egg contains several biotin-binding proteins (BBPs), whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins. PMID:15777476
Mitochondrial genomes of praying mantises (Dictyoptera, Mantodea): rearrangement, duplication, and reassignment of tRNA genes.

PubMed

Ye, Fei; Lan, Xu-E; Zhu, Wen-Bo; You, Ping

2016-05-09

Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects.
Mitochondrial genomes of praying mantises (Dictyoptera, Mantodea): rearrangement, duplication, and reassignment of tRNA genes

PubMed Central

Ye, Fei; Lan, Xu-e; Zhu, Wen-bo; You, Ping

2016-01-01

Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects. PMID:27157299
Differential transcriptional control of the two tRNA(fMet) genes of Escherichia coli K-12.

PubMed

Nagase, T; Ishii, S; Imamoto, F

1988-07-15

The metZ gene of Escherichia coli, which encodes the tRNA(f1Met), was cloned. Using the nucleotide sequence, in vitro transcription, and S1 nuclease mapping analyses, we identified the promoter region, transcriptional start point, the two tandem tRNA(f1Met) structural genes separated by an intergenic space of 33 bp, and the two Rho-independent transcriptional termination sites, in that order. We compared the promoter region of the metZ gene with that of the metY gene, which encodes the tRNA(f2Met) and is located in the promoter-proximal portion of the nusA operon. A G + C-rich sequence (5'-GCGCATCCAC-3'), similar to the corresponding sequence of the rrn promoters that are under stringent control, was found between the Pribnow box and the transcriptional start point of the metZ promoter, but not in the metY promoter region. We therefore examined the effect of guanosine 3'-diphosphate, 5'-diphosphate (ppGpp), the chemical mediator of stringent control, and found that ppGpp inhibited the transcription of the metZ gene, but not that of the metY gene. These data suggested that the promoters for metZ and metY have different physiological functions and are regulated by different mechanisms.
Chromosomal position effects in chicken lysozyme gene transgenic mice are correlated with suppression of DNase I hypersensitive site formation.

PubMed Central

Huber, M C; Bosch, F X; Sippel, A E; Bonifer, C

1994-01-01

The complete chicken lysozyme gene locus is expressed copy number dependently and at a high level in macrophages of transgenic mice. Gene expression independent of genomic position can only be achieved by the concerted action of all cis regulatory elements located on the lysozyme gene domain. Position independency of expression is lost if one essential cis regulatory region is deleted. Here we compared the DNase I hypersensitive site (DHS) pattern formed on the chromatin of position independently and position dependently expressed transgenes in order to assess the influence of deletions within the gene domain on active chromatin formation. We demonstrate, that in position independently expressed transgene all DHSs are formed with the authentic relative frequency on all genes. This is not the case for position dependently expressed transgenes. Our results show that the formation of a DHS during cellular differentiation does not occur autonomously. In case essential regulatory elements of the chicken lysozyme gene domain are lacking, the efficiency of DHS formation on remaining cis regulatory elements during myeloid differentiation is reduced and influenced by the chromosomal position. Hence, no individual regulatory element on the lysozyme domain is capable of organizing the chromatin structure of the whole locus in a dominant fashion. Images PMID:7937145
Transcriptome Analysis and Discovery of Genes Involved in Immune Pathways from Coelomocytes of Sea Cucumber (Apostichopus japonicus) after Vibrio splendidus Challenge.

PubMed

Gao, Qiong; Liao, Meijie; Wang, Yingeng; Li, Bin; Zhang, Zheng; Rong, Xiaojun; Chen, Guiping; Wang, Lan

2015-07-17

Vibrio splendidus is identified as one of the major pathogenic factors for the skin ulceration syndrome in sea cucumber (Apostichopus japonicus), which has vastly limited the development of the sea cucumber culture industry. In order to screen the immune genes involving Vibrio splendidus challenge in sea cucumber and explore the molecular mechanism of this process, the related transcriptome and gene expression profiling of resistant and susceptible biotypes of sea cucumber with Vibrio splendidus challenge were collected for analysis. A total of 319,455,942 trimmed reads were obtained, which were assembled into 186,658 contigs. After that, 89,891 representative contigs (without isoform) were clustered. The analysis of the gene expression profiling identified 358 differentially expression genes (DEGs) in the bacterial-resistant group, and 102 DEGs in the bacterial-susceptible group, compared with that in control group. According to the reported references and annotation information from BLAST, GO and KEGG, 30 putative bacterial-resistant genes and 19 putative bacterial-susceptible genes were identified from DEGs. The qRT-PCR results were consistent with the RNA-Seq results. Furthermore, many DGEs were involved in immune signaling related pathways, such as Endocytosis, Lysosome, MAPK, Chemokine and the ERBB signaling pathway.
Transcriptome Analysis and Discovery of Genes Involved in Immune Pathways from Coelomocytes of Sea Cucumber (Apostichopus japonicus) after Vibrio splendidus Challenge

PubMed Central

Gao, Qiong; Liao, Meijie; Wang, Yingeng; Li, Bin; Zhang, Zheng; Rong, Xiaojun; Chen, Guiping; Wang, Lan

2015-01-01

Vibrio splendidus is identified as one of the major pathogenic factors for the skin ulceration syndrome in sea cucumber (Apostichopus japonicus), which has vastly limited the development of the sea cucumber culture industry. In order to screen the immune genes involving Vibrio splendidus challenge in sea cucumber and explore the molecular mechanism of this process, the related transcriptome and gene expression profiling of resistant and susceptible biotypes of sea cucumber with Vibrio splendidus challenge were collected for analysis. A total of 319,455,942 trimmed reads were obtained, which were assembled into 186,658 contigs. After that, 89,891 representative contigs (without isoform) were clustered. The analysis of the gene expression profiling identified 358 differentially expression genes (DEGs) in the bacterial-resistant group, and 102 DEGs in the bacterial-susceptible group, compared with that in control group. According to the reported references and annotation information from BLAST, GO and KEGG, 30 putative bacterial-resistant genes and 19 putative bacterial-susceptible genes were identified from DEGs. The qRT-PCR results were consistent with the RNA-Seq results. Furthermore, many DGEs were involved in immune signaling related pathways, such as Endocytosis, Lysosome, MAPK, Chemokine and the ERBB signaling pathway. PMID:26193268
Sequence analysis of three mitochondrial DNA molecules reveals interesting differences among Saccharomyces yeasts

PubMed Central

Langkjær, R. B.; Casaregola, S.; Ussery, D. W.; Gaillardin, C.; Piškur, J.

2003-01-01

The complete sequences of mitochondrial DNA (mtDNA) from the two budding yeasts Saccharomyces castellii and Saccharomyces servazzii, consisting of 25 753 and 30 782 bp, respectively, were analysed and compared to Saccharomyces cerevisiae mtDNA. While some of the traits are very similar among Saccharomyces yeasts, others have highly diverged. The two mtDNAs are much more compact than that of S.cerevisiae and contain fewer introns and intergenic sequences, although they have almost the same coding potential. A few genes contain group I introns, but group II introns, otherwise found in S.cerevisiae mtDNA, are not present. Surprisingly, four genes (ATP6, COX2, COX3 and COB) in the mtDNA of S.servazzii contain, in total, five +1 frameshifts. mtDNAs of S.castellii, S.servazzii and S.cerevisiae contain all genes on the same strand, except for one tRNA gene. On the other hand, the gene order is very different. Several gene rearrangements have taken place upon separation of the Saccharomyces lineages, and even a part of the transcription units have not been preserved. It seems that the mechanism(s) involved in the generation of the rearrangements has had to ensure that all genes stayed encoded by the same DNA strand. PMID:12799436
Comparative sequence analysis of a region on human chromosome 13q14, frequently deleted in B-cell chronic lymphocytic leukemia, and its homologous region on mouse chromosome 14.

PubMed

Kapanadze, B; Makeeva, N; Corcoran, M; Jareborg, N; Hammarsund, M; Baranova, A; Zabarovsky, E; Vorontsova, O; Merup, M; Gahrton, G; Jansson, M; Yankovsky, N; Einhorn, S; Oscier, D; Grandér, D; Sangfelt, O

2000-12-15

Previous studies have indicated the presence of a putative tumor suppressor gene on human chromosome 13q14, commonly deleted in patients with B-cell chronic lymphocytic leukemia (B-CLL). We have recently identified a minimally deleted region encompassing parts of two adjacent genes, termed LEU1 and LEU2 (leukemia-associated genes 1 and 2), and several additional transcripts. In addition, 50 kb centromeric to this region we have identified another gene, LEU5/RFP2. To elucidate further the complex genomic organization of this region, we have identified, mapped, and sequenced the homologous region in the mouse. Fluorescence in situ hybridization analysis demonstrated that the region maps to mouse chromosome 14. The overall organization and gene order in this region were found to be highly conserved in the mouse. Sequence comparison between the human deletion hotspot region and its homologous mouse region revealed a high degree of sequence conservation with an overall score of 74%. However, our data also show that in terms of transcribed sequences, only two of those, human LEU2 and LEU5/RFP2, are clearly conserved, strengthening the case for these genes as putative candidate B-CLL tumor suppressor genes.
Evolution of Olfactory Receptor Genes in Primates Dominated by Birth-and-Death Process

PubMed Central

Dong, Dong; He, Guimei; Zhang, Shuyi

2009-01-01

Olfactory receptor (OR) is a large family of G protein–coupled receptors that can detect odorant in order to generate the sense of smell. They constitute one of the largest multiple gene families in animals including primates. To better understand the variation in odor perception and evolution of OR genes among primates, we computationally identified OR gene repertoires in orangutans, marmosets, and mouse lemurs and investigated the birth-and-death process of OR genes in the primate lineage. The results showed that 1) all the primate species studied have no more than 400 intact OR genes, fewer than rodents and canine; 2) Despite the similar number of OR genes in the genome, the makeup of the OR gene repertoires between different primate species is quite different as they had undergone dramatic birth-and-death evolution with extensive gene losses in the lineages leading to current species; 3) Apes and Old World monkey (OWM) have similar fraction of pseudogenes, whereas New World monkey (NWM) have fewer pseudogenes. To measure the selective pressure that had affected the OR gene repertoires in primates, we compared the ratio of nonsynonymous with synonymous substitution rates by using 70 one-to-one orthologous quintets among five primate species. We found that OR genes showed relaxed selective constraints in apes (humans, chimpanzees, and orangutans) than in OWMs (macaques) and NWMs (marmosets). We concluded that OR gene repertoires in primates have evolved in such a way to adapt to their respective living environments. Differential selective constraints might play important role in the primate OR gene evolution in each primate species. PMID:20333195
Phylogenetic Analysis of Genome Rearrangements among Five Mammalian Orders

PubMed Central

Luo, Haiwei; Arndt, William; Zhang, Yiwei; Shi, Guanqun; Alekseyev, Max; Tang, Jijun; Hughes, Austin L.; Friedman, Robert

2015-01-01

Evolutionary relationships among placental mammalian orders have been controversial. Whole genome sequencing and new computational methods offer opportunities to resolve the relationships among 10 genomes belonging to the mammalian orders Primates, Rodentia, Carnivora, Perissodactyla and Artiodactyla. By application of the double cut and join distance metric, where gene order is the phylogenetic character, we computed genomic distances among the sampled mammalian genomes. With a marsupial outgroup, the gene order tree supported a topology in which Rodentia fell outside the cluster of Primates, Carnivora, Perissodactyla, and Artiodactyla. Results of breakpoint reuse rate and synteny block length analyses were consistent with the prediction of random breakage model, which provided a diagnostic test to support use of gene order as an appropriate phylogenetic character in this study. We the influence of rate differences among lineages and other factors that may contribute to different resolutions of mammalian ordinal relationships by different methods of phylogenetic reconstruction. PMID:22929217
Genetic map of the Bacillus stearothermophilus NUB36 chromosome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Vallier, H.; Welker, N.E.

1990-02-01

A circular genetic map of Bacillus stearothermophilus NUB36 was constructed by transduction with bacteriophage TP-42C and protoplast fusion. Sixty-four genes were tentatively assigned a cognate Bacillus subtilis gene based on growth response to intermediates or end products of metabolism, cross-feeding, accumulation of intermediates, or their relative order in a linkage group. Although the relative position of many genes on the Bacillus subtilis genetic map appears to be similar, some differences were detected. The tentative order of the genes in the Bacillus stearothermophilus aro region is aspB-aroBAFEC-tyra-hisH-(trp), whereas it is aspB-aroE-tyrA-hisH-(trp)-aroHBF in Bacillus subtilis. The aroA, aroC, and aroG genes inmore » Bacillus subtilis are located in another region. The tentative order of genes in the trp operon of Bacillus stearothermophilus is trpFCDABE, whereas it is trpABFCDE in Bacillus subtilis.« less
An unexpectedly large and loosely packed mitochondrial genome in the charophycean green alga Chlorokybus atmophyticus

PubMed Central

Turmel, Monique; Otis, Christian; Lemieux, Claude

2007-01-01

Background The Streptophyta comprises all land plants and six groups of charophycean green algae. The scaly biflagellate Mesostigma viride (Mesostigmatales) and the sarcinoid Chlorokybus atmophyticus (Chlorokybales) represent the earliest diverging lineages of this phylum. In trees based on chloroplast genome data, these two charophycean green algae are nested in the same clade. To validate this relationship and gain insight into the ancestral state of the mitochondrial genome in the Charophyceae, we sequenced the mitochondrial DNA (mtDNA) of Chlorokybus and compared this genome sequence with those of three other charophycean green algae and the bryophytes Marchantia polymorpha and Physcomitrella patens. Results The Chlorokybus genome differs radically from its 42,424-bp Mesostigma counterpart in size, gene order, intron content and density of repeated elements. At 201,763-bp, it is the largest mtDNA yet reported for a green alga. The 70 conserved genes represent 41.4% of the genome sequence and include nad10 and trnL(gag), two genes reported for the first time in a streptophyte mtDNA. At the gene order level, the Chlorokybus genome shares with its Chara, Chaetosphaeridium and bryophyte homologues eight to ten gene clusters including about 20 genes. Notably, some of these clusters exhibit gene linkages not previously found outside the Streptophyta, suggesting that they originated early during streptophyte evolution. In addition to six group I and 14 group II introns, short repeated sequences accounting for 7.5% of the genome were identified. Mitochondrial trees were unable to resolve the correct position of Mesostigma, due to analytical problems arising from accelerated sequence evolution in this lineage. Conclusion The Chlorokybus and Mesostigma mtDNAs exemplify the marked fluidity of the mitochondrial genome in charophycean green algae. The notion that the mitochondrial genome was constrained to remain compact during charophycean evolution is no longer tenable. Our data raise the possibility that the emergence of land plants was not associated with a substantial gain of intergenic sequences by the mitochondrial genome. PMID:17537252
The RAD24 (= Rs1) Gene Product of Saccharomyces cerevisiae Participates in Two Different Pathways of DNA Repair

PubMed Central

Eckardt-Schupp, Friederike; Siede, Wolfram; Game, John C.

1987-01-01

The moderately UV- and X-ray-sensitive mutant of Saccharomyces cerevisiae originally designated rs1 complements all rad and mms mutants available. Therefore, the new nomination rad24-1 according to the RAD nomenclature is suggested. RAD24 maps on chromosome V, close to RAD3 (1.3 cM). In order to associate the RAD24 gene with one of the three repair pathways, double mutants of rad24 and various representative genes of each pathway were constructed. The UV and X-ray sensitivities of the double mutants compared to the single mutants indicate that RAD24 is involved in excision repair of UV damage (RAD3 epistasis group), as well as in recombination repair of UV and X-ray damage (RAD52 epistasis group). Properties of the mutant are discussed which hint at the control of late steps in the pathways. PMID:3549445

[Late-replicating regions in salivary gland polytene chromosomes of Drosophila melanogaster].

PubMed

Kolesnikov, T D; Andreenkova, N G; Beliaeva, E S; Goncharov, F P; Zykova, T Iu; Boldyreva, L V; Pokholkova, g V; Zhimulev, I F

2013-01-01

About 240 specific regions that are replicated at the very end of the S-phase have been identified in D. melanogaster polytene chromosomes. These regions have a repressive chromatine state, low gene density, long intergenic distances and are enriched in tissue specific genes. In polytene chromosomes, about a quarter of these regions have no enough time to complete replication. As a result, underreplication zones represented by fewer DNA copy number, appear. We studied 60 chromosome regions that demonstrated the most pronounced under-replication. By comparing the location of these regions on a molecular map with syntenic blocks found earlier for Drosophila species by von Grotthuss et al., 2010, we have shown that across the genus Drosophila, these regions tend to have conserved gene order. This forces us to assume the existence of evolutionary mechanisms aimed at maintaining the integrity of these regions.
Clericuzio-type poikiloderma with neutropenia syndrome in three sibs with mutations in the C16orf57 gene: delineation of the phenotype.

PubMed

Concolino, D; Roversi, G; Muzzi, G L; Sestito, S; Colombo, E A; Volpi, L; Larizza, L; Strisciuglio, P

2010-10-01

We report on three sibs who have autosomal recessive Clericuzio-type poikiloderma neutropenia (PN) syndrome. Recently, this consanguineous family was reported and shown to be informative in identifying the C16orf57 gene as the causative gene for this syndrome. Here we present the clinical data in detail. PN is a distinct and recognizable entity belonging to the group of poikiloderma syndromes among which Rothmund-Thomson is perhaps the best described and understood. PN is characterized by cutaneous poikiloderma, hyperkeratotic nails, generalized hyperkeratosis on palms and soles, neutropenia, short stature, and recurrent pulmonary infections. In order to delineate the phenotype of this rare genodermatosis, the clinical presentation together with the molecular investigations in our patients are reported and compared to those from the literature. Copyright © 2010 Wiley-Liss, Inc.
From noise to synthetic nucleoli: can synthetic biology achieve new insights?

PubMed

Ciechonska, Marta; Grob, Alice; Isalan, Mark

2016-04-18

Synthetic biology aims to re-organise and control biological components to make functional devices. Along the way, the iterative process of designing and testing gene circuits has the potential to yield many insights into the functioning of the underlying chassis of cells. Thus, synthetic biology is converging with disciplines such as systems biology and even classical cell biology, to give a new level of predictability to gene expression, cell metabolism and cellular signalling networks. This review gives an overview of the contributions that synthetic biology has made in understanding gene expression, in terms of cell heterogeneity (noise), the coupling of growth and energy usage to expression, and spatiotemporal considerations. We mainly compare progress in bacterial and mammalian systems, which have some of the most-developed engineering frameworks. Overall, one view of synthetic biology can be neatly summarised as "creating in order to understand."
Convergent evolution and adaptation of Pseudomonas aeruginosa within patients with cystic fibrosis.

PubMed

Marvig, Rasmus Lykke; Sommer, Lea Mette; Molin, Søren; Johansen, Helle Krogh

2015-01-01

Little is known about how within-host evolution compares between genotypically different strains of the same pathogenic species. We sequenced the whole genomes of 474 longitudinally collected clinical isolates of Pseudomonas aeruginosa sampled from 34 children and young individuals with cystic fibrosis. Our analysis of 36 P. aeruginosa lineages identified convergent molecular evolution in 52 genes. This list of genes suggests a role in host adaptation for remodeling of regulatory networks and central metabolism, acquisition of antibiotic resistance and loss of extracellular virulence factors. Furthermore, we find an ordered succession of mutations in key regulatory networks. Accordingly, mutations in downstream transcriptional regulators were contingent upon mutations in upstream regulators, suggesting that remodeling of regulatory networks might be important in adaptation. The characterization of genes involved in host adaptation may help in predicting bacterial evolution in patients with cystic fibrosis and in the design of future intervention strategies.
Unscrambling butterfly oogenesis

PubMed Central

2013-01-01

Background Butterflies are popular model organisms to study physiological mechanisms underlying variability in oogenesis and egg provisioning in response to environmental conditions. Nothing is known, however, about; the developmental mechanisms governing butterfly oogenesis, how polarity in the oocyte is established, or which particular maternal effect genes regulate early embryogenesis. To gain insights into these developmental mechanisms and to identify the conserved and divergent aspects of butterfly oogenesis, we analysed a de novo ovarian transcriptome of the Speckled Wood butterfly Pararge aegeria (L.), and compared the results with known model organisms such as Drosophila melanogaster and Bombyx mori. Results A total of 17306 contigs were annotated, with 30% possibly novel or highly divergent sequences observed. Pararge aegeria females expressed 74.5% of the genes that are known to be essential for D. melanogaster oogenesis. We discuss the genes involved in all aspects of oogenesis, including vitellogenesis and choriogenesis, plus those implicated in hormonal control of oogenesis and transgenerational hormonal effects in great detail. Compared to other insects, a number of significant differences were observed in; the genes involved in stem cell maintenance and differentiation in the germarium, establishment of oocyte polarity, and in several aspects of maternal regulation of zygotic development. Conclusions This study provides valuable resources to investigate a number of divergent aspects of butterfly oogenesis requiring further research. In order to fully unscramble butterfly oogenesis, we also now also have the resources to investigate expression patterns of oogenesis genes under a range of environmental conditions, and to establish their function. PMID:23622113
Polymorphisms in the leptin gene promoter in Brazilian beef herds.

PubMed

Guimarães, R C; Azevedo, J S N; Corrêa, S C; Campelo, J E G; Barbosa, E M; Gonçalves, E C; Silva Filho, E

2016-12-02

Brazil is the world's largest producer of beef cattle; however, the quality of its herds needs to be improved. The use of molecular markers as auxiliary tools in selecting animals for reproduction with high pattern for beef production would significantly improve the quality of the final beef product in Brazil. The leptin gene has been demonstrated to be an excellent candidate gene for bovine breeding. The objective of this study was to sequence and compare the leptin gene promoter of Brazil's important cattle breeds in order to identify polymorphisms in it. Blood samples of the Nellore, Guzerat, Tabapuã, and Senepol breeds were collected for genomic DNA extraction. The genomic DNA was used as a template for polymerase chain reaction (PCR) to amplify a 1575-bp fragment, which in turn was sequenced, aligned, and compared between animals of different breeds. Twenty-three single nucleotide polymorphic sites, including transitions and transversions, were detected at positions -1457, -1452, -1446, -1397, -1392, -1361, -1238, -963,-901, -578, -516, -483, -478, -470, -432, -430, -292, -282, -272, -211, -202, -170, and -147. Additionally, two insertion sites at positions -680 and -416 and two deletion sites at positions -1255 and -1059 were detected. As the promoter region of the leptin gene has been demonstrated to vary among breeds, these variations must be tested for their use as potential molecular markers for artificial selection of animals for enhanced beef production in different systems of bovine production in Brazil.
The mechanism of improved intracellular organic selenium and glutathione contents in selenium-enriched Candida utilis by acid stress.

PubMed

Zhang, Gao-Chuan; Wang, Da-Hui; Wang, Dong-Hua; Wei, Gong-Yuan

2017-03-01

Batch culture of Candida utilis CCTCC M 209298 for the preparation of selenium (Se)-enriched yeast was carried out under different pH conditions, and maximal intracellular organic Se and glutathione (GSH) contents were obtained in a moderate acid stress environment (pH 3.5). In order to elucidate the physiological mechanism of improved performance of Se-enriched yeast by acid stress, assays of the key enzymes involved in GSH biosynthesis and determinations of energy supply and regeneration were performed. The results indicated that moderate acid stress increased the activity of γ-glutamylcysteine synthetase and the ratios of NADH/NAD + and ATP/ADP, although no significant changes in intracellular pH were observed. In addition, the molecular mechanism of moderate acid stress favoring the improvement of Se-yeast performance was revealed by comparing whole transcriptomes of yeast cells cultured at pH 3.5 and 5.5. Comparative analysis of RNA-Seq data indicated that 882 genes were significantly up-regulated by moderate acid stress. Functional annotation of the up-regulated genes based on gene ontology and the Kyoto Encyclopedia of Genes and Genome (KEGG) pathway showed that these genes are involved in ATP synthesis and sulfur metabolism, including the biosynthesis of methionine, cysteine, and GSH in yeast cells. Increased intracellular ATP supply and more amounts of sulfur-containing substances in turn contributed to Na 2 SeO 3 assimilation and biotransformation, which ultimately improved the performance of the Se-enriched C. utilis.
Comparative sequence analysis of the potato cyst nematode resistance locus H1 reveals a major lack of co-linearity between three haplotypes in potato (Solanum tuberosum ssp.).

PubMed

Finkers-Tomczak, Anna; Bakker, Erin; de Boer, Jan; van der Vossen, Edwin; Achenbach, Ute; Golas, Tomasz; Suryaningrat, Suwardi; Smant, Geert; Bakker, Jaap; Goverse, Aska

2011-02-01

The H1 locus confers resistance to the potato cyst nematode Globodera rostochiensis pathotypes 1 and 4. It is positioned at the distal end of chromosome V of the diploid Solanum tuberosum genotype SH83-92-488 (SH) on an introgression segment derived from S. tuberosum ssp. andigena. Markers from a high-resolution genetic map of the H1 locus (Bakker et al. in Theor Appl Genet 109:146-152, 2004) were used to screen a BAC library to construct a physical map covering a 341-kb region of the resistant haplotype coming from SH. For comparison, physical maps were also generated of the two haplotypes from the diploid susceptible genotype RH89-039-16 (S. tuberosum ssp. tuberosum/S. phureja), spanning syntenic regions of 700 and 319 kb. Gene predictions on the genomic segments resulted in the identification of a large cluster consisting of variable numbers of the CC-NB-LRR type of R genes for each haplotype. Furthermore, the regions were interspersed with numerous transposable elements and genes coding for an extensin-like protein and an amino acid transporter. Comparative analysis revealed a major lack of gene order conservation in the sequences of the three closely related haplotypes. Our data provide insight in the evolutionary mechanisms shaping the H1 locus and will facilitate the map-based cloning of the H1 resistance gene.
Patterns of Piscirickettsia salmonis load in susceptible and resistant families of Salmo salar.

PubMed

Dettleff, Phillip; Bravo, Cristian; Patel, Alok; Martinez, Victor

2015-07-01

The pathogen Piscirickettsia salmonis produces a systemic aggressive infection that involves several organs and tissues in salmonids. In spite of the great economic losses caused by this pathogen in the Atlantic salmon (Salmo salar) industry, very little is known about the resistance mechanisms of the host to this pathogen. In this paper, for the first time, we aimed to identify the bacterial load in head kidney and muscle of Atlantic salmon exhibiting differential familiar mortality. Furthermore, in order to assess the patterns of gene expression of immune related genes in susceptible and resistant families, a set of candidate genes was evaluated using deep sequencing of the transcriptome. The results showed that the bacterial load was significantly lower in resistant fish, when compared with the susceptible individuals. Based on the candidate genes analysis, we infer that the resistant hosts triggered up-regulation of specific genes (such as for example the LysC), which may explain a decrease in the bacterial load in head kidney, while the susceptible fish presented an exacerbated innate response, which is unable to exert an effective response against the bacteria. Interestingly, we found a higher bacterial load in muscle when compared with head kidney. We argue that this is possible due to the availability of an additional source of iron in muscle. Besides, the results show that the resistant fish could not be a likely reservoir of the bacteria. Copyright © 2015 Elsevier Ltd. All rights reserved.
The mitochondrial genome of the deep-sea glass sponge Lophophysema eversa (Porifera, Hexacinellida, Hyalonematidae).

PubMed

Zhang, Yanjie; Sun, Jin; Li, Xinzheng; Qiu, Jian-Wen

2016-01-01

We reported a nearly complete mitochondrial genome (mitogenome) from the glass sponge Lophophysema eversa, the second mitogenome in the order Amphidiscosida and the ninth in the class Hexactinellida. It is 20,651 base pairs in length and contains 39 genes including 13 protein-coding genes, 2 ribosomal RNA subunit genes and 24 tRNA genes. The gene content and order of L. eversa are identical to those of Tabachnickia sp., the other species with a sequenced mitogenome in Amphidiscosida, except with two additional tRNAs and three tRNA translocations. The cob gene has a +1 translational frameshift. These results will contribute to a better understanding of the phylogeny of glass sponges.
Comprehensive molecular screening by next generation sequencing reveals a distinctive mutational profile of KIT/PDGFRA genes and novel genomic alterations: results from a 20-year cohort of patients with GIST from north-western Greece.

PubMed

Mavroeidis, Leonidas; Metaxa-Mariatou, Vassiliki; Papoudou-Bai, Alexandra; Lampraki, Angeliki Maria; Kostadima, Lida; Tsinokou, Ilias; Zarkavelis, George; Papadaki, Alexandra; Petrakis, Dimitrios; Gκoura, Stefania; Kampletsas, Eleftherios; Nasioulas, George; Batistatou, Anna; Pentheroudakis, George

2018-01-01

Gastrointestinal stromal tumours (GIST) are mesenchymal neoplasms that usually carry an activating mutation in KIT or platelet-derived growth factor receptor alpha ( PDGFRA ) genes with predictive and prognostic significance. We investigated the extended mutational status of GIST in a patient population of north-western Greece in order to look at geopraphic/genotypic distinctive traits. Clinicopathological and molecular data of 38 patients diagnosed from 1996 to 2016 with GIST in the region of Epirus in Greece were retrospectively assessed. Formalin-fixed paraffin-embedded tumours were successfully analysed for mutations in 54 genes with oncogenic potential. Next generation sequencing was conducted by using the Ion AmpliSeqCancer Hotspot Panel V.2 for DNA analysis (Thermofisher Scientific). Among 38 tumours, 24 (63.16%) and seven (18.42%) of the tumours harboured mutations in the KIT and PDGFRA genes, respectively, while seven (18.42%) tumours were negative for either KIT or PDGFRA mutation. No mutations were detected in five (13.16%) cases. Concomitant mutations of BRAF and fibroblast growth factor receptor 3 ( FGFR3 ) genes were observed in two patients with KIT gene mutation. Two patients with KIT / PDGFRA wild-type GIST had mutations in either KRAS or phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha ( PIK3CA ) genes. There was no significant survival difference regarding the exonic site of mutation in either KIT or PDGFRA gene. The presence of a mutation in pathway effectors downstream of KIT or PDGFRA , such as BRAF , KRAS or PIK3CA , was associated with poor prognosis. Adverse prognosticators were also high mitotic index and the advanced disease status at diagnosis. We report comparable incidence of KIT and PDGFRA mutation in patients with GIST from north-western Greece as compared with cohorts from other regions. Interestingly, we identified rare mutations on RAS , BRAF and PIK3CA genes in patients with poor prognosis.
A comparative study of RNA-Seq and microarray data analysis on the two examples of rectal-cancer patients and Burkitt Lymphoma cells.

PubMed

Wolff, Alexander; Bayerlová, Michaela; Gaedcke, Jochen; Kube, Dieter; Beißbarth, Tim

2018-01-01

Pipeline comparisons for gene expression data are highly valuable for applied real data analyses, as they enable the selection of suitable analysis strategies for the dataset at hand. Such pipelines for RNA-Seq data should include mapping of reads, counting and differential gene expression analysis or preprocessing, normalization and differential gene expression in case of microarray analysis, in order to give a global insight into pipeline performances. Four commonly used RNA-Seq pipelines (STAR/HTSeq-Count/edgeR, STAR/RSEM/edgeR, Sailfish/edgeR, TopHat2/Cufflinks/CuffDiff)) were investigated on multiple levels (alignment and counting) and cross-compared with the microarray counterpart on the level of gene expression and gene ontology enrichment. For these comparisons we generated two matched microarray and RNA-Seq datasets: Burkitt Lymphoma cell line data and rectal cancer patient data. The overall mapping rate of STAR was 98.98% for the cell line dataset and 98.49% for the patient dataset. Tophat's overall mapping rate was 97.02% and 96.73%, respectively, while Sailfish had only an overall mapping rate of 84.81% and 54.44%. The correlation of gene expression in microarray and RNA-Seq data was moderately worse for the patient dataset (ρ = 0.67-0.69) than for the cell line dataset (ρ = 0.87-0.88). An exception were the correlation results of Cufflinks, which were substantially lower (ρ = 0.21-0.29 and 0.34-0.53). For both datasets we identified very low numbers of differentially expressed genes using the microarray platform. For RNA-Seq we checked the agreement of differentially expressed genes identified in the different pipelines and of GO-term enrichment results. In conclusion the combination of STAR aligner with HTSeq-Count followed by STAR aligner with RSEM and Sailfish generated differentially expressed genes best suited for the dataset at hand and in agreement with most of the other transcriptomics pipelines.
Understanding genetic regulatory networks

NASA Astrophysics Data System (ADS)

Kauffman, Stuart

2003-04-01

Random Boolean networks (RBM) were introduced about 35 years ago as first crude models of genetic regulatory networks. RBNs are comprised of N on-off genes, connected by a randomly assigned regulatory wiring diagram where each gene has K inputs, and each gene is controlled by a randomly assigned Boolean function. This procedure samples at random from the ensemble of all possible NK Boolean networks. The central ideas are to study the typical, or generic properties of this ensemble, and see 1) whether characteristic differences appear as K and biases in Boolean functions are introducted, and 2) whether a subclass of this ensemble has properties matching real cells. Such networks behave in an ordered or a chaotic regime, with a phase transition, "the edge of chaos" between the two regimes. Networks with continuous variables exhibit the same two regimes. Substantial evidence suggests that real cells are in the ordered regime. A key concept is that of an attractor. This is a reentrant trajectory of states of the network, called a state cycle. The central biological interpretation is that cell types are attractors. A number of properties differentiate the ordered and chaotic regimes. These include the size and number of attractors, the existence in the ordered regime of a percolating "sea" of genes frozen in the on or off state, with a remainder of isolated twinkling islands of genes, a power law distribution of avalanches of gene activity changes following perturbation to a single gene in the ordered regime versus a similar power law distribution plus a spike of enormous avalanches of gene changes in the chaotic regime, and the existence of branching pathway of "differentiation" between attractors induced by perturbations in the ordered regime. Noise is serious issue, since noise disrupts attractors. But numerical evidence suggests that attractors can be made very stable to noise, and meanwhile, metaplasias may be a biological manifestation of noise. As we learn more about the wiring diagram and constraints on rules controlling real genes, we can build refined ensembles reflecting these properties, study the generic properties of the refined ensembles, and hope to gain insight into the dynamics of real cells.
Complete Genome Analysis of Thermus parvatiensis and Comparative Genomics of Thermus spp. Provide Insights into Genetic Variability and Evolution of Natural Competence as Strategic Survival Attributes

PubMed Central

Tripathi, Charu; Mishra, Harshita; Khurana, Himani; Dwivedi, Vatsala; Kamra, Komal; Negi, Ram K.; Lal, Rup

2017-01-01

Thermophilic environments represent an interesting niche. Among thermophiles, the genus Thermus is among the most studied genera. In this study, we have sequenced the genome of Thermus parvatiensis strain RL, a thermophile isolated from Himalayan hot water springs (temperature >96°C) using PacBio RSII SMRT technique. The small genome (2.01 Mbp) comprises a chromosome (1.87 Mbp) and a plasmid (143 Kbp), designated in this study as pTP143. Annotation revealed a high number of repair genes, a squeezed genome but containing highly plastic plasmid with transposases, integrases, mobile elements and hypothetical proteins (44%). We performed a comparative genomic study of the group Thermus with an aim of analysing the phylogenetic relatedness as well as niche specific attributes prevalent among the group. We compared the reference genome RL with 16 Thermus genomes to assess their phylogenetic relationships based on 16S rRNA gene sequences, average nucleotide identity (ANI), conserved marker genes (31 and 400), pan genome and tetranucleotide frequency. The core genome of the analyzed genomes contained 1,177 core genes and many singleton genes were detected in individual genomes, reflecting a conserved core but adaptive pan repertoire. We demonstrated the presence of metagenomic islands (chromosome:5, plasmid:5) by recruiting raw metagenomic data (from the same niche) against the genomic replicons of T. parvatiensis. We also dissected the CRISPR loci wide all genomes and found widespread presence of this system across Thermus genomes. Additionally, we performed a comparative analysis of competence loci wide Thermus genomes and found evidence for recent horizontal acquisition of the locus and continued dispersal among members reflecting that natural competence is a beneficial survival trait among Thermus members and its acquisition depicts unending evolution in order to accomplish optimal fitness. PMID:28798737
Comparative mapping in the Fagaceae and beyond with EST-SSRs

PubMed Central

2012-01-01

Background Genetic markers and linkage mapping are basic prerequisites for comparative genetic analyses, QTL detection and map-based cloning. A large number of mapping populations have been developed for oak, but few gene-based markers are available for constructing integrated genetic linkage maps and comparing gene order and QTL location across related species. Results We developed a set of 573 expressed sequence tag-derived simple sequence repeats (EST-SSRs) and located 397 markers (EST-SSRs and genomic SSRs) on the 12 oak chromosomes (2n = 2x = 24) on the basis of Mendelian segregation patterns in 5 full-sib mapping pedigrees of two species: Quercus robur (pedunculate oak) and Quercus petraea (sessile oak). Consensus maps for the two species were constructed and aligned. They showed a high degree of macrosynteny between these two sympatric European oaks. We assessed the transferability of EST-SSRs to other Fagaceae genera and a subset of these markers was mapped in Castanea sativa, the European chestnut. Reasonably high levels of macrosynteny were observed between oak and chestnut. We also obtained diversity statistics for a subset of EST-SSRs, to support further population genetic analyses with gene-based markers. Finally, based on the orthologous relationships between the oak, Arabidopsis, grape, poplar, Medicago, and soybean genomes and the paralogous relationships between the 12 oak chromosomes, we propose an evolutionary scenario of the 12 oak chromosomes from the eudicot ancestral karyotype. Conclusions This study provides map locations for a large set of EST-SSRs in two oak species of recognized biological importance in natural ecosystems. This first step toward the construction of a gene-based linkage map will facilitate the assignment of future genome scaffolds to pseudo-chromosomes. This study also provides an indication of the potential utility of new gene-based markers for population genetics and comparative mapping within and beyond the Fagaceae. PMID:22931513
New genes in the evolution of the neural crest differentiation program

PubMed Central

2007-01-01

Background Development of the vertebrate head depends on the multipotency and migratory behavior of neural crest derivatives. This cell population is considered a vertebrate innovation and, accordingly, chordate ancestors lacked neural crest counterparts. The identification of neural crest specification genes expressed in the neural plate of basal chordates, in addition to the discovery of pigmented migratory cells in ascidians, has challenged this hypothesis. These new findings revive the debate on what is new and what is ancient in the genetic program that controls neural crest formation. Results To determine the origin of neural crest genes, we analyzed Phenotype Ontology annotations to select genes that control the development of this tissue. Using a sequential blast pipeline, we phylogenetically classified these genes, as well as those associated with other tissues, in order to define tissue-specific profiles of gene emergence. Of neural crest genes, 9% are vertebrate innovations. Our comparative analyses show that, among different tissues, the neural crest exhibits a particularly high rate of gene emergence during vertebrate evolution. A remarkable proportion of the new neural crest genes encode soluble ligands that control neural crest precursor specification into each cell lineage, including pigmented, neural, glial, and skeletal derivatives. Conclusion We propose that the evolution of the neural crest is linked not only to the recruitment of ancestral regulatory genes but also to the emergence of signaling peptides that control the increasingly complex lineage diversification of this plastic cell population. PMID:17352807
Gene expression profiles in primary pancreatic tumors and metastatic lesions of Ela-c-myc transgenic mice.

PubMed

Thakur, Archana; Bollig, Aliccia; Wu, Jiusheng; Liao, Dezhong J

2008-01-24

Pancreatic carcinoma usually is a fatal disease with no cure, mainly due to its invasion and metastasis prior to diagnosis. We analyzed the gene expression profiles of paired primary pancreatic tumors and metastatic lesions from Ela-c-myc transgenic mice in order to identify genes that may be involved in the pancreatic cancer progression. Differentially expressed selected genes were verified by semi-quantitative and quantitative RT-PCR. To further evaluate the relevance of some of the selected differentially expressed genes, we investigated their expression pattern in human pancreatic cancer cell lines with high and low metastatic potentials. Data indicate that genes involved in posttranscriptional regulation were a major functional category of upregulated genes in both primary pancreatic tumors (PT) and liver metastatic lesions (LM) compared to normal pancreas (NP). In particular, differential expression for splicing factors, RNA binding/pre-mRNA processing factors and spliceosome related genes were observed, indicating that RNA processing and editing related events may play critical roles in pancreatic tumor development and progression. High expression of insulin growth factor binding protein-1 (Igfbp1) and Serine proteinase inhibitor A1 (Serpina1), and low levels or absence of Wt1 gene expression were exclusive to liver metastatic lesion samples. We identified Igfbp1, Serpina1 and Wt1 genes that are likely to be clinically useful biomarkers for prognostic or therapeutic purposes in metastatic pancreatic cancer, particularly in pancreatic cancer where c-Myc is overexpressed.
The pig X and Y Chromosomes: structure, sequence, and evolution

PubMed Central

Skinner, Benjamin M.; Sargent, Carole A.; Churcher, Carol; Hunt, Toby; Herrero, Javier; Loveland, Jane E.; Dunn, Matt; Louzada, Sandra; Fu, Beiyuan; Chow, William; Gilbert, James; Austin-Guest, Siobhan; Beal, Kathryn; Carvalho-Silva, Denise; Cheng, William; Gordon, Daria; Grafham, Darren; Hardy, Matt; Harley, Jo; Hauser, Heidi; Howden, Philip; Howe, Kerstin; Lachani, Kim; Ellis, Peter J.I.; Kelly, Daniel; Kerry, Giselle; Kerwin, James; Ng, Bee Ling; Threadgold, Glen; Wileman, Thomas; Wood, Jonathan M.D.; Yang, Fengtang; Harrow, Jen; Affara, Nabeel A.; Tyler-Smith, Chris

2016-01-01

We have generated an improved assembly and gene annotation of the pig X Chromosome, and a first draft assembly of the pig Y Chromosome, by sequencing BAC and fosmid clones from Duroc animals and incorporating information from optical mapping and fiber-FISH. The X Chromosome carries 1033 annotated genes, 690 of which are protein coding. Gene order closely matches that found in primates (including humans) and carnivores (including cats and dogs), which is inferred to be ancestral. Nevertheless, several protein-coding genes present on the human X Chromosome were absent from the pig, and 38 pig-specific X-chromosomal genes were annotated, 22 of which were olfactory receptors. The pig Y-specific Chromosome sequence generated here comprises 30 megabases (Mb). A 15-Mb subset of this sequence was assembled, revealing two clusters of male-specific low copy number genes, separated by an ampliconic region including the HSFY gene family, which together make up most of the short arm. Both clusters contain palindromes with high sequence identity, presumably maintained by gene conversion. Many of the ancestral X-related genes previously reported in at least one mammalian Y Chromosome are represented either as active genes or partial sequences. This sequencing project has allowed us to identify genes—both single copy and amplified—on the pig Y Chromosome, to compare the pig X and Y Chromosomes for homologous sequences, and thereby to reveal mechanisms underlying pig X and Y Chromosome evolution. PMID:26560630
CCDC22 gene polymorphism is associated with advanced stages of endometriosis in a sample of Brazilian women.

PubMed

de Oliveira Francisco, Daniela; de Paula Andres, Marina; Gueuvoghlanian-Silva, Bárbara Yasmim; Podgaec, Sergio; Fridman, Cintia

2017-07-01

Based on the assumption that genetic factors are involved in the etiology of endometriosis, this study aimed to investigate the possibility of rs498679 (TLR4 gene), rs1799964 (TNF-α gene), rs3024496 (IL-10 gene), and rs2294021 (CCDC22 gene) polymorphisms being associated with the occurrence of this disease in a sample of Brazilian women. We conducted a case-control study with 100 women with histological confirmation of endometriosis (endometriosis group) and 100 women submitted to laparoscopy for benign disorders, in which the absence of endometriosis was confirmed (control group). All samples were genotyped by real-time PCR technique for rs498679, rs1799964, rs3024496, and rs2294021 polymorphisms. No significant difference was observed in genotypic or allelic frequencies between control and endometriosis groups for rs498679 (TLR4 gene), rs1799964 (TNF-α gene), rs3024496 (IL-10 gene), neither when comparing endometriosis subgroups (I-II versus III-IV). On the other hand, significant difference between stages I-II and III-IV of the disease was found in genotypic and allelic frequencies for the rs2294021 (CCDC22 gene) SNP (p = 0.048 and p = 0.017, respectively). Our results suggest that the rs2294021 (CCDC22 gene) polymorphism could be associated with increased susceptibility to endometriosis in Brazilian women when the allele C is present. In order to clarify this result, further studies should be conducted on a larger population.
Short and long-term genome stability analysis of prokaryotic genomes.

PubMed

Brilli, Matteo; Liò, Pietro; Lacroix, Vincent; Sagot, Marie-France

2013-05-08

Gene organization dynamics is actively studied because it provides useful evolutionary information, makes functional annotation easier and often enables to characterize pathogens. There is therefore a strong interest in understanding the variability of this trait and the possible correlations with life-style. Two kinds of events affect genome organization: on one hand translocations and recombinations change the relative position of genes shared by two genomes (i.e. the backbone gene order); on the other, insertions and deletions leave the backbone gene order unchanged but they alter the gene neighborhoods by breaking the syntenic regions. A complete picture about genome organization evolution therefore requires to account for both kinds of events. We developed an approach where we model chromosomes as graphs on which we compute different stability estimators; we consider genome rearrangements as well as the effect of gene insertions and deletions. In a first part of the paper, we fit a measure of backbone gene order conservation (hereinafter called backbone stability) against phylogenetic distance for over 3000 genome comparisons, improving existing models for the divergence in time of backbone stability. Intra- and inter-specific comparisons were treated separately to focus on different time-scales. The use of multiple genomes of a same species allowed to identify genomes with diverging gene order with respect to their conspecific. The inter-species analysis indicates that pathogens are more often unstable with respect to non-pathogens. In a second part of the text, we show that in pathogens, gene content dynamics (insertions and deletions) have a much more dramatic effect on genome organization stability than backbone rearrangements. In this work, we studied genome organization divergence taking into account the contribution of both genome order rearrangements and genome content dynamics. By studying species with multiple sequenced genomes available, we were able to explore genome organization stability at different time-scales and to find significant differences for pathogen and non-pathogen species. The output of our framework also allows to identify the conserved gene clusters and/or partial occurrences thereof, making possible to explore how gene clusters assembled during evolution.

Gene expression of stem cells at different stages of ontological human development.

PubMed

Allegra, Adolfo; Altomare, Roberta; Curcio, Patrizia; Santoro, Alessandra; Lo Monte, Attilio I; Mazzola, Sergio; Marino, Angelo

2013-10-01

To compare multipotent mesenchymal stem cells (MSCs) obtained from chorionic villi (CV), amniotic fluid (AF) and placenta, with regard to their phenotype and gene expression, in order to understand if MSCs derived from different extra-embryonic tissues, at different stages of human ontological development, present distinct stemness characteristics. MSCs obtained from 30 samples of CV, 30 of AF and 10 placentas (obtained from elective caesarean sections) were compared. MSCs at second confluence cultures were characterized by immunophenotypic analysis with flow cytometry using FACS CANTO II. The expression of the genes Oct-4 (Octamer-binding transcription factor 4, also known as POU5F1), Sox-2 (SRY box-containing factor 2), Nanog, Rex-1 (Zfp-42) and Pax-6 (Paired Box Protein-6), was analyzed. Real-time quantitative PCR was performed by ABI Prism 7700, after RNA isolation and retro-transcription in cDNA. Statistical analysis was performed using non-parametric test Kruskal-Wallis (XLSTAT 2011) and confirmed by REST software, to estimate fold changes between samples. Each gene was defined differentially expressed if p-value was <0.05. Cells from all samples were negative for haematopoietic antigens CD45, CD34, CD117 and CD33 and positive for the typical MSCs antigens CD13, CD73 and CD90. Nevertheless, MSCs from AF and placentas showed different fluorescence intensity, reflecting the heterogeneity of these tissues. The gene expression of OCT-4, SOX-2, NANOG was not significantly different among the three groups. In AF, REX-1 and PAX-6 showed a higher expression in comparison to CV. MSCs of different extra-embryonic tissues showed no differences in immunophenotype when collected from second confluence cultures. The expression of OCT-4, NANOG and SOX-2 was not significantly different, demonstrating that all fetal sources are suitable for obtaining MSCs. These results open new possibilities for the clinical use of MSCs derived from easily accessible sources, in order to develop new protocols for clinical and experimental research. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Extensive Conserved Synteny of Genes between the Karyotypes of Manduca sexta and Bombyx mori Revealed by BAC-FISH Mapping

PubMed Central

Tanaka-Okuyama, Makiko; Shibata, Fukashi; Yoshido, Atsuo; Marec, František; Wu, Chengcang; Zhang, Hongbin; Goldsmith, Marian R.

2009-01-01

Background Genome sequencing projects have been completed for several species representing four highly diverged holometabolous insect orders, Diptera, Hymenoptera, Coleoptera, and Lepidoptera. The striking evolutionary diversity of insects argues a need for efficient methods to apply genome information from such models to genetically uncharacterized species. Constructing conserved synteny maps plays a crucial role in this task. Here, we demonstrate the use of fluorescence in situ hybridization with bacterial artificial chromosome probes as a powerful tool for physical mapping of genes and comparative genome analysis in Lepidoptera, which have numerous and morphologically uniform holokinetic chromosomes. Methodology/Principal Findings We isolated 214 clones containing 159 orthologs of well conserved single-copy genes of a sequenced lepidopteran model, the silkworm, Bombyx mori, from a BAC library of a sphingid with an unexplored genome, the tobacco hornworm, Manduca sexta. We then constructed a BAC-FISH karyotype identifying all 28 chromosomes of M. sexta by mapping 124 loci using the corresponding BAC clones. BAC probes from three M. sexta chromosomes also generated clear signals on the corresponding chromosomes of the convolvulus hawk moth, Agrius convolvuli, which belongs to the same subfamily, Sphinginae, as M. sexta. Conclusions/Significance Comparison of the M. sexta BAC physical map with the linkage map and genome sequence of B. mori pointed to extensive conserved synteny including conserved gene order in most chromosomes. Only a few rearrangements, including three inversions, three translocations, and two fission/fusion events were estimated to have occurred after the divergence of Bombycidae and Sphingidae. These results add to accumulating evidence for the stability of lepidopteran genomes. Generating signals on A. convolvuli chromosomes using heterologous M. sexta probes demonstrated that BAC-FISH with orthologous sequences can be used for karyotyping a wide range of related and genetically uncharacterized species, significantly extending the ability to develop synteny maps for comparative and functional genomics. PMID:19829706
Microbial Community Response of an Organohalide Respiring Enrichment Culture to Permanganate Oxidation.

PubMed

Sutton, Nora B; Atashgahi, Siavash; Saccenti, Edoardo; Grotenhuis, Tim; Smidt, Hauke; Rijnaarts, Huub H M

2015-01-01

While in situ chemical oxidation is often used to remediate tetrachloroethene (PCE) contaminated locations, very little is known about its influence on microbial composition and organohalide respiration (OHR) activity. Here, we investigate the impact of oxidation with permanganate on OHR rates, the abundance of organohalide respiring bacteria (OHRB) and reductive dehalogenase (rdh) genes using quantitative PCR, and microbial community composition through sequencing of 16S rRNA genes. A PCE degrading enrichment was repeatedly treated with low (25 μmol), medium (50 μmol), or high (100 μmol) permanganate doses, or no oxidant treatment (biotic control). Low and medium treatments led to higher OHR rates and enrichment of several OHRB and rdh genes, as compared to the biotic control. Improved degradation rates can be attributed to enrichment of (1) OHRB able to also utilize Mn oxides as a terminal electron acceptor and (2) non-dechlorinating community members of the Clostridiales and Deltaproteobacteria possibly supporting OHRB by providing essential co-factors. In contrast, high permanganate treatment disrupted dechlorination beyond cis-dichloroethene and caused at least a 2-4 orders of magnitude reduction in the abundance of all measured OHRB and rdh genes, as compared to the biotic control. High permanganate treatments resulted in a notably divergent microbial community, with increased abundances of organisms affiliated with Campylobacterales and Oceanospirillales capable of dissimilatory Mn reduction, and decreased abundance of presumed supporters of OHRB. Although OTUs classified within the OHR-supportive order Clostridiales and OHRB increased in abundance over the course of 213 days following the final 100 μmol permanganate treatment, only limited regeneration of PCE dechlorination was observed in one of three microcosms, suggesting strong chemical oxidation treatments can irreversibly disrupt OHR. Overall, this detailed investigation into dose-dependent changes of microbial composition and activity due to permanganate treatment provides insight into the mechanisms of OHR stimulation or disruption upon chemical oxidation.
Microbial Community Response of an Organohalide Respiring Enrichment Culture to Permanganate Oxidation

PubMed Central

Sutton, Nora B.; Atashgahi, Siavash; Saccenti, Edoardo; Grotenhuis, Tim; Smidt, Hauke; Rijnaarts, Huub H. M.

2015-01-01

While in situ chemical oxidation is often used to remediate tetrachloroethene (PCE) contaminated locations, very little is known about its influence on microbial composition and organohalide respiration (OHR) activity. Here, we investigate the impact of oxidation with permanganate on OHR rates, the abundance of organohalide respiring bacteria (OHRB) and reductive dehalogenase (rdh) genes using quantitative PCR, and microbial community composition through sequencing of 16S rRNA genes. A PCE degrading enrichment was repeatedly treated with low (25 μmol), medium (50 μmol), or high (100 μmol) permanganate doses, or no oxidant treatment (biotic control). Low and medium treatments led to higher OHR rates and enrichment of several OHRB and rdh genes, as compared to the biotic control. Improved degradation rates can be attributed to enrichment of (1) OHRB able to also utilize Mn oxides as a terminal electron acceptor and (2) non-dechlorinating community members of the Clostridiales and Deltaproteobacteria possibly supporting OHRB by providing essential co-factors. In contrast, high permanganate treatment disrupted dechlorination beyond cis-dichloroethene and caused at least a 2–4 orders of magnitude reduction in the abundance of all measured OHRB and rdh genes, as compared to the biotic control. High permanganate treatments resulted in a notably divergent microbial community, with increased abundances of organisms affiliated with Campylobacterales and Oceanospirillales capable of dissimilatory Mn reduction, and decreased abundance of presumed supporters of OHRB. Although OTUs classified within the OHR-supportive order Clostridiales and OHRB increased in abundance over the course of 213 days following the final 100 μmol permanganate treatment, only limited regeneration of PCE dechlorination was observed in one of three microcosms, suggesting strong chemical oxidation treatments can irreversibly disrupt OHR. Overall, this detailed investigation into dose-dependent changes of microbial composition and activity due to permanganate treatment provides insight into the mechanisms of OHR stimulation or disruption upon chemical oxidation. PMID:26244346
Degradation of endogenous and exogenous genes of genetically modified rice with Cry1Ab during food processing.

PubMed

Zhang, Wei; Xing, Fuguo; Selvaraj, Jonathan Nimal; Liu, Yang

2014-05-01

In order to assess the degradation of endogenous and exogenous genes during food processing, genetically modified rice with Cry1Ab was used as raw material to produce 4 processed foods: steamed rice, rice noodles, rice crackers, and sweet rice wine. The results showed various processing procedures caused different degrees of degradation of both endogenous and exogenous genes. During the processing of steamed rice and rice noodles, the procedures were so mild that only genes larger than 1500 bp were degraded, and no degradation of NOS terminator and Hpt gene was detected. For rice crackers, frying was the most severe procedure, followed by microwaving, baking, boiling, 1st drying, and 2nd drying. For sweet rice wine, fermentation had more impact on degradation of genes than the other processing procedures. All procedures in this study did not lead to degradation of genes to below 200 bp, except for NOS terminator. In the case of stability of the genes studied during processing of rice crackers and sweet rice wine, SPS gene was the most, followed by the Cry1Ab gene, Hpt gene, Pubi promoter, and NOS terminator. In our study, we gained some information about the degradation of endogenous and exogenous genes during 4 foods processing, compared the different stabilities between endogenous and exogenous genes, and analyzed different effects of procedure on degradation of genes. In addition, the fragments of endogenous and exogenous genes about 200 bp could be detected in final products, except NOS terminator. As a result, we provided some base information about risk assessment of genetically modified (GM) food and appropriate length of fragment to detect GM component in processed foods. © 2014 Institute of Food Technologists®
A comparative cDNA microarray analysis reveals a spectrum of genes regulated by Pax6 in mouse lens

PubMed Central

Chauhan, Bharesh K.; Reed, Nathan A.; Yang, Ying; Čermák, Lukáš; Reneker, Lixing; Duncan, Melinda K.; Cvekl, Aleš

2007-01-01

Background Pax6 is a transcription factor that is required for induction, growth, and maintenance of the lens; however, few direct target genes of Pax6 are known. Results In this report, we describe the results of a cDNA microarray analysis of lens transcripts from transgenic mice over-expressing Pax6 in lens fibre cells in order to narrow the field of potential direct Pax6 target genes. This study revealed that the transcript levels were significantly altered for 508 of the 9700 genes analysed, including five genes encoding the cell adhesion molecules β1-integrin, JAM1, L1 CAM, NCAM-140 and neogenin. Notably, comparisons between the genes differentially expressed in Pax6 heterozygous and Pax6 over-expressing lenses identified 13 common genes, including paralemmin, GDIβ, ATF1, Hrp12 and Brg1. Immunohistochemistry and Western blotting demonstrated that Brg1 is expressed in the embryonic and neonatal (2-week-old) but not in 14-week adult lenses, and confirmed altered expression in transgenic lenses over-expressing Pax6. Furthermore, EMSA demonstrated that the BRG1 promoter contains Pax6 binding sites, further supporting the proposition that it is directly regulated by Pax6. Conclusions These results provide a list of genes with possible roles in lens biology and cataracts that are directly or indirectly regulated by Pax6. PMID:12485166
Limitations of cytochrome oxidase I for the barcoding of Neritidae (Mollusca: Gastropoda) as revealed by Bayesian analysis.

PubMed

Chee, S Y

2015-05-25

The mitochondrial DNA (mtDNA) cytochrome oxidase I (COI) gene has been universally and successfully utilized as a barcoding gene, mainly because it can be amplified easily, applied across a wide range of taxa, and results can be obtained cheaply and quickly. However, in rare cases, the gene can fail to distinguish between species, particularly when exposed to highly sensitive methods of data analysis, such as the Bayesian method, or when taxa have undergone introgressive hybridization, over-splitting, or incomplete lineage sorting. Such cases require the use of alternative markers, and nuclear DNA markers are commonly used. In this study, a dendrogram produced by Bayesian analysis of an mtDNA COI dataset was compared with that of a nuclear DNA ATPS-α dataset, in order to evaluate the efficiency of COI in barcoding Malaysian nerites (Neritidae). In the COI dendrogram, most of the species were in individual clusters, except for two species: Nerita chamaeleon and N. histrio. These two species were placed in the same subcluster, whereas in the ATPS-α dendrogram they were in their own subclusters. Analysis of the ATPS-α gene also placed the two genera of nerites (Nerita and Neritina) in separate clusters, whereas COI gene analysis placed both genera in the same cluster. Therefore, in the case of the Neritidae, the ATPS-α gene is a better barcoding gene than the COI gene.
A 16-Gene Signature Distinguishes Anaplastic Astrocytoma from Glioblastoma

PubMed Central

Rao, Soumya Alige Mahabala; Srinivasan, Sujaya; Patric, Irene Rosita Pia; Hegde, Alangar Sathyaranjandas; Chandramouli, Bangalore Ashwathnarayanara; Arimappamagan, Arivazhagan; Santosh, Vani; Kondaiah, Paturu; Rao, Manchanahalli R. Sathyanarayana; Somasundaram, Kumaravel

2014-01-01

Anaplastic astrocytoma (AA; Grade III) and glioblastoma (GBM; Grade IV) are diffusely infiltrating tumors and are called malignant astrocytomas. The treatment regimen and prognosis are distinctly different between anaplastic astrocytoma and glioblastoma patients. Although histopathology based current grading system is well accepted and largely reproducible, intratumoral histologic variations often lead to difficulties in classification of malignant astrocytoma samples. In order to obtain a more robust molecular classifier, we analysed RT-qPCR expression data of 175 differentially regulated genes across astrocytoma using Prediction Analysis of Microarrays (PAM) and found the most discriminatory 16-gene expression signature for the classification of anaplastic astrocytoma and glioblastoma. The 16-gene signature obtained in the training set was validated in the test set with diagnostic accuracy of 89%. Additionally, validation of the 16-gene signature in multiple independent cohorts revealed that the signature predicted anaplastic astrocytoma and glioblastoma samples with accuracy rates of 99%, 88%, and 92% in TCGA, GSE1993 and GSE4422 datasets, respectively. The protein-protein interaction network and pathway analysis suggested that the 16-genes of the signature identified epithelial-mesenchymal transition (EMT) pathway as the most differentially regulated pathway in glioblastoma compared to anaplastic astrocytoma. In addition to identifying 16 gene classification signature, we also demonstrated that genes involved in epithelial-mesenchymal transition may play an important role in distinguishing glioblastoma from anaplastic astrocytoma. PMID:24475040
Dysregulated Pathway Identification of Alzheimer's Disease Based on Internal Correlation Analysis of Genes and Pathways.

PubMed

Kong, Wei; Mou, Xiaoyang; Di, Benteng; Deng, Jin; Zhong, Ruxing; Wang, Shuaiqun

2017-11-20

Dysregulated pathway identification is an important task which can gain insight into the underlying biological processes of disease. Current pathway-identification methods focus on a set of co-expression genes and single pathways and ignore the correlation between genes and pathways. The method proposed in this study, takes into account the internal correlations not only between genes but also pathways to identifying dysregulated pathways related to Alzheimer's disease (AD), the most common form of dementia. In order to find the significantly differential genes for AD, mutual information (MI) is used to measure interdependencies between genes other than expression valves. Then, by integrating the topology information from KEGG, the significant pathways involved in the feature genes are identified. Next, the distance correlation (DC) is applied to measure the pairwise pathway crosstalks since DC has the advantage of detecting nonlinear correlations when compared to Pearson correlation. Finally, the pathway pairs with significantly different correlations between normal and AD samples are known as dysregulated pathways. The molecular biology analysis demonstrated that many dysregulated pathways related to AD pathogenesis have been discovered successfully by the internal correlation detection. Furthermore, the insights of the dysregulated pathways in the development and deterioration of AD will help to find new effective target genes and provide important theoretical guidance for drug design. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Metatranscriptome sequence analysis reveals diel periodicity of microbial community gene expression in the ocean's interior

NASA Astrophysics Data System (ADS)

Vislova, A.; Aylward, F.; Sosa, O.; DeLong, E.

2016-02-01

Previous work has revealed diel periodicity of gene expression in key metabolic pathways in both autotrophic and heterotrophic microbes in the surface ocean. In this study, we investigated patterns of diel periodicity of gene expression in depth profiles (25, 75, 125 and 250 meters). We postulated that microbial diel transcriptional signals would be increasingly dampened with depth, and that the timing of peak expression of specific transcripts would be shifted in time between depths, in accordance with depth-dependent diel light variability. Bacterioplankton were sampled from four depths every four hours at station ALOHA (22° 45' N 158° W) over 2 days. RNA was extracted from cells preserved on filters, converted to cDNA, and sequenced on the Illumina platform. Surprisingly, harmonic regression analysis revealed an increasing proportion of genes with diel periodic expression patterns with increasing depth between 25- 125 meters. At 250 meters, the proportion of genes exhibiting diel expression patterns decreased an order of magnitude compared to the photic zone. Community composition, functional gene categories, and diel patterns of gene expression were significantly different between the photic zone and 250 meter samples. The signals driving diel periodic gene expression in microbes at 250 meters is under further investigation. These data are now beginning provide a better understanding of the tempo and mode of microbial dynamics among specific taxa, throughout the ocean's interior.
Pyramiding and evaluation of three dominant brown planthopper resistance genes in the elite indica rice 9311 and its hybrids.

PubMed

Hu, Jie; Cheng, Mingxing; Gao, Guanjun; Zhang, Qinglu; Xiao, Jinghua; He, Yuqing

2013-07-01

Brown planthopper (BPH), Nilaparvata lugens Stål, is the most devastating insect pest in rice-producing areas. Three dominant BPH resistance genes (Bph14, Bph15, Bph18) were pyramided into elite indica rice 9311 and its hybrids using marker-assisted selection. Gene effectiveness was evaluated on the basis of seedling and adult rice resistance, honeydew weight and survival rate of BPH. All three genes affected BPH growth and development and antibiotic factors, resulting in both seedling and adult resistance. Bph15 had the greatest effect on conferring resistance to BPH. The results showed an additive effect of pyramiding genes, the order of the gene effect being 14/15/18 ≥ 14/15 > 15/18 ≥ 15 > 14/18 ≥ 14 ≥ 18 > none. The pyramided or single-gene introgression hybrids showed greater resistance than conventional hybrids, although the heterozygous genotypes had weaker effects than the corresponding homozygous genotypes. Furthermore, field trial data demonstrated that yields of improved 9311 lines were higher than or similar to that of the control under natural field conditions. These improved versions can be immediately used in hybrid improvement and production. Compared with controls, pyramided lines and hybrids with three genes showed the strongest resistance to BPH, without a yield decrease. © 2012 Society of Chemical Industry.
Functional genomics of commercial baker's yeasts that have different abilities for sugar utilization and high-sucrose tolerance under different sugar conditions.

PubMed

Tanaka-Tsuno, Fumiko; Mizukami-Murata, Satomi; Murata, Yoshinori; Nakamura, Toshihide; Ando, Akira; Takagi, Hiroshi; Shima, Jun

2007-10-01

In the modern baking industry, high-sucrose-tolerant (HS) and maltose-utilizing (LS) yeast were developed using breeding techniques and are now used commercially. Sugar utilization and high-sucrose tolerance differ significantly between HS and LS yeasts. We analysed the gene expression profiles of HS and LS yeasts under different sucrose conditions in order to determine their basic physiology. Two-way hierarchical clustering was performed to obtain the overall patterns of gene expression. The clustering clearly showed that the gene expression patterns of LS yeast differed from those of HS yeast. Quality threshold clustering was used to identify the gene clusters containing upregulated genes (cluster 1) and downregulated genes (cluster 2) under high-sucrose conditions. Clusters 1 and 2 contained numerous genes involved in carbon and nitrogen metabolism, respectively. The expression level of the genes involved in the metabolism of glycerol and trehalose, which are known to be osmoprotectants, in LS yeast was higher than that in HS yeast under sucrose concentrations of 5-40%. No clear correlation was found between the expression level of the genes involved in the biosynthesis of the osmoprotectants and the intracellular contents of the osmoprotectants. The present gene expression data were compared with data previously reported in a comprehensive analysis of a gene deletion strain collection. Welch's t-test for this comparison showed that the relative growth rates of the deletion strains whose deletion occurred in genes belonging to cluster 1 were significantly higher than the average growth rates of all deletion strains. Copyright 2007 John Wiley & Sons, Ltd.
Comparative genomic analysis of the MHC: the evolution of class I duplication blocks, diversity and complexity from shark to man.

PubMed

Kulski, Jerzy K; Shiina, Takashi; Anzai, Tatsuya; Kohara, Sakae; Inoko, Hidetoshi

2002-12-01

The major histocompatibility complex (MHC) genomic region is composed of a group of linked genes involved functionally with the adaptive and innate immune systems. The class I and class II genes are intrinsic features of the MHC and have been found in all the jawed vertebrates studied so far. The MHC genomic regions of the human and the chicken (B locus) have been fully sequenced and mapped, and the mouse MHC sequence is almost finished. Information on the MHC genomic structures (size, complexity, genic and intergenic composition and organization, gene order and number) of other vertebrates is largely limited or nonexistent. Therefore, we are mapping, sequencing and analyzing the MHC genomic regions of different human haplotypes and at least eight nonhuman species. Here, we review our progress with these sequences and compare the human MHC structure with that of the nonhuman primates (chimpanzee and rhesus macaque), other mammals (pigs, mice and rats) and nonmammalian vertebrates such as birds (chicken and quail), bony fish (medaka, pufferfish and zebrafish) and cartilaginous fish (nurse shark). This comparison reveals a complex MHC structure for mammals and a relatively simpler design for nonmammalian animals with a hypothetical prototypic structure for the shark. In the mammalian MHC, there are two to five different class I duplication blocks embedded within a framework of conserved nonclass I and/or nonclass II genes. With a few exceptions, the class I framework genes are absent from the MHC of birds, bony fish and sharks. Comparative genomics of the MHC reveal a highly plastic region with major structural differences between the mammalian and nonmammalian vertebrates. Additional genomic data are needed on animals of the reptilia, crocodilia and marsupial classes to find the origins of the class I framework genes and examples of structures that may be intermediate between the simple and complex MHC organizations of birds and mammals, respectively.
Divergent and convergent modes of interaction between wheat and Puccinia graminis f. sp. tritici isolates revealed by the comparative gene co-expression network and genome analyses.

PubMed

Rutter, William B; Salcedo, Andres; Akhunova, Alina; He, Fei; Wang, Shichen; Liang, Hanquan; Bowden, Robert L; Akhunov, Eduard

2017-04-12

Two opposing evolutionary constraints exert pressure on plant pathogens: one to diversify virulence factors in order to evade plant defenses, and the other to retain virulence factors critical for maintaining a compatible interaction with the plant host. To better understand how the diversified arsenals of fungal genes promote interaction with the same compatible wheat line, we performed a comparative genomic analysis of two North American isolates of Puccinia graminis f. sp. tritici (Pgt). The patterns of inter-isolate divergence in the secreted candidate effector genes were compared with the levels of conservation and divergence of plant-pathogen gene co-expression networks (GCN) developed for each isolate. Comprative genomic analyses revealed substantial level of interisolate divergence in effector gene complement and sequence divergence. Gene Ontology (GO) analyses of the conserved and unique parts of the isolate-specific GCNs identified a number of conserved host pathways targeted by both isolates. Interestingly, the degree of inter-isolate sub-network conservation varied widely for the different host pathways and was positively associated with the proportion of conserved effector candidates associated with each sub-network. While different Pgt isolates tended to exploit similar wheat pathways for infection, the mode of plant-pathogen interaction varied for different pathways with some pathways being associated with the conserved set of effectors and others being linked with the diverged or isolate-specific effectors. Our data suggest that at the intra-species level pathogen populations likely maintain divergent sets of effectors capable of targeting the same plant host pathways. This functional redundancy may play an important role in the dynamic of the "arms-race" between host and pathogen serving as the basis for diverse virulence strategies and creating conditions where mutations in certain effector groups will not have a major effect on the pathogen's ability to infect the host.
A comparative study of the inner ear structures of artiodactyls and early cetaceans

DOE Office of Scientific and Technical Information (OSTI.GOV)

Klingshirn, M.A.; Luo, Z.

1994-12-31

It has been suggested that the order Cetacea (whales and porpoises) are closely related to artiodactyls, even-hoofed ungulate mammals such as the pig and cow. Paleontological and molecular data strongly supports this concept of phylogenetic relationships. In a study of DNA sequences of two mitochondrial ribosomal gene segments of cetaceans, the artiodactyls were found to be closest related to Cetaceans. These well accepted studies on the phylogenetic affinities of artiodactyls and cetaceans cause us to conduct a comparative study of the bony structure of the inner ear of these two taxa.
Comparative genome analysis of Pseudomonas genomes including Populus-associated isolates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jun, Se Ran; Wassenaar, Trudy; Nookaew, Intawat

The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches including the rhizosphere and endosphere of many plants influencing phylogenetic diversity and heterogeneity. In this study, comparative genome analysis was performed on over one thousand Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides. Based on average amino acid identity, genomic clusters were identified within the Pseudomonas genus, which showed agreements with clades by NCBI and cliques by IMG. The P. fluorescens group was organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. The speciesmore » P. aeruginosa showed clear distinction in their genomic relatedness compared to other Pseudomonas species groups based on the pan and core genome analysis. The 19 isolates of our 21 Populus-associated isolates formed three distinct subgroups within the P. fluorescens major group, supported by pathway profiles analysis, while two isolates were more closely related to P. chlororaphis and P. putida. The specific genes to Populus-associated subgroups were identified where genes specific to subgroup 1 include several sensory systems such as proteins which act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor; specific genes to subgroup 2 contain unique hypothetical genes; and genes specific to subgroup 3 organisms have a different hydrolase activity. IMPORTANCE The comparative genome analyses of the genus Pseudomonas that included Populus-associated isolates resulted in novel insights into high diversity of Pseudomonas. Consistent and robust genomic clusters with phylogenetic homogeneity were identified, which resolved species-clades that are not clearly defined by 16S rRNA gene sequence analysis alone. The genomic clusters may be reflective of distinct ecological niches to which the organisms have adapted, but this needs to be experimentally characterized with ecologically relevant phenotype properties. This study justifies the need to sequence multiple isolates, especially from P. fluorescens group in order to study functional capabilities from a pangenomic perspective. This information will prove useful when choosing Pseudomonas strains for use to promote growth and increase disease resistance in plants.« less
Comparative genome analysis of Pseudomonas genomes including Populus-associated isolates

DOE PAGES

Jun, Se Ran; Wassenaar, Trudy; Nookaew, Intawat; ...

2016-01-01

The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches including the rhizosphere and endosphere of many plants influencing phylogenetic diversity and heterogeneity. In this study, comparative genome analysis was performed on over one thousand Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides. Based on average amino acid identity, genomic clusters were identified within the Pseudomonas genus, which showed agreements with clades by NCBI and cliques by IMG. The P. fluorescens group was organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. The speciesmore » P. aeruginosa showed clear distinction in their genomic relatedness compared to other Pseudomonas species groups based on the pan and core genome analysis. The 19 isolates of our 21 Populus-associated isolates formed three distinct subgroups within the P. fluorescens major group, supported by pathway profiles analysis, while two isolates were more closely related to P. chlororaphis and P. putida. The specific genes to Populus-associated subgroups were identified where genes specific to subgroup 1 include several sensory systems such as proteins which act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor; specific genes to subgroup 2 contain unique hypothetical genes; and genes specific to subgroup 3 organisms have a different hydrolase activity. IMPORTANCE The comparative genome analyses of the genus Pseudomonas that included Populus-associated isolates resulted in novel insights into high diversity of Pseudomonas. Consistent and robust genomic clusters with phylogenetic homogeneity were identified, which resolved species-clades that are not clearly defined by 16S rRNA gene sequence analysis alone. The genomic clusters may be reflective of distinct ecological niches to which the organisms have adapted, but this needs to be experimentally characterized with ecologically relevant phenotype properties. This study justifies the need to sequence multiple isolates, especially from P. fluorescens group in order to study functional capabilities from a pangenomic perspective. This information will prove useful when choosing Pseudomonas strains for use to promote growth and increase disease resistance in plants.« less
Investigation of the 5' flanking region and exon 3 polymorphisms of IGF-1 gene showed moderate association with semen quality in Sanjabi breed rams.

PubMed

Bakhtiar, R; Abdolmohammadi, A; Hajarian, H; Nikousefat, Z; Kalantar-Neyestanaki, D

2017-12-01

In this study, semen samples were collected from 96 Sanjabi rams in order to investigate the IGF-1 gene polymorphisms and their relationship with the characteristics of semen quality and testicular size. The dimensions of scrotal length, width and circumference were measured during autumn and spring over two years. Blood samples were simultaneously collected from jugular vein to extract DNA. PCR was performed using specific primers to amplify 294 and 272bp fragments including 5' regulatory region and exon 3 of IGF-1 gene, respectively. PCR products were digested by BFOI and Eco88l restriction enzymes, respectively. Two genotypes including AA (194 and 100bp), AB (294, 194 and 100bp) and all possible genotypes including CC (182 and 90bp), CT (272, 182, and 90bp) and TT (272bp) were observed for 5' flanking region and exon 3 of IGF-1 gene, respectively. The significant differences among IGF-1 genotypes for testicular dimensions were not observed. However, the polymorphism of 5' flanking region in the studied population had significant effect on individual motility and percent morphology traits. Animals with AB genotype had significantly higher individual motility compared with AA genotype (P < 0.05). Also, animals with AA genotype had significantly the highest percent morphology compared with AB genotype (P < 0.1). The exon 3 of IGF-1 gene had significant effect on individual motility, concentration, morphology and water test traits. Animals with CT genotype had the highest sperm concentration (P < 0.1) and water test (P < 0.05) compared to CC and TT genotypes. Moreover, animals with TT genotype had significantly the highest percent morphology compared with other genotypes (P < 0.05). Briefly, the results indicated that individual motility, concentration, percent morphology and water test traits could be in association with IGF-1 genotypes. It might be concluded that polymorphisms in IGF-1gene can be considered to develop male fertility in future and for using in selection process of better animals under masker assisted selection programs. Copyright © 2017 Elsevier Inc. All rights reserved.
Identification of novel genes potentially involved in somatic embryogenesis in chicory (Cichorium intybus L.)

PubMed Central

2010-01-01

Background In our laboratory we use cultured chicory (Cichorium intybus) explants as a model to investigate cell reactivation and somatic embryogenesis and have produced 2 chicory genotypes (K59, C15) sharing a similar genetic background. K59 is a responsive genotype (embryogenic) capable of undergoing complete cell reactivation i.e. cell de- and re-differentiation leading to somatic embryogenesis (SE), whereas C15 is a non-responsive genotype (non-embryogenic) and is unable to undergo SE. Previous studies [1] showed that the use of the β-D-glucosyl Yariv reagent (β-GlcY) that specifically binds arabinogalactan-proteins (AGPs) blocked somatic embryo production in chicory root explants. This observation indicates that β-GlcY is a useful tool for investigating somatic embryogenesis (SE) in chicory. In addition, a putative AGP (DT212818) encoding gene was previously found to be significantly up-regulated in the embryogenic K59 chicory genotype as compared to the non-embryogenic C15 genotype suggesting that this AGP could be involved in chicory re-differentiation [2]. In order to improve our understanding of the molecular and cellular regulation underlying SE in chicory, we undertook a detailed cytological study of cell reactivation events in K59 and C15 genotypes, and used microarray profiling to compare gene expression in these 2 genotypes. In addition we also used β-GlcY to block SE in order to identify genes potentially involved in this process. Results Microscopy confirmed that only the K59, but not the C15 genotype underwent complete cell reactivation leading to SE formation. β-GlcY-treatment of explants blocked in vitro SE induction, but not cell reactivation, and induced cell wall modifications. Microarray analyses revealed that 78 genes were differentially expressed between induced K59 and C15 genotypes. The expression profiles of 19 genes were modified by β-GlcY-treatment. Eight genes were both differentially expressed between K59 and C15 genotypes during SE induction and transcriptionally affected by β-GlcY-treatment: AGP (DT212818), 26 S proteasome AAA ATPase subunit 6 (RPT6), remorin (REM), metallothionein-1 (MT1), two non-specific lipid transfer proteins genes (SDI-9 and DEA1), 3-hydroxy-3-methylglutaryl-CoA reductase (HMG-CoA reductase), and snakin 2 (SN2). These results suggest that the 8 genes, including the previously-identified AGP gene (DT212818), could be involved in cell fate determination events leading to SE commitment in chicory. Conclusion The use of two different chicory genotypes differing in their responsiveness to SE induction, together with β-GlcY-treatment represented an efficient tool to discriminate cell reactivation from the SE morphogenetic pathway. Such an approach, together with microarray analyses, permitted us to identify several putative key genes related to the SE morphogenetic pathway in chicory. PMID:20565992
Component identification of electron transport chains in curdlan-producing Agrobacterium sp. ATCC 31749 and its genome-specific prediction using comparative genome and phylogenetic trees analysis.

PubMed

Zhang, Hongtao; Setubal, Joao Carlos; Zhan, Xiaobei; Zheng, Zhiyong; Yu, Lijun; Wu, Jianrong; Chen, Dingqiang

2011-06-01

Agrobacterium sp. ATCC 31749 (formerly named Alcaligenes faecalis var. myxogenes) is a non-pathogenic aerobic soil bacterium used in large scale biotechnological production of curdlan. However, little is known about its genomic information. DNA partial sequence of electron transport chains (ETCs) protein genes were obtained in order to understand the components of ETC and genomic-specificity in Agrobacterium sp. ATCC 31749. Degenerate primers were designed according to ETC conserved sequences in other reported species. DNA partial sequences of ETC genes in Agrobacterium sp. ATCC 31749 were cloned by the PCR method using degenerate primers. Based on comparative genomic analysis, nine electron transport elements were ascertained, including NADH ubiquinone oxidoreductase, succinate dehydrogenase complex II, complex III, cytochrome c, ubiquinone biosynthesis protein ubiB, cytochrome d terminal oxidase, cytochrome bo terminal oxidase, cytochrome cbb (3)-type terminal oxidase and cytochrome caa (3)-type terminal oxidase. Similarity and phylogenetic analyses of these genes revealed that among fully sequenced Agrobacterium species, Agrobacterium sp. ATCC 31749 is closest to Agrobacterium tumefaciens C58. Based on these results a comprehensive ETC model for Agrobacterium sp. ATCC 31749 is proposed.

Complete mitochondrial genome of the aluminum-tolerant fungus Rhodotorula taiwanensis RS1 and comparative analysis of Basidiomycota mitochondrial genomes.

PubMed

Zhao, Xue Qiang; Aizawa, Tomoko; Schneider, Jessica; Wang, Chao; Shen, Ren Fang; Sunairi, Michio

2013-04-01

The complete mitochondrial genome of Rhodotorula taiwanensis RS1, an aluminum-tolerant Basidiomycota fungus, was determined and compared with the known mitochondrial genomes of 12 Basidiomycota species. The mitochondrial genome of R. taiwanensis RS1 is a circular DNA molecule of 40,392 bp and encodes the typical 15 mitochondrial proteins, 23 tRNAs, and small and large rRNAs as well as 10 intronic open reading frames. These genes are apparently transcribed in two directions and do not show syntenies in gene order with other investigated Basidiomycota species. The average G+C content (41%) of the mitochondrial genome of R. taiwanensis RS1 is the highest among the Basidiomycota species. Two introns were detected in the sequence of the atp9 gene of R. taiwanensis RS1, but not in that of other Basidiomycota species. Rhodotorula taiwanensis is the first species of the genus Rhodotorula whose full mitochondrial genome has been sequenced; and the data presented here supply valuable information for understanding the evolution of fungal mitochondrial genomes and researching the mechanism of aluminum tolerance in microorganisms. © 2013 The Authors. Published by Blackwell Publishing Ltd.
High variability of mitochondrial gene order among fungi.

PubMed

Aguileta, Gabriela; de Vienne, Damien M; Ross, Oliver N; Hood, Michael E; Giraud, Tatiana; Petit, Elsa; Gabaldón, Toni

2014-02-01

From their origin as an early alpha proteobacterial endosymbiont to their current state as cellular organelles, large-scale genomic reorganization has taken place in the mitochondria of all main eukaryotic lineages. So far, most studies have focused on plant and animal mitochondrial (mt) genomes (mtDNA), but fungi provide new opportunities to study highly differentiated mtDNAs. Here, we analyzed 38 complete fungal mt genomes to investigate the evolution of mtDNA gene order among fungi. In particular, we looked for evidence of nonhomologous intrachromosomal recombination and investigated the dynamics of gene rearrangements. We investigated the effect that introns, intronic open reading frames (ORFs), and repeats may have on gene order. Additionally, we asked whether the distribution of transfer RNAs (tRNAs) evolves independently to that of mt protein-coding genes. We found that fungal mt genomes display remarkable variation between and within the major fungal phyla in terms of gene order, genome size, composition of intergenic regions, and presence of repeats, introns, and associated ORFs. Our results support previous evidence for the presence of mt recombination in all fungal phyla, a process conspicuously lacking in most Metazoa. Overall, the patterns of rearrangements may be explained by the combined influences of recombination (i.e., most likely nonhomologous and intrachromosomal), accumulated repeats, especially at intergenic regions, and to a lesser extent, mobile element dynamics.
Regulatory heterochronies and loose temporal scaling between sea star and sea urchin regulatory circuits.

PubMed

Gildor, Tsvia; Hinman, Veronica; Ben-Tabou-De-Leon, Smadar

2017-01-01

It has long been argued that heterochrony, a change in relative timing of a developmental process, is a major source of evolutionary innovation. Heterochronic changes of regulatory gene activation could be the underlying molecular mechanism driving heterochronic changes through evolution. Here, we compare the temporal expression profiles of key regulatory circuits between sea urchin and sea star, representative of two classes of Echinoderms that shared a common ancestor about 500 million years ago. The morphologies of the sea urchin and sea star embryos are largely comparable, yet, differences in certain mesodermal cell types and ectodermal patterning result in distinct larval body plans. We generated high resolution temporal profiles of 17 mesodermally-, endodermally- and ectodermally-expressed regulatory genes in the sea star, Patiria miniata, and compared these to their orthologs in the Mediterranean sea urchin, Paracentrotus lividus. We found that the maternal to zygotic transition is delayed in the sea star compared to the sea urchin, in agreement with the longer cleavage stage in the sea star. Interestingly, the order of gene activation shows the highest variation in the relatively diverged mesodermal circuit, while the correlations of expression dynamics are the highest in the strongly conserved endodermal circuit. We detected loose scaling of the developmental rates of these species and observed interspecies heterochronies within all studied regulatory circuits. Thus, after 500 million years of parallel evolution, mild heterochronies between the species are frequently observed and the tight temporal scaling observed for closely related species no longer holds.
A deer (subfamily Cervinae) genetic linkage map and the evolution of ruminant genomes.

PubMed Central

Slate, Jon; Van Stijn, Tracey C; Anderson, Rayna M; McEwan, K Mary; Maqbool, Nauman J; Mathias, Helen C; Bixley, Matthew J; Stevens, Deirdre R; Molenaar, Adrian J; Beever, Jonathan E; Galloway, Susan M; Tate, Michael L

2002-01-01

Comparative maps between ruminant species and humans are increasingly important tools for the discovery of genes underlying economically important traits. In this article we present a primary linkage map of the deer genome derived from an interspecies hybrid between red deer (Cervus elaphus) and Père David's deer (Elaphurus davidianus). The map is approximately 2500 cM long and contains >600 markers including both evolutionary conserved type I markers and highly polymorphic type II markers (microsatellites). Comparative mapping by annotation and sequence similarity (COMPASS) was demonstrated to be a useful tool for mapping bovine and ovine ESTs in deer. Using marker order as a phylogenetic character and comparative map information from human, mouse, deer, cattle, and sheep, we reconstructed the karyotype of the ancestral Pecoran mammal and identified the chromosome rearrangements that have occurred in the sheep, cattle, and deer lineages. The deer map and interspecies hybrid pedigrees described here are a valuable resource for (1) predicting the location of orthologs to human genes in ruminants, (2) mapping QTL in farmed and wild deer populations, and (3) ruminant phylogenetic studies. PMID:11973312
Bile Stress Response in Listeria monocytogenes LO28: Adaptation, Cross-Protection, and Identification of Genetic Loci Involved in Bile Resistance

PubMed Central

Begley, Máire; Gahan, Cormac G. M.; Hill, Colin

2002-01-01

Bile is one of many barriers that Listeria monocytogenes must overcome in the human gastrointestinal tract in order to infect and cause disease. We demonstrated that stationary-phase cultures of L. monocytogenes LO28 were able to tolerate concentrations of bovine, porcine, and human bile and bile acids well in excess of those encountered in vivo. Strain LO28 was relatively bile resistant compared with other clinical isolates of L. monocytogenes, as well as with Listeria innocua, Salmonella enterica serovar Typhimurium LT2, and Lactobacillus sakei. While exponential-phase L. monocytogenes LO28 cells were exquisitely sensitive to unconjugated bile acids, prior adaptation to sublethal levels of bile acids or heterologous stresses, such as acid, heat, salt, or sodium dodecyl sulfate (SDS), significantly enhanced bile resistance. This adaptive response was independent of protein synthesis, and in the cases of bile and SDS adaptation, occurred in seconds. In order to identify genetic loci involved in the bile tolerance phenotype of L. monocytogenes LO28, transposon (Tn917) and plasmid (pORI19) integration banks were screened for bile-sensitive mutants. The disrupted genes included a homologue of the capA locus required for capsule formation in Bacillus anthracis; a gene encoding the transcriptional regulator ZurR; a homologue of an Escherichia coli gene, lytB, involved in isoprenoid biosynthesis; a gene encoding a homologue of the Bacillus subtilis membrane protein YxiO; and a gene encoding an amino acid transporter with a putative role in pH homeostasis, gadE. Interestingly, all of the identified loci play putative roles in maintenance of the cell envelope or in stress responses. PMID:12450822
RNA expression in a cartilaginous fish cell line reveals ancient 3′ noncoding regions highly conserved in vertebrates

PubMed Central

Forest, David; Nishikawa, Ryuhei; Kobayashi, Hiroshi; Parton, Angela; Bayne, Christopher J.; Barnes, David W.

2007-01-01

We have established a cartilaginous fish cell line [Squalus acanthias embryo cell line (SAE)], a mesenchymal stem cell line derived from the embryo of an elasmobranch, the spiny dogfish shark S. acanthias. Elasmobranchs (sharks and rays) first appeared >400 million years ago, and existing species provide useful models for comparative vertebrate cell biology, physiology, and genomics. Comparative vertebrate genomics among evolutionarily distant organisms can provide sequence conservation information that facilitates identification of critical coding and noncoding regions. Although these genomic analyses are informative, experimental verification of functions of genomic sequences depends heavily on cell culture approaches. Using ESTs defining mRNAs derived from the SAE cell line, we identified lengthy and highly conserved gene-specific nucleotide sequences in the noncoding 3′ UTRs of eight genes involved in the regulation of cell growth and proliferation. Conserved noncoding 3′ mRNA regions detected by using the shark nucleotide sequences as a starting point were found in a range of other vertebrate orders, including bony fish, birds, amphibians, and mammals. Nucleotide identity of shark and human in these regions was remarkably well conserved. Our results indicate that highly conserved gene sequences dating from the appearance of jawed vertebrates and representing potential cis-regulatory elements can be identified through the use of cartilaginous fish as a baseline. Because the expression of genes in the SAE cell line was prerequisite for their identification, this cartilaginous fish culture system also provides a physiologically valid tool to test functional hypotheses on the role of these ancient conserved sequences in comparative cell biology. PMID:17227856
Expressed sequence tags from poplar wood tissues--a comparative analysis from multiple libraries.

PubMed

Déjardin, A; Leplé, J-C; Lesage-Descauses, M-C; Costa, G; Pilate, G

2004-01-01

Xylogenesis involves successive developmental processes--cambial division, cell expansion and differentiation, cell death--each occurring along a gradient from the cambium to the pith of the stem. Taking advantage of the high level of organisation of wood tissues, we isolated cambial zone (CZ), differentiating xylem (DX) and mature xylem (MX) from both tension wood (TW) and opposite wood (OW) of bent poplars. Four different cDNA libraries were then constructed and used to generate 10,062 EST, reflecting the genes expressed in the different wood tissues. For the most abundant clusters, the EST distributions were compared between libraries in order to identify genes specific or over-represented at some specific developmental stages. They clearly showed a developmental shift between CZ and DX, whereas there is a continuity of development between DX and MX. CZ was mainly characterized by clusters of genes involved in cell cycle, protein synthesis and fate. Interestingly, two clusters with no assigned function were found specific to the cambial zone. In DX and MX, clusters were mostly involved in methylation of lignin precursors and microtubule cytoskeleton. In addition, in DX, EST from TW and OW were compared: five clusters of arabinogalactan proteins, one for sucrose synthase and one for fructokinase were specific or over-represented in TW. Moreover, a putative transcription factor and a cluster of unknown function were also identified in DX-TW. The informative comparison of multiple libraries prepared from wood tissues led to the identification of genes--some with still unknown functions--putatively involved in xylogenesis and tension wood formation.
Lessons learned from the initial sequencing of the pig genome: comparative analysis of an 8 Mb region of pig chromosome 17

PubMed Central

Hart, Elizabeth A; Caccamo, Mario; Harrow, Jennifer L; Humphray, Sean J; Gilbert, James GR; Trevanion, Steve; Hubbard, Tim; Rogers, Jane; Rothschild, Max F

2007-01-01

Background We describe here the sequencing, annotation and comparative analysis of an 8 Mb region of pig chromosome 17, which provides a useful test region to assess coverage and quality for the pig genome sequencing project. We report our findings comparing the annotation of draft sequence assembled at different depths of coverage. Results Within this region we annotated 71 loci, of which 53 are orthologous to human known coding genes. When compared to the syntenic regions in human (20q13.13-q13.33) and mouse (chromosome 2, 167.5 Mb-178.3 Mb), this region was found to be highly conserved with respect to gene order. The most notable difference between the three species is the presence of a large expansion of zinc finger coding genes and pseudogenes on mouse chromosome 2 between Edn3 and Phactr3 that is absent from pig and human. All of our annotation has been made publicly available in the Vertebrate Genome Annotation browser, VEGA. We assessed the impact of coverage on sequence assembly across this region and found, as expected, that increased sequence depth resulted in fewer, longer contigs. One-third of our annotated loci could not be fully re-aligned back to the low coverage version of the sequence, principally because the transcripts are fragmented over several contigs. Conclusion We have demonstrated the considerable advantages of sequencing at increased read depths and discuss the implications that lower coverage sequence may have on subsequent comparative and functional studies, particularly those involving complex loci such as GNAS. PMID:17705864
Estimating gene function with least squares nonnegative matrix factorization.

PubMed

Wang, Guoli; Ochs, Michael F

2007-01-01

Nonnegative matrix factorization is a machine learning algorithm that has extracted information from data in a number of fields, including imaging and spectral analysis, text mining, and microarray data analysis. One limitation with the method for linking genes through microarray data in order to estimate gene function is the high variance observed in transcription levels between different genes. Least squares nonnegative matrix factorization uses estimates of the uncertainties on the mRNA levels for each gene in each condition, to guide the algorithm to a local minimum in normalized chi2, rather than a Euclidean distance or divergence between the reconstructed data and the data itself. Herein, application of this method to microarray data is demonstrated in order to predict gene function.
Distribution, diversity and abundance of bacterial laccase-like genes in different particle size fractions of sediments in a subtropical mangrove ecosystem.

PubMed

Luo, Ling; Zhou, Zhi-Chao; Gu, Ji-Dong

2015-10-01

This study investigated the diversity and abundance of bacterial lacasse-like genes in different particle size fractions, namely sand, silt, and clay of sediments in a subtropical mangrove ecosystem. Moreover, the effects of nutrient conditions on bacterial laccase-like communities as well as the correlation between nutrients and, both the abundance and diversity indices of laccase-like bacteria in particle size fractions were also studied. Compared to bulk sediments, Bacteroidetes, Caldithrix, Cyanobacteria and Chloroflexi were dominated in all 3 particle-size fractions of intertidal sediment (IZ), but Actinobacteria and Firmicutes were lost after the fractionation procedures used. The diversity index of IZ fractions decreased in the order of bulk > clay > silt > sand. In fractions of mangrove forest sediment (MG), Verrucomicrobia was found in silt, and both Actinobacteria and Bacteroidetes appeared in clay, but no new species were found in sand. The declining order of diversity index in MG fractions was clay > silt > sand > bulk. Furthermore, the abundance of lacasse-like bacteria varied with different particle-size fractions significantly (p < 0.05), and decreased in the order of sand > clay > silt in both IZ and MG fractions. Additionally, nutrient availability was found to significantly affect the diversity and community structure of laccase-like bacteria (p < 0.05), while the total organic carbon contents were positively related to the abundance of bacterial laccase-like genes in particle size fractions (p < 0.05). Therefore, this study further provides evidence that bacterial laccase plays a vital role in turnover of sediment organic matter and cycling of nutrients.
A greedy, graph-based algorithm for the alignment of multiple homologous gene lists.

PubMed

Fostier, Jan; Proost, Sebastian; Dhoedt, Bart; Saeys, Yvan; Demeester, Piet; Van de Peer, Yves; Vandepoele, Klaas

2011-03-15

Many comparative genomics studies rely on the correct identification of homologous genomic regions using accurate alignment tools. In such case, the alphabet of the input sequences consists of complete genes, rather than nucleotides or amino acids. As optimal multiple sequence alignment is computationally impractical, a progressive alignment strategy is often employed. However, such an approach is susceptible to the propagation of alignment errors in early pairwise alignment steps, especially when dealing with strongly diverged genomic regions. In this article, we present a novel accurate and efficient greedy, graph-based algorithm for the alignment of multiple homologous genomic segments, represented as ordered gene lists. Based on provable properties of the graph structure, several heuristics are developed to resolve local alignment conflicts that occur due to gene duplication and/or rearrangement events on the different genomic segments. The performance of the algorithm is assessed by comparing the alignment results of homologous genomic segments in Arabidopsis thaliana to those obtained by using both a progressive alignment method and an earlier graph-based implementation. Especially for datasets that contain strongly diverged segments, the proposed method achieves a substantially higher alignment accuracy, and proves to be sufficiently fast for large datasets including a few dozens of eukaryotic genomes. http://bioinformatics.psb.ugent.be/software. The algorithm is implemented as a part of the i-ADHoRe 3.0 package.
Unstable genomes elevate transcriptome dynamics

PubMed Central

Stevens, Joshua B.; Liu, Guo; Abdallah, Batoul Y.; Horne, Steven D.; Ye, Karen J.; Bremer, Steven W.; Ye, Christine J.; Krawetz, Stephen A.; Heng, Henry H.

2015-01-01

The challenge of identifying common expression signatures in cancer is well known, however the reason behind this is largely unclear. Traditionally variation in expression signatures has been attributed to technological problems, however recent evidence suggests that chromosome instability (CIN) and resultant karyotypic heterogeneity may be a large contributing factor. Using a well-defined model of immortalization, we systematically compared the pattern of genome alteration and expression dynamics during somatic evolution. Co-measurement of global gene expression and karyotypic alteration throughout the immortalization process reveals that karyotype changes influence gene expression as major structural and numerical karyotypic alterations result in large gene expression deviation. Replicate samples from stages with stable genomes are more similar to each other than are replicate samples with karyotypic heterogeneity. Karyotypic and gene expression change during immortalization is dynamic as each stage of progression has a unique expression pattern. This was further verified by comparing global expression in two replicates grown in one flask with known karyotypes. Replicates with higher karyotypic instability were found to be less similar than replicates with stable karyotypes. This data illustrates the karyotype, transcriptome, and transcriptome determined pathways are in constant flux during somatic cellular evolution (particularly during the macroevolutionary phase) and this flux is an inextricable feature of CIN and essential for cancer formation. The findings presented here underscore the importance of understanding the evolutionary process of cancer in order to design improved treatment modalities. PMID:24122714
Silk gene expression of theridiid spiders: implications for male-specific silk use.

PubMed

Correa-Garhwal, Sandra M; Chaw, R Crystal; Clarke, Thomas H; Ayoub, Nadia A; Hayashi, Cheryl Y

2017-06-01

Spiders (order Araneae) rely on their silks for essential tasks, such as dispersal, prey capture, and reproduction. Spider silks are largely composed of spidroins, members of a protein family that are synthesized in silk glands. As needed, silk stored in silk glands is extruded through spigots on the spinnerets. Nearly all studies of spider silks have been conducted on females; thus, little is known about male silk biology. To shed light on silk use by males, we compared silk gene expression profiles of mature males to those of females from three cob-web weaving species (Theridiidae). We de novo assembled species-specific male transcriptomes from Latrodectus hesperus, Latrodectus geometricus, and Steatoda grossa followed by differential gene expression analyses. Consistent with their complement of silk spigots, male theridiid spiders express appreciable amounts of aciniform, major ampullate, minor ampullate, and pyriform spidroin genes but not tubuliform spidroin genes. The relative expression levels of particular spidroin genes varied between sexes and species. Because mature males desert their prey-capture webs and become cursorial in their search for mates, we anticipated that major ampullate (dragline) spidroin genes would be the silk genes most highly expressed by males. Indeed, major ampullate spidroin genes had the highest expression in S. grossa males. However, minor ampullate spidroin genes were the most highly expressed spidroin genes in L. geometricus and L. hesperus males. Our expression profiling results suggest species-specific adaptive divergence of silk use by male theridiids. Copyright © 2017 The Authors. Published by Elsevier GmbH.. All rights reserved.
confFuse: High-Confidence Fusion Gene Detection across Tumor Entities.

PubMed

Huang, Zhiqin; Jones, David T W; Wu, Yonghe; Lichter, Peter; Zapatka, Marc

2017-01-01

Background: Fusion genes play an important role in the tumorigenesis of many cancers. Next-generation sequencing (NGS) technologies have been successfully applied in fusion gene detection for the last several years, and a number of NGS-based tools have been developed for identifying fusion genes during this period. Most fusion gene detection tools based on RNA-seq data report a large number of candidates (mostly false positives), making it hard to prioritize candidates for experimental validation and further analysis. Selection of reliable fusion genes for downstream analysis becomes very important in cancer research. We therefore developed confFuse, a scoring algorithm to reliably select high-confidence fusion genes which are likely to be biologically relevant. Results: confFuse takes multiple parameters into account in order to assign each fusion candidate a confidence score, of which score ≥8 indicates high-confidence fusion gene predictions. These parameters were manually curated based on our experience and on certain structural motifs of fusion genes. Compared with alternative tools, based on 96 published RNA-seq samples from different tumor entities, our method can significantly reduce the number of fusion candidates (301 high-confidence from 8,083 total predicted fusion genes) and keep high detection accuracy (recovery rate 85.7%). Validation of 18 novel, high-confidence fusions detected in three breast tumor samples resulted in a 100% validation rate. Conclusions: confFuse is a novel downstream filtering method that allows selection of highly reliable fusion gene candidates for further downstream analysis and experimental validations. confFuse is available at https://github.com/Zhiqin-HUANG/confFuse.
Linking Genes to Cardiovascular Diseases: Gene Action and Gene–Environment Interactions

PubMed Central

2016-01-01

A unique myocardial characteristic is its ability to grow/remodel in order to adapt; this is determined partly by genes and partly by the environment and the milieu intérieur. In the “post-genomic” era, a need is emerging to elucidate the physiologic functions of myocardial genes, as well as potential adaptive and maladaptive modulations induced by environmental/epigenetic factors. Genome sequencing and analysis advances have become exponential lately, with escalation of our knowledge concerning sometimes controversial genetic underpinnings of cardiovascular diseases. Current technologies can identify candidate genes variously involved in diverse normal/abnormal morphomechanical phenotypes, and offer insights into multiple genetic factors implicated in complex cardiovascular syndromes. The expression profiles of thousands of genes are regularly ascertained under diverse conditions. Global analyses of gene expression levels are useful for cataloging genes and correlated phenotypes, and for elucidating the role of genes in maladies. Comparative expression of gene networks coupled to complex disorders can contribute insights as to how “modifier genes” influence the expressed phenotypes. Increasingly, a more comprehensive and detailed systematic understanding of genetic abnormalities underlying, for example, various genetic cardiomyopathies is emerging. Implementing genomic findings in cardiology practice may well lead directly to better diagnosing and therapeutics. There is currently evolving a strong appreciation for the value of studying gene anomalies, and doing so in a non-disjointed, cohesive manner. However, it is challenging for many—practitioners and investigators—to comprehend, interpret, and utilize the clinically increasingly accessible and affordable cardiovascular genomics studies. This survey addresses the need for fundamental understanding in this vital area. PMID:26545598
Convergent evolution of the genomes of marine mammals

USGS Publications Warehouse

Foote, Andrew D.; Liu, Yue; Thomas, Gregg W.C.; Vinař, Tomáš; Alföldi, Jessica; Deng, Jixin; Dugan, Shannon; van Elk, Cornelis E.; Hunter, Margaret; Joshi, Vandita; Khan, Ziad; Kovar, Christie; Lee, Sandra L.; Lindblad-Toh, Kerstin; Mancia, Annalaura; Nielsen, Rasmus; Qin, Xiang; Qu, Jiaxin; Raney, Brian J.; Vijay, Nagarjun; Wolf, Jochen B. W.; Hahn, Matthew W.; Muzny, Donna M.; Worley, Kim C.; Gilbert, M. Thomas P.; Gibbs, Richard A.

2015-01-01

Marine mammals from different mammalian orders share several phenotypic traits adapted to the aquatic environment and therefore represent a classic example of convergent evolution. To investigate convergent evolution at the genomic level, we sequenced and performed de novo assembly of the genomes of three species of marine mammals (the killer whale, walrus and manatee) from three mammalian orders that share independently evolved phenotypic adaptations to a marine existence. Our comparative genomic analyses found that convergent amino acid substitutions were widespread throughout the genome and that a subset of these substitutions were in genes evolving under positive selection and putatively associated with a marine phenotype. However, we found higher levels of convergent amino acid substitutions in a control set of terrestrial sister taxa to the marine mammals. Our results suggest that, whereas convergent molecular evolution is relatively common, adaptive molecular convergence linked to phenotypic convergence is comparatively rare.
Convergent evolution of the genomes of marine mammals

PubMed Central

Foote, Andrew D.; Liu, Yue; Thomas, Gregg W.C.; Vinař, Tomáš; Alföldi, Jessica; Deng, Jixin; Dugan, Shannon; van Elk, Cornelis E.; Hunter, Margaret E.; Joshi, Vandita; Khan, Ziad; Kovar, Christie; Lee, Sandra L.; Lindblad-Toh, Kerstin; Mancia, Annalaura; Nielsen, Rasmus; Qin, Xiang; Qu, Jiaxin; Raney, Brian J.; Vijay, Nagarjun; Wolf, Jochen B. W.; Hahn, Matthew W.; Muzny, Donna M.; Worley, Kim C.; Gilbert, M. Thomas P.; Gibbs, Richard A.

2015-01-01

Marine mammals from different mammalian orders share several phenotypic traits adapted to the aquatic environment and are therefore a classic example of convergent evolution. To investigate convergent evolution at the genomic level, we sequenced and de novo assembled the genomes of three species of marine mammals (the killer whale, walrus and manatee) from three mammalian orders that share independently evolved phenotypic adaptations to a marine existence. Our comparative genomic analyses found that convergent amino acid substitutions were widespread throughout the genome, and that a subset were in genes evolving under positive selection and putatively associated with a marine phenotype. However, we found higher levels of convergent amino acid substitutions in a control set of terrestrial sister taxa to the marine mammals. Our results suggest that while convergent molecular evolution is relatively common, adaptive molecular convergence linked to phenotypic convergence is comparatively rare. PMID:25621460
Isolation and characterization of major histocompatibility complex class IIB genes from the nurse shark.

PubMed

Bartl, S; Weissman, I L

1994-01-04

The major histocompatibility complex (MHC) contains a set of linked genes which encode cell surface proteins involved in the binding of small peptide antigens for their subsequent recognition by T lymphocytes. MHC proteins share structural features and the presence and location of polymorphic residues which play a role in the binding of antigens. In order to compare the structure of these molecules and gain insights into their evolution, we have isolated two MHC class IIB genes from the nurse shark, Ginglymostoma cirratum. Two clones, most probably alleles, encode proteins which differ by 13 amino acids located in the putative antigen-binding cleft. The protein structure and the location of polymorphic residues are similar to their mammalian counterparts. Although these genes appear to encode a typical MHC protein, no T-cell-mediated responses have been demonstrated in cartilaginous fish. The nurse shark represents the most phylogenetically primitive organism in which both class IIA [Kasahara, M., Vazquez, M., Sato, K., McKinney, E.C. & Flajnik, M.F. (1992) Proc. Natl. Acad. Sci USA 89, 6688-6692] and class IIB genes, presumably encoding the alpha/beta heterodimer, have been isolated.
Genome-Wide Prediction of the Polymorphic Ser Gene Family in Tetrahymena thermophila Based on Motif Analysis

PubMed Central

Ponsuwanna, Patrath; Kümpornsin, Krittikorn; Chookajorn, Thanat

2014-01-01

Even though antigenic variation is employed among parasitic protozoa for host immune evasion, Tetrahymena thermophila, a free-living ciliate, can also change its surface protein antigens. These cysteine-rich glycosylphosphatidylinositol (GPI)-linked surface proteins are encoded by a family of polymorphic Ser genes. Despite the availability of T. thermophila genome, a comprehensive analysis of the Ser family is limited by its high degree of polymorphism. In order to overcome this problem, a new approach was adopted by searching for Ser candidates with common motif sequences, namely length-specific repetitive cysteine pattern and GPI anchor site. The candidate genes were phylogenetically compared with the previously identified Ser genes and classified into subtypes. Ser candidates were often found to be located as tandem arrays of the same subtypes on several chromosomal scaffolds. Certain Ser candidates located in the same chromosomal arrays were transcriptionally expressed at specific T. thermophila developmental stages. These Ser candidates selected by the motif analysis approach can form the foundation for a systematic identification of the entire Ser gene family, which will contribute to the understanding of their function and the basis of T. thermophila antigenic variation. PMID:25133747
[Construction and characterization of liposomal magnetofection system in pig kidney cells].

PubMed

Chen, Wenjie; Cui, Haixin; Zhao, Xiang; Cui, Jinhui; Wang, Yan; Sun, Changjiao

2014-06-01

Magnetic nano gene vector is one of the non-viral gene vectors, modified by functional group to bind cationic transfect reagents. Coupling magnetofection with the universal lipofection we developed a novel somatic cell transfection method as the so-called liposomal magnetofection (LMF). This approach is potential to provide somatic cell cloning with stable genetic cell lines to cultivate transgenic animals. In order to construct such liposomal magnetic gene vectors complexes system, we used nano magnetic gene vector to combine with liposomal cationic transfect reagents by molecular self-assembly. This vectors system successfully carried exogenous gene and then transfected animal somatic cells. Here, we conducted atomic force microscopy (AFM), zeta potential-diameter analysis and other characterization experiments to investegate the size distribution and morphology of magnetic nanoparticles, the way of the vectors to load and concentrate DNA molecules. Our data reveal that, the LMF of Pig Kidney cells exhibited higher transfection efficiency comparing with the transfection mediated by the commercial lipofectamine2000. Moreover, LMF method overcomes the constraint of transient expression mediated by lipofection. Meanwhile, MTT assay showed low cytotoxicity of LMF. Hence, LMF is a feasible, low cytotoxic and effective method of cell transfection.

Copy number variants analysis in a cohort of isolated and syndromic developmental delay/intellectual disability reveals novel genomic disorders, position effects and candidate disease genes.

PubMed

Di Gregorio, E; Riberi, E; Belligni, E F; Biamino, E; Spielmann, M; Ala, U; Calcia, A; Bagnasco, I; Carli, D; Gai, G; Giordano, M; Guala, A; Keller, R; Mandrile, G; Arduino, C; Maffè, A; Naretto, V G; Sirchia, F; Sorasio, L; Ungari, S; Zonta, A; Zacchetti, G; Talarico, F; Pappi, P; Cavalieri, S; Giorgio, E; Mancini, C; Ferrero, M; Brussino, A; Savin, E; Gandione, M; Pelle, A; Giachino, D F; De Marchi, M; Restagno, G; Provero, P; Cirillo Silengo, M; Grosso, E; Buxbaum, J D; Pasini, B; De Rubeis, S; Brusco, A; Ferrero, G B

2017-10-01

Array-comparative genomic hybridization (array-CGH) is a widely used technique to detect copy number variants (CNVs) associated with developmental delay/intellectual disability (DD/ID). Identification of genomic disorders in DD/ID. We performed a comprehensive array-CGH investigation of 1,015 consecutive cases with DD/ID and combined literature mining, genetic evidence, evolutionary constraint scores, and functional information in order to assess the pathogenicity of the CNVs. We identified non-benign CNVs in 29% of patients. Amongst the pathogenic variants (11%), detected with a yield consistent with the literature, we found rare genomic disorders and CNVs spanning known disease genes. We further identified and discussed 51 cases with likely pathogenic CNVs spanning novel candidate genes, including genes encoding synaptic components and/or proteins involved in corticogenesis. Additionally, we identified two deletions spanning potential Topological Associated Domain (TAD) boundaries probably affecting the regulatory landscape. We show how phenotypic and genetic analyses of array-CGH data allow unraveling complex cases, identifying rare disease genes, and revealing unexpected position effects. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Welcome to pandoraviruses at the ‘Fourth TRUC’ club

PubMed Central

Sharma, Vikas; Colson, Philippe; Chabrol, Olivier; Scheid, Patrick; Pontarotti, Pierre; Raoult, Didier

2015-01-01

Nucleocytoplasmic large DNA viruses, or representatives of the proposed order Megavirales, belong to families of giant viruses that infect a broad range of eukaryotic hosts. Megaviruses have been previously described to comprise a fourth monophylogenetic TRUC (things resisting uncompleted classification) together with cellular domains in the universal tree of life. Recently described pandoraviruses have large (1.9–2.5 MB) and highly divergent genomes. In the present study, we updated the classification of pandoraviruses and other reported giant viruses. Phylogenetic trees were constructed based on six informational genes. Hierarchical clustering was performed based on a set of informational genes from Megavirales members and cellular organisms. Homologous sequences were selected from cellular organisms using TimeTree software, comprising comprehensive, and representative sets of members from Bacteria, Archaea, and Eukarya. Phylogenetic analyses based on three conserved core genes clustered pandoraviruses with phycodnaviruses, exhibiting their close relatedness. Additionally, hierarchical clustering analyses based on informational genes grouped pandoraviruses with Megavirales members as a super group distinct from cellular organisms. Thus, the analyses based on core conserved genes revealed that pandoraviruses are new genuine members of the ‘Fourth TRUC’ club, encompassing distinct life forms compared with cellular organisms. PMID:26042093
Incorporating interaction networks into the determination of functionally related hit genes in genomic experiments with Markov random fields

PubMed Central

Robinson, Sean; Nevalainen, Jaakko; Pinna, Guillaume; Campalans, Anna; Radicella, J. Pablo; Guyon, Laurent

2017-01-01

Abstract Motivation: Incorporating gene interaction data into the identification of ‘hit’ genes in genomic experiments is a well-established approach leveraging the ‘guilt by association’ assumption to obtain a network based hit list of functionally related genes. We aim to develop a method to allow for multivariate gene scores and multiple hit labels in order to extend the analysis of genomic screening data within such an approach. Results: We propose a Markov random field-based method to achieve our aim and show that the particular advantages of our method compared with those currently used lead to new insights in previously analysed data as well as for our own motivating data. Our method additionally achieves the best performance in an independent simulation experiment. The real data applications we consider comprise of a survival analysis and differential expression experiment and a cell-based RNA interference functional screen. Availability and implementation: We provide all of the data and code related to the results in the paper. Contact: sean.j.robinson@utu.fi or laurent.guyon@cea.fr Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28881978
Welcome to pandoraviruses at the 'Fourth TRUC' club.

PubMed

Sharma, Vikas; Colson, Philippe; Chabrol, Olivier; Scheid, Patrick; Pontarotti, Pierre; Raoult, Didier

2015-01-01

Nucleocytoplasmic large DNA viruses, or representatives of the proposed order Megavirales, belong to families of giant viruses that infect a broad range of eukaryotic hosts. Megaviruses have been previously described to comprise a fourth monophylogenetic TRUC (things resisting uncompleted classification) together with cellular domains in the universal tree of life. Recently described pandoraviruses have large (1.9-2.5 MB) and highly divergent genomes. In the present study, we updated the classification of pandoraviruses and other reported giant viruses. Phylogenetic trees were constructed based on six informational genes. Hierarchical clustering was performed based on a set of informational genes from Megavirales members and cellular organisms. Homologous sequences were selected from cellular organisms using TimeTree software, comprising comprehensive, and representative sets of members from Bacteria, Archaea, and Eukarya. Phylogenetic analyses based on three conserved core genes clustered pandoraviruses with phycodnaviruses, exhibiting their close relatedness. Additionally, hierarchical clustering analyses based on informational genes grouped pandoraviruses with Megavirales members as a super group distinct from cellular organisms. Thus, the analyses based on core conserved genes revealed that pandoraviruses are new genuine members of the 'Fourth TRUC' club, encompassing distinct life forms compared with cellular organisms.
[Expression analysis of a transformer gene in Daphnia pulex after RNAi].

PubMed

Guo, C Y; Chen, P; Zhang, M M; Ning, J J; Wang, С L; Wang, D L; Zhao, Y L

2016-01-01

In order to explore the importance of the transformer (tra) gene in reproductive mode switching in Daphnia pulex, we studied the effect of silencing of this gene using RNA interference (RNAi). We obtained Dptra dsRNA by constructing and using a dsRNA expression vector and transcription method in vitro. D. pulex individuals in different reproductive modes were treated by soaking in a solution of Dptra dsRNA. We then assayed the expression of the endogenous Dptra mRNA after RNAi treatment using RT-PCR and obtained the suppression ratio. Expression of the tra gene in the RNAi groups was down-regulated compared with the controls after 16 h (p < 0.05). We also analyzed the effect of RNAi on the expression of the TRA protein using Western blot, which showed that the expression level of the TRA protein was reduced after RNAi treatment. Our experimental results showed that soaking of D. pulex adults in tra-specific dsRNA transcribed in vitro can specifically reduce the level of tra mRNA and also reduce the expression of the TRA protein, demonstrating effective in vivo silencing of the tra gene.
Genetics, Genomics and Evolution of Ergot Alkaloid Diversity

PubMed Central

Young, Carolyn A.; Schardl, Christopher L.; Panaccione, Daniel G.; Florea, Simona; Takach, Johanna E.; Charlton, Nikki D.; Moore, Neil; Webb, Jennifer S.; Jaromczyk, Jolanta

2015-01-01

The ergot alkaloid biosynthesis system has become an excellent model to study evolutionary diversification of specialized (secondary) metabolites. This is a very diverse class of alkaloids with various neurotropic activities, produced by fungi in several orders of the phylum Ascomycota, including plant pathogens and protective plant symbionts in the family Clavicipitaceae. Results of comparative genomics and phylogenomic analyses reveal multiple examples of three evolutionary processes that have generated ergot-alkaloid diversity: gene gains, gene losses, and gene sequence changes that have led to altered substrates or product specificities of the enzymes that they encode (neofunctionalization). The chromosome ends appear to be particularly effective engines for gene gains, losses and rearrangements, but not necessarily for neofunctionalization. Changes in gene expression could lead to accumulation of various pathway intermediates and affect levels of different ergot alkaloids. Genetic alterations associated with interspecific hybrids of Epichloë species suggest that such variation is also selectively favored. The huge structural diversity of ergot alkaloids probably represents adaptations to a wide variety of ecological situations by affecting the biological spectra and mechanisms of defense against herbivores, as evidenced by the diverse pharmacological effects of ergot alkaloids used in medicine. PMID:25875294
Over, and Underexpression of Endothelin 1 and TGF-Beta Family Ligands and Receptors in Lung Tissue of Broilers with Pulmonary Hypertension

PubMed Central

Dominguez-Avila, Norma; Ruiz-Castañeda, Gabriel; González-Ramírez, Javier; Fernandez-Jaramillo, Nora; Escoto, Jorge; Sánchez-Muñoz, Fausto; Marquez-Velasco, Ricardo; Bojalil, Rafael; Espinosa-Cervantes, Román; Sánchez, Fausto

2013-01-01

Transforming growth factor beta (TGFβ) is a family of genes that play a key role in mediating tissue remodeling in various forms of acute and chronic lung disease. In order to assess their role on pulmonary hypertension in broilers, we determined mRNA expression of genes of the TGFβ family and endothelin 1 in lung samples from 4-week-old chickens raised either under normal or cold temperature conditions. Both in control and cold-treated groups of broilers, endothelin 1 mRNA expression levels in lungs from ascitic chickens were higher than levels from healthy birds (P < 0.05), whereas levels in animals with cardiac failure were intermediate. Conversely, TGFβ2 and TGFβ3 gene expression in lungs were higher in healthy animals than in ascitic animals in both groups (P < 0.05). TGFβ1, TβRI, and TβRII mRNA gene expression among healthy, ascitic, and chickens with cardiac failure showed no differences (P > 0.05). BAMBI mRNA gene expression was lowest in birds with ascites only in the control group as compared with the values from healthy birds (P < 0.05). PMID:24286074
Mutational screening in genes related with porto-pulmonary hypertension: An analysis of 6 cases.

PubMed

Pousada, Guillermo; Baloira, Adolfo; Valverde, Diana

2017-04-07

Portopulmonary hypertension (PPH) is a rare disease with a low incidence and without a clearly-identified genetic component. The aim of this work was to check genes and genetic modifiers related to pulmonary arterial hypertension in patients with PPH in order to clarify the molecular basis of the pathology. We selected a total of 6 patients with PPH and amplified the exonic regions and intronic flanking regions of the relevant genes and regions of interest of the genetic modifiers. Six patients diagnosed with PPH were analyzed and compared to 55 healthy individuals. Potentially-pathogenic mutations were identified in the analyzed genes of 5 patients. None of these mutations, which are highly conserved throughout evolution, were detected in the control patients nor different databases analyzed (1000 Genomes, ExAC and DECIPHER). After analyzing for genetic modifiers, we found different variations that could favor the onset of the disease. The genetic analysis carried out in this small cohort of patients with PPH revealed a large number of mutations, with the ENG gene showing the greatest mutational frequency. Copyright © 2017 Elsevier España, S.L.U. All rights reserved.
Phylogenetic modeling of lateral gene transfer reconstructs the pattern and relative timing of speciations

PubMed Central

Szöllősi, Gergely J.; Boussau, Bastien; Abby, Sophie S.; Tannier, Eric; Daubin, Vincent

2012-01-01

The timing of the evolution of microbial life has largely remained elusive due to the scarcity of prokaryotic fossil record and the confounding effects of the exchange of genes among possibly distant species. The history of gene transfer events, however, is not a series of individual oddities; it records which lineages were concurrent and thus provides information on the timing of species diversification. Here, we use a probabilistic model of genome evolution that accounts for differences between gene phylogenies and the species tree as series of duplication, transfer, and loss events to reconstruct chronologically ordered species phylogenies. Using simulations we show that we can robustly recover accurate chronologically ordered species phylogenies in the presence of gene tree reconstruction errors and realistic rates of duplication, transfer, and loss. Using genomic data we demonstrate that we can infer rooted species phylogenies using homologous gene families from complete genomes of 10 bacterial and archaeal groups. Focusing on cyanobacteria, distinguished among prokaryotes by a relative abundance of fossils, we infer the maximum likelihood chronologically ordered species phylogeny based on 36 genomes with 8,332 homologous gene families. We find the order of speciation events to be in full agreement with the fossil record and the inferred phylogeny of cyanobacteria to be consistent with the phylogeny recovered from established phylogenomics methods. Our results demonstrate that lateral gene transfers, detected by probabilistic models of genome evolution, can be used as a source of information on the timing of evolution, providing a valuable complement to the limited prokaryotic fossil record. PMID:23043116
Human-Specific Duplication and Mosaic Transcripts: The Recent Paralogous Structure of Chromosome 22

PubMed Central

Bailey, Jeffrey A. ; Yavor, Amy M. ; Viggiano, Luigi ; Misceo, Doriana ; Horvath, Juliann E. ; Archidiacono, Nicoletta ; Schwartz, Stuart ; Rocchi, Mariano ; Eichler, Evan E.

2002-01-01

In recent decades, comparative chromosomal banding, chromosome painting, and gene-order studies have shown strong conservation of gross chromosome structure and gene order in mammals. However, findings from the human genome sequence suggest an unprecedented degree of recent (<35 million years ago) segmental duplication. This dynamism of segmental duplications has important implications in disease and evolution. Here we present a chromosome-wide view of the structure and evolution of the most highly homologous duplications (⩾1 kb and ⩾90%) on chromosome 22. Overall, 10.8% (3.7/33.8 Mb) of chromosome 22 is duplicated, with an average sequence identity of 95.4%. To organize the duplications into tractable units, intron-exon structure and well-defined duplication boundaries were used to define 78 duplicated modules (minimally shared evolutionary segments) with 157 copies on chromosome 22. Analysis of these modules provides evidence for the creation or modification of 11 novel transcripts. Comparative FISH analyses of human, chimpanzee, gorilla, orangutan, and macaque reveal qualitative and quantitative differences in the distribution of these duplications—consistent with their recent origin. Several duplications appear to be human specific, including a ∼400-kb duplication (99.4%–99.8% sequence identity) that transposed from chromosome 14 to the most proximal pericentromeric region of chromosome 22. Experimental and in silico data further support a pericentromeric gradient of duplications where the most recent duplications transpose adjacent to the centromere. Taken together, these data suggest that segmental duplications have been an ongoing process of primate genome evolution, contributing to recent gene innovation and the dynamic transformation of genome architecture within and among closely related species. PMID:11731936
Novel relationships among ten fish model species revealed based on a phylogenomic analysis using ESTs.

PubMed

Steinke, Dirk; Salzburger, Walter; Meyer, Axel

2006-06-01

The power of comparative phylogenomic analyses also depends on the amount of data that are included in such studies. We used expressed sequence tags (ESTs) from fish model species as a proof of principle approach in order to test the reliability of using ESTs for phylogenetic inference. As expected, the robustness increases with the amount of sequences. Although some progress has been made in the elucidation of the phylogeny of teleosts, relationships among the main lineages of the derived fish (Euteleostei) remain poorly defined and are still debated. We performed a phylogenomic analysis of a set of 42 of orthologous genes from 10 available fish model systems from seven different orders (Salmoniformes, Siluriformes, Cypriniformes, Tetraodontiformes, Cyprinodontiformes, Beloniformes, and Perciformes) of euteleostean fish to estimate divergence times and evolutionary relationships among those lineages. All 10 fish species serve as models for developmental, aquaculture, genomic, and comparative genetic studies. The phylogenetic signal and the strength of the contribution of each of the 42 orthologous genes were estimated with randomly chosen data subsets. Our study revealed a molecular phylogeny of higher-level relationships of derived teleosts, which indicates that the use of multiple genes produces robust phylogenies, a finding that is expected to apply to other phylogenetic issues among distantly related taxa. Our phylogenomic analyses confirm that the euteleostean superorders Ostariophysi and Acanthopterygii are monophyletic and the Protacanthopterygii and Ostariophysi are sister clades. In addition, and contrary to the traditional phylogenetic hypothesis, our analyses determine that killifish (Cyprinodontiformes), medaka (Beloniformes), and cichlids (Perciformes) appear to be more closely related to each other than either of them is to pufferfish (Tetraodontiformes). All 10 lineages split before or during the fragmentation of the supercontinent Pangea in the Jurassic.
Phylum Level Change in the Cecal and Fecal Gut Communities of Rats Fed Diets Containing Different Fermentable Substrates Supports a Role for Nitrogen as a Factor Contributing to Community Structure

PubMed Central

Kalmokoff, Martin; Franklin, Jeff; Petronella, Nicholas; Green, Judy; Brooks, Stephen P.J.

2015-01-01

Fermentation differs between the proximal and distal gut but little is known regarding how the bacterial communities differ or how they are influenced by diet. In order to investigate this, we compared community diversity in the cecum and feces of rats by 16S rRNA gene content and DNA shot gun metagenomics after feeding purified diets containing different fermentable substrates. Gut community composition was dependent on the source of fermentable substrate included in the diet. Cecal communities were dominated by Firmicutes, and contained a higher abundance of Lachnospiraceae compared to feces. In feces, community structure was shifted by varying degrees depending on diet towards the Bacteroidetes, although this change was not always evident from 16S rRNA gene data. Multi-dimensional scaling analysis (PCoA) comparing cecal and fecal metagenomes grouped by location within the gut rather than by diet, suggesting that factors in addition to substrate were important for community change in the distal gut. Differentially abundant genes in each environment supported this shift away from the Firmicutes in the cecum (e.g., motility) towards the Bacteroidetes in feces (e.g., Bacteroidales transposons). We suggest that this phylum level change reflects a shift to ammonia as the primary source of nitrogen used to support continued microbial growth in the distal gut. PMID:25954902
Phylum level change in the cecal and fecal gut communities of rats fed diets containing different fermentable substrates supports a role for nitrogen as a factor contributing to community structure.

PubMed

Kalmokoff, Martin; Franklin, Jeff; Petronella, Nicholas; Green, Judy; Brooks, Stephen P J

2015-05-06

Fermentation differs between the proximal and distal gut but little is known regarding how the bacterial communities differ or how they are influenced by diet. In order to investigate this, we compared community diversity in the cecum and feces of rats by 16S rRNA gene content and DNA shot gun metagenomics after feeding purified diets containing different fermentable substrates. Gut community composition was dependent on the source of fermentable substrate included in the diet. Cecal communities were dominated by Firmicutes, and contained a higher abundance of Lachnospiraceae compared to feces. In feces, community structure was shifted by varying degrees depending on diet towards the Bacteroidetes, although this change was not always evident from 16S rRNA gene data. Multi-dimensional scaling analysis (PCoA) comparing cecal and fecal metagenomes grouped by location within the gut rather than by diet, suggesting that factors in addition to substrate were important for community change in the distal gut. Differentially abundant genes in each environment supported this shift away from the Firmicutes in the cecum (e.g., motility) towards the Bacteroidetes in feces (e.g., Bacteroidales transposons). We suggest that this phylum level change reflects a shift to ammonia as the primary source of nitrogen used to support continued microbial growth in the distal gut.
Concentration of facultative pathogenic bacteria and antibiotic resistance genes during sewage treatment and in receiving rivers.

PubMed

Heß, Stefanie; Lüddeke, Frauke; Gallert, Claudia

2016-10-01

Whereas the hygienic condition of drinking and bathing water by law must be monitored by culture-based methods, for quantification of microbes and antibiotic resistance in soil or the aquatic environment, often molecular genetic assays are used. For comparison of both methods, knowledge of their correlation is necessary. Therefore the population of total bacteria, Escherichia coli, enterococci and staphylococci during sewage treatment and in receiving river water was compared by agar plating and quantitative polymerase chain reaction (qPCR) assays. In parallel, all samples were investigated for clinically relevant antibiotic resistance genes. Whereas plating and qPCR data for total bacteria correlated well in sewage after primary treatment, qPCR data of river water indicated higher cell numbers for E. coli. It is unknown if these cells are 'only' not growing under standard conditions or if they are dead. Corresponding to the amount of non-culturable cells, the 'breakpoints' for monitoring water quality should be adapted. The abundances of clinically relevant antibiotic resistance genes in river water were in the same order of magnitude or even higher than in treated sewage. For estimation of the health risk it is important to investigate which species carry respective genes and whether these genes are disseminated via gene transfer.
Metallothionein Gene Family in the Sea Urchin Paracentrotus lividus: Gene Structure, Differential Expression and Phylogenetic Analysis

PubMed Central

Ragusa, Maria Antonietta; Nicosia, Aldo; Costa, Salvatore; Cuttitta, Angela; Gianguzza, Fabrizio

2017-01-01

Metallothioneins (MT) are small and cysteine-rich proteins that bind metal ions such as zinc, copper, cadmium, and nickel. In order to shed some light on MT gene structure and evolution, we cloned seven Paracentrotus lividus MT genes, comparing them to Echinodermata and Chordata genes. Moreover, we performed a phylogenetic analysis of 32 MTs from different classes of echinoderms and 13 MTs from the most ancient chordates, highlighting the relationships between them. Since MTs have multiple roles in the cells, we performed RT-qPCR and in situ hybridization experiments to understand better MT functions in sea urchin embryos. Results showed that the expression of MTs is regulated throughout development in a cell type-specific manner and in response to various metals. The MT7 transcript is expressed in all tissues, especially in the stomach and in the intestine of the larva, but it is less metal-responsive. In contrast, MT8 is ectodermic and rises only at relatively high metal doses. MT5 and MT6 expression is highly stimulated by metals in the mesenchyme cells. Our results suggest that the P. lividus MT family originated after the speciation events by gene duplications, evolving developmental and environmental sub-functionalization. PMID:28417916
Quality controls in cellular immunotherapies: rapid assessment of clinical grade dendritic cells by gene expression profiling.

PubMed

Castiello, Luciano; Sabatino, Marianna; Zhao, Yingdong; Tumaini, Barbara; Ren, Jiaqiang; Ping, Jin; Wang, Ena; Wood, Lauren V; Marincola, Francesco M; Puri, Raj K; Stroncek, David F

2013-02-01

Cell-based immunotherapies are among the most promising approaches for developing effective and targeted immune response. However, their clinical usefulness and the evaluation of their efficacy rely heavily on complex quality control assessment. Therefore, rapid systematic methods are urgently needed for the in-depth characterization of relevant factors affecting newly developed cell product consistency and the identification of reliable markers for quality control. Using dendritic cells (DCs) as a model, we present a strategy to comprehensively characterize manufactured cellular products in order to define factors affecting their variability, quality and function. After generating clinical grade human monocyte-derived mature DCs (mDCs), we tested by gene expression profiling the degrees of product consistency related to the manufacturing process and variability due to intra- and interdonor factors, and how each factor affects single gene variation. Then, by calculating for each gene an index of variation we selected candidate markers for identity testing, and defined a set of genes that may be useful comparability and potency markers. Subsequently, we confirmed the observed gene index of variation in a larger clinical data set. In conclusion, using high-throughput technology we developed a method for the characterization of cellular therapies and the discovery of novel candidate quality assurance markers.
Promises and challenges of eco-physiological genomics in the field: tests of drought responses in switchgrass

DOE PAGES

Lovell, John T.; Shakirov, Eugene V.; Schwartz, Scott; ...

2016-05-31

Identifying the physiological and genetic basis of stress tolerance in plants has proven to be critical to understanding adaptation in both agricultural and natural systems. However, many discoveries were initially made in the controlled conditions of greenhouses or laboratories, not in the field. To test the comparability of drought responses across field and greenhouse environments, we undertook three independent experiments using the switchgrass reference genotype Alamo AP13. We analyzed physiological and gene expression variation across four locations, two sampling times, and three years. Relatively similar physiological responses and expression coefficients of variation across experiments masked highly dissimilar gene expression responsesmore » to drought. Critically, a drought experiment utilizing small pots in the greenhouse elicited nearly identical physiological changes as an experiment conducted in the field, but an order of magnitude more differentially expressed genes. However, we were able to define a suite of several hundred genes that were differentially expressed across all experiments. This list was strongly enriched in photosynthesis, water status, and reactive oxygen species responsive genes. The strong across-experiment correlations between physiological plasticity—but not differential gene expression—highlight the complex and diverse genetic mechanisms that can produce phenotypically similar responses to various soil water deficits.« less
System analysis identifies distinct and common functional networks governed by transcription factor ASCL1, in glioma and small cell lung cancer.

PubMed

Donakonda, Sainitin; Sinha, Swati; Dighe, Shrinivas Nivrutti; Rao, Manchanahalli R Satyanarayana

2017-07-25

ASCL1 is a basic Helix-Loop-Helix transcription factor (TF), which is involved in various cellular processes like neuronal development and signaling pathways. Transcriptome profiling has shown that ASCL1 overexpression plays an important role in the development of glioma and Small Cell Lung Carcinoma (SCLC), but distinct and common molecular mechanisms regulated by ASCL1 in these cancers are unknown. In order to understand how it drives the cellular functional network in these two tumors, we generated a gene expression profile in a glioma cell line (U87MG) to identify ASCL1 gene targets by an si RNA silencing approach and then compared this with a publicly available dataset of similarly silenced SCLC (NCI-H1618 cells). We constructed TF-TF and gene-gene interactions, as well as protein interaction networks of ASCL1 regulated genes in glioma and SCLC cells. Detailed network analysis uncovered various biological processes governed by ASCL1 target genes in these two tumor cell lines. We find that novel ASCL1 functions related to mitosis and signaling pathways influencing development and tumor growth are affected in both glioma and SCLC cells. In addition, we also observed ASCL1 governed functional networks that are distinct to glioma and SCLC.
Annotated Gene and Proteome Data Support Recognition of Interconnections Between the Results of Different Experiments in Space Research

NASA Astrophysics Data System (ADS)

Bauer, Johann; Wehland, Markus; Pietsch, Jessica; Sickmann, Albert; Weber, Gerhard; Grimm, Daniela

2016-06-01

In a series of studies, human thyroid and endothelial cells exposed to real or simulated microgravity were analyzed in terms of changes in gene expression patterns or protein content. Due to the limitation of available cells in many space research experiments, comparative and control experiments had to be done in a serial manner. Therefore, detected genes or proteins were annotated with gene names and SwissProt numbers, in order to allow searches for interconnections between results obtained in different experiments by different methods. A crosscheck of several studies on the behavior of cytoskeletal genes and proteins suggested that clusters of cytoskeletal components change differently under the influence of microgravity and/or vibration in different cell types. The result that LOX and ISG15 gene expression were clearly altered during the Shenzhou-8 spaceflight mission could be estimated by comparison with the results of other experiments. The more than 100-fold down-regulation of LOX supports our hypothesis that the amount and stability of extracellular matrix have a great influence on the formation of three-dimensional aggregates under microgravity. The approximately 40-fold up-regulation of ISG15 cannot yet be explained in detail, but strongly suggests that ISGylation, an alternative form of posttranslational modification, plays a role in longterm cultures.
Long-range comparison of human and mouse Sprr loci to identify conserved noncoding sequences involved in coordinate regulation

PubMed Central

Martin, Natalia; Patel, Satyakam; Segre, Julia A.

2004-01-01

Mammalian epidermis provides a permeability barrier between an organism and its environment. Under homeostatic conditions, epidermal cells produce structural proteins, which are cross-linked in an orderly fashion to form a cornified envelope (CE). However, under genetic or environmental stress, specific genes are induced to rapidly build a temporary barrier. Small proline-rich (SPRR) proteins are the primary constituents of the CE. Under stress the entire family of 14 Sprr genes is upregulated. The Sprr genes are clustered within the larger epidermal differentiation complex on mouse chromosome 3, human chromosome 1q21. The clustering of the Sprr genes and their upregulation under stress suggest that these genes may be coordinately regulated. To identify enhancer elements that regulate this stress response activation of the Sprr locus, we utilized bioinformatic tools and classical biochemical dissection. Long-range comparative sequence analysis identified conserved noncoding sequences (CNSs). Clusters of epidermal-specific DNaseI-hypersensitive sites (HSs) mapped to specific CNSs. Increased prevalence of these HSs in barrier-deficient epidermis provides in vivo evidence of the regulation of the Sprr locus by these conserved sequences. Individual components of these HSs were cloned, and one was shown to have strong enhancer activity specific to conditions when the Sprr genes are coordinately upregulated. PMID:15574822

Promises and Challenges of Eco-Physiological Genomics in the Field: Tests of Drought Responses in Switchgrass1[OPEN

PubMed Central

Schwartz, Scott; Lowry, David B.; Aspinwall, Michael J.; Palacio-Mejia, Juan Diego; Hawkes, Christine V.; Fay, Philip A.

2016-01-01

Identifying the physiological and genetic basis of stress tolerance in plants has proven to be critical to understanding adaptation in both agricultural and natural systems. However, many discoveries were initially made in the controlled conditions of greenhouses or laboratories, not in the field. To test the comparability of drought responses across field and greenhouse environments, we undertook three independent experiments using the switchgrass reference genotype Alamo AP13. We analyzed physiological and gene expression variation across four locations, two sampling times, and three years. Relatively similar physiological responses and expression coefficients of variation across experiments masked highly dissimilar gene expression responses to drought. Critically, a drought experiment utilizing small pots in the greenhouse elicited nearly identical physiological changes as an experiment conducted in the field, but an order of magnitude more differentially expressed genes. However, we were able to define a suite of several hundred genes that were differentially expressed across all experiments. This list was strongly enriched in photosynthesis, water status, and reactive oxygen species responsive genes. The strong across-experiment correlations between physiological plasticity—but not differential gene expression—highlight the complex and diverse genetic mechanisms that can produce phenotypically similar responses to various soil water deficits. PMID:27246097
Viral Vectors for Gene Delivery to the Central Nervous System

PubMed Central

Lentz, Thomas B.; Gray, Steven J.; Samulski, R. Jude

2011-01-01

The potential benefits of gene therapy for neurological diseases such as Parkinson’s, Amyotrophic Lateral Sclerosis (ALS), Epilepsy, and Alzheimer’s are enormous. Even a delay in the onset of severe symptoms would be invaluable to patients suffering from these and other diseases. Significant effort has been placed in developing vectors capable of delivering therapeutic genes to the CNS in order to treat neurological disorders. At the forefront of potential vectors, viral systems have evolved to efficiently deliver their genetic material to a cell. The biology of different viruses offers unique solutions to the challenges of gene therapy, such as cell targeting, transgene expression and vector production. It is important to consider the natural biology of a vector when deciding whether it will be the most effective for a specific therapeutic function. In this review, we outline desired features of the ideal vector for gene delivery to the CNS and discuss how well available viral vectors compare to this model. Adeno-associated virus, retrovirus, adenovirus and herpesvirus vectors are covered. Focus is placed on features of the natural biology that have made these viruses effective tools for gene delivery with emphasis on their application in the CNS. Our goal is to provide insight into features of the optimal vector and which viral vectors can provide these features. PMID:22001604
Amino acid transporter expansions associated with the evolution of obligate endosymbiosis in sap-feeding insects (Hemiptera: sternorrhyncha).

PubMed

Dahan, Romain A; Duncan, Rebecca P; Wilson, Alex C C; Dávalos, Liliana M

2015-03-25

Mutualistic obligate endosymbioses shape the evolution of endosymbiont genomes, but their impact on host genomes remains unclear. Insects of the sub-order Sternorrhyncha (Hemiptera) depend on bacterial endosymbionts for essential amino acids present at low abundances in their phloem-based diet. This obligate dependency has been proposed to explain why multiple amino acid transporter genes are maintained in the genomes of the insect hosts. We implemented phylogenetic comparative methods to test whether amino acid transporters have proliferated in sternorrhynchan genomes at rates grater than expected by chance. By applying a series of methods to reconcile gene and species trees, inferring the size of gene families in ancestral lineages, and simulating the null process of birth and death in multi-gene families, we uncovered a 10-fold increase in duplication rate in the AAAP family of amino acid transporters within Sternorrhyncha. This gene family expansion was unmatched in other closely related clades lacking endosymbionts that provide essential amino acids. Our findings support the influence of obligate endosymbioses on host genome evolution by both inferring significant expansions of gene families involved in symbiotic interactions, and discovering increases in the rate of duplication associated with multiple emergences of obligate symbiosis in Sternorrhyncha.
Conserved gene clusters in bacterial genomes provide further support for the primacy of RNA

NASA Technical Reports Server (NTRS)

Siefert, J. L.; Martin, K. A.; Abdi, F.; Widger, W. R.; Fox, G. E.

1997-01-01

Five complete bacterial genome sequences have been released to the scientific community. These include four (eu)Bacteria, Haemophilus influenzae, Mycoplasma genitalium, M. pneumoniae, and Synechocystis PCC 6803, as well as one Archaeon, Methanococcus jannaschii. Features of organization shared by these genomes are likely to have arisen very early in the history of the bacteria and thus can be expected to provide further insight into the nature of early ancestors. Results of a genome comparison of these five organisms confirm earlier observations that gene order is remarkably unpreserved. There are, nevertheless, at least 16 clusters of two or more genes whose order remains the same among the four (eu)Bacteria and these are presumed to reflect conserved elements of coordinated gene expression that require gene proximity. Eight of these gene orders are essentially conserved in the Archaea as well. Many of these clusters are known to be regulated by RNA-level mechanisms in Escherichia coli, which supports the earlier suggestion that this type of regulation of gene expression may have arisen very early. We conclude that although the last common ancestor may have had a DNA genome, it likely was preceded by progenotes with an RNA genome.
Selection of reference genes for quantitative gene expression normalization in flax (Linum usitatissimum L.).

PubMed

Huis, Rudy; Hawkins, Simon; Neutelings, Godfrey

2010-04-19

Quantitative real-time PCR (qRT-PCR) is currently the most accurate method for detecting differential gene expression. Such an approach depends on the identification of uniformly expressed 'housekeeping genes' (HKGs). Extensive transcriptomic data mining and experimental validation in different model plants have shown that the reliability of these endogenous controls can be influenced by the plant species, growth conditions and organs/tissues examined. It is therefore important to identify the best reference genes to use in each biological system before using qRT-PCR to investigate differential gene expression. In this paper we evaluate different candidate HKGs for developmental transcriptomic studies in the economically-important flax fiber- and oil-crop (Linum usitatissimum L). Specific primers were designed in order to quantify the expression levels of 20 different potential housekeeping genes in flax roots, internal- and external-stem tissues, leaves and flowers at different developmental stages. After calculations of PCR efficiencies, 13 HKGs were retained and their expression stabilities evaluated by the computer algorithms geNorm and NormFinder. According to geNorm, 2 Transcriptional Elongation Factors (TEFs) and 1 Ubiquitin gene are necessary for normalizing gene expression when all studied samples are considered. However, only 2 TEFs are required for normalizing expression in stem tissues. In contrast, NormFinder identified glyceraldehyde-3-phosphate dehydrogenase (GADPH) as the most stably expressed gene when all samples were grouped together, as well as when samples were classed into different sub-groups.qRT-PCR was then used to investigate the relative expression levels of two splice variants of the flax LuMYB1 gene (homologue of AtMYB59). LuMYB1-1 and LuMYB1-2 were highly expressed in the internal stem tissues as compared to outer stem tissues and other samples. This result was confirmed with both geNorm-designated- and NormFinder-designated-reference genes. The use of 2 different statistical algorithms results in the identification of different combinations of flax HKGs for expression data normalization. Despite such differences, the use of geNorm-designated- and NormFinder-designated-reference genes enabled us to accurately compare the expression levels of a flax MYB gene in different organs and tissues. Our identification and validation of suitable flax HKGs will facilitate future developmental transcriptomic studies in this economically-important plant.
Factors influencing the removal of antibiotic-resistant bacteria and antibiotic resistance genes by the electrokinetic treatment.

PubMed

Li, Hongna; Li, Binxu; Zhang, Zhiguo; Tian, Yunlong; Ye, Jing; Lv, Xiwu; Zhu, Changxiong

2018-09-30

The performance of the electrokinetic remediation process on the removal of antibiotic-resistant bacteria (ARB) and antibiotic resistance genes (ARGs) was evaluated with different influencing factors. With chlortetracycline (CTC), oxytetracycline (OTC), and tetracycline (TC) as template chemicals, the removal of both ARB and ARGs was enhanced with the increase of voltage gradient (0.4-1.2 V cm -1 ) and prolonged reaction time (3-14 d). The greatest removal (26.01-31.48% for ARB, 37.93-83.10% for ARGs) was obtained applying a voltage of 1.2 V cm -1 , leading to the highest electrical consumption. The effect of polarity reversal intervals on the inactivation ratio of ARB followed the order of 0 h (66.06-80.00%) > 12 h (17.07-24.75%) > 24 h (10.44-13.93%). Lower pH, higher current density, and more evenly-distributed voltage drop was observed with a polarity reversal interval of 12 h compared with that of 24 h, leading to more efficient electrochemical reactions in soil. Compared with sul genes, tet genes were more vulnerable to be attacked in an electric field. It was mainly attributed to the lower abundance of tet genes (except tetM) and the varied effects of electrokinetic remediation process on different ARGs. Moreover, a relatively less removal ratio of tetC and tetG was obtained mainly due to the mechanism of the efflux pump upregulation. Both tet and sul genes were positively correlated with TC-resistant bacteria. The efflux pump genes like tetG and the cellular protection genes like tetM showed different correlations with ARB. This study enhances the current understanding on the removal strategies of ARB and ARGs, and it provides important parameters for their destruction by the electrokinetic treatment. Copyright © 2018 Elsevier Inc. All rights reserved.
Tumor SHB gene expression affects disease characteristics in human acute myeloid leukemia.

PubMed

Jamalpour, Maria; Li, Xiujuan; Cavelier, Lucia; Gustafsson, Karin; Mostoslavsky, Gustavo; Höglund, Martin; Welsh, Michael

2017-10-01

The mouse Shb gene coding for the Src Homology 2-domain containing adapter protein B has recently been placed in context of BCRABL1-induced myeloid leukemia in mice and the current study was performed in order to relate SHB to human acute myeloid leukemia (AML). Publicly available AML databases were mined for SHB gene expression and patient survival. SHB gene expression was determined in the Uppsala cohort of AML patients by qPCR. Cell proliferation was determined after SHB gene knockdown in leukemic cell lines. Despite a low frequency of SHB gene mutations, many tumors overexpressed SHB mRNA compared with normal myeloid blood cells. AML patients with tumors expressing low SHB mRNA displayed longer survival times. A subgroup of AML exhibiting a favorable prognosis, acute promyelocytic leukemia (APL) with a PMLRARA translocation, expressed less SHB mRNA than AML tumors in general. When examining genes co-expressed with SHB in AML tumors, four other genes ( PAX5, HDAC7, BCORL1, TET1) related to leukemia were identified. A network consisting of these genes plus SHB was identified that relates to certain phenotypic characteristics, such as immune cell, vascular and apoptotic features. SHB knockdown in the APL PMLRARA cell line NB4 and the monocyte/macrophage cell line MM6 adversely affected proliferation, linking SHB gene expression to tumor cell expansion and consequently to patient survival. It is concluded that tumor SHB gene expression relates to AML survival and its subgroup APL. Moreover, this gene is included in a network of genes that plays a role for an AML phenotype exhibiting certain immune cell, vascular and apoptotic characteristics.
Global Expression Profiling in Atopic Eczema Reveals Reciprocal Expression of Inflammatory and Lipid Genes

PubMed Central

Sääf, Annika M.; Tengvall-Linder, Maria; Chang, Howard Y.; Adler, Adam S.; Wahlgren, Carl-Fredrik; Scheynius, Annika; Nordenskjöld, Magnus; Bradley, Maria

2008-01-01

Background Atopic eczema (AE) is a common chronic inflammatory skin disorder. In order to dissect the genetic background several linkage and genetic association studies have been performed. Yet very little is known about specific genes involved in this complex skin disease, and the underlying molecular mechanisms are not fully understood. Methodology/Findings We used human DNA microarrays to identify a molecular picture of the programmed responses of the human genome to AE. The transcriptional program was analyzed in skin biopsy samples from lesional and patch-tested skin from AE patients sensitized to Malassezia sympodialis (M. sympodialis), and corresponding biopsies from healthy individuals. The most notable feature of the global gene-expression pattern observed in AE skin was a reciprocal expression of induced inflammatory genes and repressed lipid metabolism genes. The overall transcriptional response in M. sympodialis patch-tested AE skin was similar to the gene-expression signature identified in lesional AE skin. In the constellation of genes differentially expressed in AE skin compared to healthy control skin, we have identified several potential susceptibility genes that may play a critical role in the pathological condition of AE. Many of these genes, including genes with a role in immune responses, lipid homeostasis, and epidermal differentiation, are localized on chromosomal regions previously linked to AE. Conclusions/Significance Through genome-wide expression profiling, we were able to discover a distinct reciprocal expression pattern of induced inflammatory genes and repressed lipid metabolism genes in skin from AE patients. We found a significant enrichment of differentially expressed genes in AE with cytobands associated to the disease, and furthermore new chromosomal regions were found that could potentially guide future region-specific linkage mapping in AE. The full data set is available at http://microarray-pubs.stanford.edu/eczema. PMID:19107207
Microarray-based gene expression analysis of strong seed dormancy in rice cv. N22 and less dormant mutant derivatives.

PubMed

Wu, Tao; Yang, Chunyan; Ding, Baoxu; Feng, Zhiming; Wang, Qian; He, Jun; Tong, Jianhua; Xiao, Langtao; Jiang, Ling; Wan, Jianmin

2016-02-01

Seed dormancy in rice is an important trait related to the pre-harvest sprouting resistance. In order to understand the molecular mechanisms of seed dormancy, gene expression was investigated by transcriptome analysis using seeds of the strongly dormant cultivar N22 and its less dormant mutants Q4359 and Q4646 at 24 days after heading (DAH). Microarray data revealed more differentially expressed genes in Q4359 than in Q4646 compared to N22. Most genes differing between Q4646 and N22 also differed between Q4359 and N22. GO analysis of genes differentially expressed in both Q4359 and Q4646 revealed that some genes such as those for starch biosynthesis were repressed, whereas metabolic genes such as those for carbohydrate metabolism were enhanced in Q4359 and Q4646 seeds relative to N22. Expression of some genes involved in cell redox homeostasis and chromatin remodeling differed significantly only between Q4359 and N22. The results suggested a close correlation between cell redox homeostasis, chromatin remodeling and seed dormancy. In addition, some genes involved in ABA signaling were down-regulated, and several genes involved in GA biosynthesis and signaling were up-regulated. These observations suggest that reduced seed dormancy in Q4359 was regulated by ABA-GA antagonism. A few differentially expressed genes were located in the regions containing qSdn-1 and qSdn-5 suggesting that they could be candidate genes underlying seed dormancy. Our work provides useful leads to further determine the underling mechanisms of seed dormancy and for cloning seed dormancy genes from N22. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Transcription of the herpes simplex virus 1 genome during productive and quiescent infection of neuronal and nonneuronal cells.

PubMed

Harkness, Justine M; Kader, Muhamuda; DeLuca, Neal A

2014-06-01

Herpes simplex virus 1 (HSV-1) can undergo a productive infection in nonneuronal and neuronal cells such that the genes of the virus are transcribed in an ordered cascade. HSV-1 can also establish a more quiescent or latent infection in peripheral neurons, where gene expression is substantially reduced relative to that in productive infection. HSV mutants defective in multiple immediate early (IE) gene functions are highly defective for later gene expression and model some aspects of latency in vivo. We compared the expression of wild-type (wt) virus and IE gene mutants in nonneuronal cells (MRC5) and adult murine trigeminal ganglion (TG) neurons using the Illumina platform for cDNA sequencing (RNA-seq). RNA-seq analysis of wild-type virus revealed that expression of the genome mostly followed the previously established kinetics, validating the method, while highlighting variations in gene expression within individual kinetic classes. The accumulation of immediate early transcripts differed between MRC5 cells and neurons, with a greater abundance in neurons. Analysis of a mutant defective in all five IE genes (d109) showed dysregulated genome-wide low-level transcription that was more highly attenuated in MRC5 cells than in TG neurons. Furthermore, a subset of genes in d109 was more abundantly expressed over time in neurons. While the majority of the viral genome became relatively quiescent, the latency-associated transcript was specifically upregulated. Unexpectedly, other genes within repeat regions of the genome, as well as the unique genes just adjacent the repeat regions, also remained relatively active in neurons. The relative permissiveness of TG neurons to viral gene expression near the joint region is likely significant during the establishment and reactivation of latency. During productive infection, the genes of HSV-1 are transcribed in an ordered cascade. HSV can also establish a more quiescent or latent infection in peripheral neurons. HSV mutants defective in multiple immediate early (IE) genes establish a quiescent infection that models aspects of latency in vivo. We simultaneously quantified the expression of all the HSV genes in nonneuronal and neuronal cells by RNA-seq analysis. The results for productive infection shed further light on the nature of genes and promoters of different kinetic classes. In quiescent infection, there was greater transcription across the genome in neurons than in nonneuronal cells. In particular, the transcription of the latency-associated transcript (LAT), IE genes, and genes in the unique regions adjacent to the repeats persisted in neurons. The relative activity of this region of the genome in the absence of viral activators suggests a more dynamic state for quiescent genomes persisting in neurons. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
An Approximation to the Temporal Order in Endogenous Circadian Rhythms of Genes Implicated in Human Adipose Tissue Metabolism

PubMed Central

GARAULET, MARTA; ORDOVÁS, JOSÉ M.; GÓMEZ-ABELLÁN, PURIFICACIÓN; MARTÍNEZ, JOSE A.; MADRID, JUAN A.

2015-01-01

Although it is well established that human adipose tissue (AT) shows circadian rhythmicity, published studies have been discussed as if tissues or systems showed only one or few circadian rhythms at a time. To provide an overall view of the internal temporal order of circadian rhythms in human AT including genes implicated in metabolic processes such as energy intake and expenditure, insulin resistance, adipocyte differentiation, dyslipidemia, and body fat distribution. Visceral and subcutaneous abdominal AT biopsies (n = 6) were obtained from morbid obese women (BMI ≥ 40 kg/m2). To investigate rhythmic expression pattern, AT explants were cultured during 24-h and gene expression was analyzed at the following times: 08:00, 14:00, 20:00, 02:00 h using quantitative real-time PCR. Clock genes, glucocorticoid metabolism-related genes, leptin, adiponectin and their receptors were studied. Significant differences were found both in achrophases and relative-amplitude among genes (P <0.05). Amplitude of most genes rhythms was high (>30%). When interpreting the phase map of gene expression in both depots, data indicated that circadian rhythmicity of the genes studied followed a predictable physiological pattern, particularly for subcutaneous AT. Interesting are the relationships between adiponectin, leptin, and glucocorticoid metabolism-related genes circadian profiles. Their metabolic significance is discussed. Visceral AT behaved in a different way than subcutaneous for most of the genes studied. For every gene, protein mRNA levels fluctuated during the day in synchrony with its receptors. We have provided an overall view of the internal temporal order of circadian rhythms in human adipose tissue. PMID:21520059
A limited innate immune response is induced by a replication-defective herpes simplex virus vector following delivery to the murine central nervous system

PubMed Central

Zeier, Zane; Aguilar, J Santiago; Lopez, Cecilia M; Devi-Rao, G B; Watson, Zachary L; Baker, Henry V; Wagner, Edward K; Bloom, David C

2010-01-01

Herpes simplex virus type 1 (HSV-1)–based vectors readily transduce neurons and have a large payload capacity, making them particularly amenable to gene therapy applications within the central nervous system (CNS). Because aspects of the host responses to HSV-1 vectors in the CNS are largely unknown, we compared the host response of a nonreplicating HSV-1 vector to that of a replication-competent HSV-1 virus using microarray analysis. In parallel, HSV-1 gene expression was tracked using HSV-specific oligonucleotide-based arrays in order to correlate viral gene expression with observed changes in host response. Microarray analysis was performed following stereotactic injection into the right hippocampal formation of mice with either a replication-competent HSV-1 or a nonreplicating recombinant of HSV-1, lacking the ICP4 gene (ICP4−). Genes that demonstrated a significant change (P < .001) in expression in response to the replicating HSV-1 outnumbered those that changed in response to mock or nonreplicating vector by approximately 3-fold. Pathway analysis revealed that both the replicating and nonreplicating vectors induced robust antigen presentation but only mild interferon, chemokine, and cytokine signaling responses. The ICP4− vector was restricted in several of the Toll-like receptor-signaling pathways, indicating reduced stimulation of the innate immune response. These array analyses suggest that although the nonreplicating vector induces detectable activation of immune response pathways, the number and magnitude of the induced response is dramatically restricted compared to the replicating vector, and with the exception of antigen presentation, host gene expression induced by the non-replicating vector largely resembles mock infection. PMID:20095947
In silico pathway analysis in cervical carcinoma reveals potential new targets for treatment

PubMed Central

van Dam, Peter A.; van Dam, Pieter-Jan H. H.; Rolfo, Christian; Giallombardo, Marco; van Berckelaer, Christophe; Trinh, Xuan Bich; Altintas, Sevilay; Huizing, Manon; Papadimitriou, Kostas; Tjalma, Wiebren A. A.; van Laere, Steven

2016-01-01

An in silico pathway analysis was performed in order to improve current knowledge on the molecular drivers of cervical cancer and detect potential targets for treatment. Three publicly available Affymetrix gene expression data-sets (GSE5787, GSE7803, GSE9750) were retrieved, vouching for a total of 9 cervical cancer cell lines (CCCLs), 39 normal cervical samples, 7 CIN3 samples and 111 cervical cancer samples (CCSs). Predication analysis of microarrays was performed in the Affymetrix sets to identify cervical cancer biomarkers. To select cancer cell-specific genes the CCSs were compared to the CCCLs. Validated genes were submitted to a gene set enrichment analysis (GSEA) and Expression2Kinases (E2K). In the CCSs a total of 1,547 probe sets were identified that were overexpressed (FDR < 0.1). Comparing to CCCLs 560 probe sets (481 unique genes) had a cancer cell-specific expression profile, and 315 of these genes (65%) were validated. GSEA identified 5 cancer hallmarks enriched in CCSs (P < 0.01 and FDR < 0.25) showing that deregulation of the cell cycle is a major component of cervical cancer biology. E2K identified a protein-protein interaction (PPI) network of 162 nodes (including 20 drugable kinases) and 1626 edges. This PPI-network consists of 5 signaling modules associated with MYC signaling (Module 1), cell cycle deregulation (Module 2), TGFβ-signaling (Module 3), MAPK signaling (Module 4) and chromatin modeling (Module 5). Potential targets for treatment which could be identified were CDK1, CDK2, ABL1, ATM, AKT1, MAPK1, MAPK3 among others. The present study identified important driver pathways in cervical carcinogenesis which should be assessed for their potential therapeutic drugability. PMID:26701206
HYPOTHESIS SETTING AND ORDER STATISTIC FOR ROBUST GENOMIC META-ANALYSIS.

PubMed

Song, Chi; Tseng, George C

2014-01-01

Meta-analysis techniques have been widely developed and applied in genomic applications, especially for combining multiple transcriptomic studies. In this paper, we propose an order statistic of p-values ( r th ordered p-value, rOP) across combined studies as the test statistic. We illustrate different hypothesis settings that detect gene markers differentially expressed (DE) "in all studies", "in the majority of studies", or "in one or more studies", and specify rOP as a suitable method for detecting DE genes "in the majority of studies". We develop methods to estimate the parameter r in rOP for real applications. Statistical properties such as its asymptotic behavior and a one-sided testing correction for detecting markers of concordant expression changes are explored. Power calculation and simulation show better performance of rOP compared to classical Fisher's method, Stouffer's method, minimum p-value method and maximum p-value method under the focused hypothesis setting. Theoretically, rOP is found connected to the naïve vote counting method and can be viewed as a generalized form of vote counting with better statistical properties. The method is applied to three microarray meta-analysis examples including major depressive disorder, brain cancer and diabetes. The results demonstrate rOP as a more generalizable, robust and sensitive statistical framework to detect disease-related markers.
An information-gain approach to detecting three-way epistatic interactions in genetic association studies

PubMed Central

Hu, Ting; Chen, Yuanzhu; Kiralis, Jeff W; Collins, Ryan L; Wejse, Christian; Sirugo, Giorgio; Williams, Scott M; Moore, Jason H

2013-01-01

Background Epistasis has been historically used to describe the phenomenon that the effect of a given gene on a phenotype can be dependent on one or more other genes, and is an essential element for understanding the association between genetic and phenotypic variations. Quantifying epistasis of orders higher than two is very challenging due to both the computational complexity of enumerating all possible combinations in genome-wide data and the lack of efficient and effective methodologies. Objectives In this study, we propose a fast, non-parametric, and model-free measure for three-way epistasis. Methods Such a measure is based on information gain, and is able to separate all lower order effects from pure three-way epistasis. Results Our method was verified on synthetic data and applied to real data from a candidate-gene study of tuberculosis in a West African population. In the tuberculosis data, we found a statistically significant pure three-way epistatic interaction effect that was stronger than any lower-order associations. Conclusion Our study provides a methodological basis for detecting and characterizing high-order gene-gene interactions in genetic association studies. PMID:23396514
Microsporidian polar tube proteins: highly divergent but closely linked genes encode PTP1 and PTP2 in members of the evolutionarily distant Antonospora and Encephalitozoon groups.

PubMed

Polonais, Valérie; Prensier, Gérard; Méténier, Guy; Vivarès, Christian P; Delbac, Frédéric

2005-09-01

The spore polar tube is a unique organelle required for cell invasion by fungi-related microsporidian parasites. Two major polar tube proteins (PTP1 and PTP2) are encoded by two tandemly arranged genes in Encephalitozoon species. A look at Antonospora (Nosema) locustae contigs (http://jbpc.mbl.edu/Nosema/Contigs/) revealed significant conservation in the order and orientation of various genes, despite high sequence divergence features, when comparing with Encephalitozoon cuniculi complete genome. This syntenic relationship between distantly related Encephalitozoon and Antonospora genera has been successfully exploited to identify ptp1 and ptp2 genes in two insect-infecting species assigned to the Antonospora clade (A. locustae and Paranosema grylli). Targeting of respective proteins to the polar tube was demonstrated through immunolocalization experiments with antibodies raised against recombinant proteins. Both PTPs were extracted from spores with 100mM dithiothreitol. Evidence for PTP1 mannosylation was obtained in studied species, supporting a key role of PTP1 in interactions with host cell surface.
Spina Bifida: Pathogenesis, Mechanisms, and Genes in Mice and Humans

PubMed Central

Abou Chaar, Mohamad K.; Ahmad-Annuar, Azlina

2017-01-01

Spina bifida is among the phenotypes of the larger condition known as neural tube defects (NTDs). It is the most common central nervous system malformation compatible with life and the second leading cause of birth defects after congenital heart defects. In this review paper, we define spina bifida and discuss the phenotypes seen in humans as described by both surgeons and embryologists in order to compare and ultimately contrast it to the leading animal model, the mouse. Our understanding of spina bifida is currently limited to the observations we make in mouse models, which reflect complete or targeted knockouts of genes, which perturb the whole gene(s) without taking into account the issue of haploinsufficiency, which is most prominent in the human spina bifida condition. We thus conclude that the need to study spina bifida in all its forms, both aperta and occulta, is more indicative of the spina bifida in surviving humans and that the measure of deterioration arising from caudal neural tube defects, more commonly known as spina bifida, must be determined by the level of the lesion both in mouse and in man. PMID:28286691
Oligonucleotide microarray analysis reveals dysregulation of energy-related metabolism in insulin-sensitive tissues of type 2 diabetes patients.

PubMed

Wang, M; Wang, X C; Zhao, L; Zhang, Y; Yao, L L; Lin, Y; Peng, Y D; Hu, R M

2014-06-17

Impaired insulin action within skeletal muscle, adipose tissue, and the liver is an important characteristic of type 2 diabetes (T2D). In order to identify common underlying defects in insulin-sensitive tissues that may be involved in the pathogenesis of T2D, the gene expression profiles of skeletal muscle, visceral adipose tissue, and liver from autopsy donors with or without T2D were examined using oligonucleotide microarrays and quantitative reverse transcriptase-PCR. Compared with controls, 691 genes were commonly dysregulated in these three insulin-sensitive tissues of humans with T2D. These co-expressed genes were enriched within the mitochondrion, with suggested involvement in energy metabolic processes such as glycolysis and gluconeogenesis, fatty acid beta oxidative, tricarboxylic acid cycle, and electron transport. Genes related to energy metabolism were mostly downregulated in diabetic skeletal muscle and visceral adipose tissue, while they were upregulated in the diabetic liver. This observed dysregulation in energy-related metabolism may be the underlying factor leading to the molecular mechanisms responsible for the insulin resistance of patients with T2D.
Comparative genomics reveals convergent evolution between the bamboo-eating giant and red pandas.

PubMed

Hu, Yibo; Wu, Qi; Ma, Shuai; Ma, Tianxiao; Shan, Lei; Wang, Xiao; Nie, Yonggang; Ning, Zemin; Yan, Li; Xiu, Yunfang; Wei, Fuwen

2017-01-31

Phenotypic convergence between distantly related taxa often mirrors adaptation to similar selective pressures and may be driven by genetic convergence. The giant panda (Ailuropoda melanoleuca) and red panda (Ailurus fulgens) belong to different families in the order Carnivora, but both have evolved a specialized bamboo diet and adaptive pseudothumb, representing a classic model of convergent evolution. However, the genetic bases of these morphological and physiological convergences remain unknown. Through de novo sequencing the red panda genome and improving the giant panda genome assembly with added data, we identified genomic signatures of convergent evolution. Limb development genes DYNC2H1 and PCNT have undergone adaptive convergence and may be important candidate genes for pseudothumb development. As evolutionary responses to a bamboo diet, adaptive convergence has occurred in genes involved in the digestion and utilization of bamboo nutrients such as essential amino acids, fatty acids, and vitamins. Similarly, the umami taste receptor gene TAS1R1 has been pseudogenized in both pandas. These findings offer insights into genetic convergence mechanisms underlying phenotypic convergence and adaptation to a specialized bamboo diet.
Phylogenetic analysis of DNA and RNA polymerases from a Moniliophthora perniciosa mitochondrial plasmid reveals probable lateral gene transfer.

PubMed

Andrade, B S; Góes-Neto, A

2015-10-30

The filamentous fungus Moniliophthora perniciosa is a hemibiotrophic basidiomycete that causes witches' broom disease of cacao (Theobroma cacao L.). Many fungal mitochondrial plasmids are DNA and RNA polymerase-encoding invertrons with terminal inverted repeats and 5'-linked proteins. The aim of this study was to carry out comparative and phylogenetic analyses of DNA and RNA polymerases for all known linear mitochondrial plasmids in fungi. We performed these analyses at both gene and protein levels and assessed differences between fungal and viral polymerases in order to test the lateral gene transfer (LGT) hypothesis. We analyzed all mitochondrial plasmids of the invertron type within the fungal clade, including five from Ascomycota, seven from Basidiomycota, and one from Chytridiomycota. All phylogenetic analyses generated similar tree topologies regardless of the methods and datasets used. It is likely that DNA and RNA polymerase genes were inserted into the mitochondrial genomes of the 13 fungal species examined in our study as a result of different LGT events. These findings are important for a better understanding of the evolutionary relationships between fungal mitochondrial plasmids.

Comparative genomics reveals convergent evolution between the bamboo-eating giant and red pandas

PubMed Central

Hu, Yibo; Wu, Qi; Ma, Shuai; Ma, Tianxiao; Shan, Lei; Wang, Xiao; Nie, Yonggang; Ning, Zemin; Yan, Li; Xiu, Yunfang; Wei, Fuwen

2017-01-01

Phenotypic convergence between distantly related taxa often mirrors adaptation to similar selective pressures and may be driven by genetic convergence. The giant panda (Ailuropoda melanoleuca) and red panda (Ailurus fulgens) belong to different families in the order Carnivora, but both have evolved a specialized bamboo diet and adaptive pseudothumb, representing a classic model of convergent evolution. However, the genetic bases of these morphological and physiological convergences remain unknown. Through de novo sequencing the red panda genome and improving the giant panda genome assembly with added data, we identified genomic signatures of convergent evolution. Limb development genes DYNC2H1 and PCNT have undergone adaptive convergence and may be important candidate genes for pseudothumb development. As evolutionary responses to a bamboo diet, adaptive convergence has occurred in genes involved in the digestion and utilization of bamboo nutrients such as essential amino acids, fatty acids, and vitamins. Similarly, the umami taste receptor gene TAS1R1 has been pseudogenized in both pandas. These findings offer insights into genetic convergence mechanisms underlying phenotypic convergence and adaptation to a specialized bamboo diet. PMID:28096377
Analysis of high-throughput biological data using their rank values.

PubMed

Dembélé, Doulaye

2018-01-01

High-throughput biological technologies are routinely used to generate gene expression profiling or cytogenetics data. To achieve high performance, methods available in the literature become more specialized and often require high computational resources. Here, we propose a new versatile method based on the data-ordering rank values. We use linear algebra, the Perron-Frobenius theorem and also extend a method presented earlier for searching differentially expressed genes for the detection of recurrent copy number aberration. A result derived from the proposed method is a one-sample Student's t-test based on rank values. The proposed method is to our knowledge the only that applies to gene expression profiling and to cytogenetics data sets. This new method is fast, deterministic, and requires a low computational load. Probabilities are associated with genes to allow a statistically significant subset selection in the data set. Stability scores are also introduced as quality parameters. The performance and comparative analyses were carried out using real data sets. The proposed method can be accessed through an R package available from the CRAN (Comprehensive R Archive Network) website: https://cran.r-project.org/web/packages/fcros .
Phylogenetic Analysis of Aedes aegypti Based on Mitochondrial ND4 Gene Sequences in Almadinah, Saudi Arabia.

PubMed

Ali, Khalil H Al; El-Badry, Ayman A; Ali, Mouhanad Al; El-Sayed, Wael S M; El-Beshbishy, Hesham A

2016-06-01

Aedes aegypti is the main vector of the yellow fever and dengue virus. This mosquito has become the major indirect cause of morbidity and mortality of the human worldwide. Dengue virus activity has been reported recently in the western areas of Saudi Arabia. There is no vaccine for dengue virus until now, and the control of the disease depends on the control of the vector. The present study has aimed to perform phylogenetic analysis of Aedes aegypti based on mitochondrial NADH dehydrogenase subunit 4 ( ND4 ) gene at Almadinah, Saudi Arabia in order to get further insight into the epidemiology and transmission of this vector. Mitochondrial ND4 gene was sequenced in the eight isolated Aedes aegypti mosquitoes from Almadinah, Saudi Arabia, sequences were aligned, and phylogenetic analysis were performed and compared with 54 sequences of Aedes reported in the previous studies from Mexico, Thailand, Brazil, and Africa. Our results suggest that increased gene flow among Aedes aegypti populations occurs between Africa and Saudi Arabia. Phylogenetic relationship analysis showed two genetically distinct Aedes aegypti in Saudi Arabia derived from dual African ancestor.
Prediction of cancer class with majority voting genetic programming classifier using gene expression data.

PubMed

Paul, Topon Kumar; Iba, Hitoshi

2009-01-01

In order to get a better understanding of different types of cancers and to find the possible biomarkers for diseases, recently, many researchers are analyzing the gene expression data using various machine learning techniques. However, due to a very small number of training samples compared to the huge number of genes and class imbalance, most of these methods suffer from overfitting. In this paper, we present a majority voting genetic programming classifier (MVGPC) for the classification of microarray data. Instead of a single rule or a single set of rules, we evolve multiple rules with genetic programming (GP) and then apply those rules to test samples to determine their labels with majority voting technique. By performing experiments on four different public cancer data sets, including multiclass data sets, we have found that the test accuracies of MVGPC are better than those of other methods, including AdaBoost with GP. Moreover, some of the more frequently occurring genes in the classification rules are known to be associated with the types of cancers being studied in this paper.
[Progress in transgenic fish techniques and application].

PubMed

Ye, Xing; Tian, Yuan-Yuan; Gao, Feng-Ying

2011-05-01

Transgenic technique provides a new way for fish breeding. Stable lines of growth hormone gene transfer carps, salmon and tilapia, as well as fluorescence protein gene transfer zebra fish and white cloud mountain minnow have been produced. The fast growth characteristic of GH gene transgenic fish will be of great importance to promote aquaculture production and economic efficiency. This paper summarized the progress in transgenic fish research and ecological assessments. Microinjection is still the most common used method, but often resulted in multi-site and multi-copies integration. Co-injection of transposon or meganuclease will greatly improve the efficiency of gene transfer and integration. "All fish" gene or "auto gene" should be considered to produce transgenic fish in order to eliminate misgiving on food safety and to benefit expression of the transferred gene. Environmental risk is the biggest obstacle for transgenic fish to be commercially applied. Data indicates that transgenic fish have inferior fitness compared with the traditional domestic fish. However, be-cause of the genotype-by-environment effects, it is difficult to extrapolate simple phenotypes to the complex ecological interactions that occur in nature based on the ecological consequences of the transgenic fish determined in the laboratory. It is critical to establish highly naturalized environments for acquiring reliable data that can be used to evaluate the environ-mental risk. Efficacious physical and biological containment strategies remain to be crucial approaches to ensure the safe application of transgenic fish technology.
Comparison of gene expression profiles between pansensitive and multidrug-resistant strains of Mycobacterium tuberculosis.

PubMed

Peñuelas-Urquides, K; González-Escalante, L; Villarreal-Treviño, L; Silva-Ramírez, B; Gutiérrez-Fuentes, D J; Mojica-Espinosa, R; Rangel-Escareño, C; Uribe-Figueroa, L; Molina-Salinas, G M; Dávila-Velderrain, J; Castorena-Torres, F; Bermúdez de León, M; Said-Fernández, S

2013-09-01

Mycobacterium tuberculosis has developed resistance to anti-tuberculosis first-line drugs. Multidrug-resistant strains complicate the control of tuberculosis and have converted it into a worldwide public health problem. Mutational studies of target genes have tried to envisage the resistance in clinical isolates; however, detection of these mutations in some cases is not sufficient to identify drug resistance, suggesting that other mechanisms are involved. Therefore, the identification of new markers of susceptibility or resistance to first-line drugs could contribute (1) to specifically diagnose the type of M. tuberculosis strain and prescribe an appropriate therapy, and (2) to elucidate the mechanisms of resistance in multidrug-resistant strains. In order to identify specific genes related to resistance in M. tuberculosis, we compared the gene expression profiles between the pansensitive H37Rv strain and a clinical CIBIN:UMF:15:99 multidrug-resistant isolate using microarray analysis. Quantitative real-time PCR confirmed that in the clinical multidrug-resistant isolate, the esxG, esxH, rpsA, esxI, and rpmI genes were upregulated, while the lipF, groES, and narG genes were downregulated. The modified genes could be involved in the mechanisms of resistance to first-line drugs in M. tuberculosis and could contribute to increased efficiency in molecular diagnosis approaches of infections with drug-resistant strains.
Classification of a large microarray data set: Algorithm comparison and analysis of drug signatures

PubMed Central

Natsoulis, Georges; El Ghaoui, Laurent; Lanckriet, Gert R.G.; Tolley, Alexander M.; Leroy, Fabrice; Dunlea, Shane; Eynon, Barrett P.; Pearson, Cecelia I.; Tugendreich, Stuart; Jarnagin, Kurt

2005-01-01

A large gene expression database has been produced that characterizes the gene expression and physiological effects of hundreds of approved and withdrawn drugs, toxicants, and biochemical standards in various organs of live rats. In order to derive useful biological knowledge from this large database, a variety of supervised classification algorithms were compared using a 597-microarray subset of the data. Our studies show that several types of linear classifiers based on Support Vector Machines (SVMs) and Logistic Regression can be used to derive readily interpretable drug signatures with high classification performance. Both methods can be tuned to produce classifiers of drug treatments in the form of short, weighted gene lists which upon analysis reveal that some of the signature genes have a positive contribution (act as “rewards” for the class-of-interest) while others have a negative contribution (act as “penalties”) to the classification decision. The combination of reward and penalty genes enhances performance by keeping the number of false positive treatments low. The results of these algorithms are combined with feature selection techniques that further reduce the length of the drug signatures, an important step towards the development of useful diagnostic biomarkers and low-cost assays. Multiple signatures with no genes in common can be generated for the same classification end-point. Comparison of these gene lists identifies biological processes characteristic of a given class. PMID:15867433
A comparison of complete mitochondrial genomes of silver carp hypophthalmichthys molitrix and bighead carp hypophthalmichthys nobilis: Implications for their taxonomic relationship and phylogeny

USGS Publications Warehouse

Li, S.-F.; Xu, J.-W.; Yang, Q.-L.; Wang, C.H.; Chen, Q.; Chapman, D.C.; Lu, G.

2009-01-01

Based upon morphological characters, Silver carp Hypophthalmichthys molitrix and bighead carp Hypophthalmichthys nobilis (or Aristichthys nobilis) have been classified into either the same genus or two distinct genera. Consequently, the taxonomic relationship of the two species at the generic level remains equivocal. This issue is addressed by sequencing complete mitochondrial genomes of H. molitrix and H. nobilis, comparing their mitogenome organization, structure and sequence similarity, and conducting a comprehensive phylogenetic analysis of cyprinid species. As with other cyprinid fishes, the mitogenomes of the two species were structurally conserved, containing 37 genes including 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA (tRNAs) genes and a putative control region (D-loop). Sequence similarity between the two mitogenomes varied in different genes or regions, being highest in the tRNA genes (98??8%), lowest in the control region (89??4%) and intermediate in the protein-coding genes (94??2%). Analyses of the sequence comparison and phylogeny using concatenated protein sequences support the view that the two species belong to the genus Hypophthalmichthys. Further studies using nuclear markers and involving more closely related species, and the systematic combination of traditional biology and molecular biology are needed in order to confirm this conclusion. ?? 2009 The Fisheries Society of the British Isles.
Identifying biologically relevant putative mechanisms in a given phenotype comparison

PubMed Central

Hanoudi, Samer; Donato, Michele; Draghici, Sorin

2017-01-01

A major challenge in life science research is understanding the mechanism involved in a given phenotype. The ability to identify the correct mechanisms is needed in order to understand fundamental and very important phenomena such as mechanisms of disease, immune systems responses to various challenges, and mechanisms of drug action. The current data analysis methods focus on the identification of the differentially expressed (DE) genes using their fold change and/or p-values. Major shortcomings of this approach are that: i) it does not consider the interactions between genes; ii) its results are sensitive to the selection of the threshold(s) used, and iii) the set of genes produced by this approach is not always conducive to formulating mechanistic hypotheses. Here we present a method that can construct networks of genes that can be considered putative mechanisms. The putative mechanisms constructed by this approach are not limited to the set of DE genes, but also considers all known and relevant gene-gene interactions. We analyzed three real datasets for which both the causes of the phenotype, as well as the true mechanisms were known. We show that the method identified the correct mechanisms when applied on microarray datasets from mouse. We compared the results of our method with the results of the classical approach, showing that our method produces more meaningful biological insights. PMID:28486531
Transcriptome Analysis of Early Responsive Genes in Rice during Magnaporthe oryzae Infection.

PubMed

Wang, Yiming; Kwon, Soon Jae; Wu, Jingni; Choi, Jaeyoung; Lee, Yong-Hwan; Agrawal, Ganesh Kumar; Tamogami, Shigeru; Rakwal, Randeep; Park, Sang-Ryeol; Kim, Beom-Gi; Jung, Ki-Hong; Kang, Kyu Young; Kim, Sang Gon; Kim, Sun Tae

2014-12-01

Rice blast disease caused by Magnaporthe oryzae is one of the most serious diseases of cultivated rice (Oryza sativa L.) in most rice-growing regions of the world. In order to investigate early response genes in rice, we utilized the transcriptome analysis approach using a 300 K tilling microarray to rice leaves infected with compatible and incompatible M. oryzae strains. Prior to the microarray experiment, total RNA was validated by measuring the differential expression of rice defense-related marker genes (chitinase 2, barwin, PBZ1, and PR-10) by RT-PCR, and phytoalexins (sakuranetin and momilactone A) with HPLC. Microarray analysis revealed that 231 genes were up-regulated (>2 fold change, p < 0.05) in the incompatible interaction compared to the compatible one. Highly expressed genes were functionally characterized into metabolic processes and oxidation-reduction categories. The oxidative stress response was induced in both early and later infection stages. Biotic stress overview from MapMan analysis revealed that the phytohormone ethylene as well as signaling molecules jasmonic acid and salicylic acid is important for defense gene regulation. WRKY and Myb transcription factors were also involved in signal transduction processes. Additionally, receptor-like kinases were more likely associated with the defense response, and their expression patterns were validated by RT-PCR. Our results suggest that candidate genes, including receptor-like protein kinases, may play a key role in disease resistance against M. oryzae attack.
Drosophila mitochondrial DNA: a novel gene order.

PubMed Central

Clary, D O; Goddard, J M; Martin, S C; Fauron, C M; Wolstenholme, D R

1982-01-01

Part of the replication origin-containing A+T-rich region of the Drosophila yakuba mtDNA molecule and segments on either side of this region have been sequenced, and the genes within them identified. The data confirm that the small and large rRNA genes lie in tandem adjacent to that side of the A+T-rich region which is replicated first, and establish that a tRNAval gene lies between the two rRNA genes and that URF1 follows the large rRNA gene. The data further establish that the genes for tRNAile, tRNAgln, tRNAf-met and URF2 lie in the order given, on the opposite side of the A+T-rich region to the rRNA genes and, except for tRNAgln, are contained in the opposite strand to the rRNA, tRNAval and URF1 genes. This is in contrast to mammalian mtDNAs where all of these genes are located on the side of the replication origin which is replicated last, within the order tRNAphe, small (12S) rRNA, tRNAval, large (16S) rRNA, tRNAleu, URF1, tRNAile, tRNAgln, tRNAf-met and URF2, and, except tRNAgln, are all contained in the same (H) strand. In D. yakuba URF1 and URF2, the triplet AGA appears to specify an amino acid, which is again different from the situation found in mammalian mtDNAs, where AGA is used only as a rare termination codon. PMID:6294611
Molecular cytogenetic analysis of Xq critical regions in premature ovarian failure

PubMed Central

2013-01-01

Background One of the frequent reasons for unsuccessful conception is premature ovarian failure/primary ovarian insufficiency (POF/POI) that is defined as the loss of functional follicles below the age of 40 years. Among the genetic causes the most common one involves the X chromosome, as in Turner syndrome, partial X deletion and X-autosome translocations. Here we report a case of a 27-year-old female patient referred to genetic counselling because of premature ovarian failure. The aim of this case study to perform molecular genetic and cytogenetic analyses in order to identify the exact genetic background of the pathogenic phenotype. Results For premature ovarian failure disease diagnostics we performed the Fragile mental retardation 1 gene analysis using Southern blot technique and Repeat Primed PCR in order to identify the relationship between the Fragile mental retardation 1 gene premutation status and the premature ovarion failure disease. At this early onset, the premature ovarian failure affected patient we detected one normal allele of Fragile mental retardation 1 gene and we couldn’t verify the methylated allele, therefore we performed the cytogenetic analyses using G-banding and fluorescent in situ hybridization methods and a high resolution molecular cytogenetic method, the array comparative genomic hybridization technique. For this patient applying the G-banding, we identified a large deletion on the X chromosome at the critical region (ChrX q21.31-q28) which is associated with the premature ovarian failure phenotype. In order to detect the exact breakpoints, we used a special cytogenetic array ISCA plus CGH array and we verified a 67.355 Mb size loss at the critical region which include total 795 genes. Conclusions We conclude for this case study that the karyotyping is definitely helpful in the evaluation of premature ovarian failure patients, to identify the non submicroscopic chromosomal rearrangement, and using the array CGH technique we can contribute to the most efficient detection and mapping of exact deletion breakpoints of the deleted Xq region. PMID:24359613
Plastid genome evolution across the genus Cuscuta (Convolvulaceae): two clades within subgenus Grammica exhibit extensive gene loss.

PubMed

Braukmann, Thomas; Kuzmina, Maria; Stefanovic, Sasa

2013-02-01

The genus Cuscuta (Convolvulaceae, the morning glory family) is one of the most intensely studied lineages of parasitic plants. Whole plastome sequencing of four Cuscuta species has demonstrated changes to both plastid gene content and structure. The presence of photosynthetic genes under purifying selection indicates that Cuscuta is cryptically photosynthetic. However, the tempo and mode of plastid genome evolution across the diversity of this group (~200 species) remain largely unknown. A comparative investigation of plastid genome content, grounded within a phylogenetic framework, was conducted using a slot-blot Southern hybridization approach. Cuscuta was extensively sampled (~56% of species), including groups previously suggested to possess more altered plastomes compared with other members of this genus. A total of 56 probes derived from all categories of protein-coding genes, typically found within the plastomes of flowering plants, were used. The results indicate that two clades within subgenus Grammica (clades 'O' and 'K') exhibit substantially more plastid gene loss relative to other members of Cuscuta. All surveyed members of the 'O' clade show extensive losses of plastid genes from every category of genes typically found in the plastome, including otherwise highly conserved small and large ribosomal subunits. The extent of plastid gene losses within this clade is similar in magnitude to that observed previously in some non-asterid holoparasites, in which the very presence of a plastome has been questioned. The 'K' clade also exhibits considerable loss of plastid genes. Unlike in the 'O' clade, in which all species seem to be affected, the losses in clade 'K' progress phylogenetically, following a pattern consistent with the Evolutionary Transition Series hypothesis. This clade presents an ideal opportunity to study the reduction of the plastome of parasites 'in action'. The widespread plastid gene loss in these two clades is hypothesized to be a consequence of the complete loss of photosynthesis. Additionally, taxa that would be the best candidates for entire plastome sequencing are identified in order to investigate further the loss of photosynthesis and reduction of the plastome within Cuscuta.
Plastid genome evolution across the genus Cuscuta (Convolvulaceae): two clades within subgenus Grammica exhibit extensive gene loss

PubMed Central

Braukmann, Thomas

2013-01-01

The genus Cuscuta (Convolvulaceae, the morning glory family) is one of the most intensely studied lineages of parasitic plants. Whole plastome sequencing of four Cuscuta species has demonstrated changes to both plastid gene content and structure. The presence of photosynthetic genes under purifying selection indicates that Cuscuta is cryptically photosynthetic. However, the tempo and mode of plastid genome evolution across the diversity of this group (~200 species) remain largely unknown. A comparative investigation of plastid genome content, grounded within a phylogenetic framework, was conducted using a slot-blot Southern hybridization approach. Cuscuta was extensively sampled (~56% of species), including groups previously suggested to possess more altered plastomes compared with other members of this genus. A total of 56 probes derived from all categories of protein-coding genes, typically found within the plastomes of flowering plants, were used. The results indicate that two clades within subgenus Grammica (clades ‘O’ and ‘K’) exhibit substantially more plastid gene loss relative to other members of Cuscuta. All surveyed members of the ‘O’ clade show extensive losses of plastid genes from every category of genes typically found in the plastome, including otherwise highly conserved small and large ribosomal subunits. The extent of plastid gene losses within this clade is similar in magnitude to that observed previously in some non-asterid holoparasites, in which the very presence of a plastome has been questioned. The ‘K’ clade also exhibits considerable loss of plastid genes. Unlike in the ‘O’ clade, in which all species seem to be affected, the losses in clade ‘K’ progress phylogenetically, following a pattern consistent with the Evolutionary Transition Series hypothesis. This clade presents an ideal opportunity to study the reduction of the plastome of parasites ‘in action’. The widespread plastid gene loss in these two clades is hypothesized to be a consequence of the complete loss of photosynthesis. Additionally, taxa that would be the best candidates for entire plastome sequencing are identified in order to investigate further the loss of photosynthesis and reduction of the plastome within Cuscuta. PMID:23349139
An N-targeting real-time PCR strategy for the accurate detection of spring viremia of carp virus.

PubMed

Shao, Ling; Xiao, Yu; He, Zhengkan; Gao, Longying

2016-03-01

Spring viremia of carp virus (SVCV) is a highly pathogenic agent of several economically important Cyprinidae fish species. Currently, there are no effective vaccines or drugs for this virus, and prevention of the disease mostly relies on prompt diagnosis. Previously, nested RT-PCR and RT-qPCR detection methods based on the glycoprotein gene G have been developed. However, the high genetic diversity of the G gene seriously limits the reliability of those methods. Compared with the G gene, phylogenetic analyses indicate that the nucleoprotein gene N is more conserved. Furthermore, studies in other members of the Rhabdoviridae family reveals that their gene transcription level follows the order N>P>M>G>L, indicating that an N gene based RT-PCR should have higher sensitivity. Therefore, two pairs of primers and two corresponding probes targeting the conserved regions of the N gene were designed. RT-qPCR assays demonstrated all primers and probes could detect phylogenetically distant isolates specifically and efficiently. Moreover, in artificially infected fish, the detected copy numbers of the N gene were much higher than those of the G gene in all tissues, and both the N and G gene copy numbers were highest in the kidney and spleen. Testing in 1100 farm-raised fish also showed that the N-targeting strategy was more reliable than the G-targeting methods. The method developed in this study provides a reliable tool for the rapid diagnosis of SVCV. Copyright © 2015 Elsevier B.V. All rights reserved.
Characterization and Functional Analysis of Five MADS-Box B Class Genes Related to Floral Organ Identification in Tagetes erecta.

PubMed

Ai, Ye; Zhang, Chunling; Sun, Yalin; Wang, Weining; He, Yanhong; Bao, Manzhu

2017-01-01

According to the floral organ development ABC model, B class genes specify petal and stamen identification. In order to study the function of B class genes in flower development of Tagetes erecta, five MADS-box B class genes were identified and their expression and putative functions were studied. Sequence comparisons and phylogenetic analyses indicated that there were one PI-like gene-TePI, two euAP3-like genes-TeAP3-1 and TeAP3-2, and two TM6-like genes-TeTM6-1 and TeTM6-2 in T. erecta. Strong expression levels of these genes were detected in stamens of the disk florets, but little or no expression was detected in bracts, receptacles or vegetative organs. Yeast hybrid experiments of the B class proteins showed that TePI protein could form a homodimer and heterodimers with all the other four B class proteins TeAP3-1, TeAP3-2, TeTM6-1 and TeTM6-2. No homodimer or interaction was observed between the euAP3 and TM6 clade members. Over-expression of five B class genes of T. erecta in Nicotiana rotundifolia showed that only the transgenic plants of 35S::TePI showed altered floral morphology compared with the non-transgenic line. This study could contribute to the understanding of the function of B class genes in flower development of T. erecta, and provide a theoretical basis for further research to change floral organ structures and create new materials for plant breeding.
Transcriptome Analysis of Calcium- and Hormone-Related Gene Expressions during Different Stages of Peanut Pod Development

PubMed Central

Li, Yan; Meng, Jingjing; Yang, Sha; Guo, Feng; Zhang, Jialei; Geng, Yun; Cui, Li; Wan, Shubo; Li, Xinguo

2017-01-01

Peanut is one of the calciphilous plants. Calcium serves as a ubiquitous central hub in a large number of signaling pathways. In the field, free calcium ion (Ca2+)-deficient soil can result in unfilled pods. Four pod stages were analyzed to determine the relationship between Ca2+ excretion and pod development. Peanut shells showed Ca2+ excretion at all four stages; however, both the embryo of Stage 4 (S4) and the red skin of Stage 3 (S3) showed Ca2+ absorbance. These results showed that embryo and red skin of peanut need Ca2+ during development. In order to survey the relationship among calcium, hormone and seed development from gene perspective, we further analyzed the seed transcriptome at Stage 2 (S2), S3, and S4. About 70 million high quality clean reads were generated, which were assembled into 58,147 unigenes. By comparing these three stages, total 4,457 differentially expressed genes were identified. In these genes, 53 Ca2+ related genes, 40 auxin related genes, 15 gibberellin genes, 20 ethylene related genes, 2 abscisic acid related genes, and 7 cytokinin related genes were identified. Additionally, a part of them were validated by qRT-PCR. Most of their expressions changed during the pod development. Since some reports showed that Ca2+ signal transduction pathway is involved in hormone regulation pathway, these results implied that peanut seed development might be regulated by the collaboration of Ca2+ signal transduction pathway and hormone regulation pathway. PMID:28769950
Mucosal CCR1 gene expression as a marker of molecular activity in Crohn's disease: preliminary data.

PubMed

Dobre, Maria; Mănuc, Teodora Ecaterina; Milanesi, Elena; Pleşea, Iancu Emil; Ţieranu, Eugen Nicolae; Popa, Caterina; Mănuc, Mircea; Preda, Carmen Monica; Ţieranu, Ioana; Diculescu, Mihai Mircea; Ionescu, Elena Mirela; Becheanu, Gabriel

2017-01-01

A series of mechanisms of immune response, inflammation and apoptosis have been demonstrated to contribute to the appearance and evolution of Crohn's disease (CD) through the overexpression of several cytokines and chemokines in a susceptible host. The aim of this study was to identify the differences in gene expression profiles analyzing a panel of candidate genes in the mucosa from patients with active CD (CD-A), patients in remission (CD-R), and normal controls. Nine individuals were enrolled in the study: six CD patients (three with active lesions, three with mucosal healing) and three controls without inflammatory bowel disease (IBD) seen on endoscopy. All the individuals underwent mucosal biopsy during colonoscopy. Gene expression levels of 84 genes previously associated with CD were evaluated by polymerase chain reaction (PCR) array. Ten genes out of 84 were found significantly differentially expressed in CD-A (CCL11, CCL25, DEFA5, GCG, IL17A, LCN2, REG1A, STAT3, MUC1, CCR1) and eight genes in CD-R (CASP1, IL23A, STAT1, STAT3, TNF, CCR1, CCL5, and HSP90B1) when compared to controls. A quantitative gene expression analysis revealed that CCR1 gene was more expressed in CD-A than in CD-R. Our data suggest that CCR1 gene may be a putative marker of molecular activity of Crohn's disease. Following these preliminary data, a confirmation in larger cohort studies could represent a useful method in order to identify new therapeutic targets.
[Sequencing and analysis of the resistome of Streptomyces fradiae ATCC19609 in order to develop a test system for screening of new antimicrobial agents].

PubMed

Vatlin, A A; Bekker, O B; Lysenkova, L N; Korolev, A M; Shchekotikhin, A E; Danilenko, V N

2016-06-01

The paper provides the annotation and data on sequencing the antibiotic resistance genes in Streptomyces fradiae strain ATCC19609, highly sensitive to different antibiotics. Genome analysis revealed four groups of genes that determined the resistome of the tested strain. These included classical antibiotic resistance genes (nine aminoglycoside phosphotransferase genes, two beta-lactamase genes, and the genes of puromycin N-acetyltransferase, phosphinothricin N-acetyltransferase, and aminoglycoside acetyltransferase); the genes of ATP-dependent ABC transporters, involved in the efflux of antibiotics from the cell (MacB-2, BcrA, two-subunit MDR1); the genes of positive and negative regulation of transcription (whiB and padR families); and the genes of post-translational modification (serine-threonine protein kinases). A comparative characteristic of aminoglycoside phosphotransferase genes in S. fradiae ATCC19609, S. lividans TK24, and S. albus J1074, the causative agent of actinomycosis, is provided. The possibility of using the S. fradiae strain ATCC19609 as the test system for selection of the macrolide antibiotic oligomycin A derivatives with different levels of activity is demonstrated. Analysis of more than 20 semisynthetic oligomycin A derivatives made it possible to divide them into three groups according to the level of activity: inactive (>1 nmol/disk), 10 substances; with medium activity level (0.05–1 nmol/disk), 12 substances; and more active (0.01–0.05 nmol/disk), 2 substances. Important for the activity of semisynthetic derivatives is the change in the position of the 33rd carbon atom in the oligomycin A molecule.
GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.

PubMed

Doungpan, Narumol; Engchuan, Worrawat; Chan, Jonathan H; Meechai, Asawin

2016-12-05

Gene expression has been used to identify disease gene biomarkers, but there are ongoing challenges. Single gene or gene-set biomarkers are inadequate to provide sufficient understanding of complex disease mechanisms and the relationship among those genes. Network-based methods have thus been considered for inferring the interaction within a group of genes to further study the disease mechanism. Recently, the Gene-Network-based Feature Set (GNFS), which is capable of handling case-control and multiclass expression for gene biomarker identification, has been proposed, partly taking into account of network topology. However, its performance relies on a greedy search for building subnetworks and thus requires further improvement. In this work, we establish a new approach named Gene Sub-Network-based Feature Selection (GSNFS) by implementing the GNFS framework with two proposed searching and scoring algorithms, namely gene-set-based (GS) search and parent-node-based (PN) search, to identify subnetworks. An additional dataset is used to validate the results. The two proposed searching algorithms of the GSNFS method for subnetwork expansion are concerned with the degree of connectivity and the scoring scheme for building subnetworks and their topology. For each iteration of expansion, the neighbour genes of a current subnetwork, whose expression data improved the overall subnetwork score, is recruited. While the GS search calculated the subnetwork score using an activity score of a current subnetwork and the gene expression values of its neighbours, the PN search uses the expression value of the corresponding parent of each neighbour gene. Four lung cancer expression datasets were used for subnetwork identification. In addition, using pathway data and protein-protein interaction as network data in order to consider the interaction among significant genes were discussed. Classification was performed to compare the performance of the identified gene subnetworks with three subnetwork identification algorithms. The two searching algorithms resulted in better classification and gene/gene-set agreement compared to the original greedy search of the GNFS method. The identified lung cancer subnetwork using the proposed searching algorithm resulted in an improvement of the cross-dataset validation and an increase in the consistency of findings between two independent datasets. The homogeneity measurement of the datasets was conducted to assess dataset compatibility in cross-dataset validation. The lung cancer dataset with higher homogeneity showed a better result when using the GS search while the dataset with low homogeneity showed a better result when using the PN search. The 10-fold cross-dataset validation on the independent lung cancer datasets showed higher classification performance of the proposed algorithms when compared with the greedy search in the original GNFS method. The proposed searching algorithms provide a higher number of genes in the subnetwork expansion step than the greedy algorithm. As a result, the performance of the subnetworks identified from the GSNFS method was improved in terms of classification performance and gene/gene-set level agreement depending on the homogeneity of the datasets used in the analysis. Some common genes obtained from the four datasets using different searching algorithms are genes known to play a role in lung cancer. The improvement of classification performance and the gene/gene-set level agreement, and the biological relevance indicated the effectiveness of the GSNFS method for gene subnetwork identification using expression data.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.