Crimean-Congo Hemorrhagic Fever
2004-01-01
aminocaproic acid were also indicated. Much emphasis was also placed on preventing reinfection, including the necessity of remov- ing blood crusts from...The se- quence is approximately 60% identical both at the nucleotide and amino acid levels to the L segment of Dugbe virus, the only other Nairovirus...However, more recent data based on nucleic acid sequence analysis have revealed extensive genetic diversity. The first published CCHFV sequence
NASA Astrophysics Data System (ADS)
Karakatsanis, L. P.; Pavlos, G. P.; Iliopoulos, A. C.; Pavlos, E. G.; Clark, P. M.; Duke, J. L.; Monos, D. S.
2018-09-01
This study combines two independent domains of science, the high throughput DNA sequencing capabilities of Genomics and complexity theory from Physics, to assess the information encoded by the different genomic segments of exonic, intronic and intergenic regions of the Major Histocompatibility Complex (MHC) and identify possible interactive relationships. The dynamic and non-extensive statistical characteristics of two well characterized MHC sequences from the homozygous cell lines, PGF and COX, in addition to two other genomic regions of comparable size, used as controls, have been studied using the reconstructed phase space theorem and the non-extensive statistical theory of Tsallis. The results reveal similar non-linear dynamical behavior as far as complexity and self-organization features. In particular, the low-dimensional deterministic nonlinear chaotic and non-extensive statistical character of the DNA sequences was verified with strong multifractal characteristics and long-range correlations. The nonlinear indices repeatedly verified that MHC sequences, whether exonic, intronic or intergenic include varying levels of information and reveal an interaction of the genes with intergenic regions, whereby the lower the number of genes in a region, the less the complexity and information content of the intergenic region. Finally we showed the significance of the intergenic region in the production of the DNA dynamics. The findings reveal interesting content information in all three genomic elements and interactive relationships of the genes with the intergenic regions. The results most likely are relevant to the whole genome and not only to the MHC. These findings are consistent with the ENCODE project, which has now established that the non-coding regions of the genome remain to be of relevance, as they are functionally important and play a significant role in the regulation of expression of genes and coordination of the many biological processes of the cell.
Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya
2015-01-01
Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930
Sampson, Juliana K.; Sheth, Nihar U.; Koparde, Vishal N.; Scalora, Allison F.; Serrano, Myrna G.; Lee, Vladimir; Roberts, Catherine H.; Jameson-Lee, Max; Ferreira-Gonzalez, Andrea; Manjili, Masoud H.; Buck, Gregory A.; Neale, Michael C.; Toor, Amir A.
2016-01-01
Summary Whole exome sequencing (WES) was performed on stem cell transplant donor-recipient (D-R) pairs to determine the extent of potential antigenic variation at a molecular level. In a small cohort of D-R pairs, a high frequency of sequence variation was observed between the donor and recipient exomes independent of human leucocyte antigen (HLA) matching. Nonsynonymous, nonconservative single nucleotide polymorphisms were approximately twice as frequent in HLA-matched unrelated, compared with related D-R pairs. When mapped to individual chromosomes, these polymorphic nucleotides were uniformly distributed across the entire exome. In conclusion, WES reveals extensive nucleotide sequence variation in the exomes of HLA-matched donors and recipients. PMID:24749631
USDA-ARS?s Scientific Manuscript database
The P. ultimum DAOM BR144 (=CBS 805.95 = ATCC200006) genome (42.8 Mb) encodes 15,290 genes, and has extensive sequence similarity and synteny with related Phytophthora spp., including the potato late blight pathogen Phytophthora infestans. Whole transcriptome sequencing revealed expression of 86 % o...
High levels of diversity characterize mandrill (Mandrillus sphinx) Mhc-DRB sequences.
Abbott, Kristin M; Wickings, E Jean; Knapp, Leslie A
2006-08-01
The major histocompatibility complex (MHC) is highly polymorphic in most primate species studied thus far. The rhesus macaque (Macaca mulatta) has been studied extensively and the Mhc-DRB region demonstrates variability similar to humans. The extent of MHC diversity is relatively unknown for other Old World monkeys (OWM), especially among genera other than Macaca. A molecular survey of the Mhc-DRB region in mandrills (Mandrillus sphinx) revealed extensive variability, suggesting that other OWMs may also possess high levels of Mhc-DRB polymorphism. In the present study, 33 Mhc-DRB loci were identified from only 13 animals. Eleven were wild-born and presumed to be unrelated and two were captive-born twins. Two to seven different sequences were identified for each individual, suggesting that some mandrills may have as many as four Mhc-DRB loci on a single haplotype. From these sequences, representatives of at least six Mhc-DRB loci or lineages were identified. As observed in other primates, some new lineages may have arisen through the process of gene conversion. These findings indicate that mandrills have Mhc-DRB diversity not unlike rhesus macaques and humans.
Sampson, Juliana K; Sheth, Nihar U; Koparde, Vishal N; Scalora, Allison F; Serrano, Myrna G; Lee, Vladimir; Roberts, Catherine H; Jameson-Lee, Max; Ferreira-Gonzalez, Andrea; Manjili, Masoud H; Buck, Gregory A; Neale, Michael C; Toor, Amir A
2014-08-01
Whole exome sequencing (WES) was performed on stem cell transplant donor-recipient (D-R) pairs to determine the extent of potential antigenic variation at a molecular level. In a small cohort of D-R pairs, a high frequency of sequence variation was observed between the donor and recipient exomes independent of human leucocyte antigen (HLA) matching. Nonsynonymous, nonconservative single nucleotide polymorphisms were approximately twice as frequent in HLA-matched unrelated, compared with related D-R pairs. When mapped to individual chromosomes, these polymorphic nucleotides were uniformly distributed across the entire exome. In conclusion, WES reveals extensive nucleotide sequence variation in the exomes of HLA-matched donors and recipients. © 2014 John Wiley & Sons Ltd.
Jo, Yeong Deuk; Choi, Yoomi; Kim, Dong-Hwan; Kim, Byung-Dong; Kang, Byoung-Cheorl
2014-07-04
Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearrangements caused by recombination. However, the mitochondrial genome structure and the DNA rearrangements that may be related to CMS have not been characterized in Capsicum spp. We obtained the complete mitochondrial genome sequences of the pepper CMS line FS4401 (507,452 bp) and the fertile line Jeju (511,530 bp). Comparative analysis between mitochondrial genomes of peppers and tobacco that are included in Solanaceae revealed extensive DNA rearrangements and poor conservation in non-coding DNA. In comparison between pepper lines, FS4401 and Jeju mitochondrial DNAs contained the same complement of protein coding genes except for one additional copy of an atp6 gene (ψatp6-2) in FS4401. In terms of genome structure, we found eighteen syntenic blocks in the two mitochondrial genomes, which have been rearranged in each genome. By contrast, sequences between syntenic blocks, which were specific to each line, accounted for 30,380 and 17,847 bp in FS4401 and Jeju, respectively. The previously-reported CMS candidate genes, orf507 and ψatp6-2, were located on the edges of the largest sequence segments that were specific to FS4401. In this region, large number of small sequence segments which were absent or found on different locations in Jeju mitochondrial genome were combined together. The incorporation of repeats and overlapping of connected sequence segments by a few nucleotides implied that extensive rearrangements by homologous recombination might be involved in evolution of this region. Further analysis using mtDNA pairs from other plant species revealed common features of DNA regions around CMS-associated genes. Although large portion of sequence context was shared by mitochondrial genomes of CMS and male-fertile pepper lines, extensive genome rearrangements were detected. CMS candidate genes located on the edges of highly-rearranged CMS-specific DNA regions and near to repeat sequences. These characteristics were detected among CMS-associated genes in other species, implying a common mechanism might be involved in the evolution of CMS-associated genes.
Bhagwat, Basdeo; Dickison, Virginia; Ding, Xinlun; Walker, Melanie; Bernardy, Michael; Bouthillier, Michel; Creelman, Alexa; DeYoung, Robyn; Li, Yinzi; Nie, Xianzhou; Wang, Aiming; Xiang, Yu; Sanfaçon, Hélène
2016-06-01
In this study, we report the genome sequence of five isolates of strawberry mottle virus (family Secoviridae, order Picornavirales) from strawberry field samples with decline symptoms collected in Eastern Canada. The Canadian isolates differed from the previously characterized European isolate 1134 in that they had a longer RNA2, resulting in a 239-amino-acid extension of the C-terminal region of the polyprotein. Sequence analysis suggests that reassortment and recombination occurred among the isolates. Phylogenetic analysis revealed that the Canadian isolates are diverse, grouping in two separate branches along with isolates from Europe and the Americas.
Characterization of a prototype strain of hepatitis E virus.
Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H
1992-01-15
A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia.
Feldman, H.R.; Franseen, E.K.; Joeckel, R.M.; Heckel, P.H.
2005-01-01
Pennsylvanian glacioeustatic cyclothems exposed in Kansas and adjacent areas provide a unique opportunity to test models of the impact of relative sea level and climate on stratal architecture. A succession of eight of these high-frequency sequences, traced along dip for 500 km, reveal that modest climate shifts from relatively dry-seasonal to relatively wet-seasonal with a duration of several sequences (???600,000 to 1 million years) had a dominant impact on facies, sediment dispersal patterns, and sequence architecture. The climate shifts documented herein are intermediate, both in magnitude and duration, between previously documented longer-term climate shifts throughout much of the Pennsylvanian and shorter-term shifts described within individual sequences. Climate indicators are best preserved at sequence boundaries and in incised-valley fills of the lowstand systems tracts (LST). Relatively drier climate indicators include high-chroma paleosols, typically with pedogenic carbonates, and plant assemblages that are dominated by gymnosperms, mostly xerophytic walchian conifers. The associated valleys are small (4 km wide and >20 m deep), and dominated by quartz sandstones derived from distant source areas, reflecting large drainage networks. Transgressive systems tracts (TST) in all eight sequences gen erally are characterized by thin, extensive limestones and thin marine shales, suggesting that the dominant control on TST facies distribution was the sequestration of siliciclastic sediment in updip positions. Highstand systems tracts (HST) were significantly impacted by the intermediate-scale climate cycle in that HSTs from relatively drier climates consist of thin marine shales overlain by extensive, thick regressive limestones, whereas HSTs from relatively wetter climates are dominated by thick marine shales. Previously documented relative sea-level changes do not track the climate cycles, indicating that climate played a role distinct from that of relative sea-level change. These intermediate-scale modest climate shifts had a dominant impact on sequence architecture. This independent measure of climate and relative sea level may allow the testing of models of climate and sediment supply based on modern systems. Copyright ?? 2005, SEPM.
Underwound DNA under Tension: Structure, Elasticity, and Sequence-Dependent Behaviors
NASA Astrophysics Data System (ADS)
Sheinin, Maxim Y.; Forth, Scott; Marko, John F.; Wang, Michelle D.
2011-09-01
DNA melting under torsion plays an important role in a wide variety of cellular processes. In the present Letter, we have investigated DNA melting at the single-molecule level using an angular optical trap. By directly measuring force, extension, torque, and angle of DNA, we determined the structural and elastic parameters of torsionally melted DNA. Our data reveal that under moderate forces, the melted DNA assumes a left-handed structure as opposed to an open bubble conformation and is highly torsionally compliant. We have also discovered that at low forces melted DNA properties are highly dependent on DNA sequence. These results provide a more comprehensive picture of the global DNA force-torque phase diagram.
May, Jared; Johnson, Philip; Saleem, Huma
2017-01-01
ABSTRACT To maximize the coding potential of viral genomes, internal ribosome entry sites (IRES) can be used to bypass the traditional requirement of a 5′ cap and some/all of the associated translation initiation factors. Although viral IRES typically contain higher-order RNA structure, an unstructured sequence of about 84 nucleotides (nt) immediately upstream of the Turnip crinkle virus (TCV) coat protein (CP) open reading frame (ORF) has been found to promote internal expression of the CP from the genomic RNA (gRNA) both in vitro and in vivo. An absence of extensive RNA structure was predicted using RNA folding algorithms and confirmed by selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) RNA structure probing. Analysis of the IRES region in vitro by use of both the TCV gRNA and reporter constructs did not reveal any sequence-specific elements but rather suggested that an overall lack of structure was an important feature for IRES activity. The CP IRES is A-rich, independent of orientation, and strongly conserved among viruses in the same genus. The IRES was dependent on eIF4G, but not eIF4E, for activity. Low levels of CP accumulated in vivo in the absence of detectable TCV subgenomic RNAs, strongly suggesting that the IRES was active in the gRNA in vivo. Since the TCV CP also serves as the viral silencing suppressor, early translation of the CP from the viral gRNA is likely important for countering host defenses. Cellular mRNA IRES also lack extensive RNA structures or sequence conservation, suggesting that this viral IRES and cellular IRES may have similar strategies for internal translation initiation. IMPORTANCE Cap-independent translation is a common strategy among positive-sense, single-stranded RNA viruses for bypassing the host cell requirement of a 5′ cap structure. Viral IRES, in general, contain extensive secondary structure that is critical for activity. In contrast, we demonstrate that a region of viral RNA devoid of extensive secondary structure has IRES activity and produces low levels of viral coat protein in vitro and in vivo. Our findings may be applicable to cellular mRNA IRES that also have little or no sequences/structures in common. PMID:28179526
Characterization of a prototype strain of hepatitis E virus.
Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H
1992-01-01
A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia. Images PMID:1731327
Not All Order Memory Is Equal: Test Demands Reveal Dissociations in Memory for Sequence Information
ERIC Educational Resources Information Center
Jonker, Tanya R.; MacLeod, Colin M.
2017-01-01
Remembering the order of a sequence of events is a fundamental feature of episodic memory. Indeed, a number of formal models represent temporal context as part of the memory system, and memory for order has been researched extensively. Yet, the nature of the code(s) underlying sequence memory is still relatively unknown. Across 4 experiments that…
DNA-DNA hybridization values and their relationship to whole-genome sequence similarities.
Goris, Johan; Konstantinidis, Konstantinos T; Klappenbach, Joel A; Coenye, Tom; Vandamme, Peter; Tiedje, James M
2007-01-01
DNA-DNA hybridization (DDH) values have been used by bacterial taxonomists since the 1960s to determine relatedness between strains and are still the most important criterion in the delineation of bacterial species. Since the extent of hybridization between a pair of strains is ultimately governed by their respective genomic sequences, we examined the quantitative relationship between DDH values and genome sequence-derived parameters, such as the average nucleotide identity (ANI) of common genes and the percentage of conserved DNA. A total of 124 DDH values were determined for 28 strains for which genome sequences were available. The strains belong to six important and diverse groups of bacteria for which the intra-group 16S rRNA gene sequence identity was greater than 94 %. The results revealed a close relationship between DDH values and ANI and between DNA-DNA hybridization and the percentage of conserved DNA for each pair of strains. The recommended cut-off point of 70 % DDH for species delineation corresponded to 95 % ANI and 69 % conserved DNA. When the analysis was restricted to the protein-coding portion of the genome, 70 % DDH corresponded to 85 % conserved genes for a pair of strains. These results reveal extensive gene diversity within the current concept of "species". Examination of reciprocal values indicated that the level of experimental error associated with the DDH method is too high to reveal the subtle differences in genome size among the strains sampled. It is concluded that ANI can accurately replace DDH values for strains for which genome sequences are available.
Gent, Jonathan I; Wang, Na; Dawe, R Kelly
2017-06-21
Paradoxically, centromeres are known both for their characteristic repeat sequences (satellite DNA) and for being epigenetically defined. Maize (Zea mays mays) is an attractive model for studying centromere positioning because many of its large (~2 Mb) centromeres are not dominated by satellite DNA. These centromeres, which we call complex centromeres, allow for both assembly into reference genomes and for mapping short reads from ChIP-seq with antibodies to centromeric histone H3 (cenH3). We found frequent complex centromeres in maize and its wild relatives Z. mays parviglumis, Z. mays mexicana, and particularly Z. mays huehuetenangensis. Analysis of individual plants reveals minor variation in the positions of complex centromeres among siblings. However, such positional shifts are stochastic and not heritable, consistent with prior findings that centromere positioning is stable at the population level. Centromeres are also stable in multiple F1 hybrid contexts. Analysis of repeats in Z. mays and other species (Zea diploperennis, Zea luxurians, and Tripsacum dactyloides) reveals tenfold differences in abundance of the major satellite CentC, but similar high levels of sequence polymorphism in individual CentC copies. Deviation from the CentC consensus has little or no effect on binding of cenH3. These data indicate that complex centromeres are neither a peculiarity of cultivation nor inbreeding in Z. mays. While extensive arrays of CentC may be the norm for other Zea and Tripsacum species, these data also reveal that a wide diversity of DNA sequences and multiple types of genetic elements in and near centromeres support centromere function and constrain centromere positions.
Sequence modelling and an extensible data model for genomic database
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Peter Wei-Der
1992-01-01
The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS's do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data modelmore » that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the Extensible Object Model'', to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.« less
Sequence modelling and an extensible data model for genomic database
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Peter Wei-Der
1992-01-01
The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS`s do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data modelmore » that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the ``Extensible Object Model``, to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.« less
Structure and regulation of the Yersinia pestis yscBCDEF operon.
Haddix, P L; Straley, S C
1992-01-01
We have investigated the physical and genetic structure and regulation of the Yersinia pestis yscBCDEF region, previously called lcrC. DNA sequence analysis showed that this region is homologous to the corresponding part of the ysc locus of Yersinia enterocolitica and suggested that the yscBCDEF cistrons belong to a single operon on the low-calcium response virulence plasmid pCD1. Promoter activity measurements of ysc subclones indicated that yscBCDEF constitutes a suboperon of the larger ysc region by revealing promoter activity in a clone containing the 3' end of yscD, intact yscE and yscF, and part of yscG. These experiments also revealed an additional weak promoter upstream of yscD. Northern (RNA) analysis with a yscD probe showed that operon transcription is thermally induced and downregulated in the presence of Ca2+. Primer extension of operon transcripts suggested that two promoters, a moderate-level constitutive one and a stronger, calcium-downregulated one, control full-length operon transcription at 37 degrees C. Primer extension provided additional support for the proposed designation of a yscBCDEF suboperon by identifying a 5' end within yscF, for which relative abundances in the presence and absence of Ca2+ revealed regulation that is distinct from that for transcripts initiating farther upstream. YscB and YscC were expressed in Escherichia coli by using a high-level transcription system. Attempts to express YscD were only partially successful, but they revealed interesting regulation at the translational level. Images PMID:1624469
DOE Office of Scientific and Technical Information (OSTI.GOV)
Williams, Kelly P.
2013-10-03
This package assists in genome assembly. extendFromReads takes as input a set of Illumina (eg, MiSeq) DNA sequencing reads, a query seed sequence and a direction to extend the seed. The algorithm collects all seed-- ]matching reads (flipping reverse-- ]orientation hits), trims off the seed and additional sequence in the other direction, sorts the remaining sequences alphabetically, and prints them aligned without gaps from the point of seed trimming. This produces a visual display distinguishing the flanks of multi- ]copy seeds. A companion script hitMates.pl collects the mates of seed-- ]hi]ng reads, whose alignment reveals longer extensions from the seed.more » The collect/trim/sort strategy was made iterative and scaled up in the script denovo.pl, for de novo contig assembly. An index is pre-- ]built using indexReads.pl that for each unique 21-- ]mer found in all the reads, records its gfate h of extension (whether extendable, blocked by low coverage, or blocked by branching after a duplicated sequence) and other characteristics. Importantly, denovo.pl records all branchings that follow a branching contig endpoint, providing contig- ]extension information« less
The contribution of alu elements to mutagenic DNA double-strand break repair.
Morales, Maria E; White, Travis B; Streva, Vincent A; DeFreece, Cecily B; Hedges, Dale J; Deininger, Prescott L
2015-03-01
Alu elements make up the largest family of human mobile elements, numbering 1.1 million copies and comprising 11% of the human genome. As a consequence of evolution and genetic drift, Alu elements of various sequence divergence exist throughout the human genome. Alu/Alu recombination has been shown to cause approximately 0.5% of new human genetic diseases and contribute to extensive genomic structural variation. To begin understanding the molecular mechanisms leading to these rearrangements in mammalian cells, we constructed Alu/Alu recombination reporter cell lines containing Alu elements ranging in sequence divergence from 0%-30% that allow detection of both Alu/Alu recombination and large non-homologous end joining (NHEJ) deletions that range from 1.0 to 1.9 kb in size. Introduction of as little as 0.7% sequence divergence between Alu elements resulted in a significant reduction in recombination, which indicates even small degrees of sequence divergence reduce the efficiency of homology-directed DNA double-strand break (DSB) repair. Further reduction in recombination was observed in a sequence divergence-dependent manner for diverged Alu/Alu recombination constructs with up to 10% sequence divergence. With greater levels of sequence divergence (15%-30%), we observed a significant increase in DSB repair due to a shift from Alu/Alu recombination to variable-length NHEJ which removes sequence between the two Alu elements. This increase in NHEJ deletions depends on the presence of Alu sequence homeology (similar but not identical sequences). Analysis of recombination products revealed that Alu/Alu recombination junctions occur more frequently in the first 100 bp of the Alu element within our reporter assay, just as they do in genomic Alu/Alu recombination events. This is the first extensive study characterizing the influence of Alu element sequence divergence on DNA repair, which will inform predictions regarding the effect of Alu element sequence divergence on both the rate and nature of DNA repair events.
Genomic structure of the human D-site binding protein (DBP) gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shutler, G.; Glassco, T.; Kang, Xiaolin
1996-06-15
The human gene for the D-Site Binding Protein (DBP) has been sequenced and characterized. This gene is a member of the b/ZIP family of transcription factors and is one of three genes forming the PAR sub-family. DBP has been implicated in the diurnal regulation of a variety of liver-specific genes. Examination of the genomic structure of DBP reveals that the gene is divided into four exons and is contained within a relatively compact region of approximately 6 kb. These exons appear to correspond to functional divisions the DBP protein. Exon 1 contains a long 5{prime} UTR, and conservation between themore » rat and the human genes of the presence of small open reading frames within this region suggests that is may play a role in translational control. Exon 2 contains a limited region of similarity to the other PAR domain genes, which may be part of a potential activation domain. Exon 3 contains the PAR domain and differs by only 1 of 71 amino acids between rat and human. Exon 4, containing both the basic and the leucine zipper domains, is likewise highly conserved. The overall degree of homology between the rat and the human cDNA sequences is 82% for the nucleic acid sequence and 92% for the protein sequence. comparison of the rat and human proximal promoters reveals extensive sequence conservation, with two previously characterized DNA binding sites being conserved at the functional and sequence levels. 31 refs., 4 figs.« less
Komisarczuk, Anna Z; Kongshaug, Heidi; Nilsen, Frank
2018-02-01
Na + /K + -ATPase has a key function in a variety of physiological processes including membrane excitability, osmoregulation, regulation of cell volume, and transport of nutrients. While knowledge about Na + /K + -ATPase function in osmoregulation in crustaceans is extensive, the role of this enzyme in other physiological and developmental processes is scarce. Here, we report characterization, transcriptional distribution and likely functions of the newly identified L. salmonis Na + /K + -ATPase (LsalNa + /K + -ATPase) α subunit in various developmental stages. The complete mRNA sequence was identified, with 3003 bp open reading frame encoding a putative protein of 1001 amino acids. Putative protein sequence of LsalNa + /K + -ATPase revealed all typical features of Na + /K + -ATPase and demonstrated high sequence identity to other invertebrate and vertebrate species. Quantitative RT-PCR analysis revealed higher LsalNa + /K + -ATPase transcript level in free-living stages in comparison to parasitic stages. In situ hybridization analysis of copepodids and adult lice revealed LsalNa + /K + -ATPase transcript localization in a wide variety of tissues such as nervous system, intestine, reproductive system, and subcuticular and glandular tissue. RNAi mediated knock-down of LsalNa + /K + -ATPase caused locomotion impairment, and affected reproduction and feeding. Morphological analysis of dsRNA treated animals revealed muscle degeneration in larval stages, severe changes in the oocyte formation and maturation in females and abnormalities in tegmental glands. Thus, the study represents an important foundation for further functional investigation and identification of physiological pathways in which Na + /K + -ATPase is directly or indirectly involved. Copyright © 2018 Elsevier Inc. All rights reserved.
Modic Type 1 Changes: Detection Performance of Fat-Suppressed Fluid-Sensitive MRI Sequences.
Finkenstaedt, Tim; Del Grande, Filippo; Bolog, Nicolae; Ulrich, Nils; Tok, Sina; Kolokythas, Orpheus; Steurer, Johann; Andreisek, Gustav; Winklhofer, Sebastian
2018-02-01
To assess the performance of fat-suppressed fluid-sensitive MRI sequences compared to T1-weighted (T1w) / T2w sequences for the detection of Modic 1 end-plate changes on lumbar spine MRI. Sagittal T1w, T2w, and fat-suppressed fluid-sensitive MRI images of 100 consecutive patients (consequently 500 vertebral segments; 52 female, mean age 74 ± 7.4 years; 48 male, mean age 71 ± 6.3 years) were retrospectively evaluated. We recorded the presence (yes/no) and extension (i. e., Likert-scale of height, volume, and end-plate extension) of Modic I changes in T1w/T2w sequences and compared the results to fat-suppressed fluid-sensitive sequences (McNemar/Wilcoxon-signed-rank test). Fat-suppressed fluid-sensitive sequences revealed significantly more Modic I changes compared to T1w/T2w sequences (156 vs. 93 segments, respectively; p < 0.001). The extension of Modic I changes in fat-suppressed fluid-sensitive sequences was significantly larger compared to T1w/T2w sequences (height: 2.53 ± 0.82 vs. 2.27 ± 0.79, volume: 2.35 ± 0.76 vs. 2.1 ± 0.65, end-plate: 2.46 ± 0.76 vs. 2.19 ± 0.81), (p < 0.05). Modic I changes that were only visible in fat-suppressed fluid-sensitive sequences but not in T1w/T2w sequences were significantly smaller compared to Modic I changes that were also visible in T1w/T2w sequences (p < 0.05). In conclusion, fat-suppressed fluid-sensitive MRI sequences revealed significantly more Modic I end-plate changes and demonstrated a greater extent compared to standard T1w/T2w imaging. · When the Modic classification was defined in 1988, T2w sequences were heavily T2-weighted and thus virtually fat-suppressed.. · Nowadays, the bright fat signal in T2w images masks edema-like changes.. · The conventional definition of Modic I changes is not fully applicable anymore.. · Fat-suppressed fluid-sensitive MRI sequences revealed more/greater extent of Modic I changes.. · Finkenstaedt T, Del Grande F, Bolog N et al. Modic Type 1 Changes: Detection Performance of Fat-Suppressed Fluid-Sensitive MRI Sequences. Fortschr Röntgenstr 2018; 190: 152 - 160. © Georg Thieme Verlag KG Stuttgart · New York.
Characterization and Exploitation of CRISPR Loci in Bifidobacterium longum
Hidalgo-Cantabrana, Claudio; Crawley, Alexandra B.; Sanchez, Borja; Barrangou, Rodolphe
2017-01-01
Diverse CRISPR-Cas systems provide adaptive immunity in many bacteria and most archaea, via a DNA-encoded, RNA-mediated, nucleic-acid targeting mechanism. Over time, CRISPR loci expand via iterative uptake of invasive DNA sequences into the CRISPR array during the adaptation process. These genetic vaccination cards thus provide insights into the exposure of strains to phages and plasmids in space and time, revealing the historical predatory exposure of a strain. These genetic loci thus constitute a unique basis for genotyping of strains, with potential of resolution at the strain-level. Here, we investigate the occurrence and diversity of CRISPR-Cas systems in the genomes of various Bifidobacterium longum strains across three sub-species. Specifically, we analyzed the genomic content of 66 genomes belonging to B. longum subsp. longum, B. longum subsp. infantis and B. longum subsp. suis, and identified 25 strains that carry 29 total CRISPR-Cas systems. We identify various Type I and Type II CRISPR-Cas systems that are widespread in this species, notably I-C, I-E, and II-C. Noteworthy, Type I-C systems showed extended CRISPR arrays, with extensive spacer diversity. We show how these hypervariable loci can be used to gain insights into strain origin, evolution and phylogeny, and can provide discriminatory sequences to distinguish even clonal isolates. By investigating CRISPR spacer sequences, we reveal their origin and implicate phages and prophages as drivers of CRISPR immunity expansion in this species, with redundant targeting of select prophages. Analysis of CRISPR spacer origin also revealed novel PAM sequences. Our results suggest that CRISPR-Cas immune systems are instrumental in mounting diversified viral resistance in B. longum, and show that these sequences are useful for typing across three subspecies. PMID:29033911
Characterization and Exploitation of CRISPR Loci in Bifidobacterium longum.
Hidalgo-Cantabrana, Claudio; Crawley, Alexandra B; Sanchez, Borja; Barrangou, Rodolphe
2017-01-01
Diverse CRISPR-Cas systems provide adaptive immunity in many bacteria and most archaea, via a DNA-encoded, RNA-mediated, nucleic-acid targeting mechanism. Over time, CRISPR loci expand via iterative uptake of invasive DNA sequences into the CRISPR array during the adaptation process. These genetic vaccination cards thus provide insights into the exposure of strains to phages and plasmids in space and time, revealing the historical predatory exposure of a strain. These genetic loci thus constitute a unique basis for genotyping of strains, with potential of resolution at the strain-level. Here, we investigate the occurrence and diversity of CRISPR-Cas systems in the genomes of various Bifidobacterium longum strains across three sub-species. Specifically, we analyzed the genomic content of 66 genomes belonging to B. longum subsp. longum, B. longum subsp. infantis and B. longum subsp. suis , and identified 25 strains that carry 29 total CRISPR-Cas systems. We identify various Type I and Type II CRISPR-Cas systems that are widespread in this species, notably I-C, I-E, and II-C. Noteworthy, Type I-C systems showed extended CRISPR arrays, with extensive spacer diversity. We show how these hypervariable loci can be used to gain insights into strain origin, evolution and phylogeny, and can provide discriminatory sequences to distinguish even clonal isolates. By investigating CRISPR spacer sequences, we reveal their origin and implicate phages and prophages as drivers of CRISPR immunity expansion in this species, with redundant targeting of select prophages. Analysis of CRISPR spacer origin also revealed novel PAM sequences. Our results suggest that CRISPR-Cas immune systems are instrumental in mounting diversified viral resistance in B. longum , and show that these sequences are useful for typing across three subspecies.
Trading genes along the silk road: mtDNA sequences and the origin of central Asian populations.
Comas, D; Calafell, F; Mateu, E; Pérez-Lezaun, A; Bosch, E; Martínez-Arias, R; Clarimon, J; Facchini, F; Fiori, G; Luiselli, D; Pettener, D; Bertranpetit, J
1998-01-01
Central Asia is a vast region at the crossroads of different habitats, cultures, and trade routes. Little is known about the genetics and the history of the population of this region. We present the analysis of mtDNA control-region sequences in samples of the Kazakh, the Uighurs, the lowland Kirghiz, and the highland Kirghiz, which we have used to address both the population history of the region and the possible selective pressures that high altitude has on mtDNA genes. Central Asian mtDNA sequences present features intermediate between European and eastern Asian sequences, in several parameters-such as the frequencies of certain nucleotides, the levels of nucleotide diversity, mean pairwise differences, and genetic distances. Several hypotheses could explain the intermediate position of central Asia between Europe and eastern Asia, but the most plausible would involve extensive levels of admixture between Europeans and eastern Asians in central Asia, possibly enhanced during the Silk Road trade and clearly after the eastern and western Eurasian human groups had diverged. Lowland and highland Kirghiz mtDNA sequences are very similar, and the analysis of molecular variance has revealed that the fraction of mitochondrial genetic variance due to altitude is not significantly different from zero. Thus, it seems unlikely that altitude has exerted a major selective pressure on mitochondrial genes in central Asian populations. PMID:9837835
Molecular Simulations of Sequence-Specific Association of Transmembrane Proteins in Lipid Bilayers
NASA Astrophysics Data System (ADS)
Doxastakis, Manolis; Prakash, Anupam; Janosi, Lorant
2011-03-01
Association of membrane proteins is central in material and information flow across the cellular membranes. Amino-acid sequence and the membrane environment are two critical factors controlling association, however, quantitative knowledge on such contributions is limited. In this work, we study the dimerization of helices in lipid bilayers using extensive parallel Monte Carlo simulations with recently developed algorithms. The dimerization of Glycophorin A is examined employing a coarse-grain model that retains a level of amino-acid specificity, in three different phospholipid bilayers. Association is driven by a balance of protein-protein and lipid-induced interactions with the latter playing a major role at short separations. Following a different approach, the effect of amino-acid sequence is studied using the four transmembrane domains of the epidermal growth factor receptor family in identical lipid environments. Detailed characterization of dimer formation and estimates of the free energy of association reveal that these helices present significant affinity to self-associate with certain dimers forming non-specific interfaces.
Hammond, R; Smith, D R; Diener, T O
1989-01-01
The Columnea latent viroid (CLV) occurs latently in certain Columnea erythrophae plants grown commercially. In potato and tomato, CLV causes potato spindle tuber viroid (PSTV)-like symptoms. Its nucleotide sequence and proposed secondary structure reveal that CLV consists of a single-stranded circular RNA of 370 nucleotides which can assume a rod-like structure with extensive base-pairing characteristic of all known viroids. The electrophoretic mobility of circular CLV under nondenaturing conditions suggests a potential tertiary structure. CLV contains extensive sequence homologies to the PSTV group of viroids but contains a central conserved region identical to that of hop stunt viroid (HSV). CLV also shares some biological properties with each of the two types of viroids. Most probably, CLV is the result of intracellular RNA recombination between an HSV-type and one or more PSTV-type viroids replicating in the same plant. Images PMID:2602114
Comparative immunogenomics of molluscs.
Schultz, Jonathan H; Adema, Coen M
2017-10-01
Comparative immunology, studying both vertebrates and invertebrates, provided the earliest descriptions of phagocytosis as a general immune mechanism. However, the large scale of animal diversity challenges all-inclusive investigations and the field of immunology has developed by mostly emphasizing study of a few vertebrate species. In addressing the lack of comprehensive understanding of animal immunity, especially that of invertebrates, comparative immunology helps toward management of invertebrates that are food sources, agricultural pests, pathogens, or transmit diseases, and helps interpret the evolution of animal immunity. Initial studies showed that the Mollusca (second largest animal phylum), and invertebrates in general, possess innate defenses but lack the lymphocytic immune system that characterizes vertebrate immunology. Recognizing the reality of both common and taxon-specific immune features, and applying up-to-date cell and molecular research capabilities, in-depth studies of a select number of bivalve and gastropod species continue to reveal novel aspects of molluscan immunity. The genomics era heralded a new stage of comparative immunology; large-scale efforts yielded an initial set of full molluscan genome sequences that is available for analyses of full complements of immune genes and regulatory sequences. Next-generation sequencing (NGS), due to lower cost and effort required, allows individual researchers to generate large sequence datasets for growing numbers of molluscs. RNAseq provides expression profiles that enable discovery of immune genes and genome sequences reveal distribution and diversity of immune factors across molluscan phylogeny. Although computational de novo sequence assembly will benefit from continued development and automated annotation may require some experimental validation, NGS is a powerful tool for comparative immunology, especially increasing coverage of the extensive molluscan diversity. To date, immunogenomics revealed new levels of complexity of molluscan defense by indicating sequence heterogeneity in individual snails and bivalves, and members of expanded immune gene families are expressed differentially to generate pathogen-specific defense responses. Copyright © 2017 Elsevier Ltd. All rights reserved.
The resurrection genome of Boea hygrometrica: A blueprint for survival of dehydration.
Xiao, Lihong; Yang, Ge; Zhang, Liechi; Yang, Xinhua; Zhao, Shuang; Ji, Zhongzhong; Zhou, Qing; Hu, Min; Wang, Yu; Chen, Ming; Xu, Yu; Jin, Haijing; Xiao, Xuan; Hu, Guipeng; Bao, Fang; Hu, Yong; Wan, Ping; Li, Legong; Deng, Xin; Kuang, Tingyun; Xiang, Chengbin; Zhu, Jian-Kang; Oliver, Melvin J; He, Yikun
2015-05-05
"Drying without dying" is an essential trait in land plant evolution. Unraveling how a unique group of angiosperms, the Resurrection Plants, survive desiccation of their leaves and roots has been hampered by the lack of a foundational genome perspective. Here we report the ∼1,691-Mb sequenced genome of Boea hygrometrica, an important resurrection plant model. The sequence revealed evidence for two historical genome-wide duplication events, a compliment of 49,374 protein-coding genes, 29.15% of which are unique (orphan) to Boea and 20% of which (9,888) significantly respond to desiccation at the transcript level. Expansion of early light-inducible protein (ELIP) and 5S rRNA genes highlights the importance of the protection of the photosynthetic apparatus during drying and the rapid resumption of protein synthesis in the resurrection capability of Boea. Transcriptome analysis reveals extensive alternative splicing of transcripts and a focus on cellular protection strategies. The lack of desiccation tolerance-specific genome organizational features suggests the resurrection phenotype evolved mainly by an alteration in the control of dehydration response genes.
Stiebens, Victor A; Merino, Sonia E; Chain, Frédéric J J; Eizaguirre, Christophe
2013-04-30
In evolutionary and conservation biology, parasitism is often highlighted as a major selective pressure. To fight against parasites and pathogens, genetic diversity of the immune genes of the major histocompatibility complex (MHC) are particularly important. However, the extensive degree of polymorphism observed in these genes makes it difficult to conduct thorough population screenings. We utilized a genotyping protocol that uses 454 amplicon sequencing to characterize the MHC class I in the endangered loggerhead sea turtle (Caretta caretta) and to investigate their evolution at multiple relevant levels of organization. MHC class I genes revealed signatures of trans-species polymorphism across several reptile species. In the studied loggerhead turtle individuals, it results in the maintenance of two ancient allelic lineages. We also found that individuals carrying an intermediate number of MHC class I alleles are larger than those with either a low or high number of alleles. Multiple modes of evolution seem to maintain MHC diversity in the loggerhead turtles, with relatively high polymorphism for an endangered species.
Extensive concerted evolution of rice paralogs and the road to regaining independence.
Wang, Xiyin; Tang, Haibao; Bowers, John E; Feltus, Frank A; Paterson, Andrew H
2007-11-01
Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the approximately 0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, approximately 8% of japonica paralogs produced 5-7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while approximately 70-MY-old "paleologs" resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice-sorghum divergence approximately 41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity--that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5-7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization.
Yasukochi, Yoshiki; Satta, Yoko
2015-03-25
The human cytochrome P450 (CYP) 2D6 gene is a member of the CYP2D gene subfamily, along with the CYP2D7P and CYP2D8P pseudogenes. Although the CYP2D6 enzyme has been studied extensively because of its clinical importance, the evolution of the CYP2D subfamily has not yet been fully understood. Therefore, the goal of this study was to reveal the evolutionary process of the human drug metabolic system. Here, we investigate molecular evolution of the CYP2D subfamily in primates by comparing 14 CYP2D sequences from humans to New World monkey genomes. Window analysis and statistical tests revealed that entire genomic sequences of paralogous genes were extensively homogenized by gene conversion during molecular evolution of CYP2D genes in primates. A neighbor-joining tree based on genomic sequences at the nonsubstrate recognition sites showed that CYP2D6 and CYP2D8 genes were clustered together due to gene conversion. In contrast, a phylogenetic tree using amino acid sequences at substrate recognition sites did not cluster the CYP2D6 and CYP2D8 genes, suggesting that the functional constraint on substrate specificity is one of the causes for purifying selection at the substrate recognition sites. Our results suggest that the CYP2D gene subfamily in primates has evolved to maintain the regioselectivity for a substrate hydroxylation activity between individual enzymes, even though extensive gene conversion has occurred across CYP2D coding sequences. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Yasukochi, Yoshiki; Satta, Yoko
2015-01-01
The human cytochrome P450 (CYP) 2D6 gene is a member of the CYP2D gene subfamily, along with the CYP2D7P and CYP2D8P pseudogenes. Although the CYP2D6 enzyme has been studied extensively because of its clinical importance, the evolution of the CYP2D subfamily has not yet been fully understood. Therefore, the goal of this study was to reveal the evolutionary process of the human drug metabolic system. Here, we investigate molecular evolution of the CYP2D subfamily in primates by comparing 14 CYP2D sequences from humans to New World monkey genomes. Window analysis and statistical tests revealed that entire genomic sequences of paralogous genes were extensively homogenized by gene conversion during molecular evolution of CYP2D genes in primates. A neighbor-joining tree based on genomic sequences at the nonsubstrate recognition sites showed that CYP2D6 and CYP2D8 genes were clustered together due to gene conversion. In contrast, a phylogenetic tree using amino acid sequences at substrate recognition sites did not cluster the CYP2D6 and CYP2D8 genes, suggesting that the functional constraint on substrate specificity is one of the causes for purifying selection at the substrate recognition sites. Our results suggest that the CYP2D gene subfamily in primates has evolved to maintain the regioselectivity for a substrate hydroxylation activity between individual enzymes, even though extensive gene conversion has occurred across CYP2D coding sequences. PMID:25808902
Origin of secondary potash deposits; a case from Miocene evaporites of NW Central Iran
NASA Astrophysics Data System (ADS)
Rahimpour-Bonab, H.; Kalantarzadeh, Z.
2005-04-01
In early Miocene times, an extensive carbonate shelf developed in Central Iran and during several cycles of sea-level fluctuations, evaporite-bearing carbonate sequences of the Qom Formation were deposited. However, in the early-middle Miocene, development of restricted marine conditions led to a facies change from shelf carbonates of the Qom Formation to the evaporite series of the M 1 member of the overlying Lower Red Formation. This member is a facies mosaic of lagoonal and salina evaporites (mainly halite beds) admixed with wadi siliciclastics. The purpose of this study, which focuses on two salt mines in the northwestern portion of Central Iran in the Zanjan province, was to reveal the origin, sedimentary environment, and diagenesis of these potash-bearing evaporite sequences. Petrographic examination revealed the following mineral assemblage: halite, gypsum, anhydrite and carnallite as primary precipitates, and langbeinite and aphthitalite as secondary metamorphic potash salts. In the Iljaq mine, distorted halite beds are dominated by burial and deformational textures and a great deal of secondary potash salts. In the Qarah-Aghaje mine, however, the bedded halite shows pristine primary textures and is devoid of the secondary potash salts. High bromine content of most evaporite minerals suggests their marine origin, and confirms the absence of the extensive meteoric alterations and subsequent bromine depletions. Potash salts are mainly secondary, and resulted from diagenetic replacements of distorted halite beds during thermal and dynamic metamorphism in a burial setting.
Gruber, Barry L.; Couto, Ana Rita; Armas, Jácome Bruges; Brown, Matthew A.; Finzel, Kathleen; Terkeltaub, Robert A.
2015-01-01
This report describes a 32-year-old woman presenting since childhood with progressive calcium pyrophosphate disease (CPPD), characterized by severe arthropathy and chondrocalcinosis involving multiple peripheral joints and intervertebral disks. Because ANKH mutations have been previously described in familial CPPD, the proband’s DNA was assessed at this locus by direct sequencing of promoter and coding regions and revealed 3 sequence variants in ANKH. Sequences of exon 1 revealed a novel isolated nonsynonymous mutation (c.13 C>T), altering amino acid in codon 5 from proline to serine (CCG>TCG). Sequencing of parental DNA revealed an identical mutation in the proband’s father but not the mother. Subsequent clinical evaluation demonstrated extensive chondrocalcinosis and degenerative arthropathy in the proband’s father. In summary, we report a novel mutation, not previously described, in ANKH exon 1, wherein serine replaces proline, in a case of early-onset severe CPPD associated with metabolic abnormalities, with similar findings in the proband’s father. PMID:22647861
Gruber, Barry L; Couto, Ana Rita; Armas, Jácome Bruges; Brown, Matthew A; Finzel, Kathleen; Terkeltaub, Robert A
2012-06-01
This report describes a 32-year-old woman presenting since childhood with progressive calcium pyrophosphate disease (CPPD), characterized by severe arthropathy and chondrocalcinosis involving multiple peripheral joints and intervertebral disks. Because ANKH mutations have been previously described in familial CPPD, the proband's DNA was assessed at this locus by direct sequencing of promoter and coding regions and revealed 3 sequence variants in ANKH. Sequences of exon 1 revealed a novel isolated nonsynonymous mutation (c.13 C>T), altering amino acid in codon 5 from proline to serine (CCG>TCG). Sequencing of parental DNA revealed an identical mutation in the proband's father but not the mother. Subsequent clinical evaluation demonstrated extensive chondrocalcinosis and degenerative arthropathy in the proband's father. In summary, we report a novel mutation, not previously described, in ANKH exon 1, wherein serine replaces proline, in a case of early-onset severe CPPD associated with metabolic abnormalities, with similar findings in the proband's father.
Wu, Jia Qian; Du, Jiang; Rozowsky, Joel; Zhang, Zhengdong; Urban, Alexander E; Euskirchen, Ghia; Weissman, Sherman; Gerstein, Mark; Snyder, Michael
2008-01-03
Recent studies of the mammalian transcriptome have revealed a large number of additional transcribed regions and extraordinary complexity in transcript diversity. However, there is still much uncertainty regarding precisely what portion of the genome is transcribed, the exact structures of these novel transcripts, and the levels of the transcripts produced. We have interrogated the transcribed loci in 420 selected ENCyclopedia Of DNA Elements (ENCODE) regions using rapid amplification of cDNA ends (RACE) sequencing. We analyzed annotated known gene regions, but primarily we focused on novel transcriptionally active regions (TARs), which were previously identified by high-density oligonucleotide tiling arrays and on random regions that were not believed to be transcribed. We found RACE sequencing to be very sensitive and were able to detect low levels of transcripts in specific cell types that were not detectable by microarrays. We also observed many instances of sense-antisense transcripts; further analysis suggests that many of the antisense transcripts (but not all) may be artifacts generated from the reverse transcription reaction. Our results show that the majority of the novel TARs analyzed (60%) are connected to other novel TARs or known exons. Of previously unannotated random regions, 17% were shown to produce overlapping transcripts. Furthermore, it is estimated that 9% of the novel transcripts encode proteins. We conclude that RACE sequencing is an efficient, sensitive, and highly accurate method for characterization of the transcriptome of specific cell/tissue types. Using this method, it appears that much of the genome is represented in polyA+ RNA. Moreover, a fraction of the novel RNAs can encode protein and are likely to be functional.
Song, Yang; Zhang, Yong; Fan, Qin; Cui, Hui; Yan, Dongmei; Zhu, Shuangli; Tang, Haishu; Sun, Qiang; Wang, Dongyan; Xu, Wenbo
2017-02-23
Human enterovirus B106 (EV-B106) is a new member of the enterovirus B species. To date, only three nucleotide sequences of EV-B106 have been published, and only one full-length genome sequence (the Yunnan strain 148/YN/CHN/12) is available in the GenBank database. In this study, we conducted phylogenetic characterisation of four EV-B106 strains isolated in Xinjiang, China. Pairwise comparisons of the nucleotide sequences and the deduced amino acid sequences revealed that the four Xinjiang EV-B106 strains had only 80.5-80.8% nucleotide identity and 95.4-97.3% amino acid identity with the Yunnan EV-B106 strain, indicating high mutagenicity. Similarity plots and bootscanning analyses revealed that frequent intertypic recombination occurred in all four Xinjiang EV-B106 strains in the non-structural region. These four strains may share a donor sequence with the EV-B85 strain, which circulated in Xinjiang in 2011, indicating extensive genetic exchanges between these strains. All Xinjiang EV-B106 strains were temperature-sensitive. An antibody seroprevalence study against EV-B106 in two Xinjiang prefectures also showed low titres of neutralizing antibodies, suggesting limited exposure and transmission in the population. This study contributes the whole genome sequences of EV-B106 to the GenBank database and provides valuable information regarding the molecular epidemiology of EV-B106 in China.
Song, Yang; Zhang, Yong; Fan, Qin; Cui, Hui; Yan, Dongmei; Zhu, Shuangli; Tang, Haishu; Sun, Qiang; Wang, Dongyan; Xu, Wenbo
2017-01-01
Human enterovirus B106 (EV-B106) is a new member of the enterovirus B species. To date, only three nucleotide sequences of EV-B106 have been published, and only one full-length genome sequence (the Yunnan strain 148/YN/CHN/12) is available in the GenBank database. In this study, we conducted phylogenetic characterisation of four EV-B106 strains isolated in Xinjiang, China. Pairwise comparisons of the nucleotide sequences and the deduced amino acid sequences revealed that the four Xinjiang EV-B106 strains had only 80.5–80.8% nucleotide identity and 95.4–97.3% amino acid identity with the Yunnan EV-B106 strain, indicating high mutagenicity. Similarity plots and bootscanning analyses revealed that frequent intertypic recombination occurred in all four Xinjiang EV-B106 strains in the non-structural region. These four strains may share a donor sequence with the EV-B85 strain, which circulated in Xinjiang in 2011, indicating extensive genetic exchanges between these strains. All Xinjiang EV-B106 strains were temperature-sensitive. An antibody seroprevalence study against EV-B106 in two Xinjiang prefectures also showed low titres of neutralizing antibodies, suggesting limited exposure and transmission in the population. This study contributes the whole genome sequences of EV-B106 to the GenBank database and provides valuable information regarding the molecular epidemiology of EV-B106 in China. PMID:28230168
Fanali, Gabriella; Ascenzi, Paolo; Bernardi, Giorgio; Fasano, Mauro
2012-01-01
Serum albumin (SA) is a circulating protein providing a depot and carrier for many endogenous and exogenous compounds. At least seven major binding sites have been identified by structural and functional investigations mainly in human SA. SA is conserved in vertebrates, with at least 49 entries in protein sequence databases. The multiple sequence analysis of this set of entries leads to the definition of a cladistic tree for the molecular evolution of SA orthologs in vertebrates, thus showing the clustering of the considered species, with lamprey SAs (Lethenteron japonicum and Petromyzon marinus) in a separate outgroup. Sequence analysis aimed at searching conserved domains revealed that most SA sequences are made up by three repeated domains (about 600 residues), as extensively characterized for human SA. On the contrary, lamprey SAs are giant proteins (about 1400 residues) comprising seven repeated domains. The phylogenetic analysis of the SA family reveals a stringent correlation with the taxonomic classification of the species available in sequence databases. A focused inspection of the sequences of ligand binding sites in SA revealed that in all sites most residues involved in ligand binding are conserved, although the versatility towards different ligands could be peculiar of higher organisms. Moreover, the analysis of molecular links between the different sites suggests that allosteric modulation mechanisms could be restricted to higher vertebrates.
Diversity and Evolution of Mycobacterium tuberculosis: Moving to Whole-Genome-Based Approaches
Niemann, Stefan; Supply, Philip
2014-01-01
Genotyping of clinical Mycobacterium tuberculosis complex (MTBC) strains has become a standard tool for epidemiological tracing and for the investigation of the local and global strain population structure. Of special importance is the analysis of the expansion of multidrug (MDR) and extensively drug-resistant (XDR) strains. Classical genotyping and, more recently, whole-genome sequencing have revealed that the strains of the MTBC are more diverse than previously anticipated. Globally, several phylogenetic lineages can be distinguished whose geographical distribution is markedly variable. Strains of particular (sub)lineages, such as Beijing, seem to be more virulent and associated with enhanced resistance levels and fitness, likely fueling their spread in certain world regions. The upcoming generalization of whole-genome sequencing approaches will expectedly provide more comprehensive insights into the molecular and epidemiological mechanisms involved and lead to better diagnostic and therapeutic tools. PMID:25190252
Sequencing of Seven Haloarchaeal Genomes Reveals Patterns of Genomic Flux
Lynch, Erin A.; Langille, Morgan G. I.; Darling, Aaron; Wilbanks, Elizabeth G.; Haltiner, Caitlin; Shao, Katie S. Y.; Starr, Michael O.; Teiling, Clotilde; Harkins, Timothy T.; Edwards, Robert A.; Eisen, Jonathan A.; Facciotti, Marc T.
2012-01-01
We report the sequencing of seven genomes from two haloarchaeal genera, Haloferax and Haloarcula. Ease of cultivation and the existence of well-developed genetic and biochemical tools for several diverse haloarchaeal species make haloarchaea a model group for the study of archaeal biology. The unique physiological properties of these organisms also make them good candidates for novel enzyme discovery for biotechnological applications. Seven genomes were sequenced to ∼20×coverage and assembled to an average of 50 contigs (range 5 scaffolds - 168 contigs). Comparisons of protein-coding gene compliments revealed large-scale differences in COG functional group enrichment between these genera. Analysis of genes encoding machinery for DNA metabolism reveals genera-specific expansions of the general transcription factor TATA binding protein as well as a history of extensive duplication and horizontal transfer of the proliferating cell nuclear antigen. Insights gained from this study emphasize the importance of haloarchaea for investigation of archaeal biology. PMID:22848480
Park, Tae-Ho; Park, Beom-Seok; Kim, Jin-A; Hong, Joon Ki; Jin, Mina; Seol, Young-Joo; Mun, Jeong-Hwan
2011-01-01
As a part of the Multinational Genome Sequencing Project of Brassica rapa, linkage group R9 and R3 were sequenced using a bacterial artificial chromosome (BAC) by BAC strategy. The current physical contigs are expected to cover approximately 90% euchromatins of both chromosomes. As the project progresses, BAC selection for sequence extension becomes more limited because BAC libraries are restriction enzyme-specific. To support the project, a random sheared fosmid library was constructed. The library consists of 97536 clones with average insert size of approximately 40 kb corresponding to seven genome equivalents, assuming a Chinese cabbage genome size of 550 Mb. The library was screened with primers designed at the end of sequences of nine points of scaffold gaps where BAC clones cannot be selected to extend the physical contigs. The selected positive clones were end-sequenced to check the overlap between the fosmid clones and the adjacent BAC clones. Nine fosmid clones were selected and fully sequenced. The sequences revealed two completed gap filling and seven sequence extensions, which can be used for further selection of BAC clones confirming that the fosmid library will facilitate the sequence completion of B. rapa. Copyright © 2011. Published by Elsevier Ltd.
Length and sequence variability in mitochondrial control region of the milkfish, Chanos chanos.
Ravago, Rachel G; Monje, Virginia D; Juinio-Meñez, Marie Antonette
2002-01-01
Extensive length variability was observed in the mitochondrial control region of the milkfish, Chanos chanos. The nucleotide sequence of the control region and flanking regions was determined. Length variability and heteroplasmy was due to the presence of varying numbers of a 41-bp tandemly repeated sequence and a 48-bp insertion/deletion (indel). The structure and organization of the milkfish control region is similar to that of other teleost fish and vertebrates. However, extensive variation in the copy number of tandem repeats (4-20 copies) and the presence of a relatively large (48-bp) indel, are apparently uncommon in teleost fish control region sequences reported to date. High sequence variability of control region peripheral domains indicates the potential utility of selected regions as markers for population-level studies.
Subtype Distribution of Blastocystis Isolates in Sebha, Libya
Abdulsalam, Awatif M.; Ithoi, Init; Al-Mekhlafi, Hesham M.; Al-Mekhlafi, Abdulsalam M.; Ahmed, Abdulhamid; Surin, Johari
2013-01-01
Background Blastocystis is a genetically diverse and a common intestinal parasite of humans with a controversial pathogenic potential. This study was carried out to identify the Blastocystis subtypes and their association with demographic and socioeconomic factors among outpatients living in Sebha city, Libya. Methods/Findings Blastocystis in stool samples were cultured followed by isolation, PCR amplification of a partial SSU rDNA gene, cloning, and sequencing. The DNA sequences of isolated clones showed 98.3% to 100% identity with the reference Blastocystis isolates from the Genbank. Multiple sequence alignment showed polymorphism from one to seven base substitution and/or insertion/deletion in several groups of non-identical nucleotides clones. Phylogenetic analysis revealed three assemblage subtypes (ST) with ST1 as the most prevalent (51.1%) followed by ST2 (24.4%), ST3 (17.8%) and mixed infections of two concurrent subtypes (6.7%). Blastocystis ST1 infection was significantly associated with female (P = 0.009) and low educational level (P = 0.034). ST2 was also significantly associated with low educational level (P= 0.008) and ST3 with diarrhoea (P = 0.008). Conclusion Phylogenetic analysis of Libyan Blastocystis isolates identified three different subtypes; with ST1 being the predominant subtype and its infection was significantly associated with female gender and low educational level. More extensive studies are needed in order to relate each Blastocystis subtype with clinical symptoms and potential transmission sources in this community. PMID:24376805
Subtype distribution of Blastocystis isolates in Sebha, Libya.
Abdulsalam, Awatif M; Ithoi, Init; Al-Mekhlafi, Hesham M; Al-Mekhlafi, Abdulsalam M; Ahmed, Abdulhamid; Surin, Johari
2013-01-01
Blastocystis is a genetically diverse and a common intestinal parasite of humans with a controversial pathogenic potential. This study was carried out to identify the Blastocystis subtypes and their association with demographic and socioeconomic factors among outpatients living in Sebha city, Libya. Blastocystis in stool samples were cultured followed by isolation, PCR amplification of a partial SSU rDNA gene, cloning, and sequencing. The DNA sequences of isolated clones showed 98.3% to 100% identity with the reference Blastocystis isolates from the Genbank. Multiple sequence alignment showed polymorphism from one to seven base substitution and/or insertion/deletion in several groups of non-identical nucleotides clones. Phylogenetic analysis revealed three assemblage subtypes (ST) with ST1 as the most prevalent (51.1%) followed by ST2 (24.4%), ST3 (17.8%) and mixed infections of two concurrent subtypes (6.7%). ST1 infection was significantly associated with female (P = 0.009) and low educational level (P = 0.034). ST2 was also significantly associated with low educational level (P= 0.008) and ST3 with diarrhoea (P = 0.008). Phylogenetic analysis of Libyan Blastocystis isolates identified three different subtypes; with ST1 being the predominant subtype and its infection was significantly associated with female gender and low educational level. More extensive studies are needed in order to relate each Blastocystis subtype with clinical symptoms and potential transmission sources in this community.
Zhang, Ran; Yin, Yinliang; Zhang, Yujun; Li, Kexin; Zhu, Hongxia; Gong, Qin; Wang, Jianwu; Hu, Xiaoxiang; Li, Ning
2012-01-01
As the number of transgenic livestock increases, reliable detection and molecular characterization of transgene integration sites and copy number are crucial not only for interpreting the relationship between the integration site and the specific phenotype but also for commercial and economic demands. However, the ability of conventional PCR techniques to detect incomplete and multiple integration events is limited, making it technically challenging to characterize transgenes. Next-generation sequencing has enabled cost-effective, routine and widespread high-throughput genomic analysis. Here, we demonstrate the use of next-generation sequencing to extensively characterize cattle harboring a 150-kb human lactoferrin transgene that was initially analyzed by chromosome walking without success. Using this approach, the sites upstream and downstream of the target gene integration site in the host genome were identified at the single nucleotide level. The sequencing result was verified by event-specific PCR for the integration sites and FISH for the chromosomal location. Sequencing depth analysis revealed that multiple copies of the incomplete target gene and the vector backbone were present in the host genome. Upon integration, complex recombination was also observed between the target gene and the vector backbone. These findings indicate that next-generation sequencing is a reliable and accurate approach for the molecular characterization of the transgene sequence, integration sites and copy number in transgenic species. PMID:23185606
Båge, Tove; Lagervall, Maria; Jansson, Leif; Lundeberg, Joakim; Yucel-Lindberg, Tülay
2012-01-01
Periodontitis is a chronic inflammatory disease affecting the soft tissue and bone that surrounds the teeth. Despite extensive research, distinctive genes responsible for the disease have not been identified. The objective of this study was to elucidate transcriptome changes in periodontitis, by investigating gene expression profiles in gingival tissue obtained from periodontitis-affected and healthy gingiva from the same patient, using RNA-sequencing. Gingival biopsies were obtained from a disease-affected and a healthy site from each of 10 individuals diagnosed with periodontitis. Enrichment analysis performed among uniquely expressed genes for the periodontitis-affected and healthy tissues revealed several regulated pathways indicative of inflammation for the periodontitis-affected condition. Hierarchical clustering of the sequenced biopsies demonstrated clustering according to the degree of inflammation, as observed histologically in the biopsies, rather than clustering at the individual level. Among the top 50 upregulated genes in periodontitis-affected tissues, we investigated two genes which have not previously been demonstrated to be involved in periodontitis. These included interferon regulatory factor 4 and chemokine (C-C motif) ligand 18, which were also expressed at the protein level in gingival biopsies from patients with periodontitis. In conclusion, this study provides a first step towards a quantitative comprehensive insight into the transcriptome changes in periodontitis. We demonstrate for the first time site-specific local variation in gene expression profiles of periodontitis-affected and healthy tissues obtained from patients with periodontitis, using RNA-seq. Further, we have identified novel genes expressed in periodontitis tissues, which may constitute potential therapeutic targets for future treatment strategies of periodontitis. PMID:23029519
Putaporntip, Chaturong; Jongwutiwes, Somchai; Thongaree, Siriporn; Seethamchai, Sunee; Grynberg, Priscila; Hughes, Austin L.
2010-01-01
Although malaria parasites infecting non-human primates are important models for human malaria, little is known of the ecology of infection by these parasites in the wild. We extensively sequenced cytochrome b (cytb) of malaria parasites (Apicomplexa: Haemosporida) from free-living Southeast Asian monkeys Macaca nemestrina and M. fascicularis. The two most commonly observed taxa were P. inui and Hepatocystis sp., but certain other sequences did not cluster closely with any previously sequenced species. Most of the major clades of parasites were found in both Macaca species; and the two most commonly occurring parasite infected the two Macaca species at approximately equal levels. However, P. inui showed evidence of genetic differentiation between the populations infecting the two Macaca species, suggesting limited movement of this parasite among hosts. Moreover, coinfection with Plasmodium and Hepatocystis species occurred significantly less frequently than expected on the basis of the rates of infection with either taxon alone, suggesting the possibility of competitive exclusion. The results revealed unexpectedly complex communities of Plasmodium and Hepatocystis taxa infecting wild Southeast Asian monkeys. Parasite taxa differed with respect to both the frequency of between-host movement and their frequency of coinfection. PMID:20646216
Gamalinda, Michael; Woolford, John L
2014-11-01
Numerous ribosomal proteins have a striking bipartite architecture: a globular body positioned on the ribosomal exterior and an internal loop buried deep into the rRNA core. In eukaryotes, a significant number of conserved r-proteins have evolved extra amino- or carboxy-terminal tail sequences, which thread across the solvent-exposed surface. The biological importance of these extended domains remains to be established. In this study, we have investigated the universally conserved internal loop and the eukaryote-specific extensions of yeast L4. We show that in contrast to findings with bacterial L4, deleting the internal loop of yeast L4 causes severely impaired growth and reduced levels of large ribosomal subunits. We further report that while depleting the entire L4 protein blocks early assembly steps in yeast, deletion of only its extended internal loop affects later steps in assembly, revealing a second role for L4 during ribosome biogenesis. Surprisingly, deletion of the entire eukaryote-specific carboxy-terminal tail of L4 has no effect on viability, production of 60S subunits, or translation. These unexpected observations provide impetus to further investigate the functions of ribosomal protein extensions, especially eukaryote-specific examples, in ribosome assembly and function. © 2014 Gamalinda and Woolford; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Gómez, Africa; Serra, Manuel; Carvalho, Gary R; Lunt, David H
2002-07-01
Continental lake-dwelling zooplanktonic organisms have long been considered cosmopolitan species with little geographic variation in spite of the isolation of their habitats. Evidence of morphological cohesiveness and high dispersal capabilities support this interpretation. However, this view has been challenged recently as many such species have been shown either to comprise cryptic species complexes or to exhibit marked population genetic differentiation and strong phylogeographic structuring at a regional scale. Here we investigate the molecular phylogeny of the cosmopolitan passively dispersing rotifer Brachionus plicatilis (Rotifera: Monogononta) species complex using nucleotide sequence variation from both nuclear (ribosomal internal transcribed spacer 1, ITS1) and mitochondrial (cytochrome c oxidase subunit I, COI) genes. Analysis of rotifer resting eggs from 27 salt lakes in the Iberian Peninsula plus lakes from four continents revealed nine genetically divergent lineages. The high level of sequence divergence, absence of hybridization, and extensive sympatry observed support the specific status of these lineages. Sequence divergence estimates indicate that the B. plicatilis complex began diversifying many millions of years ago, yet has showed relatively high levels of morphological stasis. We discuss these results in relation to the ecology and genetics of aquatic invertebrates possessing dispersive resting propagules and address the apparent contradiction between zooplanktonic population structure and their morphological stasis.
Li, Fagen; Zhou, Changpin; Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming
2015-01-01
Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa.
Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming
2015-01-01
Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10–56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa. PMID:26695430
Extensive Concerted Evolution of Rice Paralogs and the Road to Regaining Independence
Wang, Xiyin; Tang, Haibao; Bowers, John E.; Feltus, Frank A.; Paterson, Andrew H.
2007-01-01
Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the ∼0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, ∼8% of japonica paralogs produced 5–7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while ∼70-MY-old “paleologs” resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice–sorghum divergence ∼41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity—that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5–7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization. PMID:18039882
Miller, Emily J.; Neaves, Linda E.; Zenger, Kyall R.; Herbert, Catherine A.
2017-01-01
The tammar wallaby (Notamacropus eugenii) is one of the most intensively studied of all macropodids and was the first Australasian marsupial to have its genome sequenced. However, comparatively little is known about genetic diversity and differentiation amongst the morphologically distinct allopatric populations of tammar wallabies found in Western (WA) and South Australia (SA). Here we compare autosomal and Y-linked microsatellite genotypes, as well as sequence data (~600 bp) from the mitochondrial DNA (mtDNA) control region (CR) in tammar wallabies from across its distribution. Levels of diversity at autosomal microsatellite loci were typically high in the WA mainland and Kangaroo Island (SA) populations (A = 8.9–10.6; He = 0.77–0.78) but significantly reduced in other endemic island populations (A = 3.8–4.1; He = 0.41–0.48). Autosomal and Y-linked microsatellite loci revealed a pattern of significant differentiation amongst populations, especially between SA and WA. The Kangaroo Island and introduced New Zealand population showed limited differentiation. Multiple divergent mtDNA CR haplotypes were identified within both SA and WA populations. The CR haplotypes of tammar wallabies from SA and WA show reciprocal monophyly and are highly divergent (14.5%), with levels of sequence divergence more typical of different species. Within WA tammar wallabies, island populations each have unique clusters of highly related CR haplotypes and each is most closely related to different WA mainland haplotypes. Y-linked microsatellite haplotypes show a similar pattern of divergence although levels of diversity are lower. In light of these differences, we suggest that two subspecies of tammar wallaby be recognized; Notamacropus eugenii eugenii in SA and N. eugenii derbianus in WA. The extensive neutral genetic diversity and inter-population differentiation identified within tammar wallabies should further increase the species value and usefulness as a model organism. PMID:28257440
Du, Pufeng; Wang, Lusheng
2014-01-01
One of the fundamental tasks in biology is to identify the functions of all proteins to reveal the primary machinery of a cell. Knowledge of the subcellular locations of proteins will provide key hints to reveal their functions and to understand the intricate pathways that regulate biological processes at the cellular level. Protein subcellular location prediction has been extensively studied in the past two decades. A lot of methods have been developed based on protein primary sequences as well as protein-protein interaction network. In this paper, we propose to use the protein-protein interaction network as an infrastructure to integrate existing sequence based predictors. When predicting the subcellular locations of a given protein, not only the protein itself, but also all its interacting partners were considered. Unlike existing methods, our method requires neither the comprehensive knowledge of the protein-protein interaction network nor the experimentally annotated subcellular locations of most proteins in the protein-protein interaction network. Besides, our method can be used as a framework to integrate multiple predictors. Our method achieved 56% on human proteome in absolute-true rate, which is higher than the state-of-the-art methods. PMID:24466278
Izquierdo, Javier A; Sizova, Maria V; Lynd, Lee R
2010-06-01
The enrichment from nature of novel microbial communities with high cellulolytic activity is useful in the identification of novel organisms and novel functions that enhance the fundamental understanding of microbial cellulose degradation. In this work we identify predominant organisms in three cellulolytic enrichment cultures with thermophilic compost as an inoculum. Community structure based on 16S rRNA gene clone libraries featured extensive representation of clostridia from cluster III, with minor representation of clostridial clusters I and XIV and a novel Lutispora species cluster. Our studies reveal different levels of 16S rRNA gene diversity, ranging from 3 to 18 operational taxonomic units (OTUs), as well as variability in community membership across the three enrichment cultures. By comparison, glycosyl hydrolase family 48 (GHF48) diversity analyses revealed a narrower breadth of novel clostridial genes associated with cultured and uncultured cellulose degraders. The novel GHF48 genes identified in this study were related to the novel clostridia Clostridium straminisolvens and Clostridium clariflavum, with one cluster sharing as little as 73% sequence similarity with the closest known relative. In all, 14 new GHF48 gene sequences were added to the known diversity of 35 genes from cultured species.
Characterization of a highly polymorphic region 5′ to JH in the human immunoglobulin heavy chain
Silva, Alcino J.; Johnson, John P.; White, Raymond L.
1987-01-01
A cloned DNA segment 1.25 kilobases (kb) upstream from the joining segments of the human heavy chain immunoglobulin gene revealed extensive polymorphic variation at this locus, and the polymorphic pattern was stably transmitted to the next generation. Genomic restriction analysis showed that the polymorphism was caused by insertions/deletions within an MspI/BamHI fragment. Sequencing of one allele, 848 base pairs (bp) long, revealed eleven 50-base-pair tandem repeats. A second allele, 648 bp long, was cloned from a human genomic cosmid library, sequenced, and found to contain four fewer repeats than the first allele. A survey of 186 chromosomes from unrelated individuals of primarily northern European descent revealed at least six alleles. Images PMID:2884636
The UK10K project identifies rare variants in health and disease.
Walter, Klaudia; Min, Josine L; Huang, Jie; Crooks, Lucy; Memari, Yasin; McCarthy, Shane; Perry, John R B; Xu, ChangJiang; Futema, Marta; Lawson, Daniel; Iotchkova, Valentina; Schiffels, Stephan; Hendricks, Audrey E; Danecek, Petr; Li, Rui; Floyd, James; Wain, Louise V; Barroso, Inês; Humphries, Steve E; Hurles, Matthew E; Zeggini, Eleftheria; Barrett, Jeffrey C; Plagnol, Vincent; Richards, J Brent; Greenwood, Celia M T; Timpson, Nicholas J; Durbin, Richard; Soranzo, Nicole
2015-10-01
The contribution of rare and low-frequency variants to human traits is largely unexplored. Here we describe insights from sequencing whole genomes (low read depth, 7×) or exomes (high read depth, 80×) of nearly 10,000 individuals from population-based and disease collections. In extensively phenotyped cohorts we characterize over 24 million novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with levels of triglycerides (APOB), adiponectin (ADIPOQ) and low-density lipoprotein cholesterol (LDLR and RGAG1) from single-marker and rare variant aggregation tests. We describe population structure and functional annotation of rare and low-frequency variants, use the data to estimate the benefits of sequencing for association studies, and summarize lessons from disease-specific collections. Finally, we make available an extensive resource, including individual-level genetic and phenotypic data and web-based tools to facilitate the exploration of association results.
Bowen, Christopher D.; Renner, Daniel W.; Shreve, Jacob T.; Tafuri, Yolanda; Payne, Kimberly M.; Dix, Richard D.; Kinchington, Paul R.; Gatherer, Derek; Szpara, Moriah L.
2016-01-01
Herpes simplex virus 1 (HSV-1) is a widespread global pathogen, of which the strain KOS is one of the most extensively studied. Previous sequence studies revealed that KOS does not cluster with other strains of North American geographic origin, but instead clustered with Asian strains. We sequenced a historical isolate of the original KOS strain, called KOS63, along with a separately isolated strain attributed to the same source individual, termed KOS79. Genomic analyses revealed that KOS63 closely resembled other recently sequenced isolates of KOS and was of Asian origin, but that KOS79 was a genetically unrelated strain that clustered in genetic distance analyses with HSV-1 strains of North American/European origin. These data suggest that the human source of KOS63 and KOS79 could have been infected with two genetically unrelated strains of disparate geographic origins. A PCR RFLP test was developed for rapid identification of these strains. PMID:26950505
Bowen, Christopher D; Renner, Daniel W; Shreve, Jacob T; Tafuri, Yolanda; Payne, Kimberly M; Dix, Richard D; Kinchington, Paul R; Gatherer, Derek; Szpara, Moriah L
2016-05-01
Herpes simplex virus 1 (HSV-1) is a widespread global pathogen, of which the strain KOS is one of the most extensively studied. Previous sequence studies revealed that KOS does not cluster with other strains of North American geographic origin, but instead clustered with Asian strains. We sequenced a historical isolate of the original KOS strain, called KOS63, along with a separately isolated strain attributed to the same source individual, termed KOS79. Genomic analyses revealed that KOS63 closely resembled other recently sequenced isolates of KOS and was of Asian origin, but that KOS79 was a genetically unrelated strain that clustered in genetic distance analyses with HSV-1 strains of North American/European origin. These data suggest that the human source of KOS63 and KOS79 could have been infected with two genetically unrelated strains of disparate geographic origins. A PCR RFLP test was developed for rapid identification of these strains. Copyright © 2016 Elsevier Inc. All rights reserved.
Effects of the HN gene c-terminal extensions on the Newcastle disease virus virulence
USDA-ARS?s Scientific Manuscript database
The hemagglutinin-neuraminidase (HN) of Newcastle disease virus (NDV) is a multifunctional protein that has receptor recognition, neuraminidase and fusion promotion activities. Sequence analysis revealed that the HN gene of many extremely low virulence NDV strains encodes a larger open reading frame...
Robertson, L. J.; Hermansen, L.; Gjerde, B. K.; Strand, E.; Alvsvåg, J. O.; Langeland, N.
2006-01-01
During the autumn and winter of 2004 and 2005, an extensive outbreak of waterborne giardiasis occurred in Bergen, Norway. Over 1,500 patients were diagnosed with giardiasis. Analysis of water from the implicated source revealed low numbers of Giardia cysts, but the initial contamination event probably occurred up to 10 weeks previously. While sewage leakage from a residential area is now considered to be the probable source of contamination, during the episode waste from one particular septic tank was thought to be a possible source. Genotyping of cysts from the septic tank demonstrated that they were assemblage A cysts, although the sequences were not identical to any previously published sequences. For the β-giardin gene, the closest published subgenotype was subgenotype A3; for the gdh gene, the closest published subgenotype was subgenotype A2. Genotyping of cysts from 21 patient samples revealed that they were assemblage B cysts; thus, the septic tank was unlikely to be the contamination source. Sequencing of the β-giardin and gdh genes from patient samples and a comparison of the sequences gave complex results. For the β-giardin gene, three isolates had sequences identical to subgenotype B3 sequences. However, other isolates had between one and four single-nucleotide polymorphisms (SNPs). For the gdh gene, none of the sequences were identical to the sequence published for subgenotype B3, and the sequences had between one and three SNPs. One isolate, which was identical to subgenotype B3 at the β-giardin gene, was more similar to subgenotype B2 at the gdh gene. Grouping the isolates on the basis of SNPs resulted in different groups for the two genes. The results are discussed in relation to giardiasis in Norway and to other Giardia genotyping studies. PMID:16517674
Sequence-Dependent Persistence Length of Long DNA
NASA Astrophysics Data System (ADS)
Chuang, Hui-Min; Reifenberger, Jeffrey G.; Cao, Han; Dorfman, Kevin D.
2017-12-01
Using a high-throughput genome-mapping approach, we obtained circa 50 million measurements of the extension of internal human DNA segments in a 41 nm ×41 nm nanochannel. The underlying DNA sequences, obtained by mapping to the reference human genome, are 2.5-393 kilobase pairs long and contain percent GC contents between 32.5% and 60%. Using Odijk's theory for a channel-confined wormlike chain, these data reveal that the DNA persistence length increases by almost 20% as the percent GC content increases. The increased persistence length is rationalized by a model, containing no adjustable parameters, that treats the DNA as a statistical terpolymer with a sequence-dependent intrinsic persistence length and a sequence-independent electrostatic persistence length.
Laopichienpong, Nararat; Muangmai, Narongrit; Supikamolseni, Arrjaree; Twilprawat, Panupon; Chanhome, Lawan; Suntrarachun, Sunutcha; Peyachoknagul, Surin; Srikulnath, Kornsorn
2016-12-15
DNA barcodes of mitochondrial cytochrome c oxidase I (COI), cytochrome b (Cytb) genes, and their combined data sets were constructed from 35 snake species in Thailand. No barcoding gap was detected in either of the two genes from the observed intra- and interspecific sequence divergences. Intra- and interspecific sequence divergences of the COI gene differed 14 times, with barcode cut-off scores ranging over 2%-4% for threshold values differentiated among most of the different species; the Cytb gene differed 6 times with cut-off scores ranging over 2%-6%. Thirty-five specific nucleotide mutations were also found at interspecific level in the COI gene, identifying 18 snake species, but no specific nucleotide mutation was observed for Cytb in any single species. This suggests that COI barcoding was a better marker than Cytb. Phylogenetic clustering analysis indicated that most species were represented by monophyletic clusters, suggesting that these snake species could be clearly differentiated using COI barcodes. However, the two-marker combination of both COI and Cytb was more effective, differentiating snake species by over 2%-4%, and reducing species numbers in the overlap value between intra- and interspecific divergences. Three species delimitation algorithms (general mixed Yule-coalescent, automatic barcoding gap detection, and statistical parsimony network analysis) were extensively applied to a wide range of snakes based on both barcodes. This revealed cryptic diversity for eleven snake species in Thailand. In addition, eleven accessions from the database previously grouped under the same species were represented at different species level, suggesting either high genetic diversity, or the misidentification of these sequences in the database as a consequence of cryptic species. Copyright © 2016 Elsevier B.V. All rights reserved.
Identification of novel Theileria genotypes from Grant's gazelle
Hooge, Janis; Howe, Laryssa; Ezenwa, Vanessa O.
2015-01-01
Blood samples collected from Grant's gazelles (Nanger granti) in Kenya were screened for hemoparasites using a combination of microscopic and molecular techniques. All 69 blood smears examined by microscopy were positive for hemoparasites. In addition, Theileria/Babesia DNA was detected in all 65 samples screened by PCR for a ~450-base pair fragment of the V4 hypervariable region of the 18S rRNA gene. Sequencing and BLAST analysis of a subset of PCR amplicons revealed widespread co-infection (25/39) and the existence of two distinct Grant's gazelle Theileria subgroups. One group of 11 isolates clustered as a subgroup with previously identified Theileria ovis isolates from small ruminants from Europe, Asia and Africa; another group of 3 isolates clustered with previously identified Theileria spp. isolates from other African antelope. Based on extensive levels of sequence divergence (1.2–2%) from previously reported Theileria species within Kenya and worldwide, the Theileria isolates detected in Grant's gazelles appear to represent at least two novel Theileria genotypes. PMID:25973394
Identification of novel Theileria genotypes from Grant's gazelle.
Hooge, Janis; Howe, Laryssa; Ezenwa, Vanessa O
2015-08-01
Blood samples collected from Grant's gazelles (Nanger granti) in Kenya were screened for hemoparasites using a combination of microscopic and molecular techniques. All 69 blood smears examined by microscopy were positive for hemoparasites. In addition, Theileria/Babesia DNA was detected in all 65 samples screened by PCR for a ~450-base pair fragment of the V4 hypervariable region of the 18S rRNA gene. Sequencing and BLAST analysis of a subset of PCR amplicons revealed widespread co-infection (25/39) and the existence of two distinct Grant's gazelle Theileria subgroups. One group of 11 isolates clustered as a subgroup with previously identified Theileria ovis isolates from small ruminants from Europe, Asia and Africa; another group of 3 isolates clustered with previously identified Theileria spp. isolates from other African antelope. Based on extensive levels of sequence divergence (1.2-2%) from previously reported Theileria species within Kenya and worldwide, the Theileria isolates detected in Grant's gazelles appear to represent at least two novel Theileria genotypes.
Tertiary structural propensities reveal fundamental sequence/structure relationships.
Zheng, Fan; Zhang, Jian; Grigoryan, Gevorg
2015-05-05
Extracting useful generalizations from the continually growing Protein Data Bank (PDB) is of central importance. We hypothesize that the PDB contains valuable quantitative information on the level of local tertiary structural motifs (TERMs). We show that by breaking a protein structure into its constituent TERMs, and querying the PDB to characterize the natural ensemble matching each, we can estimate the compatibility of the structure with a given amino acid sequence through a metric we term "structure score." Considering submissions from recent Critical Assessment of Structure Prediction (CASP) experiments, we found a strong correlation (R = 0.69) between structure score and model accuracy, with poorly predicted regions readily identifiable. This performance exceeds that of leading atomistic statistical energy functions. Furthermore, TERM-based analysis of two prototypical multi-state proteins rapidly produced structural insights fully consistent with prior extensive experimental studies. We thus find that TERM-based analysis should have considerable utility for protein structural biology. Copyright © 2015 Elsevier Ltd. All rights reserved.
Paraskevis, D; Magiorkinis, M; Vandamme, A M; Kostrikis, L G; Hatzakis, A
2001-03-01
Human immunodeficiency virus type 1 (HIV-1) has been classified into three main groups and 11 distinct subtypes. Moreover, several circulating recombinant forms (CRFs) of HIV-1 have been recently documented to have spread widely causing extensive HIV-1 epidemics. A subtype, initially designated I (CRF04_cpx), was documented in Cyprus and Greece and was found to comprise regions of sequence derived from subtypes A and G as well as regions of unclassified sequence. Re-analysis of the three full-length CRF04_cpx sequences that were available revealed a mosaic genomic organization of unique complexity comprising regions of sequence from at least five distinct subtypes, A, G, H, K and unclassified regions. These strains account for approximately 2% of the total HIV-1-infected population in Greece, thus providing evidence of the great capability of HIV-1 to recombine and produce highly divergent strains which can be spread successfully through different infection routes.
DeMaere, Matthew Z.
2016-01-01
Background Chromosome conformation capture, coupled with high throughput DNA sequencing in protocols like Hi-C and 3C-seq, has been proposed as a viable means of generating data to resolve the genomes of microorganisms living in naturally occuring environments. Metagenomic Hi-C and 3C-seq datasets have begun to emerge, but the feasibility of resolving genomes when closely related organisms (strain-level diversity) are present in the sample has not yet been systematically characterised. Methods We developed a computational simulation pipeline for metagenomic 3C and Hi-C sequencing to evaluate the accuracy of genomic reconstructions at, above, and below an operationally defined species boundary. We simulated datasets and measured accuracy over a wide range of parameters. Five clustering algorithms were evaluated (2 hard, 3 soft) using an adaptation of the extended B-cubed validation measure. Results When all genomes in a sample are below 95% sequence identity, all of the tested clustering algorithms performed well. When sequence data contains genomes above 95% identity (our operational definition of strain-level diversity), a naive soft-clustering extension of the Louvain method achieves the highest performance. Discussion Previously, only hard-clustering algorithms have been applied to metagenomic 3C and Hi-C data, yet none of these perform well when strain-level diversity exists in a metagenomic sample. Our simple extension of the Louvain method performed the best in these scenarios, however, accuracy remained well below the levels observed for samples without strain-level diversity. Strain resolution is also highly dependent on the amount of available 3C sequence data, suggesting that depth of sequencing must be carefully considered during experimental design. Finally, there appears to be great scope to improve the accuracy of strain resolution through further algorithm development. PMID:27843713
Resolving the tips of the tree of life: How much mitochondrialdata doe we need?
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bonett, Ronald M.; Macey, J. Robert; Boore, Jeffrey L.
2005-04-29
Mitochondrial (mt) DNA sequences are used extensively to reconstruct evolutionary relationships among recently diverged animals,and have constituted the most widely used markers for species- and generic-level relationships for the last decade or more. However, most studies to date have employed relatively small portions of the mt-genome. In contrast, complete mt-genomes primarily have been used to investigate deep divergences, including several studies of the amount of mt sequence necessary to recover ancient relationships. We sequenced and analyzed 24 complete mt-genomes from a group of salamander species exhibiting divergences typical of those in many species-level studies. We present the first comprehensive investigationmore » of the amount of mt sequence data necessary to consistently recover the mt-genome tree at this level, using parsimony and Bayesian methods. Both methods of phylogenetic analysis revealed extremely similar results. A surprising number of well supported, yet conflicting, relationships were found in trees based on fragments less than {approx}2000 nucleotides (nt), typical of the vast majority of the thousands of mt-based studies published to date. Large amounts of data (11,500+ nt) were necessary to consistently recover the whole mt-genome tree. Some relationships consistently were recovered with fragments of all sizes, but many nodes required the majority of the mt-genome to stabilize, particularly those associated with short internal branches. Although moderate amounts of data (2000-3000 nt) were adequate to recover mt-based relationships for which most nodes were congruent with the whole mt-genome tree, many thousands of nucleotides were necessary to resolve rapid bursts of evolution. Recent advances in genomics are making collection of large amounts of sequence data highly feasible, and our results provide the basis for comparative studies of other closely related groups to optimize mt sequence sampling and phylogenetic resolution at the ''tips'' of the Tree of Life.« less
Xiong, Haoyu; Barker, Stephen C.; Burger, Thomas D.; Raoult, Didier; Shao, Renfu
2013-01-01
The typical mitochondrial (mt) genomes of bilateral animals consist of 37 genes on a single circular chromosome. The mt genomes of the human body louse, Pediculus humanus, and the human head louse, Pediculus capitis, however, are extensively fragmented and contain 20 minichromosomes, with one to three genes on each minichromosome. Heteroplasmy, i.e. nucleotide polymorphisms in the mt genome within individuals, has been shown to be significantly higher in the mt cox1 gene of human lice than in humans and other animals that have the typical mt genomes. To understand whether the extent of heteroplasmy in human lice is associated with mt genome fragmentation, we sequenced the entire coding regions of all of the mt minichromosomes of six human body lice and six human head lice from Ethiopia, China and France with an Illumina HiSeq platform. For comparison, we also sequenced the entire coding regions of the mt genomes of seven species of ticks, which have the typical mitochondrial genome organization of bilateral animals. We found that the level of heteroplasmy varies significantly both among the human lice and among the ticks. The human lice from Ethiopia have significantly higher level of heteroplasmy than those from China and France (Pt<0.05). The tick, Amblyomma cajennense, has significantly higher level of heteroplasmy than other ticks (Pt<0.05). Our results indicate that heteroplasmy level can be substantially variable within a species and among closely related species, and does not appear to be determined by single factors such as genome fragmentation. PMID:24058467
Xiong, Haoyu; Barker, Stephen C; Burger, Thomas D; Raoult, Didier; Shao, Renfu
2013-01-01
The typical mitochondrial (mt) genomes of bilateral animals consist of 37 genes on a single circular chromosome. The mt genomes of the human body louse, Pediculus humanus, and the human head louse, Pediculus capitis, however, are extensively fragmented and contain 20 minichromosomes, with one to three genes on each minichromosome. Heteroplasmy, i.e. nucleotide polymorphisms in the mt genome within individuals, has been shown to be significantly higher in the mt cox1 gene of human lice than in humans and other animals that have the typical mt genomes. To understand whether the extent of heteroplasmy in human lice is associated with mt genome fragmentation, we sequenced the entire coding regions of all of the mt minichromosomes of six human body lice and six human head lice from Ethiopia, China and France with an Illumina HiSeq platform. For comparison, we also sequenced the entire coding regions of the mt genomes of seven species of ticks, which have the typical mitochondrial genome organization of bilateral animals. We found that the level of heteroplasmy varies significantly both among the human lice and among the ticks. The human lice from Ethiopia have significantly higher level of heteroplasmy than those from China and France (Pt<0.05). The tick, Amblyomma cajennense, has significantly higher level of heteroplasmy than other ticks (Pt<0.05). Our results indicate that heteroplasmy level can be substantially variable within a species and among closely related species, and does not appear to be determined by single factors such as genome fragmentation.
Complete genome sequence of lymphocystis disease virus isolated from China.
Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang
2004-07-01
Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1.
Complete Genome Sequence of Lymphocystis Disease Virus Isolated from China
Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang
2004-01-01
Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1. PMID:15194775
Bayne, Charlie F; Widawski, Max E; Gao, Feng; Masab, Mohammed H; Chattopadhyay, Maitreyi; Murawski, Allison M; Sansevere, Robert M; Lerner, Bryan D; Castillo, Rinaldys J; Griesman, Trevor; Fu, Jiantao; Hibben, Jennifer K; Garcia-Perez, Alma D; Simon, Anne E; Kushner, David B
2018-07-01
Noncoding RNAs use their sequence and/or structure to mediate function(s). The 5' portion (166 nt) of the 356-nt noncoding satellite RNA C (satC) of Turnip crinkle virus (TCV) was previously modeled to contain a central region with two stem-loops (H6 and H7) and a large connecting hairpin (H2). We now report that in vivo functional selection (SELEX) experiments assessing sequence/structure requirements in H2, H6, and H7 reveal that H6 loop sequence motifs were recovered at nonrandom rates and only some residues are proposed to base-pair with accessible complementary sequences within the 5' central region. In vitro SHAPE of SELEX winners indicates that the central region is heavily base-paired, such that along with the lower stem and H2 region, one extensive hairpin exists composing the entire 5' region. As these SELEX winners are highly fit, these characteristics facilitate satRNA amplification in association with TCV in plants. Copyright © 2018 Elsevier Inc. All rights reserved.
Mitochondrial DNA variation of indigenous goats in Narok and Isiolo counties of Kenya.
Kibegwa, F M; Githui, K E; Jung'a, J O; Badamana, M S; Nyamu, M N
2016-06-01
Phylogenetic relationships among and genetic variability within 60 goats from two different indigenous breeds in Narok and Isiolo counties in Kenya and 22 published goat samples were analysed using mitochondrial control region sequences. The results showed that there were 54 polymorphic sites in a 481-bp sequence and 29 haplotypes were determined. The mean haplotype diversity and nucleotide diversity were 0.981 ± 0.006 and 0.019 ± 0.001, respectively. The phylogenetic analysis in combination with goat haplogroup reference sequences from GenBank showed that all goat sequences were clustered into two haplogroups (A and G), of which haplogroup A was the commonest in the two populations. A very high percentage (99.90%) of the genetic variation was distributed within the regions, and a smaller percentage (0.10%) distributed among regions as revealed by the analysis of molecular variance (amova). This amova results showed that the divergence between regions was not statistically significant. We concluded that the high levels of intrapopulation diversity in Isiolo and Narok goats and the weak phylogeographic structuring suggested that there existed strong gene flow among goat populations probably caused by extensive transportation of goats in history. © 2015 Blackwell Verlag GmbH.
Zeng, Xu; Yuan, Zhengrong; Tong, Xin; Li, Qiushi; Gao, Weiwei; Qin, Minjian; Liu, Zhihua
2012-05-01
Oryzoideae (Poaceae) plants have economic and ecological value. However, the phylogenetic position of some plants is not clear, such as Hygroryza aristata (Retz.) Nees. and Porteresia coarctata (Roxb.) Tateoka (syn. Oryza coarctata). Comprehensive molecular phylogenetic studies have been carried out on many genera in the Poaceae. The different DNA sequences, including nuclear and chloroplast sequences, had been extensively employed to determine relationships at both higher and lower taxonomic levels in the Poaceae. Chloroplast DNA ndhF gene and atpB-rbcL spacer were used to construct phylogenetic trees and estimate the divergence time of Oryzoideae, Bambusoideae, Panicoideae, Pooideae and so on. Complete sequences of atpB-rbcL and ndhF were generated for 17 species representing six species of the Oryzoideae and related subfamilies. Nicotiana tabacum L. was the outgroup species. The two DNA datasets were analyzed, using Maximum Parsimony and Bayesian analysis methods. The molecular phylogeny revealed that H. aristata (Retz.) Nees was the sister to Chikusichloa aquatica Koidz. Moreover, P. coarctata (Roxb.) Tateoka was in the genus Oryza. Furthermore, the result of evolution analysis, which based on the ndhF marker, indicated that the time of origin of Oryzoideae might be 31 million years ago.
Iterative refinement of structure-based sequence alignments by Seed Extension
Kim, Changhoon; Tai, Chin-Hsien; Lee, Byungkook
2009-01-01
Background Accurate sequence alignment is required in many bioinformatics applications but, when sequence similarity is low, it is difficult to obtain accurate alignments based on sequence similarity alone. The accuracy improves when the structures are available, but current structure-based sequence alignment procedures still mis-align substantial numbers of residues. In order to correct such errors, we previously explored the possibility of replacing the residue-based dynamic programming algorithm in structure alignment procedures with the Seed Extension algorithm, which does not use a gap penalty. Here, we describe a new procedure called RSE (Refinement with Seed Extension) that iteratively refines a structure-based sequence alignment. Results RSE uses SE (Seed Extension) in its core, which is an algorithm that we reported recently for obtaining a sequence alignment from two superimposed structures. The RSE procedure was evaluated by comparing the correctly aligned fractions of residues before and after the refinement of the structure-based sequence alignments produced by popular programs. CE, DaliLite, FAST, LOCK2, MATRAS, MATT, TM-align, SHEBA and VAST were included in this analysis and the NCBI's CDD root node set was used as the reference alignments. RSE improved the average accuracy of sequence alignments for all programs tested when no shift error was allowed. The amount of improvement varied depending on the program. The average improvements were small for DaliLite and MATRAS but about 5% for CE and VAST. More substantial improvements have been seen in many individual cases. The additional computation times required for the refinements were negligible compared to the times taken by the structure alignment programs. Conclusion RSE is a computationally inexpensive way of improving the accuracy of a structure-based sequence alignment. It can be used as a standalone procedure following a regular structure-based sequence alignment or to replace the traditional iterative refinement procedures based on residue-level dynamic programming algorithm in many structure alignment programs. PMID:19589133
The Oral Microbiota in Health and Disease: An Overview of Molecular Findings.
Siqueira, José F; Rôças, Isabela N
2017-01-01
Culture-independent nucleic acid technologies have been extensively applied to the analysis of oral bacterial communities associated with healthy and diseased conditions. These methods have confirmed and substantially expanded the findings from culture studies to reveal the oral microbial inhabitants and candidate pathogens associated with the major oral diseases. Over 1000 bacterial distinct species-level taxa have been identified in the oral cavity and studies using next-generation DNA sequencing approaches indicate that the breadth of bacterial diversity may be even much larger. Nucleic acid technologies have also been helpful in profiling bacterial communities and identifying disease-related patterns. This chapter provides an overview of the diversity and taxonomy of oral bacteria associated with health and disease.
Systematic Characterization and Comparative Analysis of the Rabbit Immunoglobulin Repertoire
Lavinder, Jason J.; Hoi, Kam Hon; Reddy, Sai T.; Wine, Yariv; Georgiou, George
2014-01-01
Rabbits have been used extensively as a model system for the elucidation of the mechanism of immunoglobulin diversification and for the production of antibodies. We employed Next Generation Sequencing to analyze Ig germline V and J gene usage, CDR3 length and amino acid composition, and gene conversion frequencies within the functional (transcribed) IgG repertoire of the New Zealand white rabbit (Oryctolagus cuniculus). Several previously unannotated rabbit heavy chain variable (VH) and light chain variable (VL) germline elements were deduced bioinformatically using multidimensional scaling and k-means clustering methods. We estimated the gene conversion frequency in the rabbit at 23% of IgG sequences with a mean gene conversion tract length of 59±36 bp. Sequencing and gene conversion analysis of the chicken, human, and mouse repertoires revealed that gene conversion occurs much more extensively in the chicken (frequency 70%, tract length 79±57 bp), was observed to a small, yet statistically significant extent in humans, but was virtually absent in mice. PMID:24978027
Cox, Laura A; Glenn, Jeremy P; Spradling, Kimberly D; Nijland, Mark J; Garcia, Roy; Nathanielsz, Peter W; Ford, Stephen P
2012-06-15
The pregnant sheep has provided seminal insights into reproduction related to animal and human development (ovarian function, fertility, implantation, fetal growth, parturition and lactation). Fetal sheep physiology has been extensively studied since 1950, contributing significantly to the basis for our understanding of many aspects of fetal development and behaviour that remain in use in clinical practice today. Understanding mechanisms requires the combination of systems approaches uniquely available in fetal sheep with the power of genomic studies. Absence of the full range of sheep genomic resources has limited the full realization of the power of this model, impeding progress in emerging areas of pregnancy biology such as developmental programming. We have examined the expressed fetal sheep heart transcriptome using high-throughput sequencing technologies. In so doing we identified 36,737 novel transcripts and describe genes, gene variants and pathways relevant to fundamental developmental mechanisms. Genes with the highest expression levels and with novel exons in the fetal heart transcriptome are known to play central roles in muscle development. We show that high-throughput sequencing methods can generate extensive transcriptome information in the absence of an assembled and annotated genome for that species. The gene sequence data obtained provide a unique genomic resource for sheep specific genetic technology development and, combined with the polymorphism data, augment annotation and assembly of the sheep genome. In addition, identification and pathway analysis of novel fetal sheep heart transcriptome splice variants is a first step towards revealing mechanisms of genetic variation and gene environment interactions during fetal heart development.
Cox, Laura A; Glenn, Jeremy P; Spradling, Kimberly D; Nijland, Mark J; Garcia, Roy; Nathanielsz, Peter W; Ford, Stephen P
2012-01-01
The pregnant sheep has provided seminal insights into reproduction related to animal and human development (ovarian function, fertility, implantation, fetal growth, parturition and lactation). Fetal sheep physiology has been extensively studied since 1950, contributing significantly to the basis for our understanding of many aspects of fetal development and behaviour that remain in use in clinical practice today. Understanding mechanisms requires the combination of systems approaches uniquely available in fetal sheep with the power of genomic studies. Absence of the full range of sheep genomic resources has limited the full realization of the power of this model, impeding progress in emerging areas of pregnancy biology such as developmental programming. We have examined the expressed fetal sheep heart transcriptome using high-throughput sequencing technologies. In so doing we identified 36,737 novel transcripts and describe genes, gene variants and pathways relevant to fundamental developmental mechanisms. Genes with the highest expression levels and with novel exons in the fetal heart transcriptome are known to play central roles in muscle development. We show that high-throughput sequencing methods can generate extensive transcriptome information in the absence of an assembled and annotated genome for that species. The gene sequence data obtained provide a unique genomic resource for sheep specific genetic technology development and, combined with the polymorphism data, augment annotation and assembly of the sheep genome. In addition, identification and pathway analysis of novel fetal sheep heart transcriptome splice variants is a first step towards revealing mechanisms of genetic variation and gene environment interactions during fetal heart development. PMID:22508961
Zn-metalloprotease sequences in extremophiles
NASA Astrophysics Data System (ADS)
Holden, T.; Dehipawala, S.; Golebiewska, U.; Cheung, E.; Tremberger, G., Jr.; Williams, E.; Schneider, P.; Gadura, N.; Lieberman, D.; Cheung, T.
2010-09-01
The Zn-metalloprotease family contains conserved amino acid structures such that the nucleotide fluctuation at the DNA level would exhibit correlated randomness as described by fractal dimension. A nucleotide sequence fractal dimension can be calculated from a numerical series consisting of the atomic numbers of each nucleotide. The structure's vibration modes can also be studied using a Gaussian Network Model. The vibration measure and fractal dimension values form a two-dimensional plot with a standard vector metric that can be used for comparison of structures. The preference for amino acid usage in extremophiles may suppress nucleotide fluctuations that could be analyzed in terms of fractal dimension and Shannon entropy. A protein level cold adaptation study of the thermolysin Zn-metalloprotease family using molecular dynamics simulation was reported recently and our results show that the associated nucleotide fluctuation suppression is consistent with a regression pattern generated from the sequences's fractal dimension and entropy values (R-square { 0.98, N =5). It was observed that cold adaptation selected for high entropy and low fractal dimension values. Extension to the Archaemetzincin M54 family in extremophiles reveals a similar regression pattern (R-square = 0.98, N = 6). It was observed that the metalloprotease sequences of extremely halophilic organisms possess high fractal dimension and low entropy values as compared with non-halophiles. The zinc atom is usually bonded to the histidine residue, which shows limited levels of vibration in the Gaussian Network Model. The variability of the fractal dimension and entropy for a given protein structure suggests that extremophiles would have evolved after mesophiles, consistent with the bias usage of non-prebiotic amino acids by extremophiles. It may be argued that extremophiles have the capacity to offer extinction protection during drastic changes in astrobiological environments.
Konami, Y; Yamamoto, K; Osawa, T; Irimura, T
1995-04-01
The complete amino acid sequence of a lactose-binding Cytisus sessilifolius anti-H(O) lectin II (CSA-II) was determined using a protein sequencer. After digestion of CSA-II with endoproteinase Lys-C or Asp-N, the resulting peptides were purified by reversed-phase high performance liquid chromatography (HPLC) and then subjected to sequence analysis. Comparison of the complete amino acid sequence of CSA-II with the sequences of other leguminous seed lectins revealed regions of extensive homology. The amino acid sequence of a putative carbohydrate-binding domain of CSA-II was found to be similar to those of several anti-H(O) leguminous lectins, especially to that of the L-fucose-binding Ulex europaeus lectin I (UEA-I).
Qu, Chun-Pu; Xu, Zhi-Ru; Liu, Guan-Jun; Liu, Chun; Li, Yang; Wei, Zhi-Gang; Liu, Gui-Feng
2010-01-01
In aerobic organisms, protection against oxidative damage involves the combined action of highly specialized antioxidant enzymes, such as copper-zinc superoxide dismutase. In this work, a cDNA clone which encodes a copper-zinc superoxide dismutase gene, named PS-CuZnSOD, has been identified from P. sibiricum Laxm. by the rapid amplification of cDNA ends method (RACE). Analysis of the nucleotide sequence reveals that the PS-CuZnSOD gene cDNA clone consists of 669 bp, containing 87 bp in the 5' untranslated region; 459 bp in the open reading frame (ORF) encoding 152 amino acids; and 123 bp in 3' untranslated region. The gene accession nucleotide sequence number in GenBank is GQ472846. Sequence analysis indicates that the protein, like most plant superoxide dismutases (SOD), includes two conserved ecCuZnSOD signatures that are from the amino acids 43 to 51, and from the amino acids 137 to 148, and it has a signal peptide extension in the front of the N-terminus (1-16 aa). Expression analysis by real-time quantitative PCR reveals that the PS-CuZnSOD gene is expressed in leaves, stems and underground stems. PS-CuZnSOD gene expression can be induced by 3% NaHCO(3). The different mRNA levels' expression of PS-CuZnSOD show the gene's different expression modes in leaves, stems and underground stems under the salinity-alkalinity stress.
Transposon diversity in Arabidopsis thaliana
Le, Quang Hien; Wright, Stephen; Yu, Zhihui; Bureau, Thomas
2000-01-01
Recent availability of extensive genome sequence information offers new opportunities to analyze genome organization, including transposon diversity and accumulation, at a level of resolution that was previously unattainable. In this report, we used sequence similarity search and analysis protocols to perform a fine-scale analysis of a large sample (≈17.2 Mb) of the Arabidopsis thaliana (Columbia) genome for transposons. Consistent with previous studies, we report that the A. thaliana genome harbors diverse representatives of most known superfamilies of transposons. However, our survey reveals a higher density of transposons of which over one-fourth could be classified into a single novel transposon family designated as Basho, which appears unrelated to any previously known superfamily. We have also identified putative transposase-coding ORFs for miniature inverted-repeat transposable elements (MITEs), providing clues into the mechanism of mobility and origins of the most abundant transposons associated with plant genes. In addition, we provide evidence that most mined transposons have a clear distribution preference for A + T-rich sequences and show that structural variation for many mined transposons is partly due to interelement recombination. Taken together, these findings further underscore the complexity of transposons within the compact genome of A. thaliana. PMID:10861007
Characterization of the human gene (TBXAS1) encoding thromboxane synthase.
Miyata, A; Yokoyama, C; Ihara, H; Bandoh, S; Takeda, O; Takahashi, E; Tanabe, T
1994-09-01
The gene encoding human thromboxane synthase (TBXAS1) was isolated from a human EMBL3 genomic library using human platelet thromboxane synthase cDNA as a probe. Nucleotide sequencing revealed that the human thromboxane synthase gene spans more than 75 kb and consists of 13 exons and 12 introns, of which the splice donor and acceptor sites conform to the GT/AG rule. The exon-intron boundaries of the thromboxane synthase gene were similar to those of the human cytochrome P450 nifedipine oxidase gene (CYP3A4) except for introns 9 and 10, although the primary sequences of these enzymes exhibited 35.8% identity each other. The 1.2-kb of the 5'-flanking region sequence contained potential binding sites for several transcription factors (AP-1, AP-2, GATA-1, CCAAT box, xenobiotic-response element, PEA-3, LF-A1, myb, basic transcription element and cAMP-response element). Primer-extension analysis indicated the multiple transcription-start sites, and the major start site was identified as an adenine residue located 142 bases upstream of the translation-initiation site. However, neither a typical TATA box nor a typical CAAT box is found within the 100-b upstream of the translation-initiation site. Southern-blot analysis revealed the presence of one copy of the thromboxane synthase gene per haploid genome. Furthermore, a fluorescence in situ hybridization study revealed that the human gene for thromboxane synthase is localized to band q33-q34 of the long arm of chromosome 7. A tissue-distribution study demonstrated that thromboxane synthase mRNA is widely expressed in human tissues and is particularly abundant in peripheral blood leukocyte, spleen, lung and liver. The low but significant levels of mRNA were observed in kidney, placenta and thymus.
Gu, Lei-Lei; Li, Xin-Hua; Han, Yue; Zhang, Dong-Hua; Gong, Qi-Ming; Zhang, Xin-Xin
2014-02-25
Glycogen storage disease type Ia (GSD-Ia) is an autosomal recessive genetic disorder resulting in hypoglycemia, hepatomegaly and growth retardation. It is caused by mutations in the G6PC gene encoding Glucose-6-phosphatase. To date, over 80 mutations have been identified in the G6PC gene. Here we reported a novel mutation found in a Chinese patient with abnormal transaminases, hypoglycemia, hepatomegaly and short stature. Direct sequencing of the coding region and splicing-sites in the G6PC gene revealed a novel no-stop mutation, p.*358Yext*43, leading to a 43 amino-acid extension of G6Pase. The expression level of mutant G6Pase transcripts was only 7.8% relative to wild-type transcripts. This mutation was not found in 120 chromosomes from 60 unrelated healthy control subjects using direct sequencing, and was further confirmed by digestion with Rsa I restriction endonuclease. In conclusion, we revealed a novel no-stop mutation in this study which expands the spectrum of mutations in the G6PC gene. The molecular genetic analysis was indispensable to the diagnosis of GSD-Ia for the patient. Copyright © 2013 Elsevier B.V. All rights reserved.
Wang, Tao; Chen, Zuqin; Xie, Yue; Hou, Rong; Wu, Qidun; Gu, Xiaobing; Lai, Weiming; Peng, Xuerong; Yang, Guangyou
2015-06-25
Cryptosporidium spp. have been extensively reported to cause significant diarrheal disease in humans and domestic animals. On the contrary, little information is available on the prevalence and characterization of Cryptosporidium in wild animals in China, especially in giant pandas. The aim of the present study was to detect Cryptosporidium infections and identify Cryptosporidium species at the molecular level in both captive and wild giant pandas in Sichuan province, China. Using a PCR approach, we amplified and sequenced the 18S rRNA gene from 322 giant pandas fecal samples (122 from 122 captive individuals and 200 collected from four habitats) in Sichuan province, China. The Cryptosporidium species/genotypes were identified via a BLAST comparison against published Cryptosporidium sequences available in GenBank followed by phylogenetic analysis. The results revealed that both captive and wild giant pandas were infected with a single Cryptosporidium species, C. andersoni, at a prevalence of 15.6% (19/122) and 0.5% (1/200) in captive and wild giant pandas, respectively. The present study revealed the existence of C. andersoni in both captive and wild giant panda fecal samples for the first time, and also provided useful fundamental data for further research on the molecular epidemiology and control of Cryptosporidium infection in giant pandas.
ERIC Educational Resources Information Center
Hinchcliffe, Edward H.
2005-01-01
Cinemicrography--the capture of moving cellular sequences through the microscope--has been influential in revealing the dynamic nature of cellular behavior. One of the more dramatic cellular events is mitosis, the division of sister chromatids into two daughter cells. Mitosis has been extensively studied in a variety of organisms, both…
USDA-ARS?s Scientific Manuscript database
Aldehydes are major secondary lipid oxidation products (LOPs) from heating vegetable oils and deep frying. The routes and reactions that generate aldehydes have been extensively investigated, but the sequences and kinetics of their formation in oils are poorly defined. In this study, a platform comb...
Wang, Ping; Ingram-Smith, Cheryl; Hadley, Jill A.; Miller, Karen J.
1999-01-01
Periplasmic cyclic β-glucans of Rhizobium species provide important functions during plant infection and hypo-osmotic adaptation. In Sinorhizobium meliloti (also known as Rhizobium meliloti), these molecules are highly modified with phosphoglycerol and succinyl substituents. We have previously identified an S. meliloti Tn5 insertion mutant, S9, which is specifically impaired in its ability to transfer phosphoglycerol substituents to the cyclic β-glucan backbone (M. W. Breedveld, J. A. Hadley, and K. J. Miller, J. Bacteriol. 177:6346–6351, 1995). In the present study, we have cloned, sequenced, and characterized this mutation at the molecular level. By using the Tn5 flanking sequences (amplified by inverse PCR) as a probe, an S. meliloti genomic library was screened, and two overlapping cosmid clones which functionally complement S9 were isolated. A 3.1-kb HindIII-EcoRI fragment found in both cosmids was shown to fully complement mutant S9. Furthermore, when a plasmid containing this 3.1-kb fragment was used to transform Rhizobium leguminosarum bv. trifolii TA-1JH, a strain which normally synthesizes only neutral cyclic β-glucans, anionic glucans containing phosphoglycerol substituents were produced, consistent with the functional expression of an S. meliloti phosphoglycerol transferase gene. Sequence analysis revealed the presence of two major, overlapping open reading frames within the 3.1-kb fragment. Primer extension analysis revealed that one of these open reading frames, ORF1, was transcribed and its transcription was osmotically regulated. This novel locus of S. meliloti is designated the cgm (cyclic glucan modification) locus, and the product encoded by ORF1 is referred to as CgmB. PMID:10419956
Wan, Yizhen; Schwaninger, Heidi R; Baldo, Angela M; Labate, Joanne A; Zhong, Gan-Yuan; Simon, Charles J
2013-07-05
Grapes are one of the most economically important fruit crops. There are about 60 species in the genus Vitis. The phylogenetic relationships among these species are of keen interest for the conservation and use of this germplasm. We selected 309 accessions from 48 Vitis species,varieties, and outgroups, examined ~11 kb (~3.4 Mb total) of aligned nuclear DNA sequences from 27 unlinked genes in a phylogenetic context, and estimated divergence times based on fossil calibrations. Vitis formed a strongly supported clade. There was substantial support for species and less for the higher-level groupings (series). As estimated from extant taxa, the crown age of Vitis was 28 Ma and the divergence of subgenera (Vitis and Muscadinia) occurred at ~18 Ma. Higher clades in subgenus Vitis diverged 16 - 5 Ma with overlapping confidence intervals, and ongoing divergence formed extant species at 12 - 1.3 Ma. Several species had species-specific SNPs. NeighborNet analysis showed extensive reticulation at the core of subgenus Vitis representing the deeper nodes, with extensive reticulation radiating outward. Fitch Parsimony identified North America as the origin of the most recent common ancestor of extant Vitis species. Phylogenetic patterns suggested origination of the genus in North America, fragmentation of an ancestral range during the Miocene, formation of extant species in the late Miocene-Pleistocene, and differentiation of species in the context of Pliocene-Quaternary tectonic and climatic change. Nuclear SNPs effectively resolved relationships at and below the species level in grapes and rectified several misclassifications of accessions in the repositories. Our results challenge current higher-level classifications, reveal the abundance of genetic diversity in the genus that is potentially available for crop improvement, and provide a valuable resource for species delineation, germplasm conservation and use.
Combined pituitary hormone deficiency (CPHD) due to a complete PROP1 deletion.
Abrão, M G; Leite, M V; Carvalho, L R; Billerbeck, A E C; Nishi, M Y; Barbosa, A S; Martin, R M; Arnhold, I J P; Mendonca, B B
2006-09-01
PROP1 mutations are the most common cause of genetic combined pituitary hormone deficiency (CPHD). The aim of this study was to investigate the PROP1 gene in two siblings with CPHD. Pituitary function and imaging assessment and molecular analysis of PROP1. Two siblings, born to consanguineous parents, presented with GH deficiency associated with other pituitary hormone deficiencies (TSH, PRL and gonadotrophins). The male sibling also had an evolving cortisol deficiency. Pituitary size was evaluated by magnetic resonance imaging (MRI). PROP1 gene analysis was performed by polymerase chain reaction (PCR), automatic sequencing and Southern blotting. Amplification of sequence tag sites (STS) and the Q8N6H0 gene flanking PROP1 were performed to define the extension of PROP1 deletion. MRI revealed a hypoplastic anterior pituitary in the girl at 14 years and pituitary enlargement in the boy at 18 years. The PROP1 gene failed to amplify in both siblings, whereas other genes were amplified. Southern blotting analysis revealed the PROP1 band in the controls and confirmed complete PROP1 deletion in both siblings. The extension of the deletion was 18.4 kb. The region flanking PROP1 contains several Alu core sequences that might have facilitated stem-loop-mediated excision of PROP1. We report here a complete deletion of PROP1 in two siblings with CPHD phenotype.
Priest, Henry D; Fox, Samuel E; Rowley, Erik R; Murray, Jessica R; Michael, Todd P; Mockler, Todd C
2014-01-01
Brachypodium distachyon is a close relative of many important cereal crops. Abiotic stress tolerance has a significant impact on productivity of agriculturally important food and feedstock crops. Analysis of the transcriptome of Brachypodium after chilling, high-salinity, drought, and heat stresses revealed diverse differential expression of many transcripts. Weighted Gene Co-Expression Network Analysis revealed 22 distinct gene modules with specific profiles of expression under each stress. Promoter analysis implicated short DNA sequences directly upstream of module members in the regulation of 21 of 22 modules. Functional analysis of module members revealed enrichment in functional terms for 10 of 22 network modules. Analysis of condition-specific correlations between differentially expressed gene pairs revealed extensive plasticity in the expression relationships of gene pairs. Photosynthesis, cell cycle, and cell wall expression modules were down-regulated by all abiotic stresses. Modules which were up-regulated by each abiotic stress fell into diverse and unique gene ontology GO categories. This study provides genomics resources and improves our understanding of abiotic stress responses of Brachypodium.
NASA Astrophysics Data System (ADS)
Bertrand-Sarfati, Janine; Moussine-Pouchkine, Alexis
1988-08-01
The Atar Group, part of the Upper Proterozoic sequence covering the West African craton, stable since 2000 Ma, is characterized by an alternation of extensive carbonate beds and mixed siliciclastic and carbonate facies. The carbonate beds comprise essentially columnar stromatolite biostromes and bioherms which reflect sublittoral environments. The mixed facies contain a variety of laterally discontinuous facies which imply more variable environmental conditions. The settings of the mixed facies are not always clear but they do not contain thick sequences of high-energy facies. Few obvious facies sequences are discernable; those that are present are considered to be punctuated aggradational cycles (PACs) and they always start with biostromes of columnar stromatolites with very few sediments. Composite sequences are interpreted as due to shallowing upward or increasing energy environments that may be laterally contiguous, despite the fact that the contacts are not gradational. However, much of the stratigraphic sequence cannot be subdivided into cycles and seems to consist of unrelated individual facies, bound by sharp boundaries. The basin analysis reveals that biostromes of columnar stromatolites start after an instantaneous geological event corresponding to a sea-level rise. Consequently, their appearance can be considered as a time-line. We describe, in the Atar Group and its equivalents, three sedimentation trends, all of which are interpreted to be of shallowing upward character. The Atar Group appears to have been deposited in an epeiric sea (i.e. an extremely flat ramp). There are two contrasting styles of sedimentation: (1) after the submergence of the whole area, columnar stromatolites built extensive biostromes; (2) during the stable phase, sediments are deposited in a mosaic of laterally-discontinuous facies. Tidal influence cannot be recognized in the sequence, neither can a salinity increase toward the land; both common features in published epeiric sea models. A cratonic sedimentation area such as this is characterized by its size and flatness. Only during the stable phase of the cycle does small-scale topographic relief lead to deposition of a mosaic of facies. The sedimentation is storm- and wave-dominated.
Comparative systems biology across an evolutionary gradient within the Shewanella genus.
Konstantinidis, Konstantinos T; Serres, Margrethe H; Romine, Margaret F; Rodrigues, Jorge L M; Auchtung, Jennifer; McCue, Lee-Ann; Lipton, Mary S; Obraztsova, Anna; Giometti, Carol S; Nealson, Kenneth H; Fredrickson, James K; Tiedje, James M
2009-09-15
To what extent genotypic differences translate to phenotypic variation remains a poorly understood issue of paramount importance for several cornerstone concepts of microbiology including the species definition. Here, we take advantage of the completed genomic sequences, expressed proteomic profiles, and physiological studies of 10 closely related Shewanella strains and species to provide quantitative insights into this issue. Our analyses revealed that, despite extensive horizontal gene transfer within these genomes, the genotypic and phenotypic similarities among the organisms were generally predictable from their evolutionary relatedness. The power of the predictions depended on the degree of ecological specialization of the organisms evaluated. Using the gradient of evolutionary relatedness formed by these genomes, we were able to partly isolate the effect of ecology from that of evolutionary divergence and to rank the different cellular functions in terms of their rates of evolution. Our ranking also revealed that whole-cell protein expression differences among these organisms, when the organisms were grown under identical conditions, were relatively larger than differences at the genome level, suggesting that similarity in gene regulation and expression should constitute another important parameter for (new) species description. Collectively, our results provide important new information toward beginning a systems-level understanding of bacterial species and genera.
Guédon, Yann; d'Aubenton-Carafa, Yves; Thermes, Claude
2006-03-01
The most commonly used models for analysing local dependencies in DNA sequences are (high-order) Markov chains. Incorporating knowledge relative to the possible grouping of the nucleotides enables to define dedicated sub-classes of Markov chains. The problem of formulating lumpability hypotheses for a Markov chain is therefore addressed. In the classical approach to lumpability, this problem can be formulated as the determination of an appropriate state space (smaller than the original state space) such that the lumped chain defined on this state space retains the Markov property. We propose a different perspective on lumpability where the state space is fixed and the partitioning of this state space is represented by a one-to-many probabilistic function within a two-level stochastic process. Three nested classes of lumped processes can be defined in this way as sub-classes of first-order Markov chains. These lumped processes enable parsimonious reparameterizations of Markov chains that help to reveal relevant partitions of the state space. Characterizations of the lumped processes on the original transition probability matrix are derived. Different model selection methods relying either on hypothesis testing or on penalized log-likelihood criteria are presented as well as extensions to lumped processes constructed from high-order Markov chains. The relevance of the proposed approach to lumpability is illustrated by the analysis of DNA sequences. In particular, the use of lumped processes enables to highlight differences between intronic sequences and gene untranslated region sequences.
Pietan, Lucas L.; Spradling, Theresa A.
2016-01-01
In animals, mitochondrial DNA (mtDNA) typically occurs as a single circular chromosome with 13 protein-coding genes and 22 tRNA genes. The various species of lice examined previously, however, have shown mitochondrial genome rearrangements with a range of chromosome sizes and numbers. Our research demonstrates that the mitochondrial genomes of two species of chewing lice found on pocket gophers, Geomydoecus aurei and Thomomydoecus minor, are fragmented with the 1,536 base-pair (bp) cytochrome-oxidase subunit I (cox1) gene occurring as the only protein-coding gene on a 1,916–1,964 bp minicircular chromosome in the two species, respectively. The cox1 gene of T. minor begins with an atypical start codon, while that of G. aurei does not. Components of the non-protein coding sequence of G. aurei and T. minor include a tRNA (isoleucine) gene, inverted repeat sequences consistent with origins of replication, and an additional non-coding region that is smaller than the non-coding sequence of other lice with such fragmented mitochondrial genomes. Sequences of cox1 minichromosome clones for each species reveal extensive length and sequence heteroplasmy in both coding and noncoding regions. The highly variable non-gene regions of G. aurei and T. minor have little sequence similarity with one another except for a 19-bp region of phylogenetically conserved sequence with unknown function. PMID:27589589
Dallas, David C.; Guerrero, Andres; Khaldi, Nora; Castillo, Patricia A.; Martin, William F.; Smilowitz, Jennifer T.; Bevins, Charles L.; Barile, Daniela; German, J. Bruce; Lebrilla, Carlito B.
2013-01-01
Milk is traditionally considered an ideal source of the basic elemental nutrients required by infants. More detailed examination is revealing that milk represents a more functional ensemble of components with benefits to both infants and mothers. A comprehensive peptidomics method was developed and used to analyze human milk yielding an extensive array of protein products present in the fluid. Over 300 milk peptides were identified originating from major and many minor protein components of milk. As expected, the majority of peptides derived from β-casein, however no peptide fragments from the major milk proteins lactoferrin, α-lactalbumin and secretory immunoglobulin A were identified. Proteolysis in the mammary gland is selective—released peptides were drawn only from specific proteins and typically from only select parts of the parent sequence. A large number of the peptides showed significant sequence overlap with peptides with known antimicrobial or immunomodulatory functions. Antibacterial assays showed the milk peptide mixtures inhibited the growth of Escherichia coli and Staphylococcus aureus. The pre-digestion of milk proteins and the consequent release antibacterial peptides may provide a selective advantage through evolution by protecting both the mother's mammary gland and her nursing offspring from infection. PMID:23586814
Martin-Eauclaire, Marie-France; Salvatierra, Juan; Bosmans, Frank; Bougis, Pierre E
2016-09-01
We report the detailed chemical, immunological and pharmacological characterization of the α-toxin Bot IX from the Moroccan scorpion Buthus occitanus tunetanus venom. Bot IX, which consists of 70 amino acids, is a highly atypical toxin. It carries a unique N-terminal sequence extension and is highly lethal in mice. Voltage clamp recordings on oocytes expressing rat Nav1.2 or insect BgNav1 reveal that, similar to other α-like toxins, Bot IX inhibits fast inactivation of both variants. Moreover, Bot IX belongs to the same structural/immunological group as the α-like toxin Bot I. Remarkably, radioiodinated Bot IX competes efficiently with the classical α-toxin AaH II from Androctonus australis, and displays one of the highest affinities for Nav channels. © 2016 Federation of European Biochemical Societies.
Analysis of noise-induced temporal correlations in neuronal spike sequences
NASA Astrophysics Data System (ADS)
Reinoso, José A.; Torrent, M. C.; Masoller, Cristina
2016-11-01
We investigate temporal correlations in sequences of noise-induced neuronal spikes, using a symbolic method of time-series analysis. We focus on the sequence of time-intervals between consecutive spikes (inter-spike-intervals, ISIs). The analysis method, known as ordinal analysis, transforms the ISI sequence into a sequence of ordinal patterns (OPs), which are defined in terms of the relative ordering of consecutive ISIs. The ISI sequences are obtained from extensive simulations of two neuron models (FitzHugh-Nagumo, FHN, and integrate-and-fire, IF), with correlated noise. We find that, as the noise strength increases, temporal order gradually emerges, revealed by the existence of more frequent ordinal patterns in the ISI sequence. While in the FHN model the most frequent OP depends on the noise strength, in the IF model it is independent of the noise strength. In both models, the correlation time of the noise affects the OP probabilities but does not modify the most probable pattern.
Ventura, Marco; Zink, Ralf; Fitzgerald, Gerald F; van Sinderen, Douwe
2005-01-01
The incorporation and delivery of bifidobacterial strains as probiotic components in many food preparations expose these microorganisms to a multitude of environmental insults, including heat and osmotic stresses. We characterized the dnaK gene region of Bifidobacterium breve UCC 2003. Sequence analysis of the dnaK locus revealed four genes with the organization dnaK-grpE-dnaJ-ORF1, whose deduced protein products display significant similarity to corresponding chaperones found in other bacteria. Northern hybridization and real-time LightCycler PCR analysis revealed that the transcription of the dnaK operon was strongly induced by osmotic shock but was not induced significantly by heat stress. A 4.4-kb polycistronic mRNA, which represented the transcript of the complete dnaK gene region, was detected. Many other small transcripts, which were assumed to have resulted from intensive processing or degradation of this polycistronic mRNA, were identified. The transcription start site of the dnaK operon was determined by primer extension. Phylogenetic analysis of the available bifidobacterial grpE and dnaK genes suggested that the evolutionary development of these genes has been similar. The phylogeny derived from the various bifidobacterial grpE and dnaK sequences is consistent with that derived from 16S rRNA. The use of these genes in bifidobacterial species as an alternative or complement to the 16S rRNA gene marker provides sequence signatures that allow a high level of discrimination between closely related species of this genus.
Lu, Changrui; Smith, Angela M; Fuchs, Ryan T; Ding, Fang; Rajashankar, Kanagalaghatta; Henkin, Tina M; Ke, Ailong
2011-01-01
Three distinct classes of S-adenosyl-l-methionine (SAM)-responsive riboswitches have been identified that regulate bacterial gene expression at the levels of transcription attenuation or translation inhibition. The SMK box (SAM-III) translational riboswitch has been identified in the SAM synthetase gene in members of the Lactobacillales. Here we report the 2.2-Å crystal structure of the Enterococcus faecalis SMK box riboswitch. The Y-shaped riboswitch organizes its conserved nucleotides around a three-way junction for SAM recognition. The Shine-Dalgarno sequence, which is sequestered by base-pairing with the anti–Shine-Dalgarno sequence in response to SAM binding, also directly participates in SAM recognition. The riboswitch makes extensive interactions with the adenosine and sulfonium moieties of SAM but does not appear to recognize the tail of the methionine moiety. We captured a structural snapshot of the SMK box riboswitch sampling the near-cognate ligand S-adenosyl-l-homocysteine (SAH) in which SAH was found to adopt an alternative conformation and fails to make several key interactions. PMID:18806797
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, C.; Smith, A.M.; Fuchs, R.T.
2010-01-07
Three distinct classes of S-adenosyl-L-methionine (SAM)-responsive riboswitches have been identified that regulate bacterial gene expression at the levels of transcription attenuation or translation inhibition. The SMK box (SAM-III) translational riboswitch has been identified in the SAM synthetase gene in members of the Lactobacillales. Here we report the 2.2-{angstrom} crystal structure of the Enterococcus faecalis SMK box riboswitch. The Y-shaped riboswitch organizes its conserved nucleotides around a three-way junction for SAM recognition. The Shine-Dalgarno sequence, which is sequestered by base-pairing with the anti-Shine-Dalgarno sequence in response to SAM binding, also directly participates in SAM recognition. The riboswitch makes extensive interactions withmore » the adenosine and sulfonium moieties of SAM but does not appear to recognize the tail of the methionine moiety. We captured a structural snapshot of the SMK box riboswitch sampling the near-cognate ligand S-adenosyl-L-homocysteine (SAH) in which SAH was found to adopt an alternative conformation and fails to make several key interactions.« less
HuH-7 reference genome profile: complex karyotype composed of massive loss of heterozygosity.
Kasai, Fumio; Hirayama, Noriko; Ozawa, Midori; Satoh, Motonobu; Kohara, Arihiro
2018-05-17
Human cell lines represent a valuable resource as in vitro experimental models. A hepatoma cell line, HuH-7 (JCRB0403), has been used extensively in various research fields and a number of studies using this line have been published continuously since it was established in 1982. However, an accurate genome profile, which can be served as a reliable reference, has not been available. In this study, we performed M-FISH, SNP microarray and amplicon sequencing to characterize the cell line. Single cell analysis of metaphases revealed a high level of heterogeneity with a mode of 60 chromosomes. Cytogenetic results demonstrated chromosome abnormalities involving every chromosome in addition to a massive loss of heterozygosity, which accounts for 55.3% of the genome, consistent with the homozygous variants seen in the sequence analysis. We provide empirical data that the HuH-7 cell line is composed of highly heterogeneous cell populations, suggesting that besides cell line authentication, the quality of cell lines needs to be taken into consideration in the future use of tumor cell lines.
Assignment of the human PAX4 gene to chromosome band 7q32 by fluorescence in situ hybridization.
Tamura, T; Izumikawa, Y; Kishino, T; Soejima, H; Jinno, Y; Niikawa, N
1994-01-01
Of the nine known members of a human paired box-containing gene family (Pax), only PAX4 has not been precisely localized. We screened a cosmid library of human genomic DNA using polymerase chain reaction products for PAX4 as a probe and isolated three positive cosmid clones. Sequence analysis revealed that at least two of them had exon-like sequences and showed extensive homology to Pax-4 in the mouse. These two cosmid clones were mapped to human chromosome band 7q32 by fluorescence in situ hybridization.
Gao, Feng; Simon, Anne E.
2016-01-01
Programmed -1 ribosomal frameshifting (-1 PRF) is used by many positive-strand RNA viruses for translation of required products. Despite extensive studies, it remains unresolved how cis-elements just downstream of the recoding site promote a precise level of frameshifting. The Umbravirus Pea enation mosaic virus RNA2 expresses its RNA polymerase by -1 PRF of the 5′-proximal ORF (p33). Three hairpins located in the vicinity of the recoding site are phylogenetically conserved among Umbraviruses. The central Recoding Stimulatory Element (RSE), located downstream of the p33 termination codon, is a large hairpin with two asymmetric internal loops. Mutational analyses revealed that sequences throughout the RSE and the RSE lower stem (LS) structure are important for frameshifting. SHAPE probing of mutants indicated the presence of higher order structure, and sequences in the LS may also adapt an alternative conformation. Long-distance pairing between the RSE and a 3′ terminal hairpin was less critical when the LS structure was stabilized. A basal level of frameshifting occurring in the absence of the RSE increases to 72% of wild-type when a hairpin upstream of the slippery site is also deleted. These results suggest that suppression of frameshifting may be needed in the absence of an active RSE conformation. PMID:26578603
Putaporntip, Chaturong; Hughes, Austin L; Jongwutiwes, Somchai
2013-01-01
The merozoite surface protein-1 (MSP-1) is a candidate target for the development of blood stage vaccines against malaria. Polymorphism in MSP-1 can be useful as a genetic marker for strain differentiation in malarial parasites. Although sequence diversity in the MSP-1 locus has been extensively analyzed in field isolates of Plasmodium falciparum and P. vivax, the extent of variation in its homologues in P. ovale curtisi and P. ovale wallikeri, remains unknown. Analysis of the mitochondrial cytochrome b sequences of 10 P. ovale isolates from symptomatic malaria patients from diverse endemic areas of Thailand revealed co-existence of P. ovale curtisi (n = 5) and P. ovale wallikeri (n = 5). Direct sequencing of the PCR-amplified products encompassing the entire coding region of MSP-1 of P. ovale curtisi (PocMSP-1) and P. ovale wallikeri (PowMSP-1) has identified 3 imperfect repeated segments in the former and one in the latter. Most amino acid differences between these proteins were located in the interspecies variable domains of malarial MSP-1. Synonymous nucleotide diversity (πS) exceeded nonsynonymous nucleotide diversity (πN) for both PocMSP-1 and PowMSP-1, albeit at a non-significant level. However, when MSP-1 of both these species was considered together, πS was significantly greater than πN (p<0.0001), suggesting that purifying selection has shaped diversity at this locus prior to speciation. Phylogenetic analysis based on conserved domains has placed PocMSP-1 and PowMSP-1 in a distinct bifurcating branch that probably diverged from each other around 4.5 million years ago. The MSP-1 sequences support that P. ovale curtisi and P. ovale wallikeri are distinct species. Both species are sympatric in Thailand. The low level of sequence diversity in PocMSP-1 and PowMSP-1 among Thai isolates could stem from persistent low prevalence of these species, limiting the chance of outcrossing at this locus.
Putaporntip, Chaturong; Hughes, Austin L.; Jongwutiwes, Somchai
2013-01-01
Background The merozoite surface protein-1 (MSP-1) is a candidate target for the development of blood stage vaccines against malaria. Polymorphism in MSP-1 can be useful as a genetic marker for strain differentiation in malarial parasites. Although sequence diversity in the MSP-1 locus has been extensively analyzed in field isolates of Plasmodium falciparum and P. vivax, the extent of variation in its homologues in P. ovale curtisi and P. ovale wallikeri, remains unknown. Methodology/Principal Findings Analysis of the mitochondrial cytochrome b sequences of 10 P. ovale isolates from symptomatic malaria patients from diverse endemic areas of Thailand revealed co-existence of P. ovale curtisi (n = 5) and P. ovale wallikeri (n = 5). Direct sequencing of the PCR-amplified products encompassing the entire coding region of MSP-1 of P. ovale curtisi (PocMSP-1) and P. ovale wallikeri (PowMSP-1) has identified 3 imperfect repeated segments in the former and one in the latter. Most amino acid differences between these proteins were located in the interspecies variable domains of malarial MSP-1. Synonymous nucleotide diversity (πS) exceeded nonsynonymous nucleotide diversity (πN) for both PocMSP-1 and PowMSP-1, albeit at a non-significant level. However, when MSP-1 of both these species was considered together, πS was significantly greater than πN (p<0.0001), suggesting that purifying selection has shaped diversity at this locus prior to speciation. Phylogenetic analysis based on conserved domains has placed PocMSP-1 and PowMSP-1 in a distinct bifurcating branch that probably diverged from each other around 4.5 million years ago. Conclusion/Significance The MSP-1 sequences support that P. ovale curtisi and P. ovale wallikeri are distinct species. Both species are sympatric in Thailand. The low level of sequence diversity in PocMSP-1 and PowMSP-1 among Thai isolates could stem from persistent low prevalence of these species, limiting the chance of outcrossing at this locus. PMID:23536840
Unique transposon landscapes are pervasive across Drosophila melanogaster genomes
Rahman, Reazur; Chirn, Gung-wei; Kanodia, Abhay; Sytnikova, Yuliya A.; Brembs, Björn; Bergman, Casey M.; Lau, Nelson C.
2015-01-01
To understand how transposon landscapes (TLs) vary across animal genomes, we describe a new method called the Transposon Insertion and Depletion AnaLyzer (TIDAL) and a database of >300 TLs in Drosophila melanogaster (TIDAL-Fly). Our analysis reveals pervasive TL diversity across cell lines and fly strains, even for identically named sub-strains from different laboratories such as the ISO1 strain used for the reference genome sequence. On average, >500 novel insertions exist in every lab strain, inbred strains of the Drosophila Genetic Reference Panel (DGRP), and fly isolates in the Drosophila Genome Nexus (DGN). A minority (<25%) of transposon families comprise the majority (>70%) of TL diversity across fly strains. A sharp contrast between insertion and depletion patterns indicates that many transposons are unique to the ISO1 reference genome sequence. Although TL diversity from fly strains reaches asymptotic limits with increasing sequencing depth, rampant TL diversity causes unsaturated detection of TLs in pools of flies. Finally, we show novel transposon insertions negatively correlate with Piwi-interacting RNA (piRNA) levels for most transposon families, except for the highly-abundant roo retrotransposon. Our study provides a useful resource for Drosophila geneticists to understand how transposons create extensive genomic diversity in fly cell lines and strains. PMID:26578579
Ben Said, Mourad; Ben Asker, Alaa; Belkahia, Hanène; Ghribi, Raoua; Selmi, Rachid; Messadi, Lilia
2018-05-12
Anaplasma marginale, which is responsible for bovine anaplasmosis in tropical and subtropical regions, is a tick-borne obligatory intraerythrocytic bacterium of cattle and wild ruminants. In Tunisia, information about the genetic diversity and the phylogeny of A. marginale strains are limited to the msp4 gene analysis. The purpose of this study is to investigate A. marginale isolates infecting 16 cattle located in different bioclimatic areas of northern Tunisia with single gene analysis and multilocus sequence typing methods on the basis of seven partial genes (dnaA, ftsZ, groEL, lipA, secY, recA and sucB). The single gene analysis confirmed the presence of different and novel heterogenic A. marginale strains infecting cattle from the north of Tunisia. The concatenated sequence analysis showed a phylogeographical resolution at the global level and that most of the Tunisian sequence types (STs) formed a separate cluster from a South African isolate and from all New World isolates and strains. By combining the characteristics of each single locus with those of the multi-loci scheme, these results provide a more detailed understanding on the diversity and the evolution of Tunisian A. marginale strains. Copyright © 2018 Elsevier GmbH. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Machlin, S.M.; Hanson, R.S.
The nucleotide sequence of a cloned 2.5-kilobase-pair SmaI fragment containing the methanol dehydrogenase (MDH) structural gene from Methylobacterium organophilum XX was determined. A single open reading frame with a coding capacity of 626 amino acids (molecular weight, 66,000) was identified on one stand, and N-terminal sequencing of purified MDH revealed that 27 of these residues constituted a putative signal peptide. Primer extension mapping of in vivo transcripts indicated that the start of mRNA synthesis was 160 to 170 base pairs upstream of the ATG codon. Northern (RNA) blot analysis further demonstrated that the transcript was 2.1 kilobase pairs in lengthmore » and therefore appeared to encode only MDH.« less
Timing, sequencing, and executive control in repetitive movement production.
Krampe, Ralf Th; Mayr, Ulrich; Kliegl, Reinhold
2005-06-01
The authors demonstrate that the timing and sequencing of target durations require low-level timing and executive control. Sixteen young (M-sub(age) = 19 years) and 16 older (M-sub(age) = 70 years) adults participated in 2 experiments. In Experiment 1, individual mean-variance functions for low-level timing (isochronous tapping) and the sequencing of multiple targets (rhythm production) revealed (a) a dissociation of low-level timing and sequencing in both age groups, (b) negligible age differences for low-level timing, and (c) large age differences for sequencing. Experiment 2 supported the distinction between low-level timing and executive functions: Selection against a dominant rhythm and switching between rhythms impaired performances in both age groups and induced pronounced perseveration of the dominant pattern in older adults. ((c) 2005 APA, all rights reserved).
Archaebacterial rhodopsin sequences: Implications for evolution
NASA Technical Reports Server (NTRS)
Lanyi, J. K.
1991-01-01
It was proposed over 10 years ago that the archaebacteria represent a separate kingdom which diverged very early from the eubacteria and eukaryotes. It follows that investigations of archaebacterial characteristics might reveal features of early evolution. So far, two genes, one for bacteriorhodopsin and another for halorhodopsin, both from Halobacterium halobium, have been sequenced. We cloned and sequenced the gene coding for the polypeptide of another one of these rhodopsins, a halorhodopsin in Natronobacterium pharaonis. Peptide sequencing of cyanogen bromide fragments, and immuno-reactions of the protein and synthetic peptides derived from the C-terminal gene sequence, confirmed that the open reading frame was the structural gene for the pharaonis halorhodopsin polypeptide. The flanking DNA sequences of this gene, as well as those of other bacterial rhodopsins, were compared to previously proposed archaebacterial consensus sequences. In pairwise comparisons of the open reading frame with DNA sequences for bacterio-opsin and halo-opsin from Halobacterium halobium, silent divergences were calculated. These indicate very considerable evolutionary distance between each pair of genes, even in the dame organism. In spite of this, three protein sequences show extensive similarities, indicating strong selective pressures.
In vivo binding of PRDM9 reveals interactions with noncanonical genomic sites
Grey, Corinne; Clément, Julie A.J.; Buard, Jérôme; Leblanc, Benjamin; Gut, Ivo; Gut, Marta; Duret, Laurent
2017-01-01
In mouse and human meiosis, DNA double-strand breaks (DSBs) initiate homologous recombination and occur at specific sites called hotspots. The localization of these sites is determined by the sequence-specific DNA binding domain of the PRDM9 histone methyl transferase. Here, we performed an extensive analysis of PRDM9 binding in mouse spermatocytes. Unexpectedly, we identified a noncanonical recruitment of PRDM9 to sites that lack recombination activity and the PRDM9 binding consensus motif. These sites include gene promoters, where PRDM9 is recruited in a DSB-dependent manner. Another subset reveals DSB-independent interactions between PRDM9 and genomic sites, such as the binding sites for the insulator protein CTCF. We propose that these DSB-independent sites result from interactions between hotspot-bound PRDM9 and genomic sequences located on the chromosome axis. PMID:28336543
Reanalysis of RNA-Sequencing Data Reveals Several Additional Fusion Genes with Multiple Isoforms
Kangaspeska, Sara; Hultsch, Susanne; Edgren, Henrik; Nicorici, Daniel; Murumägi, Astrid; Kallioniemi, Olli
2012-01-01
RNA-sequencing and tailored bioinformatic methodologies have paved the way for identification of expressed fusion genes from the chaotic genomes of solid tumors. We have recently successfully exploited RNA-sequencing for the discovery of 24 novel fusion genes in breast cancer. Here, we demonstrate the importance of continuous optimization of the bioinformatic methodology for this purpose, and report the discovery and experimental validation of 13 additional fusion genes from the same samples. Integration of copy number profiling with the RNA-sequencing results revealed that the majority of the gene fusions were promoter-donating events that occurred at copy number transition points or involved high-level DNA-amplifications. Sequencing of genomic fusion break points confirmed that DNA-level rearrangements underlie selected fusion transcripts. Furthermore, a significant portion (>60%) of the fusion genes were alternatively spliced. This illustrates the importance of reanalyzing sequencing data as gene definitions change and bioinformatic methods improve, and highlights the previously unforeseen isoform diversity among fusion transcripts. PMID:23119097
Reanalysis of RNA-sequencing data reveals several additional fusion genes with multiple isoforms.
Kangaspeska, Sara; Hultsch, Susanne; Edgren, Henrik; Nicorici, Daniel; Murumägi, Astrid; Kallioniemi, Olli
2012-01-01
RNA-sequencing and tailored bioinformatic methodologies have paved the way for identification of expressed fusion genes from the chaotic genomes of solid tumors. We have recently successfully exploited RNA-sequencing for the discovery of 24 novel fusion genes in breast cancer. Here, we demonstrate the importance of continuous optimization of the bioinformatic methodology for this purpose, and report the discovery and experimental validation of 13 additional fusion genes from the same samples. Integration of copy number profiling with the RNA-sequencing results revealed that the majority of the gene fusions were promoter-donating events that occurred at copy number transition points or involved high-level DNA-amplifications. Sequencing of genomic fusion break points confirmed that DNA-level rearrangements underlie selected fusion transcripts. Furthermore, a significant portion (>60%) of the fusion genes were alternatively spliced. This illustrates the importance of reanalyzing sequencing data as gene definitions change and bioinformatic methods improve, and highlights the previously unforeseen isoform diversity among fusion transcripts.
Xu, Teng; Qin, Song; Hu, Yongwu; Song, Zhijian; Ying, Jianchao; Li, Peizhen; Dong, Wei; Zhao, Fangqing; Yang, Huanming; Bao, Qiyu
2016-01-01
Arthrospira platensis is a multi-cellular and filamentous non-N2-fixing cyanobacterium that is capable of performing oxygenic photosynthesis. In this study, we determined the nearly complete genome sequence of A. platensis YZ. A. platensis YZ genome is a single, circular chromosome of 6.62 Mb in size. Phylogenetic and comparative genomic analyses revealed that A. platensis YZ was more closely related to A. platensis NIES-39 than Arthrospira sp. PCC 8005 and A. platensis C1. Broad gene gains were identified between A. platensis YZ and three other Arthrospira speices, some of which have been previously demonstrated that can be laterally transferred among different species, such as restriction-modification systems-coding genes. Moreover, unprecedented extensive chromosomal rearrangements among different strains were observed. The chromosomal rearrangements, particularly the chromosomal inversions, were analysed and estimated to be closely related to palindromes that involved long inverted repeat sequences and the extensively distributed type IIR restriction enzyme in the Arthrospira genome. In addition, species from genus Arthrospira unanimously contained the highest rate of repetitive sequence compared with the other species of order Oscillatoriales, suggested that sequence duplication significantly contributed to Arthrospira genome phylogeny. These results provided in-depth views into the genomic phylogeny and structural variation of A. platensis, as well as provide a valuable resource for functional genomics studies. PMID:27330141
van den Broek, M; Bolat, I; Nijkamp, J F; Ramos, E; Luttik, M A H; Koopman, F; Geertman, J M; de Ridder, D; Pronk, J T; Daran, J-M
2015-09-01
Lager brewing strains of Saccharomyces pastorianus are natural interspecific hybrids originating from the spontaneous hybridization of Saccharomyces cerevisiae and Saccharomyces eubayanus. Over the past 500 years, S. pastorianus has been domesticated to become one of the most important industrial microorganisms. Production of lager-type beers requires a set of essential phenotypes, including the ability to ferment maltose and maltotriose at low temperature, the production of flavors and aromas, and the ability to flocculate. Understanding of the molecular basis of complex brewing-related phenotypic traits is a prerequisite for rational strain improvement. While genome sequences have been reported, the variability and dynamics of S. pastorianus genomes have not been investigated in detail. Here, using deep sequencing and chromosome copy number analysis, we showed that S. pastorianus strain CBS1483 exhibited extensive aneuploidy. This was confirmed by quantitative PCR and by flow cytometry. As a direct consequence of this aneuploidy, a massive number of sequence variants was identified, leading to at least 1,800 additional protein variants in S. pastorianus CBS1483. Analysis of eight additional S. pastorianus strains revealed that the previously defined group I strains showed comparable karyotypes, while group II strains showed large interstrain karyotypic variability. Comparison of three strains with nearly identical genome sequences revealed substantial chromosome copy number variation, which may contribute to strain-specific phenotypic traits. The observed variability of lager yeast genomes demonstrates that systematic linking of genotype to phenotype requires a three-dimensional genome analysis encompassing physical chromosomal structures, the copy number of individual chromosomes or chromosomal regions, and the allelic variation of copies of individual genes. Copyright © 2015, van den Broek et al.
van den Broek, M.; Bolat, I.; Nijkamp, J. F.; Ramos, E.; Luttik, M. A. H.; Koopman, F.; Geertman, J. M.; de Ridder, D.; Pronk, J. T.
2015-01-01
Lager brewing strains of Saccharomyces pastorianus are natural interspecific hybrids originating from the spontaneous hybridization of Saccharomyces cerevisiae and Saccharomyces eubayanus. Over the past 500 years, S. pastorianus has been domesticated to become one of the most important industrial microorganisms. Production of lager-type beers requires a set of essential phenotypes, including the ability to ferment maltose and maltotriose at low temperature, the production of flavors and aromas, and the ability to flocculate. Understanding of the molecular basis of complex brewing-related phenotypic traits is a prerequisite for rational strain improvement. While genome sequences have been reported, the variability and dynamics of S. pastorianus genomes have not been investigated in detail. Here, using deep sequencing and chromosome copy number analysis, we showed that S. pastorianus strain CBS1483 exhibited extensive aneuploidy. This was confirmed by quantitative PCR and by flow cytometry. As a direct consequence of this aneuploidy, a massive number of sequence variants was identified, leading to at least 1,800 additional protein variants in S. pastorianus CBS1483. Analysis of eight additional S. pastorianus strains revealed that the previously defined group I strains showed comparable karyotypes, while group II strains showed large interstrain karyotypic variability. Comparison of three strains with nearly identical genome sequences revealed substantial chromosome copy number variation, which may contribute to strain-specific phenotypic traits. The observed variability of lager yeast genomes demonstrates that systematic linking of genotype to phenotype requires a three-dimensional genome analysis encompassing physical chromosomal structures, the copy number of individual chromosomes or chromosomal regions, and the allelic variation of copies of individual genes. PMID:26150454
Design and cloning strategies for constructing shRNA expression vectors
McIntyre, Glen J; Fanning, Gregory C
2006-01-01
Background Short hairpin RNA (shRNA) encoded within an expression vector has proven an effective means of harnessing the RNA interference (RNAi) pathway in mammalian cells. A survey of the literature revealed that shRNA vector construction can be hindered by high mutation rates and the ensuing sequencing is often problematic. Current options for constructing shRNA vectors include the use of annealed complementary oligonucleotides (74 % of surveyed studies), a PCR approach using hairpin containing primers (22 %) and primer extension of hairpin templates (4 %). Results We considered primer extension the most attractive method in terms of cost. However, in initial experiments we encountered a mutation frequency of 50 % compared to a reported 20 – 40 % for other strategies. By modifying the technique to be an isothermal reaction using the DNA polymerase Phi29, we reduced the error rate to 10 %, making primer extension the most efficient and cost-effective approach tested. We also found that inclusion of a restriction site in the loop could be exploited for confirming construct integrity by automated sequencing, while maintaining intended gene suppression. Conclusion In this study we detail simple improvements for constructing and sequencing shRNA that overcome current limitations. We also compare the advantages of our solutions against proposed alternatives. Our technical modifications will be of tangible benefit to researchers looking for a more efficient and reliable shRNA construction process. PMID:16396676
Horner, David S; Lefkimmiatis, Konstantinos; Reyes, Aurelio; Gissi, Carmela; Saccone, Cecilia; Pesole, Graziano
2007-01-01
Background Phylogenetic relationships between Lagomorpha, Rodentia and Primates and their allies (Euarchontoglires) have long been debated. While it is now generally agreed that Rodentia constitutes a monophyletic sister-group of Lagomorpha and that this clade (Glires) is sister to Primates and Dermoptera, higher-level relationships within Rodentia remain contentious. Results We have sequenced and performed extensive evolutionary analyses on the mitochondrial genome of the scaly-tailed flying squirrel Anomalurus sp., an enigmatic rodent whose phylogenetic affinities have been obscure and extensively debated. Our phylogenetic analyses of the coding regions of available complete mitochondrial genome sequences from Euarchontoglires suggest that Anomalurus is a sister taxon to the Hystricognathi, and that this clade represents the most basal divergence among sampled Rodentia. Bayesian dating methods incorporating a relaxed molecular clock provide divergence-time estimates which are consistently in agreement with the fossil record and which indicate a rapid radiation within Glires around 60 million years ago. Conclusion Taken together, the data presented provide a working hypothesis as to the phylogenetic placement of Anomalurus, underline the utility of mitochondrial sequences in the resolution of even relatively deep divergences and go some way to explaining the difficulty of conclusively resolving higher-level relationships within Glires with available data and methodologies. PMID:17288612
Delatorre, Edson; Miranda, Milene; Tschoeke, Diogo A; Carvalho de Sequeira, Patrícia; Alves Sampaio, Simone; Barbosa-Lima, Giselle; Rangel Vieira, Yasmine; Leomil, Luciana; Bozza, Fernando A; Cerbino-Neto, José; Bozza, Patricia T; Ribeiro Nogueira, Rita Maria; Brasil, Patrícia; Thompson, Fabiano L; de Filippis, Ana M B; Souza, Thiago Moreno L
2018-05-17
Descriptive clinical data help to reveal factors that may provoke Zika virus (ZIKV) neuropathology. The case of a 24-year-old female with a ZIKV-associated severe acute neurological disorder was studied. The levels of ZIKV in the cerebrospinal fluid (CSF) were 50 times higher than the levels in other compartments. An acute anti-flavivirus IgG, together with enhanced TNF-alpha levels, may have contributed to ZIKV invasion in the CSF, whereas the unbiased genome sequencing [obtained by next-generation sequencing (NGS)] of the CSF revealed that no virus mutations were associated with the anatomic compartments (CSF, serum, saliva and urine).
NASA Technical Reports Server (NTRS)
Waas, A.; Babcock, C., Jr.
1986-01-01
A series of experiments was carried out to determine the mechanism of failure in compressively loaded laminated plates with a circular cutout. Real time holographic interferometry and photomicrography are used to observe the progression of failure. These observations together with post experiment plate sectioning and deplying for interior damage observation provide useful information for modelling the failure process. It is revealed that the failure is initiated as a localised instability in the zero layers, at the hole surface. With increasing load extensive delamination cracking is observed. The progression of failure is by growth of these delaminations induced by delamination buckling. Upon reaching a critical state, catastrophic failure of the plate is observed. The levels of applied load and the rate at which these events occur depend on the plate stacking sequence.
Nearing saturation of cancer driver gene discovery.
Hsiehchen, David; Hsieh, Antony
2018-06-15
Extensive sequencing efforts of cancer genomes such as The Cancer Genome Atlas (TCGA) have been undertaken to uncover bona fide cancer driver genes which has enhanced our understanding of cancer and revealed therapeutic targets. However, the number of driver gene mutations is bounded, indicating that there must be a point when further sequencing efforts will be excessive. We found that there was a significant positive correlation between sample size and identified driver gene mutations across 33 cancers sequenced by the TCGA, which is expected if additional sequencing is still leading to the identification of more driver genes. However, the rate of new cancer driver genes being discovered with larger samples is declining rapidly. Our analysis provides a general guide for determining which cancer types would likely benefit from additional sequencing efforts, particularly those with relatively high rates of cancer driver gene discovery. Our results argue that past strategies of indiscriminately sequencing as many specimens as possible for all cancer types is becoming inefficient. In addition, without significant investments into applying our knowledge of cancer genomes, we risk sequencing more cancer genomes for the sake of sequencing rather than meaningful patient benefit.
An extensive program of periodic alternative splicing linked to cell cycle progression
Dominguez, Daniel; Tsai, Yi-Hsuan; Weatheritt, Robert; Wang, Yang; Blencowe, Benjamin J; Wang, Zefeng
2016-01-01
Progression through the mitotic cell cycle requires periodic regulation of gene function at the levels of transcription, translation, protein-protein interactions, post-translational modification and degradation. However, the role of alternative splicing (AS) in the temporal control of cell cycle is not well understood. By sequencing the human transcriptome through two continuous cell cycles, we identify ~1300 genes with cell cycle-dependent AS changes. These genes are significantly enriched in functions linked to cell cycle control, yet they do not significantly overlap genes subject to periodic changes in steady-state transcript levels. Many of the periodically spliced genes are controlled by the SR protein kinase CLK1, whose level undergoes cell cycle-dependent fluctuations via an auto-inhibitory circuit. Disruption of CLK1 causes pleiotropic cell cycle defects and loss of proliferation, whereas CLK1 over-expression is associated with various cancers. These results thus reveal a large program of CLK1-regulated periodic AS intimately associated with cell cycle control. DOI: http://dx.doi.org/10.7554/eLife.10288.001 PMID:27015110
Niskanen, Einari A; Hytönen, Vesa P; Grapputo, Alessandro; Nordlund, Henri R; Kulomaa, Markku S; Laitinen, Olli H
2005-01-01
Background A chicken egg contains several biotin-binding proteins (BBPs), whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins. PMID:15777476
Extraordinary Structured Noncoding RNAs Revealed by Bacterial Metagenome Analysis
Weinberg, Zasha; Perreault, Jonathan; Meyer, Michelle M.; Breaker, Ronald R.
2012-01-01
Estimates of the total number of bacterial species1-3 suggest that existing DNA sequence databases carry only a tiny fraction of the total amount of DNA sequence space represented by this division of life. Indeed, environmental DNA samples have been shown to encode many previously unknown classes of proteins4 and RNAs5. Bioinformatics searches6-10 of genomic DNA from bacteria commonly identify novel noncoding RNAs (ncRNAs)10-12 such as riboswitches13,14. In rare instances, RNAs that exhibit more extensive sequence and structural conservation across a wide range of bacteria are encountered15,16. Given that large structured RNAs are known to carry out complex biochemical functions such as protein synthesis and RNA processing reactions, identifying more RNAs of great size and intricate structure is likely to reveal additional biochemical functions that can be achieved by RNA. We applied an updated computational pipeline17 to discover ncRNAs that rival the known large ribozymes in size and structural complexity or that are among the most abundant RNAs in bacteria that encode them. These RNAs would have been difficult or impossible to detect without examining environmental DNA sequences, suggesting that numerous RNAs with extraordinary size, structural complexity, or other exceptional characteristics remain to be discovered in unexplored sequence space. PMID:19956260
Beck, Heather J.; Fleming, Ian M. C.
2016-01-01
Analysis of the Escherichia coli transcriptome identified a unique subset of messenger RNAs (mRNAs) that contain a conventional untranslated leader and Shine-Dalgarno (SD) sequence upstream of the gene’s start codon while also containing an AUG triplet at the mRNA’s 5’- terminus (5’-uAUG). Fusion of the coding sequence specified by the 5’-terminal putative AUG start codon to a lacZ reporter gene, as well as primer extension inhibition assays, reveal that the majority of the 5’-terminal upstream open reading frames (5’-uORFs) tested support some level of lacZ translation, indicating that these mRNAs can function both as leaderless and canonical SD-leadered mRNAs. Although some of the uORFs were expressed at low levels, others were expressed at levels close to that of the respective downstream genes and as high as the naturally leaderless cI mRNA of bacteriophage λ. These 5’-terminal uORFs potentially encode peptides of varying lengths, but their functions, if any, are unknown. In an effort to determine whether expression from the 5’-terminal uORFs impact expression of the immediately downstream cistron, we examined expression from the downstream coding sequence after mutations were introduced that inhibit efficient 5’-uORF translation. These mutations were found to affect expression from the downstream cistrons to varying degrees, suggesting that some 5’-uORFs may play roles in downstream regulation. Since the 5’-uAUGs found on these conventionally leadered mRNAs can function to bind ribosomes and initiate translation, this indicates that canonical mRNAs containing 5’-uAUGs should be examined for their potential to function also as leaderless mRNAs. PMID:27467758
Soetens, Oriane; De Bel, Annelies; Echahidi, Fedoua; Vancutsem, Ellen; Vandoorslaer, Kristof; Piérard, Denis
2012-01-01
The performance of matrix-assisted laser desorption–ionization time of flight mass spectrometry (MALDI-TOF MS) for species identification of Prevotella was evaluated and compared with 16S rRNA gene sequencing. Using a Bruker database, 62.7% of the 102 clinical isolates were identified to the species level and 73.5% to the genus level. Extension of the commercial database improved these figures to, respectively, 83.3% and 89.2%. MALDI-TOF MS identification of Prevotella is reliable but needs a more extensive database. PMID:22301022
ERIC Educational Resources Information Center
Allen, Nancy J.; DeLauro, Kimberly A.; Perry, Julia K.; Carman, Carol A.
2017-01-01
Previous research has found a positive relationship between students who had completed a sequence of developmental reading and writing courses and success in a reading-intensive college-level course. This study replicates and expands upon the previous research of Goldstein and Perin (2008) by utilizing a differently diverse sample and an…
Recombination of polynucleotide sequences using random or defined primers
Arnold, Frances H.; Shao, Zhixin; Affholter, Joseph A.; Zhao, Huimin H; Giver, Lorraine J.
2000-01-01
A method for in vitro mutagenesis and recombination of polynucleotide sequences based on polymerase-catalyzed extension of primer oligonucleotides is disclosed. The method involves priming template polynucleotide(s) with random-sequences or defined-sequence primers to generate a pool of short DNA fragments with a low level of point mutations. The DNA fragments are subjected to denaturization followed by annealing and further enzyme-catalyzed DNA polymerization. This procedure is repeated a sufficient number of times to produce full-length genes which comprise mutants of the original template polynucleotides. These genes can be further amplified by the polymerase chain reaction and cloned into a vector for expression of the encoded proteins.
Recombination of polynucleotide sequences using random or defined primers
Arnold, Frances H.; Shao, Zhixin; Affholter, Joseph A.; Zhao, Huimin; Giver, Lorraine J.
2001-01-01
A method for in vitro mutagenesis and recombination of polynucleotide sequences based on polymerase-catalyzed extension of primer oligonucleotides is disclosed. The method involves priming template polynucleotide(s) with random-sequences or defined-sequence primers to generate a pool of short DNA fragments with a low level of point mutations. The DNA fragments are subjected to denaturization followed by annealing and further enzyme-catalyzed DNA polymerization. This procedure is repeated a sufficient number of times to produce full-length genes which comprise mutants of the original template polynucleotides. These genes can be further amplified by the polymerase chain reaction and cloned into a vector for expression of the encoded proteins.
Winkfein, R J; Nishikawa, S; Connor, W; Dixon, G H
1993-07-01
A synthetic oligonucleotide primer, designed from marsupial protamine protein-sequence data [Balhorn, R., Corzett, M., Matrimas, J. A., Cummins, J. & Faden, B. (1989) Analysis of protamines isolated from two marsupials, the ring-tailed wallaby and gray short-tailed opossum, J. Cell. Biol. 107] was used to amplify, via the polymerase chain reaction, protamine sequences from a North American opossum (Didelphis marsupialis) cDNA. Using the amplified sequences as probes, several protamine cDNA clones were isolated. The protein sequence, predicted from the cDNA sequences, consisted of 57 amino acids, contained a large number of arginine residues and exhibited the sequence ARYR at its amino terminus, which is conserved in avian and most eutherian mammal protamines. Like the true protamines of trout and chicken, the opossum protamine lacked cysteine residues, distinguishing it from placental mammalian protamine 1 (P1 or stable) protamines. Examination of the protamine gene, isolated by polymerase-chain-reaction amplification of genomic DNA, revealed the presence of an intron dividing the protamine-coding region, a common characteristic of all mammalian P1 genes. In addition, extensive sequence identity in the 5' and 3' flanking regions between mouse and opossum sequences classify the marsupial protamine as being closely related to placental mammal P1. Protamine transcripts, in both birds and mammals, are present in two size classes, differing by the length of their poly(A) tails (either short or long). Examination of opossum protamine transcripts by Northern hybridization revealed four distinct mRNA species in the total RNA fraction, two of which were enriched in the poly(A)-rich fraction. Northern-blot analysis, using an intron-specific probe, revealed the presence of intron sequences in two of the four protamine transcripts. If expressed, the corresponding protein from intron-containing transcripts would differ from spliced transcripts by length (49 versus 57 amino acids) and would contain a cysteine residue.
Early history of European domestic cattle as revealed by ancient DNA.
Bollongino, R; Edwards, C J; Alt, K W; Burger, J; Bradley, D G
2006-03-22
We present an extensive ancient DNA analysis of mainly Neolithic cattle bones sampled from archaeological sites along the route of Neolithic expansion, from Turkey to North-Central Europe and Britain. We place this first reasonable population sample of Neolithic cattle mitochondrial DNA sequence diversity in context to illustrate the continuity of haplotype variation patterns from the first European domestic cattle to the present. Interestingly, the dominant Central European pattern, a starburst phylogeny around the modal sequence, T3, has a Neolithic origin, and the reduced diversity within this cluster in the ancient samples accords with their shorter history of post-domestic accumulation of mutation.
Directional analysis of CO2 persistence at a rural site.
Pérez, Isidro A; Sánchez, M Luisa; García, M Ángeles; Paredes, Vanessa
2011-09-01
Conditional probability was used to establish persistence of CO(2) concentrations at a rural site. Measurements extended over three years and were performed with a CO(2) continuous monitor and a sodar. Concentrations in the usual range at this site were proposed as the truncation level to calculate conditional probability, allowing us to determine the extent of CO(2) sequences. Extension of episodes may be inferred from these values. Persistence of wind directions revealed two groups of sectors, one with a persistence of about 16 h and another of about 9 h. Cumulative distribution of CO(2) was calculated in each wind sector and three groups, associated with different concentration origins, were established. One group was linked to transport and local sources, another to the rural environment, and a third to transport of clean air masses. Daily evolution of concentrations revealed major differences during the night and monthly analysis allowed us to associate group 1 with the vegetation cycle and group 3 with wind speed from December to April. Persistence of concentrations was obtained, and group 3 values were lower for concentrations above the truncation level, whereas persistence of groups 1 and 2 was similar. However, group 3 persistence was, in general, between group 1 and 2 persistence for concentrations below the truncation level. Copyright © 2011 Elsevier B.V. All rights reserved.
Lu, Xinli; Kang, Xianjiang; Liu, Yongjian; Cui, Ze; Guo, Wei; Zhao, Cuiying; Li, Yan; Chen, Suliang; Li, Jingyun; Zhang, Yuqi; Zhao, Hongru
2017-01-01
New human immunodeficiency virus type 1 (HIV-1) diagnoses are increasing rapidly in Hebei. The aim of this study presents the most extensive HIV-1 molecular epidemiology investigation in Hebei province in China thus far. We have carried out the most extensive systematic cross-sectional study based on newly diagnosed HIV-1 positive individuals in 2013, and characterized the molecular epidemiology of HIV-1 based on full length gag-partial pol gene sequences in the whole of Hebei. Nine HIV-1 genotypes based on full length gag-partial pol gene sequence were identified among 610 newly diagnosed naïve individuals. The four main genotypes were circulating recombinant form (CRF)01_AE (53.4%), CRF07_BC (23.4%), subtype B (15.9%), and unique recombinant forms URFs (4.9%). Within 1 year, three new genotypes (subtype A1, CRF55_01B, CRF65_cpx), unknown before in Hebei, were first found among men who have sex with men (MSM). All nine genotypes were identified in the sexually contracted HIV-1 population. Among 30 URFs, six recombinant patterns were revealed, including CRF01_AE/BC (40.0%), CRF01_AE/B (23.3%), B/C (16.7%), CRF01_AE/C (13.3%), CRF01_AE/B/A2 (3.3%) and CRF01_AE/BC/A2 (3.3%), plus two potential CRFs. This study elucidated the complicated characteristics of HIV-1 molecular epidemiology in a low HIV-1 prevalence northern province of China and revealed the high level of HIV-1 genetic diversity. All nine HIV-1 genotypes circulating in Hebei have spread out of their initial risk groups into the general population through sexual contact, especially through MSM. This highlights the urgency of HIV prevention and control in China.
Lu, Xinli; Kang, Xianjiang; Liu, Yongjian; Cui, Ze; Guo, Wei; Zhao, Cuiying; Li, Yan; Chen, Suliang; Li, Jingyun; Zhang, Yuqi; Zhao, Hongru
2017-01-01
New human immunodeficiency virus type 1 (HIV-1) diagnoses are increasing rapidly in Hebei. The aim of this study presents the most extensive HIV-1 molecular epidemiology investigation in Hebei province in China thus far. We have carried out the most extensive systematic cross-sectional study based on newly diagnosed HIV-1 positive individuals in 2013, and characterized the molecular epidemiology of HIV-1 based on full length gag-partial pol gene sequences in the whole of Hebei. Nine HIV-1 genotypes based on full length gag-partial pol gene sequence were identified among 610 newly diagnosed naïve individuals. The four main genotypes were circulating recombinant form (CRF)01_AE (53.4%), CRF07_BC (23.4%), subtype B (15.9%), and unique recombinant forms URFs (4.9%). Within 1 year, three new genotypes (subtype A1, CRF55_01B, CRF65_cpx), unknown before in Hebei, were first found among men who have sex with men (MSM). All nine genotypes were identified in the sexually contracted HIV-1 population. Among 30 URFs, six recombinant patterns were revealed, including CRF01_AE/BC (40.0%), CRF01_AE/B (23.3%), B/C (16.7%), CRF01_AE/C (13.3%), CRF01_AE/B/A2 (3.3%) and CRF01_AE/BC/A2 (3.3%), plus two potential CRFs. This study elucidated the complicated characteristics of HIV-1 molecular epidemiology in a low HIV-1 prevalence northern province of China and revealed the high level of HIV-1 genetic diversity. All nine HIV-1 genotypes circulating in Hebei have spread out of their initial risk groups into the general population through sexual contact, especially through MSM. This highlights the urgency of HIV prevention and control in China. PMID:28178737
Benfey, PN; Takatsuji, H; Ren, L; Shah, DM; Chua, NH
1990-01-01
We have analyzed expression from deletion derivatives of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) 5[prime]-upstream region in transgenic petunia flowers and seedlings. In seedlings, expression was strongest in root cortex cells and in trichomes. High-level expression in petals and in seedling roots was conferred by large (>500 base-pair) stretches of sequence, but was lost when smaller fragments were analyzed individually. This apparent requirement for extensive sequence suggests that combinations of cis-elements that are widely separated control tissue-specific expression from the EPSPS promoter. We have also used the high-level, petal-specific expression of the EPSPS promoter to change petal color in two mutant petunia lines. PMID:12354968
Foundations for a syntatic pattern recognition system for genomic DNA sequences
DOE Office of Scientific and Technical Information (OSTI.GOV)
Searles, D.B.
1993-03-01
The goal of the proposed work is the creation of a software system that will perform sophisticated pattern recognition and related functions at a level of abstraction and with expressive power beyond current general-purpose pattern-matching systems for biological sequences; and with a more uniform language, environment, and graphical user interface, and with greater flexibility, extensibility, embeddability, and ability to incorporate other algorithms, than current special-purpose analytic software.
Miniprimer PCR, a New Lens for Viewing the Microbial World▿ †
Isenbarger, Thomas A.; Finney, Michael; Ríos-Velázquez, Carlos; Handelsman, Jo; Ruvkun, Gary
2008-01-01
Molecular methods based on the 16S rRNA gene sequence are used widely in microbial ecology to reveal the diversity of microbial populations in environmental samples. Here we show that a new PCR method using an engineered polymerase and 10-nucleotide “miniprimers” expands the scope of detectable sequences beyond those detected by standard methods using longer primers and Taq polymerase. After testing the method in silico to identify divergent ribosomal genes in previously cloned environmental sequences, we applied the method to soil and microbial mat samples, which revealed novel 16S rRNA gene sequences that would not have been detected with standard primers. Deeply divergent sequences were discovered with high frequency and included representatives that define two new division-level taxa, designated CR1 and CR2, suggesting that miniprimer PCR may reveal new dimensions of microbial diversity. PMID:18083877
NASA Astrophysics Data System (ADS)
Yoshida, S.
2000-11-01
High-frequency stratigraphic sequences that comprise the Desert Member of the Blackhawk Formation, the Lower Castlegate Sandstone, and the Buck Tongue in the Green River area of Utah display changes in sequence architecture from marine deposits to marginal marine deposits to an entirely nonmarine section. Facies and sequence architecture differ above and below the regionally extensive Castlegate sequence boundary, which separates two low-frequency (106-year cyclicity) sequences. Below this surface, high-frequency sequences are identified and interpreted as comprising the highstand systems tract of the low-frequency Blackhawk sequence. Each high-frequency sequence has a local incised valley system on top of the wave-dominated delta, and coastal plain to shallow marine deposits are preserved. Above the Castlegate sequence boundary, in contrast, a regionally extensive sheet sandstone of fluvial to estuarine origin with laterally continuous internal erosional surfaces occurs. These deposits above the Castlegate sequence boundary are interpreted as the late lowstand to early transgressive systems tracts of the low-frequency Castlegate sequence. The base-level changes that generated both the low- and high-frequency sequences are attributed to crustal response to fluctuations in compressive intraplate stress on two different time scales. The low-frequency stratigraphic sequences are attributed to changes in the long-term regional subsidence rate and regional tilting of foreland basin fill. High-frequency sequences probably reflect the response of anisotropic basement to tectonism. Sequence architecture changes rapidly across the faulted margin of the underlying Paleozoic Paradox Basin. The high-frequency sequences are deeply eroded and stack above the Paradox Basin, but display less relief and become conformable updip. These features indicate that the area above the Paradox Basin was more prone to vertical structural movements during formation of the Blackhawk-Lower Castlegate succession.
Milos, Patrice
2008-04-01
Helicos BioSciences Corporation is a life sciences company developing revolutionary new single molecule sequencing technology to provide the path to the US$1000 genome. True Single Molecule Sequencing (tSMS) will drive advancements in pharmacogenomics that can enable a better understanding of an individual's susceptibility to disease, develop more effective disease diagnoses and differentiate response to disease therapies. During 2007, genome-wide disease-association studies, the encylopedia of DNA elements (ENCODE) and the published genome sequence of two individuals have revealed human genome variation far more extensive than originally believed. These also demonstrated that common variations explain only a fraction of the genetic basis of disease. Therefore, the capability to understand an individual genome is critical in setting the foundation for the next great revolution in healthcare. Helicos is committed to this vision and will provide cost-effective genome sequencing and comprehensive analysis of the transcribed genome that can unlock the era of personalized healthcare.
Ringwald, M; Schuh, R; Vestweber, D; Eistetter, H; Lottspeich, F; Engel, J; Dölz, R; Jähnig, F; Epplen, J; Mayer, S
1987-01-01
We have determined the amino acid sequence of the Ca2+-dependent cell adhesion molecule uvomorulin as it appears on the cell surface. The extracellular part of the molecule exhibits three internally repeated domains of 112 residues which are most likely generated by gene duplication. Each of the repeated domains contains two highly conserved units which could represent putative Ca2+-binding sites. Secondary structure predictions suggest that the putative Ca2+-binding units are located in external loops at the surface of the protein. The protein sequence exhibits a single membrane-spanning region and a cytoplasmic domain. Sequence comparison reveals extensive homology to the chicken L-CAM. Both uvomorulin and L-CAM are identical in 65% of their entire amino acid sequence suggesting a common origin for both CAMs. Images Fig. 1. Fig. 4. Fig. 7. PMID:3501370
Jiménez, Diego Javier; Andreote, Fernando Dini; Chaves, Diego; Montaña, José Salvador; Osorio-Forero, Cesar; Junca, Howard; Zambrano, María Mercedes; Baena, Sandra
2012-01-01
A taxonomic and annotated functional description of microbial life was deduced from 53 Mb of metagenomic sequence retrieved from a planktonic fraction of the Neotropical high Andean (3,973 meters above sea level) acidic hot spring El Coquito (EC). A classification of unassembled metagenomic reads using different databases showed a high proportion of Gammaproteobacteria and Alphaproteobacteria (in total read affiliation), and through taxonomic affiliation of 16S rRNA gene fragments we observed the presence of Proteobacteria, micro-algae chloroplast and Firmicutes. Reads mapped against the genomes Acidiphilium cryptum JF-5, Legionella pneumophila str. Corby and Acidithiobacillus caldus revealed the presence of transposase-like sequences, potentially involved in horizontal gene transfer. Functional annotation and hierarchical comparison with different datasets obtained by pyrosequencing in different ecosystems showed that the microbial community also contained extensive DNA repair systems, possibly to cope with ultraviolet radiation at such high altitudes. Analysis of genes involved in the nitrogen cycle indicated the presence of dissimilatory nitrate reduction to N2 (narGHI, nirS, norBCDQ and nosZ), associated with Proteobacteria-like sequences. Genes involved in the sulfur cycle (cysDN, cysNC and aprA) indicated adenylsulfate and sulfite production that were affiliated to several bacterial species. In summary, metagenomic sequence data provided insight regarding the structure and possible functions of this hot spring microbial community, describing some groups potentially involved in the nitrogen and sulfur cycling in this environment. PMID:23251687
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cuomo, Christina A.; Guldener, Ulrich; Xu, Jin Rong
2007-09-07
We sequenced and annotated the genome of the filamentous fungus Fusarium graminearum, a major pathogen of cultivated cereals. Very few repetitive sequences were detected, and the process of repeat-induced point mutation, in which duplicated sequences are subject to extensive mutation, may partially account for the reduced repeat content and apparent low number of paralogous (ancestrally duplicated) genes. A second strain of F. graminearum contained more than 10,000 single-nucleotide polymorphisms, which were frequently located near telomeres and within other discrete chromosomal segments. Many highly polymorphic regions contained sets of genes implicated in plant-fungus interactions and were unusually divergent, with higher ratesmore » of recombination. These regions of genome innovation may result from selection due to interactions of F. graminearum with its plant hosts.« less
Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T
1993-01-01
Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043
Zhou, Wen-Zhao; Zhang, Yan-Mei; Lu, Jun-Ying; Li, Jun-Feng
2012-01-01
To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing. PMID:23202944
Putaporntip, Chaturong; Thongaree, Siriporn; Jongwutiwes, Somchai
2013-08-01
To determine the genetic diversity and potential transmission routes of Plasmodium knowlesi, we analyzed the complete nucleotide sequence of the gene encoding the merozoite surface protein-1 of this simian malaria (Pkmsp-1), an asexual blood-stage vaccine candidate, from naturally infected humans and macaques in Thailand. Analysis of Pkmsp-1 sequences from humans (n=12) and monkeys (n=12) reveals five conserved and four variable domains. Most nucleotide substitutions in conserved domains were dimorphic whereas three of four variable domains contained complex repeats with extensive sequence and size variation. Besides purifying selection in conserved domains, evidence of intragenic recombination scattering across Pkmsp-1 was detected. The number of haplotypes, haplotype diversity, nucleotide diversity and recombination sites of human-derived sequences exceeded that of monkey-derived sequences. Phylogenetic networks based on concatenated conserved sequences of Pkmsp-1 displayed a character pattern that could have arisen from sampling process or the presence of two independent routes of P. knowlesi transmission, i.e. from macaques to human and from human to humans in Thailand. Copyright © 2013 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Funkenstein, B.; Leary, S.L.; Stein, J.C.
1988-03-01
The Gus-s/sup ..cap alpha../ allele of the mouse ..beta..-glucuronidase gene exhibits a high degree of inducibility by androgens due to its linkage with the Gus-r/sup ..cap alpha../ regulatory locus. The authors isolated Gus-s/sup ..cap alpha../ on a 28-kilobase pair fragment of mouse chromosome 5 and found that it contains 12 exons and 11 intervening sequences spanning 14 kilobase pairs of this genomic segment. The mRNA cap site was identified by ribonuclease protection and primer extension analyses which revealed an unusually short 5' noncoding sequence of 12 nucleotides. Proximal regulatory sequences in the 5'-flanking DNA and the complete sequence of themore » Gus-s/sup ..cap alpha../ mRNA transcript were also determined. Comparison of the amino acid sequence determined from the Gus-s/sup ..cap alpha../ nucleotide sequence with that of human ..beta..-glucuronidase indicated that the two human mRNA species differ due to alternate splicing of an exon homologous to exon 6 of the mouse gene.« less
Quiapim, Andréa C.; Brito, Michael S.; Bernardes, Luciano A.S.; daSilva, Idalete; Malavazi, Iran; DePaoli, Henrique C.; Molfetta-Machado, Jeanne B.; Giuliatti, Silvana; Goldman, Gustavo H.; Goldman, Maria Helena S.
2009-01-01
The success of plant reproduction depends on pollen-pistil interactions occurring at the stigma/style. These interactions vary depending on the stigma type: wet or dry. Tobacco (Nicotiana tabacum) represents a model of wet stigma, and its stigmas/styles express genes to accomplish the appropriate functions. For a large-scale study of gene expression during tobacco pistil development and preparation for pollination, we generated 11,216 high-quality expressed sequence tags (ESTs) from stigmas/styles and created the TOBEST database. These ESTs were assembled in 6,177 clusters, from which 52.1% are pistil transcripts/genes of unknown function. The 21 clusters with the highest number of ESTs (putative higher expression levels) correspond to genes associated with defense mechanisms or pollen-pistil interactions. The database analysis unraveled tobacco sequences homologous to the Arabidopsis (Arabidopsis thaliana) genes involved in specifying pistil identity or determining normal pistil morphology and function. Additionally, 782 independent clusters were examined by macroarray, revealing 46 stigma/style preferentially expressed genes. Real-time reverse transcription-polymerase chain reaction experiments validated the pistil-preferential expression for nine out of 10 genes tested. A search for these 46 genes in the Arabidopsis pistil data sets demonstrated that only 11 sequences, with putative equivalent molecular functions, are expressed in this dry stigma species. The reverse search for the Arabidopsis pistil genes in the TOBEST exposed a partial overlap between these dry and wet stigma transcriptomes. The TOBEST represents the most extensive survey of gene expression in the stigmas/styles of wet stigma plants, and our results indicate that wet and dry stigmas/styles express common as well as distinct genes in preparation for the pollination process. PMID:19052150
Integrated systems analysis reveals a molecular network underlying autism spectrum disorders
Li, Jingjing; Shi, Minyi; Ma, Zhihai; Zhao, Shuchun; Euskirchen, Ghia; Ziskin, Jennifer; Urban, Alexander; Hallmayer, Joachim; Snyder, Michael
2014-01-01
Autism is a complex disease whose etiology remains elusive. We integrated previously and newly generated data and developed a systems framework involving the interactome, gene expression and genome sequencing to identify a protein interaction module with members strongly enriched for autism candidate genes. Sequencing of 25 patients confirmed the involvement of this module in autism, which was subsequently validated using an independent cohort of over 500 patients. Expression of this module was dichotomized with a ubiquitously expressed subcomponent and another subcomponent preferentially expressed in the corpus callosum, which was significantly affected by our identified mutations in the network center. RNA-sequencing of the corpus callosum from patients with autism exhibited extensive gene mis-expression in this module, and our immunochemical analysis showed that the human corpus callosum is predominantly populated by oligodendrocyte cells. Analysis of functional genomic data further revealed a significant involvement of this module in the development of oligodendrocyte cells in mouse brain. Our analysis delineates a natural network involved in autism, helps uncover novel candidate genes for this disease and improves our understanding of its molecular pathology. PMID:25549968
Babu, Peram Ravindra; Rao, Khareedu Venkateswara; Reddy, Vudem Dashavantha
2013-01-15
Flax CYPome analysis resulted in the identification of 334 putative cytochrome P450 (CYP450) genes in the cultivated flax genome. Classification of flax CYP450 genes based on the sequence similarity with Arabidopsis orthologs and CYP450 nomenclature, revealed 10 clans representing 44 families and 98 subfamilies. CYP80, CYP83, CYP92, CYP702, CYP705, CYP708, CYP728, CYP729, CYP733 and CYP736 families are absent in the flax genome. The subfamily members exhibited conserved sequences, length of exons and phasing of introns. Similarity search of the genomic resources of wild flax species Linum bienne with CYP450 coding sequences of the cultivated flax, revealed the presence of 127 CYP450 gene orthologs, indicating amplification of novel CYP450 genes in the cultivated flax. Seven families CYP73, 74, 75, 76, 77, 84 and 709, coding for enzymes associated with phenylpropanoid/fatty acid metabolism, showed extensive gene amplification in the flax. About 59% of the flax CYP450 genes were present in the EST libraries. Copyright © 2012 Elsevier B.V. All rights reserved.
Sánchez-Navarro, J A; Pallás, V
1997-01-01
The complete nucleotide sequence of an isolate of prunus necrotic ringspot virus (PNRSV) RNA 3 has been determined. Elucidation of the amino acid sequence of the proteins encoded by the two large open reading frames (ORFs) allowed us to carry out comparative and phylogenetic studies on the movement (MP) and coat (CP) proteins in the ilarvirus group. Amino acid sequence comparison of the MP revealed a highly conserved basic sequence motif with an amphipathic alpha-helical structure preceding the conserved motif of the '30K superfamily' proposed by Mushegian and Koonin [26] for MP's. Within this '30K' motif a strictly conserved transmembrane domain is present in all ilarviruses sequenced so far. At the amino-terminal end, prune dwarf virus (PDV) has an extension not present in other ilarviruses but which is observed in all bromo- and cucumoviruses, suggesting a common ancestor or a recombinational event in the Bromoviridae family. Examination of the N-terminus of the CP's of all ilarviruses revealed a highly basic region, part of which resembles the Arg-rich motif that has been characterized in the RNA-binding protein family. This motif has also been found in the other members of the Bromoviridae family, suggesting its involvement in a structural function. Furthermore this region is required for infectivity in ilarviruses. The similarities found in this Arg-rich motif are discussed in terms of this process known as genome activation. Finally, phylogenetic analysis of both the MP and CP proteins revealed a higher relationship of A1MV to PNRSV, apple mosaic virus (ApMV) and PDV than any other member of the ilarvirus group. In that sense, A1MV should be considered as a true ilarvirus instead of forming a distinct group of viruses.
Non-3D domain swapped crystal structure of truncated zebrafish alphaA crystallin
Laganowsky, A; Eisenberg, D
2010-01-01
In previous work on truncated alpha crystallins (Laganowsky et al., Protein Sci 2010; 19:1031–1043), we determined crystal structures of the alpha crystallin core, a seven beta-stranded immunoglobulin-like domain, with its conserved C-terminal extension. These extensions swap into neighboring cores forming oligomeric assemblies. The extension is palindromic in sequence, binding in either of two directions. Here, we report the crystal structure of a truncated alphaA crystallin (AAC) from zebrafish (Danio rerio) revealing C-terminal extensions in a non three-dimensional (3D) domain swapped, “closed” state. The extension is quasi-palindromic, bound within its own zebrafish core domain, lying in the opposite direction to that of bovine AAC, which is bound within an adjacent core domain (Laganowsky et al., Protein Sci 2010; 19:1031–1043). Our findings establish that the C-terminal extension of alpha crystallin proteins can be either 3D domain swapped or non-3D domain swapped. This duality provides another molecular mechanism for alpha crystallin proteins to maintain the polydispersity that is crucial for eye lens transparency. PMID:20669149
DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation
Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob
2014-01-01
As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252
Rhizobium etli asparaginase II
Huerta-Saquero, Alejandro; Evangelista-Martínez, Zahaed; Moreno-Enriquez, Angélica; Perez-Rueda, Ernesto
2013-01-01
Bacterial l-asparaginase has been a universal component of therapies for childhood acute lymphoblastic leukemia since the 1970s. Two principal enzymes derived from Escherichia coli and Erwinia chrysanthemi are the only options clinically approved to date. We recently reported a study of recombinant l-asparaginase (AnsA) from Rhizobium etli and described an increasing type of AnsA family members. Sequence analysis revealed four conserved motifs with notable differences with respect to the conserved regions of amino acid sequences of type I and type II l-asparaginases, particularly in comparison with therapeutic enzymes from E. coli and E. chrysanthemi. These differences suggested a distinct immunological specificity. Here, we report an in silico analysis that revealed immunogenic determinants of AnsA. Also, we used an extensive approach to compare the crystal structures of E. coli and E. chrysantemi asparaginases with a computational model of AnsA and identified immunogenic epitopes. A three-dimensional model of AsnA revealed, as expected based on sequence dissimilarities, completely different folding and different immunogenic epitopes. This approach could be very useful in transcending the problem of immunogenicity in two major ways: by chemical modifications of epitopes to reduce drug immunogenicity, and by site-directed mutagenesis of amino acid residues to diminish immunogenicity without reduction of enzymatic activity. PMID:22895060
Rhizobium etli asparaginase II: an alternative for acute lymphoblastic leukemia (ALL) treatment.
Huerta-Saquero, Alejandro; Evangelista-Martínez, Zahaed; Moreno-Enriquez, Angélica; Perez-Rueda, Ernesto
2013-01-01
Bacterial L-asparaginase has been a universal component of therapies for childhood acute lymphoblastic leukemia since the 1970s. Two principal enzymes derived from Escherichia coli and Erwinia chrysanthemi are the only options clinically approved to date. We recently reported a study of recombinant L-asparaginase (AnsA) from Rhizobium etli and described an increasing type of AnsA family members. Sequence analysis revealed four conserved motifs with notable differences with respect to the conserved regions of amino acid sequences of type I and type II L-asparaginases, particularly in comparison with therapeutic enzymes from E. coli and E. chrysanthemi. These differences suggested a distinct immunological specificity. Here, we report an in silico analysis that revealed immunogenic determinants of AnsA. Also, we used an extensive approach to compare the crystal structures of E. coli and E. chrysantemi asparaginases with a computational model of AnsA and identified immunogenic epitopes. A three-dimensional model of AsnA revealed, as expected based on sequence dissimilarities, completely different folding and different immunogenic epitopes. This approach could be very useful in transcending the problem of immunogenicity in two major ways: by chemical modifications of epitopes to reduce drug immunogenicity, and by site-directed mutagenesis of amino acid residues to diminish immunogenicity without reduction of enzymatic activity.
Lücke, S; Xu, G L; Palfi, Z; Cross, M; Bellofatto, V; Bindereif, A
1996-01-01
In trypanosomes mRNAs are generated through trans splicing. The spliced leader (SL) RNA, which donates the 5'-terminal mini-exon to each of the protein coding exons, plays a central role in the trans splicing process. We have established in vivo assays to study in detail trans splicing, cap4 modification, and RNP assembly of the SL RNA in the trypanosomatid species Leptomonas seymouri. First, we found that extensive sequences within the mini-exon are required for SL RNA function in vivo, although a conserved length of 39 nt is not essential. In contrast, the intron sequence appears to be surprisingly tolerant to mutation; only the stem-loop II structure is indispensable. The asymmetry of the sequence requirements in the stem I region suggests that this domain may exist in different functional conformations. Second, distinct mini-exon sequences outside the modification site are important for efficient cap4 formation. Third, all SL RNA mutations tested allowed core RNP assembly, suggesting flexible requirements for core protein binding. In sum, the results of our mutational analysis provide evidence for a discrete domain structure of the SL RNA and help to explain the strong phylogenetic conservation of the mini-exon sequence and of the overall SL RNA secondary structure; they also suggest that there may be certain differences between trans splicing in nematodes and trypanosomes. This approach provides a basis for studying RNA-RNA interactions in the trans spliceosome. Images PMID:8861965
Mechanism for DNA transposons to generate introns on genomic scales
Huff, Jason T.; Zilberman, Daniel; Roy, Scott W.
2017-01-01
Discovered four decades ago, the existence of introns was one of the most unexpected findings in molecular biology1. Introns are sequences interrupting genes that must be removed as part of mRNA production. Genome sequencing projects have documented that most eukaryotic genes contain at least one and frequently many introns2,3. Comparison of these genomes reveals a history of long evolutionary periods with little intron gain punctuated by episodes of rapid, extensive gain2,3. However, no detailed mechanism for such episodic intron generation has been empirically supported on a sufficient scale, despite several proposals4–8. Here we show how short non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from gene sequence duplicated upon transposon insertion, allowing perfect splicing out of RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between preexisting nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases2 and prevalence of nucleosome-sized exons9 observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism plausibly accounting for episodes of rapid, extensive intron gain during eukaryotic evolution2,3. PMID:27760113
Dynamic evolution at pericentromeres.
Hall, Anne E; Kettler, Gregory C; Preuss, Daphne
2006-03-01
Pericentromeres are exceptional genomic regions: in animals they contain extensive segmental duplications implicated in gene creation, and in plants they sustain rearrangements and insertions uncommon in euchromatin. To examine the mechanisms and patterns of plant pericentromere evolution, we compared pericentromere sequence from four Brassicaceae species separated by <15 million years (Myr). This flowering plant family is ideal for studying relationships between genome reorganization and pericentromere evolution-its members have undergone recent polyploidization and hybridization, with close relatives changing in genome size and chromosome number. Through sequence and hybridization analyses, we examined regions from Arabidopsis arenosa, Capsella rubella, and Olimarabidopsis pumila that are homologous to Arabidopsis thaliana pericentromeres (peri-CENs) III and V, and used FISH to demonstrate they have been maintained near centromere satellite arrays in each species. Sequence analysis revealed a set of highly conserved genes, yet we discovered substantial differences in intergenic length and species-specific changes in sequence content and gene density. We discovered that A. thaliana has undergone recent, significant expansions within its pericentromeres, in some cases measuring hundreds of kilobases; these findings are in marked contrast to euchromatic segments in these species that exhibit only minor length changes. While plant pericentromeres do contain some duplications, we did not find evidence of extensive segmental duplications, as has been documented in primates. Our data support a model in which plant pericentromeres may experience selective pressures distinct from euchromatin, tolerating rapid, dynamic changes in structure and sequence content, including large insertions of mobile elements, 5S rDNA arrays and pseudogenes.
Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K P; Woo, Patrick C Y
2015-10-22
Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10-49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n=2), Pichia (Candida) norvegensis (n=2), Candida tropicalis (n=1) and Saccharomyces cerevisiae (n=1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study.
Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K. P.; Woo, Patrick C. Y.
2015-01-01
Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10–49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n = 2), Pichia (Candida) norvegensis (n = 2), Candida tropicalis (n = 1) and Saccharomyces cerevisiae (n = 1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study. PMID:26506340
2013-01-01
Background Grapes are one of the most economically important fruit crops. There are about 60 species in the genus Vitis. The phylogenetic relationships among these species are of keen interest for the conservation and use of this germplasm. We selected 309 accessions from 48 Vitis species,varieties, and outgroups, examined ~11 kb (~3.4 Mb total) of aligned nuclear DNA sequences from 27 unlinked genes in a phylogenetic context, and estimated divergence times based on fossil calibrations. Results Vitis formed a strongly supported clade. There was substantial support for species and less for the higher-level groupings (series). As estimated from extant taxa, the crown age of Vitis was 28 Ma and the divergence of subgenera (Vitis and Muscadinia) occurred at ~18 Ma. Higher clades in subgenus Vitis diverged 16 – 5 Ma with overlapping confidence intervals, and ongoing divergence formed extant species at 12 – 1.3 Ma. Several species had species-specific SNPs. NeighborNet analysis showed extensive reticulation at the core of subgenus Vitis representing the deeper nodes, with extensive reticulation radiating outward. Fitch Parsimony identified North America as the origin of the most recent common ancestor of extant Vitis species. Conclusions Phylogenetic patterns suggested origination of the genus in North America, fragmentation of an ancestral range during the Miocene, formation of extant species in the late Miocene-Pleistocene, and differentiation of species in the context of Pliocene-Quaternary tectonic and climatic change. Nuclear SNPs effectively resolved relationships at and below the species level in grapes and rectified several misclassifications of accessions in the repositories. Our results challenge current higher-level classifications, reveal the abundance of genetic diversity in the genus that is potentially available for crop improvement, and provide a valuable resource for species delineation, germplasm conservation and use. PMID:23826735
Detection and characterization of hepatitis A virus circulating in Egypt.
Hamza, Hazem; Abd-Elshafy, Dina Nadeem; Fayed, Sayed A; Bahgat, Mahmoud Mohamed; El-Esnawy, Nagwa Abass; Abdel-Mobdy, Emam
2017-07-01
Hepatitis A virus (HAV) still poses a considerable problem worldwide. In the current study, hepatitis A virus was recovered from wastewater samples collected from three wastewater treatment plants over one year. Using RT-PCR, HAV was detected in 43 out of 68 samples (63.2%) representing both inlet and outlet. Eleven positive samples were subjected to sequencing targeting the VP1-2A junction region. Phylogenetic analysis revealed that all samples belonged to subgenotype IB with few substitutions at the amino acid level. The complete sequence of one isolate (HAV/Egy/BI-11/2015) showed that the similarity at the amino acid level was not reflected at the nucleotide level. However, the deduced amino acid sequence derived from the complete nucleotide sequence showed distinct substitutions in the 2B, 2C, and 3A regions. Recombination analysis revealed a recombination event between X75215 (subgenotype IA) and AF268396 (subgenotype IB) involving a portion of the 2B nonstructural protein coding region (nucleotides 3757-3868) assuming the herein characterized sequence an actual recombinant. Despite the role of recombination in picornaviruses evolution, its involvement in HAV evolution has rarely been reported, and this may be due to the limited available complete HAV sequences. To our knowledge, this represents the first characterized complete sequence of an Egyptian isolate and the described recombination event provides an important update on the circulating HAV strains in Egypt.
Carreno, R A; Barta, J R
1998-11-01
The small subunit ribosomal RNA (SSU rRNA) genes of hippoboscid (Ornithoica vicina Walker) and tabanid (Chrysops niger Macquart) Diptera were sequenced to determine their phylogenetic position within the order and to determine whether or not extensive hypervariable regions in this gene are widespread in the Diptera. A parsimony analysis of an alignment containing 8 dipteran sequences produced a single most parsimonious tree that placed O. vicina as sister group to Drosophila melanogaster Meigen. The tabanid Chrysops niger was sister group to the asilomorphan taxa, and the sister group to the Brachycera was a Tipula sp. although this relationship was not supported by bootstrap analysis. The hippoboscid and tabanid sequences contain extensive hypervariable regions in the V2, V4, V6, and V7 regions as do other Diptera. When these regions of the alignment were excluded from the phylogenetic analysis, a single most parsimonious tree was found. This tree had an identical overall topology to the tree obtained from the total data set. The hypervariable regions in parts of the dipteran SSU rRNA genes were more extensive in the nematocerous dipteran sequences used in this study than in the other dipteran representatives; these hypervariable regions may be of more utility in inferring relationship among species and subspecies than at the suprageneric level.
Complete study demonstrating the absence of rhabdovirus in a distinct Sf9 cell line.
Hashimoto, Yoshifumi; Macri, Daniel; Srivastava, Indresh; McPherson, Clifton; Felberbaum, Rachael; Post, Penny; Cox, Manon
2017-01-01
A putative novel rhabdovirus (SfRV) was previously identified in a Spodoptera frugiperda cell line (Sf9 cells [ATCC CRL-1711 lot 58078522]) by next generation sequencing and extensive bioinformatic analysis. We performed an extensive analysis of our Sf9 cell bank (ATCC CRL-1711 lot 5814 [Sf9L5814]) to determine whether this virus was already present in cells obtained from ATCC in 1987. Inverse PCR of DNA isolated from Sf9 L5814 cellular DNA revealed integration of SfRV sequences in the cellular genome. RT-PCR of total RNA showed a deletion of 320 nucleotides in the SfRV RNA that includes the transcriptional motifs for genes X and L. Concentrated cell culture supernatant was analyzed by sucrose density gradient centrifugation and revealed a single band at a density of 1.14 g/ml. This fraction was further analysed by electron microscopy and showed amorphous and particulate debris that did not resemble a rhabdovirus in morphology or size. SDS-PAGE analysis confirmed that the protein composition did not contain the typical five rhabdovirus structural proteins and LC-MS/MS analysis revealed primarily of exosomal marker proteins, the SfRV N protein, and truncated forms of SfRV N, P, and G proteins. The SfRV L gene fragment RNA sequence was recovered from the supernatant after ultracentrifugation of the 1.14 g/ml fraction treated with diethyl ether suggesting that the SfRV L gene fragment sequence is not associated with a diethyl ether resistant nucleocapsid. Interestingly, the 1.14 g/ml fraction was able to transfer baculovirus DNA into Sf9L5814 cells, consistent with the presence of functional exosomes. Our results demonstrate the absence of viral particles in ATCC CRL-1711 lot 5814 Sf9 cells in contrast to a previous study that suggested the presence of infectious rhabdoviral particles in Sf9 cells from a different lot. This study highlights how cell lines with different lineages may present different virosomes and therefore no general conclusions can be drawn across Sf9 cells from different laboratories.
Modeling of the Ebola Virus Delta Peptide Reveals a Potential Lytic Sequence Motif
Gallaher, William R.; Garry, Robert F.
2015-01-01
Filoviruses, such as Ebola and Marburg viruses, cause severe outbreaks of human infection, including the extensive epidemic of Ebola virus disease (EVD) in West Africa in 2014. In the course of examining mutations in the glycoprotein gene associated with 2014 Ebola virus (EBOV) sequences, a differential level of conservation was noted between the soluble form of glycoprotein (sGP) and the full length glycoprotein (GP), which are both encoded by the GP gene via RNA editing. In the region of the proteins encoded after the RNA editing site sGP was more conserved than the overlapping region of GP when compared to a distant outlier species, Tai Forest ebolavirus. Half of the amino acids comprising the “delta peptide”, a 40 amino acid carboxy-terminal fragment of sGP, were identical between otherwise widely divergent species. A lysine-rich amphipathic peptide motif was noted at the carboxyl terminus of delta peptide with high structural relatedness to the cytolytic peptide of the non-structural protein 4 (NSP4) of rotavirus. EBOV delta peptide is a candidate viroporin, a cationic pore-forming peptide, and may contribute to EBOV pathogenesis. PMID:25609303
Honey Bee Infecting Lake Sinai Viruses.
Daughenbaugh, Katie F; Martin, Madison; Brutscher, Laura M; Cavigli, Ian; Garcia, Emma; Lavin, Matt; Flenniken, Michelle L
2015-06-23
Honey bees are critical pollinators of important agricultural crops. Recently, high annual losses of honey bee colonies have prompted further investigation of honey bee infecting viruses. To better characterize the recently discovered and very prevalent Lake Sinai virus (LSV) group, we sequenced currently circulating LSVs, performed phylogenetic analysis, and obtained images of LSV2. Sequence analysis resulted in extension of the LSV1 and LSV2 genomes, the first detection of LSV4 in the US, and the discovery of LSV6 and LSV7. We detected LSV1 and LSV2 in the Varroa destructor mite, and determined that a large proportion of LSV2 is found in the honey bee gut, suggesting that vector-mediated, food-associated, and/or fecal-oral routes may be important for LSV dissemination. Pathogen-specific quantitative PCR data, obtained from samples collected during a small-scale monitoring project, revealed that LSV2, LSV1, Black queen cell virus (BQCV), and Nosema ceranae were more abundant in weak colonies than strong colonies within this sample cohort. Together, these results enhance our current understanding of LSVs and illustrate the importance of future studies aimed at investigating the role of LSVs and other pathogens on honey bee health at both the individual and colony levels.
Selective chemical labeling reveals the genome-wide distribution of 5-hydroxymethylcytosine.
Song, Chun-Xiao; Szulwach, Keith E; Fu, Ye; Dai, Qing; Yi, Chengqi; Li, Xuekun; Li, Yujing; Chen, Chih-Hsin; Zhang, Wen; Jian, Xing; Wang, Jing; Zhang, Li; Looney, Timothy J; Zhang, Baichen; Godley, Lucy A; Hicks, Leslie M; Lahn, Bruce T; Jin, Peng; He, Chuan
2011-01-01
In contrast to 5-methylcytosine (5-mC), which has been studied extensively, little is known about 5-hydroxymethylcytosine (5-hmC), a recently identified epigenetic modification present in substantial amounts in certain mammalian cell types. Here we present a method for determining the genome-wide distribution of 5-hmC. We use the T4 bacteriophage β-glucosyltransferase to transfer an engineered glucose moiety containing an azide group onto the hydroxyl group of 5-hmC. The azide group can be chemically modified with biotin for detection, affinity enrichment and sequencing of 5-hmC-containing DNA fragments in mammalian genomes. Using this method, we demonstrate that 5-hmC is present in human cell lines beyond those previously recognized. We also find a gene expression level-dependent enrichment of intragenic 5-hmC in mouse cerebellum and an age-dependent acquisition of this modification in specific gene bodies linked to neurodegenerative disorders.
Woods, D E; Edge, M D; Colten, H R
1984-01-01
Complementary DNA (cDNA) clones corresponding to the major histocompatibility (MHC) class III antigen, complement protein C2, have been isolated from human liver cDNA libraries with the use of a complex mixture of synthetic oligonucleotides (17 mer) that contains 576 different oligonucleotide sequences. The C2 cDNA were used to identify a DNA restriction enzyme fragment length polymorphism that provides a genetic marker within the MHC that was not detectable at the protein level. An extensive search for genomic polymorphisms using a cDNA clone for another MHC class III gene, factor B, failed to reveal any DNA variants. The genomic variants detected with the C2 cDNA probe provide an additional genetic marker for analysis of MHC-linked diseases. Images PMID:6086718
Conservation of chromosome 1 in turtles over 66 million years.
Mühlmann-Díaz, M C; Ulsh, B A; Whicker, F W; Hinton, T G; Congdon, J D; Robinson, J F; Bedford, J S
2001-01-01
Fluorescence in situ hybridization of a whole chromosome 1-specific probe from the yellow-bellied slider turtle (Trachemys scripta) to cells from four other species of turtle ranging from a desert tortoise to a loggerhead sea turtle resulted in specific and exclusive hybridization to chromosome 1 in all five species. Previous observations of conservation in the giemsa banding pattern and chromosome morphology and number among turtles are thus extended to the DNA sequence level, revealing a cytogenetic stability of chromosome 1 in these turtles during the past 66-144 million years. This contrasts with the situation for various hominoid species where, in many instances, extensive chromosomal rearrangements have been reported in one third of that time period. Our probe, which was prepared by microdissecting whole chromosomes from embryonic T. scripta fibroblasts and amplifying using DOP-PCR, is the first report of a whole-chromosome FISH probe for any reptile.
Ventura, Marco; Canchaya, Carlos; Zink, Ralf; Fitzgerald, Gerald F; van Sinderen, Douwe
2004-10-01
The bacterial heat shock response is characterized by the elevated expression of a number of chaperone complexes, including the GroEL and GroES proteins. The groES and groEL genes are highly conserved among eubacteria and are typically arranged as an operon. Genome analysis of Bifidobacterium breve UCC 2003 revealed that the groES and groEL genes are located in different chromosomal regions. The heat inducibility of the groEL and groES genes of B. breve UCC 2003 was verified by slot blot analysis. Northern blot analyses showed that the cspA gene is cotranscribed with the groEL gene, while the groES gene is transcribed as a monocistronic unit. The transcription initiation sites of these two mRNAs were determined by primer extension. Sequence and transcriptional analyses of the region flanking the groEL and groES genes of various bifidobacteria revealed similar groEL-cspA and groES gene units, suggesting a novel genetic organization of these chaperones. Phylogenetic analysis of the available bifidobacterial groES and groEL genes suggested that these genes evolved differently. Discrepancies in the phylogenetic positioning of groES-based trees make this gene an unreliable molecular marker. On the other hand, the bifidobacterial groEL gene sequences can be used as an alternative to current methods for tracing Bifidobacterium species, particularly because they allow a high level of discrimination between closely related species of this genus.
A core gut microbiome in obese and lean twins.
Turnbaugh, Peter J; Hamady, Micah; Yatsunenko, Tanya; Cantarel, Brandi L; Duncan, Alexis; Ley, Ruth E; Sogin, Mitchell L; Jones, William J; Roe, Bruce A; Affourtit, Jason P; Egholm, Michael; Henrissat, Bernard; Heath, Andrew C; Knight, Rob; Gordon, Jeffrey I
2009-01-22
The human distal gut harbours a vast ensemble of microbes (the microbiota) that provide important metabolic capabilities, including the ability to extract energy from otherwise indigestible dietary polysaccharides. Studies of a few unrelated, healthy adults have revealed substantial diversity in their gut communities, as measured by sequencing 16S rRNA genes, yet how this diversity relates to function and to the rest of the genes in the collective genomes of the microbiota (the gut microbiome) remains obscure. Studies of lean and obese mice suggest that the gut microbiota affects energy balance by influencing the efficiency of calorie harvest from the diet, and how this harvested energy is used and stored. Here we characterize the faecal microbial communities of adult female monozygotic and dizygotic twin pairs concordant for leanness or obesity, and their mothers, to address how host genotype, environmental exposure and host adiposity influence the gut microbiome. Analysis of 154 individuals yielded 9,920 near full-length and 1,937,461 partial bacterial 16S rRNA sequences, plus 2.14 gigabases from their microbiomes. The results reveal that the human gut microbiome is shared among family members, but that each person's gut microbial community varies in the specific bacterial lineages present, with a comparable degree of co-variation between adult monozygotic and dizygotic twin pairs. However, there was a wide array of shared microbial genes among sampled individuals, comprising an extensive, identifiable 'core microbiome' at the gene, rather than at the organismal lineage, level. Obesity is associated with phylum-level changes in the microbiota, reduced bacterial diversity and altered representation of bacterial genes and metabolic pathways. These results demonstrate that a diversity of organismal assemblages can nonetheless yield a core microbiome at a functional level, and that deviations from this core are associated with different physiological states (obese compared with lean).
Jolliffe, Evan A; Keegan, B Mark; Flanagan, Eoin P
2018-04-21
Longitudinally-extensive T2-hyperintense spinal cord lesions (≥3 vertebral segments) are associated with neuromyelitis optical spectrum disorder but occur with other disorders including spinal cord sarcoidosis. When linear dorsal subpial enhancement is accompanied by central cord/canal enhancement the axial post-gadolinium sequences may reveal a "trident" pattern that has previously been shown to be strongly suggestive of spinal cord sarcoidosis. We report a case in which the patient was initially diagnosed with neuromyelitis optical spectrum disorder, but where the "trident" sign ultimately led to the correct diagnosis of spinal cord sarcoidosis. Copyright © 2018. Published by Elsevier B.V.
Evidence for Differential Glycosylation of Trophoblast Cell Types*
Chen, Qiushi; Pang, Poh-Choo; Cohen, Marie E.; Longtine, Mark S.; Schust, Danny J.; Haslam, Stuart M.; Blois, Sandra M.; Dell, Anne; Clark, Gary F.
2016-01-01
Human placental villi are surfaced by the syncytiotrophoblast (STB), with a layer of cytotrophoblasts (CTB) positioned just beneath the STB. STB in normal term pregnancies is exposed to maternal immune cells in the placental intervillous space. Extravillous cytotrophoblasts (EVT) invade the decidua and spiral arteries, where they act in conjunction with natural killer (NK) cells to convert the spiral arteries into flaccid conduits for maternal blood that support a 3–4 fold increase in the rate of maternal blood flow into the placental intervillous space. The functional roles of these distinct trophoblast subtypes during pregnancy suggested that they could be differentially glycosylated. Glycomic analysis of these trophoblasts has revealed the expression of elevated levels of biantennary N-glycans in STB and CTB, with the majority of them bearing a bisecting GlcNAc. N-glycans terminated with polylactosamine extensions were also detected at low levels. A subset of the N-glycans linked to these trophoblasts were sialylated, primarily with terminal NeuAcα2–3Gal sequences. EVT were decorated with the same N-glycans as STB and CTB, except in different proportions. The level of bisecting type N-glycans was reduced, but the level of N-glycans decorated with polylactosamine sequences were substantially elevated compared with the other types of trophoblasts. The level of triantennary and tetraantennary N-glycans was also elevated in EVT. The sialylated N-glycans derived from EVT were completely susceptible to an α2–3 specific neuraminidase (sialidase S). The possibility exists that the N-glycans associated with these different trophoblast subpopulations could act as functional groups. These potential relationships will be considered. PMID:26929217
Evidence for Differential Glycosylation of Trophoblast Cell Types.
Chen, Qiushi; Pang, Poh-Choo; Cohen, Marie E; Longtine, Mark S; Schust, Danny J; Haslam, Stuart M; Blois, Sandra M; Dell, Anne; Clark, Gary F
2016-06-01
Human placental villi are surfaced by the syncytiotrophoblast (STB), with a layer of cytotrophoblasts (CTB) positioned just beneath the STB. STB in normal term pregnancies is exposed to maternal immune cells in the placental intervillous space. Extravillous cytotrophoblasts (EVT) invade the decidua and spiral arteries, where they act in conjunction with natural killer (NK) cells to convert the spiral arteries into flaccid conduits for maternal blood that support a 3-4 fold increase in the rate of maternal blood flow into the placental intervillous space. The functional roles of these distinct trophoblast subtypes during pregnancy suggested that they could be differentially glycosylated. Glycomic analysis of these trophoblasts has revealed the expression of elevated levels of biantennary N-glycans in STB and CTB, with the majority of them bearing a bisecting GlcNAc. N-glycans terminated with polylactosamine extensions were also detected at low levels. A subset of the N-glycans linked to these trophoblasts were sialylated, primarily with terminal NeuAcα2-3Gal sequences. EVT were decorated with the same N-glycans as STB and CTB, except in different proportions. The level of bisecting type N-glycans was reduced, but the level of N-glycans decorated with polylactosamine sequences were substantially elevated compared with the other types of trophoblasts. The level of triantennary and tetraantennary N-glycans was also elevated in EVT. The sialylated N-glycans derived from EVT were completely susceptible to an α2-3 specific neuraminidase (sialidase S). The possibility exists that the N-glycans associated with these different trophoblast subpopulations could act as functional groups. These potential relationships will be considered. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Sutton, Deanna A.; Swenson, Cheryl L.; Bailey, Chris J.; Wiederhold, Nathan P.; Nelson, Nathan C.; Thompson, Elizabeth H.; Wickes, Brian L.; French, Stephanie; Fu, Jianmin; Vilar-Saavedra, Paulo
2014-01-01
Infections caused by Penicillium species are rare in dogs, and the prognosis in these cases is poor. An unknown species of Penicillium was isolated from a bone lesion in a young dog with osteomyelitis of the right ilium. Extensive diagnostic evaluation did not reveal evidence of dissemination. Resolution of lameness and clinical stability of disease were achieved with intravenous phospholipid-complexed amphotericin B initially, followed by long-term combination therapy with terbinafine and ketoconazole. A detailed morphological and molecular characterization of the mold was undertaken. Sequence analysis of the internal transcribed spacer revealed the isolate to be closely related to Penicillium menonorum and Penicillium pimiteouiense. Additional sequence analysis of β-tubulin, calmodulin, minichromosome maintenance factor, DNA-dependent RNA polymerase, and pre-rRNA processing protein revealed the isolate to be a novel species; the name Penicillium canis sp. nov. is proposed. Morphologically, smooth, ovoid conidia, a greenish gray colony color, slow growth on all media, and a failure to form ascomata distinguish this species from closely related Penicillium species. PMID:24789186
Langlois, Daniel K; Sutton, Deanna A; Swenson, Cheryl L; Bailey, Chris J; Wiederhold, Nathan P; Nelson, Nathan C; Thompson, Elizabeth H; Wickes, Brian L; French, Stephanie; Fu, Jianmin; Vilar-Saavedra, Paulo; Peterson, Stephen W
2014-07-01
Infections caused by Penicillium species are rare in dogs, and the prognosis in these cases is poor. An unknown species of Penicillium was isolated from a bone lesion in a young dog with osteomyelitis of the right ilium. Extensive diagnostic evaluation did not reveal evidence of dissemination. Resolution of lameness and clinical stability of disease were achieved with intravenous phospholipid-complexed amphotericin B initially, followed by long-term combination therapy with terbinafine and ketoconazole. A detailed morphological and molecular characterization of the mold was undertaken. Sequence analysis of the internal transcribed spacer revealed the isolate to be closely related to Penicillium menonorum and Penicillium pimiteouiense. Additional sequence analysis of β-tubulin, calmodulin, minichromosome maintenance factor, DNA-dependent RNA polymerase, and pre-rRNA processing protein revealed the isolate to be a novel species; the name Penicillium canis sp. nov. is proposed. Morphologically, smooth, ovoid conidia, a greenish gray colony color, slow growth on all media, and a failure to form ascomata distinguish this species from closely related Penicillium species. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Panwar, Priyankar; Verma, A K; Dubey, Ashutosh
2018-05-01
Barnyard ( Echinochloa frumentacea ) and finger ( Eleusine coracana ) millet growing at northwestern Himalaya were explored for the α-amylase inhibitor (α-AI). The mature seeds of barnyard millet variety PRJ1 had maximum α-AI activity which increases in different developmental stage. α-AI was purified up to 22.25-fold from barnyard millet variety PRJ1. Semi-quantitative PCR of different developmental stages of barnyard millet seeds showed increased levels of the transcript from 7 to 28 days. Sequence analysis revealed that it contained 315 bp nucleotide which encodes 104 amino acid sequence with molecular weight 10.72 kDa. The predicted 3D structure of α-AI was 86.73% similar to a bifunctional inhibitor of ragi. In silico analysis of 71 α-AI protein sequences were carried out for biochemical features, homology search, multiple sequence alignment, phylogenetic tree construction, motif, and superfamily distribution of protein sequences. Analysis of multiple sequence alignment revealed the existence of conserved regions NPLP[S/G]CRWYVV[S/Q][Q/R]TCG[V/I] throughout sequences. Superfam analysis revealed that α-AI protein sequences were distributed among seven different superfamilies.
Characterizing genomic alterations in cancer by complementary functional associations.
Kim, Jong Wook; Botvinnik, Olga B; Abudayyeh, Omar; Birger, Chet; Rosenbluh, Joseph; Shrestha, Yashaswi; Abazeed, Mohamed E; Hammerman, Peter S; DiCara, Daniel; Konieczkowski, David J; Johannessen, Cory M; Liberzon, Arthur; Alizad-Rahvar, Amir Reza; Alexe, Gabriela; Aguirre, Andrew; Ghandi, Mahmoud; Greulich, Heidi; Vazquez, Francisca; Weir, Barbara A; Van Allen, Eliezer M; Tsherniak, Aviad; Shao, Diane D; Zack, Travis I; Noble, Michael; Getz, Gad; Beroukhim, Rameen; Garraway, Levi A; Ardakani, Masoud; Romualdi, Chiara; Sales, Gabriele; Barbie, David A; Boehm, Jesse S; Hahn, William C; Mesirov, Jill P; Tamayo, Pablo
2016-05-01
Systematic efforts to sequence the cancer genome have identified large numbers of mutations and copy number alterations in human cancers. However, elucidating the functional consequences of these variants, and their interactions to drive or maintain oncogenic states, remains a challenge in cancer research. We developed REVEALER, a computational method that identifies combinations of mutually exclusive genomic alterations correlated with functional phenotypes, such as the activation or gene dependency of oncogenic pathways or sensitivity to a drug treatment. We used REVEALER to uncover complementary genomic alterations associated with the transcriptional activation of β-catenin and NRF2, MEK-inhibitor sensitivity, and KRAS dependency. REVEALER successfully identified both known and new associations, demonstrating the power of combining functional profiles with extensive characterization of genomic alterations in cancer genomes.
Wang, Jing; Moore, Nicole E.; Murray, Zak L.; McInnes, Kate; White, Daniel J.; Tompkins, Daniel M.
2015-01-01
Bats harbour a diverse array of viruses, including significant human pathogens. Extensive metagenomic studies of material from bats, in particular guano, have revealed a large number of novel or divergent viral taxa that were previously unknown. New Zealand has only two extant indigenous terrestrial mammals, which are both bats, Mystacina tuberculata (the lesser short-tailed bat) and Chalinolobus tuberculatus (the long-tailed bat). Until the human introduction of exotic mammals, these species had been isolated from all other terrestrial mammals for over 1 million years (potentially over 16 million years for M. tuberculata). Four bat guano samples were collected from M. tuberculata roosts on the isolated offshore island of Whenua hou (Codfish Island) in New Zealand. Metagenomic analysis revealed that this species still hosts a plethora of divergent viruses. Whilst the majority of viruses detected were likely to be of dietary origin, some putative vertebrate virus sequences were identified. Papillomavirus, polyomavirus, calicivirus and hepevirus were found in the metagenomic data and subsequently confirmed using independent PCR assays and sequencing. The new hepevirus and calicivirus sequences may represent new genera within these viral families. Our findings may provide an insight into the origins of viral families, given their detection in an isolated host species. PMID:25900137
Wang, Jing; Moore, Nicole E; Murray, Zak L; McInnes, Kate; White, Daniel J; Tompkins, Daniel M; Hall, Richard J
2015-08-01
Bats harbour a diverse array of viruses, including significant human pathogens. Extensive metagenomic studies of material from bats, in particular guano, have revealed a large number of novel or divergent viral taxa that were previously unknown. New Zealand has only two extant indigenous terrestrial mammals, which are both bats, Mystacina tuberculata (the lesser short-tailed bat) and Chalinolobus tuberculatus (the long-tailed bat). Until the human introduction of exotic mammals, these species had been isolated from all other terrestrial mammals for over 1 million years (potentially over 16 million years for M. tuberculata). Four bat guano samples were collected from M. tuberculata roosts on the isolated offshore island of Whenua hou (Codfish Island) in New Zealand. Metagenomic analysis revealed that this species still hosts a plethora of divergent viruses. Whilst the majority of viruses detected were likely to be of dietary origin, some putative vertebrate virus sequences were identified. Papillomavirus, polyomavirus, calicivirus and hepevirus were found in the metagenomic data and subsequently confirmed using independent PCR assays and sequencing. The new hepevirus and calicivirus sequences may represent new genera within these viral families. Our findings may provide an insight into the origins of viral families, given their detection in an isolated host species.
Ventura, Marco; Jankovic, Ivana; Walker, D. Carey; Pridmore, R. David; Zink, Ralf
2002-01-01
We have identified and sequenced the genes encoding the aggregation-promoting factor (APF) protein from six different strains of Lactobacillus johnsonii and Lactobacillus gasseri. Both species harbor two apf genes, apf1 and apf2, which are in the same orientation and encode proteins of 257 to 326 amino acids. Multiple alignments of the deduced amino acid sequences of these apf genes demonstrate a very strong sequence conservation of all of the genes with the exception of their central regions. Northern blot analysis showed that both genes are transcribed, reaching their maximum expression during the exponential phase. Primer extension analysis revealed that apf1 and apf2 harbor a putative promoter sequence that is conserved in all of the genes. Western blot analysis of the LiCl cell extracts showed that APF proteins are located on the cell surface. Intact cells of L. johnsonii revealed the typical cell wall architecture of S-layer-carrying gram-positive eubacteria, which could be selectively removed with LiCl treatment. In addition, the amino acid composition, physical properties, and genetic organization were found to be quite similar to those of S-layer proteins. These results suggest that APF is a novel surface protein of the Lactobacillus acidophilus B-homology group which might belong to an S-layer-like family. PMID:12450842
Turmel, Monique; Otis, Christian; Lemieux, Claude
2005-01-01
Background The Streptophyta comprise all land plants and six monophyletic groups of charophycean green algae. Phylogenetic analyses of four genes from three cellular compartments support the following branching order for these algal lineages: Mesostigmatales, Chlorokybales, Klebsormidiales, Zygnematales, Coleochaetales and Charales, with the last lineage being sister to land plants. Comparative analyses of the Mesostigma viride (Mesostigmatales) and land plant chloroplast genome sequences revealed that this genome experienced many gene losses, intron insertions and gene rearrangements during the evolution of charophyceans. On the other hand, the chloroplast genome of Chaetosphaeridium globosum (Coleochaetales) is highly similar to its land plant counterparts in terms of gene content, intron composition and gene order, indicating that most of the features characteristic of land plant chloroplast DNA (cpDNA) were acquired from charophycean green algae. To gain further insight into when the highly conservative pattern displayed by land plant cpDNAs originated in the Streptophyta, we have determined the cpDNA sequences of the distantly related zygnematalean algae Staurastrum punctulatum and Zygnema circumcarinatum. Results The 157,089 bp Staurastrum and 165,372 bp Zygnema cpDNAs encode 121 and 125 genes, respectively. Although both cpDNAs lack an rRNA-encoding inverted repeat (IR), they are substantially larger than Chaetosphaeridium and land plant cpDNAs. This increased size is explained by the expansion of intergenic spacers and introns. The Staurastrum and Zygnema genomes differ extensively from one another and from their streptophyte counterparts at the level of gene order, with the Staurastrum genome more closely resembling its land plant counterparts than does Zygnema cpDNA. Many intergenic regions in Zygnema cpDNA harbor tandem repeats. The introns in both Staurastrum (8 introns) and Zygnema (13 introns) cpDNAs represent subsets of those found in land plant cpDNAs. They represent 16 distinct insertion sites, only five of which are shared by the two zygnematalean genomes. Three of these insertions sites have not been identified in Chaetosphaeridium cpDNA. Conclusion The chloroplast genome experienced substantial changes in overall structure, gene order, and intron content during the evolution of the Zygnematales. Most of the features considered earlier as typical of land plant cpDNAs probably originated before the emergence of the Zygnematales and Coleochaetales. PMID:16236178
Khamrin, Pattara; Okitsu, Shoko; Ushijima, Hiroshi; Maneekarn, Niwat
2013-07-01
Epidemiological surveillance of human bocavirus (HBoV) was conducted on fecal specimens collected from hospitalized children with diarrhea in Chiang Mai, Thailand in 2011. By partial sequence analysis of VP1 gene, an unusual strain of HBoV (CMH-S011-11), was initially identified as HBoV4. The complete genome sequence of CMH-S011-11 was performed and analyzed further to clarify whether it was a recombinant strain or a new HBoV variant. Analysis of complete genome sequence revealed that the coding sequence starting from NS1, NP1 to VP1/VP2 was 4795 nucleotides long. Interestingly, the nucleotide sequence of NS1 gene of CMH-S011-11 was most closely related to the HBoV2 reference strains detected in Pakistan, which contradicted to the initial genotyping result of the partial VP1 region in the previous study. In addition, comparison of NP1 nucleotide sequence of CMH-S011-11 with those of other HBoV1-4 reference strains also revealed a high level of sequence identity with HBoV2. On the other hand, nucleotide sequence of VP1/VP2 gene of CMH-S011-11 was most closely related to those of HBoV4 reference strains detected in Nigeria. The overall full-length sequence analysis revealed that this CMH-S011-11 was grouped within HBoV4 species, but located in a separate branch from other HBoV4 prototype strains. Recombination analysis revealed that CMH-S011-11 was the result of recombination between HBoV2 and HBoV4 strains with the break point located near the start codon of VP2. Copyright © 2013 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Searles, D.B.
1993-03-01
The goal of the proposed work is the creation of a software system that will perform sophisticated pattern recognition and related functions at a level of abstraction and with expressive power beyond current general-purpose pattern-matching systems for biological sequences; and with a more uniform language, environment, and graphical user interface, and with greater flexibility, extensibility, embeddability, and ability to incorporate other algorithms, than current special-purpose analytic software.
Iwasaki, Yuki; Abe, Takashi; Wada, Kennosuke; Wada, Yoshiko; Ikemura, Toshimichi
2013-11-20
With the remarkable increase of genomic sequence data of microorganisms, novel tools are needed for comprehensive analyses of the big sequence data available. The self-organizing map (SOM) is an effective tool for clustering and visualizing high-dimensional data, such as oligonucleotide composition on one map. By modifying the conventional SOM, we developed batch-learning SOM (BLSOM), which allowed classification of sequence fragments (e.g., 1 kb) according to phylotypes, solely depending on oligonucleotide composition. Metagenomics studies of uncultivable microorganisms in clinical and environmental samples should allow extensive surveys of genes important in life sciences. BLSOM is most suitable for phylogenetic assignment of metagenomic sequences, because fragmental sequences can be clustered according to phylotypes, solely depending on oligonucleotide composition. We first constructed oligonucleotide BLSOMs for all available sequences from genomes of known species, and by mapping metagenomic sequences on these large-scale BLSOMs, we can predict phylotypes of individual metagenomic sequences, revealing a microbial community structure of uncultured microorganisms, including viruses. BLSOM has shown that influenza viruses isolated from humans and birds clearly differ in oligonucleotide composition. Based on this host-dependent oligonucleotide composition, we have proposed strategies for predicting directional changes of virus sequences and for surveilling potentially hazardous strains when introduced into humans from non-human sources.
Sakai, Hiroaki; Kanamori, Hiroyuki; Arai-Kichise, Yuko; Shibata-Hatta, Mari; Ebana, Kaworu; Oono, Youko; Kurita, Kanako; Fujisawa, Hiroko; Katagiri, Satoshi; Mukai, Yoshiyuki; Hamada, Masao; Itoh, Takeshi; Matsumoto, Takashi; Katayose, Yuichi; Wakasa, Kyo; Yano, Masahiro; Wu, Jianzhong
2014-01-01
Having a deep genetic structure evolved during its domestication and adaptation, the Asian cultivated rice (Oryza sativa) displays considerable physiological and morphological variations. Here, we describe deep whole-genome sequencing of the aus rice cultivar Kasalath by using the advanced next-generation sequencing (NGS) technologies to gain a better understanding of the sequence and structural changes among highly differentiated cultivars. The de novo assembled Kasalath sequences represented 91.1% (330.55 Mb) of the genome and contained 35 139 expressed loci annotated by RNA-Seq analysis. We detected 2 787 250 single-nucleotide polymorphisms (SNPs) and 7393 large insertion/deletion (indel) sites (>100 bp) between Kasalath and Nipponbare, and 2 216 251 SNPs and 3780 large indels between Kasalath and 93-11. Extensive comparison of the gene contents among these cultivars revealed similar rates of gene gain and loss. We detected at least 7.39 Mb of inserted sequences and 40.75 Mb of unmapped sequences in the Kasalath genome in comparison with the Nipponbare reference genome. Mapping of the publicly available NGS short reads from 50 rice accessions proved the necessity and the value of using the Kasalath whole-genome sequence as an additional reference to capture the sequence polymorphisms that cannot be discovered by using the Nipponbare sequence alone. PMID:24578372
Safary, Azam; Moniri, Rezvan; Hamzeh-Mivehroud, Maryam; Dastmalchi, Siavoush
2016-01-01
Purpose: Robust pharmaceutical and industrial enzymes from extremophile microorganisms are main source of enzymes with tremendous stability under harsh conditions which make them potential tools for commercial and biotechnological applications. Methods: The genome of a Gram-positive halo-thermotolerant Bacillus sp. SL1, new isolate from Saline Lake, was investigated for the presence of genes coding for potentially pharmaceutical enzymes. We determined gene sequences for the enzymes laccase (CotA), l-asparaginase (ansA3, ansA1), glutamate-specific endopeptidase (blaSE), l-arabinose isomerase (araA2), endo-1,4-β mannosidase (gmuG), glutaminase (glsA), pectate lyase (pelA), cellulase (bglC1), aldehyde dehydrogenase (ycbD) and allantoinases (pucH) in the genome of Bacillus sp. SL1. Results: Based on the DNA sequence alignment results, six of the studied enzymes of Bacillus sp. SL-1 showed 100% similarity at the nucleotide level to the same genes of B. licheniformis 14580 demonstrating extensive organizational relationship between these two strains. Despite high similarities between the B. licheniformis and Bacillus sp. SL-1 genomes, there are minor differences in the sequences of some enzyme. Approximately 30% of the enzyme sequences revealed more than 99% identity with some variations in nucleotides leading to amino acid substitution in protein sequences. Conclusion: Molecular characterization of this new isolate provides useful information regarding evolutionary relationship between B. subtilis and B. licheniformis species. Since, the most industrial processes are often performed in harsh conditions, enzymes from such halo-thermotolerant bacteria may provide economically and industrially appealing biocatalysts to be used under specific physicochemical situations in medical, pharmaceutical, chemical and other industries. PMID:28101462
Safary, Azam; Moniri, Rezvan; Hamzeh-Mivehroud, Maryam; Dastmalchi, Siavoush
2016-12-01
Purpose: Robust pharmaceutical and industrial enzymes from extremophile microorganisms are main source of enzymes with tremendous stability under harsh conditions which make them potential tools for commercial and biotechnological applications. Methods: The genome of a Gram-positive halo-thermotolerant Bacillus sp. SL1, new isolate from Saline Lake, was investigated for the presence of genes coding for potentially pharmaceutical enzymes. We determined gene sequences for the enzymes laccase (CotA), l-asparaginase (ansA3, ansA1), glutamate-specific endopeptidase (blaSE), l-arabinose isomerase (araA2), endo-1,4-β mannosidase (gmuG), glutaminase (glsA), pectate lyase (pelA), cellulase (bglC1), aldehyde dehydrogenase (ycbD) and allantoinases (pucH) in the genome of Bacillus sp. SL1. Results: Based on the DNA sequence alignment results, six of the studied enzymes of Bacillus sp. SL-1 showed 100% similarity at the nucleotide level to the same genes of B. licheniformis 14580 demonstrating extensive organizational relationship between these two strains. Despite high similarities between the B. licheniformis and Bacillus sp. SL-1 genomes, there are minor differences in the sequences of some enzyme. Approximately 30% of the enzyme sequences revealed more than 99% identity with some variations in nucleotides leading to amino acid substitution in protein sequences. Conclusion: Molecular characterization of this new isolate provides useful information regarding evolutionary relationship between B. subtilis and B. licheniformis species. Since, the most industrial processes are often performed in harsh conditions, enzymes from such halo-thermotolerant bacteria may provide economically and industrially appealing biocatalysts to be used under specific physicochemical situations in medical, pharmaceutical, chemical and other industries.
Ginkgo and Welwitschia Mitogenomes Reveal Extreme Contrasts in Gymnosperm Mitochondrial Evolution.
Guo, Wenhu; Grewe, Felix; Fan, Weishu; Young, Gregory J; Knoop, Volker; Palmer, Jeffrey D; Mower, Jeffrey P
2016-06-01
Mitochondrial genomes (mitogenomes) of flowering plants are well known for their extreme diversity in size, structure, gene content, and rates of sequence evolution and recombination. In contrast, little is known about mitogenomic diversity and evolution within gymnosperms. Only a single complete genome sequence is available, from the cycad Cycas taitungensis, while limited information is available for the one draft sequence, from Norway spruce (Picea abies). To examine mitogenomic evolution in gymnosperms, we generated complete genome sequences for the ginkgo tree (Ginkgo biloba) and a gnetophyte (Welwitschia mirabilis). There is great disparity in size, sequence conservation, levels of shared DNA, and functional content among gymnosperm mitogenomes. The Cycas and Ginkgo mitogenomes are relatively small, have low substitution rates, and possess numerous genes, introns, and edit sites; we infer that these properties were present in the ancestral seed plant. By contrast, the Welwitschia mitogenome has an expanded size coupled with accelerated substitution rates and extensive loss of these functional features. The Picea genome has expanded further, to more than 4 Mb. With regard to structural evolution, the Cycas and Ginkgo mitogenomes share a remarkable amount of intergenic DNA, which may be related to the limited recombinational activity detected at repeats in Ginkgo Conversely, the Welwitschia mitogenome shares almost no intergenic DNA with any other seed plant. By conducting the first measurements of rates of DNA turnover in seed plant mitogenomes, we discovered that turnover rates vary by orders of magnitude among species. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Abebe-Akele, Feseha; Tisa, Louis S; Cooper, Vaughn S; Hatcher, Philip J; Abebe, Eyualem; Thomas, W Kelley
2015-07-18
Entomopathogenic associations between nematodes in the genera Steinernema and Heterorhabdus with their cognate bacteria from the bacterial genera Xenorhabdus and Photorhabdus, respectively, are extensively studied for their potential as biological control agents against invasive insect species. These two highly coevolved associations were results of convergent evolution. Given the natural abundance of bacteria, nematodes and insects, it is surprising that only these two associations with no intermediate forms are widely studied in the entomopathogenic context. Discovering analogous systems involving novel bacterial and nematode species would shed light on the evolutionary processes involved in the transition from free living organisms to obligatory partners in entomopathogenicity. We report the complete genome sequence of a new member of the enterobacterial genus Serratia that forms a putative entomopathogenic complex with Caenorhabditis briggsae. Analysis of the 5.04 MB chromosomal genome predicts 4599 protein coding genes, seven sets of ribosomal RNA genes, 84 tRNA genes and a 64.8 KB plasmid encoding 74 genes. Comparative genomic analysis with three of the previously sequenced Serratia species, S. marcescens DB11 and S. proteamaculans 568, and Serratia sp. AS12, revealed that these four representatives of the genus share a core set of ~3100 genes and extensive structural conservation. The newly identified species shares a more recent common ancestor with S. marcescens with 99% sequence identity in rDNA sequence and orthology across 85.6% of predicted genes. Of the 39 genes/operons implicated in the virulence, symbiosis, recolonization, immune evasion and bioconversion, 21 (53.8%) were present in Serratia while 33 (84.6%) and 35 (89%) were present in Xenorhabdus and Photorhabdus EPN bacteria respectively. The majority of unique sequences in Serratia sp. SCBI (South African Caenorhabditis briggsae Isolate) are found in ~29 genomic islands of 5 to 65 genes and are enriched in putative functions that are biologically relevant to an entomopathogenic lifestyle, including non-ribosomal peptide synthetases, bacteriocins, fimbrial biogenesis, ushering proteins, toxins, secondary metabolite secretion and multiple drug resistance/efflux systems. By revealing the early stages of adaptation to this lifestyle, the Serratia sp. SCBI genome underscores the fact that in EPN formation the composite end result - killing, bioconversion, cadaver protection and recolonization- can be achieved by dissimilar mechanisms. This genome sequence will enable further study of the evolution of entomopathogenic nematode-bacteria complexes.
Biswas et al. describe an “exceptional responder” lung adenocarcinoma patient who survived with metastatic lung adenocarcinoma for 7 years while undergoing single or combination ERBB2-directed therapies. Whole-genome, whole-exome, and high-coverage ion-torrent targeted sequencing were used to demonstrate extreme genomic heterogeneity between the lung and lymph node metastatic
USDA-ARS?s Scientific Manuscript database
An extensive phylogenetic analysis and genus-level taxonomic revision of Paranoplocephala Lühe, 1910 -like cestodes (Cyclophyllidea, Anoplocephalidae) are presented. The phylogenetic analysis is based on DNA sequences of two partial mitochondrial genes, i.e. cytochrome c oxidase subunit 1 (cox1) and...
ERIC Educational Resources Information Center
Tamvacakis, Arianna N.; Senatore, Adriano; Katz, Paul S.
2015-01-01
The sea slug "Hermissenda crassicornis" (Mollusca, Gastropoda, Nudibranchia) has been studied extensively in associative learning paradigms. However, lack of genetic information previously hindered molecular-level investigations. Here, the "Hermissenda" brain transcriptome was sequenced and assembled de novo, producing 165,743…
Huber, Thomas; Herwerth, Marina; Alberts, Esther; Kirschke, Jan S; Zimmer, Claus; Ilg, Ruediger
2017-02-01
Adult-onset vanishing white-matter disease (VWM) is a rare autosomal recessive disease with neurological symptoms such as ataxia and paraparesis, showing extensive white-matter hyperintensities (WMH) on magnetic resonance (MR) imaging. Besides symptom-specific scores like the International Cooperative Ataxia Rating Scale (ICARS), there is no established tool to monitor disease progression. Because of extensive WMH, visual comparison of MR images is challenging. Here, we report the results of an automated method of segmentation to detect alterations in T2-weighted fluid-attenuated-inversion-recovery (FLAIR) sequences in a one-year follow-up study of a clinically stable patient with genetically diagnosed VWM. Signal alterations in MR imaging were quantified with a recently published WMH segmentation method by means of extreme value distribution (EVD). Our analysis revealed progressive FLAIR alterations of 5.84% in the course of one year, whereas no significant WMH change could be detected in a stable multiple sclerosis (MS) control group. This result demonstrates that automated EVD-based segmentation allows a precise and rapid quantification of extensive FLAIR alterations like in VWM and might be a powerful tool for the clinical and scientific monitoring of degenerative white-matter diseases and potential therapeutic interventions.
Carrot Juice Fermentations as Man-Made Microbial Ecosystems Dominated by Lactic Acid Bacteria.
Wuyts, Sander; Van Beeck, Wannes; Oerlemans, Eline F M; Wittouck, Stijn; Claes, Ingmar J J; De Boeck, Ilke; Weckx, Stefan; Lievens, Bart; De Vuyst, Luc; Lebeer, Sarah
2018-06-15
Spontaneous vegetable fermentations, with their rich flavors and postulated health benefits, are regaining popularity. However, their microbiology is still poorly understood, therefore raising concerns about food safety. In addition, such spontaneous fermentations form interesting cases of man-made microbial ecosystems. Here, samples from 38 carrot juice fermentations were collected through a citizen science initiative, in addition to three laboratory fermentations. Culturing showed that Enterobacteriaceae were outcompeted by lactic acid bacteria (LAB) between 3 and 13 days of fermentation. Metabolite-target analysis showed that lactic acid and mannitol were highly produced, as well as the biogenic amine cadaverine. High-throughput 16S rRNA gene sequencing revealed that mainly species of Leuconostoc and Lactobacillus (as identified by 8 and 20 amplicon sequence variants [ASVs], respectively) mediated the fermentations in subsequent order. The analyses at the DNA level still detected a high number of Enterobacteriaceae , but their relative abundance was low when RNA-based sequencing was performed to detect presumptive metabolically active bacterial cells. In addition, this method greatly reduced host read contamination. Phylogenetic placement indicated a high LAB diversity, with ASVs from nine different phylogenetic groups of the Lactobacillus genus complex. However, fermentation experiments with isolates showed that only strains belonging to the most prevalent phylogenetic groups preserved the fermentation dynamics. The carrot juice fermentation thus forms a robust man-made microbial ecosystem suitable for studies on LAB diversity and niche specificity. IMPORTANCE The usage of fermented food products by professional chefs is steadily growing worldwide. Meanwhile, this interest has also increased at the household level. However, many of these artisanal food products remain understudied. Here, an extensive microbial analysis was performed of spontaneous fermented carrot juices which are used as nonalcoholic alternatives for wine in a Belgian Michelin star restaurant. Samples were collected through an active citizen science approach with 38 participants, in addition to three laboratory fermentations. Identification of the main microbial players revealed that mainly species of Leuconostoc and Lactobacillus mediated the fermentations in subsequent order. In addition, a high diversity of lactic acid bacteria was found; however, fermentation experiments with isolates showed that only strains belonging to the most prevalent lactic acid bacteria preserved the fermentation dynamics. Finally, this study showed that the usage of RNA-based 16S rRNA amplicon sequencing greatly reduces host read contamination. Copyright © 2018 American Society for Microbiology.
2011-01-01
Background The classical perspective that interspecific hybridization in animals is rare has been changing due to a growing list of empirical examples showing the occurrence of gene flow between closely related species. Using sequence data from cyt b mitochondrial gene and three intron nuclear genes (RPL9, c-myc, and RPL3) we investigated patterns of nucleotide polymorphism and divergence between two closely related toad species R. marina and R. schneideri. By comparing levels of differentiation at nuclear and mtDNA levels we were able to describe patterns of introgression and infer the history of hybridization between these species. Results All nuclear loci are essentially concordant in revealing two well differentiated groups of haplotypes, corresponding to the morphologically-defined species R. marina and R. schneideri. Mitochondrial DNA analysis also revealed two well-differentiated groups of haplotypes but, in stark contrast with the nuclear genealogies, all R. schneideri sequences are clustered with sequences of R. marina from the right Amazon bank (RAB), while R. marina sequences from the left Amazon bank (LAB) are monophyletic. An Isolation-with-Migration (IM) analysis using nuclear data showed that R. marina and R. schneideri diverged at ≈ 1.69 Myr (early Pleistocene), while R. marina populations from LAB and RAB diverged at ≈ 0.33 Myr (middle Pleistocene). This time of divergence is not consistent with the split between LAB and RAB populations obtained with mtDNA data (≈ 1.59 Myr), which is notably similar to the estimate obtained with nuclear genes between R. marina and R. schneideri. Coalescent simulations of mtDNA phylogeny under the speciation history inferred from nuclear genes rejected the hypothesis of incomplete lineage sorting to explain the conflicting signal between mtDNA and nuclear-based phylogenies. Conclusions The cytonuclear discordance seems to reflect the occurrence of interspecific hybridization between these two closely related toad species. Overall, our results suggest a phenomenon of extensive mtDNA unidirectional introgression from the previously occurring R. schneideri into the invading R. marina. We hypothesize that climatic-induced range shifts during the Pleistocene/Holocene may have played an important role in the observed patterns of introgression. PMID:21939538
Dynamic modeling and optimization for space logistics using time-expanded networks
NASA Astrophysics Data System (ADS)
Ho, Koki; de Weck, Olivier L.; Hoffman, Jeffrey A.; Shishko, Robert
2014-12-01
This research develops a dynamic logistics network formulation for lifecycle optimization of mission sequences as a system-level integrated method to find an optimal combination of technologies to be used at each stage of the campaign. This formulation can find the optimal transportation architecture considering its technology trades over time. The proposed methodologies are inspired by the ground logistics analysis techniques based on linear programming network optimization. Particularly, the time-expanded network and its extension are developed for dynamic space logistics network optimization trading the quality of the solution with the computational load. In this paper, the methodologies are applied to a human Mars exploration architecture design problem. The results reveal multiple dynamic system-level trades over time and give recommendation of the optimal strategy for the human Mars exploration architecture. The considered trades include those between In-Situ Resource Utilization (ISRU) and propulsion technologies as well as the orbit and depot location selections over time. This research serves as a precursor for eventual permanent settlement and colonization of other planets by humans and us becoming a multi-planet species.
Gianoglio, Silvia; Moglia, Andrea; Acquadro, Alberto; Comino, Cinzia; Portis, Ezio
2017-01-01
Changes to the cytosine methylation status of DNA, driven by the activity of C5 methyltransferases (C5-MTases) and demethylases, exert an important influence over development, transposon movement, gene expression and imprinting. Three groups of C5-MTase enzymes have been identified in plants, namely MET (methyltransferase 1), CMT (chromomethyltransferases) and DRM (domains rearranged methyltransferases). Here the repertoire of genes encoding C5-MTase and demethylase by the globe artichoke (Cynara cardunculus var. scolymus) is described, based on sequence homology, a phylogenetic analysis and a characterization of their functional domains. A total of ten genes encoding C5-MTase (one MET, five CMTs and four DRMs) and five demethylases was identified. An analysis of their predicted product's protein structure suggested an extensive level of conservation has been retained by the C5-MTases. Transcriptional profiling based on quantitative real time PCR revealed a number of differences between the genes encoding maintenance and de novo methyltransferases, sometimes in a tissue- or development-dependent manner, which implied a degree of functional specialization.
Warejko, Jillian K; Schueler, Markus; Vivante, Asaf; Tan, Weizhen; Daga, Ankana; Lawson, Jennifer A; Braun, Daniela A; Shril, Shirlee; Amann, Kassaundra; Somers, Michael J G; Rodig, Nancy M; Baum, Michelle A; Daouk, Ghaleb; Traum, Avram Z; Kim, Heung Bae; Vakili, Khashayar; Porras, Diego; Lock, James; Rivkin, Michael J; Chaudry, Gulraiz; Smoot, Leslie B; Singh, Michael N; Smith, Edward R; Mane, Shrikant M; Lifton, Richard P; Stein, Deborah R; Ferguson, Michael A; Hildebrandt, Friedhelm
2018-04-01
Midaortic syndrome (MAS) is a rare cause of severe childhood hypertension characterized by narrowing of the abdominal aorta in children and is associated with extensive vascular disease. It may occur as part of a genetic syndrome, such as neurofibromatosis, or as consequence of a pathological inflammatory disease. However, most cases are considered idiopathic. We hypothesized that in a high percentage of these patients, a monogenic cause of disease may be detected by evaluating whole exome sequencing data for mutations in 1 of 38 candidate genes previously described to cause vasculopathy. We studied a cohort of 36 individuals from 35 different families with MAS by exome sequencing. In 15 of 35 families (42.9%), we detected likely causal dominant mutations. In 15 of 35 (42.9%) families with MAS, whole exome sequencing revealed a mutation in one of the genes previously associated with vascular disease ( NF1 , JAG1 , ELN , GATA6 , and RNF213 ). Ten of the 15 mutations have not previously been reported. This is the first report of ELN , RNF213 , or GATA6 mutations in individuals with MAS. Mutations were detected in NF1 (6/15 families), JAG1 (4/15 families), ELN (3/15 families), and one family each for GATA6 and RNF213 Eight individuals had syndromic disease and 7 individuals had isolated MAS. Whole exome sequencing can provide conclusive molecular genetic diagnosis in a high fraction of individuals with syndromic or isolated MAS. Establishing an etiologic diagnosis may reveal genotype/phenotype correlations for MAS in the future and should, therefore, be performed routinely in MAS. © 2018 American Heart Association, Inc.
How many novel eukaryotic 'kingdoms'? Pitfalls and limitations of environmental DNA surveys
Berney, Cédric; Fahrni, José; Pawlowski, Jan
2004-01-01
Background Over the past few years, the use of molecular techniques to detect cultivation-independent, eukaryotic diversity has proven to be a powerful approach. Based on small-subunit ribosomal RNA (SSU rRNA) gene analyses, these studies have revealed the existence of an unexpected variety of new phylotypes. Some of them represent novel diversity in known eukaryotic groups, mainly stramenopiles and alveolates. Others do not seem to be related to any molecularly described lineage, and have been proposed to represent novel eukaryotic kingdoms. In order to review the evolutionary importance of this novel high-level eukaryotic diversity critically, and to test the potential technical and analytical pitfalls and limitations of eukaryotic environmental DNA surveys (EES), we analysed 484 environmental SSU rRNA gene sequences, including 81 new sequences from sediments of the small river, the Seymaz (Geneva, Switzerland). Results Based on a detailed screening of an exhaustive alignment of eukaryotic SSU rRNA gene sequences and the phylogenetic re-analysis of previously published environmental sequences using Bayesian methods, our results suggest that the number of novel higher-level taxa revealed by previously published EES was overestimated. Three main sources of errors are responsible for this situation: (1) the presence of undetected chimeric sequences; (2) the misplacement of several fast-evolving sequences; and (3) the incomplete sampling of described, but yet unsequenced eukaryotes. Additionally, EES give a biased view of the diversity present in a given biotope because of the difficult amplification of SSU rRNA genes in some taxonomic groups. Conclusions Environmental DNA surveys undoubtedly contribute to reveal many novel eukaryotic lineages, but there is no clear evidence for a spectacular increase of the diversity at the kingdom level. After re-analysis of previously published data, we found only five candidate lineages of possible novel high-level eukaryotic taxa, two of which comprise several phylotypes that were found independently in different studies. To ascertain their taxonomic status, however, the organisms themselves have now to be identified. PMID:15176975
Chaban, Bonnie; Chu, Shirley; Hendrick, Steven; Waldner, Cheryl; Hill, Janet E.
2012-01-01
The detection and subspeciation of Campylobacter fetus subsp. venerealis (CFV) from veterinary samples is important for both clinical and economic reasons. Campylobacter fetus subsp. venerealis is the causative agent of bovine genital campylobacteriosis, a venereal disease that can lead to serious reproductive problems in cattle, and strict international regulations require animals and animal products to be CFV-free for trade. This study evaluated methods reported in the literature for CFV detection and reports the translation of an extensively tested CFV-specific polymerase chain reaction (PCR) primer set; including the VenSF/VenSR primers and a real-time, quantitative PCR (qPCR) platform using SYBR Green chemistry. Three methods of preputial sample preparation for direct qPCR were evaluated and a heat lysis DNA extraction method was shown to allow for CFV detection at the level of approximately one cell equivalent per reaction (or 1.0 × 103 CFU/mL) from prepuce. The optimized sample preparation and qPCR protocols were then used to evaluate 3 western Canadian bull cohorts, which included 377 bulls, for CFV. The qPCR assay detected 11 positive bulls for the CFV-specific parA gene target. DNA sequence data confirmed the identity of the amplified product and revealed that positive samples were comprised of 2 sequence types; one identical to previously reported CFV parA gene sequences and one with a 9% sequence divergence. These results add valuable information towards our understanding of an important CFV subspeciation target and offer a significantly improved format for an internationally recognized PCR test. PMID:23277694
Comparative analysis of the small RNA transcriptomes of Pinus contorta and Oryza sativa
Morin, Ryan D.; Aksay, Gozde; Dolgosheina, Elena; Ebhardt, H. Alexander; Magrini, Vincent; Mardis, Elaine R.; Sahinalp, S. Cenk; Unrau, Peter J.
2008-01-01
The diversity of microRNAs and small-interfering RNAs has been extensively explored within angiosperms by focusing on a few key organisms such as Oryza sativa and Arabidopsis thaliana. A deeper division of the plants is defined by the radiation of the angiosperms and gymnosperms, with the latter comprising the commercially important conifers. The conifers are expected to provide important information regarding the evolution of highly conserved small regulatory RNAs. Deep sequencing provides the means to characterize and quantitatively profile small RNAs in understudied organisms such as these. Pyrosequencing of small RNAs from O. sativa revealed, as expected, ∼21- and ∼24-nt RNAs. The former contained known microRNAs, and the latter largely comprised intergenic-derived sequences likely representing heterochromatin siRNAs. In contrast, sequences from Pinus contorta were dominated by 21-nt small RNAs. Using a novel sequence-based clustering algorithm, we identified sequences belonging to 18 highly conserved microRNA families in P. contorta as well as numerous clusters of conserved small RNAs of unknown function. Using multiple methods, including expressed sequence folding and machine learning algorithms, we found a further 53 candidate novel microRNA families, 51 appearing specific to the P. contorta library. In addition, alignment of small RNA sequences to the O. sativa genome revealed six perfectly conserved classes of small RNA that included chloroplast transcripts and specific types of genomic repeats. The conservation of microRNAs and other small RNAs between the conifers and the angiosperms indicates that important RNA silencing processes were highly developed in the earliest spermatophytes. Genomic mapping of all sequences to the O. sativa genome can be viewed at http://microrna.bcgsc.ca/cgi-bin/gbrowse/rice_build_3/. PMID:18323537
NASA Astrophysics Data System (ADS)
Ielpi, Alessandro
2012-07-01
A late Pliocene incised valley fill to lacustrine succession, which contains an interbedded brown coal seam (< 20 m thick), is examined in terms of facies analysis, physical stratigraphy and sequence architecture. The succession (< 50 m thick) constitutes the first depositional event of the Castelnuovo Synthem, which is the oldest unconformity bounded stratigraphic unit of the nonmarine Upper Valdarno Basin, Northern Apennines (Italy). The integration of field surveys and borehole logs identified the following event sequence: first valley filling stages by coarse alluvial fan and channelised streams; the progressive setting of low gradient floodbasins with shallow floodplain lakes; subsequent major waterlogging and extensive peat mire development; and system drowning and establishment of permanent lacustrine conditions. The deposits are grouped in a set of nested valley fills and are arranged as high-frequency depositional sequences. The sequences are bounded by minor erosive truncations and have distinctive upward trends: lowstand system tract thinning; transgressive system tract thickening; highstand system tract thinning and eventual non-deposition; and the smoothing of along-sequence boundary sub-aerial incisions. Such features fit in with the notion of an idealised model where second-order (high-frequency) fluctuations, modulated by first-order (low-frequency) base-level rising, have short-lived standing + falling phases and prolonged transgressions, respectively. Furthermore, the general sequence architecture reveals how a mixed palustrine-siliciclastic system differs substantially from a purely siliciclastic one. In the transgressive phases, terrigenous starvation induces prevailing peat accumulation, generating abnormally thick transgressive system tracts that eventually come to occupy much of the same transgression-generated accommodation space. In the highstand phases, the development of thick highstand system tracts is then prevented by sediment upstream trapping due to retrogressive fluvial aggradations, probably coupled with low-accommodation settings inherited from the transgressive phases.
He, Ran; Zu, Li-Dong; Yao, Peng; Chen, Xin; Wang, En-Duo
2009-02-01
In human cytoplasm, nine aminoacyl-tRNA synthetases (aaRSs) and three protein factors form a multi-synthetase complex (MSC). Human cytosolic methionyl-tRNA synthetase (hcMetRS) is a component of the MSC. Sequence alignment revealed that hcMetRS has an N-terminal extension of 267 amino acid residues. This extension can be divided into three sub-domains: GST-like, GN, and GC sub-domains. The effect of each sub-domain in the N-terminal extension of hcMetRS on enzymatic activity and incorporation into the MSC was studied. The results of cellular assay showed that the GST-like sub-domain was responsible for the incorporation of hcMetRS into the MSC. The entire N-terminal extension of hcMetRS is indispensable for the enzymatic activity. Deletion mutagenesis revealed that a seven-amino acid motif within the sub-domain GC was important for the activity of amino acid activation. A conserved proline residue within the seven-amino acid motif was crucial, while the other six residues were moderately important for the amino acid activation activity. Thus, the last 15 residues of previously defined N-terminal extension of hcMetRS was a part of the catalytic domain; whereas the first 252 residues of hcMetRS constitute the N-terminal extended domain of hcMetRS. The formerly defined N-terminal extension of hcMetRS possesses two functions of two different domains.
Arnold, Frances H.; Shao, Zhixin; Zhao, Huimin; Giver, Lorraine J.
2002-01-01
A method for in vitro mutagenesis and recombination of polynucleotide sequences based on polymerase-catalyzed extension of primer oligonucleotides is disclosed. The method involves priming template polynucleotide(s) with random-sequences or defined-sequence primers to generate a pool of short DNA fragments with a low level of point mutations. The DNA fragments are subjected to denaturization followed by annealing and further enzyme-catalyzed DNA polymerization. This procedure is repeated a sufficient number of times to produce full-length genes which comprise mutants of the original template polynucleotides. These genes can be further amplified by the polymerase chain reaction and cloned into a vector for expression of the encoded proteins.
2013-01-01
Background Birds have a ZZ male: ZW female sex chromosome system and while the Z-linked DMRT1 gene is necessary for testis development, the exact mechanism of sex determination in birds remains unsolved. This is partly due to the poor annotation of the W chromosome, which is speculated to carry a female determinant. Few genes have been mapped to the W and little is known of their expression. Results We used RNA-seq to produce a comprehensive profile of gene expression in chicken blastoderms and embryonic gonads prior to sexual differentiation. We found robust sexually dimorphic gene expression in both tissues pre-dating gonadogenesis, including sex-linked and autosomal genes. This supports the hypothesis that sexual differentiation at the molecular level is at least partly cell autonomous in birds. Different sets of genes were sexually dimorphic in the two tissues, indicating that molecular sexual differentiation is tissue specific. Further analyses allowed the assembly of full-length transcripts for 26 W chromosome genes, providing a view of the W transcriptome in embryonic tissues. This is the first extensive analysis of W-linked genes and their expression profiles in early avian embryos. Conclusion Sexual differentiation at the molecular level is established in chicken early in embryogenesis, before gonadal sex differentiation. We find that the W chromosome is more transcriptionally active than previously thought, expand the number of known genes to 26 and present complete coding sequences for these W genes. This includes two novel W-linked sequences and three small RNAs reassigned to the W from the Un_Random chromosome. PMID:23531366
Yan, Jianmin; Campbell, James H.; Glick, Bernard R.; Smith, Matthew D.; Liang, Yan
2014-01-01
The translocon at the outer envelope membrane of chloroplasts (Toc) mediates the recognition and initial import into the organelle of thousands of nucleus-encoded proteins. These proteins are translated in the cytosol as precursor proteins with cleavable amino-terminal targeting sequences called transit peptides. The majority of the known Toc components that mediate chloroplast protein import were originally identified in pea, and more recently have been studied most extensively in Arabidopsis. With the completion of the tomato genome sequencing project, it is now possible to identify putative homologues of the chloroplast import components in tomato. In the work reported here, the Toc GTPase cDNAs from tomato were identified, cloned and analyzed. The analysis revealed that there are four Toc159 homologues (slToc159-1, -2, -3 and -4) and two Toc34 homologues (slToc34-1 and -2) in tomato, and it was shown that tomato Toc159 and Toc34 homologues share high sequence similarity with the comparable import apparatus components from Arabidopsis and pea. Thus, tomato is a valid model for further study of this system. The expression level of Toc complex components was also investigated in different tissues during tomato development. The two tomato Toc34 homologues are expressed at higher levels in non-photosynthetic tissues, whereas, the expression of two tomato Toc159 homologues, slToc159-1 and slToc159-4, were higher in photosynthetic tissues, and the expression patterns of slToc159-2 was not significantly different in photosynthetic and non-photosynthetic tissues, and slToc159-3 expression was limited to a few select tissues. PMID:24751891
Hsieh, Tsung-Han; Liu, Yun-Ru; Chang, Ting-Yu; Liang, Muh-Lii; Chen, Hsin-Hung; Wang, Hsei-Wei; Yen, Yun; Wong, Tai-Tong
2018-03-27
Pediatric central nervous system germ cell tumors (CNSGCTs) are rare and heterogeneous neoplasms, which can be divided into germinomas and nongerminomatous germ cell tumors (NGGCTs). NGGCTs are further subdivided into mature teratomas and nongerminomatous malignant GCTs (NGMGCTs). Clinical outcomes suggest that NGMGCTs have poor prognosis and survival and that they require more extensive radiotherapy and adjuvant chemotherapy. However, the mechanisms underlying this difference are still unclear. DNA methylation alteration is generally acknowledged to cause therapeutic resistance in cancers. We hypothesized that the pediatric NGMGCTs exhibit a different genome-wide DNA methylation pattern, which is involved in the mechanism of its therapeutic resistance. We performed methylation and hydroxymethylation DNA immunoprecipitation sequencing, mRNA expression microarray, and small RNA sequencing (smRNA-seq) to determine methylation-regulated genes, including microRNAs (miRNAs). The expression levels of 97 genes and 8 miRNAs were correlated with promoter DNA methylation and hydroxymethylation status, such as the miR-199/-214 cluster, and treatment with DNA demethylating agent 5-aza-2'-deoxycytidine elevated its expression level. Furthermore, smRNA-seq analysis showed 27 novel miRNA candidates with differential expression between germinomas and NGMGCTs. Overexpresssion of miR-214-3p in NCCIT cells leads to reduced expression of the pro-apoptotic protein BCL2-like 11 and induces cisplatin resistance. We interrogated the differential DNA methylation patterns between germinomas and NGMGCTs and proposed a mechanism for chemoresistance in NGMGCTs. In addition, our sequencing data provide a roadmap for further pediatric CNSGCT research and potential targets for the development of new therapeutic strategies.
RNA Editing During Sexual Development Occurs in Distantly Related Filamentous Ascomycetes
Teichert, Ines; Dahlmann, Tim A.; Kück, Ulrich
2017-01-01
RNA editing is a post-transcriptional process that modifies RNA molecules leading to transcript sequences that differ from their template DNA. A-to-I editing was found to be widely distributed in nuclear transcripts of metazoa, but was detected in fungi only recently in a study of the filamentous ascomycete Fusarium graminearum that revealed extensive A-to-I editing of mRNAs in sexual structures (fruiting bodies). Here, we searched for putative RNA editing events in RNA-seq data from Sordaria macrospora and Pyronema confluens, two distantly related filamentous ascomycetes, and in data from the Taphrinomycete Schizosaccharomyces pombe. Like F. graminearum, S. macrospora is a member of the Sordariomycetes, whereas P. confluens belongs to the early-diverging group of Pezizomycetes. We found extensive A-to-I editing in RNA-seq data from sexual mycelium from both filamentous ascomycetes, but not in vegetative structures. A-to-I editing was not detected in different stages of meiosis of S. pombe. A comparison of A-to-I editing in S. macrospora with F. graminearum and P. confluens, respectively, revealed little conservation of individual editing sites. An analysis of RNA-seq data from two sterile developmental mutants of S. macrospora showed that A-to-I editing is strongly reduced in these strains. Sequencing of cDNA fragments containing more than one editing site from P. confluens showed that at the beginning of sexual development, transcripts were incompletely edited or unedited, whereas in later stages transcripts were more extensively edited. Taken together, these data suggest that A-to-I RNA editing is an evolutionary conserved feature during fruiting body development in filamentous ascomycetes. PMID:28338982
RNA Editing During Sexual Development Occurs in Distantly Related Filamentous Ascomycetes.
Teichert, Ines; Dahlmann, Tim A; Kück, Ulrich; Nowrousian, Minou
2017-04-01
RNA editing is a post-transcriptional process that modifies RNA molecules leading to transcript sequences that differ from their template DNA. A-to-I editing was found to be widely distributed in nuclear transcripts of metazoa, but was detected in fungi only recently in a study of the filamentous ascomycete Fusarium graminearum that revealed extensive A-to-I editing of mRNAs in sexual structures (fruiting bodies). Here, we searched for putative RNA editing events in RNA-seq data from Sordaria macrospora and Pyronema confluens, two distantly related filamentous ascomycetes, and in data from the Taphrinomycete Schizosaccharomyces pombe. Like F. graminearum, S. macrospora is a member of the Sordariomycetes, whereas P. confluens belongs to the early-diverging group of Pezizomycetes. We found extensive A-to-I editing in RNA-seq data from sexual mycelium from both filamentous ascomycetes, but not in vegetative structures. A-to-I editing was not detected in different stages of meiosis of S. pombe. A comparison of A-to-I editing in S. macrospora with F. graminearum and P. confluens, respectively, revealed little conservation of individual editing sites. An analysis of RNA-seq data from two sterile developmental mutants of S. macrospora showed that A-to-I editing is strongly reduced in these strains. Sequencing of cDNA fragments containing more than one editing site from P. confluens showed that at the beginning of sexual development, transcripts were incompletely edited or unedited, whereas in later stages transcripts were more extensively edited. Taken together, these data suggest that A-to-I RNA editing is an evolutionary conserved feature during fruiting body development in filamentous ascomycetes. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
A proposal to rename the hyperthermophile Pyrococcus woesei as Pyrococcus furiosus subsp. woesei.
Kanoksilapatham, Wirojne; González, Juan M; Maeder, Dennis L; DiRuggiero, Jocelyne; Robb, Frank T
2004-10-01
Pyrococcus species are hyperthermophilic members of the order Thermococcales, with optimal growth temperatures approaching 100 degrees C. All species grow heterotrophically and produce H2 or, in the presence of elemental sulfur (S(o)), H2S. Pyrococcus woesei and P. furiosus were isolated from marine sediments at the same Vulcano Island beach site and share many morphological and physiological characteristics. We report here that the rDNA operons of these strains have identical sequences, including their intergenic spacer regions and part of the 23S rRNA. Both species grow rapidly and produce H2 in the presence of 0.1% maltose and 10-100 microM sodium tungstate in S(o)-free medium. However, P. woesei shows more extensive autolysis than P. furiosus in the stationary phase. Pyrococcus furiosus and P. woesei share three closely related families of insertion sequences (ISs). A Southern blot performed with IS probes showed extensive colinearity between the genomes of P. woesei and P. furiosus. Cloning and sequencing of ISs that were in different contexts in P. woesei and P. furiosus revealed that the napA gene in P. woesei is disrupted by a type III IS element, whereas in P. furiosus, this gene is intact. A type I IS element, closely linked to the napA gene, was observed in the same context in both P. furiosus and P. woesei genomes. Our results suggest that the IS elements are implicated in genomic rearrangements and reshuffling in these closely related strains. We propose to rename P. woesei a subspecies of P. furiosus based on their identical rDNA operon sequences, many common IS elements that are shared genomic markers, and the observation that all P. woesei nucleotide sequences deposited in GenBank to date are > 99% identical to P. furiosus sequences.
Molecular Population Genetics of the Alcohol Dehydrogenase Gene Region of DROSOPHILA MELANOGASTER
Aquadro, Charles F.; Desse, Susan F.; Bland, Molly M.; Langley, Charles H.; Laurie-Ahlberg, Cathy C.
1986-01-01
Variation in the DNA restriction map of a 13-kb region of chromosome II including the alcohol dehydrogenase structural gene (Adh) was examined in Drosophila melanogaster from natural populations. Detailed analysis of 48 D. melanogaster lines representing four eastern United States populations revealed extensive DNA sequence variation due to base substitutions, insertions and deletions. Cloning of this region from several lines allowed characterization of length variation as due to unique sequence insertions or deletions [nine sizes; 21–200 base pairs (bp)] or transposable element insertions (several sizes, 340 bp to 10.2 kb, representing four different elements). Despite this extensive variation in sequences flanking the Adh gene, only one length polymorphism is clearly associated with altered Adh expression (a copia element approximately 250 bp 5' to the distal transcript start site). Nonetheless, the frequency spectra of transposable elements within and between Drosophila species suggests they are slightly deleterious. Strong nonrandom associations are observed among Adh region sequence variants, ADH allozyme (Fast vs. Slow), ADH enzyme activity and the chromosome inversion ln(2L) t. Phylogenetic analysis of restriction map haplotypes suggest that the major twofold component of ADH activity variation (high vs. low, typical of Fast and Slow allozymes, respectively) is due to sequence variation tightly linked to and possibly distinct from that underlying the allozyme difference. The patterns of nucleotide and haplotype variation for Fast and Slow allozyme lines are consistent with the recent increase in frequency and spread of the Fast haplotype associated with high ADH activity. These data emphasize the important role of evolutionary history and strong nonrandom associations among tightly linked sequence variation as determinants of the patterns of variation observed in natural populations. PMID:3026893
pyPaSWAS: Python-based multi-core CPU and GPU sequence alignment.
Warris, Sven; Timal, N Roshan N; Kempenaar, Marcel; Poortinga, Arne M; van de Geest, Henri; Varbanescu, Ana L; Nap, Jan-Peter
2018-01-01
Our previously published CUDA-only application PaSWAS for Smith-Waterman (SW) sequence alignment of any type of sequence on NVIDIA-based GPUs is platform-specific and therefore adopted less than could be. The OpenCL language is supported more widely and allows use on a variety of hardware platforms. Moreover, there is a need to promote the adoption of parallel computing in bioinformatics by making its use and extension more simple through more and better application of high-level languages commonly used in bioinformatics, such as Python. The novel application pyPaSWAS presents the parallel SW sequence alignment code fully packed in Python. It is a generic SW implementation running on several hardware platforms with multi-core systems and/or GPUs that provides accurate sequence alignments that also can be inspected for alignment details. Additionally, pyPaSWAS support the affine gap penalty. Python libraries are used for automated system configuration, I/O and logging. This way, the Python environment will stimulate further extension and use of pyPaSWAS. pyPaSWAS presents an easy Python-based environment for accurate and retrievable parallel SW sequence alignments on GPUs and multi-core systems. The strategy of integrating Python with high-performance parallel compute languages to create a developer- and user-friendly environment should be considered for other computationally intensive bioinformatics algorithms.
Extensive Mobilome-Driven Genome Diversification in Mouse Gut-Associated Bacteroides vulgatus mpk
Lange, Anna; Beier, Sina; Steimle, Alex; Autenrieth, Ingo B.; Huson, Daniel H.; Frick, Julia-Stefanie
2016-01-01
Like many other Bacteroides species, Bacteroides vulgatus strain mpk, a mouse fecal isolate which was shown to promote intestinal homeostasis, utilizes a variety of mobile elements for genome evolution. Based on sequences collected by Pacific Biosciences SMRT sequencing technology, we discuss the challenges of assembling and studying a bacterial genome of high plasticity. Additionally, we conducted comparative genomics comparing this commensal strain with the B. vulgatus type strain ATCC 8482 as well as multiple other Bacteroides and Parabacteroides strains to reveal the most important differences and identify the unique features of B. vulgatus mpk. The genome of B. vulgatus mpk harbors a large and diverse set of mobile element proteins compared with other sequenced Bacteroides strains. We found evidence of a number of different horizontal gene transfer events and a genome landscape that has been extensively altered by different mobilization events. A CRISPR/Cas system could be identified that provides a possible mechanism for preventing the integration of invading external DNA. We propose that the high genome plasticity and the introduced genome instabilities of B. vulgatus mpk arising from the various mobilization events might play an important role not only in its adaptation to the challenging intestinal environment in general, but also in its ability to interact with the gut microbiota. PMID:27071651
A survey of the sorghum transcriptome using single-molecule long reads
Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; ...
2016-06-24
Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novelmore » splice isoforms. Additionally, we uncover APA ofB11,000 expressed genes and more than 2,100 novel genes. Lastly, these results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.« less
A survey of the sorghum transcriptome using single-molecule long reads
Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; Ngam, Peter; Devitt, Nicholas; Schilkey, Faye; Ben-Hur, Asa; Reddy, Anireddy S. N.
2016-01-01
Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ∼11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism. PMID:27339290
2013-01-01
Inherited retinal degenerative diseases (RDDs) display wide variation in their mode of inheritance, underlying genetic defects, age of onset, and phenotypic severity. Molecular mechanisms have not been delineated for many retinal diseases, and treatment options are limited. In most instances, genotype-phenotype correlations have not been elucidated because of extensive clinical and genetic heterogeneity. Next-generation sequencing (NGS) methods, including exome, genome, transcriptome and epigenome sequencing, provide novel avenues towards achieving comprehensive understanding of the genetic architecture of RDDs. Whole-exome sequencing (WES) has already revealed several new RDD genes, whereas RNA-Seq and ChIP-Seq analyses are expected to uncover novel aspects of gene regulation and biological networks that are involved in retinal development, aging and disease. In this review, we focus on the genetic characterization of retinal and macular degeneration using NGS technology and discuss the basic framework for further investigations. We also examine the challenges of NGS application in clinical diagnosis and management. PMID:24112618
Khanna, Namita; Ghosh, Ananta Kumar; Huntemann, Marcel; Deshpande, Shweta; Han, James; Chen, Amy; Kyrpides, Nikos; Mavrommatis, Kostas; Szeto, Ernest; Markowitz, Victor; Ivanova, Natalia; Pagani, Ioanna; Pati, Amrita; Pitluck, Sam; Nolan, Matt; Woyke, Tanja; Teshima, Hazuki; Chertkov, Olga; Daligault, Hajnalka; Davenport, Karen; Gu, Wei; Munk, Christine; Zhang, Xiaojing; Bruce, David; Detter, Chris; Xu, Yan; Quintana, Beverly; Reitenga, Krista; Kunde, Yulia; Green, Lance; Erkkila, Tracy; Han, Cliff; Brambilla, Evelyne-Marie; Lang, Elke; Klenk, Hans-Peter; Goodwin, Lynne; Chain, Patrick; Das, Debabrata
2013-12-20
Enterobacter sp. IIT-BT 08 belongs to Phylum: Proteobacteria, Class: Gammaproteobacteria, Order: Enterobacteriales, Family: Enterobacteriaceae. The organism was isolated from the leaves of a local plant near the Kharagpur railway station, Kharagpur, West Bengal, India. It has been extensively studied for fermentative hydrogen production because of its high hydrogen yield. For further enhancement of hydrogen production by strain development, complete genome sequence analysis was carried out. Sequence analysis revealed that the genome was linear, 4.67 Mbp long and had a GC content of 56.01%. The genome properties encode 4,393 protein-coding and 179 RNA genes. Additionally, a putative pathway of hydrogen production was suggested based on the presence of formate hydrogen lyase complex and other related genes identified in the genome. Thus, in the present study we describe the specific properties of the organism and the generation, annotation and analysis of its genome sequence as well as discuss the putative pathway of hydrogen production by this organism.
A corticostriatal deficit promotes temporal distortion of automatic action in ageing
Matamales, Miriam; Skrbis, Zala; Bailey, Matthew R; Balsam, Peter D; Balleine, Bernard W; Götz, Jürgen
2017-01-01
The acquisition of motor skills involves implementing action sequences that increase task efficiency while reducing cognitive loads. This learning capacity depends on specific cortico-basal ganglia circuits that are affected by normal ageing. Here, combining a series of novel behavioural tasks with extensive neuronal mapping and targeted cell manipulations in mice, we explored how ageing of cortico-basal ganglia networks alters the microstructure of action throughout sequence learning. We found that, after extended training, aged mice produced shorter actions and displayed squeezed automatic behaviours characterised by ultrafast oligomeric action chunks that correlated with deficient reorganisation of corticostriatal activity. Chemogenetic disruption of a striatal subcircuit in young mice reproduced age-related within-sequence features, and the introduction of an action-related feedback cue temporarily restored normal sequence structure in aged mice. Our results reveal static properties of aged cortico-basal ganglia networks that introduce temporal limits to action automaticity, something that can compromise procedural learning in ageing. PMID:29058672
Adaptation to chronic MG132 reduces oxidative toxicity by a CuZnSOD-dependent mechanism
Leak, Rehana K.; Zigmond, Michael J.; Liou, Anthony K. F.
2010-01-01
To study whether and how cells adapt to chronic cellular stress, we exposed PC12 cells to the proteasome inhibitor MG132 (0.1 μM) for 2 weeks and longer. This treatment reduced chymotrypsin-like proteasome activity by 47% and was associated with protection against both 6-hydroxydopamine (6-OHDA, 100 μM) and higher dose MG132 (40 μM). Protection developed slowly over the course of the first 2 weeks of exposure and was chronic thereafter. There was no change in total glutathione levels after MG132. Buthionine sulfoximine (100 μM) reduced glutathione levels by 60%, but exacerbated 6-OHDA toxicity to the same extent in both MG132-treated and control cells and failed to reduce MG132-induced protection. Chronic MG132 resulted in elevated antioxidant proteins CuZn superoxide dismutase (SOD, +55%), MnSOD (+21%), and catalase (+15%), as well as chaperone heat shock protein 70 (+42%). Examination of SOD enzyme activity revealed higher levels of CuZnSOD (+40%), with no change in MnSOD. We further assessed the mechanism of protection by reducing CuZnSOD levels with two independent siRNA sequences, both of which successfully attenuated protection against 6-OHDA. Previous reports suggested that artificial overexpression of CuZnSOD in dopaminergic cells is protective. Our data complement such observations, revealing that dopaminergic cells are also able to use endogenous CuZnSOD in self-defensive adaptations to chronic stress, and that they can even do so in the face of extensive glutathione loss. PMID:18466318
Layered Deposits of Arabia Terra and Meridiani Planum: Keys to the Habitability of Ancient Mars
NASA Technical Reports Server (NTRS)
Allen, Carlton C.; Oehler, Dorothy Z.; Paris, Kristen N.; Venechuk, Elizabeth M.
2006-01-01
Understanding the habitability of ancient Mars is a key goal in the exploration of that planet. Evidence for conditions favorable to early life must be sought in ancient sedimentary rocks, such as those of Arabia Terra and Meridiani Planum. Arabia Terra, the northernmost extension of the ancient highlands, is dominated by cratered plains and minor ridged units. These plains extend south into the adjacent Meridiani Planum. The Opportunity rover landed in northern Meridiani, close to the border with Arabia. High resolution MOC images reveal extensive layered sequences across much of the Arabia and Meridiani region. These layers have been interpreted as eroded remnants of sedimentary rock deposits (Edgett, 2005). The layered sequences are concentrated in the SW quadrant of Arabia and in northern Meridiani. Preliminary mapping by Edgett (2005) distinguished four large scale layered sequences in the Arabia and Meridiani region. These have dimensions of hundreds to more than 1,000 km. MOLA altimetry shows that each of the sequences can attain a thickness of 200 to 400 m, with a total thickness greater than 1 km. The sequences are generally flat lying, with regional slopes of a few degrees. Much finer layering is evident within a number of craters. The plains and ridged units of the Arabia and Meridiani region were originally mapped as Noachian based on crater statistics, particularly the number of large craters (Scott and Carr, 1978). The layered sequences in the current study postdate many, but not all, of these large craters. The layered sequences have partially or totally filled a number of craters with diameters ranging from 20 to over 50 km. The topmost layered sequence, as well as the lower two sequences, have intermediate thermal inertia, as derived from THEMIS, indicative of moderate induration. The TES spectra from the lower sequences include features indicative of basalt. Some areas of the topmost sequence, which includes the Opportunity landing site, have TES spectra dominated by hematite. Just below this topmost sequence lies a sequence with higher thermal inertia, indicative of more indurated or coarser grained material. The TES spectra of this sequence lack distinctive mineral features, and the rocks may be obscured by a thin coating of dust. The layers have been extensively eroded. The uppermost sequences are characterized by deeply scalloped boundaries. Filled craters have been partially exhumed. Finely layered deposits within craters have been strongly dissected. Landforms uniquely attributable to wind erosion are rare, but erosive styles and geomorphology characteristic of water and possibly ice are present. The layered sequences in Arabia Terra and Meridiani Planum likely reflect an epoch when the planet was much more habitable than it is today. Several areas in these layered sequences are under intensive study as candidate landing sites for the 2009 Mars Science Laboratory.
Identifying mRNA sequence elements for target recognition by human Argonaute proteins
Li, Jingjing; Kim, TaeHyung; Nutiu, Razvan; Ray, Debashish; Hughes, Timothy R.; Zhang, Zhaolei
2014-01-01
It is commonly known that mammalian microRNAs (miRNAs) guide the RNA-induced silencing complex (RISC) to target mRNAs through the seed-pairing rule. However, recent experiments that coimmunoprecipitate the Argonaute proteins (AGOs), the central catalytic component of RISC, have consistently revealed extensive AGO-associated mRNAs that lack seed complementarity with miRNAs. We herein test the hypothesis that AGO has its own binding preference within target mRNAs, independent of guide miRNAs. By systematically analyzing the data from in vivo cross-linking experiments with human AGOs, we have identified a structurally accessible and evolutionarily conserved region (∼10 nucleotides in length) that alone can accurately predict AGO–mRNA associations, independent of the presence of miRNA binding sites. Within this region, we further identified an enriched motif that was replicable on independent AGO-immunoprecipitation data sets. We used RNAcompete to enumerate the RNA-binding preference of human AGO2 to all possible 7-mer RNA sequences and validated the AGO motif in vitro. These findings reveal a novel function of AGOs as sequence-specific RNA-binding proteins, which may aid miRNAs in recognizing their targets with high specificity. PMID:24663241
Pulseq: A rapid and hardware-independent pulse sequence prototyping framework.
Layton, Kelvin J; Kroboth, Stefan; Jia, Feng; Littin, Sebastian; Yu, Huijun; Leupold, Jochen; Nielsen, Jon-Fredrik; Stöcker, Tony; Zaitsev, Maxim
2017-04-01
Implementing new magnetic resonance experiments, or sequences, often involves extensive programming on vendor-specific platforms, which can be time consuming and costly. This situation is exacerbated when research sequences need to be implemented on several platforms simultaneously, for example, at different field strengths. This work presents an alternative programming environment that is hardware-independent, open-source, and promotes rapid sequence prototyping. A novel file format is described to efficiently store the hardware events and timing information required for an MR pulse sequence. Platform-dependent interpreter modules convert the file to appropriate instructions to run the sequence on MR hardware. Sequences can be designed in high-level languages, such as MATLAB, or with a graphical interface. Spin physics simulation tools are incorporated into the framework, allowing for comparison between real and virtual experiments. Minimal effort is required to implement relatively advanced sequences using the tools provided. Sequences are executed on three different MR platforms, demonstrating the flexibility of the approach. A high-level, flexible and hardware-independent approach to sequence programming is ideal for the rapid development of new sequences. The framework is currently not suitable for large patient studies or routine scanning although this would be possible with deeper integration into existing workflows. Magn Reson Med 77:1544-1552, 2017. © 2016 International Society for Magnetic Resonance in Medicine. © 2016 International Society for Magnetic Resonance in Medicine.
Hirose, Yusuke; Onuki, Mamiko; Tenjimbayashi, Yuri; Mori, Seiichiro; Ishii, Yoshiyuki; Takeuchi, Takamasa; Tasaka, Nobutaka; Satoh, Toyomi; Morisada, Tohru; Iwata, Takashi; Miyamoto, Shingo; Matsumoto, Koji; Sekizawa, Akihiko; Kukimoto, Iwao
2018-06-15
Persistent infection with oncogenic human papillomaviruses (HPVs) causes cervical cancer, accompanied by the accumulation of somatic mutations into the host genome. There are concomitant genetic changes in the HPV genome during viral infection; however, their relevance to cervical carcinogenesis is poorly understood. Here, we explored within-host genetic diversity of HPV by performing deep-sequencing analyses of viral whole-genome sequences in clinical specimens. The whole genomes of HPV types 16, 52, and 58 were amplified by type-specific PCR from total cellular DNA of cervical exfoliated cells collected from patients with cervical intraepithelial neoplasia (CIN) and invasive cervical cancer (ICC) and were deep sequenced. After constructing a reference viral genome sequence for each specimen, nucleotide positions showing changes with >0.5% frequencies compared to the reference sequence were determined for individual samples. In total, 1,052 positions of nucleotide variations were detected in HPV genomes from 151 samples (CIN1, n = 56; CIN2/3, n = 68; ICC, n = 27), with various numbers per sample. Overall, C-to-T and C-to-A substitutions were the dominant changes observed across all histological grades. While C-to-T transitions were predominantly detected in CIN1, their prevalence was decreased in CIN2/3 and fell below that of C-to-A transversions in ICC. Analysis of the trinucleotide context encompassing substituted bases revealed that TpCpN, a preferred target sequence for cellular APOBEC cytosine deaminases, was a primary site for C-to-T substitutions in the HPV genome. These results strongly imply that the APOBEC proteins are drivers of HPV genome mutation, particularly in CIN1 lesions. IMPORTANCE HPVs exhibit surprisingly high levels of genetic diversity, including a large repertoire of minor genomic variants in each viral genotype. Here, by conducting deep-sequencing analyses, we show for the first time a comprehensive snapshot of the within-host genetic diversity of high-risk HPVs during cervical carcinogenesis. Quasispecies harboring minor nucleotide variations in viral whole-genome sequences were extensively observed across different grades of CIN and cervical cancer. Among the within-host variations, C-to-T transitions, a characteristic change mediated by cellular APOBEC cytosine deaminases, were predominantly detected throughout the whole viral genome, most strikingly in low-grade CIN lesions. The results strongly suggest that within-host variations of the HPV genome are primarily generated through the interaction with host cell DNA-editing enzymes and that such within-host variability is an evolutionary source of the genetic diversity of HPVs. Copyright © 2018 American Society for Microbiology.
Clonal evolution in breast cancer revealed by single nucleus genome sequencing.
Wang, Yong; Waters, Jill; Leung, Marco L; Unruh, Anna; Roh, Whijae; Shi, Xiuqing; Chen, Ken; Scheet, Paul; Vattathil, Selina; Liang, Han; Multani, Asha; Zhang, Hong; Zhao, Rui; Michor, Franziska; Meric-Bernstam, Funda; Navin, Nicholas E
2014-08-14
Sequencing studies of breast tumour cohorts have identified many prevalent mutations, but provide limited insight into the genomic diversity within tumours. Here we developed a whole-genome and exome single cell sequencing approach called nuc-seq that uses G2/M nuclei to achieve 91% mean coverage breadth. We applied this method to sequence single normal and tumour nuclei from an oestrogen-receptor-positive (ER(+)) breast cancer and a triple-negative ductal carcinoma. In parallel, we performed single nuclei copy number profiling. Our data show that aneuploid rearrangements occurred early in tumour evolution and remained highly stable as the tumour masses clonally expanded. In contrast, point mutations evolved gradually, generating extensive clonal diversity. Using targeted single-molecule sequencing, many of the diverse mutations were shown to occur at low frequencies (<10%) in the tumour mass. Using mathematical modelling we found that the triple-negative tumour cells had an increased mutation rate (13.3×), whereas the ER(+) tumour cells did not. These findings have important implications for the diagnosis, therapeutic treatment and evolution of chemoresistance in breast cancer.
Der Sarkissian, Clio; Allentoft, Morten E.; Ávila-Arcos, María C.; Barnett, Ross; Campos, Paula F.; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J.; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D.; Moreno-Mayar, J. Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M. Thomas P.; Willerslev, Eske; Orlando, Ludovic
2015-01-01
The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past. PMID:25487338
Liang, Lisa; Morrison, Ludivine Coudière; Tatari, Nazanin; Stromecki, Margaret; Fresnoza, Agnes; Porter, Christopher J; Del Bigio, Marc R; Hawkins, Cynthia; Chan, Jennifer A; Ryken, Timothy C; Taylor, Michael D; Ramaswamy, Vijay; Werbowetski-Ogilvie, Tamra E
2018-06-21
The extensive heterogeneity both between and within the medulloblastoma (MB) subgroups underscores a critical need for variant-specific biomarkers and therapeutic strategies. We previously identified a role for the CD271/p75 neurotrophin receptor (p75NTR) in regulating stem/progenitor cells in the SHH MB subgroup. Here we demonstrate the utility of CD271 as a novel diagnostic and prognostic marker for SHH MB using immunohistochemical analysis and transcriptome data across 763 primary tumors. RNA sequencing of CD271+ and CD271- cells revealed molecularly distinct, co-existing cellular subsets both in vitro and in vivo. MAPK/ERK signaling was upregulated in the CD271+ population, and inhibiting this pathway reduced endogenous CD271 levels, stem/progenitor cell proliferation, and cell survival as well as cell migration in vitro. Treatment with the MEK inhibitor selumetinib extended survival and reduced CD271 levels in vivo; whereas, treatment with vismodegib, a well-known smoothened (SMO) inhibitor currently in clinical trials for the treatment of recurrent SHH medulloblastoma, had no significant effect in our models. Our study demonstrates the clinical utility of CD271 as both a diagnostic and prognostic tool for SHH MB tumors and reveals a novel role for MEK inhibitors in targeting CD271+ SHH MB cells. Copyright ©2018, American Association for Cancer Research.
Sequence analysis of RNase MRP RNA reveals its origination from eukaryotic RNase P RNA
Zhu, Yanglong; Stribinskis, Vilius; Ramos, Kenneth S.; Li, Yong
2006-01-01
RNase MRP is a eukaryote-specific endoribonuclease that generates RNA primers for mitochondrial DNA replication and processes precursor rRNA. RNase P is a ubiquitous endoribonuclease that cleaves precursor tRNA transcripts to produce their mature 5′ termini. We found extensive sequence homology of catalytic domains and specificity domains between their RNA subunits in many organisms. In Candida glabrata, the internal loop of helix P3 is 100% conserved between MRP and P RNAs. The helix P8 of MRP RNA from microsporidia Encephalitozoon cuniculi is identical to that of P RNA. Sequence homology can be widely spread over the whole molecule of MRP RNA and P RNA, such as those from Dictyostelium discoideum. These conserved nucleotides between the MRP and P RNAs strongly support the hypothesis that the MRP RNA is derived from the P RNA molecule in early eukaryote evolution. PMID:16540690
Low, Van Lun; Adler, Peter H; Takaoka, Hiroyuki; Ya'cob, Zubaidah; Lim, Phaik Eem; Tan, Tiong Kai; Lim, Yvonne A L; Chen, Chee Dhang; Norma-Rashid, Yusoff; Sofian-Azirun, Mohd
2014-01-01
The population genetic structure of Simulium tani was inferred from mitochondria-encoded sequences of cytochrome c oxidase subunits I (COI) and II (COII) along an elevational gradient in Cameron Highlands, Malaysia. A statistical parsimony network of 71 individuals revealed 71 haplotypes in the COI gene and 43 haplotypes in the COII gene; the concatenated sequences of the COI and COII genes revealed 71 haplotypes. High levels of genetic diversity but low levels of genetic differentiation were observed among populations of S. tani at five elevations. The degree of genetic diversity, however, was not in accordance with an altitudinal gradient, and a Mantel test indicated that elevation did not have a limiting effect on gene flow. No ancestral haplotype of S. tani was found among the populations. Pupae with unique structural characters at the highest elevation showed a tendency to form their own haplotype cluster, as revealed by the COII gene. Tajima's D, Fu's Fs, and mismatch distribution tests revealed population expansion of S. tani in Cameron Highlands. A strong correlation was found between nucleotide diversity and the levels of dissolved oxygen in the streams where S. tani was collected.
Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction.
Mai, Uyen; Sayyari, Erfan; Mirarab, Siavash
2017-01-01
Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we introduce a new method of rooting a tree based on its branch length distribution; our method, which minimizes the variance of root to tip distances, is inspired by the traditional midpoint rerooting and is justified when deviations from the strict molecular clock are random. Like midpoint rerooting, the method can be implemented in a linear time algorithm. In extensive simulations that consider discordance between gene trees and the species tree, we show that the new method is more accurate than midpoint rerooting, but its relative accuracy compared to using outgroups to root gene trees depends on the size of the dataset and levels of deviations from the strict clock. We show high levels of error for all methods of rooting estimated gene trees due to factors that include effects of gene tree discordance, deviations from the clock, and gene tree estimation error. Our simulations, however, did not reveal significant differences between two equivalent methods for species tree estimation that use rooted and unrooted input, namely, STAR and NJst. Nevertheless, our results point to limitations of existing scalable rooting methods.
Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction
Sayyari, Erfan; Mirarab, Siavash
2017-01-01
Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we introduce a new method of rooting a tree based on its branch length distribution; our method, which minimizes the variance of root to tip distances, is inspired by the traditional midpoint rerooting and is justified when deviations from the strict molecular clock are random. Like midpoint rerooting, the method can be implemented in a linear time algorithm. In extensive simulations that consider discordance between gene trees and the species tree, we show that the new method is more accurate than midpoint rerooting, but its relative accuracy compared to using outgroups to root gene trees depends on the size of the dataset and levels of deviations from the strict clock. We show high levels of error for all methods of rooting estimated gene trees due to factors that include effects of gene tree discordance, deviations from the clock, and gene tree estimation error. Our simulations, however, did not reveal significant differences between two equivalent methods for species tree estimation that use rooted and unrooted input, namely, STAR and NJst. Nevertheless, our results point to limitations of existing scalable rooting methods. PMID:28800608
Symmetry of proprioceptive sense in female soccer players.
Iwańska, Dagmara; Karczewska, Magdalena; Madej, Anna; Urbanik, Czesław
2015-01-01
The purpose of the study was to assess the symmetry of proprioceptive sense among female soccer players when trying to reproduce isometric knee extensions (right and left) and to analyze the impact of a given level of muscle force on proprioception. The study involved 12 soccer players aged 19.5 ± 2.65 years. Soccer players performed a control measurement of a maximum 3s (knee at the 90°) position in the joint. Subsequently, 70%, 50%, and 30% of the maximum voluntary contraction (MVC) were all calculated and then reproduced by each subject with feedback. Next, the players reproduced the predefined muscle contraction values in three sequences: A - 50%, 70%, 30%; B - 50%, 30%, 70%; C - 70%, 30%, 50% of MVC without visual control. In every sequence, the participants found obtaining the value of 30% of MVC the most difficult. The value they reproduced most accurately was 70% of MVC. Both trial II and trial III demonstrated that the symmetry index SI significantly differed from values considered acceptable (SIRa). In each successive sequence the largest asymmetry occurred while reproducing the lowest values of MVC (30%) (p < 0.05). High level of prioprioceptive sense is important to soccer players due to the extensive overload associated with dynamics stops or changes in direction while running. Special attention should be paid to develop skills in sensing force of varying levels. It was much harder to reproduce the predefined values if there was no feedback.
Some New Sets of Sequences of Fuzzy Numbers with Respect to the Partial Metric
Ozluk, Muharrem
2015-01-01
In this paper, we essentially deal with Köthe-Toeplitz duals of fuzzy level sets defined using a partial metric. Since the utilization of Zadeh's extension principle is quite difficult in practice, we prefer the idea of level sets in order to construct some classical notions. In this paper, we present the sets of bounded, convergent, and null series and the set of sequences of bounded variation of fuzzy level sets, based on the partial metric. We examine the relationships between these sets and their classical forms and give some properties including definitions, propositions, and various kinds of partial metric spaces of fuzzy level sets. Furthermore, we study some of their properties like completeness and duality. Finally, we obtain the Köthe-Toeplitz duals of fuzzy level sets with respect to the partial metric based on a partial ordering. PMID:25695102
Pan, Zhangyuan; Li, Shengdi; Liu, Qiuyue; Wang, Zhen; Zhou, Zhengkui; Di, Ran; Miao, Benpeng; Hu, Wenping; Wang, Xiangyu; Hu, Xiaoxiang; Xu, Ze; Wei, Dongkai; He, Xiaoyun; Yuan, Liyun; Guo, Xiaofei; Liang, Benmeng; Wang, Ruichao; Li, Xiaoyu; Cao, Xiaohan; Dong, Xinlong; Xia, Qing; Shi, Hongcai; Hao, Geng; Yang, Jean; Luosang, Cuicheng; Zhao, Yiqiang; Jin, Mei; Zhang, Yingjie; Lv, Shenjin; Li, Fukuan; Ding, Guohui; Chu, Mingxing; Li, Yixue
2018-04-01
Animal domestication has been extensively studied, but the process of feralization remains poorly understood. Here, we performed whole-genome sequencing of 99 sheep and identified a primary genetic divergence between 2 heterogeneous populations in the Tibetan Plateau, including 1 semi-feral lineage. Selective sweep and candidate gene analysis revealed local adaptations of these sheep associated with sensory perception, muscle strength, eating habit, mating process, and aggressive behavior. In particular, a horn-related gene, RXFP2, showed signs of rapid evolution specifically in the semi-feral breeds. A unique haplotype and repressed horn-related tissue expression of RXFP2 were correlated with higher horn length, as well as spiral and horizontally extended horn shape. Semi-feralization has an extensive impact on diverse phenotypic traits of sheep. By acquiring features like those of their wild ancestors, semi-feral sheep were able to regain fitness while in frequent contact with wild surroundings and rare human interventions. This study provides a new insight into the evolution of domestic animals when human interventions are no longer dominant.
Gayral, Philippe; Noa-Carrazana, Juan-Carlos; Lescot, Magali; Lheureux, Fabrice; Lockhart, Benham E. L.; Matsumoto, Takashi; Piffanelli, Pietro; Iskra-Caruana, Marie-Line
2008-01-01
Sequencing of plant nuclear genomes reveals the widespread presence of integrated viral sequences known as endogenous pararetroviruses (EPRVs). Banana is one of the three plant species known to harbor infectious EPRVs. Musa balbisiana carries integrated copies of Banana streak virus (BSV), which are infectious by releasing virions in interspecific hybrids. Here, we analyze the organization of the EPRV of BSV Goldfinger (BSGfV) present in the wild diploid M. balbisiana cv. Pisang Klutuk Wulung (PKW) revealed by the study of Musa bacterial artificial chromosome resources and interspecific genetic cross. cv. PKW contains two similar EPRVs of BSGfV. Genotyping of these integrants and studies of their segregation pattern show an allelic insertion. Despite the fact that integrated BSGfV has undergone extensive rearrangement, both EPRVs contain the full-length viral genome. The high degree of sequence conservation between the integrated and episomal form of the virus indicates a recent integration event; however, only one allele is infectious. Analysis of BSGfV EPRV segregation among an F1 population from an interspecific genetic cross revealed that these EPRV sequences correspond to two alleles originating from a single integration event. We describe here for the first time the full genomic and genetic organization of the two EPRVs of BSGfV present in cv. PKW in response to the challenge facing both scientists and breeders to identify and generate genetic resources free from BSV. We discuss the consequences of this unique host-pathogen interaction in terms of genetic and genomic plant defenses versus strategies of infectious BSGfV EPRVs. PMID:18417582
DOE Office of Scientific and Technical Information (OSTI.GOV)
John C. Meeks
2001-12-31
Nostoc punctiforme is a filamentous cyanobacterium with extensive phenotypic characteristics and a relatively large genome, approaching 10 Mb. The phenotypic characteristics include a photoautotrophic, diazotrophic mode of growth, but N. punctiforme is also facultatively heterotrophic; its vegetative cells have multiple development alternatives, including terminal differentiation into nitrogen-fixing heterocysts and transient differentiation into spore-like akinetes or motile filaments called hormogonia; and N. punctiforme has broad symbiotic competence with fungi and terrestrial plants, including bryophytes, gymnosperms and an angiosperm. The shotgun-sequencing phase of the N. punctiforme strain ATCC 29133 genome has been completed by the Joint Genome Institute. Annotation of an 8.9more » Mb database yielded 7432 open reading frames, 45% of which encode proteins with known or probable known function and 29% of which are unique to N. punctiforme. Comparative analysis of the sequence indicates a genome that is highly plastic and in a state of flux, with numerous insertion sequences and multilocus repeats, as well as genes encoding transposases and DNA modification enzymes. The sequence also reveals the presence of genes encoding putative proteins that collectively define almost all characteristics of cyanobacteria as a group. N. punctiforme has an extensive potential to sense and respond to environmental signals as reflected by the presence of more than 400 genes encoding sensor protein kinases, response regulators and other transcriptional factors. The signal transduction systems and any of the large number of unique genes may play essential roles in the cell differentiation and symbiotic interaction properties of N. punctiforme.« less
Accurate phylogenetic classification of DNA fragments based onsequence composition
DOE Office of Scientific and Technical Information (OSTI.GOV)
McHardy, Alice C.; Garcia Martin, Hector; Tsirigos, Aristotelis
2006-05-01
Metagenome studies have retrieved vast amounts of sequenceout of a variety of environments, leading to novel discoveries and greatinsights into the uncultured microbial world. Except for very simplecommunities, diversity makes sequence assembly and analysis a verychallenging problem. To understand the structure a 5 nd function ofmicrobial communities, a taxonomic characterization of the obtainedsequence fragments is highly desirable, yet currently limited mostly tothose sequences that contain phylogenetic marker genes. We show that forclades at the rank of domain down to genus, sequence composition allowsthe very accurate phylogenetic 10 characterization of genomic sequence.We developed a composition-based classifier, PhyloPythia, for de novophylogenetic sequencemore » characterization and have trained it on adata setof 340 genomes. By extensive evaluation experiments we show that themethodis accurate across all taxonomic ranks considered, even forsequences that originate fromnovel organisms and are as short as 1kb.Application to two metagenome datasets 15 obtained from samples ofphosphorus-removing sludge showed that the method allows the accurateclassification at genus level of most sequence fragments from thedominant populations, while at the same time correctly characterizingeven larger parts of the samples at higher taxonomic levels.« less
Serrano, Soraya; Huarte, Nerea; Rujas, Edurne; Andreu, David; Nieva, José L; Jiménez, María Angeles
2017-10-17
Despite extensive characterization of the human immunodeficiency virus type 1 (HIV-1) hydrophobic fusion peptide (FP), the structure-function relationships underlying its extraordinary degree of conservation remain poorly understood. Specifically, the fact that the tandem repeat of the FLGFLG tripeptide is absolutely conserved suggests that high hydrophobicity may not suffice to unleash FP function. Here, we have compared the nuclear magnetic resonance (NMR) structures adopted in nonpolar media by two FP surrogates, wtFP-tag and scrFP-tag, which had equal hydrophobicity but contained wild-type and scrambled core sequences LFLGFLG and FGLLGFL, respectively. In addition, these peptides were tagged at their C-termini with an epitope sequence that folded independently, thereby allowing Western blot detection without interfering with FP structure. We observed similar α-helical FP conformations for both specimens dissolved in the low-polarity medium 25% (v/v) 1,1,1,3,3,3-hexafluoro-2-propanol (HFIP), but important differences in contact with micelles of the membrane mimetic dodecylphosphocholine (DPC). Thus, whereas wtFP-tag preserved a helix displaying a Gly-rich ridge, the scrambled sequence lost in great part the helical structure upon being solubilized in DPC. Western blot analyses further revealed the capacity of wtFP-tag to assemble trimers in membranes, whereas membrane oligomers were not observed in the case of the scrFP-tag sequence. We conclude that, beyond hydrophobicity, preserving sequence order is an important feature for defining the secondary structures and oligomeric states adopted by the HIV FP in membranes.
Genetic and phylogenetic divergence of feline immunodeficiency virus in the puma (Puma concolor).
Carpenter, M A; Brown, E W; Culver, M; Johnson, W E; Pecon-Slattery, J; Brousset, D; O'Brien, S J
1996-01-01
Feline immunodeficiency virus (FIV) is a lentivirus which causes an AIDS-like disease in domestic cats (Felis catus). A number of other felid species, including the puma (Puma concolor), carry a virus closely related to domestic cat FIV. Serological testing revealed the presence of antibodies to FIV in 22% of 434 samples from throughout the geographic range of the puma. FIV-Pco pol gene sequences isolated from pumas revealed extensive sequence diversity, greater than has been documented in the domestic cat. The puma sequences formed two highly divergent groups, analogous to the clades which have been defined for domestic cat and lion (Panthera leo) FIV. The puma clade A was made up of samples from Florida and California, whereas clade B consisted of samples from other parts of North America, Central America, and Brazil. The difference between these two groups was as great as that reported among three lion FIV clades. Within puma clades, sequence variation is large, comparable to between-clade differences seen for domestic cat clades, allowing recognition of 15 phylogenetic lineages (subclades) among puma FIV-Pco. Large sequence divergence among isolates, nearly complete species monophyly, and widespread geographic distribution suggest that FIV-Pco has evolved within the puma species for a long period. The sequence data provided evidence for vertical transmission of FIV-Pco from mothers to their kittens, for coinfection of individuals by two different viral strains, and for cross-species transmission of FIV from a domestic cat to a puma. These factors may all be important for understanding the epidemiology and natural history of FIV in the puma. PMID:8794304
2013-01-01
Background Candida albicans is a ubiquitous opportunistic fungal pathogen that afflicts immunocompromised human hosts. With rare and transient exceptions the yeast is diploid, yet despite its clinical relevance the respective sequences of its two homologous chromosomes have not been completely resolved. Results We construct a phased diploid genome assembly by deep sequencing a standard laboratory wild-type strain and a panel of strains homozygous for particular chromosomes. The assembly has 700-fold coverage on average, allowing extensive revision and expansion of the number of known SNPs and indels. This phased genome significantly enhances the sensitivity and specificity of allele-specific expression measurements by enabling pooling and cross-validation of signal across multiple polymorphic sites. Additionally, the diploid assembly reveals pervasive and unexpected patterns in allelic differences between homologous chromosomes. Firstly, we see striking clustering of indels, concentrated primarily in the repeat sequences in promoters. Secondly, both indels and their repeat-sequence substrate are enriched near replication origins. Finally, we reveal an intimate link between repeat sequences and indels, which argues that repeat length is under selective pressure for most eukaryotes. This connection is described by a concise one-parameter model that explains repeat-sequence abundance in C. albicans as a function of the indel rate, and provides a general framework to interpret repeat abundance in species ranging from bacteria to humans. Conclusions The phased genome assembly and insights into repeat plasticity will be valuable for better understanding allele-specific phenomena and genome evolution. PMID:24025428
Kis, Mihaly; Burbridge, Emma; Brock, Ian W; Heggie, Laura; Dix, Philip J; Kavanagh, Tony A
2004-03-01
Native horseradish (Armoracia rusticana) peroxidase, HRP (EC 1.11.1.7), isoenzyme C is synthesized with N-terminal and C-terminal peptide extensions, believed to be associated with protein targeting. This study aimed to explore the specific functions of these extensions, and to generate transgenic plants with expression patterns suitable for exploring the role of peroxidase in plant development and defence. Transgenic Nicotiana tabacum (tobacco) plants expressing different versions of a synthetic horseradish peroxidase, HRP, isoenzyme C gene were constructed. The gene was engineered to include additional sequences coding for either the natural N-terminal or the C-terminal extension or both. These constructs were placed under the control of a constitutive promoter (CaMV-35S) or the tobacco RUBISCO-SSU light inducible promoter (SSU) and introduced into tobacco using Agrobacterium-mediated transformation. To study the effects of the N- and C-terminal extensions, the localization of recombinant peroxidase was determined using biochemical and molecular techniques. Transgenic tobacco plants can exhibit a ten-fold increase in peroxidase activity compared with wild-type tobacco levels, and the majority of this activity is located in the symplast. The N-terminal extension is essential for the production of high levels of recombinant protein, while the C-terminal extension has little effect. Differences in levels of enzyme activity and recombinant protein are reflected in transcript levels. There is no evidence to support either preferential secretion or vacuolar targeting of recombinant peroxidase in this heterologous expression system. This leads us to question the postulated targeting roles of these peptide extensions. The N-terminal extension is essential for high level expression and appears to influence transcript stability or translational efficiency. Plants have been generated with greatly elevated cytosolic peroxidase activity, and smaller increases in apoplastic activity. These will be valuable for exploring the role of these enzymes in stress amelioration and plant development.
KIS, MIHALY; BURBRIDGE, EMMA; BROCK, IAN W.; HEGGIE, LAURA; DIX, PHILIP J.; KAVANAGH, TONY A.
2004-01-01
• Background and Aims Native horseradish (Armoracia rusticana) peroxidase, HRP (EC 1.11.1.7), isoenzyme C is synthesized with N‐terminal and C‐terminal peptide extensions, believed to be associated with protein targeting. This study aimed to explore the specific functions of these extensions, and to generate transgenic plants with expression patterns suitable for exploring the role of peroxidase in plant development and defence. • Methods Transgenic Nicotiana tabacum (tobacco) plants expressing different versions of a synthetic horseradish peroxidase, HRP, isoenzyme C gene were constructed. The gene was engineered to include additional sequences coding for either the natural N‐terminal or the C‐terminal extension or both. These constructs were placed under the control of a constitutive promoter (CaMV‐35S) or the tobacco RUBISCO‐SSU light inducible promoter (SSU) and introduced into tobacco using Agrobacterium‐mediated transformation. To study the effects of the N‐ and C‐terminal extensions, the localization of recombinant peroxidase was determined using biochemical and molecular techniques. • Key Results Transgenic tobacco plants can exhibit a ten‐fold increase in peroxidase activity compared with wild‐type tobacco levels, and the majority of this activity is located in the symplast. The N‐terminal extension is essential for the production of high levels of recombinant protein, while the C‐terminal extension has little effect. Differences in levels of enzyme activity and recombinant protein are reflected in transcript levels. • Conclusions There is no evidence to support either preferential secretion or vacuolar targeting of recombinant peroxidase in this heterologous expression system. This leads us to question the postulated targeting roles of these peptide extensions. The N‐terminal extension is essential for high level expression and appears to influence transcript stability or translational efficiency. Plants have been generated with greatly elevated cytosolic peroxidase activity, and smaller increases in apoplastic activity. These will be valuable for exploring the role of these enzymes in stress amelioration and plant development. PMID:14749254
Sequence analysis of a bitter taste receptor gene repertoires in different ruminant species
USDA-ARS?s Scientific Manuscript database
Bitter taste has been extensively studied in mammalian species and is associated with sensitivity to toxins and with food choices that avoid dangerous substances in the diet. At the molecular level, bitter compounds are sensed by bitter taste receptor proteins (T2R) present at the surface of taste r...
USDA-ARS?s Scientific Manuscript database
Premise of the study: Prunus L. phylogeny has extensively studied using cpDNA sequences. CpDNA has a slow rate of evolution which is beneficial to determine species relationships at a deeper level. However, a limitation of the chloroplast based phylogenies is its transfer by interspecific hybridizat...
2013-01-01
Background The Brassica B genome is known to carry several important traits, yet there has been limited analyses of its underlying genome structure, especially in comparison to the closely related A and C genomes. A bacterial artificial chromosome (BAC) library of Brassica nigra was developed and screened with 17 genes from a 222 kb region of A. thaliana that had been well characterised in both the Brassica A and C genomes. Results Fingerprinting of 483 apparently non-redundant clones defined physical contigs for the corresponding regions in B. nigra. The target region is duplicated in A. thaliana and six homologous contigs were found in B. nigra resulting from the whole genome triplication event shared by the Brassiceae tribe. BACs representative of each region were sequenced to elucidate the level of microscale rearrangements across the Brassica species divide. Conclusions Although the B genome species separated from the A/C lineage some 6 Mya, comparisons between the three paleopolyploid Brassica genomes revealed extensive conservation of gene content and sequence identity. The level of fractionation or gene loss varied across genomes and genomic regions; however, the greatest loss of genes was observed to be common to all three genomes. One large-scale chromosomal rearrangement differentiated the B genome suggesting such events could contribute to the lack of recombination observed between B genome species and those of the closely related A/C lineage. PMID:23586706
de Welzen, Lynne; Eldholm, Vegard; Maharaj, Kashmeel; Manson, Abigail L.; Earl, Ashlee M.
2017-01-01
ABSTRACT Genetics-based drug susceptibility testing has improved the diagnosis of drug-resistant tuberculosis but is limited by our lack of knowledge of all resistance mechanisms. Next-generation sequencing has assisted in identifying the principal genetic mechanisms of resistance for many drugs, but a significant proportion of phenotypic drug resistance is unexplained genetically. Few studies have formally compared the transcriptomes of susceptible and resistant Mycobacterium tuberculosis strains. We carried out comparative whole-genome transcriptomics of extensively drug-resistant (XDR) clinical isolates using RNA sequencing (RNA-seq) to find novel transcription-mediated mechanisms of resistance. We identified a promoter mutation (t to c) at position −11 (t−11c) relative to the start codon of ethA that reduces the expression of a monooxygenase (EthA) that activates ethionamide. (In this article, nucleotide changes are lowercase and amino acid substitutions are uppercase.) Using a flow cytometry-based reporter assay, we show that the reduced transcription of ethA is not due to transcriptional repression by ethR. Clinical strains harboring this mutation were resistant to ethionamide. Other ethA promoter mutations were identified in a global genomic survey of resistant M. tuberculosis strains. These results demonstrate a new mechanism of ethionamide resistance that can cause high-level resistance when it is combined with other ethionamide resistance-conferring mutations. Our study revealed many other genes which were highly up- or downregulated in XDR strains, including a toxin-antitoxin module (mazF5 mazE5) and tRNAs (leuX and thrU). This suggests that global transcriptional modifications could contribute to resistance or the maintenance of bacterial fitness have also occurred in XDR strains. PMID:28993337
Puerma, Eva; Orengo, Dorcas J; Salguero, David; Papaceit, Montserrat; Segarra, Carmen; Aguadé, Montserrat
2014-09-01
Inversions are an integral part of structural variation within species, and they play a leading role in genome reorganization across species. Work at both the cytological and genome sequence levels has revealed heterogeneity in the distribution of inversion breakpoints, with some regions being recurrently used. Breakpoint reuse at the molecular level has mostly been assessed for fixed inversions through genome sequence comparison, and therefore rather broadly. Here, we have identified and sequenced the breakpoints of two polymorphic inversions-E1 and E2 that share a breakpoint-in the extant Est and E1 + 2 chromosomal arrangements of Drosophila subobscura. The breakpoints are two medium-sized repeated motifs that mediated the inversions by two different mechanisms: E1 via staggered breaks and subsequent repair and E2 via repeat-mediated ectopic recombination. The fine delimitation of the shared breakpoint revealed its strict reuse at the molecular level regardless of which was the intermediate arrangement. The occurrence of other rearrangements in the most proximal and distal extended breakpoint regions reveals the broad reuse of these regions. This differential degree of fragility might be related to their sharing the presence outside the inverted region of snoRNA-encoding genes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Characterization of mango (Mangifera indica L.) transcriptome and chloroplast genome.
Azim, M Kamran; Khan, Ishtaiq A; Zhang, Yong
2014-05-01
We characterized mango leaf transcriptome and chloroplast genome using next generation DNA sequencing. The RNA-seq output of mango transcriptome generated >12 million reads (total nucleotides sequenced >1 Gb). De novo transcriptome assembly generated 30,509 unigenes with lengths in the range of 300 to ≥3,000 nt and 67× depth of coverage. Blast searching against nonredundant nucleotide databases and several Viridiplantae genomic datasets annotated 24,593 mango unigenes (80% of total) and identified Citrus sinensis as closest neighbor of mango with 9,141 (37%) matched sequences. The annotation with gene ontology and Clusters of Orthologous Group terms categorized unigene sequences into 57 and 25 classes, respectively. More than 13,500 unigenes were assigned to 293 KEGG pathways. Besides major plant biology related pathways, KEGG based gene annotation pointed out active presence of an array of biochemical pathways involved in (a) biosynthesis of bioactive flavonoids, flavones and flavonols, (b) biosynthesis of terpenoids and lignins and (c) plant hormone signal transduction. The mango transcriptome sequences revealed 235 proteases belonging to five catalytic classes of proteolytic enzymes. The draft genome of mango chloroplast (cp) was obtained by a combination of Sanger and next generation sequencing. The draft mango cp genome size is 151,173 bp with a pair of inverted repeats of 27,093 bp separated by small and large single copy regions, respectively. Out of 139 genes in mango cp genome, 91 found to be protein coding. Sequence analysis revealed cp genome of C. sinensis as closest neighbor of mango. We found 51 short repeats in mango cp genome supposed to be associated with extensive rearrangements. This is the first report of transcriptome and chloroplast genome analysis of any Anacardiaceae family member.
NASA Astrophysics Data System (ADS)
Farrell, J.
2010-12-01
Farrell, Jamie M. Smith, Robert Massin, Fred White, Bonnie Department of Geology and Geophysics University of Utah Salt Lake City, Utah 84112 Seismicity has persisted along a zone south of the Yellowstone volcanic field in the Gros Ventre Range, Wyoming, and on the eastern edge of the asesimic Quat. high slip-rate Teton fault. Concentrated seismicity has in this area occurs in sporadic sequences documented since 1923 with notable earthquakes in the decade preceding the deadly 1925 Gros Ventre slide that eventually lead to the failure of a dam blocked by the slide in 1927. Notable seismicity of the Gros Ventre region, using data from the Teton, Yellowstone and USArray seismic networks, has continued in the last decade with sequences in 2002, 2004, culminating in an energetic sequence beginning in May, 2009 through a sequence of more than 180 well located earthquakes mainly from August 5 to August 17 of 0.5
Biodiversity of Environmental Leptospira: Improving Identification and Revisiting the Diagnosis.
Thibeaux, Roman; Girault, Dominique; Bierque, Emilie; Soupé-Gilbert, Marie-Estelle; Rettinger, Anna; Douyère, Anthony; Meyer, Michael; Iraola, Gregorio; Picardeau, Mathieu; Goarant, Cyrille
2018-01-01
Leptospirosis is an important environmental disease and a major threat to human health causing at least 1 million clinical infections annually. There has recently been a growing interest in understanding the environmental lifestyle of Leptospira . However, Leptospira isolation from complex environmental samples is difficult and time-consuming and few tools are available to identify Leptospira isolates at the species level. Here, we propose a polyphasic isolation and identification scheme, which might prove useful to recover and identify environmental isolates and select those to be submitted to whole-genome sequencing. Using this approach, we recently described 12 novel Leptospira species for which we propose names. We also show that MALDI-ToF MS allows rapid and reliable identification and provide an extensive database of Leptospira MALDI-ToF mass spectra, which will be valuable to researchers in the leptospirosis community for species identification. Lastly, we also re-evaluate some of the current techniques for the molecular diagnosis of leptospirosis taking into account the extensive and recently revealed biodiversity of Leptospira in the environment. In conclusion, we describe our method for isolating Leptospira from the environment, confirm the usefulness of mass spectrometry for species identification and propose names for 12 novel species. This also offers the opportunity to refine current molecular diagnostic tools.
Qualitative and quantitative assessment of Illumina's forensic STR and SNP kits on MiSeq FGx™.
Sharma, Vishakha; Chow, Hoi Yan; Siegel, Donald; Wurmbach, Elisa
2017-01-01
Massively parallel sequencing (MPS) is a powerful tool transforming DNA analysis in multiple fields ranging from medicine, to environmental science, to evolutionary biology. In forensic applications, MPS offers the ability to significantly increase the discriminatory power of human identification as well as aid in mixture deconvolution. However, before the benefits of any new technology can be employed, a thorough evaluation of its quality, consistency, sensitivity, and specificity must be rigorously evaluated in order to gain a detailed understanding of the technique including sources of error, error rates, and other restrictions/limitations. This extensive study assessed the performance of Illumina's MiSeq FGx MPS system and ForenSeq™ kit in nine experimental runs including 314 reaction samples. In-depth data analysis evaluated the consequences of different assay conditions on test results. Variables included: sample numbers per run, targets per run, DNA input per sample, and replications. Results are presented as heat maps revealing patterns for each locus. Data analysis focused on read numbers (allele coverage), drop-outs, drop-ins, and sequence analysis. The study revealed that loci with high read numbers performed better and resulted in fewer drop-outs and well balanced heterozygous alleles. Several loci were prone to drop-outs which led to falsely typed homozygotes and therefore to genotype errors. Sequence analysis of allele drop-in typically revealed a single nucleotide change (deletion, insertion, or substitution). Analyses of sequences, no template controls, and spurious alleles suggest no contamination during library preparation, pooling, and sequencing, but indicate that sequencing or PCR errors may have occurred due to DNA polymerase infidelities. Finally, we found utilizing Illumina's FGx System at recommended conditions does not guarantee 100% outcomes for all samples tested, including the positive control, and required manual editing due to low read numbers and/or allele drop-in. These findings are important for progressing towards implementation of MPS in forensic DNA testing.
2014-01-01
Background Plasmodium vivax is a protozoan parasite with an extensive worldwide distribution, being highly prevalent in Asia as well as in Mesoamerica and South America. In southern Mexico, P. vivax transmission has been endemic and recent studies suggest that these parasites have unique biological and genetic features. The msp1 gene has shown high rate of nucleotide substitutions, deletions, insertions, and its mosaic structure reveals frequent events of recombination, maybe between highly divergent parasite isolates. Methods The nucleotide sequence variation in the polymorphic icb5-6 fragment of the msp1 gene of Mexican and worldwide isolates was analysed. To understand how genotype diversity arises, disperses and persists in Mexico, the genetic structure and genealogical relationships of local isolates were examined. To identify new sequence hybrids and their evolutionary relationships with other P. vivax isolates circulating worldwide two haplotype networks were constructed questioning that two portions of the icb5-6 have different evolutionary history. Results Twelve new msp1 icb5-6 haplotypes of P. vivax from Mexico were identified. These nucleotide sequences show mosaic structure comprising three partially conserved and two variable subfragments and resulted into five different sequence types. The variable subfragment sV1 has undergone recombination events and resulted in hybrid sequences and the haplotype network allocated the Mexican haplotypes to three lineages, corresponding to the Sal I and Belem types, and other more divergent group. In contrast, the network from icb5-6 fragment but not sV1 revealed that the Mexican haplotypes belong to two separate lineages, none of which are closely related to Sal I or Belem sequences. Conclusions These results suggest that the new hybrid haplotypes from southern Mexico were the result of at least three different recombination events. These rearrangements likely resulted from the recombination between haplotypes of highly divergent lineages that are frequently distributed in South America and Asia and diversified rapidly. PMID:24472213
A review of bioinformatic methods for forensic DNA analyses.
Liu, Yao-Yuan; Harbison, SallyAnn
2018-03-01
Short tandem repeats, single nucleotide polymorphisms, and whole mitochondrial analyses are three classes of markers which will play an important role in the future of forensic DNA typing. The arrival of massively parallel sequencing platforms in forensic science reveals new information such as insights into the complexity and variability of the markers that were previously unseen, along with amounts of data too immense for analyses by manual means. Along with the sequencing chemistries employed, bioinformatic methods are required to process and interpret this new and extensive data. As more is learnt about the use of these new technologies for forensic applications, development and standardization of efficient, favourable tools for each stage of data processing is being carried out, and faster, more accurate methods that improve on the original approaches have been developed. As forensic laboratories search for the optimal pipeline of tools, sequencer manufacturers have incorporated pipelines into sequencer software to make analyses convenient. This review explores the current state of bioinformatic methods and tools used for the analyses of forensic markers sequenced on the massively parallel sequencing (MPS) platforms currently most widely used. Copyright © 2017 Elsevier B.V. All rights reserved.
Nullomers and High Order Nullomers in Genomic Sequences
Vergni, Davide; Santoni, Daniele
2016-01-01
A nullomer is an oligomer that does not occur as a subsequence in a given DNA sequence, i.e. it is an absent word of that sequence. The importance of nullomers in several applications, from drug discovery to forensic practice, is now debated in the literature. Here, we investigated the nature of nullomers, whether their absence in genomes has just a statistical explanation or it is a peculiar feature of genomic sequences. We introduced an extension of the notion of nullomer, namely high order nullomers, which are nullomers whose mutated sequences are still nullomers. We studied different aspects of them: comparison with nullomers of random sequences, CpG distribution and mean helical rise. In agreement with previous results we found that the number of nullomers in the human genome is much larger than expected by chance. Nevertheless antithetical results were found when considering a random DNA sequence preserving dinucleotide frequencies. The analysis of CpG frequencies in nullomers and high order nullomers revealed, as expected, a high CpG content but it also highlighted a strong dependence of CpG frequencies on the dinucleotide position, suggesting that nullomers have their own peculiar structure and are not simply sequences whose CpG frequency is biased. Furthermore, phylogenetic trees were built on eleven species based on both the similarities between the dinucleotide frequencies and the number of nullomers two species share, showing that nullomers are fairly conserved among close species. Finally the study of mean helical rise of nullomers sequences revealed significantly high mean rise values, reinforcing the hypothesis that those sequences have some peculiar structural features. The obtained results show that nullomers are the consequence of the peculiar structure of DNA (also including biased CpG frequency and CpGs islands), so that the hypermutability model, also taking into account CpG islands, seems to be not sufficient to explain nullomer phenomenon. Finally, high order nullomers could emphasize those features that already make simple nullomers useful in several applications. PMID:27906971
The proximal-to-distal sequence in upper-limb motions on multiple levels and time scales.
Serrien, Ben; Baeyens, Jean-Pierre
2017-10-01
The proximal-to-distal sequence is a phenomenon that can be observed in a large variety of motions of the upper limbs in both humans and other mammals. The mechanisms behind this sequence are not completely understood and motor control theories able to explain this phenomenon are currently incomplete. The aim of this narrative review is to take a theoretical constraints-led approach to the proximal-to-distal sequence and provide a broad multidisciplinary overview of relevant literature. This sequence exists at multiple levels (brain, spine, muscles, kinetics and kinematics) and on multiple time scales (motion, motor learning and development, growth and possibly even evolution). We hypothesize that the proximodistal spatiotemporal direction on each time scale and level provides part of the organismic constraints that guide the dynamics at the other levels and time scales. The constraint-led approach in this review may serve as a first onset towards integration of evidence and a framework for further experimentation to reveal the dynamics of the proximal-to-distal sequence. Copyright © 2017 Elsevier B.V. All rights reserved.
Starrett, James; Hedin, Marshal; Ayoub, Nadia; Hayashi, Cheryl Y
2013-07-25
Hemocyanins are multimeric copper-containing hemolymph proteins involved in oxygen binding and transport in all major arthropod lineages. Most arachnids have seven primary subunits (encoded by paralogous genes a-g), which combine to form a 24-mer (4×6) quaternary structure. Within some spider lineages, however, hemocyanin evolution has been a dynamic process with extensive paralog duplication and loss. We have obtained hemocyanin gene sequences from numerous representatives of the spider infraorders Mygalomorphae and Araneomorphae in order to infer the evolution of the hemocyanin gene family and estimate spider relationships using these conserved loci. Our hemocyanin gene tree is largely consistent with the previous hypotheses of paralog relationships based on immunological studies, but reveals some discrepancies in which paralog types have been lost or duplicated in specific spider lineages. Analyses of concatenated hemocyanin sequences resolved deep nodes in the spider phylogeny and recovered a number of clades that are supported by other molecular studies, particularly for mygalomorph taxa. The concatenated data set is also used to estimate dates of higher-level spider divergences and suggests that the diversification of extant mygalomorphs preceded that of extant araneomorphs. Spiders are diverse in behavior and respiratory morphology, and our results are beneficial for comparative analyses of spider respiration. Lastly, the conserved hemocyanin sequences allow for the inference of spider relationships and ancient divergence dates. Copyright © 2013 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
French, J.A.; Watney, W.L.
A significant number of petroleum reservoirs within the Kansas City Group in central and western Kansas are dominantly oolitic grainstones that cap 10- to 30-m-thick, shallowing-upward, carbonate-rich depositional sequences. Coeval units that occur at and near the surface in southeastern Kansas contain similar porous lithofacies that have been examined in detail via cores, outcrops, and an extensive log database to better understand the equivalent reservoirs. These studies suggest that individual oolitic, reservoir-quality units in the Bethany Falls Limestone (equivalent to the K zone in the subsurface) developed at several relative sea level stands that occurred during development of a highstandmore » systems tract within this depositional sequence. As many as three grain-rich parasequences may occur at a given location. The occurrence of multiple parasequences indicates a relatively complex history of K-zone deposition, which likely resulted in significant effects on reservoir architecture. Two-dimensional forward modeling of this sequence with our interactive, PC-based software has revealed that limited combinations of parameters such as shelf configuration, eustasy, sedimentation rates, and subsidence rates generate strata successions similar to those observed. Sensitivity analysis coupled with regional characterization of processes suggest ranges of values that these parameters could have had during deposition of these units. The ultimate goal of this modeling is to improve our ability to predict facies development in areas of potential and known hydrocarbon accumulations.« less
Chau, Man Ling; Aung, Kyaw Thu; Hapuarachchi, Hapuarachchige Chanditha; Lee, Pei Sze Valarie; Lim, Pei Ying; Kang, Joanne Su Lin; Ng, Youming; Yap, Hooi Ming; Yuk, Hyun-Gyun; Gutiérrez, Ramona Alikiiteaga; Ng, Lee Ching
2017-02-28
As the preparation of salads involves extensive handling and the use of uncooked ingredients, they are particularly vulnerable to microbial contamination. This study aimed to determine the microbial safety and quality of pre-packed salads and salad bar ingredients sold in Singapore, so as to identify public health risks that could arise from consuming salads and to determine areas for improvement in the management of food safety. The most frequently encountered organism in pre-packed salad samples was B. cereus, particularly in pasta salads (33.3%, 10/30). The most commonly detected organism in salad bar ingredients was L. monocytogenes, in particular seafood ingredients (44.1%, 15/34), largely due to contaminated smoked salmon. Further investigation showed that 21.6% (37/171) of the pre-packed smoked salmon sold in supermarkets contained L. monocytogenes. Significantly higher prevalence of L. monocytogenes and higher Standard Plate Count were detected in smoked salmon at salad bars compared to pre-packed smoked salmon in supermarkets, which suggested multiplication of the organism as the products move down the supply chain. Further molecular analysis revealed that L. monocytogenes Sequence Type (ST) 2 and ST87 were present in a particular brand of pre-packed salmon products over a 4-year period, implying a potential persistent contamination problem at the manufacturing level. Our findings highlighted a need to improve manufacturing and retail hygiene processes as well as to educate vulnerable populations to avoid consuming food prone to L. monocytogenes contamination.
Molecular complexity of successive bacterial epidemics deconvoluted by comparative pathogenomics.
Beres, Stephen B; Carroll, Ronan K; Shea, Patrick R; Sitkiewicz, Izabela; Martinez-Gutierrez, Juan Carlos; Low, Donald E; McGeer, Allison; Willey, Barbara M; Green, Karen; Tyrrell, Gregory J; Goldman, Thomas D; Feldgarden, Michael; Birren, Bruce W; Fofanov, Yuriy; Boos, John; Wheaton, William D; Honisch, Christiane; Musser, James M
2010-03-02
Understanding the fine-structure molecular architecture of bacterial epidemics has been a long-sought goal of infectious disease research. We used short-read-length DNA sequencing coupled with mass spectroscopy analysis of SNPs to study the molecular pathogenomics of three successive epidemics of invasive infections involving 344 serotype M3 group A Streptococcus in Ontario, Canada. Sequencing the genome of 95 strains from the three epidemics, coupled with analysis of 280 biallelic SNPs in all 344 strains, revealed an unexpectedly complex population structure composed of a dynamic mixture of distinct clonally related complexes. We discovered that each epidemic is dominated by micro- and macrobursts of multiple emergent clones, some with distinct strain genotype-patient phenotype relationships. On average, strains were differentiated from one another by only 49 SNPs and 11 insertion-deletion events (indels) in the core genome. Ten percent of SNPs are strain specific; that is, each strain has a unique genome sequence. We identified nonrandom temporal-spatial patterns of strain distribution within and between the epidemic peaks. The extensive full-genome data permitted us to identify genes with significantly increased rates of nonsynonymous (amino acid-altering) nucleotide polymorphisms, thereby providing clues about selective forces operative in the host. Comparative expression microarray analysis revealed that closely related strains differentiated by seemingly modest genetic changes can have significantly divergent transcriptomes. We conclude that enhanced understanding of bacterial epidemics requires a deep-sequencing, geographically centric, comparative pathogenomics strategy.
Bouma, Arnold H.; Feeley, Mary H.; Kindinger, Jack G.; Stelting, Charles E.; Hilde, Thomas W.C.
1981-01-01
A high-resolution seismic reflection survey was conducted in a small area of the upper Louisiana Continental Slope known as Green Canyon Area. This area includes tracts 427, 428, 471, 472, 515, and 516, that will be offered for sale in March 1982 as part of Lease Sale 67.The sea floor of this region is, slightly hummocky and is underlain by salt diapirs that are mantled by early Tertiary shale. Most of the shale is overlain by younger Tertiary and Quaternary deposits, although locally some of the shale protrudes the sea floor. Because of proximity to older Mississippi River sources, the sediments are thick. The sediment cover shows an abundance of geologic phenomena such as horsts, grabens, growth faults, normal faults, and consolidation faults, zones with distinct and indistinct parallel reflections, semi-transparent zones, distorted zones, and angular unconformities.The major feature of this region is a N-S linear zone of uplifted and intruded sedimentary deposits formed due to diapiric intrusion.Small scale graben development over the crest of the structure can be attributed to extension and collapse. Large scale undulations of reflections well off the flanks of the uplifted structure suggest sediment creep and slumping. Dipping of parallel reflections show block faulting and tilting.Air gun (5 and 40 cubic inch) records reveal at least five major sequences that show masked onlap and slumping in their lower parts grading into more distinct parallel reflections in their upper parts. Such sequences can be related to local uplift and sea level changes. Minisparker records of this area show similar sequences but on a smaller scale. The distinct parallel reflections often onlap the diapir flanks. The highly reflective parts of these sequences may represent turbidite-type deposition, possibly at times of lower sea level. The acoustically more transparent parts of each sequence may represent deposits containing primarily hemipelagic and pelagic sediment.A complex ridge system is present along the west side of the area and distinct parallel reflections onlap onto this structure primarily from the east. Much of this deposition may be ascribed to sedimentation within a submarine canyon whose position is controlled by this ridge.
Single-cell sequencing reveals karyotype heterogeneity in murine and human malignancies.
Bakker, Bjorn; Taudt, Aaron; Belderbos, Mirjam E; Porubsky, David; Spierings, Diana C J; de Jong, Tristan V; Halsema, Nancy; Kazemier, Hinke G; Hoekstra-Wakker, Karina; Bradley, Allan; de Bont, Eveline S J M; van den Berg, Anke; Guryev, Victor; Lansdorp, Peter M; Colomé-Tatché, Maria; Foijer, Floris
2016-05-31
Chromosome instability leads to aneuploidy, a state in which cells have abnormal numbers of chromosomes, and is found in two out of three cancers. In a chromosomal instable p53 deficient mouse model with accelerated lymphomagenesis, we previously observed whole chromosome copy number changes affecting all lymphoma cells. This suggests that chromosome instability is somehow suppressed in the aneuploid lymphomas or that selection for frequently lost/gained chromosomes out-competes the CIN-imposed mis-segregation. To distinguish between these explanations and to examine karyotype dynamics in chromosome instable lymphoma, we use a newly developed single-cell whole genome sequencing (scWGS) platform that provides a complete and unbiased overview of copy number variations (CNV) in individual cells. To analyse these scWGS data, we develop AneuFinder, which allows annotation of copy number changes in a fully automated fashion and quantification of CNV heterogeneity between cells. Single-cell sequencing and AneuFinder analysis reveals high levels of copy number heterogeneity in chromosome instability-driven murine T-cell lymphoma samples, indicating ongoing chromosome instability. Application of this technology to human B cell leukaemias reveals different levels of karyotype heterogeneity in these cancers. Our data show that even though aneuploid tumours select for particular and recurring chromosome combinations, single-cell analysis using AneuFinder reveals copy number heterogeneity. This suggests ongoing chromosome instability that other platforms fail to detect. As chromosome instability might drive tumour evolution, karyotype analysis using single-cell sequencing technology could become an essential tool for cancer treatment stratification.
Sagi-Dain, Lena; Shemer, Lilach; Zelnik, Nathanel; Zoabi, Yusri; Orit, Sadeh; Adir, Vardit; Schif, Aharon; Peleg, Amir
2018-06-01
Charcot-Marie-Tooth (CMT) is a heterogeneous group of progressive disorders, characterized by chronic motor and sensory polyneuropathy. This hereditary disorder is related to numerous genes and varying inheritance patterns. Thus, many patients do not reach a final genetic diagnosis. We describe a 13-year-old girl presenting with progressive bilateral leg weakness and gait instability. Extensive laboratory studies and spinal magnetic resonance imaging scan were normal. Nerve conduction studies revealed severe lower limb peripheral neuropathy with prominent demyelinative component. Following presumptive diagnosis of chronic inflammatory demyelinating polyneuropathy, the patient received treatment with steroids and intravenous immunoglobulins courses for several months, with no apparent improvement. Whole-exome sequencing revealed a novel heterozygous c.2209C>T (p.Arg737Trp) mutation in the MARS gene (OMIM 156560). This gene has recently been related to CMT type 2U. In-silico prediction programs classified this mutation as a probable cause for protein malfunction. Allele frequency data reported this variant in 0.003% of representative Caucasian population. Family segregation analysis study revealed that the patient had inherited the variant from her 60-years old mother, reported as healthy. Neurologic examination of the mother demonstrated decreased tendon reflexes, while nerve conduction studies were consistent with demyelinative and axonal sensory-motor polyneuropathy. Our report highlights the importance of next-generation sequencing approach to facilitate the proper molecular diagnosis of highly heterogeneous neurologic disorders. Amongst other numerous benefits, this approach might prevent unnecessary diagnostic testing and potentially harmful medical treatment. © 2018 Peripheral Nerve Society.
Human structural variation: mechanisms of chromosome rearrangements
Weckselblatt, Brooke; Rudd, M. Katharine
2015-01-01
Chromosome structural variation (SV) is a normal part of variation in the human genome, but some classes of SV can cause neurodevelopmental disorders. Analysis of the DNA sequence at SV breakpoints can reveal mutational mechanisms and risk factors for chromosome rearrangement. Large-scale SV breakpoint studies have become possible recently owing to advances in next-generation sequencing (NGS) including whole-genome sequencing (WGS). These findings have shed light on complex forms of SV such as triplications, inverted duplications, insertional translocations, and chromothripsis. Sequence-level breakpoint data resolve SV structure and determine how genes are disrupted, fused, and/or misregulated by breakpoints. Recent improvements in breakpoint sequencing have also revealed non-allelic homologous recombination (NAHR) between paralogous long interspersed nuclear element (LINE) or human endogenous retrovirus (HERV) repeats as a cause of deletions, duplications, and translocations. This review covers the genomic organization of simple and complex constitutional SVs, as well as the molecular mechanisms of their formation. PMID:26209074
Rapid diversification of five Oryza AA genomes associated with rice adaptation.
Zhang, Qun-Jie; Zhu, Ting; Xia, En-Hua; Shi, Chao; Liu, Yun-Long; Zhang, Yun; Liu, Yuan; Jiang, Wen-Kai; Zhao, You-Jie; Mao, Shu-Yan; Zhang, Li-Ping; Huang, Hui; Jiao, Jun-Ying; Xu, Ping-Zhen; Yao, Qiu-Yang; Zeng, Fan-Chun; Yang, Li-Li; Gao, Ju; Tao, Da-Yun; Wang, Yue-Ju; Bennetzen, Jeffrey L; Gao, Li-Zhi
2014-11-18
Comparative genomic analyses among closely related species can greatly enhance our understanding of plant gene and genome evolution. We report de novo-assembled AA-genome sequences for Oryza nivara, Oryza glaberrima, Oryza barthii, Oryza glumaepatula, and Oryza meridionalis. Our analyses reveal massive levels of genomic structural variation, including segmental duplication and rapid gene family turnover, with particularly high instability in defense-related genes. We show, on a genomic scale, how lineage-specific expansion or contraction of gene families has led to their morphological and reproductive diversification, thus enlightening the evolutionary process of speciation and adaptation. Despite strong purifying selective pressures on most Oryza genes, we documented a large number of positively selected genes, especially those genes involved in flower development, reproduction, and resistance-related processes. These diversifying genes are expected to have played key roles in adaptations to their ecological niches in Asia, South America, Africa and Australia. Extensive variation in noncoding RNA gene numbers, function enrichment, and rates of sequence divergence might also help account for the different genetic adaptations of these rice species. Collectively, these resources provide new opportunities for evolutionary genomics, numerous insights into recent speciation, a valuable database of functional variation for crop improvement, and tools for efficient conservation of wild rice germplasm.
Oh, Dong-Ha; Hong, Hyewon; Lee, Sang Yeol; Yun, Dae-Jin; Bohnert, Hans J.; Dassanayake, Maheshi
2014-01-01
Schrenkiella parvula (formerly Thellungiella parvula), a close relative of Arabidopsis (Arabidopsis thaliana) and Brassica crop species, thrives on the shores of Lake Tuz, Turkey, where soils accumulate high concentrations of multiple-ion salts. Despite the stark differences in adaptations to extreme salt stresses, the genomes of S. parvula and Arabidopsis show extensive synteny. S. parvula completes its life cycle in the presence of Na+, K+, Mg2+, Li+, and borate at soil concentrations lethal to Arabidopsis. Genome structural variations, including tandem duplications and translocations of genes, interrupt the colinearity observed throughout the S. parvula and Arabidopsis genomes. Structural variations distinguish homologous gene pairs characterized by divergent promoter sequences and basal-level expression strengths. Comparative RNA sequencing reveals the enrichment of ion-transport functions among genes with higher expression in S. parvula, while pathogen defense-related genes show higher expression in Arabidopsis. Key stress-related ion transporter genes in S. parvula showed increased copy number, higher transcript dosage, and evidence for subfunctionalization. This extremophyte offers a framework to identify the requisite adjustments of genomic architecture and expression control for a set of genes found in most plants in a way to support distinct niche adaptation and lifestyles. PMID:24563282
Rapid diversification of five Oryza AA genomes associated with rice adaptation
Zhang, Qun-Jie; Zhu, Ting; Xia, En-Hua; Shi, Chao; Liu, Yun-Long; Zhang, Yun; Liu, Yuan; Jiang, Wen-Kai; Zhao, You-Jie; Mao, Shu-Yan; Zhang, Li-Ping; Huang, Hui; Jiao, Jun-Ying; Xu, Ping-Zhen; Yao, Qiu-Yang; Zeng, Fan-Chun; Yang, Li-Li; Gao, Ju; Tao, Da-Yun; Wang, Yue-Ju; Bennetzen, Jeffrey L.; Gao, Li-Zhi
2014-01-01
Comparative genomic analyses among closely related species can greatly enhance our understanding of plant gene and genome evolution. We report de novo-assembled AA-genome sequences for Oryza nivara, Oryza glaberrima, Oryza barthii, Oryza glumaepatula, and Oryza meridionalis. Our analyses reveal massive levels of genomic structural variation, including segmental duplication and rapid gene family turnover, with particularly high instability in defense-related genes. We show, on a genomic scale, how lineage-specific expansion or contraction of gene families has led to their morphological and reproductive diversification, thus enlightening the evolutionary process of speciation and adaptation. Despite strong purifying selective pressures on most Oryza genes, we documented a large number of positively selected genes, especially those genes involved in flower development, reproduction, and resistance-related processes. These diversifying genes are expected to have played key roles in adaptations to their ecological niches in Asia, South America, Africa and Australia. Extensive variation in noncoding RNA gene numbers, function enrichment, and rates of sequence divergence might also help account for the different genetic adaptations of these rice species. Collectively, these resources provide new opportunities for evolutionary genomics, numerous insights into recent speciation, a valuable database of functional variation for crop improvement, and tools for efficient conservation of wild rice germplasm. PMID:25368197
Honey Bee Infecting Lake Sinai Viruses
Daughenbaugh, Katie F.; Martin, Madison; Brutscher, Laura M.; Cavigli, Ian; Garcia, Emma; Lavin, Matt; Flenniken, Michelle L.
2015-01-01
Honey bees are critical pollinators of important agricultural crops. Recently, high annual losses of honey bee colonies have prompted further investigation of honey bee infecting viruses. To better characterize the recently discovered and very prevalent Lake Sinai virus (LSV) group, we sequenced currently circulating LSVs, performed phylogenetic analysis, and obtained images of LSV2. Sequence analysis resulted in extension of the LSV1 and LSV2 genomes, the first detection of LSV4 in the US, and the discovery of LSV6 and LSV7. We detected LSV1 and LSV2 in the Varroa destructor mite, and determined that a large proportion of LSV2 is found in the honey bee gut, suggesting that vector-mediated, food-associated, and/or fecal-oral routes may be important for LSV dissemination. Pathogen-specific quantitative PCR data, obtained from samples collected during a small-scale monitoring project, revealed that LSV2, LSV1, Black queen cell virus (BQCV), and Nosema ceranae were more abundant in weak colonies than strong colonies within this sample cohort. Together, these results enhance our current understanding of LSVs and illustrate the importance of future studies aimed at investigating the role of LSVs and other pathogens on honey bee health at both the individual and colony levels. PMID:26110586
Ravi, Anuradha; Estensmo, Eva Lena F; Abée-Lund, Trine M L'; Foley, Steven L; Allgaier, Bernhard; Martin, Camilia R; Claud, Erika C; Rudi, Knut
2017-11-01
BackgroundThe preterm infant gut microbiota is vulnerable to different biotic and abiotic factors. Although the development of this microbiota has been extensively studied, the mobilome-i.e. the mobile genetic elements (MGEs) in the gut microbiota-has not been considered. Therefore, the aim of this study was to investigate the association of the mobilome with birth weight and hospital location in the preterm infant gut microbiota.MethodsThe data set consists of fecal samples from 62 preterm infants with and without necrotizing enterocolitis (NEC) from three different hospitals. We analyzed the gut microbiome by using 16S rRNA amplicon sequencing, shot-gun metagenome sequencing, and quantitative PCR. Predictive models and other data analyses were performed using MATLAB and QIIME.ResultSThe microbiota composition was significantly different between NEC-positive and NEC-negative infants and significantly different between hospitals. An operational taxanomic unit (OTU) showed strong positive and negative correlation with NEC and birth weight, respectively, whereas none showed significance for mode of delivery. Metagenome analyses revealed high levels of conjugative plasmids with MGEs and virulence genes. Results from quantitative PCR showed that the plasmid signature genes were significantly different between hospitals and in NEC-positive infants.ConclusionOur results point toward an association of the mobilome with hospital location in preterm infants.
Parallel gene analysis with allele-specific padlock probes and tag microarrays
Banér, Johan; Isaksson, Anders; Waldenström, Erik; Jarvius, Jonas; Landegren, Ulf; Nilsson, Mats
2003-01-01
Parallel, highly specific analysis methods are required to take advantage of the extensive information about DNA sequence variation and of expressed sequences. We present a scalable laboratory technique suitable to analyze numerous target sequences in multiplexed assays. Sets of padlock probes were applied to analyze single nucleotide variation directly in total genomic DNA or cDNA for parallel genotyping or gene expression analysis. All reacted probes were then co-amplified and identified by hybridization to a standard tag oligonucleotide array. The technique was illustrated by analyzing normal and pathogenic variation within the Wilson disease-related ATP7B gene, both at the level of DNA and RNA, using allele-specific padlock probes. PMID:12930977
Schermer, Elizabeth R.; Gillaspy, J.R.; Lamb, R.
2007-01-01
Structural analysis of the Lopez Structural Complex, a major Late Cretaceous terrane-bounding fault zone in the San Juan thrust system, reveals a sequence of events that provides insight into accretionary wedge mechanics and regional tectonics. After formation of regional ductile flattening and shear-related fabrics, the area was crosscut by brittle structures including: (1) southwest-vergent thrusts, (2) extension veins and normal faults related to northwest-southeast extension, and (3) conjugate strike-slip structures that record northwest-southeast extension and northeast-southwest shortening. Aragonite-bearing veins are associated with thrust and normal faults, but only rarely with strike-slip faults. High-pressure, low-temperature (HP-LT) minerals constrain the conditions for brittle deformation to ???20 km and <250 ??C. The presence of similar structures elsewhere indicates that the brittle structural sequence is typical of the San Juan nappes. Sustained HP-LT conditions are possible only if structures formed in an accretionary prism during active subduction, which suggests that these brittle structures record internal wedge deformation at depth and early during uplift of the San Juan nappes. The structures are consistent with orogen-normal shortening and vertical thickening followed by vertical thinning and along-strike extension. The kinematic evolution may be related initially to changes in wedge strength, followed by response to overthickening of the wedge in an unbuttressed, obliquely convergent setting. The change in vein mineralogy indicates that exhumation occurred prior to the strike-slip event. The pressure and temperature conditions and spatial and temporal extent of small faults associated with fluid flow suggest a link between these structures and the silent earthquake process. ?? 2007 Geological Society of America.
Kaplan, Oktay I; Berber, Burak; Hekim, Nezih; Doluca, Osman
2016-11-02
Many studies show that short non-coding sequences are widely conserved among regulatory elements. More and more conserved sequences are being discovered since the development of next generation sequencing technology. A common approach to identify conserved sequences with regulatory roles relies on topological changes such as hairpin formation at the DNA or RNA level. G-quadruplexes, non-canonical nucleic acid topologies with little established biological roles, are increasingly considered for conserved regulatory element discovery. Since the tertiary structure of G-quadruplexes is strongly dependent on the loop sequence which is disregarded by the generally accepted algorithm, we hypothesized that G-quadruplexes with similar topology and, indirectly, similar interaction patterns, can be determined using phylogenetic clustering based on differences in the loop sequences. Phylogenetic analysis of 52 G-quadruplex forming sequences in the Escherichia coli genome revealed two conserved G-quadruplex motifs with a potential regulatory role. Further analysis revealed that both motifs tend to form hairpins and G quadruplexes, as supported by circular dichroism studies. The phylogenetic analysis as described in this work can greatly improve the discovery of functional G-quadruplex structures and may explain unknown regulatory patterns. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Early Miocene sequence development across the New Jersey margin
Monteverde, D.H.; Mountain, Gregory S.; Miller, K.G.
2008-01-01
Sequence stratigraphy provides an understanding of the interplay between eustasy, sediment supply and accommodation in the sedimentary construction of passive margins. We used this approach to follow the early to middle Miocene growth of the New Jersey margin and analyse the connection between relative changes of sea level and variable sediment supply. Eleven candidate sequence boundaries were traced in high-resolution multi-channel seismic profiles across the inner margin and matched to geophysical log signatures and lithologic changes in ODP Leg 150X onshore coreholes. Chronologies at these drill sites were then used to assign ages to the intervening seismic sequences. We conclude that the regional and global correlation of early Miocene sequences suggests a dominant role of global sea-level change but margin progradation was controlled by localized sediment contribution and that local conditions played a large role in sequence formation and preservation. Lowstand deposits were regionally restricted and their locations point to both single and multiple sediment sources. The distribution of highstand deposits, by contrast, documents redistribution by along shelf currents. We find no evidence that sea level fell below the elevation of the clinoform rollover, and the existence of extensive lowstand deposits seaward of this inflection point indicates efficient cross-shelf sediment transport mechanisms despite the apparent lack of well-developed fluvial drainage. ?? 2008 The Authors. Journal compilation ?? 2008 Blackwell Publishing.
PIMS sequencing extension: a laboratory information management system for DNA sequencing facilities.
Troshin, Peter V; Postis, Vincent Lg; Ashworth, Denise; Baldwin, Stephen A; McPherson, Michael J; Barton, Geoffrey J
2011-03-07
Facilities that provide a service for DNA sequencing typically support large numbers of users and experiment types. The cost of services is often reduced by the use of liquid handling robots but the efficiency of such facilities is hampered because the software for such robots does not usually integrate well with the systems that run the sequencing machines. Accordingly, there is a need for software systems capable of integrating different robotic systems and managing sample information for DNA sequencing services. In this paper, we describe an extension to the Protein Information Management System (PIMS) that is designed for DNA sequencing facilities. The new version of PIMS has a user-friendly web interface and integrates all aspects of the sequencing process, including sample submission, handling and tracking, together with capture and management of the data. The PIMS sequencing extension has been in production since July 2009 at the University of Leeds DNA Sequencing Facility. It has completely replaced manual data handling and simplified the tasks of data management and user communication. Samples from 45 groups have been processed with an average throughput of 10000 samples per month. The current version of the PIMS sequencing extension works with Applied Biosystems 3130XL 96-well plate sequencer and MWG 4204 or Aviso Theonyx liquid handling robots, but is readily adaptable for use with other combinations of robots. PIMS has been extended to provide a user-friendly and integrated data management solution for DNA sequencing facilities that is accessed through a normal web browser and allows simultaneous access by multiple users as well as facility managers. The system integrates sequencing and liquid handling robots, manages the data flow, and provides remote access to the sequencing results. The software is freely available, for academic users, from http://www.pims-lims.org/.
Production of Supra-regular Spatial Sequences by Macaque Monkeys.
Jiang, Xinjian; Long, Tenghai; Cao, Weicong; Li, Junru; Dehaene, Stanislas; Wang, Liping
2018-06-18
Understanding and producing embedded sequences in language, music, or mathematics, is a central characteristic of our species. These domains are hypothesized to involve a human-specific competence for supra-regular grammars, which can generate embedded sequences that go beyond the regular sequences engendered by finite-state automata. However, is this capacity truly unique to humans? Using a production task, we show that macaque monkeys can be trained to produce time-symmetrical embedded spatial sequences whose formal description requires supra-regular grammars or, equivalently, a push-down stack automaton. Monkeys spontaneously generalized the learned grammar to novel sequences, including longer ones, and could generate hierarchical sequences formed by an embedding of two levels of abstract rules. Compared to monkeys, however, preschool children learned the grammars much faster using a chunking strategy. While supra-regular grammars are accessible to nonhuman primates through extensive training, human uniqueness may lie in the speed and learning strategy with which they are acquired. Copyright © 2018 Elsevier Ltd. All rights reserved.
Low diversity in the mitogenome of sperm whales revealed by next-generation sequencing
Alana Alexander; Debbie Steel; Beth Slikas; Kendra Hoekzema; Colm Carraher; Matthew Parks; Richard Cronn; C. Scott Baker
2012-01-01
Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20...
Hurtado, Luis A.; Lee, Eun Jung; Mateos, Mariana
2013-01-01
Phylogeographic studies of animals with low vagility and restricted to patchy habitats of the supralittoral zone, can uncover unknown diversity and shed light on processes that shaped evolution along a continent’s edge. The Pacific coast between southern California and central Mexico, including the megadiverse Gulf of California, offers a remarkable setting to study biological diversification in the supralittoral. A complex geological history coupled with cyclical fluctuations in temperature and sea level provided ample opportunities for diversification of supralittoral organisms. Indeed, a previous phylogeographic study of Ligia, a supralittoral isopod that has limited dispersal abilities and is restricted to rocky patches, revealed high levels of morphologically cryptic diversity. Herein, we examined phylogeographic patterns of Tylos, another supralittoral isopod with limited dispersal potential, but whose habitat (i.e., sandy shores) appears to be more extensive and connected than that of Ligia. We conducted Maximum Likelihood and Bayesian phylogenetic analyses on mitochondrial and nuclear DNA sequences. These analyses revealed multiple highly divergent lineages with discrete regional distributions, despite the recognition of a single valid species for this region. A traditional species-diagnostic morphological trait distinguished several of these lineages. The phylogeographic patterns of Tylos inside the Gulf of California show a deep and complex history. In contrast, patterns along the Pacific region between southern California and the Baja Peninsula indicate a recent range expansion, probably postglacial and related to changes in sea surface temperature (SST). In general, the phylogeographic patterns of Tylos differed from those of Ligia. Differences in the extension and connectivity of the habitats occupied by Tylos and Ligia may account for the different degrees of population isolation experienced by these two isopods and their contrasting phylogeographic patterns. Identification of divergent lineages of Tylos in the study area is important for conservation, as some populations are threatened by human activities. PMID:23844103
Takahata, Satoshi; Yago, Takumi; Iwabuchi, Keisuke; Hirakawa, Hideki; Suzuki, Yutaka; Onodera, Yasuyuki
2016-01-01
Spinach (Spinacia oleracea, 2n = 12) and sugar beet (Beta vulgaris, 2n = 18) are important crop members of the family Chenopodiaceae ss Sugar beet has a basic chromosome number of 9 and a cosexual breeding system, as do most members of the Chenopodiaceae ss. family. By contrast, spinach has a basic chromosome number of 6 and, although certain cultivars and genotypes produce monoecious plants, is considered to be a dioecious species. The loci determining male and monoecious sexual expression were mapped to different loci on the spinach sex chromosomes. In this study, a linkage map with 46 mapped protein-coding sequences was constructed for the spinach sex chromosomes. Comparison of the linkage map with a reference genome sequence of sugar beet revealed that the spinach sex chromosomes exhibited extensive synteny with sugar beet chromosomes 4 and 9. Tightly linked protein-coding genes linked to the male-determining locus in spinach corresponded to genes located in or around the putative pericentromeric and centromeric regions of sugar beet chromosomes 4 and 9, supporting the observation that recombination rates were low in the vicinity of the male-determining locus. The locus for monoecism was confined to a chromosomal segment corresponding to a region of approximately 1.7Mb on sugar beet chromosome 9, which may facilitate future positional cloning of the locus. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
DNA methylation dynamics during early plant life.
Bouyer, Daniel; Kramdi, Amira; Kassam, Mohamed; Heese, Maren; Schnittger, Arp; Roudier, François; Colot, Vincent
2017-09-25
Cytosine methylation is crucial for gene regulation and silencing of transposable elements in mammals and plants. While this epigenetic mark is extensively reprogrammed in the germline and early embryos of mammals, the extent to which DNA methylation is reset between generations in plants remains largely unknown. Using Arabidopsis as a model, we uncovered distinct DNA methylation dynamics over transposable element sequences during the early stages of plant development. Specifically, transposable elements and their relics show invariably high methylation at CG sites but increasing methylation at CHG and CHH sites. This non-CG methylation culminates in mature embryos, where it reaches saturation for a large fraction of methylated CHH sites, compared to the typical 10-20% methylation level observed in seedlings or adult plants. Moreover, the increase in CHH methylation during embryogenesis matches the hypomethylated state in the early endosperm. Finally, we show that interfering with the embryo-to-seedling transition results in the persistence of high CHH methylation levels after germination, specifically over sequences that are targeted by the RNA-directed DNA methylation (RdDM) machinery. Our findings indicate the absence of extensive resetting of DNA methylation patterns during early plant life and point instead to an important role of RdDM in reinforcing DNA methylation of transposable element sequences in every cell of the mature embryo. Furthermore, we provide evidence that this elevated RdDM activity is a specific property of embryogenesis.
Sebaihia, Mohammed; Preston, Andrew; Maskell, Duncan J.; Kuzmiak, Holly; Connell, Terry D.; King, Natalie D.; Orndorff, Paul E.; Miyamoto, David M.; Thomson, Nicholas R.; Harris, David; Goble, Arlette; Lord, Angela; Murphy, Lee; Quail, Michael A.; Rutter, Simon; Squares, Robert; Squares, Steven; Woodward, John; Parkhill, Julian; Temple, Louise M.
2006-01-01
Bordetella avium is a pathogen of poultry and is phylogenetically distinct from Bordetella bronchiseptica, Bordetella pertussis, and Bordetella parapertussis, which are other species in the Bordetella genus that infect mammals. In order to understand the evolutionary relatedness of Bordetella species and further the understanding of pathogenesis, we obtained the complete genome sequence of B. avium strain 197N, a pathogenic strain that has been extensively studied. With 3,732,255 base pairs of DNA and 3,417 predicted coding sequences, it has the smallest genome and gene complement of the sequenced bordetellae. In this study, the presence or absence of previously reported virulence factors from B. avium was confirmed, and the genetic bases for growth characteristics were elucidated. Over 1,100 genes present in B. avium but not in B. bronchiseptica were identified, and most were predicted to encode surface or secreted proteins that are likely to define an organism adapted to the avian rather than the mammalian respiratory tracts. These include genes coding for the synthesis of a polysaccharide capsule, hemagglutinins, a type I secretion system adjacent to two very large genes for secreted proteins, and unique genes for both lipopolysaccharide and fimbrial biogenesis. Three apparently complete prophages are also present. The BvgAS virulence regulatory system appears to have polymorphisms at a poly(C) tract that is involved in phase variation in other bordetellae. A number of putative iron-regulated outer membrane proteins were predicted from the sequence, and this regulation was confirmed experimentally for five of these. PMID:16885469
NASA Astrophysics Data System (ADS)
Sun, Z.; Zhou, D.
2013-12-01
Complete sedimentary sequences and weak erosion make the transition zone of the South China Sea the optimal place to study the entire evolution history of marginal sea basins, as well as the transition mechanism from active subduction to passive extension. 2D long cable seismic profiles revealed that both Baiyun and Liwan sag in the northeast South China Sea margin were lack of large controlling faults, especially in Liwan sag, syn-rift sequences waved above the basement. Dome-like uplifts(serpetinite uplifts?) or diapirs(?) came from below the basement, caused the syn-rift sequences pushed up around 36Ma(T80). Gravity inversion based on seismic reflection indicated that the dome has a lower density and a lower layer velocity than normal crust. Also around the Continent-Ocean Boundary (COB), a small segment similar to the lower crust was exposed. Between this exposed segment and the Cenozoic oceanic crust, mantle seems to be exhumed along the breakup point. Between the COB and roughly the shelf break, high velocity lower crust was discriminated in the northeast continental margin. Structures in northeast South China Sea seems having many similarities with Newfoundland-Iberia margin, by serpentinite(?) dome and exhumed mantle, although spreading rate here is intermediate. In fact, regional background suggests that there might be another interpretation: transition from Mesozoic subduction to Cenozoic extension occurred through paleo oceanic crust breakup in the northeast, which in turn retained Mesozoic subduction system beneath the northeast continental margin. Confined with magnetic anomaly, Bouguer gravity gradient anomaly, and well drilling lithological evidences, Cenozoic Baiyun sag developed upon Mesozoic fore-arc, while Cenozoic Liwan sag developed upon Mesozoic accretionary prism. The high velocity lower crust was caused by both remnant subducted slab and by Oceanic-Continent interaction due to subduction. There might also be serpentinite dome and exhumed mantle, but may be caused by extension and breakup of paleo oceanic slab, not the depth-dependent extension. IODP drillings are needed to test all these scientific conjectures.
Georgi, Enrico; Walter, Mathias C; Pfalzgraf, Marie-Theres; Northoff, Bernd H; Holdt, Lesca M; Scholz, Holger C; Zoeller, Lothar; Zange, Sabine; Antwerpen, Markus H
2017-01-01
Brucellosis, a worldwide common bacterial zoonotic disease, has become quite rare in Northern and Western Europe. However, since 2014 a significant increase of imported infections caused by Brucella (B.) melitensis has been noticed in Germany. Patients predominantly originated from Middle East including Turkey and Syria. These circumstances afforded an opportunity to gain insights into the population structure of Brucella strains. Brucella-isolates from 57 patients were recovered between January 2014 and June 2016 with culture confirmed brucellosis by the National Consultant Laboratory for Brucella. Their whole genome sequences were generated using the Illumina MiSeq platform. A whole genome-based SNP typing assay was developed in order to resolve geographically attributed genetic clusters. Results were compared to MLVA typing results, the current gold-standard of Brucella typing. In addition, sequences were examined for possible genetic variation within target regions of molecular diagnostic assays. Phylogenetic analyses revealed spatial clustering and distinguished strains from different patients in either case, whereas multiple isolates from a single patient or technical replicates showed identical SNP and MLVA profiles. By including WGS data from the NCBI database, five major genotypes were identified. Notably, strains originating from Turkey showed a high diversity and grouped into seven subclusters of genotype II. MLVA analysis congruently clustered all isolates and predominantly matched the East Mediterranean genetic clade. This study confirms whole-genome based SNP-analysis as a powerful tool for accurate typing of B. melitensis. Furthermore it allows special allocation and therefore provides useful information on the geographic origin for trace-back analysis. However, the lack of reliable metadata in public databases often prevents a resolution below geographic regions or country levels and corresponding precise trace-back analysis. Once this obstacle is resolved, WGS-derived bacterial typing adds an important method to complement epidemiological surveys during outbreak investigations. This is the first report of a detailed genetic investigation of an extensive collection of B. melitensis strains isolated from human cases in Germany.
Use of 16S rRNA gene for identification of a broad range of clinically relevant bacterial pathogens
Srinivasan, Ramya; Karaoz, Ulas; Volegova, Marina; ...
2015-02-06
According to World Health Organization statistics of 2011, infectious diseases remain in the top five causes of mortality worldwide. However, despite sophisticated research tools for microbial detection, rapid and accurate molecular diagnostics for identification of infection in humans have not been extensively adopted. Time-consuming culture-based methods remain to the forefront of clinical microbial detection. The 16S rRNA gene, a molecular marker for identification of bacterial species, is ubiquitous to members of this domain and, thanks to ever-expanding databases of sequence information, a useful tool for bacterial identification. In this study, we assembled an extensive repository of clinical isolates (n =more » 617), representing 30 medically important pathogenic species and originally identified using traditional culture-based or non-16S molecular methods. This strain repository was used to systematically evaluate the ability of 16S rRNA for species level identification. To enable the most accurate species level classification based on the paucity of sequence data accumulated in public databases, we built a Naïve Bayes classifier representing a diverse set of high-quality sequences from medically important bacterial organisms. We show that for species identification, a model-based approach is superior to an alignment based method. Overall, between 16S gene based and clinical identities, our study shows a genus-level concordance rate of 96% and a species-level concordance rate of 87.5%. We point to multiple cases of probable clinical misidentification with traditional culture based identification across a wide range of gram-negative rods and gram-positive cocci as well as common gram-negative cocci.« less
Use of 16S rRNA Gene for Identification of a Broad Range of Clinically Relevant Bacterial Pathogens
Srinivasan, Ramya; Karaoz, Ulas; Volegova, Marina; MacKichan, Joanna; Kato-Maeda, Midori; Miller, Steve; Nadarajan, Rohan; Brodie, Eoin L.; Lynch, Susan V.
2015-01-01
According to World Health Organization statistics of 2011, infectious diseases remain in the top five causes of mortality worldwide. However, despite sophisticated research tools for microbial detection, rapid and accurate molecular diagnostics for identification of infection in humans have not been extensively adopted. Time-consuming culture-based methods remain to the forefront of clinical microbial detection. The 16S rRNA gene, a molecular marker for identification of bacterial species, is ubiquitous to members of this domain and, thanks to ever-expanding databases of sequence information, a useful tool for bacterial identification. In this study, we assembled an extensive repository of clinical isolates (n = 617), representing 30 medically important pathogenic species and originally identified using traditional culture-based or non-16S molecular methods. This strain repository was used to systematically evaluate the ability of 16S rRNA for species level identification. To enable the most accurate species level classification based on the paucity of sequence data accumulated in public databases, we built a Naïve Bayes classifier representing a diverse set of high-quality sequences from medically important bacterial organisms. We show that for species identification, a model-based approach is superior to an alignment based method. Overall, between 16S gene based and clinical identities, our study shows a genus-level concordance rate of 96% and a species-level concordance rate of 87.5%. We point to multiple cases of probable clinical misidentification with traditional culture based identification across a wide range of gram-negative rods and gram-positive cocci as well as common gram-negative cocci. PMID:25658760
Characteristics of Human Brain Activity during the Evaluation of Service-to-Service Brand Extension
Yang, Taeyang; Lee, Seungji; Seomoon, Eunbi; Kim, Sung-Phil
2018-01-01
Brand extension is a marketing strategy to apply the previously established brand name into new goods or service. A number of studies have reported the characteristics of human event-related potentials (ERPs) in response to the evaluation of goods-to-goods brand extension. In contrast, human brain responses to the evaluation of service extension are relatively unexplored. The aim of this study was investigating cognitive processes underlying the evaluation of service-to-service brand extension with electroencephalography (EEG). A total of 56 text stimuli composed of service brand name (S1) followed by extended service name (S2) were presented to participants. The EEG of participants was recorded while participants were asked to evaluate whether a given brand extension was acceptable or not. The behavioral results revealed that participants could evaluate brand extension though they had little knowledge about the extended services, indicating the role of brand in the evaluation of the services. Additionally, we developed a method of grouping brand extension stimuli according to the fit levels obtained from behavioral responses, instead of grouping of stimuli a priori. The ERP analysis identified three components during the evaluation of brand extension: N2, P300, and N400. No difference in the N2 amplitude was found among the different levels of a fit between S1 and S2. The P300 amplitude for the low level of fit was greater than those for higher levels (p < 0.05). The N400 amplitude was more negative for the mid- and high-level fits than the low level. The ERP results of P300 and N400 indicate that the early stage of brain extension evaluation might first detect low-fit brand extension as an improbable target followed by the late stage of the integration of S2 into S1. Along with previous findings, our results demonstrate different cognitive evaluation of service-to-service brand extension from goods-to-goods. PMID:29479313
Characteristics of Human Brain Activity during the Evaluation of Service-to-Service Brand Extension.
Yang, Taeyang; Lee, Seungji; Seomoon, Eunbi; Kim, Sung-Phil
2018-01-01
Brand extension is a marketing strategy to apply the previously established brand name into new goods or service. A number of studies have reported the characteristics of human event-related potentials (ERPs) in response to the evaluation of goods-to-goods brand extension. In contrast, human brain responses to the evaluation of service extension are relatively unexplored. The aim of this study was investigating cognitive processes underlying the evaluation of service-to-service brand extension with electroencephalography (EEG). A total of 56 text stimuli composed of service brand name (S1) followed by extended service name (S2) were presented to participants. The EEG of participants was recorded while participants were asked to evaluate whether a given brand extension was acceptable or not. The behavioral results revealed that participants could evaluate brand extension though they had little knowledge about the extended services, indicating the role of brand in the evaluation of the services. Additionally, we developed a method of grouping brand extension stimuli according to the fit levels obtained from behavioral responses, instead of grouping of stimuli a priori . The ERP analysis identified three components during the evaluation of brand extension: N2, P300, and N400. No difference in the N2 amplitude was found among the different levels of a fit between S1 and S2. The P300 amplitude for the low level of fit was greater than those for higher levels ( p < 0.05). The N400 amplitude was more negative for the mid- and high-level fits than the low level. The ERP results of P300 and N400 indicate that the early stage of brain extension evaluation might first detect low-fit brand extension as an improbable target followed by the late stage of the integration of S2 into S1. Along with previous findings, our results demonstrate different cognitive evaluation of service-to-service brand extension from goods-to-goods.
ERIC Educational Resources Information Center
Saraç, Hatice Sezgi
2018-01-01
In this study, it was aimed to compare two distinct methodologies of grammar instruction: task-based and form-focused teaching. Within the application procedure, which lasted for one academic term, two groups of tertiary level learners (N = 53) were exposed to the same sequence of target structures, extensive writing activities and evaluation…
Calcium-activated potassium (BK) channels are encoded by duplicate slo1 genes in teleost fishes.
Rohmann, Kevin N; Deitcher, David L; Bass, Andrew H
2009-07-01
Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue.
Calcium-Activated Potassium (BK) Channels Are Encoded by Duplicate slo1 Genes in Teleost Fishes
Deitcher, David L.; Bass, Andrew H.
2009-01-01
Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue. PMID:19321796
Koludarov, Ivan; Jackson, Timothy N W; Sunagar, Kartik; Nouwens, Amanda; Hendrikx, Iwan; Fry, Bryan G
2014-12-22
Research into snake venoms has revealed extensive variation at all taxonomic levels. Lizard venoms, however, have received scant research attention in general, and no studies of intraclade variation in lizard venom composition have been attempted to date. Despite their iconic status and proven usefulness in drug design and discovery, highly venomous helodermatid lizards (gila monsters and beaded lizards) have remained neglected by toxinological research. Proteomic comparisons of venoms of three helodermatid lizards in this study has unravelled an unusual similarity in venom-composition, despite the long evolutionary time (~30 million years) separating H. suspectum from the other two species included in this study (H. exasperatum and H. horridum). Moreover, several genes encoding the major helodermatid toxins appeared to be extremely well-conserved under the influence of negative selection (but with these results regarded as preliminary due to the scarcity of available sequences). While the feeding ecologies of all species of helodermatid lizard are broadly similar, there are significant morphological differences between species, which impact upon relative niche occupation.
Neuroimaging with functional near infrared spectroscopy: From formation to interpretation
NASA Astrophysics Data System (ADS)
Herrera-Vega, Javier; Treviño-Palacios, Carlos G.; Orihuela-Espina, Felipe
2017-09-01
Functional Near Infrared Spectroscopy (fNIRS) is gaining momentum as a functional neuroimaging modality to investigate the cerebral hemodynamics subsequent to neural metabolism. As other neuroimaging modalities, it is neuroscience's tool to understand brain systems functions at behaviour and cognitive levels. To extract useful knowledge from functional neuroimages it is critical to understand the series of transformations applied during the process of the information retrieval and how they bound the interpretation. This process starts with the irradiation of the head tissues with infrared light to obtain the raw neuroimage and proceeds with computational and statistical analysis revealing hidden associations between pixels intensities and neural activity encoded to end up with the explanation of some particular aspect regarding brain function.To comprehend the overall process involved in fNIRS there is extensive literature addressing each individual step separately. This paper overviews the complete transformation sequence through image formation, reconstruction and analysis to provide an insight of the final functional interpretation.
Extensive Mobilome-Driven Genome Diversification in Mouse Gut-Associated Bacteroides vulgatus mpk.
Lange, Anna; Beier, Sina; Steimle, Alex; Autenrieth, Ingo B; Huson, Daniel H; Frick, Julia-Stefanie
2016-04-25
Like many other Bacteroides species, Bacteroides vulgatus strain mpk, a mouse fecal isolate which was shown to promote intestinal homeostasis, utilizes a variety of mobile elements for genome evolution. Based on sequences collected by Pacific Biosciences SMRT sequencing technology, we discuss the challenges of assembling and studying a bacterial genome of high plasticity. Additionally, we conducted comparative genomics comparing this commensal strain with the B. vulgatus type strain ATCC 8482 as well as multiple other Bacteroides and Parabacteroides strains to reveal the most important differences and identify the unique features of B. vulgatus mpk. The genome of B. vulgatus mpk harbors a large and diverse set of mobile element proteins compared with other sequenced Bacteroides strains. We found evidence of a number of different horizontal gene transfer events and a genome landscape that has been extensively altered by different mobilization events. A CRISPR/Cas system could be identified that provides a possible mechanism for preventing the integration of invading external DNA. We propose that the high genome plasticity and the introduced genome instabilities of B. vulgatus mpk arising from the various mobilization events might play an important role not only in its adaptation to the challenging intestinal environment in general, but also in its ability to interact with the gut microbiota. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Low, Van Lun; Adler, Peter H.; Takaoka, Hiroyuki; Ya’cob, Zubaidah; Lim, Phaik Eem; Tan, Tiong Kai; Lim, Yvonne A. L.; Chen, Chee Dhang; Norma-Rashid, Yusoff; Sofian-Azirun, Mohd
2014-01-01
The population genetic structure of Simulium tani was inferred from mitochondria-encoded sequences of cytochrome c oxidase subunits I (COI) and II (COII) along an elevational gradient in Cameron Highlands, Malaysia. A statistical parsimony network of 71 individuals revealed 71 haplotypes in the COI gene and 43 haplotypes in the COII gene; the concatenated sequences of the COI and COII genes revealed 71 haplotypes. High levels of genetic diversity but low levels of genetic differentiation were observed among populations of S. tani at five elevations. The degree of genetic diversity, however, was not in accordance with an altitudinal gradient, and a Mantel test indicated that elevation did not have a limiting effect on gene flow. No ancestral haplotype of S. tani was found among the populations. Pupae with unique structural characters at the highest elevation showed a tendency to form their own haplotype cluster, as revealed by the COII gene. Tajima’s D, Fu’s Fs, and mismatch distribution tests revealed population expansion of S. tani in Cameron Highlands. A strong correlation was found between nucleotide diversity and the levels of dissolved oxygen in the streams where S. tani was collected. PMID:24941043
Mountain gorilla genomes reveal the impact of long-term population decline and inbreeding
Ayub, Qasim; Szpak, Michal; Frandsen, Peter; Chen, Yuan; Yngvadottir, Bryndis; Cooper, David N.; de Manuel, Marc; Hernandez-Rodriguez, Jessica; Lobon, Irene; Siegismund, Hans R.; Pagani, Luca; Quail, Michael A.; Hvilsom, Christina; Mudakikwa, Antoine; Eichler, Evan E.; Cranfield, Michael R.; Marques-Bonet, Tomas; Tyler-Smith, Chris; Scally, Aylwyn
2015-01-01
Mountain gorillas are an endangered great ape subspecies and a prominent focus for conservation, yet we know little about their genomic diversity and evolutionary past. We sequenced whole genomes from multiple wild individuals and compared the genomes of all four Gorilla subspecies. We found that the two eastern subspecies have experienced a prolonged population decline over the past 100,000 years, resulting in very low genetic diversity and an increased overall burden of deleterious variation. A further recent decline in the mountain gorilla population has led to extensive inbreeding, such that individuals are typically homozygous at 34% of their sequence, leading to the purging of severely deleterious recessive mutations from the population. We discuss the causes of their decline and the consequences for their future survival. PMID:25859046
van der Meulen, Sjoerd B; de Jong, Anne; Kok, Jan
2016-01-01
RNA sequencing has revolutionized genome-wide transcriptome analyses, and the identification of non-coding regulatory RNAs in bacteria has thus increased concurrently. Here we reveal the transcriptome map of the lactic acid bacterial paradigm Lactococcus lactis MG1363 by employing differential RNA sequencing (dRNA-seq) and a combination of manual and automated transcriptome mining. This resulted in a high-resolution genome annotation of L. lactis and the identification of 60 cis-encoded antisense RNAs (asRNAs), 186 trans-encoded putative regulatory RNAs (sRNAs) and 134 novel small ORFs. Based on the putative targets of asRNAs, a novel classification is proposed. Several transcription factor DNA binding motifs were identified in the promoter sequences of (a)sRNAs, providing insight in the interplay between lactococcal regulatory RNAs and transcription factors. The presence and lengths of 14 putative sRNAs were experimentally confirmed by differential Northern hybridization, including the abundant RNA 6S that is differentially expressed depending on the available carbon source. For another sRNA, LLMGnc_147, functional analysis revealed that it is involved in carbon uptake and metabolism. L. lactis contains 13% leaderless mRNAs (lmRNAs) that, from an analysis of overrepresentation in GO classes, seem predominantly involved in nucleotide metabolism and DNA/RNA binding. Moreover, an A-rich sequence motif immediately following the start codon was uncovered, which could provide novel insight in the translation of lmRNAs. Altogether, this first experimental genome-wide assessment of the transcriptome landscape of L. lactis and subsequent sRNA studies provide an extensive basis for the investigation of regulatory RNAs in L. lactis and related lactococcal species.
Transposable elements in fish chromosomes: a study in the marine cobia species.
Costa, G W W F; Cioffi, M B; Bertollo, L A C; Molina, W F
2013-01-01
Rachycentron canadum, a unique representative of the Rachycentridae family, has been the subject of considerable biotechnological interest due to its potential use in marine fish farming. This species has undergone extensive research concerning the location of genes and multigene families on its chromosomes. Although most of the genome of some organisms is composed of repeated DNA sequences, aspects of the origin and dispersion of these elements are still largely unknown. The physical mapping of repetitive sequences on the chromosomes of R. canadum proved to be relevant for evolutionary and applied purposes. Therefore, here, we present the mapping by fluorescence in situ hybridization of the transposable element (TE) Tol2, the non-LTR retrotransposons Rex1 and Rex3, together with the 18S and 5S rRNA genes in the chromosome of this species. The Tol2 TE, belonging to the family of hAT transposons, is homogeneously distributed in the euchromatic regions of the chromosomes but with huge colocalization with the 18S rDNA sites. The hybridization signals for Rex1 and Rex3 revealed a semi-arbitrary distribution pattern, presenting differentiated dispersion in euchromatic and heterochromatic regions. Rex1 elements are associated preferentially in heterochromatic regions, while Rex3 shows a scarce distribution in the euchromatic regions of the chromosomes. The colocalization of TEs with 18S and 5S rDNA revealed complex chromosomal regions of repetitive sequences. In addition, the nonpreferential distribution of Rex1 and Rex3 in all heterochromatic regions, as well as the preferential distribution of the Tol2 transposon associated with 18S rDNA sequences, reveals a distinct pattern of organization of TEs in the genome of this species. A heterogeneous chromosomal colonization of TEs may confer different evolutionary rates to the heterochromatic regions of this species.
Oral imazalil exposure induces gut microbiota dysbiosis and colonic inflammation in mice.
Jin, Cuiyuan; Zeng, Zhaoyang; Fu, Zhengwei; Jin, Yuanxiang
2016-10-01
The fungicide imazalil (IMZ) is used extensively in vegetable and fruit plantations and as a post-harvest treatment to avoid rot. Here, we revealed that ingestion of 25, 50 and 100 mg IMZ kg(-1) body weight for 28 d induced gut microbiota dysbiosis and colonic inflammation in mice. The relative abundance of Bacteroidetes, Firmicutes and Actinobacteria in the cecal contents decreased significantly after exposure to 100 mg kg(-1) IMZ for 28 d. In feces, the relative abundance in Bacteroidetes, Firmicutes and Actinobacteria decreased significantly after being exposed to 100 mg kg(-1) IMZ for 1, 14 and 7 d, respectively. High throughput sequencing of the V3-V4 region of the bacterial 16S rRNA gene revealed a significant reduction in the richness and diversity of microbiota in cecal contents and feces of IMZ-treated mice. Operational taxonomic units (OTUs) analysis identified 49.3% of OTUs changed in cecal contents, while 55.6% of OTUs changed in the feces after IMZ exposure. Overall, at the phylum level, the relative abundance of Firmicutes, Proteobacteria and Actinobacteria increased and that of Bacteroidetes decreased in IMZ-treated groups. At the genus level, the abundance of Lactobacillus and Bifidobacterium decreased while those of Deltaproteobacteria and Desulfovibrio increased in response to IMZ exposure. In addition, it was observed that IMZ exposure could induce colonic inflammation characterized by infiltration of inflammatory cells, elevated levels of lipocalin-2 (lcn-2) in the feces, and increased mRNA levels of Tnf-α, IL-1β, IL-22 and IFN-γ in the colon. Our findings strongly suggest that ingestion of IMZ has some risks to human health. Copyright © 2016 Elsevier Ltd. All rights reserved.
Liu, Juanxu; Wei, Qian; Wang, Rongmin; Yang, Weiyuan; Ma, Yueyue; Chen, Guoju
2017-01-01
Petal senescence is a complex programmed process. It has been demonstrated previously that treatment with ethylene, a plant hormone involved in senescence, can extensively alter transcriptome and proteome profiles in plants. However, little is known regarding the impact of ethylene on posttranslational modification (PTM) or the association between PTM and the proteome. Protein degradation is one of the hallmarks of senescence, and ubiquitination, a major PTM in eukaryotes, plays important roles in protein degradation. In this study, we first obtained reference petunia (Petunia hybrida) transcriptome data via RNA sequencing. Next, we quantitatively investigated the petunia proteome and ubiquitylome and the association between them in petunia corollas following ethylene treatment. In total, 51,799 unigenes, 3,606 proteins, and 2,270 ubiquitination sites were quantified 16 h after ethylene treatment. Treatment with ethylene resulted in 14,448 down-regulated and 6,303 up-regulated unigenes (absolute log2 fold change > 1 and false discovery rate < 0.001), 284 down-regulated and 233 up-regulated proteins, and 320 up-regulated and 127 down-regulated ubiquitination sites using a 1.5-fold threshold (P < 0.05), indicating that global ubiquitination levels increase during ethylene-mediated corolla senescence in petunia. Several putative ubiquitin ligases were up-regulated at the protein and transcription levels. Our results showed that the global proteome and ubiquitylome were negatively correlated and that ubiquitination could be involved in the degradation of proteins during ethylene-mediated corolla senescence in petunia. Ethylene regulates hormone signaling transduction pathways at both the protein and ubiquitination levels in petunia corollas. In addition, our results revealed that ethylene increases the ubiquitination levels of proteins involved in endoplasmic reticulum-associated degradation. PMID:27810942
Vargo, Edward L.; Crissman, Jonathan R.; Booth, Warren; Santangelo, Richard G.; Mukha, Dmitry V.; Schal, Coby
2014-01-01
Understanding the population structure of species that disperse primarily by human transport is essential to predicting and controlling human-mediated spread of invasive species. The German cockroach (Blattella germanica) is a widespread urban invader that can actively disperse within buildings but is spread solely by human-mediated dispersal over longer distances; however, its population structure is poorly understood. Using microsatellite markers we investigated population structure at several spatial scales, from populations within single apartment buildings to populations from several cities across the U.S. and Eurasia. Both traditional measures of genetic differentiation and Bayesian clustering methods revealed increasing levels of genetic differentiation at greater geographic scales. Our results are consistent with active dispersal of cockroaches largely limited to movement within a building. Their low levels of genetic differentiation, yet limited active spread between buildings, suggests a greater likelihood of human-mediated dispersal at more local scales (within a city) than at larger spatial scales (within and between continents). About half the populations from across the U.S. clustered together with other U.S. populations, and isolation by distance was evident across the U.S. Levels of genetic differentiation among Eurasian cities were greater than those in the U.S. and greater than those between the U.S. and Eurasia, but no clear pattern of structure at the continent level was detected. MtDNA sequence variation was low and failed to reveal any geographical structure. The weak genetic structure detected here is likely due to a combination of historical admixture among populations and periodic population bottlenecks and founder events, but more extensive studies are needed to determine whether signatures of global movement may be present in this species. PMID:25020136
Extensive Geographic Mosaicism in Avian Influenza Viruses from Gulls in the Northern Hemisphere
Wille, Michelle; Robertson, Gregory J.; Whitney, Hugh; Bishop, Mary Anne; Runstadler, Jonathan A.; Lang, Andrew S.
2011-01-01
Due to limited interaction of migratory birds between Eurasia and America, two independent avian influenza virus (AIV) gene pools have evolved. There is evidence of low frequency reassortment between these regions, which has major implications in global AIV dynamics. Indeed, all currently circulating lineages of the PB1 and PA segments in North America are of Eurasian origin. Large-scale analyses of intercontinental reassortment have shown that viruses isolated from Charadriiformes (gulls, terns, and shorebirds) are the major contributor of these outsider events. To clarify the role of gulls in AIV dynamics, specifically in movement of genes between geographic regions, we have sequenced six gull AIV isolated in Alaska and analyzed these along with 142 other available gull virus sequences. Basic investigations of host species and the locations and times of isolation reveal biases in the available sequence information. Despite these biases, our analyses reveal a high frequency of geographic reassortment in gull viruses isolated in America. This intercontinental gene mixing is not found in the viruses isolated from gulls in Eurasia. This study demonstrates that gulls are important as vectors for geographically reassorted viruses, particularly in America, and that more surveillance effort should be placed on this group of birds. PMID:21697989
Grieco, Maria Angela B; Cavalcante, Janaina J V; Cardoso, Alexander M; Vieira, Ricardo P; Machado, Ednildo A; Clementino, Maysa M; Medeiros, Marcelo N; Albano, Rodolpho M; Garcia, Eloi S; de Souza, Wanderley; Constantino, Reginaldo; Martins, Orlando B
2013-01-01
Termites inhabit tropical and subtropical areas where they contribute to structure and composition of soils by efficiently degrading biomass with aid of resident gut microbiota. In this study, culture-independent molecular analysis was performed based on bacterial and archaeal 16S rRNA clone libraries to describe the gut microbial communities within Cornitermes cumulans, a South American litter-feeding termite. Our data reveal extensive bacterial diversity, mainly composed of organisms from the phyla Spirochaetes, Bacteroidetes, Firmicutes, Actinobacteria, and Fibrobacteres. In contrast, a low diversity of archaeal 16S rRNA sequences was found, comprising mainly members of the Crenarchaeota phylum. The diversity of archaeal methanogens was further analyzed by sequencing clones from a library for the mcrA gene, which encodes the enzyme methyl coenzyme reductase, responsible for catalyzing the last step in methane production, methane being an important greenhouse gas. The mcrA sequences were diverse and divided phylogenetically into three clades related to uncultured environmental archaea and methanogens found in different termite species. C. cumulans is a litter-feeding, mound-building termite considered a keystone species in natural ecosystems and also a pest in agriculture. Here, we describe the archaeal and bacterial communities within this termite, revealing for the first time its intriguing microbiota.
O'Connell, Kerry Joan; Motherway, Mary O'Connell; Liedtke, Andrea; Fitzgerald, Gerald F; Paul Ross, R; Stanton, Catherine; Zomer, Aldert; van Sinderen, Douwe
2014-06-01
Members of the genus Bifidobacterium are commonly found in the gastrointestinal tracts of mammals, including humans, where their growth is presumed to be dependent on various diet- and/or host-derived carbohydrates. To understand transcriptional control of bifidobacterial carbohydrate metabolism, we investigated two genetic carbohydrate utilization clusters dedicated to the metabolism of raffinose-type sugars and melezitose. Transcriptomic and gene inactivation approaches revealed that the raffinose utilization system is positively regulated by an activator protein, designated RafR. The gene cluster associated with melezitose metabolism was shown to be subject to direct negative control by a LacI-type transcriptional regulator, designated MelR1, in addition to apparent indirect negative control by means of a second LacI-type regulator, MelR2. In silico analysis, DNA-protein interaction, and primer extension studies revealed the MelR1 and MelR2 operator sequences, each of which is positioned just upstream of or overlapping the correspondingly regulated promoter sequences. Similar analyses identified the RafR binding operator sequence located upstream of the rafB promoter. This study indicates that transcriptional control of gene clusters involved in carbohydrate metabolism in bifidobacteria is subject to conserved regulatory systems, representing either positive or negative control.
Revealing the transcriptomic complexity of switchgrass by PacBio long-read sequencing.
Zuo, Chunman; Blow, Matthew; Sreedasyam, Avinash; Kuo, Rita C; Ramamoorthy, Govindarajan Kunde; Torres-Jerez, Ivone; Li, Guifen; Wang, Mei; Dilworth, David; Barry, Kerrie; Udvardi, Michael; Schmutz, Jeremy; Tang, Yuhong; Xu, Ying
2018-01-01
Switchgrass ( Panicum virgatum L.) is an important bioenergy crop widely used for lignocellulosic research. While extensive transcriptomic analyses have been conducted on this species using short read-based sequencing techniques, very little has been reliably derived regarding alternatively spliced (AS) transcripts. We present an analysis of transcriptomes of six switchgrass tissue types pooled together, sequenced using Pacific Biosciences (PacBio) single-molecular long-read technology. Our analysis identified 105,419 unique transcripts covering 43,570 known genes and 8795 previously unknown genes. 45,168 are novel transcripts of known genes. A total of 60,096 AS transcripts are identified, 45,628 being novel. We have also predicted 1549 transcripts of genes involved in cell wall construction and remodeling, 639 being novel transcripts of known cell wall genes. Most of the predicted transcripts are validated against Illumina-based short reads. Specifically, 96% of the splice junction sites in all the unique transcripts are validated by at least five Illumina reads. Comparisons between genes derived from our identified transcripts and the current genome annotation revealed that among the gene set predicted by both analyses, 16,640 have different exon-intron structures. Overall, substantial amount of new information is derived from the PacBio RNA data regarding both the transcriptome and the genome of switchgrass.
Bag, Sudeep; Al Rwahnih, Maher; Li, Ashley; Gonzalez, Asaul; Rowhani, Adib; Uyemoto, Jerry K; Sudarshana, Mysore R
2015-06-01
In spring 2013, 5-year-old nectarine (Prunus persica) trees, grafted on peach rootstock Nemaguard, were found stunted in a propagation block in California. These trees had been propagated from budwood of three nectarine cultivars imported from France and cleared through the post-entry quarantine procedure. Examination of the canopy failed to reveal any obvious symptoms. However, examination of the trunks, after stripping the bark, revealed extensive pitting on the woody cylinder. To investigate the etiological agent, double-stranded RNA was extracted from bark scrapings from the scion and rootstock portions, and a cDNA library was prepared and sequenced using the Illumina platform. BLAST analysis of the contigs generated by the de novo assembly of sequence reads indicated the presence of a novel luteovirus. Complete sequence of the viral genome was determined by sequencing of three overlapping cDNA clones generated by reverse transcription-polymerase chain reaction (RT-PCR) and by rapid amplification of the 5'- and 3'-termini. The virus genome was comprised of 4,991 nucleotides with a gene organization similar to members of the genus Luteovirus (family Luteoviridae). The presence of the virus, tentatively named Nectarine stem pitting-associated virus, was confirmed in symptomatic trees by RT-PCR. Discovery of a new virus in nectarine trees after post-entry quarantine indicates the importance of including (i) metagenomic analysis by next-generation sequencing approach as an essential tool to assess the plant health status, and (ii) examination of the woody cylinders as part of the indexing process.
Saijuntha, Weerachai; Sithithaworn, Paiboon; Duenngai, Kunyarat; Kiatsopit, Nadda; Andrews, Ross H; Petney, Trevor N
2011-03-01
Multilocus enzyme electrophoresis (MEE) and DNA sequencing of the mitochondrial cytochrome c oxidase subunit 1 (CO1) gene were used to genetically compare four species of echinostomes of human health importance. Fixed genetic differences among adults of Echinostoma revolutum, Echinostoma malayanum, Echinoparyphium recurvatum and Hypoderaeum conoideum were detected at 51-75% of the enzyme loci examined, while interspecific differences in CO1 sequence were detected at 16-32 (8-16%) of the 205 alignment positions. The results of the MEE analyses also revealed fixed genetic differences between E. revolutum from Thailand and Lao PDR at five (19%) of 27 loci, which could either represent genetic variation between geographically separated populations of a single species, or the existence of a cryptic (i.e. genetically distinct but morphologically similar) species. However, there was no support for the existence of cryptic species within E. revolutum based on the CO1 sequence between the two geographical areas sampled. Genetic variation in CO1 sequence was also detected among E. malayanum from three different species of snail intermediate host. Separate phylogenetic analyses of the MEE and DNA sequence data revealed that the two species of Echinostoma (E. revolutum and E. malayanum) did not form a monophyletic clade. These results, together with the large number of morphologically similar species with inadequate descriptions, poor specific diagnoses and extensive synonymy, suggest that the morphological characters used for species taxonomy of echinostomes in South-East Asia should be reconsidered according to the concordance of biology, morphology and molecular classification. Copyright © 2010 Elsevier B.V. All rights reserved.
Diebolder, Philipp; Keller, Armin; Haase, Stephanie; Schlegelmilch, Anne; Kiefer, Jonathan D; Karimi, Tamana; Weber, Tobias; Moldenhauer, Gerhard; Kehm, Roland; Eis-Hübinger, Anna M; Jäger, Dirk; Federspil, Philippe A; Herold-Mende, Christel; Dyckhoff, Gerhard; Kontermann, Roland E; Arndt, Michaela AE; Krauss, Jürgen
2014-01-01
The development of efficient strategies for generating fully human monoclonal antibodies with unique functional properties that are exploitable for tailored therapeutic interventions remains a major challenge in the antibody technology field. Here, we present a methodology for recovering such antibodies from antigen-encountered human B cell repertoires. As the source for variable antibody genes, we cloned immunoglobulin G (IgG)-derived B cell repertoires from lymph nodes of 20 individuals undergoing surgery for head and neck cancer. Sequence analysis of unselected “LYmph Node Derived Antibody Libraries” (LYNDAL) revealed a naturally occurring distribution pattern of rearranged antibody sequences, representing all known variable gene families and most functional germline sequences. To demonstrate the feasibility for selecting antibodies with therapeutic potential from these repertoires, seven LYNDAL from donors with high serum titers against herpes simplex virus (HSV) were panned on recombinant glycoprotein B of HSV-1. Screening for specific binders delivered 34 single-chain variable fragments (scFvs) with unique sequences. Sequence analysis revealed extensive somatic hypermutation of enriched clones as a result of affinity maturation. Binding of scFvs to common glycoprotein B variants from HSV-1 and HSV-2 strains was highly specific, and the majority of analyzed antibody fragments bound to the target antigen with nanomolar affinity. From eight scFvs with HSV-neutralizing capacity in vitro, the most potent antibody neutralized 50% HSV-2 at 4.5 nM as a dimeric (scFv)2. We anticipate our approach to be useful for recovering fully human antibodies with therapeutic potential. PMID:24256717
Diebolder, Philipp; Keller, Armin; Haase, Stephanie; Schlegelmilch, Anne; Kiefer, Jonathan D; Karimi, Tamana; Weber, Tobias; Moldenhauer, Gerhard; Kehm, Roland; Eis-Hübinger, Anna M; Jäger, Dirk; Federspil, Philippe A; Herold-Mende, Christel; Dyckhoff, Gerhard; Kontermann, Roland E; Arndt, Michaela A E; Krauss, Jürgen
2014-01-01
The development of efficient strategies for generating fully human monoclonal antibodies with unique functional properties that are exploitable for tailored therapeutic interventions remains a major challenge in the antibody technology field. Here, we present a methodology for recovering such antibodies from antigen-encountered human B cell repertoires. As the source for variable antibody genes, we cloned immunoglobulin G (IgG)-derived B cell repertoires from lymph nodes of 20 individuals undergoing surgery for head and neck cancer. Sequence analysis of unselected “LYmph Node Derived Antibody Libraries” (LYNDAL) revealed a naturally occurring distribution pattern of rearranged antibody sequences, representing all known variable gene families and most functional germline sequences. To demonstrate the feasibility for selecting antibodies with therapeutic potential from these repertoires, seven LYNDAL from donors with high serum titers against herpes simplex virus (HSV) were panned on recombinant glycoprotein B of HSV-1. Screening for specific binders delivered 34 single-chain variable fragments (scFvs) with unique sequences. Sequence analysis revealed extensive somatic hypermutation of enriched clones as a result of affinity maturation. Binding of scFvs to common glycoprotein B variants from HSV-1 and HSV-2 strains was highly specific, and the majority of analyzed antibody fragments bound to the target antigen with nanomolar affinity. From eight scFvs with HSV-neutralizing capacity in vitro,the most potent antibody neutralized 50% HSV-2 at 4.5 nM as a dimeric (scFv)2. We anticipate our approach to be useful for recovering fully human antibodies with therapeutic potential.
Singh, Pushpendra; Benjak, Andrej; Schuenemann, Verena J.; Herbig, Alexander; Avanzi, Charlotte; Busso, Philippe; Nieselt, Kay; Krause, Johannes; Vera-Cabrera, Lucio; Cole, Stewart T.
2015-01-01
Mycobacterium lepromatosis is an uncultured human pathogen associated with diffuse lepromatous leprosy and a reactional state known as Lucio's phenomenon. By using deep sequencing with and without DNA enrichment, we obtained the near-complete genome sequence of M. lepromatosis present in a skin biopsy from a Mexican patient, and compared it with that of Mycobacterium leprae, which has undergone extensive reductive evolution. The genomes display extensive synteny and are similar in size (∼3.27 Mb). Protein-coding genes share 93% nucleotide sequence identity, whereas pseudogenes are only 82% identical. The events that led to pseudogenization of 50% of the genome likely occurred before divergence from their most recent common ancestor (MRCA), and both M. lepromatosis and M. leprae have since accumulated new pseudogenes or acquired specific deletions. Functional comparisons suggest that M. lepromatosis has lost several enzymes required for amino acid synthesis whereas M. leprae has a defective heme pathway. M. lepromatosis has retained all functions required to infect the Schwann cells of the peripheral nervous system and therefore may also be neuropathogenic. A phylogeographic survey of 227 leprosy biopsies by differential PCR revealed that 221 contained M. leprae whereas only six, all from Mexico, harbored M. lepromatosis. Phylogenetic comparisons indicate that M. lepromatosis is closer than M. leprae to the MRCA, and a Bayesian dating analysis suggests that they diverged from their MRCA approximately 13.9 Mya. Thus, despite their ancient separation, the two leprosy bacilli are remarkably conserved and still cause similar pathologic conditions. PMID:25831531
Trombetta, Beniamino; Sellitto, Daniele; Scozzari, Rosaria; Cruciani, Fulvio
2014-01-01
It has long been believed that the male-specific region of the human Y chromosome (MSY) is genetically independent from the X chromosome. This idea has been recently dismissed due to the discovery that X–Y gametologous gene conversion may occur. However, the pervasiveness of this molecular process in the evolution of sex chromosomes has yet to be exhaustively analyzed. In this study, we explored how pervasive X–Y gene conversion has been during the evolution of the youngest stratum of the human sex chromosomes. By comparing about 0.5 Mb of human–chimpanzee gametologous sequences, we identified 19 regions in which extensive gene conversion has occurred. From our analysis, two major features of these emerged: 1) Several of them are evolutionarily conserved between the two species and 2) almost all of the 19 hotspots overlap with regions where X–Y crossing-over has been previously reported to be involved in sex reversal. Furthermore, in order to explore the dynamics of X–Y gametologous conversion in recent human evolution, we resequenced these 19 hotspots in 68 widely divergent Y haplogroups and used publicly available single nucleotide polymorphism data for the X chromosome. We found that at least ten hotspots are still active in humans. Hence, the results of the interspecific analysis are consistent with the hypothesis of widespread reticulate evolution within gametologous sequences in the differentiation of hominini sex chromosomes. In turn, intraspecific analysis demonstrates that X–Y gene conversion may modulate human sex-chromosome-sequence evolution to a greater extent than previously thought. PMID:24817545
Extensive sequencing of seven human genomes to characterize benchmark reference materials
Zook, Justin M.; Catoe, David; McDaniel, Jennifer; Vang, Lindsay; Spies, Noah; Sidow, Arend; Weng, Ziming; Liu, Yuling; Mason, Christopher E.; Alexander, Noah; Henaff, Elizabeth; McIntyre, Alexa B.R.; Chandramohan, Dhruva; Chen, Feng; Jaeger, Erich; Moshrefi, Ali; Pham, Khoa; Stedman, William; Liang, Tiffany; Saghbini, Michael; Dzakula, Zeljko; Hastie, Alex; Cao, Han; Deikus, Gintaras; Schadt, Eric; Sebra, Robert; Bashir, Ali; Truty, Rebecca M.; Chang, Christopher C.; Gulbahce, Natali; Zhao, Keyan; Ghosh, Srinka; Hyland, Fiona; Fu, Yutao; Chaisson, Mark; Xiao, Chunlin; Trow, Jonathan; Sherry, Stephen T.; Zaranek, Alexander W.; Ball, Madeleine; Bobe, Jason; Estep, Preston; Church, George M.; Marks, Patrick; Kyriazopoulou-Panagiotopoulou, Sofia; Zheng, Grace X.Y.; Schnall-Levin, Michael; Ordonez, Heather S.; Mudivarti, Patrice A.; Giorda, Kristina; Sheng, Ying; Rypdal, Karoline Bjarnesdatter; Salit, Marc
2016-01-01
The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly. PMID:27271295
Diversity of the Cronobacter Genus as Revealed by Multilocus Sequence Typing
Joseph, S.; Sonbol, H.; Hariri, S.; Desai, P.; McClelland, M.
2012-01-01
Cronobacter (previously known as Enterobacter sakazakii) is a diverse bacterial genus consisting of seven species: C. sakazakii, C. malonaticus, C. turicensis, C. universalis, C. muytjensii, C. dublinensis, and C. condimenti. In this study, we have used a multilocus sequence typing (MLST) approach employing the alleles of 7 genes (atpD, fusA, glnS, gltB, gyrB, infB, and ppsA; total length, 3,036 bp) to investigate the phylogenetic relationship of 325 Cronobacter species isolates. Strains were chosen on the basis of their species, geographic and temporal distribution, source, and clinical outcome. The earliest strain was isolated from milk powder in 1950, and the earliest clinical strain was isolated in 1953. The existence of seven species was supported by MLST. Intraspecific variation ranged from low diversity in C. sakazakii to extensive diversity within some species, such as C. muytjensii and C. dublinensis, including evidence of gene conversion between species. The predominant species from clinical sources was found to be C. sakazakii. C. sakazakii sequence type 4 (ST4) was the predominant sequence type of cerebral spinal fluid isolates from cases of meningitis. PMID:22785185
Bottacini, Francesca; Morrissey, Ruth; Roberts, Richard John; James, Kieran; van Breen, Justin; Egan, Muireann; Lambert, Jolanda; van Limpt, Kees; Knol, Jan; Motherway, Mary O’Connell; van Sinderen, Douwe
2018-01-01
Abstract Bifidobacterium breve represents one of the most abundant bifidobacterial species in the gastro-intestinal tract of breast-fed infants, where their presence is believed to exert beneficial effects. In the present study whole genome sequencing, employing the PacBio Single Molecule, Real-Time (SMRT) sequencing platform, combined with comparative genome analysis allowed the most extensive genetic investigation of this taxon. Our findings demonstrate that genes encoding Restriction/Modification (R/M) systems constitute a substantial part of the B. breve variable gene content (or variome). Using the methylome data generated by SMRT sequencing, combined with targeted Illumina bisulfite sequencing (BS-seq) and comparative genome analysis, we were able to detect methylation recognition motifs and assign these to identified B. breve R/M systems, where in several cases such assignments were confirmed by restriction analysis. Furthermore, we show that R/M systems typically impose a very significant barrier to genetic accessibility of B. breve strains, and that cloning of a methyltransferase-encoding gene may overcome such a barrier, thus allowing future functional investigations of members of this species. PMID:29294107
Guo, Zixiao; Li, Xinnian; He, Ziwen; Yang, Yuchen; Wang, Wenqing; Zhong, Cairong; Greenberg, Anthony J; Wu, Chung-I; Duke, Norman C; Shi, Suhua
2018-04-01
The projected increases in sea levels are expected to affect coastal ecosystems. Tropical communities, anchored by mangrove trees and having experienced frequent past sea level changes, appear to be vibrant at present. However, any optimism about the resilience of these ecosystems is premature because the impact of past climate events may not be reflected in the current abundance. To assess the impact of historical sea level changes, we conducted an extensive genetic diversity survey on the Indo-Malayan coast, a hotspot with a large global mangrove distribution. A survey of 26 populations in six species reveals extremely low genome-wide nucleotide diversity and hence very small effective population sizes (N e ) in all populations. Whole-genome sequencing of three mangrove species further shows the decline in N e to be strongly associated with the speed of past changes in sea level. We also used a recent series of flooding events in Yalong Bay, southern China, to test the robustness of mangroves to sea level changes in relation to their genetic diversity. The events resulted in the death of half of the mangrove trees in this area. Significantly, less genetically diverse mangrove species suffered much greater destruction. The dieback was accompanied by a drastic reduction in local invertebrate biodiversity. We thus predict that tropical coastal communities will be seriously endangered as the global sea level rises. Well-planned coastal development near mangrove forests will be essential to avert this crisis. © 2017 John Wiley & Sons Ltd.
Vectors as Epidemiological Sentinels: Patterns of Within-Tick Borrelia burgdorferi Diversity
Walter, Katharine S.; Carpi, Giovanna; Evans, Benjamin R.; Caccone, Adalgisa; Diuk-Wasser, Maria A.
2016-01-01
Hosts including humans, other vertebrates, and arthropods, are frequently infected with heterogeneous populations of pathogens. Within-host pathogen diversity has major implications for human health, epidemiology, and pathogen evolution. However, pathogen diversity within-hosts is difficult to characterize and little is known about the levels and sources of within-host diversity maintained in natural populations of disease vectors. Here, we examine genomic variation of the Lyme disease bacteria, Borrelia burgdorferi (Bb), in 98 individual field-collected tick vectors as a model for study of within-host processes. Deep population sequencing reveals extensive and previously undocumented levels of Bb variation: the majority (~70%) of ticks harbor mixed strain infections, which we define as levels Bb diversity pre-existing in a diverse inoculum. Within-tick diversity is thus a sample of the variation present within vertebrate hosts. Within individual ticks, we detect signatures of positive selection. Genes most commonly under positive selection across ticks include those involved in dissemination in vertebrate hosts and evasion of the vertebrate immune complement. By focusing on tick-borne Bb, we show that vectors can serve as epidemiological and evolutionary sentinels: within-vector pathogen diversity can be a useful and unbiased way to survey circulating pathogen diversity and identify evolutionary processes occurring in natural transmission cycles. PMID:27414806
Li, Xi; Mu, Xinli; Yang, Yunxing; Hua, Xiaoting; Yang, Qing; Wang, Nanfei; Du, Xiaoxing; Ruan, Zhi; Shen, Xiaoqiang; Yu, Yunsong
2016-04-01
Tigecycline (TIG) resistance is a growing concern because this antibiotic is regarded as one of the last resorts to treat infections caused by multidrug-resistant and extensively drug-resistant (XDR) bacteria. Information regarding TIG-resistant Escherichia coli isolates is scarce. In this study, we report the emergence of high-level TIG resistance in a longitudinal series of XDR E. coli isolates collected during TIG treatment. Whole-genome sequencing was performed for six E. coli strains harbouring bla(NDM-5) and genomic comparison revealed two amino acid substitutions. Mutation in rpsJ could be a significant factor conferring TIG resistance in these isolates. The fitness cost of TIG resistance in resistant strains was evaluated by determining the relative growth rate, indicating that TIG resistance reduced fitness by ca. 7%. This study is the first report to demonstrate high-level TIG resistance in E. coli in vivo. In addition, we report the first treatment-emergent minimum inhibitory concentration (MIC) development of TIG from 1mg/L to 64 mg/L in E. coli. Clinicians should be aware of the risk of an increase in the MIC of TIG under therapy. Copyright © 2016 Elsevier B.V. and the International Society of Chemotherapy. All rights reserved.
Characterization of kinetoplast DNA from Phytomonas serpens.
Sá-Carvalho, D; Perez-Morga, D; Traub-Cseko, Y M
1993-01-01
The restriction enzyme digestion of kinetoplast DNA from four Phytomonas serpens isolates shows an overall similar band pattern. One minicircle from isolate 30T was cloned and sequenced, showing low levels of homology but the same general features and organization as described for minicircles of other trypanosomatids. Extensive regions of the minicircle are composed by G and T on the H strand. These regions are very repetitive and similar to regions in a minicircle of Crithidia oncopelti and to telomeric sequences of Saccharomyces cerevisiae. Conserved Sequence Block 3, present in all trypanosomatids, is one nucleotide different from the consensus in P. serpens and provides a basis to differentiate P. serpens from other trypanosomatids. Electron microscopy of kinetoplast DNA evidenced a network with organization similar to other trypanosomatids and the measurement of minicircles confirmed the size of about 1.45 kb of the sequenced minicircle.
Influence of summer water-level variability on St. Lawrence River-wetland fish assemblages
McKenna, J.E.; Barkley, J.L.; Johnson, J. H.
2008-01-01
Water-level and associated variability are substantial influences on wetland and shallow aquatic communities. The Akwesasne Wetland Complex is an extensive St. Lawrence River system affected by water regulation. The responses of fish assemblages to short-term summer water-level variation were examined throughout this section of the St. Lawrence River and its tributaries. An influence of water-level variability was detected on abundance of three common species [bluntnose minnow (Pimephales notatus), rock bass (Amboplites rupestris), and white sucker (Catastomus commersonii)] and explained 30-44% of variation. This influence has implications for water regulation and natural resource management, and a larger scope evaluation may reveal more extensive effects.
NASA Astrophysics Data System (ADS)
LoPresto, Michael C.
2018-05-01
In a recent "AstroNote," I described a simple exercise on the mass-luminosity relation for main sequence stars as an example of exposing students in a general education science course of lower mathematical level to the use of quantitative skills such as collecting and analyzing data. Here I present another attempt at a meaningful experience for such students that again involves both the gathering and analysis of numerical data and comparison with accepted result, this time on the relationship of the mass and lifetimes of main sequence stars. This experiment can stand alone or be used as an extension of the previous mass-luminosity relationship experiment.
Typing Clostridium difficile strains based on tandem repeat sequences
2009-01-01
Background Genotyping of epidemic Clostridium difficile strains is necessary to track their emergence and spread. Portability of genotyping data is desirable to facilitate inter-laboratory comparisons and epidemiological studies. Results This report presents results from a systematic screen for variation in repetitive DNA in the genome of C. difficile. We describe two tandem repeat loci, designated 'TR6' and 'TR10', which display extensive sequence variation that may be useful for sequence-based strain typing. Based on an investigation of 154 C. difficile isolates comprising 75 ribotypes, tandem repeat sequencing demonstrated excellent concordance with widely used PCR ribotyping and equal discriminatory power. Moreover, tandem repeat sequences enabled the reconstruction of the isolates' largely clonal population structure and evolutionary history. Conclusion We conclude that sequence analysis of the two repetitive loci introduced here may be highly useful for routine typing of C. difficile. Tandem repeat sequence typing resolves phylogenetic diversity to a level equivalent to PCR ribotypes. DNA sequences may be stored in databases accessible over the internet, obviating the need for the exchange of reference strains. PMID:19133124
Functional mapping of language networks in the normal brain using a word-association task.
Ghosh, Shantanu; Basu, Amrita; Kumaran, Senthil S; Khushu, Subash
2010-08-01
Language functions are known to be affected in diverse neurological conditions, including ischemic stroke, traumatic brain injury, and brain tumors. Because language networks are extensive, interpretation of functional data depends on the task completed during evaluation. The aim was to map the hemodynamic consequences of word association using functional magnetic resonance imaging (fMRI) in normal human subjects. Ten healthy subjects underwent fMRI scanning with a postlexical access semantic association task vs lexical processing task. The fMRI protocol involved a T2*-weighted gradient-echo echo-planar imaging (GE-EPI) sequence (TR 4523 ms, TE 64 ms, flip angle 90°) with alternate baseline and activation blocks. A total of 78 scans were taken (interscan interval = 3 s) with a total imaging time of 587 s. Functional data were processed in Statistical Parametric Mapping software (SPM2) with 8-mm Gaussian kernel by convolving the blood oxygenation level-dependent (BOLD) signal with an hemodynamic response function estimated by general linear method to generate SPM{t} and SPM{F} maps. Single subject analysis of the functional data (FWE-corrected, P≤0.001) revealed extensive activation in the frontal lobes, with overlaps among middle frontal gyrus (MFG), superior, and inferior frontal gyri. BOLD activity was also found in the medial frontal gyrus, middle occipital gyrus (MOG), anterior fusiform gyrus, superior and inferior parietal lobules, and to a smaller extent, the thalamus and right anterior cerebellum. Group analysis (FWE-corrected, P≤0.001) revealed neural recruitment of bilateral lingual gyri, left MFG, bilateral MOG, left superior occipital gyrus, left fusiform gyrus, bilateral thalami, and right cerebellar areas. Group data analysis revealed a cerebellar-occipital-fusiform-thalamic network centered around bilateral lingual gyri for word association, thereby indicating how these areas facilitate language comprehension by activating a semantic association network of words processed postlexical access. This finding is important when assessing the extent of cognitive damage and/or recovery and can be used for presurgical planning after optimization.
Analysis of human herpesvirus-6 IE1 sequence variation in clinical samples.
Stanton, Richard; Wilkinson, Gavin W G; Fox, Julie D
2003-12-01
Herpesvirus immediate early (IE) proteins are known to play key roles in establishing productive infections, regulating reactivation from latency, and creating a cellular environment favourable to viral replication. Human herpesvirus-6 (HHV-6) IE genes have not been studied as intensively as their homologues in the prototype betaherpesvirus human cytomegalovirus (HCMV). Whilst the HCMV IE1 gene is relatively conserved, early studies indicated that HHV-6 IE1 exhibited a high level of sequence variation between HHV-6A and HHV-6B isolates, although the observation was based primarily on virus stocks that had been isolated and propagated in vitro. In this study, we investigated the level of HHV-6 IE1 sequence variation in vivo by direct sequencing of circulating virus in clinical samples without prior in vitro culture. Sequences exactly matching those reported for reference HHV-6 isolates were identified in clinical samples, thus the HHV-6 laboratory strains used in the majority of in vitro studies appear to be representative of virus circulating in vivo with respect to the IE1 gene. The HHV-6 IE1 sequence is also conserved in reference strains that had been passaged extensively in vitro. The high degree of divergence between variant A and B type IE1 sequences was confirmed, but interestingly HHV-6B IE1 sequences were observed to further segregate into two distinct subgroups, with the laboratory strains Z29 and HST representative of these two subgroups. Within each HHV-6B subgroup, a remarkably high level of homology was observed. Thus the HHV-6 IE1 sequence appears highly stable, underlining its potential importance to the viral life cycle. Copyright 2003 Wiley-Liss, Inc.
Multi-region and single-cell sequencing reveal variable genomic heterogeneity in rectal cancer.
Liu, Mingshan; Liu, Yang; Di, Jiabo; Su, Zhe; Yang, Hong; Jiang, Beihai; Wang, Zaozao; Zhuang, Meng; Bai, Fan; Su, Xiangqian
2017-11-23
Colorectal cancer is a heterogeneous group of malignancies with complex molecular subtypes. While colon cancer has been widely investigated, studies on rectal cancer are very limited. Here, we performed multi-region whole-exome sequencing and single-cell whole-genome sequencing to examine the genomic intratumor heterogeneity (ITH) of rectal tumors. We sequenced nine tumor regions and 88 single cells from two rectal cancer patients with tumors of the same molecular classification and characterized their mutation profiles and somatic copy number alterations (SCNAs) at the multi-region and the single-cell levels. A variable extent of genomic heterogeneity was observed between the two patients, and the degree of ITH increased when analyzed on the single-cell level. We found that major SCNAs were early events in cancer development and inherited steadily. Single-cell sequencing revealed mutations and SCNAs which were hidden in bulk sequencing. In summary, we studied the ITH of rectal cancer at regional and single-cell resolution and demonstrated that variable heterogeneity existed in two patients. The mutational scenarios and SCNA profiles of two patients with treatment naïve from the same molecular subtype are quite different. Our results suggest each tumor possesses its own architecture, which may result in different diagnosis, prognosis, and drug responses. Remarkable ITH exists in the two patients we have studied, providing a preliminary impression of ITH in rectal cancer.
McQuade, L R; Hill, R J; Francis, D
1994-01-01
B chromosomes, despite their common occurrence throughout the animal and plant kingdoms, have not been investigated extensively at the molecular level. While the majority of B chromosomes occurring in animals have been described as heterochromatic, only a few researchers have examined the DNA of these chromosomes beyond this gross cytological level. This is the case in the largest of the gliding marsupial possums, the greater glider, Petauroides volans. To examine the molecular composition and localization of B-chromosome DNA sequences in P. volans, a combination of micromanipulation and the polymerase chain reaction was used in this study to isolate and then amplify the DNA of the B chromosomes. Localization of the isolated B-chromosome sequences to metaphase chromosomes was investigated using fluorescence in situ hybridization. The B chromosomes in this species are shown to be composed of a heterogeneous mixture of sequences, some of which are unique to the B chromosomes, while others exhibit homology to the centromeric regions of the autosomal complement.
EuroPineDB: a high-coverage web database for maritime pine transcriptome
2011-01-01
Background Pinus pinaster is an economically and ecologically important species that is becoming a woody gymnosperm model. Its enormous genome size makes whole-genome sequencing approaches are hard to apply. Therefore, the expressed portion of the genome has to be characterised and the results and annotations have to be stored in dedicated databases. Description EuroPineDB is the largest sequence collection available for a single pine species, Pinus pinaster (maritime pine), since it comprises 951 641 raw sequence reads obtained from non-normalised cDNA libraries and high-throughput sequencing from adult (xylem, phloem, roots, stem, needles, cones, strobili) and embryonic (germinated embryos, buds, callus) maritime pine tissues. Using open-source tools, sequences were optimally pre-processed, assembled, and extensively annotated (GO, EC and KEGG terms, descriptions, SNPs, SSRs, ORFs and InterPro codes). As a result, a 10.5× P. pinaster genome was covered and assembled in 55 322 UniGenes. A total of 32 919 (59.5%) of P. pinaster UniGenes were annotated with at least one description, revealing at least 18 466 different genes. The complete database, which is designed to be scalable, maintainable, and expandable, is freely available at: http://www.scbi.uma.es/pindb/. It can be retrieved by gene libraries, pine species, annotations, UniGenes and microarrays (i.e., the sequences are distributed in two-colour microarrays; this is the only conifer database that provides this information) and will be periodically updated. Small assemblies can be viewed using a dedicated visualisation tool that connects them with SNPs. Any sequence or annotation set shown on-screen can be downloaded. Retrieval mechanisms for sequences and gene annotations are provided. Conclusions The EuroPineDB with its integrated information can be used to reveal new knowledge, offers an easy-to-use collection of information to directly support experimental work (including microarray hybridisation), and provides deeper knowledge on the maritime pine transcriptome. PMID:21762488
Takaesu, Azusa; Watanabe, Kiyotaka; Takai, Shinji; Sasaki, Yukako; Orino, Koichi
2008-01-01
Background Iron-storage protein, ferritin plays a central role in iron metabolism. Ferritin has dual function to store iron and segregate iron for protection of iron-catalyzed reactive oxygen species. Tissue ferritin is composed of two kinds of subunits (H: heavy chain or heart-type subunit; L: light chain or liver-type subunit). Ferritin gene expression is controlled at translational level in iron-dependent manner or at transcriptional level in iron-independent manner. However, sequencing analysis of marine mammalian ferritin subunits has not yet been performed fully. The purpose of this study is to reveal cDNA-derived amino acid sequences of cetacean ferritin H and L subunits, and demonstrate the possibility of expression of these subunits, especially H subunit, by iron. Methods Sequence analyses of cetacean ferritin H and L subunits were performed by direct sequencing of polymerase chain reaction (PCR) fragments from cDNAs generated via reverse transcription-PCR of leukocyte total RNA prepared from blood samples of six different dolphin species (Pseudorca crassidens, Lagenorhynchus obliquidens, Grampus griseus, Globicephala macrorhynchus, Tursiops truncatus, and Delphinapterus leucas). The putative iron-responsive element sequence in the 5'-untranslated region of the six different dolphin species was revealed by direct sequencing of PCR fragments obtained using leukocyte genomic DNA. Results Dolphin H and L subunits consist of 182 and 174 amino acids, respectively, and amino acid sequence identities of ferritin subunits among these dolphins are highly conserved (H: 99–100%, (99→98) ; L: 98–100%). The conserved 28 bp IRE sequence was located -144 bp upstream from the initiation codon in the six different dolphin species. Conclusion These results indicate that six different dolphin species have conserved ferritin sequences, and suggest that these genes are iron-dependently expressed. PMID:18954429
Ravikumar, Komandur Elayavilli; Wagholikar, Kavishwar B; Li, Dingcheng; Kocher, Jean-Pierre; Liu, Hongfang
2015-06-06
Advances in the next generation sequencing technology has accelerated the pace of individualized medicine (IM), which aims to incorporate genetic/genomic information into medicine. One immediate need in interpreting sequencing data is the assembly of information about genetic variants and their corresponding associations with other entities (e.g., diseases or medications). Even with dedicated effort to capture such information in biological databases, much of this information remains 'locked' in the unstructured text of biomedical publications. There is a substantial lag between the publication and the subsequent abstraction of such information into databases. Multiple text mining systems have been developed, but most of them focus on the sentence level association extraction with performance evaluation based on gold standard text annotations specifically prepared for text mining systems. We developed and evaluated a text mining system, MutD, which extracts protein mutation-disease associations from MEDLINE abstracts by incorporating discourse level analysis, using a benchmark data set extracted from curated database records. MutD achieves an F-measure of 64.3% for reconstructing protein mutation disease associations in curated database records. Discourse level analysis component of MutD contributed to a gain of more than 10% in F-measure when compared against the sentence level association extraction. Our error analysis indicates that 23 of the 64 precision errors are true associations that were not captured by database curators and 68 of the 113 recall errors are caused by the absence of associated disease entities in the abstract. After adjusting for the defects in the curated database, the revised F-measure of MutD in association detection reaches 81.5%. Our quantitative analysis reveals that MutD can effectively extract protein mutation disease associations when benchmarking based on curated database records. The analysis also demonstrates that incorporating discourse level analysis significantly improved the performance of extracting the protein-mutation-disease association. Future work includes the extension of MutD for full text articles.
76 FR 37241 - Airworthiness Directives; Airbus Model A318, A319, A320, and A321 Series Airplanes
Federal Register 2010, 2011, 2012, 2013, 2014
2011-06-27
... Aircraft Monitoring] warnings during the landing gear retraction or extension sequence. * * * * * This... [Electronic Centralised Aircraft Monitoring] warnings during the landing gear retraction or extension sequence... [Electronic Centralised Aircraft [[Page 37243
Variations in the crustal structure beneath western Turkey
NASA Astrophysics Data System (ADS)
Saunders, Paul; Priestley, Keith; Taymaz, Tuncay
1998-08-01
We use teleseismic receiver functions to investigate the crustal structure at two locations in western Turkey using seismic data recorded on small arrays of temporary broad-band seismographs. The results from these analyses are compared with receiver function results from the GDSN station ANTO on the Anatolian Plateau in central Turkey. The crust is ~ 30 km thick in the region of western Turkey where active normal faulting reveals present-day extension in the upper crust and alkali-basaltic volcanism reveals recent extension within the subcrustal lithosphere The crust is ~ 34 km thick further east where crustal extension is still evident but less pronounced. In the Anatolian Plateau, which is not currently extending, the crust is ~ 38 km thick. The level of extension estimated from these measurements of crustal thickness implies a β -factor of ~ 1.2. This value agrees with the amount of extension estimated in the upper crust from the integrated seismic strain rate (β -factor of ~ 1.3), from surface faulting(β -factor of ~ 1.25) and from the amount of extension in the subcrustal lithosphere estimated from the volcanism (β -factor < 2), all indicating that the extension is approximately uniformly distributed vertically throughout the lithosphere. The Moho transition in this region appears to thin slightly as the degree of extension increases westwards.
NASA Astrophysics Data System (ADS)
Tsai, M. C.
2017-12-01
High strain accumulation across the fold-and-thrust belt in Southwestern Taiwan are revealed by the Continuous GPS (cGPS) and SAR interferometry. This high strain is generally accommodated by the major active structures in fold-and-thrust belt of western Foothills in SW Taiwan connected to the accretionary wedge in the incipient are-continent collision zone. The active structures across the high strain accumulation include the deformation front around the Tainan Tableland, the Hochiali, Hsiaokangshan, Fangshan and Chishan faults. Among these active structures, the deformation pattern revealed from cGPS and SAR interferometry suggest that the Fangshan transfer fault may be a left-lateral fault zone with thrust component accommodating the westward differential motion of thrust sheets on both side of the fault. In addition, the Chishan fault connected to the splay fault bordering the lower-slope and upper-slope of the accretionary wedge which could be the major seismogenic fault and an out-of-sequence thrust fault in SW Taiwan. The big earthquakes resulted from the reactivation of out-of-sequence thrusts have been observed along the Nankai accretionary wedge, thus the assessment of the major seismogenic structures by strain accumulation between the frontal décollement and out-of-sequence thrusts is a crucial topic. According to the background seismicity, the low seismicity and mid-crust to mantle events are observed inland and the lower- and upper- slope domain offshore SW Taiwan, which rheologically implies the upper crust of the accretionary wedge is more or less aseimic. This result may suggest that the excess fluid pressure from the accretionary wedge not only has significantly weakened the prism materials as well as major fault zone, but also makes the accretionary wedge landward extension, which is why the low seismicity is observed in SW Taiwan area. Key words: Continuous GPS, SAR interferometry, strain rate, out-of-sequence thrust.
Heterogeneity of DNA methylation in multifocal prostate cancer.
Serenaite, Inga; Daniunaite, Kristina; Jankevicius, Feliksas; Laurinavicius, Arvydas; Petroska, Donatas; Lazutka, Juozas R; Jarmalaite, Sonata
2015-01-01
Most prostate cancer (PCa) cases are multifocal, and separate foci display histological and molecular heterogeneity. DNA hypermethylation is a frequent alteration in PCa, but interfocal heterogeneity of these changes has not been extensively investigated. Ten pairs of foci from multifocal PCa and 15 benign prostatic hyperplasia (BPH) samples were obtained from prostatectomy specimens, resulting altogether in 35 samples. Methylation-specific PCR (MSP) was used to evaluate methylation status of nine tumor suppressor genes (TSGs), and a set of selected TSGs was quantitatively analyzed for methylation intensity by pyrosequencing. Promoter sequences of the RASSF1 and ESR1 genes were methylated in all paired PCa foci, and frequent (≥75 %) DNA methylation was detected in RARB, GSTP1, and ABCB1 genes. MSP revealed different methylation status of at least one gene in separate foci in 8 out of 10 multifocal tumors. The mean methylation level of ESR1, GSTP1, RASSF1, and RARB differed between the paired foci of all PCa cases. The intensity of DNA methylation in these TSGs was significantly higher in PCa cases than in BPH (p < 0.001). Hierarchical cluster analysis revealed a divergent methylation profile of paired PCa foci, while the foci from separate cases with biochemical recurrence showed similar methylation profile and the highest mean levels of DNA methylation. Our findings suggest that PCa tissue is heterogeneous, as between paired foci differences in DNA methylation status were found. Common epigenetic profile of recurrent tumors can be inferred from our data.
Turgeon, J; Bernatchez, L
2001-04-01
The comparative molecular phylogeography of regional fish fauna has revealed the wide distribution of young clades in freshwater fishes of formerly glaciated areas as well as interspecific incongruences in their refugial origins and recolonization routes. In this study, we employed single-strand conformation polymorphism (SSCP) and sequence analyses to describe mitochondrial DNA (mtDNA) polymorphism among 27 populations of the lake cisco (Coregonus artedi) from its entire range of distribution in order to evaluate the hypothesis of dual glacial refuges proposed by Bernatchez & Dodson against the traditional view that this species is solely of Mississippian origin. Results indicate that this taxon is composed of two closely related groups that are widely distributed and intermixed over most of the sampled range. The estimated level of divergence (0.48%), the contrast in the geographical distribution of each group, as well as the general distribution of C. artedi in North America together support the hypothesis that one group dispersed from a Mississippian refuge via the proglacial lakes, while the other is of Atlantic origin and also took advantages of earlier dispersal routes towards eastern Hudson Bay drainages. However, the signal of past range fragmentation revealed by a nested clade analysis was weak, and did not allow to formally exclude the hypothesis of a single Mississippian origin for both lineages. Comparisons with the phylogeographic patterns of other Nearctic freshwater fishes suggest that the salinity tolerance and thermal sensitivity of lake cisco may have been determinant for its extensive postglacial dispersal. The presence or co-occurrence of sympatric or allopatric eco/morphotypes were not found to be necessarily associated with the presence of both haplotype groups.
Olsen, Anne Berit; Gulla, Snorre; Steinum, Terje; Colquhoun, Duncan J; Nilsen, Hanne K; Duchaud, Eric
2017-06-01
Skin ulcer development in sea-reared salmonids, commonly associated with Tenacibaculum spp., is a significant fish welfare- and economical problem in Norwegian aquaculture. A collection of 89 Tenacibaculum isolates was subjected to multilocus sequence analysis (MLSA). The isolates were retrieved from outbreaks of clinical disease in farms spread along the Norwegian coast line from seven different fish species over a period of 19 years. MLSA analysis reveals considerable genetic diversity, but allows identification of four main clades. One clade encompasses isolates belonging to the species T. dicentrarchi, whereas three clades encompass bacteria that likely represent novel, as yet undescribed species. The study identified T. maritimum in lumpsucker, T. ovolyticum in halibut, and has extended the host and geographic range for T. soleae, isolated from wrasse. The overall lack of clonality and host specificity, with some indication of geographical range restriction argue for local epidemics involving multiple strains. The diversity of Tenacibaculum isolates from fish displaying ulcerative disease may complicate vaccine development. Copyright © 2017 Elsevier B.V. All rights reserved.
Pan, Zhangyuan; Li, Shengdi; Liu, Qiuyue; Wang, Zhen; Zhou, Zhengkui; Di, Ran; Miao, Benpeng; Hu, Wenping; Wang, Xiangyu; Hu, Xiaoxiang; Xu, Ze; Wei, Dongkai; He, Xiaoyun; Yuan, Liyun; Guo, Xiaofei; Liang, Benmeng; Wang, Ruichao; Li, Xiaoyu; Cao, Xiaohan; Dong, Xinlong; Xia, Qing; Shi, Hongcai; Hao, Geng; Yang, Jean; Luosang, Cuicheng; Zhao, Yiqiang; Jin, Mei; Zhang, Yingjie; Lv, Shenjin; Li, Fukuan; Ding, Guohui; Chu, Mingxing; Li, Yixue
2018-01-01
Abstract Background Animal domestication has been extensively studied, but the process of feralization remains poorly understood. Results Here, we performed whole-genome sequencing of 99 sheep and identified a primary genetic divergence between 2 heterogeneous populations in the Tibetan Plateau, including 1 semi-feral lineage. Selective sweep and candidate gene analysis revealed local adaptations of these sheep associated with sensory perception, muscle strength, eating habit, mating process, and aggressive behavior. In particular, a horn-related gene, RXFP2, showed signs of rapid evolution specifically in the semi-feral breeds. A unique haplotype and repressed horn-related tissue expression of RXFP2 were correlated with higher horn length, as well as spiral and horizontally extended horn shape. Conclusions Semi-feralization has an extensive impact on diverse phenotypic traits of sheep. By acquiring features like those of their wild ancestors, semi-feral sheep were able to regain fitness while in frequent contact with wild surroundings and rare human interventions. This study provides a new insight into the evolution of domestic animals when human interventions are no longer dominant. PMID:29668959
Li, Ying; Ji, Xiaoting; Song, Weiling; Guo, Yingshu
2013-04-03
A cross-circular amplification system for sensitive detection of adenosine triphosphate (ATP) in cancer cells was developed based on aptamer-target interaction, magnetic microbeads (MBs)-assisted strand displacement amplification and target recycling. Here we described a new recognition probe possessing two parts, the ATP aptamer and the extension part. The recognition probe was firstly immobilized on the surface of MBs and hybridized with its complementary sequence to form a duplex. When combined with ATP, the probe changed its conformation, revealing the extension part in single-strand form, which further served as a toehold for subsequent target recycling. The released complementary sequence of the probe acted as the catalyst of the MB-assisted strand displacement reaction. Incorporated with target recycling, a large amount of biotin-tagged MB complexes were formed to stimulate the generation of chemiluminescence (CL) signal in the presence of luminol and H2O2 by incorporating with streptavidin-HRP, reaching a detection limit of ATP as low as 6.1×10(-10)M. Moreover, sample assays of ATP in Ramos Burkitt's lymphoma B cells were performed, which confirmed the reliability and practicality of the protocol. Copyright © 2013 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Fuentealba, Claudio; Sánchez-Matamoros, Gloria; Badillo, Edelmira; Trigueros, María
2017-01-01
This study is part of a more extensive research project that addresses the understanding of the derivative concept in university students with prior instruction in differential calculus. In particular, we focus on the analysis of students' responses to a sequence of tasks that require a high level of understanding of the concept, and complement…
A sequence-based survey of the complex structural organization of tumor genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Collins, Colin; Raphael, Benjamin J.; Volik, Stanislav
2008-04-03
The genomes of many epithelial tumors exhibit extensive chromosomal rearrangements. All classes of genome rearrangements can be identified using End Sequencing Profiling (ESP), which relies on paired-end sequencing of cloned tumor genomes. In this study, brain, breast, ovary and prostate tumors along with three breast cancer cell lines were surveyed with ESP yielding the largest available collection of sequence-ready tumor genome breakpoints and providing evidence that some rearrangements may be recurrent. Sequencing and fluorescence in situ hybridization (FISH) confirmed translocations and complex tumor genome structures that include coamplification and packaging of disparate genomic loci with associated molecular heterogeneity. Comparison ofmore » the tumor genomes suggests recurrent rearrangements. Some are likely to be novel structural polymorphisms, whereas others may be bona fide somatic rearrangements. A recurrent fusion transcript in breast tumors and a constitutional fusion transcript resulting from a segmental duplication were identified. Analysis of end sequences for single nucleotide polymorphisms (SNPs) revealed candidate somatic mutations and an elevated rate of novel SNPs in an ovarian tumor. These results suggest that the genomes of many epithelial tumors may be far more dynamic and complex than previously appreciated and that genomic fusions including fusion transcripts and proteins may be common, possibly yielding tumor-specific biomarkers and therapeutic targets.« less
NASA Astrophysics Data System (ADS)
Bie, L.; Rietbrock, A.; Agurto-Detzel, H.
2017-12-01
The forearc region in subduction zones deforms in response to relative movement on the plate interface throughout the earthquake cycle. Megathrust earthquakes may alter the stress field in the forearc areas from compression to extension, resulting in normal faulting earthquakes. Recent cases include the 2011 Iwaki sequence following the Tohoku-Oki earthquake in Japan, and 2010 Pichilemu sequence after the Maule earthquake in central Chile. Given the closeness of these normal fault events to residential areas, and their shallow depth, they may pose equivalent, if not higher, seismic risk in comparison to earthquakes on the megathrust. Here, we focus on the 2010 Pichilemu sequence following the Mw 8.8 Maule earthquake in central Chile, where the Nazca Plate subducts beneath the South American Plate. Previous studies have clearly delineated the Pichilemu normal fault structure. However, it is not clear whether the Pichilemu events fully released the extensional stress exerted by the Maule mainshock, or the forearc area is still controlled by extensional stress. A 3 months displacement time-series, constructed by radar satellite images, clearly shows continuous aseismic deformation along the Pichilemu fault. Kinematic inversion reveals peak afterslip of 25 cm at shallow depth, equivalent to a Mw 5.4 earthquake. We identified a Mw 5.3 earthquake 2 months after the Pichilemu sequence from both geodetic and seismic observations. Nonlinear inversion from geodetic data suggests that this event ruptured a normal fault conjugate to the Pichilemu fault, at a depth of 4.5 km, consistent with the result obtained from independent moment tensor inversion. We relocated aftershocks in the Pichilemu area using relative arrivals time and a 3D velocity model. The spatial correlation between geodetic deformation and aftershocks reveals three additional areas which may have experienced aseismic slip at depth. Both geodetic displacement and aftershock distribution show a conjugated L-shape feature. This pattern coincides with weak zones depicted by high vp/vs and low vs in the upper crust of this region, suggesting fluid control of seismic and aseismic activities in the Pichilemu area.
MGmapper: Reference based mapping and taxonomy annotation of metagenomics sequence reads
Lukjancenko, Oksana; Thomsen, Martin Christen Frølund; Maddalena Sperotto, Maria; Lund, Ole; Møller Aarestrup, Frank; Sicheritz-Pontén, Thomas
2017-01-01
An increasing amount of species and gene identification studies rely on the use of next generation sequence analysis of either single isolate or metagenomics samples. Several methods are available to perform taxonomic annotations and a previous metagenomics benchmark study has shown that a vast number of false positive species annotations are a problem unless thresholds or post-processing are applied to differentiate between correct and false annotations. MGmapper is a package to process raw next generation sequence data and perform reference based sequence assignment, followed by a post-processing analysis to produce reliable taxonomy annotation at species and strain level resolution. An in-vitro bacterial mock community sample comprised of 8 genuses, 11 species and 12 strains was previously used to benchmark metagenomics classification methods. After applying a post-processing filter, we obtained 100% correct taxonomy assignments at species and genus level. A sensitivity and precision at 75% was obtained for strain level annotations. A comparison between MGmapper and Kraken at species level, shows MGmapper assigns taxonomy at species level using 84.8% of the sequence reads, compared to 70.5% for Kraken and both methods identified all species with no false positives. Extensive read count statistics are provided in plain text and excel sheets for both rejected and accepted taxonomy annotations. The use of custom databases is possible for the command-line version of MGmapper, and the complete pipeline is freely available as a bitbucked package (https://bitbucket.org/genomicepidemiology/mgmapper). A web-version (https://cge.cbs.dtu.dk/services/MGmapper) provides the basic functionality for analysis of small fastq datasets. PMID:28467460
MGmapper: Reference based mapping and taxonomy annotation of metagenomics sequence reads.
Petersen, Thomas Nordahl; Lukjancenko, Oksana; Thomsen, Martin Christen Frølund; Maddalena Sperotto, Maria; Lund, Ole; Møller Aarestrup, Frank; Sicheritz-Pontén, Thomas
2017-01-01
An increasing amount of species and gene identification studies rely on the use of next generation sequence analysis of either single isolate or metagenomics samples. Several methods are available to perform taxonomic annotations and a previous metagenomics benchmark study has shown that a vast number of false positive species annotations are a problem unless thresholds or post-processing are applied to differentiate between correct and false annotations. MGmapper is a package to process raw next generation sequence data and perform reference based sequence assignment, followed by a post-processing analysis to produce reliable taxonomy annotation at species and strain level resolution. An in-vitro bacterial mock community sample comprised of 8 genuses, 11 species and 12 strains was previously used to benchmark metagenomics classification methods. After applying a post-processing filter, we obtained 100% correct taxonomy assignments at species and genus level. A sensitivity and precision at 75% was obtained for strain level annotations. A comparison between MGmapper and Kraken at species level, shows MGmapper assigns taxonomy at species level using 84.8% of the sequence reads, compared to 70.5% for Kraken and both methods identified all species with no false positives. Extensive read count statistics are provided in plain text and excel sheets for both rejected and accepted taxonomy annotations. The use of custom databases is possible for the command-line version of MGmapper, and the complete pipeline is freely available as a bitbucked package (https://bitbucket.org/genomicepidemiology/mgmapper). A web-version (https://cge.cbs.dtu.dk/services/MGmapper) provides the basic functionality for analysis of small fastq datasets.
Nirasawa, Satoru; Nakahara, Kazuhiko; Takahashi, Saori
2018-02-27
Paenidase is the first microorganism-derived D-aspartyl endopeptidase that specifically recognizes an internal D-Asp residue to cleave [D-Asp]-X peptide bonds. Using peptide sequences obtained from the protein, we performed PCR with degenerate primers to amplify the paenidase I-encoding gene. Nucleotide sequencing revealed that mature paenidase I consists of 322 amino acid residues and that the protein is encoded as a pro-protein with a 197-amino-acid N-terminal extension compared to the mature protein. Paenidase I exhibits amino acid sequence similarity to several penicillin-binding proteins. In addition, paenidase I was classified into peptidase family S12 based on a MEROPS database search. Family S12 contains serine-type D-Ala-D-Ala carboxypeptidases that have three active site residues (Ser, Lys, and Tyr) in the conserved motifs Ser-Xaa-Thr-Lys and Tyr-Xaa-Asn. These motifs were conserved in the primary structure of paenidase I, and the role of these residues was confirmed by site-directed mutagenesis.
Hargreaves, Adam D; Zhou, Long; Christensen, Josef; Marlétaz, Ferdinand; Liu, Shiping; Li, Fang; Jansen, Peter Gildsig; Spiga, Enrico; Hansen, Matilde Thye; Pedersen, Signe Vendelbo Horn; Biswas, Shameek; Serikawa, Kyle; Fox, Brian A; Taylor, William R; Mulley, John Frederick; Zhang, Guojie; Heller, R Scott; Holland, Peter W H
2017-07-18
The sand rat Psammomys obesus is a gerbil species native to deserts of North Africa and the Middle East, and is constrained in its ecology because high carbohydrate diets induce obesity and type II diabetes that, in extreme cases, can lead to pancreatic failure and death. We report the sequencing of the sand rat genome and discovery of an unusual, extensive, and mutationally biased GC-rich genomic domain. This highly divergent genomic region encompasses several functionally essential genes, and spans the ParaHox cluster which includes the insulin-regulating homeobox gene Pdx1. The sequence of sand rat Pdx1 has been grossly affected by GC-biased mutation, leading to the highest divergence observed for this gene across the Bilateria. In addition to genomic insights into restricted caloric intake in a desert species, the discovery of a localized chromosomal region subject to elevated mutation suggests that mutational heterogeneity within genomes could influence the course of evolution.
PIMS sequencing extension: a laboratory information management system for DNA sequencing facilities
2011-01-01
Background Facilities that provide a service for DNA sequencing typically support large numbers of users and experiment types. The cost of services is often reduced by the use of liquid handling robots but the efficiency of such facilities is hampered because the software for such robots does not usually integrate well with the systems that run the sequencing machines. Accordingly, there is a need for software systems capable of integrating different robotic systems and managing sample information for DNA sequencing services. In this paper, we describe an extension to the Protein Information Management System (PIMS) that is designed for DNA sequencing facilities. The new version of PIMS has a user-friendly web interface and integrates all aspects of the sequencing process, including sample submission, handling and tracking, together with capture and management of the data. Results The PIMS sequencing extension has been in production since July 2009 at the University of Leeds DNA Sequencing Facility. It has completely replaced manual data handling and simplified the tasks of data management and user communication. Samples from 45 groups have been processed with an average throughput of 10000 samples per month. The current version of the PIMS sequencing extension works with Applied Biosystems 3130XL 96-well plate sequencer and MWG 4204 or Aviso Theonyx liquid handling robots, but is readily adaptable for use with other combinations of robots. Conclusions PIMS has been extended to provide a user-friendly and integrated data management solution for DNA sequencing facilities that is accessed through a normal web browser and allows simultaneous access by multiple users as well as facility managers. The system integrates sequencing and liquid handling robots, manages the data flow, and provides remote access to the sequencing results. The software is freely available, for academic users, from http://www.pims-lims.org/. PMID:21385349
Setliff, Ian; McDonnell, Wyatt J; Raju, Nagarajan; Bombardi, Robin G; Murji, Amyn A; Scheepers, Cathrine; Ziki, Rutendo; Mynhardt, Charissa; Shepherd, Bryan E; Mamchak, Alusha A; Garrett, Nigel; Karim, Salim Abdool; Mallal, Simon A; Crowe, James E; Morris, Lynn; Georgiev, Ivelin S
2018-06-13
Characterization of single antibody lineages within infected individuals has provided insights into the development of Env-specific antibodies. However, a systems-level understanding of the humoral response against HIV-1 is limited. Here, we interrogated the antibody repertoires of multiple HIV-infected donors from an infection-naive state through acute and chronic infection using next-generation sequencing. This analysis revealed the existence of "public" antibody clonotypes that were shared among multiple HIV-infected individuals. The HIV-1 reactivity for representative antibodies from an identified public clonotype shared by three donors was confirmed. Furthermore, a meta-analysis of publicly available antibody repertoire sequencing datasets revealed antibodies with high sequence identity to known HIV-reactive antibodies, even in repertoires that were reported to be HIV naive. The discovery of public antibody clonotypes in HIV-infected individuals represents an avenue of significant potential for better understanding antibody responses to HIV-1 infection, as well as for clonotype-specific vaccine development. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Dorado, Erika Jimena; Okoth, Sheila Akinyi; Montenegro, Lidia Madeline; Diaz, Gustavo; Barnwell, John W.; Udhayakumar, Venkatachalam; Murillo Solano, Claribel
2016-01-01
Most Plasmodium falciparum-detecting rapid diagnostic tests (RDTs) target histidine-rich protein 2 (PfHRP2). However, P. falciparum isolates with deletion of the pfhrp2 gene and its homolog gene, pfhrp3, have been detected. We carried out an extensive investigation on 365 P. falciparum dried blood samples collected from seven P. falciparum endemic sites in Colombia between 2003 and 2012 to genetically characterise and geographically map pfhrp2- and/or pfhrp3-negative P. falciparum parasites in the country. We found a high proportion of pfhrp2-negative parasites only in Amazonas (15/39; 38.5%), and these parasites were also pfhrp3-negative. These parasites were collected between 2008 and 2009 in Amazonas, while pfhrp3-negative parasites (157/365, 43%) were found in all the sites and from each of the sample collection years evaluated (2003 to 2012). We also found that all pfhrp2- and/or pfhrp3-negative parasites were also negative for one or both flanking genes. Six sub-population clusters were established with 93.3% (14/15) of the pfhrp2-negative parasites grouped in the same cluster and sharing the same haplotype. This haplotype corresponded with the genetic lineage BV1, a multidrug resistant strain that caused two outbreaks reported in Peru between 2010 and 2013. We found this BV1 lineage in the Colombian Amazon as early as 2006. Two new clonal lineages were identified in these parasites from Colombia: the genetic lineages EV1 and F. PfHRP2 sequence analysis revealed high genetic diversity at the amino acid level, with 17 unique sequences identified among 53 PfHRP2 sequences analysed. The use of PfHRP2-based RDTs is not recommended in Amazonas because of the high proportion of parasites with pfhrp2 deletion (38.5%), and implementation of new strategies for malaria diagnosis and control in Amazonas must be prioritised. Moreover, studies to monitor and genetically characterise pfhrp2-negative P. falciparum parasites in the Americas are warranted, given the extensive human migration occurring in the region. PMID:27636709
Ling, Shaoping; Hu, Zheng; Yang, Zuyu; Yang, Fang; Li, Yawei; Lin, Pei; Chen, Ke; Dong, Lili; Cao, Lihua; Tao, Yong; Hao, Lingtong; Chen, Qingjian; Gong, Qiang; Wu, Dafei; Li, Wenjie; Zhao, Wenming; Tian, Xiuyun; Hao, Chunyi; Hungate, Eric A; Catenacci, Daniel V T; Hudson, Richard R; Li, Wen-Hsiung; Lu, Xuemei; Wu, Chung-I
2015-11-24
The prevailing view that the evolution of cells in a tumor is driven by Darwinian selection has never been rigorously tested. Because selection greatly affects the level of intratumor genetic diversity, it is important to assess whether intratumor evolution follows the Darwinian or the non-Darwinian mode of evolution. To provide the statistical power, many regions in a single tumor need to be sampled and analyzed much more extensively than has been attempted in previous intratumor studies. Here, from a hepatocellular carcinoma (HCC) tumor, we evaluated multiregional samples from the tumor, using either whole-exome sequencing (WES) (n = 23 samples) or genotyping (n = 286) under both the infinite-site and infinite-allele models of population genetics. In addition to the many single-nucleotide variations (SNVs) present in all samples, there were 35 "polymorphic" SNVs among samples. High genetic diversity was evident as the 23 WES samples defined 20 unique cell clones. With all 286 samples genotyped, clonal diversity agreed well with the non-Darwinian model with no evidence of positive Darwinian selection. Under the non-Darwinian model, MALL (the number of coding region mutations in the entire tumor) was estimated to be greater than 100 million in this tumor. DNA sequences reveal local diversities in small patches of cells and validate the estimation. In contrast, the genetic diversity under a Darwinian model would generally be orders of magnitude smaller. Because the level of genetic diversity will have implications on therapeutic resistance, non-Darwinian evolution should be heeded in cancer treatments even for microscopic tumors.
Ling, Shaoping; Hu, Zheng; Yang, Zuyu; Yang, Fang; Li, Yawei; Lin, Pei; Chen, Ke; Dong, Lili; Cao, Lihua; Tao, Yong; Hao, Lingtong; Chen, Qingjian; Gong, Qiang; Wu, Dafei; Li, Wenjie; Zhao, Wenming; Tian, Xiuyun; Hao, Chunyi; Hungate, Eric A.; Catenacci, Daniel V. T.; Hudson, Richard R.; Li, Wen-Hsiung; Lu, Xuemei; Wu, Chung-I
2015-01-01
The prevailing view that the evolution of cells in a tumor is driven by Darwinian selection has never been rigorously tested. Because selection greatly affects the level of intratumor genetic diversity, it is important to assess whether intratumor evolution follows the Darwinian or the non-Darwinian mode of evolution. To provide the statistical power, many regions in a single tumor need to be sampled and analyzed much more extensively than has been attempted in previous intratumor studies. Here, from a hepatocellular carcinoma (HCC) tumor, we evaluated multiregional samples from the tumor, using either whole-exome sequencing (WES) (n = 23 samples) or genotyping (n = 286) under both the infinite-site and infinite-allele models of population genetics. In addition to the many single-nucleotide variations (SNVs) present in all samples, there were 35 “polymorphic” SNVs among samples. High genetic diversity was evident as the 23 WES samples defined 20 unique cell clones. With all 286 samples genotyped, clonal diversity agreed well with the non-Darwinian model with no evidence of positive Darwinian selection. Under the non-Darwinian model, MALL (the number of coding region mutations in the entire tumor) was estimated to be greater than 100 million in this tumor. DNA sequences reveal local diversities in small patches of cells and validate the estimation. In contrast, the genetic diversity under a Darwinian model would generally be orders of magnitude smaller. Because the level of genetic diversity will have implications on therapeutic resistance, non-Darwinian evolution should be heeded in cancer treatments even for microscopic tumors. PMID:26561581
Cheng, Kun; Rong, Xiaoying; Huang, Ying
2016-09-01
Homologous recombination is increasingly being recognized as a driving force in microbial evolution. However, recombination in streptomycetes, a rich source of diverse secondary metabolites, particularly among different species, remains minimally investigated. In this study, the largest sample of Streptomyces species to date, consisting of 142 type strains spanning the genus, with available sequences of 16S rRNA, atpD, gyrB, recA, rpoB and trpB genes, were collected and subjected to a comprehensive population genetic analysis to generate an overall estimate of the level of Streptomyces interspecies genetic exchange and its effect on the evolution of this genus. The results indicate frequent homologous recombination among Streptomyces species, which occurred three times more frequently and was nearly 14 times more important than point mutation in nucleotide sequence divergence (ρ/θw=3.10, r/m=13.74). As a result, a facilitating effect on the evolutionary process and confusion in phylogenetic relationships were observed, as well as a number of specific transfer events of the six gene fragments. A resultant phylogenetic network depicted extensive horizontal genetic exchange which decays clonality in streptomycetes. Moreover, seven evolutionary lineage groups were identified in the present sample in the Structure analysis, generally consistent with morphological and physiological data, and the contribution of recombination was detected to be varied among them. Our analyses demonstrated a reticulate evolution within Streptomyces due to the high level of interspecies gene exchange, which greatly challenges the traditional tree-shaped phylogeny in this genus and may advance our evolutionary understanding of a genuine Streptomyces species. Copyright © 2016 Elsevier Inc. All rights reserved.
Detection of Emerging Vaccine-Related Polioviruses by Deep Sequencing.
Sahoo, Malaya K; Holubar, Marisa; Huang, ChunHong; Mohamed-Hadley, Alisha; Liu, Yuanyuan; Waggoner, Jesse J; Troy, Stephanie B; Garcia-Garcia, Lourdes; Ferreyra-Reyes, Leticia; Maldonado, Yvonne; Pinsky, Benjamin A
2017-07-01
Oral poliovirus vaccine can mutate to regain neurovirulence. To date, evaluation of these mutations has been performed primarily on culture-enriched isolates by using conventional Sanger sequencing. We therefore developed a culture-independent, deep-sequencing method targeting the 5' untranslated region (UTR) and P1 genomic region to characterize vaccine-related poliovirus variants. Error analysis of the deep-sequencing method demonstrated reliable detection of poliovirus mutations at levels of <1%, depending on read depth. Sequencing of viral nucleic acids from the stool of vaccinated, asymptomatic children and their close contacts collected during a prospective cohort study in Veracruz, Mexico, revealed no vaccine-derived polioviruses. This was expected given that the longest duration between sequenced sample collection and the end of the most recent national immunization week was 66 days. However, we identified many low-level variants (<5%) distributed across the 5' UTR and P1 genomic region in all three Sabin serotypes, as well as vaccine-related viruses with multiple canonical mutations associated with phenotypic reversion present at high levels (>90%). These results suggest that monitoring emerging vaccine-related poliovirus variants by deep sequencing may aid in the poliovirus endgame and efforts to ensure global polio eradication. Copyright © 2017 Sahoo et al.
Distinguishing Functional DNA Words; A Method for Measuring Clustering Levels
NASA Astrophysics Data System (ADS)
Moghaddasi, Hanieh; Khalifeh, Khosrow; Darooneh, Amir Hossein
2017-01-01
Functional DNA sub-sequences and genome elements are spatially clustered through the genome just as keywords in literary texts. Therefore, some of the methods for ranking words in texts can also be used to compare different DNA sub-sequences. In analogy with the literary texts, here we claim that the distribution of distances between the successive sub-sequences (words) is q-exponential which is the distribution function in non-extensive statistical mechanics. Thus the q-parameter can be used as a measure of words clustering levels. Here, we analyzed the distribution of distances between consecutive occurrences of 16 possible dinucleotides in human chromosomes to obtain their corresponding q-parameters. We found that CG as a biologically important two-letter word concerning its methylation, has the highest clustering level. This finding shows the predicting ability of the method in biology. We also proposed that chromosome 18 with the largest value of q-parameter for promoters of genes is more sensitive to dietary and lifestyle. We extended our study to compare the genome of some selected organisms and concluded that the clustering level of CGs increases in higher evolutionary organisms compared to lower ones.
Hussain, Hirra; Fisher, David I; Abbott, W Mark; Roth, Robert G; Dickson, Alan J
2017-10-01
Certain recombinant proteins are deemed "difficult to express" in mammalian expression systems requiring significant cell and/or process engineering to abrogate expression bottlenecks. With increasing demand for the production of recombinant proteins in mammalian cells, low protein yields can have significant consequences for industrial processes. To investigate the molecular mechanisms that restrict expression of recombinant proteins, naturally secreted model proteins were analyzed from the tissue inhibitors of metalloproteinase (TIMP) protein family. In particular, TIMP-2 and TIMP-3 were subjected to detailed study. TIMP proteins share significant sequence homology (∼50% identity and ∼70% similarity in amino acid sequence). However, they show marked differences in secretion in mammalian expression systems despite this extensive sequence homology. Using these two proteins as models, this study characterized the molecular mechanisms responsible for poor recombinant protein production. Our results reveal that both TIMP-2 and TIMP-3 are detectable at mRNA and protein level within the cell but only TIMP-2 is secreted effectively into the extracellular medium. Analysis of protein localization and the nature of intracellular protein suggest TIMP-3 is severely limited in its post-translational processing. To overcome this challenge, modification of the TIMP-3 sequence to include a furin protease-cleavable pro-sequence resulted in secretion of the modified TIMP-3 protein, however, incomplete processing was observed. Based on the TIMP-3 data, the protein engineering approach was optimized and successfully applied in combination with cell engineering, the overexpression of furin, to another member of the TIMP protein family (the poorly expressed TIMP-4). Use of the described protein engineering strategy resulted in successful secretion of poorly (TIMP-4) and non-secreted (TIMP-3) targets, and presents a novel strategy to enhance the production of "difficult" recombinant targets. Biotechnol. Bioeng. 2017;114: 2348-2359. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Turner, Peter C; Yomano, Lorraine P; Jarboe, Laura R; York, Sean W; Baggett, Christy L; Moritz, Brélan E; Zentz, Emily B; Shanmugam, K T; Ingram, Lonnie O
2012-04-01
Escherichia coli KO11 (ATCC 55124) was engineered in 1990 to produce ethanol by chromosomal insertion of the Zymomonas mobilis pdc and adhB genes into E. coli W (ATCC 9637). KO11FL, our current laboratory version of KO11, and its parent E. coli W were sequenced, and contigs assembled into genomic sequences using optical NcoI restriction maps as templates. E. coli W contained plasmids pRK1 (102.5 kb) and pRK2 (5.4 kb), but KO11FL only contained pRK2. KO11FL optical maps made with AflII and with BamHI showed a tandem repeat region, consisting of at least 20 copies of a 10-kb unit. The repeat region was located at the insertion site for the pdc, adhB, and chloramphenicol-resistance genes. Sequence coverage of these genes was about 25-fold higher than average, consistent with amplification of the foreign genes that were inserted as circularized DNA. Selection for higher levels of chloramphenicol resistance originally produced strains with higher pdc and adhB expression, and hence improved fermentation performance, by increasing the gene copy number. Sequence data for an earlier version of KO11, ATCC 55124, indicated that multiple copies of pdc adhB were present. Comparison of the W and KO11FL genomes showed large inversions and deletions in KO11FL, mostly enabled by IS10, which is absent from W but present at 30 sites in KO11FL. The early KO11 strain ATCC 55124 had no rearrangements, contained only one IS10, and lacked most accumulated single nucleotide polymorphisms (SNPs) present in KO11FL. Despite rearrangements and SNPs in KO11FL, fermentation performance was equal to that of ATCC 55124.
Negrisolo, Enrico; Kuhl, Heiner; Forcato, Claudio; Vitulo, Nicola; Reinhardt, Richard; Patarnello, Tomaso; Bargelloni, Luca
2010-12-01
Comparative genomics holds the promise to magnify the information obtained from individual genome sequencing projects, revealing common features conserved across genomes and identifying lineage-specific characteristics. To implement such a comparative approach, a robust phylogenetic framework is required to accurately reconstruct evolution at the genome level. Among vertebrate taxa, teleosts represent the second best characterized group, with high-quality draft genome sequences for five model species (Danio rerio, Gasterosteus aculeatus, Oryzias latipes, Takifugu rubripes, and Tetraodon nigroviridis), and several others are in the finishing lane. However, the relationships among the acanthomorph teleost model fishes remain an unresolved taxonomic issue. Here, a genomic region spanning over 1.2 million base pairs was sequenced in the teleost fish Dicentrarchus labrax. Together with genomic data available for the above fish models, the new sequence was used to identify unique orthologous genomic regions shared across all target taxa. Different strategies were applied to produce robust multiple gene and genomic alignments spanning from 11,802 to 186,474 amino acid/nucleotide positions. Ten data sets were analyzed according to Bayesian inference, maximum likelihood, maximum parsimony, and neighbor joining methods. Extensive analyses were performed to explore the influence of several factors (e.g., alignment methodology, substitution model, data set partitions, and long-branch attraction) on the tree topology. Although a general consensus was observed for a closer relationship between G. aculeatus (Gasterosteidae) and Di. labrax (Moronidae) with the atherinomorph O. latipes (Beloniformes) sister taxon of this clade, with the tetraodontiform group Ta. rubripes and Te. nigroviridis (Tetraodontiformes) representing a more distantly related taxon among acanthomorph model fish species, conflicting results were obtained between data sets and methods, especially with respect to the choice of alignment methodology applied to noncoding parts of the genomic region under study. This may limit the use of intergenic/noncoding sequences in phylogenomics until more robust alignment algorithms are developed.
Häger, K P; Wind, C
1997-06-15
Subunit monomers and oligomers of crystalloid-type legumins are major components of SDS-soluble fractions from Metasequoia glyptostroboides (Dawn redwood, Taxodiaceae) seed proteins. The subunits are made up of disulfide linked alpha-polypeptides and beta-polypeptides with molecular masses of 33 kDa and 23-25 kDa, respectively. Unusually for legumins, those from Metasequoia are glycosylated and the carbohydrate moieties are residing in the C-terminal region of the respective beta-polypeptides. A Metasequoia endosperm cDNA library has been constructed and legumin-encoding transcripts representing two divergent gene subfamilies have been characterized. Intersubfamily comparisons reveal 75% identity at the amino acid level and the values range from 53-35% when the legumin precursors deduced were compared with those from angiosperms. The predicted sequences together with data from amino acid sequencing prove that post-translational processing of Metasequoia prolegumins is directed to two different processing sites, each of them specific for one of the legumin subfamilies. The sites involved differ in their relative position and in the junction to be cleaved: Metasequoia legumin precursors MgLeg18 and MgLeg26 contain the conventional post-translational Asn-Gly processing site, which is generally regarded as highly conserved. In contrast, the MgLeg4 precursor is lacking this site and post-translational cleavage is directed to an unusual Asn-Thr processing site located in its hypervariable region, causing N-terminal extension of the beta-polypeptide relative to those hitherto known. Evidence is given that the unusual variant of processing also occurs in other conifers. Phylogenetic analysis reveals the precursors concerned as representatives of a distinct legumin subfamily, originating from duplication of an ancestral gene prior to or at the beginning of Taxodiaceae diversification.
Kuhn, G C S; Teo, C H; Schwarzacher, T; Heslop-Harrison, J S
2009-05-01
Satellite DNA (satDNA) is a major component of genomes but relatively little is known about the fine-scale organization of unrelated satDNAs residing at the same chromosome location, and the sequence structure and dynamics of satDNA junctions. We studied the organization and sequence junctions of two nonhomologous satDNAs, pBuM and DBC-150, in three species from the neotropical Drosophila buzzatii cluster (repleta group). In situ hybridization to microchromosomes, interphase nuclei and extended DNA fibers showed frequent interspersion of the two satellites in D. gouveai, D. antonietae and, to a lesser extent, D. seriema. We isolated by PCR six pBuM x DBC-150 junctions: four are exclusive to D. gouveai and two are exclusive to D. antonietae. The six junction breakpoints occur at different positions within monomers, suggesting independent origin. Four junctions showed abrupt transitions between the two satellites, whereas two junctions showed a distinct 10 bp tandem duplication before the junction. Unlike pBuM, DBC-150 junction repeats are more variable than randomly cloned monomers and showed diagnostic features in common to a 3-monomer higher-order repeat seen in the sister species D. serido. The high levels of interspersion between pBuM and DBC-150 repeats suggest extensive rearrangements between the two satellites, maybe favored by specific features of the microchromosomes. Our interpretation is that the junctions evolved by multiples events of illegitimate recombination between nonhomologous satDNA repeats, with subsequent rounds of unequal crossing-over expanding the copy number of some of the junctions.
Rosenbom, Sónia; Costa, Vânia; Chen, Shanyuan; Khalatbari, Leili; Yusefi, Gholam Hosein; Abdukadir, Ablimit; Yangzom, Chamba; Kebede, Fanuel; Teclai, Redae; Yohannes, Hagos; Hagos, Futsum; Moehlman, Patricia D; Beja-Pereira, Albano
2015-04-01
All extant equid species are grouped in a single genus - Equus. Among those, ass-like equids have remained particularly unstudied and their phylogenetic relations were poorly understood, most probably because they inhabit extreme environments in remote geographic areas. To gain further insights into the evolutionary history of ass-like equids, we have used a non-invasive sampling approach to collect representative fecal samples of extant African and Asiatic ass-like equid populations across their distribution range and mitochondrial DNA (mtDNA) sequencing analyses to examine intraspecific genetic diversity and population structure, and to reconstruct phylogenetic relations among wild ass species/subspecies. Sequence analyses of 410 base pairs of the fast evolving mtDNA control region identified the Asiatic wild ass population of Kalamaili (China) as the one displaying the highest diversity among all wild ass populations. Phylogenetic analyses of complete cytochrome b sequences revealed that African and Asiatic wild asses shared a common ancestor approximately 2.3Mya and that diversification in both groups occurred much latter, probably driven by climatic events during the Pleistocene. Inferred genetic relationships among Asiatic wild ass species do not support E. kiang monophyly, highlighting the need of more extensive studies in order to clarify the taxonomic status of species/subspecies belonging to this branch of the Equus phylogeny. These results highlight the importance of re-assessing the evolutionary history of ass-like equid species, and urge to extend studies at the population level to efficiently design conservation and management actions for these threatened species. Copyright © 2015 Elsevier Inc. All rights reserved.
Ali, M A; Al-Hemaid, F M; Lee, J; Hatamleh, A A; Gyulai, G; Rahman, M O
2015-10-02
The present study explored the systematic inventory of Echinops L. (Asteraceae) of Saudi Arabia, with special reference to the molecular typing of Echinops abuzinadianus Chaudhary, an endemic species to Saudi Arabia, based on the internal transcribed spacer (ITS) sequences (ITS1-5.8S-ITS2) of nuclear ribosomal DNA. A sequence similarity search using BLAST and a phylogenetic analysis of the ITS sequence of E. abuzinadianus revealed a high level of sequence similarity with E. glaberrimus DC. (section Ritropsis). The novel primary sequence and the secondary structure of ITS2 of E. abuzinadianus could potentially be used for molecular genotyping.
Cell cycle, differentiation and tissue-independent expression of ribosomal protein L37.
Su, S; Bird, R C
1995-09-15
A unique human cDNA (hG1.16) that encodes a mRNA of 450 nucleotides was isolated from a subtractive library derived from HeLa cells. The relative expression level of hG1.16 during different cell-cycle phases was determined by Northern-blot analysis of cells synchronized by double-thymidine block and serum deprivation/refeeding. hG1.16 was constitutively expressed during all phases of the cell cycle, including the quiescent phase when even most constitutively expressed genes experience some suppression of expression. The expression level of hG1.16 did not change during terminal differentiation of myoblasts to myotubes, during which cells become permanently post-mitotic. Examination of other tissues revealed that the relative expression level of hG1.16 was constitutive in all embryonic mouse tissues examined, including brain, eye, heart, kidney, liver, lung and skeletal muscle. This was unusual in that expression was not down-modulated during differentiation and did not vary appreciably between tissue types. Analysis by inter-species Northern-blot analysis revealed that hG1.16 was highly conserved among all vertebrates studied (from fish to humans but not in insects). DNA sequence analysis of hG1.16 revealed a high level of similarity to rat ribosomal protein L37, identifying hG1.16 as a new member of this multigene family. The deduced amino acid sequence of hG1.16 was identical to rat ribosomal protein L37 that contained 97 amino acids, many of which are highly positively charged (15 arginine and 14 lysine residues with a predicted M(r) of 11,065). hG1.16 protein has a single C2-C2 zinc-finger-like motif which is also present in rat ribosomal protein L37. Using primers designed from the sequence of hG1.16, unique bovine and rat cDNAs were also isolated by 5'-rapid-amplification of cDNA ends. DNA sequences of bovine and rat G1.16, clones were 92.8% and 92.2% similar to human G1.16 while the deduced amino acid sequences derived from bovine and rat cDNAs each differed by a single amino acid from the sequence of hG1.16 and the published rat L37 sequence. Southern-blot analysis revealed that hG1.16 exists in multiple copies in human, rat and mouse genomes. These G1.16 clones encode unique human, rat and bovine members of the ribosomal protein L37 gene family, which are constitutively expressed even during transitions from quiescence to active cell proliferation or terminal differentiation, in all tissues and all vertebrates investigated.
The origin and phylogeography of dog rabies virus
Bourhy, Hervé; Reynes, Jean-Marc; Dunham, Eleca J.; Dacheux, Laurent; Larrous, Florence; Huong, Vu Thi Que; Xu, Gelin; Yan, Jiaxin; Miranda, Mary Elizabeth G.; Holmes, Edward C.
2012-01-01
Rabies is a progressively fatal and incurable viral encephalitis caused by a lyssavirus infection. Almost all of the 55 000 annual rabies deaths in humans result from infection with dog rabies viruses (RABV). Despite the importance of rabies for human health, little is known about the spread of RABV in dog populations, and patterns of biodiversity have only been studied in limited geographical space. To address these questions on a global scale, we sequenced 62 new isolates and performed an extensive comparative analysis of RABV gene sequence data, representing 192 isolates sampled from 55 countries. From this, we identified six clades of RABV in non-flying mammals, each of which has a distinct geographical distribution, most likely reflecting major physical barriers to gene flow. Indeed, a detailed analysis of phylogeographic structure revealed only limited viral movement among geographical localities. Using Bayesian coalescent methods we also reveal that the sampled lineages of canid RABV derive from a common ancestor that originated within the past 1500 years. Additionally, we found no evidence for either positive selection or widespread population bottlenecks during the global expansion of canid RABV. Overall, our study reveals that the stochastic processes of genetic drift and population subdivision are the most important factors shaping the global phylogeography of canid RABV. PMID:18931062
Zaburannyi, Nestor; Bunk, Boyke; Maier, Josef; Overmann, Jörg
2016-01-01
Here, we report the complete genome sequence of the type strain of the myxobacterial genus Chondromyces, Chondromyces crocatus Cm c5. It presents one of the largest prokaryotic genomes featuring a single circular chromosome and no plasmids. Analysis revealed an enlarged set of tRNA genes, along with reduced pressure on preferred codon usage compared to that of other bacterial genomes. The large coding capacity and the plethora of encoded secondary metabolite biosynthetic gene clusters are in line with the capability of Cm c5 to produce an arsenal of antibacterial, antifungal, and cytotoxic compounds. Known pathways of the ajudazol, chondramide, chondrochloren, crocacin, crocapeptin, and thuggacin compound families are complemented by many more natural compound biosynthetic gene clusters in the chromosome. Whole-genome comparison of the fruiting-body-forming type strain (Cm c5, DSM 14714) to an accustomed laboratory strain which has lost this ability (nonfruiting phenotype, Cm c5 fr−) revealed genetic changes in three loci. In addition to the low synteny found with the closest sequenced representative of the same family, Sorangium cellulosum, extensive genetic information duplication and broad application of eukaryotic-type signal transduction systems are hallmarks of this 11.3-Mbp prokaryotic genome. PMID:26773087
Liang, Chen; Rong, Liwei; Russell, Rodney S.; Wainberg, Mark A.
2000-01-01
We have studied the role of an RNA region at nucleotides (nt) +200 to +233, just downstream of the 5′ long terminal repeat, in encapsidation of human immunodeficiency virus type 1 genomic RNA. Three deletion mutations, namely, BH-D0, BH-D1, and BH-D2, were generated to eliminate sequences at positions nt +200 to +219, +200 to +226, and +200 to +233. The result in each case was decreased levels of packaging of viral RNA into the mutated viruses, with the BH-D2 virus being the most severely affected. Consistently, all three deletions resulted in impaired viral infectiousness and the BH-D2 mutation showed the most dramatic impact in this regard. Further analysis revealed additional defects in Gag precursor processing and in the extension efficiency of the tRNA3Lys primer in reverse transcription reactions performed with these mutated viruses. To shed further light on the function of these deleted sequences in viral replication, the mutated viruses were cultured in MT-2 cells over prolonged periods to enable them to reacquire wild-type replication kinetics. Sequencing of the reverted viruses revealed point mutations in both the noncoding region and the gag gene. In the case of the BH-D0 revertant, two mutations were observed at positions G112A in the U5 region, termed M1, and T24I in the nucleocapsid protein, termed MNC, respectively. Either of these two mutations was able to confer wild-type replication capacity on BH-D0. In the case of BH-D1, each of the M1 mutations, a mutation termed M2, i.e., C227T, just downstream of the primer binding site, a mutation termed MP2 (T12I) in the p2 protein, and the MNC mutation were observed. A combination of either M1 and M2 or MP2 and MNC was able to rescue BH-D1. In the case of the BH-D2 deletion-containing viruses, three point mutations, i.e., M1, MP2, and MNC, were observed and the presence of all three was required to restore viral replication to wild-type levels. PMID:10864634
Wang, Y L; Beach, M J; Rodwell, V W
1989-01-01
We have cloned and sequenced a 505-base-pair (bp) segment of DNA situated upstream of mvaA, the structural gene for (S)-3-hydroxy-3-methylglutaryl coenzyme A reductase (EC 1.1.1.88) of Pseudomonas mevalonii. The DNA segment that we characterized includes the promoter region for the mva operon. Nuclease S1 mapping and primer extension analysis showed that mvaA is the promoter-proximal gene of the mva operon. Transcription initiates at -56 bp relative to the first A (+1) of the translation start site. Transcription in vivo was induced by mevalonate. Structural features of the mva promoter region include an 80-bp A + T-rich region, and -12, -24 consensus sequences that resemble sequences of sigma 54 promoters in enteric organisms. The relative amplitudes of catalytic activity, enzyme protein, and mvaA mRNA are consistent with a model of regulation of this operon at the transcriptional level. Images PMID:2477360
Kosinski, Jan; Gajda, Michal J; Cymerman, Iwona A; Kurowski, Michal A; Pawlowski, Marcin; Boniecki, Michal; Obarska, Agnieszka; Papaj, Grzegorz; Sroczynska-Obuchowicz, Paulina; Tkaczuk, Karolina L; Sniezynska, Paulina; Sasin, Joanna M; Augustyn, Anna; Bujnicki, Janusz M; Feder, Marcin
2005-01-01
In the course of CASP6, we generated models for all targets using a new version of the "FRankenstein's monster approach." Previously (in CASP5) we were able to build many very accurate full-atom models by selection and recombination of well-folded fragments obtained from crude fold recognition (FR) results, followed by optimization of the sequence-structure fit and assessment of alternative alignments on the structural level. This procedure was however very arduous, as most of the steps required extensive visual and manual input from the human modeler. Now, we have automated the most tedious steps, such as superposition of alternative models, extraction of best-scoring fragments, and construction of a hybrid "monster" structure, as well as generation of alternative alignments in the regions that remain poorly scored in the refined hybrid model. We have also included the ROSETTA method to construct those parts of the target for which no reasonable structures were generated by FR methods (such as long insertions and terminal extensions). The analysis of successes and failures of the current version of the FRankenstein approach in modeling of CASP6 targets reveals that the considerably streamlined and automated method performs almost as well as the initial, mostly manual version, which suggests that it may be a useful tool for accurate protein structure prediction even in the hands of nonexperts. 2005 Wiley-Liss, Inc.
Kilpert, Fabian; Podsiadlowski, Lars
2010-06-01
This study presents the complete mitochondrial (mt) genome of the amphipod Caprella mutica, an east-Asian species, which recently invaded the coastal regions of North America, Europe, and New Zealand. It is the first complete sequence of a member of the amphipod subclade Caprellidea. The mt genome has a total length of 15,427 bp and is organized in a circular double-strand molecule. All 37 mt genes are present, including the common set of 22 tRNAs. Particularly noticeable is the duplication of the control region (CR). The additional CR is located between nad6 and cob, and is almost identical to the original one. The most extensive changes in the gene order affect nad5 and a block consisting of trnH, nad4, nad4L, and trnP-all inserted near the original CR. The gene nad5 is also inverted. Furthermore, a comparison with the pancrustacean ground pattern reveals additional changes of individual tRNA genes. Some of these changes are also shared by Metacrangonyx longipes and Parhyale hawaiensis. These arrangements were found only in amphipods and might be considered as apomorphic by character states of Amphipoda. In all the three species, there is good evidence that trnG originated from a rare duplication/remolding event of the adjacent trnW gene. Nevertheless, each of the three available amphipod mitogenome sequences also bears unique rearrangements. C. mutica, however, shows the most extensive rearrangement in comparison with the pancrustacean ground pattern.
Mountain gorilla genomes reveal the impact of long-term population decline and inbreeding.
Xue, Yali; Prado-Martinez, Javier; Sudmant, Peter H; Narasimhan, Vagheesh; Ayub, Qasim; Szpak, Michal; Frandsen, Peter; Chen, Yuan; Yngvadottir, Bryndis; Cooper, David N; de Manuel, Marc; Hernandez-Rodriguez, Jessica; Lobon, Irene; Siegismund, Hans R; Pagani, Luca; Quail, Michael A; Hvilsom, Christina; Mudakikwa, Antoine; Eichler, Evan E; Cranfield, Michael R; Marques-Bonet, Tomas; Tyler-Smith, Chris; Scally, Aylwyn
2015-04-10
Mountain gorillas are an endangered great ape subspecies and a prominent focus for conservation, yet we know little about their genomic diversity and evolutionary past. We sequenced whole genomes from multiple wild individuals and compared the genomes of all four Gorilla subspecies. We found that the two eastern subspecies have experienced a prolonged population decline over the past 100,000 years, resulting in very low genetic diversity and an increased overall burden of deleterious variation. A further recent decline in the mountain gorilla population has led to extensive inbreeding, such that individuals are typically homozygous at 34% of their sequence, leading to the purging of severely deleterious recessive mutations from the population. We discuss the causes of their decline and the consequences for their future survival. Copyright © 2015, American Association for the Advancement of Science.
Linkage of Higher Education with Agricultural Research, Extension and Development in Ethiopia
ERIC Educational Resources Information Center
Belay, Kassa
2008-01-01
High-level agricultural manpower training in Ethiopian institutions of higher education (AIHE)specializing in agriculture and related fields was studied. The study reveals that high-level agricultural manpower training began in the early 1950s and that, at present, the country has seven institutions of higher learning, which train students in…
In-cell RNA structure probing with SHAPE-MaP.
Smola, Matthew J; Weeks, Kevin M
2018-06-01
This protocol is an extension to: Nat. Protoc. 10, 1643-1669 (2015); doi:10.1038/nprot.2015.103; published online 01 October 2015RNAs play key roles in many cellular processes. The underlying structure of RNA is an important determinant of how transcripts function, are processed, and interact with RNA-binding proteins and ligands. RNA structure analysis by selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) takes advantage of the reactivity of small electrophilic chemical probes that react with the 2'-hydroxyl group to assess RNA structure at nucleotide resolution. When coupled with mutational profiling (MaP), in which modified nucleotides are detected as internal miscodings during reverse transcription and then read out by massively parallel sequencing, SHAPE yields quantitative per-nucleotide measurements of RNA structure. Here, we provide an extension to our previous in vitro SHAPE-MaP protocol with detailed guidance for undertaking and analyzing SHAPE-MaP probing experiments in live cells. The MaP strategy works for both abundant-transcriptome experiments and for cellular RNAs of low to moderate abundance, which are not well examined by whole-transcriptome methods. In-cell SHAPE-MaP, performed in roughly 3 d, can be applied in cell types ranging from bacteria to cultured mammalian cells and is compatible with a variety of structure-probing reagents. We detail several strategies by which in-cell SHAPE-MaP can inform new biological hypotheses and emphasize downstream analyses that reveal sequence or structure motifs important for RNA interactions in cells.
Treviño-Quintanilla, Luis Gerardo; Escalante, Adelfo; Caro, Alma Delia; Martínez, Alfredo; González, Ricardo; Puente, José Luis; Bolívar, Francisco; Gosset, Guillermo
2007-01-01
The capacity to utilize sucrose as a carbon and energy source (Scr(+) phenotype) is a highly variable trait among Escherichia coli strains. In this study, seven enteropathogenic E. coli (EPEC) strains from different sources were studied for their capacity to grow using sucrose. Liquid media cultures showed that all analyzed strains have the Scr(+) phenotype and two distinct groups were defined: one of five and another of two strains displaying doubling times of 67 and 125 min, respectively. The genes conferring the Scr(+) phenotype in one of the fast-growing strains (T19) were cloned and sequenced. Comparative sequence analysis revealed that this strain possesses the scr regulon genes scrKYABR, encoding phosphoenolpyruvate:phosphotransferase system-dependent sucrose transport and utilization activities. Transcript level quantification revealed sucrose-dependent induction of scrK and scrR genes in fast-growing strains, whereas no transcripts were detected in slow-growing strains. Sequence comparison analysis revealed that the scr genes in strain T19 are almost identical to those present in the scr regulon of prototype EPEC E2348/69 and in both strains, the scr genes are inserted in the chromosomal intergenic region of hypothetical genes ygcE and ygcF. Comparison of the ygcE-ygcF intergenic region sequence of strains MG1655, enterohemorrhagic EDL933, uropathogenic ECFT073 and EPEC T19-E2348/69 revealed that the number of extragenic highly repeated iap sequences corresponded to nine, four, two and none, respectively. These results show that the iap sequence-containing chromosomal ygcE-ygcF intergenic region is highly variable in E. coli. Copyright (c) 2007 S. Karger AG, Basel.
Tóth, Annamária; Hausknecht, Anton; Krisai-Greilhuber, Irmgard; Papp, Tamás; Vágvölgyi, Csaba; Nagy, László G.
2013-01-01
Reconciling traditional classifications, morphology, and the phylogenetic relationships of brown-spored agaric mushrooms has proven difficult in many groups, due to extensive convergence in morphological features. Here, we address the monophyly of the Bolbitiaceae, a family with over 700 described species and examine the higher-level relationships within the family using a newly constructed multilocus dataset (ITS, nrLSU rDNA and EF1-alpha). We tested whether the fast-evolving Internal Transcribed Spacer (ITS) sequences can be accurately aligned across the family, by comparing the outcome of two iterative alignment refining approaches (an automated and a manual) and various indel-treatment strategies. We used PRANK to align sequences in both cases. Our results suggest that – although PRANK successfully evades overmatching of gapped sites, referred previously to as alignment overmatching – it infers an unrealistically high number of indel events with natively generated guide-trees. This 'alignment undermatching' could be avoided by using more rigorous (e.g. ML) guide trees. The trees inferred in this study support the monophyly of the core Bolbitiaceae, with the exclusion of Panaeolus, Agrocybe, and some of the genera formerly placed in the family. Bolbitius and Conocybe were found monophyletic, however, Pholiotina and Galerella require redefinition. The phylogeny revealed that stipe coverage type is a poor predictor of phylogenetic relationships, indicating the need for a revision of the intrageneric relationships within Conocybe. PMID:23418526
Mammella, Marco A; Martin, Frank N; Cacciola, Santa O; Coffey, Michael D; Faedda, Roberto; Schena, Leonardo
2013-06-01
Genetic variation within the heterothallic cosmopolitan plant pathogen Phytophthora nicotianae was determined in 96 isolates from a wide range of hosts and geographic locations by characterizing four mitochondrial (10% of the genome) and three nuclear loci. In all, 52 single-nucleotide polymorphisms (SNPs) (an average of 1 every 58 bp) and 313 sites with gaps representing 5,450 bases enabled the identification of 50 different multilocus mitochondrial haplotypes. Similarly, 24 SNPs (an average of 1 every 69 bp), with heterozygosity observed at each locus, were observed in three nuclear regions (hyp, scp, and β-tub) differentiating 40 multilocus nuclear genotypes. Both mitochondrial and nuclear markers revealed a high level of dispersal of isolates and an inconsistent geographic structuring of populations. However, a specific association was observed for host of origin and genetic grouping with both nuclear and mitochondrial sequences. In particular, the majority of citrus isolates from Italy, California, Florida, Syria, Albania, and the Philippines clustered in the same mitochondrial group and shared at least one nuclear allele. A similar association was also observed for isolates recovered from Nicotiana and Solanum spp. The present study suggests an important role of nursery populations in increasing genetic recombination within the species and the existence of extensive phenomena of migration of isolates that have been likely spread worldwide with infected plant material.
Fitzgerald, Timothy L; Powell, Jonathan J; Stiller, Jiri; Weese, Terri L; Abe, Tomoko; Zhao, Guangyao; Jia, Jizeng; McIntyre, C Lynne; Li, Zhongyi; Manners, John M; Kazan, Kemal
2015-01-01
Reverse genetic techniques harnessing mutational approaches are powerful tools that can provide substantial insight into gene function in plants. However, as compared to diploid species, reverse genetic analyses in polyploid plants such as bread wheat can present substantial challenges associated with high levels of sequence and functional similarity amongst homoeologous loci. We previously developed a high-throughput method to identify deletions of genes within a physically mutagenized wheat population. Here we describe our efforts to combine multiple homoeologous deletions of three candidate disease susceptibility genes (TaWRKY11, TaPFT1 and TaPLDß1). We were able to produce lines featuring homozygous deletions at two of the three homoeoloci for all genes, but this was dependent on the individual mutants used in crossing. Intriguingly, despite extensive efforts, viable lines possessing homozygous deletions at all three homoeoloci could not be produced for any of the candidate genes. To investigate deletion size as a possible reason for this phenomenon, we developed an amplicon sequencing approach based on synteny to Brachypodium distachyon to assess the size of the deletions removing one candidate gene (TaPFT1) in our mutants. These analyses revealed that genomic deletions removing the locus are relatively large, resulting in the loss of multiple additional genes. The implications of this work for the use of heavy ion mutagenesis for reverse genetic analyses in wheat are discussed.
Fitzgerald, Timothy L.; Powell, Jonathan J.; Stiller, Jiri; Weese, Terri L.; Abe, Tomoko; Zhao, Guangyao; Jia, Jizeng; McIntyre, C. Lynne; Li, Zhongyi; Manners, John M.; Kazan, Kemal
2015-01-01
Reverse genetic techniques harnessing mutational approaches are powerful tools that can provide substantial insight into gene function in plants. However, as compared to diploid species, reverse genetic analyses in polyploid plants such as bread wheat can present substantial challenges associated with high levels of sequence and functional similarity amongst homoeologous loci. We previously developed a high-throughput method to identify deletions of genes within a physically mutagenized wheat population. Here we describe our efforts to combine multiple homoeologous deletions of three candidate disease susceptibility genes (TaWRKY11, TaPFT1 and TaPLDß1). We were able to produce lines featuring homozygous deletions at two of the three homoeoloci for all genes, but this was dependent on the individual mutants used in crossing. Intriguingly, despite extensive efforts, viable lines possessing homozygous deletions at all three homoeoloci could not be produced for any of the candidate genes. To investigate deletion size as a possible reason for this phenomenon, we developed an amplicon sequencing approach based on synteny to Brachypodium distachyon to assess the size of the deletions removing one candidate gene (TaPFT1) in our mutants. These analyses revealed that genomic deletions removing the locus are relatively large, resulting in the loss of multiple additional genes. The implications of this work for the use of heavy ion mutagenesis for reverse genetic analyses in wheat are discussed. PMID:25719507
Douroupi, Triantafyllia G; Papassideri, Issidora S; Stravopodis, Dimitrios J; Margaritis, Lukas H
2005-12-05
A full-length cDNA clone, designated Udp1, was isolated from Urtica dioica (stinging nettle), using a polymerase chain reaction based strategy. The putative Udp1 protein is characterized by a cleavable N-terminal signal sequence, likely responsible for the rough endoplasmic reticulum entry and a 310 amino acids mature protein, containing all the important residues, which are evolutionary conserved among different members of the plant peroxidase family. A unique structural feature of the Udp1 peroxidase is defined into the short carboxyl-terminal extension, which could be associated with the vacuolar targeting process. Udp1 peroxidase is differentially regulated at the transcriptional level and is specifically expressed in the roots. Interestingly, wounding and ultraviolet radiation stress cause an ectopic induction of the Udp1 gene expression in the aerial parts of the plant. A genomic DNA fragment encoding the Udp1 peroxidase was also cloned and fully sequenced, revealing a structural organization of three exons and two introns. The phylogenetic relationships of the Udp1 protein to the Arabidopsis thaliana peroxidase family members were also examined and, in combination with the homology modelling approach, dictated the presence of distinct structural elements, which could be specifically involved in the determination of substrate recognition and subcellular localization of the Udp1 peroxidase.
Massana, Ramon; Gobet, Angélique; Audic, Stéphane; Bass, David; Bittner, Lucie; Boutte, Christophe; Chambouvet, Aurélie; Christen, Richard; Claverie, Jean-Michel; Decelle, Johan; Dolan, John R; Dunthorn, Micah; Edvardsen, Bente; Forn, Irene; Forster, Dominik; Guillou, Laure; Jaillon, Olivier; Kooistra, Wiebe H C F; Logares, Ramiro; Mahé, Frédéric; Not, Fabrice; Ogata, Hiroyuki; Pawlowski, Jan; Pernice, Massimo C; Probert, Ian; Romac, Sarah; Richards, Thomas; Santini, Sébastien; Shalchian-Tabrizi, Kamran; Siano, Raffaele; Simon, Nathalie; Stoeck, Thorsten; Vaulot, Daniel; Zingone, Adriana; de Vargas, Colomban
2015-10-01
Although protists are critical components of marine ecosystems, they are still poorly characterized. Here we analysed the taxonomic diversity of planktonic and benthic protist communities collected in six distant European coastal sites. Environmental deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) from three size fractions (pico-, nano- and micro/mesoplankton), as well as from dissolved DNA and surface sediments were used as templates for tag pyrosequencing of the V4 region of the 18S ribosomal DNA. Beta-diversity analyses split the protist community structure into three main clusters: picoplankton-nanoplankton-dissolved DNA, micro/mesoplankton and sediments. Within each cluster, protist communities from the same site and time clustered together, while communities from the same site but different seasons were unrelated. Both DNA and RNA-based surveys provided similar relative abundances for most class-level taxonomic groups. Yet, particular groups were overrepresented in one of the two templates, such as marine alveolates (MALV)-I and MALV-II that were much more abundant in DNA surveys. Overall, the groups displaying the highest relative contribution were Dinophyceae, Diatomea, Ciliophora and Acantharia. Also, well represented were Mamiellophyceae, Cryptomonadales, marine alveolates and marine stramenopiles in the picoplankton, and Monadofilosa and basal Fungi in sediments. Our extensive and systematic sequencing of geographically separated sites provides the most comprehensive molecular description of coastal marine protist diversity to date. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Ben Chobba, Ines; Elleuch, Amine; Ayadi, Imen; Khannous, Lamia; Namsi, Ahmed; Cerqueira, Frederique; Drira, Noureddine; Gharsallah, Néji; Vallaeys, Tatiana
2013-01-01
Endophytic flora plays a vital role in the colonization and survival of host plants, especially in harsh environments, such as arid regions. This flora may, however, contain pathogenic species responsible for various troublesome host diseases. The present study is aimed at investigating the diversity of both cultivable and non-cultivable endophytic fungal floras in the internal tissues (roots and leaves) of Tunisian date palm trees (Phoenix dactylifera). Accordingly, 13 isolates from both root and leaf samples, exhibiting distinct colony morphology, were selected from potato dextrose agar (PDA) medium and identified by a sequence match search wherein their 18S–28S internal transcribed spacer (ITS) sequences were compared to those available in public databases. These findings revealed that the cultivable root and leaf isolates fell into two groups, namely Nectriaceae and Pleosporaceae. Additionally, total DNA from palm roots and leaves was further extracted and ITS fragments were amplified. Restriction fragment length polymorphism (RFLP) analysis of the ITS from 200 fungal clones (leaves: 100; roots: 100) using HaeIII restriction enzyme revealed 13 distinct patterns that were further sequenced and led to the identification of Alternaria, Cladosporium, Davidiella (Cladosporium teleomorph), Pythium, Curvularia, and uncharacterized fungal endophytes. Both approaches confirmed that while the roots were predominantly colonized by Fusaria (members of the Nectriaceae family), the leaves were essentially colonized by Alternaria (members of the Pleosporaceae family). Overall, the findings of the present study constitute, to the authors’ knowledge, the first extensive report on the diversity of endophytic fungal flora associated with date palm trees (P. dactylifera). PMID:24302709
Zapata, Luis; Ding, Jia; Willing, Eva-Maria; Hartwig, Benjamin; Bezdan, Daniela; Jiao, Wen-Biao; Patel, Vipul; Velikkakam James, Geo; Koornneef, Maarten; Ossowski, Stephan; Schneeberger, Korbinian
2016-07-12
Resequencing or reference-based assemblies reveal large parts of the small-scale sequence variation. However, they typically fail to separate such local variation into colinear and rearranged variation, because they usually do not recover the complement of large-scale rearrangements, including transpositions and inversions. Besides the availability of hundreds of genomes of diverse Arabidopsis thaliana accessions, there is so far only one full-length assembled genome: the reference sequence. We have assembled 117 Mb of the A. thaliana Landsberg erecta (Ler) genome into five chromosome-equivalent sequences using a combination of short Illumina reads, long PacBio reads, and linkage information. Whole-genome comparison against the reference sequence revealed 564 transpositions and 47 inversions comprising ∼3.6 Mb, in addition to 4.1 Mb of nonreference sequence, mostly originating from duplications. Although rearranged regions are not different in local divergence from colinear regions, they are drastically depleted for meiotic recombination in heterozygotes. Using a 1.2-Mb inversion as an example, we show that such rearrangement-mediated reduction of meiotic recombination can lead to genetically isolated haplotypes in the worldwide population of A. thaliana Moreover, we found 105 single-copy genes, which were only present in the reference sequence or the Ler assembly, and 334 single-copy orthologs, which showed an additional copy in only one of the genomes. To our knowledge, this work gives first insights into the degree and type of variation, which will be revealed once complete assemblies will replace resequencing or other reference-dependent methods.
Molecular recognition of the Tes LIM2-3 domains by the actin-related protein Arp7A.
Boëda, Batiste; Knowles, Phillip P; Briggs, David C; Murray-Rust, Judith; Soriano, Erika; Garvalov, Boyan K; McDonald, Neil Q; Way, Michael
2011-04-01
Actin-related proteins (Arps) are a highly conserved family of proteins that have extensive sequence and structural similarity to actin. All characterized Arps are components of large multimeric complexes associated with chromatin or the cytoskeleton. In addition, the human genome encodes five conserved but largely uncharacterized "orphan" Arps, which appear to be mostly testis-specific. Here we show that Arp7A, which has 43% sequence identity with β-actin, forms a complex with the cytoskeletal proteins Tes and Mena in the subacrosomal layer of round spermatids. The N-terminal 65-residue extension to the actin-like fold of Arp7A interacts directly with Tes. The crystal structure of the 1-65(Arp7A)·LIM2-3(Tes)·EVH1(Mena) complex reveals that residues 28-49 of Arp7A contact the LIM2-3 domains of Tes. Two alanine residues from Arp7A that occupy equivalent apolar pockets in both LIM domains as well as an intervening GPAK linker that binds the LIM2-3 junction are critical for the Arp7A-Tes interaction. Equivalent occupied apolar pockets are also seen in the tandem LIM domain structures of LMO4 and Lhx3 bound to unrelated ligands. Our results indicate that apolar pocket interactions are a common feature of tandem LIM domain interactions, but ligand specificity is principally determined by the linker sequence.
The Grapevine and Wine Microbiome: Insights from High-Throughput Amplicon Sequencing
Morgan, Horatio H.; du Toit, Maret; Setati, Mathabatha E.
2017-01-01
From the time when microbial activity in wine fermentation was first demonstrated, the microbial ecology of the vineyard, grape, and wine has been extensively investigated using culture-based methods. However, the last 2 decades have been characterized by an important change in the approaches used for microbial examination, due to the introduction of DNA-based community fingerprinting methods such as DGGE, SSCP, T-RFLP, and ARISA. These approaches allowed for the exploration of microbial community structures without the need to cultivate, and have been extensively applied to decipher the microbial populations associated with the grapevine as well as the microbial dynamics throughout grape berry ripening and wine fermentation. These techniques are well-established for the rapid more sensitive profiling of microbial communities; however, they often do not provide direct taxonomic information and possess limited ability to detect the presence of rare taxa and taxa with low abundance. Consequently, the past 5 years have seen an upsurge in the application of high-throughput sequencing methods for the in-depth assessment of the grapevine and wine microbiome. Although a relatively new approach in wine sciences, these methods reveal a considerably greater diversity than previously reported, and identified several species that had not yet been reported. The aim of the current review is to highlight the contribution of high-throughput next generation sequencing and metagenomics approaches to vineyard microbial ecology especially unraveling the influence of vineyard management practices on microbial diversity. PMID:28553266
The Grapevine and Wine Microbiome: Insights from High-Throughput Amplicon Sequencing.
Morgan, Horatio H; du Toit, Maret; Setati, Mathabatha E
2017-01-01
From the time when microbial activity in wine fermentation was first demonstrated, the microbial ecology of the vineyard, grape, and wine has been extensively investigated using culture-based methods. However, the last 2 decades have been characterized by an important change in the approaches used for microbial examination, due to the introduction of DNA-based community fingerprinting methods such as DGGE, SSCP, T-RFLP, and ARISA. These approaches allowed for the exploration of microbial community structures without the need to cultivate, and have been extensively applied to decipher the microbial populations associated with the grapevine as well as the microbial dynamics throughout grape berry ripening and wine fermentation. These techniques are well-established for the rapid more sensitive profiling of microbial communities; however, they often do not provide direct taxonomic information and possess limited ability to detect the presence of rare taxa and taxa with low abundance. Consequently, the past 5 years have seen an upsurge in the application of high-throughput sequencing methods for the in-depth assessment of the grapevine and wine microbiome. Although a relatively new approach in wine sciences, these methods reveal a considerably greater diversity than previously reported, and identified several species that had not yet been reported. The aim of the current review is to highlight the contribution of high-throughput next generation sequencing and metagenomics approaches to vineyard microbial ecology especially unraveling the influence of vineyard management practices on microbial diversity.
Zhang, Likui; Kang, Manyu; Huang, Yangchao; Yang, Lixiang
2016-05-01
The diversity and ecological significance of bacteria and archaea in deep-sea environments have been thoroughly investigated, but eukaryotic microorganisms in these areas, such as fungi, are poorly understood. To elucidate fungal diversity in calcareous deep-sea sediments in the Southwest India Ridge (SWIR), the internal transcribed spacer (ITS) regions of rRNA genes from two sediment metagenomic DNA samples were amplified and sequenced using the Illumina sequencing platform. The results revealed that 58-63 % and 36-42 % of the ITS sequences (97 % similarity) belonged to Basidiomycota and Ascomycota, respectively. These findings suggest that Basidiomycota and Ascomycota are the predominant fungal phyla in the two samples. We also found that Agaricomycetes, Leotiomycetes, and Pezizomycetes were the major fungal classes in the two samples. At the species level, Thelephoraceae sp. and Phialocephala fortinii were major fungal species in the two samples. Despite the low relative abundance, unidentified fungal sequences were also observed in the two samples. Furthermore, we found that there were slight differences in fungal diversity between the two sediment samples, although both were collected from the SWIR. Thus, our results demonstrate that calcareous deep-sea sediments in the SWIR harbor diverse fungi, which augment the fungal groups in deep-sea sediments. This is the first report of fungal communities in calcareous deep-sea sediments in the SWIR revealed by Illumina sequencing.
NASA Astrophysics Data System (ADS)
Attou, Ahmed; Hamoumi, Naima
2004-07-01
In the Oulad Abbou syncline, western coastal Meseta, the Silurian deposits exhibit siliciclastic or mixed siliciclastic/carbonate tidal facies that recorded alkaline basalt flows and syn-sedimentary deformations. These facies are staked into peritidal shallowing upward sequences reflecting the evolution from an infratidal to a supratidal environment. These sequences recorded low-amplitude and high-frequency sea-level variations. The built-up of these rhythmic sequences is related to distensive tectonic that allowed the development of isolated platform from extensive siliciclastic influx. This tectonic event is well recorded in the palaeogeographic evolution of the northern Gondwana platform during the Lower Palaeozoic time. To cite this article: A. Attou, N. Hamoumi, C. R. Geoscience 336 (2004).
Reverse Genetics and High Throughput Sequencing Methodologies for Plant Functional Genomics
Ben-Amar, Anis; Daldoul, Samia; Reustle, Götz M.; Krczal, Gabriele; Mliki, Ahmed
2016-01-01
In the post-genomic era, increasingly sophisticated genetic tools are being developed with the long-term goal of understanding how the coordinated activity of genes gives rise to a complex organism. With the advent of the next generation sequencing associated with effective computational approaches, wide variety of plant species have been fully sequenced giving a wealth of data sequence information on structure and organization of plant genomes. Since thousands of gene sequences are already known, recently developed functional genomics approaches provide powerful tools to analyze plant gene functions through various gene manipulation technologies. Integration of different omics platforms along with gene annotation and computational analysis may elucidate a complete view in a system biology level. Extensive investigations on reverse genetics methodologies were deployed for assigning biological function to a specific gene or gene product. We provide here an updated overview of these high throughout strategies highlighting recent advances in the knowledge of functional genomics in plants. PMID:28217003
Differences in a ribosomal DNA sequence of Strongylus species allows identification of single eggs.
Campbell, A J; Gasser, R B; Chilton, N B
1995-03-01
In the current study, molecular techniques were evaluated for the species identification of individual strongyle eggs. Adult worms of Strongylus edentatus, S. equinus and S. vulgaris were collected at necropsy from horses from Australia and the U.S.A. Genomic DNA was isolated and a ribosomal transcribed spacer (ITS-2) amplified and sequenced using polymerase chain reaction (PCR) techniques. The length of the ITS-2 sequence of S. edentatus, S. equinus and S. vulgaris ranged between 217 and 235 nucleotides. Extensive sequence analysis demonstrated a low degree (0-0.9%) of intraspecific variation in the ITS-2 for the Strongylus species examined, whereas the levels of interspecific differences (13-29%) were significantly greater. Interspecific differences in the ITS-2 sequences allowed unequivocal species identification of single worms and eggs using PCR-linked restriction fragment length polymorphism. These results demonstrate the potential of the ribosomal spacers as genetic markers for species identification of single strongyle eggs from horse faeces.
Unprecedented high-resolution view of bacterial operon architecture revealed by RNA sequencing.
Conway, Tyrrell; Creecy, James P; Maddox, Scott M; Grissom, Joe E; Conkle, Trevor L; Shadid, Tyler M; Teramoto, Jun; San Miguel, Phillip; Shimada, Tomohiro; Ishihama, Akira; Mori, Hirotada; Wanner, Barry L
2014-07-08
We analyzed the transcriptome of Escherichia coli K-12 by strand-specific RNA sequencing at single-nucleotide resolution during steady-state (logarithmic-phase) growth and upon entry into stationary phase in glucose minimal medium. To generate high-resolution transcriptome maps, we developed an organizational schema which showed that in practice only three features are required to define operon architecture: the promoter, terminator, and deep RNA sequence read coverage. We precisely annotated 2,122 promoters and 1,774 terminators, defining 1,510 operons with an average of 1.98 genes per operon. Our analyses revealed an unprecedented view of E. coli operon architecture. A large proportion (36%) of operons are complex with internal promoters or terminators that generate multiple transcription units. For 43% of operons, we observed differential expression of polycistronic genes, despite being in the same operons, indicating that E. coli operon architecture allows fine-tuning of gene expression. We found that 276 of 370 convergent operons terminate inefficiently, generating complementary 3' transcript ends which overlap on average by 286 nucleotides, and 136 of 388 divergent operons have promoters arranged such that their 5' ends overlap on average by 168 nucleotides. We found 89 antisense transcripts of 397-nucleotide average length, 7 unannotated transcripts within intergenic regions, and 18 sense transcripts that completely overlap operons on the opposite strand. Of 519 overlapping transcripts, 75% correspond to sequences that are highly conserved in E. coli (>50 genomes). Our data extend recent studies showing unexpected transcriptome complexity in several bacteria and suggest that antisense RNA regulation is widespread. Importance: We precisely mapped the 5' and 3' ends of RNA transcripts across the E. coli K-12 genome by using a single-nucleotide analytical approach. Our resulting high-resolution transcriptome maps show that ca. one-third of E. coli operons are complex, with internal promoters and terminators generating multiple transcription units and allowing differential gene expression within these operons. We discovered extensive antisense transcription that results from more than 500 operons, which fully overlap or extensively overlap adjacent divergent or convergent operons. The genomic regions corresponding to these antisense transcripts are highly conserved in E. coli (including Shigella species), although it remains to be proven whether or not they are functional. Our observations of features unearthed by single-nucleotide transcriptome mapping suggest that deeper layers of transcriptional regulation in bacteria are likely to be revealed in the future. Copyright © 2014 Conway et al.
Expansion of the Preimmune Antibody Repertoire by Junctional Diversity in Bos taurus
Liljavirta, Jenni; Niku, Mikael; Pessa-Morikawa, Tiina; Ekman, Anna; Iivanainen, Antti
2014-01-01
Cattle have a limited range of immunoglobulin genes which are further diversified by antigen independent somatic hypermutation in fetuses. Junctional diversity generated during somatic recombination contributes to antibody diversity but its relative significance has not been comprehensively studied. We have investigated the importance of terminal deoxynucleotidyl transferase (TdT) -mediated junctional diversity to the bovine immunoglobulin repertoire. We also searched for new bovine heavy chain diversity (IGHD) genes as the information of the germline sequences is essential to define the junctional boundaries between gene segments. New heavy chain variable genes (IGHV) were explored to address the gene usage in the fetal recombinations. Our bioinformatics search revealed five new IGHD genes, which included the longest IGHD reported so far, 154 bp. By genomic sequencing we found 26 new IGHV sequences that represent potentially new IGHV genes or allelic variants. Sequence analysis of immunoglobulin heavy chain cDNA libraries of fetal bone marrow, ileum and spleen showed 0 to 36 nontemplated N-nucleotide additions between variable, diversity and joining genes. A maximum of 8 N nucleotides were also identified in the light chains. The junctional base profile was biased towards A and T nucleotide additions (64% in heavy chain VD, 52% in heavy chain DJ and 61% in light chain VJ junctions) in contrast to the high G/C content which is usually observed in mice. Sequence analysis also revealed extensive exonuclease activity, providing additional diversity. B-lymphocyte specific TdT expression was detected in bovine fetal bone marrow by reverse transcription-qPCR and immunofluorescence. These results suggest that TdT-mediated junctional diversity and exonuclease activity contribute significantly to the size of the cattle preimmune antibody repertoire already in the fetal period. PMID:24926997
Miller, Melissa A; Burgess, Tristan L; Dodd, Erin M; Rhyan, Jack C; Jang, Spencer S; Byrne, Barbara A; Gulland, Frances M D; Murray, Michael J; Toy-Choutka, Sharon; Conrad, Patricia A; Field, Cara L; Sidor, Inga F; Smith, Woutrina A
2017-04-01
We characterize Brucella infection in a wild southern sea otter ( Enhydra lutris nereis) with osteolytic lesions similar to those reported in other marine mammals and humans. This otter stranded twice along the central California coast, US over a 1-yr period and was handled extensively at two wildlife rehabilitation facilities, undergoing multiple surgeries and months of postsurgical care. Ultimately the otter was euthanized due to severe, progressive neurologic disease. Necropsy and postmortem radiographs revealed chronic, severe osteoarthritis spanning the proximal interphalangeal joint of the left hind fifth digit. Numerous coccobacilli within the joint were strongly positive on Brucella immunohistochemical labelling, and Brucella sp. was isolated in pure culture from this lesion. Sparse Brucella-immunopositive bacteria were also observed in the cytoplasm of a pulmonary vascular monocyte, and multifocal granulomas were observed in the spinal cord and liver on histopathology. Findings from biochemical characterization, 16S ribosomal DNA, and bp26 gene sequencing of the bacterial isolate were identical to those from marine-origin brucellae isolated from cetaceans and phocids. Although omp2a gene sequencing revealed 100% homology with marine Brucella spp. infecting pinnipeds, whales, and humans, omp2b gene sequences were identical only to pinniped-origin isolates. Multilocus sequence typing classified the sea otter isolate as ST26, a sequence type previously associated only with cetaceans. Our data suggest that the sea otter Brucella strain represents a novel marine lineage that is distinct from both Brucella pinnipedialis and Brucella ceti. Prior reports document the zoonotic potential of the marine brucellae. Isolation of Brucella sp. from a stranded sea otter highlights the importance of wearing personal protective equipment when handling sea otters and other marine mammals as part of wildlife conservation and rehabilitation efforts.
Testing Extension Services through AKAP Models
ERIC Educational Resources Information Center
De Rosa, Marcello; Bartoli, Luca; La Rocca, Giuseppe
2014-01-01
Purpose: The aim of the paper is to analyse the attitude of Italian farms in gaining access to agricultural extension services (AES). Design/methodology/approach: The ways Italian farms use AES are described through the AKAP (Awareness, Knowledge, Adoption, Product) sequence. This article investigated the AKAP sequence by submitting a…
Blake, Jonathon; Riddell, Andrew; Theiss, Susanne; Gonzalez, Alexis Perez; Haase, Bettina; Jauch, Anna; Janssen, Johannes W. G.; Ibberson, David; Pavlinic, Dinko; Moog, Ute; Benes, Vladimir; Runz, Heiko
2014-01-01
Balanced chromosome abnormalities (BCAs) occur at a high frequency in healthy and diseased individuals, but cost-efficient strategies to identify BCAs and evaluate whether they contribute to a phenotype have not yet become widespread. Here we apply genome-wide mate-pair library sequencing to characterize structural variation in a patient with unclear neurodevelopmental disease (NDD) and complex de novo BCAs at the karyotype level. Nucleotide-level characterization of the clinically described BCA breakpoints revealed disruption of at least three NDD candidate genes (LINC00299, NUP205, PSMD14) that gave rise to abnormal mRNAs and could be assumed as disease-causing. However, unbiased genome-wide analysis of the sequencing data for cryptic structural variation was key to reveal an additional submicroscopic inversion that truncates the schizophrenia- and bipolar disorder-associated brain transcription factor ZNF804A as an equally likely NDD-driving gene. Deep sequencing of fluorescent-sorted wild-type and derivative chromosomes confirmed the clinically undetected BCA. Moreover, deep sequencing further validated a high accuracy of mate-pair library sequencing to detect structural variants larger than 10 kB, proposing that this approach is powerful for clinical-grade genome-wide structural variant detection. Our study supports previous evidence for a role of ZNF804A in NDD and highlights the need for a more comprehensive assessment of structural variation in karyotypically abnormal individuals and patients with neurocognitive disease to avoid diagnostic deception. PMID:24625750
FlyBase: genes and gene models
Drysdale, Rachel A.; Crosby, Madeline A.
2005-01-01
FlyBase (http://flybase.org) is the primary repository of genetic and molecular data of the insect family Drosophilidae. For the most extensively studied species, Drosophila melanogaster, a wide range of data are presented in integrated formats. Data types include mutant phenotypes, molecular characterization of mutant alleles and aberrations, cytological maps, wild-type expression patterns, anatomical images, transgenic constructs and insertions, sequence-level gene models and molecular classification of gene product functions. There is a growing body of data for other Drosophila species; this is expected to increase dramatically over the next year, with the completion of draft-quality genomic sequences of an additional 11 Drosphila species. PMID:15608223
Wang, Yong-Liang; Wang, Yu-Jun; Luan, Jun-Bo; Yan, Gen-Hong; Liu, Shu-Sheng; Wang, Xiao-Wei
2013-01-01
Background The whitefly Bemisa tabaci is a species complex of more than 31 cryptic species which include some of the most destructive invasive pests of crops worldwide. Among them, Middle East-Asia Minor 1 (MEAM1) and Mediterranean have invaded many countries and displaced the native whitefly species. The successful invasion of the two species is largely due to their wide range of host plants, high resistance to insecticides and remarkable tolerance to environmental stresses. However, the molecular differences between invasive and indigenous whiteflies remain largely unknown. Methodology/Principal Findings Here the global transcriptional difference between the two invasive whitefly species (MEAM1, MED) and one indigenous whitefly species (Asia II 3) were analyzed using the Illumina sequencing. Our analysis indicated that 2,422 genes between MEAM1 and MED; 3,073 genes between MEAM1 and Asia II 3; and 3,644 genes between MED and Asia II 3 were differentially expressed. Gene Ontology enrichment analysis revealed that the differently expressed genes between the invasive and indigenous whiteflies were significantly enriched in the term of ‘oxidoreductase activity’. Pathway enrichment analysis showed that carbohydrate, amino acid and glycerolipid metabolisms were more active in MEAM1 and MED than in Asia II 3, which may contribute to their differences in biological characteristics. Our analysis also illustrated that the majority of genes involved in ‘drug metabolic pathway’ were expressed at a higher level in MEAM1 and MED than in Asia II 3. Taken together, these results revealed that the genes related to basic metabolism and detoxification were expressed at an elevated level in the invasive whiteflies, which might be responsible for their higher resistance to insecticides and environmental stresses. Conclusions/Significance The extensive comparison of MEAM1, MED and Asia II 3 gene expression may serve as an invaluable resource for revealing the molecular mechanisms underlying their biological differences and the whitefly invasion. PMID:23667457
Yao, Xuan; Li, Juanjuan; Liu, Jianping; Liu, Kede
2015-10-01
The molecular mechanisms of abscisic acid (ABA) signalling have been studied for many years; however, how mitochondria-localized proteins play roles in ABA signalling remains unclear. Here an Arabidopsis mitochondria-localized protein RRL (RETARDED ROOT GROWTH-LIKE) was shown to function in ABA signalling. A previous study had revealed that the Arabidopsis mitochondria-localized protein RRG (RETARDED ROOT GROWTH) is required for cell division in the root meristem. RRL shares 54% and 57% identity at the nucleotide and amino acid sequences, respectively, with RRG; nevertheless, RRL shows a different function in Arabidopsis. In this study, disruption of RRL decreased ABA sensitivity whereas overexpression of RRL increased ABA sensitivity during seed germination and seedling growth. High expression levels of RRL were found in germinating seeds and developing seedlings, as revealed by β-glucuronidase (GUS) staining of ProRRL-GUS transgenic lines. The analyses of the structure and function of mitochondria in the knockout rrl mutant showed that the disruption of RRL causes extensively internally vacuolated mitochondria and reduced ABA-stimulated reactive oxygen species (ROS) production. Previous studies have revealed that the expression of alternative oxidase (AOX) in the alternative respiratory pathway is increased by mitochondrial retrograde regulation to regain ROS levels when the mitochondrial electron transport chain is impaired. The APETALA2 (AP2)-type transcription factor ABI4 is a regulator of ALTERNATIVE OXIDASE1a (AOX1a) in mitochondrial retrograde signalling. This study showed that ABA-induced AOX1a and ABI4 expression was inhibited in the rrl mutant, suggesting that RRL is probably involved in ABI4-mediated mitochondrial retrograde signalling. Furthermore, the results revealed that ABI4 is a downstream regulatory factor in RRL-mediated ABA signalling in seed germination and seedling growth. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Levels of integration in cognitive control and sequence processing in the prefrontal cortex.
Bahlmann, Jörg; Korb, Franziska M; Gratton, Caterina; Friederici, Angela D
2012-01-01
Cognitive control is necessary to flexibly act in changing environments. Sequence processing is needed in language comprehension to build the syntactic structure in sentences. Functional imaging studies suggest that sequence processing engages the left ventrolateral prefrontal cortex (PFC). In contrast, cognitive control processes additionally recruit bilateral rostral lateral PFC regions. The present study aimed to investigate these two types of processes in one experimental paradigm. Sequence processing was manipulated using two different sequencing rules varying in complexity. Cognitive control was varied with different cue-sets that determined the choice of a sequencing rule. Univariate analyses revealed distinct PFC regions for the two types of processing (i.e. sequence processing: left ventrolateral PFC and cognitive control processing: bilateral dorsolateral and rostral PFC). Moreover, in a common brain network (including left lateral PFC and intraparietal sulcus) no interaction between sequence and cognitive control processing was observed. In contrast, a multivariate pattern analysis revealed an interaction of sequence and cognitive control processing, such that voxels in left lateral PFC and parietal cortex showed different tuning functions for tasks involving different sequencing and cognitive control demands. These results suggest that the difference between the process of rule selection (i.e. cognitive control) and the process of rule-based sequencing (i.e. sequence processing) find their neuronal underpinnings in distinct activation patterns in lateral PFC. Moreover, the combination of rule selection and rule sequencing can shape the response of neurons in lateral PFC and parietal cortex.
Levels of Integration in Cognitive Control and Sequence Processing in the Prefrontal Cortex
Bahlmann, Jörg; Korb, Franziska M.; Gratton, Caterina; Friederici, Angela D.
2012-01-01
Cognitive control is necessary to flexibly act in changing environments. Sequence processing is needed in language comprehension to build the syntactic structure in sentences. Functional imaging studies suggest that sequence processing engages the left ventrolateral prefrontal cortex (PFC). In contrast, cognitive control processes additionally recruit bilateral rostral lateral PFC regions. The present study aimed to investigate these two types of processes in one experimental paradigm. Sequence processing was manipulated using two different sequencing rules varying in complexity. Cognitive control was varied with different cue-sets that determined the choice of a sequencing rule. Univariate analyses revealed distinct PFC regions for the two types of processing (i.e. sequence processing: left ventrolateral PFC and cognitive control processing: bilateral dorsolateral and rostral PFC). Moreover, in a common brain network (including left lateral PFC and intraparietal sulcus) no interaction between sequence and cognitive control processing was observed. In contrast, a multivariate pattern analysis revealed an interaction of sequence and cognitive control processing, such that voxels in left lateral PFC and parietal cortex showed different tuning functions for tasks involving different sequencing and cognitive control demands. These results suggest that the difference between the process of rule selection (i.e. cognitive control) and the process of rule-based sequencing (i.e. sequence processing) find their neuronal underpinnings in distinct activation patterns in lateral PFC. Moreover, the combination of rule selection and rule sequencing can shape the response of neurons in lateral PFC and parietal cortex. PMID:22952762
2018-01-01
New, as yet undiscovered aptamers for Protein A were identified by applying next generation sequencing (NGS) to a previously selected aptamer pool. This pool was obtained in a classical SELEX (Systematic Evolution of Ligands by EXponential enrichment) experiment using the FluMag-SELEX procedure followed by cloning and Sanger sequencing. PA#2/8 was identified as the only Protein A-binding aptamer from the Sanger sequence pool, and was shown to be able to bind intact cells of Staphylococcus aureus. In this study, we show the extension of the SELEX results by re-sequencing of the same aptamer pool using a medium throughput NGS approach and data analysis. Both data pools were compared. They confirm the selection of a highly complex and heterogeneous oligonucleotide pool and show consistently a high content of orphans as well as a similar relative frequency of certain sequence groups. But in contrast to the Sanger data pool, the NGS pool was clearly dominated by one sequence group containing the known Protein A-binding aptamer PA#2/8 as the most frequent sequence in this group. In addition, we found two new sequence groups in the NGS pool represented by PA-C10 and PA-C8, respectively, which also have high specificity for Protein A. Comparative affinity studies reveal differences between the aptamers and confirm that PA#2/8 remains the most potent sequence within the selected aptamer pool reaching affinities in the low nanomolar range of KD = 20 ± 1 nM. PMID:29495282
Stoltenburg, Regina; Strehlitz, Beate
2018-02-24
New, as yet undiscovered aptamers for Protein A were identified by applying next generation sequencing (NGS) to a previously selected aptamer pool. This pool was obtained in a classical SELEX (Systematic Evolution of Ligands by EXponential enrichment) experiment using the FluMag-SELEX procedure followed by cloning and Sanger sequencing. PA#2/8 was identified as the only Protein A-binding aptamer from the Sanger sequence pool, and was shown to be able to bind intact cells of Staphylococcus aureus . In this study, we show the extension of the SELEX results by re-sequencing of the same aptamer pool using a medium throughput NGS approach and data analysis. Both data pools were compared. They confirm the selection of a highly complex and heterogeneous oligonucleotide pool and show consistently a high content of orphans as well as a similar relative frequency of certain sequence groups. But in contrast to the Sanger data pool, the NGS pool was clearly dominated by one sequence group containing the known Protein A-binding aptamer PA#2/8 as the most frequent sequence in this group. In addition, we found two new sequence groups in the NGS pool represented by PA-C10 and PA-C8, respectively, which also have high specificity for Protein A. Comparative affinity studies reveal differences between the aptamers and confirm that PA#2/8 remains the most potent sequence within the selected aptamer pool reaching affinities in the low nanomolar range of K D = 20 ± 1 nM.
NASA Astrophysics Data System (ADS)
El-Azabi, M. H.; El-Araby, A.
2005-01-01
The Middle Triassic-Lower Cretaceous (pre-Late Albian) succession of Arif El-Naga anticline comprises various distinctive facies and environments that are connected with eustatic relative sea-level changes, local/regional tectonism, variable sediment influx and base-level changes. It displays six unconformity-bounded depositional sequences. The Triassic deposits are divided into a lower clastic facies (early Middle Triassic sequence) and an upper carbonate unit (late Middle- and latest Middle/early Late Triassic sequences). The early Middle Triassic sequence consists of sandstone with shale/mudstone interbeds that formed under variable regimes, ranging from braided fluvial, lower shoreface to beach foreshore. The marine part of this sequence marks retrogradational and progradational parasequences of transgressive- and highstand systems tract deposits respectively. Deposition has taken place under warm semi-arid climate and a steady supply of clastics. The late Middle- and latest Middle/early Late Triassic sequences are carbonate facies developed on an extensive shallow marine shelf under dry-warm climate. The late Middle Triassic sequence includes retrogradational shallow subtidal oyster rudstone and progradational lower intertidal lime-mudstone parasequences that define the transgressive- and highstand systems tracts respectively. It terminates with upper intertidal oncolitic packstone with bored upper surface. The next latest Middle/early Late Triassic sequence is marked by lime-mudstone, packstone/grainstone and algal stromatolitic bindstone with minor shale/mudstone. These lower intertidal/shallow subtidal deposits of a transgressive-systems tract are followed upward by progradational highstand lower intertidal lime-mudstone deposits. The overlying Jurassic deposits encompass two different sequences. The Lower Jurassic sequence is made up of intercalating lower intertidal lime-mudstone and wave-dominated beach foreshore sandstone which formed during a short period of rising sea-level with a relative increase in clastic supply. The Middle-Upper Jurassic sequence is represented by cycles of cross-bedded sandstone topped with thin mudstone that accumulated by northerly flowing braided-streams accompanying regional uplift of the Arabo-Nubian shield. It is succeeded by another regressive fluvial sequence of Early Cretaceous age due to a major eustatic sea-level fall. The Lower Cretaceous sequence is dominated by sandy braided-river deposits with minor overbank fines and basal debris flow conglomerate.
Hughes, M. S.; Hoey, E. M.; Coyle, P. V.
1993-01-01
Ten coxsackievirus B4 (CVB4) strains isolated from clinical and environmental sources in Northern Ireland in 1985-7, were compared at the nucleotide sequence level. Dideoxynucleotide sequencing of a polymerase chain reaction (PCR) amplified fragment, spanning the VP1/P2A genomic region, classified the isolates into two distinct groups or genotypes as defined by Rico-Hesse and colleagues for poliovirus type 1. Isolates within each group shared approximately 99% sequence identity at the nucleotide level whereas < or = 86% sequence identity was shared between groups. One isolate derived from a clinical specimen in 1987 was grouped with six CVB4 isolates recovered from the aquatic environment in 1986-7. The second group comprised CVB4 isolates from clinical specimens in 1985-6. Both groups were different at the nucleotide level from the prototype strain isolated in 1950. It was concluded that the method could be used to sub-type CVB4 isolates and would be of value in epidemiological studies of CVB4. Predicted amino acid sequences revealed non-conservation of the tyrosine residue at the VP1/P2A cleavage site but were of little value in distinguishing CVB4 variants. PMID:8386098
Phylogenetic analysis of Demodex caprae based on mitochondrial 16S rDNA sequence.
Zhao, Ya-E; Hu, Li; Ma, Jun-Xian
2013-11-01
Demodex caprae infests the hair follicles and sebaceous glands of goats worldwide, which not only seriously impairs goat farming, but also causes a big economic loss. However, there are few reports on the DNA level of D. caprae. To reveal the taxonomic position of D. caprae within the genus Demodex, the present study conducted phylogenetic analysis of D. caprae based on mt16S rDNA sequence data. D. caprae adults and eggs were obtained from a skin nodule of the goat suffering demodicidosis. The mt16S rDNA sequences of individual mite were amplified using specific primers, and then cloned, sequenced, and aligned. The sequence divergence, genetic distance, and transition/transversion rate were computed, and the phylogenetic trees in Demodex were reconstructed. Results revealed the 339-bp partial sequences of six D. caprae isolates were obtained, and the sequence identity was 100% among isolates. The pairwise divergences between D. caprae and Demodex canis or Demodex folliculorum or Demodex brevis were 22.2-24.0%, 24.0-24.9%, and 22.9-23.2%, respectively. The corresponding average genetic distances were 2.840, 2.926, and 2.665, and the average transition/transversion rates were 0.70, 0.55, and 0.54, respectively. The divergences, genetic distances, and transition/transversion rates of D. caprae versus the other three species all reached interspecies level. The five phylogenetic trees all presented that D. caprae clustered with D. brevis first, and then with D. canis, D. folliculorum, and Demodex injai in sequence. In conclusion, D. caprae is an independent species, and it is closer to D. brevis than to D. canis, D. folliculorum, or D. injai.
Dai, Hongying; Wu, Guodong; Wu, Michael; Zhi, Degui
2016-01-01
Next-generation sequencing data pose a severe curse of dimensionality, complicating traditional "single marker-single trait" analysis. We propose a two-stage combined p-value method for pathway analysis. The first stage is at the gene level, where we integrate effects within a gene using the Sequence Kernel Association Test (SKAT). The second stage is at the pathway level, where we perform a correlated Lancaster procedure to detect joint effects from multiple genes within a pathway. We show that the Lancaster procedure is optimal in Bahadur efficiency among all combined p-value methods. The Bahadur efficiency,[Formula: see text], compares sample sizes among different statistical tests when signals become sparse in sequencing data, i.e. ε →0. The optimal Bahadur efficiency ensures that the Lancaster procedure asymptotically requires a minimal sample size to detect sparse signals ([Formula: see text]). The Lancaster procedure can also be applied to meta-analysis. Extensive empirical assessments of exome sequencing data show that the proposed method outperforms Gene Set Enrichment Analysis (GSEA). We applied the competitive Lancaster procedure to meta-analysis data generated by the Global Lipids Genetics Consortium to identify pathways significantly associated with high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, triglycerides, and total cholesterol.
Mung bean proteins and peptides: nutritional, functional and bioactive properties.
Yi-Shen, Zhu; Shuai, Sun; FitzGerald, Richard
2018-01-01
To date, no extensive literature review exists regarding potential uses of mung bean proteins and peptides. As mung bean has long been widely used as a food source, early studies evaluated mung bean nutritional value against the Food and Agriculture Organization of the United Nations (FAO)/the World Health Organization (WHO) amino acids dietary recommendations. The comparison demonstrated mung bean to be a good protein source, except for deficiencies in sulphur-containing amino acids, methionine and cysteine. Methionine and cysteine residues have been introduced into the 8S globulin through protein engineering technology. Subsequently, purified mung bean proteins and peptides have facilitated the study of their structural and functional properties. Two main types of extraction methods have been reported for isolation of proteins and peptides from mung bean flours, permitting sequencing of major proteins present in mung bean, including albumins and globulins (notably 8S globulin). However, the sequence for albumin deposited in the UniProt database differs from other sequences reported in the literature. Meanwhile, a limited number of reports have revealed other useful bioactivities for proteins and hydrolysed peptides, including angiotensin-converting enzyme inhibitory activity, anti-fungal activity and trypsin inhibitory activity. Consequently, several mung bean hydrolysed peptides have served as effective food additives to prevent proteolysis during storage. Ultimately, further research will reveal other nutritional, functional and bioactive properties of mung bean for uses in diverse applications.
Genetic diversity and recombination analysis of sweepoviruses from Brazil
2012-01-01
Background Monopartite begomoviruses (genus Begomovirus, family Geminiviridae) that infect sweet potato (Ipomoea batatas) around the world are known as sweepoviruses. Because sweet potato plants are vegetatively propagated, the accumulation of viruses can become a major constraint for root production. Mixed infections of sweepovirus species and strains can lead to recombination, which may contribute to the generation of new recombinant sweepoviruses. Results This study reports the full genome sequence of 34 sweepoviruses sampled from a sweet potato germplasm bank and commercial fields in Brazil. These sequences were compared with others from public nucleotide sequence databases to provide a comprehensive overview of the genetic diversity and patterns of genetic exchange in sweepoviruses isolated from Brazil, as well as to review the classification and nomenclature of sweepoviruses in accordance with the current guidelines proposed by the Geminiviridae Study Group of the International Committee on Taxonomy of Viruses (ICTV). Co-infections and extensive recombination events were identified in Brazilian sweepoviruses. Analysis of the recombination breakpoints detected within the sweepovirus dataset revealed that most recombination events occurred in the intergenic region (IR) and in the middle of the C1 open reading frame (ORF). Conclusions The genetic diversity of sweepoviruses was considerably greater than previously described in Brazil. Moreover, recombination analysis revealed that a genomic exchange is responsible for the emergence of sweepovirus species and strains and provided valuable new information for understanding the diversity and evolution of sweepoviruses. PMID:23082767
O'Connell, Kerry Joan; O'Connell Motherway, Mary; Liedtke, Andrea; Fitzgerald, Gerald F.; Ross, R. Paul; Stanton, Catherine; Zomer, Aldert
2014-01-01
Members of the genus Bifidobacterium are commonly found in the gastrointestinal tracts of mammals, including humans, where their growth is presumed to be dependent on various diet- and/or host-derived carbohydrates. To understand transcriptional control of bifidobacterial carbohydrate metabolism, we investigated two genetic carbohydrate utilization clusters dedicated to the metabolism of raffinose-type sugars and melezitose. Transcriptomic and gene inactivation approaches revealed that the raffinose utilization system is positively regulated by an activator protein, designated RafR. The gene cluster associated with melezitose metabolism was shown to be subject to direct negative control by a LacI-type transcriptional regulator, designated MelR1, in addition to apparent indirect negative control by means of a second LacI-type regulator, MelR2. In silico analysis, DNA-protein interaction, and primer extension studies revealed the MelR1 and MelR2 operator sequences, each of which is positioned just upstream of or overlapping the correspondingly regulated promoter sequences. Similar analyses identified the RafR binding operator sequence located upstream of the rafB promoter. This study indicates that transcriptional control of gene clusters involved in carbohydrate metabolism in bifidobacteria is subject to conserved regulatory systems, representing either positive or negative control. PMID:24705323
Mung bean proteins and peptides: nutritional, functional and bioactive properties
Yi-Shen, Zhu; Shuai, Sun; FitzGerald, Richard
2018-01-01
To date, no extensive literature review exists regarding potential uses of mung bean proteins and peptides. As mung bean has long been widely used as a food source, early studies evaluated mung bean nutritional value against the Food and Agriculture Organization of the United Nations (FAO)/the World Health Organization (WHO) amino acids dietary recommendations. The comparison demonstrated mung bean to be a good protein source, except for deficiencies in sulphur-containing amino acids, methionine and cysteine. Methionine and cysteine residues have been introduced into the 8S globulin through protein engineering technology. Subsequently, purified mung bean proteins and peptides have facilitated the study of their structural and functional properties. Two main types of extraction methods have been reported for isolation of proteins and peptides from mung bean flours, permitting sequencing of major proteins present in mung bean, including albumins and globulins (notably 8S globulin). However, the sequence for albumin deposited in the UniProt database differs from other sequences reported in the literature. Meanwhile, a limited number of reports have revealed other useful bioactivities for proteins and hydrolysed peptides, including angiotensin-converting enzyme inhibitory activity, anti-fungal activity and trypsin inhibitory activity. Consequently, several mung bean hydrolysed peptides have served as effective food additives to prevent proteolysis during storage. Ultimately, further research will reveal other nutritional, functional and bioactive properties of mung bean for uses in diverse applications. PMID:29545737
Chen, Gang; Wang, Feng; Dillenburger, Barbara C.; Friedman, Robert M.; Chen, Li M.; Gore, John C.; Avison, Malcolm J.; Roe, Anna W.
2011-01-01
Functional magnetic resonance imaging (fMRI), at high magnetic field strength can suffer from serious degradation of image quality because of motion and physiological noise, as well as spatial distortions and signal losses due to susceptibility effects. Overcoming such limitations is essential for sensitive detection and reliable interpretation of fMRI data. These issues are particularly problematic in studies of awake animals. As part of our initial efforts to study functional brain activations in awake, behaving monkeys using fMRI at 4.7T, we have developed acquisition and analysis procedures to improve image quality with encouraging results. We evaluated the influence of two main variables on image quality. First, we show how important the level of behavioral training is for obtaining good data stability and high temporal signal-to-noise ratios. In initial sessions, our typical scan session lasted 1.5 hours, partitioned into short (<10 minutes) runs. During reward periods and breaks between runs, the monkey exhibited movements resulting in considerable image misregistrations. After a few months of extensive behavioral training, we were able to increase the length of individual runs and the total length of each session. The monkey learned to wait until the end of a block for fluid reward, resulting in longer periods of continuous acquisition. Each additional 60 training sessions extended the duration of each session by 60 minutes, culminating, after about 140 training sessions, in sessions that last about four hours. As a result, the average translational movement decreased from over 500 μm to less than 80 μm, a displacement close to that observed in anesthetized monkeys scanned in a 7 T horizontal scanner. Another major source of distortion at high fields arises from susceptibility variations. To reduce such artifacts, we used segmented gradient-echo echo-planar imaging (EPI) sequences. Increasing the number of segments significantly decreased susceptibility artifacts and image distortion. Comparisons of images from functional runs using four segments with those using a single-shot EPI sequence revealed a roughly two-fold improvement in functional signal-to-noise-ratio and 50% decrease in distortion. These methods enabled reliable detection of neural activation and permitted blood-oxygenation-level-dependent (BOLD) based mapping of early visual areas in monkeys using a volume coil. In summary, both extensive behavioral training of monkeys and application of segmented gradient-echo EPI sequence improved signal-to-noise and image quality. Understanding the effects these factors have is important for the application of high field imaging methods to the detection of sub-millimeter functional structures in the awake monkey brain. PMID:22055855
2010-01-01
Background Newcastle disease (ND), caused by Newcastle disease virus (NDV), is a highly contagious disease of birds and has been one of the major causes of economic losses in the poultry industry. Despite routine vaccination programs, sporadic cases have occasionally occurred in the country and remain a constant threat to commercial poultry. Hence, the present study was aimed to characterize NDV isolates obtained from clinical cases in various locations of Malaysia between 2004 and 2007 based on sequence and phylogenetic analysis of partial F gene and C-terminus extension length of HN gene. Results The coding region of eleven NDV isolates fusion (F) gene and carboxyl terminal region of haemagglutinin-neuraminidase (HN) gene including extensions were amplified by reverse transcriptase PCR and directly sequenced. All the isolates have shown to have non-synonymous to synonymous base substitution rate ranging between 0.081 - 0.264 demonstrating presence of negative selection. Analysis based on F gene showed the characterized isolates possess three different types of protease cleavage site motifs; namely 112RRQKRF117, 112RRRKRF117 and 112GRQGRL117 and appear to show maximum identities with isolates in the region such as cockatoo/14698/90 (Indonesia), Ch/2000 (China), local isolate AF2240 indicating the high similarity of isolates circulating in the South East Asian countries. Meanwhile, one of the isolates resembles commonly used lentogenic vaccine strains. On further characterization of the HN gene, Malaysian isolates had C-terminus extensions of 0, 6 and 11 amino acids. Analysis of the phylogenetic tree revealed that the existence of three genetic groups; namely, genotype II, VII and VIII. Conclusions The study concluded that the occurrence of three types of NDV genotypes and presence of varied carboxyl terminus extension lengths among Malaysian isolates incriminated for sporadic cases. PMID:20691110
Berhanu, Ayalew; Ideris, Aini; Omar, Abdul R; Bejo, Mohd Hair
2010-08-08
Newcastle disease (ND), caused by Newcastle disease virus (NDV), is a highly contagious disease of birds and has been one of the major causes of economic losses in the poultry industry. Despite routine vaccination programs, sporadic cases have occasionally occurred in the country and remain a constant threat to commercial poultry. Hence, the present study was aimed to characterize NDV isolates obtained from clinical cases in various locations of Malaysia between 2004 and 2007 based on sequence and phylogenetic analysis of partial F gene and C-terminus extension length of HN gene. The coding region of eleven NDV isolates fusion (F) gene and carboxyl terminal region of haemagglutinin-neuraminidase (HN) gene including extensions were amplified by reverse transcriptase PCR and directly sequenced. All the isolates have shown to have non-synonymous to synonymous base substitution rate ranging between 0.081 - 0.264 demonstrating presence of negative selection. Analysis based on F gene showed the characterized isolates possess three different types of protease cleavage site motifs; namely 112RRQKRF117, 112RRRKRF117 and 112GRQGRL117 and appear to show maximum identities with isolates in the region such as cockatoo/14698/90 (Indonesia), Ch/2000 (China), local isolate AF2240 indicating the high similarity of isolates circulating in the South East Asian countries. Meanwhile, one of the isolates resembles commonly used lentogenic vaccine strains. On further characterization of the HN gene, Malaysian isolates had C-terminus extensions of 0, 6 and 11 amino acids. Analysis of the phylogenetic tree revealed that the existence of three genetic groups; namely, genotype II, VII and VIII. The study concluded that the occurrence of three types of NDV genotypes and presence of varied carboxyl terminus extension lengths among Malaysian isolates incriminated for sporadic cases.
Regularized rare variant enrichment analysis for case-control exome sequencing data.
Larson, Nicholas B; Schaid, Daniel J
2014-02-01
Rare variants have recently garnered an immense amount of attention in genetic association analysis. However, unlike methods traditionally used for single marker analysis in GWAS, rare variant analysis often requires some method of aggregation, since single marker approaches are poorly powered for typical sequencing study sample sizes. Advancements in sequencing technologies have rendered next-generation sequencing platforms a realistic alternative to traditional genotyping arrays. Exome sequencing in particular not only provides base-level resolution of genetic coding regions, but also a natural paradigm for aggregation via genes and exons. Here, we propose the use of penalized regression in combination with variant aggregation measures to identify rare variant enrichment in exome sequencing data. In contrast to marginal gene-level testing, we simultaneously evaluate the effects of rare variants in multiple genes, focusing on gene-based least absolute shrinkage and selection operator (LASSO) and exon-based sparse group LASSO models. By using gene membership as a grouping variable, the sparse group LASSO can be used as a gene-centric analysis of rare variants while also providing a penalized approach toward identifying specific regions of interest. We apply extensive simulations to evaluate the performance of these approaches with respect to specificity and sensitivity, comparing these results to multiple competing marginal testing methods. Finally, we discuss our findings and outline future research. © 2013 WILEY PERIODICALS, INC.
Hand, Melanie L.; Spangenberg, German C.; Forster, John W.; Cogan, Noel O. I.
2013-01-01
Chloroplast genome sequences are of broad significance in plant biology, due to frequent use in molecular phylogenetics, comparative genomics, population genetics, and genetic modification studies. The present study used a second-generation sequencing approach to determine and assemble the plastid genomes (plastomes) of four representatives from the agriculturally important Lolium-Festuca species complex of pasture grasses (Lolium multiflorum, Festuca pratensis, Festuca altissima, and Festuca ovina). Total cellular DNA was extracted from either roots or leaves, was sequenced, and the output was filtered for plastome-related reads. A comparison between sources revealed fewer plastome-related reads from root-derived template but an increase in incidental bacterium-derived sequences. Plastome assembly and annotation indicated high levels of sequence identity and a conserved organization and gene content between species. However, frequent deletions within the F. ovina plastome appeared to contribute to a smaller plastid genome size. Comparative analysis with complete plastome sequences from other members of the Poaceae confirmed conservation of most grass-specific features. Detailed analysis of the rbcL–psaI intergenic region, however, revealed a “hot-spot” of variation characterized by independent deletion events. The evolutionary implications of this observation are discussed. The complete plastome sequences are anticipated to provide the basis for potential organelle-specific genetic modification of pasture grasses. PMID:23550121
Sequence analysis of Jembrana disease virus strains reveals a genetically stable lentivirus.
Desport, Moira; Stewart, Meredith E; Mikosza, Andrew S; Sheridan, Carol A; Peterson, Shane E; Chavand, Olivier; Hartaningsih, Nining; Wilcox, Graham E
2007-06-01
Jembrana disease virus (JDV) is a lentivirus associated with an acute disease syndrome with a 20% case fatality rate in Bos javanicus (Bali cattle) in Indonesia, occurring after a short incubation period and with no recurrence of the disease after recovery. Partial regions of gag and pol and the entire env were examined for sequence variation in DNA samples from cases of Jembrana disease obtained from Bali, Sumatra and South Kalimantan in Indonesian Borneo. A high level of nucleotide conservation (97-100%) was observed in gag sequences from samples taken in Bali and Sumatra, indicating that the source of JDV in Sumatra was most likely to have originated from Bali. The pol sequences and, unexpectedly, the env sequences from Bali samples were also well conserved with low nucleotide (96-99%) and amino acid substitutions (95-99%). However, the sample from South Kalimantan (JDV(KAL/01)) contained more divergent sequences, particularly in env (88% identity). Phylogenetic analysis revealed that the JDV(KAL/01)env sequences clustered with the sequence from the Pulukan sample (Bali) from 2001. JDV appears to be remarkably stable genetically and has undergone minor genetic changes over a period of nearly 20 years in Bali despite becoming endemic in the cattle population of the island.
Satellite DNA Sequences in Canidae and Their Chromosome Distribution in Dog and Red Fox.
Vozdova, Miluse; Kubickova, Svatava; Cernohorska, Halina; Fröhlich, Jan; Rubes, Jiri
2016-01-01
Satellite DNA is a characteristic component of mammalian centromeric heterochromatin, and a comparative analysis of its evolutionary dynamics can be used for phylogenetic studies. We analysed satellite and satellite-like DNA sequences available in NCBI for 4 species of the family Canidae (red fox, Vulpes vulpes, VVU; domestic dog, Canis familiaris, CFA; arctic fox, Vulpes lagopus, VLA; raccoon dog, Nyctereutes procyonoides procyonoides, NPR) by comparative sequence analysis, which revealed 86-90% intraspecies and 76-79% interspecies similarity. Comparative fluorescence in situ hybridisation in the red fox and dog showed signals of the red fox satellite probe in canine and vulpine autosomal centromeres, on VVUY, B chromosomes, and in the distal parts of VVU9q and VVU10p which were shown to contain nucleolus organiser regions. The CFA satellite probe stained autosomal centromeres only in the dog. The CFA satellite-like DNA did not show any significant sequence similarity with the satellite DNA of any species analysed and was localised to the centromeres of 9 canine chromosome pairs. No significant heterochromatin block was detected on the B chromosomes of the red fox. Our results show extensive heterogeneity of satellite sequences among Canidae and prove close evolutionary relationships between the red and arctic fox. © 2017 S. Karger AG, Basel.
Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada
2002-07-01
Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.
Birth and death of genes linked to chromosomal inversion
Furuta, Yoshikazu; Kawai, Mikihiko; Yahara, Koji; Takahashi, Noriko; Handa, Naofumi; Tsuru, Takeshi; Oshima, Kenshiro; Yoshida, Masaru; Azuma, Takeshi; Hattori, Masahira; Uchiyama, Ikuo; Kobayashi, Ichizo
2011-01-01
The birth and death of genes is central to adaptive evolution, yet the underlying genome dynamics remain elusive. The availability of closely related complete genome sequences helps to follow changes in gene contents and clarify their relationship to overall genome organization. Helicobacter pylori, bacteria in our stomach, are known for their extreme genome plasticity through mutation and recombination and will make a good target for such an analysis. In comparing their complete genome sequences, we found that gain and loss of genes (loci) for outer membrane proteins, which mediate host interaction, occurred at breakpoints of chromosomal inversions. Sequence comparison there revealed a unique mechanism of DNA duplication: DNA duplication associated with inversion. In this process, a DNA segment at one chromosomal locus is copied and inserted, in an inverted orientation, into a distant locus on the same chromosome, while the entire region between these two loci is also inverted. Recognition of this and three more inversion modes, which occur through reciprocal recombination between long or short sequence similarity or adjacent to a mobile element, allowed reconstruction of synteny evolution through inversion events in this species. These results will guide the interpretation of extensive DNA sequencing results for understanding long- and short-term genome evolution in various organisms and in cancer cells. PMID:21212362
Dominguez, Gustavo A; Quattro, Joseph M; Denslow, Nancy D; Kroll, Kevin J; Prucha, Melinda S; Porak, Wesley F; Grier, Harry J; Sabo-Attwood, Tara L
2012-09-01
Fish vitellogenin synthesized and released from the liver of oviparous animals is taken up into oocytes by the vitellogenin receptor. This is an essential process in providing nutrient yolk to developing embryos to ensure successful reproduction. Here we disclose the full length vtgr cDNA sequence for largemouth bass (LMB) that reveals greater than 90% sequence homology with other fish vtgr sequences. We classify LMB Vtgr as a member of the low density lipoprotein receptor superfamily based on conserved domains and categorize as the short variant that is devoid of the O-glycan segment. Phylogenetic analysis places LMB Vtgr sequence into a well-supported monophyletic group of fish Vtgr. Real-time PCR showed that the greatest levels of LMB vtgr mRNA expression occurred in previtellogenic ovarian tissues. In addition, we reveal the effects of insulin, 17beta-estradiol (E(2)), and 11-ketotestosterone (11-KT) in modulation of vtgr, esr, and ar mRNAs in previtellogenic oocytes. Insulin increased vtgr expression levels in follicles ex vivo while exposure to E(2) or 11-KT did not result in modulation of expression. However, both steroids were able to repress insulin-induced vtgr transcript levels. Coexposure with insulin and E(2) or of insulin and 11-KT increased ovarian esr2b and ar mRNA levels, respectively, which suggest a role for these nuclear receptors in insulin-mediated signaling pathways. These data provide the first evidence for the ordered stage-specific expression of LMB vtgr during the normal reproductive process and the hormonal influence of insulin and sex steroids on controlling vtgr transcript levels in ovarian tissues.
Dominguez, Gustavo A.; Quattro, Joseph M.; Denslow, Nancy D.; Kroll, Kevin J.; Prucha, Melinda S.; Porak, Wesley F.; Grier, Harry J.; Sabo-Attwood, Tara L.
2012-01-01
ABSTRACT Fish vitellogenin synthesized and released from the liver of oviparous animals is taken up into oocytes by the vitellogenin receptor. This is an essential process in providing nutrient yolk to developing embryos to ensure successful reproduction. Here we disclose the full length vtgr cDNA sequence for largemouth bass (LMB) that reveals greater than 90% sequence homology with other fish vtgr sequences. We classify LMB Vtgr as a member of the low density lipoprotein receptor superfamily based on conserved domains and categorize as the short variant that is devoid of the O-glycan segment. Phylogenetic analysis places LMB Vtgr sequence into a well-supported monophyletic group of fish Vtgr. Real-time PCR showed that the greatest levels of LMB vtgr mRNA expression occurred in previtellogenic ovarian tissues. In addition, we reveal the effects of insulin, 17beta-estradiol (E2), and 11-ketotestosterone (11-KT) in modulation of vtgr, esr, and ar mRNAs in previtellogenic oocytes. Insulin increased vtgr expression levels in follicles ex vivo while exposure to E2 or 11-KT did not result in modulation of expression. However, both steroids were able to repress insulin-induced vtgr transcript levels. Coexposure with insulin and E2 or of insulin and 11-KT increased ovarian esr2b and ar mRNA levels, respectively, which suggest a role for these nuclear receptors in insulin-mediated signaling pathways. These data provide the first evidence for the ordered stage-specific expression of LMB vtgr during the normal reproductive process and the hormonal influence of insulin and sex steroids on controlling vtgr transcript levels in ovarian tissues. PMID:22786822
SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells.
Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang
2018-01-01
Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. © 2018 Han et al.; Published by Cold Spring Harbor Laboratory Press.
SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells
Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang
2018-01-01
Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. PMID:29208629
Chakraborty, Sandipan; Chatterjee, Barnali; Basu, Soumalee
2012-07-01
A collective approach of sequence analysis, phylogenetic tree and in silico prediction of amyloidogenecity using bioinformatics tools have been used to correlate the observed species-specific variations in IAPP sequences with the amyloid forming propensity. Observed substitution patterns indicate that probable changes in local hydrophobicity are instrumental in altering the aggregation propensity of the peptide. In particular, residues at 17th, 22nd and 23rd positions of the IAPP peptide are found to be crucial for amyloid formation. Proline25 primarily dictates the observed non-amyloidogenecity in rodents. Furthermore, extensive molecular dynamics simulation of 0.24 μs have been carried out with human IAPP (hIAPP) fragment 19-27, the portion showing maximum sequence variation across different species, to understand the native folding characteristic of this region. Principal component analysis in combination with free energy landscape analysis illustrates a four residue turn spanning from residue 22 to 25. The results provide a structural insight into the intramolecular β-sheet structure of amylin which probably is the template for nucleation of fibril formation and growth, a pathogenic feature of type II diabetes. Copyright © 2012 Elsevier B.V. All rights reserved.
Rapid phylogenetic dissection of prokaryotic community structure in tidal flat using pyrosequencing.
Kim, Bong-Soo; Kim, Byung Kwon; Lee, Jae-Hak; Kim, Myungjin; Lim, Young Woon; Chun, Jongsik
2008-08-01
Dissection of prokaryotic community structure is prerequisite to understand their ecological roles. Various methods are available for such a purpose which amplification and sequencing of 16S rRNA genes gained its popularity. However, conventional methods based on Sanger sequencing technique require cloning process prior to sequencing, and are expensive and labor-intensive. We investigated prokaryotic community structure in tidal flat sediments, Korea, using pyrosequencing and a subsequent automated bioinformatic pipeline for the rapid and accurate taxonomic assignment of each amplicon. The combination of pyrosequencing and bioinformatic analysis showed that bacterial and archaeal communities were more diverse than previously reported in clone library studies. Pyrosequencing analysis revealed 21 bacterial divisions and 37 candidate divisions. Proteobacteria was the most abundant division in the bacterial community, of which Gamma-and Delta-Proteobacteria were the most abundant. Similarly, 4 archaeal divisions were found in tidal flat sediments. Euryarchaeota was the most abundant division in the archaeal sequences, which were further divided into 8 classes and 11 unclassified euryarchaeota groups. The system developed here provides a simple, in-depth and automated way of dissecting a prokaryotic community structure without extensive pretreatment such as cloning.
VarMod: modelling the functional effects of non-synonymous variants
Pappalardo, Morena; Wass, Mark N.
2014-01-01
Unravelling the genotype–phenotype relationship in humans remains a challenging task in genomics studies. Recent advances in sequencing technologies mean there are now thousands of sequenced human genomes, revealing millions of single nucleotide variants (SNVs). For non-synonymous SNVs present in proteins the difficulties of the problem lie in first identifying those nsSNVs that result in a functional change in the protein among the many non-functional variants and in turn linking this functional change to phenotype. Here we present VarMod (Variant Modeller) a method that utilises both protein sequence and structural features to predict nsSNVs that alter protein function. VarMod develops recent observations that functional nsSNVs are enriched at protein–protein interfaces and protein–ligand binding sites and uses these characteristics to make predictions. In benchmarking on a set of nearly 3000 nsSNVs VarMod performance is comparable to an existing state of the art method. The VarMod web server provides extensive resources to investigate the sequence and structural features associated with the predictions including visualisation of protein models and complexes via an interactive JSmol molecular viewer. VarMod is available for use at http://www.wasslab.org/varmod. PMID:24906884
Jones, Susan C.; Jenkins, Tracie M.
2016-01-01
The goal of this study was to infer Heterotermes (Froggatt) (Dictyoptera: Rhinotermitidae) species diversity on the island of Puerto Rico from phylogenetic analyses of DNA sequence data from two mitochondrial genes, 16S rRNA and cytochrome oxidase II (COII). This termite genus is a structural pest known to be well adapted to arid environments in subtropical and tropical regions worldwide including Puerto Rico and many other Caribbean islands. Extensive sampling was accomplished across Puerto Rico, and phylogenetic analyses of individual gene sequences from these samples indicated robust datasets of congruent gene tree topologies showing three monophyletic groups: H. cardini (Snyder), H. convexinotatus (Snyder), and H. tenuis (Hagen). We found that H. cardini and H. convexinotatus were widespread in the arid coastal regions of Puerto Rico, whereas H. tenuis was uncommon and may represent a relatively new introduction. We found only H. convexinotatus on Culebra Island. We provide strong evidence that Puerto Rico may be linked to the Heterotermes in southern Florida, USA, since its GenBank 16S sequence was identical to that of seven Puerto Rican H. cardini sequences. Our study represents the first records of H. cardini from Puerto Rico and Grand Bahama.
A tale of two sequences: microRNA-target chimeric reads.
Broughton, James P; Pasquinelli, Amy E
2016-04-04
In animals, a functional interaction between a microRNA (miRNA) and its target RNA requires only partial base pairing. The limited number of base pair interactions required for miRNA targeting provides miRNAs with broad regulatory potential and also makes target prediction challenging. Computational approaches to target prediction have focused on identifying miRNA target sites based on known sequence features that are important for canonical targeting and may miss non-canonical targets. Current state-of-the-art experimental approaches, such as CLIP-seq (cross-linking immunoprecipitation with sequencing), PAR-CLIP (photoactivatable-ribonucleoside-enhanced CLIP), and iCLIP (individual-nucleotide resolution CLIP), require inference of which miRNA is bound at each site. Recently, the development of methods to ligate miRNAs to their target RNAs during the preparation of sequencing libraries has provided a new tool for the identification of miRNA target sites. The chimeric, or hybrid, miRNA-target reads that are produced by these methods unambiguously identify the miRNA bound at a specific target site. The information provided by these chimeric reads has revealed extensive non-canonical interactions between miRNAs and their target mRNAs, and identified many novel interactions between miRNAs and noncoding RNAs.
Telex Language in Thailand: Its Importance, Its Teaching and the Materials Required.
ERIC Educational Resources Information Center
Savangvarorose, Bang-Orn
A 1985 survey of telex use in Thailand revealed that 90 out of 100 companies send and receive between 1 and 50 telexes a day, that telex is used more extensively than any other form of written communication for business, and that there are typical telex styles and formats. Information on business education on the university level reveals that only…
NASA Astrophysics Data System (ADS)
Stanković, Ana; Nadachowski, Adam; Doan, Karolina; Stefaniak, Krzysztof; Baca, Mateusz; Socha, Paweł; Wegleński, Piotr; Ridush, Bogdan
2010-05-01
The Late Pleistocene has been a period of significant population and species turnover and extinctions among the large mammal fauna. Massive climatic and environmental changes during Pleistocene significantly influenced the distribution and also genetic diversity of plants and animals. The model of glacial refugia and habitat contraction to southern peninsulas in Europe as areas for the survival of temperate animal species during unfavourable Pleistocene glaciations is at present widely accepted. However, both molecular data and the fossil record indicate the presence of northern and perhaps north-eastern refugia in Europe. In recent years, much new palaeontological data have been obtained in the Crimean Peninsula, Ukraine, following extensive investigations. The red deer (Cervus elaphus) samples for aDNA studies were collected in Emine-Bair-Khosar Cave, situated on the north edge of Lower Plateau of the Chatyrdag Massif (Crimean Mountains). The cave is a vertical shaft, which functioned as a huge mega-trap over a long period of time (probably most of the Pleistocene). The bone assemblages provided about 5000 bones belonging to more than 40 species. The C. elaphus bones were collected from three different stratigraphical levels, radiocarbon dated by accelerator mass spectrometry (AMS) method. The bone fragments of four specimens of red deer were used for the DNA isolation and analysis. The mtDNA (Cytochome b) was successfully isolated from three bone fragments and the cytochrome b sequences were amplified by multiplex PCR. The sequences obtained so far allowed for the reconstruction of only preliminary phylogenetic trees. A fragment of metatarsus from level dated to ca. 48,500±2,000 years BP, yielded a sequence of 513 bp, allowing to locate the specimen on the phylogenetic tree within modern C. elaphus specimens from southern and middle Europe. The second bone fragment, a fragment of mandible, collected from level dated approximately to ca. 33,500±400 years BP, yielded a sequence (696 bp) locating this specimen much closer to the modern C. elaphus specimens from China and Far East. From the third bone fragment (metatarsus), dated between ca. 12,000 years BP and 30,000 years BP, the sequence of only 346 bp has been obtained. It locates this specimen between European and Asiatic haplogroups. The preliminary results of analysis of the DNA from Crimean C. elaphus fossils reveal the great genetic heterogeneity and a complex phylogeographical pattern of the material studied. The obtained results support the opinion that Crimean Peninsula was the most north-eastern refugium in Europe during Late Pleistocene playing a major role in recolonization and dispersal processes of temperate species during and after the Late Pleistocene in this part of the Euro-Asian continent.
Sakakibara, Yasumbumi
2018-02-13
Keio University's Yasumbumi Sakakibara on "MetaVelvet: An Extension of Velvet Assembler to de novo Metagenome Assembly from Short Sequence Reads" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sakakibara, Yasumbumi
2011-10-13
Keio University's Yasumbumi Sakakibara on "MetaVelvet: An Extension of Velvet Assembler to de novo Metagenome Assembly from Short Sequence Reads" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.
Jennings, W Bryan; Wogel, Henrique; Bilate, Marcos; Salles, Rodrigo de O L; Buckup, Paulo A
2016-09-01
The microhylid frogs belonging to the genus Arcovomer have been reported from lowland Atlantic Rainforest in the Brazilian states of Espírito Santo, Rio de Janeiro, and São Paulo. Here, we use DNA barcoding to assess levels of genetic divergence between apparently isolated populations in Espírito Santo and Rio de Janeiro. Our mtDNA data consisting of cytochrome oxidase subunit I (COI) nucleotide sequences reveals 13.2% uncorrected and 30.4% TIM2 + I + Γ corrected genetic divergences between these two populations. This level of divergence exceeds the suggested 10% uncorrected divergence threshold for elevating amphibian populations to candidate species using this marker, which implies that the Espírito Santo population is a species distinct from Arcovomer passarellii. Calibration of our model-corrected sequence divergence estimates suggests that the time of population divergence falls between 12 and 29 million years ago.
Fajardo, Diego; Schlautman, Brandon; Steffan, Shawn; Polashock, James; Vorsa, Nicholi; Zalapa, Juan
2014-02-25
This is the first de novo assembly and annotation of a complete mitochondrial genome in the Ericales order from the American cranberry (Vaccinium macrocarpon Ait.). Moreover, only four complete Asterid mitochondrial genomes have been made publicly available. The cranberry mitochondrial genome was assembled and reconstructed from whole genome 454 Roche GS-FLX and Illumina shotgun sequences. Compared with other Asterids, the reconstruction of the genome revealed an average size mitochondrion (459,678 nt) with relatively little repetitive sequences and DNA of plastid origin. The complete mitochondrial genome of cranberry was annotated obtaining a total of 34 genes classified based on their putative function, plus three ribosomal RNAs, and 17 transfer RNAs. Maternal organellar cranberry inheritance was inferred by analyzing gene variation in the cranberry mitochondria and plastid genomes. The annotation of cranberry mitochondrial genome revealed the presence of two copies of tRNA-Sec and a selenocysteine insertion sequence (SECIS) element which were lost in plants during evolution. This is the first report of a land plant possessing selenocysteine insertion machinery at the sequence level. Published by Elsevier B.V.
The organization and expression of the mdm2 gene.
de Oca Luna, R M; Tabor, A D; Eberspaecher, H; Hulboy, D L; Worth, L L; Colman, M S; Finlay, C A; Lozano, G
1996-05-01
The mdm2 gene encodes a zinc finger protein that negatively regulates p53 function by binding and masking the p53 transcriptional activation domain. Two different promoters control expression of mdm2, one of which is also transactivated by p53. We cloned and characterized the mdm2 gene from a murine 129 library. It contained at least 12 exons and spanned approximately 25 kb of DNA. Sequencing of the mdm2 gene revealed three nucleotide differences that resulted in amino acid substitutions in the previously published mdm2 sequence. Sequencing of normal BalbC/J DNA and the original cosmid clone isolated from the 3T3DM cell line revealed that they are identical, suggesting that the published sequence is in error at these three positions. In addition, we analyzed the expression pattern of mdm2 and found ubiquitous low-level expression throughout embryo development and in adult tissues. Analysis of mRNA from numerous tissues for several mdm2 spliced variants that had been identified in the transformed 3T3DM cell line revealed that these variants could not be detected in the developing embryo or in adult tissues.
Pasta, Saloni Yatin; Raman, Bakthisaran; Ramakrishna, Tangirala; Rao, Ch Mohan
2002-11-29
Several small heat shock proteins contain a well conserved alpha-crystallin domain, flanked by an N-terminal domain and a C-terminal extension, both of which vary in length and sequence. The structural and functional role of the C-terminal extension of small heat shock proteins, particularly of alphaA- and alphaB-crystallins, is not well understood. We have swapped the C-terminal extensions between alphaA- and alphaB-crystallins and generated two novel chimeric proteins, alphaABc and alphaBAc. We have investigated the domain-swapped chimeras for structural and functional alterations. We have used thermal and non-thermal models of protein aggregation and found that the chimeric alphaB with the C-terminal extension of alphaA-crystallin, alphaBAc, exhibits dramatically enhanced chaperone-like activity. Interestingly, however, the chimeric alphaA with the C-terminal extension of alphaB-crystallin, alphaABc, has almost lost its activity. Pyrene solubilization and bis-1-anilino-8-naphthalenesulfonate binding studies show that alphaBAc exhibits more solvent-exposed hydrophobic pockets than alphaA, alphaB, or alphaABc. Significant tertiary structural changes are revealed by tryptophan fluorescence and near-UV CD studies upon swapping the C-terminal extensions. The far-UV CD spectrum of alphaBAc differs from that of alphaB-crystallin whereas that of alphaABc overlaps with that of alphaA-crystallin. Gel filtration chromatography shows alteration in the size of the proteins upon swapping the C-terminal extensions. Our study demonstrates that the unstructured C-terminal extensions play a crucial role in the structure and chaperone activity, in addition to generally believed electrostatic "solubilizer" function.
Christie, Andrew E.; Fontanilla, Tiana M.; Nesbit, Katherine T.; Lenz, Petra H.
2013-01-01
Diel vertical migration and seasonal diapause are critical life history events for the copepod Calanus finmarchicus. While much is known about these behaviors phenomenologically, little is known about their molecular underpinnings. Recent studies in insects suggest that some circadian genes/proteins also contribute to the establishment of seasonal diapause. Thus, it is possible that in Calanus these distinct timing regimes share some genetic components. To begin to address this possibility, we used the well-established Drosophila melanogaster circadian system as a reference for mining clock transcripts from a 200,000+ sequence Calanus transcriptome; the proteins encoded by the identified transcripts were also deduced and characterized. Sequences encoding homologs of the Drosophila core clock proteins CLOCK, CYCLE, PERIOD and TIMELESS were identified, as was one encoding CRYPTOCHROME 2, a core clock protein in ancestral insect systems, but absent in Drosophila. Calanus transcripts encoding proteins known to modulate the Drosophila core clock were also identified and characterized, e.g. CLOCKWORK ORANGE, DOUBLETIME, SHAGGY and VRILLE. Alignment and structural analyses of the deduced Calanus proteins with their Drosophila counterparts revealed extensive sequence conservation, particularly in functional domains. Interestingly, reverse BLAST analyses of these sequences against all arthropod proteins typically revealed non-Drosophila isoforms to be most similar to the Calanus queries. This, in combination with the presence of both CRYPTOCHROME 1 (a clock input pathway protein) and CRYPTOCHROME 2 in Calanus, suggests that the organization of the copepod circadian system is an ancestral one, more similar to that of insects like Danaus plexippus than to that of Drosophila. PMID:23727418
Matrix metalloproteinases outside vertebrates.
Marino-Puertas, Laura; Goulas, Theodoros; Gomis-Rüth, F Xavier
2017-11-01
The matrix metalloproteinase (MMP) family belongs to the metzincin clan of zinc-dependent metallopeptidases. Due to their enormous implications in physiology and disease, MMPs have mainly been studied in vertebrates. They are engaged in extracellular protein processing and degradation, and present extensive paralogy, with 23 forms in humans. One characteristic of MMPs is a ~165-residue catalytic domain (CD), which has been structurally studied for 14 MMPs from human, mouse, rat, pig and the oral-microbiome bacterium Tannerella forsythia. These studies revealed close overall coincidence and characteristic structural features, which distinguish MMPs from other metzincins and give rise to a sequence pattern for their identification. Here, we reviewed the literature available on MMPs outside vertebrates and performed database searches for potential MMP CDs in invertebrates, plants, fungi, viruses, protists, archaea and bacteria. These and previous results revealed that MMPs are widely present in several copies in Eumetazoa and higher plants (Tracheophyta), but have just token presence in eukaryotic algae. A few dozen sequences were found in Ascomycota (within fungi) and in double-stranded DNA viruses infecting invertebrates (within viruses). In contrast, a few hundred sequences were found in archaea and >1000 in bacteria, with several copies for some species. Most of the archaeal and bacterial phyla containing potential MMPs are present in human oral and gut microbiomes. Overall, MMP-like sequences are present across all kingdoms of life, but their asymmetric distribution contradicts the vertical descent model from a eubacterial or archaeal ancestor. This article is part of a Special Issue entitled: Matrix Metalloproteinases edited by Rafael Fridman. Copyright © 2017 Elsevier B.V. All rights reserved.
Not all order memory is equal: Test demands reveal dissociations in memory for sequence information.
Jonker, Tanya R; MacLeod, Colin M
2017-02-01
Remembering the order of a sequence of events is a fundamental feature of episodic memory. Indeed, a number of formal models represent temporal context as part of the memory system, and memory for order has been researched extensively. Yet, the nature of the code(s) underlying sequence memory is still relatively unknown. Across 4 experiments that manipulated encoding task, we found evidence for 3 dissociable facets of order memory. Experiment 1 introduced a test requiring a judgment of which of 2 alternatives had immediately followed a word during encoding. This measure revealed better retention of interitem associations following relational encoding (silent reading) than relatively item-specific encoding (judging referent size), a pattern consistent with that observed in previous research using order reconstruction tests. In sharp contrast, Experiment 2 demonstrated the reverse pattern: Memory for the studied order of 2 sequentially presented items was actually better following item-specific encoding than following relational encoding. Experiment 3 reproduced this dissociation in a single experiment using both tests. Experiment 4 extended these findings by further dissociating the roles of relational encoding and item strength in the 2 tests. Taken together, these results indicate that memory for event sequence is influenced by (a) interitem associations, (b) the emphasized directionality of an association, and (c) an item's strength independent of other items. Memory for order is more complicated than has been portrayed in theories of memory and its nuances should be carefully considered when designing tests and models of temporal and relational memory. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Tennessen, Jacob A.; Govindarajulu, Rajanikanth; Ashman, Tia-Lynn; Liston, Aaron
2014-01-01
Whole-genome duplications are radical evolutionary events that have driven speciation and adaptation in many taxa. Higher-order polyploids have complex histories often including interspecific hybridization and dynamic genomic changes. This chromosomal reshuffling is poorly understood for most polyploid species, despite their evolutionary and agricultural importance, due to the challenge of distinguishing homologous sequences from each other. Here, we use dense linkage maps generated with targeted sequence capture to improve the diploid strawberry (Fragaria vesca) reference genome and to disentangle the subgenomes of the wild octoploid progenitors of cultivated strawberry, Fragaria virginiana and Fragaria chiloensis. Our novel approach, POLiMAPS (Phylogenetics Of Linkage-Map-Anchored Polyploid Subgenomes), leverages sequence reads to associate informative interhomeolog phylogenetic markers with linkage groups and reference genome positions. In contrast to a widely accepted model, we find that one of the four subgenomes originates with the diploid cytoplasm donor F. vesca, one with the diploid Fragaria iinumae, and two with an unknown ancestor close to F. iinumae. Extensive unidirectional introgression has converted F. iinumae-like subgenomes to be more F. vesca-like, but never the reverse, due either to homoploid hybridization in the F. iinumae-like diploid ancestors or else strong selection spreading F. vesca-like sequence among subgenomes through homeologous exchange. In addition, divergence between homeologous chromosomes has been substantially augmented by interchromosomal rearrangements. Our phylogenetic approach reveals novel aspects of the complicated web of genetic exchanges that occur during polyploid evolution and suggests a path forward for unraveling other agriculturally and ecologically important polyploid genomes. PMID:25477420
Alguacil, Maria del Mar; Torrecillas, Emma; Lozano, Zenaida; Roldán, Antonio
2011-01-01
Arbuscular mycorrhizal fungi (AMF) play important roles as plant protection agents, reducing or suppressing nematode colonization. However, it has never been investigated whether the galls produced in roots by nematode infection are colonized by AMF. This study tested whether galls produced by Meloidogyne incognita infection in Prunus persica roots are colonized by AMF. We also determined the changes in AMF composition and biodiversity mediated by infection with this root-knot nematode. DNA from galls and roots of plants infected by M. incognita and from roots of noninfected plants was extracted, amplified, cloned, and sequenced using AMF-specific primers. Phylogenetic analysis using the small-subunit (SSU) ribosomal DNA (rDNA) data set revealed 22 different AMF sequence types (17 Glomus sequence types, 3 Paraglomus sequence types, 1 Scutellospora sequence type, and 1 Acaulospora sequence type). The highest AMF diversity was found in uninfected roots, followed by infected roots and galls. This study indicates that the galls produced in P. persica roots due to infection with M. incognita were colonized extensively by a community of AMF, belonging to the families Paraglomeraceae and Glomeraceae, that was different from the community detected in roots. Although the function of the AMF in the galls is still unknown, we hypothesize that they act as protection agents against opportunistic pathogens. PMID:21984233
Alguacil, Maria del Mar; Torrecillas, Emma; Lozano, Zenaida; Roldán, Antonio
2011-12-01
Arbuscular mycorrhizal fungi (AMF) play important roles as plant protection agents, reducing or suppressing nematode colonization. However, it has never been investigated whether the galls produced in roots by nematode infection are colonized by AMF. This study tested whether galls produced by Meloidogyne incognita infection in Prunus persica roots are colonized by AMF. We also determined the changes in AMF composition and biodiversity mediated by infection with this root-knot nematode. DNA from galls and roots of plants infected by M. incognita and from roots of noninfected plants was extracted, amplified, cloned, and sequenced using AMF-specific primers. Phylogenetic analysis using the small-subunit (SSU) ribosomal DNA (rDNA) data set revealed 22 different AMF sequence types (17 Glomus sequence types, 3 Paraglomus sequence types, 1 Scutellospora sequence type, and 1 Acaulospora sequence type). The highest AMF diversity was found in uninfected roots, followed by infected roots and galls. This study indicates that the galls produced in P. persica roots due to infection with M. incognita were colonized extensively by a community of AMF, belonging to the families Paraglomeraceae and Glomeraceae, that was different from the community detected in roots. Although the function of the AMF in the galls is still unknown, we hypothesize that they act as protection agents against opportunistic pathogens.
Maruyama, Kyonoshin; Todaka, Daisuke; Mizoi, Junya; Yoshida, Takuya; Kidokoro, Satoshi; Matsukura, Satoko; Takasaki, Hironori; Sakurai, Tetsuya; Yamamoto, Yoshiharu Y.; Yoshiwara, Kyouko; Kojima, Mikiko; Sakakibara, Hitoshi; Shinozaki, Kazuo; Yamaguchi-Shinozaki, Kazuko
2012-01-01
The genomes of three plants, Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), and soybean (Glycine max), have been sequenced, and their many genes and promoters have been predicted. In Arabidopsis, cis-acting promoter elements involved in cold- and dehydration-responsive gene expression have been extensively analysed; however, the characteristics of such cis-acting promoter sequences in cold- and dehydration-inducible genes of rice and soybean remain to be clarified. In this study, we performed microarray analyses using the three species, and compared characteristics of identified cold- and dehydration-inducible genes. Transcription profiles of the cold- and dehydration-responsive genes were similar among these three species, showing representative upregulated (dehydrin/LEA) and downregulated (photosynthesis-related) genes. All (46 = 4096) hexamer sequences in the promoters of the three species were investigated, revealing the frequency of conserved sequences in cold- and dehydration-inducible promoters. A core sequence of the abscisic acid-responsive element (ABRE) was the most conserved in dehydration-inducible promoters of all three species, suggesting that transcriptional regulation for dehydration-inducible genes is similar among these three species, with the ABRE-dependent transcriptional pathway. In contrast, for cold-inducible promoters, the conserved hexamer sequences were diversified among these three species, suggesting the existence of diverse transcriptional regulatory pathways for cold-inducible genes among the species. PMID:22184637
Effect of genome sequence on the force-induced unzipping of a DNA molecule.
Singh, N; Singh, Y
2006-02-01
We considered a dsDNA polymer in which distribution of bases are random at the base pair level but ordered at a length of 18 base pairs and calculated its force elongation behaviour in the constant extension ensemble. The unzipping force F(y) vs. extension y is found to have a series of maxima and minima. By changing base pairs at selected places in the molecule we calculated the change in F(y) curve and found that the change in the value of force is of the order of few pN and the range of the effect depending on the temperature, can spread over several base pairs. We have also discussed briefly how to calculate in the constant force ensemble a pause or a jump in the extension-time curve from the knowledge of F(y).
Fernández, Helena; Hughes, Sandrine; Vigne, Jean-Denis; Helmer, Daniel; Hodgins, Greg; Miquel, Christian; Hänni, Catherine; Luikart, Gordon; Taberlet, Pierre
2006-01-01
Goats were among the first farm animals domesticated, ≈10,500 years ago, contributing to the rise of the “Neolithic revolution.” Previous genetic studies have revealed that contemporary domestic goats (Capra hircus) show far weaker intercontinental population structuring than other livestock species, suggesting that goats have been transported more extensively. However, the timing of these extensive movements in goats remains unknown. To address this question, we analyzed mtDNA sequences from 19 ancient goat bones (7,300–6,900 years old) from one of the earliest Neolithic sites in southwestern Europe. Phylogenetic analysis revealed that two highly divergent goat lineages coexisted in each of the two Early Neolithic layers of this site. This finding indicates that high mtDNA diversity was already present >7,000 years ago in European goats, far from their areas of initial domestication in the Near East. These results argue for substantial gene flow among goat populations dating back to the early neolithisation of Europe and for a dual domestication scenario in the Near East, with two independent but essentially contemporary origins (of both A and C domestic lineages) and several more remote and/or later origins. PMID:17030824
Limitations and potentials of current motif discovery algorithms
Hu, Jianjun; Li, Bin; Kihara, Daisuke
2005-01-01
Computational methods for de novo identification of gene regulation elements, such as transcription factor binding sites, have proved to be useful for deciphering genetic regulatory networks. However, despite the availability of a large number of algorithms, their strengths and weaknesses are not sufficiently understood. Here, we designed a comprehensive set of performance measures and benchmarked five modern sequence-based motif discovery algorithms using large datasets generated from Escherichia coli RegulonDB. Factors that affect the prediction accuracy, scalability and reliability are characterized. It is revealed that the nucleotide and the binding site level accuracy are very low, while the motif level accuracy is relatively high, which indicates that the algorithms can usually capture at least one correct motif in an input sequence. To exploit diverse predictions from multiple runs of one or more algorithms, a consensus ensemble algorithm has been developed, which achieved 6–45% improvement over the base algorithms by increasing both the sensitivity and specificity. Our study illustrates limitations and potentials of existing sequence-based motif discovery algorithms. Taking advantage of the revealed potentials, several promising directions for further improvements are discussed. Since the sequence-based algorithms are the baseline of most of the modern motif discovery algorithms, this paper suggests substantial improvements would be possible for them. PMID:16284194
D'hooge, Roseline; Cagnie, Barbara; Crombez, Geert; Vanderstraeten, Guy; Achten, Eric; Danneels, Lieven
2013-03-01
After cessation of a low-back pain (LBP) episode, alterations in trunk muscle behavior, despite recovery from pain, have been hypothesized to play a pathogenic role in the recurrence of LBP. This study aimed to identify the presence of lumbar muscle dysfunction during the remission of recurrent LBP, while performing a low-load trunk-extension movement. Thirteen participants with unilateral recurrent LBP were tested at least 1 month after cessation of the previous LBP episode and were compared with a healthy control group without any history of LBP (n=13). Also, differences between previously painful and nonpainful sides were examined. Muscle functional magnetic resonance imaging, based on quantitative T2-imaging, was used to examine muscle tissue characteristics (T2 rest) and muscle recruitment (T2 shift) during prone trunk extension. The lumbar multifidus, erector spinae, quadratus lumborum, and psoas were bilaterally visualized on 2 lumbar levels using a T2-weighted (spin-echo multicontrast) magnetic resonance imaging sequence. Linear mixed model analysis revealed a significantly lower T2 rest (P=0.044) and a significantly higher T2 shift (P=0.034) solely for the multifidus in the LBP group compared with the control group. No significant differences between pain sides were found. Lower T2-rest values have been suggested to correlate with a conversion of the multifidus' fiber typing toward the glycolytic muscle spectrum. Elevated T2 shifts correspond with increased levels of metabolic activity in the multifidus in the LBP group, for which several hypotheses can be put forward. Taken together, these findings provide evidence of concurrent alterations in the multifidus structure and activity in individuals with unilateral recurrent LBP, despite being pain free and functionally recovered.
Failla, A J; Vasquez, A A; Hudson, P; Fujimoto, M; Ram, J L
2016-02-01
Establishing reliable methods for the identification of benthic chironomid communities is important due to their significant contribution to biomass, ecology and the aquatic food web. Immature larval specimens are more difficult to identify to species level by traditional morphological methods than their fully developed adult counterparts, and few keys are available to identify the larval species. In order to develop molecular criteria to identify species of chironomid larvae, larval and adult chironomids from Western Lake Erie were subjected to both molecular and morphological taxonomic analysis. Mitochondrial cytochrome c oxidase I (COI) barcode sequences of 33 adults that were identified to species level by morphological methods were grouped with COI sequences of 189 larvae in a neighbor-joining taxon-ID tree. Most of these larvae could be identified only to genus level by morphological taxonomy (only 22 of the 189 sequenced larvae could be identified to species level). The taxon-ID tree of larval sequences had 45 operational taxonomic units (OTUs, defined as clusters with >97% identity or individual sequences differing from nearest neighbors by >3%; supported by analysis of all larval pairwise differences), of which seven could be identified to species or 'species group' level by larval morphology. Reference sequences from the GenBank and BOLD databases assigned six larval OTUs with presumptive species level identifications and confirmed one previously assigned species level identification. Sequences from morphologically identified adults in the present study grouped with and further classified the identity of 13 larval OTUs. The use of morphological identification and subsequent DNA barcoding of adult chironomids proved to be beneficial in revealing possible species level identifications of larval specimens. Sequence data from this study also contribute to currently inadequate public databases relevant to the Great Lakes region, while the neighbor-joining analysis reported here describes the application and confirmation of a useful tool that can accelerate identification and bioassessment of chironomid communities.
Failla, Andrew Joseph; Vasquez, Adrian Amelio; Hudson, Patrick L.; Fujimoto, Masanori; Ram, Jeffrey L.
2016-01-01
Establishing reliable methods for the identification of benthic chironomid communities is important due to their significant contribution to biomass, ecology and the aquatic food web. Immature larval specimens are more difficult to identify to species level by traditional morphological methods than their fully developed adult counterparts, and few keys are available to identify the larval species. In order to develop molecular criteria to identify species of chironomid larvae, larval and adult chironomids from Western Lake Erie were subjected to both molecular and morphological taxonomic analysis. Mitochondrial cytochrome c oxidase I (COI) barcode sequences of 33 adults that were identified to species level by morphological methods were grouped with COI sequences of 189 larvae in a neighbor-joining taxon-ID tree. Most of these larvae could be identified only to genus level by morphological taxonomy (only 22 of the 189 sequenced larvae could be identified to species level). The taxon-ID tree of larval sequences had 45 operational taxonomic units (OTUs, defined as clusters with >97% identity or individual sequences differing from nearest neighbors by >3%; supported by analysis of all larval pairwise differences), of which seven could be identified to species or ‘species group’ level by larval morphology. Reference sequences from the GenBank and BOLD databases assigned six larval OTUs with presumptive species level identifications and confirmed one previously assigned species level identification. Sequences from morphologically identified adults in the present study grouped with and further classified the identity of 13 larval OTUs. The use of morphological identification and subsequent DNA barcoding of adult chironomids proved to be beneficial in revealing possible species level identifications of larval specimens. Sequence data from this study also contribute to currently inadequate public databases relevant to the Great Lakes region, while the neighbor-joining analysis reported here describes the application and confirmation of a useful tool that can accelerate identification and bioassesment of chironomid communities.
Egener, Tanja; Martin, Dietmar E.; Sarkar, Abhijit; Reinhold-Hurek, Barbara
2001-01-01
The endophytic diazotroph Azoarcus sp. strain BH72 is capable of infecting rice roots and of expressing the nitrogenase (nif) genes there. In order to study the genetic background for nitrogen fixation in strain BH72, the structural genes of nitrogenase (nifHDK) were cloned and sequenced. The sequence analysis revealed an unusual gene organization: downstream of nifHDK, a ferredoxin gene (fdxN; 59% amino acid sequence identity to R. capsulatus FdxN) and open reading frames showing 52 and 36% amino acid sequence identity to nifY of Pseudomonas stutzeri A15 and ORF1 of Azotobacter vinelandii were located. Northern blot analysis, reverse transcriptase PCR and primer extension analysis revealed that these six genes are located on one transcript transcribed from a ς54-type promoter. Shorter transcripts sequentially missing genes of the 3′ part of the full-length mRNA were more abundantly detected. Mutational analyses suggested that FdxN is an important but not the essential electron donor for dinitrogenase reductase. An in-frame deletion of fdxN resulted in reduced growth rates (59% ± 9%) and nitrogenase activities (81%) in nitrogen-fixing pure cultures in comparison to the wild type. Nitrogenase activity was fully complemented in an fdxN mutant which carried a nifH promoter-driven fdxN gene in trans. Also, in coculture with the ascomycete Acremonium alternatum, where strain BH72 develops intracytoplasmic membrane stacks, the nitrogenase activity in the fdxN deletion mutant was decreased to 56% of the wild-type level. Surprisingly, the fdxN deletion also had an effect on the rapid “switch-off” of nitrogenase activity in response to ammonium. Wild-type strain BH72 and the deletion mutant complemented with fdxN in trans showed a rapid reversible inactivation of acetylene reduction, while the deletion mutant did not cease to reduce acetylene. In concordance with the hypothesis that changes in the redox state of NifH or electron flux towards nitrogenase may be involved in the mechanism of physiological nitrogenase switch-off, our results suggest that the ferredoxin may be a component involved in this process. PMID:11371540
Pingault, Lise; Choulet, Frédéric; Alberti, Adriana; Glover, Natasha; Wincker, Patrick; Feuillet, Catherine; Paux, Etienne
2015-02-10
Because of its size, allohexaploid nature, and high repeat content, the bread wheat genome is a good model to study the impact of the genome structure on gene organization, function, and regulation. However, because of the lack of a reference genome sequence, such studies have long been hampered and our knowledge of the wheat gene space is still limited. The access to the reference sequence of the wheat chromosome 3B provided us with an opportunity to study the wheat transcriptome and its relationships to genome and gene structure at a level that has never been reached before. By combining this sequence with RNA-seq data, we construct a fine transcriptome map of the chromosome 3B. More than 8,800 transcription sites are identified, that are distributed throughout the entire chromosome. Expression level, expression breadth, alternative splicing as well as several structural features of genes, including transcript length, number of exons, and cumulative intron length are investigated. Our analysis reveals a non-monotonic relationship between gene expression and structure and leads to the hypothesis that gene structure is determined by its function, whereas gene expression is subject to energetic cost. Moreover, we observe a recombination-based partitioning at the gene structure and function level. Our analysis provides new insights into the relationships between gene and genome structure and function. It reveals mechanisms conserved with other plant species as well as superimposed evolutionary forces that shaped the wheat gene space, likely participating in wheat adaptation.
Windsor, Aaron J.; Schranz, M. Eric; Formanová, Nataša; Gebauer-Jung, Steffi; Bishop, John G.; Schnabelrauch, Domenica; Kroymann, Juergen; Mitchell-Olds, Thomas
2006-01-01
Comparative genomics provides insight into the evolutionary dynamics that shape discrete sequences as well as whole genomes. To advance comparative genomics within the Brassicaceae, we have end sequenced 23,136 medium-sized insert clones from Boechera stricta, a wild relative of Arabidopsis (Arabidopsis thaliana). A significant proportion of these sequences, 18,797, are nonredundant and display highly significant similarity (BLASTn e-value ≤ 10−30) to low copy number Arabidopsis genomic regions, including more than 9,000 annotated coding sequences. We have used this dataset to identify orthologous gene pairs in the two species and to perform a global comparison of DNA regions 5′ to annotated coding regions. On average, the 500 nucleotides upstream to coding sequences display 71.4% identity between the two species. In a similar analysis, 61.4% identity was observed between 5′ noncoding sequences of Brassica oleracea and Arabidopsis, indicating that regulatory regions are not as diverged among these lineages as previously anticipated. By mapping the B. stricta end sequences onto the Arabidopsis genome, we have identified nearly 2,000 conserved blocks of microsynteny (bracketing 26% of the Arabidopsis genome). A comparison of fully sequenced B. stricta inserts to their homologous Arabidopsis genomic regions indicates that indel polymorphisms >5 kb contribute substantially to the genome size difference observed between the two species. Further, we demonstrate that microsynteny inferred from end-sequence data can be applied to the rapid identification and cloning of genomic regions of interest from nonmodel species. These results suggest that among diploid relatives of Arabidopsis, small- to medium-scale shotgun sequencing approaches can provide rapid and cost-effective benefits to evolutionary and/or functional comparative genomic frameworks. PMID:16607030
Discovery of Novel Wall Teichoic Acid Inhibitors as Effective anti-MRSA β-lactam Combination Agents
Wang, Hao; Gill, Charles J.; Lee, Sang H.; Mann, Paul; Zuck, Paul; Meredith, Timothy C.; Murgolo, Nicholas; She, Xinwei; Kales, Susan; Liang, Lianzhu; Liu, Jenny; Wu, Jin; Maria, John Santa; Su, Jing; Pan, Jianping; Hailey, Judy; Mcguinness, Debra; Tan, Christopher M.; Flattery, Amy; Walker, Suzanne; Black, Todd; Roemer, Terry
2013-01-01
Summary Innovative strategies are needed to combat drug resistance associated with methicillin-resistant Staphylococcus aureus (MRSA). Here, we investigate the potential of wall teichoic acid (WTA) biosynthesis inhibitors as combination agents to restore β-lactam efficacy against MRSA. Performing a whole cell pathway-based screen we identified a series of WTA inhibitors (WTAIs) targeting the WTA transporter protein, TarG. Whole genome sequencing of WTAI resistant isolates across two methicillin-resistant Staphylococci spp. revealed TarG as their common target, as well as a broad assortment of drug resistant bypass mutants mapping to earlier steps of WTA biosynthesis. Extensive in vitro microbiological analysis and animal infection studies provide strong genetic and pharmacological evidence of the potential effectiveness of WTAIs as anti-MRSA β-lactam combination agents. This work also highlights the emerging role of whole genome sequencing in antibiotic mode-of-action and resistance studies. PMID:23438756
Casein expression in cytotoxic T lymphocytes.
Grusby, M J; Mitchell, S C; Nabavi, N; Glimcher, L H
1990-01-01
A cDNA that expresses a mRNA restricted to cytotoxic T lymphocytes (CTL) and mammary tissue has been isolated and characterized. The deduced amino acid sequence from this cDNA shows extensive homology with the previously reported amino acid sequence for rat alpha-casein. Indeed, the presence of a six-residue-repeated motif that is specific for rodent alpha-caseins strongly supports the identification of this cDNA as mouse alpha-casein. Northern (RNA) blot analysis of many hematopoietic cell types revealed that this gene is restricted to CTL, being expressed in four of six CTL lines examined. Furthermore, CTL that express this gene were also found to express other members of the casein gene family, such as beta- and kappa-casein. These results suggest that caseins may be important in CTL function, and their potential role in CTL-mediated lysis is discussed. Images PMID:2395885
The transmission dynamics and diversity of human metapneumovirus in Peru.
Pollett, Simon; Trovão, Nidia S; Tan, Yi; Eden, John-Sebastian; Halpin, Rebecca A; Bera, Jayati; Das, Suman R; Wentworth, David; Ocaña, Victor; Mendocilla, Silvia M; Álvarez, Carlos; Calisto, Maria E; Garcia, Josefina; Halsey, Eric; Ampuero, Julia S; Nelson, Martha I; Leguia, Mariana
2017-12-29
The transmission dynamics of human metapneumovirus (HMPV) in tropical countries remain unclear. Further understanding of the genetic diversity of the virus could aid in HMPV vaccine design and improve our understanding of respiratory virus transmission dynamics in low- and middle-income countries. We examined the evolution of HMPV in Peru through phylogenetic analysis of 61 full genome HMPV sequences collected in three ecologically diverse regions of Peru (Lima, Piura, and Iquitos) during 2008-2012, comprising the largest data set of HMPV whole genomes sequenced from any tropical country to date. We revealed extensive genetic diversity generated by frequent viral introductions, with little evidence of local persistence. While considerable viral traffic between non-Peruvian countries and Peru was observed, HMPV epidemics in Peruvian locales were more frequently epidemiologically linked with other sites within Peru. We showed that Iquitos experienced greater HMPV traffic than the similar sized city of Piura by both Bayesian and maximum likelihood methods. There is extensive HMPV genetic diversity even within smaller and relatively less connected cities of Peru and this virus is spatially fluid. Greater diversity of HMPV in Iquitos compared to Piura may relate to higher volumes of human movement, including air traffic to this location. © 2017 The Authors. Influenza and Other Respiratory Viruses Published by John Wiley & Sons Ltd.
A sequence-dependent rigid-base model of DNA
NASA Astrophysics Data System (ADS)
Gonzalez, O.; Petkevičiutė, D.; Maddocks, J. H.
2013-02-01
A novel hierarchy of coarse-grain, sequence-dependent, rigid-base models of B-form DNA in solution is introduced. The hierarchy depends on both the assumed range of energetic couplings, and the extent of sequence dependence of the model parameters. A significant feature of the models is that they exhibit the phenomenon of frustration: each base cannot simultaneously minimize the energy of all of its interactions. As a consequence, an arbitrary DNA oligomer has an intrinsic or pre-existing stress, with the level of this frustration dependent on the particular sequence of the oligomer. Attention is focussed on the particular model in the hierarchy that has nearest-neighbor interactions and dimer sequence dependence of the model parameters. For a Gaussian version of this model, a complete coarse-grain parameter set is estimated. The parameterized model allows, for an oligomer of arbitrary length and sequence, a simple and explicit construction of an approximation to the configuration-space equilibrium probability density function for the oligomer in solution. The training set leading to the coarse-grain parameter set is itself extracted from a recent and extensive database of a large number of independent, atomic-resolution molecular dynamics (MD) simulations of short DNA oligomers immersed in explicit solvent. The Kullback-Leibler divergence between probability density functions is used to make several quantitative assessments of our nearest-neighbor, dimer-dependent model, which is compared against others in the hierarchy to assess various assumptions pertaining both to the locality of the energetic couplings and to the level of sequence dependence of its parameters. It is also compared directly against all-atom MD simulation to assess its predictive capabilities. The results show that the nearest-neighbor, dimer-dependent model can successfully resolve sequence effects both within and between oligomers. For example, due to the presence of frustration, the model can successfully predict the nonlocal changes in the minimum energy configuration of an oligomer that are consequent upon a local change of sequence at the level of a single point mutation.
A sequence-dependent rigid-base model of DNA.
Gonzalez, O; Petkevičiūtė, D; Maddocks, J H
2013-02-07
A novel hierarchy of coarse-grain, sequence-dependent, rigid-base models of B-form DNA in solution is introduced. The hierarchy depends on both the assumed range of energetic couplings, and the extent of sequence dependence of the model parameters. A significant feature of the models is that they exhibit the phenomenon of frustration: each base cannot simultaneously minimize the energy of all of its interactions. As a consequence, an arbitrary DNA oligomer has an intrinsic or pre-existing stress, with the level of this frustration dependent on the particular sequence of the oligomer. Attention is focussed on the particular model in the hierarchy that has nearest-neighbor interactions and dimer sequence dependence of the model parameters. For a Gaussian version of this model, a complete coarse-grain parameter set is estimated. The parameterized model allows, for an oligomer of arbitrary length and sequence, a simple and explicit construction of an approximation to the configuration-space equilibrium probability density function for the oligomer in solution. The training set leading to the coarse-grain parameter set is itself extracted from a recent and extensive database of a large number of independent, atomic-resolution molecular dynamics (MD) simulations of short DNA oligomers immersed in explicit solvent. The Kullback-Leibler divergence between probability density functions is used to make several quantitative assessments of our nearest-neighbor, dimer-dependent model, which is compared against others in the hierarchy to assess various assumptions pertaining both to the locality of the energetic couplings and to the level of sequence dependence of its parameters. It is also compared directly against all-atom MD simulation to assess its predictive capabilities. The results show that the nearest-neighbor, dimer-dependent model can successfully resolve sequence effects both within and between oligomers. For example, due to the presence of frustration, the model can successfully predict the nonlocal changes in the minimum energy configuration of an oligomer that are consequent upon a local change of sequence at the level of a single point mutation.
Suzuki, Keijiro; Sugawara, Takeshi; Ishida, Yoji; Suwabe, Akira
2013-02-01
Congenital coagulation factor VII (FVII) deficiency is a rare coagulation disease. We investigated the molecular mechanisms of this FVII deficiency in a patient with compound heterozygous mutations. A 22-year-old Japanese female was diagnosed with asymptomatic FVII deficiency. The FVII activity and antigen were greatly reduced (activity, 13.0%; antigen, 10.8%). We analyzed the F7 gene of this patient and characterized mutant FVII proteins using in vitro expression studies. Sequence analysis revealed that the patient was compound heterozygous with a point mutation (p.Leu13Pro) in the central hydrophobic core of the signal peptides and a novel non-sense mutation (p.Tyr294*) in the catalytic domain. Expression studies revealed that mutant FVII with p.Leu13Pro (FVII13P) showed less accumulation in the cells (17.5%) and less secretion into the medium (64.8%) than wild type showed. Truncated FVII resulting from p.Tyr294* (FVII294X) was also decreased in the cells (32.0%), but was not secreted into the medium. Pulse-chase experiments revealed that both mutants were extensively degraded intracellularly compared to wild type. The majority of FVII13P cannot translocate into endoplasmic reticulum (ER). However, a small amount of FVII13P was processed normally with post-translational modifications and was secreted into the medium. The fact that FVII294X was observed only in ER suggests that it is retained in ER. Proteasome apparently plays a central role in these degradations. These findings demonstrate that both mutant FVIIs impaired secretion through ineffective translocation to and retention in ER with extensive intracellular degradation, resulting in an insufficient phenotype. Copyright © 2012 Elsevier Ltd. All rights reserved.
Determinants of Child Attachment in the Years Postpartum in a High-Risk Sample of Immigrant Women.
Lecompte, Vanessa; Rousseau, Cécile
2017-10-07
Our goal was to examine maternal mental health and associated stresses in a sample of high-risk immigrant mothers, and its association with child insecure attachment in the years following childbirth. Mothers and their child (Mage = 37 months) were recruited through a Health and Social Service organization in the Parc-Extension neighborhood in Montreal, Quebec. Mothers completed the Hopkins Symptoms Checklist (HSCL-25), the Multidimensional Scale of Perceived Social Support (MPSS) and a sociodemographic questionnaire that included questions on premature delivery and birth weight. Attachment behaviors were coded out of a videotaped free play sequence using the Preschool and Early School-Age Attachment Rating Scales (PARS). Analysis revealed high levels of clinical anxiety and depression, low social support and low attachment security. Significant mean differences and associations were found between anxiety, depression, social support, preterm delivery and child attachment. These results underscore the importance of screening for anxiety and depression early in the postnatal years, in order to prevent associated consequences such as child insecure attachment. Results also highlight the importance of building positive social networks, especially with immigrant populations.
Adriano, E A; Okamura, B
2017-02-01
The Myxozoa demonstrate extensive morphological simplification and miniaturization relative to their free-living cnidarian ancestors. This is particularly pronounced in the highly derived myxosporeans, which develop as plasmodia and pseudoplasmodia. To date, motility in these stages has been linked with membrane deformation (e.g. as pseudopodia and mobile folds). Here we illustrate a motile, elongate plasmodium that undergoes coordinated undulatory locomotion, revealing remarkable convergence to a functional worm at the cellular level. Ultrastructural and confocal analyses of these plasmodia identify a highly differentiated external layer containing an actin-rich network, long tubular mitochondria, abundant microtubules, a secreted glycocalyx layer, and an internal region where sporogony occurs and which contains homogeneously distributed granular/fibrillar material. We consider how some of these features may support motility. We also describe the species based on spore morphology and SSU rDNA sequence data, undertake molecular phylogenetic analysis to place it within an early-diverging clade of the ceratomyxids, and evaluate the resultant implications for classification (validity of the genus Meglitschia) and for inferring early host environments (freshwater) of ceratomyxids.
Schönberger, Jan; Möhlenbruch, Markus; Seitz, Angelika; Bußmann, Cornelia; Bächli, Heidi; Kölker, Stefan
2017-07-01
Spontaneous intracranial hypotension is a rarely diagnosed cause of headache, especially in children and adolescents. It is due to cerebrospinal fluid (CSF) leakage via spinal fistulae occurring without major trauma. An adolescent patient presented with a 3-month history of strictly postural headache. Cranial magnetic resonance imaging (MRI) showed pronounced Chiari-like prolapse of the cerebellar tonsils, narrow ventricles and enlarged cerebral veins. On spinal MRI, myelographic sequences revealed a large collection of CSF around the first sacral roots. CT myelography proved extensive spinal CSF leakage. Hence, we applied epidural patches at multiple levels. Afterwards, symptoms and radiologic findings, including Chiari-like displacement, completely resolved. A Chiari-like descent of the cerebellar tonsils alone does not secure the diagnosis of a Chiari I malformation. Especially if other findings indicate spinal CSF leakage, a systematic work-up should be initiated. In most cases, interventional techniques seal the leak successfully, resulting in a favorable outcome. Copyright © 2017 European Paediatric Neurology Society. Published by Elsevier Ltd. All rights reserved.
Genetic Regulation of Fibroblast Activation and Proliferation in Cardiac Fibrosis.
Park, Shuin; Ranjbarvazirj, Sara; Lay, Fides D; Zhao, Peng; Miller, Mark J; Dhaliwal, Jasmeet S; Huertas-Vazquez, Adriana; Wu, Xiuju; Qiao, Rong; Soffer, Justin M; Mikkola, Hanna K A; Lusis, Aldons J; Ardehali, Reza
2018-06-27
Background -Genetic diversity and the heterogeneous nature of cardiac fibroblasts (CFbs) have hindered characterization of the molecular mechanisms that regulate cardiac fibrosis. The Hybrid Mouse Diversity Panel (HMDP) offers a valuable tool to examine genetically diverse cardiac fibroblasts and their role in fibrosis. Methods -Three strains of mice (C57BL/6J, C3H/HeJ, and KK/HlJ) were selected from the HMDP and treated with either isoproterenol (ISO) or saline by an intraperitoneally implanted osmotic pump. After 21 days, cardiac function and levels of fibrosis were measured by echocardiography and trichrome staining, respectively. Activation and proliferation of CFbs were measured by in vitro and in vivo assays under normal and injury conditions. RNA-sequencing was done on isolated CFbs from each strain and results were analyzed by Ingenuity Pathway Analysis (IPA) and validated by reverse transcription-qPCR, immunohistochemistry, and ELISA. Results -ISO treatment in C57BL/6J, C3H/HeJ, and KK/HlJ mice resulted in minimal, moderate, and extensive levels of fibrosis, respectively (n = 7-8 hearts/condition). Isolated CFbs treated with ISO exhibited strain-specific increases in the levels of activation but showed comparable levels of proliferation. Similar results were found in vivo , with fibroblast activation, and not proliferation, correlating with the differential levels of cardiac fibrosis after ISO treatment. RNA-sequencing revealed that CFbs from each strain exhibit unique gene expression changes in response to ISO. We identified Ltbp2 as a commonly upregulated gene after ISO treatment. Expression of LTBP2 was elevated and specifically localized in the fibrotic regions of the myocardium after injury in mice and in human heart failure patients. Conclusions -This study highlights the importance of genetic variation in cardiac fibrosis by using multiple inbred mouse strains to characterize CFbs and their response to ISO treatment. Our data suggest that, while fibroblast activation is a response that parallels the extent of scar formation, proliferation may not necessarily correlate with levels of fibrosis. Additionally, by comparing CFbs from multiple strains, we identified pathways as potential therapeutic targets and LTBP2 as a marker for fibrosis, with relevance to patients with underlying myocardial fibrosis.
Reddy, M K; Nair, S; Singh, B N; Mudgil, Y; Tewari, K K; Sopory, S K
2001-01-24
We report the cloning and sequencing of both cDNA and genomic DNA of a 33 kDa chloroplast ribonucleoprotein (33RNP) from pea. The analysis of the predicted amino acid sequence of the cDNA clone revealed that the encoded protein contains two RNA binding domains, including the conserved consensus ribonucleoprotein sequences CS-RNP1 and CS-RNP2, on the C-terminus half and the presence of a putative transit peptide sequence in the N-terminus region. The phylogenetic and multiple sequence alignment analysis of pea chloroplast RNP along with RNPs reported from the other plant sources revealed that the pea 33RNP is very closely related to Nicotiana sylvestris 31RNP and 28RNP and also to 31RNP and 28RNP of Arabidopsis and spinach, respectively. The pea 33RNP was expressed in Escherichia coli and purified to homogeneity. The in vitro import of precursor protein into chloroplasts confirmed that the N-terminus putative transit peptide is a bona fide transit peptide and 33RNP is localized in the chloroplast. The nucleic acid-binding properties of the recombinant protein, as revealed by South-Western analysis, showed that 33RNP has higher binding affinity for poly (U) and oligo dT than for ssDNA and dsDNA. The steady state transcript level was higher in leaves than in roots and the expression of this gene is light stimulated. Sequence analysis of the genomic clone revealed that the gene contains four exons and three introns. We have also isolated and analyzed the 5' flanking region of the pea 33RNP gene.
Biochemistry and genetics of actinomycete cellulases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wilson, D.B.
1992-01-01
The order Actinomycetales includes a number of genera that contain species that actively degrade cellulose and these include both mesophilic and facultative thermophilic species. Cellulases produced by strains from two of the genera containing thermophilic organisms have been studied extensively: Microbispora bispora and Thermomonospora fusca. Fractionation of M. bispora cellulases has identified six different enzymes, all of which were purified to near homogeneity and partially characterized. Two of these enzymes appear to be exocellulases and gave synergism with each other and with the endocellulases. The structural genes of five M. bispora cellulases have been cloned and one was sequenced. Fractionationmore » of T. fusca cellulases has identified five different enzymes, all of which were purified to near homogeneity and partially characterized. One of the T. fusca enzymes gives synergism in the hydrolysis of crystalline cellulose with several T. fusca endocellulases and with Trichoderma reesei CBHI but not with T. reesei CBHII. Each T. fusca cellulase contains distinct catalytic and cellulose binding domains. The structural genes of four of the T. fusca endoglucanases have been cloned and sequenced, while three cellulase genes have been cloned from T. curvata. The T. fusca cellulase genes are expressed at a low level in Escherichia coli, but at a high level in Streptomyces lividans. Sequence comparisons have shown that there are no significant amino acid homologies between any of the catalytic domains of the four T. fusca cellulases, but each of them shows extensive homology to several other cellulases and fits in one of the five existing cellulase gene families. 73 refs., 8 figs., 4 tabs.« less
Gene structure and evolution of transthyretin in the order Chiroptera.
Khwanmunee, Jiraporn; Leelawatwattana, Ladda; Prapunpoj, Porntip
2016-02-01
Bats are mammals in the order Chiroptera. Although many extensive morphologic and molecular genetics analyses have been attempted, phylogenetic relationships of bats has not been completely resolved. The paraphyly of microbats is of particular controversy that needs to be confirmed. In this study, we attempted to use the nucleotide sequence of transthyretin (TTR) intron 1 to resolve the relationship among bats. To explore its utility, the complete sequences of TTR gene and intron 1 region of bats in Vespertilionidae: genus Eptesicus (Eptesicus fuscus) and genus Myotis (Myotis brandtii, Myotis davidii, and Myotis lucifugus), and Pteropodidae (Pteropus alecto and Pteropus vampyrus) were extracted from the retrieved sequences, whereas those of Rhinoluphus affinis and Scotophilus kuhlii were amplified and sequenced. The derived overall amino sequences of bat TTRs were found to be very similar to those in other eutherians but differed from those in other classes of vertebrates. However, missing of amino acids from N-terminal or C-terminal region was observed. The phylogenetic analysis of amino acid sequences suggested bat and other eutherian TTRs lineal descent from a single most recent common ancestor which differed from those of non-placental mammals and the other classes of vertebrates. The splicing of bat TTR precursor mRNAs was similar to those of other eutherian but different from those of marsupial, bird, reptile and amphibian. Based on TTR intron 1 sequence, the inferred evolutionary relationship within Chiroptera revealed more closely relatedness of R. affinis to megabats than to microbats. Accordingly, the paraphyly of microbats was suggested.
Naganeeswaran, Sudalaimuthu Asari; Subbian, Elain Apshara; Ramaswamy, Manimekalai
2012-01-01
Phytophthora megakarya, the causative agent of cacao black pod disease in West African countries causes an extensive loss of yield. In this study we have analyzed 4 libraries of ESTs derived from Phytophthora megakarya infected cocoa leaf and pod tissues. Totally 6379 redundant sequences were retrieved from ESTtik database and EST processing was performed using seqclean tool. Clustering and assembling using CAP3 generated 3333 non-redundant (907 contigs and 2426 singletons) sequences. The primary sequence analysis of 3333 non-redundant sequences showed that the GC percentage was 42.7 and the sequence length ranged from 101 - 2576 nucleotides. Further, functional analysis (Blast, Interproscan, Gene ontology and KEGG search) were executed and 1230 orthologous genes were annotated. Totally 272 enzymes corresponding to 114 metabolic pathways were identified. Functional annotation revealed that most of the sequences are related to molecular function, stress response and biological processes. The annotated enzymes are aldehyde dehydrogenase (E.C: 1.2.1.3), catalase (E.C: 1.11.1.6), acetyl-CoA C-acetyltransferase (E.C: 2.3.1.9), threonine ammonia-lyase (E.C: 4.3.1.19), acetolactate synthase (E.C: 2.2.1.6), O-methyltransferase (E.C: 2.1.1.68) which play an important role in amino acid biosynthesis and phenyl propanoid biosynthesis. All this information was stored in MySQL database management system to be used in future for reconstruction of biotic stress response pathway in cocoa.
Song, Ha Yeun; Mabuchi, Kohji; Satoh, Takashi P; Moore, Jon A; Yamanoue, Yusuke; Miya, Masaki; Nishida, Mutsumi
2014-06-01
Percomorpha, comprising about 60% of modern teleost fishes, has been described as the "(unresolved) bush at the top" of the tree, with its intrarelationships still being ambiguous owing to huge diversity (>15,000 species). Recent molecular phylogenetic studies based on extensive taxon and character sampling, however, have revealed a number of unexpected clades of Percomorpha, and one of which is composed of Syngnathoidei (seahorses, pipefishes, and their relatives) plus several groups distributed across three different orders. To circumscribe the clade more definitely, we sampled several candidate taxa with reference to the previous studies and newly determined whole mitochondrial genome (mitogenome) sequences for 16 percomorph species across syngnathoids, dactylopterids, and their putatively closely-related fishes (Mullidae, Callionymoidei, Malacanthidae). Unambiguously aligned sequences (13,872 bp) from those 16 species plus 78 percomorphs and two outgroups (total 96 species) were subjected to partitioned Bayesian and maximum likelihood analyses. The resulting trees revealed a highly supported clade comprising seven families in Syngnathoidei (Gasterosteiformes), Dactylopteridae (Scorpaeniformes), Mullidae in Percoidei and two families in Callionymoidei (Perciformes). We herein proposed to call this clade "Syngnathiformes" following the latest nuclear DNA studies with some revisions on the included families. Copyright © 2014 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Suzuki, Kazuo; Yasunami, Michio; Matsuda, Yoichi
1996-09-01
Embryonic TEA domain-containing factor (ETF) belongs to the family of proteins structurally related to transcriptional enhancer factor-1 (TEF-1) and is implicated in neural development. Isolation and characterization of the cosmid clones encoding the mouse ETF gene (Etdf) revealed that Etdf spans approximately 17.9 kb and consists of 12 exons. The exon-intron structure of Etdf closely resembles that of the Drosophila scalloped gene, indicating that these genes may have evolved from a common ancestor. Then multiple transcription initiation sites revealed by S1 protection and primer extension analyses are consistent with the absence of the canonical TATA and CAAT boxes in themore » 5{prime}-flanking region, which contains many potential regulatory sequences, such as the E-box, N-box, Sp1 element, GATA-1 element, TAATGARAT element, and B2 short interspersed element (SINE) as well as several direct and inverted repeat sequences. The Etdf locus was assigned to the proximal region of mouse chromosome 7 using fluorescence in situ hybridization and linkage mapping analyses. These results provide the molecular basis for studying the regulation, in vivo function, and evolution of Etdf. 29 refs., 5 figs., 1 tab.« less
Suzuki, K; Yasunami, M; Matsuda, Y; Maeda, T; Kobayashi, H; Terasaki, H; Ohkubo, H
1996-09-01
Embryonic TEA domain-containing factor (ETF) belongs to the family of proteins structurally related to transcriptional enhancer factor-1 (TEF-1) and is implicated in neural development. Isolation and characterization of the cosmid clones encoding the mouse ETF gene (Etdf) revealed that Etdf spans approximately 17.9 kb and consists of 12 exons. The exon-intron structure of Etdf closely resembles that of the Drosophila scalloped gene, indicating that these genes may have evolved from a common ancestor. The multiple transcription initiation sites revealed by S1 protection and primer extension analyses are consistent with the absence of the canonical TATA and CAAT boxes in the 5'-flanking region, which contains many potential regulatory sequences, such as the E-box, N-box, Sp1 element, GATA-1 element, TAATGARAT element, and B2 short interspersed element (SINE) as well as several direct and inverted repeat sequences. The Etdf locus was assigned to the proximal region of mouse chromosome 7 using fluorescence in situ hybridization and linkage mapping analyses. These results provide the molecular basis for studying the regulation, in vivo function, and evolution of Etdf.
Gao, Feng; Song, Weibo; Katz, Laura A.
2014-01-01
In most lineages, diversity among gene family members results from gene duplication followed by sequence divergence. Because of the genome rearrangements during the development of somatic nuclei, gene family evolution in ciliates involves more complex processes. Previous work on the ciliate Chilodonella uncinata revealed that macronuclear β-tubulin gene family members are generated by alternative processing, in which germline regions are alternatively used in multiple macronuclear chromosomes. To further study genome evolution in this ciliate, we analyzed its transcriptome and found that: 1) alternative processing is extensive among gene families; and 2) such gene families are likely to be C. uncinata-specific. We characterized additional macronuclear and micronuclear copies of one candidate alternatively processed gene family -- a protein kinase domain containing protein (PKc) -- from two C. uncinata strains. Analysis of the PKc sequences reveals: 1) multiple PKc gene family members in the macronucleus share some identical regions flanked by divergent regions; and 2) the shared identical regions are processed from a single micronuclear chromosome. We discuss analogous processes in lineages across the eukaryotic tree of life to provide further insights on the impact of genome structure on gene family evolution in eukaryotes. PMID:24749903
Woebken, Dagmar; Burow, Luke C.; Prufert-Bebout, Leslie; ...
2012-01-12
N 2 fixation is a key process in photosynthetic microbial mats to support the nitrogen demands associated with primary production. Despite its importance, groups that actively fix N 2 and contribute to the input of organic N in these ecosystems still remain largely unclear. To investigate the active diazotrophic community in microbial mats from the Elkhorn Slough estuary, Monterey Bay, CA, USA, we conducted an extensive combined approach, including biogeochemical, molecular and high-resolution secondary ion mass spectrometry (NanoSIMS) analyses. Detailed analysis of dinitrogenase reductase (nifH) transcript clone libraries from mat samples that fixed N 2 at night indicated that cyanobacterialmore » nifH transcripts were abundant and formed a novel monophyletic lineage. Independent NanoSIMS analysis of 15N2-incubated samples revealed significant incorporation of 15N into small, non-heterocystous cyanobacterial filaments. Mat-derived enrichment cultures yielded a unicyanobacterial culture with similar filaments (named Elkhorn Slough Filamentous Cyanobacterium-1 (ESFC-1)) that contained nifH gene sequences grouping with the novel cyanobacterial lineage identified in the transcript clone libraries, displaying up to 100% amino-acid sequence identity. The 16S rRNA gene sequence recovered from this enrichment allowed for the identification of related sequences from Elkhorn Slough mats and revealed great sequence diversity in this cluster. Furthermore, by combining 15N 2 tracer experiments, fluorescence in situ hybridization and NanoSIMS, in situ N 2 fixation activity by the novel ESFC-1 group was demonstrated, suggesting that this group may be the most active cyanobacterial diazotroph in the Elkhorn Slough mat. Pyrotag sequences affiliated with ESFC-1 were recovered from mat samples throughout 2009, demonstrating the prevalence of this group. Here, this work illustrates that combining standard and single-cell analyses can link phylogeny and function to identify previously unknown key functional groups in complex ecosystems.« less
Pedraza-Reyes, M; Gutiérrez-Corona, F; Nicholson, W L
1994-01-01
Bacterial spores are highly resistant to killing by UV radiation and exhibit unique DNA photochemistry. UV irradiation of spore DNA results in formation of spore photoproduct (SP), the thymine dimer 5-thyminyl-5,6-dihydrothymine. Repair of SP occurs during germination of Bacillus subtilis spores by two distinct routes, either by the general nucleotide excision repair (uvr) pathway or by a novel SP-specific monomerization reaction mediated by the enzyme SP lyase, which is encoded by the spl gene. Repair of SP occurs early in spore germination and is independent of de novo protein synthesis, suggesting that the SP repair enzymes are synthesized during sporulation and are packaged in the dormant spore. To test this hypothesis, the expression of a translational spl-lacZ fusion integrated at the spl locus was monitored during B. subtilis growth and sporulation. beta-Galactosidase expression from the spl-lacZ fusion was silent during vegetative growth and was not DNA damage inducible, but it was activated at morphological stage III of sporulation specifically in the forespore compartment, coincident with activation of expression of the stage III marker enzyme glucose dehydrogenase. Expression of the spl-lacZ fusion was shown to be dependent upon the sporulation-specific RNA polymerase containing the sigma-G factor (E sigma G), as spl-lacZ expression was abolished in a mutant harboring a deletion in the sigG gene and restored by expression of the sigG gene in trans. Primer extension analysis of spl mRNA revealed a major extension product initiating upstream from a small open reading frame of unknown function which precedes spl, and it revealed two other shorter minor extension products. All three extension products were present in higher quantities during sporulation and after sigG induction. The three putative transcripts are all preceded by sequences which share homology with the consensus sigma-G factor-type promoter sequence, but in vitro transcription by purified sigma-G RNA polymerase was detected only from the promoter corresponding to the major extension product. The open reading frame-spl operon therefore appears to be an additional member of the sigma-G regulon, which also includes as members the small, acid-soluble spore proteins which are in large part responsible for spore DNA photochemistry. Therefore, sporulating bacteria appear to coordinately regulate genes whose products not only alter spore DNA photochemistry but also repair the major spore-specific photoproduct during germination Images PMID:8021181
An extension of command shaping methods for controlling residual vibration using frequency sampling
NASA Technical Reports Server (NTRS)
Singer, Neil C.; Seering, Warren P.
1992-01-01
The authors present an extension to the impulse shaping technique for commanding machines to move with reduced residual vibration. The extension, called frequency sampling, is a method for generating constraints that are used to obtain shaping sequences which minimize residual vibration in systems such as robots whose resonant frequencies change during motion. The authors present a review of impulse shaping methods, a development of the proposed extension, and a comparison of results of tests conducted on a simple model of the space shuttle robot arm. Frequency shaping provides a method for minimizing the impulse sequence duration required to give the desired insensitivity.
Genome-, Transcriptome- and Proteome-Wide Analyses of the Gliadin Gene Families in Triticum urartu
Wang, Dongzhi; Yang, Wenlong; Sun, Jiazhu; Zhang, Aimin; Zhan, Kehui
2015-01-01
Gliadins are the major components of storage proteins in wheat grains, and they play an essential role in the dough extensibility and nutritional quality of flour. Because of the large number of the gliadin family members, the high level of sequence identity, and the lack of abundant genomic data for Triticum species, identifying the full complement of gliadin family genes in hexaploid wheat remains challenging. Triticum urartu is a wild diploid wheat species and considered the A-genome donor of polyploid wheat species. The accession PI428198 (G1812) was chosen to determine the complete composition of the gliadin gene families in the wheat A-genome using the available draft genome. Using a PCR-based cloning strategy for genomic DNA and mRNA as well as a bioinformatics analysis of genomic sequence data, 28 gliadin genes were characterized. Of these genes, 23 were α-gliadin genes, three were γ-gliadin genes and two were ω-gliadin genes. An RNA sequencing (RNA-Seq) survey of the dynamic expression patterns of gliadin genes revealed that their synthesis in immature grains began prior to 10 days post-anthesis (DPA), peaked at 15 DPA and gradually decreased at 20 DPA. The accumulation of proteins encoded by 16 of the expressed gliadin genes was further verified and quantified using proteomic methods. The phylogenetic analysis demonstrated that the homologs of these α-gliadin genes were present in tetraploid and hexaploid wheat, which was consistent with T. urartu being the A-genome progenitor species. This study presents a systematic investigation of the gliadin gene families in T. urartu that spans the genome, transcriptome and proteome, and it provides new information to better understand the molecular structure, expression profiles and evolution of the gliadin genes in T. urartu and common wheat. PMID:26132381
Genome-, Transcriptome- and Proteome-Wide Analyses of the Gliadin Gene Families in Triticum urartu.
Zhang, Yanlin; Luo, Guangbin; Liu, Dongcheng; Wang, Dongzhi; Yang, Wenlong; Sun, Jiazhu; Zhang, Aimin; Zhan, Kehui
2015-01-01
Gliadins are the major components of storage proteins in wheat grains, and they play an essential role in the dough extensibility and nutritional quality of flour. Because of the large number of the gliadin family members, the high level of sequence identity, and the lack of abundant genomic data for Triticum species, identifying the full complement of gliadin family genes in hexaploid wheat remains challenging. Triticum urartu is a wild diploid wheat species and considered the A-genome donor of polyploid wheat species. The accession PI428198 (G1812) was chosen to determine the complete composition of the gliadin gene families in the wheat A-genome using the available draft genome. Using a PCR-based cloning strategy for genomic DNA and mRNA as well as a bioinformatics analysis of genomic sequence data, 28 gliadin genes were characterized. Of these genes, 23 were α-gliadin genes, three were γ-gliadin genes and two were ω-gliadin genes. An RNA sequencing (RNA-Seq) survey of the dynamic expression patterns of gliadin genes revealed that their synthesis in immature grains began prior to 10 days post-anthesis (DPA), peaked at 15 DPA and gradually decreased at 20 DPA. The accumulation of proteins encoded by 16 of the expressed gliadin genes was further verified and quantified using proteomic methods. The phylogenetic analysis demonstrated that the homologs of these α-gliadin genes were present in tetraploid and hexaploid wheat, which was consistent with T. urartu being the A-genome progenitor species. This study presents a systematic investigation of the gliadin gene families in T. urartu that spans the genome, transcriptome and proteome, and it provides new information to better understand the molecular structure, expression profiles and evolution of the gliadin genes in T. urartu and common wheat.
Evidence for polymorphism in the cytochrome P450 2D50 gene in horses.
Corado, C R; McKemie, D S; Young, A; Knych, H K
2016-06-01
Metabolism is an essential factor in the clearance of many drugs and as such plays a major role in the establishment of dosage regimens and withdrawal times. CYP2D6, the human orthologue to equine CYP2D50, is a drug-metabolizing enzyme that is highly polymorphic in humans leading to widely differing levels of metabolic activity. As CYP2D6 is highly polymorphic, in this study it was hypothesized that the gene coding for the equine orthologue, CYP2D50, may also be prone to polymorphism. Blood samples were collected from 150 horses, the CYP2D50 gene was cloned and sequenced; and full-length sequences were analyzed for single nucleotide polymorphisms (SNPs), deletions, or insertions. Pharmacokinetic data were collected from a subset of horses following the administration of a single oral dose of tramadol and probit analysis used to calculate metabolic ratios. Prior to drug administration, the ability of recombinant CYP2D50 to metabolize tramadol to O-desmethyltramadol was confirmed. Sequencing of CYP2D50 identified 126 exonic SNPs, with 31 of those appearing in multiple horses. Oral administration of tramadol to a subset of these horses revealed variable metabolic ratios (tramadol: O-desmethyltramadol) in individual horses and separation into three metabolic groups. While a limited number of horses of primarily a single breed were studied, the variability in tramadol metabolism to O-desmethyltramadol between horses and preliminary evidence of what appears to be poor, extensive, and ultra-rapid metabolizers supports further study of the potential for genetic polymorphisms in the CYP2D50 gene in horses. © 2015 John Wiley & Sons Ltd.
Wang, Xiao-Wei; Zhao, Qiong-Yi; Luan, Jun-Bo; Wang, Yu-Jun; Yan, Gen-Hong; Liu, Shu-Sheng
2012-10-04
Genomic divergence between invasive and native species may provide insight into the molecular basis underlying specific characteristics that drive the invasion and displacement of closely related species. In this study, we sequenced the transcriptome of an indigenous species, Asia II 3, of the Bemisia tabaci complex and compared its genetic divergence with the transcriptomes of two invasive whiteflies species, Middle East Asia Minor 1 (MEAM1) and Mediterranean (MED), respectively. More than 16 million reads of 74 base pairs in length were obtained for the Asia II 3 species using the Illumina sequencing platform. These reads were assembled into 52,535 distinct sequences (mean size: 466 bp) and 16,596 sequences were annotated with an E-value above 10-5. Protein family comparisons revealed obvious diversification among the transcriptomes of these species suggesting species-specific adaptations during whitefly evolution. On the contrary, substantial conservation of the whitefly transcriptomes was also evident, despite their differences. The overall divergence of coding sequences between the orthologous gene pairs of Asia II 3 and MEAM1 is 1.73%, which is comparable to the average divergence of Asia II 3 and MED transcriptomes (1.84%) and much higher than that of MEAM1 and MED (0.83%). This is consistent with the previous phylogenetic analyses and crossing experiments suggesting these are distinct species. We also identified hundreds of highly diverged genes and compiled sequence identify data into gene functional groups and found the most divergent gene classes are Cytochrome P450, Glutathione metabolism and Oxidative phosphorylation. These results strongly suggest that the divergence of genes related to metabolism might be the driving force of the MEAM1 and Asia II 3 differentiation. We also analyzed single nucleotide polymorphisms within the orthologous gene pairs of indigenous and invasive whiteflies which are helpful for the investigation of association between allelic and phenotypes. Our data present the most comprehensive sequences for the indigenous whitefly species Asia II 3. The extensive comparisons of Asia II 3, MEAM1 and MED transcriptomes will serve as an invaluable resource for revealing the genetic basis of whitefly invasion and the molecular mechanisms underlying their biological differences.
2012-01-01
Background Genomic divergence between invasive and native species may provide insight into the molecular basis underlying specific characteristics that drive the invasion and displacement of closely related species. In this study, we sequenced the transcriptome of an indigenous species, Asia II 3, of the Bemisia tabaci complex and compared its genetic divergence with the transcriptomes of two invasive whiteflies species, Middle East Asia Minor 1 (MEAM1) and Mediterranean (MED), respectively. Results More than 16 million reads of 74 base pairs in length were obtained for the Asia II 3 species using the Illumina sequencing platform. These reads were assembled into 52,535 distinct sequences (mean size: 466 bp) and 16,596 sequences were annotated with an E-value above 10-5. Protein family comparisons revealed obvious diversification among the transcriptomes of these species suggesting species-specific adaptations during whitefly evolution. On the contrary, substantial conservation of the whitefly transcriptomes was also evident, despite their differences. The overall divergence of coding sequences between the orthologous gene pairs of Asia II 3 and MEAM1 is 1.73%, which is comparable to the average divergence of Asia II 3 and MED transcriptomes (1.84%) and much higher than that of MEAM1 and MED (0.83%). This is consistent with the previous phylogenetic analyses and crossing experiments suggesting these are distinct species. We also identified hundreds of highly diverged genes and compiled sequence identify data into gene functional groups and found the most divergent gene classes are Cytochrome P450, Glutathione metabolism and Oxidative phosphorylation. These results strongly suggest that the divergence of genes related to metabolism might be the driving force of the MEAM1 and Asia II 3 differentiation. We also analyzed single nucleotide polymorphisms within the orthologous gene pairs of indigenous and invasive whiteflies which are helpful for the investigation of association between allelic and phenotypes. Conclusions Our data present the most comprehensive sequences for the indigenous whitefly species Asia II 3. The extensive comparisons of Asia II 3, MEAM1 and MED transcriptomes will serve as an invaluable resource for revealing the genetic basis of whitefly invasion and the molecular mechanisms underlying their biological differences. PMID:23036081
Transcriptomics of the Bed Bug (Cimex lectularius)
Rajarapu, Swapna P.; Jones, Susan C.; Mittapalli, Omprakash
2011-01-01
Background Bed bugs (Cimex lectularius) are blood-feeding insects poised to become one of the major pests in households throughout the United States. Resistance of C. lectularius to insecticides/pesticides is one factor thought to be involved in its sudden resurgence. Despite its high-impact status, scant knowledge exists at the genomic level for C. lectularius. Hence, we subjected the C. lectularius transcriptome to 454 pyrosequencing in order to identify potential genes involved in pesticide resistance. Methodology and Principal Findings Using 454 pyrosequencing, we obtained a total of 216,419 reads with 79,596,412 bp, which were assembled into 35,646 expressed sequence tags (3902 contigs and 31744 singletons). Nearly 85.9% of the C. lectularius sequences showed similarity to insect sequences, but 44.8% of the deduced proteins of C. lectularius did not show similarity with sequences in the GenBank non-redundant database. KEGG analysis revealed putative members of several detoxification pathways involved in pesticide resistance. Lamprin domains, Protein Kinase domains, Protein Tyrosine Kinase domains and cytochrome P450 domains were among the top Pfam domains predicted for the C. lectularius sequences. An initial assessment of putative defense genes, including a cytochrome P450 and a glutathione-S-transferase (GST), revealed high transcript levels for the cytochrome P450 (CYP9) in pesticide-exposed versus pesticide-susceptible C. lectularius populations. A significant number of single nucleotide polymorphisms (296) and microsatellite loci (370) were predicted in the C. lectularius sequences. Furthermore, 59 putative sequences of Wolbachia were retrieved from the database. Conclusions To our knowledge this is the first study to elucidate the genetic makeup of C. lectularius. This pyrosequencing effort provides clues to the identification of potential detoxification genes involved in pesticide resistance of C. lectularius and lays the foundation for future functional genomics studies. PMID:21283830
Identification of a human synaptotagmin-1 mutation that perturbs synaptic vesicle cycling
Baker, Kate; Gordon, Sarah L.; Grozeva, Detelina; van Kogelenberg, Margriet; Roberts, Nicola Y.; Pike, Michael; Blair, Edward; Hurles, Matthew E.; Chong, W. Kling; Baldeweg, Torsten; Kurian, Manju A.; Boyd, Stewart G.; Cousin, Michael A.; Raymond, F. Lucy
2015-01-01
Synaptotagmin-1 (SYT1) is a calcium-binding synaptic vesicle protein that is required for both exocytosis and endocytosis. Here, we describe a human condition associated with a rare variant in SYT1. The individual harboring this variant presented with an early onset dyskinetic movement disorder, severe motor delay, and profound cognitive impairment. Structural MRI was normal, but EEG showed extensive neurophysiological disturbances that included the unusual features of low-frequency oscillatory bursts and enhanced paired-pulse depression of visual evoked potentials. Trio analysis of whole-exome sequence identified a de novo SYT1 missense variant (I368T). Expression of rat SYT1 containing the equivalent human variant in WT mouse primary hippocampal cultures revealed that the mutant form of SYT1 correctly localizes to nerve terminals and is expressed at levels that are approximately equal to levels of endogenous WT protein. The presence of the mutant SYT1 slowed synaptic vesicle fusion kinetics, a finding that agrees with the previously demonstrated role for I368 in calcium-dependent membrane penetration. Expression of the I368T variant also altered the kinetics of synaptic vesicle endocytosis. Together, the clinical features, electrophysiological phenotype, and in vitro neuronal phenotype associated with this dominant negative SYT1 mutation highlight presynaptic mechanisms that mediate human motor control and cognitive development. PMID:25705886
Balancing of B6 Vitamers Is Essential for Plant Development and Metabolism in Arabidopsis
Tohge, Takayuki; Fernie, Alisdair R.
2016-01-01
Vitamin B6 comprises a family of compounds that is essential for all organisms, most notable among which is the cofactor pyridoxal 5′-phosphate (PLP). Other forms of vitamin B6 include pyridoxamine 5′-phosphate (PMP), pyridoxine 5′-phosphate (PNP), and the corresponding nonphosphorylated derivatives. While plants can biosynthesize PLP de novo, they also have salvage pathways that serve to interconvert the different vitamers. The selective contribution of these various pathways to cellular vitamin B6 homeostasis in plants is not fully understood. Although biosynthesis de novo has been extensively characterized, the salvage pathways have received comparatively little attention in plants. Here, we show that the PMP/PNP oxidase PDX3 is essential for balancing B6 vitamer levels in Arabidopsis thaliana. In the absence of PDX3, growth and development are impaired and the metabolite profile is altered. Surprisingly, RNA sequencing reveals strong induction of stress-related genes in pdx3, particularly those associated with biotic stress that coincides with an increase in salicylic acid levels. Intriguingly, exogenous ammonium rescues the growth and developmental phenotype in line with a severe reduction in nitrate reductase activity that may be due to the overaccumulation of PMP in pdx3. Our analyses demonstrate an important link between vitamin B6 homeostasis and nitrogen metabolism. PMID:26858304
Georgi, Enrico; Walter, Mathias C.; Pfalzgraf, Marie-Theres; Northoff, Bernd H.; Holdt, Lesca M.; Scholz, Holger C.; Zoeller, Lothar
2017-01-01
Brucellosis, a worldwide common bacterial zoonotic disease, has become quite rare in Northern and Western Europe. However, since 2014 a significant increase of imported infections caused by Brucella (B.) melitensis has been noticed in Germany. Patients predominantly originated from Middle East including Turkey and Syria. These circumstances afforded an opportunity to gain insights into the population structure of Brucella strains. Brucella-isolates from 57 patients were recovered between January 2014 and June 2016 with culture confirmed brucellosis by the National Consultant Laboratory for Brucella. Their whole genome sequences were generated using the Illumina MiSeq platform. A whole genome-based SNP typing assay was developed in order to resolve geographically attributed genetic clusters. Results were compared to MLVA typing results, the current gold-standard of Brucella typing. In addition, sequences were examined for possible genetic variation within target regions of molecular diagnostic assays. Phylogenetic analyses revealed spatial clustering and distinguished strains from different patients in either case, whereas multiple isolates from a single patient or technical replicates showed identical SNP and MLVA profiles. By including WGS data from the NCBI database, five major genotypes were identified. Notably, strains originating from Turkey showed a high diversity and grouped into seven subclusters of genotype II. MLVA analysis congruently clustered all isolates and predominantly matched the East Mediterranean genetic clade. This study confirms whole-genome based SNP-analysis as a powerful tool for accurate typing of B. melitensis. Furthermore it allows special allocation and therefore provides useful information on the geographic origin for trace-back analysis. However, the lack of reliable metadata in public databases often prevents a resolution below geographic regions or country levels and corresponding precise trace-back analysis. Once this obstacle is resolved, WGS-derived bacterial typing adds an important method to complement epidemiological surveys during outbreak investigations. This is the first report of a detailed genetic investigation of an extensive collection of B. melitensis strains isolated from human cases in Germany. PMID:28388689
Wernegreen, Jennifer J
2017-09-15
Ancient associations between insects and bacteria provide models to study intimate host-microbe interactions. Currently, a wealth of genome sequence data for long-term, obligately intracellular (primary) endosymbionts of insects reveals profound genomic consequences of this specialized bacterial lifestyle. Those consequences include severe genome reduction and extreme base compositions. This minireview highlights the utility of genome sequence data to understand how, and why, endosymbionts have been pushed to such extremes, and to illuminate the functional consequences of such extensive genome change. While the static snapshots provided by individual endosymbiont genomes are valuable, comparative analyses of multiple genomes have shed light on evolutionary mechanisms. Namely, genome comparisons have told us that selection is important in fine-tuning gene content, but at the same time, mutational pressure and genetic drift contribute to genome degradation. Examples from Blochmannia, the primary endosymbiont of the ant tribe Camponotini, illustrate the value and constraints of genome sequence data, and exemplify how genomes can serve as a springboard for further comparative and experimental inquiry. Copyright © 2017. Published by Elsevier Inc.
Banik, Tenley J.; Wallace, Paul J.; Höskuldsson, Ármann; Miller, Calvin F.; Bacon, Charles R.; Furbish, David J.
2013-01-01
Products of subglacial volcanism can illuminate reconstructions of paleo-environmental conditions on both local and regional scales. Competing interpretations of Pleistocene conditions in south Iceland have been proposed based on an extensive sequence of repeating lava-and-hyaloclastite deposits in the Síða district. We propose here a new eruptive model and refine the glacial environment during eruption based on field research and analytical data for the Síða district lava/hyaloclastite units. Field observations from this and previous studies reveal a repeating sequence of cogenetic lava and hyaloclastite deposits extending many kilometers from their presumed eruptive source. Glasses from lava selvages and unaltered hyaloclastites have very low H2O, S, and CO2 concentrations, indicating significant degassing at or close to atmospheric pressure prior to quenching. We also present a scenario that demonstrates virtual co-emplacement of the two eruptive products. Our data and model results suggest repeated eruptions under thin ice or partially subaerial conditions, rather than eruption under a thick ice sheet or subglacial conditions as previously proposed.
Pandin, Caroline; Le Coq, Dominique; Deschamps, Julien; Védie, Régis; Rousseau, Thierry; Aymerich, Stéphane; Briandet, Romain
2018-04-24
Bacillus subtilis QST713 is extensively used as a biological control agent in agricultural fields including in the button mushroom culture, Agaricus bisporus. This last use exploits its inhibitory activity against microbial pathogens such as Trichoderma aggressivum f. europaeum, the main button mushroom green mould competitor. Here, we report the complete genome sequence of this bacterium with a genome size of 4 233 757 bp, 4263 predicted genes and an average GC content of 45.9%. Based on phylogenomic analyses, strain QST713 is finally designated as Bacillus velezensis. Genomic analyses revealed two clusters encoding potential new antimicrobials with NRPS and TransATPKS synthetase. B. velezensis QST713 genome also harbours several genes previously described as being involved in surface colonization and biofilm formation. This strain shows a strong ability to form in vitro spatially organized biofilm and to antagonize T. aggressivum. The availability of this genome sequence could bring new elements to understand the interactions with micro or/and macroorganisms in crops. Copyright © 2018 Elsevier B.V. All rights reserved.
Hynönen, Ulla; Rasinkangas, Pia; Satokari, Reetta; Paulin, Lars; de Vos, Willem M; Pietilä, Taija E; Kant, Ravi; Palva, Airi
2016-06-01
In our previous studies on the intestinal microbiota in irritable bowel syndrome (IBS), we identified a bacterial phylotype with higher abundance in patients suffering from diarrhea than in healthy controls. In the present work, we have isolated in pure culture strain RT94, belonging to this phylotype, determined its whole genome sequence and performed an extensive genomic analysis and phenotypical testing. This revealed strain RT94 to be a strict anaerobe apparently belonging to a novel species with only 94% similarity in the 16S rRNA gene sequence to the closest relatives Ruminococcus torques and Ruminococcus lactaris. The G + C content of strain RT94 is 45.2 mol% and the major long-chain cellular fatty acids are C16:0, C18:0 and C14:0. The isolate is metabolically versatile but not a mucus or cellulose utilizer. It produces acetate, ethanol, succinate, lactate and formate, but very little butyrate, as end products of glucose metabolism. The mechanisms underlying the association of strain RT94 with diarrhea-type IBS are discussed. Copyright © 2016 Elsevier Ltd. All rights reserved.
Deep Sequencing of the Oral Microbiome Reveals Signatures of Periodontal Disease
Ghodsi, Mohammad; Sommer, Daniel D.; Gibbons, Theodore R.; Treangen, Todd J.; Chang, Yi-Chien; Li, Shan; Stine, O. Colin; Hasturk, Hatice; Kasif, Simon; Segrè, Daniel; Pop, Mihai; Amar, Salomon
2012-01-01
The oral microbiome, the complex ecosystem of microbes inhabiting the human mouth, harbors several thousands of bacterial types. The proliferation of pathogenic bacteria within the mouth gives rise to periodontitis, an inflammatory disease known to also constitute a risk factor for cardiovascular disease. While much is known about individual species associated with pathogenesis, the system-level mechanisms underlying the transition from health to disease are still poorly understood. Through the sequencing of the 16S rRNA gene and of whole community DNA we provide a glimpse at the global genetic, metabolic, and ecological changes associated with periodontitis in 15 subgingival plaque samples, four from each of two periodontitis patients, and the remaining samples from three healthy individuals. We also demonstrate the power of whole-metagenome sequencing approaches in characterizing the genomes of key players in the oral microbiome, including an unculturable TM7 organism. We reveal the disease microbiome to be enriched in virulence factors, and adapted to a parasitic lifestyle that takes advantage of the disrupted host homeostasis. Furthermore, diseased samples share a common structure that was not found in completely healthy samples, suggesting that the disease state may occupy a narrow region within the space of possible configurations of the oral microbiome. Our pilot study demonstrates the power of high-throughput sequencing as a tool for understanding the role of the oral microbiome in periodontal disease. Despite a modest level of sequencing (∼2 lanes Illumina 76 bp PE) and high human DNA contamination (up to ∼90%) we were able to partially reconstruct several oral microbes and to preliminarily characterize some systems-level differences between the healthy and diseased oral microbiomes. PMID:22675498
Keller, J.; Rousseau-Gueutin, M.; Martin, G.E.; Morice, J.; Boutte, J.; Coissac, E.; Ourari, M.; Aïnouche, M.; Salmon, A.; Cabello-Hurtado, F.
2017-01-01
Abstract The Fabaceae family is considered as a model system for understanding chloroplast genome evolution due to the presence of extensive structural rearrangements, gene losses and localized hypermutable regions. Here, we provide sequences of four chloroplast genomes from the Lupinus genus, belonging to the underinvestigated Genistoid clade. Notably, we found in Lupinus species the functional loss of the essential rps16 gene, which was most likely replaced by the nuclear rps16 gene that encodes chloroplast and mitochondrion targeted RPS16 proteins. To study the evolutionary fate of the rps16 gene, we explored all available plant chloroplast, mitochondrial and nuclear genomes. Whereas no plant mitochondrial genomes carry an rps16 gene, many plants still have a functional nuclear and chloroplast rps16 gene. Ka/Ks ratios revealed that both chloroplast and nuclear rps16 copies were under purifying selection. However, due to the dual targeting of the nuclear rps16 gene product and the absence of a mitochondrial copy, the chloroplast gene may be lost. We also performed comparative analyses of lupine plastomes (SNPs, indels and repeat elements), identified the most variable regions and examined their phylogenetic utility. The markers identified here will help to reveal the evolutionary history of lupines, Genistoids and closely related clades. PMID:28338826
Garrison, J.R.; Van Den, Bergh; Barker, C.E.; Tabet, D.E.
1997-01-01
This Field Excursion will visit outcrops of the fluvial-deltaic Upper Cretaceous (Turonian) Ferron Sandstone Member of the Mancos Shale, known as the Last Chance delta or Upper Ferron Sandstone. This field guide and the field stops will outline the architecture and depositional sequence stratigraphy of the Upper Ferron Sandstone clastic wedge and explore the stratigraphic positions and compositions of major coal zones. The implications of the architecture and stratigraphy of the Ferron fluvial-deltaic complex for coal and coalbed methane resources will be discussed. Early works suggested that the southwesterly derived deltaic deposits of the the upper Ferron Sandstone clastic wedge were a Type-2 third-order depositional sequence, informally called the Ferron Sequence. These works suggested that the Ferron Sequence is separated by a type-2 sequence boundary from the underlying 3rd-order Hyatti Sequence, which has its sediment source from the northwest. Within the 3rd-order depositional sequence, the deltaic events of the Ferron clastic wedge, recognized as parasequence sets, appear to be stacked into progradational, aggradational, and retrogradational patterns reflecting a generally decreasing sediment supply during an overall slow sea-level rise. The architecture of both near-marine facies and non-marine fluvial facies exhibit well defined trends in response to this decrease in available sediment. Recent studies have concluded that, unless coincident with a depositional sequence boundary, regionally extensive coal zones occur at the tops of the parasequence sets within the Ferron clastic wedge. These coal zones consist of coal seams and their laterally equivalent fissile carbonaceous shales, mudstones, and siltstones, paleosols, and flood plain mudstones. Although the compositions of coal zones vary along depositional dip, the presence of these laterally extensive stratigraphic horizons, above parasequence sets, provides a means of correlating and defining the tops of depositional parasequence sets in both near-marine and non-marine parts of fluvial-deltaic depositional sequences. Ongoing field studies, based on this concept of coal zone stratigraphy, and detailed stratigraphic mapping, have documented the existence of at least 12 parasequence sets within the Last Chance delta clastic wedge. These parasequence sets appear to form four high frequency, 4th-order depositional sequences. The dramatic erosional unconformities, associated with these 4th-order sequence boundaries, indicate that there was up to 20-30 m of erosion, signifying locally substantial base-level drops. These base-level drops were accompanied by a basin ward shift in paleo-shorelines by as much as 5-7 km. These 4th-order Upper Ferron Sequences are superimposed on the 3rd-order sea-level rise event and the 3rd-order, sediment supply/accommodation space driven, stratigraphie architecture of the Upper Ferron Sandstone. The fluvial deltaic architecture shows little response to these 4th-order sea-level events. Coal zones generally thicken landward relative to the mean position of the landward pinch-out of the underlying parasequence set, but after some distance landward, they decrease in thickness. Coal zones also generally thin seaward relative to the mean position of the landward pinch-out of the underlying parasequence set. The coal is thickest in the region between this landward pinch-out and the position of maximum zone thickness. Data indicate that the proportion of coal in the coal zone decreases progressively landward from the landward pinch-out. The effects of differential compaction and differences in original pre-peat swamp topography have the effect of adding perturbations to the general trends. These coal zone systematics have major impact on approaches to exploration and production, and the resource accessment of both coal and coalbed methane.
Phylogenetic stratigraphy in the Guerrero Negro hypersaline microbial mat.
Harris, J Kirk; Caporaso, J Gregory; Walker, Jeffrey J; Spear, John R; Gold, Nicholas J; Robertson, Charles E; Hugenholtz, Philip; Goodrich, Julia; McDonald, Daniel; Knights, Dan; Marshall, Paul; Tufo, Henry; Knight, Rob; Pace, Norman R
2013-01-01
The microbial mats of Guerrero Negro (GN), Baja California Sur, Mexico historically were considered a simple environment, dominated by cyanobacteria and sulfate-reducing bacteria. Culture-independent rRNA community profiling instead revealed these microbial mats as among the most phylogenetically diverse environments known. A preliminary molecular survey of the GN mat based on only ∼1500 small subunit rRNA gene sequences discovered several new phylum-level groups in the bacterial phylogenetic domain and many previously undetected lower-level taxa. We determined an additional ∼119,000 nearly full-length sequences and 28,000 >200 nucleotide 454 reads from a 10-layer depth profile of the GN mat. With this unprecedented coverage of long sequences from one environment, we confirm the mat is phylogenetically stratified, presumably corresponding to light and geochemical gradients throughout the depth of the mat. Previous shotgun metagenomic data from the same depth profile show the same stratified pattern and suggest that metagenome properties may be predictable from rRNA gene sequences. We verify previously identified novel lineages and identify new phylogenetic diversity at lower taxonomic levels, for example, thousands of operational taxonomic units at the family-genus levels differ considerably from known sequences. The new sequences populate parts of the bacterial phylogenetic tree that previously were poorly described, but indicate that any comprehensive survey of GN diversity has only begun. Finally, we show that taxonomic conclusions are generally congruent between Sanger and 454 sequencing technologies, with the taxonomic resolution achieved dependent on the abundance of reference sequences in the relevant region of the rRNA tree of life.
Comparison of Next-Generation Sequencing Systems
Liu, Lin; Li, Yinhu; Li, Siliang; Hu, Ni; He, Yimin; Pong, Ray; Lin, Danni; Lu, Lihua; Law, Maggie
2012-01-01
With fast development and wide applications of next-generation sequencing (NGS) technologies, genomic sequence information is within reach to aid the achievement of goals to decode life mysteries, make better crops, detect pathogens, and improve life qualities. NGS systems are typically represented by SOLiD/Ion Torrent PGM from Life Sciences, Genome Analyzer/HiSeq 2000/MiSeq from Illumina, and GS FLX Titanium/GS Junior from Roche. Beijing Genomics Institute (BGI), which possesses the world's biggest sequencing capacity, has multiple NGS systems including 137 HiSeq 2000, 27 SOLiD, one Ion Torrent PGM, one MiSeq, and one 454 sequencer. We have accumulated extensive experience in sample handling, sequencing, and bioinformatics analysis. In this paper, technologies of these systems are reviewed, and first-hand data from extensive experience is summarized and analyzed to discuss the advantages and specifics associated with each sequencing system. At last, applications of NGS are summarized. PMID:22829749
LEE, Seung-Hun; PARK, Sang-Joon; KWAK, Dongmi; KIM, Kyoo-Tae
2017-01-01
A 16-year-old female Indian peafowl (Pavo cristatus) died two days after recognition of conjunctivitis in the right eye, anorexia and depression. Gross necropsy revealed a thick pseudomembrane under the eyelid and hydropericardium. Histopathological examination revealed hepatocellular necrosis, sinusoidal and vascular congestion and infiltrated inflammatory cells. Infiltration by inflammatory cells was noted in the epicardium. The lungs had mild interstitial pneumonia with the extensive congestion within the capillaries of the air sacs. Tubular interstitial congestion and necrosis was noted in the kidneys. Bacterial culture and nucleotide sequencing of the inflammatory specimens identified the causative agent as Serratia marcescens, an uncommon bacterium in birds. In summary, this study describes the sudden death of an Indian peafowl due to S. marcescens infection, which is rarely seen in animals. PMID:29081475
Co-circulation of West Nile virus and distinct insect-specific flaviviruses in Turkey.
Ergünay, Koray; Litzba, Nadine; Brinkmann, Annika; Günay, Filiz; Sarıkaya, Yasemen; Kar, Sırrı; Örsten, Serra; Öter, Kerem; Domingo, Cristina; Erisoz Kasap, Özge; Özkul, Aykut; Mitchell, Luke; Nitsche, Andreas; Alten, Bülent; Linton, Yvonne-Marie
2017-03-20
Active vector surveillance provides an efficient tool for monitoring the presence or spread of emerging or re-emerging vector-borne viruses. This study was undertaken to investigate the circulation of flaviviruses. Mosquitoes were collected from 58 locations in 10 provinces across the Aegean, Thrace and Mediterranean Anatolian regions of Turkey in 2014 and 2015. Following morphological identification, mosquitoes were pooled and screened by nested and real-time PCR assays. Detected viruses were further characterised by sequencing. Positive pools were inoculated onto cell lines for virus isolation. Next generation sequencing was employed for genomic characterisation of the isolates. A total of 12,711 mosquito specimens representing 15 species were screened in 594 pools. Eleven pools (2%) were reactive in the virus screening assays. Sequencing revealed West Nile virus (WNV) in one Culex pipiens (s.l.) pool from Thrace. WNV sequence corresponded to lineage one clade 1a but clustered distinctly from the Turkish prototype isolate. In 10 pools, insect-specific flaviviruses were characterised as Culex theileri flavivirus in 5 pools of Culex theileri and one pool of Cx. pipiens (s.l.), Ochlerotatus caspius flavivirus in two pools of Aedes (Ochlerotatus) caspius, Flavivirus AV-2011 in one pool of Culiseta annulata, and an undetermined flavivirus in one pool of Uranotaenia unguiculata from the Aegean and Thrace regions. DNA forms or integration of the detected insect-specific flaviviruses were not observed. A virus strain, tentatively named as "Ochlerotatus caspius flavivirus Turkey", was isolated from an Ae. caspius pool in C6/36 cells. The viral genome comprised 10,370 nucleotides with a putative polyprotein of 3,385 amino acids that follows the canonical flavivirus polyprotein organisation. Sequence comparisons and phylogenetic analyses revealed the close relationship of this strain with Ochlerotatus caspius flavivirus from Portugal and Hanko virus from Finland. Several conserved structural and amino acid motifs were identified. We identified WNV and several distinct insect-specific flaviviruses during an extensive biosurveillance study of mosquitoes in various regions of Turkey in 2014 and 2015. Ongoing circulation of WNV is revealed, with an unprecedented genetic diversity. A probable replicating form of an insect flavivirus identified only in DNA form was detected.
Laursen, J R; di Liu, H; Wu, X J; Yoshino, T P
1997-11-01
Sublethal heat-shock of cells of the Bge (Biomphalaria glabrata embryonic) snail cell line resulted in increased or new expression of metabolically labeled polypeptides of approximately 21.5, 41, 70, and 74 kDa molecular mass. Regulation of this response appeared to be at the transcriptional level since a similar protein banding pattern was seen upon SDS-PAGE/fluorographic analysis of polypeptides produced by in vitro translation of total RNA from cells subjected to heat shock. Using a yeast (Saccharomyces cerevisiae) 70-kDa heat-shock protein (HSP70) probe to screen a cDNA library from heat-treated Bge cells, we isolated a full-length cDNA clone encoding a putative Bge HSP70. The cDNA was 2453 bp in length and contained an open reading frame of 1908 bp encoding a 636-amino-acid polypeptide with calculated molecular mass of 70,740 Da. Comparison of a conserved region of 209 amino acid residues revealed > 80% identity between the deduced amino acid sequence of Bge HSP70 and that of yeast (81%), the human blood fluke Schistosoma mansoni (for which B. glabrata serves as intermediate host) (81%), Drosophila (81%), human (84%), and the marine gastropod Aplysia californica (88%, 90%). In addition to the extensive sharing of sequence homology, the identification of several eukaryotic HSP70 signature sequences and an N-linked glycosylation site characteristic of cytoplasmic HSPs strongly support the identity of the Bge cDNA as encoding an authentic HSP70. Results of a Northern blot analysis, using Bge HSP70 clone-specific probes, indicated that gene expression was heat inducible and not constitutively expressed. This is the first reported sequence of an inducible HSP70 from cells originating from a freshwater gastropod and provides a first step in the development of a genetic transformation system for molluscs of medical importance.
Angeloni, Debora; ter Elst, Arja; Wei, Ming Hui; van der Veen, Anneke Y; Braga, Eleonora A; Klimov, Eugene A; Timmer, Tineke; Korobeinikova, Luba; Lerman, Michael I; Buys, Charles H C M
2006-07-01
Homozygous deletions or loss of heterozygosity (LOH) at human chromosome band 3p12 are consistent features of lung and other malignancies, suggesting the presence of a tumor suppressor gene(s) (TSG) at this location. Only one gene has been cloned thus far from the overlapping region deleted in lung and breast cancer cell lines U2020, NCI H2198, and HCC38. It is DUTT1 (Deleted in U Twenty Twenty), also known as ROBO1, FLJ21882, and SAX3, according to HUGO. DUTT1, the human ortholog of the fly gene ROBO, has homology with NCAM proteins. Extensive analyses of DUTT1 in lung cancer have not revealed any mutations, suggesting that another gene(s) at this location could be of importance in lung cancer initiation and progression. Here, we report the discovery of a new, small, homozygous deletion in the small cell lung cancer (SCLC) cell line GLC20, nested in the overlapping, critical region. The deletion was delineated using several polymorphic markers and three overlapping P1 phage clones. Fiber-FISH experiments revealed the deletion was approximately 130 kb. Comparative genomic sequence analysis uncovered short sequence elements highly conserved among mammalian genomes and the chicken genome. The discovery of two EST clusters within the deleted region led to the isolation of two noncoding RNA (ncRNA) genes. These were subsequently found differentially expressed in various tumors when compared to their normal tissues. The ncRNA and other highly conserved sequence elements in the deleted region may represent miRNA targets of importance in cancer initiation or progression. Published 2006 Wiley-Liss, Inc.
Marandino, Ana; Tomás, Gonzalo; Panzera, Yanina; Greif, Gonzalo; Parodi-Talice, Adriana; Hernández, Martín; Techera, Claudia; Hernández, Diego; Pérez, Ruben
2017-10-01
Infectious bronchitis virus (Gammacoronavirus, Coronaviridae) is a genetically variable RNA virus that causes one of the most persistent respiratory diseases in poultry. The virus is classified in genotypes and lineages with different epidemiological relevance. Two lineages of the GI genotype (11 and 16) have been widely circulating for decades in South America. GI-11 is an exclusive South American lineage while the GI-16 lineage is distributed in Asia, Europe and South America. Here, we obtained the whole genome of two Uruguayan strains of the GI-11 and GI-16 lineages using Illumina high-throughput sequencing. The strains here sequenced are the first obtained in South America for the infectious bronchitis virus and provide new insights into the origin, spreading and evolution of viral variants. The complete genome of the GI-11 and GI-16 strains have 27,621 and 27,638 nucleotides, respectively, and possess the same genomic organization. Phylogenetic incongruence analysis reveals that both strains have a mosaic genome that arose by recombination between Euro Asiatic strains of the GI-16 lineage and ancestral South American GI-11 viruses. The recombination occurred in South America and produced two viral variants that have retained the full-length S1 sequences of the parental lineages but are extremely similar in the rest of their genomes. These recombinant virus have been extraordinary successful, persisting in the continent for several years with a notorious wide geographic distribution. Our findings reveal a singular viral dynamics and emphasize the importance of complete genomic characterization to understand the emergence and evolutionary history of viral variants. Copyright © 2017 Elsevier B.V. All rights reserved.
The blood DNA virome in 8,000 humans.
Moustafa, Ahmed; Xie, Chao; Kirkness, Ewen; Biggs, William; Wong, Emily; Turpaz, Yaron; Bloom, Kenneth; Delwart, Eric; Nelson, Karen E; Venter, J Craig; Telenti, Amalio
2017-03-01
The characterization of the blood virome is important for the safety of blood-derived transfusion products, and for the identification of emerging pathogens. We explored non-human sequence data from whole-genome sequencing of blood from 8,240 individuals, none of whom were ascertained for any infectious disease. Viral sequences were extracted from the pool of sequence reads that did not map to the human reference genome. Analyses sifted through close to 1 Petabyte of sequence data and performed 0.5 trillion similarity searches. With a lower bound for identification of 2 viral genomes/100,000 cells, we mapped sequences to 94 different viruses, including sequences from 19 human DNA viruses, proviruses and RNA viruses (herpesviruses, anelloviruses, papillomaviruses, three polyomaviruses, adenovirus, HIV, HTLV, hepatitis B, hepatitis C, parvovirus B19, and influenza virus) in 42% of the study participants. Of possible relevance to transfusion medicine, we identified Merkel cell polyomavirus in 49 individuals, papillomavirus in blood of 13 individuals, parvovirus B19 in 6 individuals, and the presence of herpesvirus 8 in 3 individuals. The presence of DNA sequences from two RNA viruses was unexpected: Hepatitis C virus is revealing of an integration event, while the influenza virus sequence resulted from immunization with a DNA vaccine. Age, sex and ancestry contributed significantly to the prevalence of infection. The remaining 75 viruses mostly reflect extensive contamination of commercial reagents and from the environment. These technical problems represent a major challenge for the identification of novel human pathogens. Increasing availability of human whole-genome sequences will contribute substantial amounts of data on the composition of the normal and pathogenic human blood virome. Distinguishing contaminants from real human viruses is challenging.
Norman, Paul J.; Norberg, Steven J.; Guethlein, Lisbeth A.; Nemat-Gorgani, Neda; Royce, Thomas; Wroblewski, Emily E.; Dunn, Tamsen; Mann, Tobias; Alicata, Claudia; Hollenbach, Jill A.; Chang, Weihua; Shults Won, Melissa; Gunderson, Kevin L.; Abi-Rached, Laurent; Ronaghi, Mostafa; Parham, Peter
2017-01-01
The most polymorphic part of the human genome, the MHC, encodes over 160 proteins of diverse function. Half of them, including the HLA class I and II genes, are directly involved in immune responses. Consequently, the MHC region strongly associates with numerous diseases and clinical therapies. Notoriously, the MHC region has been intractable to high-throughput analysis at complete sequence resolution, and current reference haplotypes are inadequate for large-scale studies. To address these challenges, we developed a method that specifically captures and sequences the 4.8-Mbp MHC region from genomic DNA. For 95 MHC homozygous cell lines we assembled, de novo, a set of high-fidelity contigs and a sequence scaffold, representing a mean 98% of the target region. Included are six alternative MHC reference sequences of the human genome that we completed and refined. Characterization of the sequence and structural diversity of the MHC region shows the approach accurately determines the sequences of the highly polymorphic HLA class I and HLA class II genes and the complex structural diversity of complement factor C4A/C4B. It has also uncovered extensive and unexpected diversity in other MHC genes; an example is MUC22, which encodes a lung mucin and exhibits more coding sequence alleles than any HLA class I or II gene studied here. More than 60% of the coding sequence alleles analyzed were previously uncharacterized. We have created a substantial database of robust reference MHC haplotype sequences that will enable future population scale studies of this complicated and clinically important region of the human genome. PMID:28360230
Song, B K; Pan, M Z; Lau, Y L; Wan, K L
2014-07-29
Commercial flocks infected by Eimeria species parasites, including Eimeria maxima, have an increased risk of developing clinical or subclinical coccidiosis; an intestinal enteritis associated with increased mortality rates in poultry. Currently, infection control is largely based on chemotherapy or live vaccines; however, drug resistance is common and vaccines are relatively expensive. The development of new cost-effective intervention measures will benefit from unraveling the complex genetic mechanisms that underlie host-parasite interactions, including the identification and characterization of genes encoding proteins such as phosphatidylinositol 4-phosphate 5-kinase (PIP5K). We previously identified a PIP5K coding sequence within the E. maxima genome. In this study, we analyzed two bacterial artificial chromosome clones presenting a ~145-kb E. maxima (Weybridge strain) genomic region spanning the PIP5K gene locus. Sequence analysis revealed that ~95% of the simple sequence repeats detected were located within regions comparable to the previously described feature-rich segments of the Eimeria tenella genome. Comparative sequence analysis with the orthologous E. maxima (Houghton strain) region revealed a moderate level of conserved synteny. Unique segmental organizations and telomere-like repeats were also observed in both genomes. A number of incomplete transposable elements were detected and further scrutiny of these elements in both orthologous segments revealed interesting nesting events, which may play a role in facilitating genome plasticity in E. maxima. The current analysis provides more detailed information about the genome organization of E. maxima and may help to reveal genotypic differences that are important for expression of traits related to pathogenicity and virulence.
NASA Astrophysics Data System (ADS)
Liu, Yun; Song, Shuqun; Chen, Tiantian; Li, Caiwen
2017-04-01
Pyrosequencing of the 18S rRNA gene has been widely adopted to study the eukaryotic diversity in various types of environments, and has an advantage over traditional morphology methods in exploring unknown microbial communities. To comprehensively assess the diversity and community composition of marine protists in the coastal waters of China, we applied both morphological observations and high-throughput sequencing of the V2 and V3 regions of 18S rDNA simultaneously to analyze samples collected from the surface layer of the Yellow and East China Seas. Dinoflagellates, diatoms and ciliates were the three dominant protistan groups as revealed by the two methods. Diatoms were the first dominant protistan group in the microscopic observations, with Skeletonema mainly distributed in the nearshore eutrophic waters and Chaetoceros in higher temperature and higher pH waters. The mixotrophic dinoflagellates, Gymnodinium and Gyrodinium, were more competitive in the oligotrophic waters. The pyrosequencing method revealed an extensive diversity of dinoflagellates. Chaetoceros was the only dominant diatom group in the pyrosequencing dataset. Gyrodinium represented the most abundant reads and dominated the offshore oligotrophic protistan community as they were in the microscopic observations. The dominance of parasitic dinoflagellates in the pyrosequencing dataset, which were overlooked in the morphological observations, indicates more attention should be paid to explore the potential role of this group. Both methods provide coherent clustering of samples. Nutrient levels, salinity and pH were the main factors influencing the distribution of protists. This study demonstrates that different primer pairs used in the pyrosequencing will indicate different protistan community structures. A suitable marker may reveal more comprehensive composition of protists and provide valuable information on environmental drivers.
Deep sequencing reveals cell-type-specific patterns of single-cell transcriptome variation.
Dueck, Hannah; Khaladkar, Mugdha; Kim, Tae Kyung; Spaethling, Jennifer M; Francis, Chantal; Suresh, Sangita; Fisher, Stephen A; Seale, Patrick; Beck, Sheryl G; Bartfai, Tamas; Kuhn, Bernhard; Eberwine, James; Kim, Junhyong
2015-06-09
Differentiation of metazoan cells requires execution of different gene expression programs but recent single-cell transcriptome profiling has revealed considerable variation within cells of seeming identical phenotype. This brings into question the relationship between transcriptome states and cell phenotypes. Additionally, single-cell transcriptomics presents unique analysis challenges that need to be addressed to answer this question. We present high quality deep read-depth single-cell RNA sequencing for 91 cells from five mouse tissues and 18 cells from two rat tissues, along with 30 control samples of bulk RNA diluted to single-cell levels. We find that transcriptomes differ globally across tissues with regard to the number of genes expressed, the average expression patterns, and within-cell-type variation patterns. We develop methods to filter genes for reliable quantification and to calibrate biological variation. All cell types include genes with high variability in expression, in a tissue-specific manner. We also find evidence that single-cell variability of neuronal genes in mice is correlated with that in rats consistent with the hypothesis that levels of variation may be conserved. Single-cell RNA-sequencing data provide a unique view of transcriptome function; however, careful analysis is required in order to use single-cell RNA-sequencing measurements for this purpose. Technical variation must be considered in single-cell RNA-sequencing studies of expression variation. For a subset of genes, biological variability within each cell type appears to be regulated in order to perform dynamic functions, rather than solely molecular noise.