dna sequence pattern: Topics by Science.gov

Sample records for dna sequence pattern

CpG PatternFinder: a Windows-based utility program for easy and rapid identification of the CpG methylation status of DNA.

PubMed

Xu, Yi-Hua; Manoharan, Herbert T; Pitot, Henry C

2007-09-01

The bisulfite genomic sequencing technique is one of the most widely used techniques to study sequence-specific DNA methylation because of its unambiguous ability to reveal DNA methylation status to the order of a single nucleotide. One characteristic feature of the bisulfite genomic sequencing technique is that a number of sample sequence files will be produced from a single DNA sample. The PCR products of bisulfite-treated DNA samples cannot be sequenced directly because they are heterogeneous in nature; therefore they should be cloned into suitable plasmids and then sequenced. This procedure generates an enormous number of sample DNA sequence files as well as adding extra bases belonging to the plasmids to the sequence, which will cause problems in the final sequence comparison. Finding the methylation status for each CpG in each sample sequence is not an easy job. As a result CpG PatternFinder was developed for this purpose. The main functions of the CpG PatternFinder are: (i) to analyze the reference sequence to obtain CpG and non-CpG-C residue position information. (ii) To tailor sample sequence files (delete insertions and mark deletions from the sample sequence files) based on a configuration of ClustalW multiple alignment. (iii) To align sample sequence files with a reference file to obtain bisulfite conversion efficiency and CpG methylation status. And, (iv) to produce graphics, highlighted aligned sequence text and a summary report which can be easily exported to Microsoft Office suite. CpG PatternFinder is designed to operate cooperatively with BioEdit, a freeware on the internet. It can handle up to 100 files of sample DNA sequences simultaneously, and the total CpG pattern analysis process can be finished in minutes. CpG PatternFinder is an ideal software tool for DNA methylation studies to determine the differential methylation pattern in a large number of individuals in a population. Previously we developed the CpG Analyzer program; CpG PatternFinder is our further effort to create software tools for DNA methylation studies.
A Programmable DNA Double-Write Material: Synergy of Photolithography and Self-Assembly Nanofabrication.

PubMed

Song, Youngjun; Takahashi, Tsukasa; Kim, Sejung; Heaney, Yvonne C; Warner, John; Chen, Shaochen; Heller, Michael J

2017-01-11

We demonstrate a DNA double-write process that uses UV to pattern a uniquely designed DNA write material, which produces two distinct binding identities for hybridizing two different complementary DNA sequences. The process requires no modification to the DNA by chemical reagents and allows programmed DNA self-assembly and further UV patterning in the UV exposed and nonexposed areas. Multilayered DNA patterning with hybridization of fluorescently labeled complementary DNA sequences, biotin probe/fluorescent streptavidin complexes, and DNA patterns with 500 nm line widths were all demonstrated.
Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes

PubMed Central

Huang, Yongjie; Mrázek, Jan

2014-01-01

Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877
Process of labeling specific chromosomes using recombinant repetitive DNA

DOEpatents

Moyzis, R.K.; Meyne, J.

1988-02-12

Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.
Multiplex analysis of DNA

DOEpatents

Church, George M.; Kieffer-Higgins, Stephen

1992-01-01

This invention features vectors and a method for sequencing DNA. The method includes the steps of: a) ligating the DNA into a vector comprising a tag sequence, the tag sequence includes at least 15 bases, wherein the tag sequence will not hybridize to the DNA under stringent hybridization conditions and is unique in the vector, to form a hybrid vector, b) treating the hybrid vector in a plurality of vessels to produce fragments comprising the tag sequence, wherein the fragments differ in length and terminate at a fixed known base or bases, wherein the fixed known base or bases differs in each vessel, c) separating the fragments from each vessel according to their size, d) hybridizing the fragments with an oligonucleotide able to hybridize specifically with the tag sequence, and e) detecting the pattern of hybridization of the tag sequence, wherein the pattern reflects the nucleotide sequence of the DNA.
Chromosome specific repetitive DNA sequences

DOEpatents

Moyzis, Robert K.; Meyne, Julianne

1991-01-01

A method is provided for determining specific nucleotide sequences useful in forming a probe which can identify specific chromosomes, preferably through in situ hybridization within the cell itself. In one embodiment, chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family me This invention is the result of a contract with the Department of Energy (Contract No. W-7405-ENG-36).
Unlinking the methylome pattern from nucleotide sequence, revealed by large-scale in vivo genome engineering and methylome editing in medaka fish

PubMed Central

Nakamura, Ryohei; Uno, Ayako; Kumagai, Masahiko; Fukushima, Hiroto S.; Morishita, Shinichi; Takeda, Hiroyuki

2017-01-01

The heavily methylated vertebrate genomes are punctuated by stretches of poorly methylated DNA sequences that usually mark gene regulatory regions. It is known that the methylation state of these regions confers transcriptional control over their associated genes. Given its governance on the transcriptome, cellular functions and identity, genome-wide DNA methylation pattern is tightly regulated and evidently predefined. However, how is the methylation pattern determined in vivo remains enigmatic. Based on in silico and in vitro evidence, recent studies proposed that the regional hypomethylated state is primarily determined by local DNA sequence, e.g., high CpG density and presence of specific transcription factor binding sites. Nonetheless, the dependency of DNA methylation on nucleotide sequence has not been carefully validated in vertebrates in vivo. Herein, with the use of medaka (Oryzias latipes) as a model, the sequence dependency of DNA methylation was intensively tested in vivo. Our statistical modeling confirmed the strong statistical association between nucleotide sequence pattern and methylation state in the medaka genome. However, by manipulating the methylation state of a number of genomic sequences and reintegrating them into medaka embryos, we demonstrated that artificially conferred DNA methylation states were predominantly and robustly maintained in vivo, regardless of their sequences and endogenous states. This feature was also observed in the medaka transgene that had passed across generations. Thus, despite the observed statistical association, nucleotide sequence was unable to autonomously determine its own methylation state in medaka in vivo. Our results apparently argue against the notion of the governance on the DNA methylation by nucleotide sequence, but instead suggest the involvement of other epigenetic factors in defining and maintaining the DNA methylation landscape. Further investigation in other vertebrate models in vivo will be needed for the generalization of our observations made in medaka. PMID:29267279
Hairpin Bisulfite Sequencing: Synchronous Methylation Analysis on Complementary DNA Strands of Individual Chromosomes.

PubMed

Giehr, Pascal; Walter, Jörn

2018-01-01

The accurate and quantitative detection of 5-methylcytosine is of great importance in the field of epigenetics. The method of choice is usually bisulfite sequencing because of the high resolution and the possibility to combine it with next generation sequencing. Nevertheless, also this method has its limitations. Following the bisulfite treatment DNA strands are no longer complementary such that in a subsequent PCR amplification the DNA methylation patterns information of only one of the two DNA strand is preserved. Several years ago Hairpin Bisulfite sequencing was developed as a method to obtain the pattern information on complementary DNA strands. The method requires fragmentation (usually by enzymatic cleavage) of genomic DNA followed by a covalent linking of both DNA strands through ligation of a short DNA hairpin oligonucleotide to both strands. The ligated covalently linked dsDNA products are then subjected to a conventional bisulfite treatment during which all unmodified cytosines are converted to uracils. During the treatment the DNA is denatured forming noncomplementary ssDNA circles. These circles serve as a template for a locus specific PCR to amplify chromosomal patterns of the region of interest. As a result one ends up with a linearized product, which contains the methylation information of both complementary DNA strands.
Applications of statistical physics and information theory to the analysis of DNA sequences

NASA Astrophysics Data System (ADS)

Grosse, Ivo

2000-10-01

DNA carries the genetic information of most living organisms, and the of genome projects is to uncover that genetic information. One basic task in the analysis of DNA sequences is the recognition of protein coding genes. Powerful computer programs for gene recognition have been developed, but most of them are based on statistical patterns that vary from species to species. In this thesis I address the question if there exist universal statistical patterns that are different in coding and noncoding DNA of all living species, regardless of their phylogenetic origin. In search for such species-independent patterns I study the mutual information function of genomic DNA sequences, and find that it shows persistent period-three oscillations. To understand the biological origin of the observed period-three oscillations, I compare the mutual information function of genomic DNA sequences to the mutual information function of stochastic model sequences. I find that the pseudo-exon model is able to reproduce the mutual information function of genomic DNA sequences. Moreover, I find that a generalization of the pseudo-exon model can connect the existence and the functional form of long-range correlations to the presence and the length distributions of coding and noncoding regions. Based on these theoretical studies I am able to find an information-theoretical quantity, the average mutual information (AMI), whose probability distributions are significantly different in coding and noncoding DNA, while they are almost identical in all studied species. These findings show that there exist universal statistical patterns that are different in coding and noncoding DNA of all studied species, and they suggest that the AMI may be used to identify genes in different living species, irrespective of their taxonomic origin.
Fragmentation of contaminant and endogenous DNA in ancient samples determined by shotgun sequencing; prospects for human palaeogenomics.

PubMed

García-Garcerà, Marc; Gigli, Elena; Sanchez-Quinto, Federico; Ramirez, Oscar; Calafell, Francesc; Civit, Sergi; Lalueza-Fox, Carles

2011-01-01

Despite the successful retrieval of genomes from past remains, the prospects for human palaeogenomics remain unclear because of the difficulty of distinguishing contaminant from endogenous DNA sequences. Previous sequence data generated on high-throughput sequencing platforms indicate that fragmentation of ancient DNA sequences is a characteristic trait primarily arising due to depurination processes that create abasic sites leading to DNA breaks. METHODOLOGY/PRINCIPALS FINDINGS: To investigate whether this pattern is present in ancient remains from a temperate environment, we have 454-FLX pyrosequenced different samples dated between 5,500 and 49,000 years ago: a bone from an extinct goat (Myotragus balearicus) that was treated with a depurinating agent (bleach), an Iberian lynx bone not subjected to any treatment, a human Neolithic sample from Barcelona (Spain), and a Neandertal sample from the El Sidrón site (Asturias, Spain). The efficiency of retrieval of endogenous sequences is below 1% in all cases. We have used the non-human samples to identify human sequences (0.35 and 1.4%, respectively), that we positively know are contaminants. We observed that bleach treatment appears to create a depurination-associated fragmentation pattern in resulting contaminant sequences that is indistinguishable from previously described endogenous sequences. Furthermore, the nucleotide composition pattern observed in 5' and 3' ends of contaminant sequences is much more complex than the flat pattern previously described in some Neandertal contaminants. Although much research on samples with known contaminant histories is needed, our results suggest that endogenous and contaminant sequences cannot be distinguished by the fragmentation pattern alone.
Molecular Analysis of Dehalococcoides 16S Ribosomal DNA from Chloroethene-Contaminated Sites throughout North America and Europe

PubMed Central

Hendrickson, Edwin R.; Payne, Jo Ann; Young, Roslyn M.; Starr, Mark G.; Perry, Michael P.; Fahnestock, Stephen; Ellis, David E.; Ebersole, Richard C.

2002-01-01

The environmental distribution of Dehalococcoides group organisms and their association with chloroethene-contaminated sites were examined. Samples from 24 chloroethene-dechlorinating sites scattered throughout North America and Europe were tested for the presence of members of the Dehalococcoides group by using a PCR assay developed to detect Dehalococcoides 16S rRNA gene (rDNA) sequences. Sequences identified by sequence analysis as sequences of members of the Dehalococcoides group were detected at 21 sites. Full dechlorination of chloroethenes to ethene occurred at these sites. Dehalococcoides sequences were not detected in samples from three sites at which partial dechlorination of chloroethenes occurred, where dechlorination appeared to stop at 1,2-cis-dichloroethene. Phylogenetic analysis of the 16S rDNA amplicons confirmed that Dehalococcoides sequences formed a unique 16S rDNA group. These 16S rDNA sequences were divided into three subgroups based on specific base substitution patterns in variable regions 2 and 6 of the Dehalococcoides 16S rDNA sequence. Analyses also demonstrated that specific base substitution patterns were signature patterns. The specific base substitutions distinguished the three sequence subgroups phylogenetically. These results demonstrated that members of the Dehalococcoides group are widely distributed in nature and can be found in a variety of geological formations and in different climatic zones. Furthermore, the association of these organisms with full dechlorination of chloroethenes suggests that they are promising candidates for engineered bioremediation and may be important contributors to natural attenuation of chloroethenes. PMID:11823182
Methylation patterns of repetitive DNA sequences in germ cells of Mus musculus.

PubMed

Sanford, J; Forrester, L; Chapman, V; Chandley, A; Hastie, N

1984-03-26

The major and the minor satellite sequences of Mus musculus were undermethylated in both sperm and oocyte DNAs relative to the amount of undermethylation observed in adult somatic tissue DNA. This hypomethylation was specific for satellite sequences in sperm DNA. Dispersed repetitive and low copy sequences show a high degree of methylation in sperm DNA; however, a dispersed repetitive sequence was undermethylated in oocyte DNA. This finding suggests a difference in the amount of total genomic DNA methylation between sperm and oocyte DNA. The methylation levels of the minor satellite sequences did not change during spermiogenesis, and were not associated with the onset of meiosis or a specific stage in sperm development.
A Glimpse into the Satellite DNA Library in Characidae Fish (Teleostei, Characiformes)

PubMed Central

Utsunomia, Ricardo; Ruiz-Ruano, Francisco J.; Silva, Duílio M. Z. A.; Serrano, Érica A.; Rosa, Ivana F.; Scudeler, Patrícia E. S.; Hashimoto, Diogo T.; Oliveira, Claudio; Camacho, Juan Pedro M.; Foresti, Fausto

2017-01-01

Satellite DNA (satDNA) is an abundant fraction of repetitive DNA in eukaryotic genomes and plays an important role in genome organization and evolution. In general, satDNA sequences follow a concerted evolutionary pattern through the intragenomic homogenization of different repeat units. In addition, the satDNA library hypothesis predicts that related species share a series of satDNA variants descended from a common ancestor species, with differential amplification of different satDNA variants. The finding of a same satDNA family in species belonging to different genera within Characidae fish provided the opportunity to test both concerted evolution and library hypotheses. For this purpose, we analyzed here sequence variation and abundance of this satDNA family in ten species, by a combination of next generation sequencing (NGS), PCR and Sanger sequencing, and fluorescence in situ hybridization (FISH). We found extensive between-species variation for the number and size of pericentromeric FISH signals. At genomic level, the analysis of 1000s of DNA sequences obtained by Illumina sequencing and PCR amplification allowed defining 150 haplotypes which were linked in a common minimum spanning tree, where different patterns of concerted evolution were apparent. This also provided a glimpse into the satDNA library of this group of species. In consistency with the library hypothesis, different variants for this satDNA showed high differences in abundance between species, from highly abundant to simply relictual variants. PMID:28855916
DNA Photo Lithography with Cinnamate-based Photo-Bio-Nano-Glue

NASA Astrophysics Data System (ADS)

Feng, Lang; Li, Minfeng; Romulus, Joy; Sha, Ruojie; Royer, John; Wu, Kun-Ta; Xu, Qin; Seeman, Nadrian; Weck, Marcus; Chaikin, Paul

2013-03-01

We present a technique to make patterned functional surfaces, using a cinnamate photo cross-linker and photolithography. We have designed and modified a complementary set of single DNA strands to incorporate a pair of opposing cinnamate molecules. On exposure to 360nm UV, the cinnamate makes a highly specific covalent bond permanently linking only the complementary strands containing the cinnamates. We have studied this specific and efficient crosslinking with cinnamate-containing DNA in solution and on particles. UV addressability allows us to pattern surfaces functionally. The entire surface is coated with a DNA sequence A incorporating cinnamate. DNA strands A'B with one end containing a complementary cinnamated sequence A' attached to another sequence B, are then hybridized to the surface. UV photolithography is used to bind the A'B strand in a specific pattern. The system is heated and the unbound DNA is washed away. The pattern is then observed by thermo-reversibly hybridizing either fluorescently dyed B' strands complementary to B, or colloids coated with B' strands. Our techniques can be used to reversibly and/or permanently bind, via DNA linkers, an assortment of molecules, proteins and nanostructures. Potential applications range from advanced self-assembly, such as templated self-replication schemes recently reported, to designed physical and chemical patterns, to high-resolution multi-functional DNA surfaces for genetic detection or DNA computing.
Reduced representation bisulphite sequencing of the cattle genome reveals DNA methylation patterns

USDA-ARS?s Scientific Manuscript database

Using reduced representation bisulphite sequencing (RRBS), we obtained the first single-base-resolution maps of bovine DNA methylation in ten somatic tissues. In total, we observed 1,868,049 cytosines in the CG-enriched regions. Similar to the methylation patterns in other species, the CG context wa...
Informational structure of genetic sequences and nature of gene splicing

NASA Astrophysics Data System (ADS)

Trifonov, E. N.

1991-10-01

Only about 1/20 of DNA of higher organisms codes for proteins, by means of classical triplet code. The rest of DNA sequences is largely silent, with unclear functions, if any. The triplet code is not the only code (message) carried by the sequences. There are three levels of molecular communication, where the same sequence ``talks'' to various bimolecules, while having, respectively, three different appearances: DNA, RNA and protein. Since the molecular structures and, hence, sequence specific preferences of these are substantially different, the original DNA sequence has to carry simultaneously three types of sequence patterns (codes, messages), thus, being a composite structure in which one had the same letter (nucleotide) is frequently involved in several overlapping codes of different nature. This multiplicity and overlapping of the codes is a unique feature of the Gnomic, language of genetic sequences. The coexisting codes have to be degenerate in various degrees to allow an optimal and concerted performance of all the encoded functions. There is an obvious conflict between the best possible performance of a given function and necessity to compromise the quality of a given sequence pattern in favor of other patterns. It appears that the major role of various changes in the sequences on their ``ontogenetic'' way from DNA to RNA to protein, like RNA editing and splicing, or protein post-translational modifications is to resolve such conflicts. New data are presented strongly indicating that the gene splicing is such a device to resolve the conflict between the code of DNA folding in chromatin and the triplet code for protein synthesis.
Discovery and information-theoretic characterization of transcription factor binding sites that act cooperatively.

PubMed

Clifford, Jacob; Adami, Christoph

2015-09-02

Transcription factor binding to the surface of DNA regulatory regions is one of the primary causes of regulating gene expression levels. A probabilistic approach to model protein-DNA interactions at the sequence level is through position weight matrices (PWMs) that estimate the joint probability of a DNA binding site sequence by assuming positional independence within the DNA sequence. Here we construct conditional PWMs that depend on the motif signatures in the flanking DNA sequence, by conditioning known binding site loci on the presence or absence of additional binding sites in the flanking sequence of each site's locus. Pooling known sites with similar flanking sequence patterns allows for the estimation of the conditional distribution function over the binding site sequences. We apply our model to the Dorsal transcription factor binding sites active in patterning the Dorsal-Ventral axis of Drosophila development. We find that those binding sites that cooperate with nearby Twist sites on average contain about 0.5 bits of information about the presence of Twist transcription factor binding sites in the flanking sequence. We also find that Dorsal binding site detectors conditioned on flanking sequence information make better predictions about what is a Dorsal site relative to background DNA than detection without information about flanking sequence features.
Patterns of Viral DNA Integration in Cells Transformed by Wild Type or DNA-Binding Protein Mutants of Adenovirus Type 5 and Effect of Chemical Carcinogens on Integration

PubMed Central

Dorsch-Häsler, Karoline; Fisher, Paul B.; Weinstein, I. Bernard; Ginsberg, Harold S.

1980-01-01

The integration pattern of viral DNA was studied in a number of cell lines transformed by wild-type adenovirus type 5 (Ad5 WT) and two mutants of the DNA-binding protein gene, H5ts125 and H5ts107. The effect of chemical carcinogens on the integration of viral DNA was also investigated. Liquid hybridization (C0t) analyses showed that rat embryo cells transformed by Ad5 WT usually contained only the left-hand end of the viral genome, whereas cell lines transformed by H5ts125 or H5ts107 at either the semipermissive (36°C) or nonpermissive (39.5°C) temperature often contained one to five copies of all or most of the entire adenovirus genome. The arrangement of the integrated adenovirus DNA sequences was determined by cleavage of transformed cell DNA with restriction endonucleases XbaI, EcoRI, or HindIII followed by transfer of separated fragments to nitrocellulose paper and hybridization according to the technique of E. M. Southern (J. Mol. Biol. 98: 503-517, 1975). It was found that the adenovirus genome is integrated as a linear sequence covalently linked to host cell DNA; that the viral DNA is integrated into different host DNA sequences in each cell line studied; that in cell lines that contain multiple copies of the Ad5 genome the viral DNA sequences can be integrated in a single set of host cell DNA sequences and not as concatemers; and that chemical carcinogens do not alter the extent or pattern of viral DNA integration. Images PMID:6246266
Comparison of sequencing-based methods to profile DNA methylation and identification of monoallelic epigenetic modifications

USDA-ARS?s Scientific Manuscript database

Analysis of DNA methylation patterns relies increasingly on sequencing-based profiling methods. The four most frequently used sequencing-based technologies are the bisulfite-based methods MethylC-seq and reduced representation bisulfite sequencing (RRBS), and the enrichment-based techniques methylat...
Sequence Dependencies of DNA Deformability and Hydration in the Minor Groove

PubMed Central

Yonetani, Yoshiteru; Kono, Hidetoshi

2009-01-01

Abstract DNA deformability and hydration are both sequence-dependent and are essential in specific DNA sequence recognition by proteins. However, the relationship between the two is not well understood. Here, systematic molecular dynamics simulations of 136 DNA sequences that differ from each other in their central tetramer revealed that sequence dependence of hydration is clearly correlated with that of deformability. We show that this correlation can be illustrated by four typical cases. Most rigid basepair steps are highly likely to form an ordered hydration pattern composed of one water molecule forming a bridge between the bases of distinct strands, but a few exceptions favor another ordered hydration composed of two water molecules forming such a bridge. Steps with medium deformability can display both of these hydration patterns with frequent transition. Highly flexible steps do not have any stable hydration pattern. A detailed picture of this correlation demonstrates that motions of hydration water molecules and DNA bases are tightly coupled with each other at the atomic level. These results contribute to our understanding of the entropic contribution from water molecules in protein or drug binding and could be applied for the purpose of predicting binding sites. PMID:19686662

SINE sequences detect DNA fingerprints in salmonid fishes.

PubMed

Spruell, P; Thorgaard, G H

1996-04-01

DNA probes homologous to two previously described salmonid short interspersed nuclear elements (SINEs) detected DNA fingerprint patterns in 14 species of salmonid fishes. The probes showed more homology to some species than to others and little homology to three nonsalmonid fishes. The DNA fingerprint patterns derived from the SINE probes are individual-specific and inherited in a Mendelian manner. Probes derived from different regions of the same SINE detect only partially overlapping banding patterns, reflecting a more complex SINE structure than has been previously reported. Like the human Alu sequence, the SINEs found in salmonids could provide useful genetic markers and primer sites for PCR-based techniques. These elements may be more desirable for some applications than traditional DNA fingerprinting probes that detect tandemly repeated arrays.
Vander Lugt correlation of DNA sequence data

NASA Astrophysics Data System (ADS)

Christens-Barry, William A.; Hawk, James F.; Martin, James C.

1990-12-01

DNA, the molecule containing the genetic code of an organism, is a linear chain of subunits. It is the sequence of subunits, of which there are four kinds, that constitutes the unique blueprint of an individual. This sequence is the focus of a large number of analyses performed by an army of geneticists, biologists, and computer scientists. Most of these analyses entail searches for specific subsequences within the larger set of sequence data. Thus, most analyses are essentially pattern recognition or correlation tasks. Yet, there are special features to such analysis that influence the strategy and methods of an optical pattern recognition approach. While the serial processing employed in digital electronic computers remains the main engine of sequence analyses, there is no fundamental reason that more efficient parallel methods cannot be used. We describe an approach using optical pattern recognition (OPR) techniques based on matched spatial filtering. This allows parallel comparison of large blocks of sequence data. In this study we have simulated a Vander Lugt1 architecture implementing our approach. Searches for specific target sequence strings within a block of DNA sequence from the Co/El plasmid2 are performed.
Evolution of rDNA in Nicotiana Allopolyploids: A Potential Link between rDNA Homogenization and Epigenetics

PubMed Central

Kovarik, Ales; Dadejova, Martina; Lim, Yoong K.; Chase, Mark W.; Clarkson, James J.; Knapp, Sandra; Leitch, Andrew R.

2008-01-01

Background The evolution and biology of rDNA have interested biologists for many years, in part, because of two intriguing processes: (1) nucleolar dominance and (2) sequence homogenization. We review patterns of evolution in rDNA in the angiosperm genus Nicotiana to determine consequences of allopolyploidy on these processes. Scope Allopolyploid species of Nicotiana are ideal for studying rDNA evolution because phylogenetic reconstruction of DNA sequences has revealed patterns of species divergence and their parents. From these studies we also know that polyploids formed over widely different timeframes (thousands to millions of years), enabling comparative and temporal studies of rDNA structure, activity and chromosomal distribution. In addition studies on synthetic polyploids enable the consequences of de novo polyploidy on rDNA activity to be determined. Conclusions We propose that rDNA epigenetic expression patterns established even in F1 hybrids have a material influence on the likely patterns of divergence of rDNA. It is the active rDNA units that are vulnerable to homogenization, which probably acts to reduce mutational load across the active array. Those rDNA units that are epigenetically silenced may be less vulnerable to sequence homogenization. Selection cannot act on these silenced genes, and they are likely to accumulate mutations and eventually be eliminated from the genome. It is likely that whole silenced arrays will be deleted in polyploids of 1 million years of age and older. PMID:18310159
DNA methylation polymorphism in a set of elite rice cultivars and its possible contribution to inter-cultivar differential gene expression.

PubMed

Wang, Yongming; Lin, Xiuyun; Dong, Bo; Wang, Yingdian; Liu, Bao

2004-01-01

RAPD (randomly amplified polymorphic DNA) and ISSR (inter-simple sequence repeat) fingerprinting on HpaII/MspI-digested genomic DNA of nine elite japonica rice cultivars implies inter-cultivar DNA methylation polymorphism. Using both DNA fragments isolated from RAPD or ISSR gels and selected low-copy sequences as probes, methylation-sensitive Southern blot analysis confirms the existence of extensive DNA methylation polymorphism in both genes and DNA repeats among the rice cultivars. The cultivar-specific methylation patterns are stably maintained, and can be used as reliable molecular markers. Transcriptional analysis of four selected sequences (RdRP, AC9, HSP90 and MMR) on leaves and roots from normal and 5-azacytidine-treated seedlings of three representative cultivars shows an association between the transcriptional activity of one of the genes, the mismatch repair (MMR) gene, and its CG methylation patterns.
Genome organization and DNA methylation patterns of B chromosomes in the red fox and Chinese raccoon dogs.

PubMed

Bugno-Poniewierska, Monika; Solek, Przemysław; Wronski, Mariusz; Potocki, Leszek; Jezewska-Witkowska, Grażyna; Wnuk, Maciej

2014-12-01

The molecular structure of B chromosomes (Bs) is relatively well studied. Previous research demonstrates that Bs of various species usually contain two types of repetitive DNA sequences, satellite DNA and ribosomal DNA, but Bs also contain genes encoding histone proteins and many others. However, many questions remain regarding the origin and function of these chromosomes. Here, we focused on the comparative cytogenetic characteristics of the red fox and Chinese raccoon dog B chromosomes with particular attention to the distribution of repetitive DNA sequences and their methylation status. We confirmed that the small Bs of the red fox show a typical fluorescent telomeric distal signal, whereas medium-sized Bs of the Chinese raccoon dog were characterized by clusters of telomeric sequences along their length. We also found different DNA methylation patterns for the B chromosomes of both species. Therefore, we concluded that DNA methylation may maintain the transcriptional inactivation of DNA sequences localized to B chromosomes and may prevent genetic unbalancing and several negative phenotypic effects. © 2014 The Authors.
Species-specific Typing of DNA Based on Palindrome Frequency Patterns

PubMed Central

Lamprea-Burgunder, Estelle; Ludin, Philipp; Mäser, Pascal

2011-01-01

DNA in its natural, double-stranded form may contain palindromes, sequences which read the same from either side because they are identical to their reverse complement on the sister strand. Short palindromes are underrepresented in all kinds of genomes. The frequency distribution of short palindromes exhibits more than twice the inter-species variance of non-palindromic sequences, which renders palindromes optimally suited for the typing of DNA. Here, we show that based on palindrome frequency, DNA sequences can be discriminated to the level of species of origin. By plotting the ratios of actual occurrence to expectancy, we generate palindrome frequency patterns that allow to cluster different sequences of the same genome and to assign plasmids, and in some cases even viruses to their respective host genomes. This finding will be of use in the growing field of metagenomics. PMID:21429991
Application of a time-dependent coalescence process for inferring the history of population size changes from DNA sequence data.

PubMed

Polanski, A; Kimmel, M; Chakraborty, R

1998-05-12

Distribution of pairwise differences of nucleotides from data on a sample of DNA sequences from a given segment of the genome has been used in the past to draw inferences about the past history of population size changes. However, all earlier methods assume a given model of population size changes (such as sudden expansion), parameters of which (e.g., time and amplitude of expansion) are fitted to the observed distributions of nucleotide differences among pairwise comparisons of all DNA sequences in the sample. Our theory indicates that for any time-dependent population size, N(tau) (in which time tau is counted backward from present), a time-dependent coalescence process yields the distribution, p(tau), of the time of coalescence between two DNA sequences randomly drawn from the population. Prediction of p(tau) and N(tau) requires the use of a reverse Laplace transform known to be unstable. Nevertheless, simulated data obtained from three models of monotone population change (stepwise, exponential, and logistic) indicate that the pattern of a past population size change leaves its signature on the pattern of DNA polymorphism. Application of the theory to the published mtDNA sequences indicates that the current mtDNA sequence variation is not inconsistent with a logistic growth of the human population.
Cations Form Sequence Selective Motifs within DNA Grooves via a Combination of Cation-Pi and Ion-Dipole/Hydrogen Bond Interactions

PubMed Central

Stewart, Mikaela; Dunlap, Tori; Dourlain, Elizabeth; Grant, Bryce; McFail-Isom, Lori

2013-01-01

The fine conformational subtleties of DNA structure modulate many fundamental cellular processes including gene activation/repression, cellular division, and DNA repair. Most of these cellular processes rely on the conformational heterogeneity of specific DNA sequences. Factors including those structural characteristics inherent in the particular base sequence as well as those induced through interaction with solvent components combine to produce fine DNA structural variation including helical flexibility and conformation. Cation-pi interactions between solvent cations or their first hydration shell waters and the faces of DNA bases form sequence selectively and contribute to DNA structural heterogeneity. In this paper, we detect and characterize the binding patterns found in cation-pi interactions between solvent cations and DNA bases in a set of high resolution x-ray crystal structures. Specifically, we found that monovalent cations (Tl+) and the polarized first hydration shell waters of divalent cations (Mg2+, Ca2+) form cation-pi interactions with DNA bases stabilizing unstacked conformations. When these cation-pi interactions are combined with electrostatic interactions a pattern of specific binding motifs is formed within the grooves. PMID:23940752
Cations form sequence selective motifs within DNA grooves via a combination of cation-pi and ion-dipole/hydrogen bond interactions.

PubMed

Stewart, Mikaela; Dunlap, Tori; Dourlain, Elizabeth; Grant, Bryce; McFail-Isom, Lori

2013-01-01

The fine conformational subtleties of DNA structure modulate many fundamental cellular processes including gene activation/repression, cellular division, and DNA repair. Most of these cellular processes rely on the conformational heterogeneity of specific DNA sequences. Factors including those structural characteristics inherent in the particular base sequence as well as those induced through interaction with solvent components combine to produce fine DNA structural variation including helical flexibility and conformation. Cation-pi interactions between solvent cations or their first hydration shell waters and the faces of DNA bases form sequence selectively and contribute to DNA structural heterogeneity. In this paper, we detect and characterize the binding patterns found in cation-pi interactions between solvent cations and DNA bases in a set of high resolution x-ray crystal structures. Specifically, we found that monovalent cations (Tl⁺) and the polarized first hydration shell waters of divalent cations (Mg²⁺, Ca²⁺) form cation-pi interactions with DNA bases stabilizing unstacked conformations. When these cation-pi interactions are combined with electrostatic interactions a pattern of specific binding motifs is formed within the grooves.
Gene Identification Algorithms Using Exploratory Statistical Analysis of Periodicity

NASA Astrophysics Data System (ADS)

Mukherjee, Shashi Bajaj; Sen, Pradip Kumar

2010-10-01

Studying periodic pattern is expected as a standard line of attack for recognizing DNA sequence in identification of gene and similar problems. But peculiarly very little significant work is done in this direction. This paper studies statistical properties of DNA sequences of complete genome using a new technique. A DNA sequence is converted to a numeric sequence using various types of mappings and standard Fourier technique is applied to study the periodicity. Distinct statistical behaviour of periodicity parameters is found in coding and non-coding sequences, which can be used to distinguish between these parts. Here DNA sequences of Drosophila melanogaster were analyzed with significant accuracy.
Counting Patterns in Degenerated Sequences

NASA Astrophysics Data System (ADS)

Nuel, Grégory

Biological sequences like DNA or proteins, are always obtained through a sequencing process which might produce some uncertainty. As a result, such sequences are usually written in a degenerated alphabet where some symbols may correspond to several possible letters (ex: IUPAC DNA alphabet). When counting patterns in such degenerated sequences, the question that naturally arises is: how to deal with degenerated positions ? Since most (usually 99%) of the positions are not degenerated, it is considered harmless to discard the degenerated positions in order to get an observation, but the exact consequences of such a practice are unclear. In this paper, we introduce a rigorous method to take into account the uncertainty of sequencing for biological sequences (DNA, Proteins). We first introduce a Forward-Backward approach to compute the marginal distribution of the constrained sequence and use it both to perform a Expectation-Maximization estimation of parameters, as well as deriving a heterogeneous Markov distribution for the constrained sequence. This distribution is hence used along with known DFA-based pattern approaches to obtain the exact distribution of the pattern count under the constraints. As an illustration, we consider a EST dataset from the EMBL database. Despite the fact that only 1% of the positions in this dataset are degenerated, we show that not taking into account these positions might lead to erroneous observations, further proving the interest of our approach.
DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation.

PubMed

Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H; Proukakis, Christos

2017-01-01

Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array "waves", and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance.
DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation

PubMed Central

Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M.; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H.

2017-01-01

Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array “waves”, and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance. PMID:28683077
Winnowing DNA for rare sequences: highly specific sequence and methylation based enrichment.

PubMed

Thompson, Jason D; Shibahara, Gosuke; Rajan, Sweta; Pel, Joel; Marziali, Andre

2012-01-01

Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue.
Kangaroo – A pattern-matching program for biological sequences

PubMed Central

2002-01-01

Background Biologists are often interested in performing a simple database search to identify proteins or genes that contain a well-defined sequence pattern. Many databases do not provide straightforward or readily available query tools to perform simple searches, such as identifying transcription binding sites, protein motifs, or repetitive DNA sequences. However, in many cases simple pattern-matching searches can reveal a wealth of information. We present in this paper a regular expression pattern-matching tool that was used to identify short repetitive DNA sequences in human coding regions for the purpose of identifying potential mutation sites in mismatch repair deficient cells. Results Kangaroo is a web-based regular expression pattern-matching program that can search for patterns in DNA, protein, or coding region sequences in ten different organisms. The program is implemented to facilitate a wide range of queries with no restriction on the length or complexity of the query expression. The program is accessible on the web at http://bioinfo.mshri.on.ca/kangaroo/ and the source code is freely distributed at http://sourceforge.net/projects/slritools/. Conclusion A low-level simple pattern-matching application can prove to be a useful tool in many research settings. For example, Kangaroo was used to identify potential genetic targets in a human colorectal cancer variant that is characterized by a high frequency of mutations in coding regions containing mononucleotide repeats. PMID:12150718
Regulatory sequence analysis tools.

PubMed

van Helden, Jacques

2003-07-01

The web resource Regulatory Sequence Analysis Tools (RSAT) (http://rsat.ulb.ac.be/rsat) offers a collection of software tools dedicated to the prediction of regulatory sites in non-coding DNA sequences. These tools include sequence retrieval, pattern discovery, pattern matching, genome-scale pattern matching, feature-map drawing, random sequence generation and other utilities. Alternative formats are supported for the representation of regulatory motifs (strings or position-specific scoring matrices) and several algorithms are proposed for pattern discovery. RSAT currently holds >100 fully sequenced genomes and these data are regularly updated from GenBank.
Applications of alignment-free methods in epigenomics.

PubMed

Pinello, Luca; Lo Bosco, Giosuè; Yuan, Guo-Cheng

2014-05-01

Epigenetic mechanisms play an important role in the regulation of cell type-specific gene activities, yet how epigenetic patterns are established and maintained remains poorly understood. Recent studies have supported a role of DNA sequences in recruitment of epigenetic regulators. Alignment-free methods have been applied to identify distinct sequence features that are associated with epigenetic patterns and to predict epigenomic profiles. Here, we review recent advances in such applications, including the methods to map DNA sequence to feature space, sequence comparison and prediction models. Computational studies using these methods have provided important insights into the epigenetic regulatory mechanisms.
Nonparametric Bayesian clustering to detect bipolar methylated genomic loci.

PubMed

Wu, Xiaowei; Sun, Ming-An; Zhu, Hongxiao; Xie, Hehuang

2015-01-16

With recent development in sequencing technology, a large number of genome-wide DNA methylation studies have generated massive amounts of bisulfite sequencing data. The analysis of DNA methylation patterns helps researchers understand epigenetic regulatory mechanisms. Highly variable methylation patterns reflect stochastic fluctuations in DNA methylation, whereas well-structured methylation patterns imply deterministic methylation events. Among these methylation patterns, bipolar patterns are important as they may originate from allele-specific methylation (ASM) or cell-specific methylation (CSM). Utilizing nonparametric Bayesian clustering followed by hypothesis testing, we have developed a novel statistical approach to identify bipolar methylated genomic regions in bisulfite sequencing data. Simulation studies demonstrate that the proposed method achieves good performance in terms of specificity and sensitivity. We used the method to analyze data from mouse brain and human blood methylomes. The bipolar methylated segments detected are found highly consistent with the differentially methylated regions identified by using purified cell subsets. Bipolar DNA methylation often indicates epigenetic heterogeneity caused by ASM or CSM. With allele-specific events filtered out or appropriately taken into account, our proposed approach sheds light on the identification of cell-specific genes/pathways under strong epigenetic control in a heterogeneous cell population.
Methylation pattern of fish lymphocystis disease virus DNA.

PubMed

Wagner, H; Simon, D; Werner, E; Gelderblom, H; Darai, C; Flügel, R M

1985-03-01

The content and distribution of 5-methylcytosine in DNA from fish lymphocystis disease virus was analyzed by high-pressure liquid chromatography, nearest-neighbor analysis, and with restriction endonucleases. We found that 22% of all C residues were methylated, including methylation of the following dinucleotide sequences: CpG to 75%, CpC to ca. 1%, and CpA to 2 to 5%. Comparison of relative digestion of viral DNA with MspI and HpaII indicated that CCGG sequences were almost completely methylated at the inner C. The degree of methylation of GCGC was much lower. The methylation pattern of fish lymphocystis disease virus DNA differed from that of the host cell DNA.
Methylation pattern of fish lymphocystis disease virus DNA.

PubMed Central

Wagner, H; Simon, D; Werner, E; Gelderblom, H; Darai, C; Flügel, R M

1985-01-01

The content and distribution of 5-methylcytosine in DNA from fish lymphocystis disease virus was analyzed by high-pressure liquid chromatography, nearest-neighbor analysis, and with restriction endonucleases. We found that 22% of all C residues were methylated, including methylation of the following dinucleotide sequences: CpG to 75%, CpC to ca. 1%, and CpA to 2 to 5%. Comparison of relative digestion of viral DNA with MspI and HpaII indicated that CCGG sequences were almost completely methylated at the inner C. The degree of methylation of GCGC was much lower. The methylation pattern of fish lymphocystis disease virus DNA differed from that of the host cell DNA. Images PMID:3973962

Winnowing DNA for Rare Sequences: Highly Specific Sequence and Methylation Based Enrichment

PubMed Central

Thompson, Jason D.; Shibahara, Gosuke; Rajan, Sweta; Pel, Joel; Marziali, Andre

2012-01-01

Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue. PMID:22355378
BiQ Analyzer HT: locus-specific analysis of DNA methylation by high-throughput bisulfite sequencing

PubMed Central

Lutsik, Pavlo; Feuerbach, Lars; Arand, Julia; Lengauer, Thomas; Walter, Jörn; Bock, Christoph

2011-01-01

Bisulfite sequencing is a widely used method for measuring DNA methylation in eukaryotic genomes. The assay provides single-base pair resolution and, given sufficient sequencing depth, its quantitative accuracy is excellent. High-throughput sequencing of bisulfite-converted DNA can be applied either genome wide or targeted to a defined set of genomic loci (e.g. using locus-specific PCR primers or DNA capture probes). Here, we describe BiQ Analyzer HT (http://biq-analyzer-ht.bioinf.mpi-inf.mpg.de/), a user-friendly software tool that supports locus-specific analysis and visualization of high-throughput bisulfite sequencing data. The software facilitates the shift from time-consuming clonal bisulfite sequencing to the more quantitative and cost-efficient use of high-throughput sequencing for studying locus-specific DNA methylation patterns. In addition, it is useful for locus-specific visualization of genome-wide bisulfite sequencing data. PMID:21565797
Direct bisulfite sequencing for examination of DNA methylation with gene and nucleotide resolution from brain tissues.

PubMed

Parrish, R Ryley; Day, Jeremy J; Lubin, Farah D

2012-07-01

DNA methylation is an epigenetic modification that is essential for the development and mature function of the central nervous system. Due to the relevance of this modification to the transcriptional control of gene expression, it is often necessary to examine changes in DNA methylation patterns with both gene and single-nucleotide resolution. Here, we describe an in-depth basic protocol for direct bisulfite sequencing of DNA isolated from brain tissue, which will permit direct assessment of methylation status at individual genes as well as individual cytosine molecules/nucleotides within a genomic region. This method yields analysis of DNA methylation patterns that is robust, accurate, and reproducible, thereby allowing insights into the role of alterations in DNA methylation in brain tissue.
Complexity and Entropy Analysis of DNMT1 Gene

USDA-ARS?s Scientific Manuscript database

Background: The application of complexity information on DNA sequence and protein in biological processes are well established in this study. Available sequences for DNMT1 gene, which is a maintenance methyltransferase is responsible for copying DNA methylation patterns to the daughter strands durin...
Nonneutral mitochondrial DNA variation in humans and chimpanzees

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nachman, M.W.; Aquadro, C.F.; Brown, W.M.

1996-03-01

We sequenced the NADH dehydrogenase subunit 3 (ND3) gene from a sample of 61 humans, five common chimpanzees, and one gorilla to test whether patterns of mitochondrial DNA (mtDNA) variation are consistent with a neutral model of molecular evolution. Within humans and within chimpanzees, the ratio of replacement to silent nucleotide substitutions was higher than observed in comparisons between species, contrary to neutral expectations. To test the generality of this result, we reanalyzed published human RFLP data from the entire mitochondrial genome. Gains of restriction sites relative to a known human mtDNA sequence were used to infer unambiguous nucleotide substitutions.more » We also compared the complete mtDNA sequences of three humans. Both the RFLP data and the sequence data reveal a higher ratio of replacement to silent nucleotide substitutions within humans than is seen between species. This pattern is observed at most or all human mitochondrial genes and is inconsistent with a strictly neutral model. These data suggest that many mitochondrial protein polymorphisms are slightly deleterious, consistent with studies of human mitochondrial diseases. 59 refs., 2 figs., 8 tabs.« less
GENETIC STRUCTURE OF CREEK CHUB (SEMOTILUS ATROMACULATUS) POPULATIONS IN COAL MINING-IMPACTED AREAS OF THE EASTERN UNITED STATES, AS DETERMINED BY MTDNA SEQUENCING AND AFLP ANALYSIS

EPA Science Inventory

Analysis of intraspecific patterns in genetic diversity of stream fishes provides a potentially powerful method for assessing the status and trends in the condition of aquatic ecosystems. We analyzed mitochondrial DNA (mtDNA) sequences (590 bases of cytochrome B) and nuclear DNA...
Regional differences in mitochondrial DNA methylation in human post-mortem brain tissue.

PubMed

Devall, Matthew; Smith, Rebecca G; Jeffries, Aaron; Hannon, Eilis; Davies, Matthew N; Schalkwyk, Leonard; Mill, Jonathan; Weedon, Michael; Lunnon, Katie

2017-01-01

DNA methylation is an important epigenetic mechanism involved in gene regulation, with alterations in DNA methylation in the nuclear genome being linked to numerous complex diseases. Mitochondrial DNA methylation is a phenomenon that is receiving ever-increasing interest, particularly in diseases characterized by mitochondrial dysfunction; however, most studies have been limited to the investigation of specific target regions. Analyses spanning the entire mitochondrial genome have been limited, potentially due to the amount of input DNA required. Further, mitochondrial genetic studies have been previously confounded by nuclear-mitochondrial pseudogenes. Methylated DNA Immunoprecipitation Sequencing is a technique widely used to profile DNA methylation across the nuclear genome; however, reads mapped to mitochondrial DNA are often discarded. Here, we have developed an approach to control for nuclear-mitochondrial pseudogenes within Methylated DNA Immunoprecipitation Sequencing data. We highlight the utility of this approach in identifying differences in mitochondrial DNA methylation across regions of the human brain and pre-mortem blood. We were able to correlate mitochondrial DNA methylation patterns between the cortex, cerebellum and blood. We identified 74 nominally significant differentially methylated regions ( p < 0.05) in the mitochondrial genome, between anatomically separate cortical regions and the cerebellum in matched samples ( N = 3 matched donors). Further analysis identified eight significant differentially methylated regions between the total cortex and cerebellum after correcting for multiple testing. Using unsupervised hierarchical clustering analysis of the mitochondrial DNA methylome, we were able to identify tissue-specific patterns of mitochondrial DNA methylation between blood, cerebellum and cortex. Our study represents a comprehensive analysis of the mitochondrial methylome using pre-existing Methylated DNA Immunoprecipitation Sequencing data to identify brain region-specific patterns of mitochondrial DNA methylation.
Ancient DNA sequence revealed by error-correcting codes.

PubMed

Brandão, Marcelo M; Spoladore, Larissa; Faria, Luzinete C B; Rocha, Andréa S L; Silva-Filho, Marcio C; Palazzo, Reginaldo

2015-07-10

A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code.
Ancient DNA sequence revealed by error-correcting codes

PubMed Central

Brandão, Marcelo M.; Spoladore, Larissa; Faria, Luzinete C. B.; Rocha, Andréa S. L.; Silva-Filho, Marcio C.; Palazzo, Reginaldo

2015-01-01

A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code. PMID:26159228
Single-strand conformation polymorphism (SSCP)-based mutation scanning approaches to fingerprint sequence variation in ribosomal DNA of ascaridoid nematodes.

PubMed

Zhu, X Q; Gasser, R B

1998-06-01

In this study, we assessed single-strand conformation polymorphism (SSCP)-based approaches for their capacity to fingerprint sequence variation in ribosomal DNA (rDNA) of ascaridoid nematodes of veterinary and/or human health significance. The second internal transcribed spacer region (ITS-2) of rDNA was utilised as the target region because it is known to provide species-specific markers for this group of parasites. ITS-2 was amplified by PCR from genomic DNA derived from individual parasites and subjected to analysis. Direct SSCP analysis of amplicons from seven taxa (Toxocara vitulorum, Toxocara cati, Toxocara canis, Toxascaris leonina, Baylisascaris procyonis, Ascaris suum and Parascaris equorum) showed that the single-strand (ss) ITS-2 patterns produced allowed their unequivocal identification to species. While no variation in SSCP patterns was detected in the ITS-2 within four species for which multiple samples were available, the method allowed the direct display of four distinct sequence types of ITS-2 among individual worms of T. cati. Comparison of SSCP/sequencing with the methods of dideoxy fingerprinting (ddF) and restriction endonuclease fingerprinting (REF) revealed that also ddF allowed the definition of the four sequence types, whereas REF displayed three of four. The findings indicate the usefulness of the SSCP-based approaches for the identification of ascaridoid nematodes to species, the direct display of sequence variation in rDNA and the detection of population variation. The ability to fingerprint microheterogeneity in ITS-2 rDNA using such approaches also has implications for studying fundamental aspects relating to mutational change in rDNA.
Mitochondrial DNA mutations in single human blood cells.

PubMed

Yao, Yong-Gang; Kajigaya, Sachiko; Young, Neal S

2015-09-01

Determination mitochondrial DNA (mtDNA) sequences from extremely small amounts of DNA extracted from tissue of limited amounts and/or degraded samples is frequently employed in medical, forensic, and anthropologic studies. Polymerase chain reaction (PCR) amplification followed by DNA cloning is a routine method, especially to examine heteroplasmy of mtDNA mutations. In this review, we compare the mtDNA mutation patterns detected by three different sequencing strategies. Cloning and sequencing methods that are based on PCR amplification of DNA extracted from either single cells or pooled cells yield a high frequency of mutations, partly due to the artifacts introduced by PCR and/or the DNA cloning process. Direct sequencing of PCR product which has been amplified from DNA in individual cells is able to detect the low levels of mtDNA mutations present within a cell. We further summarize the findings in our recent studies that utilized this single cell method to assay mtDNA mutation patterns in different human blood cells. Our data show that many somatic mutations observed in the end-stage differentiated cells are found in hematopoietic stem cells (HSCs) and progenitors within the CD34(+) cell compartment. Accumulation of mtDNA variations in the individual CD34+ cells is affected by both aging and family genetic background. Granulocytes harbor higher numbers of mutations compared with the other cells, such as CD34(+) cells and lymphocytes. Serial assessment of mtDNA mutations in a population of single CD34(+) cells obtained from the same donor over time suggests stability of some somatic mutations. CD34(+) cell clones from a donor marked by specific mtDNA somatic mutations can be found in the recipient after transplantation. The significance of these findings is discussed in terms of the lineage tracing of HSCs, aging effect on accumulation of mtDNA mutations and the usage of mtDNA sequence in forensic identification. Copyright © 2015 Elsevier B.V. All rights reserved.
Reduced representation bisulphite sequencing of the ten bovine somatic tissues reveals DNA methylation patterns

USDA-ARS?s Scientific Manuscript database

As a major component epigenetics, DNA methylation has been proved that widely functions in individual development and various diseases. It has been well studied in model organisms and human but includes limited data for the economic animals. Using reduced representation bisulphite sequencing (RRBS),...
Sequence-Dependent Diastereospecific and Diastereodivergent Crosslinking of DNA by Decarbamoylmitomycin C.

PubMed

Aguilar, William; Paz, Manuel M; Vargas, Anayatzinc; Clement, Cristina C; Cheng, Shu-Yuan; Champeil, Elise

2018-04-20

Mitomycin C (MC), a potent antitumor drug, and decarbamoylmitomycin C (DMC), a derivative lacking the carbamoyl group, form highly cytotoxic DNA interstrand crosslinks. The major interstrand crosslink formed by DMC is the C1'' epimer of the major crosslink formed by MC. The molecular basis for the stereochemical configuration exhibited by DMC was investigated using biomimetic synthesis. The formation of DNA-DNA crosslinks by DMC is diastereospecific and diastereodivergent: Only the 1''S-diastereomer of the initially formed monoadduct can form crosslinks at GpC sequences, and only the 1''R-diastereomer of the monoadduct can form crosslinks at CpG sequences. We also show that CpG and GpC sequences react with divergent diastereoselectivity in the first alkylation step: 1"S stereochemistry is favored at GpC sequences and 1''R stereochemistry is favored at CpG sequences. Therefore, the first alkylation step results, at each sequence, in the selective formation of the diastereomer able to generate an interstrand DNA-DNA crosslink after the "second arm" alkylation. Examination of the known DNA adduct pattern obtained after treatment of cancer cell cultures with DMC indicates that the GpC sequence is the major target for the formation of DNA-DNA crosslinks in vivo by this drug. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Negatively supercoiled simian virus 40 DNA contains Z-DNA segments within transcriptional enhancer sequences

NASA Technical Reports Server (NTRS)

Nordheim, A.; Rich, A.

1983-01-01

Three 8-base pair (bp) segments of alternating purine-pyrimidine from the simian virus 40 enhancer region form Z-DNA on negative supercoiling; minichromosome DNase I-hypersensitive sites determined by others bracket these three segments. A survey of transcriptional enhancer sequences reveals a pattern of potential Z-DNA-forming regions which occur in pairs 50-80 bp apart. This may influence local chromatin structure and may be related to transcriptional activation.
Distinct DNA methylation patterns associated with active and inactive centromeres of the maize B chromosome.

PubMed

Koo, Dal-Hoe; Han, Fangpu; Birchler, James A; Jiang, Jiming

2011-06-01

Centromeres are determined by poorly understood epigenetic mechanisms. Centromeres can be activated or inactivated without changing the underlying DNA sequences. However, virtually nothing is known about the epigenetic transition of a centromere from an active to an inactive state because of the lack of examples of the same centromere exhibiting alternative forms and being distinguishable from other centromeres. The centromere of the supernumerary B chromosome of maize provides such an opportunity because its functional core can be cytologically tracked, and an inactive version of the centromere is available. We developed a DNA fiber-based technique that can be used to assess the levels of cytosine methylation associated with repetitive DNA sequences. We report that DNA sequences in the normal B centromere exhibit hypomethylation. This methylation pattern is not affected by the genetic background or structural rearrangement of the B chromosome, but is slightly changed when the B chromosome is transferred to oat as an addition chromosome. In contrast, an inactive version of this same centromere exhibits hypermethylation, indicating that the inactive centromere was modified into a different epigenetic state at the DNA level.
Early history of European domestic cattle as revealed by ancient DNA.

PubMed

Bollongino, R; Edwards, C J; Alt, K W; Burger, J; Bradley, D G

2006-03-22

We present an extensive ancient DNA analysis of mainly Neolithic cattle bones sampled from archaeological sites along the route of Neolithic expansion, from Turkey to North-Central Europe and Britain. We place this first reasonable population sample of Neolithic cattle mitochondrial DNA sequence diversity in context to illustrate the continuity of haplotype variation patterns from the first European domestic cattle to the present. Interestingly, the dominant Central European pattern, a starburst phylogeny around the modal sequence, T3, has a Neolithic origin, and the reduced diversity within this cluster in the ancient samples accords with their shorter history of post-domestic accumulation of mutation.
Multiplexed Sequence Encoding: A Framework for DNA Communication.

PubMed

Zakeri, Bijan; Carr, Peter A; Lu, Timothy K

2016-01-01

Synthetic DNA has great propensity for efficiently and stably storing non-biological information. With DNA writing and reading technologies rapidly advancing, new applications for synthetic DNA are emerging in data storage and communication. Traditionally, DNA communication has focused on the encoding and transfer of complete sets of information. Here, we explore the use of DNA for the communication of short messages that are fragmented across multiple distinct DNA molecules. We identified three pivotal points in a communication-data encoding, data transfer & data extraction-and developed novel tools to enable communication via molecules of DNA. To address data encoding, we designed DNA-based individualized keyboards (iKeys) to convert plaintext into DNA, while reducing the occurrence of DNA homopolymers to improve synthesis and sequencing processes. To address data transfer, we implemented a secret-sharing system-Multiplexed Sequence Encoding (MuSE)-that conceals messages between multiple distinct DNA molecules, requiring a combination key to reveal messages. To address data extraction, we achieved the first instance of chromatogram patterning through multiplexed sequencing, thereby enabling a new method for data extraction. We envision these approaches will enable more widespread communication of information via DNA.
The Gene Construction Kit: a new computer program for manipulating and presenting DNA constructs.

PubMed

Gross, R H

1990-06-01

The Gene Construction Kit is a new tool for manipulating and displaying DNA sequence information. Constructs can be displayed either graphically or as formatted sequence. Segments of DNA can be cut out with restriction enzymes and pasted into other sites. The program keeps track of staggered ends and notifies the user of incompatibilities and offers a choice of ligation options. Each segment of a construct can have its own defined thickness, pattern, direction and color. The sequence listing can be displayed in any font and style in user defined grouping. Nucleotide positions can be displayed as can restriction sites and protein sequences. The DNA can be displayed as either single- or double-stranded. Restriction sites can be readily marked. Alternative views of the DNA can be maintained and the history of the construct automatically stored. Gel electrophoresis patterns can be generated and can be used in cloning project design. Extensive comments can be stored with the construct and can be searched rapidly for key words. High quality illustrations showing multiple editable constructs with added graphics and text information can be generated for slides, posters or publication.
A compositional segmentation of the human mitochondrial genome is related to heterogeneities in the guanine mutation rate

PubMed Central

Samuels, David C.; Boys, Richard J.; Henderson, Daniel A.; Chinnery, Patrick F.

2003-01-01

We applied a hidden Markov model segmentation method to the human mitochondrial genome to identify patterns in the sequence, to compare these patterns to the gene structure of mtDNA and to see whether these patterns reveal additional characteristics important for our understanding of genome evolution, structure and function. Our analysis identified three segmentation categories based upon the sequence transition probabilities. Category 2 segments corresponded to the tRNA and rRNA genes, with a greater strand-symmetry in these segments. Category 1 and 3 segments covered the protein- coding genes and almost all of the non-coding D-loop. Compared to category 1, the mtDNA segments assigned to category 3 had much lower guanine abundance. A comparison to two independent databases of mitochondrial mutations and polymorphisms showed that the high substitution rate of guanine in human mtDNA is largest in the category 3 segments. Analysis of synonymous mutations showed the same pattern. This suggests that this heterogeneity in the mutation rate is partly independent of respiratory chain function and is a direct property of the genome sequence itself. This has important implications for our understanding of mtDNA evolution and its use as a ‘molecular clock’ to determine the rate of population and species divergence. PMID:14530452
Mitochondrial DNA heteroplasmy in the emerging field of massively parallel sequencing

PubMed Central

Just, Rebecca S.; Irwin, Jodi A.; Parson, Walther

2015-01-01

Long an important and useful tool in forensic genetic investigations, mitochondrial DNA (mtDNA) typing continues to mature. Research in the last few years has demonstrated both that data from the entire molecule will have practical benefits in forensic DNA casework, and that massively parallel sequencing (MPS) methods will make full mitochondrial genome (mtGenome) sequencing of forensic specimens feasible and cost-effective. A spate of recent studies has employed these new technologies to assess intraindividual mtDNA variation. However, in several instances, contamination and other sources of mixed mtDNA data have been erroneously identified as heteroplasmy. Well vetted mtGenome datasets based on both Sanger and MPS sequences have found authentic point heteroplasmy in approximately 25% of individuals when minor component detection thresholds are in the range of 10–20%, along with positional distribution patterns in the coding region that differ from patterns of point heteroplasmy in the well-studied control region. A few recent studies that examined very low-level heteroplasmy are concordant with these observations when the data are examined at a common level of resolution. In this review we provide an overview of considerations related to the use of MPS technologies to detect mtDNA heteroplasmy. In addition, we examine published reports on point heteroplasmy to characterize features of the data that will assist in the evaluation of future mtGenome data developed by any typing method. PMID:26009256

The Control Region of Mitochondrial DNA Shows an Unusual CpG and Non-CpG Methylation Pattern

PubMed Central

Bellizzi, Dina; D'Aquila, Patrizia; Scafone, Teresa; Giordano, Marco; Riso, Vincenzo; Riccio, Andrea; Passarino, Giuseppe

2013-01-01

DNA methylation is a common epigenetic modification of the mammalian genome. Conflicting data regarding the possible presence of methylated cytosines within mitochondrial DNA (mtDNA) have been reported. To clarify this point, we analysed the methylation status of mtDNA control region (D-loop) on human and murine DNA samples from blood and cultured cells by bisulphite sequencing and methylated/hydroxymethylated DNA immunoprecipitation assays. We found methylated and hydroxymethylated cytosines in the L-strand of all samples analysed. MtDNA methylation particularly occurs within non-C-phosphate-G (non-CpG) nucleotides, mainly in the promoter region of the heavy strand and in conserved sequence blocks, suggesting its involvement in regulating mtDNA replication and/or transcription. We observed DNA methyltransferases within the mitochondria, but the inactivation of Dnmt1, Dnmt3a, and Dnmt3b in mouse embryonic stem (ES) cells results in a reduction of the CpG methylation, while the non-CpG methylation shows to be not affected. This suggests that D-loop epigenetic modification is only partially established by these enzymes. Our data show that DNA methylation occurs in the mtDNA control region of mammals, not only at symmetrical CpG dinucleotides, typical of nuclear genome, but in a peculiar non-CpG pattern previously reported for plants and fungi. The molecular mechanisms responsible for this pattern remain an open question. PMID:23804556
Nucleosome Positioning and Epigenetics

NASA Astrophysics Data System (ADS)

Schwab, David; Bruinsma, Robijn

2008-03-01

The role of chromatin structure in gene regulation has recently taken center stage in the field of epigenetics, phenomena that change the phenotype without changing the DNA sequence. Recent work has also shown that nucleosomes, a complex of DNA wrapped around a histone octamer, experience a sequence dependent energy landscape due to the variation in DNA bend stiffness with sequence composition. In this talk, we consider the role nucleosome positioning might play in the formation of heterochromatin, a compact form of DNA generically responsible for gene silencing. In particular, we discuss how different patterns of nucleosome positions, periodic or random, could either facilitate or suppress heterochromatin stability and formation.
qPMS9: An Efficient Algorithm for Quorum Planted Motif Search

NASA Astrophysics Data System (ADS)

Nicolae, Marius; Rajasekaran, Sanguthevar

2015-01-01

Discovering patterns in biological sequences is a crucial problem. For example, the identification of patterns in DNA sequences has resulted in the determination of open reading frames, identification of gene promoter elements, intron/exon splicing sites, and SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have led to domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, discovery of short functional motifs, etc. In this paper we focus on the identification of an important class of patterns, namely, motifs. We study the (l, d) motif search problem or Planted Motif Search (PMS). PMS receives as input n strings and two integers l and d. It returns all sequences M of length l that occur in each input string, where each occurrence differs from M in at most d positions. Another formulation is quorum PMS (qPMS), where the motif appears in at least q% of the strings. We introduce qPMS9, a parallel exact qPMS algorithm that offers significant runtime improvements on DNA and protein datasets. qPMS9 solves the challenging DNA (l, d)-instances (28, 12) and (30, 13). The source code is available at https://code.google.com/p/qpms9/.
Separating endogenous ancient DNA from modern day contamination in a Siberian Neandertal

PubMed Central

Skoglund, Pontus; Northoff, Bernd H.; Shunkov, Michael V.; Derevianko, Anatoli P.; Pääbo, Svante; Krause, Johannes; Jakobsson, Mattias

2014-01-01

One of the main impediments for obtaining DNA sequences from ancient human skeletons is the presence of contaminating modern human DNA molecules in many fossil samples and laboratory reagents. However, DNA fragments isolated from ancient specimens show a characteristic DNA damage pattern caused by miscoding lesions that differs from present day DNA sequences. Here, we develop a framework for evaluating the likelihood of a sequence originating from a model with postmortem degradation—summarized in a postmortem degradation score—which allows the identification of DNA fragments that are unlikely to originate from present day sources. We apply this approach to a contaminated Neandertal specimen from Okladnikov Cave in Siberia to isolate its endogenous DNA from modern human contaminants and show that the reconstructed mitochondrial genome sequence is more closely related to the variation of Western Neandertals than what was discernible from previous analyses. Our method opens up the potential for genomic analysis of contaminated fossil material. PMID:24469802
Temporal patterns of damage and decay kinetics of DNA retrieved from plant herbarium specimens.

PubMed

Weiß, Clemens L; Schuenemann, Verena J; Devos, Jane; Shirsekar, Gautam; Reiter, Ella; Gould, Billie A; Stinchcombe, John R; Krause, Johannes; Burbano, Hernán A

2016-06-01

Herbaria archive a record of changes of worldwide plant biodiversity harbouring millions of specimens that contain DNA suitable for genome sequencing. To profit from this resource, it is fundamental to understand in detail the process of DNA degradation in herbarium specimens. We investigated patterns of DNA fragmentation and nucleotide misincorporation by analysing 86 herbarium samples spanning the last 300 years using Illumina shotgun sequencing. We found an exponential decay relationship between DNA fragmentation and time, and estimated a per nucleotide fragmentation rate of 1.66 × 10(-4) per year, which is six times faster than the rate estimated for ancient bones. Additionally, we found that strand breaks occur specially before purines, and that depurination-driven DNA breakage occurs constantly through time and can to a great extent explain decreasing fragment length over time. Similar to what has been found analysing ancient DNA from bones, we found a strong correlation between the deamination-driven accumulation of cytosine to thymine substitutions and time, which reinforces the importance of substitution patterns to authenticate the ancient/historical nature of DNA fragments. Accurate estimations of DNA degradation through time will allow informed decisions about laboratory and computational procedures to take advantage of the vast collection of worldwide herbarium specimens.
A new family of satellite DNA sequences as a major component of centromeric heterochromatin in owls (Strigiformes).

PubMed

Yamada, Kazuhiko; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2004-03-01

We isolated a new family of satellite DNA sequences from HaeIII- and EcoRI-digested genomic DNA of the Blakiston's fish owl ( Ketupa blakistoni). The repetitive sequences were organized in tandem arrays of the 174 bp element, and localized to the centromeric regions of all macrochromosomes, including the Z and W chromosomes, and microchromosomes. This hybridization pattern was consistent with the distribution of C-band-positive centromeric heterochromatin, and the satellite DNA sequences occupied 10% of the total genome as a major component of centromeric heterochromatin. The sequences were homogenized between macro- and microchromosomes in this species, and therefore intraspecific divergence of the nucleotide sequences was low. The 174 bp element cross-hybridized to the genomic DNA of six other Strigidae species, but not to that of the Tytonidae, suggesting that the satellite DNA sequences are conserved in the same family but fairly divergent between the different families in the Strigiformes. Secondly, the centromeric satellite DNAs were cloned from eight Strigidae species, and the nucleotide sequences of 41 monomer fragments were compared within and between species. Molecular phylogenetic relationships of the nucleotide sequences were highly correlated with both the taxonomy based on morphological traits and the phylogenetic tree constructed by DNA-DNA hybridization. These results suggest that the satellite DNA sequence has evolved by concerted evolution in the Strigidae and that it is a good taxonomic and phylogenetic marker to examine genetic diversity between Strigiformes species.
Rényi continuous entropy of DNA sequences.

PubMed

Vinga, Susana; Almeida, Jonas S

2004-12-07

Entropy measures of DNA sequences estimate their randomness or, inversely, their repeatability. L-block Shannon discrete entropy accounts for the empirical distribution of all length-L words and has convergence problems for finite sequences. A new entropy measure that extends Shannon's formalism is proposed. Renyi's quadratic entropy calculated with Parzen window density estimation method applied to CGR/USM continuous maps of DNA sequences constitute a novel technique to evaluate sequence global randomness without some of the former method drawbacks. The asymptotic behaviour of this new measure was analytically deduced and the calculation of entropies for several synthetic and experimental biological sequences was performed. The results obtained were compared with the distributions of the null model of randomness obtained by simulation. The biological sequences have shown a different p-value according to the kernel resolution of Parzen's method, which might indicate an unknown level of organization of their patterns. This new technique can be very useful in the study of DNA sequence complexity and provide additional tools for DNA entropy estimation. The main MATLAB applications developed and additional material are available at the webpage . Specialized functions can be obtained from the authors.
Cloning and sequence analysis of Hemonchus contortus HC58cDNA.

PubMed

Muleke, Charles I; Ruofeng, Yan; Lixin, Xu; Xinwen, Bo; Xiangrui, Li

2007-06-01

The complete coding sequence of Hemonchus contortus HC58cDNA was generated by rapid amplification of cDNA ends and polymerase chain reaction using primers based on the 5' and 3' ends of the parasite mRNA, accession no. AF305964. The HC58cDNA gene was 851 bp long, with open reading frame of 717 bp, precursors to 239 amino acids coding for approximately 27 kDa protein. Analysis of amino acid sequence revealed conserved residues of cysteine, histidine, asparagine, occluding loop pattern, hemoglobinase motif and glutamine of the oxyanion hole characteristic of cathepsin B like proteases (CBL). Comparison of the predicted amino acid sequences showed the protein shared 33.5-58.7% identity to cathepsin B homologues in the papain clan CA family (family C1). Phylogenetic analysis revealed close evolutionary proximity of the protein sequence to counterpart sequences in the CBL, suggesting that HC58cDNA was a member of the papain family.
Evolution in the block: common elements of 5S rDNA organization and evolutionary patterns in distant fish genera.

PubMed

Campo, Daniel; García-Vázquez, Eva

2012-01-01

The 5S rDNA is organized in the genome as tandemly repeated copies of a structural unit composed of a coding sequence plus a nontranscribed spacer (NTS). The coding region is highly conserved in the evolution, whereas the NTS vary in both length and sequence. It has been proposed that 5S rRNA genes are members of a gene family that have arisen through concerted evolution. In this study, we describe the molecular organization and evolution of the 5S rDNA in the genera Lepidorhombus and Scophthalmus (Scophthalmidae) and compared it with already known 5S rDNA of the very different genera Merluccius (Merluccidae) and Salmo (Salmoninae), to identify common structural elements or patterns for understanding 5S rDNA evolution in fish. High intra- and interspecific diversity within the 5S rDNA family in all the genera can be explained by a combination of duplications, deletions, and transposition events. Sequence blocks with high similarity in all the 5S rDNA members across species were identified for the four studied genera, with evidences of intense gene conversion within noncoding regions. We propose a model to explain the evolution of the 5S rDNA, in which the evolutionary units are blocks of nucleotides rather than the entire sequences or single nucleotides. This model implies a "two-speed" evolution: slow within blocks (homogenized by recombination) and fast within the gene family (diversified by duplications and deletions).
Regulatory link between DNA methylation and active demethylation in Arabidopsis

PubMed Central

Lei, Mingguang; Zhang, Huiming; Julian, Russell; Tang, Kai; Xie, Shaojun; Zhu, Jian-Kang

2015-01-01

De novo DNA methylation through the RNA-directed DNA methylation (RdDM) pathway and active DNA demethylation play important roles in controlling genome-wide DNA methylation patterns in plants. Little is known about how cells manage the balance between DNA methylation and active demethylation activities. Here, we report the identification of a unique RdDM target sequence, where DNA methylation is required for maintaining proper active DNA demethylation of the Arabidopsis genome. In a genetic screen for cellular antisilencing factors, we isolated several REPRESSOR OF SILENCING 1 (ros1) mutant alleles, as well as many RdDM mutants, which showed drastically reduced ROS1 gene expression and, consequently, transcriptional silencing of two reporter genes. A helitron transposon element (TE) in the ROS1 gene promoter negatively controls ROS1 expression, whereas DNA methylation of an RdDM target sequence between ROS1 5′ UTR and the promoter TE region antagonizes this helitron TE in regulating ROS1 expression. This RdDM target sequence is also targeted by ROS1, and defective DNA demethylation in loss-of-function ros1 mutant alleles causes DNA hypermethylation of this sequence and concomitantly causes increased ROS1 expression. Our results suggest that this sequence in the ROS1 promoter region serves as a DNA methylation monitoring sequence (MEMS) that senses DNA methylation and active DNA demethylation activities. Therefore, the ROS1 promoter functions like a thermostat (i.e., methylstat) to sense DNA methylation levels and regulates DNA methylation by controlling ROS1 expression. PMID:25733903
Profiling the genome-wide DNA methylation pattern of porcine ovaries using reduced representation bisulfite sequencing.

PubMed

Yuan, Xiao-Long; Gao, Ning; Xing, Yan; Zhang, Hai-Bin; Zhang, Ai-Ling; Liu, Jing; He, Jin-Long; Xu, Yuan; Lin, Wen-Mian; Chen, Zan-Mou; Zhang, Hao; Zhang, Zhe; Li, Jia-Qi

2016-02-25

Substantial evidence has shown that DNA methylation regulates the initiation of ovarian and sexual maturation. Here, we investigated the genome-wide profile of DNA methylation in porcine ovaries at single-base resolution using reduced representation bisulfite sequencing. The biological variation was minimal among the three ovarian replicates. We found hypermethylation frequently occurred in regions with low gene abundance, while hypomethylation in regions with high gene abundance. The DNA methylation around transcriptional start sites was negatively correlated with their own CpG content. Additionally, the methylation level in the bodies of genes was higher than that in their 5' and 3' flanking regions. The DNA methylation pattern of the low CpG content promoter genes differed obviously from that of the high CpG content promoter genes. The DNA methylation level of the porcine ovary was higher than that of the porcine intestine. Analyses of the genome-wide DNA methylation in porcine ovaries would advance the knowledge and understanding of the porcine ovarian methylome.
A symmetry model for genetic coding via a wallpaper group composed of the traditional four bases and an imaginary base E: towards category theory-like systematization of molecular/genetic biology.

PubMed

Sawamura, Jitsuki; Morishita, Shigeru; Ishigooka, Jun

2014-05-07

Previously, we suggested prototypal models that describe some clinical states based on group postulates. Here, we demonstrate a group/category theory-like model for molecular/genetic biology as an alternative application of our previous model. Specifically, we focus on deoxyribonucleic acid (DNA) base sequences. We construct a wallpaper pattern based on a five-letter cruciform motif with letters C, A, T, G, and E. Whereas the first four letters represent the standard DNA bases, the fifth is introduced for ease in formulating group operations that reproduce insertions and deletions of DNA base sequences. A basic group Z5 = {r, u, d, l, n} of operations is defined for the wallpaper pattern, with which a sequence of points can be generated corresponding to changes of a base in a DNA sequence by following the orbit of a point of the pattern under operations in group Z5. Other manipulations of DNA sequence can be treated using a vector-like notation 'Dj' corresponding to a DNA sequence but based on the five-letter base set; also, 'Dj's are expressed graphically. Insertions and deletions of a series of letters 'E' are admitted to assist in describing DNA recombination. Likewise, a vector-like notation Rj can be constructed for sequences of ribonucleic acid (RNA). The wallpaper group B = {Z5×∞, ●} (an ∞-fold Cartesian product of Z5) acts on Dj (or Rj) yielding changes to Dj (or Rj) denoted by 'Dj◦B(j→k) = Dk' (or 'Rj◦B(j→k) = Rk'). Based on the operations of this group, two types of groups-a modulo 5 linear group and a rotational group over the Gaussian plane, acting on the five bases-are linked as parts of the wallpaper group for broader applications. As a result, changes, insertions/deletions and DNA (RNA) recombination (partial/total conversion) are described. As an exploratory study, a notation for the canonical "central dogma" via a category theory-like way is presented for future developments. Despite the large incompleteness of our methodology, there is fertile ground to consider a symmetry model for genetic coding based on our specific wallpaper group. A more integrated formulation containing "central dogma" for future molecular/genetic biology remains to be explored.
Linear and Nonlinear Statistical Characterization of DNA

NASA Astrophysics Data System (ADS)

Norio Oiwa, Nestor; Goldman, Carla; Glazier, James

2002-03-01

We find spatial order in the distribution of protein-coding (including RNAs) and control segments of GenBank genomic sequences, irrespective of ATCG content. This is achieved by correlations, histograms, fractal dimensions and singularity spectra. Estimates of these quantities in complete nuclear genome indicate that coding sequences are long-range correlated and their disposition are self-similar (multifractal) for eukaryotes. These characteristics are absent in prokaryotes, where there are few noncoding sequences, suggesting the `junk' DNA play a relevant role to the genome structure and function. Concerning the genetic message of ATCG sequences, we build a random walk (Levy flight), using DNA symmetry arguments, where we associate A, T, C and G as left, right, down and up steps, respectively. Nonlinear analysis of mitochondrial DNA walks reveal multifractal pattern based on palindromic sequences, which fold in hairpins and loops.
Comparative whole genome DNA methylation profiling of cattle sperm and somatic tissues reveals striking hypomethylated patterns in sperm

USDA-ARS?s Scientific Manuscript database

Using whole-genome bisulfite sequencing (WGBS), we profiled the DNA methylome of cattle sperms through comparison with three bovine somatic tissues (mammary grand, brain and blood). Large differences between them were observed in the methylation patterns of global CpGs, pericentromeric satellites, p...
[Corn plant DNA methylation pattern changes upon fractional UV-C irradiation].

PubMed

Kravets, A P; Sokolova, D A; Vengzhen, G S; Grodzinskiĭ, D M

2013-01-01

Relationship of changes of methylation pattern of functionally different parts of DNA and chromosomal aberration yield was studied at the conditions of the fractionating of UV-C irradiation. Combination of restriction analysis (Hpall, MspI, MboI enzymes) with the subsequent raising of PCR (internal transcribed space ITS1, 1TS4 and inter simple sequence repeat - ISSR, 14b primers) was used. The got results testify to the changes in methylation pattern of satellite and transcription active part of DNA atan irradiation in the mode of fractionating and depending on fraction time ranges. The role of the methylation DNA pattern change in development of radiation damage and induction of organism protective reactions was discussed.
Analysis of a four generation family reveals the widespread sequence-dependent maintenance of allelic DNA methylation in somatic and germ cells

PubMed Central

Tang, Aifa; Huang, Yi; Li, Zesong; Wan, Shengqing; Mou, Lisha; Yin, Guangliang; Li, Ning; Xie, Jun; Xia, Yudong; Li, Xianxin; Luo, Liya; Zhang, Junwen; Chen, Shen; Wu, Song; Sun, Jihua; Sun, Xiaojuan; Jiang, Zhimao; Chen, Jing; Li, Yingrui; Wang, Jian; Wang, Jun; Cai, Zhiming; Gui, Yaoting

2016-01-01

Differential methylation of the homologous chromosomes, a well-known mechanism leading to genomic imprinting and X-chromosome inactivation, is widely reported at the non-imprinted regions on autosomes. To evaluate the transgenerational DNA methylation patterns in human, we analyzed the DNA methylomes of somatic and germ cells in a four-generation family. We found that allelic asymmetry of DNA methylation was pervasive at the non-imprinted loci and was likely regulated by cis-acting genetic variants. We also observed that the allelic methylation patterns for the vast majority of the cis-regulated loci were shared between the somatic and germ cells from the same individual. These results demonstrated the interaction between genetic and epigenetic variations and suggested the possibility of widespread sequence-dependent transmission of DNA methylation during spermatogenesis. PMID:26758766
Multiplexed Sequence Encoding: A Framework for DNA Communication

PubMed Central

Zakeri, Bijan; Carr, Peter A.; Lu, Timothy K.

2016-01-01

Synthetic DNA has great propensity for efficiently and stably storing non-biological information. With DNA writing and reading technologies rapidly advancing, new applications for synthetic DNA are emerging in data storage and communication. Traditionally, DNA communication has focused on the encoding and transfer of complete sets of information. Here, we explore the use of DNA for the communication of short messages that are fragmented across multiple distinct DNA molecules. We identified three pivotal points in a communication—data encoding, data transfer & data extraction—and developed novel tools to enable communication via molecules of DNA. To address data encoding, we designed DNA-based individualized keyboards (iKeys) to convert plaintext into DNA, while reducing the occurrence of DNA homopolymers to improve synthesis and sequencing processes. To address data transfer, we implemented a secret-sharing system—Multiplexed Sequence Encoding (MuSE)—that conceals messages between multiple distinct DNA molecules, requiring a combination key to reveal messages. To address data extraction, we achieved the first instance of chromatogram patterning through multiplexed sequencing, thereby enabling a new method for data extraction. We envision these approaches will enable more widespread communication of information via DNA. PMID:27050646
Ancient genomics

PubMed Central

Der Sarkissian, Clio; Allentoft, Morten E.; Ávila-Arcos, María C.; Barnett, Ross; Campos, Paula F.; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J.; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D.; Moreno-Mayar, J. Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M. Thomas P.; Willerslev, Eske; Orlando, Ludovic

2015-01-01

The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past. PMID:25487338
Artificial Intelligence, DNA Mimicry, and Human Health.

PubMed

Stefano, George B; Kream, Richard M

2017-08-14

The molecular evolution of genomic DNA across diverse plant and animal phyla involved dynamic registrations of sequence modifications to maintain existential homeostasis to increasingly complex patterns of environmental stressors. As an essential corollary, driver effects of positive evolutionary pressure are hypothesized to effect concerted modifications of genomic DNA sequences to meet expanded platforms of regulatory controls for successful implementation of advanced physiological requirements. It is also clearly apparent that preservation of updated registries of advantageous modifications of genomic DNA sequences requires coordinate expansion of convergent cellular proofreading/error correction mechanisms that are encoded by reciprocally modified genomic DNA. Computational expansion of operationally defined DNA memory extends to coordinate modification of coding and previously under-emphasized noncoding regions that now appear to represent essential reservoirs of untapped genetic information amenable to evolutionary driven recruitment into the realm of biologically active domains. Additionally, expansion of DNA memory potential via chemical modification and activation of noncoding sequences is targeted to vertical augmentation and integration of an expanded cadre of transcriptional and epigenetic regulatory factors affecting linear coding of protein amino acid sequences within open reading frames.
Ancient DNA studies: new perspectives on old samples

PubMed Central

2012-01-01

In spite of past controversies, the field of ancient DNA is now a reliable research area due to recent methodological improvements. A series of recent large-scale studies have revealed the true potential of ancient DNA samples to study the processes of evolution and to test models and assumptions commonly used to reconstruct patterns of evolution and to analyze population genetics and palaeoecological changes. Recent advances in DNA technologies, such as next-generation sequencing make it possible to recover DNA information from archaeological and paleontological remains allowing us to go back in time and study the genetic relationships between extinct organisms and their contemporary relatives. With the next-generation sequencing methodologies, DNA sequences can be retrieved even from samples (for example human remains) for which the technical pitfalls of classical methodologies required stringent criteria to guaranty the reliability of the results. In this paper, we review the methodologies applied to ancient DNA analysis and the perspectives that next-generation sequencing applications provide in this field. PMID:22697611

Detection of Cytosine methylation in ancient DNA from five native american populations using bisulfite sequencing.

PubMed

Smith, Rick W A; Monroe, Cara; Bolnick, Deborah A

2015-01-01

While cytosine methylation has been widely studied in extant populations, relatively few studies have analyzed methylation in ancient DNA. Most existing studies of epigenetic marks in ancient DNA have inferred patterns of methylation in highly degraded samples using post-mortem damage to cytosines as a proxy for cytosine methylation levels. However, this approach limits the inference of methylation compared with direct bisulfite sequencing, the current gold standard for analyzing cytosine methylation at single nucleotide resolution. In this study, we used direct bisulfite sequencing to assess cytosine methylation in ancient DNA from the skeletal remains of 30 Native Americans ranging in age from approximately 230 to 4500 years before present. Unmethylated cytosines were converted to uracils by treatment with sodium bisulfite, bisulfite products of a CpG-rich retrotransposon were pyrosequenced, and C-to-T ratios were quantified for a single CpG position. We found that cytosine methylation is readily recoverable from most samples, given adequate preservation of endogenous nuclear DNA. In addition, our results indicate that the precision of cytosine methylation estimates is inversely correlated with aDNA preservation, such that samples of low DNA concentration show higher variability in measures of percent methylation than samples of high DNA concentration. In particular, samples in this study with a DNA concentration above 0.015 ng/μL generated the most consistent measures of cytosine methylation. This study presents evidence of cytosine methylation in a large collection of ancient human remains, and indicates that it is possible to analyze epigenetic patterns in ancient populations using direct bisulfite sequencing approaches.
Storage and utilization of HLA genomic data--new approaches to HLA typing.

PubMed

Helmberg, W

2000-01-01

Currently available DNA-based HLA typing assays can provide detailed information about sequence motifs of a tested sample. It is still a common practice, however, for information acquired by high-resolution sequence specific oligonucleotide probe (SSOP) typing or sequence specific priming (SSP) to be presented in a low-resolution serological format. Unfortunately, this representation can lead to significant loss of useful data in many cases. An alternative to assigning allele equivalents to suchDNA typing results is simply to store the observed typing pattern and utilize the information with the help of Virtual DNA Analysis (VDA). Interpretation of the stored typing patterns can then be updated based on newly defined alleles, assuming the sequence motifs detected by the typing reagents are known. Rather than updating reagent specificities in individual laboratories, such updates should be performed in a central, publicly available sequence database. By referring to this database, HLA genomic data can then be stored and transferred between laboratories without loss of information. The 13th International Histocompatibility Workshop offers an ideal opportunity to begin building this common database for the entire human MHC.
Immune-Related Transcriptome of Coptotermes formosanus Shiraki Workers: The Defense Mechanism

PubMed Central

Hussain, Abid; Li, Yi-Feng; Cheng, Yu; Liu, Yang; Chen, Chuan-Cheng; Wen, Shuo-Yang

2013-01-01

Formosan subterranean termites, Coptotermes formosanus Shiraki, live socially in microbial-rich habitats. To understand the molecular mechanism by which termites combat pathogenic microbes, a full-length normalized cDNA library and four Suppression Subtractive Hybridization (SSH) libraries were constructed from termite workers infected with entomopathogenic fungi (Metarhizium anisopliae and Beauveria bassiana), Gram-positive Bacillus thuringiensis and Gram-negative Escherichia coli, and the libraries were analyzed. From the high quality normalized cDNA library, 439 immune-related sequences were identified. These sequences were categorized as pattern recognition receptors (47 sequences), signal modulators (52 sequences), signal transducers (137 sequences), effectors (39 sequences) and others (164 sequences). From the SSH libraries, 27, 17, 22 and 15 immune-related genes were identified from each SSH library treated with M. anisopliae, B. bassiana, B. thuringiensis and E. coli, respectively. When the normalized cDNA library was compared with the SSH libraries, 37 immune-related clusters were found in common; 56 clusters were identified in the SSH libraries, and 259 were identified in the normalized cDNA library. The immune-related gene expression pattern was further investigated using quantitative real time PCR (qPCR). Important immune-related genes were characterized, and their potential functions were discussed based on the integrated analysis of the results. We suggest that normalized cDNA and SSH libraries enable us to discover functional genes transcriptome. The results remarkably expand our knowledge about immune-inducible genes in C. formosanus Shiraki and enable the future development of novel control strategies for the management of Formosan subterranean termites. PMID:23874972
Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.

PubMed

Bergman, C M; Kreitman, M

2001-08-01

Comparative genomic approaches to gene and cis-regulatory prediction are based on the principle that differential DNA sequence conservation reflects variation in functional constraint. Using this principle, we analyze noncoding sequence conservation in Drosophila for 40 loci with known or suspected cis-regulatory function encompassing >100 kb of DNA. We estimate the fraction of noncoding DNA conserved in both intergenic and intronic regions and describe the length distribution of ungapped conserved noncoding blocks. On average, 22%-26% of noncoding sequences surveyed are conserved in Drosophila, with median block length approximately 19 bp. We show that point substitution in conserved noncoding blocks exhibits transition bias as well as lineage effects in base composition, and occurs more than an order of magnitude more frequently than insertion/deletion (indel) substitution. Overall, patterns of noncoding DNA structure and evolution differ remarkably little between intergenic and intronic conserved blocks, suggesting that the effects of transcription per se contribute minimally to the constraints operating on these sequences. The results of this study have implications for the development of alignment and prediction algorithms specific to noncoding DNA, as well as for models of cis-regulatory DNA sequence evolution.
Effect of ionic strength and cationic DNA affinity binders on the DNA sequence selective alkylation of guanine N7-positions by nitrogen mustards

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hartley, J.A.; Forrow, S.M.; Souhami, R.L.

Large variations in alkylation intensities exist among guanines in a DNA sequence following treatment with chemotherapeutic alkylating agents such as nitrogen mustards, and the substituent attached to the reactive group can impose a distinct sequence preference for reaction. In order to understand further the structural and electrostatic factors which determine the sequence selectivity of alkylation reactions, the effect of increase ionic strength, the intercalator ethidium bromide, AT-specific minor groove binders distamycin A and netropsin, and the polyamine spermine on guanine N7-alkylation by L-phenylalanine mustard (L-Pam), uracil mustard (UM), and quinacrine mustard (QM) was investigated with a modification of the guanine-specificmore » chemical cleavage technique for DNA sequencing. The result differed with both the nitrogen mustard and the cationic agent used. The effect, which resulted in both enhancement and suppression of alkylation sites, was most striking in the case of netropsin and distamycin A, which differed from each other. DNA footprinting indicated that selective binding to AT sequences in the minor groove of DNA can have long-range effects on the alkylation pattern of DNA in the major groove.« less
Karyotype Analysis of Four Vicia Species using In Situ Hybridization with Repetitive Sequences

PubMed Central

NAVRÁTILOVÁ, ALICE; NEUMANN, PAVEL; MACAS, JIŘÍ

2003-01-01

Mitotic chromosomes of four Vicia species (V. sativa, V. grandiflora, V. pannonica and V. narbonensis) were subjected to in situ hybridization with probes derived from conserved plant repetitive DNA sequences (18S–25S and 5S rDNA, telomeres) and genus‐specific satellite repeats (VicTR‐A and VicTR‐B). Numbers and positions of hybridization signals provided cytogenetic landmarks suitable for unambiguous identification of all chromosomes, and establishment of the karyotypes. The VicTR‐A and ‐B sequences, in particular, produced highly informative banding patterns that alone were sufficient for discrimination of all chromosomes. However, these patterns were not conserved among species and thus could not be employed for identification of homologous chromosomes. This fact, together with observed variations in positions and numbers of rDNA loci, suggests considerable divergence between karyotypes of the species studied. PMID:12770847
New families of site-specific repetitive DNA sequences that comprise constitutive heterochromatin of the Syrian hamster (Mesocricetus auratus, Cricetinae, Rodentia).

PubMed

Yamada, Kazuhiko; Kamimura, Eikichi; Kondo, Mariko; Tsuchiya, Kimiyuki; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2006-02-01

We molecularly cloned new families of site-specific repetitive DNA sequences from BglII- and EcoRI-digested genomic DNA of the Syrian hamster (Mesocricetus auratus, Cricetrinae, Rodentia) and characterized them by chromosome in situ hybridization and filter hybridization. They were classified into six different types of repetitive DNA sequence families according to chromosomal distribution and genome organization. The hybridization patterns of the sequences were consistent with the distribution of C-positive bands and/or Hoechst-stained heterochromatin. The centromeric major satellite DNA and sex chromosome-specific and telomeric region-specific repetitive sequences were conserved in the same genus (Mesocricetus) but divergent in different genera. The chromosome-2-specific sequence was conserved in two genera, Mesocricetus and Cricetulus, and a low copy number of repetitive sequences on the heterochromatic chromosome arms were conserved in the subfamily Cricetinae but not in the subfamily Calomyscinae. By contrast, the other type of repetitive sequences on the heterochromatic chromosome arms, which had sequence similarities to a LINE sequence of rodents, was conserved through the three subfamilies, Cricetinae, Calomyscinae and Murinae. The nucleotide divergence of the repetitive sequences of heterochromatin was well correlated with the phylogenetic relationships of the Cricetinae species, and each sequence has been independently amplified and diverged in the same genome.
A streamlined method for analysing genome-wide DNA methylation patterns from low amounts of FFPE DNA.

PubMed

Ludgate, Jackie L; Wright, James; Stockwell, Peter A; Morison, Ian M; Eccles, Michael R; Chatterjee, Aniruddha

2017-08-31

Formalin fixed paraffin embedded (FFPE) tumor samples are a major source of DNA from patients in cancer research. However, FFPE is a challenging material to work with due to macromolecular fragmentation and nucleic acid crosslinking. FFPE tissue particularly possesses challenges for methylation analysis and for preparing sequencing-based libraries relying on bisulfite conversion. Successful bisulfite conversion is a key requirement for sequencing-based methylation analysis. Here we describe a complete and streamlined workflow for preparing next generation sequencing libraries for methylation analysis from FFPE tissues. This includes, counting cells from FFPE blocks and extracting DNA from FFPE slides, testing bisulfite conversion efficiency with a polymerase chain reaction (PCR) based test, preparing reduced representation bisulfite sequencing libraries and massively parallel sequencing. The main features and advantages of this protocol are: An optimized method for extracting good quality DNA from FFPE tissues. An efficient bisulfite conversion and next generation sequencing library preparation protocol that uses 50 ng DNA from FFPE tissue. Incorporation of a PCR-based test to assess bisulfite conversion efficiency prior to sequencing. We provide a complete workflow and an integrated protocol for performing DNA methylation analysis at the genome-scale and we believe this will facilitate clinical epigenetic research that involves the use of FFPE tissue.
A DNA microarray for identification of selected Korean birds based on mitochondrial cytochrome c oxidase I gene sequences.

PubMed

Chung, In-Hyuk; Yoo, Hye Sook; Eah, Jae-Yong; Yoon, Hyun-Kyu; Jung, Jin-Wook; Hwang, Seung Yong; Kim, Chang-Bae

2010-10-01

DNA barcoding with the gene encoding cytochrome c oxidase I (COI) in the mitochondrial genome has been proposed as a standard marker to identify and discover animal species. Some migratory wild birds are suspected of transmitting avian influenza and pose a threat to aircraft safety because of bird strikes. We have previously reported the COI gene sequences of 92 Korean bird species. In the present study, we developed a DNA microarray to identify 17 selected bird species on the basis of nucleotide diversity. We designed and synthesized 19 specific oligonucleotide probes; these probes were arrayed on a silylated glass slide. The length of the probes was 19-24 bps. The COI sequences amplified from the tissues of the selected birds were labeled with a fluorescent probe for microarray hybridization, and unique hybridization patterns were detected for each selected species. These patterns may be considered diagnostic patterns for species identification. This microarray system will provide a sensitive and a high-throughput method for identification of Korean birds.
mtDNA sequence diversity in Africa.

PubMed Central

Watson, E.; Bauer, K.; Aman, R.; Weiss, G.; von Haeseler, A.; Pääbo, S.

1996-01-01

mtDNA sequences were determined from 241 individuals from nine ethnic groups in Africa. When they were compared with published data from other groups, it was found that the !Kung, Mbuti, and Biaka show on the order of 10 times more sequence differences between the three groups, as well as between those and the other groups (the Fulbe, Hausa, Tuareg, Songhai, Kanuri, Yoruba, Mandenka, Somali, Tukana, and Kikuyu), than these other groups do between one other. Furthermore, the pairwise sequence distributions, patterns of coalescence events, and numbers of variable positions relative to the mean sequence difference indicate that the former three groups have been of constant size over time, whereas the latter have expanded in size. We suggest that this reflects subsistence patterns in that the populations that have expanded in size are food producers whereas those that have not are hunters and gatherers. PMID:8755932
Effects of 16S rDNA sampling on estimates of the number of endosymbiont lineages in sucking lice

PubMed Central

Burleigh, J. Gordon; Light, Jessica E.; Reed, David L.

2016-01-01

Phylogenetic trees can reveal the origins of endosymbiotic lineages of bacteria and detect patterns of co-evolution with their hosts. Although taxon sampling can greatly affect phylogenetic and co-evolutionary inference, most hypotheses of endosymbiont relationships are based on few available bacterial sequences. Here we examined how different sampling strategies of Gammaproteobacteria sequences affect estimates of the number of endosymbiont lineages in parasitic sucking lice (Insecta: Phthirapatera: Anoplura). We estimated the number of louse endosymbiont lineages using both newly obtained and previously sequenced 16S rDNA bacterial sequences and more than 42,000 16S rDNA sequences from other Gammaproteobacteria. We also performed parametric and nonparametric bootstrapping experiments to examine the effects of phylogenetic error and uncertainty on these estimates. Sampling of 16S rDNA sequences affects the estimates of endosymbiont diversity in sucking lice until we reach a threshold of genetic diversity, the size of which depends on the sampling strategy. Sampling by maximizing the diversity of 16S rDNA sequences is more efficient than randomly sampling available 16S rDNA sequences. Although simulation results validate estimates of multiple endosymbiont lineages in sucking lice, the bootstrap results suggest that the precise number of endosymbiont origins is still uncertain. PMID:27547523
Best practices for mapping replication origins in eukaryotic chromosomes.

PubMed

Besnard, Emilie; Desprat, Romain; Ryan, Michael; Kahli, Malik; Aladjem, Mirit I; Lemaitre, Jean-Marc

2014-09-02

Understanding the regulatory principles ensuring complete DNA replication in each cell division is critical for deciphering the mechanisms that maintain genomic stability. Recent advances in genome sequencing technology facilitated complete mapping of DNA replication sites and helped move the field from observing replication patterns at a handful of single loci to analyzing replication patterns genome-wide. These advances address issues, such as the relationship between replication initiation events, transcription, and chromatin modifications, and identify potential replication origin consensus sequences. This unit summarizes the technological and fundamental aspects of replication profiling and briefly discusses novel insights emerging from mining large datasets, published in the last 3 years, and also describes DNA replication dynamics on a whole-genome scale. Copyright © 2014 John Wiley & Sons, Inc.
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

PubMed

Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

2014-11-01

As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Amplification of the major satellite DNA family (FA-SAT) in a cat fibrosarcoma might be related to chromosomal instability.

PubMed

Santos, Sara; Chaves, Raquel; Adega, Filomena; Bastos, Estela; Guedes-Pinto, Henrique

2006-01-01

Most mammalian chromosomes have satellite DNA sequences located at or near the centromeres, organized in arrays of variable size and higher order structure. The implications of these specific repetitive DNA sequences and their organization for centromere function are still quite cloudy. In contrast to most mammalian species, the domestic cat seems to have the major satellite DNA family (FA-SAT) localized primarily at the telomeres and secondarily at the centromeres of the chromosomes. In the present work, we analyzed chromosome preparations from a fibrosarcoma, in comparison with nontumor cells (epithelial tissue) from the same individual, by in situ hybridization of the FA-SAT cat satellite DNA family. This repetitive sequence was found to be amplified in the cat tumor chromosomes analyzed. The amplification of these satellite DNA sequences in the cat chromosomes with variable number and appearance (marker chromosomes) is discussed and might be related to mitotic instability, which could explain the exhibition of complex patterns of chromosome aberrations detected in the fibrosarcoma analyzed.
SHARAKU: an algorithm for aligning and clustering read mapping profiles of deep sequencing in non-coding RNA processing.

PubMed

Tsuchiya, Mariko; Amano, Kojiro; Abe, Masaya; Seki, Misato; Hase, Sumitaka; Sato, Kengo; Sakakibara, Yasubumi

2016-06-15

Deep sequencing of the transcripts of regulatory non-coding RNA generates footprints of post-transcriptional processes. After obtaining sequence reads, the short reads are mapped to a reference genome, and specific mapping patterns can be detected called read mapping profiles, which are distinct from random non-functional degradation patterns. These patterns reflect the maturation processes that lead to the production of shorter RNA sequences. Recent next-generation sequencing studies have revealed not only the typical maturation process of miRNAs but also the various processing mechanisms of small RNAs derived from tRNAs and snoRNAs. We developed an algorithm termed SHARAKU to align two read mapping profiles of next-generation sequencing outputs for non-coding RNAs. In contrast with previous work, SHARAKU incorporates the primary and secondary sequence structures into an alignment of read mapping profiles to allow for the detection of common processing patterns. Using a benchmark simulated dataset, SHARAKU exhibited superior performance to previous methods for correctly clustering the read mapping profiles with respect to 5'-end processing and 3'-end processing from degradation patterns and in detecting similar processing patterns in deriving the shorter RNAs. Further, using experimental data of small RNA sequencing for the common marmoset brain, SHARAKU succeeded in identifying the significant clusters of read mapping profiles for similar processing patterns of small derived RNA families expressed in the brain. The source code of our program SHARAKU is available at http://www.dna.bio.keio.ac.jp/sharaku/, and the simulated dataset used in this work is available at the same link. Accession code: The sequence data from the whole RNA transcripts in the hippocampus of the left brain used in this work is available from the DNA DataBank of Japan (DDBJ) Sequence Read Archive (DRA) under the accession number DRA004502. yasu@bio.keio.ac.jp Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Shotgun Bisulfite Sequencing of the Betula platyphylla Genome Reveals the Tree’s DNA Methylation Patterning

PubMed Central

Su, Chang; Wang, Chao; He, Lin; Yang, Chuanping; Wang, Yucheng

2014-01-01

DNA methylation plays a critical role in the regulation of gene expression. Most studies of DNA methylation have been performed in herbaceous plants, and little is known about the methylation patterns in tree genomes. In the present study, we generated a map of methylated cytosines at single base pair resolution for Betula platyphylla (white birch) by bisulfite sequencing combined with transcriptomics to analyze DNA methylation and its effects on gene expression. We obtained a detailed view of the function of DNA methylation sequence composition and distribution in the genome of B. platyphylla. There are 34,460 genes in the whole genome of birch, and 31,297 genes are methylated. Conservatively, we estimated that 14.29% of genomic cytosines are methylcytosines in birch. Among the methylation sites, the CHH context accounts for 48.86%, and is the largest proportion. Combined transcriptome and methylation analysis showed that the genes with moderate methylation levels had higher expression levels than genes with high and low methylation. In addition, methylated genes are highly enriched for the GO subcategories of binding activities, catalytic activities, cellular processes, response to stimulus and cell death, suggesting that methylation mediates these pathways in birch trees. PMID:25514241
DNA-imprinted polymer nanoparticles with monodispersity and prescribed DNA-strand patterns

NASA Astrophysics Data System (ADS)

Trinh, Tuan; Liao, Chenyi; Toader, Violeta; Barłóg, Maciej; Bazzi, Hassan S.; Li, Jianing; Sleiman, Hanadi F.

2018-02-01

As colloidal self-assembly increasingly approaches the complexity of natural systems, an ongoing challenge is to generate non-centrosymmetric structures. For example, patchy, Janus or living crystallization particles have significantly advanced the area of polymer assembly. It has remained difficult, however, to devise polymer particles that associate in a directional manner, with controlled valency and recognition motifs. Here, we present a method to transfer DNA patterns from a DNA cage to a polymeric nanoparticle encapsulated inside the cage in three dimensions. The resulting DNA-imprinted particles (DIPs), which are 'moulded' on the inside of the DNA cage, consist of a monodisperse crosslinked polymer core with a predetermined pattern of different DNA strands covalently 'printed' on their exterior, and further assemble with programmability and directionality. The number, orientation and sequence of DNA strands grafted onto the polymeric core can be controlled during the process, and the strands are addressable independently of each other.
Efficient Mining of Interesting Patterns in Large Biological Sequences

PubMed Central

Rashid, Md. Mamunur; Karim, Md. Rezaul; Jeong, Byeong-Soo

2012-01-01

Pattern discovery in biological sequences (e.g., DNA sequences) is one of the most challenging tasks in computational biology and bioinformatics. So far, in most approaches, the number of occurrences is a major measure of determining whether a pattern is interesting or not. In computational biology, however, a pattern that is not frequent may still be considered very informative if its actual support frequency exceeds the prior expectation by a large margin. In this paper, we propose a new interesting measure that can provide meaningful biological information. We also propose an efficient index-based method for mining such interesting patterns. Experimental results show that our approach can find interesting patterns within an acceptable computation time. PMID:23105928
Efficient mining of interesting patterns in large biological sequences.

PubMed

Rashid, Md Mamunur; Karim, Md Rezaul; Jeong, Byeong-Soo; Choi, Ho-Jin

2012-03-01

Pattern discovery in biological sequences (e.g., DNA sequences) is one of the most challenging tasks in computational biology and bioinformatics. So far, in most approaches, the number of occurrences is a major measure of determining whether a pattern is interesting or not. In computational biology, however, a pattern that is not frequent may still be considered very informative if its actual support frequency exceeds the prior expectation by a large margin. In this paper, we propose a new interesting measure that can provide meaningful biological information. We also propose an efficient index-based method for mining such interesting patterns. Experimental results show that our approach can find interesting patterns within an acceptable computation time.
Genetic variation patterns of American chestnut populations at EST-SSRs

Treesearch

Oliver Gailing; C. Dana Nelson

2017-01-01

The objective of this study is to analyze patterns of genetic variation at genic expressed sequence tag - simple sequence repeats (EST-SSRs) and at chloroplast DNA markers in populations of American chestnut (Castanea dentata Borkh.) to assist in conservation and breeding efforts. Allelic diversity at EST-SSRs decreased significantly from southwest to northeast along...

Tissue-specific DNA methylation is conserved across human, mouse, and rat, and driven by primary sequence conservation.

PubMed

Zhou, Jia; Sears, Renee L; Xing, Xiaoyun; Zhang, Bo; Li, Daofeng; Rockweiler, Nicole B; Jang, Hyo Sik; Choudhary, Mayank N K; Lee, Hyung Joo; Lowdon, Rebecca F; Arand, Jason; Tabers, Brianne; Gu, C Charles; Cicero, Theodore J; Wang, Ting

2017-09-12

Uncovering mechanisms of epigenome evolution is an essential step towards understanding the evolution of different cellular phenotypes. While studies have confirmed DNA methylation as a conserved epigenetic mechanism in mammalian development, little is known about the conservation of tissue-specific genome-wide DNA methylation patterns. Using a comparative epigenomics approach, we identified and compared the tissue-specific DNA methylation patterns of rat against those of mouse and human across three shared tissue types. We confirmed that tissue-specific differentially methylated regions are strongly associated with tissue-specific regulatory elements. Comparisons between species revealed that at a minimum 11-37% of tissue-specific DNA methylation patterns are conserved, a phenomenon that we define as epigenetic conservation. Conserved DNA methylation is accompanied by conservation of other epigenetic marks including histone modifications. Although a significant amount of locus-specific methylation is epigenetically conserved, the majority of tissue-specific DNA methylation is not conserved across the species and tissue types that we investigated. Examination of the genetic underpinning of epigenetic conservation suggests that primary sequence conservation is a driving force behind epigenetic conservation. In contrast, evolutionary dynamics of tissue-specific DNA methylation are best explained by the maintenance or turnover of binding sites for important transcription factors. Our study extends the limited literature of comparative epigenomics and suggests a new paradigm for epigenetic conservation without genetic conservation through analysis of transcription factor binding sites.
Identification of apple cultivars on the basis of simple sequence repeat markers.

PubMed

Liu, G S; Zhang, Y G; Tao, R; Fang, J G; Dai, H Y

2014-09-12

DNA markers are useful tools that play an important role in plant cultivar identification. They are usually based on polymerase chain reaction (PCR) and include simple sequence repeats (SSRs), inter-simple sequence repeats, and random amplified polymorphic DNA. However, DNA markers were not used effectively in the complete identification of plant cultivars because of the lack of known DNA fingerprints. Recently, a novel approach called the cultivar identification diagram (CID) strategy was developed to facilitate the use of DNA markers for separate plant individuals. The CID was designed whereby a polymorphic maker was generated from each PCR that directly allowed for cultivar sample separation at each step. Therefore, it could be used to identify cultivars and varieties easily with fewer primers. In this study, 60 apple cultivars, including a few main cultivars in fields and varieties from descendants (Fuji x Telamon) were examined. Of the 20 pairs of SSR primers screened, 8 pairs gave reproducible, polymorphic DNA amplification patterns. The banding patterns obtained from these 8 primers were used to construct a CID map. Each cultivar or variety in this study was distinguished from the others completely, indicating that this method can be used for efficient cultivar identification. The result contributed to studies on germplasm resources and the seedling industry in fruit trees.
A symmetry model for genetic coding via a wallpaper group composed of the traditional four bases and an imaginary base E: Towards category theory-like systematization of molecular/genetic biology

PubMed Central

2014-01-01

Background Previously, we suggested prototypal models that describe some clinical states based on group postulates. Here, we demonstrate a group/category theory-like model for molecular/genetic biology as an alternative application of our previous model. Specifically, we focus on deoxyribonucleic acid (DNA) base sequences. Results We construct a wallpaper pattern based on a five-letter cruciform motif with letters C, A, T, G, and E. Whereas the first four letters represent the standard DNA bases, the fifth is introduced for ease in formulating group operations that reproduce insertions and deletions of DNA base sequences. A basic group Z5 = {r, u, d, l, n} of operations is defined for the wallpaper pattern, with which a sequence of points can be generated corresponding to changes of a base in a DNA sequence by following the orbit of a point of the pattern under operations in group Z5. Other manipulations of DNA sequence can be treated using a vector-like notation ‘Dj’ corresponding to a DNA sequence but based on the five-letter base set; also, ‘Dj’s are expressed graphically. Insertions and deletions of a series of letters ‘E’ are admitted to assist in describing DNA recombination. Likewise, a vector-like notation Rj can be constructed for sequences of ribonucleic acid (RNA). The wallpaper group B = {Z5×∞, ●} (an ∞-fold Cartesian product of Z5) acts on Dj (or Rj) yielding changes to Dj (or Rj) denoted by ‘Dj◦B(j→k) = Dk’ (or ‘Rj◦B(j→k) = Rk’). Based on the operations of this group, two types of groups—a modulo 5 linear group and a rotational group over the Gaussian plane, acting on the five bases—are linked as parts of the wallpaper group for broader applications. As a result, changes, insertions/deletions and DNA (RNA) recombination (partial/total conversion) are described. As an exploratory study, a notation for the canonical “central dogma” via a category theory-like way is presented for future developments. Conclusions Despite the large incompleteness of our methodology, there is fertile ground to consider a symmetry model for genetic coding based on our specific wallpaper group. A more integrated formulation containing “central dogma” for future molecular/genetic biology remains to be explored. PMID:24885369
Generate Optimized Genetic Rhythm for Enzyme Expression in Non-native systems

DOE Office of Scientific and Technical Information (OSTI.GOV)

2016-11-03

Most amino acids are represented by more than one codon, resulting in redundancy in the genetic code. Silent codon substitutions that do not alter the amino acid sequence still have an effect on protein expression. We have developed an algorithm, GoGREEN, to enhance the expression of foreign proteins in a host organism. GoGREEN selects codons according to frequency patterns seen in the gene of interest using the codon usage table from the host organism. GoGREEN is also designed to accommodate gaps in the sequence.This software takes for input (1) the aligned protein sequences for genes the user wishes to express,more » (2) the codon usage table for the host organism, (3) and the DNA sequence for the target protein found in the host organism. The program will select codons based on codon usage patterns for the target DNA sequence. The program will also select codons for “gaps” found in the aligned protein sequences using the codon usage table from the host organism.« less
Chromosome Evolution in Connection with Repetitive Sequences and Epigenetics in Plants.

PubMed

Li, Shu-Fen; Su, Ting; Cheng, Guang-Qian; Wang, Bing-Xiao; Li, Xu; Deng, Chuan-Liang; Gao, Wu-Jun

2017-10-24

Chromosome evolution is a fundamental aspect of evolutionary biology. The evolution of chromosome size, structure and shape, number, and the change in DNA composition suggest the high plasticity of nuclear genomes at the chromosomal level. Repetitive DNA sequences, which represent a conspicuous fraction of every eukaryotic genome, particularly in plants, are found to be tightly linked with plant chromosome evolution. Different classes of repetitive sequences have distinct distribution patterns on the chromosomes. Mounting evidence shows that repetitive sequences may play multiple generative roles in shaping the chromosome karyotypes in plants. Furthermore, recent development in our understanding of the repetitive sequences and plant chromosome evolution has elucidated the involvement of a spectrum of epigenetic modification. In this review, we focused on the recent evidence relating to the distribution pattern of repetitive sequences in plant chromosomes and highlighted their potential relevance to chromosome evolution in plants. We also discussed the possible connections between evolution and epigenetic alterations in chromosome structure and repatterning, such as heterochromatin formation, centromere function, and epigenetic-associated transposable element inactivation.
Cloning and characterization of transferrin cDNA and rapid detection of transferrin gene polymorphism in rainbow trout (Oncorhynchus mykiss).

PubMed

Tange, N; Jong-Young, L; Mikawa, N; Hirono, I; Aoki, T

1997-12-01

A cDNA clone of rainbow trout (Oncorhynchus mykiss) transferrin was obtained from a liver cDNA library. The 2537-bp cDNA sequence contained an open reading frame encoding 691 amino acids and the 5' and 3' noncoding regions. The amino acid sequences at the iron-binding sites and the two N-linked glycosylation sites, and the cysteine residues were consistent with known, conserved vertebrate transferrin cDNA sequences. Single N-linked glycosylation sites existed on the N- and C-lobe. The deduced amino acid sequence of the rainbow trout transferrin cDNA had 92.9% identities with transferrin of coho salmon (Oncorhynchus kisutch); 85%, Atlantic salmon (Salmo salar); 67.3%, medaka (Oryzias latipes); 61.3% Atlantic cod (Gadus morhua); and 59.7%, Japanese flounder (Paralichthys olivaceus). The long and accurate polymerase chain reaction (LA-PCR) was used to amplify approximately 6.5 kb of the transferrin gene from rainbow trout genomic DNA. Restriction fragment length polymorphisms (RFLPs) of the LA-PCR products revealed three digestion patterns in 22 samples.
Alu repeated DNAs are differentially methylated in primate germ cells.

PubMed Central

Rubin, C M; VandeVoort, C A; Teplitz, R L; Schmid, C W

1994-01-01

A significant fraction of Alu repeats in human sperm DNA, previously found to be unmethylated, is nearly completely methylated in DNA from many somatic tissues. A similar fraction of unmethylated Alus is observed here in sperm DNA from rhesus monkey. However, Alus are almost completely methylated at the restriction sites tested in monkey follicular oocyte DNA. The Alu methylation patterns in mature male and female monkey germ cells are consistent with Alu methylation in human germ cell tumors. Alu sequences are hypomethylated in seminoma DNAs and more methylated in a human ovarian dysgerminoma. These results contrast with methylation patterns reported for germ cell single-copy, CpG island, satellite, and L1 sequences. The function of Alu repeats is not known, but differential methylation of Alu repeats in the male and female germ lines suggests that they may serve as markers for genomic imprinting or in maintaining differences in male and female meiosis. Images PMID:7800508
Directed nucleation assembly of DNA tile complexes for barcode-patterned lattices

NASA Astrophysics Data System (ADS)

Yan, Hao; Labean, Thomas H.; Feng, Liping; Reif, John H.

2003-07-01

The programmed self-assembly of patterned aperiodic molecular structures is a major challenge in nanotechnology and has numerous potential applications for nanofabrication of complex structures and useful devices. Here we report the construction of an aperiodic patterned DNA lattice (barcode lattice) by a self-assembly process of directed nucleation of DNA tiles around a scaffold DNA strand. The input DNA scaffold strand, constructed by ligation of shorter synthetic oligonucleotides, provides layers of the DNA lattice with barcode patterning information represented by the presence or absence of DNA hairpin loops protruding out of the lattice plane. Self-assembly of multiple DNA tiles around the scaffold strand was shown to result in a patterned lattice containing barcode information of 01101. We have also demonstrated the reprogramming of the system to another patterning. An inverted barcode pattern of 10010 was achieved by modifying the scaffold strands and one of the strands composing each tile. A ribbon lattice, consisting of repetitions of the barcode pattern with expected periodicity, was also constructed by the addition of sticky ends. The patterning of both classes of lattices was clearly observable via atomic force microscopy. These results represent a step toward implementation of a visual readout system capable of converting information encoded on a 1D DNA strand into a 2D form readable by advanced microscopic techniques. A functioning visual output method would not only increase the readout speed of DNA-based computers, but may also find use in other sequence identification techniques such as mutation or allele mapping.
Directed nucleation assembly of DNA tile complexes for barcode-patterned lattices.

PubMed

Yan, Hao; LaBean, Thomas H; Feng, Liping; Reif, John H

2003-07-08

The programmed self-assembly of patterned aperiodic molecular structures is a major challenge in nanotechnology and has numerous potential applications for nanofabrication of complex structures and useful devices. Here we report the construction of an aperiodic patterned DNA lattice (barcode lattice) by a self-assembly process of directed nucleation of DNA tiles around a scaffold DNA strand. The input DNA scaffold strand, constructed by ligation of shorter synthetic oligonucleotides, provides layers of the DNA lattice with barcode patterning information represented by the presence or absence of DNA hairpin loops protruding out of the lattice plane. Self-assembly of multiple DNA tiles around the scaffold strand was shown to result in a patterned lattice containing barcode information of 01101. We have also demonstrated the reprogramming of the system to another patterning. An inverted barcode pattern of 10010 was achieved by modifying the scaffold strands and one of the strands composing each tile. A ribbon lattice, consisting of repetitions of the barcode pattern with expected periodicity, was also constructed by the addition of sticky ends. The patterning of both classes of lattices was clearly observable via atomic force microscopy. These results represent a step toward implementation of a visual readout system capable of converting information encoded on a 1D DNA strand into a 2D form readable by advanced microscopic techniques. A functioning visual output method would not only increase the readout speed of DNA-based computers, but may also find use in other sequence identification techniques such as mutation or allele mapping.
Survey of protein–DNA interactions in Aspergillus oryzae on a genomic scale

PubMed Central

Wang, Chao; Lv, Yangyong; Wang, Bin; Yin, Chao; Lin, Ying; Pan, Li

2015-01-01

The genome-scale delineation of in vivo protein–DNA interactions is key to understanding genome function. Only ∼5% of transcription factors (TFs) in the Aspergillus genus have been identified using traditional methods. Although the Aspergillus oryzae genome contains >600 TFs, knowledge of the in vivo genome-wide TF-binding sites (TFBSs) in aspergilli remains limited because of the lack of high-quality antibodies. We investigated the landscape of in vivo protein–DNA interactions across the A. oryzae genome through coupling the DNase I digestion of intact nuclei with massively parallel sequencing and the analysis of cleavage patterns in protein–DNA interactions at single-nucleotide resolution. The resulting map identified overrepresented de novo TF-binding motifs from genomic footprints, and provided the detailed chromatin remodeling patterns and the distribution of digital footprints near transcription start sites. The TFBSs of 19 known Aspergillus TFs were also identified based on DNase I digestion data surrounding potential binding sites in conjunction with TF binding specificity information. We observed that the cleavage patterns of TFBSs were dependent on the orientation of TF motifs and independent of strand orientation, consistent with the DNA shape features of binding motifs with flanking sequences. PMID:25883143
Contrasting Patterns of rDNA Homogenization within the Zygosaccharomyces rouxii Species Complex

PubMed Central

Chand Dakal, Tikam; Giudici, Paolo; Solieri, Lisa

2016-01-01

Arrays of repetitive ribosomal DNA (rDNA) sequences are generally expected to evolve as a coherent family, where repeats within such a family are more similar to each other than to orthologs in related species. The continuous homogenization of repeats within individual genomes is a recombination process termed concerted evolution. Here, we investigated the extent and the direction of concerted evolution in 43 yeast strains of the Zygosaccharomyces rouxii species complex (Z. rouxii, Z. sapae, Z. mellis), by analyzing two portions of the 35S rDNA cistron, namely the D1/D2 domains at the 5’ end of the 26S rRNA gene and the segment including the internal transcribed spacers (ITS) 1 and 2 (ITS regions). We demonstrate that intra-genomic rDNA sequence variation is unusually frequent in this clade and that rDNA arrays in single genomes consist of an intermixing of Z. rouxii, Z. sapae and Z. mellis-like sequences, putatively evolved by reticulate evolutionary events that involved repeated hybridization between lineages. The levels and distribution of sequence polymorphisms vary across rDNA repeats in different individuals, reflecting four patterns of rDNA evolution: I) rDNA repeats that are homogeneous within a genome but are chimeras derived from two parental lineages via recombination: Z. rouxii in the ITS region and Z. sapae in the D1/D2 region; II) intra-genomic rDNA repeats that retain polymorphisms only in ITS regions; III) rDNA repeats that vary only in their D1/D2 domains; IV) heterogeneous rDNA arrays that have both polymorphic ITS and D1/D2 regions. We argue that an ongoing process of homogenization following allodiplodization or incomplete lineage sorting gave rise to divergent evolutionary trajectories in different strains, depending upon temporal, structural and functional constraints. We discuss the consequences of these findings for Zygosaccharomyces species delineation and, more in general, for yeast barcoding. PMID:27501051
Foundations for a syntatic pattern recognition system for genomic DNA sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Searles, D.B.

1993-03-01

The goal of the proposed work is the creation of a software system that will perform sophisticated pattern recognition and related functions at a level of abstraction and with expressive power beyond current general-purpose pattern-matching systems for biological sequences; and with a more uniform language, environment, and graphical user interface, and with greater flexibility, extensibility, embeddability, and ability to incorporate other algorithms, than current special-purpose analytic software.
Differential principal component analysis of ChIP-seq.

PubMed

Ji, Hongkai; Li, Xia; Wang, Qian-fei; Ning, Yang

2013-04-23

We propose differential principal component analysis (dPCA) for analyzing multiple ChIP-sequencing datasets to identify differential protein-DNA interactions between two biological conditions. dPCA integrates unsupervised pattern discovery, dimension reduction, and statistical inference into a single framework. It uses a small number of principal components to summarize concisely the major multiprotein synergistic differential patterns between the two conditions. For each pattern, it detects and prioritizes differential genomic loci by comparing the between-condition differences with the within-condition variation among replicate samples. dPCA provides a unique tool for efficiently analyzing large amounts of ChIP-sequencing data to study dynamic changes of gene regulation across different biological conditions. We demonstrate this approach through analyses of differential chromatin patterns at transcription factor binding sites and promoters as well as allele-specific protein-DNA interactions.
FA-SAT Is an Old Satellite DNA Frozen in Several Bilateria Genomes

PubMed Central

Chaves, Raquel; Ferreira, Daniela; Mendes-da-Silva, Ana; Meles, Susana; Adega, Filomena

2017-01-01

Abstract In recent years, a growing body of evidence has recognized the tandem repeat sequences, and specifically satellite DNA, as a functional class of sequences in the genomic “dark matter.” Using an original, complementary, and thus an eclectic experimental design, we show that the cat archetypal satellite DNA sequence, FA-SAT, is “frozen” conservatively in several Bilateria genomes. We found different genomic FA-SAT architectures, and the interspersion pattern was conserved. In Carnivora genomes, the FA-SAT-related sequences are also amplified, with the predominance of a specific FA-SAT variant, at the heterochromatic regions. We inspected the cat genome project to locate FA-SAT array flanking regions and revealed an intensive intermingling with transposable elements. Our results also show that FA-SAT-related sequences are transcribed and that the most abundant FA-SAT variant is not always the most transcribed. We thus conclude that the DNA sequences of FA-SAT and their transcripts are “frozen” in these genomes. Future work is needed to disclose any putative function that these sequences may play in these genomes. PMID:29608678
Epigenomics of Development in Populus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Strauss, Steve; Freitag, Michael; Mockler, Todd

2013-01-10

We conducted research to determine the role of epigenetic modifications during tree development using poplar (Populus trichocarpa), a model woody feedstock species. Using methylated DNA immunoprecipitation (MeDIP) or chromatin immunoprecipitation (ChIP), followed by high-throughput sequencing, we are analyzed DNA and histone methylation patterns in the P. trichocarpa genome in relation to four biological processes: bud dormancy and release, mature organ maintenance, in vitro organogenesis, and methylation suppression. Our project is now completed. We have 1) produced 22 transgenic events for a gene involved in DNA methylation suppression and studied its phenotypic consequences; 2) completed sequencing of methylated DNA from elevenmore » target tissues in wildtype P. trichocarpa; 3) updated our customized poplar genome browser using the open-source software tools (2.13) and (V2.2) of the P. trichocarpa genome; 4) produced summary data for genome methylation in P. trichocarpa, including distribution of methylation across chromosomes and in and around genes; 5) employed bioinformatic and statistical methods to analyze differences in methylation patterns among tissue types; and 6) used bisulfite sequencing of selected target genes to confirm bioinformatics and sequencing results, and gain a higher-resolution view of methylation at selected genes 7) compared methylation patterns to expression using available microarray data. Our main findings of biological significance are the identification of extensive regions of the genome that display developmental variation in DNA methylation; highly distinctive gene-associated methylation profiles in reproductive tissues, particularly male catkins; a strong whole genome/all tissue inverse association of methylation at gene bodies and promoters with gene expression; a lack of evidence that tissue specificity of gene expression is associated with gene methylation; and evidence that genome methylation is a significant impediment to tissue dedifferentiation and redifferentiation in vitro.« less
Germline transformation of the butterfly Bicyclus anynana.

PubMed

Marcus, Jeffrey M; Ramos, Diane M; Monteiro, Antónia

2004-08-07

Ecological and evolutionary theory has frequently been inspired by the diversity of colour patterns on the wings of butterflies. More recently, these varied patterns have also become model systems for studying the evolution of developmental mechanisms. A technique that will facilitate our understanding of butterfly colour-pattern development is germline transformation. Germline transformation permits functional tests of candidate gene products and of cis-regulatory regions, and provides a means of generating new colour-pattern mutants by insertional mutagenesis. We report the successful transformation of the African satyrid butterfly Bicyclus anynana with two different transposable element vectors, Hermes and piggyBac, each carrying EGFP coding sequences driven by the 3XP3 synthetic enhancer that drives gene expression in the eyes. Candidate lines identified by screening for EGFP in adult eyes were later confirmed by PCR amplification of a fragment of the EGFP coding sequence from genomic DNA. Flanking DNA surrounding the insertions was amplified by inverse PCR and sequenced. Transformation rates were 5% for piggyBac and 10.2% for Hermes. Ultimately, the new data generated by these techniques may permit an integrated understanding of the developmental genetics of colour-pattern formation and of the ecological and evolutionary processes in which these patterns play a role.
A DNA-based pattern classifier with in vitro learning and associative recall for genomic characterization and biosensing without explicit sequence knowledge.

PubMed

Lee, Ju Seok; Chen, Junghuei; Deaton, Russell; Kim, Jin-Woo

2014-01-01

Genetic material extracted from in situ microbial communities has high promise as an indicator of biological system status. However, the challenge is to access genomic information from all organisms at the population or community scale to monitor the biosystem's state. Hence, there is a need for a better diagnostic tool that provides a holistic view of a biosystem's genomic status. Here, we introduce an in vitro methodology for genomic pattern classification of biological samples that taps large amounts of genetic information from all genes present and uses that information to detect changes in genomic patterns and classify them. We developed a biosensing protocol, termed Biological Memory, that has in vitro computational capabilities to "learn" and "store" genomic sequence information directly from genomic samples without knowledge of their explicit sequences, and that discovers differences in vitro between previously unknown inputs and learned memory molecules. The Memory protocol was designed and optimized based upon (1) common in vitro recombinant DNA operations using 20-base random probes, including polymerization, nuclease digestion, and magnetic bead separation, to capture a snapshot of the genomic state of a biological sample as a DNA memory and (2) the thermal stability of DNA duplexes between new input and the memory to detect similarities and differences. For efficient read out, a microarray was used as an output method. When the microarray-based Memory protocol was implemented to test its capability and sensitivity using genomic DNA from two model bacterial strains, i.e., Escherichia coli K12 and Bacillus subtilis, results indicate that the Memory protocol can "learn" input DNA, "recall" similar DNA, differentiate between dissimilar DNA, and detect relatively small concentration differences in samples. This study demonstrated not only the in vitro information processing capabilities of DNA, but also its promise as a genomic pattern classifier that could access information from all organisms in a biological system without explicit genomic information. The Memory protocol has high potential for many applications, including in situ biomonitoring of ecosystems, screening for diseases, biosensing of pathological features in water and food supplies, and non-biological information processing of memory devices, among many.
Alteration of gene expression in human hepatocellular carcinoma with integrated hepatitis B virus DNA.

PubMed

Tamori, Akihiro; Yamanishi, Yoshihiro; Kawashima, Shuichi; Kanehisa, Minoru; Enomoto, Masaru; Tanaka, Hiromu; Kubo, Shoji; Shiomi, Susumu; Nishiguchi, Shuhei

2005-08-15

Integration of hepatitis B virus (HBV) DNA into the human genome is one of the most important steps in HBV-related carcinogenesis. This study attempted to find the link between HBV DNA, the adjoining cellular sequence, and altered gene expression in hepatocellular carcinoma (HCC) with integrated HBV DNA. We examined 15 cases of HCC infected with HBV by cassette ligation-mediated PCR. The human DNA adjacent to the integrated HBV DNA was sequenced. Protein coding sequences were searched for in the human sequence. In five cases with HBV DNA integration, from which good quality RNA was extracted, gene expression was examined by cDNA microarray analysis. The human DNA sequence successive to integrated HBV DNA was determined in the 15 HCCs. Eight protein-coding regions were involved: ras-responsive element binding protein 1, calmodulin 1, mixed lineage leukemia 2 (MLL2), FLJ333655, LOC220272, LOC255345, LOC220220, and LOC168991. The MLL2 gene was expressed in three cases with HBV DNA integrated into exon 3 of MLL2 and in one case with HBV DNA integrated into intron 3 of MLL2. Gene expression analysis suggested that two HCCs with HBV integrated into MLL2 had similar patterns of gene expression compared with three HCCs with HBV integrated into other loci of human chromosomes. HBV DNA was integrated at random sites of human DNA, and the MLL2 gene was one of the targets for integration. Our results suggest that HBV DNA might modulate human genes near integration sites, followed by integration site-specific expression of such genes during hepatocarcinogenesis.
DNA methylation and targeted sequencing of methyltransferases family genes in canine acute myeloid leukaemia, modelling human myeloid leukaemia.

PubMed

Bronzini, I; Aresu, L; Paganin, M; Marchioretto, L; Comazzi, S; Cian, F; Riondato, F; Marconato, L; Martini, V; Te Kronnie, G

2017-09-01

Tumours shows aberrant DNA methylation patterns, being hypermethylated or hypomethylated compared with normal tissues. In human acute myeloid leukaemia (hAML) mutations in DNA methyltransferase (DNMT3A) are associated to a more aggressive tumour behaviour. As AML is lethal in dogs, we defined global DNA methylation content, and screened the C-terminal domain of DNMT3 family of genes for sequence variants in 39 canine acute myeloid leukaemia (cAML) cases. A heterogeneous pattern of DNA methylation was found among cAML samples, with subsets of cases being hypermethylated or hypomethylated compared with healthy controls; four recurrent single nucleotide variations (SNVs) were found in DNMT3L gene. Although SNVs were not directly correlated to whole genome DNA methylation levels, all hypomethylated cAML cases were homozygous for the deleterious mutation at p.Arg222Trp. This study contributes to understand genetic modifications of cAML, leading up to studies that will elucidate the role of methylome alterations in the pathogenesis of AML in dogs. © 2016 John Wiley & Sons Ltd.
Contrasting patterns of evolution of 45S and 5S rDNA families uncover new aspects in the genome constitution of the agronomically important grass Thinopyrum intermedium (Triticeae).

PubMed

Mahelka, Václav; Kopecky, David; Baum, Bernard R

2013-09-01

We employed sequencing of clones and in situ hybridization (genomic and fluorescent in situ hybridization [GISH and rDNA-FISH]) to characterize both the sequence variation and genomic organization of 45S (herein ITS1-5.8S-ITS2 region) and 5S (5S gene + nontranscribed spacer) ribosomal DNA (rDNA) families in the allohexaploid grass Thinopyrum intermedium. Both rDNA families are organized within several rDNA loci within all three subgenomes of the allohexaploid species. Both families have undergone different patterns of evolution. The 45S rDNA family has evolved in a concerted manner: internal transcribed spacer (ITS) sequences residing within the arrays of two subgenomes out of three got homogenized toward one major ribotype, whereas the third subgenome contained a minor proportion of distinct unhomogenized copies. Homogenization mechanisms such as unequal crossover and/or gene conversion were coupled with the loss of certain 45S rDNA loci. Unlike in the 45S family, the data suggest that neither interlocus homogenization among homeologous chromosomes nor locus loss occurred in 5S rDNA. Consistently with other Triticeae, the 5S rDNA family in intermediate wheatgrass comprised two distinct array types-the long- and short-spacer unit classes. Within the long and short units, we distinguished five and three different types, respectively, likely representing homeologous unit classes donated by putative parental species. Although the major ITS ribotype corresponds in our phylogenetic analysis to the E-genome species, the minor ribotype corresponds to Dasypyrum. 5S sequences suggested the contributions from Pseudoroegneria, Dasypyrum, and Aegilops. The contribution from Aegilops to the intermediate wheatgrass' genome is a new finding with implications in wheat improvement. We discuss rDNA evolution and potential origin of intermediate wheatgrass.

High-resolution characterization of sequence signatures due to non-random cleavage of cell-free DNA.

PubMed

Chandrananda, Dineika; Thorne, Natalie P; Bahlo, Melanie

2015-06-17

High-throughput sequencing of cell-free DNA fragments found in human plasma has been used to non-invasively detect fetal aneuploidy, monitor organ transplants and investigate tumor DNA. However, many biological properties of this extracellular genetic material remain unknown. Research that further characterizes circulating DNA could substantially increase its diagnostic value by allowing the application of more sophisticated bioinformatics tools that lead to an improved signal to noise ratio in the sequencing data. In this study, we investigate various features of cell-free DNA in plasma using deep-sequencing data from two pregnant women (>70X, >50X) and compare them with matched cellular DNA. We utilize a descriptive approach to examine how the biological cleavage of cell-free DNA affects different sequence signatures such as fragment lengths, sequence motifs at fragment ends and the distribution of cleavage sites along the genome. We show that the size distributions of these cell-free DNA molecules are dependent on their autosomal and mitochondrial origin as well as the genomic location within chromosomes. DNA mapping to particular microsatellites and alpha repeat elements display unique size signatures. We show how cell-free fragments occur in clusters along the genome, localizing to nucleosomal arrays and are preferentially cleaved at linker regions by correlating the mapping locations of these fragments with ENCODE annotation of chromatin organization. Our work further demonstrates that cell-free autosomal DNA cleavage is sequence dependent. The region spanning up to 10 positions on either side of the DNA cleavage site show a consistent pattern of preference for specific nucleotides. This sequence motif is present in cleavage sites localized to nucleosomal cores and linker regions but is absent in nucleosome-free mitochondrial DNA. These background signals in cell-free DNA sequencing data stem from the non-random biological cleavage of these fragments. This sequence structure can be harnessed to improve bioinformatics algorithms, in particular for CNV and structural variant detection. Descriptive measures for cell-free DNA features developed here could also be used in biomarker analysis to monitor the changes that occur during different pathological conditions.
Biophysics of protein-DNA interactions and chromosome organization

PubMed Central

Marko, John F.

2014-01-01

The function of DNA in cells depends on its interactions with protein molecules, which recognize and act on base sequence patterns along the double helix. These notes aim to introduce basic polymer physics of DNA molecules, biophysics of protein-DNA interactions and their study in single-DNA experiments, and some aspects of large-scale chromosome structure. Mechanisms for control of chromosome topology will also be discussed. PMID:25419039
The problems and promise of DNA barcodes for species diagnosis of primate biomaterials

PubMed Central

Lorenz, Joseph G; Jackson, Whitney E; Beck, Jeanne C; Hanner, Robert

2005-01-01

The Integrated Primate Biomaterials and Information Resource (www.IPBIR.org) provides essential research reagents to the scientific community by establishing, verifying, maintaining, and distributing DNA and RNA derived from primate cell cultures. The IPBIR uses mitochondrial cytochrome c oxidase subunit I sequences to verify the identity of samples for quality control purposes in the accession, cell culture, DNA extraction processes and prior to shipping to end users. As a result, IPBIR is accumulating a database of ‘DNA barcodes’ for many species of primates. However, this quality control process is complicated by taxon specific patterns of ‘universal primer’ failure, as well as the amplification or co-amplification of nuclear pseudogenes of mitochondrial origins. To overcome these difficulties, taxon specific primers have been developed, and reverse transcriptase PCR is utilized to exclude these extraneous sequences from amplification. DNA barcoding of primates has applications to conservation and law enforcement. Depositing barcode sequences in a public database, along with primer sequences, trace files and associated quality scores, makes this species identification technique widely accessible. Reference DNA barcode sequences should be derived from, and linked to, specimens of known provenance in web-accessible collections in order to validate this system of molecular diagnostics. PMID:16214744
Design pattern mining using distributed learning automata and DNA sequence alignment.

PubMed

Esmaeilpour, Mansour; Naderifar, Vahideh; Shukur, Zarina

2014-01-01

Over the last decade, design patterns have been used extensively to generate reusable solutions to frequently encountered problems in software engineering and object oriented programming. A design pattern is a repeatable software design solution that provides a template for solving various instances of a general problem. This paper describes a new method for pattern mining, isolating design patterns and relationship between them; and a related tool, DLA-DNA for all implemented pattern and all projects used for evaluation. DLA-DNA achieves acceptable precision and recall instead of other evaluated tools based on distributed learning automata (DLA) and deoxyribonucleic acid (DNA) sequences alignment. The proposed method mines structural design patterns in the object oriented source code and extracts the strong and weak relationships between them, enabling analyzers and programmers to determine the dependency rate of each object, component, and other section of the code for parameter passing and modular programming. The proposed model can detect design patterns better that available other tools those are Pinot, PTIDEJ and DPJF; and the strengths of their relationships. The result demonstrate that whenever the source code is build standard and non-standard, based on the design patterns, then the result of the proposed method is near to DPJF and better that Pinot and PTIDEJ. The proposed model is tested on the several source codes and is compared with other related models and available tools those the results show the precision and recall of the proposed method, averagely 20% and 9.6% are more than Pinot, 27% and 31% are more than PTIDEJ and 3.3% and 2% are more than DPJF respectively. The primary idea of the proposed method is organized in two following steps: the first step, elemental design patterns are identified, while at the second step, is composed to recognize actual design patterns.
Identification of genes from pattern formation, tyrosine kinase, and potassium channel families by DNA amplification

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kamb, A.; Weir, M.; Rudy, B.

1989-06-01

The study of gene family members has been aided by the isolation of related genes on the basis of DNA homology. The authors have adapted the polymerase chain reaction to screen animal genomes very rapidly and reliably for likely gene family members. Using conserved amino acid sequences to design degenerate oligonucleotide primers, they have shown that the genome of the nematode Caenorhabditis elegans contains sequences homologous to many Drosophila genes involved in pattern formation, including the segment polarity gene wingless (vertebrate int-1), and homeobox sequences characteristic of the Antennapedia, engrailed, and paired families. In addition, they have used this methodmore » to show that C. elegans contains at least five different sequences homologous to genes in the tyrosine kinase family. Lastly, they have isolated six potassium channel sequences from humans, a result that validates the utility of the method with large genomes and suggests that human potassium channel gene diversity may be extensive.« less
Directed nucleation assembly of DNA tile complexes for barcode-patterned lattices

PubMed Central

Yan, Hao; LaBean, Thomas H.; Feng, Liping; Reif, John H.

2003-01-01

The programmed self-assembly of patterned aperiodic molecular structures is a major challenge in nanotechnology and has numerous potential applications for nanofabrication of complex structures and useful devices. Here we report the construction of an aperiodic patterned DNA lattice (barcode lattice) by a self-assembly process of directed nucleation of DNA tiles around a scaffold DNA strand. The input DNA scaffold strand, constructed by ligation of shorter synthetic oligonucleotides, provides layers of the DNA lattice with barcode patterning information represented by the presence or absence of DNA hairpin loops protruding out of the lattice plane. Self-assembly of multiple DNA tiles around the scaffold strand was shown to result in a patterned lattice containing barcode information of 01101. We have also demonstrated the reprogramming of the system to another patterning. An inverted barcode pattern of 10010 was achieved by modifying the scaffold strands and one of the strands composing each tile. A ribbon lattice, consisting of repetitions of the barcode pattern with expected periodicity, was also constructed by the addition of sticky ends. The patterning of both classes of lattices was clearly observable via atomic force microscopy. These results represent a step toward implementation of a visual readout system capable of converting information encoded on a 1D DNA strand into a 2D form readable by advanced microscopic techniques. A functioning visual output method would not only increase the readout speed of DNA-based computers, but may also find use in other sequence identification techniques such as mutation or allele mapping. PMID:12821776
cDNA-AFLP analysis reveals differential gene expression in compatible interaction of wheat challenged with Puccinia striiformis f. sp. tritici

PubMed Central

Wang, Xiaojie; Tang, Chunlei; Zhang, Gang; Li, Yingchun; Wang, Chenfang; Liu, Bo; Qu, Zhipeng; Zhao, Jie; Han, Qingmei; Huang, Lili; Chen, Xianming; Kang, Zhensheng

2009-01-01

Background Puccinia striiformis f. sp. tritici is a fungal pathogen causing stripe rust, one of the most important wheat diseases worldwide. The fungus is strictly biotrophic and thus, completely dependent on living host cells for its reproduction, which makes it difficult to study genes of the pathogen. In spite of its economic importance, little is known about the molecular basis of compatible interaction between the pathogen and wheat host. In this study, we identified wheat and P. striiformis genes associated with the infection process by conducting a large-scale transcriptomic analysis using cDNA-AFLP. Results Of the total 54,912 transcript derived fragments (TDFs) obtained using cDNA-AFLP with 64 primer pairs, 2,306 (4.2%) displayed altered expression patterns after inoculation, of which 966 showed up-regulated and 1,340 down-regulated. 186 TDFs produced reliable sequences after sequencing of 208 TDFs selected, of which 74 (40%) had known functions through BLAST searching the GenBank database. Majority of the latter group had predicted gene products involved in energy (13%), signal transduction (5.4%), disease/defence (5.9%) and metabolism (5% of the sequenced TDFs). BLAST searching of the wheat stem rust fungus genome database identified 18 TDFs possibly from the stripe rust pathogen, of which 9 were validated of the pathogen origin using PCR-based assays followed by sequencing confirmation. Of the 186 reliable TDFs, 29 homologous to genes known to play a role in disease/defense, signal transduction or uncharacterized genes were further selected for validation of cDNA-AFLP expression patterns using qRT-PCR analyses. Results confirmed the altered expression patterns of 28 (96.5%) genes revealed by the cDNA-AFLP technique. Conclusion The results show that cDNA-AFLP is a reliable technique for studying expression patterns of genes involved in the wheat-stripe rust interactions. Genes involved in compatible interactions between wheat and the stripe rust pathogen were identified and their expression patterns were determined. The present study should be helpful in elucidating the molecular basis of the infection process, and identifying genes that can be targeted for inhibiting the growth and reproduction of the pathogen. Moreover, this study can also be used to elucidate the defence responses of the genes that were of plant origin. PMID:19566949
A parallelized binary search tree

USDA-ARS?s Scientific Manuscript database

PTTRNFNDR is an unsupervised statistical learning algorithm that detects patterns in DNA sequences, protein sequences, or any natural language texts that can be decomposed into letters of a finite alphabet. PTTRNFNDR performs complex mathematical computations and its processing time increases when i...
Molecular Analysis and Genomic Organization of Major DNA Satellites in Banana (Musa spp.)

PubMed Central

Čížková, Jana; Hřibová, Eva; Humplíková, Lenka; Christelová, Pavla; Suchánková, Pavla; Doležel, Jaroslav

2013-01-01

Satellite DNA sequences consist of tandemly arranged repetitive units up to thousands nucleotides long in head-to-tail orientation. The evolutionary processes by which satellites arise and evolve include unequal crossing over, gene conversion, transposition and extra chromosomal circular DNA formation. Large blocks of satellite DNA are often observed in heterochromatic regions of chromosomes and are a typical component of centromeric and telomeric regions. Satellite-rich loci may show specific banding patterns and facilitate chromosome identification and analysis of structural chromosome changes. Unlike many other genomes, nuclear genomes of banana (Musa spp.) are poor in satellite DNA and the information on this class of DNA remains limited. The banana cultivars are seed sterile clones originating mostly from natural intra-specific crosses within M. acuminata (A genome) and inter-specific crosses between M. acuminata and M. balbisiana (B genome). Previous studies revealed the closely related nature of the A and B genomes, including similarities in repetitive DNA. In this study we focused on two main banana DNA satellites, which were previously identified in silico. Their genomic organization and molecular diversity was analyzed in a set of nineteen Musa accessions, including representatives of A, B and S (M. schizocarpa) genomes and their inter-specific hybrids. The two DNA satellites showed a high level of sequence conservation within, and a high homology between Musa species. FISH with probes for the satellite DNA sequences, rRNA genes and a single-copy BAC clone 2G17 resulted in characteristic chromosome banding patterns in M. acuminata and M. balbisiana which may aid in determining genomic constitution in interspecific hybrids. In addition to improving the knowledge on Musa satellite DNA, our study increases the number of cytogenetic markers and the number of individual chromosomes, which can be identified in Musa. PMID:23372772
Molecular analysis and genomic organization of major DNA satellites in banana (Musa spp.).

PubMed

Čížková, Jana; Hřibová, Eva; Humplíková, Lenka; Christelová, Pavla; Suchánková, Pavla; Doležel, Jaroslav

2013-01-01

Satellite DNA sequences consist of tandemly arranged repetitive units up to thousands nucleotides long in head-to-tail orientation. The evolutionary processes by which satellites arise and evolve include unequal crossing over, gene conversion, transposition and extra chromosomal circular DNA formation. Large blocks of satellite DNA are often observed in heterochromatic regions of chromosomes and are a typical component of centromeric and telomeric regions. Satellite-rich loci may show specific banding patterns and facilitate chromosome identification and analysis of structural chromosome changes. Unlike many other genomes, nuclear genomes of banana (Musa spp.) are poor in satellite DNA and the information on this class of DNA remains limited. The banana cultivars are seed sterile clones originating mostly from natural intra-specific crosses within M. acuminata (A genome) and inter-specific crosses between M. acuminata and M. balbisiana (B genome). Previous studies revealed the closely related nature of the A and B genomes, including similarities in repetitive DNA. In this study we focused on two main banana DNA satellites, which were previously identified in silico. Their genomic organization and molecular diversity was analyzed in a set of nineteen Musa accessions, including representatives of A, B and S (M. schizocarpa) genomes and their inter-specific hybrids. The two DNA satellites showed a high level of sequence conservation within, and a high homology between Musa species. FISH with probes for the satellite DNA sequences, rRNA genes and a single-copy BAC clone 2G17 resulted in characteristic chromosome banding patterns in M. acuminata and M. balbisiana which may aid in determining genomic constitution in interspecific hybrids. In addition to improving the knowledge on Musa satellite DNA, our study increases the number of cytogenetic markers and the number of individual chromosomes, which can be identified in Musa.
Transcription Factors Bind Thousands of Active and InactiveRegions in the Drosophila Blastoderm

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Xiao-Yong; MacArthur, Stewart; Bourgon, Richard

2008-01-10

Identifying the genomic regions bound by sequence-specific regulatory factors is central both to deciphering the complex DNA cis-regulatory code that controls transcription in metazoans and to determining the range of genes that shape animal morphogenesis. Here, we use whole-genome tiling arrays to map sequences bound in Drosophila melanogaster embryos by the six maternal and gap transcription factors that initiate anterior-posterior patterning. We find that these sequence-specific DNA binding proteins bind with quantitatively different specificities to highly overlapping sets of several thousand genomic regions in blastoderm embryos. Specific high- and moderate-affinity in vitro recognition sequences for each factor are enriched inmore » bound regions. This enrichment, however, is not sufficient to explain the pattern of binding in vivo and varies in a context-dependent manner, demonstrating that higher-order rules must govern targeting of transcription factors. The more highly bound regions include all of the over forty well-characterized enhancers known to respond to these factors as well as several hundred putative new cis-regulatory modules clustered near developmental regulators and other genes with patterned expression at this stage of embryogenesis. The new targets include most of the microRNAs (miRNAs) transcribed in the blastoderm, as well as all major zygotically transcribed dorsal-ventral patterning genes, whose expression we show to be quantitatively modulated by anterior-posterior factors. In addition to these highly bound regions, there are several thousand regions that are reproducibly bound at lower levels. However, these poorly bound regions are, collectively, far more distant from genes transcribed in the blastoderm than highly bound regions; are preferentially found in protein-coding sequences; and are less conserved than highly bound regions. Together these observations suggest that many of these poorly-bound regions are not involved in early-embryonic transcriptional regulation, and a significant proportion may be nonfunctional. Surprisingly, for five of the six factors, their recognition sites are not unambiguously more constrained evolutionarily than the immediate flanking DNA, even in more highly bound and presumably functional regions, indicating that comparative DNA sequence analysis is limited in its ability to identify functional transcription factor targets.« less
In Vivo Control of CpG and Non-CpG DNA Methylation by DNA Methyltransferases

PubMed Central

Arand, Julia; Spieler, David; Karius, Tommy; Branco, Miguel R.; Meilinger, Daniela; Meissner, Alexander; Jenuwein, Thomas; Xu, Guoliang; Leonhardt, Heinrich; Wolf, Verena; Walter, Jörn

2012-01-01

The enzymatic control of the setting and maintenance of symmetric and non-symmetric DNA methylation patterns in a particular genome context is not well understood. Here, we describe a comprehensive analysis of DNA methylation patterns generated by high resolution sequencing of hairpin-bisulfite amplicons of selected single copy genes and repetitive elements (LINE1, B1, IAP-LTR-retrotransposons, and major satellites). The analysis unambiguously identifies a substantial amount of regional incomplete methylation maintenance, i.e. hemimethylated CpG positions, with variant degrees among cell types. Moreover, non-CpG cytosine methylation is confined to ESCs and exclusively catalysed by Dnmt3a and Dnmt3b. This sequence position–, cell type–, and region-dependent non-CpG methylation is strongly linked to neighboring CpG methylation and requires the presence of Dnmt3L. The generation of a comprehensive data set of 146,000 CpG dyads was used to apply and develop parameter estimated hidden Markov models (HMM) to calculate the relative contribution of DNA methyltransferases (Dnmts) for de novo and maintenance DNA methylation. The comparative modelling included wild-type ESCs and mutant ESCs deficient for Dnmt1, Dnmt3a, Dnmt3b, or Dnmt3a/3b, respectively. The HMM analysis identifies a considerable de novo methylation activity for Dnmt1 at certain repetitive elements and single copy sequences. Dnmt3a and Dnmt3b contribute de novo function. However, both enzymes are also essential to maintain symmetrical CpG methylation at distinct repetitive and single copy sequences in ESCs. PMID:22761581
In vitro excision of adeno-associated virus DNA from recombinant plasmids: Isolation of an enzyme fraction from HeLa cells that cleaves DNA at poly(G) sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gottlieb, J.; Muzyczka, N.

1988-06-01

When circular recombinant plasmids containing adeno-associated virus (AAV) DNA sequences are transfected into human cells, the AAV provirus is rescued. Using these circular AAV plasmids as substrates, the authors isolated an enzyme fraction from HeLa cell nuclear extracts that excises intact AAV DNA in vitro from vector DNA and produces linear DNA products. The recognition signal for the enzyme is a polypurine-polypyrimidine sequence which is at least 9 residues long and rich in G . C base pairs. Such sequences are present in AAV recombinant plasmids as part of the first 15 base pairs of the AAV terminal repeat andmore » in some cases as the result of cloning the AAV genome by G . C tailing. The isolated enzyme fraction does not have significant endonucleolytic activity on single-stranded or double-stranded DNA. Plasmid DNA that is transfected into tissue culture cells is cleaved in vivo to produce a pattern of DNA fragments similar to that seen with purified enzyme in vitro. The activity has been called endo R for rescue, and its behavior suggests that it may have a role in recombination of cellular chromosomes.« less
Creation of a type IIS restriction endonuclease with a long recognition sequence

PubMed Central

Lippow, Shaun M.; Aha, Patti M.; Parker, Matthew H.; Blake, William J.; Baynes, Brian M.; Lipovšek, Daša

2009-01-01

Type IIS restriction endonucleases cleave DNA outside their recognition sequences, and are therefore particularly useful in the assembly of DNA from smaller fragments. A limitation of type IIS restriction endonucleases in assembly of long DNA sequences is the relative abundance of their target sites. To facilitate ligation-based assembly of extremely long pieces of DNA, we have engineered a new type IIS restriction endonuclease that combines the specificity of the homing endonuclease I-SceI with the type IIS cleavage pattern of FokI. We linked a non-cleaving mutant of I-SceI, which conveys to the chimeric enzyme its specificity for an 18-bp DNA sequence, to the catalytic domain of FokI, which cuts DNA at a defined site outside the target site. Whereas previously described chimeric endonucleases do not produce type IIS-like precise DNA overhangs suitable for ligation, our chimeric endonuclease cleaves double-stranded DNA exactly 2 and 6 nt from the target site to generate homogeneous, 5′, four-base overhangs, which can be ligated with 90% fidelity. We anticipate that these enzymes will be particularly useful in manipulation of DNA fragments larger than a thousand bases, which are very likely to contain target sites for all natural type IIS restriction endonucleases. PMID:19304757
A Tandemly Arranged Pattern of Two 5S rDNA Arrays in Amolops mantzorum (Anura, Ranidae).

PubMed

Liu, Ting; Song, Menghuan; Xia, Yun; Zeng, Xiaomao

2017-01-01

In an attempt to extend the knowledge of the 5S rDNA organization in anurans, the 5S rDNA sequences of Amolops mantzorum were isolated, characterized, and mapped by FISH. Two forms of 5S rDNA, type I (209 bp) and type II (about 870 bp), were found in specimens investigated from various populations. Both of them contained a 118-bp coding sequence, readily differentiated by their non-transcribed spacer (NTS) sizes and compositions. Four probes (the 5S rDNA coding sequences, the type I NTS, the type II NTS, and the entire type II 5S rDNA sequences) were respectively labeled with TAMRA or digoxigenin to hybridize with mitotic chromosomes for samples of all localities. It turned out that all probes showed the same signals that appeared in every centromeric region and in the telomeric regions of chromosome 5, without differences within or between populations. Obviously, both type I and type II of the 5S rDNA arrays arranged in tandem, which was contrasting with other frogs or fishes recorded to date. More interestingly, all the probes detected centromeric regions in all karyotypes, suggesting the presence of a satellite DNA family derived from 5S rDNA. © 2017 S. Karger AG, Basel.
Genomic in situ hybridization in interspecific hybrids of scallops (Bivalvia, Pectinidae) and localization of the satellite DNA Cf303, and the vertebrate telomeric sequences (TTAGGG)n on chromosomes of scallop Chlamys farreri (Jones & Preston, 1904)

PubMed Central

Hu, Liping; Jiang, Liming; Bi, Ke; Liao, Huan; Yang, Zujing; Huang, Xiaoting; Bao, Zhenmin

2018-01-01

Abstract Mitotic chromosome preparations of the interspecific hybrids Chlamys farreri (Jones & Preston, 1904) × Patinopecten yessoensis (Jay, 1857), C. farreri × Argopecten irradians (Lamarck, 1819) and C. farreri × Mimachlamys nobilis (Reeve, 1852) were used to compare two different scallop genomes in a single slide. Although genomic in situ hybridization (GISH) using genomic DNA from each scallop species as probe painted mitotic chromosomes of the interspecific hybrids, the painting results were not uniform; instead it showed species-specific distribution patterns of fluorescent signals among the chromosomes. The most prominent GISH-bands were mainly located at centromeric or telomeric regions of scallop chromosomes. In order to illustrate the sequence constitution of the GISH-bands, the satellite Cf303 sequences of C. farreri and the vertebrate telomeric (TTAGGG)n sequences were used to map mitotic chromosomes of C. farreri by fluorescence in situ hybridization (FISH). The results indicated that the GISH-banding pattern presented by the chromosomes of C. farreri is mainly due to the distribution of the satellite Cf303 DNA, therefore suggesting that the GISH-banding patterns found in the other three scallops could also be the result of the chromosomal distribution of other species-specific satellite DNAs. PMID:29675138
Genomic in situ hybridization in interspecific hybrids of scallops (Bivalvia, Pectinidae) and localization of the satellite DNA Cf303, and the vertebrate telomeric sequences (TTAGGG)n on chromosomes of scallop Chlamys farreri (Jones & Preston, 1904).

PubMed

Hu, Liping; Jiang, Liming; Bi, Ke; Liao, Huan; Yang, Zujing; Huang, Xiaoting; Bao, Zhenmin

2018-01-01

Mitotic chromosome preparations of the interspecific hybrids Chlamys farreri (Jones & Preston, 1904) × Patinopecten yessoensis (Jay, 1857), C. farreri × Argopecten irradians (Lamarck, 1819) and C. farreri × Mimachlamys nobilis (Reeve, 1852) were used to compare two different scallop genomes in a single slide. Although genomic in situ hybridization (GISH) using genomic DNA from each scallop species as probe painted mitotic chromosomes of the interspecific hybrids, the painting results were not uniform; instead it showed species-specific distribution patterns of fluorescent signals among the chromosomes. The most prominent GISH-bands were mainly located at centromeric or telomeric regions of scallop chromosomes. In order to illustrate the sequence constitution of the GISH-bands, the satellite Cf303 sequences of C. farreri and the vertebrate telomeric (TTAGGG) n sequences were used to map mitotic chromosomes of C. farreri by fluorescence in situ hybridization (FISH). The results indicated that the GISH-banding pattern presented by the chromosomes of C. farreri is mainly due to the distribution of the satellite Cf303 DNA, therefore suggesting that the GISH-banding patterns found in the other three scallops could also be the result of the chromosomal distribution of other species-specific satellite DNAs.
Singular over-representation of an octameric palindrome, HIP1, in DNA from many cyanobacteria.

PubMed

Robinson, N J; Robinson, P J; Gupta, A; Bleasby, A J; Whitton, B A; Morby, A P

1995-03-11

An octameric palindrome (5'-GCGATCGC-3') is abundant in cyanobacterial sequences within databases (GenBank/EMBL) and was designated HIP1 (highly iterated palindrome). The frequency of occurrence of all 256 octameric palindromes has now been determined in sub-databases revealing large and unique over-representation of HIP1 in cyanobacterial entries. DNA sequences from other bacteria were searched for any over-represented octameric palindromes analogous to HIP1. Only two sequences were identified, in the genomes of a thermophile and halophilic archaebacteria, although these were less abundant than HIP1 in cyanobacteria and relate to codon usage. To test the proposed widespread distribution of HIP1 in DNA from the cyanobacterium Synechococcus PCC 6301, randomly selected genomic clones were partly sequenced. HIP1 constituted 2.5% of the novel sequences, equivalent to a site on average once every 320 nucleotides. An oligonucleotide including HIP1 was also tested in PCR. Multiple products were obtained using template DNA from cyanobacterial strains in which HIP1 is abundant in known sequences, and some strains generated characteristic HIP-PCR banding patterns. However, analysis of DNA from one strain (not previously represented in databases) by random sequencing, HIP-PCR and Pvul digestion, confirms that not all cyanobacterial genomes are rich in HIP1.
Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm

PubMed Central

Glunčić, Matko; Paar, Vladimir

2013-01-01

The main feature of global repeat map (GRM) algorithm (www.hazu.hr/grm/software/win/grm2012.exe) is its ability to identify a broad variety of repeats of unbounded length that can be arbitrarily distant in sequences as large as human chromosomes. The efficacy is due to the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram. In this way, we obtain very fast, efficient and highly automatized repeat finding tool. The method is robust to substitutions and insertions/deletions, as well as to various complexities of the sequence pattern. We present several case studies of GRM use, in order to illustrate its capabilities: identification of α-satellite tandem repeats and higher order repeats (HORs), identification of Alu dispersed repeats and of Alu tandems, identification of Period 3 pattern in exons, implementation of ‘magnifying glass’ effect, identification of complex HOR pattern, identification of inter-tandem transitional dispersed repeat sequences and identification of long segmental duplications. GRM algorithm is convenient for use, in particular, in cases of large repeat units, of highly mutated and/or complex repeats, and of global repeat maps for large genomic sequences (chromosomes and genomes). PMID:22977183
Comparative analysis of DNA methylation polymorphism in drought sensitive (HPKC2) and tolerant (HPK4) genotypes of horse Gram (Macrotyloma uniflorum).

PubMed

Bhardwaj, Jyoti; Mahajan, Monika; Yadav, Sudesh Kumar

2013-08-01

DNA methylation is known as an epigenetic modification that affects gene expression in plants. Variation in CpG methylation behavior was studied in two natural horse gram (Macrotyloma uniflorum [Lam.] Verdc.) genotypes, HPKC2 (drought-sensitive) and HPK4 (drought-tolerant). The methylation pattern in both genotypes was studied through methylation-sensitive amplified polymorphism. The results revealed that methylation was higher in HPKC2 (10.1%) than in HPK4 (8.6%). Sequencing demonstrated sequence homology with the DRE binding factor (cbf1), the POZ/BTB protein, and the Ty1-copia retrotransposon among some of the polymorphic fragments showing alteration in methylation behavior. Differences in DNA methylation patterns could explain the differential drought tolerance and the epigenetic signature of these two horse gram genotypes.

Wavelet analysis of frequency chaos game signal: a time-frequency signature of the C. elegans DNA.

PubMed

Messaoudi, Imen; Oueslati, Afef Elloumi; Lachiri, Zied

2014-12-01

Challenging tasks are encountered in the field of bioinformatics. The choice of the genomic sequence's mapping technique is one the most fastidious tasks. It shows that a judicious choice would serve in examining periodic patterns distribution that concord with the underlying structure of genomes. Despite that, searching for a coding technique that can highlight all the information contained in the DNA has not yet attracted the attention it deserves. In this paper, we propose a new mapping technique based on the chaos game theory that we call the frequency chaos game signal (FCGS). The particularity of the FCGS coding resides in exploiting the statistical properties of the genomic sequence itself. This may reflect important structural and organizational features of DNA. To prove the usefulness of the FCGS approach in the detection of different local periodic patterns, we use the wavelet analysis because it provides access to information that can be obscured by other time-frequency methods such as the Fourier analysis. Thus, we apply the continuous wavelet transform (CWT) with the complex Morlet wavelet as a mother wavelet function. Scalograms that relate to the organism Caenorhabditis elegans (C. elegans) exhibit a multitude of periodic organization of specific DNA sequences.
Importance Sampling of Word Patterns in DNA and Protein Sequences

PubMed Central

Chan, Hock Peng; Chen, Louis H.Y.

2010-01-01

Abstract Monte Carlo methods can provide accurate p-value estimates of word counting test statistics and are easy to implement. They are especially attractive when an asymptotic theory is absent or when either the search sequence or the word pattern is too short for the application of asymptotic formulae. Naive direct Monte Carlo is undesirable for the estimation of small probabilities because the associated rare events of interest are seldom generated. We propose instead efficient importance sampling algorithms that use controlled insertion of the desired word patterns on randomly generated sequences. The implementation is illustrated on word patterns of biological interest: palindromes and inverted repeats, patterns arising from position-specific weight matrices (PSWMs), and co-occurrences of pairs of motifs. PMID:21128856
New Stopping Criteria for Segmenting DNA Sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Wentian

2001-06-18

We propose a solution on the stopping criterion in segmenting inhomogeneous DNA sequences with complex statistical patterns. This new stopping criterion is based on Bayesian information criterion in the model selection framework. When this criterion is applied to telomere of S.cerevisiae and the complete sequence of E.coli, borders of biologically meaningful units were identified, and a more reasonable number of domains was obtained. We also introduce a measure called segmentation strength which can be used to control the delineation of large domains. The relationship between the average domain size and the threshold of segmentation strength is determined for several genomemore » sequences.« less
Aberrant DNA methylation patterns of spermatozoa in men with unexplained infertility.

PubMed

Urdinguio, Rocío G; Bayón, Gustavo F; Dmitrijeva, Marija; Toraño, Estela G; Bravo, Cristina; Fraga, Mario F; Bassas, Lluís; Larriba, Sara; Fernández, Agustín F

2015-05-01

Are there DNA methylation alterations in sperm that could explain the reduced biological fertility of male partners from couples with unexplained infertility? DNA methylation patterns, not only at specific loci but also at Alu Yb8 repetitive sequences, are altered in infertile individuals compared with fertile controls. Aberrant DNA methylation of sperm has been associated with human male infertility in patients demonstrating either deficiencies in the process of spermatogenesis or low semen quality. Case and control prospective study. This study compares 46 sperm samples obtained from 17 normospermic fertile men and 29 normospermic infertile patients. Illumina Infinium HD Human Methylation 450K arrays were used to identify genomic regions showing differences in sperm DNA methylation patterns between five fertile and seven infertile individuals. Additionally, global DNA methylation of sperm was measured using the Methylamp Global DNA Methylation Quantification Ultra kit (Epigentek) in 14 samples, and DNA methylation at several repetitive sequences (LINE-1, Alu Yb8, NBL2, D4Z4) measured by bisulfite pyrosequencing in 44 sperm samples. A sperm-specific DNA methylation pattern was obtained by comparing the sperm methylomes with the DNA methylomes of differentiated somatic cells using data obtained from methylation arrays (Illumina 450 K) of blood, neural and glial cells deposited in public databases. In this study we conduct, for the first time, a genome-wide study to identify alterations of sperm DNA methylation in individuals with unexplained infertility that may account for the differences in their biological fertility compared with fertile individuals. We have identified 2752 CpGs showing aberrant DNA methylation patterns, and more importantly, these differentially methylated CpGs were significantly associated with CpG sites which are specifically methylated in sperm when compared with somatic cells. We also found statistically significant (P < 0.001) associations between DNA hypomethylation and regions corresponding to those which, in somatic cells, are enriched in the repressive histone mark H3K9me3, and between DNA hypermethylation and regions enriched in H3K4me1 and CTCF, suggesting that the relationship between chromatin context and aberrant DNA methylation of sperm in infertile men could be locus-dependent. Finally, we also show that DNA methylation patterns, not only at specific loci but also at several repetitive sequences (LINE-1, Alu Yb8, NBL2, D4Z4), were lower in sperm than in somatic cells. Interestingly, sperm samples at Alu Yb8 repetitive sequences of infertile patients showed significantly lower DNA methylation levels than controls. Our results are descriptive and further studies would be needed to elucidate the functional effects of aberrant DNA methylation on male fertility. Overall, our data suggest that aberrant sperm DNA methylation might contribute to fertility impairment in couples with unexplained infertility and they provide a promising basis for future research. This work has been financially supported by Fundación Cientifica de la AECC (to R.G.U.); IUOPA (to G.F.B.); FICYT (to E.G.T.); the Spanish National Research Council (CSIC; 200820I172 to M.F.F.); Fundación Ramón Areces (to M.F.F); the Plan Nacional de I+D+I 2008-2011/2013-2016/FEDER (PI11/01728 to AF.F., PI12/01080 to M.F.F. and PI12/00361 to S.L.); the PN de I+D+I 2008-20011 and the Generalitat de Catalunya (2009SGR01490). A.F.F. is sponsored by ISCIII-Subdirección General de Evaluación y Fomento de la Investigación (CP11/00131). S.L. is sponsored by the Researchers Stabilization Program from the Spanish National Health System (CES09/020). The IUOPA is supported by the Obra Social Cajastur, Spain. © The Author 2015. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Foundations for a syntatic pattern recognition system for genomic DNA sequences. [Annual] report, 1 December 1991--31 March 1993

DOE Office of Scientific and Technical Information (OSTI.GOV)

Searles, D.B.

1993-03-01

The goal of the proposed work is the creation of a software system that will perform sophisticated pattern recognition and related functions at a level of abstraction and with expressive power beyond current general-purpose pattern-matching systems for biological sequences; and with a more uniform language, environment, and graphical user interface, and with greater flexibility, extensibility, embeddability, and ability to incorporate other algorithms, than current special-purpose analytic software.
Mapping Simple Repeated DNA Sequences in Heterochromatin of Drosophila Melanogaster

PubMed Central

Lohe, A. R.; Hilliker, A. J.; Roberts, P. A.

1993-01-01

Heterochromatin in Drosophila has unusual genetic, cytological and molecular properties. Highly repeated DNA sequences (satellites) are the principal component of heterochromatin. Using probes from cloned satellites, we have constructed a chromosome map of 10 highly repeated, simple DNA sequences in heterochromatin of mitotic chromosomes of Drosophila melanogaster. Despite extensive sequence homology among some satellites, chromosomal locations could be distinguished by stringent in situ hybridizations for each satellite. Only two of the localizations previously determined using gradient-purified bulk satellite probes are correct. Eight new satellite localizations are presented, providing a megabase-level chromosome map of one-quarter of the genome. Five major satellites each exhibit a multichromosome distribution, and five minor satellites hybridize to single sites on the Y chromosome. Satellites closely related in sequence are often located near one another on the same chromosome. About 80% of Y chromosome DNA is composed of nine simple repeated sequences, in particular (AAGAC)(n) (8 Mb), (AAGAG)(n) (7 Mb) and (AATAT)(n) (6 Mb). Similarly, more than 70% of the DNA in chromosome 2 heterochromatin is composed of five simple repeated sequences. We have also generated a high resolution map of satellites in chromosome 2 heterochromatin, using a series of translocation chromosomes whose breakpoints in heterochromatin were ordered by N-banding. Finally, staining and banding patterns of heterochromatic regions are correlated with the locations of specific repeated DNA sequences. The basis for the cytochemical heterogeneity in banding appears to depend exclusively on the different satellite DNAs present in heterochromatin. PMID:8375654
Chromosome Evolution in Connection with Repetitive Sequences and Epigenetics in Plants

PubMed Central

Li, Shu-Fen; Su, Ting; Cheng, Guang-Qian; Wang, Bing-Xiao; Li, Xu; Deng, Chuan-Liang; Gao, Wu-Jun

2017-01-01

Chromosome evolution is a fundamental aspect of evolutionary biology. The evolution of chromosome size, structure and shape, number, and the change in DNA composition suggest the high plasticity of nuclear genomes at the chromosomal level. Repetitive DNA sequences, which represent a conspicuous fraction of every eukaryotic genome, particularly in plants, are found to be tightly linked with plant chromosome evolution. Different classes of repetitive sequences have distinct distribution patterns on the chromosomes. Mounting evidence shows that repetitive sequences may play multiple generative roles in shaping the chromosome karyotypes in plants. Furthermore, recent development in our understanding of the repetitive sequences and plant chromosome evolution has elucidated the involvement of a spectrum of epigenetic modification. In this review, we focused on the recent evidence relating to the distribution pattern of repetitive sequences in plant chromosomes and highlighted their potential relevance to chromosome evolution in plants. We also discussed the possible connections between evolution and epigenetic alterations in chromosome structure and repatterning, such as heterochromatin formation, centromere function, and epigenetic-associated transposable element inactivation. PMID:29064432
Meta-Analysis of Mitochondrial DNA Variation in the Iberian Peninsula.

PubMed

Barral-Arca, Ruth; Pischedda, Sara; Gómez-Carballa, Alberto; Pastoriza, Ana; Mosquera-Miguel, Ana; López-Soto, Manuel; Martinón-Torres, Federico; Álvarez-Iglesias, Vanesa; Salas, Antonio

2016-01-01

The Iberian Peninsula has been the focus of attention of numerous studies dealing with mitochondrial DNA (mtDNA) variation, most of them targeting the control region segment. In the present study we sequenced the control region of 3,024 Spanish individuals from areas where available data were still limited. We also compiled mtDNA haplotypes from the literature involving 4,588 sequences and 28 population groups or small regions. We meta-analyzed all these data in order to shed further light on patterns of geographic variation, taking advantage of the large sample size and geographic coverage, in contrast with the atomized sampling strategy of previous work. The results indicate that the main mtDNA haplogroups show primarily clinal geographic patterns across the Iberian geography, roughly along a North-South axis. Haplogroup HV0 (where haplogroup U is nested) is more prevalent in the Franco Cantabrian region, in good agreement with previous findings that identified this area as a climate refuge during the Last Glacial Maximum (LGM), prior to a subsequent demographic re-expansion towards Central Europe and the Mediterranean. Typical sub-Saharan and North African lineages are slightly more prevalent in South Iberia, although at low frequencies; this pattern has been shaped mainly by the transatlantic slave trade and the Arab invasion of the Iberian Peninsula. The results also indicate that summary statistics that aim to measure molecular variation, or AMOVA, have limited sensitivity to detect population substructure, in contrast to patterns revealed by phylogeographic analysis. Overall, the results suggest that mtDNA variation in Iberia is substantially stratified. These patterns might be relevant in biomedical studies given that stratification is a common cause of false positives in case-control mtDNA association studies, and should be also considered when weighting the DNA evidence in forensic casework, which is strongly dependent on haplotype frequencies.
Meta-Analysis of Mitochondrial DNA Variation in the Iberian Peninsula

PubMed Central

Barral-Arca, Ruth; Pischedda, Sara; Gómez-Carballa, Alberto; Pastoriza, Ana; Mosquera-Miguel, Ana; López-Soto, Manuel; Martinón-Torres, Federico; Álvarez-Iglesias, Vanesa; Salas, Antonio

2016-01-01

The Iberian Peninsula has been the focus of attention of numerous studies dealing with mitochondrial DNA (mtDNA) variation, most of them targeting the control region segment. In the present study we sequenced the control region of 3,024 Spanish individuals from areas where available data were still limited. We also compiled mtDNA haplotypes from the literature involving 4,588 sequences and 28 population groups or small regions. We meta-analyzed all these data in order to shed further light on patterns of geographic variation, taking advantage of the large sample size and geographic coverage, in contrast with the atomized sampling strategy of previous work. The results indicate that the main mtDNA haplogroups show primarily clinal geographic patterns across the Iberian geography, roughly along a North-South axis. Haplogroup HV0 (where haplogroup U is nested) is more prevalent in the Franco Cantabrian region, in good agreement with previous findings that identified this area as a climate refuge during the Last Glacial Maximum (LGM), prior to a subsequent demographic re-expansion towards Central Europe and the Mediterranean. Typical sub-Saharan and North African lineages are slightly more prevalent in South Iberia, although at low frequencies; this pattern has been shaped mainly by the transatlantic slave trade and the Arab invasion of the Iberian Peninsula. The results also indicate that summary statistics that aim to measure molecular variation, or AMOVA, have limited sensitivity to detect population substructure, in contrast to patterns revealed by phylogeographic analysis. Overall, the results suggest that mtDNA variation in Iberia is substantially stratified. These patterns might be relevant in biomedical studies given that stratification is a common cause of false positives in case-control mtDNA association studies, and should be also considered when weighting the DNA evidence in forensic casework, which is strongly dependent on haplotype frequencies. PMID:27441366
Transcription blockage by stable H-DNA analogs in vitro

PubMed Central

Pandey, Shristi; Ogloblina, Anna M.; Belotserkovskii, Boris P.; Dolinnaya, Nina G.; Yakubovskaya, Marianna G.; Mirkin, Sergei M.; Hanawalt, Philip C.

2015-01-01

DNA sequences that can form unusual secondary structures are implicated in regulating gene expression and causing genomic instability. H-palindromes are an important class of such DNA sequences that can form an intramolecular triplex structure, H-DNA. Within an H-palindrome, the H-DNA and canonical B-DNA are in a dynamic equilibrium that shifts toward H-DNA with increased negative supercoiling. The interplay between H- and B-DNA and the fact that the process of transcription affects supercoiling makes it difficult to elucidate the effects of H-DNA upon transcription. We constructed a stable structural analog of H-DNA that cannot flip into B-DNA, and studied the effects of this structure on transcription by T7 RNA polymerase in vitro. We found multiple transcription blockage sites adjacent to and within sequences engaged in this triplex structure. Triplex-mediated transcription blockage varied significantly with changes in ambient conditions: it was exacerbated in the presence of Mn2+ or by increased concentrations of K+ and Li+. Analysis of the detailed pattern of the blockage suggests that RNA polymerase is sterically hindered by H-DNA and has difficulties in unwinding triplex DNA. The implications of these findings for the biological roles of triple-stranded DNA structures are discussed. PMID:26101261
Phylogeography, intraspecific structure and sex-biased dispersal of Dall's porpoise, Phocoenoides dalli, revealed by mitochondrial and microsatellite DNA analyses.

PubMed

Escorza-Treviño, S; Dizon, A E

2000-08-01

Mitochondrial DNA (mtDNA) control-region sequences and microsatellite loci length polymorphisms were used to estimate phylogeographical patterns (historical patterns underlying contemporary distribution), intraspecific population structure and gender-biased dispersal of Phocoenoides dalli dalli across its entire range. One-hundred and thirteen animals from several geographical strata were sequenced over 379 bp of mtDNA, resulting in 58 mtDNA haplotypes. Analysis using F(ST) values (based on haplotype frequencies) and phi(ST) values (based on frequencies and genetic distances between haplotypes) yielded statistically significant separation (bootstrap values P < 0.05) among most of the stocks currently used for management purposes. A minimum spanning network of haplotypes showed two very distinctive clusters, differentially occupied by western and eastern populations, with some common widespread haplotypes. This suggests some degree of phyletic radiation from west to east, superimposed on gene flow. Highly male-biased migration was detected for several population comparisons. Nuclear microsatellite DNA markers (119 individuals and six loci) provided additional support for population subdivision and gender-biased dispersal detected in the mtDNA sequences. Analysis using F(ST) values (based on allelic frequencies) yielded statistically significant separation between some, but not all, populations distinguished by mtDNA analysis. R(ST) values (based on frequencies of and genetic distance between alleles) showed no statistically significant subdivision. Again, highly male-biased dispersal was detected for all population comparisons, suggesting, together with morphological and reproductive data, the existence of sexual selection. Our molecular results argue for nine distinct dalli-type populations that should be treated as separate units for management purposes.
Epigenetically-inherited centromere and neocentromere DNA replicates earliest in S-phase.

PubMed

Koren, Amnon; Tsai, Hung-Ji; Tirosh, Itay; Burrack, Laura S; Barkai, Naama; Berman, Judith

2010-08-19

Eukaryotic centromeres are maintained at specific chromosomal sites over many generations. In the budding yeast Saccharomyces cerevisiae, centromeres are genetic elements defined by a DNA sequence that is both necessary and sufficient for function; whereas, in most other eukaryotes, centromeres are maintained by poorly characterized epigenetic mechanisms in which DNA has a less definitive role. Here we use the pathogenic yeast Candida albicans as a model organism to study the DNA replication properties of centromeric DNA. By determining the genome-wide replication timing program of the C. albicans genome, we discovered that each centromere is associated with a replication origin that is the first to fire on its respective chromosome. Importantly, epigenetic formation of new ectopic centromeres (neocentromeres) was accompanied by shifts in replication timing, such that a neocentromere became the first to replicate and became associated with origin recognition complex (ORC) components. Furthermore, changing the level of the centromere-specific histone H3 isoform led to a concomitant change in levels of ORC association with centromere regions, further supporting the idea that centromere proteins determine origin activity. Finally, analysis of centromere-associated DNA revealed a replication-dependent sequence pattern characteristic of constitutively active replication origins. This strand-biased pattern is conserved, together with centromere position, among related strains and species, in a manner independent of primary DNA sequence. Thus, inheritance of centromere position is correlated with a constitutively active origin of replication that fires at a distinct early time. We suggest a model in which the distinct timing of DNA replication serves as an epigenetic mechanism for the inheritance of centromere position.
An SRY mutation causing human sex reversal resolves a general mechanism of structure-specific DNA recognition: application to the four-way DNA junction.

PubMed

Peters, R; King, C Y; Ukiyama, E; Falsafi, S; Donahoe, P K; Weiss, M A

1995-04-11

SRY, a genetic "master switch" for male development in mammals, exhibits two biochemical activities: sequence-specific recognition of duplex DNA and sequence-independent binding to the sharp angles of four-way DNA junctions. Here, we distinguish between these activities by analysis of a mutant SRY associated with human sex reversal (46, XY female with pure gonadal dysgenesis). The substitution (168T in human SRY) alters a nonpolar side chain in the minor-groove DNA recognition alpha-helix of the HMG box [Haqq, C.M., King, C.-Y., Ukiyama, E., Haqq, T.N., Falsalfi, S., Donahoe, P.K., & Weiss, M.A. (1994) Science 266, 1494-1500]. The native (but not mutant) side chain inserts between specific base pairs in duplex DNA, interrupting base stacking at a site of induced DNA bending. Isotope-aided 1H-NMR spectroscopy demonstrates that analogous side-chain insertion occurs on binding of SRY to a four-way junction, establishing a shared mechanism of sequence- and structure-specific DNA binding. Although the mutant DNA-binding domain exhibits > 50-fold reduction in sequence-specific DNA recognition, near wild-type affinity for four-way junctions is retained. Our results (i) identify a shared SRY-DNA contact at a site of either induced or intrinsic DNA bending, (ii) demonstrate that this contact is not required to bind an intrinsically bent DNA target, and (iii) rationalize patterns of sequence conservation or diversity among HMG boxes. Clinical association of the I68T mutation with human sex reversal supports the hypothesis that specific DNA recognition by SRY is required for male sex determination.
Cloning and restriction enzyme mapping of ribosomal DNA of Giardia duodenalis, Giardia ardeae and Giardia muris.

PubMed

van Keulen, H; Campbell, S R; Erlandsen, S L; Jarroll, E L

1991-06-01

In an attempt to study Giardia at the DNA sequence level, the rRNA genes of three species, Giardia duodenalis, Giardia ardeae and Giardia muris were cloned and restriction enzyme maps were constructed. The rDNA repeats of these Giardia show completely different restriction enzyme recognition patterns. The size of the rDNA repeat ranges from approximately 5.6 kb in G. duodenalis to 7.6 kb in both G. muris and G. ardeae. These size differences are mainly attributable to the variation in length of the spacer. Minor differences exist among these Giardia in the sizes of their small subunit rRNA and the internal transcribed spacer between small and large subunit rRNA. The genetic maps were constructed by sequence analysis of the DNA around the 5' and 3' ends of the mature rRNA genes and between the rRNA covering the 5.8S rRNA gene and internal transcribed spacer. Comparison of the 5.8S rDNA and 3' end of large subunit rDNA from these three Giardia species showed considerable sequence variation, but the rDNA sequences of G. duodenalis and G. ardeae appear more closely related to each other than to G. muris.
Characterization of kinetoplast DNA from Phytomonas serpens.

PubMed

Sá-Carvalho, D; Perez-Morga, D; Traub-Cseko, Y M

1993-01-01

The restriction enzyme digestion of kinetoplast DNA from four Phytomonas serpens isolates shows an overall similar band pattern. One minicircle from isolate 30T was cloned and sequenced, showing low levels of homology but the same general features and organization as described for minicircles of other trypanosomatids. Extensive regions of the minicircle are composed by G and T on the H strand. These regions are very repetitive and similar to regions in a minicircle of Crithidia oncopelti and to telomeric sequences of Saccharomyces cerevisiae. Conserved Sequence Block 3, present in all trypanosomatids, is one nucleotide different from the consensus in P. serpens and provides a basis to differentiate P. serpens from other trypanosomatids. Electron microscopy of kinetoplast DNA evidenced a network with organization similar to other trypanosomatids and the measurement of minicircles confirmed the size of about 1.45 kb of the sequenced minicircle.
Evaluation of next generation mtGenome sequencing using the Ion Torrent Personal Genome Machine (PGM)☆

PubMed Central

Parson, Walther; Strobl, Christina; Huber, Gabriela; Zimmermann, Bettina; Gomes, Sibylle M.; Souto, Luis; Fendt, Liane; Delport, Rhena; Langit, Reina; Wootton, Sharon; Lagacé, Robert; Irwin, Jodi

2013-01-01

Insights into the human mitochondrial phylogeny have been primarily achieved by sequencing full mitochondrial genomes (mtGenomes). In forensic genetics (partial) mtGenome information can be used to assign haplotypes to their phylogenetic backgrounds, which may, in turn, have characteristic geographic distributions that would offer useful information in a forensic case. In addition and perhaps even more relevant in the forensic context, haplogroup-specific patterns of mutations form the basis for quality control of mtDNA sequences. The current method for establishing (partial) mtDNA haplotypes is Sanger-type sequencing (STS), which is laborious, time-consuming, and expensive. With the emergence of Next Generation Sequencing (NGS) technologies, the body of available mtDNA data can potentially be extended much more quickly and cost-efficiently. Customized chemistries, laboratory workflows and data analysis packages could support the community and increase the utility of mtDNA analysis in forensics. We have evaluated the performance of mtGenome sequencing using the Personal Genome Machine (PGM) and compared the resulting haplotypes directly with conventional Sanger-type sequencing. A total of 64 mtGenomes (>1 million bases) were established that yielded high concordance with the corresponding STS haplotypes (<0.02% differences). About two-thirds of the differences were observed in or around homopolymeric sequence stretches. In addition, the sequence alignment algorithm employed to align NGS reads played a significant role in the analysis of the data and the resulting mtDNA haplotypes. Further development of alignment software would be desirable to facilitate the application of NGS in mtDNA forensic genetics. PMID:23948325
DNA methylation assessment from human slow- and fast-twitch skeletal muscle fibers

PubMed Central

Begue, Gwénaëlle; Raue, Ulrika; Jemiolo, Bozena

2017-01-01

A new application of the reduced representation bisulfite sequencing method was developed using low-DNA input to investigate the epigenetic profile of human slow- and fast-twitch skeletal muscle fibers. Successful library construction was completed with as little as 15 ng of DNA, and high-quality sequencing data were obtained with 32 ng of DNA. Analysis identified 143,160 differentially methylated CpG sites across 14,046 genes. In both fiber types, selected genes predominantly expressed in slow or fast fibers were hypomethylated, which was supported by the RNA-sequencing analysis. These are the first fiber type-specific methylation data from human skeletal muscle and provide a unique platform for future research. NEW & NOTEWORTHY This study validates a low-DNA input reduced representation bisulfite sequencing method for human muscle biopsy samples to investigate the methylation patterns at a fiber type-specific level. These are the first fiber type-specific methylation data reported from human skeletal muscle and thus provide initial insight into basal state differences in myosin heavy chain I and IIa muscle fibers among young, healthy men. PMID:28057818
DNATagger, colors for codons.

PubMed

Scherer, N M; Basso, D M

2008-09-16

DNATagger is a web-based tool for coloring and editing DNA, RNA and protein sequences and alignments. It is dedicated to the visualization of protein coding sequences and also protein sequence alignments to facilitate the comprehension of evolutionary processes in sequence analysis. The distinctive feature of DNATagger is the use of codons as informative units for coloring DNA and RNA sequences. The codons are colored according to their corresponding amino acids. It is the first program that colors codons in DNA sequences without being affected by "out-of-frame" gaps of alignments. It can handle single gaps and gaps inside the triplets. The program also provides the possibility to edit the alignments and change color patterns and translation tables. DNATagger is a JavaScript application, following the W3C guidelines, designed to work on standards-compliant web browsers. It therefore requires no installation and is platform independent. The web-based DNATagger is available as free and open source software at http://www.inf.ufrgs.br/~dmbasso/dnatagger/.
The implication of DNA bending energy for nucleosome positioning and sliding.

PubMed

Liu, Guoqing; Xing, Yongqiang; Zhao, Hongyu; Cai, Lu; Wang, Jianying

2018-06-11

Nucleosome not only directly affects cellular processes, such as DNA replication, recombination, and transcription, but also severs as a fundamentally important target of epigenetic modifications. Our previous study indicated that the bending property of DNA is important in nucleosome formation, particularly in predicting the dyad positions of nucleosomes on a DNA segment. Here, we investigated the role of bending energy in nucleosome positioning and sliding in depth to decipher sequence-directed mechanism. The results show that bending energy is a good physical index to predict the free energy in the process of nucleosome reconstitution in vitro. Our data also imply that there are at least 20% of the nucleosomes in budding yeast do not adopt canonical positioning, in which underlying sequences wrapped around histones are structurally symmetric. We also revealed distinct patterns of bending energy profile for distinctly organized chromatin structures, such as well-positioned nucleosomes, fuzzy nucleosomes, and linker regions and discussed nucleosome sliding in terms of bending energy. We proposed that the stability of a nucleosome is positively correlated with the strength of the bending anisotropy of DNA segment, and both accessibility and directionality of nucleosome sliding is likely to be modulated by diverse patterns of DNA bending energy profile.
The chemical structure of DNA sequence signals for RNA transcription

NASA Technical Reports Server (NTRS)

George, D. G.; Dayhoff, M. O.

1982-01-01

The proposed recognition sites for RNA transcription for E. coli NRA polymerase, bacteriophage T7 RNA polymerase, and eukaryotic RNA polymerase Pol II are evaluated in the light of the requirements for efficient recognition. It is shown that although there is good experimental evidence that specific nucleic acid sequence patterns are involved in transcriptional regulation in bacteria and bacterial viruses, among the sequences now available, only in the case of the promoters recognized by bacteriophage T7 polymerase does it seem likely that the pattern is sufficient. It is concluded that the eukaryotic pattern that is investigated is not restrictive enough to serve as a recognition site.

The landscape of actionable genomic alterations in cell-free circulating tumor DNA from 21,807 advanced cancer patients.

PubMed

Zill, Oliver A; Banks, Kimberly C; Fairclough, Stephen R; Mortimer, Stefanie; Vowles, James V; Mokhtari, Reza; Gandara, David R; Mack, Philip C; Odegaard, Justin I; Nagy, Rebecca J; Baca, Arthur M; Eltoukhy, Helmy; Chudova, Darya I; Lanman, Richard B; Talasaz, AmirAli

2018-05-18

Cell-free DNA (cfDNA) sequencing provides a non-invasive method for obtaining actionable genomic information to guide personalized cancer treatment, but the presence of multiple alterations in circulation related to treatment and tumor heterogeneity complicate the interpretation of the observed variants. Experimental Design: We describe the somatic mutation landscape of 70 cancer genes from cfDNA deep-sequencing analysis of 21,807 patients with treated, late-stage cancers across >50 cancer types. To facilitate interpretation of the genomic complexity of circulating tumor DNA in advanced, treated cancer patients, we developed methods to identify cfDNA copy-number driver alterations and cfDNA clonality. Patterns and prevalence of cfDNA alterations in major driver genes for non-small cell lung, breast, and colorectal cancer largely recapitulated those from tumor tissue sequencing compendia (TCGA and COSMIC; r=0.90-0.99), with the principle differences in alteration prevalence being due to patient treatment. This highly sensitive cfDNA sequencing assay revealed numerous subclonal tumor-derived alterations, expected as a result of clonal evolution, but leading to an apparent departure from mutual exclusivity in treatment-naïve tumors. Upon applying novel cfDNA clonality and copy-number driver identification methods, robust mutual exclusivity was observed among predicted truncal driver cfDNA alterations (FDR=5x10 -7 for EGFR and ERBB2 ), in effect distinguishing tumor-initiating alterations from secondary alterations. Treatment-associated resistance, including both novel alterations and parallel evolution, was common in the cfDNA cohort and was enriched in patients with targetable driver alterations (>18.6% patients). Together these retrospective analyses of a large cfDNA sequencing data set reveal subclonal structures and emerging resistance in advanced solid tumors. Copyright ©2018, American Association for Cancer Research.
Differential repetitive DNA composition in the centromeric region of chromosomes of Amazonian lizard species in the family Teiidae

PubMed Central

Carvalho, Natalia D. M.; Carmo, Edson; Neves, Rogerio O.; Schneider, Carlos Henrique; Gross, Maria Claudia

2016-01-01

Abstract Differences in heterochromatin distribution patterns and its composition were observed in Amazonian teiid species. Studies have shown repetitive DNA harbors heterochromatic blocks which are located in centromeric and telomeric regions in Ameiva ameiva (Linnaeus, 1758), Kentropyx calcarata (Spix, 1825), Kentropyx pelviceps (Cope, 1868), and Tupinambis teguixin (Linnaeus, 1758). In Cnemidophorus sp.1, repetitive DNA has multiple signals along all chromosomes. The aim of this study was to characterize moderately and highly repetitive DNA sequences by Cot1-DNA from Ameiva ameiva and Cnemidophorus sp.1 genomes through cloning and DNA sequencing, as well as mapping them chromosomally to better understand its organization and genome dynamics. The results of sequencing of DNA libraries obtained by Cot1-DNA showed that different microsatellites, transposons, retrotransposons, and some gene families also comprise the fraction of repetitive DNA in the teiid species. FISH using Cot1-DNA probes isolated from both Ameiva ameiva and Cnemidophorus sp.1 showed these sequences mainly located in heterochromatic centromeric, and telomeric regions in Ameiva ameiva, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin chromosomes, indicating they play structural and functional roles in the genome of these species. In Cnemidophorus sp.1, Cot1-DNA probe isolated from Ameiva ameiva had multiple interstitial signals on chromosomes, whereas mapping of Cot1-DNA isolated from the Ameiva ameiva and Cnemidophorus sp.1 highlighted centromeric regions of some chromosomes. Thus, the data obtained showed that many repetitive DNA classes are part of the genome of Ameiva ameiva, Cnemidophorus sp.1, Kentroyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin, and these sequences are shared among the analyzed teiid species, but they were not always allocated at the same chromosome position. PMID:27551343
Differential repetitive DNA composition in the centromeric region of chromosomes of Amazonian lizard species in the family Teiidae.

PubMed

Carvalho, Natalia D M; Carmo, Edson; Neves, Rogerio O; Schneider, Carlos Henrique; Gross, Maria Claudia

2016-01-01

Differences in heterochromatin distribution patterns and its composition were observed in Amazonian teiid species. Studies have shown repetitive DNA harbors heterochromatic blocks which are located in centromeric and telomeric regions in Ameiva ameiva (Linnaeus, 1758), Kentropyx calcarata (Spix, 1825), Kentropyx pelviceps (Cope, 1868), and Tupinambis teguixin (Linnaeus, 1758). In Cnemidophorus sp.1, repetitive DNA has multiple signals along all chromosomes. The aim of this study was to characterize moderately and highly repetitive DNA sequences by C ot1-DNA from Ameiva ameiva and Cnemidophorus sp.1 genomes through cloning and DNA sequencing, as well as mapping them chromosomally to better understand its organization and genome dynamics. The results of sequencing of DNA libraries obtained by C ot1-DNA showed that different microsatellites, transposons, retrotransposons, and some gene families also comprise the fraction of repetitive DNA in the teiid species. FISH using C ot1-DNA probes isolated from both Ameiva ameiva and Cnemidophorus sp.1 showed these sequences mainly located in heterochromatic centromeric, and telomeric regions in Ameiva ameiva, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin chromosomes, indicating they play structural and functional roles in the genome of these species. In Cnemidophorus sp.1, C ot1-DNA probe isolated from Ameiva ameiva had multiple interstitial signals on chromosomes, whereas mapping of C ot1-DNA isolated from the Ameiva ameiva and Cnemidophorus sp.1 highlighted centromeric regions of some chromosomes. Thus, the data obtained showed that many repetitive DNA classes are part of the genome of Ameiva ameiva, Cnemidophorus sp.1, Kentroyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin, and these sequences are shared among the analyzed teiid species, but they were not always allocated at the same chromosome position.
ngs.plot: Quick mining and visualization of next-generation sequencing data by integrating genomic databases.

PubMed

Shen, Li; Shao, Ningyi; Liu, Xiaochuan; Nestler, Eric

2014-04-15

Understanding the relationship between the millions of functional DNA elements and their protein regulators, and how they work in conjunction to manifest diverse phenotypes, is key to advancing our understanding of the mammalian genome. Next-generation sequencing technology is now used widely to probe these protein-DNA interactions and to profile gene expression at a genome-wide scale. As the cost of DNA sequencing continues to fall, the interpretation of the ever increasing amount of data generated represents a considerable challenge. We have developed ngs.plot - a standalone program to visualize enrichment patterns of DNA-interacting proteins at functionally important regions based on next-generation sequencing data. We demonstrate that ngs.plot is not only efficient but also scalable. We use a few examples to demonstrate that ngs.plot is easy to use and yet very powerful to generate figures that are publication ready. We conclude that ngs.plot is a useful tool to help fill the gap between massive datasets and genomic information in this era of big sequencing data.
ngs.plot: Quick mining and visualization of next-generation sequencing data by integrating genomic databases

PubMed Central

2014-01-01

Background Understanding the relationship between the millions of functional DNA elements and their protein regulators, and how they work in conjunction to manifest diverse phenotypes, is key to advancing our understanding of the mammalian genome. Next-generation sequencing technology is now used widely to probe these protein-DNA interactions and to profile gene expression at a genome-wide scale. As the cost of DNA sequencing continues to fall, the interpretation of the ever increasing amount of data generated represents a considerable challenge. Results We have developed ngs.plot – a standalone program to visualize enrichment patterns of DNA-interacting proteins at functionally important regions based on next-generation sequencing data. We demonstrate that ngs.plot is not only efficient but also scalable. We use a few examples to demonstrate that ngs.plot is easy to use and yet very powerful to generate figures that are publication ready. Conclusions We conclude that ngs.plot is a useful tool to help fill the gap between massive datasets and genomic information in this era of big sequencing data. PMID:24735413
The 5S rDNA in two Abracris grasshoppers (Ommatolampidinae: Acrididae): molecular and chromosomal organization.

PubMed

Bueno, Danilo; Palacios-Gimenez, Octavio Manuel; Martí, Dardo Andrea; Mariguela, Tatiane Casagrande; Cabral-de-Mello, Diogo Cavalcanti

2016-08-01

The 5S ribosomal DNA (rDNA) sequences are subject of dynamic evolution at chromosomal and molecular levels, evolving through concerted and/or birth-and-death fashion. Among grasshoppers, the chromosomal location for this sequence was established for some species, but little molecular information was obtained to infer evolutionary patterns. Here, we integrated data from chromosomal and nucleotide sequence analysis for 5S rDNA in two Abracris species aiming to identify evolutionary dynamics. For both species, two arrays were identified, a larger sequence (named type-I) that consisted of the entire 5S rDNA gene plus NTS (non-transcribed spacer) and a smaller (named type-II) with truncated 5S rDNA gene plus short NTS that was considered a pseudogene. For type-I sequences, the gene corresponding region contained the internal control region and poly-T motif and the NTS presented partial transposable elements. Between the species, nucleotide differences for type-I were noticed, while type-II was identical, suggesting pseudogenization in a common ancestor. At chromosomal point to view, the type-II was placed in one bivalent, while type-I occurred in multiple copies in distinct chromosomes. In Abracris, the evolution of 5S rDNA was apparently influenced by the chromosomal distribution of clusters (single or multiple location), resulting in a mixed mechanism integrating concerted and birth-and-death evolution depending on the unit.
Phylogenetic patterns in populations of Chilean species of the genus Orestias (Teleostei: Cyprinodontidae): results of mitochondrial DNA analysis.

PubMed

Lüssen, Arne; Falk, Thomas M; Villwock, Wolfgang

2003-10-01

Patterns of molecular genetic differentiation among taxa of the "agassii species complex" (Parenti, 1984) were analysed based on partial mtDNA control region sequences. Special attention has been paid to Chilean populations of Orestias agassii and species from isolated lakes of northern Chile, e.g., O. agassii, Orestias chungarensis, Orestias parinacotensis, Orestias laucaensis, and Orestias ascotanensis. Orestias tschudii, Orestias luteus, and Orestias ispi were analysed comparatively. Our findings support the utility of mtDNA control region sequences for phylogenetic studies within the "agassii species complex" and confirmed the monophyly of this particular lineage, excluding O. luteus. However, the monophyly of further morphologically defined lineages within the "agassii complex" appears doubtful. No support was found for the utility of these data sets for inferring phylogenetic relationships between more distantly related taxa originating from Lake Titicaca.
Optimization of cDNA-AFLP experiments using genomic sequence data.

PubMed

Kivioja, Teemu; Arvas, Mikko; Saloheimo, Markku; Penttilä, Merja; Ukkonen, Esko

2005-06-01

cDNA amplified fragment length polymorphism (cDNA-AFLP) is one of the few genome-wide level expression profiling methods capable of finding genes that have not yet been cloned or even predicted from sequence but have interesting expression patterns under the studied conditions. In cDNA-AFLP, a complex cDNA mixture is divided into small subsets using restriction enzymes and selective PCR. A large cDNA-AFLP experiment can require a substantial amount of resources, such as hundreds of PCR amplifications and gel electrophoresis runs, followed by manual cutting of a large number of bands from the gels. Our aim was to test whether this workload can be reduced by rational design of the experiment. We used the available genomic sequence information to optimize cDNA-AFLP experiments beforehand so that as many transcripts as possible could be profiled with a given amount of resources. Optimization of the selection of both restriction enzymes and selective primers for cDNA-AFLP experiments has not been performed previously. The in silico tests performed suggest that substantial amounts of resources can be saved by the optimization of cDNA-AFLP experiments.
Molecular characterization of the probiotic strain Bacillus cereus var. toyoi NCIMB 40112 and differentiation from food poisoning strains.

PubMed

Klein, Günter

2011-07-01

Bacillus cereus var. toyoi strain NCIMB 40112 (Toyocerin), a probiotic authorized in the European Union as feed additive for swine, bovines, poultry, and rabbits, was characterized by DNA fingerprinting applying pulsed-field gel electrophoresis and multilocus sequence typing and was compared with reference strains (of clinical and environmental origins). The probiotic strain was clearly characterized by pulsed-field gel electrophoresis using the restriction enzymes Apa I and Sma I resulting in unique DNA patterns. The comparison to the clinical reference strain B. cereus DSM 4312 was done with the same restriction enzymes, and again a clear differentiation of the two strains was possible by the resulting DNA patterns. The use of the restriction enzymes Apa I and Sma I is recommended for further studies. Furthermore, multilocus sequence typing analysis revealed a sequence type (ST 111) that was different from all known STs of B. cereus strains from food poisoning incidents. Thus, a strain characterization and differentiation from food poisoning strains for the probiotic strain was possible. Copyright ©, International Association for Food Protection
Global DNA methylation analysis using methyl-sensitive amplification polymorphism (MSAP).

PubMed

Yaish, Mahmoud W; Peng, Mingsheng; Rothstein, Steven J

2014-01-01

DNA methylation is a crucial epigenetic process which helps control gene transcription activity in eukaryotes. Information regarding the methylation status of a regulatory sequence of a particular gene provides important knowledge of this transcriptional control. DNA methylation can be detected using several methods, including sodium bisulfite sequencing and restriction digestion using methylation-sensitive endonucleases. Methyl-Sensitive Amplification Polymorphism (MSAP) is a technique used to study the global DNA methylation status of an organism and hence to distinguish between two individuals based on the DNA methylation status determined by the differential digestion pattern. Therefore, this technique is a useful method for DNA methylation mapping and positional cloning of differentially methylated genes. In this technique, genomic DNA is first digested with a methylation-sensitive restriction enzyme such as HpaII, and then the DNA fragments are ligated to adaptors in order to facilitate their amplification. Digestion using a methylation-insensitive isoschizomer of HpaII, MspI is used in a parallel digestion reaction as a loading control in the experiment. Subsequently, these fragments are selectively amplified by fluorescently labeled primers. PCR products from different individuals are compared, and once an interesting polymorphic locus is recognized, the desired DNA fragment can be isolated from a denaturing polyacrylamide gel, sequenced and identified based on DNA sequence similarity to other sequences available in the database. We will use analysis of met1, ddm1, and atmbd9 mutants and wild-type plants treated with a cytidine analogue, 5-azaC, or zebularine to demonstrate how to assess the genetic modulation of DNA methylation in Arabidopsis. It should be noted that despite the fact that MSAP is a reliable technique used to fish for polymorphic methylated loci, its power is limited to the restriction recognition sites of the enzymes used in the genomic DNA digestion.
A phylogenetic study of Laeliinae (Orchidaceae) based on combined nuclear and plastid DNA sequences

PubMed Central

van den Berg, Cássio; Higgins, Wesley E.; Dressler, Robert L.; Whitten, W. Mark; Soto-Arenas, Miguel A.; Chase, Mark W.

2009-01-01

Background and Aims Laeliinae are a neotropical orchid subtribe with approx. 1500 species in 50 genera. In this study, an attempt is made to assess generic alliances based on molecular phylogenetic analysis of DNA sequence data. Methods Six DNA datasets were gathered: plastid trnL intron, trnL-F spacer, matK gene and trnK introns upstream and dowstream from matK and nuclear ITS rDNA. Data were analysed with maximum parsimony (MP) and Bayesian analysis with mixed models (BA). Key Results Although relationships between Laeliinae and outgroups are well supported, within the subtribe sequence variation is low considering the broad taxonomic range covered. Localized incongruence between the ITS and plastid trees was found. A combined tree followed the ITS trees more closely, but the levels of support obtained with MP were low. The Bayesian analysis recovered more well-supported nodes. The trees from combined MP and BA allowed eight generic alliances to be recognized within Laeliinae, all of which show trends in morphological characters but lack unambiguous synapomorphies. Conclusions By using combined plastid and nuclear DNA data in conjunction with mixed-models Bayesian inference, it is possible to delimit smaller groups within Laeliinae and discuss general patterns of pollination and hybridization compatibility. Furthermore, these small groups can now be used for further detailed studies to explain morphological evolution and diversification patterns within the subtribe. PMID:19423551
Determining geographical spread pattern of MERS-CoV by distance method using Kimura model

NASA Astrophysics Data System (ADS)

Amiroch, Siti; Rohmatullah, Arif

2017-03-01

MERS-CoV or generally called as Middle East Respiratory Syndrome Coronavirus, a respiratory disease syndrome caused by a corona virus that attacks the respiratory tract ranging from mild to severe acute indication of fever, cough and shortness of breath. The cases happened relate to the countries in the Arabian Peninsula (Middle East) and there were 356 deaths have been reported due to the spread of the epidemic MERS. The data used in the case of MERS are the data DNA sequences taken from Genbank, the online database of the United States that stores the results of molecular biological experiments from all over the world (http://www.ncbi.nlm.nih.gov). In this case, bioinformatics plays an important role of reading sequences of DNA and genetic information by using the main device in the form of software that is supported by the availability of the Internet, while the analysis there in made and proven with mathematical methods. In similar research conducted by molecular biologists and physicians, the process of DNA sequencing is done with software that is already available like BLAST. In order to determine the MERS geographical distribution patterns in the Arabian Peninsula is done with program Clustal W, Bayesian, Phylip, etc. In this study, the writer use the Matlab simulation for all processes starting sequence alignment, counting the number of transitions and transversion substitutions for each sequence and its location up to the process of forming a phylogenetic tree that figures out the pattern of spread of the epidemic MERS. Mathematical analysis performed on a decline in the formula is to find Kimura evolutionary models and the process of forming a phylogenetic tree (the pattern of the epidemic MERS distribution) with neighbor joining algorithm. Finally it was obtained the pattern of geographical spread with 6 groups epidemic of MERS which ultimately turns out that all the MERS viruses that were spread in the Arabian Peninsula everything are almost the same as the virus sequence found in al-Hasa.
Inferring patterns in mitochondrial DNA sequences through hypercube independent spanning trees.

PubMed

Silva, Eduardo Sant Ana da; Pedrini, Helio

2016-03-01

Given a graph G, a set of spanning trees rooted at a vertex r of G is said vertex/edge independent if, for each vertex v of G, v≠r, the paths of r to v in any pair of trees are vertex/edge disjoint. Independent spanning trees (ISTs) provide a number of advantages in data broadcasting due to their fault tolerant properties. For this reason, some studies have addressed the issue by providing mechanisms for constructing independent spanning trees efficiently. In this work, we investigate how to construct independent spanning trees on hypercubes, which are generated based upon spanning binomial trees, and how to use them to predict mitochondrial DNA sequence parts through paths on the hypercube. The prediction works both for inferring mitochondrial DNA sequences comprised of six bases as well as infer anomalies that probably should not belong to the mitochondrial DNA standard. Copyright © 2016 Elsevier Ltd. All rights reserved.
The Organization of Repetitive DNA in the Genomes of Amazonian Lizard Species in the Family Teiidae.

PubMed

Carvalho, Natalia D M; Pinheiro, Vanessa S S; Carmo, Edson J; Goll, Leonardo G; Schneider, Carlos H; Gross, Maria C

2015-01-01

Repetitive DNA is the largest fraction of the eukaryote genome and comprises tandem and dispersed sequences. It presents variations in relation to its composition, number of copies, distribution, dynamics, and genome organization, and participates in the evolutionary diversification of different vertebrate species. Repetitive sequences are usually located in the heterochromatin of centromeric and telomeric regions of chromosomes, contributing to chromosomal structures. Therefore, the aim of this study was to physically map repetitive DNA sequences (5S rDNA, telomeric sequences, tropomyosin gene 1, and retroelements Rex1 and SINE) of mitotic chromosomes of Amazonian species of teiids (Ameiva ameiva, Cnemidophorus sp. 1, Kentropyx calcarata, Kentropyx pelviceps, and Tupinambis teguixin) to understand their genome organization and karyotype evolution. The mapping of repetitive sequences revealed a distinct pattern in Cnemidophorus sp. 1, whereas the other species showed all sequences interspersed in the heterochromatic region. Physical mapping of the tropomyosin 1 gene was performed for the first time in lizards and showed that in addition to being functional, this gene has a structural function similar to the mapped repetitive elements as it is located preferentially in centromeric regions and termini of chromosomes. © 2016 S. Karger AG, Basel.
Transcriptional dynamics of the developing sweet cherry (Prunus avium L.) fruit: sequencing, annotation and expression profiling of exocarp-associated genes

PubMed Central

Alkio, Merianne; Jonas, Uwe; Declercq, Myriam; Van Nocker, Steven; Knoche, Moritz

2014-01-01

The exocarp, or skin, of fleshy fruit is a specialized tissue that protects the fruit, attracts seed dispersing fruit eaters, and has large economical relevance for fruit quality. Development of the exocarp involves regulated activities of many genes. This research analyzed global gene expression in the exocarp of developing sweet cherry (Prunus avium L., ‘Regina’), a fruit crop species with little public genomic resources. A catalog of transcript models (contigs) representing expressed genes was constructed from de novo assembled short complementary DNA (cDNA) sequences generated from developing fruit between flowering and maturity at 14 time points. Expression levels in each sample were estimated for 34 695 contigs from numbers of reads mapping to each contig. Contigs were annotated functionally based on BLAST, gene ontology and InterProScan analyses. Coregulated genes were detected using partitional clustering of expression patterns. The results are discussed with emphasis on genes putatively involved in cuticle deposition, cell wall metabolism and sugar transport. The high temporal resolution of the expression patterns presented here reveals finely tuned developmental specialization of individual members of gene families. Moreover, the de novo assembled sweet cherry fruit transcriptome with 7760 full-length protein coding sequences and over 20 000 other, annotated cDNA sequences together with their developmental expression patterns is expected to accelerate molecular research on this important tree fruit crop. PMID:26504533
Association of Tissue-Specific DNA Methylation Alterations with α-Thalassemia Southeast Asian Deletion

PubMed Central

Pangeson, Tanapat; Sanguansermsri, Phanchana; Sanguansermsri, Torpong; Seeratanachot, Teerapat; Suwanakhon, Narutchala; Srikummool, Metawee; Kaewkong, Worasak; Mahingsa, Khwanruedee

2017-01-01

In the wild-type allele, DNA methylation levels of 10 consecutive CpG sites adjacent to the upstream 5′-breakpoint of α-thalassemia Southeast Asian (SEA) deletion are not different between placenta and leukocytes. However, no previous study has reported the map of DNA methylation in the SEA allele. This report aims to show that the SEA mutation is associated with DNA methylation changes, resulting in differential methylation between placenta and leukocytes. Methylation-sensitive high-resolution analysis was used to compare DNA methylation among placenta, leukocytes, and unmethylated control DNA. The result indicates that the DNA methylation between placenta and leukocyte DNA is different and shows that the CpG status of both is not fully unmethylated. Mapping of individual CpG sites was performed by targeted bisulfite sequencing. The DNA methylation level of the 10 consecutive CpG sites was different between placenta and leukocyte DNA. When the 10th CpG of the mutation allele was considered as a hallmark for comparing DNA methylation level, it was totally different from the unmethylated 10th CpG of the wild-type allele. Finally, the distinct DNA methylation patterns between both DNA were extracted. In total, 24 patterns were found in leukocyte samples and 9 patterns were found in placenta samples. This report shows that the large deletion is associated with DNA methylation change. In further studies for clinical application, the distinct DNA methylation pattern might be a potential marker for detecting cell-free fetal DNA. PMID:29162979
DNA-Demethylase Regulated Genes Show Methylation-Independent Spatiotemporal Expression Patterns

PubMed Central

Schumann, Ulrike; Lee, Joanne; Kazan, Kemal; Ayliffe, Michael; Wang, Ming-Bo

2017-01-01

Recent research has indicated that a subset of defense-related genes is downregulated in the Arabidopsis DNA demethylase triple mutant rdd (ros1 dml2 dml3) resulting in increased susceptibility to the fungal pathogen Fusarium oxysporum. In rdd plants these downregulated genes contain hypermethylated transposable element sequences (TE) in their promoters, suggesting that this methylation represses gene expression in the mutant and that these sequences are actively demethylated in wild-type plants to maintain gene expression. In this study, the tissue-specific and pathogen-inducible expression patterns of rdd-downregulated genes were investigated and the individual role of ROS1, DML2, and DML3 demethylases in these spatiotemporal regulation patterns was determined. Large differences in defense gene expression were observed between pathogen-infected and uninfected tissues and between root and shoot tissues in both WT and rdd plants, however, only subtle changes in promoter TE methylation patterns occurred. Therefore, while TE hypermethylation caused decreased gene expression in rdd plants it did not dramatically effect spatiotemporal gene regulation, suggesting that this latter regulation is largely methylation independent. Analysis of ros1-3, dml2-1, and dml3-1 single gene mutant lines showed that promoter TE hypermethylation and defense-related gene repression was predominantly, but not exclusively, due to loss of ROS1 activity. These data demonstrate that DNA demethylation of TE sequences, largely by ROS1, promotes defense-related gene expression but does not control spatiotemporal expression in Arabidopsis. Summary: Ros1-mediated DNA demethylation of promoter transposable elements is essential for activation of defense-related gene expression in response to fungal infection in Arabidopsis thaliana. PMID:28894455
Rhizobium favelukesii sp. nov., isolated from the root nodules of alfalfa (Medicago sativa L).

PubMed

Torres Tejerizo, Gonzalo; Rogel, Marco Antonio; Ormeño-Orrillo, Ernesto; Althabegoiti, María Julia; Nilsson, Juliet Fernanda; Niehaus, Karsten; Schlüter, Andreas; Pühler, Alfred; Del Papa, María Florencia; Lagares, Antonio; Martínez-Romero, Esperanza; Pistorio, Mariano

2016-11-01

Strains LPU83T and Or191 of the genus Rhizobium were isolated from the root nodules of alfalfa, grown in acid soils from Argentina and the USA. These two strains, which shared the same plasmid pattern, lipopolysaccharide profile, insertion-sequence fingerprint, 16S rRNA gene sequence and PCR-fingerprinting pattern, were different from reference strains representing species of the genus Rhizobium with validly published names. On the basis of previously reported data and from new DNA-DNA hybridization results, phenotypic characterization and phylogenetic analyses, strains LPU83T and Or191 can be considered to be representatives of a novel species of the genus Rhizobium, for which the name Rhizobium favelukesii sp. nov. is proposed. The type strain of this species is LPU83T (=CECT 9014T=LMG 29160T), for which an improved draft-genome sequence is available.
Identifying active foraminifera in the Sea of Japan using metatranscriptomic approach

NASA Astrophysics Data System (ADS)

Lejzerowicz, Franck; Voltsky, Ivan; Pawlowski, Jan

2013-02-01

Metagenetics represents an efficient and rapid tool to describe environmental diversity patterns of microbial eukaryotes based on ribosomal DNA sequences. However, the results of metagenetic studies are often biased by the presence of extracellular DNA molecules that are persistent in the environment, especially in deep-sea sediment. As an alternative, short-lived RNA molecules constitute a good proxy for the detection of active species. Here, we used a metatranscriptomic approach based on RNA-derived (cDNA) sequences to study the diversity of the deep-sea benthic foraminifera and compared it to the metagenetic approach. We analyzed 257 ribosomal DNA and cDNA sequences obtained from seven sediments samples collected in the Sea of Japan at depths ranging from 486 to 3665 m. The DNA and RNA-based approaches gave a similar view of the taxonomic composition of foraminiferal assemblage, but differed in some important points. First, the cDNA dataset was dominated by sequences of rotaliids and robertiniids, suggesting that these calcareous species, some of which have been observed in Rose Bengal stained samples, are the most active component of foraminiferal community. Second, the richness of monothalamous (single-chambered) foraminifera was particularly high in DNA extracts from the deepest samples, confirming that this group of foraminifera is abundant but not necessarily very active in the deep-sea sediments. Finally, the high divergence of undetermined sequences in cDNA dataset indicate the limits of our database and lack of knowledge about some active but possibly rare species. Our study demonstrates the capability of the metatranscriptomic approach to detect active foraminiferal species and prompt its use in future high-throughput sequencing-based environmental surveys.
Genetic variation among the Mapuche Indians from the Patagonian region of Argentina: mitochondrial DNA sequence variation and allele frequencies of several nuclear genes.

PubMed

Ginther, C; Corach, D; Penacino, G A; Rey, J A; Carnese, F R; Hutz, M H; Anderson, A; Just, J; Salzano, F M; King, M C

1993-01-01

DNA samples from 60 Mapuche Indians, representing 39 maternal lineages, were genetically characterized for (1) nucleotide sequences of the mtDNA control region; (2) presence or absence of a nine base duplication in mtDNA region V; (3) HLA loci DRB1 and DQA1; (4) variation at three nuclear genes with short tandem repeats; and (5) variation at the polymorphic marker D2S44. The genetic profile of the Mapuche population was compared to other Amerinds and to worldwide populations. Two highly polymorphic portions of the mtDNA control region, comprising 650 nucleotides, were amplified by the polymerase chain reaction (PCR) and directly sequenced. The 39 maternal lineages were defined by two or three generation families identified by the Mapuches. These 39 lineages included 19 different mtDNA sequences that could be grouped into four classes. The same classes of sequences appear in other Amerinds from North, Central, and South American populations separated by thousands of miles, suggesting that the origin of the mtDNA patterns predates the migration to the Americas. The mtDNA sequence similarity between Amerind populations suggests that the migration throughout the Americas occurred rapidly relative to the mtDNA mutation rate. HLA DRB1 alleles 1602 and 1402 were frequent among the Mapuches. These alleles also occur at high frequency among other Amerinds in North and South America, but not among Spanish, Chinese or African-American populations. The high frequency of these alleles throughout the Americas, and their specificity to the Americas, supports the hypothesis that Mapuches and other Amerind groups are closely related.(ABSTRACT TRUNCATED AT 250 WORDS)

Some maternal lineages of domestic horses may have origins in East Asia revealed with further evidence of mitochondrial genomes and HVR-1 sequences.

PubMed

Ma, Hongying; Wu, Yajiang; Xiang, Hai; Yang, Yunzhou; Wang, Min; Zhao, Chunjiang; Wu, Changxin

2018-01-01

There are large populations of indigenous horse ( Equus caballus ) in China and some other parts of East Asia. However, their matrilineal genetic diversity and origin remained poorly understood. Using a combination of mitochondrial DNA (mtDNA) and hypervariable region (HVR-1) sequences, we aim to investigate the origin of matrilineal inheritance in these domestic horses. To investigate patterns of matrilineal inheritance in domestic horses, we conducted a phylogenetic study using 31 de novo mtDNA genomes together with 317 others from the GenBank. In terms of the updated phylogeny, a total of 5,180 horse mitochondrial HVR-1 sequences were analyzed. Eightteen haplogroups (Aw-Rw) were uncovered from the analysis of the whole mitochondrial genomes. Most of which have a divergence time before the earliest domestication of wild horses (about 5,800 years ago) and during the Upper Paleolithic (35-10 KYA). The distribution of some haplogroups shows geographic patterns. The Lw haplogroup contained a significantly higher proportion of European horses than the horses from other regions, while haplogroups Jw, Rw, and some maternal lineages of Cw, have a higher frequency in the horses from East Asia. The 5,180 sequences of horse mitochondrial HVR-1 form nine major haplogroups (A-I). We revealed a corresponding relationship between the haplotypes of HVR-1 and those of whole mitochondrial DNA sequences. The data of the HVR-1 sequences also suggests that Jw, Rw, and some haplotypes of Cw may have originated in East Asia while Lw probably formed in Europe. Our study supports the hypothesis of the multiple origins of the maternal lineage of domestic horses and some maternal lineages of domestic horses may have originated from East Asia.
Hybridization and massive mtDNA unidirectional introgression between the closely related Neotropical toads Rhinella marina and R. schneideri inferred from mtDNA and nuclear markers

PubMed Central

2011-01-01

Background The classical perspective that interspecific hybridization in animals is rare has been changing due to a growing list of empirical examples showing the occurrence of gene flow between closely related species. Using sequence data from cyt b mitochondrial gene and three intron nuclear genes (RPL9, c-myc, and RPL3) we investigated patterns of nucleotide polymorphism and divergence between two closely related toad species R. marina and R. schneideri. By comparing levels of differentiation at nuclear and mtDNA levels we were able to describe patterns of introgression and infer the history of hybridization between these species. Results All nuclear loci are essentially concordant in revealing two well differentiated groups of haplotypes, corresponding to the morphologically-defined species R. marina and R. schneideri. Mitochondrial DNA analysis also revealed two well-differentiated groups of haplotypes but, in stark contrast with the nuclear genealogies, all R. schneideri sequences are clustered with sequences of R. marina from the right Amazon bank (RAB), while R. marina sequences from the left Amazon bank (LAB) are monophyletic. An Isolation-with-Migration (IM) analysis using nuclear data showed that R. marina and R. schneideri diverged at ≈ 1.69 Myr (early Pleistocene), while R. marina populations from LAB and RAB diverged at ≈ 0.33 Myr (middle Pleistocene). This time of divergence is not consistent with the split between LAB and RAB populations obtained with mtDNA data (≈ 1.59 Myr), which is notably similar to the estimate obtained with nuclear genes between R. marina and R. schneideri. Coalescent simulations of mtDNA phylogeny under the speciation history inferred from nuclear genes rejected the hypothesis of incomplete lineage sorting to explain the conflicting signal between mtDNA and nuclear-based phylogenies. Conclusions The cytonuclear discordance seems to reflect the occurrence of interspecific hybridization between these two closely related toad species. Overall, our results suggest a phenomenon of extensive mtDNA unidirectional introgression from the previously occurring R. schneideri into the invading R. marina. We hypothesize that climatic-induced range shifts during the Pleistocene/Holocene may have played an important role in the observed patterns of introgression. PMID:21939538
Transcription blockage by stable H-DNA analogs in vitro.

PubMed

Pandey, Shristi; Ogloblina, Anna M; Belotserkovskii, Boris P; Dolinnaya, Nina G; Yakubovskaya, Marianna G; Mirkin, Sergei M; Hanawalt, Philip C

2015-08-18

DNA sequences that can form unusual secondary structures are implicated in regulating gene expression and causing genomic instability. H-palindromes are an important class of such DNA sequences that can form an intramolecular triplex structure, H-DNA. Within an H-palindrome, the H-DNA and canonical B-DNA are in a dynamic equilibrium that shifts toward H-DNA with increased negative supercoiling. The interplay between H- and B-DNA and the fact that the process of transcription affects supercoiling makes it difficult to elucidate the effects of H-DNA upon transcription. We constructed a stable structural analog of H-DNA that cannot flip into B-DNA, and studied the effects of this structure on transcription by T7 RNA polymerase in vitro. We found multiple transcription blockage sites adjacent to and within sequences engaged in this triplex structure. Triplex-mediated transcription blockage varied significantly with changes in ambient conditions: it was exacerbated in the presence of Mn(2+) or by increased concentrations of K(+) and Li(+). Analysis of the detailed pattern of the blockage suggests that RNA polymerase is sterically hindered by H-DNA and has difficulties in unwinding triplex DNA. The implications of these findings for the biological roles of triple-stranded DNA structures are discussed. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Quantification of DNA cleavage specificity in Hi-C experiments.

PubMed

Meluzzi, Dario; Arya, Gaurav

2016-01-08

Hi-C experiments produce large numbers of DNA sequence read pairs that are typically analyzed to deduce genomewide interactions between arbitrary loci. A key step in these experiments is the cleavage of cross-linked chromatin with a restriction endonuclease. Although this cleavage should happen specifically at the enzyme's recognition sequence, an unknown proportion of cleavage events may involve other sequences, owing to the enzyme's star activity or to random DNA breakage. A quantitative estimation of these non-specific cleavages may enable simulating realistic Hi-C read pairs for validation of downstream analyses, monitoring the reproducibility of experimental conditions and investigating biophysical properties that correlate with DNA cleavage patterns. Here we describe a computational method for analyzing Hi-C read pairs to estimate the fractions of cleavages at different possible targets. The method relies on expressing an observed local target distribution downstream of aligned reads as a linear combination of known conditional local target distributions. We validated this method using Hi-C read pairs obtained by computer simulation. Application of the method to experimental Hi-C datasets from murine cells revealed interesting similarities and differences in patterns of cleavage across the various experiments considered. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Rapid in silico cloning of genes using expressed sequence tags (ESTs).

PubMed

Gill, R W; Sanseau, P

2000-01-01

Expressed sequence tags (ESTs) are short single-pass DNA sequences obtained from either end of cDNA clones. These ESTs are derived from a vast number of cDNA libraries obtained from different species. Human ESTs are the bulk of the data and have been widely used to identify new members of gene families, as markers on the human chromosomes, to discover polymorphism sites and to compare expression patterns in different tissues or pathologies states. Information strategies have been devised to query EST databases. Since most of the analysis is performed with a computer, the term "in silico" strategy has been coined. In this chapter we will review the current status of EST databases, the pros and cons of EST-type data and describe possible strategies to retrieve meaningful information.
Accelerated construction of a regional DNA-barcode reference library: Caddisflies (Trichoptera) in the Great Smoky Mountains National Park

USGS Publications Warehouse

Zhou, X.; Robinson, J.L.; Geraci, C.J.; Parker, C.R.; Flint, O.S.; Etnier, D.A.; Ruiter, D.; DeWalt, R.E.; Jacobus, L.M.; Hebert, P.D.N.

2011-01-01

Deoxyribonucleic acid (DNA) barcoding is an effective tool for species identification and lifestage association in a wide range of animal taxa. We developed a strategy for rapid construction of a regional DNA-barcode reference library and used the caddisflies (Trichoptera) of the Great Smoky Mountains National Park (GSMNP) as a model. Nearly 1000 cytochrome c oxidase subunit I (COI) sequences, representing 209 caddisfly species previously recorded from GSMNP, were obtained from the global Trichoptera Barcode of Life campaign. Most of these sequences were collected from outside the GSMNP area. Another 645 COI sequences, representing 80 species, were obtained from specimens collected in a 3-d bioblitz (short-term, intense sampling program) in GSMNP. The joint collections provided barcode coverage for 212 species, 91% of the GSMNP fauna. Inclusion of samples from other localities greatly expedited construction of the regional DNA-barcode reference library. This strategy increased intraspecific divergence and decreased average distances to nearest neighboring species, but the DNA-barcode library was able to differentiate 93% of the GSMNP Trichoptera species examined. Global barcoding projects will aid construction of regional DNA-barcode libraries, but local surveys make crucial contributions to progress by contributing rare or endemic species and full-length barcodes generated from high-quality DNA. DNA taxonomy is not a goal of our present work, but the investigation of COI divergence patterns in caddisflies is providing new insights into broader biodiversity patterns in this group and has directed attention to various issues, ranging from the need to re-evaluate species taxonomy with integrated morphological and molecular evidence to the necessity of an appropriate interpretation of barcode analyses and its implications in understanding species diversity (in contrast to a simple claim for barcoding failure).
Molecular evolution of the leptin exon 3 in some species of the family Canidae.

PubMed

Chmurzynska, Agata; Zajac, Magdalena; Switonski, Marek

2003-01-01

The structure of the leptin gene seems to be well conserved. The polymorphism of this gene in four species belonging to the Canidae family (the dog (Canis familiaris)--16 different breeds, the Chinese racoon dog (Nyctereutes procyonoides procyonoides), the red fox (Vulpes vulpes) and the arctic fox (Alopex lagopus)) were studied with the use of single strand conformation polymorphism (SSCP), restriction fragment length polymorphism (RFLP) and DNA sequencing techniques. For exon 2, all species presented the same SSCP pattern, while in exon 3 some differences were found. DNA sequencing of exon 3 revealed the presence of six nucleotide substitutions, differentiating the studied species. Three of them cause amino acid substitutions as well. For all dog breeds studied, SSCP patterns were identical.
Design Pattern Mining Using Distributed Learning Automata and DNA Sequence Alignment

PubMed Central

Esmaeilpour, Mansour; Naderifar, Vahideh; Shukur, Zarina

2014-01-01

Context Over the last decade, design patterns have been used extensively to generate reusable solutions to frequently encountered problems in software engineering and object oriented programming. A design pattern is a repeatable software design solution that provides a template for solving various instances of a general problem. Objective This paper describes a new method for pattern mining, isolating design patterns and relationship between them; and a related tool, DLA-DNA for all implemented pattern and all projects used for evaluation. DLA-DNA achieves acceptable precision and recall instead of other evaluated tools based on distributed learning automata (DLA) and deoxyribonucleic acid (DNA) sequences alignment. Method The proposed method mines structural design patterns in the object oriented source code and extracts the strong and weak relationships between them, enabling analyzers and programmers to determine the dependency rate of each object, component, and other section of the code for parameter passing and modular programming. The proposed model can detect design patterns better that available other tools those are Pinot, PTIDEJ and DPJF; and the strengths of their relationships. Results The result demonstrate that whenever the source code is build standard and non-standard, based on the design patterns, then the result of the proposed method is near to DPJF and better that Pinot and PTIDEJ. The proposed model is tested on the several source codes and is compared with other related models and available tools those the results show the precision and recall of the proposed method, averagely 20% and 9.6% are more than Pinot, 27% and 31% are more than PTIDEJ and 3.3% and 2% are more than DPJF respectively. Conclusion The primary idea of the proposed method is organized in two following steps: the first step, elemental design patterns are identified, while at the second step, is composed to recognize actual design patterns. PMID:25243670
Methylation Pattern of Radish (Raphanus sativus) Nuclear Ribosomal RNA Genes 1

PubMed Central

Delseny, Michel; Laroche, Monique; Penon, Paul

1984-01-01

The methylation pattern of radish Raphanus sativus nuclear rDNA has been investigated using the Hpa II, Msp I, and Hha I restriction enzymes. The presence of numerous target sites for these enzymes has been shown using cloned rDNA fragments. A large fraction of the numerous rDNA units are heavily methylated, being completely resistant to Hpa II and Hpa I. However, specific sites are constantly available in another fraction of the units and are therefore unmethylated. The use of different probes allowed us to demonstrate that hypomethylated sites are present in different regions. Major hypomethylated Hha I sites have been mapped in the 5′ portion of 25S rRNA coding sequence. Among the hypomethylated fraction, different methylation patterns coexist. It has been possible to demonstrate that methylation patterns are specific for particular units. The Hha I pattern of rDNA in tissues of different developmental stages was analyzed. Evidence for possible tissue specific differences in the methylation pattern is reported. Images Fig. 2 Fig. 3 Fig. 5 PMID:16663896
High resolution optical DNA mapping

NASA Astrophysics Data System (ADS)

Baday, Murat

Many types of diseases including cancer and autism are associated with copy-number variations in the genome. Most of these variations could not be identified with existing sequencing and optical DNA mapping methods. We have developed Multi-color Super-resolution technique, with potential for high throughput and low cost, which can allow us to recognize more of these variations. Our technique has made 10--fold improvement in the resolution of optical DNA mapping. Using a 180 kb BAC clone as a model system, we resolved dense patterns from 108 fluorescent labels of two different colors representing two different sequence-motifs. Overall, a detailed DNA map with 100 bp resolution was achieved, which has the potential to reveal detailed information about genetic variance and to facilitate medical diagnosis of genetic disease.
Things fall apart: biological species form unconnected parsimony networks.

PubMed

Hart, Michael W; Sunday, Jennifer

2007-10-22

The generality of operational species definitions is limited by problematic definitions of between-species divergence. A recent phylogenetic species concept based on a simple objective measure of statistically significant genetic differentiation uses between-species application of statistical parsimony networks that are typically used for population genetic analysis within species. Here we review recent phylogeographic studies and reanalyse several mtDNA barcoding studies using this method. We found that (i) alignments of DNA sequences typically fall apart into a separate subnetwork for each Linnean species (but with a higher rate of true positives for mtDNA data) and (ii) DNA sequences from single species typically stick together in a single haplotype network. Departures from these patterns are usually consistent with hybridization or cryptic species diversity.
BiRen: predicting enhancers with a deep-learning-based model using the DNA sequence alone.

PubMed

Yang, Bite; Liu, Feng; Ren, Chao; Ouyang, Zhangyi; Xie, Ziwei; Bo, Xiaochen; Shu, Wenjie

2017-07-01

Enhancer elements are noncoding stretches of DNA that play key roles in controlling gene expression programmes. Despite major efforts to develop accurate enhancer prediction methods, identifying enhancer sequences continues to be a challenge in the annotation of mammalian genomes. One of the major issues is the lack of large, sufficiently comprehensive and experimentally validated enhancers for humans or other species. Thus, the development of computational methods based on limited experimentally validated enhancers and deciphering the transcriptional regulatory code encoded in the enhancer sequences is urgent. We present a deep-learning-based hybrid architecture, BiRen, which predicts enhancers using the DNA sequence alone. Our results demonstrate that BiRen can learn common enhancer patterns directly from the DNA sequence and exhibits superior accuracy, robustness and generalizability in enhancer prediction relative to other state-of-the-art enhancer predictors based on sequence characteristics. Our BiRen will enable researchers to acquire a deeper understanding of the regulatory code of enhancer sequences. Our BiRen method can be freely accessed at https://github.com/wenjiegroup/BiRen . shuwj@bmi.ac.cn or boxc@bmi.ac.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Many human accelerated regions are developmental enhancers

PubMed Central

Capra, John A.; Erwin, Genevieve D.; McKinsey, Gabriel; Rubenstein, John L. R.; Pollard, Katherine S.

2013-01-01

The genetic changes underlying the dramatic differences in form and function between humans and other primates are largely unknown, although it is clear that gene regulatory changes play an important role. To identify regulatory sequences with potentially human-specific functions, we and others used comparative genomics to find non-coding regions conserved across mammals that have acquired many sequence changes in humans since divergence from chimpanzees. These regions are good candidates for performing human-specific regulatory functions. Here, we analysed the DNA sequence, evolutionary history, histone modifications, chromatin state and transcription factor (TF) binding sites of a combined set of 2649 non-coding human accelerated regions (ncHARs) and predicted that at least 30% of them function as developmental enhancers. We prioritized the predicted ncHAR enhancers using analysis of TF binding site gain and loss, along with the functional annotations and expression patterns of nearby genes. We then tested both the human and chimpanzee sequence for 29 ncHARs in transgenic mice, and found 24 novel developmental enhancers active in both species, 17 of which had very consistent patterns of activity in specific embryonic tissues. Of these ncHAR enhancers, five drove expression patterns suggestive of different activity for the human and chimpanzee sequence at embryonic day 11.5. The changes to human non-coding DNA in these ncHAR enhancers may modify the complex patterns of gene expression necessary for proper development in a human-specific manner and are thus promising candidates for understanding the genetic basis of human-specific biology. PMID:24218637
Identification of full-length proviral DNA of porcine endogenous retrovirus from Chinese Wuzhishan miniature pigs inbred.

PubMed

Ma, Yuyuan; Lv, Maomin; Xu, Shu; Wu, Jianmin; Tian, Kegong; Zhang, Jingang

2010-07-01

Existence of porcine endogenous retrovirus (PERV) hinders pigs to be used in clinical xenotransplantation to alleviate the shortage of human transplants. Chinese miniature pigs are potential organ donors for xenotransplantation in China. However, so far, an adequate level of information on the molecular characteristics of PERV from Chinese miniature pigs has not been available. We described here the cloning and characterization of full-length proviral DNA of PERV from Chinese Wuzhishan miniature pigs inbred (WZSP). Full-length nucleotide sequences of PERV-WZSP and other PERVs were aligned and phylogenetic tree was constructed from deduced amino-acid sequences of env. The results demonstrated that the full-length proviral DNA of PERV-WZSP belongs to gammaretrovirus and shares high similarity with other PERVs. Sequence analysis also suggested that different patterns of LTR existed in the same porcine germ line and partial PERV-C sequence may recombine with PERV-A sequence in LTR. (c) 2008 Elsevier Ltd. All rights reserved.
Distinct biological subtypes and patterns of genome evolution in lymphoma revealed by circulating tumor DNA.

PubMed

Scherer, Florian; Kurtz, David M; Newman, Aaron M; Stehr, Henning; Craig, Alexander F M; Esfahani, Mohammad Shahrokh; Lovejoy, Alexander F; Chabon, Jacob J; Klass, Daniel M; Liu, Chih Long; Zhou, Li; Glover, Cynthia; Visser, Brendan C; Poultsides, George A; Advani, Ranjana H; Maeda, Lauren S; Gupta, Neel K; Levy, Ronald; Ohgami, Robert S; Kunder, Christian A; Diehn, Maximilian; Alizadeh, Ash A

2016-11-09

Patients with diffuse large B cell lymphoma (DLBCL) exhibit marked diversity in tumor behavior and outcomes, yet the identification of poor-risk groups remains challenging. In addition, the biology underlying these differences is incompletely understood. We hypothesized that characterization of mutational heterogeneity and genomic evolution using circulating tumor DNA (ctDNA) profiling could reveal molecular determinants of adverse outcomes. To address this hypothesis, we applied cancer personalized profiling by deep sequencing (CAPP-Seq) analysis to tumor biopsies and cell-free DNA samples from 92 lymphoma patients and 24 healthy subjects. At diagnosis, the amount of ctDNA was found to strongly correlate with clinical indices and was independently predictive of patient outcomes. We demonstrate that ctDNA genotyping can classify transcriptionally defined tumor subtypes, including DLBCL cell of origin, directly from plasma. By simultaneously tracking multiple somatic mutations in ctDNA, our approach outperformed immunoglobulin sequencing and radiographic imaging for the detection of minimal residual disease and facilitated noninvasive identification of emergent resistance mutations to targeted therapies. In addition, we identified distinct patterns of clonal evolution distinguishing indolent follicular lymphomas from those that transformed into DLBCL, allowing for potential noninvasive prediction of histological transformation. Collectively, our results demonstrate that ctDNA analysis reveals biological factors that underlie lymphoma clinical outcomes and could facilitate individualized therapy. Copyright © 2016, American Association for the Advancement of Science.
Identification of tissue-specific cell death using methylation patterns of circulating DNA

PubMed Central

Lehmann-Werman, Roni; Neiman, Daniel; Zemmour, Hai; Moss, Joshua; Magenheim, Judith; Vaknin-Dembinsky, Adi; Rubertsson, Sten; Nellgård, Bengt; Blennow, Kaj; Zetterberg, Henrik; Spalding, Kirsty; Haller, Michael J.; Wasserfall, Clive H.; Schatz, Desmond A.; Greenbaum, Carla J.; Dorrell, Craig; Grompe, Markus; Zick, Aviad; Hubert, Ayala; Maoz, Myriam; Fendrich, Volker; Bartsch, Detlef K.; Golan, Talia; Ben Sasson, Shmuel A.; Zamir, Gideon; Razin, Aharon; Cedar, Howard; Shapiro, A. M. James; Glaser, Benjamin; Shemer, Ruth; Dor, Yuval

2016-01-01

Minimally invasive detection of cell death could prove an invaluable resource in many physiologic and pathologic situations. Cell-free circulating DNA (cfDNA) released from dying cells is emerging as a diagnostic tool for monitoring cancer dynamics and graft failure. However, existing methods rely on differences in DNA sequences in source tissues, so that cell death cannot be identified in tissues with a normal genome. We developed a method of detecting tissue-specific cell death in humans based on tissue-specific methylation patterns in cfDNA. We interrogated tissue-specific methylome databases to identify cell type-specific DNA methylation signatures and developed a method to detect these signatures in mixed DNA samples. We isolated cfDNA from plasma or serum of donors, treated the cfDNA with bisulfite, PCR-amplified the cfDNA, and sequenced it to quantify cfDNA carrying the methylation markers of the cell type of interest. Pancreatic β-cell DNA was identified in the circulation of patients with recently diagnosed type-1 diabetes and islet-graft recipients; oligodendrocyte DNA was identified in patients with relapsing multiple sclerosis; neuronal/glial DNA was identified in patients after traumatic brain injury or cardiac arrest; and exocrine pancreas DNA was identified in patients with pancreatic cancer or pancreatitis. This proof-of-concept study demonstrates that the tissue origins of cfDNA and thus the rate of death of specific cell types can be determined in humans. The approach can be adapted to identify cfDNA derived from any cell type in the body, offering a minimally invasive window for diagnosing and monitoring a broad spectrum of human pathologies as well as providing a better understanding of normal tissue dynamics. PMID:26976580
Function-Based Algorithms for Biological Sequences

ERIC Educational Resources Information Center

Mohanty, Pragyan Sheela P.

2015-01-01

Two problems at two different abstraction levels of computational biology are studied. At the molecular level, efficient pattern matching algorithms in DNA sequences are presented. For gene order data, an efficient data structure is presented capable of storing all gene re-orderings in a systematic manner. A common characteristic of presented…
Base damage, local sequence context and TP53 mutation hotspots: a molecular dynamics study of benzo[a]pyrene induced DNA distortion and mutability

PubMed Central

Menzies, Georgina E.; Reed, Simon H.; Brancale, Andrea; Lewis, Paul D.

2015-01-01

The mutational pattern for the TP53 tumour suppressor gene in lung tumours differs to other cancer types by having a higher frequency of G:C>T:A transversions. The aetiology of this differing mutation pattern is still unknown. Benzo[a]pyrene,diol epoxide (BPDE) is a potent cigarette smoke carcinogen that forms guanine adducts at TP53 CpG mutation hotspot sites including codons 157, 158, 245, 248 and 273. We performed molecular modelling of BPDE-adducted TP53 duplex sequences to determine the degree of local distortion caused by adducts which could influence the ability of nucleotide excision repair. We show that BPDE adducted codon 157 has greater structural distortion than other TP53 G:C>T:A hotspot sites and that sequence context more distal to adjacent bases must influence local distortion. Using TP53 trinucleotide mutation signatures for lung cancer in smokers and non-smokers we further show that codons 157 and 273 have the highest mutation probability in smokers. Combining this information with adduct structural data we predict that G:C>T:A mutations at codon 157 in lung tumours of smokers are predominantly caused by BPDE. Our results provide insight into how different DNA sequence contexts show variability in DNA distortion at mutagen adduct sites that could compromise DNA repair at well characterized cancer related mutation hotspots. PMID:26400171
Optimal Ancient DNA Yields from the Inner Ear Part of the Human Petrous Bone.

PubMed

Pinhasi, Ron; Fernandes, Daniel; Sirak, Kendra; Novak, Mario; Connell, Sarah; Alpaslan-Roodenberg, Songül; Gerritsen, Fokke; Moiseyev, Vyacheslav; Gromov, Andrey; Raczky, Pál; Anders, Alexandra; Pietrusewsky, Michael; Rollefson, Gary; Jovanovic, Marija; Trinhhoang, Hiep; Bar-Oz, Guy; Oxenham, Marc; Matsumura, Hirofumi; Hofreiter, Michael

2015-01-01

The invention and development of next or second generation sequencing methods has resulted in a dramatic transformation of ancient DNA research and allowed shotgun sequencing of entire genomes from fossil specimens. However, although there are exceptions, most fossil specimens contain only low (~ 1% or less) percentages of endogenous DNA. The only skeletal element for which a systematically higher endogenous DNA content compared to other skeletal elements has been shown is the petrous part of the temporal bone. In this study we investigate whether (a) different parts of the petrous bone of archaeological human specimens give different percentages of endogenous DNA yields, (b) there are significant differences in average DNA read lengths, damage patterns and total DNA concentration, and (c) it is possible to obtain endogenous ancient DNA from petrous bones from hot environments. We carried out intra-petrous comparisons for ten petrous bones from specimens from Holocene archaeological contexts across Eurasia dated between 10,000-1,800 calibrated years before present (cal. BP). We obtained shotgun DNA sequences from three distinct areas within the petrous: a spongy part of trabecular bone (part A), the dense part of cortical bone encircling the osseous inner ear, or otic capsule (part B), and the dense part within the otic capsule (part C). Our results confirm that dense bone parts of the petrous bone can provide high endogenous aDNA yields and indicate that endogenous DNA fractions for part C can exceed those obtained for part B by up to 65-fold and those from part A by up to 177-fold, while total endogenous DNA concentrations are up to 126-fold and 109-fold higher for these comparisons. Our results also show that while endogenous yields from part C were lower than 1% for samples from hot (both arid and humid) parts, the DNA damage patterns indicate that at least some of the reads originate from ancient DNA molecules, potentially enabling ancient DNA analyses of samples from hot regions that are otherwise not amenable to ancient DNA analyses.
Useful DNA polymorphisms are identified by snapback, a midrepetitive element in Tribolium castaneum.

PubMed

Stuart, J J; De Gortari, M J; Hall, P S; Maxwell, M E; Mocelin, G; Brown, S J; Muir, W M

1996-06-01

The red flour bettle, Tribolium castaneum, is both a pest of stored grain products and an important experimental organism. To improve its facility as a genetic model, we are developing DNA fingerprinting methods for this insect. A Tribolium DNA fragment, snapback-1 (SBI), identified among sequences that reassociate before a Cot of 0.03 mol.s/L, was found to produce a banding pattern in restriction endonuclease digested genomic DNA that is characteristic of a midrepetitive element. DNA fingerprints of individual beetles demonstrated that unvarying inherited DNA polymorphism is revealed, and that polymorphism is inherited in a dominant Mendelian fashion. Linkage between bands was minimal. The sequence of SBI was determined, and hybridization experiments indicated that SBI is a fragment of a larger midrepetitive element. Fingerprinting individuals with known inbreeding coefficients indicated that SBI loci have relatively high mutation rates. The possibility that SBI is a fragment of a transposable element is discussed.

Re-entrant DNA gels

PubMed Central

Bomboi, Francesca; Romano, Flavio; Leo, Manuela; Fernandez-Castanon, Javier; Cerbino, Roberto; Bellini, Tommaso; Bordi, Federico; Filetici, Patrizia; Sciortino, Francesco

2016-01-01

DNA is acquiring a primary role in material development, self-assembling by design into complex supramolecular aggregates, the building block of a new-materials world. Using DNA nanoconstructs to translate sophisticated theoretical intuitions into experimental realizations by closely matching idealized models of colloidal particles is a much less explored avenue. Here we experimentally show that an appropriate selection of competing interactions enciphered in multiple DNA sequences results into the successful design of a one-pot DNA hydrogel that melts both on heating and on cooling. The relaxation time, measured by light scattering, slows down dramatically in a limited window of temperatures. The phase diagram displays a peculiar re-entrant shape, the hallmark of the competition between different bonding patterns. Our study shows that it is possible to rationally design biocompatible bulk materials with unconventional phase diagrams and tuneable properties by encoding into DNA sequences both the particle shape and the physics of the collective response. PMID:27767029
Improved Prediction of Non-methylated Islands in Vertebrates Highlights Different Characteristic Sequence Patterns

PubMed Central

Vingron, Martin

2016-01-01

Non-methylated islands (NMIs) of DNA are genomic regions that are important for gene regulation and development. A recent study of genome-wide non-methylation data in vertebrates by Long et al. (eLife 2013;2:e00348) has shown that many experimentally identified non-methylated regions do not overlap with classically defined CpG islands which are computationally predicted using simple DNA sequence features. This is especially true in cold-blooded vertebrates such as Danio rerio (zebrafish). In order to investigate how predictive DNA sequence is of a region’s methylation status, we applied a supervised learning approach using a spectrum kernel support vector machine, to see if a more complex model and supervised learning can be used to improve non-methylated island prediction and to understand the sequence properties of these regions. We demonstrate that DNA sequence is highly predictive of methylation status, and that in contrast to existing CpG island prediction methods our method is able to provide more useful predictions of NMIs genome-wide in all vertebrate organisms that were studied. Our results also show that in cold-blooded vertebrates (Anolis carolinensis, Xenopus tropicalis and Danio rerio) where genome-wide classical CpG island predictions consist primarily of false positives, longer primarily AT-rich DNA sequence features are able to identify these regions much more accurately. PMID:27984582
Sequence-Level Mechanisms of Human Epigenome Evolution

PubMed Central

Prendergast, James G.D.; Chambers, Emily V.; Semple, Colin A.M.

2014-01-01

DNA methylation and chromatin states play key roles in development and disease. However, the extent of recent evolutionary divergence in the human epigenome and the influential factors that have shaped it are poorly understood. To determine the links between genome sequence and human epigenome evolution, we examined the divergence of DNA methylation and chromatin states following segmental duplication events in the human lineage. Chromatin and DNA methylation states were found to have been generally well conserved following a duplication event, with the evolution of the epigenome largely uncoupled from the total number of genetic changes in the surrounding DNA sequence. However, the epigenome at tissue-specific, distal regulatory regions was observed to be unusually prone to diverge following duplication, with particular sequence differences, altering known sequence motifs, found to be associated with divergence in patterns of DNA methylation and chromatin. Alu elements were found to have played a particularly prominent role in shaping human epigenome evolution, and we show that human-specific AluY insertion events are strongly linked to the evolution of the DNA methylation landscape and gene expression levels, including at key neurological genes in the human brain. Studying paralogous regions within the same sample enables the study of the links between genome and epigenome evolution while controlling for biological and technical variation. We show DNA methylation and chromatin divergence between duplicated regions are linked to the divergence of particular genetic motifs, with Alu elements having played a disproportionate role in the evolution of the epigenome in the human lineage. PMID:24966180
DNA Sequences Proximal to Human Mitochondrial DNA Deletion Breakpoints Prevalent in Human Disease Form G-quadruplexes, a Class of DNA Structures Inefficiently Unwound by the Mitochondrial Replicative Twinkle Helicase*

PubMed Central

Bharti, Sanjay Kumar; Sommers, Joshua A.; Zhou, Jun; Kaplan, Daniel L.; Spelbrink, Johannes N.; Mergny, Jean-Louis; Brosh, Robert M.

2014-01-01

Mitochondrial DNA deletions are prominent in human genetic disorders, cancer, and aging. It is thought that stalling of the mitochondrial replication machinery during DNA synthesis is a prominent source of mitochondrial genome instability; however, the precise molecular determinants of defective mitochondrial replication are not well understood. In this work, we performed a computational analysis of the human mitochondrial genome using the “Pattern Finder” G-quadruplex (G4) predictor algorithm to assess whether G4-forming sequences reside in close proximity (within 20 base pairs) to known mitochondrial DNA deletion breakpoints. We then used this information to map G4P sequences with deletions characteristic of representative mitochondrial genetic disorders and also those identified in various cancers and aging. Circular dichroism and UV spectral analysis demonstrated that mitochondrial G-rich sequences near deletion breakpoints prevalent in human disease form G-quadruplex DNA structures. A biochemical analysis of purified recombinant human Twinkle protein (gene product of c10orf2) showed that the mitochondrial replicative helicase inefficiently unwinds well characterized intermolecular and intramolecular G-quadruplex DNA substrates, as well as a unimolecular G4 substrate derived from a mitochondrial sequence that nests a deletion breakpoint described in human renal cell carcinoma. Although G4 has been implicated in the initiation of mitochondrial DNA replication, our current findings suggest that mitochondrial G-quadruplexes are also likely to be a source of instability for the mitochondrial genome by perturbing the normal progression of the mitochondrial replication machinery, including DNA unwinding by Twinkle helicase. PMID:25193669
Phylogeographic patterns of Armillaria ostoyae in the western United States

Treesearch

J. W. Hanna; N. B. Klopfenstein; M. -S. Kim; G. I. McDonald; J. A. Moore

2007-01-01

Nuclear ribosomal DNA regions (i.e. large subunit, internal transcribed spacer, 5.8S and intergenic spacer) were sequenced using a direct-polymerase chain reaction method from Armillaria ostoyae genets collected from the western USA. Many of the A. ostoyae genets contained heterogeneity among rDNA repeats, indicating intragenomic variation and likely intraspecific...
The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

PubMed

Holland, M J; Holland, J P; Thill, G P; Jackson, K A

1981-02-10

Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5- noncoding portions of these glycolytic genes.
DNA methylation of retrotransposons, DNA transposons and genes in sugar beet (Beta vulgaris L.).

PubMed

Zakrzewski, Falk; Schmidt, Martin; Van Lijsebettens, Mieke; Schmidt, Thomas

2017-06-01

The methylation of cytosines shapes the epigenetic landscape of plant genomes, coordinates transgenerational epigenetic inheritance, represses the activity of transposable elements (TEs), affects gene expression and, hence, can influence the phenotype. Sugar beet (Beta vulgaris ssp. vulgaris), an important crop that accounts for 30% of worldwide sugar needs, has a relatively small genome size (758 Mbp) consisting of approximately 485 Mbp repetitive DNA (64%), in particular satellite DNA, retrotransposons and DNA transposons. Genome-wide cytosine methylation in the sugar beet genome was studied in leaves and leaf-derived callus with a focus on repetitive sequences, including retrotransposons and DNA transposons, the major groups of repetitive DNA sequences, and compared with gene methylation. Genes showed a specific methylation pattern for CG, CHG (H = A, C, and T) and CHH sites, whereas the TE pattern differed, depending on the TE class (class 1, retrotransposons and class 2, DNA transposons). Along genes and TEs, CG and CHG methylation was higher than that of adjacent genomic regions. In contrast to the relatively low CHH methylation in retrotransposons and genes, the level of CHH methylation in DNA transposons was strongly increased, pointing to a functional role of asymmetric methylation in DNA transposon silencing. Comparison of genome-wide DNA methylation between sugar beet leaves and callus revealed a differential methylation upon tissue culture. Potential epialleles were hypomethylated (lower methylation) at CG and CHG sites in retrotransposons and genes and hypermethylated (higher methylation) at CHH sites in DNA transposons of callus when compared with leaves. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
DnaSAM: Software to perform neutrality testing for large datasets with complex null models.

PubMed

Eckert, Andrew J; Liechty, John D; Tearse, Brandon R; Pande, Barnaly; Neale, David B

2010-05-01

Patterns of DNA sequence polymorphisms can be used to understand the processes of demography and adaptation within natural populations. High-throughput generation of DNA sequence data has historically been the bottleneck with respect to data processing and experimental inference. Advances in marker technologies have largely solved this problem. Currently, the limiting step is computational, with most molecular population genetic software allowing a gene-by-gene analysis through a graphical user interface. An easy-to-use analysis program that allows both high-throughput processing of multiple sequence alignments along with the flexibility to simulate data under complex demographic scenarios is currently lacking. We introduce a new program, named DnaSAM, which allows high-throughput estimation of DNA sequence diversity and neutrality statistics from experimental data along with the ability to test those statistics via Monte Carlo coalescent simulations. These simulations are conducted using the ms program, which is able to incorporate several genetic parameters (e.g. recombination) and demographic scenarios (e.g. population bottlenecks). The output is a set of diversity and neutrality statistics with associated probability values under a user-specified null model that are stored in easy to manipulate text file. © 2009 Blackwell Publishing Ltd.
Predicting DNA binding proteins using support vector machine with hybrid fractal features.

PubMed

Niu, Xiao-Hui; Hu, Xue-Hai; Shi, Feng; Xia, Jing-Bo

2014-02-21

DNA-binding proteins play a vitally important role in many biological processes. Prediction of DNA-binding proteins from amino acid sequence is a significant but not fairly resolved scientific problem. Chaos game representation (CGR) investigates the patterns hidden in protein sequences, and visually reveals previously unknown structure. Fractal dimensions (FD) are good tools to measure sizes of complex, highly irregular geometric objects. In order to extract the intrinsic correlation with DNA-binding property from protein sequences, CGR algorithm, fractal dimension and amino acid composition are applied to formulate the numerical features of protein samples in this paper. Seven groups of features are extracted, which can be computed directly from the primary sequence, and each group is evaluated by the 10-fold cross-validation test and Jackknife test. Comparing the results of numerical experiments, the group of amino acid composition and fractal dimension (21-dimension vector) gets the best result, the average accuracy is 81.82% and average Matthew's correlation coefficient (MCC) is 0.6017. This resulting predictor is also compared with existing method DNA-Prot and shows better performances. © 2013 The Authors. Published by Elsevier Ltd All rights reserved.
enoLOGOS: a versatile web tool for energy normalized sequence logos

PubMed Central

Workman, Christopher T.; Yin, Yutong; Corcoran, David L.; Ideker, Trey; Stormo, Gary D.; Benos, Panayiotis V.

2005-01-01

enoLOGOS is a web-based tool that generates sequence logos from various input sources. Sequence logos have become a popular way to graphically represent DNA and amino acid sequence patterns from a set of aligned sequences. Each position of the alignment is represented by a column of stacked symbols with its total height reflecting the information content in this position. Currently, the available web servers are able to create logo images from a set of aligned sequences, but none of them generates weighted sequence logos directly from energy measurements or other sources. With the advent of high-throughput technologies for estimating the contact energy of different DNA sequences, tools that can create logos directly from binding affinity data are useful to researchers. enoLOGOS generates sequence logos from a variety of input data, including energy measurements, probability matrices, alignment matrices, count matrices and aligned sequences. Furthermore, enoLOGOS can represent the mutual information of different positions of the consensus sequence, a unique feature of this tool. Another web interface for our software, C2H2-enoLOGOS, generates logos for the DNA-binding preferences of the C2H2 zinc-finger transcription factor family members. enoLOGOS and C2H2-enoLOGOS are accessible over the web at . PMID:15980495
Deppdb--DNA electrostatic potential properties database: electrostatic properties of genome DNA.

PubMed

Osypov, Alexander A; Krutinin, Gleb G; Kamzolova, Svetlana G

2010-06-01

The electrostatic properties of genome DNA influence its interactions with different proteins, in particular, the regulation of transcription by RNA-polymerases. DEPPDB--DNA Electrostatic Potential Properties Database--was developed to hold and provide all available information on the electrostatic properties of genome DNA combined with its sequence and annotation of biological and structural properties of genome elements and whole genomes. Genomes in DEPPDB are organized on a taxonomical basis. Currently, the database contains all the completely sequenced bacterial and viral genomes according to NCBI RefSeq. General properties of the genome DNA electrostatic potential profile and principles of its formation are revealed. This potential correlates with the GC content but does not correspond to it exactly and strongly depends on both the sequence arrangement and its context (flanking regions). Analysis of the promoter regions for bacterial and viral RNA polymerases revealed a correspondence between the scale of these proteins' physical properties and electrostatic profile patterns. We also discovered a direct correlation between the potential value and the binding frequency of RNA polymerase to DNA, supporting the idea of the role of electrostatics in these interactions. This matches a pronounced tendency of the promoter regions to possess higher values of the electrostatic potential.
Exploring the roles of DNA methylation in the metal-reducing bacterium Shewanella oneidensis MR-1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bendall, Matthew L.; Luong, Khai; Wetmore, Kelly M.

2013-08-30

We performed whole genome analyses of DNA methylation in Shewanella 17 oneidensis MR-1 to examine its possible role in regulating gene expression and 18 other cellular processes. Single-Molecule Real Time (SMRT) sequencing 19 revealed extensive methylation of adenine (N6mA) throughout the 20 genome. These methylated bases were located in five sequence motifs, 21 including three novel targets for Type I restriction/modification enzymes. The 22 sequence motifs targeted by putative methyltranferases were determined via 23 SMRT sequencing of gene knockout mutants. In addition, we found S. 24 oneidensis MR-1 cultures grown under various culture conditions displayed 25 different DNA methylation patterns.more » However, the small number of differentially 26 methylated sites could not be directly linked to the much larger number of 27 differentially expressed genes in these conditions, suggesting DNA methylation is 28 not a major regulator of gene expression in S. oneidensis MR-1. The enrichment 29 of methylated GATC motifs in the origin of replication indicate DNA methylation 30 may regulate genome replication in a manner similar to that seen in Escherichia 31 coli. Furthermore, comparative analyses suggest that many 32 Gammaproteobacteria, including all members of the Shewanellaceae family, may 33 also utilize DNA methylation to regulate genome replication.« less
Identification of human papillomavirus (HPV) 16 DNA integration and the ensuing patterns of methylation in HPV-associated head and neck squamous cell carcinoma cell lines.

PubMed

Hatano, Takashi; Sano, Daisuke; Takahashi, Hideaki; Hyakusoku, Hiroshi; Isono, Yasuhiro; Shimada, Shoko; Sawakuma, Kae; Takada, Kentaro; Oikawa, Ritsuko; Watanabe, Yoshiyuki; Yamamoto, Hiroyuki; Itoh, Fumio; Myers, Jeffrey N; Oridate, Nobuhiko

2017-04-01

Recent studies showed that human papillomavirus (HPV) integration contributes to the genomic instability seen in HPV-associated head and neck squamous cell carcinoma (HPV-HNSCC). However, the epigenetic alterations induced after HPV integration remains unclear. To identify the molecular details of HPV16 DNA integration and the ensuing patterns of methylation in HNSCC, we performed next-generation sequencing using a target-enrichment method for the effective identification of HPV16 integration breakpoints as well as the characterization of genomic sequences adjacent to HPV16 integration breakpoints with three HPV16-related HNSCC cell lines. The DNA methylation levels of the integrated HPV16 genome and that of the adjacent human genome were also analyzed by bisulfite pyrosequencing. We found various integration loci, including novel integration sites. Integration loci were located predominantly in the intergenic region, with a significant enrichment of the microhomologous sequences between the human and HPV16 genomes at the integration breakpoints. Furthermore, various levels of methylation within both the human genome and the integrated HPV genome at the integration breakpoints in each integrant were observed. Allele-specific methylation analysis suggested that the HPV16 integrants remained hypomethylated when the flanking host genome was hypomethylated. After integration into highly methylated human genome regions, however, the HPV16 DNA became methylated. In conclusion, we found novel integration sites and methylation patterns in HPV-HNSCC using our unique method. These findings may provide insights into understanding of viral integration mechanism and virus-associated carcinogenesis of HPV-HNSCC. © 2016 UICC.
Deep sequencing reveals distinct patterns of DNA methylation in prostate cancer.

PubMed

Kim, Jung H; Dhanasekaran, Saravana M; Prensner, John R; Cao, Xuhong; Robinson, Daniel; Kalyana-Sundaram, Shanker; Huang, Christina; Shankar, Sunita; Jing, Xiaojun; Iyer, Matthew; Hu, Ming; Sam, Lee; Grasso, Catherine; Maher, Christopher A; Palanisamy, Nallasivam; Mehra, Rohit; Kominsky, Hal D; Siddiqui, Javed; Yu, Jindan; Qin, Zhaohui S; Chinnaiyan, Arul M

2011-07-01

Beginning with precursor lesions, aberrant DNA methylation marks the entire spectrum of prostate cancer progression. We mapped the global DNA methylation patterns in select prostate tissues and cell lines using MethylPlex-next-generation sequencing (M-NGS). Hidden Markov model-based next-generation sequence analysis identified ∼68,000 methylated regions per sample. While global CpG island (CGI) methylation was not differential between benign adjacent and cancer samples, overall promoter CGI methylation significantly increased from ~12.6% in benign samples to 19.3% and 21.8% in localized and metastatic cancer tissues, respectively (P-value < 2 × 10(-16)). We found distinct patterns of promoter methylation around transcription start sites, where methylation occurred not only on the CGIs, but also on flanking regions and CGI sparse promoters. Among the 6691 methylated promoters in prostate tissues, 2481 differentially methylated regions (DMRs) are cancer-specific, including numerous novel DMRs. A novel cancer-specific DMR in the WFDC2 promoter showed frequent methylation in cancer (17/22 tissues, 6/6 cell lines), but not in the benign tissues (0/10) and normal PrEC cells. Integration of LNCaP DNA methylation and H3K4me3 data suggested an epigenetic mechanism for alternate transcription start site utilization, and these modifications segregated into distinct regions when present on the same promoter. Finally, we observed differences in repeat element methylation, particularly LINE-1, between ERG gene fusion-positive and -negative cancers, and we confirmed this observation using pyrosequencing on a tissue panel. This comprehensive methylome map will further our understanding of epigenetic regulation in prostate cancer progression.
MySSP: Non-stationary evolutionary sequence simulation, including indels

PubMed Central

Rosenberg, Michael S.

2007-01-01

MySSP is a new program for the simulation of DNA sequence evolution across a phylogenetic tree. Although many programs are available for sequence simulation, MySSP is unique in its inclusion of indels, flexibility in allowing for non-stationary patterns, and output of ancestral sequences. Some of these features can individually be found in existing programs, but have not all have been previously available in a single package. PMID:19325855
Complete cpDNA genome sequence of Smilax china and phylogenetic placement of Liliales--influences of gene partitions and taxon sampling.

PubMed

Liu, Juan; Qi, Zhe-Chen; Zhao, Yun-Peng; Fu, Cheng-Xin; Jenny Xiang, Qiu-Yun

2012-09-01

The complete nucleotide sequence of the chloroplast genome (cpDNA) of Smilax china L. (Smilacaceae) is reported. It is the first complete cp genome sequence in Liliales. Genomic analyses were conducted to examine the rate and pattern of cpDNA genome evolution in Smilax relative to other major lineages of monocots. The cpDNA genomic sequences were combined with those available for Lilium to evaluate the phylogenetic position of Liliales and to investigate the influence of taxon sampling, gene sampling, gene function, natural selection, and substitution rate on phylogenetic inference in monocots. Phylogenetic analyses using sequence data of gene groups partitioned according to gene function, selection force, and total substitution rate demonstrated evident impacts of these factors on phylogenetic inference of monocots and the placement of Liliales, suggesting potential evolutionary convergence or adaptation of some cpDNA genes in monocots. Our study also demonstrated that reduced taxon sampling reduced the bootstrap support for the placement of Liliales in the cpDNA phylogenomic analysis. Analyses of sequences of 77 protein genes with some missing data and sequences of 81 genes (all protein genes plus the rRNA genes) support a sister relationship of Liliales to the commelinids-Asparagales clade, consistent with the APG III system. Analyses of 63 cpDNA protein genes for 32 taxa with few missing data, however, support a sister relationship of Liliales (represented by Smilax and Lilium) to Dioscoreales-Pandanales. Topology tests indicated that these two alignments do not significantly differ given any of these three cpDNA genomic sequence data sets. Furthermore, we found no saturation effect of the data, suggesting that the cpDNA genomic sequence data used in the study are appropriate for monocot phylogenetic study and long-branch attraction is unlikely to be the cause to explain the result of two well-supported, conflict placements of Liliales. Further analyses using sufficient nuclear data remain necessary to evaluate these two phylogenetic hypotheses regarding the position of Liliales and to address the causes of signal conflict among genes and partitions. Copyright © 2012 Elsevier Inc. All rights reserved.
Sequence and pattern of expression of a bovine homologue of a human mitochondrial transport protein associated with Grave's disease.

PubMed

Fiermonte, G; Runswick, M J; Walker, J E; Palmieri, F

1992-01-01

A human cDNA has been isolated previously from a thyroid library with the aid of serum from a patient with Grave's disease. It encodes a protein belonging to the mitochondrial metabolite carrier family, referred to as the Grave's disease carrier protein (GDC). Using primers based on this sequence, overlapping cDNAs encoding the bovine homologue of the GDC have been isolated from total bovine heart poly(A)+ cDNA. The bovine protein is 18 amino acids shorter than the published human sequence, but if a frame shift requiring the removal of one nucleotide is introduced into the human cDNA sequence, the human and bovine proteins become identical in their C-terminal regions, and 308 out of 330 amino acids are conserved over their entire sequences. The bovine cDNA has been used to investigate the expression of the GDC in various bovine tissues. In the tissues that were examined, the GDC is most strongly expressed in the thyroid, but substantial amounts of its mRNA were also detected in liver, lung and kidney, and lesser amounts in heart and skeletal muscle.
spads 1.0: a toolbox to perform spatial analyses on DNA sequence data sets.

PubMed

Dellicour, Simon; Mardulyn, Patrick

2014-05-01

SPADS 1.0 (for 'Spatial and Population Analysis of DNA Sequences') is a population genetic toolbox for characterizing genetic variability within and among populations from DNA sequences. In view of the drastic increase in genetic information available through sequencing methods, spads was specifically designed to deal with multilocus data sets of DNA sequences. It computes several summary statistics from populations or groups of populations, performs input file conversions for other population genetic programs and implements locus-by-locus and multilocus versions of two clustering algorithms to study the genetic structure of populations. The toolbox also includes two MATLAB and r functions, GDISPAL and GDIVPAL, to display differentiation and diversity patterns across landscapes. These functions aim to generate interpolating surfaces based on multilocus distance and diversity indices. In the case of multiple loci, such surfaces can represent a useful alternative to multiple pie charts maps traditionally used in phylogeography to represent the spatial distribution of genetic diversity. These coloured surfaces can also be used to compare different data sets or different diversity and/or distance measures estimated on the same data set. © 2013 John Wiley & Sons Ltd.
New dye-labeled terminators for improved DNA sequencing patterns.

PubMed Central

Rosenblum, B B; Lee, L G; Spurgeon, S L; Khan, S H; Menchen, S M; Heiner, C R; Chen, S M

1997-01-01

We have used two new dye sets for automated dye-labeled terminator DNA sequencing. One set consists of four, 4,7-dichlororhodamine dyes (d-rhodamines). The second set consists of energy-transfer dyes that use the 5-carboxy-d-rhodamine dyes as acceptor dyes and the 5- or 6-carboxy isomers of 4'-aminomethylfluorescein as the donor dye. Both dye sets utilize a new linker between the dye and the nucleotide, and both provide more even peak heights in terminator sequencing than the dye-terminators consisting of unsubstituted rhodamine dyes. The unsubstituted rhodamine terminators produced electropherograms in which weak G peaks are observed after A peaks and occasionally C peaks. The number of weak G peaks has been reduced or eliminated with the new dye terminators. The general improvement in peak evenness improves accuracy for the automated base-calling software. The improved signal-to-noise ratio of the energy-transfer dye-labeled terminators combined with more even peak heights results in successful sequencing of high molecular weight DNA templates such as bacterial artificial chromosome DNA. PMID:9358158
Identification of unique repeated patterns, location of mutation in DNA finger printing using artificial intelligence technique.

PubMed

Mukunthan, B; Nagaveni, N

2014-01-01

In genetic engineering, conventional techniques and algorithms employed by forensic scientists to assist in identification of individuals on the basis of their respective DNA profiles involves more complex computational steps and mathematical formulae, also the identification of location of mutation in a genomic sequence in laboratories is still an exigent task. This novel approach provides ability to solve the problems that do not have an algorithmic solution and the available solutions are also too complex to be found. The perfect blend made of bioinformatics and neural networks technique results in efficient DNA pattern analysis algorithm with utmost prediction accuracy.

Heritable Epigenomic Changes to the Maize Methylome Resulting from Tissue Culture.

PubMed

Han, Zhaoxue; Crisp, Peter A; Stelpflug, Scott; Kaeppler, Shawn M; Li, Qing; Springer, Nathan M

2018-05-30

DNA methylation can contribute to the maintenance of genome integrity and regulation of gene expression. In most situations, DNA methylation patterns are inherited quite stably. However, changes in DNA methylation can occur at some loci as a result of tissue culture resulting in somaclonal variation. To investigate heritable epigenetic changes as a consequence of tissue culture, a sequence-capture bisulfite sequencing approach was implemented to monitor context-specific DNA methylation patterns in ∼15Mb of the maize genome for a population of plants that had been regenerated from tissue culture. Plants that have been regenerated from tissue culture exhibit gains and losses of DNA methylation at a subset of genomic regions. There was evidence for a high rate of homozygous changes to DNA methylation levels that occur consistently in multiple independent tissue culture lines suggesting that some loci are either targeted or hotspots for epigenetic variation. The consistent changes inherited following tissue culture include both gains and losses of DNA methylation and can affect CG, CHG or both contexts within a region. Only a subset of the tissue culture changes observed in callus plants are observed in the primary regnerants but the majority of DNA methylation changes present in primary regenerants are passed onto offspring. This study provides insights into the susceptibility of some loci and potential mechanisms that could contribute to altered DNA methylation and epigenetic state that occur during tissue culture in plant species. Copyright © 2018, Genetics.
Global DNA methylation analysis reveals miR-214-3p contributes to cisplatin resistance in pediatric intracranial nongerminomatous malignant germ cell tumors.

PubMed

Hsieh, Tsung-Han; Liu, Yun-Ru; Chang, Ting-Yu; Liang, Muh-Lii; Chen, Hsin-Hung; Wang, Hsei-Wei; Yen, Yun; Wong, Tai-Tong

2018-03-27

Pediatric central nervous system germ cell tumors (CNSGCTs) are rare and heterogeneous neoplasms, which can be divided into germinomas and nongerminomatous germ cell tumors (NGGCTs). NGGCTs are further subdivided into mature teratomas and nongerminomatous malignant GCTs (NGMGCTs). Clinical outcomes suggest that NGMGCTs have poor prognosis and survival and that they require more extensive radiotherapy and adjuvant chemotherapy. However, the mechanisms underlying this difference are still unclear. DNA methylation alteration is generally acknowledged to cause therapeutic resistance in cancers. We hypothesized that the pediatric NGMGCTs exhibit a different genome-wide DNA methylation pattern, which is involved in the mechanism of its therapeutic resistance. We performed methylation and hydroxymethylation DNA immunoprecipitation sequencing, mRNA expression microarray, and small RNA sequencing (smRNA-seq) to determine methylation-regulated genes, including microRNAs (miRNAs). The expression levels of 97 genes and 8 miRNAs were correlated with promoter DNA methylation and hydroxymethylation status, such as the miR-199/-214 cluster, and treatment with DNA demethylating agent 5-aza-2'-deoxycytidine elevated its expression level. Furthermore, smRNA-seq analysis showed 27 novel miRNA candidates with differential expression between germinomas and NGMGCTs. Overexpresssion of miR-214-3p in NCCIT cells leads to reduced expression of the pro-apoptotic protein BCL2-like 11 and induces cisplatin resistance. We interrogated the differential DNA methylation patterns between germinomas and NGMGCTs and proposed a mechanism for chemoresistance in NGMGCTs. In addition, our sequencing data provide a roadmap for further pediatric CNSGCT research and potential targets for the development of new therapeutic strategies.
Genomic sequencing and in vivo footprinting of an expression-specific DNase I-hypersensitive site of avian vitellogenin II promoter reveal a demethylation of a mCpG and a change in specific interactions of proteins with DNA.

PubMed Central

Saluz, H P; Feavers, I M; Jiricny, J; Jost, J P

1988-01-01

Genomic sequencing was used to study the in vivo methylation pattern of two CpG sites in the promoter region of the avian vitellogenin gene. The CpG at position +10 was fully methylated in DNA isolated from tissues that do not express the gene but was unmethylated in the liver of mature hens and estradiol-treated roosters. In the latter tissue, this site became demethylated and DNase I hypersensitive after estradiol treatment. A second CpG (position -52) was unmethylated in all tissues examined. In vivo genomic footprinting with dimethyl sulfate revealed different patterns of DNA protection in silent and expressed genes. In rooster liver cells, at least 10 base pairs of DNA, including the methylated CpG, were protected by protein(s). Gel-shift assays indicated that a protein factor, present in rooster liver nuclear extract, bound at this site only when it was methylated. In hen liver cells, the same unmethylated CpG lies within a protected region of approximately equal to 20 base pairs. In vitro DNase I protection and gel-shift assays indicate that this sequence is bound by a protein, which binds both double- and single-stranded DNA. For the latter substrate, this factor was shown to bind solely the noncoding (i.e., mRNA-like) strand. Images PMID:3413118
Molecular organization and phylogenetic analysis of 5S rDNA in crustaceans of the genus Pollicipes reveal birth-and-death evolution and strong purifying selection.

PubMed

Perina, Alejandra; Seoane, David; González-Tizón, Ana M; Rodríguez-Fariña, Fernanda; Martínez-Lage, Andrés

2011-10-17

The 5S ribosomal DNA (5S rDNA) is organized in tandem arrays with repeat units that consist of a transcribing region (5S) and a variable nontranscribed spacer (NTS), in higher eukaryotes. Until recently the 5S rDNA was thought to be subject to concerted evolution, however, in several taxa, sequence divergence levels between the 5S and the NTS were found higher than expected under this model. So, many studies have shown that birth-and-death processes and selection can drive the evolution of 5S rDNA. In analyses of 5S rDNA evolution is found several 5S rDNA types in the genome, with low levels of nucleotide variation in the 5S and a spacer region highly divergent. Molecular organization and nucleotide sequence of the 5S ribosomal DNA multigene family (5S rDNA) were investigated in three Pollicipes species in an evolutionary context. The nucleotide sequence variation revealed that several 5S rDNA variants occur in Pollicipes genomes. They are clustered in up to seven different types based on differences in their nontranscribed spacers (NTS). Five different units of 5S rDNA were characterized in P. pollicipes and two different units in P. elegans and P. polymerus. Analysis of these sequences showed that identical types were shared among species and that two pseudogenes were present. We predicted the secondary structure and characterized the upstream and downstream conserved elements. Phylogenetic analysis showed an among-species clustering pattern of 5S rDNA types. These results suggest that the evolution of Pollicipes 5S rDNA is driven by birth-and-death processes with strong purifying selection.
Molecular organization and phylogenetic analysis of 5S rDNA in crustaceans of the genus Pollicipes reveal birth-and-death evolution and strong purifying selection

PubMed Central

2011-01-01

Background The 5S ribosomal DNA (5S rDNA) is organized in tandem arrays with repeat units that consist of a transcribing region (5S) and a variable nontranscribed spacer (NTS), in higher eukaryotes. Until recently the 5S rDNA was thought to be subject to concerted evolution, however, in several taxa, sequence divergence levels between the 5S and the NTS were found higher than expected under this model. So, many studies have shown that birth-and-death processes and selection can drive the evolution of 5S rDNA. In analyses of 5S rDNA evolution is found several 5S rDNA types in the genome, with low levels of nucleotide variation in the 5S and a spacer region highly divergent. Molecular organization and nucleotide sequence of the 5S ribosomal DNA multigene family (5S rDNA) were investigated in three Pollicipes species in an evolutionary context. Results The nucleotide sequence variation revealed that several 5S rDNA variants occur in Pollicipes genomes. They are clustered in up to seven different types based on differences in their nontranscribed spacers (NTS). Five different units of 5S rDNA were characterized in P. pollicipes and two different units in P. elegans and P. polymerus. Analysis of these sequences showed that identical types were shared among species and that two pseudogenes were present. We predicted the secondary structure and characterized the upstream and downstream conserved elements. Phylogenetic analysis showed an among-species clustering pattern of 5S rDNA types. Conclusions These results suggest that the evolution of Pollicipes 5S rDNA is driven by birth-and-death processes with strong purifying selection. PMID:22004418
Genetic differences between blood- and brain-derived viral sequences from human immunodeficiency virus type 1-infected patients: evidence of conserved elements in the V3 region of the envelope protein of brain-derived sequences.

PubMed Central

Korber, B T; Kunstman, K J; Patterson, B K; Furtado, M; McEvilly, M M; Levy, R; Wolinsky, S M

1994-01-01

Human immunodeficiency virus type 1 (HIV-1) sequences were generated from blood and from brain tissue obtained by stereotactic biopsy from six patients undergoing a diagnostic neurosurgical procedure. Proviral DNA was directly amplified by nested PCR, and 8 to 36 clones from each sample were sequenced. Phylogenetic analysis of intrapatient envelope V3-V5 region HIV-1 DNA sequence sets revealed that brain viral sequences were clustered relative to the blood viral sequences, suggestive of tissue-specific compartmentalization of the virus in four of the six cases. In the other two cases, the blood and brain virus sequences were intermingled in the phylogenetic analyses, suggesting trafficking of virus between the two tissues. Slide-based PCR-driven in situ hybridization of two of the patients' brain biopsy samples confirmed our interpretation of the intrapatient phylogenetic analyses. Interpatient V3 region brain-derived sequence distances were significantly less than blood-derived sequence distances. Relative to the tip of the loop, the set of brain-derived viral sequences had a tendency towards negative or neutral charge compared with the set of blood-derived viral sequences. Entropy calculations were used as a measure of the variability at each position in alignments of blood and brain viral sequences. A relatively conserved set of positions were found, with a significantly lower entropy in the brain-than in the blood-derived viral sequences. These sites constitute a brain "signature pattern," or a noncontiguous set of amino acids in the V3 region conserved in viral sequences derived from brain tissue. This brain-derived signature pattern was also well preserved among isolates previously characterized in vitro as macrophage tropic. Macrophage-monocyte tropism may be the biological constraint that results in the conservation of the viral brain signature pattern. Images PMID:7933130
5-Methyldeoxycytidine in the Physarum minichromosome containing the ribosomal RNA genes.

PubMed Central

Cooney, C A; Matthews, H R; Bradbury, E M

1984-01-01

5-Methyldeoxycytidine (5MC) was analyzed by high pressure liquid chromatography (HPLC) and by restriction enzyme digestion in rDNA isolated from Physarum polycephalum. rDNA from Physarum M3C strain microplasmodia has a significant 5MC content (about half that of the whole genomic DNA). This rDNA contains many C5MCGG sites because it is clearly digested further by Msp I than by Hpa II. However, most 5MC is in other sites. In particular, alternating CG sequences appear to be highly methylated. HPLC of deoxyribonucleosides shows tha most of the transcribed regions contain little or no 5MC. Restriction digestion indicates that there is little or no 5MC in any of the transcribed regions including the transcription origin and adjacent sequences. Over 90% of the total 5MC is in or near the central nontranscribed spacer and most methylated restriction sites are in inverted repeats of this spacer. rDNA is very heterogeneous with respect to 5MC. The 5MC pattern doesn't appear to change with inactivation of the rRNA genes during reversible differentiation from microplasmodia (growing) to microsclerotia (dormant), showing that inactivation is due to changes in other chromatin variables. The 5MC pattern is different between Physarum strains. The possible involvement of this 5MC in rDNA chromatin structure and in cruciform and Z-DNA formation is discussed. Images PMID:6322108
Mutations Affecting Expression of the rosy Locus in Drosophila melanogaster

PubMed Central

Lee, Chong Sung; Curtis, Daniel; McCarron, Margaret; Love, Carol; Gray, Mark; Bender, Welcome; Chovnick, Arthur

1987-01-01

The rosy locus in Drosophila melanogaster codes for the enzyme xanthine dehydrogenase (XDH). Previous studies defined a "control element" near the 5' end of the gene, where variant sites affected the amount of rosy mRNA and protein produced. We have determined the DNA sequence of this region from both genomic and cDNA clones, and from the ry+10 underproducer strain. This variant strain had many sequence differences, so that the site of the regulatory change could not be fixed. A mutagenesis was also undertaken to isolate new regulatory mutations. We induced 376 new mutations with 1-ethyl-1-nitrosourea (ENU) and screened them to isolate those that reduced the amount of XDH protein produced, but did not change the properties of the enzyme. Genetic mapping was used to find mutations located near the 5' end of the gene. DNA from each of seven mutants was cloned and sequenced through the 5' region. Mutant base changes were identified in all seven; they appear to affect splicing and translation of the rosy mRNA. In a related study (T. P. Keith et al. 1987), the genomic and cDNA sequences are extended through the 3' end of the gene; the combined sequences define the processing pattern of the rosy transcript and predict the amino acid sequence of XDH. PMID:3036645
Rapid identification of causative species in patients with Old World leishmaniasis.

PubMed Central

Minodier, P; Piarroux, R; Gambarelli, F; Joblet, C; Dumon, H

1997-01-01

Conventional methods for the identification of species of Leishmania parasite causing infections have limitations. By using a DNA-based alternative, the present study tries to develop a new tool for this purpose. Thirty-three patients living in Marseilles (in the south of France) were suffering from visceral or cutaneous leishmaniasis. DNA of the parasite in clinical samples (bone marrow, peripheral blood, or skin) from these patients were amplified by PCR and were directly sequenced. The sequences observed were compared to these of 30 strains of the genus causing Old World leishmaniasis collected in Europe, Africa, or Asia. In the analysis of the sequences of the strains, two different sequence patterns for Leishmania infantum, one sequence for Leishmania donovani, one sequence for Leishmania major, two sequences for Leishmania tropica, and one sequence for Leishmania aethiopica were obtained. Four sequences were observed among the strains from the patients: one was similar to the sequence for the L. major strains, two were identical to the sequences for the L. infantum strains, and the last sequence was not observed within the strains but had a high degree of homology with the sequences of the L. infantum and L. donovani strains. The L. infantum strains from all immunocompetent patients had the same sequence. The L. infantum strains from immunodeficient patients suffering from visceral leishmaniasis had three different sequences. This fact might signify that some variants of L. infantum acquire pathogenicity exclusively in immunocompromised patients. To dispense with the sequencing step, a restriction assay with HaeIII was used. Some restriction patterns might support genetic exchanges in members of the genus Leishmania. PMID:9316906
Molecular evolution of the leptin exon 3 in some species of the family Canidae

PubMed Central

Chmurzynska, Agata; Zajac, Magdalena; Switonski, Marek

2003-01-01

The structure of the leptin gene seems to be well conserved. The polymorphism of this gene in four species belonging to the Canidae family (the dog (Canis familiaris) – 16 different breeds, the Chinese racoon dog (Nyctereutes procyonoides procyonoides), the red fox (Vulpes vulpes) and the arctic fox (Alopex lagopus)) were studied with the use of single strand conformation polymorphism (SSCP), restriction fragment length polymorphism (RFLP) and DNA sequencing techniques. For exon 2, all species presented the same SSCP pattern, while in exon 3 some differences were found. DNA sequencing of exon 3 revealed the presence of six nucleotide substitutions, differentiating the studied species. Three of them cause amino acid substitutions as well. For all dog breeds studied, SSCP patterns were identical. PMID:12939206
Centromeric and non-centromeric satellite DNA organisation differs in holocentric Rhynchospora species.

PubMed

Ribeiro, Tiago; Marques, André; Novák, Petr; Schubert, Veit; Vanzela, André L L; Macas, Jiri; Houben, Andreas; Pedrosa-Harand, Andrea

2017-03-01

Satellite DNA repeats (or satDNA) are fast-evolving sequences usually associated with condensed heterochromatin. To test whether the chromosomal organisation of centromeric and non-centromeric satDNA differs in species with holocentric chromosomes, we identified and characterised the major satDNA families in the holocentric Cyperaceae species Rhynchospora ciliata (2n = 10), R. globosa (2n = 50) and R. tenuis (2n = 2x = 4 and 2n = 4x = 8). While conserved centromeric repeats (present in R. ciliata and R. tenuis) revealed linear signals at both chromatids, non-centromeric, species-specific satDNAs formed distinct clusters along the chromosomes. Colocalisation of both repeat types resulted in a ladder-like hybridisation pattern at mitotic chromosomes. In interphase, the centromeric satDNA was dispersed while non-centromeric satDNA clustered and partly colocalised to chromocentres. Despite the banding-like hybridisation patterns of the clustered satDNA, the identification of chromosome pairs was impaired due to the irregular hybridisation patterns of the homologues in R. tenuis and R. ciliata. These differences are probably caused by restricted or impaired meiotic recombination as reported for R. tenuis, or alternatively by complex chromosome rearrangements or unequal condensation of homologous metaphase chromosomes. Thus, holocentricity influences the chromosomal organisation leading to differences in the distribution patterns and condensation dynamics of centromeric and non-centromeric satDNA.
Genetic characterization of the Bifidobacterium breve UCC 2003 hrcA locus.

PubMed

Ventura, Marco; Canchaya, Carlos; Bernini, Valentina; Del Casale, Antonio; Dellaglio, Franco; Neviani, Erasmo; Fitzgerald, Gerald F; van Sinderen, Douwe

2005-12-01

The bacterial heat shock response is characterized by the elevated expression of a number of chaperone complexes and transcriptional regulators, including the DnaJ and the HrcA proteins. Genome analysis of Bifidobacterium breve UCC 2003 revealed a second copy of a dnaJ gene, named dnaJ2, which is flanked by the hrcA gene in a genetic constellation that appears to be unique to the actinobacteria. Phylogenetic analysis using 53 bacterial dnaJ sequences, including both dnaJ1 and dnaJ2 sequences, suggests that these genes have followed a different evolutionary development. Furthermore, the B. breve UCC 2003 dnaJ2 gene seems to be regulated in a manner that is different from that of the previously characterized dnaJ1 gene. The dnaJ2 gene, which was shown to be part of a 2.3-kb bicistronic operon with hrcA, was induced by osmotic shock but not significantly by heat stress. This induction pattern is unlike those of other characterized dnaJ genes and may be indicative of a unique stress adaptation strategy by this commensal microorganism.
Genetic Characterization of the Bifidobacterium breve UCC 2003 hrcA Locus

PubMed Central

Ventura, Marco; Canchaya, Carlos; Bernini, Valentina; Del Casale, Antonio; Dellaglio, Franco; Neviani, Erasmo; Fitzgerald, Gerald F.; van Sinderen, Douwe

2005-01-01

The bacterial heat shock response is characterized by the elevated expression of a number of chaperone complexes and transcriptional regulators, including the DnaJ and the HrcA proteins. Genome analysis of Bifidobacterium breve UCC 2003 revealed a second copy of a dnaJ gene, named dnaJ2, which is flanked by the hrcA gene in a genetic constellation that appears to be unique to the actinobacteria. Phylogenetic analysis using 53 bacterial dnaJ sequences, including both dnaJ1 and dnaJ2 sequences, suggests that these genes have followed a different evolutionary development. Furthermore, the B. breve UCC 2003 dnaJ2 gene seems to be regulated in a manner that is different from that of the previously characterized dnaJ1 gene. The dnaJ2 gene, which was shown to be part of a 2.3-kb bicistronic operon with hrcA, was induced by osmotic shock but not significantly by heat stress. This induction pattern is unlike those of other characterized dnaJ genes and may be indicative of a unique stress adaptation strategy by this commensal microorganism. PMID:16332909
Fractal assembly of micrometre-scale DNA origami arrays with arbitrary patterns.

PubMed

Tikhomirov, Grigory; Petersen, Philip; Qian, Lulu

2017-12-06

Self-assembled DNA nanostructures enable nanometre-precise patterning that can be used to create programmable molecular machines and arrays of functional materials. DNA origami is particularly versatile in this context because each DNA strand in the origami nanostructure occupies a unique position and can serve as a uniquely addressable pixel. However, the scale of such structures has been limited to about 0.05 square micrometres, hindering applications that demand a larger layout and integration with more conventional patterning methods. Hierarchical multistage assembly of simple sets of tiles can in principle overcome this limitation, but so far has not been sufficiently robust to enable successful implementation of larger structures using DNA origami tiles. Here we show that by using simple local assembly rules that are modified and applied recursively throughout a hierarchical, multistage assembly process, a small and constant set of unique DNA strands can be used to create DNA origami arrays of increasing size and with arbitrary patterns. We illustrate this method, which we term 'fractal assembly', by producing DNA origami arrays with sizes of up to 0.5 square micrometres and with up to 8,704 pixels, allowing us to render images such as the Mona Lisa and a rooster. We find that self-assembly of the tiles into arrays is unaffected by changes in surface patterns on the tiles, and that the yield of the fractal assembly process corresponds to about 0.95 m - 1 for arrays containing m tiles. When used in conjunction with a software tool that we developed that converts an arbitrary pattern into DNA sequences and experimental protocols, our assembly method is readily accessible and will facilitate the construction of sophisticated materials and devices with sizes similar to that of a bacterium using DNA nanostructures.
Fractal assembly of micrometre-scale DNA origami arrays with arbitrary patterns

NASA Astrophysics Data System (ADS)

Tikhomirov, Grigory; Petersen, Philip; Qian, Lulu

2017-12-01

Self-assembled DNA nanostructures enable nanometre-precise patterning that can be used to create programmable molecular machines and arrays of functional materials. DNA origami is particularly versatile in this context because each DNA strand in the origami nanostructure occupies a unique position and can serve as a uniquely addressable pixel. However, the scale of such structures has been limited to about 0.05 square micrometres, hindering applications that demand a larger layout and integration with more conventional patterning methods. Hierarchical multistage assembly of simple sets of tiles can in principle overcome this limitation, but so far has not been sufficiently robust to enable successful implementation of larger structures using DNA origami tiles. Here we show that by using simple local assembly rules that are modified and applied recursively throughout a hierarchical, multistage assembly process, a small and constant set of unique DNA strands can be used to create DNA origami arrays of increasing size and with arbitrary patterns. We illustrate this method, which we term ‘fractal assembly’, by producing DNA origami arrays with sizes of up to 0.5 square micrometres and with up to 8,704 pixels, allowing us to render images such as the Mona Lisa and a rooster. We find that self-assembly of the tiles into arrays is unaffected by changes in surface patterns on the tiles, and that the yield of the fractal assembly process corresponds to about 0.95m - 1 for arrays containing m tiles. When used in conjunction with a software tool that we developed that converts an arbitrary pattern into DNA sequences and experimental protocols, our assembly method is readily accessible and will facilitate the construction of sophisticated materials and devices with sizes similar to that of a bacterium using DNA nanostructures.
Diversity of the Arabidopsis mitochondrial genome occurs via nuclear-controlled recombination activity.

PubMed

Arrieta-Montiel, Maria P; Shedge, Vikas; Davila, Jaime; Christensen, Alan C; Mackenzie, Sally A

2009-12-01

The plant mitochondrial genome is recombinogenic, with DNA exchange activity controlled to a large extent by nuclear gene products. One nuclear gene, MSH1, appears to participate in suppressing recombination in Arabidopsis at every repeated sequence ranging in size from 108 to 556 bp. Present in a wide range of plant species, these mitochondrial repeats display evidence of successful asymmetric DNA exchange in Arabidopsis when MSH1 is disrupted. Recombination frequency appears to be influenced by repeat sequence homology and size, with larger size repeats corresponding to increased DNA exchange activity. The extensive mitochondrial genomic reorganization of the msh1 mutant produced altered mitochondrial transcription patterns. Comparison of mitochondrial genomes from the Arabidopsis ecotypes C24, Col-0, and Ler suggests that MSH1 activity accounts for most or all of the polymorphisms distinguishing these genomes, producing ecotype-specific stoichiometric changes in each line. Our observations suggest that MSH1 participates in mitochondrial genome evolution by influencing the lineage-specific pattern of mitochondrial genetic variation in higher plants.
Circadian clock protein KaiC forms ATP-dependent hexameric rings and binds DNA

PubMed Central

Mori, Tetsuya; Saveliev, Sergei V.; Xu, Yao; Stafford, Walter F.; Cox, Michael M.; Inman, Ross B.; Johnson, Carl H.

2002-01-01

KaiC from Synechococcus elongatus PCC 7942 (KaiC) is an essential circadian clock protein in cyanobacteria. Previous sequence analyses suggested its inclusion in the RecA/DnaB superfamily. A characteristic of the proteins of this superfamily is that they form homohexameric complexes that bind DNA. We show here that KaiC also forms ring complexes with a central pore that can be visualized by electron microscopy. A combination of analytical ultracentrifugation and chromatographic analyses demonstrates that these complexes are hexameric. The association of KaiC molecules into hexamers depends on the presence of ATP. The KaiC sequence does not include the obvious DNA-binding motifs found in RecA or DnaB. Nevertheless, KaiC binds forked DNA substrates. These data support the inclusion of KaiC into the RecA/DnaB superfamily and have important implications for enzymatic activity of KaiC in the circadian clock mechanism that regulates global changes in gene expression patterns. PMID:12477935
SPLASH: structural pattern localization analysis by sequential histograms.

PubMed

Califano, A

2000-04-01

The discovery of sparse amino acid patterns that match repeatedly in a set of protein sequences is an important problem in computational biology. Statistically significant patterns, that is patterns that occur more frequently than expected, may identify regions that have been preserved by evolution and which may therefore play a key functional or structural role. Sparseness can be important because a handful of non-contiguous residues may play a key role, while others, in between, may be changed without significant loss of function or structure. Similar arguments may be applied to conserved DNA patterns. Available sparse pattern discovery algorithms are either inefficient or impose limitations on the type of patterns that can be discovered. This paper introduces a deterministic pattern discovery algorithm, called Splash, which can find sparse amino or nucleic acid patterns matching identically or similarly in a set of protein or DNA sequences. Sparse patterns of any length, up to the size of the input sequence, can be discovered without significant loss in performances. Splash is extremely efficient and embarrassingly parallel by nature. Large databases, such as a complete genome or the non-redundant SWISS-PROT database can be processed in a few hours on a typical workstation. Alternatively, a protein family or superfamily, with low overall homology, can be analyzed to discover common functional or structural signatures. Some examples of biologically interesting motifs discovered by Splash are reported for the histone I and for the G-Protein Coupled Receptor families. Due to its efficiency, Splash can be used to systematically and exhaustively identify conserved regions in protein family sets. These can then be used to build accurate and sensitive PSSM or HMM models for sequence analysis. Splash is available to non-commercial research centers upon request, conditional on the signing of a test field agreement. acal@us.ibm.com, Splash main page http://www.research.ibm.com/splash
Structural insight into the specificity of the B3 DNA-binding domains provided by the co-crystal structure of the C-terminal fragment of BfiI restriction enzyme

PubMed Central

Golovenko, Dmitrij; Manakova, Elena; Zakrys, Linas; Zaremba, Mindaugas; Sasnauskas, Giedrius; Gražulis, Saulius; Siksnys, Virginijus

2014-01-01

The B3 DNA-binding domains (DBDs) of plant transcription factors (TF) and DBDs of EcoRII and BfiI restriction endonucleases (EcoRII-N and BfiI-C) share a common structural fold, classified as the DNA-binding pseudobarrel. The B3 DBDs in the plant TFs recognize a diverse set of target sequences. The only available co-crystal structure of the B3-like DBD is that of EcoRII-N (recognition sequence 5′-CCTGG-3′). In order to understand the structural and molecular mechanisms of specificity of B3 DBDs, we have solved the crystal structure of BfiI-C (recognition sequence 5′-ACTGGG-3′) complexed with 12-bp cognate oligoduplex. Structural comparison of BfiI-C–DNA and EcoRII-N–DNA complexes reveals a conserved DNA-binding mode and a conserved pattern of interactions with the phosphodiester backbone. The determinants of the target specificity are located in the loops that emanate from the conserved structural core. The BfiI-C–DNA structure presented here expands a range of templates for modeling of the DNA-bound complexes of the B3 family of plant TFs. PMID:24423868
Isolation and characterization of full-length putative alcohol dehydrogenase genes from polygonum minus

NASA Astrophysics Data System (ADS)

Hamid, Nur Athirah Abd; Ismail, Ismanizan

2013-11-01

Polygonum minus, locally named as Kesum is an aromatic herb which is high in secondary metabolite content. Alcohol dehydrogenase is an important enzyme that catalyzes the reversible oxidation of alcohol and aldehyde with the presence of NAD(P)(H) as co-factor. The main focus of this research is to identify the gene of ADH. The total RNA was extracted from leaves of P. minus which was treated with 150 μM Jasmonic acid. Full-length cDNA sequence of ADH was isolated via rapid amplification cDNA end (RACE). Subsequently, in silico analysis was conducted on the full-length cDNA sequence and PCR was done on genomic DNA to determine the exon and intron organization. Two sequences of ADH, designated as PmADH1 and PmADH2 were successfully isolated. Both sequences have ORF of 801 bp which encode 266 aa residues. Nucleotide sequence comparison of PmADH1 and PmADH2 indicated that both sequences are highly similar at the ORF region but divergent in the 3' untranslated regions (UTR). The amino acid is differ at the 107 residue; PmADH1 contains Gly (G) residue while PmADH2 contains Cys (C) residue. The intron-exon organization pattern of both sequences are also same, with 3 introns and 4 exons. Based on in silico analysis, both sequences contain "classical" short chain alcohol dehydrogenases/reductases ((c) SDRs) conserved domain. The results suggest that both sequences are the members of short chain alcohol dehydrogenase family.

Mitochondrial Genome Sequences of Nematocera (Lower Diptera): Evidence of Rearrangement following a Complete Genome Duplication in a Winter Crane Fly

PubMed Central

Beckenbach, Andrew T.

2012-01-01

The complete mitochondrial DNA sequences of eight representatives of lower Diptera, suborder Nematocera, along with nearly complete sequences from two other species, are presented. These taxa represent eight families not previously represented by complete mitochondrial DNA sequences. Most of the sequences retain the ancestral dipteran mitochondrial gene arrangement, while one sequence, that of the midge Arachnocampa flava (family Keroplatidae), has an inversion of the trnE gene. The most unusual result is the extensive rearrangement of the mitochondrial genome of a winter crane fly, Paracladura trichoptera (family Trichocera). The pattern of rearrangement indicates that the mechanism of rearrangement involved a tandem duplication of the entire mitochondrial genome, followed by random and nonrandom loss of one copy of each gene. Another winter crane fly retains the ancestral diperan gene arrangement. A preliminary mitochondrial phylogeny of the Diptera is also presented. PMID:22155689
DNA barcoding of gypsy moths from China (Lepidoptera: Erebidae) reveals new haplotypes and divergence patterns within gypsy moth subspecies

Treesearch

Fang Chen; Youqing Luo; Melody A. Keena; Ying Wu; Peng Wu; Juan Shi

2015-01-01

The gypsy moth from Asia (two subspecies) is considered a greater threat to North America than European gypsy moth, because of a broader host range and females being capable of flight. Variation within and among gypsy moths from China (nine locations), one of the native countries of Asian gypsy moth, were compared using DNA barcode sequences (658 bp of mtDNA cytochrome...
DNA sequence-selective C8-linked pyrrolobenzodiazepine-heterocyclic polyamide conjugates show anti-tubercular-specific activities.

PubMed

Brucoli, Federico; Guzman, Juan D; Basher, Mohammad A; Evangelopoulos, Dimitrios; McMahon, Eleanor; Munshi, Tulika; McHugh, Timothy D; Fox, Keith R; Bhakta, Sanjib

2016-12-01

New chemotherapeutic agents with novel mechanisms of action are in urgent need to combat the tuberculosis pandemic. A library of 12 C8-linked pyrrolo[2,1-c][1,4]benzodiazepine (PBD)-heterocyclic polyamide conjugates (1-12) was evaluated for anti-tubercular activity and DNA sequence selectivity. The PBD conjugates were screened against slow-growing Mycobacterium bovis Bacillus Calmette-Guérin and M. tuberculosis H 37 Rv, and fast-growing Escherichia coli, Pseudomonas putida and Rhodococcus sp. RHA1 bacteria. DNase I footprinting and DNA thermal denaturation experiments were used to determine the molecules' DNA recognition properties. The PBD conjugates were highly selective for the mycobacterial strains and exhibited significant growth inhibitory activity against the pathogenic M. tuberculosis H 37 Rv, with compound 4 showing MIC values (MIC=0.08 mg l -1 ) similar to those of rifampin and isoniazid. DNase I footprinting results showed that the PBD conjugates with three heterocyclic moieties had enhanced sequence selectivity and produced larger footprints, with distinct cleavage patterns compared with the two-heterocyclic chain PBD conjugates. DNA melting experiments indicated a covalent binding of the PBD conjugates to two AT-rich DNA-duplexes containing either a central GGATCC or GTATAC sequence, and showed that the polyamide chains affect the interactions of the molecules with DNA. The PBD-C8 conjugates tested in this study have a remarkable anti-mycobacterial activity and can be further developed as DNA-targeted anti-tubercular drugs.
Identification of GATC- and CCGG- recognizing Type II REases and their putative specificity-determining positions using Scan2S—a novel motif scan algorithm with optional secondary structure constraints

PubMed Central

Niv, Masha Y.; Skrabanek, Lucy; Roberts, Richard J.; Scheraga, Harold A.; Weinstein, Harel

2008-01-01

Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering. PMID:17972284
Identification of GATC- and CCGG-recognizing Type II REases and their putative specificity-determining positions using Scan2S--a novel motif scan algorithm with optional secondary structure constraints.

PubMed

Niv, Masha Y; Skrabanek, Lucy; Roberts, Richard J; Scheraga, Harold A; Weinstein, Harel

2008-05-01

Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering.
Highly Iterated Palindromic Sequences (HIPs) and Their Relationship to DNA Methyltransferases

PubMed Central

Elhai, Jeff

2015-01-01

The sequence GCGATCGC (Highly Iterated Palindrome, HIP1) is commonly found in high frequency in cyanobacterial genomes. An important clue to its function may be the presence of two orphan DNA methyltransferases that recognize internal sequences GATC and CGATCG. An examination of genomes from 97 cyanobacteria, both free-living and obligate symbionts, showed that there are exceptional cases in which HIP1 is at a low frequency or nearly absent. In some of these cases, it appears to have been replaced by a different GC-rich palindromic sequence, alternate HIPs. When HIP1 is at a high frequency, GATC- and CGATCG-specific methyltransferases are generally present in the genome. When an alternate HIP is at high frequency, a methyltransferase specific for that sequence is present. The pattern of 1-nt deviations from HIP1 sequences is biased towards the first and last nucleotides, i.e., those distinguish CGATCG from HIP1. Taken together, the results point to a role of DNA methylation in the creation or functioning of HIP sites. A model is presented that postulates the existence of a GmeC-dependent mismatch repair system whose activity creates and maintains HIP sequences. PMID:25789551
Highly Iterated Palindromic Sequences (HIPs) and Their Relationship to DNA Methyltransferases.

PubMed

Elhai, Jeff

2015-03-17

The sequence GCGATCGC (Highly Iterated Palindrome, HIP1) is commonly found in high frequency in cyanobacterial genomes. An important clue to its function may be the presence of two orphan DNA methyltransferases that recognize internal sequences GATC and CGATCG. An examination of genomes from 97 cyanobacteria, both free-living and obligate symbionts, showed that there are exceptional cases in which HIP1 is at a low frequency or nearly absent. In some of these cases, it appears to have been replaced by a different GC-rich palindromic sequence, alternate HIPs. When HIP1 is at a high frequency, GATC- and CGATCG-specific methyltransferases are generally present in the genome. When an alternate HIP is at high frequency, a methyltransferase specific for that sequence is present. The pattern of 1-nt deviations from HIP1 sequences is biased towards the first and last nucleotides, i.e., those distinguish CGATCG from HIP1. Taken together, the results point to a role of DNA methylation in the creation or functioning of HIP sites. A model is presented that postulates the existence of a GmeC-dependent mismatch repair system whose activity creates and maintains HIP sequences.
DNA copy number changes define spatial patterns of heterogeneity in colorectal cancer

PubMed Central

Mamlouk, Soulafa; Childs, Liam Harold; Aust, Daniela; Heim, Daniel; Melching, Friederike; Oliveira, Cristiano; Wolf, Thomas; Durek, Pawel; Schumacher, Dirk; Bläker, Hendrik; von Winterfeld, Moritz; Gastl, Bastian; Möhr, Kerstin; Menne, Andrea; Zeugner, Silke; Redmer, Torben; Lenze, Dido; Tierling, Sascha; Möbs, Markus; Weichert, Wilko; Folprecht, Gunnar; Blanc, Eric; Beule, Dieter; Schäfer, Reinhold; Morkel, Markus; Klauschen, Frederick; Leser, Ulf; Sers, Christine

2017-01-01

Genetic heterogeneity between and within tumours is a major factor determining cancer progression and therapy response. Here we examined DNA sequence and DNA copy-number heterogeneity in colorectal cancer (CRC) by targeted high-depth sequencing of 100 most frequently altered genes. In 97 samples, with primary tumours and matched metastases from 27 patients, we observe inter-tumour concordance for coding mutations; in contrast, gene copy numbers are highly discordant between primary tumours and metastases as validated by fluorescent in situ hybridization. To further investigate intra-tumour heterogeneity, we dissected a single tumour into 68 spatially defined samples and sequenced them separately. We identify evenly distributed coding mutations in APC and TP53 in all tumour areas, yet highly variable gene copy numbers in numerous genes. 3D morpho-molecular reconstruction reveals two clusters with divergent copy number aberrations along the proximal–distal axis indicating that DNA copy number variations are a major source of tumour heterogeneity in CRC. PMID:28120820
Spatial and temporal plasticity of chromatin during programmed DNA-reorganization in Stylonychia macronuclear development

PubMed Central

Postberg, Jan; Heyse, Katharina; Cremer, Marion; Cremer, Thomas; Lipps, Hans J

2008-01-01

Background: In this study we exploit the unique genome organization of ciliates to characterize the biological function of histone modification patterns and chromatin plasticity for the processing of specific DNA sequences during a nuclear differentiation process. Ciliates are single-cell eukaryotes containing two morphologically and functionally specialized types of nuclei, the somatic macronucleus and the germline micronucleus. In the course of sexual reproduction a new macronucleus develops from a micronuclear derivative. During this process specific DNA sequences are eliminated from the genome, while sequences that will be transcribed in the mature macronucleus are retained. Results: We show by immunofluorescence microscopy, Western analyses and chromatin immunoprecipitation (ChIP) experiments that each nuclear type establishes its specific histone modification signature. Our analyses reveal that the early macronuclear anlage adopts a permissive chromatin state immediately after the fusion of two heterochromatic germline micronuclei. As macronuclear development progresses, repressive histone modifications that specify sequences to be eliminated are introduced de novo. ChIP analyses demonstrate that permissive histone modifications are associated with sequences that will be retained in the new macronucleus. Furthermore, our data support the hypothesis that a PIWI-family protein is involved in a transnuclear cross-talk and in the RNAi-dependent control of developmental chromatin reorganization. Conclusion: Based on these data we present a comprehensive analysis of the spatial and temporal pattern of histone modifications during this nuclear differentiation process. Results obtained in this study may also be relevant for our understanding of chromatin plasticity during metazoan embryogenesis. PMID:19014664
Accurate Prediction of Inducible Transcription Factor Binding Intensities In Vivo

PubMed Central

Siepel, Adam; Lis, John T.

2012-01-01

DNA sequence and local chromatin landscape act jointly to determine transcription factor (TF) binding intensity profiles. To disentangle these influences, we developed an experimental approach, called protein/DNA binding followed by high-throughput sequencing (PB–seq), that allows the binding energy landscape to be characterized genome-wide in the absence of chromatin. We applied our methods to the Drosophila Heat Shock Factor (HSF), which inducibly binds a target DNA sequence element (HSE) following heat shock stress. PB–seq involves incubating sheared naked genomic DNA with recombinant HSF, partitioning the HSF–bound and HSF–free DNA, and then detecting HSF–bound DNA by high-throughput sequencing. We compared PB–seq binding profiles with ones observed in vivo by ChIP–seq and developed statistical models to predict the observed departures from idealized binding patterns based on covariates describing the local chromatin environment. We found that DNase I hypersensitivity and tetra-acetylation of H4 were the most influential covariates in predicting changes in HSF binding affinity. We also investigated the extent to which DNA accessibility, as measured by digital DNase I footprinting data, could be predicted from MNase–seq data and the ChIP–chip profiles for many histone modifications and TFs, and found GAGA element associated factor (GAF), tetra-acetylation of H4, and H4K16 acetylation to be the most predictive covariates. Lastly, we generated an unbiased model of HSF binding sequences, which revealed distinct biophysical properties of the HSF/HSE interaction and a previously unrecognized substructure within the HSE. These findings provide new insights into the interplay between the genomic sequence and the chromatin landscape in determining transcription factor binding intensity. PMID:22479205
Molecular population dynamics of DNA structures in a bcl-2 promoter sequence is regulated by small molecules and the transcription factor hnRNP LL

PubMed Central

Cui, Yunxi; Koirala, Deepak; Kang, HyunJin; Dhakal, Soma; Yangyuoru, Philip; Hurley, Laurence H.; Mao, Hanbin

2014-01-01

Minute difference in free energy change of unfolding among structures in an oligonucleotide sequence can lead to a complex population equilibrium, which is rather challenging for ensemble techniques to decipher. Herein, we introduce a new method, molecular population dynamics (MPD), to describe the intricate equilibrium among non-B deoxyribonucleic acid (DNA) structures. Using mechanical unfolding in laser tweezers, we identified six DNA species in a cytosine (C)-rich bcl-2 promoter sequence. Population patterns of these species with and without a small molecule (IMC-76 or IMC-48) or the transcription factor hnRNP LL are compared to reveal the MPD of different species. With a pattern recognition algorithm, we found that IMC-48 and hnRNP LL share 80% similarity in stabilizing i-motifs with 60 s incubation. In contrast, IMC-76 demonstrates an opposite behavior, preferring flexible DNA hairpins. With 120–180 s incubation, IMC-48 and hnRNP LL destabilize i-motifs, which has been previously proposed to activate bcl-2 transcriptions. These results provide strong support, from the population equilibrium perspective, that small molecules and hnRNP LL can modulate bcl-2 transcription through interaction with i-motifs. The excellent agreement with biochemical results firmly validates the MPD analyses, which, we expect, can be widely applicable to investigate complex equilibrium of biomacromolecules. PMID:24609386
Analysis and Dynamics of the Chromosomal Complements of Wild Sparkling-Wine Yeast Strains

PubMed Central

Nadal, Dolors; Carro, David; Fernández-Larrea, Juan; Piña, Benjamin

1999-01-01

We isolated Saccharomyces cerevisiae yeast strains that are able to carry out the second fermentation of sparkling wine from spontaneously fermenting musts in El Penedès (Spain) by specifically designed selection protocols. All of them (26 strains) showed one of two very similar mitochondrial DNA (mtDNA) restriction patterns, whereas their karyotypes differed. These strains showed high rates of karyotype instability, which were dependent on both the medium and the strain, during vegetative growth. In all cases, the mtDNA restriction pattern was conserved in strains kept under the same conditions. Analysis of different repetitive sequences in their genomes suggested that ribosomal DNA repeats play an important role in the changes in size observed in chromosome XII, whereas SUC genes or Ty elements did not show amplification or transposition processes that could be related to rearrangements of the chromosomes showing these sequences. Karyotype changes also occurred in monosporidic diploid derivatives. We propose that these changes originated mainly from ectopic recombination between repeated sequences interspersed in the genome. None of the rearranged karyotypes provided a selective advantage strong enough to allow the strains to displace the parental strains. The nature and frequency of these changes suggest that they may play an important role in the establishment and maintenance of the genetic diversity observed in S. cerevisiae wild populations. PMID:10103269
Random-breakage mapping method applied to human DNA sequences

NASA Technical Reports Server (NTRS)

Lobrich, M.; Rydberg, B.; Cooper, P. K.; Chatterjee, A. (Principal Investigator)

1996-01-01

The random-breakage mapping method [Game et al. (1990) Nucleic Acids Res., 18, 4453-4461] was applied to DNA sequences in human fibroblasts. The methodology involves NotI restriction endonuclease digestion of DNA from irradiated calls, followed by pulsed-field gel electrophoresis, Southern blotting and hybridization with DNA probes recognizing the single copy sequences of interest. The Southern blots show a band for the unbroken restriction fragments and a smear below this band due to radiation induced random breaks. This smear pattern contains two discontinuities in intensity at positions that correspond to the distance of the hybridization site to each end of the restriction fragment. By analyzing the positions of those discontinuities we confirmed the previously mapped position of the probe DXS1327 within a NotI fragment on the X chromosome, thus demonstrating the validity of the technique. We were also able to position the probes D21S1 and D21S15 with respect to the ends of their corresponding NotI fragments on chromosome 21. A third chromosome 21 probe, D21S11, has previously been reported to be close to D21S1, although an uncertainty about a second possible location existed. Since both probes D21S1 and D21S11 hybridized to a single NotI fragment and yielded a similar smear pattern, this uncertainty is removed by the random-breakage mapping method.
Current and Emerging Technologies for the Analysis of the Genome-Wide and Locus-Specific DNA Methylation Patterns.

PubMed

Tost, Jörg

2016-01-01

DNA methylation is the most studied epigenetic modification, and altered DNA methylation patterns have been identified in cancer and more recently also in many other complex diseases. Furthermore, DNA methylation is influenced by a variety of environmental factors, and the analysis of DNA methylation patterns might allow deciphering previous exposure. Although a large number of techniques to study DNA methylation either genome-wide or at specific loci have been devised, they all are based on a limited number of principles for differentiating the methylation state, viz., methylation-specific/methylation-dependent restriction enzymes, antibodies or methyl-binding proteins, chemical-based enrichment, or bisulfite conversion. Second-generation sequencing has largely replaced microarrays as readout platform and is also becoming more popular for locus-specific DNA methylation analysis. In this chapter, the currently used methods for both genome-wide and locus-specific analysis of 5-methylcytosine and as its oxidative derivatives, such as 5-hydroxymethylcytosine, are reviewed in detail, and the advantages and limitations of each approach are discussed. Furthermore, emerging technologies avoiding PCR amplification and allowing a direct readout of DNA methylation are summarized, together with novel applications, such as the detection of DNA methylation in single cells or in circulating cell-free DNA.
Land, language, and loci: mtDNA in Native Americans and the genetic history of Peru.

PubMed

Lewis, Cecil M; Tito, Raúl Y; Lizárraga, Beatriz; Stone, Anne C

2005-07-01

Despite a long history of complex societies and despite extensive present-day linguistic and ethnic diversity, relatively few populations in Peru have been sampled for population genetic investigations. In order to address questions about the relationships between South American populations and about the extent of correlation between genetic distance, language, and geography in the region, mitochondrial DNA (mtDNA) hypervariable region I sequences and mtDNA haplogroup markers were examined in 33 individuals from the state of Ancash, Peru. These sequences were compared to those from 19 American Indian populations using diversity estimates, AMOVA tests, mismatch distributions, a multidimensional scaling plot, and regressions. The results show correlations between genetics, linguistics, and geographical affinities, with stronger correlations between genetics and language. Additionally, the results suggest a pattern of differential gene flow and drift in western vs. eastern South America, supporting previous mtDNA and Y chromosome investigations. (c) 2004 Wiley-Liss, Inc
Patterns of DNA variation among three centromere satellite families in Arabidopsis halleri and A. lyrata.

PubMed

Kawabe, Akira; Charlesworth, Deborah

2007-02-01

We describe patterns of DNA variation among the three centromeric satellite families in Arabidopsis halleri and lyrata. The newly studied subspecies (A. halleri ssp. halleri and A. lyrata ssp. lyrata and petraea), like the previously studied A. halleri ssp. gemmifera and A. lyrata ssp. kawasakiana, have three different centromeric satellite families, the older pAa family (also present in A. arenosa) and two families, pAge1 and pAge2, that probably evolved more recently. Sequence variability is high in all three satellite families, and the pAa sequences do not cluster by their species of origin. Diversity in the pAge2 family is complex, and different from variation among copies of the other two families, showing clear evidence for exchange events among family members, especially in A. halleri ssp. halleri. In A. lyrata ssp. lyrata there is some evidence for recent rapid spread of pAge2 variants, suggesting selection favoring these sequences.
Reclassification of Lactobacillus kimchii and Lactobacillus bobalius as later subjective synonyms of Lactobacillus paralimentarius.

PubMed

Pang, Huili; Kitahara, Maki; Tan, Zhongfang; Wang, Yanping; Qin, Guangyong; Ohkuma, Moriya; Cai, Yimin

2012-10-01

Characterization and identification of strain CW 1 ( = JCM 17161) isolated from corn silage were performed. Strain CW 1 was a Gram-positive, catalase-negative and homofermentative rod that produced the DL-form of lactic acid. This strain exhibited more than 99.6% 16S rRNA gene sequence similarity and greater than 82% DNA-DNA reassociation with type strains of Lactobacillus kimchii, L. bobalius and L. paralimentarius. To clarify the taxonomic positions of these type strains, phenotypic characterization, 16S rRNA gene sequencing, ribotyping and DNA-DNA relatedness were examined. The three type strains displayed different L-arabinose, lactose, melibiose, melezitose, raffinose and N-acetyl-β-glucosaminidase fermentation patterns. Phylogenetic analysis showed that L. paralimentarius is a closer neighbour of L. kimchii and L. bobalius, sharing 99.5-99.9% 16S rRNA gene sequence similarity, which was confirmed by the high DNA-DNA relatedness (≥82%) between L. paralimentarius JCM 10415(T), L. bobalius JCM 16180(T) and L. kimchii JCM 10707(T). Therefore, it is proposed that L. kimchii and L. bobalius should be reclassified as later synonyms of L. paralimentarius.
Distinct patterns of alteration of myc genes associated with integration of human papillomavirus type 16 or type 45 DNA in two genital tumours.

PubMed

Sastre-Garau, X; Favre, M; Couturier, J; Orth, G

2000-08-01

We previously described two genital carcinomas (IC2, IC4) containing human papillomavirus type 16 (HPV-16)- or HPV-18-related sequences integrated in chromosomal bands containing the c-myc (8q24) or N-myc (2p24) gene, respectively. The c-myc gene was rearranged and amplified in IC2 cells without evidence of overexpression. The N-myc gene was amplified and highly transcribed in IC4 cells. Here, the sequence of an 8039 bp IC4 DNA fragment containing the integrated viral sequences and the cellular junctions is reported. A 3948 bp segment of the genome of HPV-45 encompassing the upstream regulatory region and the E6 and E7 ORFs was integrated into the untranslated part of N-myc exon 3, upstream of the N-myc polyadenylation signal. Both N-myc and HPV-45 sequences were amplified 10- to 20-fold. The 3' ends of the major N-myc transcript were mapped upstream of the 5' junction. A minor N-myc/HPV-45 fusion transcript was also identified, as well as two abundant transcripts from the HPV-45 E6-E7 region. Large amounts of N-myc protein were detected in IC4 cells. A major alteration of c-myc sequences in IC2 cells involved the insertion of a non-coding sequence into the second intron and their co-amplification with the third exon, without any evidence for the integration of HPV-16 sequences within or close to the gene. Different patterns of myc gene alterations may thus be associated with integration of HPV DNA in genital tumours, including the activation of the protooncogene via a mechanism of insertional mutagenesis and/or gene amplification.
Single-Cell RNA Sequencing of Glioblastoma Cells.

PubMed

Sen, Rajeev; Dolgalev, Igor; Bayin, N Sumru; Heguy, Adriana; Tsirigos, Aris; Placantonakis, Dimitris G

2018-01-01

Single-cell RNA sequencing (sc-RNASeq) is a recently developed technique used to evaluate the transcriptome of individual cells. As opposed to conventional RNASeq in which entire populations are sequenced in bulk, sc-RNASeq can be beneficial when trying to better understand gene expression patterns in markedly heterogeneous populations of cells or when trying to identify transcriptional signatures of rare cells that may be underrepresented when using conventional bulk RNASeq. In this method, we describe the generation and analysis of cDNA libraries from single patient-derived glioblastoma cells using the C1 Fluidigm system. The protocol details the use of the C1 integrated fluidics circuit (IFC) for capturing, imaging and lysing cells; performing reverse transcription; and generating cDNA libraries that are ready for sequencing and analysis.
Genomics and museum specimens.

PubMed

Nachman, Michael W

2013-12-01

Nearly 25 years ago, Allan Wilson and colleagues isolated DNA sequences from museum specimens of kangaroo rats (Dipodomys panamintinus) and compared these sequences with those from freshly collected animals (Thomas et al. 1990). The museum specimens had been collected up to 78 years earlier, so the two samples provided a direct temporal comparison of patterns of genetic variation. This was not the first time DNA sequences had been isolated from preserved material, but it was the first time it had been carried out with a population sample. Population geneticists often try to make inferences about the influence of historical processes such as selection, drift, mutation and migration on patterns of genetic variation in the present. The work of Wilson and colleagues was important in part because it suggested a way in which population geneticists could actually study genetic change in natural populations through time, much the same way that experimentalists can do with artificial populations in the laboratory. Indeed, the work of Thomas et al. (1990) spawned dozens of studies in which museum specimens were used to compare historical and present-day genetic diversity (reviewed in Wandeler et al. 2007). All of these studies, however, were limited by the same fundamental problem: old DNA is degraded into short fragments. As a consequence, these studies mostly involved PCR amplification of short templates, usually short stretches of mitochondrial DNA or microsatellites. In this issue, Bi et al. (2013) report a breakthrough that should open the door to studies of genomic variation in museum specimens. They used target enrichment (exon capture) and next-generation (Illumina) sequencing to compare patterns of genetic variation in historic and present-day population samples of alpine chipmunks (Tamias alpinus) (Fig. 1). The historic samples came from specimens collected in 1915, so the temporal span of this comparison is nearly 100 years. © 2013 John Wiley & Sons Ltd.

cWINNOWER algorithm for finding fuzzy dna motifs

NASA Technical Reports Server (NTRS)

Liang, S.; Samanta, M. P.; Biegel, B. A.

2004-01-01

The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in protein-binding signals. A signal is defined as any short nucleotide pattern having up to d mutations differing from a motif of length l. The algorithm finds such motifs if a clique consisting of a sufficiently large number of mutated copies of the motif (i.e., the signals) is present in the DNA sequence. The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker signals. We studied the minimum detectable clique size qc as a function of sequence length N for random sequences. We found that qc increases linearly with N for a fast version of the algorithm based on counting three-member sub-cliques. Imposing consensus constraints reduces qc by a factor of three in this case, which makes the algorithm dramatically more sensitive. Our most sensitive algorithm, which counts four-member sub-cliques, needs a minimum of only 13 signals to detect motifs in a sequence of length N = 12,000 for (l, d) = (15, 4). Copyright Imperial College Press.
cWINNOWER Algorithm for Finding Fuzzy DNA Motifs

NASA Technical Reports Server (NTRS)

Liang, Shoudan

2003-01-01

The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in protein-binding signals. A signal is defined as any short nucleotide pattern having up to d mutations differing from a motif of length l. The algorithm finds such motifs if multiple mutated copies of the motif (i.e., the signals) are present in the DNA sequence in sufficient abundance. The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker signals. We studied the minimum number of detectable motifs qc as a function of sequence length N for random sequences. We found that qc increases linearly with N for a fast version of the algorithm based on counting three-member sub-cliques. Imposing consensus constraints reduces qc, by a factor of three in this case, which makes the algorithm dramatically more sensitive. Our most sensitive algorithm, which counts four-member sub-cliques, needs a minimum of only 13 signals to detect motifs in a sequence of length N = 12000 for (l,d) = (15,4).
The use of museum specimens with high-throughput DNA sequencers

PubMed Central

Burrell, Andrew S.; Disotell, Todd R.; Bergey, Christina M.

2015-01-01

Natural history collections have long been used by morphologists, anatomists, and taxonomists to probe the evolutionary process and describe biological diversity. These biological archives also offer great opportunities for genetic research in taxonomy, conservation, systematics, and population biology. They allow assays of past populations, including those of extinct species, giving context to present patterns of genetic variation and direct measures of evolutionary processes. Despite this potential, museum specimens are difficult to work with because natural postmortem processes and preservation methods fragment and damage DNA. These problems have restricted geneticists’ ability to use natural history collections primarily by limiting how much of the genome can be surveyed. Recent advances in DNA sequencing technology, however, have radically changed this, making truly genomic studies from museum specimens possible. We review the opportunities and drawbacks of the use of museum specimens, and suggest how to best execute projects when incorporating such samples. Several high-throughput (HT) sequencing methodologies, including whole genome shotgun sequencing, sequence capture, and restriction digests (demonstrated here), can be used with archived biomaterials. PMID:25532801
Altered imprinted gene expression and methylation patterns in mid-gestation aborted cloned porcine fetuses and placentas.

PubMed

Zhang, Xiaoyang; Wang, Dongxu; Han, Yang; Duan, Feifei; Lv, Qinyan; Li, Zhanjun

2014-11-01

To determine the expression patterns of imprinted genes and their methylation status in aborted cloned porcine fetuses and placentas. RNA and DNA were prepared from fetuses and placentas that were produced by SCNT and controls from artificial insemination. The expression of 18 imprinted genes was determined by quantitative real-time PCR (q-PCR). Bisulfite sequencing PCR (BSP) was conducted to determine the methylation status of PRE-1 short interspersed repetitive element (SINE), satellite DNA and H19 differentially methylated region 3 (DMR3). The weight, imprinted gene expression and genome-wide DNA methylation patterns were compared between the mid-gestation aborted and normal control samples. The results showed hypermethylation of PRE-1 and satellite sequences, the aberrant expression of imprinted genes, and the hypomethylation of H19 DMR3 occurred in mid-gestation aborted fetuses and placentas. Cloned pigs generated by somatic cell nuclear transfer (SCNT) showed a greater ratio of early abortion during mid-gestation than did normal controls because of the incomplete epigenetic reprogramming of the donor cells. Altered expression of imprinted genes and the hypermethylation profile of the repetitive regions (PRE-1 and satellite DNA) may be associated with defective development and early abortion of cloned pigs, emphasizing the importance of epigenetics during pregnancy and implications thereof for patient-specific embryonic stem cells for human therapeutic cloning and improvement of human assisted reproduction.
Phylogeography of the bark beetle Dendroctonus mexicanus Hopkins (Coleoptera: Curculionidae: Scolytinae)

Treesearch

Miguel A. Anducho-Reyes; Anthony I. Cognato; Jane L. Hayes; Gerardo. Zuniga

2008-01-01

Dendroctonus mexicanus is polyphagous within the Pinus genus and has a wide geographical distribution in Mexico and Guatemala. We examined the pattern of genetic variation across the range of this species to explore its demographic history and its phylogeographic pattern. Analysis of the mtDNA sequences of 173 individuals from...
Dasytricha dominance in Surti buffalo rumen revealed by 18S rRNA sequences and real-time PCR assay.

PubMed

Singh, K M; Tripathi, A K; Pandya, P R; Rank, D N; Kothari, R K; Joshi, C G

2011-09-01

The genetic diversity of protozoa in Surti buffalo rumen was studied by amplified ribosomal DNA restriction analysis, 18S rDNA sequence homology and phylogenetic and Real-time PCR analysis methods. Three animals were fed diet comprised green fodder Napier bajra 21 (Pennisetum purpureum), mature pasture grass (Dicanthium annulatum) and concentrate mixture (20% crude protein, 65% total digestible nutrients). A protozoa-specific primer (P-SSU-342f) and a eukarya-specific primer (Medlin B) were used to amplify a 1,360 bp fragment of DNA encoding protozoal small subunit (SSU) ribosomal RNA from rumen fluid. A total of 91 clones were examined and identified 14 different 18S RNA sequences based on PCR-RFLP pattern. These 14 phylotypes were distributed into four genera-based 18S rDNA database sequences and identified as Dasytricha (57 clones), Isotricha (14 clones), Ostracodinium (11 clones) and Polyplastron (9 clones). Phylogenetic analyses were also used to infer the makeup of protozoa communities in the rumen of Surti buffalo. Out of 14 sequences, 8 sequences (69 clones) clustered with the Dasytricha ruminantium-like clone and 4 sequences (13 clones) were also phylogenetically placed with the Isotricha prostoma-like clone. Moreover, 2 phylotypes (9 clones) were related to Polyplastron multivesiculatum-like clone. In addition, the number of 18S rDNA gene copies of Dasytricha ruminantium (0.05% to ciliate protozoa) was higher than Entodinium sp. (2.0 × 10(5) vs. 1.3 × 10(4)) in per ml ruminal fluid.
Decision Tree Algorithm-Generated Single-Nucleotide Polymorphism Barcodes of rbcL Genes for 38 Brassicaceae Species Tagging.

PubMed

Yang, Cheng-Hong; Wu, Kuo-Chuan; Chuang, Li-Yeh; Chang, Hsueh-Wei

2018-01-01

DNA barcode sequences are accumulating in large data sets. A barcode is generally a sequence larger than 1000 base pairs and generates a computational burden. Although the DNA barcode was originally envisioned as straightforward species tags, the identification usage of barcode sequences is rarely emphasized currently. Single-nucleotide polymorphism (SNP) association studies provide us an idea that the SNPs may be the ideal target of feature selection to discriminate between different species. We hypothesize that SNP-based barcodes may be more effective than the full length of DNA barcode sequences for species discrimination. To address this issue, we tested a r ibulose diphosphate carboxylase ( rbcL ) S NP b arcoding (RSB) strategy using a decision tree algorithm. After alignment and trimming, 31 SNPs were discovered in the rbcL sequences from 38 Brassicaceae plant species. In the decision tree construction, these SNPs were computed to set up the decision rule to assign the sequences into 2 groups level by level. After algorithm processing, 37 nodes and 31 loci were required for discriminating 38 species. Finally, the sequence tags consisting of 31 rbcL SNP barcodes were identified for discriminating 38 Brassicaceae species based on the decision tree-selected SNP pattern using RSB method. Taken together, this study provides the rational that the SNP aspect of DNA barcode for rbcL gene is a useful and effective sequence for tagging 38 Brassicaceae species.
Analysis of methylated patterns and quality-related genes in tobacco (Nicotiana tabacum) cultivars.

PubMed

Jiao, Junna; Jia, Yanlong; Lv, Zhuangwei; Sun, Chuanfei; Gao, Lijie; Yan, Xiaoxiao; Cui, Liusu; Tang, Zongxiang; Yan, Benju

2014-08-01

Methylation-sensitive amplified polymorphism was used in this study to investigate epigenetic information of four tobacco cultivars: Yunyan 85, NC89, K326, and Yunyan 87. The DNA fragments with methylated information were cloned by reamplified PCR and sequenced. The results of Blast alignments showed that the genes with methylation information included chitinase, nitrate reductase, chloroplast DNA, mitochondrial DNA, ornithine decarboxylase, ribulose carboxylase, and promoter sequences. Homologous comparison in three cloned gene sequences (nitrate reductase, ornithine decarboxylase, and ribulose decarboxylase) indicated that geographic factors had significant influence on the whole genome methylation. Introns also contained different information in different tobacco cultivars. These findings suggest that synthetic mechanisms for tobacco aromatic components could be affected by different environmental factors leading to variation of noncoding regions in the genome, which finally results in different fragrance and taste in different tobacco cultivars.
Minimalist Approach to Complexity: Templating the Assembly of DNA Tile Structures with Sequentially Grown Input Strands.

PubMed

Lau, Kai Lin; Sleiman, Hanadi F

2016-07-26

Given its highly predictable self-assembly properties, DNA has proven to be an excellent template toward the design of functional materials. Prominent examples include the remarkable complexity provided by DNA origami and single-stranded tile (SST) assemblies, which require hundreds of unique component strands. However, in many cases, the majority of the DNA assembly is purely structural, and only a small "working area" needs to be aperiodic. On the other hand, extended lattices formed by DNA tile motifs require only a few strands; but they suffer from lack of size control and limited periodic patterning. To overcome these limitations, we adopt a templation strategy, where an input strand of DNA dictates the size and patterning of resultant DNA tile structures. To prepare these templating input strands, a sequential growth technique developed in our lab is used, whereby extended DNA strands of defined sequence and length may be generated simply by controlling their order of addition. With these, we demonstrate the periodic patterning of size-controlled double-crossover (DX) and triple-crossover (TX) tile structures, as well as intentionally designed aperiodicity of a DX tile structure. As such, we are able to prepare size-controlled DNA structures featuring aperiodicity only where necessary with exceptional economy and efficiency.
DNA fingerprinting of Brassica juncea cultivars using microsatellite probes.

PubMed

Bhatia, S; Das, S; Jain, A; Lakshmikumaran, M

1995-09-01

The genetic variability in the Brassica juncea cultivars was detected by employing in-gel hybridization of restricted DNA to simple repetitive sequences such as (GATA)4, (GACA)4 and (CAC)5. The most informative probe/enzyme combination was (GATA)4/EcoRI, yielding highly polymorphic fingerprint patterns for the B. juncea cultivars. This technique was found to be dependable for establishing the variety specific patterns for most of the cultivars studied, a prerequisite for germplasm preservation. The results of the present study were compared with those reported in our earlier study in which random amplification of polymorphic DNA (RAPD) was used for assessing the genetic variability in the B. juncea cultivars.
A periodic pattern of SNPs in the human genome

PubMed Central

Madsen, Bo Eskerod; Villesen, Palle; Wiuf, Carsten

2007-01-01

By surveying a filtered, high-quality set of SNPs in the human genome, we have found that SNPs positioned 1, 2, 4, 6, or 8 bp apart are more frequent than SNPs positioned 3, 5, 7, or 9 bp apart. The observed pattern is not restricted to genomic regions that are known to cause sequencing or alignment errors, for example, transposable elements (SINE, LINE, and LTR), tandem repeats, and large duplicated regions. However, we found that the pattern is almost entirely confined to what we define as “periodic DNA.” Periodic DNA is a genomic region with a high degree of periodicity in nucleotide usage. It turned out that periodic DNA is mainly small regions (average length 16.9 bp), widely distributed in the genome. Furthermore, periodic DNA has a 1.8 times higher SNP density than the rest of the genome and SNPs inside periodic DNA have a significantly higher genotyping error rate than SNPs outside periodic DNA. Our results suggest that not all SNPs in the human genome are created by independent single nucleotide mutations, and that care should be taken in analysis of SNPs from periodic DNA. The latter may have important consequences for SNP and association studies. PMID:17673700
Finding functional features in Saccharomyces genomes by phylogenetic footprinting.

PubMed

Cliften, Paul; Sudarsanam, Priya; Desikan, Ashwin; Fulton, Lucinda; Fulton, Bob; Majors, John; Waterston, Robert; Cohen, Barak A; Johnston, Mark

2003-07-04

The sifting and winnowing of DNA sequence that occur during evolution cause nonfunctional sequences to diverge, leaving phylogenetic footprints of functional sequence elements in comparisons of genome sequences. We searched for such footprints among the genome sequences of six Saccharomyces species and identified potentially functional sequences. Comparison of these sequences allowed us to revise the catalog of yeast genes and identify sequence motifs that may be targets of transcriptional regulatory proteins. Some of these conserved sequence motifs reside upstream of genes with similar functional annotations or similar expression patterns or those bound by the same transcription factor and are thus good candidates for functional regulatory sequences.
Detecting the borders between coding and non-coding DNA regions in prokaryotes based on recursive segmentation and nucleotide doublets statistics

PubMed Central

2012-01-01

Background Detecting the borders between coding and non-coding regions is an essential step in the genome annotation. And information entropy measures are useful for describing the signals in genome sequence. However, the accuracies of previous methods of finding borders based on entropy segmentation method still need to be improved. Methods In this study, we first applied a new recursive entropic segmentation method on DNA sequences to get preliminary significant cuts. A 22-symbol alphabet is used to capture the differential composition of nucleotide doublets and stop codon patterns along three phases in both DNA strands. This process requires no prior training datasets. Results Comparing with the previous segmentation methods, the experimental results on three bacteria genomes, Rickettsia prowazekii, Borrelia burgdorferi and E.coli, show that our approach improves the accuracy for finding the borders between coding and non-coding regions in DNA sequences. Conclusions This paper presents a new segmentation method in prokaryotes based on Jensen-Rényi divergence with a 22-symbol alphabet. For three bacteria genomes, comparing to A12_JR method, our method raised the accuracy of finding the borders between protein coding and non-coding regions in DNA sequences. PMID:23282225
Brown and polar bear Y chromosomes reveal extensive male-biased gene flow within brother lineages.

PubMed

Bidon, Tobias; Janke, Axel; Fain, Steven R; Eiken, Hans Geir; Hagen, Snorre B; Saarma, Urmas; Hallström, Björn M; Lecomte, Nicolas; Hailer, Frank

2014-06-01

Brown and polar bears have become prominent examples in phylogeography, but previous phylogeographic studies relied largely on maternally inherited mitochondrial DNA (mtDNA) or were geographically restricted. The male-specific Y chromosome, a natural counterpart to mtDNA, has remained underexplored. Although this paternally inherited chromosome is indispensable for comprehensive analyses of phylogeographic patterns, technical difficulties and low variability have hampered its application in most mammals. We developed 13 novel Y-chromosomal sequence and microsatellite markers from the polar bear genome and screened these in a broad geographic sample of 130 brown and polar bears. We also analyzed a 390-kb-long Y-chromosomal scaffold using sequencing data from published male ursine genomes. Y chromosome evidence support the emerging understanding that brown and polar bears started to diverge no later than the Middle Pleistocene. Contrary to mtDNA patterns, we found 1) brown and polar bears to be reciprocally monophyletic sister (or rather brother) lineages, without signals of introgression, 2) male-biased gene flow across continents and on phylogeographic time scales, and 3) male dispersal that links the Alaskan ABC islands population to mainland brown bears. Due to female philopatry, mtDNA provides a highly structured estimate of population differentiation, while male-biased gene flow is a homogenizing force for nuclear genetic variation. Our findings highlight the importance of analyzing both maternally and paternally inherited loci for a comprehensive view of phylogeographic history, and that mtDNA-based phylogeographic studies of many mammals should be reevaluated. Recent advances in sequencing technology render the analysis of Y-chromosomal variation feasible, even in nonmodel organisms. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Predicting nuclear gene coalescence from mitochondrial data: the three-times rule.

PubMed

Palumbi, S R; Cipriano, F; Hare, M P

2001-05-01

Coalescence theory predicts when genetic drift at nuclear loci will result in fixation of sequence differences to produce monophyletic gene trees. However, the theory is difficult to apply to particular taxa because it hinges on genetically effective population size, which is generally unknown. Neutral theory also predicts that evolution of monophyly will be four times slower in nuclear than in mitochondrial genes primarily because genetic drift is slower at nuclear loci. Variation in mitochondrial DNA (mtDNA) within and between species has been studied extensively, but can these mtDNA data be used to predict coalescence in nuclear loci? Comparison of neutral theories of coalescence of mitochondrial and nuclear loci suggests a simple rule of thumb. The "three-times rule" states that, on average, most nuclear loci will be monophyletic when the branch length leading to the mtDNA sequences of a species is three times longer than the average mtDNA sequence diversity observed within that species. A test using mitochondrial and nuclear intron data from seven species of whales and dolphins suggests general agreement with predictions of the three-times rule. We define the coalescence ratio as the mitochondrial branch length for a species divided by intraspecific mtDNA diversity. We show that species with high coalescence ratios show nuclear monophyly, whereas species with low ratios have polyphyletic nuclear gene trees. As expected, species with intermediate coalescence ratios show a variety of patterns. Especially at very high or low coalescence ratios, the three-times rule predicts nuclear gene patterns that can help detect the action of selection. The three-times rule may be useful as an empirical benchmark for evaluating evolutionary processes occurring at multiple loci.
Variation in the DNA methylation pattern of expressed and nonexpressed genes in chicken.

PubMed

Cooper, D N; Errington, L H; Clayton, R M

1983-01-01

Using methyl-sensitive and -insensitive restriction enzymes, Hpa II and Msp I, the methylation status of various chicken genes was examined in different tissues and developmental stages. Tissue-specific differences in methylation were found for the delta-crystallin, beta-tubulin, G3PDH, rDNA, and actin genes but not for the histone genes. Developmental decreases in methylation were noted for the delta-crystallin and actin genes in chicken kidney between embryo and adult. Since most of the sequences examined were housekeeping genes, transcriptional differences are apparently not a necessary accompaniment to changes in DNA methylation at the CpG sites examined. The only exception is sperm DNA where the delta-crystallin, beta-tubulin, and actin genes are highly methylated and almost certainly not transcribed. However the G3PDH genes are no more highly methylated in sperm than in other somatic tissues. Many sequences homologous to the rDNA and histone probes used are unmethylated in all tissues examined including sperm, but a methylated rDNA subfraction is more heavily methylated in sperm than in other tissues. We speculate as to the significance of these differences in sperm DNA methylation in the light of possible requirements for early gene activation and the probable deleterious mutagenic effects of heavy methylation within coding sequences.
Evolution of DNA Methylation across Insects

PubMed Central

Vogel, Kevin J.; Moore, Allen J.; Schmitz, Robert J.

2017-01-01

DNA methylation contributes to gene and transcriptional regulation in eukaryotes, and therefore has been hypothesized to facilitate the evolution of plastic traits such as sociality in insects. However, DNA methylation is sparsely studied in insects. Therefore, we documented patterns of DNA methylation across a wide diversity of insects. We predicted that underlying enzymatic machinery is concordant with patterns of DNA methylation. Finally, given the suggestion that DNA methylation facilitated social evolution in Hymenoptera, we tested the hypothesis that the DNA methylation system will be associated with presence/absence of sociality among other insect orders. We found DNA methylation to be widespread, detected in all orders examined except Diptera (flies). Whole genome bisulfite sequencing showed that orders differed in levels of DNA methylation. Hymenopteran (ants, bees, wasps and sawflies) had some of the lowest levels, including several potential losses. Blattodea (cockroaches and termites) show all possible patterns, including a potential loss of DNA methylation in a eusocial species whereas solitary species had the highest levels. Species with DNA methylation do not always possess the typical enzymatic machinery. We identified a gene duplication event in the maintenance DNA methyltransferase 1 (DNMT1) that is shared by some Hymenoptera, and paralogs have experienced divergent, nonneutral evolution. This diversity and nonneutral evolution of underlying machinery suggests alternative DNA methylation pathways may exist. Phylogenetically corrected comparisons revealed no evidence that supports evolutionary association between sociality and DNA methylation. Future functional studies will be required to advance our understanding of DNA methylation in insects. PMID:28025279
Genetic features of ancient West Siberian people of the Middle Ages, revealed by mitochondrial DNA haplogroup analysis.

PubMed

Sato, Takehiro; Razhev, Dmitry; Amano, Tetsuya; Masuda, Ryuichi

2011-08-01

In order to investigate the genetic features of ancient West Siberian people of the Middle Ages, we studied ancient DNA from bone remains excavated from two archeological sites in West Siberia: Saigatinsky 6 (eighth to eleventh centuries) and Zeleny Yar (thirteenth century). Polymerase chain reaction amplification and nucleotide sequencing of mitochondrial DNA (mtDNA) succeeded for 9 of 67 specimens examined, and the sequences were assigned to mtDNA haplogroups B4, C4, G2, H and U. This distribution pattern of mtDNA haplogroups in medieval West Siberian people was similar to those previously reported in modern populations living in West Siberia, such as the Mansi, Ket and Nganasan. Exact tests of population differentiation showed no significant differences between the medieval people and modern populations in West Siberia. The findings suggest that some medieval West Siberian people analyzed in the present study are included in direct ancestral lineages of modern populations native to West Siberia.
Identification of tissue-specific, abiotic stress-responsive gene expression patterns in wine grape (Vitis vinifera L.) based on curation and mining of large-scale EST data sets

PubMed Central

2011-01-01

Background Abiotic stresses, such as water deficit and soil salinity, result in changes in physiology, nutrient use, and vegetative growth in vines, and ultimately, yield and flavor in berries of wine grape, Vitis vinifera L. Large-scale expressed sequence tags (ESTs) were generated, curated, and analyzed to identify major genetic determinants responsible for stress-adaptive responses. Although roots serve as the first site of perception and/or injury for many types of abiotic stress, EST sequencing in root tissues of wine grape exposed to abiotic stresses has been extremely limited to date. To overcome this limitation, large-scale EST sequencing was conducted from root tissues exposed to multiple abiotic stresses. Results A total of 62,236 expressed sequence tags (ESTs) were generated from leaf, berry, and root tissues from vines subjected to abiotic stresses and compared with 32,286 ESTs sequenced from 20 public cDNA libraries. Curation to correct annotation errors, clustering and assembly of the berry and leaf ESTs with currently available V. vinifera full-length transcripts and ESTs yielded a total of 13,278 unique sequences, with 2302 singletons and 10,976 mapped to V. vinifera gene models. Of these, 739 transcripts were found to have significant differential expression in stressed leaves and berries including 250 genes not described previously as being abiotic stress responsive. In a second analysis of 16,452 ESTs from a normalized root cDNA library derived from roots exposed to multiple, short-term, abiotic stresses, 135 genes with root-enriched expression patterns were identified on the basis of their relative EST abundance in roots relative to other tissues. Conclusions The large-scale analysis of relative EST frequency counts among a diverse collection of 23 different cDNA libraries from leaf, berry, and root tissues of wine grape exposed to a variety of abiotic stress conditions revealed distinct, tissue-specific expression patterns, previously unrecognized stress-induced genes, and many novel genes with root-enriched mRNA expression for improving our understanding of root biology and manipulation of rootstock traits in wine grape. mRNA abundance estimates based on EST library-enriched expression patterns showed only modest correlations between microarray and quantitative, real-time reverse transcription-polymerase chain reaction (qRT-PCR) methods highlighting the need for deep-sequencing expression profiling methods. PMID:21592389
Designing and conducting in silico analysis for identifying of Echinococcus spp. with discrimination of novel haplotypes: an approach to better understanding of parasite taxonomic.

PubMed

Spotin, Adel; Gholami, Shirzad; Nasab, Abbas Najafi; Fallah, Esmaeil; Oskouei, Mahmoud Mahami; Semnani, Vahid; Shariatzadeh, Seyyed Ali; Shahbazi, Abbas

2015-04-01

The definitive identification of Echinococcus species is currently carried out by sequencing and phylogenetic strategies. However, the application of polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) patterns is not broadly used as a result of heterogeneity traits of Echinococcus genome in different regions of the world. Therefore, designing and conducting a standardized pattern should indigenously be considered in under-studied areas. In this investigation, an in silico mapping was designed and developed for eight Echinococcus spp. on the basis of regional sequences in Iran and the world. The numbers of 60 Echinococcus isolates were collected from the liver and lungs of 15 human, 15 sheep, 15 cattle, and 15 camel cases in Semnan province, Central Iran. DNA samples were extracted and examined by polymerase chain reaction of ribosomal DNA (rDNA) internal transcribed spacer 1 (ITS1) and PCR-RFLP via Rsa1 endonuclease enzyme. Moreover, 15 amplicons of cytochrome oxidase 1 (Cox1) were directly sequenced in order to identify the strains/haplotypes. PCR-RFLP and phylogenetic analyses revealed firmly the presence of the G1 and G6 genotypes with heterogeneity (three novel haplotypes) of Cox1 gene although no other expected genotypes were found in the region. Finding shows that the identification of novel haplotypes along with discrimination of Echinococcus spp. through regional patterns can unambiguously illustrate the real taxonomic status of parasite in Central Iran.

Contrasting population structure from nuclear intron sequences and mtDNA of humpback whales.

PubMed

Palumbi, S R; Baker, C S

1994-05-01

Powerful analyses of population structure require information from multiple genetic loci. To help develop a molecular toolbox for obtaining this information, we have designed universal oligonucleotide primers that span conserved intron-exon junctions in a wide variety of animal phyla. We test the utility of exon-primed, intron-crossing amplifications by analyzing the variability of actin intron sequences from humpback, blue, and bowhead whales and comparing the results with mitochondrial DNA (mtDNA) haplotype data. Humpback actin introns fall into two major clades that exist in different frequencies in different oceanic populations. It is surprising that Hawaii and California populations, which are very distinct in mtDNAs, are similar in actin intron alleles. This discrepancy between mtDNA and nuclear DNA results may be due either to differences in genetic drift in mitochondrial and nuclear genes or to preferential movement of males, which do not transmit mtDNA to offspring, between separate breeding grounds. Opposing mtDNA and nuclear DNA results can help clarify otherwise hidden patterns of structure in natural populations.
DNA demethylation activates genes in seed maternal integument development in rice (Oryza sativa L.).

PubMed

Wang, Yifeng; Lin, Haiyan; Tong, Xiaohong; Hou, Yuxuan; Chang, Yuxiao; Zhang, Jian

2017-11-01

DNA methylation is an important epigenetic modification that regulates various plant developmental processes. Rice seed integument determines the seed size. However, the role of DNA methylation in its development remains largely unknown. Here, we report the first dynamic DNA methylomic profiling of rice maternal integument before and after pollination by using a whole-genome bisulfite deep sequencing approach. Analysis of DNA methylation patterns identified 4238 differentially methylated regions underpin 4112 differentially methylated genes, including GW2, DEP1, RGB1 and numerous other regulators participated in maternal integument development. Bisulfite sanger sequencing and qRT-PCR of six differentially methylated genes revealed extensive occurrence of DNA hypomethylation triggered by double fertilization at IAP compared with IBP, suggesting that DNA demethylation might be a key mechanism to activate numerous maternal controlling genes. These results presented here not only greatly expanded the rice methylome dataset, but also shed novel insight into the regulatory roles of DNA methylation in rice seed maternal integument development. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Identification and molecular characterization of a novel circular single-stranded DNA virus associated with yerba mate in Argentina.

PubMed

Bejerman, Nicolás; de Breuil, Soledad; Nome, Claudia

2018-06-06

A single-stranded DNA (ssDNA) virus was detected in Yerba mate samples showing chlorotic linear patterns, chlorotic rings and vein yellowing. The full-genome sequences of six different isolates of this ssDNA circular virus were obtained, which share > 99% sequence identity with each other. The newly identified virus has been tentatively named as yerba mate-associated circular DNA virus (YMaCV). The 2707 nt-long viral genome has two and three open reading frame on its complementary and virion-sense strands, respectively. The coat protein is more similar to that of mastreviruses (44% identity), whereas the replication-associated protein of YMaCV is more similar (49% identity) to that encoded by a recently described, unclassified ssDNA virus isolated on trees in Brazil. This is the first report of a circular DNA virus associated with yerba mate. Its unique genome organization and phylogenetic relationships indicates that YMaCV represents a distinct evolutionary lineage within the ssDNA viruses and therefore this virus should be classified as a member of a new species within an unassigned genus or family.
Ligation Bias in Illumina Next-Generation DNA Libraries: Implications for Sequencing Ancient Genomes

PubMed Central

Seguin-Orlando, Andaine; Schubert, Mikkel; Clary, Joel; Stagegaard, Julia; Alberdi, Maria T.; Prado, José Luis; Prieto, Alfredo; Willerslev, Eske; Orlando, Ludovic

2013-01-01

Ancient DNA extracts consist of a mixture of endogenous molecules and contaminant DNA templates, often originating from environmental microbes. These two populations of templates exhibit different chemical characteristics, with the former showing depurination and cytosine deamination by-products, resulting from post-mortem DNA damage. Such chemical modifications can interfere with the molecular tools used for building second-generation DNA libraries, and limit our ability to fully characterize the true complexity of ancient DNA extracts. In this study, we first use fresh DNA extracts to demonstrate that library preparation based on adapter ligation at AT-overhangs are biased against DNA templates starting with thymine residues, contrarily to blunt-end adapter ligation. We observe the same bias on fresh DNA extracts sheared on Bioruptor, Covaris and nebulizers. This contradicts previous reports suggesting that this bias could originate from the methods used for shearing DNA. This also suggests that AT-overhang adapter ligation efficiency is affected in a sequence-dependent manner and results in an uneven representation of different genomic contexts. We then show how this bias could affect the base composition of ancient DNA libraries prepared following AT-overhang ligation, mainly by limiting the ability to ligate DNA templates starting with thymines and therefore deaminated cytosines. This results in particular nucleotide misincorporation damage patterns, deviating from the signature generally expected for authenticating ancient sequence data. Consequently, we show that models adequate for estimating post-mortem DNA damage levels must be robust to the molecular tools used for building ancient DNA libraries. PMID:24205269
Characterization and isolation of a T-DNA tagged banana promoter active during in vitro culture and low temperature stress.

PubMed

Santos, Efrén; Remy, Serge; Thiry, Els; Windelinckx, Saskia; Swennen, Rony; Sági, László

2009-06-24

Next-generation transgenic plants will require a more precise regulation of transgene expression, preferably under the control of native promoters. A genome-wide T-DNA tagging strategy was therefore performed for the identification and characterization of novel banana promoters. Embryogenic cell suspensions of a plantain-type banana were transformed with a promoterless, codon-optimized luciferase (luc+) gene and low temperature-responsive luciferase activation was monitored in real time. Around 16,000 transgenic cell colonies were screened for baseline luciferase activity at room temperature 2 months after transformation. After discarding positive colonies, cultures were re-screened in real-time at 26 degrees C followed by a gradual decrease to 8 degrees C. The baseline activation frequency was 0.98%, while the frequency of low temperature-responsive luciferase activity was 0.61% in the same population of cell cultures. Transgenic colonies with luciferase activity responsive to low temperature were regenerated to plantlets and luciferase expression patterns monitored during different regeneration stages. Twenty four banana DNA sequences flanking the right T-DNA borders in seven independent lines were cloned via PCR walking. RT-PCR analysis in one line containing five inserts allowed the identification of the sequence that had activated luciferase expression under low temperature stress in a developmentally regulated manner. This activating sequence was fused to the uidA reporter gene and back-transformed into a commercial dessert banana cultivar, in which its original expression pattern was confirmed. This promoter tagging and real-time screening platform proved valuable for the identification of novel promoters and genes in banana and for monitoring expression patterns throughout in vitro development and low temperature treatment. Combination of PCR walking techniques was efficient for the isolation of candidate promoters even in a multicopy T-DNA line. Qualitative and quantitative GUS expression analyses of one tagged promoter in a commercial cultivar demonstrated a reproducible promoter activity pattern during in vitro culture. Thus, this promoter could be used during in vitro selection and generation of commercial transgenic plants.
Sphingomonas pituitosa sp. nov., an exopolysaccharide-producing bacterium that secretes an unusual type of sphingan.

PubMed

Denner, E B; Paukner, S; Kämpfer, P; Moore, E R; Abraham, W R; Busse, H J; Wanner, G; Lubitz, W

2001-05-01

Strain EDIVT, an exopolysaccharide-producing bacterium, was subjected to polyphasic characterization. The bacterium produced copious amounts of an extracellular polysaccharide, forming slimy, viscous, intensely yellow-pigmented colonies on Czapek-Dox (CZD) agar. The culture fluids of the liquid version of CZD medium were highly viscous after cultivation for 5 d. Cells of strain EDIVT were Gram-negative, catalase-positive, oxidase-negative, nonspore-forming, rod-shaped and motile. Comparisons of 16S rDNA gene sequences demonstrated that EDIVT clusters phylogenetically with the species of the genus Sphingomonas sensu stricto. The G+C content of the DNA (64.5 mol%), the presence of ubiquinone Q-10, the presence of 2-hydroxymyristic acid (14:0 2-OH) as the major hydroxylated fatty acid, the absence of 3-hydroxy fatty acids and the detection of sym-homospermidine as the major component in the polyamine pattern, together with the presence of sphingoglycolipid, supported this delineation. 16S rDNA sequence analysis indicated that strain EDIVT is most closely related (99.4% similarity) to Sphingomonas trueperi LMG 2142T. DNA-DNA hybridization showed that the level of relatedness to S. trueperi is only 45.5%. Further differences were apparent in the cellular fatty acid profile, the polar lipid pattern, the Fourier-transform infrared spectrum and whole-cell proteins and in a number of biochemical characteristics. On the basis of the estimated phylogenetic position derived from 16S rDNA sequence data, DNA-DNA reassociation and phenotypic differences, strain EDIVT (= CIP 106154T = DSM 13101T) was recognized as a new species of Sphingomonas, for which the name Sphingomonas pituitosa sp. nov. is proposed. A component analysis of the exopolysaccharide (named PS-EDIV) suggested that it represents a novel type of sphingan composed of glucose, rhamnose and an unidentified sugar. Glucuronic acid, which is commonly found in sphingans, was absent. The mean molecular mass of PS-EDIV was approximately 3 x 10(6) Da.
Simultaneously measuring multiple protein interactions and their correlations in a cell by Protein-interactome Footprinting

PubMed Central

Luo, Si-Wei; Liang, Zhi; Wu, Jia-Rui

2017-01-01

Quantitatively detecting correlations of multiple protein-protein interactions (PPIs) in vivo is a big challenge. Here we introduce a novel method, termed Protein-interactome Footprinting (PiF), to simultaneously measure multiple PPIs in one cell. The principle of PiF is that each target physical PPI in the interactome is simultaneously transcoded into a specific DNA sequence based on dimerization of the target proteins fused with DNA-binding domains. The interaction intensity of each target protein is quantified as the copy number of the specific DNA sequences bound by each fusion protein dimers. Using PiF, we quantitatively reveal dynamic patterns of PPIs and their correlation network in E. coli two-component systems. PMID:28338015
High-Resolution Analysis of Cytosine Methylation in Ancient DNA

PubMed Central

Cropley, Jennifer E.; Cooper, Alan; Suter, Catherine M.

2012-01-01

Epigenetic changes to gene expression can result in heritable phenotypic characteristics that are not encoded in the DNA itself, but rather by biochemical modifications to the DNA or associated chromatin proteins. Interposed between genes and environment, these epigenetic modifications can be influenced by environmental factors to affect phenotype for multiple generations. This raises the possibility that epigenetic states provide a substrate for natural selection, with the potential to participate in the rapid adaptation of species to changes in environment. Any direct test of this hypothesis would require the ability to measure epigenetic states over evolutionary timescales. Here we describe the first single-base resolution of cytosine methylation patterns in an ancient mammalian genome, by bisulphite allelic sequencing of loci from late Pleistocene Bison priscus remains. Retrotransposons and the differentially methylated regions of imprinted loci displayed methylation patterns identical to those derived from fresh bovine tissue, indicating that methylation patterns are preserved in the ancient DNA. Our findings establish the biochemical stability of methylated cytosines over extensive time frames, and provide the first direct evidence that cytosine methylation patterns are retained in DNA from ancient specimens. The ability to resolve cytosine methylation in ancient DNA provides a powerful means to study the role of epigenetics in evolution. PMID:22276161
Late-Quaternary biogeographic scenarios for the brown bear ( Ursus arctos), a wild mammal model species

NASA Astrophysics Data System (ADS)

Davison, John; Ho, Simon Y. W.; Bray, Sarah C.; Korsten, Marju; Tammeleht, Egle; Hindrikson, Maris; Østbye, Kjartan; Østbye, Eivind; Lauritzen, Stein-Erik; Austin, Jeremy; Cooper, Alan; Saarma, Urmas

2011-02-01

This review provides an up-to-date synthesis of the matrilineal phylogeography of a uniquely well-studied Holarctic mammal, the brown bear. We extend current knowledge by presenting a DNA sequence derived from one of the earliest known fossils of a polar bear (dated to 115 000 years before present), a species that shares a paraphyletic mitochondrial association with brown bears. A molecular clock analysis of 140 mitochondrial DNA sequences, including our new polar bear sequence, provides novel insights into the times of origin for different brown bear clades. We propose a number of regional biogeographic scenarios based on genetic data, divergence time estimates and paleontological records. The case of the brown bear provides an example for researchers working with less well-studied taxa: it shows clearly that phylogeographic models based on patterns of modern genetic variation alone can be substantially improved by including data on historical patterns of genetic diversity in the form of ancient DNA sequences derived from accurately dated samples and by using an approach to divergence-time estimation that suits the data under analysis. Using such approaches it has been possible to (i) establish that the processes shaping modern genetic diversity in brown bears acted recently, within the last three glacial cycles; (ii) distinguish among hypotheses concerning species' responses to climatic oscillations in accordance with the lack of phylogeographic structure that existed in brown bears prior to the last glacial maximum (LGM); (iii) reassess theories linking monophyletic brown bear populations to particular LGM refuge areas; and (iv) identify vicariance events and track analogous patterns of migration by brown bears out of Eurasia to North America and Japan.
MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.

PubMed

Ozaki, Haruka; Iwasaki, Wataru

2016-08-01

As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Dissecting enzyme function with microfluidic-based deep mutational scanning.

PubMed

Romero, Philip A; Tran, Tuan M; Abate, Adam R

2015-06-09

Natural enzymes are incredibly proficient catalysts, but engineering them to have new or improved functions is challenging due to the complexity of how an enzyme's sequence relates to its biochemical properties. Here, we present an ultrahigh-throughput method for mapping enzyme sequence-function relationships that combines droplet microfluidic screening with next-generation DNA sequencing. We apply our method to map the activity of millions of glycosidase sequence variants. Microfluidic-based deep mutational scanning provides a comprehensive and unbiased view of the enzyme function landscape. The mapping displays expected patterns of mutational tolerance and a strong correspondence to sequence variation within the enzyme family, but also reveals previously unreported sites that are crucial for glycosidase function. We modified the screening protocol to include a high-temperature incubation step, and the resulting thermotolerance landscape allowed the discovery of mutations that enhance enzyme thermostability. Droplet microfluidics provides a general platform for enzyme screening that, when combined with DNA-sequencing technologies, enables high-throughput mapping of enzyme sequence space.
Oligonucleotide Array for Identification and Detection of Pythium Species†

PubMed Central

Tambong, J. T.; de Cock, A. W. A. M.; Tinker, N. A.; Lévesque, C. A.

2006-01-01

A DNA array containing 172 oligonucleotides complementary to specific diagnostic regions of internal transcribed spacers (ITS) of more than 100 species was developed for identification and detection of Pythium species. All of the species studied, with the exception of Pythium ostracodes, exhibited a positive hybridization reaction with at least one corresponding species-specific oligonucleotide. Hybridization patterns were distinct for each species. The array hybridization patterns included cluster-specific oligonucleotides that facilitated the recognition of species, including new ones, belonging to groups such as those producing filamentous or globose sporangia. BLAST analyses against 500 publicly available Pythium sequences in GenBank confirmed that species-specific oligonucleotides were unique to all of the available strains of each species, of which there were numerous economically important ones. GenBank entries of newly described species that are not putative synonyms showed no homology to sequences of the spotted species-specific oligonucleotides, but most new species did match some of the cluster-specific oligonucleotides. Further verification of the specificity of the DNA array was done with 50 additional Pythium isolates obtained by soil dilution plating. The hybridization patterns obtained were consistent with the identification of these isolates based on morphology and ITS sequence analyses. In another blind test, total DNA of the same soil samples was amplified and hybridized on the array, and the results were compared to those of 130 Pythium isolates obtained by soil dilution plating and root baiting. The 13 species detected by the DNA array corresponded to the isolates obtained by a combination of soil dilution plating and baiting, except for one new species that was not represented on the array. We conclude that the reported DNA array is a reliable tool for identification and detection of the majority of Pythium species in environmental samples. Simultaneous detection and identification of multiple species of soilborne pathogens such as Pythium species could be a major step forward for epidemiological and ecological studies. PMID:16597974
Environmental DNA detection of rare and invasive fish species in two Great Lakes tributaries.

PubMed

Balasingham, Katherine D; Walter, Ryan P; Mandrak, Nicholas E; Heath, Daniel D

2018-01-01

The extraction and characterization of DNA from aquatic environmental samples offers an alternative, noninvasive approach for the detection of rare species. Environmental DNA, coupled with PCR and next-generation sequencing ("metabarcoding"), has proven to be very sensitive for the detection of rare aquatic species. Our study used a custom-designed group-specific primer set and next-generation sequencing for the detection of three species at risk (Eastern Sand Darter, Ammocrypta pellucida; Northern Madtom, Noturus stigmosus; and Silver Shiner, Notropis photogenis), one invasive species (Round Goby, Neogobius melanostomus) and an additional 78 native species from two large Great Lakes tributary rivers in southern Ontario, Canada: the Grand River and the Sydenham River. Of 82 fish species detected in both rivers using capture-based and eDNA methods, our eDNA method detected 86.2% and 72.0% of the fish species in the Grand River and the Sydenham River, respectively, which included our four target species. Our analyses also identified significant positive and negative species co-occurrence patterns between our target species and other identified species. Our results demonstrate that eDNA metabarcoding that targets the fish community as well as individual species of interest provides a better understanding of factors affecting the target species spatial distribution in an ecosystem than possible with only target species data. Additionally, eDNA is easily implemented as an initial survey tool, or alongside capture-based methods, for improved mapping of species distribution patterns. © 2017 John Wiley & Sons Ltd.
Modulation of cyclobutane thymine photodimer formation in T11-tracts in rotationally phased nucleosome core particles and DNA minicircles.

PubMed

Wang, Kesai; Taylor, John-Stephen A

2017-07-07

Cyclobutane pyrimidine dimers (CPDs) are DNA photoproducts linked to skin cancer, whose mutagenicity depends in part on their frequency of formation and deamination. Nucleosomes modulate CPD formation, favoring outside facing sites and disfavoring inward facing sites. A similar pattern of CPD formation in protein-free DNA loops suggests that DNA bending causes the modulation in nucleosomes. To systematically study the cause and effect of nucleosome structure on CPD formation and deamination, we have developed a circular permutation synthesis strategy for positioning a target sequence at different superhelix locations (SHLs) across a nucleosome in which the DNA has been rotationally phased with respect to the histone octamer by TG motifs. We have used this system to show that the nucleosome dramatically modulates CPD formation in a T11-tract that covers one full turn of the nucleosome helix at seven different SHLs, and that the position of maximum CPD formation at all locations is shifted to the 5΄-side of that found in mixed-sequence nucleosomes. We also show that an 80-mer minicircle DNA using the same TG-motifs faithfully reproduces the CPD pattern in the nucleosome, indicating that it is a good model for protein-free rotationally phased bent DNA of the same curvature as in a nucleosome, and that bending is modulating CPD formation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Learning a weighted sequence model of the nucleosome core and linker yields more accurate predictions in Saccharomyces cerevisiae and Homo sapiens.

PubMed

Reynolds, Sheila M; Bilmes, Jeff A; Noble, William Stafford

2010-07-08

DNA in eukaryotes is packaged into a chromatin complex, the most basic element of which is the nucleosome. The precise positioning of the nucleosome cores allows for selective access to the DNA, and the mechanisms that control this positioning are important pieces of the gene expression puzzle. We describe a large-scale nucleosome pattern that jointly characterizes the nucleosome core and the adjacent linkers and is predominantly characterized by long-range oscillations in the mono, di- and tri-nucleotide content of the DNA sequence, and we show that this pattern can be used to predict nucleosome positions in both Homo sapiens and Saccharomyces cerevisiae more accurately than previously published methods. Surprisingly, in both H. sapiens and S. cerevisiae, the most informative individual features are the mono-nucleotide patterns, although the inclusion of di- and tri-nucleotide features results in improved performance. Our approach combines a much longer pattern than has been previously used to predict nucleosome positioning from sequence-301 base pairs, centered at the position to be scored-with a novel discriminative classification approach that selectively weights the contributions from each of the input features. The resulting scores are relatively insensitive to local AT-content and can be used to accurately discriminate putative dyad positions from adjacent linker regions without requiring an additional dynamic programming step and without the attendant edge effects and assumptions about linker length modeling and overall nucleosome density. Our approach produces the best dyad-linker classification results published to date in H. sapiens, and outperforms two recently published models on a large set of S. cerevisiae nucleosome positions. Our results suggest that in both genomes, a comparable and relatively small fraction of nucleosomes are well-positioned and that these positions are predictable based on sequence alone. We believe that the bulk of the remaining nucleosomes follow a statistical positioning model.
Distinct pattern of TP53 mutations in human immunodeficiency virus-related head and neck squamous cell carcinoma.

PubMed

Gleber-Netto, Frederico O; Zhao, Mei; Trivedi, Sanchit; Wang, Jiping; Jasser, Samar; McDowell, Christina; Kadara, Humam; Zhang, Jiexin; Wang, Jing; William, William N; Lee, J Jack; Nguyen, Minh Ly; Pai, Sara I; Walline, Heather M; Shin, Dong M; Ferris, Robert L; Carey, Thomas E; Myers, Jeffrey N; Pickering, Curtis R

2018-01-01

Human immunodeficiency virus-infected individuals (HIVIIs) have a higher incidence of head and neck squamous cell carcinoma (HNSCC), and clinical and histopathological differences have been observed in their tumors in comparison with those of HNSCC patients without a human immunodeficiency virus (HIV) infection. The reasons for these differences are not clear, and molecular differences between HIV-related HNSCC and non-HIV-related HNSCC may exist. This study compared the mutational patterns of HIV-related HNSCC and non-HIV-related HNSCC. The DNA of 20 samples of HIV-related HNSCCs and 32 samples of non-HIV-related HNSCCs was sequenced. DNA libraries covering exons of 18 genes frequently mutated in HNSCC (AJUBA, CASP8, CCND1, CDKN2A, EGFR, FAT1, FBXW7, HLA-A, HRAS, KEAP1, NFE2L2, NOTCH1, NOTCH2, NSD1, PIK3CA, TGFBR2, TP53, and TP63) were prepared and sequenced on an Ion Personal Genome Machine sequencer. DNA sequencing data were analyzed with Ion Reporter software. The human papillomavirus (HPV) status of the tumor samples was assessed with in situ hybridization, the MassARRAY HPV multiplex polymerase chain reaction assay, and p16 immunostaining. Mutation calls were compared among the studied groups. HIV-related HNSCC revealed a distinct pattern of mutations in comparison with non-HIV-related HNSCC. TP53 mutation frequencies were significantly lower in HIV-related HNSCC. Mutations in HIV+ patients tended to be TpC>T nucleotide changes for all mutated genes but especially for TP53. HNSCC in HIVIIs presents a distinct pattern of genetic mutations, particularly in the TP53 gene. HIV-related HNSCC may have a distinct biology, and an effect of the HIV virus on the pathogenesis of these tumors should not be ruled out. Cancer 2018;124:84-94. © 2017 American Cancer Society. © 2017 American Cancer Society.
Localized population divergence of vervet monkeys (Chlorocebus spp.) in South Africa: evidence from mtDNA

PubMed Central

Turner, Trudy R.; Coetzer, Willem G.; Schmitt, Christopher A.; Lorenz, Joseph G.; Freimer, Nelson B.; Grobler, J. Paul

2015-01-01

Objectives Vervet monkeys are common in most tree-rich areas of South Africa, but their absence from grassland and semi-desert areas of the country suggest potentially restricted and mosaic local population patterns that may have relevance to local phenotype patterns and selection. A portion of the mtDNA control region was sequenced to study patterns of genetic differentiation. Materials and Methods DNA was extracted and mtDNA sequences were obtained from 101 vervet monkeys at 15 localities which represent both an extensive (widely across the distribution range) and intensive (more than one troop at most of the localities) sampling strategy. Analyses utilized Arlequin 3.1, MEGA 6, BEAST v1.5.2 and Network V3.6.1 Results The dataset contained 26 distinct haplotypes, with six populations fixed for single haplotypes. Pairwise P-distance among population pairs showed significant differentiation among most population pairs, but with non-significant differences among populations within some regions. Populations were grouped into three broad clusters in a maximum likelihood phylogenetic tree and a haplotype network. These clusters correspond to (i) north-western, northern and north-eastern parts of the distribution range as well as the northern coastal belt; (ii) central areas of the country; and (iii) southern part of the Indian Ocean coastal belt, and adjacent inland areas. Discussion Apparent patterns of genetic structure correspond to current and past distribution of suitable habitat, geographic barriers to gene flow, geographic distance and female philopatry. However, further work on nuclear markers and other genomic data is necessary to confirm these results. PMID:26265297
Plastoglobule-Targeting Competence of a Putative Transit Peptide Sequence from Rice Phytoene Synthase 2 in Plastids.

PubMed

You, Min Kyoung; Kim, Jin Hwa; Lee, Yeo Jin; Jeong, Ye Sol; Ha, Sun-Hwa

2016-12-22

Plastoglobules (PGs) are thylakoid membrane microdomains within plastids that are known as specialized locations of carotenogenesis. Three rice phytoene synthase proteins (OsPSYs) involved in carotenoid biosynthesis have been identified. Here, the N-terminal 80-amino-acid portion of OsPSY2 (PTp) was demonstrated to be a chloroplast-targeting peptide by displaying cytosolic localization of OsPSY2(ΔPTp):mCherry in rice protoplast, in contrast to chloroplast localization of OsPSY2:mCherry in a punctate pattern. The peptide sequence of a PTp was predicted to harbor two transmembrane domains eligible for a putative PG-targeting signal. To assess and enhance the PG-targeting ability of PTp, the original PTp DNA sequence ( PTp ) was modified to a synthetic DNA sequence ( stPTp ), which had 84.4% similarity to the original sequence. The motivation of this modification was to reduce the GC ratio from 75% to 65% and to disentangle the hairpin loop structures of PTp . These two DNA sequences were fused to the sequence of the synthetic green fluorescent protein (sGFP) and drove GFP expression with different efficiencies. In particular, the RNA and protein levels of stPTp-sGFP were slightly improved to 1.4-fold and 1.3-fold more than those of sGFP, respectively. The green fluorescent signals of their mature proteins were all observed as speckle-like patterns with slightly blurred stromal signals in chloroplasts. These discrete green speckles of PTp - sGFP and stPTp - sGFP corresponded exactly to the red fluorescent signal displayed by OsPSY2:mCherry in both etiolated and greening protoplasts and it is presumed to correspond to distinct PGs. In conclusion, we identified PTp as a transit peptide sequence facilitating preferential translocation of foreign proteins to PGs, and developed an improved PTp sequence, a s tPTp , which is expected to be very useful for applications in plant biotechnologies requiring precise micro-compartmental localization in plastids.
In and out of the rRNA genes: characterization of Pokey elements in the sequenced Daphnia genome

PubMed Central

2013-01-01

Background Only a few transposable elements are known to exhibit site-specific insertion patterns, including the well-studied R-element retrotransposons that insert into specific sites within the multigene rDNA. The only known rDNA-specific DNA transposon, Pokey (superfamily: piggyBac) is found in the freshwater microcrustacean, Daphnia pulex. Here, we present a genome-wide analysis of Pokey based on the recently completed whole genome sequencing project for D. pulex. Results Phylogenetic analysis of Pokey elements recovered from the genome sequence revealed the presence of four lineages corresponding to two divergent autonomous families and two related lineages of non-autonomous miniature inverted repeat transposable elements (MITEs). The MITEs are also found at the same 28S rRNA gene insertion site as the Pokey elements, and appear to have arisen as deletion derivatives of autonomous elements. Several copies of the full-length Pokey elements may be capable of producing an active transposase. Surprisingly, both families of Pokey possess a series of 200 bp repeats upstream of the transposase that is derived from the rDNA intergenic spacer (IGS). The IGS sequences within the Pokey elements appear to be evolving in concert with the rDNA units. Finally, analysis of the insertion sites of Pokey elements outside of rDNA showed a target preference for sites similar to the specific sequence that is targeted within rDNA. Conclusions Based on the target site preference of Pokey elements and the concerted evolution of a segment of the element with the rDNA unit, we propose an evolutionary path by which the ancestors of Pokey elements have invaded the rDNA niche. We discuss how specificity for the rDNA unit may have evolved and how this specificity has played a role in the long-term survival of these elements in the subgenus Daphnia. PMID:24059783
Bacterial community composition in different sediments from the Eastern Mediterranean Sea: a comparison of four 16S ribosomal DNA clone libraries.

PubMed

Polymenakou, Paraskevi N; Bertilsson, Stefan; Tselepides, Anastasios; Stephanou, Euripides G

2005-10-01

The regional variability of sediment bacterial community composition and diversity was studied by comparative analysis of four large 16S ribosomal DNA (rDNA) clone libraries from sediments in different regions of the Eastern Mediterranean Sea (Thermaikos Gulf, Cretan Sea, and South lonian Sea). Amplified rDNA restriction analysis of 664 clones from the libraries indicate that the rDNA richness and evenness was high: for example, a near-1:1 relationship among screened clones and number of unique restriction patterns when up to 190 clones were screened for each library. Phylogenetic analysis of 207 bacterial 16S rDNA sequences from the sediment libraries demonstrated that Gamma-, Delta-, and Alphaproteobacteria, Holophaga/Acidobacteria, Planctomycetales, Actinobacteria, Bacteroidetes, and Verrucomicrobia were represented in all four libraries. A few clones also grouped with the Betaproteobacteria, Nitrospirae, Spirochaetales, Chlamydiae, Firmicutes, and candidate division OPl 1. The abundance of sequences affiliated with Gammaproteobacteria was higher in libraries from shallow sediments in the Thermaikos Gulf (30 m) and the Cretan Sea (100 m) compared to the deeper South Ionian station (2790 m). Most sequences in the four sediment libraries clustered with uncultured 16S rDNA phylotypes from marine habitats, and many of the closest matches were clones from hydrocarbon seeps, benzene-mineralizing consortia, sulfate reducers, sulk oxidizers, and ammonia oxidizers. LIBSHUFF statistics of 16S rDNA gene sequences from the four libraries revealed major differences, indicating either a very high richness in the sediment bacterial communities or considerable variability in bacterial community composition among regions, or both.

Patscanui: an intuitive web interface for searching patterns in DNA and protein data.

PubMed

Blin, Kai; Wohlleben, Wolfgang; Weber, Tilmann

2018-05-02

Patterns in biological sequences frequently signify interesting features in the underlying molecule. Many tools exist to search for well-known patterns. Less support is available for exploratory analysis, where no well-defined patterns are known yet. PatScanUI (https://patscan.secondarymetabolites.org/) provides a highly interactive web interface to the powerful generic pattern search tool PatScan. The complex PatScan-patterns are created in a drag-and-drop aware interface allowing researchers to do rapid prototyping of the often complicated patterns useful to identifying features of interest.
Analysis of ELA-DQB exon 2 polymorphism in Argentine Creole horses by PCR-RFLP and PCR-SSCP.

PubMed

Villegas-Castagnasso, E E; Díaz, S; Giovambattista, G; Dulout, F N; Peral-García, P

2003-08-01

The second exon of equine leucocyte antigen (ELA)-DQB genes was amplified from genomic DNA of 32 Argentine Creole horses by PCR. Amplified DNA was analysed by PCR-restriction fragment length polymorphism (RFLP) and PCR-single-strand conformation polymorphism (SSCP). The PCR-RFLP analysis revealed two HaeIII patterns, four RsaI patterns, five MspI patterns and two HinfI patterns. EcoRI showed no variation in the analysed sample. Additional patterns that did not account for known exon 2 DNA sequences were observed, suggesting the existence of novel ELA-DQB alleles. PCR-SSCP analysis exhibited seven different band patterns, and the number of bands per animal ranged from four to nine. Both methods indicated that at least two DQB genes are present. The presence of more than two alleles in each animal showed that the primers employed in this work are not specific for a unique DQB locus. The improvement of this PCR-RFLP method should provide a simple and rapid technique for an accurate definition of ELA-DQB typing in horses.
New insights into Trypanosoma cruzi evolution, genotyping and molecular diagnostics from satellite DNA sequence analysis.

PubMed

Ramírez, Juan C; Torres, Carolina; Curto, María de Los A; Schijman, Alejandro G

2017-12-01

Trypanosoma cruzi has been subdivided into seven Discrete Typing Units (DTUs), TcI-TcVI and Tcbat. Two major evolutionary models have been proposed to explain the origin of hybrid lineages, but while it is widely accepted that TcV and TcVI are the result of genetic exchange between TcII and TcIII strains, the origin of TcIII and TcIV is still a matter of debate. T. cruzi satellite DNA (SatDNA), comprised of 195 bp units organized in tandem repeats, from both TcV and TcVI stocks were found to have SatDNA copies type TcI and TcII; whereas contradictory results were observed for TcIII stocks and no TcIV sequence has been analyzed yet. Herein, we have gone deeper into this matter analyzing 335 distinct SatDNA sequences from 19 T. cruzi stocks representative of DTUs TcI-TcVI for phylogenetic inference. Bayesian phylogenetic tree showed that all sequences were grouped in three major clusters, which corresponded to sequences from DTUs TcI/III, TcII and TcIV; whereas TcV and TcVI stocks had two sets of sequences distributed into TcI/III and TcII clusters. As expected, the lowest genetic distances were found between TcI and TcIII, and between TcV and TcVI sequences; whereas the highest ones were observed between TcII and TcI/III, and among TcIV sequences and those from the remaining DTUs. In addition, signature patterns associated to specific T. cruzi lineages were identified and new primers that improved SatDNA-based qPCR sensitivity were designed. Our findings support the theory that TcIII is not the result of a hybridization event between TcI and TcII, and that TcIV had an independent origin from the other DTUs, contributing to clarifying the evolutionary history of T. cruzi lineages. Moreover, this work opens the possibility of typing samples from Chagas disease patients with low parasitic loads and improving molecular diagnostic methods of T. cruzi infection based on SatDNA sequence amplification.
Physical locations of 5S and 18S-25S rDNA in Asian and American diploid Hordeum species with the I genome.

PubMed

Taketa, S; Ando, H; Takeda, K; von Bothmer, R

2001-05-01

The physical locations of 5S and 18S-25S rDNA sequences in 15 diploid Hordeum species with the I genome were examined by double-target in situ hybridization with pTa71 (18S-25S rDNA) and pTa794 (5S rDNA) clones as probes. All the three Asian species had a species-specific rDNA pattern. In 12 American species studied, eight different rDNA types were found. The type reported previously in H. chilense (the 'chilense' type) was observed in eight American species. The chilense type had double 5S rDNA sites - two sites on one chromosome arm separated by a short distance - and two pairs of major 18S-25S rDNA sites on two pairs of satellite chromosomes. The other seven types found in American species were similar to the chilense type and could be derived from the chilense type through deletion, reduction or addition of a rDNA site. Intraspecific polymorphisms were observed in three American species. The overall similarity in rDNA patterns among American species indicates the close relationships between North and South American species and their derivation from a single ancestral source. The differences in the distribution patterns of 5S and 18S-25S rDNA between Asian and American species suggest differentiation between the I genomes of Asian and American species. The 5S and 18S-25S rDNA sites are useful chromosome markers for delimiting Asian species, but have limited value as a taxonomic character in American species. On the basis of rDNA patterns, karyotype evolution and phylogeny of the I-genome diploid species are discussed.
Pneumocystis jirovecii multilocus genotyping profiles in patients from Portugal and Spain.

PubMed

Esteves, F; Montes-Cano, M A; de la Horra, C; Costa, M C; Calderón, E J; Antunes, F; Matos, O

2008-04-01

Pneumonia caused by the opportunistic organism Pneumocystis jirovecii is a clinically important infection affecting AIDS and other immunocompromised patients. The present study aimed to compare and characterise the frequency pattern of DNA sequences from the P. jirovecii mitochondrial large-subunit rRNA (mtLSU rRNA) gene, the dihydropteroate synthase (DHPS) gene and the internal transcribed spacer (ITS) regions of the nuclear rRNA operon in specimens from Lisbon (Portugal) and Seville (Spain). Total DNA was extracted and used for specific molecular sequence analysis of the three loci. In both populations, mtLSU rRNA gene analysis revealed an overall prevalence of genotype 1. In the Portuguese population, genotype 2 was the second most common, followed by genotype 3. Inversely, in the Spanish population, genotype 3 was the second most common, followed by genotype 2. The DHPS wild-type sequence was the genotype observed most frequently in both populations, and the DHPS genotype frequency pattern was identical to distribution patterns revealed in other European studies. ITS types showed a significant diversity in both populations because of the high sequence variability in these genomic regions. The most prevalent ITS type in the Portuguese population was Eg, followed by Cg. In contrast to other European studies, Bi was the most common ITS type in the Spanish samples, followed by Eg. A statistically significant association between mtLSU rRNA genotype 1 and ITS type Eg was revealed.
Divergence of Gene Body DNA Methylation and Evolution of Plant Duplicate Genes

PubMed Central

Wang, Jun; Marowsky, Nicholas C.; Fan, Chuanzhu

2014-01-01

It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica) genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences) of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes. PMID:25310342
Assessing DNA Barcodes for Species Identification in North American Reptiles and Amphibians in Natural History Collections.

PubMed

Chambers, E Anne; Hebert, Paul D N

2016-01-01

High rates of species discovery and loss have led to the urgent need for more rapid assessment of species diversity in the herpetofauna. DNA barcoding allows for the preliminary identification of species based on sequence divergence. Prior DNA barcoding work on reptiles and amphibians has revealed higher biodiversity counts than previously estimated due to cases of cryptic and undiscovered species. Past studies have provided DNA barcodes for just 14% of the North American herpetofauna, revealing the need for expanded coverage. This study extends the DNA barcode reference library for North American herpetofauna, assesses the utility of this approach in aiding species delimitation, and examines the correspondence between current species boundaries and sequence clusters designated by the BIN system. Sequences were obtained from 730 specimens, representing 274 species (43%) from the North American herpetofauna. Mean intraspecific divergences were 1% and 3%, while average congeneric sequence divergences were 16% and 14% in amphibians and reptiles, respectively. BIN assignments corresponded with current species boundaries in 79% of amphibians, 100% of turtles, and 60% of squamates. Deep divergences (>2%) were noted in 35% of squamate and 16% of amphibian species, and low divergences (<2%) occurred in 12% of reptiles and 23% of amphibians, patterns reflected in BIN assignments. Sequence recovery declined with specimen age, and variation in recovery success was noted among collections. Within collections, barcodes effectively flagged seven mislabeled tissues, and barcode fragments were recovered from five formalin-fixed specimens. This study demonstrates that DNA barcodes can effectively flag errors in museum collections, while BIN splits and merges reveal taxa belonging to deeply diverged or hybridizing lineages. This study is the first effort to compile a reference library of DNA barcodes for herpetofauna on a continental scale.
Assessing DNA Barcodes for Species Identification in North American Reptiles and Amphibians in Natural History Collections

PubMed Central

Chambers, E. Anne; Hebert, Paul D. N.

2016-01-01

Background High rates of species discovery and loss have led to the urgent need for more rapid assessment of species diversity in the herpetofauna. DNA barcoding allows for the preliminary identification of species based on sequence divergence. Prior DNA barcoding work on reptiles and amphibians has revealed higher biodiversity counts than previously estimated due to cases of cryptic and undiscovered species. Past studies have provided DNA barcodes for just 14% of the North American herpetofauna, revealing the need for expanded coverage. Methodology/Principal Findings This study extends the DNA barcode reference library for North American herpetofauna, assesses the utility of this approach in aiding species delimitation, and examines the correspondence between current species boundaries and sequence clusters designated by the BIN system. Sequences were obtained from 730 specimens, representing 274 species (43%) from the North American herpetofauna. Mean intraspecific divergences were 1% and 3%, while average congeneric sequence divergences were 16% and 14% in amphibians and reptiles, respectively. BIN assignments corresponded with current species boundaries in 79% of amphibians, 100% of turtles, and 60% of squamates. Deep divergences (>2%) were noted in 35% of squamate and 16% of amphibian species, and low divergences (<2%) occurred in 12% of reptiles and 23% of amphibians, patterns reflected in BIN assignments. Sequence recovery declined with specimen age, and variation in recovery success was noted among collections. Within collections, barcodes effectively flagged seven mislabeled tissues, and barcode fragments were recovered from five formalin-fixed specimens. Conclusions/Significance This study demonstrates that DNA barcodes can effectively flag errors in museum collections, while BIN splits and merges reveal taxa belonging to deeply diverged or hybridizing lineages. This study is the first effort to compile a reference library of DNA barcodes for herpetofauna on a continental scale. PMID:27116180
Differential structural status of the RNA counterpart of an undecamer quasi-palindromic DNA sequence present in LCR of human β-globin gene cluster.

PubMed

Kaushik, Mahima; Kukreti, Shrikant

2015-01-01

Our previous work on structural polymorphism shown at a single nucleotide polymorphism (SNP) (A → G) site located on HS4 region of locus control region (LCR) of β-globin gene has established a hairpin → duplex equilibrium corresponding to A → B like DNA transition (Kaushik M, Kukreti, R., Grover, D., Brahmachari, S.K. and Kukreti S. Nucleic Acids Res. 2003; Kaushik M, Kukreti S. Nucleic Acids Res. 2006). The G-allele of A → G SNP has been shown to be significantly associated with the occurrence of β-thalassemia. Considering the significance of this 11-nt long quasi-palindromic sequence [5'-TGGGG(G/A)CCCCA; HP(G/A)11] of β-globin gene LCR, we further explored the differential behavior of the same DNA sequence with its RNA counterpart, using various biophysical and biochemical techniques. In contrast to its DNA counterpart exhibiting a A → B structural transition and an equilibrium between duplex and hairpin forms, the studied RNA oligonucleotide sequence [5'-UGGGG(G/A)CCCCA; RHP(G/A)11] existed only in duplex form (A-conformation) and did not form hairpin. The single residue difference from A to G led to the unusual thermal stability of the RNA structure formed by the studied sequence. Since, naturally occurring mutations and various SNP sites may stabilize or destabilize the local DNA/RNA secondary structures, these structural transitions may affect the gene expression by a change in the protein-DNA recognition patterns.
Plasmodium falciparum Nucleosomes Exhibit Reduced Stability and Lost Sequence Dependent Nucleosome Positioning

PubMed Central

Silberhorn, Elisabeth; Schwartz, Uwe; Symelka, Anne; de Koning-Ward, Tania; Längst, Gernot

2016-01-01

The packaging and organization of genomic DNA into chromatin represents an additional regulatory layer of gene expression, with specific nucleosome positions that restrict the accessibility of regulatory DNA elements. The mechanisms that position nucleosomes in vivo are thought to depend on the biophysical properties of the histones, sequence patterns, like phased di-nucleotide repeats and the architecture of the histone octamer that folds DNA in 1.65 tight turns. Comparative studies of human and P. falciparum histones reveal that the latter have a strongly reduced ability to recognize internal sequence dependent nucleosome positioning signals. In contrast, the nucleosomes are positioned by AT-repeat sequences flanking nucleosomes in vivo and in vitro. Further, the strong sequence variations in the plasmodium histones, compared to other mammalian histones, do not present adaptations to its AT-rich genome. Human and parasite histones bind with higher affinity to GC-rich DNA and with lower affinity to AT-rich DNA. However, the plasmodium nucleosomes are overall less stable, with increased temperature induced mobility, decreased salt stability of the histones H2A and H2B and considerable reduced binding affinity to GC-rich DNA, as compared with the human nucleosomes. In addition, we show that plasmodium histone octamers form the shortest known nucleosome repeat length (155bp) in vitro and in vivo. Our data suggest that the biochemical properties of the parasite histones are distinct from the typical characteristics of other eukaryotic histones and these properties reflect the increased accessibility of the P. falciparum genome. PMID:28033404
The barley EST DNA Replication and Repair Database (bEST-DRRD) as a tool for the identification of the genes involved in DNA replication and repair.

PubMed

Gruszka, Damian; Marzec, Marek; Szarejko, Iwona

2012-06-14

The high level of conservation of genes that regulate DNA replication and repair indicates that they may serve as a source of information on the origin and evolution of the species and makes them a reliable system for the identification of cross-species homologs. Studies that had been conducted to date shed light on the processes of DNA replication and repair in bacteria, yeast and mammals. However, there is still much to be learned about the process of DNA damage repair in plants. These studies, which were conducted mainly using bioinformatics tools, enabled the list of genes that participate in various pathways of DNA repair in Arabidopsis thaliana (L.) Heynh to be outlined; however, information regarding these mechanisms in crop plants is still very limited. A similar, functional approach is particularly difficult for a species whose complete genomic sequences are still unavailable. One of the solutions is to apply ESTs (Expressed Sequence Tags) as the basis for gene identification. For the construction of the barley EST DNA Replication and Repair Database (bEST-DRRD), presented here, the Arabidopsis nucleotide and protein sequences involved in DNA replication and repair were used to browse for and retrieve the deposited sequences, derived from four barley (Hordeum vulgare L.) sequence databases, including the "Barley Genome version 0.05" database (encompassing ca. 90% of barley coding sequences) and from two databases covering the complete genomes of two monocot models: Oryza sativa L. and Brachypodium distachyon L. in order to identify homologous genes. Sequences of the categorised Arabidopsis queries are used for browsing the repositories, which are located on the ViroBLAST platform. The bEST-DRRD is currently used in our project during the identification and validation of the barley genes involved in DNA repair. The presented database provides information about the Arabidopsis genes involved in DNA replication and repair, their expression patterns and models of protein interactions. It was designed and established to provide an open-access tool for the identification of monocot homologs of known Arabidopsis genes that are responsible for DNA-related processes. The barley genes identified in the project are currently being analysed to validate their function.
Uncommonly isolated clinical Pseudomonas: identification and phylogenetic assignation.

PubMed

Mulet, M; Gomila, M; Ramírez, A; Cardew, S; Moore, E R B; Lalucat, J; García-Valdés, E

2017-02-01

Fifty-two Pseudomonas strains that were difficult to identify at the species level in the phenotypic routine characterizations employed by clinical microbiology laboratories were selected for genotypic-based analysis. Species level identifications were done initially by partial sequencing of the DNA dependent RNA polymerase sub-unit D gene (rpoD). Two other gene sequences, for the small sub-unit ribosonal RNA (16S rRNA) and for DNA gyrase sub-unit B (gyrB) were added in a multilocus sequence analysis (MLSA) study to confirm the species identifications. These sequences were analyzed with a collection of reference sequences from the type strains of 161 Pseudomonas species within an in-house multi-locus sequence analysis database. Whole-cell matrix-assisted laser-desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) analyses of these strains complemented the DNA sequenced-based phylogenetic analyses and were observed to be in accordance with the results of the sequence data. Twenty-three out of 52 strains were assigned to 12 recognized species not commonly detected in clinical specimens and 29 (56 %) were considered representatives of at least ten putative new species. Most strains were distributed within the P. fluorescens and P. aeruginosa lineages. The value of rpoD sequences in species-level identifications for Pseudomonas is emphasized. The correct species identifications of clinical strains is essential for establishing the intrinsic antibiotic resistance patterns and improved treatment plans.
Molecular population dynamics of DNA structures in a bcl-2 promoter sequence is regulated by small molecules and the transcription factor hnRNP LL.

PubMed

Cui, Yunxi; Koirala, Deepak; Kang, HyunJin; Dhakal, Soma; Yangyuoru, Philip; Hurley, Laurence H; Mao, Hanbin

2014-05-01

Minute difference in free energy change of unfolding among structures in an oligonucleotide sequence can lead to a complex population equilibrium, which is rather challenging for ensemble techniques to decipher. Herein, we introduce a new method, molecular population dynamics (MPD), to describe the intricate equilibrium among non-B deoxyribonucleic acid (DNA) structures. Using mechanical unfolding in laser tweezers, we identified six DNA species in a cytosine (C)-rich bcl-2 promoter sequence. Population patterns of these species with and without a small molecule (IMC-76 or IMC-48) or the transcription factor hnRNP LL are compared to reveal the MPD of different species. With a pattern recognition algorithm, we found that IMC-48 and hnRNP LL share 80% similarity in stabilizing i-motifs with 60 s incubation. In contrast, IMC-76 demonstrates an opposite behavior, preferring flexible DNA hairpins. With 120-180 s incubation, IMC-48 and hnRNP LL destabilize i-motifs, which has been previously proposed to activate bcl-2 transcriptions. These results provide strong support, from the population equilibrium perspective, that small molecules and hnRNP LL can modulate bcl-2 transcription through interaction with i-motifs. The excellent agreement with biochemical results firmly validates the MPD analyses, which, we expect, can be widely applicable to investigate complex equilibrium of biomacromolecules. © 2014 The Author(s). Published by Oxford University Press [on behalf of Nucleic Acids Research].
Prasinoviruses reveal a complex evolutionary history and a patchy environmental distribution

NASA Astrophysics Data System (ADS)

Finke, J. F.; Suttle, C.

2016-02-01

Prasinophytes constitute a group of eukaryotic phytoplankton that has a global distribution and is a major component of coastal and oceanic communities. Members of this group are infected by large double-stranded DNA viruses that can be significant agents of mortality, and which show evidence of substantial horizontal transfer of genes from their hosts and other organisms. However, information on the genetic diversity of these viruses and their environmental distribution is limited. This study examines the genetic repertoire, phylogeny and environmental distribution of large double-stranded DNA viruses infecting Micromonas pusilla and other prasinophytes. The genomes of viruses infecting M. pusilla were sequenced and compared to those of viruses infecting other prasinophytes, revealing a relatively small set of core genes and a larger flexible pan genome. Comparing genomes among prasinoviruses highlights their variable genetic content and complex evolutionary history. While some of the pan genome is clearly host derived, many open reading frames are most similar to those found in other eukaryotes and bacteria. Gene content of the viruses is is congruent with phylogenetic analysis of viral DNA polymerase sequences and indicates that two clades of M. pusilla viruses are less related to each other than to other prasinoviruses. Moreover, the environmental distribution of prasinovirus DNA polymerase sequences indicates a complex pattern of virus-host interactions in nature. Ultimately, these patterns are influenced by the genetic repertoire encoded by prasinoviruses, and the distribution of the hosts they infect.
Molecular characterization of a Toxocara variant from cats in Kuala Lumpur, Malaysia.

PubMed

Zhu, X Q; Jacobs, D E; Chilton, N B; Sani, R A; Cheng, N A; Gasser, R B

1998-08-01

The ascaridoid nematode of cats from Kuala Lumpur, Malaysia, previously identified morphologically as Toxocara canis, was characterized using a molecular approach. The nuclear ribosomal DNA (rDNA) region spanning the first internal transcribed spacer (ITS-1), the 5.8S gene and the second internal transcribed spacer (ITS-2) was amplified and sequenced. The sequences for the parasite from Malaysian cats were compared with those for T. canis and T. cati. The sequence data showed that this taxon was genetically more similar to T. cati than to T. canis in the ITS-1, 5.8S and ITS-2. Differences in the ITS-1 and ITS-2 sequences between the taxa (9.4-26.1%) were markedly higher than variation between samples within T. canis and T. cati (0-2.9%). The sequence data demonstrate that the parasite from Malaysian cats is neither T. canis nor T. cati and indicate that it is a distinct species. Based on these data, PCR-linked restriction fragment length polymorphism (RFLP) and single-strand conformation polymorphism (SSCP) methods were employed for the unequivocal differentiation of the Toxocara variant from T. canis and T. cati. These methods should provide valuable tools for studying the life-cycle, transmission pattern(s) and zoonotic potential of this parasite.
Ribosomal DNA, tri- and bi-partite pericentromeres in the permanent translocation heterozygote Rhoeo spathacea.

PubMed

Golczyk, Hieronim; Hasterok, Robert; Szklarczyk, Marek

2010-12-01

High- and low-stringency FISH and base-specific fluorescence were performed on the permanent translocation heterozygote Rhoeo spathacea (2n = 12). Our results indicate that 45S rDNA arrays, rDNA-related sequences and other GC-rich DNA fraction(s) are located within the pericentromeric regions of all twelve chromosomes, usually colocalizing with the chromomycin A(3)-positive bands. Homogenization of the pericentromeric regions appears to result from the concerted spread of GC-rich sequences, with differential amplification likely. We found new 5S rDNA patterns, which suggest a variability in the breakpoints and in the consequent chromosome reorganizations. It was found that the large 5S rDNA locus residing on each of the 8E and 9E arms consisted of two smaller loci. On each of the two chromosome arms 3b and 4b, in addition to the major subtelomeric 5S rDNA locus, a new minor locus was found interstitially about 40% along the arm length. The arrangement of cytotogenetic landmarks and chromosome arm measurements are discussed with regard to genome repatterning in Rhoeo.
Resurrection of New Caledonian maskray Neotrygon trigonoides (Myliobatoidei: Dasyatidae) from synonymy with N. kuhlii, based on cytochrome-oxidase I gene sequences and spotting patterns.

PubMed

Borsa, Philippe; Arlyza, Irma S; Chen, Wei-Jen; Durand, Jean-Dominique; Meekan, Mark G; Shen, Kang-Ning

2013-04-01

The maskray from New Caledonia, Neotrygon trigonoides Castelnau, 1873, has been recently synonymized with the blue-spotted maskray, N. kuhlii (Müller and Henle, 1841), a species with wide Indo-West Pacific distribution, but the reasons for this are unclear. Blue-spotted maskray specimens were collected from the Indian Ocean (Tanzania, Sumatra) and the Coral Triangle (Indonesia, Taiwan, and West Papua), and N. trigonoides specimens were collected from New Caledonia (Coral-Sea). Their partial COI gene sequences were generated to expand the available DNA-barcode database on this species, which currently comprises homologous sequences from Ningaloo Reef, the Coral Triangle and the Great Barrier Reef (Coral-Sea). Spotting patterns were also compared across regions. Haplotypes from the Coral-Sea formed a haplogroup phylogenetically distinct from all other haplotypes sampled in the Indo-West Pacific. No clear-cut geographic composition relative to DNA-barcodes or spotting patterns was apparent in N. kuhlii samples across the Indian Ocean and the Coral Triangle. The New Caledonian maskray had spotting patterns markedly different from all the other samples. This, added to a substantial level of net nucleotide divergence (2.6%) with typical N. kuhlii justifies considering the New Caledonian maskray as a separate species, for which we propose to resurrect the name Neotrygon trigonoides. Copyright © 2013. Published by Elsevier SAS.
HUNT: launch of a full-length cDNA database from the Helix Research Institute.

PubMed

Yudate, H T; Suwa, M; Irie, R; Matsui, H; Nishikawa, T; Nakamura, Y; Yamaguchi, D; Peng, Z Z; Yamamoto, T; Nagai, K; Hayashi, K; Otsuki, T; Sugiyama, T; Ota, T; Suzuki, Y; Sugano, S; Isogai, T; Masuho, Y

2001-01-01

The Helix Research Institute (HRI) in Japan is releasing 4356 HUman Novel Transcripts and related information in the newly established HUNT database. The institute is a joint research project principally funded by the Japanese Ministry of International Trade and Industry, and the clones were sequenced in the governmental New Energy and Industrial Technology Development Organization (NEDO) Human cDNA Sequencing Project. The HUNT database contains an extensive amount of annotation from advanced analysis and represents an essential bioinformatics contribution towards understanding of the gene function. The HRI human cDNA clones were obtained from full-length enriched cDNA libraries constructed with the oligo-capping method and have resulted in novel full-length cDNA sequences. A large fraction has little similarity to any proteins of known function and to obtain clues about possible function we have developed original analysis procedures. Any putative function deduced here can be validated or refuted by complementary analysis results. The user can also extract information from specific categories like PROSITE patterns, PFAM domains, PSORT localization, transmembrane helices and clones with GENIUS structure assignments. The HUNT database can be accessed at http://www.hri.co.jp/HUNT.
A chromosome inversion near the KIT gene and the Tobiano spotting pattern in horses.

PubMed

Brooks, S A; Lear, T L; Adelson, D L; Bailey, E

2007-01-01

Tobiano is a white spotting pattern in horses caused by a dominant gene, Tobiano(TO). Here, we report TO associated with a large paracentric chromosome inversion on horse chromosome 3. DNA sequences flanking the inversion were identified and a PCR test was developed to detect the inversion. The inversion was only found in horses with the tobiano pattern, including horses with diverse genetic backgrounds, which indicated a common genetic origin thousands of years ago. The inversion does not interrupt any annotated genes, but begins approximately 100 kb downstream of the KIT gene. This inversion may disrupt regulatory sequences for the KIT gene and cause the white spotting pattern. Copyright (c) 2008 S. Karger AG, Basel.
Transposable elements in fish chromosomes: a study in the marine cobia species.

PubMed

Costa, G W W F; Cioffi, M B; Bertollo, L A C; Molina, W F

2013-01-01

Rachycentron canadum, a unique representative of the Rachycentridae family, has been the subject of considerable biotechnological interest due to its potential use in marine fish farming. This species has undergone extensive research concerning the location of genes and multigene families on its chromosomes. Although most of the genome of some organisms is composed of repeated DNA sequences, aspects of the origin and dispersion of these elements are still largely unknown. The physical mapping of repetitive sequences on the chromosomes of R. canadum proved to be relevant for evolutionary and applied purposes. Therefore, here, we present the mapping by fluorescence in situ hybridization of the transposable element (TE) Tol2, the non-LTR retrotransposons Rex1 and Rex3, together with the 18S and 5S rRNA genes in the chromosome of this species. The Tol2 TE, belonging to the family of hAT transposons, is homogeneously distributed in the euchromatic regions of the chromosomes but with huge colocalization with the 18S rDNA sites. The hybridization signals for Rex1 and Rex3 revealed a semi-arbitrary distribution pattern, presenting differentiated dispersion in euchromatic and heterochromatic regions. Rex1 elements are associated preferentially in heterochromatic regions, while Rex3 shows a scarce distribution in the euchromatic regions of the chromosomes. The colocalization of TEs with 18S and 5S rDNA revealed complex chromosomal regions of repetitive sequences. In addition, the nonpreferential distribution of Rex1 and Rex3 in all heterochromatic regions, as well as the preferential distribution of the Tol2 transposon associated with 18S rDNA sequences, reveals a distinct pattern of organization of TEs in the genome of this species. A heterogeneous chromosomal colonization of TEs may confer different evolutionary rates to the heterochromatic regions of this species.

Base changes in tumour DNA have the power to reveal the causes and evolution of cancer

DOE PAGES

Hollstein, M.; Alexandrov, L. B.; Wild, C. P.; ...

2016-06-06

Next-generation sequencing (NGS) technology has demonstrated that the cancer genomes are peppered with mutations. Although most somatic tumour mutations are unlikely to have any role in the cancer process per se, the spectra of DNA sequence changes in tumour mutation catalogues have the potential to identify the mutagens, and to reveal the mutagenic processes responsible for human cancer. Very recently, a novel approach for data mining of the vast compilations of tumour NGS data succeeded in separating and precisely defining at least 30 distinct patterns of sequence change hidden in mutation databases. At least half of these mutational signatures canmore » be readily assigned to known human carcinogenic exposures or endogenous mechanisms of mutagenesis. A quantum leap in our knowledge of mutagenesis in human cancers has resulted, stimulating a flurry of research activity. We trace here the major findings leading first to the hypothesis that carcinogenic insults leave characteristic imprints on the DNA sequence of tumours, and culminating in empirical evidence from NGS data that well-defined carcinogen mutational signatures are indeed present in tumour genomic DNA from a variety of cancer types. The notion that tumour DNAs can divulge environmental sources of mutation is now a well-accepted fact. This approach to cancer aetiology has also incriminated various endogenous, enzyme-driven processes that increase the somatic mutation load in sporadic cancers. The tasks now confronting the field of molecular epidemiology are to assign mutagenic processes to orphan and newly discovered tumour mutation patterns, and to determine whether avoidable cancer risk factors influence signatures produced by endogenous enzymatic mechanisms. As a result, innovative research with experimental models and exploitation of the geographical heterogeneity in cancer incidence can address these challenges.« less
Base changes in tumour DNA have the power to reveal the causes and evolution of cancer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hollstein, M.; Alexandrov, L. B.; Wild, C. P.

Next-generation sequencing (NGS) technology has demonstrated that the cancer genomes are peppered with mutations. Although most somatic tumour mutations are unlikely to have any role in the cancer process per se, the spectra of DNA sequence changes in tumour mutation catalogues have the potential to identify the mutagens, and to reveal the mutagenic processes responsible for human cancer. Very recently, a novel approach for data mining of the vast compilations of tumour NGS data succeeded in separating and precisely defining at least 30 distinct patterns of sequence change hidden in mutation databases. At least half of these mutational signatures canmore » be readily assigned to known human carcinogenic exposures or endogenous mechanisms of mutagenesis. A quantum leap in our knowledge of mutagenesis in human cancers has resulted, stimulating a flurry of research activity. We trace here the major findings leading first to the hypothesis that carcinogenic insults leave characteristic imprints on the DNA sequence of tumours, and culminating in empirical evidence from NGS data that well-defined carcinogen mutational signatures are indeed present in tumour genomic DNA from a variety of cancer types. The notion that tumour DNAs can divulge environmental sources of mutation is now a well-accepted fact. This approach to cancer aetiology has also incriminated various endogenous, enzyme-driven processes that increase the somatic mutation load in sporadic cancers. The tasks now confronting the field of molecular epidemiology are to assign mutagenic processes to orphan and newly discovered tumour mutation patterns, and to determine whether avoidable cancer risk factors influence signatures produced by endogenous enzymatic mechanisms. As a result, innovative research with experimental models and exploitation of the geographical heterogeneity in cancer incidence can address these challenges.« less
Genomic resources for songbird research and their use in characterizing gene expression during brain development

PubMed Central

Li, XiaoChing; Wang, Xiu-Jie; Tannenhauser, Jonathan; Podell, Sheila; Mukherjee, Piali; Hertel, Moritz; Biane, Jeremy; Masuda, Shoko; Nottebohm, Fernando; Gaasterland, Terry

2007-01-01

Vocal learning and neuronal replacement have been studied extensively in songbirds, but until recently, few molecular and genomic tools for songbird research existed. Here we describe new molecular/genomic resources developed in our laboratory. We made cDNA libraries from zebra finch (Taeniopygia guttata) brains at different developmental stages. A total of 11,000 cDNA clones from these libraries, representing 5,866 unique gene transcripts, were randomly picked and sequenced from the 3′ ends. A web-based database was established for clone tracking, sequence analysis, and functional annotations. Our cDNA libraries were not normalized. Sequencing ESTs without normalization produced many developmental stage-specific sequences, yielding insights into patterns of gene expression at different stages of brain development. In particular, the cDNA library made from brains at posthatching day 30–50, corresponding to the period of rapid song system development and song learning, has the most diverse and richest set of genes expressed. We also identified five microRNAs whose sequences are highly conserved between zebra finch and other species. We printed cDNA microarrays and profiled gene expression in the high vocal center of both adult male zebra finches and canaries (Serinus canaria). Genes differentially expressed in the high vocal center were identified from the microarray hybridization results. Selected genes were validated by in situ hybridization. Networks among the regulated genes were also identified. These resources provide songbird biologists with tools for genome annotation, comparative genomics, and microarray gene expression analysis. PMID:17426146
DNA Data Visualization (DDV): Software for Generating Web-Based Interfaces Supporting Navigation and Analysis of DNA Sequence Data of Entire Genomes.

PubMed

Neugebauer, Tomasz; Bordeleau, Eric; Burrus, Vincent; Brzezinski, Ryszard

2015-01-01

Data visualization methods are necessary during the exploration and analysis activities of an increasingly data-intensive scientific process. There are few existing visualization methods for raw nucleotide sequences of a whole genome or chromosome. Software for data visualization should allow the researchers to create accessible data visualization interfaces that can be exported and shared with others on the web. Herein, novel software developed for generating DNA data visualization interfaces is described. The software converts DNA data sets into images that are further processed as multi-scale images to be accessed through a web-based interface that supports zooming, panning and sequence fragment selection. Nucleotide composition frequencies and GC skew of a selected sequence segment can be obtained through the interface. The software was used to generate DNA data visualization of human and bacterial chromosomes. Examples of visually detectable features such as short and long direct repeats, long terminal repeats, mobile genetic elements, heterochromatic segments in microbial and human chromosomes, are presented. The software and its source code are available for download and further development. The visualization interfaces generated with the software allow for the immediate identification and observation of several types of sequence patterns in genomes of various sizes and origins. The visualization interfaces generated with the software are readily accessible through a web browser. This software is a useful research and teaching tool for genetics and structural genomics.
Quantum dot-based microfluidic biosensor for cancer detection

NASA Astrophysics Data System (ADS)

Ghrera, Aditya Sharma; Pandey, Chandra Mouli; Ali, Md. Azahar; Malhotra, Bansi Dhar

2015-05-01

We report results of the studies relating to fabrication of an impedimetric microfluidic-based nucleic acid sensor for quantification of DNA sequences specific to chronic myelogenous leukemia (CML). The sensor chip is prepared by patterning an indium-tin-oxide (ITO) coated glass substrate via wet chemical etching method followed by sealing with polydimethylsiloxane (PDMS) microchannel for fluid control. The fabricated microfluidic chip comprising of a patterned ITO substrate is modified by depositing cadmium selenide quantum dots (QCdSe) via Langmuir-Blodgett technique. Further, the QCdSe surface has been functionalized with specific DNA probe for CML detection. The probe DNA functionalized QCdSe integrated miniaturized system has been used to monitor target complementary DNA concentration by measuring the interfacial charge transfer resistance via hybridization. The presence of complementary DNA in buffer solution significantly results in decreased electro-conductivity of the interface due to presence of a charge barrier for transport of the redox probe ions. The microfluidic DNA biosensor exhibits improved linearity in the concentration range of 10-15 M to 10-11 M.
Optical biosensing strategies for DNA methylation analysis.

PubMed

Nazmul Islam, Md; Yadav, Sharda; Hakimul Haque, Md; Munaz, Ahmed; Islam, Farhadul; Al Hossain, Md Shahriar; Gopalan, Vinod; Lam, Alfred K; Nguyen, Nam-Trung; Shiddiky, Muhammad J A

2017-06-15

DNA methylation is an epigenetic modification of DNA, where a methyl group is added at the fifth carbon of the cytosine base to form 5 methyl cytosine (5mC) without altering the DNA sequences. It plays important roles in regulating many cellular processes by modulating key genes expression. Alteration in DNA methylation patterns becomes particularly important in the aetiology of different diseases including cancers. Abnormal methylation pattern could contribute to the pathogenesis of cancer either by silencing key tumor suppressor genes or by activating oncogenes. Thus, DNA methylation biosensing can help in the better understanding of cancer prognosis and diagnosis and aid the development of therapies. Over the last few decades, a plethora of optical detection techniques have been developed for analyzing DNA methylation using fluorescence, Raman spectroscopy, surface plasmon resonance (SPR), electrochemiluminescence and colorimetric readouts. This paper aims to comprehensively review the optical strategies for DNA methylation detection. We also present an overview of the remaining challenges of optical strategies that still need to be focused along with the lesson learnt while working with these techniques. Copyright © 2016 Elsevier B.V. All rights reserved.
The study of human Y chromosome variation through ancient DNA.

PubMed

Kivisild, Toomas

2017-05-01

High throughput sequencing methods have completely transformed the study of human Y chromosome variation by offering a genome-scale view on genetic variation retrieved from ancient human remains in context of a growing number of high coverage whole Y chromosome sequence data from living populations from across the world. The ancient Y chromosome sequences are providing us the first exciting glimpses into the past variation of male-specific compartment of the genome and the opportunity to evaluate models based on previously made inferences from patterns of genetic variation in living populations. Analyses of the ancient Y chromosome sequences are challenging not only because of issues generally related to ancient DNA work, such as DNA damage-induced mutations and low content of endogenous DNA in most human remains, but also because of specific properties of the Y chromosome, such as its highly repetitive nature and high homology with the X chromosome. Shotgun sequencing of uniquely mapping regions of the Y chromosomes to sufficiently high coverage is still challenging and costly in poorly preserved samples. To increase the coverage of specific target SNPs capture-based methods have been developed and used in recent years to generate Y chromosome sequence data from hundreds of prehistoric skeletal remains. Besides the prospects of testing directly as how much genetic change in a given time period has accompanied changes in material culture the sequencing of ancient Y chromosomes allows us also to better understand the rate at which mutations accumulate and get fixed over time. This review considers genome-scale evidence on ancient Y chromosome diversity that has recently started to accumulate in geographic areas favourable to DNA preservation. More specifically the review focuses on examples of regional continuity and change of the Y chromosome haplogroups in North Eurasia and in the New World.
Human papillomavirus type 16 DNA in periungual squamous cell carcinomas

DOE Office of Scientific and Technical Information (OSTI.GOV)

Moy, R.L.; Eliezri, Y.D.; Bennett, R.G.

1989-05-12

Ten squamous cell carcinomas (in situ or invasive) of the fingernail region were analyzed for the presence of DNA sequences homologous to human papilloma-virus (HPV) by dot blot hybridization. In most patients, the lesions were verrucae of long-term duration that were refractory to conventional treatment methods. Eight of the lesions contained HPV DNA sequences, and in six of these the sequences were related to HPV 16 as deduced from low-stringency nucleic acid hybridization followed by low- and high-stringency washes. Furthermore, the restriction endonuclease digestion pattern of DNA isolated from four of these lesions was diagnostic of episomal HPV 16. Themore » high-frequency association of HPV 16 with periungual squamous cell carcinoma is similar to that reported for HPV 16 with squamous cell carcinomas on mucous membranes at other sites, notably the genital tract. The findings suggest that HPV 16 may play an important role in the development of squamous cell carcinomas of the finger, most notably those lesions that are chronic and located in the periungual area.« less
Molecular Microbial Analysis of Lactobacillus Strains Isolated from the Gut of Calves for Potential Probiotic Use

PubMed Central

Soto, Lorena P.; Frizzo, Laureano S.; Bertozzi, Ezequiel; Avataneo, Elizabeth; Sequeira, Gabriel J.; Rosmini, Marcelo R.

2010-01-01

The intestinal microbiota has an influence on the growth and health status of the hosts. This is of particular interest in animals reared using intensive farming practices. Hence, it is necessary to know more about complexity of the beneficial intestinal microbiota. The use of molecular methods has revolutionized microbial identification by improving its quality and effectiveness. The specific aim of the study was to analyze predominant species of Lactobacillus in intestinal microbial ecosystem of young calves. Forty-two lactic acid bacteria (LAB) isolated from intestinal tract of young calves were characterized by: Amplified Ribosomal DNA Restriction Analysis (ARDRA), by using Hae III, Msp I, and Hinf I restriction enzymes, and 16S rDNA gene sequencing. ARDRA screening revealed nine unique patterns among 42 isolates, with the same pattern for 29 of the isolates. Gene fragments of 16S rDNA of 19 strains representing different patterns were sequenced to confirm the identification of these species. These results confirmed that ARDRA is a good tool for identification and discrimination of bacterial species isolated from complex ecosystem and between closely related groups. This paper provides information about the LAB species predominant in intestinal tract of young calves that could provide beneficial effects when administered as probiotic. PMID:20445780
Determination of ABO genotypes with DNA extracted from formalin-fixed, paraffin-embedded tissues.

PubMed

Yamada, M; Yamamoto, Y; Tanegashima, A; Kane, M; Ikehara, Y; Fukunaga, T; Nishi, K

1994-01-01

The gene encoding the specific glycosyltransferases which catalyze the conversion of the H antigen to A or B antigens shows a slight but distinct variation in its allelic nucleotide sequence and can be divided into 6 genotypes when digested with specific restriction enzymes. We extracted DNA from formalin-fixed, paraffin-embedded tissues using SDS/proteinase K treatment followed by phenol/chloroform extraction. The sequence of nucleotides for the A, B and O genes was amplified by the polymerase chain reaction (PCR). DNA fragments of 128 bp and 200 bp could be amplified in the second round of PCR, using an aliquot of the first round PCR product as template. Degraded DNA from paraffin blocks stored for up to 10.7 years could be successfully typed. The ABO genotype was deduced from the digestion patterns with an appropriate combination of restriction enzymes and was compatible with the phenotype obtained from the blood sample.
A rapid loss of stripes: the evolutionary history of the extinct quagga

PubMed Central

Leonard, Jennifer A; Rohland, Nadin; Glaberman, Scott; Fleischer, Robert C; Caccone, Adalgisa; Hofreiter, Michael

2005-01-01

Twenty years ago, the field of ancient DNA was launched with the publication of two short mitochondrial (mt) DNA sequences from a single quagga (Equus quagga) museum skin, an extinct South African equid (Higuchi et al. 1984 Nature 312, 282–284). This was the first extinct species from which genetic information was retrieved. The DNA sequences of the quagga showed that it was more closely related to zebras than to horses. However, quagga evolutionary history is far from clear. We have isolated DNA from eight quaggas and a plains zebra (subspecies or phenotype Equus burchelli burchelli). We show that the quagga displayed little genetic diversity and very recently diverged from the plains zebra, probably during the penultimate glacial maximum. This emphasizes the importance of Pleistocene climate changes for phylogeographic patterns in African as well as Holarctic fauna. PMID:17148190
Sex reversal in the mouse (Mus musculus) is caused by a recurrent nonreciprocal crossover involving the x and an aberrant y chromosome.

PubMed

Singh, L; Jones, K W

1982-02-01

Satellite DNA (Bkm) from the W sex-determining chromosome of snakes, which is related to sequences on the mouse Y chromosome, has been used to analyze the DNA and chromosomes of sex-reversed (Sxr) XXSxr male mice. Such mice exhibit a male-specific Southern blot Bkm hybridization pattern, consistent with the presence of Y-chromosome DNA. In situ hybridization of Bkm to chromosomes of XXSxr mice shows an aberrant concentration of related sequences on the distal terminus of a large mouse chromosome. The XYSxr carrier male, however, shows a pair of small chromosomes, which are presumed to be aberrant Y derivatives. Meiosis in the XYSxr mouse involves transfer of chromatin rich in Bkm-related DNA from the Y-Y1 complex to the X distal terminus. We suggest that this event is responsible for the transmission of the Sxr trait.
Biparental inheritance of organelles in Pelargonium: evidence for intergenomic recombination of mitochondrial DNA.

PubMed

Apitz, Janina; Weihe, Andreas; Pohlheim, Frank; Börner, Thomas

2013-02-01

While uniparental transmission of mtDNA is widespread and dominating in eukaryotes leaving mutation as the major source of genotypic diversity, recently, biparental inheritance of mitochondrial genes has been demonstrated in reciprocal crosses of Pelargonium zonale and P. inquinans. The thereby arising heteroplasmy carries the potential for recombination between mtDNAs of different descent, i.e. between the parental mitochondrial genomes. We have analyzed these Pelargonium hybrids for mitochondrial intergenomic recombination events by examining differences in DNA blot hybridization patterns of the mitochondrial genes atp1 and cob. Further investigation of these genes and their flanking regions using nucleotide sequence polymorphisms and PCR revealed DNA segments in the progeny, which contained both P. zonale and P. inquinans sequences suggesting an intergenomic recombination in hybrids of Pelargonium. This turns Pelargonium into an interesting subject for studies of recombination and evolutionary dynamics of mitochondrial genomes.
Burkholderia pseudomallei sequencing identifies genomic clades with distinct recombination, accessory, and epigenetic profiles

PubMed Central

Nandi, Tannistha; Holden, Matthew T.G.; Didelot, Xavier; Mehershahi, Kurosh; Boddey, Justin A.; Beacham, Ifor; Peak, Ian; Harting, John; Baybayan, Primo; Guo, Yan; Wang, Susana; How, Lee Chee; Sim, Bernice; Essex-Lopresti, Angela; Sarkar-Tyson, Mitali; Nelson, Michelle; Smither, Sophie; Ong, Catherine; Aw, Lay Tin; Hoon, Chua Hui; Michell, Stephen; Studholme, David J.; Titball, Richard; Chen, Swaine L.; Parkhill, Julian

2015-01-01

Burkholderia pseudomallei (Bp) is the causative agent of the infectious disease melioidosis. To investigate population diversity, recombination, and horizontal gene transfer in closely related Bp isolates, we performed whole-genome sequencing (WGS) on 106 clinical, animal, and environmental strains from a restricted Asian locale. Whole-genome phylogenies resolved multiple genomic clades of Bp, largely congruent with multilocus sequence typing (MLST). We discovered widespread recombination in the Bp core genome, involving hundreds of regions associated with multiple haplotypes. Highly recombinant regions exhibited functional enrichments that may contribute to virulence. We observed clade-specific patterns of recombination and accessory gene exchange, and provide evidence that this is likely due to ongoing recombination between clade members. Reciprocally, interclade exchanges were rarely observed, suggesting mechanisms restricting gene flow between clades. Interrogation of accessory elements revealed that each clade harbored a distinct complement of restriction-modification (RM) systems, predicted to cause clade-specific patterns of DNA methylation. Using methylome sequencing, we confirmed that representative strains from separate clades indeed exhibit distinct methylation profiles. Finally, using an E. coli system, we demonstrate that Bp RM systems can inhibit uptake of non-self DNA. Our data suggest that RM systems borne on mobile elements, besides preventing foreign DNA invasion, may also contribute to limiting exchanges of genetic material between individuals of the same species. Genomic clades may thus represent functional units of genetic isolation in Bp, modulating intraspecies genetic diversity. PMID:25236617
Molecular evolution of ependymin and the phylogenetic resolution of early divergences among euteleost fishes.

PubMed

Ortí, G; Meyer, A

1996-04-01

The rate and pattern of DNA evolution of ependymin, a single-copy gene coding for a highly expressed glycoprotein in the brain matrix of teleost fishes, is characterized and its phylogenetic utility for fish systematics is assessed. DNA sequences were determined from catfish, electric fish, and characiforms and compared with published ependymin sequences from cyprinids, salmon, pike, and herring. Among these groups, ependymin amino acid sequences were highly divergent (up to 60% sequence difference), but had surprisingly similar hydropathy profiles and invariant glycosylation sites, suggesting that functional properties of the proteins are conserved. Comparison of base composition at third codon positions and introns revealed AT-rich introns and GC-rich third codon positions, suggesting that the biased codon usage observed might not be due to mutational bias. Phylogenetic information content of third codon positions was surprisingly high and sufficient to recover the most basal nodes of the tree, in spite of the observation that pairwise distances (at third codon positions) were well above the presumed saturation level. This finding can be explained by the high proportion of phylogenetically informative nonsynonymous changes at third codon positions among these highly divergent proteins. Ependymin DNA sequences have established the first molecular evidence for the monophyly of a group containing salmonids and esociforms. In addition, ependymin suggests a sister group relationship of electric fish (Gymnotiformes) and Characiformes, constituting a significant departure from currently accepted classifications. However, relationships among characiform lineages were not completely resolved by ependymin sequences in spite of seemingly appropriate levels of variation among taxa and considerably low levels of homoplasy in the data (consistency index = 0.7). If the diversification of Characiformes took place in an "explosive" manner, over a relatively short period of time this pattern should also be observed using other phylogenetic markers. Poor conservation of ependymin's primary structure hinders the design of efficient primers for PCR that could be used in wide-ranging fish systematic studies. However, alternative methods like PCR amplification from cDNA used here should provide promising comparative sequence data for the resolution of phylogenetic relationships among other basal lineages of teleost fishes.
Misconceptions on Missing Data in RAD-seq Phylogenetics with a Deep-scale Example from Flowering Plants.

PubMed

Eaton, Deren A R; Spriggs, Elizabeth L; Park, Brian; Donoghue, Michael J

2017-05-01

Restriction-site associated DNA (RAD) sequencing and related methods rely on the conservation of enzyme recognition sites to isolate homologous DNA fragments for sequencing, with the consequence that mutations disrupting these sites lead to missing information. There is thus a clear expectation for how missing data should be distributed, with fewer loci recovered between more distantly related samples. This observation has led to a related expectation: that RAD-seq data are insufficiently informative for resolving deeper scale phylogenetic relationships. Here we investigate the relationship between missing information among samples at the tips of a tree and information at edges within it. We re-analyze and review the distribution of missing data across ten RAD-seq data sets and carry out simulations to determine expected patterns of missing information. We also present new empirical results for the angiosperm clade Viburnum (Adoxaceae, with a crown age >50 Ma) for which we examine phylogenetic information at different depths in the tree and with varied sequencing effort. The total number of loci, the proportion that are shared, and phylogenetic informativeness varied dramatically across the examined RAD-seq data sets. Insufficient or uneven sequencing coverage accounted for similar proportions of missing data as dropout from mutation-disruption. Simulations reveal that mutation-disruption, which results in phylogenetically distributed missing data, can be distinguished from the more stochastic patterns of missing data caused by low sequencing coverage. In Viburnum, doubling sequencing coverage nearly doubled the number of parsimony informative sites, and increased by >10X the number of loci with data shared across >40 taxa. Our analysis leads to a set of practical recommendations for maximizing phylogenetic information in RAD-seq studies. [hierarchical redundancy; phylogenetic informativeness; quartet informativeness; Restriction-site associated DNA (RAD) sequencing; sequencing coverage; Viburnum.]. © The authors 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please e-mail: journals.permission@oup.com.
Mitochondrial DNA pattern of the fine shrimp Metapenaeus elegans (De Man, 1907) in the lagoon of Segara Anakan, Central Java, using Hind III

NASA Astrophysics Data System (ADS)

Nugraha, Fitra Arya Dwi; Holil, Kholifah; Kurniawan, Nia

2017-05-01

Ecological damages to the Lagoon of Segara Anakan, Central Java, as well as large-scale and continuous exploitation are threatening the sustainability of fine shrimp, Metapenaeus elegans, and resources. Information in regards to genetic resources is crucial to establish long-term conservation programs and to preserve germplasm quality. This study aims to evaluate the number and size of the fragment which is digested with restriction enzyme Hind III. Seven individuals of Metapenaeus elegans from the Lagoon of Segara Anakan were examined using Hind III. Amplification of mitochondrial DNA resulted in 950 bp, and the digestion using Hind III generated four fragments consisting of 114 bp, 200 bp, 250 bp, and 386 bp, which formed a monomorphic pattern. The restriction pattern showed the probability of homozygosity of alleles that restricted using Hind III. Homozygosity indicates no variation of DNA sequence.
The Influence of Hydroxylation on Maintaining CpG Methylation Patterns: A Hidden Markov Model Approach.

PubMed

Giehr, Pascal; Kyriakopoulos, Charalampos; Ficz, Gabriella; Wolf, Verena; Walter, Jörn

2016-05-01

DNA methylation and demethylation are opposing processes that when in balance create stable patterns of epigenetic memory. The control of DNA methylation pattern formation by replication dependent and independent demethylation processes has been suggested to be influenced by Tet mediated oxidation of 5mC. Several alternative mechanisms have been proposed suggesting that 5hmC influences either replication dependent maintenance of DNA methylation or replication independent processes of active demethylation. Using high resolution hairpin oxidative bisulfite sequencing data, we precisely determine the amount of 5mC and 5hmC and model the contribution of 5hmC to processes of demethylation in mouse ESCs. We develop an extended hidden Markov model capable of accurately describing the regional contribution of 5hmC to demethylation dynamics. Our analysis shows that 5hmC has a strong impact on replication dependent demethylation, mainly by impairing methylation maintenance.
Identifying the bacterial community on the surface of Intralox belting in a meat boning room by culture-dependent and culture-independent 16S rDNA sequence analysis.

PubMed

Brightwell, Gale; Boerema, Jackie; Mills, John; Mowat, Eilidh; Pulford, David

2006-05-25

We examined the bacterial community present on an Intralox conveyor belt system in an operating lamb boning room by sequencing the 16S ribosomal DNA (rDNA) of bacteria extracted in the presence or absence of cultivation. RFLP patterns for 16S rDNA clone library and cultures were generated using HaeIII and MspI restriction endonucleases. 16S rDNA amplicons produced 8 distinct RFLP pattern groups. RFLP groups I-IV were represented in the clone library and RFLP groups I and V-VIII were represented amongst the cultured isolates. Partial DNA sequences from each RFLP group revealed that all group I, II and VIII representatives were Pseudomonas spp., group III were Sphingomonas spp., group IV clones were most similar to an uncultured alpha proteobacterium, group V was similar to a Serratia spp., group VI with an Alcaligenes spp., and group VII with Microbacterium spp. Sphingomonads were numerically dominant in the culture-independent clone library and along with the group IV alpha proteobacterium were not represented amongst the cultured isolates. Serratia, Alcaligenes and Microbacterium spp. were only represented with cultured isolates. Pseudomonads were detected by both culture-dependent (84% of isolates) and culture-independent (12.5% of clones) methods and their presence at high frequency does pose the risk of product spoilage if transferred onto meat stored under aerobic conditions. The detection of sphingomonads in large numbers by the culture-independent method demands further analysis because sphingomonads may represent a new source of meat spoilage that has not been previously recognised in the meat processing environment. The 16S rDNA collections generated by both methods were important at representing the diversity of the bacterial population associated with an Intralox conveyor belt system.
Molecular Cytogenetic Analysis of Deschampsia antarctica Desv. (Poaceae), Maritime Antarctic.

PubMed

Amosova, Alexandra V; Bolsheva, Nadezhda L; Samatadze, Tatiana E; Twardovska, Maryana O; Zoshchuk, Svyatoslav A; Andreev, Igor O; Badaeva, Ekaterina D; Kunakh, Viktor A; Muravenko, Olga V

2015-01-01

Deschampsia antarctica Desv. (Poaceae) (2n = 26) is one of the two vascular plants adapted to the harshest environment of the Antarctic. Although the species is a valuable model for study of environmental stress tolerance in plants, its karyotype is still poorly investigated. We firstly conducted a comprehensive molecular cytogenetic analysis of D. antarctica collected on four islands of the Maritime Antarctic. D. antarctica karyotypes were studied by Giemsa C- and DAPI/C-banding, Ag-NOR staining, multicolour fluorescence in situ hybridization with repeated DNA probes (pTa71, pTa794, telomere repeats, pSc119.2, pAs1) and the GAA simple sequence repeat probe. We also performed sequential rapid in situ hybridization with genomic DNA of D. caespitosa. Two chromosome pairs bearing transcriptionally active 45S rDNA loci and five pairs with 5S rDNA sites were detected. A weak intercalary site of telomere repeats was revealed on the largest chromosome in addition to telomere hybridization signals at terminal positions. This fact confirms indirectly the hypothesis that chromosome fusion might have been the cause of the unusual for cereals chromosome number in this species. Based on patterns of distribution of the examined molecular cytogenetic markers, all chromosomes in karyotypes were identified, and chromosome idiograms of D. antarctica were constructed. B chromosomes were found in most karyotypes of plants from Darboux Island. A mixoploid plant with mainly triploid cells bearing a Robertsonian rearrangement was detected among typical diploid specimens from Great Jalour Island. The karyotype variability found in D. antarctica is probably an expression of genome instability induced by environmental stress factors. The differences in C-banding patterns and in chromosome distribution of rDNA loci as well as homologous highly repeated DNA sequences detected between genomes of D. antarctica and its related species D. caespitosa indicate that genome reorganization involving coding and noncoding repeated DNA sequences had occurred during the divergence of these species.

Molecular Cytogenetic Analysis of Deschampsia antarctica Desv. (Poaceae), Maritime Antarctic

PubMed Central

Amosova, Alexandra V.; Bolsheva, Nadezhda L.; Samatadze, Tatiana E.; Twardovska, Maryana O.; Zoshchuk, Svyatoslav A.; Andreev, Igor O.; Badaeva, Ekaterina D.; Kunakh, Viktor A.; Muravenko, Olga V.

2015-01-01

Deschampsia antarctica Desv. (Poaceae) (2n = 26) is one of the two vascular plants adapted to the harshest environment of the Antarctic. Although the species is a valuable model for study of environmental stress tolerance in plants, its karyotype is still poorly investigated. We firstly conducted a comprehensive molecular cytogenetic analysis of D. antarctica collected on four islands of the Maritime Antarctic. D. antarctica karyotypes were studied by Giemsa C- and DAPI/C-banding, Ag-NOR staining, multicolour fluorescence in situ hybridization with repeated DNA probes (pTa71, pTa794, telomere repeats, pSc119.2, pAs1) and the GAA simple sequence repeat probe. We also performed sequential rapid in situ hybridization with genomic DNA of D. caespitosa. Two chromosome pairs bearing transcriptionally active 45S rDNA loci and five pairs with 5S rDNA sites were detected. A weak intercalary site of telomere repeats was revealed on the largest chromosome in addition to telomere hybridization signals at terminal positions. This fact confirms indirectly the hypothesis that chromosome fusion might have been the cause of the unusual for cereals chromosome number in this species. Based on patterns of distribution of the examined molecular cytogenetic markers, all chromosomes in karyotypes were identified, and chromosome idiograms of D. antarctica were constructed. B chromosomes were found in most karyotypes of plants from Darboux Island. A mixoploid plant with mainly triploid cells bearing a Robertsonian rearrangement was detected among typical diploid specimens from Great Jalour Island. The karyotype variability found in D. antarctica is probably an expression of genome instability induced by environmental stress factors. The differences in C-banding patterns and in chromosome distribution of rDNA loci as well as homologous highly repeated DNA sequences detected between genomes of D. antarctica and its related species D. caespitosa indicate that genome reorganization involving coding and noncoding repeated DNA sequences had occurred during the divergence of these species. PMID:26394331
Evolution of DNA Methylation across Insects.

PubMed

Bewick, Adam J; Vogel, Kevin J; Moore, Allen J; Schmitz, Robert J

2017-03-01

DNA methylation contributes to gene and transcriptional regulation in eukaryotes, and therefore has been hypothesized to facilitate the evolution of plastic traits such as sociality in insects. However, DNA methylation is sparsely studied in insects. Therefore, we documented patterns of DNA methylation across a wide diversity of insects. We predicted that underlying enzymatic machinery is concordant with patterns of DNA methylation. Finally, given the suggestion that DNA methylation facilitated social evolution in Hymenoptera, we tested the hypothesis that the DNA methylation system will be associated with presence/absence of sociality among other insect orders. We found DNA methylation to be widespread, detected in all orders examined except Diptera (flies). Whole genome bisulfite sequencing showed that orders differed in levels of DNA methylation. Hymenopteran (ants, bees, wasps and sawflies) had some of the lowest levels, including several potential losses. Blattodea (cockroaches and termites) show all possible patterns, including a potential loss of DNA methylation in a eusocial species whereas solitary species had the highest levels. Species with DNA methylation do not always possess the typical enzymatic machinery. We identified a gene duplication event in the maintenance DNA methyltransferase 1 (DNMT1) that is shared by some Hymenoptera, and paralogs have experienced divergent, nonneutral evolution. This diversity and nonneutral evolution of underlying machinery suggests alternative DNA methylation pathways may exist. Phylogenetically corrected comparisons revealed no evidence that supports evolutionary association between sociality and DNA methylation. Future functional studies will be required to advance our understanding of DNA methylation in insects. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Human papillomavirus hpv-16 DNA as an epitheliotropic virus that induces hyperproliferation in squamous penile tissue.

PubMed

Salazar, Edith L; Mercado, E; Calzada, L

2005-01-01

The prevalence of human papillomavirus HPV-16DNA sequences in 57 penile carcinoma biopsies was examined using the polymerase chain reaction (PCR) with type specific internal probes, employing HPV consensus primers from the L1 region. The cases comprised 39 typical squamous cell carcinoma and 18 specimens with different subtype. PCR products were analyzed and HPV-16DNA was detected in a high percentage of specimens. Thirty-eight biopsies were HPV-16DNA positive. This determination was correlated with cellular differentiation and growth pattern. Our data corroborates that squamous cell carcinoma was invariably associated with HPV-16DNA.
Bi-PROF

PubMed Central

Gries, Jasmin; Schumacher, Dirk; Arand, Julia; Lutsik, Pavlo; Markelova, Maria Rivera; Fichtner, Iduna; Walter, Jörn; Sers, Christine; Tierling, Sascha

2013-01-01

The use of next generation sequencing has expanded our view on whole mammalian methylome patterns. In particular, it provides a genome-wide insight of local DNA methylation diversity at single nucleotide level and enables the examination of single chromosome sequence sections at a sufficient statistical power. We describe a bisulfite-based sequence profiling pipeline, Bi-PROF, which is based on the 454 GS-FLX Titanium technology that allows to obtain up to one million sequence stretches at single base pair resolution without laborious subcloning. To illustrate the performance of the experimental workflow connected to a bioinformatics program pipeline (BiQ Analyzer HT) we present a test analysis set of 68 different epigenetic marker regions (amplicons) in five individual patient-derived xenograft tissue samples of colorectal cancer and one healthy colon epithelium sample as a control. After the 454 GS-FLX Titanium run, sequence read processing and sample decoding, the obtained alignments are quality controlled and statistically evaluated. Comprehensive methylation pattern interpretation (profiling) assessed by analyzing 102-104 sequence reads per amplicon allows an unprecedented deep view on pattern formation and methylation marker heterogeneity in tissues concerned by complex diseases like cancer. PMID:23803588
Analysis of Chromatin Regulators Reveals Specific Features of Rice DNA Methylation Pathways.

PubMed

Tan, Feng; Zhou, Chao; Zhou, Qiangwei; Zhou, Shaoli; Yang, Wenjing; Zhao, Yu; Li, Guoliang; Zhou, Dao-Xiu

2016-07-01

Plant DNA methylation that occurs at CG, CHG, and CHH sites (H = A, C, or T) is a hallmark of the repression of repetitive sequences and transposable elements (TEs). The rice (Oryza sativa) genome contains about 40% repetitive sequence and TEs and displays specific patterns of genome-wide DNA methylation. The mechanism responsible for the specific methylation patterns is unclear. Here, we analyzed the function of OsDDM1 (Deficient in DNA Methylation 1) and OsDRM2 (Deficient in DNA Methylation 1) in genome-wide DNA methylation, TE repression, small RNA accumulation, and gene expression. We show that OsDDM1 is essential for high levels of methylation at CHG and, to a lesser extent, CG sites in heterochromatic regions and also is required for CHH methylation that mainly locates in the genic regions of the genome. In addition to a large member of TEs, loss of OsDDM1 leads to hypomethylation and up-regulation of many protein-coding genes, producing very severe growth phenotypes at the initial generation. Importantly, we show that OsDRM2 mutation results in a nearly complete loss of CHH methylation and derepression of mainly small TE-associated genes and that OsDDM1 is involved in facilitating OsDRM2-mediated CHH methylation. Thus, the function of OsDDM1 and OsDRM2 defines distinct DNA methylation pathways in the bulk of DNA methylation of the genome, which is possibly related to the dispersed heterochromatin across chromosomes in rice and suggests that DNA methylation mechanisms may vary among different plant species. © 2016 American Society of Plant Biologists. All Rights Reserved.
Characterization of rpoB mutations in rifampin-resistant clinical Mycobacterium tuberculosis isolates from Kuwait and Dubai.

PubMed

Ahmad, Suhail; Mokaddas, Eiman; Fares, Esther

2002-11-01

Mutations conferring resistance to rifampin in rifampin-resistant clinical Mycobacterium tuberculosis isolates occur mostly in the 81 bp rifampin-resistance-determining region (RRDR) of the rpoB gene. In this study, 29 rifampin-resistant and 12 -susceptible clinical M. tuberculosis isolates were tested for characterization of mutations in the rpoB gene by line probe (INNO-LiPA Rif. TB) assay and the results were confirmed and extended by DNA sequencing of the PCR amplified target DNA. The line probe assay identified all 12 susceptible strains as rifampin-sensitive and the DNA sequence of RRDR in the amplified rpoB gene from two isolates matched perfectly with the wild-type sequence. The line probe assay identified 28 resistant isolates as rifampin-resistant with specific detection of mutation in 22 isolates including one isolate that exhibited hetro-resistance containing both the wild-type pattern as well as a specific mutation within RRDR while one of the rifampin-resistant strain was identified as rifampin-susceptible. DNA sequencing confirmed these results and, in addition, led to the specific detection of mutations in 5 rifampin-resistant isolates in which specific base changes within RRDR could not be determined by the line probe assay. These analyses identified 8 different mutations within RRDR of the rpoB gene including one novel mutation (S522W) that has not been reported so far. The genotyping performed on the isolates carrying similar mutations showed that majority of these isolates were unique as they exhibited varying DNA banding patterns. Correlating the ethnic origin of the infected TB patients with the occurrence of specific mutations at three main codon positions (516, 526 and 531) in the rpoB gene showed that most patients (11 of 15) from South Asian region contained mutations at codon 526 while majority of isolates from patients (6 of 11) of Middle Eastern origin contained mutations at codon 531.
DNA adducts of ethylene dibromide: Aspects of formation and mutagenicity

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cmarik, J.L.

1,2-Dibromoethane (ethylene dibromide, EDB), a potential human carcinogen, undergoes bioactivation by the pathway of glutathione (GSH) conjugation, which generates a reactive intermediate capable of alkylating DNA. The major DNA adduct formed is S-[2-(N[sup 7]-guanyl)ethyl]GSH. This dissertation examined the bioactivation of EDB and the formation of DNA adducts. The selectivity of purified rat and human GSH S-transferases for EDB was examined in vitro. An assay was developed to measure the formation of S,S[prime]-ethylene-bis(GSH). The [alpha] class of the GSH S-transferases was responsible for the majority of EDB-GSH conjugation with both the rat and human enzymes. Human tissue samples for a victimmore » of EDB poisoning were analyzed for S-[2-(N[sup 7]-guanyl)ethyl]GSH utilizing electrochemical detection. No adducts were detected in samples of brain, heart, or kidney. The pattern of alkylation of guanines in fragments of plasmid pBR322 DNA by S-(2-chloroethyl)GSH and related compounds was determined. Alkylation varied approximately ten-fold in intensity and was strongest in runs of guanines. Few differences were observed in the alkylation patterns generated by the different compounds tested. The spectrum of mutations caused by S-(2-chloroethyl)GSH was determined using an M13 bacteriophage forward mutation assay. The majority of mutations (70%) were G:C to A:T transitions. Participation of the N[sup 7]-guanyl adduct in the mutagenic process is strongly implicated. The sequence selectivity of alkylation in the region of M13 sequenced in the mutation assay was determined. Comparison of the sequence selectivity with the mutation spectrum revealed no obligate relationship between the extent of adduct formation and the number of mutations which resulted at different sites. Sequence context appears to exert a strong influence on the processing of lesions. These studies strongly implicate S-[2-(N[sup 7]-guanyl)-ethyl]GSH as a mutagenic lesion formed by EDB.« less
Circadian clock protein KaiC forms ATP-dependent hexameric rings and binds DNA.

PubMed

Mori, Tetsuya; Saveliev, Sergei V; Xu, Yao; Stafford, Walter F; Cox, Michael M; Inman, Ross B; Johnson, Carl H

2002-12-24

KaiC from Synechococcus elongatus PCC 7942 (KaiC) is an essential circadian clock protein in cyanobacteria. Previous sequence analyses suggested its inclusion in the RecADnaB superfamily. A characteristic of the proteins of this superfamily is that they form homohexameric complexes that bind DNA. We show here that KaiC also forms ring complexes with a central pore that can be visualized by electron microscopy. A combination of analytical ultracentrifugation and chromatographic analyses demonstrates that these complexes are hexameric. The association of KaiC molecules into hexamers depends on the presence of ATP. The KaiC sequence does not include the obvious DNA-binding motifs found in RecA or DnaB. Nevertheless, KaiC binds forked DNA substrates. These data support the inclusion of KaiC into the RecADnaB superfamily and have important implications for enzymatic activity of KaiC in the circadian clock mechanism that regulates global changes in gene expression patterns.
Estimating the efficiency of fish cross-species cDNA microarray hybridization.

PubMed

Cohen, Raphael; Chalifa-Caspi, Vered; Williams, Timothy D; Auslander, Meirav; George, Stephen G; Chipman, James K; Tom, Moshe

2007-01-01

Using an available cross-species cDNA microarray is advantageous for examining multigene expression patterns in non-model organisms, saving the need for construction of species-specific arrays. The aim of the present study was to estimate relative efficiency of cross-species hybridizations across bony fishes, using bioinformatics tools. The methodology may serve also as a model for similar evaluations in other taxa. The theoretical evaluation was done by substituting comparative whole-transcriptome sequence similarity information into the thermodynamic hybridization equation. Complementary DNA sequence assemblages of nine fish species belonging to common families or suborders and distributed across the bony fish taxonomic branch were selected for transcriptome-wise comparisons. Actual cross-species hybridizations among fish of different taxonomic distances were used to validate and eventually to calibrate the theoretically computed relative efficiencies.
Evolutionary Patterns and Processes: Lessons from Ancient DNA.

PubMed

Leonardi, Michela; Librado, Pablo; Der Sarkissian, Clio; Schubert, Mikkel; Alfarhan, Ahmed H; Alquraishi, Saleh A; Al-Rasheid, Khaled A S; Gamba, Cristina; Willerslev, Eske; Orlando, Ludovic

2017-01-01

Ever since its emergence in 1984, the field of ancient DNA has struggled to overcome the challenges related to the decay of DNA molecules in the fossil record. With the recent development of high-throughput DNA sequencing technologies and molecular techniques tailored to ultra-damaged templates, it has now come of age, merging together approaches in phylogenomics, population genomics, epigenomics, and metagenomics. Leveraging on complete temporal sample series, ancient DNA provides direct access to the most important dimension in evolution—time, allowing a wealth of fundamental evolutionary processes to be addressed at unprecedented resolution. This review taps into the most recent findings in ancient DNA research to present analyses of ancient genomic and metagenomic data.
Evolutionary Patterns and Processes: Lessons from Ancient DNA

PubMed Central

Leonardi, Michela; Librado, Pablo; Der Sarkissian, Clio; Schubert, Mikkel; Alfarhan, Ahmed H.; Alquraishi, Saleh A.; Al-Rasheid, Khaled A. S.; Gamba, Cristina; Willerslev, Eske

2017-01-01

Abstract Ever since its emergence in 1984, the field of ancient DNA has struggled to overcome the challenges related to the decay of DNA molecules in the fossil record. With the recent development of high-throughput DNA sequencing technologies and molecular techniques tailored to ultra-damaged templates, it has now come of age, merging together approaches in phylogenomics, population genomics, epigenomics, and metagenomics. Leveraging on complete temporal sample series, ancient DNA provides direct access to the most important dimension in evolution—time, allowing a wealth of fundamental evolutionary processes to be addressed at unprecedented resolution. This review taps into the most recent findings in ancient DNA research to present analyses of ancient genomic and metagenomic data. PMID:28173586
Molecular diversity and distribution pattern of ciliates in sediments from deep-sea hydrothermal vents in the Okinawa Trough and adjacent sea areas

NASA Astrophysics Data System (ADS)

Zhao, Feng; Xu, Kuidong

2016-10-01

In comparison with the macrobenthos and prokaryotes, patterns of diversity and distribution of microbial eukaryotes in deep-sea hydrothermal vents are poorly known. The widely used high-throughput sequencing of 18S rDNA has revealed a high diversity of microeukaryotes yielded from both living organisms and buried DNA in marine sediments. More recently, cDNA surveys have been utilized to uncover the diversity of active organisms. However, both methods have never been used to evaluate the diversity of ciliates in hydrothermal vents. By using high-throughput DNA and cDNA sequencing of 18S rDNA, we evaluated the molecular diversity of ciliates, a representative group of microbial eukaryotes, from the sediments of deep-sea hydrothermal vents in the Okinawa Trough and compared it with that of an adjacent deep-sea area about 15 km away and that of an offshore area of the Yellow Sea about 500 km away. The results of DNA sequencing showed that Spirotrichea and Oligohymenophorea were the most diverse and abundant groups in all the three habitats. The proportion of sequences of Oligohymenophorea was the highest in the hydrothermal vents whereas Spirotrichea was the most diverse group at all three habitats. Plagiopyleans were found only in the hydrothermal vents but with low diversity and abundance. By contrast, the cDNA sequencing showed that Plagiopylea was the most diverse and most abundant group in the hydrothermal vents, followed by Spirotrichea in terms of diversity and Oligohymenophorea in terms of relative abundance. A novel group of ciliates, distinctly separate from the 12 known classes, was detected in the hydrothermal vents, indicating undescribed, possibly highly divergent ciliates may inhabit this environment. Statistical analyses showed that: (i) the three habitats differed significantly from one another in terms of diversity of both the rare and the total ciliate taxa, and; (ii) the adjacent deep sea was more similar to the offshore area than to the hydrothermal vents. In terms of the diversity of abundant taxa, however, there was no significant difference between the hydrothermal vents and the adjacent deep sea, both of which differed significantly from the offshore area. As abundant ciliate taxa can be found in several sampling sites, they are likely adapted to large environmental variations, while rare taxa are found in specific habitat and thus are potentially more sensitive to varying environmental conditions.
CRISPR-Cas systems exploit viral DNA injection to establish and maintain adaptive immunity.

PubMed

Modell, Joshua W; Jiang, Wenyan; Marraffini, Luciano A

2017-04-06

Clustered regularly interspaced short palindromic repeats (CRISPR)-Cas systems provide protection against viral and plasmid infection by capturing short DNA sequences from these invaders and integrating them into the CRISPR locus of the prokaryotic host. These sequences, known as spacers, are transcribed into short CRISPR RNA guides that specify the cleavage site of Cas nucleases in the genome of the invader. It is not known when spacer sequences are acquired during viral infection. Here, to investigate this, we tracked spacer acquisition in Staphylococcus aureus cells harbouring a type II CRISPR-Cas9 system after infection with the staphylococcal bacteriophage ϕ12. We found that new spacers were acquired immediately after infection preferentially from the cos site, the viral free DNA end that is first injected into the cell. Analysis of spacer acquisition after infection with mutant phages demonstrated that most spacers are acquired during DNA injection, but not during other stages of the viral cycle that produce free DNA ends, such as DNA replication or packaging. Finally, we showed that spacers acquired from early-injected genomic regions, which direct Cas9 cleavage of the viral DNA immediately after infection, provide better immunity than spacers acquired from late-injected regions. Our results reveal that CRISPR-Cas systems exploit the phage life cycle to generate a pattern of spacer acquisition that ensures a successful CRISPR immune response.
Evidence that steroid 5alpha-reductase isozyme genes are differentially methylated in human lymphocytes.

PubMed

Rodríguez-Dorantes, M; Lizano-Soberón, M; Camacho-Arroyo, I; Calzada-León, R; Morimoto, S; Téllez-Ascencio, N; Cerbón, M A

2002-03-01

The synthesis of dihydrotestosterone (DHT) is catalyzed by steroid 5alpha-reductase isozymes 1 and 2, and this function determines the development of the male phenotype during embriogenesis and the growth of androgen sensitive tissues during puberty. The aim of this study was to determine the cytosine methylation status of 5alpha-reductase isozymes types 1 and 2 genes in normal and in 5alpha-reductase deficient men. Genomic DNA was obtained from lymphocytes of both normal subjects and patients with primary 5alpha-reductase deficiency due to point mutations in 5alpha-reductase 2 gene. Southern blot analysis of 5alpha-reductase types 1 and 2 genes from DNA samples digested with HpaII presented a different cytosine methylation pattern compared to that observed with its isoschizomer MspI, indicating that both genes are methylated in CCGG sequences. The analysis of 5alpha-reductase 1 gene from DNA samples digested with Sau3AI and its isoschizomer MboI which recognize methylation in GATC sequences showed an identical methylation pattern. In contrast, 5alpha-reductase 2 gene digested with Sau3AI presented a different methylation pattern to that of the samples digested with MboI, indicating that steroid 5alpha-reductase 2 gene possess methylated cytosines in GATC sequences. Analysis of exon 4 of 5alpha-reductase 2 gene after metabisulfite PCR showed that normal and deficient subjects present a different methylation pattern, being more methylated in patients with 5alpha-reductase 2 mutated gene. The overall results suggest that 5alpha-reductase genes 1 and 2 are differentially methylated in lymphocytes from normal and 5alpha-reductase deficient patients. Moreover, the extensive cytosine methylation pattern observed in exon 4 of 5alpha-reductase 2 gene in deficient patients, points out to an increased rate of mutations in this gene.
Use of continuous/contiguous stacking hybridization as a diagnostic tool

DOEpatents

Mirzabekov, Andrei Darievich; Kirillov, Eugene Vladislavovich; Parinov, Sergei Valeryevich; Barski, Victor Evgenievich; Dubiley, Svetlana Alekseevna

2002-01-01

A method for detecting disease-associated alleles in patient genetic material is provided whereby a first group of oligonucleotide molecules, synthesized to compliment base sequences of the disease associated alleles is immobilized on a predetermined position on a substrate, and then contacted with patient genetic material to form duplexes. The duplexes are then contacted with a second group of oligonucleotide molecules which are synthesized to extend the predetermined length of the oligonucleotide molecules of the first group, and where each of the oligonucleotide molecules of the second group are tagged and either incorporate universal bases or a mixture of guanine, cytosine, thymine, and adenine, or complementary nucleotide strands that are tagged with a different fluorochrome which radiates light at a predetermined wavelength. The treated substrate is then washed and the light patterns radiating therefrom are compared with predetermined light patterns of various diseases that were prepared on identical substrates. A method is also provided for determining the length of a repeat sequence in DNA or RNA, and also for determining the base sequence of unknown DNA or RNA.
Use of continuous/contiguous stacking hybridization as a diagnostic tool

DOEpatents

Mirzabekov, Andrei Darievich; Kirillov, Eugene Vladislavovich; Parinov, Sergei Valeryevich; Barski, Victor Evgenievich; Dubiley, Svetlana Alekseevna

2000-01-01

A method for detecting disease-associated alleles in patient genetic material is provided whereby a first group of oligonucleotide molecules, synthesized to compliment base sequences of the disease associated alleles is immobilized on a predetermined position on a substrate, and then contacted with patient genetic material to form duplexes. The duplexes are then contacted with a second group of oligonucleotide molecules which are synthesized to extend the predetermined length of the oligonucleotide molecules of the first group, and where each of the oligonucleotide molecules of the second group are tagged and either incorporate universal bases or a mixture of guanine, cytosine, thymine, and adenine, or complementary nucleotide strands that are tagged with a different fluorochrome which radiates light at a predetermined wavelength. The treated substrate is then washed and the light patterns radiating therefrom are compared with predetermined light patterns of various diseases that were prepared on identical substrates. A method is also provided for determining the length of a repeat sequence in DNA or RNA, and also for determining the base sequence of unknown DNA or RNA.
HLA DNA Sequence Variation among Human Populations: Molecular Signatures of Demographic and Selective Events

PubMed Central

Buhler, Stéphane; Sanchez-Mazas, Alicia

2011-01-01

Molecular differences between HLA alleles vary up to 57 nucleotides within the peptide binding coding region of human Major Histocompatibility Complex (MHC) genes, but it is still unclear whether this variation results from a stochastic process or from selective constraints related to functional differences among HLA molecules. Although HLA alleles are generally treated as equidistant molecular units in population genetic studies, DNA sequence diversity among populations is also crucial to interpret the observed HLA polymorphism. In this study, we used a large dataset of 2,062 DNA sequences defined for the different HLA alleles to analyze nucleotide diversity of seven HLA genes in 23,500 individuals of about 200 populations spread worldwide. We first analyzed the HLA molecular structure and diversity of these populations in relation to geographic variation and we further investigated possible departures from selective neutrality through Tajima's tests and mismatch distributions. All results were compared to those obtained by classical approaches applied to HLA allele frequencies. Our study shows that the global patterns of HLA nucleotide diversity among populations are significantly correlated to geography, although in some specific cases the molecular information reveals unexpected genetic relationships. At all loci except HLA-DPB1, populations have accumulated a high proportion of very divergent alleles, suggesting an advantage of heterozygotes expressing molecularly distant HLA molecules (asymmetric overdominant selection model). However, both different intensities of selection and unequal levels of gene conversion may explain the heterogeneous mismatch distributions observed among the loci. Also, distinctive patterns of sequence divergence observed at the HLA-DPB1 locus suggest current neutrality but old selective pressures on this gene. We conclude that HLA DNA sequences advantageously complement HLA allele frequencies as a source of data used to explore the genetic history of human populations, and that their analysis allows a more thorough investigation of human MHC molecular evolution. PMID:21408106
Photoactivated bioconjugation between ortho-azidophenols and anilines: a facile approach to biomolecular photopatterning.

PubMed

El Muslemany, Kareem M; Twite, Amy A; ElSohly, Adel M; Obermeyer, Allie C; Mathies, Richard A; Francis, Matthew B

2014-09-10

Methods for the surface patterning of small molecules and biomolecules can yield useful platforms for drug screening, synthetic biology applications, diagnostics, and the immobilization of live cells. However, new techniques are needed to achieve the ease, feature sizes, reliability, and patterning speed necessary for widespread adoption. Herein, we report an easily accessible and operationally simple photoinitiated reaction that can achieve patterned bioconjugation in a highly chemoselective manner. The reaction involves the photolysis of 2-azidophenols to generate iminoquinone intermediates that couple rapidly to aniline groups. We demonstrate the broad functional group compatibility of this reaction for the modification of proteins, polymers, oligonucleotides, peptides, and small molecules. As a specific application, the reaction was adapted for the photolithographic patterning of azidophenol DNA on aniline glass substrates. The presence of the DNA was confirmed by the ability of the surface to capture living cells bearing the sequence complement on their cell walls or cytoplasmic membranes. Compared to other light-based DNA patterning methods, this reaction offers higher speed and does not require the use of a photoresist or other blocking material.
Evolutionary history of Wolbachia infections in the fire ant Solenopsis invicta

PubMed Central

Ahrens, Michael E; Shoemaker, Dewayne

2005-01-01

Background Wolbachia are endosymbiotic bacteria that commonly infect numerous arthropods. Despite their broad taxonomic distribution, the transmission patterns of these bacteria within and among host species are not well understood. We sequenced a portion of the wsp gene from the Wolbachia genome infecting 138 individuals from eleven geographically distributed native populations of the fire ant Solenopsis invicta. We then compared these wsp sequence data to patterns of mitochondrial DNA (mtDNA) variation of both infected and uninfected host individuals to infer the transmission patterns of Wolbachia in S. invicta. Results Three different Wolbachia (wsp) variants occur within S. invicta, all of which are identical to previously described strains in fire ants. A comparison of the distribution of Wolbachia variants within S. invicta to a phylogeny of mtDNA haplotypes suggests S. invicta has acquired Wolbachia infections on at least three independent occasions. One common Wolbachia variant in S. invicta (wSinvictaB) is associated with two divergent mtDNA haplotype clades. Further, within each of these clades, Wolbachia-infected and uninfected individuals possess virtually identical subsets of mtDNA haplotypes, including both putative derived and ancestral mtDNA haplotypes. The same pattern also holds for wSinvictaA, where at least one and as many as three invasions into S. invicta have occurred. These data suggest that the initial invasions of Wolbachia into host ant populations may be relatively ancient and have been followed by multiple secondary losses of Wolbachia in different infected lineages over time. Finally, our data also provide additional insights into the factors responsible for previously reported variation in Wolbachia prevalence among S. invicta populations. Conclusion The history of Wolbachia infections in S. invicta is rather complex and involves multiple invasions or horizontal transmission events of Wolbachia into this species. Although these Wolbachia infections apparently have been present for relatively long time periods, these data clearly indicate that Wolbachia infections frequently have been secondarily lost within different lineages. Importantly, the uncoupled transmission of the Wolbachia and mtDNA genomes suggests that the presumed effects of Wolbachia on mtDNA evolution within S. invicta are less severe than originally predicted. Thus, the common concern that use of mtDNA markers for studying the evolutionary history of insects is confounded by maternally inherited endosymbionts such as Wolbachia may be somewhat unwarranted in the case of S. invicta. PMID:15927071
Human T-cell leukemia virus type 1 Tax requires direct access to DNA for recruitment of CREB binding protein to the viral promoter.

PubMed

Lenzmeier, B A; Giebler, H A; Nyborg, J K

1998-02-01

Efficient human T-cell leukemia virus type 1 (HTLV-1) replication and viral gene expression are dependent upon the virally encoded oncoprotein Tax. To activate HTLV-1 transcription, Tax interacts with the cellular DNA binding protein cyclic AMP-responsive element binding protein (CREB) and recruits the coactivator CREB binding protein (CBP), forming a nucleoprotein complex on the three viral cyclic AMP-responsive elements (CREs) in the HTLV-1 promoter. Short stretches of dG-dC-rich (GC-rich) DNA, immediately flanking each of the viral CREs, are essential for Tax recruitment of CBP in vitro and Tax transactivation in vivo. Although the importance of the viral CRE-flanking sequences is well established, several studies have failed to identify an interaction between Tax and the DNA. The mechanistic role of the viral CRE-flanking sequences has therefore remained enigmatic. In this study, we used high resolution methidiumpropyl-EDTA iron(II) footprinting to show that Tax extended the CREB footprint into the GC-rich DNA flanking sequences of the viral CRE. The Tax-CREB footprint was enhanced but not extended by the KIX domain of CBP, suggesting that the coactivator increased the stability of the nucleoprotein complex. Conversely, the footprint pattern of CREB on a cellular CRE lacking GC-rich flanking sequences did not change in the presence of Tax or Tax plus KIX. The minor-groove DNA binding drug chromomycin A3 bound to the GC-rich flanking sequences and inhibited the association of Tax and the Tax-CBP complex without affecting CREB binding. Tax specifically cross-linked to the viral CRE in the 5'-flanking sequence, and this cross-link was blocked by chromomycin A3. Together, these data support a model where Tax interacts directly with both CREB and the minor-groove viral CRE-flanking sequences to form a high-affinity binding site for the recruitment of CBP to the HTLV-1 promoter.

Molecular cloning, sequence analysis and phylogeny of first caudata g-type lysozyme in axolotl (Ambystoma mexicanum).

PubMed

Yu, Haining; Gao, Jiuxiang; Lu, Yiling; Guang, Huijuan; Cai, Shasha; Zhang, Songyan; Wang, Yipeng

2013-11-01

Lysozymes are key proteins that play important roles in innate immune defense in many animal phyla by breaking down the bacterial cell-walls. In this study, we report the molecular cloning, sequence analysis and phylogeny of the first caudate amphibian g-lysozyme: a full-length spleen cDNA library from axolotl (Ambystoma mexicanum). A goose-type (g-lysozyme) EST was identified and the full-length cDNA was obtained using RACE-PCR. The axolotl g-lysozyme sequence represents an open reading frame for a putative signal peptide and the mature protein composed of 184 amino acids. The calculated molecular mass and the theoretical isoelectric point (pl) of this mature protein are 21523.0 Da and 4.37, respectively. Expression of g-lysozyme mRNA is predominantly found in skin, with lower levels in spleen, liver, muscle, and lung. Phylogenetic analysis revealed that caudate amphibian g-lysozyme had distinct evolution pattern for being juxtaposed with not only anura amphibian, but also with the fish, bird and mammal. Although the first complete cDNA sequence for caudate amphibian g-lysozyme is reported in the present study, clones encoding axolotl's other functional immune molecules in the full-length cDNA library will have to be further sequenced to gain insight into the fundamental aspects of antibacterial mechanisms in caudate.
The recognition and modification sites for the bacterial type I restriction systems KpnAI, StySEAI, StySENI and StySGI

PubMed Central

Kasarjian, Julie K. A.; Hidaka, Masumi; Horiuchi, Takashi; Iida, Masatake; Ryu, Junichi

2004-01-01

Using an in vivo plasmid transformation method, we have determined the DNA sequences recognized by the KpnAI, StySEAI, StySENI and StySGI R-M systems from Klebsiella oxytoca strain M5a1, Salmonella eastbourne, Salmonella enteritidis and Salmonella gelsenkirchen, respectively. These type I restriction-modification systems were originally identified using traditional phage assay, and described here is the plasmid transformation test and computer program used to determine their DNA recognition sequences. For this test, we constructed two sets of plasmids, pL and pE, that contain phage lambda and Escherichia coli K-12 chromosomal DNA fragments, respectively. Further, using the methylation sensitivities of various known type II restriction enzymes, we identified the target adenines for methylation (listed in bold italics below as A or T in case of the complementary strand). The recognition sequence and methylation sites are GAA(6N)TGCC (KpnAI), ACA(6N)TYCA (StySEAI), CGA(6N)TACC (StySENI) and TAAC(7N)RTCG (StySGI). These DNA recognition sequences all have a typical type I bipartite pattern and represent three novel specificities and one isoschizomer (StySENI). For confirmation, oligonucleotides containing each of the predicted sequences were synthesized, cloned into plasmid pMECA and transformed into each strain, resulting in a large reduction in efficiency of transformation (EOT). PMID:15199175
Successful enrichment and recovery of whole mitochondrial genomes from ancient human dental calculus.

PubMed

Ozga, Andrew T; Nieves-Colón, Maria A; Honap, Tanvi P; Sankaranarayanan, Krithivasan; Hofman, Courtney A; Milner, George R; Lewis, Cecil M; Stone, Anne C; Warinner, Christina

2016-06-01

Archaeological dental calculus is a rich source of host-associated biomolecules. Importantly, however, dental calculus is more accurately described as a calcified microbial biofilm than a host tissue. As such, concerns regarding destructive analysis of human remains may not apply as strongly to dental calculus, opening the possibility of obtaining human health and ancestry information from dental calculus in cases where destructive analysis of conventional skeletal remains is not permitted. Here we investigate the preservation of human mitochondrial DNA (mtDNA) in archaeological dental calculus and its potential for full mitochondrial genome (mitogenome) reconstruction in maternal lineage ancestry analysis. Extracted DNA from six individuals at the 700-year-old Norris Farms #36 cemetery in Illinois was enriched for mtDNA using in-solution capture techniques, followed by Illumina high-throughput sequencing. Full mitogenomes (7-34×) were successfully reconstructed from dental calculus for all six individuals, including three individuals who had previously tested negative for DNA preservation in bone using conventional PCR techniques. Mitochondrial haplogroup assignments were consistent with previously published findings, and additional comparative analysis of paired dental calculus and dentine from two individuals yielded equivalent haplotype results. All dental calculus samples exhibited damage patterns consistent with ancient DNA, and mitochondrial sequences were estimated to be 92-100% endogenous. DNA polymerase choice was found to impact error rates in downstream sequence analysis, but these effects can be mitigated by greater sequencing depth. Dental calculus is a viable alternative source of human DNA that can be used to reconstruct full mitogenomes from archaeological remains. Am J Phys Anthropol 160:220-228, 2016. © 2016 The Authors American Journal of Physical Anthropology Published by Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Successful enrichment and recovery of whole mitochondrial genomes from ancient human dental calculus

PubMed Central

Ozga, Andrew T.; Nieves‐Colón, Maria A.; Honap, Tanvi P.; Sankaranarayanan, Krithivasan; Hofman, Courtney A.; Milner, George R.; Lewis, Cecil M.; Stone, Anne C.

2016-01-01

ABSTRACT Objectives Archaeological dental calculus is a rich source of host‐associated biomolecules. Importantly, however, dental calculus is more accurately described as a calcified microbial biofilm than a host tissue. As such, concerns regarding destructive analysis of human remains may not apply as strongly to dental calculus, opening the possibility of obtaining human health and ancestry information from dental calculus in cases where destructive analysis of conventional skeletal remains is not permitted. Here we investigate the preservation of human mitochondrial DNA (mtDNA) in archaeological dental calculus and its potential for full mitochondrial genome (mitogenome) reconstruction in maternal lineage ancestry analysis. Materials and Methods Extracted DNA from six individuals at the 700‐year‐old Norris Farms #36 cemetery in Illinois was enriched for mtDNA using in‐solution capture techniques, followed by Illumina high‐throughput sequencing. Results Full mitogenomes (7–34×) were successfully reconstructed from dental calculus for all six individuals, including three individuals who had previously tested negative for DNA preservation in bone using conventional PCR techniques. Mitochondrial haplogroup assignments were consistent with previously published findings, and additional comparative analysis of paired dental calculus and dentine from two individuals yielded equivalent haplotype results. All dental calculus samples exhibited damage patterns consistent with ancient DNA, and mitochondrial sequences were estimated to be 92–100% endogenous. DNA polymerase choice was found to impact error rates in downstream sequence analysis, but these effects can be mitigated by greater sequencing depth. Discussion Dental calculus is a viable alternative source of human DNA that can be used to reconstruct full mitogenomes from archaeological remains. Am J Phys Anthropol 160:220–228, 2016. © 2016 The Authors American Journal of Physical Anthropology Published by Wiley Periodicals, Inc. PMID:26989998
The Epigenomic Landscape of Prokaryotes

DOE PAGES

Blow, Matthew J.; Clark, Tyson A.; Daum, Chris G.; ...

2016-02-12

DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities ofmore » 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active ‘orphan’ MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems.« less
The Epigenomic Landscape of Prokaryotes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Blow, Matthew J.; Clark, Tyson A.; Daum, Chris G.

DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities ofmore » 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active ‘orphan’ MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems.« less
Effects of DNA Methylation and Chromatin State on Rates of Molecular Evolution in Insects.

PubMed

Glastad, Karl M; Goodisman, Michael A D; Yi, Soojin V; Hunt, Brendan G

2015-12-04

Epigenetic information is widely appreciated for its role in gene regulation in eukaryotic organisms. However, epigenetic information can also influence genome evolution. Here, we investigate the effects of epigenetic information on gene sequence evolution in two disparate insects: the fly Drosophila melanogaster, which lacks substantial DNA methylation, and the ant Camponotus floridanus, which possesses a functional DNA methylation system. We found that DNA methylation was positively correlated with the synonymous substitution rate in C. floridanus, suggesting a key effect of DNA methylation on patterns of gene evolution. However, our data suggest the link between DNA methylation and elevated rates of synonymous substitution was explained, in large part, by the targeting of DNA methylation to genes with signatures of transcriptionally active chromatin, rather than the mutational effect of DNA methylation itself. This phenomenon may be explained by an elevated mutation rate for genes residing in transcriptionally active chromatin, or by increased structural constraints on genes in inactive chromatin. This result highlights the importance of chromatin structure as the primary epigenetic driver of genome evolution in insects. Overall, our study demonstrates how different epigenetic systems contribute to variation in the rates of coding sequence evolution. Copyright © 2016 Glastad et al.
A communal catalogue reveals Earth's multiscale microbial diversity.

PubMed

Thompson, Luke R; Sanders, Jon G; McDonald, Daniel; Amir, Amnon; Ladau, Joshua; Locey, Kenneth J; Prill, Robert J; Tripathi, Anupriya; Gibbons, Sean M; Ackermann, Gail; Navas-Molina, Jose A; Janssen, Stefan; Kopylova, Evguenia; Vázquez-Baeza, Yoshiki; González, Antonio; Morton, James T; Mirarab, Siavash; Zech Xu, Zhenjiang; Jiang, Lingjing; Haroon, Mohamed F; Kanbar, Jad; Zhu, Qiyun; Jin Song, Se; Kosciolek, Tomasz; Bokulich, Nicholas A; Lefler, Joshua; Brislawn, Colin J; Humphrey, Gregory; Owens, Sarah M; Hampton-Marcell, Jarrad; Berg-Lyons, Donna; McKenzie, Valerie; Fierer, Noah; Fuhrman, Jed A; Clauset, Aaron; Stevens, Rick L; Shade, Ashley; Pollard, Katherine S; Goodwin, Kelly D; Jansson, Janet K; Gilbert, Jack A; Knight, Rob

2017-11-23

Our growing awareness of the microbial world's importance and diversity contrasts starkly with our limited understanding of its fundamental structure. Despite recent advances in DNA sequencing, a lack of standardized protocols and common analytical frameworks impedes comparisons among studies, hindering the development of global inferences about microbial life on Earth. Here we present a meta-analysis of microbial community samples collected by hundreds of researchers for the Earth Microbiome Project. Coordinated protocols and new analytical methods, particularly the use of exact sequences instead of clustered operational taxonomic units, enable bacterial and archaeal ribosomal RNA gene sequences to be followed across multiple studies and allow us to explore patterns of diversity at an unprecedented scale. The result is both a reference database giving global context to DNA sequence data and a framework for incorporating data from future studies, fostering increasingly complete characterization of Earth's microbial diversity.
Determining the Location of DNA Modification and Mutation Caused by UVB Light in Skin Cancer

DTIC Science & Technology

2013-09-01

we obtain cleavage patterns consistent with the administered UV dosage and that sequencing libraries generated for both yeast and human cells show...understanding the mutations they cause. 15. SUBJECT TERMS UV DNA modification, HeLa cells, Skin Cancer 16. SECURITY CLASSIFICATION OF: 17...of mutations that are caused by UV light in cells and correlate them to modification frequencies. Understanding the initial chemical changes
Investigation of genetic divergence and polymorphism of nuclear DNA in species and populations of domestic and wild sheep

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mel`nikova, M.N.; Grechko, V.V.; Mednikov, B.M.

1995-08-01

Genetic divergence in repetitive sequences of nuclear DNA of wild and domestic sheep was studied by general restriction endonuclease mapping (i.e., the taxonoprint method). The PCR RAPD method with one and two arbitrary primers was also used to analyze the nuclear DNA polymorphism in some other regions. The taxonoprint method, performed using six endonucleases, showed specificity and virtually complete similarity in the patterns of repetitive DNA sequences of two wild forms, argali and moufflon, and five domestic sheep breeds. Central Asian breeds, Kazakh fine-fleeced, karakuk, ghissar, and eadeelbay, and an English breed, Lincoln, were examined. The results confirm the opinionmore » that wild and domestic sheep may be considered one polytypic species. The PCR-RAPD method, both with one and two arbitrary primers, revealed a closer similarity of all the sheep breeds examined when aragali, rather than with moufflon, was used. These results indicate that the domestication area of sheep was much more broader than was earlier presumed. Otherwise, hybridizations of domestic and wild forms could occasionally occur in the area of their coexistence. The amplification patterns of PCR-RAPD products are the most promising population genetic markers. 27 refs., 4 figs., 7 tabs.« less
Phylogeographical patterns of an alpine plant, Rhodiola dumulosa (Crassulaceae), inferred from chloroplast DNA sequences.

PubMed

Hou, Yan; Lou, Anru

2014-01-01

The phylogeographical patterns of Rhodiola dumulosa, an alpine plant species restrictedly growing in the crevices of rock piles, were investigated based on 4 fragments of the chloroplast genome. To cover the full distribution of R. dumulosa in China, 19 populations from 3 major disjunct distribution areas (northern, central, and northwestern China) were sampled. A total of 5881bp (after alignment) of chloroplast DNA (cpDNA) from 100 individuals were sequenced. The combined cpDNA data set yielded 36 haplotypes. The total genetic diversity of R. dumulosa was remarkably high (H(T) = 0.981). The interpopulation genetic differentiation was significantly large (F(ST) = 0.8537, P < 0.001), possibly due to the long-term isolation of the natural populations. N(ST) was significantly larger than G(ST) (P < 0.001), indicating the presence of phylogeographical structure among the R. dumulosa populations. We propose 2 migration steps to explain the current distribution of R. dumulosa in China. First, this species migrated from refugia in the Qinghai-Tibetan Plateau to northern areas via the intervening highlands when temperatures increased; second, the highland populations migrated toward the mountaintops when temperatures increased further because R. dumulosa is adapted to cold environments. During the second migration step, the common ancestral haplotypes may have been gradually lost.
A 21.7 kb DNA segment on the left arm of yeast chromosome XIV carries WHI3, GCR2, SPX18, SPX19, an homologue to the heat shock gene SSB1 and 8 new open reading frames of unknown function.

PubMed

Jonniaux, J L; Coster, F; Purnelle, B; Goffeau, A

1994-12-01

We report the amino acid sequence of 13 open reading frames (ORF > 299 bp) located on a 21.7 kb DNA segment from the left arm of chromosome XIV of Saccharomyces cerevisiae. Five open reading frames had been entirely or partially sequenced previously: WHI3, GCR2, SPX19, SPX18 and a heat shock gene similar to SSB1. The products of 8 other ORFs are new putative proteins among which N1394 is probably a membrane protein. N1346 contains a leucine zipper pattern and the corresponding ORF presents an HAP (global regulator of respiratory genes) upstream activating sequence in the promoting region. N1386 shares homologies with the DNA structure-specific recognition protein family SSRPs and the corresponding ORF is preceded by an MCB (MluI cell cycle box) upstream activating factor.
Molecular genetic analysis of ancient cattle bones excavated from archaeological sites in Jeju, Korea.

PubMed

Kim, Jae-Hwan; Oh, Ju-Hyung; Song, Ji-Hoon; Jeon, Jin-Tae; Han, Sang-Hyun; Jung, Yong-Hwan; Oh, Moon-You

2005-12-31

Ancient cattle bones were excavated from archaeological sites in Jeju, Korea. We used molecular genetic techniques to identify the species and establish its relationship to extant cattle breeds. Ancient DNA was extracted from four sources: a humerus (Gonae site, A.D. 700-800), two fragments of radius, and a tooth (Kwakji site, A.D. 0-900). The mitochondrial DNA (mtDNA) D-loop regions were cloned, sequenced, and compared with previously reported sequences of various cattle breeds (9 Asian, 8 European, and 3 African). The results revealed that these bones were of the breed, Bos taurus, and a phylogenetic tree indicated that the four cattle bones formed a monophyletic group with Jeju native black cattle. However, the patterns of sequence variation and reports from archaeological sites suggest that a few wild cattle, with a different maternal lineage, may have existed on Jeju Island. Our results will contribute to further studies of the origin of Jeju native cattle and the possible existence of local wild cattle.
Population-scale whole genome sequencing identifies 271 highly polymorphic short tandem repeats from Japanese population.

PubMed

Hirata, Satoshi; Kojima, Kaname; Misawa, Kazuharu; Gervais, Olivier; Kawai, Yosuke; Nagasaki, Masao

2018-05-01

Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.
TRX-LOGOS - a graphical tool to demonstrate DNA information content dependent upon backbone dynamics in addition to base sequence.

PubMed

Fortin, Connor H; Schulze, Katharina V; Babbitt, Gregory A

2015-01-01

It is now widely-accepted that DNA sequences defining DNA-protein interactions functionally depend upon local biophysical features of DNA backbone that are important in defining sites of binding interaction in the genome (e.g. DNA shape, charge and intrinsic dynamics). However, these physical features of DNA polymer are not directly apparent when analyzing and viewing Shannon information content calculated at single nucleobases in a traditional sequence logo plot. Thus, sequence logos plots are severely limited in that they convey no explicit information regarding the structural dynamics of DNA backbone, a feature often critical to binding specificity. We present TRX-LOGOS, an R software package and Perl wrapper code that interfaces the JASPAR database for computational regulatory genomics. TRX-LOGOS extends the traditional sequence logo plot to include Shannon information content calculated with regard to the dinucleotide-based BI-BII conformation shifts in phosphate linkages on the DNA backbone, thereby adding a visual measure of intrinsic DNA flexibility that can be critical for many DNA-protein interactions. TRX-LOGOS is available as an R graphics module offered at both SourceForge and as a download supplement at this journal. To demonstrate the general utility of TRX logo plots, we first calculated the information content for 416 Saccharomyces cerevisiae transcription factor binding sites functionally confirmed in the Yeastract database and matched to previously published yeast genomic alignments. We discovered that flanking regions contain significantly elevated information content at phosphate linkages than can be observed at nucleobases. We also examined broader transcription factor classifications defined by the JASPAR database, and discovered that many general signatures of transcription factor binding are locally more information rich at the level of DNA backbone dynamics than nucleobase sequence. We used TRX-logos in combination with MEGA 6.0 software for molecular evolutionary genetics analysis to visually compare the human Forkhead box/FOX protein evolution to its binding site evolution. We also compared the DNA binding signatures of human TP53 tumor suppressor determined by two different laboratory methods (SELEX and ChIP-seq). Further analysis of the entire yeast genome, center aligned at the start codon, also revealed a distinct sequence-independent 3 bp periodic pattern in information content, present only in coding region, and perhaps indicative of the non-random organization of the genetic code. TRX-LOGOS is useful in any situation in which important information content in DNA can be better visualized at the positions of phosphate linkages (i.e. dinucleotides) where the dynamic properties of the DNA backbone functions to facilitate DNA-protein interaction.
Protein sequences from mastodon and Tyrannosaurus rex revealed by mass spectrometry.

PubMed

Asara, John M; Schweitzer, Mary H; Freimark, Lisa M; Phillips, Matthew; Cantley, Lewis C

2007-04-13

Fossilized bones from extinct taxa harbor the potential for obtaining protein or DNA sequences that could reveal evolutionary links to extant species. We used mass spectrometry to obtain protein sequences from bones of a 160,000- to 600,000-year-old extinct mastodon (Mammut americanum) and a 68-million-year-old dinosaur (Tyrannosaurus rex). The presence of T. rex sequences indicates that their peptide bonds were remarkably stable. Mass spectrometry can thus be used to determine unique sequences from ancient organisms from peptide fragmentation patterns, a valuable tool to study the evolution and adaptation of ancient taxa from which genomic sequences are unlikely to be obtained.
Nuclear and mitochondrial patterns of population structure in North Pacific false killer whales (Pseudorca crassidens).

PubMed

Martien, Karen K; Chivers, Susan J; Baird, Robin W; Archer, Frederick I; Gorgone, Antoinette M; Hancock-Hanser, Brittany L; Mattila, David; McSweeney, Daniel J; Oleson, Erin M; Palmer, Carol; Pease, Victoria L; Robertson, Kelly M; Schorr, Gregory S; Schultz, Mark B; Webster, Daniel L; Taylor, Barbara L

2014-01-01

False killer whales (Pseudorca crassidens) are large delphinids typically found in deep water far offshore. However, in the Hawaiian Archipelago, there are 2 resident island-associated populations of false killer whales, one in the waters around the main Hawaiian Islands (MHI) and one in the waters around the Northwestern Hawaiian Islands (NWHI). We use mitochondrial DNA (mtDNA) control region sequences and genotypes from 16 nuclear DNA (nucDNA) microsatellite loci from 206 individuals to examine levels of differentiation among the 2 island-associated populations and offshore animals from the central and eastern North Pacific. Both mtDNA and nucDNA exhibit highly significant differentiation between populations, confirming limited gene flow in both sexes. The mtDNA haplotypes exhibit a strong pattern of phylogeographic concordance, with island-associated populations sharing 3 closely related haplotypes not found elsewhere in the Pacific. However, nucDNA data suggest that NWHI animals are at least as differentiated from MHI animals as they are from offshore animals. The patterns of differentiation revealed by the 2 marker types suggest that the island-associated false killer whale populations likely share a common colonization history, but have limited contemporary gene flow. Published by Oxford University Press on behalf of the American Genetic Association 2014. This work is written by (a) US Government employee(s) and is in the public domain in the US.
SIPSim: A Modeling Toolkit to Predict Accuracy and Aid Design of DNA-SIP Experiments.

PubMed

Youngblut, Nicholas D; Barnett, Samuel E; Buckley, Daniel H

2018-01-01

DNA Stable isotope probing (DNA-SIP) is a powerful method that links identity to function within microbial communities. The combination of DNA-SIP with multiplexed high throughput DNA sequencing enables simultaneous mapping of in situ assimilation dynamics for thousands of microbial taxonomic units. Hence, high throughput sequencing enabled SIP has enormous potential to reveal patterns of carbon and nitrogen exchange within microbial food webs. There are several different methods for analyzing DNA-SIP data and despite the power of SIP experiments, it remains difficult to comprehensively evaluate method accuracy across a wide range of experimental parameters. We have developed a toolset (SIPSim) that simulates DNA-SIP data, and we use this toolset to systematically evaluate different methods for analyzing DNA-SIP data. Specifically, we employ SIPSim to evaluate the effects that key experimental parameters (e.g., level of isotopic enrichment, number of labeled taxa, relative abundance of labeled taxa, community richness, community evenness, and beta-diversity) have on the specificity, sensitivity, and balanced accuracy (defined as the product of specificity and sensitivity) of DNA-SIP analyses. Furthermore, SIPSim can predict analytical accuracy and power as a function of experimental design and community characteristics, and thus should be of great use in the design and interpretation of DNA-SIP experiments.
SIPSim: A Modeling Toolkit to Predict Accuracy and Aid Design of DNA-SIP Experiments

PubMed Central

Youngblut, Nicholas D.; Barnett, Samuel E.; Buckley, Daniel H.

2018-01-01

DNA Stable isotope probing (DNA-SIP) is a powerful method that links identity to function within microbial communities. The combination of DNA-SIP with multiplexed high throughput DNA sequencing enables simultaneous mapping of in situ assimilation dynamics for thousands of microbial taxonomic units. Hence, high throughput sequencing enabled SIP has enormous potential to reveal patterns of carbon and nitrogen exchange within microbial food webs. There are several different methods for analyzing DNA-SIP data and despite the power of SIP experiments, it remains difficult to comprehensively evaluate method accuracy across a wide range of experimental parameters. We have developed a toolset (SIPSim) that simulates DNA-SIP data, and we use this toolset to systematically evaluate different methods for analyzing DNA-SIP data. Specifically, we employ SIPSim to evaluate the effects that key experimental parameters (e.g., level of isotopic enrichment, number of labeled taxa, relative abundance of labeled taxa, community richness, community evenness, and beta-diversity) have on the specificity, sensitivity, and balanced accuracy (defined as the product of specificity and sensitivity) of DNA-SIP analyses. Furthermore, SIPSim can predict analytical accuracy and power as a function of experimental design and community characteristics, and thus should be of great use in the design and interpretation of DNA-SIP experiments. PMID:29643843
Applying Agrep to r-NSA to solve multiple sequences approximate matching.

PubMed

Ni, Bing; Wong, Man-Hon; Lam, Chi-Fai David; Leung, Kwong-Sak

2014-01-01

This paper addresses the approximate matching problem in a database consisting of multiple DNA sequences, where the proposed approach applies Agrep to a new truncated suffix array, r-NSA. The construction time of the structure is linear to the database size, and the computations of indexing a substring in the structure are constant. The number of characters processed in applying Agrep is analysed theoretically, and the theoretical upper-bound can approximate closely the empirical number of characters, which is obtained through enumerating the characters in the actual structure built. Experiments are carried out using (synthetic) random DNA sequences, as well as (real) genome sequences including Hepatitis-B Virus and X-chromosome. Experimental results show that, compared to the straight-forward approach that applies Agrep to multiple sequences individually, the proposed approach solves the matching problem in much shorter time. The speed-up of our approach depends on the sequence patterns, and for highly similar homologous genome sequences, which are the common cases in real-life genomes, it can be up to several orders of magnitude.

Population dynamics coded in DNA: genetic traces of the expansion of modern humans

NASA Astrophysics Data System (ADS)

Kimmel, Marek

1999-12-01

It has been proposed that modern humans evolved from a small ancestral population, which appeared several hundred thousand years ago in Africa. Descendants of the founder group migrated to Europe and then to Asia, not mixing with the pre-existing local populations but replacing them. Two demographic elements are present in this “out of Africa” hypothesis: numerical growth of the modern humans and their migration into Eurasia. Did these processes leave an imprint in our DNA? To address this question, we use the classical Fisher-Wright-Moran model of population genetics, assuming variable population size and two models of mutation: the infinite-sites model and the stepwise-mutation model. We use the coalescence theory, which amounts to tracing the common ancestors of contemporary genes. We obtain mathematical formulae expressing the distribution of alleles given the time changes of population size . In the framework of the infinite-sites model, simulations indicate that the pattern of past population size change leaves its signature on the pattern of DNA polymorphism. Application of the theory to the published mitochondrial DNA sequences indicates that the current mitochondrial DNA sequence variation is not inconsistent with the logistic growth of the modern human population. In the framework of the stepwise-mutation model, we demonstrate that population bottleneck followed by growth in size causes an imbalance between allele-size variance and heterozygosity. We analyze a set of data on tetranucleotide repeats which reveals the existence of this imbalance. The pattern of imbalance is consistent with the bottleneck being most ancient in Africans, most recent in Asians and intermediate in Europeans. These findings are consistent with the “out of Africa” hypothesis, although by no means do they constitute its proof.
Wolbachia and DNA barcoding insects: patterns, potential, and problems.

PubMed

Smith, M Alex; Bertrand, Claudia; Crosby, Kate; Eveleigh, Eldon S; Fernandez-Triana, Jose; Fisher, Brian L; Gibbs, Jason; Hajibabaei, Mehrdad; Hallwachs, Winnie; Hind, Katharine; Hrcek, Jan; Huang, Da-Wei; Janda, Milan; Janzen, Daniel H; Li, Yanwei; Miller, Scott E; Packer, Laurence; Quicke, Donald; Ratnasingham, Sujeevan; Rodriguez, Josephine; Rougerie, Rodolphe; Shaw, Mark R; Sheffield, Cory; Stahlhut, Julie K; Steinke, Dirk; Whitfield, James; Wood, Monty; Zhou, Xin

2012-01-01

Wolbachia is a genus of bacterial endosymbionts that impacts the breeding systems of their hosts. Wolbachia can confuse the patterns of mitochondrial variation, including DNA barcodes, because it influences the pathways through which mitochondria are inherited. We examined the extent to which these endosymbionts are detected in routine DNA barcoding, assessed their impact upon the insect sequence divergence and identification accuracy, and considered the variation present in Wolbachia COI. Using both standard PCR assays (Wolbachia surface coding protein--wsp), and bacterial COI fragments we found evidence of Wolbachia in insect total genomic extracts created for DNA barcoding library construction. When >2 million insect COI trace files were examined on the Barcode of Life Datasystem (BOLD) Wolbachia COI was present in 0.16% of the cases. It is possible to generate Wolbachia COI using standard insect primers; however, that amplicon was never confused with the COI of the host. Wolbachia alleles recovered were predominantly Supergroup A and were broadly distributed geographically and phylogenetically. We conclude that the presence of the Wolbachia DNA in total genomic extracts made from insects is unlikely to compromise the accuracy of the DNA barcode library; in fact, the ability to query this DNA library (the database and the extracts) for endosymbionts is one of the ancillary benefits of such a large scale endeavor--which we provide several examples. It is our conclusion that regular assays for Wolbachia presence and type can, and should, be adopted by large scale insect barcoding initiatives. While COI is one of the five multi-locus sequence typing (MLST) genes used for categorizing Wolbachia, there is limited overlap with the eukaryotic DNA barcode region.
Within-genome evolution of REPINs: a new family of miniature mobile DNA in bacteria.

PubMed

Bertels, Frederic; Rainey, Paul B

2011-06-01

Repetitive sequences are a conserved feature of many bacterial genomes. While first reported almost thirty years ago, and frequently exploited for genotyping purposes, little is known about their origin, maintenance, or processes affecting the dynamics of within-genome evolution. Here, beginning with analysis of the diversity and abundance of short oligonucleotide sequences in the genome of Pseudomonas fluorescens SBW25, we show that over-represented short sequences define three distinct groups (GI, GII, and GIII) of repetitive extragenic palindromic (REP) sequences. Patterns of REP distribution suggest that closely linked REP sequences form a functional replicative unit: REP doublets are over-represented, randomly distributed in extragenic space, and more highly conserved than singlets. In addition, doublets are organized as inverted repeats, which together with intervening spacer sequences are predicted to form hairpin structures in ssDNA or mRNA. We refer to these newly defined entities as REPINs (REP doublets forming hairpins) and identify short reads from population sequencing that reveal putative transposition intermediates. The proximal relationship between GI, GII, and GIII REPINs and specific REP-associated tyrosine transposases (RAYTs), combined with features of the putative transposition intermediate, suggests a mechanism for within-genome dissemination. Analysis of the distribution of REPs in a range of RAYT-containing bacterial genomes, including Escherichia coli K-12 and Nostoc punctiforme, show that REPINs are a widely distributed, but hitherto unrecognized, family of miniature non-autonomous mobile DNA.
Holes influence the mutation spectrum of human mitochondrial DNA

NASA Astrophysics Data System (ADS)

Villagran, Martha; Miller, John

Mutations drive evolution and disease, showing highly non-random patterns of variant frequency vs. nucleotide position. We use computational DNA hole spectroscopy [M.Y. Suarez-Villagran & J.H. Miller, Sci. Rep. 5, 13571 (2015)] to reveal sites of enhanced hole probability in selected regions of human mitochondrial DNA. A hole is a mobile site of positive charge created when an electron is removed, for example by radiation or contact with a mutagenic agent. The hole spectra are quantum mechanically computed using a two-stranded tight binding model of DNA. We observe significant correlation between spectra of hole probabilities and of genetic variation frequencies from the MITOMAP database. These results suggest that hole-enhanced mutation mechanisms exert a substantial, perhaps dominant, influence on mutation patterns in DNA. One example is where a trapped hole induces a hydrogen bond shift, known as tautomerization, which then triggers a base-pair mismatch during replication. Our results deepen overall understanding of sequence specific mutation rates, encompassing both hotspots and cold spots, which drive molecular evolution.
Spliced RNA of woodchuck hepatitis virus.

PubMed

Ogston, C W; Razman, D G

1992-07-01

Polymerase chain reaction was used to investigate RNA splicing in liver of woodchucks infected with woodchuck hepatitis virus (WHV). Two spliced species were detected, and the splice junctions were sequenced. The larger spliced RNA has an intron of 1300 nucleotides, and the smaller spliced sequence shows an additional downstream intron of 1104 nucleotides. We did not detect singly spliced sequences from which the smaller intron alone was removed. Control experiments showed that spliced sequences are present in both RNA and DNA in infected liver, showing that the viral reverse transcriptase can use spliced RNA as template. Spliced sequences were detected also in virion DNA prepared from serum. The upstream intron produces a reading frame that fuses the core to the polymerase polypeptide, while the downstream intron causes an inframe deletion in the polymerase open reading frame. Whereas the splicing patterns in WHV are superficially similar to those reported recently in hepatitis B virus, we detected no obvious homology in the coding capacity of spliced RNAs from these two viruses.
Chromosome rearrangements via template switching between diverged repeated sequences

PubMed Central

Anand, Ranjith P.; Tsaponina, Olga; Greenwell, Patricia W.; Lee, Cheng-Sheng; Du, Wei; Petes, Thomas D.

2014-01-01

Recent high-resolution genome analyses of cancer and other diseases have revealed the occurrence of microhomology-mediated chromosome rearrangements and copy number changes. Although some of these rearrangements appear to involve nonhomologous end-joining, many must have involved mechanisms requiring new DNA synthesis. Models such as microhomology-mediated break-induced replication (MM-BIR) have been invoked to explain these rearrangements. We examined BIR and template switching between highly diverged sequences in Saccharomyces cerevisiae, induced during repair of a site-specific double-strand break (DSB). Our data show that such template switches are robust mechanisms that give rise to complex rearrangements. Template switches between highly divergent sequences appear to be mechanistically distinct from the initial strand invasions that establish BIR. In particular, such jumps are less constrained by sequence divergence and exhibit a different pattern of microhomology junctions. BIR traversing repeated DNA sequences frequently results in complex translocations analogous to those seen in mammalian cells. These results suggest that template switching among repeated genes is a potent driver of genome instability and evolution. PMID:25367035
The Targeted Sequencing of Alpha Satellite DNA in Cercopithecus pogonias Provides New Insight into the Diversity and Dynamics of Centromeric Repeats in Old World monkeys.

PubMed

Cacheux, Lauriane; Ponger, Loïc; Gerbault-Seureau, Michèle; Loll, François; Gey, Delphine; Richard, Florence Anne; Escudé, Christophe

2018-06-01

Alpha satellite is the major repeated DNA element of primate centromeres. Specific evolutionary mechanisms have led to a great diversity of sequence families with peculiar genomic organization and distribution, which have till now been studied mostly in great apes. Using high throughput sequencing of alpha satellite monomers obtained by enzymatic digestion followed by computational and cytogenetic analysis, we compare here the diversity and genomic distribution of alpha satellite DNA in two related Old World monkey species, Cercopithecus pogonias and Cercopithecus solatus, which are known to have diverged about seven million years ago. Two main families of monomers, called C1 and C2, are found in both species. A detailed analysis of our datasets revealed the existence of numerous subfamilies within the centromeric C1 family. Although the most abundant subfamily is conserved between both species, our FISH experiments clearly show that some subfamilies are specific for each species and that their distribution is restricted to a subset of chromosomes, thereby pointing to the existence of recurrent amplification/homogenization events. The pericentromeric C2 family is very abundant on the short arm of all acrocentric chromosomes in both species, pointing to specific mechanisms that lead to this distribution. Results obtained using two different restriction enzymes are fully consistent with a predominant monomeric organization of alpha satellite DNA which coexists with higher order organization patterns in the Cercopithecus pogonias genome. Our study suggests a high dynamics of alpha satellite DNA in Cercopithecini, with recurrent apparition of new sequence variants and interchromosomal sequence transfer.
Genetic signs of multiple colonization events in Baltic ciscoes with radiation into sympatric spring- and autumn-spawners confined to early postglacial arrival

PubMed Central

Delling, Bo; Palm, Stefan; Palkopoulou, Eleftheria; Prestegaard, Tore

2014-01-01

Presence of sympatric populations may reflect local diversification or secondary contact of already distinct forms. The Baltic cisco (Coregonus albula) normally spawns in late autumn, but in a few lakes in Northern Europe sympatric autumn and spring- or winter-spawners have been described. So far, the evolutionary relationships and taxonomic status of these main life history forms have remained largely unclear. With microsatellites and mtDNA sequences, we analyzed extant and extinct spring- and autumn-spawners from a total of 23 Swedish localities, including sympatric populations. Published sequences from Baltic ciscoes in Germany and Finland, and Coregonus sardinella from North America were also included together with novel mtDNA sequences from Siberian C. sardinella. A clear genetic structure within Sweden was found that included two population assemblages markedly differentiated at microsatellites and apparently fixed for mtDNA haplotypes from two distinct clades. All sympatric Swedish populations belonged to the same assemblage, suggesting parallel evolution of spring-spawning rather than secondary contact. The pattern observed further suggests that postglacial immigration to Northern Europe occurred from at least two different refugia. Previous results showing that mtDNA in Baltic cisco is paraphyletic with respect to North American C. sardinella were confirmed. However, the inclusion of Siberian C. sardinella revealed a more complicated pattern, as these novel haplotypes were found within one of the two main C. albula clades and were clearly distinct from those in North American C. sardinella. The evolutionary history of Northern Hemisphere ciscoes thus seems to be more complex than previously recognized. PMID:25540695
Genetic signs of multiple colonization events in Baltic ciscoes with radiation into sympatric spring- and autumn-spawners confined to early postglacial arrival.

PubMed

Delling, Bo; Palm, Stefan; Palkopoulou, Eleftheria; Prestegaard, Tore

2014-11-01

Presence of sympatric populations may reflect local diversification or secondary contact of already distinct forms. The Baltic cisco (Coregonus albula) normally spawns in late autumn, but in a few lakes in Northern Europe sympatric autumn and spring- or winter-spawners have been described. So far, the evolutionary relationships and taxonomic status of these main life history forms have remained largely unclear. With microsatellites and mtDNA sequences, we analyzed extant and extinct spring- and autumn-spawners from a total of 23 Swedish localities, including sympatric populations. Published sequences from Baltic ciscoes in Germany and Finland, and Coregonus sardinella from North America were also included together with novel mtDNA sequences from Siberian C. sardinella. A clear genetic structure within Sweden was found that included two population assemblages markedly differentiated at microsatellites and apparently fixed for mtDNA haplotypes from two distinct clades. All sympatric Swedish populations belonged to the same assemblage, suggesting parallel evolution of spring-spawning rather than secondary contact. The pattern observed further suggests that postglacial immigration to Northern Europe occurred from at least two different refugia. Previous results showing that mtDNA in Baltic cisco is paraphyletic with respect to North American C. sardinella were confirmed. However, the inclusion of Siberian C. sardinella revealed a more complicated pattern, as these novel haplotypes were found within one of the two main C. albula clades and were clearly distinct from those in North American C. sardinella. The evolutionary history of Northern Hemisphere ciscoes thus seems to be more complex than previously recognized.
Evaluation of Two Highly-Multiplexed Custom Panels for Massively Parallel Semiconductor Sequencing on Paraffin DNA

PubMed Central

Kotoula, Vassiliki; Lyberopoulou, Aggeliki; Papadopoulou, Kyriaki; Charalambous, Elpida; Alexopoulou, Zoi; Gakou, Chryssa; Lakis, Sotiris; Tsolaki, Eleftheria; Lilakos, Konstantinos; Fountzilas, George

2015-01-01

Background—Aim Massively parallel sequencing (MPS) holds promise for expanding cancer translational research and diagnostics. As yet, it has been applied on paraffin DNA (FFPE) with commercially available highly multiplexed gene panels (100s of DNA targets), while custom panels of low multiplexing are used for re-sequencing. Here, we evaluated the performance of two highly multiplexed custom panels on FFPE DNA. Methods Two custom multiplex amplification panels (B, 373 amplicons; T, 286 amplicons) were coupled with semiconductor sequencing on DNA samples from FFPE breast tumors and matched peripheral blood samples (n samples: 316; n libraries: 332). The two panels shared 37% DNA targets (common or shifted amplicons). Panel performance was evaluated in paired sample groups and quartets of libraries, where possible. Results Amplicon read ratios yielded similar patterns per gene with the same panel in FFPE and blood samples; however, performance of common amplicons differed between panels (p<0.001). FFPE genotypes were compared for 1267 coding and non-coding variant replicates, 999 out of which (78.8%) were concordant in different paired sample combinations. Variant frequency was highly reproducible (Spearman’s rho 0.959). Repeatedly discordant variants were of high coverage / low frequency (p<0.001). Genotype concordance was (a) high, for intra-run duplicates with the same panel (mean±SD: 97.2±4.7, 95%CI: 94.8–99.7, p<0.001); (b) modest, when the same DNA was analyzed with different panels (mean±SD: 81.1±20.3, 95%CI: 66.1–95.1, p = 0.004); and (c) low, when different DNA samples from the same tumor were compared with the same panel (mean±SD: 59.9±24.0; 95%CI: 43.3–76.5; p = 0.282). Low coverage / low frequency variants were validated with Sanger sequencing even in samples with unfavourable DNA quality. Conclusions Custom MPS may yield novel information on genomic alterations, provided that data evaluation is adjusted to tumor tissue FFPE DNA. To this scope, eligibility of all amplicons along with variant coverage and frequency need to be assessed. PMID:26039550
Automated hybridization/imaging device for fluorescent multiplex DNA sequencing

DOEpatents

Weiss, R.B.; Kimball, A.W.; Gesteland, R.F.; Ferguson, F.M.; Dunn, D.M.; Di Sera, L.J.; Cherry, J.L.

1995-11-28

A method is disclosed for automated multiplex sequencing of DNA with an integrated automated imaging hybridization chamber system. This system comprises an hybridization chamber device for mounting a membrane containing size-fractionated multiplex sequencing reaction products, apparatus for fluid delivery to the chamber device, imaging apparatus for light delivery to the membrane and image recording of fluorescence emanating from the membrane while in the chamber device, and programmable controller apparatus for controlling operation of the system. The multiplex reaction products are hybridized with a probe, the enzyme (such as alkaline phosphatase) is bound to a binding moiety on the probe, and a fluorogenic substrate (such as a benzothiazole derivative) is introduced into the chamber device by the fluid delivery apparatus. The enzyme converts the fluorogenic substrate into a fluorescent product which, when illuminated in the chamber device with a beam of light from the imaging apparatus, excites fluorescence of the fluorescent product to produce a pattern of hybridization. The pattern of hybridization is imaged by a CCD camera component of the imaging apparatus to obtain a series of digital signals. These signals are converted by the controller apparatus into a string of nucleotides corresponding to the nucleotide sequence an automated sequence reader. The method and apparatus are also applicable to other membrane-based applications such as colony and plaque hybridization and Southern, Northern, and Western blots. 9 figs.
Automated hybridization/imaging device for fluorescent multiplex DNA sequencing

DOEpatents

Weiss, Robert B.; Kimball, Alvin W.; Gesteland, Raymond F.; Ferguson, F. Mark; Dunn, Diane M.; Di Sera, Leonard J.; Cherry, Joshua L.

1995-01-01

A method is disclosed for automated multiplex sequencing of DNA with an integrated automated imaging hybridization chamber system. This system comprises an hybridization chamber device for mounting a membrane containing size-fractionated multiplex sequencing reaction products, apparatus for fluid delivery to the chamber device, imaging apparatus for light delivery to the membrane and image recording of fluorescence emanating from the membrane while in the chamber device, and programmable controller apparatus for controlling operation of the system. The multiplex reaction products are hybridized with a probe, then an enzyme (such as alkaline phosphatase) is bound to a binding moiety on the probe, and a fluorogenic substrate (such as a benzothiazole derivative) is introduced into the chamber device by the fluid delivery apparatus. The enzyme converts the fluorogenic substrate into a fluorescent product which, when illuminated in the chamber device with a beam of light from the imaging apparatus, excites fluorescence of the fluorescent product to produce a pattern of hybridization. The pattern of hybridization is imaged by a CCD camera component of the imaging apparatus to obtain a series of digital signals. These signals are converted by the controller apparatus into a string of nucleotides corresponding to the nucleotide sequence an automated sequence reader. The method and apparatus are also applicable to other membrane-based applications such as colony and plaque hybridization and Southern, Northern, and Western blots.
A New Phylogeographic Pattern of Endemic Bufo bankorensis in Taiwan Island Is Attributed to the Genetic Variation of Populations

PubMed Central

Yu, Teng-Lang; Lin, Hung-Du; Weng, Ching-Feng

2014-01-01

Aim To comprehend the phylogeographic patterns of genetic variation in anurans at Taiwan Island, this study attempted to examine (1) the existence of various geological barriers (Central Mountain Ranges, CMRs); and (2) the genetic variation of Bufo bankorensis using mtDNA sequences among populations located in different regions of Taiwan, characterized by different climates and existing under extreme conditions when compared available sequences of related species B. gargarizans of mainland China. Methodology/Principal Findings Phylogenetic analyses of the dataset with mitochondrial DNA (mtDNA) D-loop gene (348 bp) recovered a close relationship between B. bankorensis and B. gargarizans, identified three distinct lineages. Furthermore, the network of mtDNA D-loop gene (564 bp) amplified (279 individuals, 27 localities) from Taiwan Island indicated three divergent clades within B. bankorensis (Clade W, E and S), corresponding to the geography, thereby verifying the importance of the CMRs and Kaoping River drainage as major biogeographic barriers. Mismatch distribution analysis, neutrality tests and Bayesian skyline plots revealed that a significant population expansion occurred for the total population and Clade W, with horizons dated to approximately 0.08 and 0.07 Mya, respectively. These results suggest that the population expansion of Taiwan Island species B. bankorensis might have resulted from the release of available habitat in post-glacial periods, the genetic variation on mtDNA showing habitat selection, subsequent population dispersal, and co-distribution among clades. Conclusions The multiple origins (different clades) of B. bankorensis mtDNA sequences were first evident in this study. The divergent genetic clades found within B. bankorensis could be independent colonization by previously diverged lineages; inferring B. bankorensis originated from B. gargarizans of mainland China, then dispersal followed by isolation within Taiwan Island. Highly divergent clades between W and E of B. bankorensis, implies that the CMRs serve as a genetic barrier and separated the whole island into the western and eastern phylogroups. PMID:24853679
[Applications of DNA methylation markers in forensic medicine].

PubMed

Zhao, Gui-sen; Yang, Qing-en

2005-02-01

DNA methylation is a post-replication modification that is predominantly found in cytosines of the dinucleotide sequence CpG. Epigenetic information is stored in the distribution of the modified base 5-methylcytosine. DNA methylation profiles represent a more chemically and biologically stable source of molecular diagnostic information than RNA or most proteins. Recent advances attest to the great promise of DNA methylation markers as powerful future tools in the clinic. In the past decade, DNA methylation analysis has been revolutionized by two technological advances--bisulphite modification of DNA and methylation-specific polymerase chain reaction (MSP). The methylation pattern of human genome is space-time specific, sex-specific, parent-of-origin specific and disease specific, providing us an alternative way to solve forensic problems.
Of mice and (Viking?) men: phylogeography of British and Irish house mice.

PubMed

Searle, Jeremy B; Jones, Catherine S; Gündüz, Islam; Scascitelli, Moira; Jones, Eleanor P; Herman, Jeremy S; Rambau, R Victor; Noble, Leslie R; Berry, R J; Giménez, Mabel D; Jóhannesdóttir, Fríoa

2009-01-22

The west European subspecies of house mouse (Mus musculus domesticus) has gained much of its current widespread distribution through commensalism with humans. This means that the phylogeography of M. m. domesticus should reflect patterns of human movements. We studied restriction fragment length polymorphism (RFLP) and DNA sequence variations in mouse mitochondrial (mt) DNA throughout the British Isles (328 mice from 105 localities, including previously published data). There is a major mtDNA lineage revealed by both RFLP and sequence analyses, which is restricted to the northern and western peripheries of the British Isles, and also occurs in Norway. This distribution of the 'Orkney' lineage fits well with the sphere of influence of the Norwegian Vikings and was probably generated through inadvertent transport by them. To form viable populations, house mice would have required large human settlements such as the Norwegian Vikings founded. The other parts of the British Isles (essentially most of mainland Britain) are characterized by house mice with different mtDNA sequences, some of which are also found in Germany, and which probably reflect both Iron Age movements of people and mice and earlier development of large human settlements. MtDNA studies on house mice have the potential to reveal novel aspects of human history.
Of mice and (Viking?) men: phylogeography of British and Irish house mice

PubMed Central

Searle, Jeremy B.; Jones, Catherine S.; Gündüz, İslam; Scascitelli, Moira; Jones, Eleanor P.; Herman, Jeremy S.; Rambau, R. Victor; Noble, Leslie R.; Berry, R.J.; Giménez, Mabel D.; Jóhannesdóttir, Fríða

2008-01-01

The west European subspecies of house mouse (Mus musculus domesticus) has gained much of its current widespread distribution through commensalism with humans. This means that the phylogeography of M. m. domesticus should reflect patterns of human movements. We studied restriction fragment length polymorphism (RFLP) and DNA sequence variations in mouse mitochondrial (mt) DNA throughout the British Isles (328 mice from 105 localities, including previously published data). There is a major mtDNA lineage revealed by both RFLP and sequence analyses, which is restricted to the northern and western peripheries of the British Isles, and also occurs in Norway. This distribution of the ‘Orkney’ lineage fits well with the sphere of influence of the Norwegian Vikings and was probably generated through inadvertent transport by them. To form viable populations, house mice would have required large human settlements such as the Norwegian Vikings founded. The other parts of the British Isles (essentially most of mainland Britain) are characterized by house mice with different mtDNA sequences, some of which are also found in Germany, and which probably reflect both Iron Age movements of people and mice and earlier development of large human settlements. MtDNA studies on house mice have the potential to reveal novel aspects of human history. PMID:18826939
DNA methylation dynamics during early plant life.

PubMed

Bouyer, Daniel; Kramdi, Amira; Kassam, Mohamed; Heese, Maren; Schnittger, Arp; Roudier, François; Colot, Vincent

2017-09-25

Cytosine methylation is crucial for gene regulation and silencing of transposable elements in mammals and plants. While this epigenetic mark is extensively reprogrammed in the germline and early embryos of mammals, the extent to which DNA methylation is reset between generations in plants remains largely unknown. Using Arabidopsis as a model, we uncovered distinct DNA methylation dynamics over transposable element sequences during the early stages of plant development. Specifically, transposable elements and their relics show invariably high methylation at CG sites but increasing methylation at CHG and CHH sites. This non-CG methylation culminates in mature embryos, where it reaches saturation for a large fraction of methylated CHH sites, compared to the typical 10-20% methylation level observed in seedlings or adult plants. Moreover, the increase in CHH methylation during embryogenesis matches the hypomethylated state in the early endosperm. Finally, we show that interfering with the embryo-to-seedling transition results in the persistence of high CHH methylation levels after germination, specifically over sequences that are targeted by the RNA-directed DNA methylation (RdDM) machinery. Our findings indicate the absence of extensive resetting of DNA methylation patterns during early plant life and point instead to an important role of RdDM in reinforcing DNA methylation of transposable element sequences in every cell of the mature embryo. Furthermore, we provide evidence that this elevated RdDM activity is a specific property of embryogenesis.
Amazonian phylogeography: mtDNA sequence variation in arboreal echimyid rodents (Caviomorpha).

PubMed

da Silva, M N; Patton, J L

1993-09-01

Patterns of evolutionary relationships among haplotype clades of sequences of the mitochondrial cytochrome b DNA gene are examined for five genera of arboreal rodents of the Caviomorph family Echimyidae from the Amazon Basin. Data are available for 798 bp of sequence from a total of 24 separate localities in Peru, Venezuela, Bolivia, and Brazil for Mesomys, Isothrix, Makalata, Dactylomys, and Echimys. Sequence divergence, corrected for multiple hits, is extensive, ranging from less than 1% for comparisons within populations of over 20% among geographic units within genera. Both the degree of differentiation and the geographic patterning of the variation suggest that more than one species composes the Amazonian distribution of the currently recognized Mesomys hispidus, Isothrix bistriata, Makalata didelphoides, and Dactylomys dactylinus. There is general concordance in the geographic range of haplotype clades for each of these taxa, and the overall level of differentiation within them is largely equivalent. These observations suggest that a common vicariant history underlies the respective diversification of each genus. However, estimated times of divergence based on the rate of third position transversion substitutions for the major clades within each genus typically range above 1 million years. Thus, allopatric isolation precipitating divergence must have been considerably earlier than the late Pleistocene forest fragmentation events commonly invoked for Amazonian biota.
Linkage Study Revealed Complex Haplotypes in a Multifamily due to Different Mutations in CAPN3 Gene in an Iranian Ethnic Group.

PubMed

Mojbafan, Marzieh; Tonekaboni, Seyed Hassan; Abiri, Maryam; Kianfar, Soudeh; Sarhadi, Ameneh; Nilipour, Yalda; Tavakkoly-Bazzaz, Javad; Zeinali, Sirous

2016-07-01

Calpainopathy is an autosomal recessive form of limb girdle muscular dystrophies which is caused by mutation in CAPN3 gene. In the present study, co-segregation of this disorder was analyzed with four short tandem repeat markers linked to the CAPN3 gene. Three apparently unrelated Iranian families with same ethnicity were investigated. Haplotype analysis and sequencing of the CAPN3 gene were performed. DNA sample from one of the patients was simultaneously sent for next-generation sequencing. DNA sequencing identified two mutations. It was seen as a homozygous c.2105C>T in exon 19 in one family, a homozygous novel mutation c.380G>A in exon 3 in another family, and a compound heterozygote form of these two mutations in the third family. Next-generation sequencing also confirmed our results. It was expected that, due to the rare nature of limb girdle muscular dystrophies, affected individuals from the same ethnic group share similar mutations. Haplotype analysis showed two different homozygote patterns in two families, yet a compound heterozygote pattern in the third family as seen in the mutation analysis. This study shows that haplotype analysis would help in determining presence of different founders.
Detection of cystic fibrosis mutations in a GeneChip{trademark} assay format

DOE Office of Scientific and Technical Information (OSTI.GOV)

Miyada, C.G.; Cronin, M.T.; Kim, S.M.

1994-09-01

We are developing assays for the detection of cystic fibrosis mutations based on DNA hybridization. A DNA sample is amplified by PCR, labeled by incorporating a fluorescein-tagged dNTP, enzymatically treated to produce smaller fragments and hybridized to a series of short (13-16 bases) oligonucleotides synthesized on a glass surface via photolithography. The hybrids are detected by eqifluorescence and mutations are identified by the specific pattern of hybridization. In a GeneChip assay, the chip surface is composed of a series of subarrays, each being specific for a particular mutation. Each subarray is further subdivided into a series of probes (40 total),more » half based on the mutant sequence and the remainder based on the wild-type sequence. For each of the subarrays, there is a redundancy in the number of probes that should hybridize to either a wild-type or a mutant target. The multiple probe strategy provides sequence information for a short five base region overlapping the mutation site. In addition, homozygous wild-type and mutant as well as heterozygous samples are each identified by a specific pattern of hybridization. The small size of each probe feature (250 x 250 {mu}m{sup 2}) permits the inclusion of additional probes required to generate sequence information by hybridization.« less

Patterns of linkage disequilibrium in mitochondrial DNA of 16 ruminant populations.

PubMed

Slate, J; Phua, S H

2003-03-01

Mitochondrial DNA (mtDNA) is a widely employed molecular tool in phylogeography, in the inference of human evolutionary history, in dating the domestication of livestock and in forensic science. In humans and other vertebrates the popularity of mtDNA can be partially attributed to an assumption of strict maternal inheritance, such that there is no recombination between mitochondrial lineages. The recent demonstration that linkage disequilibrium (LD) declines as a function of distance between polymorphic sites in hominid mitochondrial genomes has been interpreted as evidence of recombination between mtDNA haplotypes, and hence nonclonal inheritance. However, critics of mtDNA recombination have suggested that this association is an artefact of an inappropriate measure of LD or of sequencing error, and subsequent studies of other populations have failed to replicate the initial finding. Here we report the analysis of 16 ruminant populations and present evidence that LD significantly declines with distance in five of them. A meta-analysis of the data indicates a nonsignificant trend of LD declining with distance. Most of the earlier criticisms of patterns between LD and distance in hominid mtDNA are not applicable to this data set. Our results suggest that either ruminant mtDNA is not strictly clonal or that compensatory selection has influenced patterns of variation at closely linked sites within the mitochondrial control region. The potential impact of these processes should be considered when using mtDNA as a tool in vertebrate population genetic, phylogenetic and forensic studies.
Methylation analysis of plasma cell-free DNA for breast cancer early detection using bisulfite next-generation sequencing.

PubMed

Li, Zibo; Guo, Xinwu; Tang, Lili; Peng, Limin; Chen, Ming; Luo, Xipeng; Wang, Shouman; Xiao, Zhi; Deng, Zhongping; Dai, Lizhong; Xia, Kun; Wang, Jun

2016-10-01

Circulating cell-free DNA (cfDNA) has been considered as a potential biomarker for non-invasive cancer detection. To evaluate the methylation levels of six candidate genes (EGFR, GREM1, PDGFRB, PPM1E, SOX17, and WRN) in plasma cfDNA as biomarkers for breast cancer early detection, quantitative analysis of the promoter methylation of these genes from 86 breast cancer patients and 67 healthy controls was performed by using microfluidic-PCR-based target enrichment and next-generation bisulfite sequencing technology. The predictive performance of different logistic models based on methylation status of candidate genes was investigated by means of the area under the ROC curve (AUC) and odds ratio (OR) analysis. Results revealed that EGFR, PPM1E, and 8 gene-specific CpG sites showed significantly hypermethylation in cancer patients' plasma and significantly associated with breast cancer (OR ranging from 2.51 to 9.88). The AUC values for these biomarkers were ranging from 0.66 to 0.75. Combinations of multiple hypermethylated genes or CpG sites substantially improved the predictive performance for breast cancer detection. Our study demonstrated the feasibility of quantitative measurement of candidate gene methylation in cfDNA by using microfluidic-PCR-based target enrichment and bisulfite next-generation sequencing, which is worthy of further validation and potentially benefits a broad range of applications in clinical oncology practice. Quantitative analysis of methylation pattern of plasma cfDNA by next-generation sequencing might be a valuable non-invasive tool for early detection of breast cancer.
Extensive sequence-influenced DNA methylation polymorphism in the human genome

PubMed Central

2010-01-01

Background Epigenetic polymorphisms are a potential source of human diversity, but their frequency and relationship to genetic polymorphisms are unclear. DNA methylation, an epigenetic mark that is a covalent modification of the DNA itself, plays an important role in the regulation of gene expression. Most studies of DNA methylation in mammalian cells have focused on CpG methylation present in CpG islands (areas of concentrated CpGs often found near promoters), but there are also interesting patterns of CpG methylation found outside of CpG islands. Results We compared DNA methylation patterns on both alleles between many pairs (and larger groups) of related and unrelated individuals. Direct observation and simulation experiments revealed that around 10% of common single nucleotide polymorphisms (SNPs) reside in regions with differences in the propensity for local DNA methylation between the two alleles. We further showed that for the most common form of SNP, a polymorphism at a CpG dinucleotide, the presence of the CpG at the SNP positively affected local DNA methylation in cis. Conclusions Taken together with the known effect of DNA methylation on mutation rate, our results suggest an interesting interdependence between genetics and epigenetics underlying diversity in the human genome. PMID:20497546
Investigating diversity and possible functions of G-quadruplexes in regulatory regions of maize genes

USDA-ARS?s Scientific Manuscript database

G4-quadruplexes are reversible DNA structures that likely function in gene regulation, but exactly how they work is not known. G4 DNA can be predicted from sequence motifs such as the pattern G-G-G-N(1,7)-G-G-G-N(1,7)-G-G-G-N(1,7)-G-G-G-N(1,7). In the maize genome, G4 motifs were found to occupy ...
Detection of DNA "fingerprints" of cultivated rice by hybridization with a human minisatellite DNA probe.

PubMed

Dallas, J F

1988-09-01

A human minisatellite DNA probe detects several restriction fragment length polymorphisms in cultivars of Asian and African rice. Certain fragments appear to be inherited in a Mendelian fashion and may represent unlinked loci. The hybridization patterns appear to be cultivar-specific and largely unchanged after the regeneration of plants from tissue culture. The results suggest that these regions of the rice genome may be used to generate cultivar-specific DNA fingerprints. The demonstration of similarity between a human minisatellite sequence and polymorphic regions in the rice genome suggests that such regions also occur in the genomes of many other plant species.
Quantum dot-based microfluidic biosensor for cancer detection

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ghrera, Aditya Sharma; School of Engineering and Technology, ITM University, Gurgaon-122017; Pandey, Chandra Mouli

2015-05-11

We report results of the studies relating to fabrication of an impedimetric microfluidic–based nucleic acid sensor for quantification of DNA sequences specific to chronic myelogenous leukemia (CML). The sensor chip is prepared by patterning an indium–tin–oxide (ITO) coated glass substrate via wet chemical etching method followed by sealing with polydimethylsiloxane (PDMS) microchannel for fluid control. The fabricated microfluidic chip comprising of a patterned ITO substrate is modified by depositing cadmium selenide quantum dots (QCdSe) via Langmuir–Blodgett technique. Further, the QCdSe surface has been functionalized with specific DNA probe for CML detection. The probe DNA functionalized QCdSe integrated miniaturized system hasmore » been used to monitor target complementary DNA concentration by measuring the interfacial charge transfer resistance via hybridization. The presence of complementary DNA in buffer solution significantly results in decreased electro-conductivity of the interface due to presence of a charge barrier for transport of the redox probe ions. The microfluidic DNA biosensor exhibits improved linearity in the concentration range of 10{sup −15} M to 10{sup −11} M.« less
DNA methylome of the 20-gigabase Norway spruce genome

PubMed Central

Ausin, Israel; Feng, Suhua; Yu, Chaowei; Liu, Wanlu; Kuo, Hsuan Yu; Jacobsen, Elise L.; Zhai, Jixian; Gallego-Bartolome, Javier; Wang, Lin; Egertsdotter, Ulrika; Street, Nathaniel R.; Jacobsen, Steven E.; Wang, Haifeng

2016-01-01

DNA methylation plays important roles in many biological processes, such as silencing of transposable elements, imprinting, and regulating gene expression. Many studies of DNA methylation have shown its essential roles in angiosperms (flowering plants). However, few studies have examined the roles and patterns of DNA methylation in gymnosperms. Here, we present genome-wide high coverage single-base resolution methylation maps of Norway spruce (Picea abies) from both needles and somatic embryogenesis culture cells via whole genome bisulfite sequencing. On average, DNA methylation levels of CG and CHG of Norway spruce were higher than most other plants studied. CHH methylation was found at a relatively low level; however, at least one copy of most of the RNA-directed DNA methylation pathway genes was found in Norway spruce, and CHH methylation was correlated with levels of siRNAs. In comparison with needles, somatic embryogenesis culture cells that are used for clonally propagating spruce trees showed lower levels of CG and CHG methylation but higher level of CHH methylation, suggesting that like in other species, these culture cells show abnormal methylation patterns. PMID:27911846
Developmentally programmed DNA splicing in Paramecium reveals short-distance crosstalk between DNA cleavage sites

PubMed Central

Gratias, Ariane; Lepère, Gersende; Garnier, Olivier; Rosa, Sarah; Duharcourt, Sandra; Malinsky, Sophie; Meyer, Eric; Bétermier, Mireille

2008-01-01

Somatic genome assembly in the ciliate Paramecium involves the precise excision of thousands of short internal eliminated sequences (IESs) that are scattered throughout the germline genome and often interrupt open reading frames. Excision is initiated by double-strand breaks centered on the TA dinucleotides that are conserved at each IES boundary, but the factors that drive cleavage site recognition remain unknown. A degenerate consensus was identified previously at IES ends and genetic analyses confirmed the participation of their nucleotide sequence in efficient excision. Even for wild-type IESs, however, variant excision patterns (excised or nonexcised) may be inherited maternally through sexual events, in a homology-dependent manner. We show here that this maternal epigenetic control interferes with the targeting of DNA breaks at IES ends. Furthermore, we demonstrate that a mutation in the TA at one end of an IES impairs DNA cleavage not only at the mutant end but also at the wild-type end. We conclude that crosstalk between both ends takes place prior to their cleavage and propose that the ability of an IES to adopt an excision-prone conformation depends on the combination of its nucleotide sequence and of additional determinants. PMID:18420657
16S-23S rDNA intergenic spacer region polymorphism of Lactococcus garvieae, Lactococcus raffinolactis and Lactococcus lactis as revealed by PCR and nucleotide sequence analysis.

PubMed

Blaiotta, Giuseppe; Pepe, Olimpia; Mauriello, Gianluigi; Villani, Francesco; Andolfi, Rosamaria; Moschetti, Giancarlo

2002-12-01

The intergenic spacer region (ISR) between the 16S and 23S rRNA genes was tested as a tool for differentiating lactococci commonly isolated in a dairy environment. 17 reference strains, representing 11 different species belonging to the genera Lactococcus, Streptococcus, Lactobacillus, Enterococcus and Leuconostoc, and 127 wild streptococcal strains isolated during the whole fermentation process of "Fior di Latte" cheese were analyzed. After 16S-23S rDNA ISR amplification by PCR, species or genus-specific patterns were obtained for most of the reference strains tested. Moreover, results obtained after nucleotide analysis show that the 16S-23S rDNA ISR sequences vary greatly, in size and sequence, among Lactococcus garvieae, Lactococcus raffinolactis, Lactococcus lactis as well as other streptococci from dairy environments. Because of the high degree of inter-specific polymorphism observed, 16S-23S rDNA ISR can be considered a good potential target for selecting species-specific molecular assays, such as PCR primer or probes, for a rapid and extremely reliable differentiation of dairy lactococcal isolates.
The evolutionary dynamics of the lion Panthera leo revealed by host and viral population genomics.

PubMed

Antunes, Agostinho; Troyer, Jennifer L; Roelke, Melody E; Pecon-Slattery, Jill; Packer, Craig; Winterbach, Christiaan; Winterbach, Hanlie; Hemson, Graham; Frank, Laurence; Stander, Philip; Siefert, Ludwig; Driciru, Margaret; Funston, Paul J; Alexander, Kathy A; Prager, Katherine C; Mills, Gus; Wildt, David; Bush, Mitch; O'Brien, Stephen J; Johnson, Warren E

2008-11-01

The lion Panthera leo is one of the world's most charismatic carnivores and is one of Africa's key predators. Here, we used a large dataset from 357 lions comprehending 1.13 megabases of sequence data and genotypes from 22 microsatellite loci to characterize its recent evolutionary history. Patterns of molecular genetic variation in multiple maternal (mtDNA), paternal (Y-chromosome), and biparental nuclear (nDNA) genetic markers were compared with patterns of sequence and subtype variation of the lion feline immunodeficiency virus (FIV(Ple)), a lentivirus analogous to human immunodeficiency virus (HIV). In spite of the ability of lions to disperse long distances, patterns of lion genetic diversity suggest substantial population subdivision (mtDNA Phi(ST) = 0.92; nDNA F(ST) = 0.18), and reduced gene flow, which, along with large differences in sero-prevalence of six distinct FIV(Ple) subtypes among lion populations, refute the hypothesis that African lions consist of a single panmictic population. Our results suggest that extant lion populations derive from several Pleistocene refugia in East and Southern Africa ( approximately 324,000-169,000 years ago), which expanded during the Late Pleistocene ( approximately 100,000 years ago) into Central and North Africa and into Asia. During the Pleistocene/Holocene transition ( approximately 14,000-7,000 years), another expansion occurred from southern refugia northwards towards East Africa, causing population interbreeding. In particular, lion and FIV(Ple) variation affirms that the large, well-studied lion population occupying the greater Serengeti Ecosystem is derived from three distinct populations that admixed recently.
Vicariance, long-distance dispersal, and regional extinction-recolonization dynamics explain the disjunct circumpolar distribution of the arctic-alpine plant Silene acaulis.

PubMed

Gussarova, Galina; Allen, Geraldine A; Mikhaylova, Yulia; McCormick, Laurie J; Mirré, Virginia; Marr, Kendrick L; Hebda, Richard J; Brochmann, Christian

2015-10-01

Many arctic-alpine species have vast geographic ranges, but these may encompass substantial gaps whose origins are poorly understood. Here we address the phylogeographic history of Silene acaulis, a perennial cushion plant with a circumpolar distribution except for a large gap in Siberia. We assessed genetic variation in a range-wide sample of 103 populations using plastid DNA (pDNA) sequences and AFLPs (amplified fragment length polymorphisms). We constructed a haplotype network and performed Bayesian phylogenetic analyses based on plastid sequences. We visualized AFLP patterns using principal coordinate analysis, identified genetic groups using the program structure, and estimated genetic diversity and rarity indices by geographic region. The history of the main pDNA lineages was estimated to span several glaciations. AFLP data revealed a distinct division between Beringia/North America and Europe/East Greenland. These two regions shared only one of 17 pDNA haplotypes. Populations on opposite sides of the Siberian range gap (Ural Mountains and Chukotka) were genetically distinct and appear to have resulted from postglacial leading-edge colonizations. We inferred two refugia in North America (Beringia and the southern Rocky Mountains) and two in Europe (central-southern Europe and northern Europe/East Greenland). Patterns in the East Atlantic region suggested transoceanic long-distance dispersal events. Silene acaulis has a highly dynamic history characterized by vicariance, regional extinction, and recolonization, with persistence in at least four refugia. Long-distance dispersal explains patterns across the Atlantic Ocean, but we found no evidence of dispersal across the Siberian range gap. © 2015 Botanical Society of America.
The Evolutionary Dynamics of the Lion Panthera leo Revealed by Host and Viral Population Genomics

PubMed Central

Antunes, Agostinho; Troyer, Jennifer L.; Roelke, Melody E.; Pecon-Slattery, Jill; Packer, Craig; Winterbach, Christiaan; Winterbach, Hanlie; Hemson, Graham; Frank, Laurence; Stander, Philip; Siefert, Ludwig; Driciru, Margaret; Funston, Paul J.; Alexander, Kathy A.; Prager, Katherine C.; Mills, Gus; Wildt, David; Bush, Mitch; O'Brien, Stephen J.; Johnson, Warren E.

2008-01-01

The lion Panthera leo is one of the world's most charismatic carnivores and is one of Africa's key predators. Here, we used a large dataset from 357 lions comprehending 1.13 megabases of sequence data and genotypes from 22 microsatellite loci to characterize its recent evolutionary history. Patterns of molecular genetic variation in multiple maternal (mtDNA), paternal (Y-chromosome), and biparental nuclear (nDNA) genetic markers were compared with patterns of sequence and subtype variation of the lion feline immunodeficiency virus (FIVPle), a lentivirus analogous to human immunodeficiency virus (HIV). In spite of the ability of lions to disperse long distances, patterns of lion genetic diversity suggest substantial population subdivision (mtDNA ΦST = 0.92; nDNA F ST = 0.18), and reduced gene flow, which, along with large differences in sero-prevalence of six distinct FIVPle subtypes among lion populations, refute the hypothesis that African lions consist of a single panmictic population. Our results suggest that extant lion populations derive from several Pleistocene refugia in East and Southern Africa (∼324,000–169,000 years ago), which expanded during the Late Pleistocene (∼100,000 years ago) into Central and North Africa and into Asia. During the Pleistocene/Holocene transition (∼14,000–7,000 years), another expansion occurred from southern refugia northwards towards East Africa, causing population interbreeding. In particular, lion and FIVPle variation affirms that the large, well-studied lion population occupying the greater Serengeti Ecosystem is derived from three distinct populations that admixed recently. PMID:18989457
A discrete artificial bee colony algorithm for detecting transcription factor binding sites in DNA sequences.

PubMed

Karaboga, D; Aslan, S

2016-04-27

The great majority of biological sequences share significant similarity with other sequences as a result of evolutionary processes, and identifying these sequence similarities is one of the most challenging problems in bioinformatics. In this paper, we present a discrete artificial bee colony (ABC) algorithm, which is inspired by the intelligent foraging behavior of real honey bees, for the detection of highly conserved residue patterns or motifs within sequences. Experimental studies on three different data sets showed that the proposed discrete model, by adhering to the fundamental scheme of the ABC algorithm, produced competitive or better results than other metaheuristic motif discovery techniques.
Partial bisulfite conversion for unique template sequencing

PubMed Central

Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael

2018-01-01

Abstract We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. PMID:29161423
Zygosaccharomyces kombuchaensis, a new ascosporogenous yeast from 'Kombucha tea'.

PubMed

Kurtzman, C P; Robnett, C J; Basehoar-Powers, E

2001-07-01

A new ascosporogenous yeast, Zygosaccharomyces kombuchaensis sp. n. (type strain NRRL YB-4811, CBS 8849), is described; it was isolated from Kombucha tea, a popular fermented tea-based beverage. The four known strains of the new species have identical nucleotide sequences in domain D1/D2 of 26S rDNA. Phylogenetic analysis of D1/D2 and 18S rDNA sequences places Z. kombuchaensis near Zygosaccharomyces lentus. The two species are indistinguishable on standard physiological tests used for yeast identification, but can be recognized from differences in restriction fragment length polymorphism patterns obtained by digestion of 18S-ITS1 amplicons with the restriction enzymes DdeI and MboI.
The anti-tumor drug bleomycin preferentially cleaves at the transcription start sites of actively transcribed genes in human cells.

PubMed

Murray, Vincent; Chen, Jon K; Galea, Anne M

2014-04-01

The genome-wide pattern of DNA cleavage at transcription start sites (TSSs) for the anti-tumor drug bleomycin was examined in human HeLa cells using next-generation DNA sequencing. It was found that actively transcribed genes were preferentially cleaved compared with non-transcribed genes. The 143,600 identified human TSSs were split into non-transcribed genes (82,596) and transcribed genes (61,004) for HeLa cells. These transcribed genes were further split into quintiles of 12,201 genes comprising the top 20, 20-40, 40-60, 60-80, and 80-100 % of expressed genes. The bleomycin cleavage pattern at highly transcribed gene TSSs was greatly enhanced compared with purified DNA and non-transcribed gene TSSs. The top 20 and 20-40 % quintiles had a very similar enhanced cleavage pattern, the 40-60 % quintile was intermediate, while the 60-80 and 80-100 % quintiles were close to the non-transcribed and purified DNA profiles. The pattern of bleomycin enhanced cleavage had peaks that were approximately 200 bp apart, and this indicated that bleomycin was identifying the presence of phased nucleosomes at TSSs. Hence bleomycin can be utilized to detect chromatin structures that are present at actively transcribed genes. In this study, for the first time, the pattern of DNA damage by a clinically utilized cancer chemotherapeutic agent was performed on a human genome-wide scale at the nucleotide level.
Autosomal-dominant Leber Congenital Amaurosis Caused by a Heterozygous CRX Mutation in a Father and Son.

PubMed

Arcot Sadagopan, Karthikeyan; Battista, Robert; Keep, Rosanne B; Capasso, Jenina E; Levin, Alex V

2015-06-01

Leber congenital amaurosis (LCA) is most often an autosomal recessive disorder. We report a father and son with autosomal dominant LCA due to a mutation in the CRX gene. DNA screening using an allele specific assay of 90 of the most common LCA-causing variations in the coding sequences of AIPL1, CEP290, CRB1, CRX, GUCY2D, RDH12 and RPE65 was performed on the father. Automated DNA sequencing of his son examining exon 3 of the CRX gene was subsequently performed. Both father and son have a heterozygous single base pair deletion of an adenine at codon 153 in the coding sequence of the CRX gene resulting in a frameshift mutation. Mutations involving the CRX gene may demonstrate an autosomal dominant inheritance pattern for LCA.
Identification of Y-Chromosome Sequences in Turner Syndrome.

PubMed

Silva-Grecco, Roseane Lopes da; Trovó-Marqui, Alessandra Bernadete; Sousa, Tiago Alves de; Croce, Lilian Da; Balarin, Marly Aparecida Spadotto

2016-05-01

To investigate the presence of Y-chromosome sequences and determine their frequency in patients with Turner syndrome. The study included 23 patients with Turner syndrome from Brazil, who gave written informed consent for participating in the study. Cytogenetic analyses were performed in peripheral blood lymphocytes, with 100 metaphases per patient. Genomic DNA was also extracted from peripheral blood lymphocytes, and gene sequences DYZ1, DYZ3, ZFY and SRY were amplified by Polymerase Chain Reaction. The cytogenetic analysis showed a 45,X karyotype in 9 patients (39.2 %) and a mosaic pattern in 14 (60.8 %). In 8.7 % (2 out of 23) of the patients, Y-chromosome sequences were found. This prevalence is very similar to those reported previously. The initial karyotype analysis of these patients did not reveal Y-chromosome material, but they were found positive for Y-specific sequences in the lymphocyte DNA analysis. The PCR technique showed that 2 (8.7 %) of the patients with Turner syndrome had Y-chromosome sequences, both presenting marker chromosomes on cytogenetic analysis.
Evidence for recombination of mitochondrial DNA in triploid crucian carp.

PubMed

Guo, Xinhong; Liu, Shaojun; Liu, Yun

2006-03-01

In this study, we report the complete mitochondrial DNA (mtDNA) sequences of the allotetraploid and triploid crucian carp and compare the complete mtDNA sequences between the triploid crucian carp and its female parent Japanese crucian carp and between the triploid crucian carp and its male parent allotetraploid. Our results indicate that the complete mtDNA nucleotide identity (98%) between the triploid crucian carp and its male parent allotetraploid was higher than that (93%) between the triploid crucian carp and its female parent Japanese crucian carp. Moreover, the presence of a pattern of identity and difference at synonymous sites of mitochondrial genomes between the triploid crucian carp and its parents provides direct evidence that triploid crucian carp possessed the recombination mtDNA fragment (12,759 bp) derived from the paternal fish. These results suggest that mtDNA recombination was derived from the fusion of the maternal and paternal mtDNAs. Compared with the haploid egg with one set of genome from the Japanese crucian carp, the diploid sperm with two sets of genomes from the allotetraploid could more easily make its mtDNA fuse with the mtDNA of the haploid egg. In addition, the triple hybrid nature of the triploid crucian carp probably allowed its better mtDNA recombination. In summary, our results provide the first evidence of mtDNA combination in polyploid fish.
Localization of the 5S and 45S rDNA Sites and cpDNA Sequence Analysis in Species of the Quadrifaria Group of Paspalum (Poaceae, Paniceae)

PubMed Central

VAIO, MAGDALENA; SPERANZA, PABLO; VALLS, JOSÉ FRANCISCO; GUERRA, MARCELO; MAZZELLA, CRISTINA

2005-01-01

• Background and Aims The Quadrifaria group of Paspalum (Poaceae, Paniceae) comprises species native to the subtropical and temperate regions of South America. The purpose of this research was to characterize the I genomes in five species of this group and to establish phylogenetic relationships among them. • Methods Prometaphase chromatin condensation patterns, the physical location of 5S and 45S rDNA sites by fluorescence in situ hybridization (FISH), and sequences of five chloroplast non-coding regions were analysed. • Key Results The condensation patterns observed were highly conserved among diploid and tetraploid accessions studied and not influenced by the dyes used or by the FISH procedure, allowing the identification of almost all the chromosome pairs that carried the rDNA signals. The FISH analysis of 5S rDNA sites showed the same localization and a correspondence between the number of sites and ploidy level. In contrast, the distribution of 45S rDNA sites was variable. Two general patterns were observed with respect to the location of the 45S rDNA. The species and cytotypes Paspalum haumanii 2x, P. intermedium 2x, P. quadrifarium 4x and P. exaltatum 4x showed proximal sites on chromosome 8 and two to four distal sites in other chromosomes, while P. quarinii 4x and P. quadrifarium 2x showed only distal sites located on a variable number of small chromosomes and on the long arm of chromosome 1. The single most-parsimonious tree found from the phylogenetic analysis showed the Quadrifaria species partitioned in two clades, one of them includes P. haumanii 2x and P. intermedium 2x together with P. quadrifarium 4x and P. exaltatum 4x, while the other contains P. quadrifarium 2x and P. quarinii 4x. • Conclusions The subdivision found with FISH is consistent with the clades recovered with cpDNA data and both analyses suggest that the Quadrifaria group, as presently defined, is not monophyletic and its species belong in at least two clades. PMID:15911540

Deletion endpoint allele-specificity in the developmentally regulated elimination of an internal sequence (IES) in Paramecium.

PubMed Central

Dubrana, K; Le Mouël, A; Amar, L

1997-01-01

Ciliated protozoa undergo thousands of site-specific DNA deletion events during the programmed development of micronuclear genomes to macronuclear genomes. Two deletion elements, W1 and W2, were identified in the Paramecium primaurelia wild-type 156 strain. Here, we report the characterization of both elements in wild-type strain 168 and show that they display variant deletion patterns when compared with those of strain 156. The W1 ( 168 ) element is defective for deletion. The W2 ( 168 ) element is excised utilizing two alternative boundaries on one side, both are different from the boundary utilized to excise the W2156 element. By crossing the 156 and 168 strains, we demonstrate that the definition of all deletion endpoints are each controlled by cis -acting determinant(s) rather than by strain-specific trans-acting factor(s). Sequence comparison of all deleted DNA segments indicates that the 5'-TA-3'terminal sequence is strictly required at their ends. Furthermore the identity of the first eight base pairs of these ends to a previously established consensus sequence correlates with the frequency of the corresponding deletion events. Our data implies the existence of an adaptive convergent evolution of these Paramecium deleted DNA segment end sequences. PMID:9171098
Detection and Phylogenetic Analysis of Wolbachia in the Asiatic Rice Leafroller, Cnaphalocrocis medinalis, in Chinese Populations

PubMed Central

Chai, Huan-Na; Du, Yu-Zhou; Qiu, Bao-Li; Zhai, Bao-Ping

2011-01-01

Wolbachia are a group of intracellular inherited endosymbiontic bacteria infecting a wide range of insects. In this study the infection status of Wolbachia (Rickettsiales: Rickettsiaceae) was measured in the Asiatic rice leafroller, Cnaphalocrocis medinalis (Guenée) (Lepidoptera: Pyralidae), from twenty locations in China by sequencing wsp, ftsZ and 16S rDNA genes. The results showed high infection rates of Wolbachia in C. medinalis populations. Wolbachia was detected in all geographically separate populations; the average infection rate was ∼ 62.5%, and the highest rates were 90% in Wenzhou and Yangzhou populations. The Wolbachia detected in different C. medinalis populations were 100% identical to each other when wsp, ftsZ, and 16S rDNA sequences were compared, with all sequences belonging to the Wolbachia B supergroup. Based on wsp, ftsZ and 16S rDNA sequences of Wolbachia, three phylogenetic trees of similar pattern emerged. This analysis indicated the possibility of inter-species and intra-species horizontal transmission of Wolbachia in different arthropods in related geographical regions. The migration route of C. medinalis in mainland China was also discussed since large differentiation had been found between the wsp sequences of Chinese and Thai populations. PMID:22233324
Fingerprinting of HLA class I genes for improved selection of unrelated bone marrow donors.

PubMed

Martinelli, G; Farabegoli, P; Buzzi, M; Panzica, G; Zaccaria, A; Bandini, G; Calori, E; Testoni, N; Rosti, G; Conte, R; Remiddi, C; Salvucci, M; De Vivo, A; Tura, S

1996-02-01

The degree of matching of HLA genes between the selected donor and recipient is an important aspect of the selection of unrelated donors for allogeneic bone marrow transplantation (UBMT). The most sensitive methods currently used are serological typing of HLA class I genes, mixed lymphocyte culture (MLC), IEF and molecular genotyping of HLA class II genes by direct sequencing of PCR products. Serological typing of class I antigenes (A, B and C) fails to detect minor differences demonstrated by direct sequencing of DNA polymorphic regions. Molecular genotyping of HLA class I genes by DNA analysis is costly and work-intensive. To improve compatibility between donor and recipient, we have set up a new rapid and non-radioisotopic application of the 'fingerprinting PCR' technique for the analysis of the polymorphic second exon of the HLA class I A, B and C genes. This technique is based on the formation of specific patterns (PCR fingerprints) of homoduplexes and heteroduplexes between heterologous amplified DNA sequences. After an electrophoretic run on non-denaturing polyacrylamide gel, different HLA class I types give allele-specific banding patterns. HLA class I matching is performed, after the gel has been soaked in ethidium bromide or silver-stained, by visual comparison of patients' fingerprints with those of donors. Identity can be confirmed by mixing donor and recipient DNAs in an amplification cross-match. To assess the technique, 10 normal samples, 22 related allogeneic bone marrow transplanted pairs and 10 unrelated HLA-A and HLA-B serologically matched patient-donor pairs were analysed for HLA class I polymorphic regions. In all the related pairs and in 1/10 unrelated pairs, matched donor-recipient patterns were identified. This new application of PCR fingerprinting may confirm the HLA class I serological selection of unrelated marrow donors.
Salt Stress Induced Variation in DNA Methylation Pattern and Its Influence on Gene Expression in Contrasting Rice Genotypes

PubMed Central

Karan, Ratna; DeLeon, Teresa; Biradar, Hanamareddy; Subudhi, Prasanta K.

2012-01-01

Background Salinity is a major environmental factor limiting productivity of crop plants including rice in which wide range of natural variability exists. Although recent evidences implicate epigenetic mechanisms for modulating the gene expression in plants under environmental stresses, epigenetic changes and their functional consequences under salinity stress in rice are underexplored. DNA methylation is one of the epigenetic mechanisms regulating gene expression in plant’s responses to environmental stresses. Better understanding of epigenetic regulation of plant growth and response to environmental stresses may create novel heritable variation for crop improvement. Methodology/Principal Findings Methylation sensitive amplification polymorphism (MSAP) technique was used to assess the effect of salt stress on extent and patterns of DNA methylation in four genotypes of rice differing in the degree of salinity tolerance. Overall, the amount of DNA methylation was more in shoot compared to root and the contribution of fully methylated loci was always more than hemi-methylated loci. Sequencing of ten randomly selected MSAP fragments indicated gene-body specific DNA methylation of retrotransposons, stress responsive genes, and chromatin modification genes, distributed on different rice chromosomes. Bisulphite sequencing and quantitative RT-PCR analysis of selected MSAP loci showed that cytosine methylation changes under salinity as well as gene expression varied with genotypes and tissue types irrespective of the level of salinity tolerance of rice genotypes. Conclusions/Significance The gene body methylation may have an important role in regulating gene expression in organ and genotype specific manner under salinity stress. Association between salt tolerance and methylation changes observed in some cases suggested that many methylation changes are not “directed”. The natural genetic variation for salt tolerance observed in rice germplasm may be independent of the extent and pattern of DNA methylation which may have been induced by abiotic stress followed by accumulation through the natural selection process. PMID:22761959
Vaginal microbial flora analysis by next generation sequencing and microarrays; can microbes indicate vaginal origin in a forensic context?

PubMed

Benschop, Corina C G; Quaak, Frederike C A; Boon, Mathilde E; Sijen, Titia; Kuiper, Irene

2012-03-01

Forensic analysis of biological traces generally encompasses the investigation of both the person who contributed to the trace and the body site(s) from which the trace originates. For instance, for sexual assault cases, it can be beneficial to distinguish vaginal samples from skin or saliva samples. In this study, we explored the use of microbial flora to indicate vaginal origin. First, we explored the vaginal microbiome for a large set of clinical vaginal samples (n = 240) by next generation sequencing (n = 338,184 sequence reads) and found 1,619 different sequences. Next, we selected 389 candidate probes targeting genera or species and designed a microarray, with which we analysed a diverse set of samples; 43 DNA extracts from vaginal samples and 25 DNA extracts from samples from other body sites, including sites in close proximity of or in contact with the vagina. Finally, we used the microarray results and next generation sequencing dataset to assess the potential for a future approach that uses microbial markers to indicate vaginal origin. Since no candidate genera/species were found to positively identify all vaginal DNA extracts on their own, while excluding all non-vaginal DNA extracts, we deduce that a reliable statement about the cellular origin of a biological trace should be based on the detection of multiple species within various genera. Microarray analysis of a sample will then render a microbial flora pattern that is probably best analysed in a probabilistic approach.
DNA replication-timing analysis of human chromosome 22 at high resolution and different developmental states.

PubMed

White, Eric J; Emanuelsson, Olof; Scalzo, David; Royce, Thomas; Kosak, Steven; Oakeley, Edward J; Weissman, Sherman; Gerstein, Mark; Groudine, Mark; Snyder, Michael; Schübeler, Dirk

2004-12-21

Duplication of the genome during the S phase of the cell cycle does not occur simultaneously; rather, different sequences are replicated at different times. The replication timing of specific sequences can change during development; however, the determinants of this dynamic process are poorly understood. To gain insights into the contribution of developmental state, genomic sequence, and transcriptional activity to replication timing, we investigated the timing of DNA replication at high resolution along an entire human chromosome (chromosome 22) in two different cell types. The pattern of replication timing was correlated with respect to annotated genes, gene expression, novel transcribed regions of unknown function, sequence composition, and cytological features. We observed that chromosome 22 contains regions of early- and late-replicating domains of 100 kb to 2 Mb, many (but not all) of which are associated with previously described chromosomal bands. In both cell types, expressed sequences are replicated earlier than nontranscribed regions. However, several highly transcribed regions replicate late. Overall, the DNA replication-timing profiles of the two different cell types are remarkably similar, with only nine regions of difference observed. In one case, this difference reflects the differential expression of an annotated gene that resides in this region. Novel transcribed regions with low coding potential exhibit a strong propensity for early DNA replication. Although the cellular function of such transcripts is poorly understood, our results suggest that their activity is linked to the replication-timing program.
CaMV-35S promoter sequence-specific DNA methylation in lettuce.

PubMed

Okumura, Azusa; Shimada, Asahi; Yamasaki, Satoshi; Horino, Takuya; Iwata, Yuji; Koizumi, Nozomu; Nishihara, Masahiro; Mishiba, Kei-ichiro

2016-01-01

We found 35S promoter sequence-specific DNA methylation in lettuce. Additionally, transgenic lettuce plants having a modified 35S promoter lost methylation, suggesting the modified sequence is subjected to the methylation machinery. We previously reported that cauliflower mosaic virus 35S promoter-specific DNA methylation in transgenic gentian (Gentiana triflora × G. scabra) plants occurs irrespective of the copy number and the genomic location of T-DNA, and causes strong gene silencing. To confirm whether 35S-specific methylation can occur in other plant species, transgenic lettuce (Lactuca sativa L.) plants with a single copy of the 35S promoter-driven sGFP gene were produced and analyzed. Among 10 lines of transgenic plants, 3, 4, and 3 lines showed strong, weak, and no expression of sGFP mRNA, respectively. Bisulfite genomic sequencing of the 35S promoter region showed hypermethylation at CpG and CpWpG (where W is A or T) sites in 9 of 10 lines. Gentian-type de novo methylation pattern, consisting of methylated cytosines at CpHpH (where H is A, C, or T) sites, was also observed in the transgenic lettuce lines, suggesting that lettuce and gentian share similar methylation machinery. Four of five transgenic lettuce lines having a single copy of a modified 35S promoter, which was modified in the proposed core target of de novo methylation in gentian, exhibited 35S hypomethylation, indicating that the modified sequence may be the target of the 35S-specific methylation machinery.
Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

PubMed Central

Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

2012-01-01

The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3. PMID:22778697
Intra-isolate genome variation in arbuscular mycorrhizal fungi persists in the transcriptome.

PubMed

Boon, E; Zimmerman, E; Lang, B F; Hijri, M

2010-07-01

Arbuscular mycorrhizal fungi (AMF) are heterokaryotes with an unusual genetic makeup. Substantial genetic variation occurs among nuclei within a single mycelium or isolate. AMF reproduce through spores that contain varying fractions of this heterogeneous population of nuclei. It is not clear whether this genetic variation on the genome level actually contributes to the AMF phenotype. To investigate the extent to which polymorphisms in nuclear genes are transcribed, we analysed the intra-isolate genomic and cDNA sequence variation of two genes, the large subunit ribosomal RNA (LSU rDNA) of Glomus sp. DAOM-197198 (previously known as G. intraradices) and the POL1-like sequence (PLS) of Glomus etunicatum. For both genes, we find high sequence variation at the genome and transcriptome level. Reconstruction of LSU rDNA secondary structure shows that all variants are functional. Patterns of PLS sequence polymorphism indicate that there is one functional gene copy, PLS2, which is preferentially transcribed, and one gene copy, PLS1, which is a pseudogene. This is the first study that investigates AMF intra-isolate variation at the transcriptome level. In conclusion, it is possible that, in AMF, multiple nuclear genomes contribute to a single phenotype.
The GAGA protein of Drosophila is phosphorylated by CK2.

PubMed

Bonet, Carles; Fernández, Irene; Aran, Xavier; Bernués, Jordi; Giralt, Ernest; Azorín, Fernando

2005-08-19

The GAGA factor of Drosophila is a sequence-specific DNA-binding protein that contributes to multiple processes from the regulation of gene expression to the structural organisation of heterochromatin and chromatin remodelling. GAGA is known to interact with various other proteins (tramtrack, pipsqueak, batman and dSAP18) and protein complexes (PRC1, NURF and FACT). GAGA functions are likely regulated at the level of post-translational modifications. Little is known, however, about its actual pattern of modification. It was proposed that GAGA can be O-glycosylated. Here, we report that GAGA519 isoform is a phosphoprotein that is phosphorylated by CK2 at the region of the DNA-binding domain. Our results indicate that phosphorylation occurs at S388 and, to a lesser extent, at S378. These two residues are located in a region of the DNA-binding domain that makes no direct contact with DNA, being dispensable for sequence-specific recognition. Phosphorylation at these sites does not abolish DNA binding but reduces the affinity of the interaction. These results are discussed in the context of the various functions and interactions that GAGA supports.
Mechanism of DNA-binding enhancement by the human T-cell leukaemia virus transactivator Tax.

PubMed

Baranger, A M; Palmer, C R; Hamm, M K; Giebler, H A; Brauweiler, A; Nyborg, J K; Schepartz, A

1995-08-17

Tax protein activates transcription of the human T-cell leukaemia virus type I (HTLV-I) genome through three imperfect cyclic AMP-responsive element (CRE) target sites located within the viral promoter. Previous work has shown that Tax interacts with the bZIP element of proteins that bind the CRE target site to promote peptide dimerization, suggesting an association between Tax and bZIP coiled coil. Here we show that the site of interaction with Tax is not the coiled coil, but the basic segment. This interaction increases the stability of the GCN4 bZIP dimer by 1.7 kcal mol-1 and the DNA affinity of the dimer by 1.9 kcal mol-1. The differential effect of Tax on several bZip-DNA complexes that differ in peptide sequence or DNA conformation suggests a model for Tax action based on stabilization of a distinct DNA-bound protein structure. This model may explain how Tax interacts with transcription factors of considerable sequence diversity to alter patterns of gene expression.
Distinct Trends of DNA Methylation Patterning in the Innate and Adaptive Immune Systems

PubMed Central

Schuyler, Ronald P.; Merkel, Angelika; Raineri, Emanuele; Altucci, Lucia; Vellenga, Edo; Martens, Joost H.A.; Pourfarzad, Farzin; Kuijpers, Taco W.; Burden, Frances; Farrow, Samantha; Downes, Kate; Ouwehand, Willem H.; Clarke, Laura; Datta, Avik; Lowy, Ernesto; Flicek, Paul; Frontini, Mattia; Stunnenberg, Hendrik G.; Martín-Subero, José I.; Gut, Ivo; Heath, Simon

2018-01-01

Summary DNA methylation and the localization and post-translational modification of nucleosomes are interdependent factors that contribute to the generation of distinct phenotypes from genetically identical cells. With 112 whole-genome bisulfite sequencing datasets from the BLUEPRINT Epigenome Project, we analyzed the global development of DNA methylation patterns during lineage commitment and maturation of a range of immune system effector cells and the cancers that arise from them. We show clear trends in methylation patterns that are distinct in the innate and adaptive arms of the human immune system, both globally and in relation to consistently positioned nucleosomes. Most notable are a progressive loss of methylation in developing lymphocytes and the consistent occurrence of non-CG methylation in specific cell types. Cancer samples from the two lineages are further polarized, suggesting the involvement of distinct lineage-specific epigenetic mechanisms. We anticipate broad utility for this resource as a basis for further comparative epigenetic analyses. PMID:27851971
Distinct Trends of DNA Methylation Patterning in the Innate and Adaptive Immune Systems.

PubMed

Schuyler, Ronald P; Merkel, Angelika; Raineri, Emanuele; Altucci, Lucia; Vellenga, Edo; Martens, Joost H A; Pourfarzad, Farzin; Kuijpers, Taco W; Burden, Frances; Farrow, Samantha; Downes, Kate; Ouwehand, Willem H; Clarke, Laura; Datta, Avik; Lowy, Ernesto; Flicek, Paul; Frontini, Mattia; Stunnenberg, Hendrik G; Martín-Subero, José I; Gut, Ivo; Heath, Simon

2016-11-15

DNA methylation and the localization and post-translational modification of nucleosomes are interdependent factors that contribute to the generation of distinct phenotypes from genetically identical cells. With 112 whole-genome bisulfite sequencing datasets from the BLUEPRINT Epigenome Project, we analyzed the global development of DNA methylation patterns during lineage commitment and maturation of a range of immune system effector cells and the cancers that arise from them. We show clear trends in methylation patterns that are distinct in the innate and adaptive arms of the human immune system, both globally and in relation to consistently positioned nucleosomes. Most notable are a progressive loss of methylation in developing lymphocytes and the consistent occurrence of non-CG methylation in specific cell types. Cancer samples from the two lineages are further polarized, suggesting the involvement of distinct lineage-specific epigenetic mechanisms. We anticipate broad utility for this resource as a basis for further comparative epigenetic analyses. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Learning a Weighted Sequence Model of the Nucleosome Core and Linker Yields More Accurate Predictions in Saccharomyces cerevisiae and Homo sapiens

PubMed Central

Reynolds, Sheila M.; Bilmes, Jeff A.; Noble, William Stafford

2010-01-01

DNA in eukaryotes is packaged into a chromatin complex, the most basic element of which is the nucleosome. The precise positioning of the nucleosome cores allows for selective access to the DNA, and the mechanisms that control this positioning are important pieces of the gene expression puzzle. We describe a large-scale nucleosome pattern that jointly characterizes the nucleosome core and the adjacent linkers and is predominantly characterized by long-range oscillations in the mono, di- and tri-nucleotide content of the DNA sequence, and we show that this pattern can be used to predict nucleosome positions in both Homo sapiens and Saccharomyces cerevisiae more accurately than previously published methods. Surprisingly, in both H. sapiens and S. cerevisiae, the most informative individual features are the mono-nucleotide patterns, although the inclusion of di- and tri-nucleotide features results in improved performance. Our approach combines a much longer pattern than has been previously used to predict nucleosome positioning from sequence—301 base pairs, centered at the position to be scored—with a novel discriminative classification approach that selectively weights the contributions from each of the input features. The resulting scores are relatively insensitive to local AT-content and can be used to accurately discriminate putative dyad positions from adjacent linker regions without requiring an additional dynamic programming step and without the attendant edge effects and assumptions about linker length modeling and overall nucleosome density. Our approach produces the best dyad-linker classification results published to date in H. sapiens, and outperforms two recently published models on a large set of S. cerevisiae nucleosome positions. Our results suggest that in both genomes, a comparable and relatively small fraction of nucleosomes are well-positioned and that these positions are predictable based on sequence alone. We believe that the bulk of the remaining nucleosomes follow a statistical positioning model. PMID:20628623
Chloroplast genes as genetic markers for inferring patterns of change, maternal ancestry and phylogenetic relationships among Eleusine species

PubMed Central

Agrawal, Renuka; Agrawal, Nitin; Tandon, Rajesh; Raina, Soom Nath

2013-01-01

Assessment of phylogenetic relationships is an important component of any successful crop improvement programme, as wild relatives of the crop species often carry agronomically beneficial traits. Since its domestication in East Africa, Eleusine coracana (2n = 4x = 36), a species belonging to the genus Eleusine (x = 8, 9, 10), has held a prominent place in the semi-arid regions of India, Nepal and Africa. The patterns of variation between the cultivated and wild species reported so far and the interpretations based upon them have been considered primarily in terms of nuclear events. We analysed, for the first time, the phylogenetic relationship between finger millet (E. coracana) and its wild relatives by species-specific chloroplast deoxyribonucleic acid (cpDNA) polymerase chain reaction–restriction fragment length polymorphism (PCR–RFLP) and chloroplast simple sequence repeat (cpSSR) markers/sequences. Restriction fragment length polymorphism of the seven amplified chloroplast genes/intergenic spacers (trnK, psbD, psaA, trnH–trnK, trnL–trnF, 16S and trnS–psbC), nucleotide sequencing of the chloroplast trnK gene and chloroplast microsatellite polymorphism were analysed in all nine known species of Eleusine. The RFLP of all seven amplified chloroplast genes/intergenic spacers and trnK gene sequences in the diploid (2n = 16, 18, 20) and allotetraploid (2n = 36, 38) species resulted in well-resolved phylogenetic trees with high bootstrap values. Eleusine coracana, E. africana, E. tristachya, E. indica and E. kigeziensis did not show even a single change in restriction site. Eleusine intermedia and E. floccifolia were also shown to have identical cpDNA fragment patterns. The cpDNA diversity in Eleusine multiflora was found to be more extensive than that of the other eight species. The trnK gene sequence data complemented the results obtained by PCR–RFLP. The maternal lineage of all three allotetraploid species (AABB, AADD) was the same, with E. indica being the maternal diploid progenitor species. The markers specific to certain species were also identified. PMID:24790119
Selectivity by host plants affects the distribution of arbuscular mycorrhizal fungi: evidence from ITS rDNA sequence metadata.

PubMed

Yang, Haishui; Zang, Yanyan; Yuan, Yongge; Tang, Jianjun; Chen, Xin

2012-04-12

Arbuscular mycorrhizal fungi (AMF) can form obligate symbioses with the vast majority of land plants, and AMF distribution patterns have received increasing attention from researchers. At the local scale, the distribution of AMF is well documented. Studies at large scales, however, are limited because intensive sampling is difficult. Here, we used ITS rDNA sequence metadata obtained from public databases to study the distribution of AMF at continental and global scales. We also used these sequence metadata to investigate whether host plant is the main factor that affects the distribution of AMF at large scales. We defined 305 ITS virtual taxa (ITS-VTs) among all sequences of the Glomeromycota by using a comprehensive maximum likelihood phylogenetic analysis. Each host taxonomic order averaged about 53% specific ITS-VTs, and approximately 60% of the ITS-VTs were host specific. Those ITS-VTs with wide host range showed wide geographic distribution. Most ITS-VTs occurred in only one type of host functional group. The distributions of most ITS-VTs were limited across ecosystem, across continent, across biogeographical realm, and across climatic zone. Non-metric multidimensional scaling analysis (NMDS) showed that AMF community composition differed among functional groups of hosts, and among ecosystem, continent, biogeographical realm, and climatic zone. The Mantel test showed that AMF community composition was significantly correlated with plant community composition among ecosystem, among continent, among biogeographical realm, and among climatic zone. The structural equation modeling (SEM) showed that the effects of ecosystem, continent, biogeographical realm, and climatic zone were mainly indirect on AMF distribution, but plant had strongly direct effects on AMF. The distribution of AMF as indicated by ITS rDNA sequences showed a pattern of high endemism at large scales. This pattern indicates high specificity of AMF for host at different scales (plant taxonomic order and functional group) and high selectivity from host plants for AMF. The effects of ecosystemic, biogeographical, continental and climatic factors on AMF distribution might be mediated by host plants.
Chloroplast genes as genetic markers for inferring patterns of change, maternal ancestry and phylogenetic relationships among Eleusine species.

PubMed

Agrawal, Renuka; Agrawal, Nitin; Tandon, Rajesh; Raina, Soom Nath

2014-01-01

Assessment of phylogenetic relationships is an important component of any successful crop improvement programme, as wild relatives of the crop species often carry agronomically beneficial traits. Since its domestication in East Africa, Eleusine coracana (2n = 4x = 36), a species belonging to the genus Eleusine (x = 8, 9, 10), has held a prominent place in the semi-arid regions of India, Nepal and Africa. The patterns of variation between the cultivated and wild species reported so far and the interpretations based upon them have been considered primarily in terms of nuclear events. We analysed, for the first time, the phylogenetic relationship between finger millet (E. coracana) and its wild relatives by species-specific chloroplast deoxyribonucleic acid (cpDNA) polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) and chloroplast simple sequence repeat (cpSSR) markers/sequences. Restriction fragment length polymorphism of the seven amplified chloroplast genes/intergenic spacers (trnK, psbD, psaA, trnH-trnK, trnL-trnF, 16S and trnS-psbC), nucleotide sequencing of the chloroplast trnK gene and chloroplast microsatellite polymorphism were analysed in all nine known species of Eleusine. The RFLP of all seven amplified chloroplast genes/intergenic spacers and trnK gene sequences in the diploid (2n = 16, 18, 20) and allotetraploid (2n = 36, 38) species resulted in well-resolved phylogenetic trees with high bootstrap values. Eleusine coracana, E. africana, E. tristachya, E. indica and E. kigeziensis did not show even a single change in restriction site. Eleusine intermedia and E. floccifolia were also shown to have identical cpDNA fragment patterns. The cpDNA diversity in Eleusine multiflora was found to be more extensive than that of the other eight species. The trnK gene sequence data complemented the results obtained by PCR-RFLP. The maternal lineage of all three allotetraploid species (AABB, AADD) was the same, with E. indica being the maternal diploid progenitor species. The markers specific to certain species were also identified.
Approach to determine the diversity of Legionella species by nested PCR-DGGE in aquatic environments.

PubMed

Huang, Wen-Chien; Tsai, Hsin-Chi; Tao, Chi-Wei; Chen, Jung-Sheng; Shih, Yi-Jia; Kao, Po-Min; Huang, Tung-Yi; Hsu, Bing-Mu

2017-01-01

In this study, we describe a nested PCR-DGGE strategy to detect Legionella communities from river water samples. The nearly full-length 16S rRNA gene was amplified using bacterial primer in the first step. After, the amplicons were employed as DNA templates in the second PCR using Legionella specific primer. The third round of gene amplification was conducted to gain PCR fragments apposite for DGGE analysis. Then the total numbers of amplified genes were observed in DGGE bands of products gained with primers specific for the diversity of Legionella species. The DGGE patterns are thus potential for a high-throughput preliminary determination of aquatic environmental Legionella species before sequencing. Comparative DNA sequence analysis of excised DGGE unique band patterns showed the identity of the Legionella community members, including a reference profile with two pathogenic species of Legionella strains. In addition, only members of Legionella pneumophila and uncultured Legionella sp. were detected. Development of three step nested PCR-DGGE tactic is seen as a useful method for studying the diversity of Legionella community. The method is rapid and provided sequence information for phylogenetic analysis.
Approach to determine the diversity of Legionella species by nested PCR-DGGE in aquatic environments

PubMed Central

Huang, Wen-Chien; Tsai, Hsin-Chi; Tao, Chi-Wei; Chen, Jung-Sheng; Shih, Yi-Jia; Kao, Po-Min; Huang, Tung-Yi; Hsu, Bing-Mu

2017-01-01

In this study, we describe a nested PCR-DGGE strategy to detect Legionella communities from river water samples. The nearly full-length 16S rRNA gene was amplified using bacterial primer in the first step. After, the amplicons were employed as DNA templates in the second PCR using Legionella specific primer. The third round of gene amplification was conducted to gain PCR fragments apposite for DGGE analysis. Then the total numbers of amplified genes were observed in DGGE bands of products gained with primers specific for the diversity of Legionella species. The DGGE patterns are thus potential for a high-throughput preliminary determination of aquatic environmental Legionella species before sequencing. Comparative DNA sequence analysis of excised DGGE unique band patterns showed the identity of the Legionella community members, including a reference profile with two pathogenic species of Legionella strains. In addition, only members of Legionella pneumophila and uncultured Legionella sp. were detected. Development of three step nested PCR-DGGE tactic is seen as a useful method for studying the diversity of Legionella community. The method is rapid and provided sequence information for phylogenetic analysis. PMID:28166249
Retention-error patterns in complex alphanumeric serial-recall tasks.

PubMed

Mathy, Fabien; Varré, Jean-Stéphane

2013-01-01

We propose a new method based on an algorithm usually dedicated to DNA sequence alignment in order to both reliably score short-term memory performance on immediate serial-recall tasks and analyse retention-error patterns. There can be considerable confusion on how performance on immediate serial list recall tasks is scored, especially when the to-be-remembered items are sampled with replacement. We discuss the utility of sequence-alignment algorithms to compare the stimuli to the participants' responses. The idea is that deletion, substitution, translocation, and insertion errors, which are typical in DNA, are also typical putative errors in short-term memory (respectively omission, confusion, permutation, and intrusion errors). We analyse four data sets in which alphanumeric lists included a few (or many) repetitions. After examining the method on two simple data sets, we show that sequence alignment offers 1) a compelling method for measuring capacity in terms of chunks when many regularities are introduced in the material (third data set) and 2) a reliable estimator of individual differences in short-term memory capacity. This study illustrates the difficulty of arriving at a good measure of short-term memory performance, and also attempts to characterise the primary factors underpinning remembering and forgetting.

Next generation sequencing analysis reveals a relationship between rDNA unit diversity and locus number in Nicotiana diploids

PubMed Central

2012-01-01

Background Tandemly arranged nuclear ribosomal DNA (rDNA), encoding 18S, 5.8S and 26S ribosomal RNA (rRNA), exhibit concerted evolution, a pattern thought to result from the homogenisation of rDNA arrays. However rDNA homogeneity at the single nucleotide polymorphism (SNP) level has not been detailed in organisms with more than a few hundred copies of the rDNA unit. Here we study rDNA complexity in species with arrays consisting of thousands of units. Methods We examined homogeneity of genic (18S) and non-coding internally transcribed spacer (ITS1) regions of rDNA using Roche 454 and/or Illumina platforms in four angiosperm species, Nicotiana sylvestris, N. tomentosiformis, N. otophora and N. kawakamii. We compared the data with Southern blot hybridisation revealing the structure of intergenic spacer (IGS) sequences and with the number and distribution of rDNA loci. Results and Conclusions In all four species the intragenomic homogeneity of the 18S gene was high; a single ribotype makes up over 90% of the genes. However greater variation was observed in the ITS1 region, particularly in species with two or more rDNA loci, where >55% of rDNA units were a single ribotype, with the second most abundant variant accounted for >18% of units. IGS heterogeneity was high in all species. The increased number of ribotypes in ITS1 compared with 18S sequences may reflect rounds of incomplete homogenisation with strong selection for functional genic regions and relaxed selection on ITS1 variants. The relationship between the number of ITS1 ribotypes and the number of rDNA loci leads us to propose that rDNA evolution and complexity is influenced by locus number and/or amplification of orphaned rDNA units at new chromosomal locations. PMID:23259460
DNA mutation motifs in the genes associated with inherited diseases.

PubMed

Růžička, Michal; Kulhánek, Petr; Radová, Lenka; Čechová, Andrea; Špačková, Naďa; Fajkusová, Lenka; Réblová, Kamila

2017-01-01

Mutations in human genes can be responsible for inherited genetic disorders and cancer. Mutations can arise due to environmental factors or spontaneously. It has been shown that certain DNA sequences are more prone to mutate. These sites are termed hotspots and exhibit a higher mutation frequency than expected by chance. In contrast, DNA sequences with lower mutation frequencies than expected by chance are termed coldspots. Mutation hotspots are usually derived from a mutation spectrum, which reflects particular population where an effect of a common ancestor plays a role. To detect coldspots/hotspots unaffected by population bias, we analysed the presence of germline mutations obtained from HGMD database in the 5-nucleotide segments repeatedly occurring in genes associated with common inherited disorders, in particular, the PAH, LDLR, CFTR, F8, and F9 genes. Statistically significant sequences (mutational motifs) rarely associated with mutations (coldspots) and frequently associated with mutations (hotspots) exhibited characteristic sequence patterns, e.g. coldspots contained purine tract while hotspots showed alternating purine-pyrimidine bases, often with the presence of CpG dinucleotide. Using molecular dynamics simulations and free energy calculations, we analysed the global bending properties of two selected coldspots and two hotspots with a G/T mismatch. We observed that the coldspots were inherently more flexible than the hotspots. We assume that this property might be critical for effective mismatch repair as DNA with a mutation recognized by MutSα protein is noticeably bent.
Evolutionary insight on localization of 18S, 28S rDNA genes on homologous chromosomes in Primates genomes

PubMed Central

Mazzoleni, Sofia; Rovatsos, Michail; Schillaci, Odessa; Dumas, Francesca

2018-01-01

Abstract We explored the topology of 18S and 28S rDNA units by fluorescence in situ hybridization (FISH) in the karyotypes of thirteen species representatives from major groups of Primates and Tupaia minor (Günther, 1876) (Scandentia), in order to expand our knowledge of Primate genome reshuffling and to identify the possible dispersion mechanisms of rDNA sequences. We documented that rDNA probe signals were identified on one to six pairs of chromosomes, both acrocentric and metacentric ones. In addition, we examined the potential homology of chromosomes bearing rDNA genes across different species and in a wide phylogenetic perspective, based on the DAPI-inverted pattern and their synteny to human. Our analysis revealed an extensive variability in the topology of the rDNA signals across studied species. In some cases, closely related species show signals on homologous chromosomes, thus representing synapomorphies, while in other cases, signal was detected on distinct chromosomes, leading to species specific patterns. These results led us to support the hypothesis that different mechanisms are responsible for the distribution of the ribosomal DNA cluster in Primates. PMID:29416829
Genome-wide DNA methylation reprogramming in response to inorganic arsenic links inhibition of CTCF binding, DNMT expression and cellular transformation

NASA Astrophysics Data System (ADS)

Rea, Matthew; Eckstein, Meredith; Eleazer, Rebekah; Smith, Caroline; Fondufe-Mittendorf, Yvonne N.

2017-02-01

Chronic low dose inorganic arsenic (iAs) exposure leads to changes in gene expression and epithelial-to-mesenchymal transformation. During this transformation, cells adopt a fibroblast-like phenotype accompanied by profound gene expression changes. While many mechanisms have been implicated in this transformation, studies that focus on the role of epigenetic alterations in this process are just emerging. DNA methylation controls gene expression in physiologic and pathologic states. Several studies show alterations in DNA methylation patterns in iAs-mediated pathogenesis, but these studies focused on single genes. We present a comprehensive genome-wide DNA methylation analysis using methyl-sequencing to measure changes between normal and iAs-transformed cells. Additionally, these differential methylation changes correlated positively with changes in gene expression and alternative splicing. Interestingly, most of these differentially methylated genes function in cell adhesion and communication pathways. To gain insight into how genomic DNA methylation patterns are regulated during iAs-mediated carcinogenesis, we show that iAs probably targets CTCF binding at the promoter of DNA methyltransferases, regulating their expression. These findings reveal how CTCF binding regulates DNA methyltransferase to reprogram the methylome in response to an environmental toxin.
Genome-wide DNA methylation reprogramming in response to inorganic arsenic links inhibition of CTCF binding, DNMT expression and cellular transformation

PubMed Central

Rea, Matthew; Eckstein, Meredith; Eleazer, Rebekah; Smith, Caroline; Fondufe-Mittendorf , Yvonne N.

2017-01-01

Chronic low dose inorganic arsenic (iAs) exposure leads to changes in gene expression and epithelial-to-mesenchymal transformation. During this transformation, cells adopt a fibroblast-like phenotype accompanied by profound gene expression changes. While many mechanisms have been implicated in this transformation, studies that focus on the role of epigenetic alterations in this process are just emerging. DNA methylation controls gene expression in physiologic and pathologic states. Several studies show alterations in DNA methylation patterns in iAs-mediated pathogenesis, but these studies focused on single genes. We present a comprehensive genome-wide DNA methylation analysis using methyl-sequencing to measure changes between normal and iAs-transformed cells. Additionally, these differential methylation changes correlated positively with changes in gene expression and alternative splicing. Interestingly, most of these differentially methylated genes function in cell adhesion and communication pathways. To gain insight into how genomic DNA methylation patterns are regulated during iAs-mediated carcinogenesis, we show that iAs probably targets CTCF binding at the promoter of DNA methyltransferases, regulating their expression. These findings reveal how CTCF binding regulates DNA methyltransferase to reprogram the methylome in response to an environmental toxin. PMID:28150704
A new Heraclides swallowtail (Lepidoptera, Papilionidae) from North America is recognized by the pattern on its neck

PubMed Central

Shiraiwa, Kojiro; Cong, Qian; Grishin, Nick V.

2014-01-01

Abstract Heraclides rumiko Shiraiwa & Grishin, sp. n. is described from southwestern United States, Mexico, and Central America (type locality: USA, Texas, Duval County). It is closely allied to Heraclides cresphontes (Cramer, 1777) and the two species are sympatric in central Texas. The new species is diagnosed by male genitalia and exhibits a nearly 3% difference from Heraclides cresphontes in the COI DNA barcode sequence of mitochondrial DNA. The two Heraclides species can usually be told apart by the shape and size of yellow spots on the neck, by the wing shape, and the details of wing patterns. “Western Giant Swallowtail” is proposed as the English name for Heraclides rumiko. To stabilize nomenclature, neotype for Papilio cresphontes Cramer, 1777, an eastern United States species, is designated from Brooklyn, New York, USA; and lectotype for Papilio thoas Linnaeus, 1771 is designated from Suriname. We sequenced DNA barcodes and ID tags of nearly 400 Papilionini specimens completing coverage of all Heraclides species. Comparative analyses of DNA barcodes, genitalia, and facies suggest that Heraclides oviedo (Gundlach, 1866), reinstated status, is a species-level taxon rather than a subspecies of Heraclides thoas (Linnaeus, 1771); and Heraclides pallas (G. Gray, [1853]), reinstated status, with its subspecies Heraclides Papilio bajaensis (J. Brown & Faulkner, 1992), comb. n., and Heraclides anchicayaensis Constantino, Le Crom & Salazar, 2002, stat. n., are not conspecific with Heraclides astyalus (Godart, 1819). PMID:25610342
Influence of volcanic activity on the population genetic structure of Hawaiian Tetragnatha spiders: Fragmentation, rapid population growth and the potential for accelerated evolution

USGS Publications Warehouse

Vandergast, A.G.; Gillespie, R.G.; Roderick, G.K.

2004-01-01

Volcanic activity on the island of Hawaii results in a cyclical pattern of habitat destruction and fragmentation by lava, followed by habitat regeneration on newly formed substrates. While this pattern has been hypothesized to promote the diversification of Hawaiian lineages, there have been few attempts to link geological processes to measurable changes in population structure. We investigated the genetic structure of three species of Hawaiian spiders in forests fragmented by a 150-year-old lava flow on Mauna Loa Volcano, island of Hawaii: Tetragnatha quasimodo (forest and lava flow generalist), T. anuenue and T. brevignatha (forest specialists). To estimate fragmentation effects on population subdivision in each species, we examined variation in mitochondrial and nuclear genomes (DNA sequences and allozymes, respectively). Population subdivision was higher for forest specialists than for the generalist in fragments separated by lava. Patterns of mtDNA sequence evolution also revealed that forest specialists have undergone rapid expansion, while the generalist has experienced more gradual population growth. Results confirm that patterns of neutral genetic variation reflect patterns of volcanic activity in some Tetragnatha species. Our study further suggests that population subdivision and expansion can occur across small spatial and temporal scales, which may facilitate the rapid spread of new character states, leading to speciation as hypothesized by H. L. Carson 30 years ago.
Loss of maintenance DNA methylation results in abnormal DNA origin firing during DNA replication.

PubMed

Haruta, Mayumi; Shimada, Midori; Nishiyama, Atsuya; Johmura, Yoshikazu; Le Tallec, Benoît; Debatisse, Michelle; Nakanishi, Makoto

2016-01-22

The mammalian maintenance methyltransferase DNMT1 [DNA (cytosine-5-)-methyltransferase 1] mediates the inheritance of the DNA methylation pattern during replication. Previous studies have shown that depletion of DNMT1 causes a severe growth defect and apoptosis in differentiated cells. However, the detailed mechanisms behind this phenomenon remain poorly understood. Here we show that conditional ablation of Dnmt1 in murine embryonic fibroblasts (MEFs) resulted in an aberrant DNA replication program showing an accumulation of late-S phase replication and causing severely defective growth. Furthermore, we found that the catalytic activity and replication focus targeting sequence of DNMT1 are required for a proper DNA replication program. Taken together, our findings suggest that the maintenance of DNA methylation by DNMT1 plays a critical role in proper regulation of DNA replication in mammalian cells. Copyright © 2015 Elsevier Inc. All rights reserved.
Mechanism of DNA binding enhancement by hepatitis B virus protein pX.

PubMed

Palmer, C R; Gegnas, L D; Schepartz, A

1997-12-09

At least three hundred million people worldwide are infected with the hepatitis B virus (HBV), and epidemiological studies show a clear correlation between chronic HBV infection and the development of hepatocellular carcinoma. HBV encodes a protein, pX, which abducts the cellular transcriptional machinery in several ways including direct interactions with bZIP transcription factors. These interactions increase the DNA affinities of target bZIP proteins in a DNA sequence-dependent manner. Here we use a series of bZIP peptide models to explore the mechanism by which pX interacts with bZIP proteins. Our results suggest that pX increases bZIP.DNA stability by increasing the stability of the bZIP dimer as well as the affinity of the dimer for DNA. Additional experiments provide evidence for a mechanism in which pX recognizes the composite structure of the peptide.DNA complex, not simply the primary peptide sequence. These experiments provide a framework for understanding how pX alters the patterns of transcription within the nucleus. The similarities between the mechanism proposed for pX and the mechanism previously proposed for the human T-cell leukemia virus protein Tax are discussed.
Modeling the Lac repressor-operator assembly: The influence of DNA looping on Lac repressor conformation

PubMed Central

Swigon, David; Coleman, Bernard D.; Olson, Wilma K.

2006-01-01

Repression of transcription of the Escherichia coli Lac operon by the Lac repressor (LacR) is accompanied by the simultaneous binding of LacR to two operators and the formation of a DNA loop. A recently developed theory of sequence-dependent DNA elasticity enables one to relate the fine structure of the LacR–DNA complex to a wide range of heretofore-unconnected experimental observations. Here, that theory is used to calculate the configuration and free energy of the DNA loop as a function of its length and base-pair sequence, its linking number, and the end conditions imposed by the LacR tetramer. The tetramer can assume two types of conformations. Whereas a rigid V-shaped structure is observed in the crystal, EM images show extended forms in which two dimer subunits are flexibly joined. Upon comparing our computed loop configurations with published experimental observations of permanganate sensitivities, DNase I cutting patterns, and loop stabilities, we conclude that linear DNA segments of short-to-medium chain length (50–180 bp) give rise to loops with the extended form of LacR and that loops formed within negatively supercoiled plasmids induce the V-shaped structure. PMID:16785444
Enhancement of fluorescence quenching and exciplex formation in DNA major groove by double incorporation of modified fluorescent deoxyuridines.

PubMed

Tanaka, Makiko; Oguma, Kazuhiro; Saito, Yoshio; Saito, Isao

2012-06-15

5-(1-Naphthalenylethynyl)-2'-deoxyuridine ((N)U) and 5-[(4-cyano-1-naphthalenyl)ethynyl]-2'-deoxyuridine ((CN)U) were synthesized and incorporated into oligodeoxynucleotides. Fluorescence emissions of modified duplexes containing double (N)U were efficiently quenched depending upon the sequence pattern of the naphthalenes in DNA major groove, as compared to the duplex possessing single (N)U. When one of the naphthalene moieties has a cyano substituent, the exciplex emission from the chromophores in DNA major groove was observed at longer wavelength. Copyright © 2012 Elsevier Ltd. All rights reserved.
Genetic structuring of European anchovy (Engraulis encrasicolus) populations through mitochondrial DNA sequences.

PubMed

Keskin, Emre; Atar, Hasan Huseyin

2012-04-01

Mitochondrial DNA sequence variation in 655 bpfragments of the cytochrome oxidase c subunit I gene, known as the DNA barcode, of European anchovy (Engraulis encrasicolus) was evaluated by analyzing 1529 individuals representing 16 populations from the Black Sea, through the Marmara Sea and the Aegean Sea to the Mediterranean Sea. A total of 19 (2.9%) variable sites were found among individuals, and these defined 10 genetically diverged populations with an overall mean distance of 1.2%. The highest nucleotide divergence was found between samples of eastern Mediterranean and northern Aegean (2.2%). Evolutionary history analysis among 16 populations clustered the Mediterranean Sea clades in one main branch and the other clades in another branch. Diverging pattern of the European anchovy populations correlated with geographic dispersion supports the genetic structuring through the Black Sea-Marmara Sea-Aegean Sea-Mediterranean Sea quad.
Screening of Israeli Holstein-Friesian cattle for restriction fragment length polymorphisms using homologous and heterologous deoxyribonucleic acid probes.

PubMed

Hallerman, E M; Nave, A; Soller, M; Beckmann, J S

1988-12-01

Genomic DNA of Israeli Holstein-Friesian dairy cattle were screened with a battery of 17 cloned or subcloned DNA probes in an attempt to document restriction fragment length polymorphisms at a number of genetic loci. Restriction fragment length polymorphisms were observed at the chymosin, oxytocin-neurophysin I, lutropin beta, keratin III, keratin VI, keratin VII, prolactin, and dihydrofolate reductase loci. Use of certain genomic DNA fragments as probes produced hybridization patterns indicative of satellite DNA at the respective loci. Means for distinguishing hybridizations to coding sequences for unique genes from those to satellite DNA were developed. Results of this study are discussed in terms of strategy for the systematic development of large numbers of bovine genomic polymorphisms.
Within-Genome Evolution of REPINs: a New Family of Miniature Mobile DNA in Bacteria

PubMed Central

Bertels, Frederic; Rainey, Paul B.

2011-01-01

Repetitive sequences are a conserved feature of many bacterial genomes. While first reported almost thirty years ago, and frequently exploited for genotyping purposes, little is known about their origin, maintenance, or processes affecting the dynamics of within-genome evolution. Here, beginning with analysis of the diversity and abundance of short oligonucleotide sequences in the genome of Pseudomonas fluorescens SBW25, we show that over-represented short sequences define three distinct groups (GI, GII, and GIII) of repetitive extragenic palindromic (REP) sequences. Patterns of REP distribution suggest that closely linked REP sequences form a functional replicative unit: REP doublets are over-represented, randomly distributed in extragenic space, and more highly conserved than singlets. In addition, doublets are organized as inverted repeats, which together with intervening spacer sequences are predicted to form hairpin structures in ssDNA or mRNA. We refer to these newly defined entities as REPINs (REP doublets forming hairpins) and identify short reads from population sequencing that reveal putative transposition intermediates. The proximal relationship between GI, GII, and GIII REPINs and specific REP-associated tyrosine transposases (RAYTs), combined with features of the putative transposition intermediate, suggests a mechanism for within-genome dissemination. Analysis of the distribution of REPs in a range of RAYT–containing bacterial genomes, including Escherichia coli K-12 and Nostoc punctiforme, show that REPINs are a widely distributed, but hitherto unrecognized, family of miniature non-autonomous mobile DNA. PMID:21698139
Copy number variants calling for single cell sequencing data by multi-constrained optimization.

PubMed

Xu, Bo; Cai, Hongmin; Zhang, Changsheng; Yang, Xi; Han, Guoqiang

2016-08-01

Variations in DNA copy number carry important information on genome evolution and regulation of DNA replication in cancer cells. The rapid development of single-cell sequencing technology allows one to explore gene expression heterogeneity among single-cells, thus providing important cancer cell evolution information. Single-cell DNA/RNA sequencing data usually have low genome coverage, which requires an extra step of amplification to accumulate enough samples. However, such amplification will introduce large bias and makes bioinformatics analysis challenging. Accurately modeling the distribution of sequencing data and effectively suppressing the bias influence is the key to success variations analysis. Recent advances demonstrate the technical noises by amplification are more likely to follow negative binomial distribution, a special case of Poisson distribution. Thus, we tackle the problem CNV detection by formulating it into a quadratic optimization problem involving two constraints, in which the underling signals are corrupted by Poisson distributed noises. By imposing the constraints of sparsity and smoothness, the reconstructed read depth signals from single-cell sequencing data are anticipated to fit the CNVs patterns more accurately. An efficient numerical solution based on the classical alternating direction minimization method (ADMM) is tailored to solve the proposed model. We demonstrate the advantages of the proposed method using both synthetic and empirical single-cell sequencing data. Our experimental results demonstrate that the proposed method achieves excellent performance and high promise of success with single-cell sequencing data. Crown Copyright © 2016. Published by Elsevier Ltd. All rights reserved.
The paradox of HBV evolution as revealed from a 16th century mummy

PubMed Central

Duggan, Ana T.; Poinar, Debi; Poinar, Hendrik N.

2018-01-01

Hepatitis B virus (HBV) is a ubiquitous viral pathogen associated with large-scale morbidity and mortality in humans. However, there is considerable uncertainty over the time-scale of its origin and evolution. Initial shotgun data from a mid-16th century Italian child mummy, that was previously paleopathologically identified as having been infected with Variola virus (VARV, the agent of smallpox), showed no DNA reads for VARV yet did for hepatitis B virus (HBV). Previously, electron microscopy provided evidence for the presence of VARV in this sample, although similar analyses conducted here did not reveal any VARV particles. We attempted to enrich and sequence for both VARV and HBV DNA. Although we did not recover any reads identified as VARV, we were successful in reconstructing an HBV genome at 163.8X coverage. Strikingly, both the HBV sequence and that of the associated host mitochondrial DNA displayed a nearly identical cytosine deamination pattern near the termini of DNA fragments, characteristic of an ancient origin. In contrast, phylogenetic analyses revealed a close relationship between the putative ancient virus and contemporary HBV strains (of genotype D), at first suggesting contamination. In addressing this paradox we demonstrate that HBV evolution is characterized by a marked lack of temporal structure. This confounds attempts to use molecular clock-based methods to date the origin of this virus over the time-frame sampled so far, and means that phylogenetic measures alone cannot yet be used to determine HBV sequence authenticity. If genuine, this phylogenetic pattern indicates that the genotypes of HBV diversified long before the 16th century, and enables comparison of potential pathogenic similarities between modern and ancient HBV. These results have important implications for our understanding of the emergence and evolution of this common viral pathogen. PMID:29300782
Ultrasensitive Quantification of Hepatitis B Virus A1762T/G1764A Mutant by a SimpleProbe PCR Using a Wild-Type-Selective PCR Blocker and a Primer-Blocker-Probe Partial-Overlap Approach ▿

PubMed Central

Nie, Hui; Evans, Alison A.; London, W. Thomas; Block, Timothy M.; Ren, Xiangdong David

2011-01-01

Hepatitis B virus (HBV) carrying the A1762T/G1764A double mutation in the basal core promoter (BCP) region is associated with HBe antigen seroconversion and increased risk of liver cirrhosis and hepatocellular carcinoma (HCC). Quantification of the mutant viruses may help in predicting the risk of HCC. However, the viral genome tends to have nucleotide polymorphism, which makes it difficult to design hybridization-based assays including real-time PCR. Ultrasensitive quantification of the mutant viruses at the early developmental stage is even more challenging, as the mutant is masked by excessive amounts of the wild-type (WT) viruses. In this study, we developed a selective inhibitory PCR (siPCR) using a locked nucleic acid-based PCR blocker to selectively inhibit the amplification of the WT viral DNA but not the mutant DNA. At the end of siPCR, the proportion of the mutant could be increased by about 10,000-fold, making the mutant more readily detectable by downstream applications such as real-time PCR and DNA sequencing. We also describe a primer-probe partial overlap approach which significantly simplified the melting curve patterns and minimized the influence of viral genome polymorphism on assay accuracy. Analysis of 62 patient samples showed a complete match of the melting curve patterns with the sequencing results. More than 97% of HBV BCP sequences in the GenBank database can be correctly identified by the melting curve analysis. The combination of siPCR and the SimpleProbe real-time PCR enabled mutant quantification in the presence of a 100,000-fold excess of the WT DNA. PMID:21562108
Fast, Accurate and Automatic Ancient Nucleosome and Methylation Maps with epiPALEOMIX.

PubMed

Hanghøj, Kristian; Seguin-Orlando, Andaine; Schubert, Mikkel; Madsen, Tobias; Pedersen, Jakob Skou; Willerslev, Eske; Orlando, Ludovic

2016-12-01

The first epigenomes from archaic hominins (AH) and ancient anatomically modern humans (AMH) have recently been characterized, based, however, on a limited number of samples. The extent to which ancient genome-wide epigenetic landscapes can be reconstructed thus remains contentious. Here, we present epiPALEOMIX, an open-source and user-friendly pipeline that exploits post-mortem DNA degradation patterns to reconstruct ancient methylomes and nucleosome maps from shotgun and/or capture-enrichment data. Applying epiPALEOMIX to the sequence data underlying 35 ancient genomes including AMH, AH, equids and aurochs, we investigate the temporal, geographical and preservation range of ancient epigenetic signatures. We first assess the quality of inferred ancient epigenetic signatures within well-characterized genomic regions. We find that tissue-specific methylation signatures can be obtained across a wider range of DNA preparation types than previously thought, including when no particular experimental procedures have been used to remove deaminated cytosines prior to sequencing. We identify a large subset of samples for which DNA associated with nucleosomes is protected from post-mortem degradation, and nucleosome positioning patterns can be reconstructed. Finally, we describe parameters and conditions such as DNA damage levels and sequencing depth that limit the preservation of epigenetic signatures in ancient samples. When such conditions are met, we propose that epigenetic profiles of CTCF binding regions can be used to help data authentication. Our work, including epiPALEOMIX, opens for further investigations of ancient epigenomes through time especially aimed at tracking possible epigenetic changes during major evolutionary, environmental, socioeconomic, and cultural shifts. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
DNA-COMPACT: DNA COMpression Based on a Pattern-Aware Contextual Modeling Technique

PubMed Central

Li, Pinghao; Wang, Shuang; Kim, Jihoon; Xiong, Hongkai; Ohno-Machado, Lucila; Jiang, Xiaoqian

2013-01-01

Genome data are becoming increasingly important for modern medicine. As the rate of increase in DNA sequencing outstrips the rate of increase in disk storage capacity, the storage and data transferring of large genome data are becoming important concerns for biomedical researchers. We propose a two-pass lossless genome compression algorithm, which highlights the synthesis of complementary contextual models, to improve the compression performance. The proposed framework could handle genome compression with and without reference sequences, and demonstrated performance advantages over best existing algorithms. The method for reference-free compression led to bit rates of 1.720 and 1.838 bits per base for bacteria and yeast, which were approximately 3.7% and 2.6% better than the state-of-the-art algorithms. Regarding performance with reference, we tested on the first Korean personal genome sequence data set, and our proposed method demonstrated a 189-fold compression rate, reducing the raw file size from 2986.8 MB to 15.8 MB at a comparable decompression cost with existing algorithms. DNAcompact is freely available at https://sourceforge.net/projects/dnacompact/for research purpose. PMID:24282536
Sequence-specific "gene signatures" can be obtained by PCR with single specific primers at low stringency.

PubMed Central

Pena, S D; Barreto, G; Vago, A R; De Marco, L; Reinach, F C; Dias Neto, E; Simpson, A J

1994-01-01

Low-stringency single specific primer PCR (LSSP-PCR) is an extremely simple PCR-based technique that detects single or multiple mutations in gene-sized DNA fragments. A purified DNA fragment is subjected to PCR using high concentrations of a single specific oligonucleotide primer, large amounts of Taq polymerase, and a very low annealing temperature. Under these conditions the primer hybridizes specifically to its complementary region and nonspecifically to multiple sites within the fragment, in a sequence-dependent manner, producing a heterogeneous set of reaction products resolvable by electrophoresis. The complex banding pattern obtained is significantly altered by even a single-base change and thus constitutes a unique "gene signature." Therefore LSSP-PCR will have almost unlimited application in all fields of genetics and molecular medicine where rapid and sensitive detection of mutations and sequence variations is important. The usefulness of LSSP-PCR is illustrated by applications in the study of mutants of smooth muscle myosin light chain, analysis of a family with X-linked nephrogenic diabetes insipidus, and identity testing using human mitochondrial DNA. Images PMID:8127912

B chromosome dynamics in Prochilodus costatus (Teleostei, Characiformes) and comparisons with supernumerary chromosome system in other Prochilodus species

PubMed Central

Melo, Silvana; Utsunomia, Ricardo; Penitente, Manolo; Sobrinho-Scudeler, Patrícia Elda; Porto-Foresti, Fábio; Oliveira, Claudio; Foresti, Fausto; Dergam, Jorge Abdala

2017-01-01

Abstract Within the genus Prochilodus Agassiz, 1829, five species are known to carry B chromosomes, i.e. chromosomes beyond the usual diploid number that have been traditionally considered as accessory for the genome. Chromosome microdissection and mapping of repetitive DNA sequences are effective tools to assess the DNA content and allow a better understanding about the origin and composition of these elements in an array of species. In this study, a novel characterization of B chromosomes in Prochilodus costatus Valenciennes, 1850 (2n=54) was reported for the first time and their sequence complementarity with the supernumerary chromosomes observed in Prochilodus lineatus (Valenciennes, 1836) and Prochilodus argenteus Agassiz, 1829 was investigated. The hybridization patterns obtained with chromosome painting using the micro B probe of P. costatus and the satDNA SATH1 mapping made it possible to assume homology of sequences between the B chromosomes of these congeneric species. Our results suggest that the origin of B chromosomes in the genus Prochilodus is a phylogenetically old event. PMID:28919971
Single-cell DNA methylome sequencing and bioinformatic inference of epigenomic cell-state dynamics.

PubMed

Farlik, Matthias; Sheffield, Nathan C; Nuzzo, Angelo; Datlinger, Paul; Schönegger, Andreas; Klughammer, Johanna; Bock, Christoph

2015-03-03

Methods for single-cell genome and transcriptome sequencing have contributed to our understanding of cellular heterogeneity, whereas methods for single-cell epigenomics are much less established. Here, we describe a whole-genome bisulfite sequencing (WGBS) assay that enables DNA methylation mapping in very small cell populations (μWGBS) and single cells (scWGBS). Our assay is optimized for profiling many samples at low coverage, and we describe a bioinformatic method that analyzes collections of single-cell methylomes to infer cell-state dynamics. Using these technological advances, we studied epigenomic cell-state dynamics in three in vitro models of cellular differentiation and pluripotency, where we observed characteristic patterns of epigenome remodeling and cell-to-cell heterogeneity. The described method enables single-cell analysis of DNA methylation in a broad range of biological systems, including embryonic development, stem cell differentiation, and cancer. It can also be used to establish composite methylomes that account for cell-to-cell heterogeneity in complex tissue samples. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Genome-wide Mapping Reveals Conservation of Promoter DNA Methylation Following Chicken Domestication

PubMed Central

Li, Qinghe; Wang, Yuanyuan; Hu, Xiaoxiang; Zhao, Yaofeng; Li, Ning

2015-01-01

It is well-known that environment influences DNA methylation, however, the extent of heritable DNA methylation variation following animal domestication remains largely unknown. Using meDIP-chip we mapped the promoter methylomes for 23,316 genes in muscle tissues of ancestral and domestic chickens. We systematically examined the variation of promoter DNA methylation in terms of different breeds, differentially expressed genes, SNPs and genes undergo genetic selection sweeps. While considerable changes in DNA sequence and gene expression programs were prevalent, we found that the inter-strain DNA methylation patterns were highly conserved in promoter region between the wild and domestic chicken breeds. Our data suggests a global preservation of DNA methylation between the wild and domestic chicken breeds in either a genome-wide or locus-specific scale in chick muscle tissues. PMID:25735894
ETS target genes: Identification of Egr1 as a target by RNA differential display and whole genome PCR techniques

PubMed Central

Robinson, Lois; Panayiotakis, Alexandra; Papas, Takis S.; Kola, Ismail; Seth, Arun

1997-01-01

ETS transcription factors play important roles in hematopoiesis, angiogenesis, and organogenesis during murine development. The ETS genes also have a role in neoplasia, for example in Ewing’s sarcomas and retrovirally induced cancers. The ETS genes encode transcription factors that bind to specific DNA sequences and activate transcription of various cellular and viral genes. To isolate novel ETS target genes, we used two approaches. In the first approach, we isolated genes by the RNA differential display technique. Previously, we have shown that the overexpression of ETS1 and ETS2 genes effects transformation of NIH 3T3 cells and specific transformants produce high levels of the ETS proteins. To isolate ETS1 and ETS2 responsive genes in these transformed cells, we prepared RNA from ETS1, ETS2 transformants, and normal NIH 3T3 cell lines and converted it into cDNA. This cDNA was amplified by PCR and displayed on sequencing gels. The differentially displayed bands were subcloned into plasmid vectors. By Northern blot analysis, several clones showed differential patterns of mRNA expression in the NIH 3T3-, ETS1-, and ETS2-expressing cell lines. Sixteen clones were analyzed by DNA sequence analysis, and 13 of them appeared to be unique because their DNA sequences did not match with any of the known genes present in the gene bank. Three known genes were found to be identical to the CArG box binding factor, phospholipase A2-activating protein, and early growth response 1 (Egr1) genes. In the second approach, to isolate ETS target promoters directly, we performed ETS1 binding with MboI-cleaved genomic DNA in the presence of a specific mAb followed by whole genome PCR. The immune complex-bound ETS binding sites containing DNA fragments were amplified and subcloned into pBluescript and subjected to DNA sequence and computer analysis. We found that, of a large number of clones isolated, 43 represented unique sequences not previously identified. Three clones turned out to contain regulatory sequences derived from human serglycin, preproapolipoprotein C II, and Egr1 genes. The ETS binding sites derived from these three regulatory sequences showed specific binding with recombinant ETS proteins. Of interest, Egr1 was identified by both of these techniques, suggesting strongly that it is indeed an ETS target gene. PMID:9207063
Clustered array of ochratoxin A biosynthetic genes in Aspergillus steynii and their expression patterns in permissive conditions.

PubMed

Gil-Serna, Jessica; Vázquez, Covadonga; González-Jaén, María Teresa; Patiño, Belén

2015-12-02

Aspergillus steynii is probably the most relevant species of section Circumdati producing ochratoxin A (OTA). This mycotoxin contaminates a wide number of commodities and it is highly toxic for humans and animals. Little is known on the biosynthetic genes and their regulation in Aspergillus species. In this work, we identified and analysed three contiguous genes in A. steynii using 5'-RACE and genome walking approaches which predicted a cytochrome P450 monooxygenase (p450ste), a non-ribosomal peptide synthetase (nrpsste) and a polyketide synthase (pksste). These three genes were contiguous within a 20742 bp long genomic DNA fragment. Their corresponding cDNA were sequenced and their expression was analysed in three A. steynii strains using real time RT-PCR specific assays in permissive conditions in in vitro cultures. OTA was also analysed in these cultures. Comparative analyses of predicted genomic, cDNA and amino acid sequences were performed with sequences of similar gene functions. All the results obtained in these analyses were consistent and point out the involvement of these three genes in OTA biosynthesis by A. steynii and showed a co-ordinated expression pattern. This is the first time that a clustered organization OTA biosynthetic genes has been reported in Aspergillus genus. The results also suggested that this situation might be common in Aspergillus OTA-producing species and distinct to the one described for Penicillium species. Copyright © 2015 Elsevier B.V. All rights reserved.
TP53, PIK3CA, FBXW7 and KRAS Mutations in Esophageal Cancer Identified by Targeted Sequencing.

PubMed

Zheng, Huili; Wang, Yan; Tang, Chuanning; Jones, Lindsey; Ye, Hua; Zhang, Guangchun; Cao, Weihai; Li, Jingwen; Liu, Lifeng; Liu, Zhencong; Zhang, Chao; Lou, Feng; Liu, Zhiyuan; Li, Yangyang; Shi, Zhenfen; Zhang, Jingbo; Zhang, Dandan; Sun, Hong; Dong, Haichao; Dong, Zhishou; Guo, Baishuai; Yan, H E; Lu, Qingyu; Huang, Xue; Chen, Si-Yi

2016-01-01

Esophageal cancer (EC) is a common malignancy with significant morbidity and mortality. As individual cancers exhibit unique mutation patterns, identifying and characterizing gene mutations in EC that may serve as biomarkers might help predict patient outcome and guide treatment. Traditionally, personalized cancer DNA sequencing was impractical and expensive. Recent technological advancements have made targeted DNA sequencing more cost- and time-effective with reliable results. This technology may be useful for clinicians to direct patient treatment. The Ion PGM and AmpliSeq Cancer Panel was used to identify mutations at 737 hotspot loci of 45 cancer-related genes in 64 EC samples from Chinese patients. Frequent mutations were found in TP53 and less frequent mutations in PIK3CA, FBXW7 and KRAS. These results demonstrate that targeted sequencing can reliably identify mutations in individual tumors that make this technology a possibility for clinical use. Copyright© 2016, International Institute of Anticancer Research (Dr. John G. Delinasios), All rights reserved.
Management of familial cancer: sequencing, surveillance and society.

PubMed

Samuel, Nardin; Villani, Anita; Fernandez, Conrad V; Malkin, David

2014-12-01

The clinical management of familial cancer begins with recognition of patterns of cancer occurrence suggestive of genetic susceptibility in a proband or pedigree, to enable subsequent investigation of the underlying DNA mutations. In this regard, next-generation sequencing of DNA continues to transform cancer diagnostics, by enabling screening for cancer-susceptibility genes in the context of known and emerging familial cancer syndromes. Increasingly, not only are candidate cancer genes sequenced, but also entire 'healthy' genomes are mapped in children with cancer and their family members. Although large-scale genomic analysis is considered intrinsic to the success of cancer research and discovery, a number of accompanying ethical and technical issues must be addressed before this approach can be adopted widely in personalized therapy. In this Perspectives article, we describe our views on how the emergence of new sequencing technologies and cancer surveillance strategies is altering the framework for the clinical management of hereditary cancer. Genetic counselling and disclosure issues are discussed, and strategies for approaching ethical dilemmas are proposed.
Activity of foreign proteins targeted within the bacteriophage T4 head and prohead: implications for packaged DNA structure.

PubMed

Mullaney, J M; Black, L W

1998-11-13

The phage-derived expression, packaging, and processing (PEPP) system was used to target foreign proteins into the bacteriophage capsid to probe the intracapsid environment and the structure of packaged DNA. Small proteins with minimal requirements for activity were selected, staphylococcal nuclease (SN) and green fluorescent protein (GFP). These proteins were targeted into the T4 head by means of IPIII (internal protein III) fusions or CTS (capsid targeting sequence) fusions. Additional evidence is provided that foreign proteins are targeted into T4 by the N-terminal ten amino acid residue consensus CTS of IPIII identified in previous work. Fusion proteins were produced within host bacteria by expression from plasmids or by produc tion from recombinant phage carrying the fusion genes. Packaged fusion proteins CTS IPIII SN, CTS IPIII TSN, CTS IPIII GFP, CTS IPIII TGFP, and CTS GFP, where [symbol: see text] indicates a linkage peptide sequence Leu(Ile)-N-Glu cleaved by the T4 head morphogenetic proteinase gp21 during head maturation, are observed to exhibit intracapsid activity. SN activity within the head is demonstrated by loss of phage viability and by digested genomic DNA patterns visualized by gel electrophoresis when viable phage are incubated in Ca2+. Green fluorescent phage result immediately after packaging GFP produced at 30 degreesC and below, and continue to give green fluorescence under 470 nm light after CsCl purification. Non-fluorescent GFP-fusions are produced in bacteria at 37 degreesC, and phage packaged with these proteins achieve a fluorescent state after incubation for several months at 4 degreesC. GFP-packaged phage and proheads analyzed by fluorescence spectroscopy show that the mature head and the DNA-empty prohead package identical numbers of GFP-fusion proteins. Encapsidated GFP and SN can be injected into bacteria and rapidly exhibit intracellular activity. In vivo SN digestion of encapsidated DNA gives an intriguing pattern of DNA fragments by gel analysis, predominantly a repeat pattern of 160 bp multiples, reminiscent of a nucleosome digestion ladder, This quasi-limit DNA digestion pattern, reached >100-fold more slowly than the loss of titer, is invariant over a range
Shotgun metagenomic data streams: surfing without fear

DOE Office of Scientific and Technical Information (OSTI.GOV)

Berendzen, Joel R

2010-12-06

Timely information about bio-threat prevalence, consequence, propagation, attribution, and mitigation is needed to support decision-making, both routinely and in a crisis. One DNA sequencer can stream 25 Gbp of information per day, but sampling strategies and analysis techniques are needed to turn raw sequencing power into actionable knowledge. Shotgun metagenomics can enable biosurveillance at the level of a single city, hospital, or airplane. Metagenomics characterizes viruses and bacteria from complex environments such as soil, air filters, or sewage. Unlike targeted-primer-based sequencing, shotgun methods are not blind to sequences that are truly novel, and they can measure absolute prevalence. Shotgun metagenomicmore » sampling can be non-invasive, efficient, and inexpensive while being informative. We have developed analysis techniques for shotgun metagenomic sequencing that rely upon phylogenetic signature patterns. They work by indexing local sequence patterns in a manner similar to web search engines. Our methods are laptop-fast and favorable scaling properties ensure they will be sustainable as sequencing methods grow. We show examples of application to soil metagenomic samples.« less
A Dual-Mode Large-Arrayed CMOS ISFET Sensor for Accurate and High-Throughput pH Sensing in Biomedical Diagnosis.

PubMed

Huang, Xiwei; Yu, Hao; Liu, Xu; Jiang, Yu; Yan, Mei; Wu, Dongping

2015-09-01

The existing ISFET-based DNA sequencing detects hydrogen ions released during the polymerization of DNA strands on microbeads, which are scattered into microwell array above the ISFET sensor with unknown distribution. However, false pH detection happens at empty microwells due to crosstalk from neighboring microbeads. In this paper, a dual-mode CMOS ISFET sensor is proposed to have accurate pH detection toward DNA sequencing. Dual-mode sensing, optical and chemical modes, is realized by integrating a CMOS image sensor (CIS) with ISFET pH sensor, and is fabricated in a standard 0.18-μm CIS process. With accurate determination of microbead physical locations with CIS pixel by contact imaging, the dual-mode sensor can correlate local pH for one DNA slice at one location-determined microbead, which can result in improved pH detection accuracy. Moreover, toward a high-throughput DNA sequencing, a correlated-double-sampling readout that supports large array for both modes is deployed to reduce pixel-to-pixel nonuniformity such as threshold voltage mismatch. The proposed CMOS dual-mode sensor is experimentally examined to show a well correlated pH map and optical image for microbeads with a pH sensitivity of 26.2 mV/pH, a fixed pattern noise (FPN) reduction from 4% to 0.3%, and a readout speed of 1200 frames/s. A dual-mode CMOS ISFET sensor with suppressed FPN for accurate large-arrayed pH sensing is proposed and demonstrated with state-of-the-art measured results toward accurate and high-throughput DNA sequencing. The developed dual-mode CMOS ISFET sensor has great potential for future personal genome diagnostics with high accuracy and low cost.
Conditional poliovirus mutants made by random deletion mutagenesis of infectious cDNA.

PubMed Central

Kirkegaard, K; Nelsen, B

1990-01-01

Small deletions were introduced into DNA plasmids bearing cDNA copies of Mahoney type 1 poliovirus RNA. The procedure used was similar to that of P. Hearing and T. Shenk (J. Mol. Biol. 167:809-822, 1983), with modifications designed to introduce only one lesion randomly into each DNA molecule. Methods to map small deletions in either large DNA or RNA molecules were employed. Two poliovirus mutants, VP1-101 and VP1-102, were selected from mutagenized populations on the basis of their host range phenotype, showing a large reduction in the relative numbers of plaques on CV1 and HeLa cells compared with wild-type virus. The deletions borne by the mutant genomes were mapped to the region encoding the amino terminus of VP1. That these lesions were responsible for the mutant phenotypes was substantiated by reintroduction of the sequenced lesions into a wild-type poliovirus cDNA by deoxyoligonucleotide-directed mutagenesis. The deletion of nucleotides encoding amino acids 8 and 9 of VP1 was responsible for the VP1-101 phenotype; the VP1-102 defect was caused by the deletion of the sequences encoding the first four amino acids of VP1. The peptide sequence at the VP1-VP3 proteolytic cleavage site was altered from glutamine-glycine to glutamine-methionine in VP1-102; this apparently did not alter the proteolytic cleavage pattern. The biochemical defects resulting from these mutations are discussed in the accompanying report. Images PMID:2152811
Research on parallel algorithm for sequential pattern mining

NASA Astrophysics Data System (ADS)

Zhou, Lijuan; Qin, Bai; Wang, Yu; Hao, Zhongxiao

2008-03-01

Sequential pattern mining is the mining of frequent sequences related to time or other orders from the sequence database. Its initial motivation is to discover the laws of customer purchasing in a time section by finding the frequent sequences. In recent years, sequential pattern mining has become an important direction of data mining, and its application field has not been confined to the business database and has extended to new data sources such as Web and advanced science fields such as DNA analysis. The data of sequential pattern mining has characteristics as follows: mass data amount and distributed storage. Most existing sequential pattern mining algorithms haven't considered the above-mentioned characteristics synthetically. According to the traits mentioned above and combining the parallel theory, this paper puts forward a new distributed parallel algorithm SPP(Sequential Pattern Parallel). The algorithm abides by the principal of pattern reduction and utilizes the divide-and-conquer strategy for parallelization. The first parallel task is to construct frequent item sets applying frequent concept and search space partition theory and the second task is to structure frequent sequences using the depth-first search method at each processor. The algorithm only needs to access the database twice and doesn't generate the candidated sequences, which abates the access time and improves the mining efficiency. Based on the random data generation procedure and different information structure designed, this paper simulated the SPP algorithm in a concrete parallel environment and implemented the AprioriAll algorithm. The experiments demonstrate that compared with AprioriAll, the SPP algorithm had excellent speedup factor and efficiency.
Principles of regulatory information conservation between mouse and human.

PubMed

Cheng, Yong; Ma, Zhihai; Kim, Bong-Hyun; Wu, Weisheng; Cayting, Philip; Boyle, Alan P; Sundaram, Vasavi; Xing, Xiaoyun; Dogan, Nergiz; Li, Jingjing; Euskirchen, Ghia; Lin, Shin; Lin, Yiing; Visel, Axel; Kawli, Trupti; Yang, Xinqiong; Patacsil, Dorrelyn; Keller, Cheryl A; Giardine, Belinda; Kundaje, Anshul; Wang, Ting; Pennacchio, Len A; Weng, Zhiping; Hardison, Ross C; Snyder, Michael P

2014-11-20

To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human-mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and with genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.
Emerging technologies for studying DNA methylation for the molecular diagnosis of cancer

PubMed Central

Marzese, Diego M.; Hoon, Dave S.B.

2015-01-01

DNA methylation is an epigenetic mechanism that plays a key role in regulating gene expression and other functions. Although this modification is seen in different sequence contexts, the most frequently detected DNA methylation in mammals involves cytosine-guanine dinucleotides. Pathological alterations in DNA methylation patterns are described in a variety of human diseases, including cancer. Unlike genetic changes, DNA methylation is heavily influenced by subtle modifications in the cellular microenvironment. In all cancers, aberrant DNA methylation is involved in the alteration of a large number of oncological pathways with relevant theranostic utility. Several technologies for DNA methylation mapping were recently developed and successfully applied in cancer studies. The scope of these technologies varies from assessing a single cytosine-guanine locus to genome-wide distribution of DNA methylation. Here, we review the strengths and weaknesses of these approaches in the context of clinical utility for the molecular diagnosis of human cancers. PMID:25797072
Ancestral Polymorphisms and Sex-Biased Migration Shaped the Demographic History of Brown Bears and Polar Bears

PubMed Central

Nakagome, Shigeki; Mano, Shuhei; Hasegawa, Masami

2013-01-01

Recent studies have reported discordant gene trees in the evolution of brown bears and polar bears. Genealogical histories are different among independent nuclear loci and between biparentally inherited autosomal DNA (aDNA) and matrilineal mitochondrial DNA (mtDNA). Based on multi-locus genomic sequences from aDNA and mtDNA, we inferred the population demography of brown and polar bears and found that brown bears have 6 times (aDNA) or more than 14 times (mtDNA) larger population sizes than polar bears and that polar bear lineage is derived from within brown bear diversity. In brown bears, the effective population size ratio of mtDNA to aDNA was at least 0.62, which deviated from the expected value of 0.25, suggesting matriarchal population due to female philopatry and male-biased migration. These results emphasize that ancestral polymorphisms and sex-biased migration may have contributed to conflicting branching patterns in brown and polar bears across aDNA genes and mtDNA. PMID:24236053
Ancestral polymorphisms and sex-biased migration shaped the demographic history of brown bears and polar bears.

PubMed

Nakagome, Shigeki; Mano, Shuhei; Hasegawa, Masami

2013-01-01

Recent studies have reported discordant gene trees in the evolution of brown bears and polar bears. Genealogical histories are different among independent nuclear loci and between biparentally inherited autosomal DNA (aDNA) and matrilineal mitochondrial DNA (mtDNA). Based on multi-locus genomic sequences from aDNA and mtDNA, we inferred the population demography of brown and polar bears and found that brown bears have 6 times (aDNA) or more than 14 times (mtDNA) larger population sizes than polar bears and that polar bear lineage is derived from within brown bear diversity. In brown bears, the effective population size ratio of mtDNA to aDNA was at least 0.62, which deviated from the expected value of 0.25, suggesting matriarchal population due to female philopatry and male-biased migration. These results emphasize that ancestral polymorphisms and sex-biased migration may have contributed to conflicting branching patterns in brown and polar bears across aDNA genes and mtDNA.
Scar-less multi-part DNA assembly design automation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hillson, Nathan J.

The present invention provides a method of a method of designing an implementation of a DNA assembly. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which to assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding flanking homology sequences to each of the DNA oligos. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which tomore » assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding optimized overhang sequences to each of the DNA oligos.« less
Stress-induced rearrangement of Fusarium retrotransposon sequences.

PubMed

Anaya, N; Roncero, M I

1996-11-27

Rearrangement of fusarium oxysporum retrotransposon skippy was induced by growth in the presence of potassium chlorate. Three fungal strains, one sensitive to chlorate (Co60) and two resistant to chlorate and deficient for nitrate reductase (Co65 and Co94), were studied by Southern analysis of their genomic DNA. Polymorphism was detected in their hybridization banding pattern, relative to the wild type grown in the absence of chlorate, using various enzymes with or without restriction sites within the retrotransposon. Results were consistent with the assumption that three different events had occurred in strain Co60: genomic amplification of skippy yielding tandem arrays of the element, generation of new skippy sequences, and deletion of skippy sequences. Amplification of Co60 genomic DNA using the polymerase chain reaction and divergent primers derived from the retrotransposon generated a new band, corresponding to one long terminal repeat plus flanking sequences, that was not present in the wild-type strain. Molecular analysis of nitrate reductase-deficient mutants showed that generation and deletion of skippy sequences, but not genomic amplification in tandem repeats, had occurred in their genomes.
Genomic Organization of Repetitive DNA in Woodpeckers (Aves, Piciformes): Implications for Karyotype and ZW Sex Chromosome Differentiation

PubMed Central

Kretschmer, Rafael; Bertocchi, Natasha Avila; Degrandi, Tiago Marafiga; de Oliveira, Edivaldo Herculano Corrêa; Cioffi, Marcelo de Bello; Garnero, Analía del Valle; Gunski, Ricardo José

2017-01-01

Birds are characterized by a low proportion of repetitive DNA in their genome when compared to other vertebrates. Among birds, species belonging to Piciformes order, such as woodpeckers, show a relatively higher amount of these sequences. The aim of this study was to analyze the distribution of different classes of repetitive DNA—including microsatellites, telomere sequences and 18S rDNA—in the karyotype of three Picidae species (Aves, Piciformes)—Colaptes melanochloros (2n = 84), Colaptes campestris (2n = 84) and Melanerpes candidus (2n = 64)–by means of fluorescence in situ hybridization. Clusters of 18S rDNA were found in one microchromosome pair in each of the three species, coinciding to a region of (CGG)10 sequence accumulation. Interstitial telomeric sequences were found in some macrochromosomes pairs, indicating possible regions of fusions, which can be related to variation of diploid number in the family. Only one, from the 11 different microsatellite sequences used, did not produce any signals. Both species of genus Colaptes showed a similar distribution of microsatellite sequences, with some difference when compared to M. candidus. Microsatellites were found preferentially in the centromeric and telomeric regions of micro and macrochromosomes. However, some sequences produced patterns of interstitial bands in the Z chromosome, which corresponds to the largest element of the karyotype in all three species. This was not observed in the W chromosome of Colaptes melanochloros, which is heterochromatic in most of its length, but was not hybridized by any of the sequences used. These results highlight the importance of microsatellite sequences in differentiation of sex chromosomes, and the accumulation of these sequences is probably responsible for the enlargement of the Z chromosome. PMID:28081238
Complex structure of knob DNA on maize chromosome 9. Retrotransposon invasion into heterochromatin.

PubMed Central

Ananiev, E V; Phillips, R L; Rines, H W

1998-01-01

The recovery of maize (Zea mays L.) chromosome addition lines of oat (Avena sativa L.) from oat x maize crosses enables us to analyze the structure and composition of specific regions, such as knobs, of individual maize chromosomes. A DNA hybridization blot panel of eight individual maize chromosome addition lines revealed that 180-bp repeats found in knobs are present in each of these maize chromosomes, but the copy number varies from approximately 100 to 25, 000. Cosmid clones with knob DNA segments were isolated from a genomic library of an oat-maize chromosome 9 addition line with the help of the 180-bp knob-associated repeated DNA sequence used as a probe. Cloned knob DNA segments revealed a complex organization in which blocks of tandemly arranged 180-bp repeating units are interrupted by insertions of other repeated DNA sequences, mostly represented by individual full size copies of retrotransposable elements. There is an obvious preference for the integration of retrotransposable elements into certain sites (hot spots) of the 180-bp repeat. Sequence microheterogeneity including point mutations and duplications was found in copies of 180-bp repeats. The 180-bp repeats within an array all had the same polarity. Restriction maps constructed for 23 cloned knob DNA fragments revealed the positions of polymorphic sites and sites of integration of insertion elements. Discovery of the interspersion of retrotransposable elements among blocks of tandem repeats in maize and some other organisms suggests that this pattern may be basic to heterochromatin organization for eukaryotes. PMID:9691055

Multiplexed SNP typing of ancient DNA clarifies the origin of Andaman mtDNA haplogroups amongst South Asian tribal populations.

PubMed

Endicott, Phillip; Metspalu, Mait; Stringer, Chris; Macaulay, Vincent; Cooper, Alan; Sanchez, Juan J

2006-12-20

The issue of errors in genetic data sets is of growing concern, particularly in population genetics where whole genome mtDNA sequence data is coming under increased scrutiny. Multiplexed PCR reactions, combined with SNP typing, are currently under-exploited in this context, but have the potential to genotype whole populations rapidly and accurately, significantly reducing the amount of errors appearing in published data sets. To show the sensitivity of this technique for screening mtDNA genomic sequence data, 20 historic samples of the enigmatic Andaman Islanders and 12 modern samples from three Indian tribal populations (Chenchu, Lambadi and Lodha) were genotyped for 20 coding region sites after provisional haplogroup assignment with control region sequences. The genotype data from the historic samples significantly revise the topologies for the Andaman M31 and M32 mtDNA lineages by rectifying conflicts in published data sets. The new Indian data extend the distribution of the M31a lineage to South Asia, challenging previous interpretations of mtDNA phylogeography. This genetic connection between the ancestors of the Andamanese and South Asian tribal groups approximately 30 kya has important implications for the debate concerning migration routes and settlement patterns of humans leaving Africa during the late Pleistocene, and indicates the need for more detailed genotyping strategies. The methodology serves as a low-cost, high-throughput model for the production and authentication of data from modern or ancient DNA, and demonstrates the value of museum collections as important records of human genetic diversity.
Multiplexed SNP Typing of Ancient DNA Clarifies the Origin of Andaman mtDNA Haplogroups amongst South Asian Tribal Populations

PubMed Central

Endicott, Phillip; Metspalu, Mait; Stringer, Chris; Macaulay, Vincent; Cooper, Alan; Sanchez, Juan J.

2006-01-01

The issue of errors in genetic data sets is of growing concern, particularly in population genetics where whole genome mtDNA sequence data is coming under increased scrutiny. Multiplexed PCR reactions, combined with SNP typing, are currently under-exploited in this context, but have the potential to genotype whole populations rapidly and accurately, significantly reducing the amount of errors appearing in published data sets. To show the sensitivity of this technique for screening mtDNA genomic sequence data, 20 historic samples of the enigmatic Andaman Islanders and 12 modern samples from three Indian tribal populations (Chenchu, Lambadi and Lodha) were genotyped for 20 coding region sites after provisional haplogroup assignment with control region sequences. The genotype data from the historic samples significantly revise the topologies for the Andaman M31 and M32 mtDNA lineages by rectifying conflicts in published data sets. The new Indian data extend the distribution of the M31a lineage to South Asia, challenging previous interpretations of mtDNA phylogeography. This genetic connection between the ancestors of the Andamanese and South Asian tribal groups ∼30 kya has important implications for the debate concerning migration routes and settlement patterns of humans leaving Africa during the late Pleistocene, and indicates the need for more detailed genotyping strategies. The methodology serves as a low-cost, high-throughput model for the production and authentication of data from modern or ancient DNA, and demonstrates the value of museum collections as important records of human genetic diversity. PMID:17218991
Nanopore sensing of individual transcription factors bound to DNA

PubMed Central

Squires, Allison; Atas, Evrim; Meller, Amit

2015-01-01

Transcription factor (TF)-DNA interactions are the primary control point in regulation of gene expression. Characterization of these interactions is essential for understanding genetic regulation of biological systems and developing novel therapies to treat cellular malfunctions. Solid-state nanopores are a highly versatile class of single-molecule sensors that can provide rich information about local properties of long charged biopolymers using the current blockage patterns generated during analyte translocation, and provide a novel platform for characterization of TF-DNA interactions. The DNA-binding domain of the TF Early Growth Response Protein 1 (EGR1), a prototypical zinc finger protein known as zif268, is used as a model system for this study. zif268 adopts two distinct bound conformations corresponding to specific and nonspecific binding, according to the local DNA sequence. Here we implement a solid-state nanopore platform for direct, label- and tether-free single-molecule detection of zif268 bound to DNA. We demonstrate detection of single zif268 TFs bound to DNA according to current blockage sublevels and duration of translocation through the nanopore. We further show that the nanopore can detect and discriminate both specific and nonspecific binding conformations of zif268 on DNA via the distinct current blockage patterns corresponding to each of these two known binding modes. PMID:26109509
Nanopore sensing of individual transcription factors bound to DNA

NASA Astrophysics Data System (ADS)

Squires, Allison; Atas, Evrim; Meller, Amit

2015-06-01

Transcription factor (TF)-DNA interactions are the primary control point in regulation of gene expression. Characterization of these interactions is essential for understanding genetic regulation of biological systems and developing novel therapies to treat cellular malfunctions. Solid-state nanopores are a highly versatile class of single-molecule sensors that can provide rich information about local properties of long charged biopolymers using the current blockage patterns generated during analyte translocation, and provide a novel platform for characterization of TF-DNA interactions. The DNA-binding domain of the TF Early Growth Response Protein 1 (EGR1), a prototypical zinc finger protein known as zif268, is used as a model system for this study. zif268 adopts two distinct bound conformations corresponding to specific and nonspecific binding, according to the local DNA sequence. Here we implement a solid-state nanopore platform for direct, label- and tether-free single-molecule detection of zif268 bound to DNA. We demonstrate detection of single zif268 TFs bound to DNA according to current blockage sublevels and duration of translocation through the nanopore. We further show that the nanopore can detect and discriminate both specific and nonspecific binding conformations of zif268 on DNA via the distinct current blockage patterns corresponding to each of these two known binding modes.
SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells.

PubMed

Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang

2018-01-01

Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. © 2018 Han et al.; Published by Cold Spring Harbor Laboratory Press.
SIDR: simultaneous isolation and parallel sequencing of genomic DNA and total RNA from single cells

PubMed Central

Han, Kyung Yeon; Kim, Kyu-Tae; Joung, Je-Gun; Son, Dae-Soon; Kim, Yeon Jeong; Jo, Areum; Jeon, Hyo-Jeong; Moon, Hui-Sung; Yoo, Chang Eun; Chung, Woosung; Eum, Hye Hyeon; Kim, Sangmin; Kim, Hong Kwan; Lee, Jeong Eon; Ahn, Myung-Ju; Lee, Hae-Ock; Park, Donghyun; Park, Woong-Yang

2018-01-01

Simultaneous sequencing of the genome and transcriptome at the single-cell level is a powerful tool for characterizing genomic and transcriptomic variation and revealing correlative relationships. However, it remains technically challenging to analyze both the genome and transcriptome in the same cell. Here, we report a novel method for simultaneous isolation of genomic DNA and total RNA (SIDR) from single cells, achieving high recovery rates with minimal cross-contamination, as is crucial for accurate description and integration of the single-cell genome and transcriptome. For reliable and efficient separation of genomic DNA and total RNA from single cells, the method uses hypotonic lysis to preserve nuclear lamina integrity and subsequently captures the cell lysate using antibody-conjugated magnetic microbeads. Evaluating the performance of this method using real-time PCR demonstrated that it efficiently recovered genomic DNA and total RNA. Thorough data quality assessments showed that DNA and RNA simultaneously fractionated by the SIDR method were suitable for genome and transcriptome sequencing analysis at the single-cell level. The integration of single-cell genome and transcriptome sequencing by SIDR (SIDR-seq) showed that genetic alterations, such as copy-number and single-nucleotide variations, were more accurately captured by single-cell SIDR-seq compared with conventional single-cell RNA-seq, although copy-number variations positively correlated with the corresponding gene expression levels. These results suggest that SIDR-seq is potentially a powerful tool to reveal genetic heterogeneity and phenotypic information inferred from gene expression patterns at the single-cell level. PMID:29208629
Isogenic mice exhibit sexually-dimorphic DNA methylation patterns across multiple tissues.

PubMed

McCormick, Helen; Young, Paul E; Hur, Suzy S J; Booher, Keith; Chung, Hunter; Cropley, Jennifer E; Giannoulatou, Eleni; Suter, Catherine M

2017-12-13

Cytosine methylation is a stable epigenetic modification of DNA that plays an important role in both normal physiology and disease. Most diseases exhibit some degree of sexual dimorphism, but the extent to which epigenetic states are influenced by sex is understudied and poorly understood. To address this deficit we studied DNA methylation patterns across multiple reduced representation bisulphite sequencing datasets (from liver, heart, brain, muscle and spleen) derived from isogenic male and female mice. DNA methylation patterns varied significantly from tissue to tissue, as expected, but they also varied between the sexes, with thousands of sexually dimorphic loci identified. The loci affected were largely autonomous to each tissue, even within tissues derived from the same germ layer. At most loci, differences between genders were driven by females exhibiting hypermethylation relative to males; a proportion of these differences were independent of the presence of testosterone in males. Loci harbouring gender differences were clustered in ontologies related to tissue function. Our findings suggest that gender is underwritten in the epigenome in a tissue-specific and potentially sex hormone-independent manner. Gender-specific epigenetic states are likely to have important implications for understanding sexually dimorphic phenotypes in health and disease.
Structural analysis of the rDNA intergenic spacer of Brassica nigra: evolutionary divergence of the spacers of the three diploid Brassica species.

PubMed

Bhatia, S; Singh Negi, M; Lakshmikumaran, M

1996-11-01

EcoRI restriction of the B. nigra rDNA recombinants, isolated from a lambda genomic library, showed that the 3.9-kb fragment corresponded to the Intergenic Spacer (IGS), which was sequenced and found to be 3,928 bp in size. Sequence and dot-matrix analyses showed that the organization of the B. nigra rDNA IGS was typical of most rDNA spacers, consisting of a central repetitive region and flanking unique sequences on either side. The repetitive region was composed of two repeat families-RF 'A' and RF 'B.' The B. nigra RF 'A' consisted of a tandem array of three full-length copies of a 106-bp sequence element. RF 'B' was composed of 66 tandemly repeated elements. Each 'B' element was only 21-bp in size and this is the smallest repeat unit identified in plant rDNA to date. The putative transcription initiation site (TIS) was identified as nucleotide position 3,110. Based on the sequence analysis it was suggested that the present organization of the repeat families was generated by successive cycles of deletions and amplifications and was being maintained by homogenization processes such as gene conversion and crossing-over.A detailed comparison of the rDNA IGS sequences of the three diploid Brassica species-namely, B. nigra, B. campestris, and B. oleracea-was carried out. First, comparisons revealed that B. campestris and B. oleracea were close to each other as the repeat families in both showed high sequence homology between each other. Second, the repeat elements in both the species were organized in an interspersed manner. Third, a 52-bp sequence, present just downstream of the repeats in B. campestris, was found to be identical to the B. oleracea repeats, thereby suggesting a common progenitor. On the other hand, in B. nigra no interspersion pattern of organization of repeats was observed. Further, the B. nigra RF 'A' was identified as distinct from the repeat families of B. campestris and B. oleracea. Based on this analysis, it was suggested that during speciation B. campestris and B. oleracea evolved in one lineage whereas B. nigra diverged into a separate lineage. The comparative analysis of the IGS helped in identifying not only conserved ancestral sequence motifs of possible functional significance such as promoters and enhancers, but also sequences which showed variation between the three diploid species and were therefore identified as species-specific sequences.
Unequal rates of Y chromosome gene divergence during speciation of the family Ursidae.

PubMed

Nakagome, Shigeki; Pecon-Slattery, Jill; Masuda, Ryuichi

2008-07-01

Evolution of the bear family Ursidae is well investigated in terms of morphological, paleontological, and genetic features. However, several phylogenetic ambiguities occur within the subfamily Ursinae (the family Ursidae excluding the giant panda and spectacled bear), which may correlate with behavioral traits of female philopatry and male-biased dispersal which form the basis of the observed matriarchal population structure in these species. In the process of bear evolution, we investigate the premise that such behavioral traits may be reflected in patterns of variation among genes with different modes of inheritance: matrilineal mitochondrial DNA (mtDNA), patrilineal Y chromosome, biparentally inherited autosomes, and the X chromosome. In the present study, we sequenced 3 Y-linked genes (3,453 bp) and 4 X-linked genes (4,960 bp) and reanalyzed previously published sequences from autosome genes (2,347 bp) in ursid species to investigate differences in evolutionary rates associated with patterns of inheritance. The results describe topological incongruence between sex-linked genes and autosome genes and between nuclear DNA and mtDNA. In more ancestral branches within the bear phylogeny, Y-linked genes evolved faster than autosome and X-linked genes, consistent with expectations based on male-driven evolution. However, this pattern changes among branches leading to each species within the lineage of Ursinae whereby the evolutionary rates of Y-linked genes have fewer than expected substitutions. This inconsistency between more recent nodes of the bear phylogeny with more ancestral nodes may reflect the influences of sex-biased dispersal as well as molecular evolutionary characteristics of the Y chromosome, and stochastic events in species natural history, and phylogeography unique to ursine bears.
Screening and Characterization of RAPD Markers in Viscerotropic Leishmania Parasites

PubMed Central

Mkada–Driss, Imen; Talbi, Chiraz; Guerbouj, Souheila; Driss, Mehdi; Elamine, Elwaleed M.; Cupolillo, Elisa; Mukhtar, Moawia M.; Guizani, Ikram

2014-01-01

Visceral leishmaniasis (VL) is mainly due to the Leishmania donovani complex. VL is endemic in many countries worldwide including East Africa and the Mediterranean region where the epidemiology is complex. Taxonomy of these pathogens is under controversy but there is a correlation between their genetic diversity and geographical origin. With steady increase in genome knowledge, RAPD is still a useful approach to identify and characterize novel DNA markers. Our aim was to identify and characterize polymorphic DNA markers in VL Leishmania parasites in diverse geographic regions using RAPD in order to constitute a pool of PCR targets having the potential to differentiate among the VL parasites. 100 different oligonucleotide decamers having arbitrary DNA sequences were screened for reproducible amplification and a selection of 28 was used to amplify DNA from 12 L. donovani, L. archibaldi and L. infantum strains having diverse origins. A total of 155 bands were amplified of which 60.65% appeared polymorphic. 7 out of 28 primers provided monomorphic patterns. Phenetic analysis allowed clustering the parasites according to their geographical origin. Differentially amplified bands were selected, among them 22 RAPD products were successfully cloned and sequenced. Bioinformatic analysis allowed mapping of the markers and sequences and priming sites analysis. This study was complemented with Southern-blot to confirm assignment of markers to the kDNA. The bioinformatic analysis identified 16 nuclear and 3 minicircle markers. Analysis of these markers highlighted polymorphisms at RAPD priming sites with mainly 5′ end transversions, and presence of inter– and intra– taxonomic complex sequence and microsatellites variations; a bias in transitions over transversions and indels between the different sequences compared is observed, which is however less marked between L. infantum and L. donovani. The study delivers a pool of well-documented polymorphic DNA markers, to develop molecular diagnostics assays to characterize and differentiate VL causing agents. PMID:25313833
Active BRAF-V600E is the key player in generation of a sessile serrated polyp-specific DNA methylation profile

PubMed Central

Dehghanizadeh, Somaye; Khoddami, Vahid; Mosbruger, Timothy L.; Hammoud, Sue S.; Edes, Kornelia; Berry, Therese S.; Done, Michelle; Samowitz, Wade S.; DiSario, James A.; Luba, Daniel G.; Burt, Randall W.

2018-01-01

Background Sessile serrated polyps (SSPs) have emerged as important precursors for a large number of sporadic colorectal cancers. They are difficult to detect during colonoscopy due to their flat shape and the excessive amounts of secreted mucin that cover the polyps. The underlying genetic and epigenetic basis for the emergence of SSPs is largely unknown with existing genetic studies confined to a limited number of oncogenes and tumor suppressors. A full characterization of the genetic and epigenetic landscape of SSPs would provide insight into their origin and potentially offer new biomarkers useful for detection of SSPs in stool samples. Methods We used a combination of genome-wide mutation detection, exome sequencing and DNA methylation profiling (via methyl-array and whole-genome bisulfite sequencing) to analyze multiple samples of sessile serrated polyps and compared these to familial adenomatous polyps. Results Our analysis revealed BRAF-V600E as the sole recurring somatic mutation in SSPs with no additional major genetic mutations detected. The occurrence of BRAF-V600E was coincident with a unique DNA methylation pattern revealing a set of DNA methylation markers showing significant (~3 to 30 fold) increase in their methylation levels, exclusively in SSP samples. These methylation patterns effectively distinguished sessile serrated polys from adenomatous polyps and did so more effectively than parallel gene expression profiles. Conclusions This study provides an important example of a single oncogenic mutation leading to reproducible global DNA methylation changes. These methylated markers are specific to SSPs and could be of important clinical relevance for the early diagnosis of SSPs using non-invasive approaches such as fecal DNA testing. PMID:29590112
Patterns of DNA barcode variation in Canadian marine molluscs.

PubMed

Layton, Kara K S; Martel, André L; Hebert, Paul D N

2014-01-01

Molluscs are the most diverse marine phylum and this high diversity has resulted in considerable taxonomic problems. Because the number of species in Canadian oceans remains uncertain, there is a need to incorporate molecular methods into species identifications. A 648 base pair segment of the cytochrome c oxidase subunit I gene has proven useful for the identification and discovery of species in many animal lineages. While the utility of DNA barcoding in molluscs has been demonstrated in other studies, this is the first effort to construct a DNA barcode registry for marine molluscs across such a large geographic area. This study examines patterns of DNA barcode variation in 227 species of Canadian marine molluscs. Intraspecific sequence divergences ranged from 0-26.4% and a barcode gap existed for most taxa. Eleven cases of relatively deep (>2%) intraspecific divergence were detected, suggesting the possible presence of overlooked species. Structural variation was detected in COI with indels found in 37 species, mostly bivalves. Some indels were present in divergent lineages, primarily in the region of the first external loop, suggesting certain areas are hotspots for change. Lastly, mean GC content varied substantially among orders (24.5%-46.5%), and showed a significant positive correlation with nearest neighbour distances. DNA barcoding is an effective tool for the identification of Canadian marine molluscs and for revealing possible cases of overlooked species. Some species with deep intraspecific divergence showed a biogeographic partition between lineages on the Atlantic, Arctic and Pacific coasts, suggesting the role of Pleistocene glaciations in the subdivision of their populations. Indels were prevalent in the barcode region of the COI gene in bivalves and gastropods. This study highlights the efficacy of DNA barcoding for providing insights into sequence variation across a broad taxonomic group on a large geographic scale.
The Chapel Hill hemophilia A dog colony exhibits a factor VIII gene inversion

PubMed Central

Lozier, Jay N.; Dutra, Amalia; Pak, Evgenia; Zhou, Nan; Zheng, Zhili; Nichols, Timothy C.; Bellinger, Dwight A.; Read, Marjorie; Morgan, Richard A.

2002-01-01

In the Chapel Hill colony of factor VIII-deficient dogs, abnormal sequence (ch8, for canine hemophilia 8, GenBank no. AF361485) follows exons 1–22 in the factor VIII transcript in place of exons 23–26. The canine hemophilia 8 locus (ch8) sequence was found in a 140-kb normal dog genomic DNA bacterial artificial chromosome (BAC) clone that was completely outside the factor VIII gene, but not in BAC clones containing the factor VIII gene. The BAC clone that contained ch8 also contained a homologue of F8A (factor 8 associated) sequence, which participates in a common inversion that causes severe hemophilia A in humans. Fluorescence in situ hybridization analysis indicated that exons 1–26 normally proceed sequentially from telomere to centromere at Xq28, and ch8 is telomeric to the factor VIII gene. The appearance of an “upstream” genomic sequence element (ch8) at the end of the aberrant factor VIII transcript suggested that an inversion of genomic DNA replaced factor VIII exons 22–26 with ch8. The F8A sequence appeared also in overlapping normal BAC clones containing factor VIII sequence. We hypothesized that homologous recombination between copies of canine F8A inside and outside the factor VIII gene had occurred, as in human hemophilia A. High-resolution fluorescent in situ hybridization on hemophilia A dog DNA revealed a pattern consistent with this inversion mechanism. We also identified a HindIII restriction fragment length polymorphism of F8A fragments that distinguished hemophilia A, carrier, and normal dogs' DNA. The Chapel Hill hemophilia A dog colony therefore replicates the factor VIII gene inversion commonly seen in humans with severe hemophilia A. PMID:12242334
Methylation-sensitive enrichment of minor DNA alleles using a double-strand DNA-specific nuclease.

PubMed

Liu, Yibin; Song, Chen; Ladas, Ioannis; Fitarelli-Kiehl, Mariana; Makrigiorgos, G Mike

2017-04-07

Aberrant methylation changes, often present in a minor allelic fraction in clinical samples such as plasma-circulating DNA (cfDNA), are potentially powerful prognostic and predictive biomarkers in human disease including cancer. We report on a novel, highly-multiplexed approach to facilitate analysis of clinically useful methylation changes in minor DNA populations. Methylation Specific Nuclease-assisted Minor-allele Enrichment (MS-NaME) employs a double-strand-specific DNA nuclease (DSN) to remove excess DNA with normal methylation patterns. The technique utilizes oligonucleotide-probes that direct DSN activity to multiple targets in bisulfite-treated DNA, simultaneously. Oligonucleotide probes targeting unmethylated sequences generate local double stranded regions resulting to digestion of unmethylated targets, and leaving methylated targets intact; and vice versa. Subsequent amplification of the targeted regions results in enrichment of the targeted methylated or unmethylated minority-epigenetic-alleles. We validate MS-NaME by demonstrating enrichment of RARb2, ATM, MGMT and GSTP1 promoters in multiplexed MS-NaME reactions (177-plex) using dilutions of methylated/unmethylated DNA and in DNA from clinical lung cancer samples and matched normal tissue. MS-NaME is a highly scalable single-step approach performed at the genomic DNA level in solution that combines with most downstream detection technologies including Sanger sequencing, methylation-sensitive-high-resolution melting (MS-HRM) and methylation-specific-Taqman-based-digital-PCR (digital Methylight) to boost detection of low-level aberrant methylation-changes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Programmed self-assembly of DNA/RNA for biomedical applications

NASA Astrophysics Data System (ADS)

Wang, Pengfei

Three self-assembly strategies were utilized for assembly of novel functional DNA/RNA nanostructures. RNA-DNA hybrid origami method was developed to fabricate nano-objects (ribbon, rectangle, and triangle) with precisely controlled geometry. Unlike conventional DNA origami which use long DNA single strand as scaffold, a long RNA single strand was used instead, which was folded by short DNA single strands (staples) into prescribed objects through sequence specific hybridization between RNA and DNA. Single stranded tiles (SST) and RNA-DNA hybrid origami were utilized to fabricate a variety of barcode-like nanostructures with unique patterns by expanding a plain rectangle via introducing spacers (10-bp dsDNA segment) between parallel duplexes. Finally, complex 2D array and 3D polyhedrons with multiple patterns within one structure were assembled from simple DNA motifs. Two demonstrations of biomedical applications of DNA nanotechnology were presented. Firstly, lambda-DNA was used as template to direct the fabrication of multi-component magnetic nanoparticle chains. Nuclear magnetic relaxation (NMR) characterization showed superb magnetic relaxativity of the nanoparticle chains which have large potential to be utilized as MRI contrast agents. Secondly, DNA nanotechnology was introduced into the conformational study of a routinely used catalytic DNAzyme, the RNA-cleaving 10-23 DNAzyme. The relative angle between two flanking duplexes of the catalytic core was determined (94.8°), which shall be able to provide a clue to further understanding of the cleaving mechanism of this DNAzyme from a conformational perspective.
Synthesis of Messenger RNA and Chromosome Structure in the Cellular Slime Mold*

PubMed Central

Lodish, Harvey F.; Jacobson, Allan; Firtel, Richard; Alton, Tom; Tuchman, Jessica

1974-01-01

This paper summarizes our knowledge of the structure and biosynthesis of messenger RNA in the slime mold Dictyostelium discoideum, the arrangement of DNA sequences in the Dictyostelium chromosome, and the changes in the pattern of predominant polypeptides synthesized during Dictyostelium development. Images PMID:4531041
Genetic Variation in the Acorn Barnacle from Allozymes to Population Genomics

PubMed Central

Flight, Patrick A.; Rand, David M.

2012-01-01

Understanding the patterns of genetic variation within and among populations is a central problem in population and evolutionary genetics. We examine this question in the acorn barnacle, Semibalanus balanoides, in which the allozyme loci Mpi and Gpi have been implicated in balancing selection due to varying selective pressures at different spatial scales. We review the patterns of genetic variation at the Mpi locus, compare this to levels of population differentiation at mtDNA and microsatellites, and place these data in the context of genome-wide variation from high-throughput sequencing of population samples spanning the North Atlantic. Despite considerable geographic variation in the patterns of selection at the Mpi allozyme, this locus shows rather low levels of population differentiation at ecological and trans-oceanic scales (FST ∼ 5%). Pooled population sequencing was performed on samples from Rhode Island (RI), Maine (ME), and Southwold, England (UK). Analysis of more than 650 million reads identified approximately 335,000 high-quality SNPs in 19 million base pairs of the S. balanoides genome. Much variation is shared across the Atlantic, but there are significant examples of strong population differentiation among samples from RI, ME, and UK. An FST outlier screen of more than 22,000 contigs provided a genome-wide context for interpretation of earlier studies on allozymes, mtDNA, and microsatellites. FST values for allozymes, mtDNA and microsatellites are close to the genome-wide average for random SNPs, with the exception of the trans-Atlantic FST for mtDNA. The majority of FST outliers were unique between individual pairs of populations, but some genes show shared patterns of excess differentiation. These data indicate that gene flow is high, that selection is strong on a subset of genes, and that a variety of genes are experiencing diversifying selection at large spatial scales. This survey of polymorphism in S. balanoides provides a number of genomic tools that promise to make this a powerful model for ecological genomics of the rocky intertidal. PMID:22767487
Molecular analysis of RAPD DNA based markers: their potential use for the detection of genetic variability in jojoba (Simmondsia chinensis L Schneider).

PubMed

Amarger, V; Mercier, L

1995-01-01

We have applied the recently developed technique of random amplified polymorphic DNA (RAPD) for the discrimination between two jojoba clones at the genomic level. Among a set of 30 primers tested, a simple reproducible pattern with three distinct fragments for clone D and two distinct fragments for clone E was obtained with primer OPB08. Since RAPD products are the results of arbitrarily priming events and because a given primer can amplify a number of non-homologous sequences, we wondered whether or not RAPD bands, even those of similar size, were derived from different loci in the two clones. To answer this question, two complementary approaches were used: i) cloning and sequencing of the amplification products from clone E; and ii) complementary Southern analysis of RAPD gels using cloned or amplified fragments (directly recovered from agarose gels) as RFLP probes. The data reported here show that the RAPD reaction generates multiple amplified fragments. Some fragments, although resolved as a single band on agarose gels, contain different DNA species of the same size. Furthermore, it appears that the cloned RAPD products of known sequence that do not target repetitive DNA can be used as hybridization probes in RFLP to detect a polymorphism among individuals.
DNA damage and genetic methylation changes caused by Cd in Arabidopsis thaliana seedlings.

PubMed

Li, Zhaoling; Liu, Zhihong; Chen, Ruijuan; Li, Xiaojun; Tai, Peidong; Gong, Zongqiang; Jia, Chunyun; Liu, Wan

2015-09-01

Amplified fragment length polymorphism (AFLP) and methylation-sensitive amplification polymorphism (MASP) techniques are sensitive to deoxyribonucleic acid (DNA) damage and genetic methylation, respectively. Using these 2 techniques, Arabidopsis thaliana cultured with 0 mg/L (control), 0.5 mg/L, 1.5 mg/L, and 5.0 mg/L Cd(2+) for 16 d was used to analyze the DNA damage and methylation changes as a result of cadmium (Cd). The DNA was amplified by 14 AFLP primer pairs and 13 MSAP primer combinations. In the AFLP experiment, 62 polymorphic sites were found in the patterns of 11 primer combinations and a total of 1116 fragments were obtained in these patterns. There were no polymorphic bands in the remaining 3 pairs. The proportions of polymorphic sites in the 0.5-mg/L Cd(2+) and 5.0-mg/L Cd(2+) treatments were significantly different. Seven polymorphic fragments were then separated and successfully sequenced, yielding 6 nucleobase substitutions and 1 nucleobase deletion. Similarly, in the MSAP experiment, the MSAP% and number of demethylated-type bands were unchanged after Cd treatment, but the number of methylated-type bands was increased significantly in the 5.0-mg/L Cd(2+) treatment group, a finding that may be associated with the AFLP results. The polymorphic bands were also sequenced and the functions of their homologous genes were determined. The DNA damage and methylation changes may be the primary cause of certain pathology changes as a result of Cd uptake in plants. © 2015 SETAC.
Analysis of the DNA-Binding Activities of the Arabidopsis R2R3-MYB Transcription Factor Family by One-Hybrid Experiments in Yeast

PubMed Central

Kelemen, Zsolt; Sebastian, Alvaro; Xu, Wenjia; Grain, Damaris; Salsac, Fabien; Avon, Alexandra; Berger, Nathalie; Tran, Joseph; Dubreucq, Bertrand; Lurin, Claire; Lepiniec, Loïc; Contreras-Moreira, Bruno; Dubos, Christian

2015-01-01

The control of growth and development of all living organisms is a complex and dynamic process that requires the harmonious expression of numerous genes. Gene expression is mainly controlled by the activity of sequence-specific DNA binding proteins called transcription factors (TFs). Amongst the various classes of eukaryotic TFs, the MYB superfamily is one of the largest and most diverse, and it has considerably expanded in the plant kingdom. R2R3-MYBs have been extensively studied over the last 15 years. However, DNA-binding specificity has been characterized for only a small subset of these proteins. Therefore, one of the remaining challenges is the exhaustive characterization of the DNA-binding specificity of all R2R3-MYB proteins. In this study, we have developed a library of Arabidopsis thaliana R2R3-MYB open reading frames, whose DNA-binding activities were assayed in vivo (yeast one-hybrid experiments) with a pool of selected cis-regulatory elements. Altogether 1904 interactions were assayed leading to the discovery of specific patterns of interactions between the various R2R3-MYB subgroups and their DNA target sequences and to the identification of key features that govern these interactions. The present work provides a comprehensive in vivo analysis of R2R3-MYB binding activities that should help in predicting new DNA motifs and identifying new putative target genes for each member of this very large family of TFs. In a broader perspective, the generated data will help to better understand how TF interact with their target DNA sequences. PMID:26484765

Some links on this page may take you to non-federal websites. Their policies may differ from this site.