Characterization of HIV Transmission in South-East Austria
Kessler, Harald H.; Haas, Bernhard; Stelzl, Evelyn; Weninger, Karin; Little, Susan J.; Mehta, Sanjay R.
2016-01-01
To gain deeper insight into the epidemiology of HIV-1 transmission in South-East Austria we performed a retrospective analysis of 259 HIV-1 partial pol sequences obtained from unique individuals newly diagnosed with HIV infection in South-East Austria from 2008 through 2014. After quality filtering, putative transmission linkages were inferred when two sequences were ≤1.5% genetically different. Multiple linkages were resolved into putative transmission clusters. Further phylogenetic analyses were performed using BEAST v1.8.1. Finally, we investigated putative links between the 259 sequences from South-East Austria and all publicly available HIV polymerase sequences in the Los Alamos National Laboratory HIV sequence database. We found that 45.6% (118/259) of the sampled sequences were genetically linked with at least one other sequence from South-East Austria forming putative transmission clusters. Clustering individuals were more likely to be men who have sex with men (MSM; p<0.001), infected with subtype B (p<0.001) or subtype F (p = 0.02). Among clustered males who reported only heterosexual (HSX) sex as an HIV risk, 47% clustered closely with MSM (either as pairs or within larger MSM clusters). One hundred and seven of the 259 sequences (41.3%) from South-East Austria had at least one putative inferred linkage with sequences from a total of 69 other countries. In conclusion, analysis of HIV-1 sequences from newly diagnosed individuals residing in South-East Austria revealed a high degree of national and international clustering mainly within MSM. Interestingly, we found that a high number of heterosexual males clustered within MSM networks, suggesting either linkage between risk groups or misrepresentation of sexual risk behaviors by subjects. PMID:26967154
Characterization of HIV Transmission in South-East Austria.
Hoenigl, Martin; Chaillon, Antoine; Kessler, Harald H; Haas, Bernhard; Stelzl, Evelyn; Weninger, Karin; Little, Susan J; Mehta, Sanjay R
2016-01-01
To gain deeper insight into the epidemiology of HIV-1 transmission in South-East Austria we performed a retrospective analysis of 259 HIV-1 partial pol sequences obtained from unique individuals newly diagnosed with HIV infection in South-East Austria from 2008 through 2014. After quality filtering, putative transmission linkages were inferred when two sequences were ≤1.5% genetically different. Multiple linkages were resolved into putative transmission clusters. Further phylogenetic analyses were performed using BEAST v1.8.1. Finally, we investigated putative links between the 259 sequences from South-East Austria and all publicly available HIV polymerase sequences in the Los Alamos National Laboratory HIV sequence database. We found that 45.6% (118/259) of the sampled sequences were genetically linked with at least one other sequence from South-East Austria forming putative transmission clusters. Clustering individuals were more likely to be men who have sex with men (MSM; p<0.001), infected with subtype B (p<0.001) or subtype F (p = 0.02). Among clustered males who reported only heterosexual (HSX) sex as an HIV risk, 47% clustered closely with MSM (either as pairs or within larger MSM clusters). One hundred and seven of the 259 sequences (41.3%) from South-East Austria had at least one putative inferred linkage with sequences from a total of 69 other countries. In conclusion, analysis of HIV-1 sequences from newly diagnosed individuals residing in South-East Austria revealed a high degree of national and international clustering mainly within MSM. Interestingly, we found that a high number of heterosexual males clustered within MSM networks, suggesting either linkage between risk groups or misrepresentation of sexual risk behaviors by subjects.
NASA Astrophysics Data System (ADS)
Rahman, Md. Habibur; Matin, M. A.; Salma, Umma
2017-12-01
The precipitation patterns of seventeen locations in Bangladesh from 1961 to 2014 were studied using a cluster analysis and metric multidimensional scaling. In doing so, the current research applies four major hierarchical clustering methods to precipitation in conjunction with different dissimilarity measures and metric multidimensional scaling. A variety of clustering algorithms were used to provide multiple clustering dendrograms for a mixture of distance measures. The dendrogram of pre-monsoon rainfall for the seventeen locations formed five clusters. The pre-monsoon precipitation data for the areas of Srimangal and Sylhet were located in two clusters across the combination of five dissimilarity measures and four hierarchical clustering algorithms. The single linkage algorithm with Euclidian and Manhattan distances, the average linkage algorithm with the Minkowski distance, and Ward's linkage algorithm provided similar results with regard to monsoon precipitation. The results of the post-monsoon and winter precipitation data are shown in different types of dendrograms with disparate combinations of sub-clusters. The schematic geometrical representations of the precipitation data using metric multidimensional scaling showed that the post-monsoon rainfall of Cox's Bazar was located far from those of the other locations. The results of a box-and-whisker plot, different clustering techniques, and metric multidimensional scaling indicated that the precipitation behaviour of Srimangal and Sylhet during the pre-monsoon season, Cox's Bazar and Sylhet during the monsoon season, Maijdi Court and Cox's Bazar during the post-monsoon season, and Cox's Bazar and Khulna during the winter differed from those at other locations in Bangladesh.
Wolf, Antje; Kirschner, Karl N
2013-02-01
With improvements in computer speed and algorithm efficiency, MD simulations are sampling larger amounts of molecular and biomolecular conformations. Being able to qualitatively and quantitatively sift these conformations into meaningful groups is a difficult and important task, especially when considering the structure-activity paradigm. Here we present a study that combines two popular techniques, principal component (PC) analysis and clustering, for revealing major conformational changes that occur in molecular dynamics (MD) simulations. Specifically, we explored how clustering different PC subspaces effects the resulting clusters versus clustering the complete trajectory data. As a case example, we used the trajectory data from an explicitly solvated simulation of a bacteria's L11·23S ribosomal subdomain, which is a target of thiopeptide antibiotics. Clustering was performed, using K-means and average-linkage algorithms, on data involving the first two to the first five PC subspace dimensions. For the average-linkage algorithm we found that data-point membership, cluster shape, and cluster size depended on the selected PC subspace data. In contrast, K-means provided very consistent results regardless of the selected subspace. Since we present results on a single model system, generalization concerning the clustering of different PC subspaces of other molecular systems is currently premature. However, our hope is that this study illustrates a) the complexities in selecting the appropriate clustering algorithm, b) the complexities in interpreting and validating their results, and c) by combining PC analysis with subsequent clustering valuable dynamic and conformational information can be obtained.
Determining the trophic guilds of fishes and macroinvertebrates in a seagrass food web
Luczkovich, J.J.; Ward, G.P.; Johnson, J.C.; Christian, R.R.; Baird, D.; Neckles, H.; Rizzo, W.M.
2002-01-01
We established trophic guilds of macroinvertebrate and fish taxa using correspondence analysis and a hierarchical clustering strategy for a seagrass food web in winter in the northeastern Gulf of Mexico. To create the diet matrix, we characterized the trophic linkages of macroinvertebrate and fish taxa present in Halodule wrightii seagrass habitat areas within the St. Marks National Wildlife Refuge (Florida) using binary data, combining dietary links obtained from relevant literature for macroinvertebrates with stomach analysis of common fishes collected during January and February of 1994. Heirarchical average-linkage cluster analysis of the 73 taxa of fishes and macroinvertebrates in the diet matrix yielded 14 clusters with diet similarity ??? 0.60. We then used correspondence analysis with three factors to jointly plot the coordinates of the consumers (identified by cluster membership) and of the 33 food sources. Correspondence analysis served as a visualization tool for assigning each taxon to one of eight trophic guilds: herbivores, detritivores, suspension feeders, omnivores, molluscivores, meiobenthos consumers, macrobenthos consumers, and piscivores. These trophic groups, cross-classified with major taxonomic groups, were further used to develop consumer compartments in a network analysis model of carbon flow in this seagrass ecosystem. The method presented here should greatly improve the development of future network models of food webs by providing an objective procedure for aggregating trophic groups.
Hahus, Ian; Migliaccio, Kati; Douglas-Mankin, Kyle; Klarenberg, Geraldine; Muñoz-Carpena, Rafael
2018-04-27
Hierarchical and partitional cluster analyses were used to compartmentalize Water Conservation Area 1, a managed wetland within the Arthur R. Marshall Loxahatchee National Wildlife Refuge in southeast Florida, USA, based on physical, biological, and climatic geospatial attributes. Single, complete, average, and Ward's linkages were tested during the hierarchical cluster analyses, with average linkage providing the best results. In general, the partitional method, partitioning around medoids, found clusters that were more evenly sized and more spatially aggregated than those resulting from the hierarchical analyses. However, hierarchical analysis appeared to be better suited to identify outlier regions that were significantly different from other areas. The clusters identified by geospatial attributes were similar to clusters developed for the interior marsh in a separate study using water quality attributes, suggesting that similar factors have influenced variations in both the set of physical, biological, and climatic attributes selected in this study and water quality parameters. However, geospatial data allowed further subdivision of several interior marsh clusters identified from the water quality data, potentially indicating zones with important differences in function. Identification of these zones can be useful to managers and modelers by informing the distribution of monitoring equipment and personnel as well as delineating regions that may respond similarly to future changes in management or climate.
Environmental Gradient Analysis, Ordination, and Classification in Environmental Impact Assessments.
1987-09-01
agglomerative clustering algorithms for mainframe computers: (1) the unweighted pair-group method that V uses arithmetic averages ( UPGMA ), (2) the...hierarchical agglomerative unweighted pair-group method using arithmetic averages ( UPGMA ), which is also called average linkage clustering. This method was...dendrograms produced by weighted clustering (93). Sneath and Sokal (94), Romesburg (84), and Seber• (90) also strongly recommend the UPGMA . A dendrogram
NASA Astrophysics Data System (ADS)
Chuan, Zun Liang; Ismail, Noriszura; Shinyie, Wendy Ling; Lit Ken, Tan; Fam, Soo-Fen; Senawi, Azlyna; Yusoff, Wan Nur Syahidah Wan
2018-04-01
Due to the limited of historical precipitation records, agglomerative hierarchical clustering algorithms widely used to extrapolate information from gauged to ungauged precipitation catchments in yielding a more reliable projection of extreme hydro-meteorological events such as extreme precipitation events. However, identifying the optimum number of homogeneous precipitation catchments accurately based on the dendrogram resulted using agglomerative hierarchical algorithms are very subjective. The main objective of this study is to propose an efficient regionalized algorithm to identify the homogeneous precipitation catchments for non-stationary precipitation time series. The homogeneous precipitation catchments are identified using average linkage hierarchical clustering algorithm associated multi-scale bootstrap resampling, while uncentered correlation coefficient as the similarity measure. The regionalized homogeneous precipitation is consolidated using K-sample Anderson Darling non-parametric test. The analysis result shows the proposed regionalized algorithm performed more better compared to the proposed agglomerative hierarchical clustering algorithm in previous studies.
2005-04-01
Bray-Curtis distance measure with an Unweighted Pair Group Method with Arithmetic Averages ( UPGMA ) linkage method to perform a cluster analysis of the...59 35 Comparison of reef condition indicators clustering by UPGMA analysis...Polyvinyl Chloride RBD Red-band Disease SACEX Supporting Arms Coordination Exercise SAV Submerged Aquatic Vegetation SD Standard Deviation UPGMA
Determining the trophic guilds of fishes and macroinvertebrates in a seagrass food web
Luczkovich, J.J.; Ward, G.P.; Johnson, J.C.; Christian, R.R.; Baird, D.; Neckles, H.; Rizzo, W.M.
2002-01-01
We established trophic guilds of macroinvertebrate and fish taxa using correspondence analysis and a hierarchical clustering strategy for a seagrass food web in winter in the northeastern Gulf of Mexico. To create the diet matrix, we characterized the trophic linkages of macroinvertebrate and fish taxa. present in Hatodule wrightii seagrass habitat areas within the St. Marks National Wildlife Refuge (Florida) using binary data, combining dietary links obtained from relevant literature for macroinvertebrates with stomach analysis of common fishes collected during January and February of 1994. Heirarchical average-linkage cluster analysis of the 73 taxa of fishes and macroinvertebrates in the diet matrix yielded 14 clusters with diet similarity greater than or equal to 0.60. We then used correspondence analysis with three factors to jointly plot the coordinates of the consumers (identified by cluster membership) and of the 33 food sources. Correspondence analysis served as a visualization tool for assigning each taxon to one of eight trophic guilds: herbivores, detritivores, suspension feeders, omnivores, molluscivores, meiobenthos consumers, macrobenthos consumers, and piscivores. These trophic groups, cross-classified with major taxonomic groups, were further used to develop consumer compartments in a network analysis model of carbon flow in this seagrass ecosystem. The method presented here should greatly improve the development of future network models of food webs by providing an objective procedure for aggregating trophic groups.
Structure and gene cluster of the O-antigen of Escherichia coli O54.
Naumenko, Olesya I; Guo, Xi; Senchenkova, Sof'ya N; Geng, Peng; Perepelov, Andrei V; Shashkov, Alexander S; Liu, Bin; Knirel, Yuriy A
2018-06-15
Mild acid hydrolysis of the lipopolysaccharide of Escherichia coli O54 afforded an O-polysaccharide, which was studied by sugar analysis, solvolysis with anhydrous trifluoroacetic acid, and 1 H and 13 C NMR spectroscopy. Solvolysis cleaved predominantly the linkage of β-d-Ribf and, to a lesser extent, that of β-d-GlcpNAc, whereas the other linkages, including the linkage of α-l-Rhap, were stable under selected conditions (40 °C, 5 h). The following structure of the O-polysaccharide was established: →4)-α-d-GalpA-(1 → 2)-α-l-Rhap-(1 → 2)-β-d-Ribf-(1 → 4)-β-d-Galp-(1 → 3)-β-d-GlcpNAc-(1→ The O-antigen gene cluster of E. coli O54 was analyzed and found to be consistent in general with the O-polysaccharide structure established but there were two exceptions: i) in the cluster, there were genes for phosphoserine phosphatase and serine transferase, which have no apparent role in the O-polysaccharide synthesis, and ii) no ribofuranosyltransferase gene was present in the cluster. Both uncommon features are shared by some other enteric bacteria. Copyright © 2018 Elsevier Ltd. All rights reserved.
Efficient Record Linkage Algorithms Using Complete Linkage Clustering.
Mamun, Abdullah-Al; Aseltine, Robert; Rajasekaran, Sanguthevar
2016-01-01
Data from different agencies share data of the same individuals. Linking these datasets to identify all the records belonging to the same individuals is a crucial and challenging problem, especially given the large volumes of data. A large number of available algorithms for record linkage are prone to either time inefficiency or low-accuracy in finding matches and non-matches among the records. In this paper we propose efficient as well as reliable sequential and parallel algorithms for the record linkage problem employing hierarchical clustering methods. We employ complete linkage hierarchical clustering algorithms to address this problem. In addition to hierarchical clustering, we also use two other techniques: elimination of duplicate records and blocking. Our algorithms use sorting as a sub-routine to identify identical copies of records. We have tested our algorithms on datasets with millions of synthetic records. Experimental results show that our algorithms achieve nearly 100% accuracy. Parallel implementations achieve almost linear speedups. Time complexities of these algorithms do not exceed those of previous best-known algorithms. Our proposed algorithms outperform previous best-known algorithms in terms of accuracy consuming reasonable run times.
Efficient Record Linkage Algorithms Using Complete Linkage Clustering
Mamun, Abdullah-Al; Aseltine, Robert; Rajasekaran, Sanguthevar
2016-01-01
Data from different agencies share data of the same individuals. Linking these datasets to identify all the records belonging to the same individuals is a crucial and challenging problem, especially given the large volumes of data. A large number of available algorithms for record linkage are prone to either time inefficiency or low-accuracy in finding matches and non-matches among the records. In this paper we propose efficient as well as reliable sequential and parallel algorithms for the record linkage problem employing hierarchical clustering methods. We employ complete linkage hierarchical clustering algorithms to address this problem. In addition to hierarchical clustering, we also use two other techniques: elimination of duplicate records and blocking. Our algorithms use sorting as a sub-routine to identify identical copies of records. We have tested our algorithms on datasets with millions of synthetic records. Experimental results show that our algorithms achieve nearly 100% accuracy. Parallel implementations achieve almost linear speedups. Time complexities of these algorithms do not exceed those of previous best-known algorithms. Our proposed algorithms outperform previous best-known algorithms in terms of accuracy consuming reasonable run times. PMID:27124604
Polymorphisms and linkage analysis for ICAM-1 and the selectin gene cluster
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vora, D.K.; Rosenbloom, C.L.; Cottingham, R.W.
1994-06-01
Genetic polymorphisms in leukocyte and endothelial cell adhesion molecules may be important variables with regard to susceptibility to multifactorial disease processes that include an inflammatory component. For this reason, polymorphisms were sought for intercellular adhesion molecule-1 (ICAM-1; gene symbol ICAM1) and for the three genes in the selectin cluster, P-selectin, L-selectin, and E-selectin (gene symbols SELP, SELL, and SELE, respectively). Two amino acid polymorphisms were identified for ICAM-1; Gly or Arg at codon 241 and Lys or Glu at codon 469. Dinucleotide repeat polymorphisms were identified in the 3{prime}-untranslated region for ICAM-1 and in intron 9 for P-selectin. Restriction fragmentmore » length polymorphisms were found using cDNAs for each of the three selectin genes as probes; E-selectin with BglII, P-selectin with ScaI, and L-selectin with HincII. Linkage analysis was performed for the selectin gene cluster and for ICAM-1 using the CEPH families; ICAM-1 is very tightly linked to the LDL receptor on chromosome 19, and the selectin cluster is linked to markers at chromosome 1q23. 41 refs., 2 tabs.« less
Genetic heterogeneity in families with non-epidermolytic palmar plantar keratosis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Spurr, N.K.; Kelshell, D.P.; Stevens, H.
1994-09-01
Following reports of linkage close to the keratin gene cluster in families with tylosis and the detection of mutations in the keratin 9 gene cosegregating in families with epidermolytic palmar plantar keratoderma (EPPK, and EPPK associated with breast and ovarian cancer), we have identified families with three phenotypically distinct forms of non-epidermolytic keratosis with either punctate, diffuse or focal keratoderma, one with diffuse lesions and one with punctate and malignancies. Initially we typed these families with 17q markers close to the keratin gene cluster; this included a dinucleotide repeat marker within the keratin 9 gene. Two point linkage analysis ofmore » the focal keratoderma family showed a positive lod score of 3.2 at a theta of 0 from the marker D17S855. The lod score for the diffuse family was -6.0 at a theta of 0.05 from the marker D17S776. The second focal keratoderma family showed a haplotype consistent with linkage to 17q close to the keratin gene cluster. A second keratin gene cluster has been mapped in humans on 12q, and we decided to test the unlinked diffuse and punctate keratoderma families with markers in that region. We used the markers: D12S87-D12S85-D12S368-D12S96-D12S90. Linkage analysis of the diffuse family gave a lod score of 3.1 at a theta of 0 from the marker D12S368. Currently studies are underway to look for mutations in specific keratin genes in the clusters on 17q and 12q that segregate with the observed phenotypes. The punctate keratoderma family gave lod scores of -3.9 at a theta of 0.55 with D17S855 and -6.0 at a theta of 0.05 with D12S90/D12S83. This would lead us to the conclusion that a separate susceptibility locus must exist for the punctate family associated with malignancy. Investigations of candidate regions are in progress.« less
NASA Astrophysics Data System (ADS)
Basalto, Nicolas; Bellotti, Roberto; de Carlo, Francesco; Facchi, Paolo; Pantaleo, Ester; Pascazio, Saverio
2008-10-01
A clustering algorithm based on the Hausdorff distance is analyzed and compared to the single, complete, and average linkage algorithms. The four clustering procedures are applied to a toy example and to the time series of financial data. The dendrograms are scrutinized and their features compared. The Hausdorff linkage relies on firm mathematical grounds and turns out to be very effective when one has to discriminate among complex structures.
Hierarchic Agglomerative Clustering Methods for Automatic Document Classification.
ERIC Educational Resources Information Center
Griffiths, Alan; And Others
1984-01-01
Considers classifications produced by application of single linkage, complete linkage, group average, and word clustering methods to Keen and Cranfield document test collections, and studies structure of hierarchies produced, extent to which methods distort input similarity matrices during classification generation, and retrieval effectiveness…
Designing Trend-Monitoring Sounds for Helicopters: Methodological Issues and an Application
ERIC Educational Resources Information Center
Edworthy, Judy; Hellier, Elizabeth; Aldrich, Kirsteen; Loxley, Sarah
2004-01-01
This article explores methodological issues in sonification and sound design arising from the design of helicopter monitoring sounds. Six monitoring sounds (each with 5 levels) were tested for similarity and meaning with 3 different techniques: hierarchical cluster analysis, linkage analysis, and multidimensional scaling. In Experiment 1,…
NASA Astrophysics Data System (ADS)
Crawford, I.; Ruske, S.; Topping, D. O.; Gallagher, M. W.
2015-07-01
In this paper we present improved methods for discriminating and quantifying Primary Biological Aerosol Particles (PBAP) by applying hierarchical agglomerative cluster analysis to multi-parameter ultra violet-light induced fluorescence (UV-LIF) spectrometer data. The methods employed in this study can be applied to data sets in excess of 1×106 points on a desktop computer, allowing for each fluorescent particle in a dataset to be explicitly clustered. This reduces the potential for misattribution found in subsampling and comparative attribution methods used in previous approaches, improving our capacity to discriminate and quantify PBAP meta-classes. We evaluate the performance of several hierarchical agglomerative cluster analysis linkages and data normalisation methods using laboratory samples of known particle types and an ambient dataset. Fluorescent and non-fluorescent polystyrene latex spheres were sampled with a Wideband Integrated Bioaerosol Spectrometer (WIBS-4) where the optical size, asymmetry factor and fluorescent measurements were used as inputs to the analysis package. It was found that the Ward linkage with z-score or range normalisation performed best, correctly attributing 98 and 98.1 % of the data points respectively. The best performing methods were applied to the BEACHON-RoMBAS ambient dataset where it was found that the z-score and range normalisation methods yield similar results with each method producing clusters representative of fungal spores and bacterial aerosol, consistent with previous results. The z-score result was compared to clusters generated with previous approaches (WIBS AnalysiS Program, WASP) where we observe that the subsampling and comparative attribution method employed by WASP results in the overestimation of the fungal spore concentration by a factor of 1.5 and the underestimation of bacterial aerosol concentration by a factor of 5. We suggest that this likely due to errors arising from misatrribution due to poor centroid definition and failure to assign particles to a cluster as a result of the subsampling and comparative attribution method employed by WASP. The methods used here allow for the entire fluorescent population of particles to be analysed yielding an explict cluster attribution for each particle, improving cluster centroid definition and our capacity to discriminate and quantify PBAP meta-classes compared to previous approaches.
Campa, Ana; Trabanco, Noemí; Ferreira, Juan José
2017-12-01
The correct identification of the anthracnose resistance systems present in the common bean cultivars AB136 and MDRK is important because both are included in the set of 12 differential cultivars proposed for use in classifying the races of the anthracnose causal agent, Colletrotrichum lindemuthianum. In this work, the responses against seven C. lindemuthianum races were analyzed in a recombinant inbred line population derived from the cross AB136 × MDRK. A genetic linkage map of 100 molecular markers distributed across the 11 bean chromosomes was developed in this population to locate the gene or genes conferring resistance against each race, based on linkage analyses and χ 2 tests of independence. The identified anthracnose resistance genes were organized in clusters. Two clusters were found in AB136: one located on linkage group Pv07, which corresponds to the anthracnose resistance cluster Co-5, and the other located at the end of linkage group Pv11, which corresponds to the Co-2 cluster. The presence of resistance genes at the Co-5 cluster in AB136 was validated through an allelism test conducted in the F 2 population TU × AB136. The presence of resistance genes at the Co-2 cluster in AB136 was validated through genetic dissection using the F 2:3 population ABM3 × MDRK, in which it was directly mapped to a genomic position between 46.01 and 47.77 Mb of chromosome Pv11. In MDRK, two independent clusters were identified: one located on linkage group Pv01, corresponding to the Co-1 cluster, and the second located on LG Pv04, corresponding to the Co-3 cluster. This report enhances the understanding of the race-specific Phaseolus vulgaris-C. lindemuthianum interactions and will be useful in breeding programs.
Saeed, Mohammad
2017-05-01
Systemic lupus erythematosus (SLE) is a complex disorder. Genetic association studies of complex disorders suffer from the following three major issues: phenotypic heterogeneity, false positive (type I error), and false negative (type II error) results. Hence, genes with low to moderate effects are missed in standard analyses, especially after statistical corrections. OASIS is a novel linkage disequilibrium clustering algorithm that can potentially address false positives and negatives in genome-wide association studies (GWAS) of complex disorders such as SLE. OASIS was applied to two SLE dbGAP GWAS datasets (6077 subjects; ∼0.75 million single-nucleotide polymorphisms). OASIS identified three known SLE genes viz. IFIH1, TNIP1, and CD44, not previously reported using these GWAS datasets. In addition, 22 novel loci for SLE were identified and the 5 SLE genes previously reported using these datasets were verified. OASIS methodology was validated using single-variant replication and gene-based analysis with GATES. This led to the verification of 60% of OASIS loci. New SLE genes that OASIS identified and were further verified include TNFAIP6, DNAJB3, TTF1, GRIN2B, MON2, LATS2, SNX6, RBFOX1, NCOA3, and CHAF1B. This study presents the OASIS algorithm, software, and the meta-analyses of two publicly available SLE GWAS datasets along with the novel SLE genes. Hence, OASIS is a novel linkage disequilibrium clustering method that can be universally applied to existing GWAS datasets for the identification of new genes.
NASA Astrophysics Data System (ADS)
Crawford, I.; Ruske, S.; Topping, D. O.; Gallagher, M. W.
2015-11-01
In this paper we present improved methods for discriminating and quantifying primary biological aerosol particles (PBAPs) by applying hierarchical agglomerative cluster analysis to multi-parameter ultraviolet-light-induced fluorescence (UV-LIF) spectrometer data. The methods employed in this study can be applied to data sets in excess of 1 × 106 points on a desktop computer, allowing for each fluorescent particle in a data set to be explicitly clustered. This reduces the potential for misattribution found in subsampling and comparative attribution methods used in previous approaches, improving our capacity to discriminate and quantify PBAP meta-classes. We evaluate the performance of several hierarchical agglomerative cluster analysis linkages and data normalisation methods using laboratory samples of known particle types and an ambient data set. Fluorescent and non-fluorescent polystyrene latex spheres were sampled with a Wideband Integrated Bioaerosol Spectrometer (WIBS-4) where the optical size, asymmetry factor and fluorescent measurements were used as inputs to the analysis package. It was found that the Ward linkage with z-score or range normalisation performed best, correctly attributing 98 and 98.1 % of the data points respectively. The best-performing methods were applied to the BEACHON-RoMBAS (Bio-hydro-atmosphere interactions of Energy, Aerosols, Carbon, H2O, Organics and Nitrogen-Rocky Mountain Biogenic Aerosol Study) ambient data set, where it was found that the z-score and range normalisation methods yield similar results, with each method producing clusters representative of fungal spores and bacterial aerosol, consistent with previous results. The z-score result was compared to clusters generated with previous approaches (WIBS AnalysiS Program, WASP) where we observe that the subsampling and comparative attribution method employed by WASP results in the overestimation of the fungal spore concentration by a factor of 1.5 and the underestimation of bacterial aerosol concentration by a factor of 5. We suggest that this likely due to errors arising from misattribution due to poor centroid definition and failure to assign particles to a cluster as a result of the subsampling and comparative attribution method employed by WASP. The methods used here allow for the entire fluorescent population of particles to be analysed, yielding an explicit cluster attribution for each particle and improving cluster centroid definition and our capacity to discriminate and quantify PBAP meta-classes compared to previous approaches.
Lange, Ethan; Borresen, Anna-Lise; Chen, Xiaoguang; Chessa, Luciana; Chiplunkar, Sujata; Concannon, Patrick; Dandekar, Sugandha; Gerken, Steven; Lange, Kenneth; Liang, Teresa; McConville, Carmel; Polakow, Jeff; Porras, Oscar; Rotman, Galit; Sanal, Ozden; Sheikhavandi, Sepideh; Shiloh, Yosef; Sobel, Eric; Taylor, Malcolm; Telatar, Milhan; Teraoka, Sharon; Tolun, Aslihan; Udar, Nitin; Uhrhammer, Nancy; Vanagaite, Lina; Wang, Zhijun; Wapelhorst, Beth; Wright, Jocyndra; Yang, Huan-Ming; Yang, Lan; Ziv, Yael; Gatti, Richard A.
1995-01-01
We describe a 20-point linkage analysis map of chromosome 11q22-23 that is based on genotyping 249 families (59 CEPH and 190 A-T). Monte Carlo linkage analyses of 176 ataxia-telangiectasia (A-T) families localizes the major A-T locus to the region between S1819(A4) and S1818(A2). When seven nonlinking families were excluded from subsequent analyses, a 2-lod support interval of ∼500 kb was identified between S1819(A4) and S1294. No recombinants were observed between A-T and markers S384, B7, S535, or S1294. Only 17 of the international consortium families have been assigned to complementation groups. The available evidence favors either a cluster of A-T genes on chromosome 11 or intragenic defects in a single gene. PMID:7611279
Hebbian self-organizing integrate-and-fire networks for data clustering.
Landis, Florian; Ott, Thomas; Stoop, Ruedi
2010-01-01
We propose a Hebbian learning-based data clustering algorithm using spiking neurons. The algorithm is capable of distinguishing between clusters and noisy background data and finds an arbitrary number of clusters of arbitrary shape. These properties render the approach particularly useful for visual scene segmentation into arbitrarily shaped homogeneous regions. We present several application examples, and in order to highlight the advantages and the weaknesses of our method, we systematically compare the results with those from standard methods such as the k-means and Ward's linkage clustering. The analysis demonstrates that not only the clustering ability of the proposed algorithm is more powerful than those of the two concurrent methods, the time complexity of the method is also more modest than that of its generally used strongest competitor.
Ransome, Yusuf; Dean, Lorraine T; Crawford, Natalie D; Metzger, David S; Blank, Michael B; Nunn, Amy S
2017-09-01
Place of residence has been associated with HIV transmission risks. Social capital, defined as features of social organization that improve efficiency of society by facilitating coordinated actions, often varies by neighborhood, and hypothesized to have protective effects on HIV care continuum outcomes. We examined whether the association between social capital and 2 HIV care continuum outcomes clustered geographically and whether sociocontextual mechanisms predict differences across clusters. Bivariate Local Moran's I evaluated geographical clustering in the association between social capital (participation in civic and social organizations, 2006, 2008, 2010) and [5-year (2007-2011) prevalence of late HIV diagnosis and linkage to HIV care] across Philadelphia, PA, census tracts (N = 378). Maps documented the clusters and multinomial regression assessed which sociocontextual mechanisms (eg, racial composition) predict differences across clusters. We identified 4 significant clusters (high social capital-high HIV/AIDS, low social capital-low HIV/AIDS, low social capital-high HIV/AIDS, and high social capital-low HIV/AIDS). Moran's I between social capital and late HIV diagnosis was (I = 0.19, z = 9.54, P < 0.001) and linkage to HIV care (I = 0.06, z = 3.274, P = 0.002). In multivariable analysis, median household income predicted differences across clusters, particularly where social capital was lowest and HIV burden the highest, compared with clusters with high social capital and lowest HIV burden. The association between social participation and HIV care continuum outcomes cluster geographically in Philadelphia, PA. HIV prevention interventions should account for this phenomenon. Reducing geographic disparities will require interventions tailored to each continuum step and that address socioeconomic factors such as neighborhood median income.
Troggio, Michela; Surbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James
2013-01-01
High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.
Modularization of biochemical networks based on classification of Petri net t-invariants.
Grafahrend-Belau, Eva; Schreiber, Falk; Heiner, Monika; Sackmann, Andrea; Junker, Björn H; Grunwald, Stefanie; Speer, Astrid; Winder, Katja; Koch, Ina
2008-02-08
Structural analysis of biochemical networks is a growing field in bioinformatics and systems biology. The availability of an increasing amount of biological data from molecular biological networks promises a deeper understanding but confronts researchers with the problem of combinatorial explosion. The amount of qualitative network data is growing much faster than the amount of quantitative data, such as enzyme kinetics. In many cases it is even impossible to measure quantitative data because of limitations of experimental methods, or for ethical reasons. Thus, a huge amount of qualitative data, such as interaction data, is available, but it was not sufficiently used for modeling purposes, until now. New approaches have been developed, but the complexity of data often limits the application of many of the methods. Biochemical Petri nets make it possible to explore static and dynamic qualitative system properties. One Petri net approach is model validation based on the computation of the system's invariant properties, focusing on t-invariants. T-invariants correspond to subnetworks, which describe the basic system behavior.With increasing system complexity, the basic behavior can only be expressed by a huge number of t-invariants. According to our validation criteria for biochemical Petri nets, the necessary verification of the biological meaning, by interpreting each subnetwork (t-invariant) manually, is not possible anymore. Thus, an automated, biologically meaningful classification would be helpful in analyzing t-invariants, and supporting the understanding of the basic behavior of the considered biological system. Here, we introduce a new approach to automatically classify t-invariants to cope with network complexity. We apply clustering techniques such as UPGMA, Complete Linkage, Single Linkage, and Neighbor Joining in combination with different distance measures to get biologically meaningful clusters (t-clusters), which can be interpreted as modules. To find the optimal number of t-clusters to consider for interpretation, the cluster validity measure, Silhouette Width, is applied. We considered two different case studies as examples: a small signal transduction pathway (pheromone response pathway in Saccharomyces cerevisiae) and a medium-sized gene regulatory network (gene regulation of Duchenne muscular dystrophy). We automatically classified the t-invariants into functionally distinct t-clusters, which could be interpreted biologically as functional modules in the network. We found differences in the suitability of the various distance measures as well as the clustering methods. In terms of a biologically meaningful classification of t-invariants, the best results are obtained using the Tanimoto distance measure. Considering clustering methods, the obtained results suggest that UPGMA and Complete Linkage are suitable for clustering t-invariants with respect to the biological interpretability. We propose a new approach for the biological classification of Petri net t-invariants based on cluster analysis. Due to the biologically meaningful data reduction and structuring of network processes, large sets of t-invariants can be evaluated, allowing for model validation of qualitative biochemical Petri nets. This approach can also be applied to elementary mode analysis.
Modularization of biochemical networks based on classification of Petri net t-invariants
Grafahrend-Belau, Eva; Schreiber, Falk; Heiner, Monika; Sackmann, Andrea; Junker, Björn H; Grunwald, Stefanie; Speer, Astrid; Winder, Katja; Koch, Ina
2008-01-01
Background Structural analysis of biochemical networks is a growing field in bioinformatics and systems biology. The availability of an increasing amount of biological data from molecular biological networks promises a deeper understanding but confronts researchers with the problem of combinatorial explosion. The amount of qualitative network data is growing much faster than the amount of quantitative data, such as enzyme kinetics. In many cases it is even impossible to measure quantitative data because of limitations of experimental methods, or for ethical reasons. Thus, a huge amount of qualitative data, such as interaction data, is available, but it was not sufficiently used for modeling purposes, until now. New approaches have been developed, but the complexity of data often limits the application of many of the methods. Biochemical Petri nets make it possible to explore static and dynamic qualitative system properties. One Petri net approach is model validation based on the computation of the system's invariant properties, focusing on t-invariants. T-invariants correspond to subnetworks, which describe the basic system behavior. With increasing system complexity, the basic behavior can only be expressed by a huge number of t-invariants. According to our validation criteria for biochemical Petri nets, the necessary verification of the biological meaning, by interpreting each subnetwork (t-invariant) manually, is not possible anymore. Thus, an automated, biologically meaningful classification would be helpful in analyzing t-invariants, and supporting the understanding of the basic behavior of the considered biological system. Methods Here, we introduce a new approach to automatically classify t-invariants to cope with network complexity. We apply clustering techniques such as UPGMA, Complete Linkage, Single Linkage, and Neighbor Joining in combination with different distance measures to get biologically meaningful clusters (t-clusters), which can be interpreted as modules. To find the optimal number of t-clusters to consider for interpretation, the cluster validity measure, Silhouette Width, is applied. Results We considered two different case studies as examples: a small signal transduction pathway (pheromone response pathway in Saccharomyces cerevisiae) and a medium-sized gene regulatory network (gene regulation of Duchenne muscular dystrophy). We automatically classified the t-invariants into functionally distinct t-clusters, which could be interpreted biologically as functional modules in the network. We found differences in the suitability of the various distance measures as well as the clustering methods. In terms of a biologically meaningful classification of t-invariants, the best results are obtained using the Tanimoto distance measure. Considering clustering methods, the obtained results suggest that UPGMA and Complete Linkage are suitable for clustering t-invariants with respect to the biological interpretability. Conclusion We propose a new approach for the biological classification of Petri net t-invariants based on cluster analysis. Due to the biologically meaningful data reduction and structuring of network processes, large sets of t-invariants can be evaluated, allowing for model validation of qualitative biochemical Petri nets. This approach can also be applied to elementary mode analysis. PMID:18257938
ERIC Educational Resources Information Center
Soddell, J. A.; Seviour, R. J.
1985-01-01
Describes an exercise which uses a computer program (written for Commodore 64 microcomputers) that accepts data obtained from identifying bacteria, calculates similarity coefficients, and performs single linkage cluster analysis. Includes a program for simulating bacterial cultures for students who should not handle pathogenic microorganisms. (JN)
Kemppainen, Petri; Knight, Christopher G; Sarma, Devojit K; Hlaing, Thaung; Prakash, Anil; Maung Maung, Yan Naung; Somboon, Pradya; Mahanta, Jagadish; Walton, Catherine
2015-09-01
Recent advances in sequencing allow population-genomic data to be generated for virtually any species. However, approaches to analyse such data lag behind the ability to generate it, particularly in nonmodel species. Linkage disequilibrium (LD, the nonrandom association of alleles from different loci) is a highly sensitive indicator of many evolutionary phenomena including chromosomal inversions, local adaptation and geographical structure. Here, we present linkage disequilibrium network analysis (LDna), which accesses information on LD shared between multiple loci genomewide. In LD networks, vertices represent loci, and connections between vertices represent the LD between them. We analysed such networks in two test cases: a new restriction-site-associated DNA sequence (RAD-seq) data set for Anopheles baimaii, a Southeast Asian malaria vector; and a well-characterized single nucleotide polymorphism (SNP) data set from 21 three-spined stickleback individuals. In each case, we readily identified five distinct LD network clusters (single-outlier clusters, SOCs), each comprising many loci connected by high LD. In A. baimaii, further population-genetic analyses supported the inference that each SOC corresponds to a large inversion, consistent with previous cytological studies. For sticklebacks, we inferred that each SOC was associated with a distinct evolutionary phenomenon: two chromosomal inversions, local adaptation, population-demographic history and geographic structure. LDna is thus a useful exploratory tool, able to give a global overview of LD associated with diverse evolutionary phenomena and identify loci potentially involved. LDna does not require a linkage map or reference genome, so it is applicable to any population-genomic data set, making it especially valuable for nonmodel species. © 2015 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
Genetic structure of Plasmodium falciparum populations across the Honduras-Nicaragua border.
Larrañaga, Nerea; Mejía, Rosa E; Hormaza, José I; Montoya, Alberto; Soto, Aida; Fontecha, Gustavo A
2013-10-04
The Caribbean coast of Central America remains an area of malaria transmission caused by Plasmodium falciparum despite the fact that morbidity has been reduced in recent years. Parasite populations in that region show interesting characteristics such as chloroquine susceptibility and low mortality rates. Genetic structure and diversity of P. falciparum populations in the Honduras-Nicaragua border were analysed in this study. Seven neutral microsatellite loci were analysed in 110 P. falciparum isolates from endemic areas of Honduras (n = 77) and Nicaragua (n = 33), mostly from the border region called the Moskitia. Several analyses concerning the genetic diversity, linkage disequilibrium, population structure, molecular variance, and haplotype clustering were conducted. There was a low level of genetic diversity in P. falciparum populations from Honduras and Nicaragua. Expected heterozigosity (H(e)) results were similarly low for both populations. A moderate differentiation was revealed by the F(ST) index between both populations, and two putative clusters were defined through a structure analysis. The main cluster grouped most of samples from Honduras and Nicaragua, while the second cluster was smaller and included all the samples from the Siuna community in Nicaragua. This result could partially explain the stronger linkage disequilibrium (LD) in the parasite population from that country. These findings are congruent with the decreasing rates of malaria endemicity in Central America.
An agglomerative hierarchical clustering approach to visualisation in Bayesian clustering problems
Dawson, Kevin J.; Belkhir, Khalid
2009-01-01
Clustering problems (including the clustering of individuals into outcrossing populations, hybrid generations, full-sib families and selfing lines) have recently received much attention in population genetics. In these clustering problems, the parameter of interest is a partition of the set of sampled individuals, - the sample partition. In a fully Bayesian approach to clustering problems of this type, our knowledge about the sample partition is represented by a probability distribution on the space of possible sample partitions. Since the number of possible partitions grows very rapidly with the sample size, we can not visualise this probability distribution in its entirety, unless the sample is very small. As a solution to this visualisation problem, we recommend using an agglomerative hierarchical clustering algorithm, which we call the exact linkage algorithm. This algorithm is a special case of the maximin clustering algorithm that we introduced previously. The exact linkage algorithm is now implemented in our software package Partition View. The exact linkage algorithm takes the posterior co-assignment probabilities as input, and yields as output a rooted binary tree, - or more generally, a forest of such trees. Each node of this forest defines a set of individuals, and the node height is the posterior co-assignment probability of this set. This provides a useful visual representation of the uncertainty associated with the assignment of individuals to categories. It is also a useful starting point for a more detailed exploration of the posterior distribution in terms of the co-assignment probabilities. PMID:19337306
Goswami, Neela D; Schmitz, Michelle M; Sanchez, Travis; Dasgupta, Sharoda; Sullivan, Patrick; Cooper, Hannah; Rane, Deepali; Kelly, Jane; Del Rio, Carlos; Waller, Lance A
2016-05-01
Engagement in care is central to reducing mortality for HIV-infected persons and achieving the White House National AIDS Strategy of 80% viral suppression in the US by 2020. Where an HIV-infected person lives impacts his or her ability to achieve viral suppression. Reliable transportation access for healthcare may be a key determinant of this place-suppression relationship. ZIP code tabulation areas (ZCTAs) were the units of analysis. We used geospatial and ecologic analyses to examine spatial distributions of neighborhood-level variables (eg, transportation accessibility) and associations with: (1) community linkage to care, and (2) community viral suppression. Among Atlanta ZCTAs with data for newly diagnosed HIV cases (2006-2010), we used Moran I to evaluate spatial clustering and linear regression models to evaluate associations between neighborhood variables and outcomes. In 100 ZCTAs with 8413 newly diagnosed HIV-positive residents, a median of 60 HIV cases were diagnosed per ZCTA during the 5-year period. We found significant clustering of ZCTAs with low linkage to care and viral suppression (Moran I = 0.218, P < 0.05). In high-poverty ZCTAs, a 10% point increase in ZCTA-level household vehicle ownership was associated with a 4% point increase in linkage to care (P = 0.02, R = 0.16). In low-poverty ZCTAs, a 10% point increase in ZCTA-level household vehicle ownership was associated with a 30% point increase in ZCTA-level viral suppression (P = 0.01, R = 0.08). Correlations between transportation variables and community-level care linkage and viral suppression vary by area poverty level and provide opportunities for interventions beyond individual-level factors.
Linkage of A-to-I RNA Editing in Metazoans and the Impact on Genome Evolution
Duan, Yuange; Dou, Shengqian; Zhang, Hong; Wu, Changcheng; Wu, Mingming
2018-01-01
Abstract The adenosine-to-inosine (A-to-I) RNA editomes have been systematically characterized in various metazoan species, and many editing sites were found in clusters. However, it remains unclear whether the clustered editing sites tend to be linked in the same RNA molecules or not. By adopting a method originally designed to detect linkage disequilibrium of DNA mutations, we examined the editomes of ten metazoan species and detected extensive linkage of editing in Drosophila and cephalopods. The prevalent linkages of editing in these two clades, many of which are conserved between closely related species and might be associated with the adaptive proteomic recoding, are maintained by natural selection at the cost of genome evolution. Nevertheless, in worms and humans, we only detected modest proportions of linked editing events, the majority of which were not conserved. Furthermore, the linkage of editing in coding regions of worms and humans might be overall deleterious, which drives the evolution of DNA sites to escape promiscuous editing. Altogether, our results suggest that the linkage landscape of A-to-I editing has evolved during metazoan evolution. This present study also suggests that linkage of editing should be considered in elucidating the functional consequences of RNA editing. PMID:29048557
Flory-Stockmayer analysis on reprocessable polymer networks
NASA Astrophysics Data System (ADS)
Li, Lingqiao; Chen, Xi; Jin, Kailong; Torkelson, John
Reprocessable polymer networks can undergo structure rearrangement through dynamic chemistries under proper conditions, making them a promising candidate for recyclable crosslinked materials, e.g. tires. This research field has been focusing on various chemistries. However, there has been lacking of an essential physical theory explaining the relationship between abundancy of dynamic linkages and reprocessability. Based on the classical Flory-Stockmayer analysis on network gelation, we developed a similar analysis on reprocessable polymer networks to quantitatively predict the critical condition for reprocessability. Our theory indicates that it is unnecessary for all bonds to be dynamic to make the resulting network reprocessable. As long as there is no percolated permanent network in the system, the material can fully rearrange. To experimentally validate our theory, we used a thiol-epoxy network model system with various dynamic linkage compositions. The stress relaxation behavior of resulting materials supports our theoretical prediction: only 50 % of linkages between crosslinks need to be dynamic for a tri-arm network to be reprocessable. Therefore, this analysis provides the first fundamental theoretical platform for designing and evaluating reprocessable polymer networks. We thank McCormick Research Catalyst Award Fund and ISEN cluster fellowship (L. L.) for funding support.
Zhang, Yu; Yan, Haidong; Jiang, Xiaomei; Wang, Xiaoli; Huang, Linkai; Xu, Bin; Zhang, Xinquan; Zhang, Lexin
2016-01-01
To evaluate genetic variation, population structure, and the extent of linkage disequilibrium (LD), 134 switchgrass ( Panicum virgatum L.) samples were analyzed with 51 markers, including 16 ISSRs, 20 SCoTs, and 15 EST-SSRs. In this study, a high level of genetic variation was observed in the switchgrass samples and they had an average Nei's gene diversity index (H) of 0.311. A total of 793 bands were obtained, of which 708 (89.28 %) were polymorphic. Using a parameter marker index (MI), the efficiency of the three types of markers (ISSR, SCoT, and EST-SSR) in the study were compared and we found that SCoT had a higher marker efficiency than the other two markers. The 134 switchgrass samples could be divided into two sub-populations based on STRUCTURE, UPGMA clustering, and principal coordinate analyses (PCA), and upland and lowland ecotypes could be separated by UPGMA clustering and PCA analyses. Linkage disequilibrium analysis revealed an average r 2 of 0.035 across all 51 markers, indicating a trend of higher LD in sub-population 2 than that in sub-population 1 ( P < 0.01). The population structure revealed in this study will guide the design of future association studies using these switchgrass samples.
Verardi, A; Lucchini, V; Randi, E
2006-09-01
Occasional crossbreeding between free-ranging domestic dogs and wild wolves (Canis lupus) has been detected in some European countries by mitochondrial DNA sequencing and genotyping unlinked microsatellite loci. Maternal and unlinked genomic markers, however, might underestimate the extent of introgressive hybridization, and their impacts on the preservation of wild wolf gene pools. In this study, we genotyped 220 presumed Italian wolves, 85 dogs and 7 known hybrids at 16 microsatellites belonging to four different linkage groups (plus four unlinked microsatellites). Population clustering and individual assignments were performed using a Bayesian procedure implemented in structure 2.1, which models the gametic disequilibrium arising between linked loci during admixtures, aiming to trace hybridization events further back in time and infer the population of origin of chromosomal blocks. Results indicate that (i) linkage disequilibrium was higher in wolves than in dogs; (ii) 11 out of 220 wolves (5.0%) were likely admixed, a proportion that is significantly higher than one admixed genotype in 107 wolves found previously in a study using unlinked markers; (iii) posterior maximum-likelihood estimates of the recombination parameter r revealed that introgression in Italian wolves is not recent, but could have continued for the last 70 (+/- 20) generations, corresponding to approximately 140-210 years. Bayesian clustering showed that, despite some admixture, wolf and dog gene pools remain sharply distinct (the average proportions of membership to wolf and dog clusters were Q(w) = 0.95 and Q(d) = 0.98, respectively), suggesting that hybridization was not frequent, and that introgression in nature is counteracted by behavioural or selective constraints.
Application of agglomerative clustering for analyzing phylogenetically on bacterium of saliva
NASA Astrophysics Data System (ADS)
Bustamam, A.; Fitria, I.; Umam, K.
2017-07-01
Analyzing population of Streptococcus bacteria is important since these species can cause dental caries, periodontal, halitosis (bad breath) and more problems. This paper will discuss the phylogenetically relation between the bacterium Streptococcus in saliva using a phylogenetic tree of agglomerative clustering methods. Starting with the bacterium Streptococcus DNA sequence obtained from the GenBank, then performed characteristic extraction of DNA sequences. The characteristic extraction result is matrix form, then performed normalization using min-max normalization and calculate genetic distance using Manhattan distance. Agglomerative clustering technique consisting of single linkage, complete linkage and average linkage. In this agglomerative algorithm number of group is started with the number of individual species. The most similar species is grouped until the similarity decreases and then formed a single group. Results of grouping is a phylogenetic tree and branches that join an established level of distance, that the smaller the distance the more the similarity of the larger species implementation is using R, an open source program.
Genetic structure of Plasmodium falciparum populations across the Honduras-Nicaragua border
2013-01-01
Background The Caribbean coast of Central America remains an area of malaria transmission caused by Plasmodium falciparum despite the fact that morbidity has been reduced in recent years. Parasite populations in that region show interesting characteristics such as chloroquine susceptibility and low mortality rates. Genetic structure and diversity of P. falciparum populations in the Honduras-Nicaragua border were analysed in this study. Methods Seven neutral microsatellite loci were analysed in 110 P. falciparum isolates from endemic areas of Honduras (n = 77) and Nicaragua (n = 33), mostly from the border region called the Moskitia. Several analyses concerning the genetic diversity, linkage disequilibrium, population structure, molecular variance, and haplotype clustering were conducted. Results There was a low level of genetic diversity in P. falciparum populations from Honduras and Nicaragua. Expected heterozigosity (He) results were similarly low for both populations. A moderate differentiation was revealed by the FST index between both populations, and two putative clusters were defined through a structure analysis. The main cluster grouped most of samples from Honduras and Nicaragua, while the second cluster was smaller and included all the samples from the Siuna community in Nicaragua. This result could partially explain the stronger linkage disequilibrium (LD) in the parasite population from that country. These findings are congruent with the decreasing rates of malaria endemicity in Central America. PMID:24093629
Musmeci, Nicoló; Aste, Tomaso; Di Matteo, T
2015-01-01
We quantify the amount of information filtered by different hierarchical clustering methods on correlations between stock returns comparing the clustering structure with the underlying industrial activity classification. We apply, for the first time to financial data, a novel hierarchical clustering approach, the Directed Bubble Hierarchical Tree and we compare it with other methods including the Linkage and k-medoids. By taking the industrial sector classification of stocks as a benchmark partition, we evaluate how the different methods retrieve this classification. The results show that the Directed Bubble Hierarchical Tree can outperform other methods, being able to retrieve more information with fewer clusters. Moreover,we show that the economic information is hidden at different levels of the hierarchical structures depending on the clustering method. The dynamical analysis on a rolling window also reveals that the different methods show different degrees of sensitivity to events affecting financial markets, like crises. These results can be of interest for all the applications of clustering methods to portfolio optimization and risk hedging [corrected].
Musmeci, Nicoló; Aste, Tomaso; Di Matteo, T.
2015-01-01
We quantify the amount of information filtered by different hierarchical clustering methods on correlations between stock returns comparing the clustering structure with the underlying industrial activity classification. We apply, for the first time to financial data, a novel hierarchical clustering approach, the Directed Bubble Hierarchical Tree and we compare it with other methods including the Linkage and k-medoids. By taking the industrial sector classification of stocks as a benchmark partition, we evaluate how the different methods retrieve this classification. The results show that the Directed Bubble Hierarchical Tree can outperform other methods, being able to retrieve more information with fewer clusters. Moreover, we show that the economic information is hidden at different levels of the hierarchical structures depending on the clustering method. The dynamical analysis on a rolling window also reveals that the different methods show different degrees of sensitivity to events affecting financial markets, like crises. These results can be of interest for all the applications of clustering methods to portfolio optimization and risk hedging. PMID:25786703
Linkage of A-to-I RNA Editing in Metazoans and the Impact on Genome Evolution.
Duan, Yuange; Dou, Shengqian; Zhang, Hong; Wu, Changcheng; Wu, Mingming; Lu, Jian
2018-01-01
The adenosine-to-inosine (A-to-I) RNA editomes have been systematically characterized in various metazoan species, and many editing sites were found in clusters. However, it remains unclear whether the clustered editing sites tend to be linked in the same RNA molecules or not. By adopting a method originally designed to detect linkage disequilibrium of DNA mutations, we examined the editomes of ten metazoan species and detected extensive linkage of editing in Drosophila and cephalopods. The prevalent linkages of editing in these two clades, many of which are conserved between closely related species and might be associated with the adaptive proteomic recoding, are maintained by natural selection at the cost of genome evolution. Nevertheless, in worms and humans, we only detected modest proportions of linked editing events, the majority of which were not conserved. Furthermore, the linkage of editing in coding regions of worms and humans might be overall deleterious, which drives the evolution of DNA sites to escape promiscuous editing. Altogether, our results suggest that the linkage landscape of A-to-I editing has evolved during metazoan evolution. This present study also suggests that linkage of editing should be considered in elucidating the functional consequences of RNA editing. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Effect of Linkage Disequilibrium on the Identification of Functional Variants
Thomas, Alun; Abel, Haley J; Di, Yanming; Faye, Laura L; Jin, Jing; Liu, Jin; Wu, Zheyan; Paterson, Andrew D
2011-01-01
We summarize the contributions of Group 9 of Genetic Analysis Workshop 17. This group addressed the problems of linkage disequilibrium and other longer range forms of allelic association when evaluating the effects of genotypes on phenotypes. Issues raised by long-range associations, whether a result of selection, stratification, possible technical errors, or chance, were less expected but proved to be important. Most contributors focused on regression methods of various types to illustrate problematic issues or to develop adaptations for dealing with high-density genotype assays. Study design was also considered, as was graphical modeling. Although no method emerged as uniformly successful, most succeeded in reducing false-positive results either by considering clusters of loci within genes or by applying smoothing metrics that required results from adjacent loci to be similar. Two unexpected results that questioned our assumptions of what is required to model linkage disequilibrium were observed. The first was that correlations between loci separated by large genetic distances can greatly inflate single-locus test statistics, and, whether the result of selection, stratification, possible technical errors, or chance, these correlations seem overabundant. The second unexpected result was that applying principal components analysis to genome-wide genotype data can apparently control not only for population structure but also for linkage disequilibrium. PMID:22128051
Dong, Skye T; Costa, Daniel S J; Butow, Phyllis N; Lovell, Melanie R; Agar, Meera; Velikova, Galina; Teckle, Paulos; Tong, Allison; Tebbutt, Niall C; Clarke, Stephen J; van der Hoek, Kim; King, Madeleine T; Fayers, Peter M
2016-01-01
Symptom clusters in advanced cancer can influence patient outcomes. There is large heterogeneity in the methods used to identify symptom clusters. To investigate the consistency of symptom cluster composition in advanced cancer patients using different statistical methodologies for all patients across five primary cancer sites, and to examine which clusters predict functional status, a global assessment of health and global quality of life. Principal component analysis and exploratory factor analysis (with different rotation and factor selection methods) and hierarchical cluster analysis (with different linkage and similarity measures) were used on a data set of 1562 advanced cancer patients who completed the European Organization for the Research and Treatment of Cancer Quality of Life Questionnaire-Core 30. Four clusters consistently formed for many of the methods and cancer sites: tense-worry-irritable-depressed (emotional cluster), fatigue-pain, nausea-vomiting, and concentration-memory (cognitive cluster). The emotional cluster was a stronger predictor of overall quality of life than the other clusters. Fatigue-pain was a stronger predictor of overall health than the other clusters. The cognitive cluster and fatigue-pain predicted physical functioning, role functioning, and social functioning. The four identified symptom clusters were consistent across statistical methods and cancer types, although there were some noteworthy differences. Statistical derivation of symptom clusters is in need of greater methodological guidance. A psychosocial pathway in the management of symptom clusters may improve quality of life. Biological mechanisms underpinning symptom clusters need to be delineated by future research. A framework for evidence-based screening, assessment, treatment, and follow-up of symptom clusters in advanced cancer is essential. Copyright © 2016 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Integrated genome sequence and linkage map of physic nut (Jatropha curcas L.), a biodiesel plant.
Wu, Pingzhi; Zhou, Changpin; Cheng, Shifeng; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Chen, Yanbo; Chen, Yan; Ni, Peixiang; Wang, Ying; Xu, Xun; Huang, Ying; Song, Chi; Wang, Zhiwen; Shi, Nan; Zhang, Xudong; Fang, Xiaohua; Yang, Qing; Jiang, Huawu; Chen, Yaping; Li, Meiru; Wang, Ying; Chen, Fan; Wang, Jun; Wu, Guojiang
2015-03-01
The family Euphorbiaceae includes some of the most efficient biomass accumulators. Whole genome sequencing and the development of genetic maps of these species are important components in molecular breeding and genetic improvement. Here we report the draft genome of physic nut (Jatropha curcas L.), a biodiesel plant. The assembled genome has a total length of 320.5 Mbp and contains 27,172 putative protein-coding genes. We established a linkage map containing 1208 markers and anchored the genome assembly (81.7%) to this map to produce 11 pseudochromosomes. After gene family clustering, 15,268 families were identified, of which 13,887 existed in the castor bean genome. Analysis of the genome highlighted specific expansion and contraction of a number of gene families during the evolution of this species, including the ribosome-inactivating proteins and oil biosynthesis pathway enzymes. The genomic sequence and linkage map provide a valuable resource not only for fundamental and applied research on physic nut but also for evolutionary and comparative genomics analysis, particularly in the Euphorbiaceae. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
Troggio, Michela; Šurbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James
2013-01-01
High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the ‘Golden Delicious’ genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies. PMID:23826289
Using data mining to segment healthcare markets from patients' preference perspectives.
Liu, Sandra S; Chen, Jie
2009-01-01
This paper aims to provide an example of how to use data mining techniques to identify patient segments regarding preferences for healthcare attributes and their demographic characteristics. Data were derived from a number of individuals who received in-patient care at a health network in 2006. Data mining and conventional hierarchical clustering with average linkage and Pearson correlation procedures are employed and compared to show how each procedure best determines segmentation variables. Data mining tools identified three differentiable segments by means of cluster analysis. These three clusters have significantly different demographic profiles. The study reveals, when compared with traditional statistical methods, that data mining provides an efficient and effective tool for market segmentation. When there are numerous cluster variables involved, researchers and practitioners need to incorporate factor analysis for reducing variables to clearly and meaningfully understand clusters. Interests and applications in data mining are increasing in many businesses. However, this technology is seldom applied to healthcare customer experience management. The paper shows that efficient and effective application of data mining methods can aid the understanding of patient healthcare preferences.
NASA Astrophysics Data System (ADS)
Kantar, Ersin; Keskin, Mustafa; Deviren, Bayram
2012-04-01
We have analyzed the topology of 50 important Turkish companies for the period 2006-2010 using the concept of hierarchical methods (the minimal spanning tree (MST) and hierarchical tree (HT)). We investigated the statistical reliability of links between companies in the MST by using the bootstrap technique. We also used the average linkage cluster analysis (ALCA) technique to observe the cluster structures much better. The MST and HT are known as useful tools to perceive and detect global structure, taxonomy, and hierarchy in financial data. We obtained four clusters of companies according to their proximity. We also observed that the Banks and Holdings cluster always forms in the centre of the MSTs for the periods 2006-2007, 2008, and 2009-2010. The clusters match nicely with their common production activities or their strong interrelationship. The effects of the Automobile sector increased after the global financial crisis due to the temporary incentives provided by the Turkish government. We find that Turkish companies were not very affected by the global financial crisis.
DiMeglio, Laura M.; Yu, Hongrun; Davis, Thomas M.
2014-01-01
The genus Fragaria encompasses species at ploidy levels ranging from diploid to decaploid. The cultivated strawberry, Fragaria×ananassa, and its two immediate progenitors, F. chiloensis and F. virginiana, are octoploids. To elucidate the ancestries of these octoploid species, we performed a phylogenetic analysis using intron-containing sequences of the nuclear ADH-1 gene from 39 germplasm accessions representing nineteen Fragaria species and one outgroup species, Dasiphora fruticosa. All trees from Maximum Parsimony and Maximum Likelihood analyses showed two major clades, Clade A and Clade B. Each of the sampled octoploids contributed alleles to both major clades. All octoploid-derived alleles in Clade A clustered with alleles of diploid F. vesca, with the exception of one octoploid allele that clustered with the alleles of diploid F. mandshurica. All octoploid-derived alleles in clade B clustered with the alleles of only one diploid species, F. iinumae. When gaps encoded as binary characters were included in the Maximum Parsimony analysis, tree resolution was improved with the addition of six nodes, and the bootstrap support was generally higher, rising above the 50% threshold for an additional nine branches. These results, coupled with the congruence of the sequence data and the coded gap data, validate and encourage the employment of sequence sets containing gaps for phylogenetic analysis. Our phylogenetic conclusions, based upon sequence data from the ADH-1 gene located on F. vesca linkage group II, complement and generally agree with those obtained from analyses of protein-encoding genes GBSSI-2 and DHAR located on F. vesca linkage groups V and VII, respectively, but differ from a previous study that utilized rDNA sequences and did not detect the ancestral role of F. iinumae. PMID:25078607
NASA Astrophysics Data System (ADS)
Sneath, P. H. A.
A BASIC program is presented for significance tests to determine whether a dendrogram is derived from clustering of points that belong to a single multivariate normal distribution. The significance tests are based on statistics of the Kolmogorov—Smirnov type, obtained by comparing the observed cumulative graph of branch levels with a graph for the hypothesis of multivariate normality. The program also permits testing whether the dendrogram could be from a cluster of lower dimensionality due to character correlations. The program makes provision for three similarity coefficients, (1) Euclidean distances, (2) squared Euclidean distances, and (3) Simple Matching Coefficients, and for five cluster methods (1) WPGMA, (2) UPGMA, (3) Single Linkage (or Minimum Spanning Trees), (4) Complete Linkage, and (5) Ward's Increase in Sums of Squares. The program is entitled DENBRAN.
The apolipoprotein E/CI/CII gene cluster and late-onset Alzheimer disease
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yu, Chang-En; Nemens, E.; Olson, J.M.
1994-04-01
The chromosome 19 apolipoprotein E/CI/CII gene cluster was examined for evidence of linkage to a familial Alzheimer disease (FAD) locus. The family groups studied were Volga German (VG), early-onset non-VG (ENVG; mean age at onset <60 years), and late-onset families. A genetic association was observed between apolipoprotein E (ApoE) allele E4 and FAD in late-onset families; the E4 allele frequency was .51 in affected subjects, .37 in at-risk subjects, .11 in spouses, and .19 in unrelated controls. The differences between the E4 frequencies in affected subjects versus controls and in at-risk subjects versus controls were highly significant. No association betweenmore » the E4 allele and FAD was observed in the ENVG or VG groups. A statistically significant allelic association between E4 and AD was also observed in a group of unrelated subjects; the E4 frequency was .26 in affected subjects, versus .19 in controls (Z[sub SND] = 2.20, P < .03). Evidence of linkage of ApoE and ApoCII to FAD was examined by maximum-likelihood methods, using three models and assuming autosomal dominant inheritance: (1) age-dependent penetrance, (2) extremely low (1%) penetrance, and (3) age-dependent penetrance corrected for sporadic Alzheimer disease (AD). For ApoCII in late-onset families, results for close linkage were negative, and only small positive lod-score-statistic (Z) values were obtained. For ApoE in late-onset kindreds, positive Z values were obtained when either allele frequencies from controls or allele frequencies from the families were used. When linkage disequilibrium was incorporated into the analysis, the Z values increased. For the ENVG group, results for ApoE and ApoCII were uniformly negative. Affected-pedigree-member analysis gave significant results for the late-onset kindreds, for ApoE, when control allele frequencies were used but not when allele frequencies were derived from the families. 58 refs., 6 tabs.« less
Autosomal Linkage Scan for Loci Predisposing to Comorbid Dependence on Multiple Substances
Yang, Bao-Zhu; Han, Shizhong; Kranzler, Henry R.; Farrer, Lindsay A.; Elston, Robert C.; Gelernter, Joel
2014-01-01
Multiple substance dependence (MSD) trait comorbidity is common, and MSD patients are often severely affected clinically. While shared genetic risks have been documented, so far there has been no published report using the linkage scan approach to survey risk loci for MSD as a phenotype. A total of 1,758 individuals in 739 families [384 African American (AA) and 355 European American (EA) families] ascertained via affected sib-pairs with cocaine or opioid or alcohol dependence were genotyped using an array-based linkage panel of single-nucleotide polymorphism markers. Fuzzy clustering analysis was conducted on individuals with alcohol, cannabis, cocaine, opioid, and nicotine dependence for AAs and EAs separately, and linkage scans were conducted for the output membership coefficients using Merlin-regression. In EAs, we observed an autosome-wide significant linkage signal on chromosome 4 (peak lod = 3.31 at 68.3 cM; empirical autosome-wide P = 0.038), and a suggestive linkage signal on chromosome 21 (peak lod = 2.37 at 19.4 cM). In AAs, four suggestive linkage peaks were observed: two peaks on chromosome 10 (lod = 2.66 at 96.7 cM and lod = 3.02 at 147.6 cM] and the other two on chromosomes 3 (lod = 2.81 at 145.5 cM) and 9 (lod = 1.93 at 146.8 cM). Three particularly promising candidate genes, GABRA4, GABRB1, and CLOCK, are located within or very close to the autosome-wide significant linkage region for EAs on chromosome 4. This is the first linkage evidence supporting existence of genetic loci influencing risk for several comorbid disorders simultaneously in two major US populations. PMID:22354695
Tran, Duong Thuy; Havard, Alys; Jorm, Louisa R
2017-07-11
Data cleaning is an important quality assurance in data linkage research studies. This paper presents the data cleaning and preparation process for a large-scale cross-jurisdictional Australian study (the Smoking MUMS Study) to evaluate the utilisation and safety of smoking cessation pharmacotherapies during pregnancy. Perinatal records for all deliveries (2003-2012) in the States of New South Wales (NSW) and Western Australia were linked to State-based data collections including hospital separation, emergency department and death data (mothers and babies) and congenital defect notifications (babies in NSW) by State-based data linkage units. A national data linkage unit linked pharmaceutical dispensing data for the mothers. All linkages were probabilistic. Twenty two steps assessed the uniqueness of records and consistency of items within and across data sources, resolved discrepancies in the linkages between units, and identified women having records in both States. State-based linkages yielded a cohort of 783,471 mothers and 1,232,440 babies. Likely false positive links relating to 3703 mothers were identified. Corrections of baby's date of birth and age, and parity were made for 43,578 records while 1996 records were flagged as duplicates. Checks for the uniqueness of the matches between State and national linkages detected 3404 ID clusters, suggestive of missed links in the State linkages, and identified 1986 women who had records in both States. Analysis of content data can identify inaccurate links that cannot be detected by data linkage units that have access to personal identifiers only. Perinatal researchers are encouraged to adopt the methods presented to ensure quality and consistency among studies using linked administrative data.
Assessing population genetic structure via the maximisation of genetic distance
2009-01-01
Background The inference of the hidden structure of a population is an essential issue in population genetics. Recently, several methods have been proposed to infer population structure in population genetics. Methods In this study, a new method to infer the number of clusters and to assign individuals to the inferred populations is proposed. This approach does not make any assumption on Hardy-Weinberg and linkage equilibrium. The implemented criterion is the maximisation (via a simulated annealing algorithm) of the averaged genetic distance between a predefined number of clusters. The performance of this method is compared with two Bayesian approaches: STRUCTURE and BAPS, using simulated data and also a real human data set. Results The simulations show that with a reduced number of markers, BAPS overestimates the number of clusters and presents a reduced proportion of correct groupings. The accuracy of the new method is approximately the same as for STRUCTURE. Also, in Hardy-Weinberg and linkage disequilibrium cases, BAPS performs incorrectly. In these situations, STRUCTURE and the new method show an equivalent behaviour with respect to the number of inferred clusters, although the proportion of correct groupings is slightly better with the new method. Re-establishing equilibrium with the randomisation procedures improves the precision of the Bayesian approaches. All methods have a good precision for FST ≥ 0.03, but only STRUCTURE estimates the correct number of clusters for FST as low as 0.01. In situations with a high number of clusters or a more complex population structure, MGD performs better than STRUCTURE and BAPS. The results for a human data set analysed with the new method are congruent with the geographical regions previously found. Conclusion This new method used to infer the hidden structure in a population, based on the maximisation of the genetic distance and not taking into consideration any assumption about Hardy-Weinberg and linkage equilibrium, performs well under different simulated scenarios and with real data. Therefore, it could be a useful tool to determine genetically homogeneous groups, especially in those situations where the number of clusters is high, with complex population structure and where Hardy-Weinberg and/or linkage equilibrium are present. PMID:19900278
Lu, Yang Young; Chen, Ting; Fuhrman, Jed A; Sun, Fengzhu
2017-03-15
The advent of next-generation sequencing technologies enables researchers to sequence complex microbial communities directly from the environment. Because assembly typically produces only genome fragments, also known as contigs, instead of an entire genome, it is crucial to group them into operational taxonomic units (OTUs) for further taxonomic profiling and down-streaming functional analysis. OTU clustering is also referred to as binning. We present COCACOLA, a general framework automatically bin contigs into OTUs based on sequence composition and coverage across multiple samples. The effectiveness of COCACOLA is demonstrated in both simulated and real datasets in comparison with state-of-art binning approaches such as CONCOCT, GroopM, MaxBin and MetaBAT. The superior performance of COCACOLA relies on two aspects. One is using L 1 distance instead of Euclidean distance for better taxonomic identification during initialization. More importantly, COCACOLA takes advantage of both hard clustering and soft clustering by sparsity regularization. In addition, the COCACOLA framework seamlessly embraces customized knowledge to facilitate binning accuracy. In our study, we have investigated two types of additional knowledge, the co-alignment to reference genomes and linkage of contigs provided by paired-end reads, as well as the ensemble of both. We find that both co-alignment and linkage information further improve binning in the majority of cases. COCACOLA is scalable and faster than CONCOCT, GroopM, MaxBin and MetaBAT. The software is available at https://github.com/younglululu/COCACOLA . fsun@usc.edu. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Developing Industry Linkages: Learning from Practice.
ERIC Educational Resources Information Center
Misko, Josie
Linkages between Australia's vocational education and training (VET) and technical and further education (TAFE) sectors and industry were examined through 13 case studies involving a variety of industrial sectors in South Australia, New South Wales, and Victoria. Special attention was paid to the processes established by school clusters to develop…
Generalising Ward's Method for Use with Manhattan Distances.
Strauss, Trudie; von Maltitz, Michael Johan
2017-01-01
The claim that Ward's linkage algorithm in hierarchical clustering is limited to use with Euclidean distances is investigated. In this paper, Ward's clustering algorithm is generalised to use with l1 norm or Manhattan distances. We argue that the generalisation of Ward's linkage method to incorporate Manhattan distances is theoretically sound and provide an example of where this method outperforms the method using Euclidean distances. As an application, we perform statistical analyses on languages using methods normally applied to biology and genetic classification. We aim to quantify differences in character traits between languages and use a statistical language signature based on relative bi-gram (sequence of two letters) frequencies to calculate a distance matrix between 32 Indo-European languages. We then use Ward's method of hierarchical clustering to classify the languages, using the Euclidean distance and the Manhattan distance. Results obtained from using the different distance metrics are compared to show that the Ward's algorithm characteristic of minimising intra-cluster variation and maximising inter-cluster variation is not violated when using the Manhattan metric.
No clustering for linkage map based on low-copy and undermethylated microsatellites.
Zhou, Yi; Gwaze, David P; Reyes-Valdés, M Humberto; Bui, Thomas; Williams, Claire G
2003-10-01
Clustering has been reported for conifer genetic maps based on hypomethylated or low-copy molecular markers, resulting in uneven marker distribution. To test this, a framework genetic map was constructed from three types of microsatellites: low-copy, undermethylated, and genomic. These Pinus taeda L. microsatellites were mapped using a three-generation pedigree with 118 progeny. The microsatellites were highly informative; of the 32 markers in intercross configuration, 29 were segregating for three or four alleles in the progeny. The sex-averaged map placed 51 of the 95 markers in 15 linkage groups at LOD > 4.0. No clustering or uneven distribution across the genome was observed. The three types of P. taeda microsatellites were randomly dispersed within each linkage group. The 51 microsatellites covered a map distance of 795 cM, an average distance of 21.8 cM between markers, roughly half of the estimated total map length. The minimum and maximum distances between any two bins was 4.4 and 45.3 cM, respectively. These microsatellites provided anchor points for framework mapping for polymorphism in P. taeda and other closely related hard pines.
Waldram, Alison; Dolan, Gayle; Ashton, Philip M; Jenkins, Claire; Dallman, Timothy J
2018-05-01
The unprecedented level of bacterial strain discrimination provided by whole genome sequencing (WGS) presents new challenges with respect to the utility and interpretation of the data. Whole genome sequences from 1445 isolates of Salmonella belonging to the most commonly identified serotypes in England and Wales isolated between April and August 2014 were analysed. Single linkage single nucleotide polymorphism thresholds at the 10, 5 and 0 level were explored for evidence of epidemiological links between clustered cases. Analysis of the WGS data organised 566 of the 1445 isolates into 32 clusters of five or more. A statistically significant epidemiological link was identified for 17 clusters. The clusters were associated with foreign travel (n = 8), consumption of Chinese takeaways (n = 4), chicken eaten at home (n = 2), and one each of the following; eating out, contact with another case in the home and contact with reptiles. In the same time frame, one cluster was detected using traditional outbreak detection methods. WGS can be used for the highly specific and highly sensitive detection of biologically related isolates when epidemiological links are obscured. Improvements in the collection of detailed, standardised exposure information would enhance cluster investigations. Copyright © 2017 Elsevier Ltd. All rights reserved.
Chassain, Benoît; Lemée, Ludovic; Didi, Jennifer; Thiberge, Jean-Michel; Brisse, Sylvain; Pons, Jean-Louis
2012-01-01
Staphylococcus lugdunensis is recognized as one of the major pathogenic species within the genus Staphylococcus, even though it belongs to the coagulase-negative group. A multilocus sequence typing (MLST) scheme was developed to study the genetic relationships and population structure of 87 S. lugdunensis isolates from various clinical and geographic sources by DNA sequence analysis of seven housekeeping genes (aroE, dat, ddl, gmk, ldh, recA, and yqiL). The number of alleles ranged from four (gmk and ldh) to nine (yqiL). Allelic profiles allowed the definition of 20 different sequence types (STs) and five clonal complexes. The 20 STs lacked correlation with geographic source. Isolates recovered from hematogenic infections (blood or osteoarticular isolates) or from skin and soft tissue infections did not cluster in separate lineages. Penicillin-resistant isolates clustered mainly in one clonal complex, unlike glycopeptide-tolerant isolates, which did not constitute a distinct subpopulation within S. lugdunensis. Phylogenies from the sequences of the seven individual housekeeping genes were congruent, indicating a predominantly mutational evolution of these genes. Quantitative analysis of the linkages between alleles from the seven loci revealed a significant linkage disequilibrium, thus confirming a clonal population structure for S. lugdunensis. This first MLST scheme for S. lugdunensis provides a new tool for investigating the macroepidemiology and phylogeny of this unusually virulent coagulase-negative Staphylococcus. PMID:22785196
Nurmi, Erika L; Dowd, Michael; Tadevosyan-Leyfer, Ovsanna; Haines, Jonathan L; Folstein, Susan E; Sutcliffe, James S
2003-07-01
Autism displays a remarkably high heritability but a complex genetic etiology. One approach to identifying susceptibility loci under these conditions is to define more homogeneous subsets of families on the basis of genetically relevant phenotypic or biological characteristics that vary from case to case. The authors performed a principal components analysis, using items from the Autism Diagnostic Interview, which resulted in six clusters of variables, five of which showed significant sib-sib correlation. The utility of these phenotypic subsets was tested in an exploratory genetic analysis of the autism candidate region on chromosome 15q11-q13. When the Collaborative Linkage Study of Autism sample was divided, on the basis of mean proband score for the "savant skills" cluster, the heterogeneity logarithm of the odds under a recessive model at D15S511, within the GABRB3 gene, increased from 0.6 to 2.6 in the subset of families in which probands had greater savant skills. These data are consistent with the genetic contribution of a 15q locus to autism susceptibility in a subset of affected individuals exhibiting savant skills. Similar types of skills have been noted in individuals with Prader-Willi syndrome, which results from deletions of this chromosomal region.
1988-01-01
We report the organization of the human genes encoding the complement components C4-binding protein (C4BP), C3b/C4b receptor (CR1), decay accelerating factor (DAF), and C3dg receptor (CR2) within the regulator of complement activation (RCA) gene cluster. Using pulsed field gel electrophoresis analysis these genes have been physically linked and aligned as CR1-CR2-DAF-C4BP in an 800-kb DNA segment. The very tight linkage between the CR1 and the C4BP loci, contrasted with the relative long DNA distance between these genes, suggests the existence of mechanisms interfering with recombination within the RCA gene cluster. PMID:2450163
Next-Generation Sequencing of Coccidioides immitis Isolated during Cluster Investigation
Engelthaler, David M.; Chiller, Tom; Schupp, James A.; Colvin, Joshua; Beckstrom-Sternberg, Stephen M.; Driebe, Elizabeth M.; Moses, Tracy; Tembe, Waibhav; Sinari, Shripad; Beckstrom-Sternberg, James S.; Christoforides, Alexis; Pearson, John V.; Carpten, John; Keim, Paul; Peterson, Ashley; Terashita, Dawn
2011-01-01
Next-generation sequencing enables use of whole-genome sequence typing (WGST) as a viable and discriminatory tool for genotyping and molecular epidemiologic analysis. We used WGST to confirm the linkage of a cluster of Coccidioides immitis isolates from 3 patients who received organ transplants from a single donor who later had positive test results for coccidioidomycosis. Isolates from the 3 patients were nearly genetically identical (a total of 3 single-nucleotide polymorphisms identified among them), thereby demonstrating direct descent of the 3 isolates from an original isolate. We used WGST to demonstrate the genotypic relatedness of C. immitis isolates that were also epidemiologically linked. Thus, WGST offers unique benefits to public health for investigation of clusters considered to be linked to a single source. PMID:21291593
Laursen, Jens; Milman, Nils; Pind, Niels; Pedersen, Henrik; Mulvad, Gert
2014-01-01
Meta-analysis of previous studies evaluating associations between content of elements sulphur (S), chlorine (Cl), potassium (K), iron (Fe), copper (Cu), zinc (Zn) and bromine (Br) in normal and cirrhotic autopsy liver tissue samples. Normal liver samples from 45 Greenlandic Inuit, median age 60 years and from 71 Danes, median age 61 years. Cirrhotic liver samples from 27 Danes, median age 71 years. Element content was measured using X-ray fluorescence spectrometry. Dual hierarchical clustering analysis, creating a dual dendrogram, one clustering element contents according to calculated similarities, one clustering elements according to correlation coefficients between the element contents, both using Euclidian distance and Ward Procedure. One dendrogram separated subjects in 7 clusters showing no differences in ethnicity, gender or age. The analysis discriminated between elements in normal and cirrhotic livers. The other dendrogram clustered elements in four clusters: sulphur and chlorine; copper and bromine; potassium and zinc; iron. There were significant correlations between the elements in normal liver samples: S was associated with Cl, K, Br and Zn; Cl with S and Br; K with S, Br and Zn; Cu with Br. Zn with S and K. Br with S, Cl, K and Cu. Fe did not show significant associations with any other element. In contrast to simple statistical methods, which analyses content of elements separately one by one, dual hierarchical clustering analysis incorporates all elements at the same time and can be used to examine the linkage and interplay between multiple elements in tissue samples. Copyright © 2013 Elsevier GmbH. All rights reserved.
HIV-1 transmission linkage in an HIV-1 prevention clinical trial
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leitner, Thomas; Campbell, Mary S; Mullins, James I
2009-01-01
HIV-1 sequencing has been used extensively in epidemiologic and forensic studies to investigate patterns of HIV-1 transmission. However, the criteria for establishing genetic linkage between HIV-1 strains in HIV-1 prevention trials have not been formalized. The Partners in Prevention HSV/HIV Transmission Study (ClinicaITrials.gov NCT00194519) enrolled 3408 HIV-1 serodiscordant heterosexual African couples to determine the efficacy of genital herpes suppression with acyclovir in reducing HIV-1 transmission. The trial analysis required laboratory confirmation of HIV-1 linkage between enrolled partners in couples in which seroconversion occurred. Here we describe the process and results from HIV-1 sequencing studies used to perform transmission linkage determinationmore » in this clinical trial. Consensus Sanger sequencing of env (C2-V3-C3) and gag (p17-p24) genes was performed on plasma HIV-1 RNA from both partners within 3 months of seroconversion; env single molecule or pyrosequencing was also performed in some cases. For linkage, we required monophyletic clustering between HIV-1 sequences in the transmitting and seroconverting partners, and developed a Bayesian algorithm using genetic distances to evaluate the posterior probability of linkage of participants sequences. Adjudicators classified transmissions as linked, unlinked, or indeterminate. Among 151 seroconversion events, we found 108 (71.5%) linked, 40 (26.5%) unlinked, and 3 (2.0%) to have indeterminate transmissions. Nine (8.3%) were linked by consensus gag sequencing only and 8 (7.4%) required deep sequencing of env. In this first use of HIV-1 sequencing to establish endpoints in a large clinical trial, more than one-fourth of transmissions were unlinked to the enrolled partner, illustrating the relevance of these methods in the design of future HIV-1 prevention trials in serodiscordant couples. A hierarchy of sequencing techniques, analysis methods, and expert adjudication contributed to the linkage determination process.« less
Viral Linkage in HIV-1 Seroconverters and Their Partners in an HIV-1 Prevention Clinical Trial
Campbell, Mary S.; Mullins, James I.; Hughes, James P.; Celum, Connie; Wong, Kim G.; Raugi, Dana N.; Sorensen, Stefanie; Stoddard, Julia N.; Zhao, Hong; Deng, Wenjie; Kahle, Erin; Panteleeff, Dana; Baeten, Jared M.; McCutchan, Francine E.; Albert, Jan; Leitner, Thomas; Wald, Anna; Corey, Lawrence; Lingappa, Jairam R.
2011-01-01
Background Characterization of viruses in HIV-1 transmission pairs will help identify biological determinants of infectiousness and evaluate candidate interventions to reduce transmission. Although HIV-1 sequencing is frequently used to substantiate linkage between newly HIV-1 infected individuals and their sexual partners in epidemiologic and forensic studies, viral sequencing is seldom applied in HIV-1 prevention trials. The Partners in Prevention HSV/HIV Transmission Study (ClinicalTrials.gov #NCT00194519) was a prospective randomized placebo-controlled trial that enrolled serodiscordant heterosexual couples to determine the efficacy of genital herpes suppression in reducing HIV-1 transmission; as part of the study analysis, HIV-1 sequences were examined for genetic linkage between seroconverters and their enrolled partners. Methodology/Principal Findings We obtained partial consensus HIV-1 env and gag sequences from blood plasma for 151 transmission pairs and performed deep sequencing of env in some cases. We analyzed sequences with phylogenetic techniques and developed a Bayesian algorithm to evaluate the probability of linkage. For linkage, we required monophyletic clustering between enrolled partners' sequences and a Bayesian posterior probability of ≥50%. Adjudicators classified each seroconversion, finding 108 (71.5%) linked, 40 (26.5%) unlinked, and 3 (2.0%) indeterminate transmissions, with linkage determined by consensus env sequencing in 91 (84%). Male seroconverters had a higher frequency of unlinked transmissions than female seroconverters. The likelihood of transmission from the enrolled partner was related to time on study, with increasing numbers of unlinked transmissions occurring after longer observation periods. Finally, baseline viral load was found to be significantly higher among linked transmitters. Conclusions/Significance In this first use of HIV-1 sequencing to establish endpoints in a large clinical trial, more than one-fourth of transmissions were unlinked to the enrolled partner, illustrating the relevance of these methods in the design of future HIV-1 prevention trials in serodiscordant couples. A hierarchy of sequencing techniques, analysis methods, and expert adjudication contributed to the linkage determination process. PMID:21399681
Genome Scan Meta-Analysis of Schizophrenia and Bipolar Disorder, Part II: Schizophrenia
Lewis, Cathryn M.; Levinson, Douglas F.; Wise, Lesley H.; DeLisi, Lynn E.; Straub, Richard E.; Hovatta, Iiris; Williams, Nigel M.; Schwab, Sibylle G.; Pulver, Ann E.; Faraone, Stephen V.; Brzustowicz, Linda M.; Kaufmann, Charles A.; Garver, David L.; Gurling, Hugh M. D.; Lindholm, Eva; Coon, Hilary; Moises, Hans W.; Byerley, William; Shaw, Sarah H.; Mesen, Andrea; Sherrington, Robin; O’Neill, F. Anthony; Walsh, Dermot; Kendler, Kenneth S.; Ekelund, Jesper; Paunio, Tiina; Lönnqvist, Jouko; Peltonen, Leena; O’Donovan, Michael C.; Owen, Michael J.; Wildenauer, Dieter B.; Maier, Wolfgang; Nestadt, Gerald; Blouin, Jean-Louis; Antonarakis, Stylianos E.; Mowry, Bryan J.; Silverman, Jeremy M.; Crowe, Raymond R.; Cloninger, C. Robert; Tsuang, Ming T.; Malaspina, Dolores; Harkavy-Friedman, Jill M.; Svrakic, Dragan M.; Bassett, Anne S.; Holcomb, Jennifer; Kalsi, Gursharan; McQuillin, Andrew; Brynjolfson, Jon; Sigmundsson, Thordur; Petursson, Hannes; Jazin, Elena; Zoëga, Tomas; Helgason, Tomas
2003-01-01
Schizophrenia is a common disorder with high heritability and a 10-fold increase in risk to siblings of probands. Replication has been inconsistent for reports of significant genetic linkage. To assess evidence for linkage across studies, rank-based genome scan meta-analysis (GSMA) was applied to data from 20 schizophrenia genome scans. Each marker for each scan was assigned to 1 of 120 30-cM bins, with the bins ranked by linkage scores (1 = most significant) and the ranks averaged across studies (Ravg) and then weighted for sample size (\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\setlength{\\oddsidemargin}{-69pt} \\begin{document} \\begin{equation*}\\sqrt{N[affected cases]}\\end{equation*}\\end{document}). A permutation test was used to compute the probability of observing, by chance, each bin’s average rank (PAvgRnk) or of observing it for a bin with the same place (first, second, etc.) in the order of average ranks in each permutation (Pord). The GSMA produced significant genomewide evidence for linkage on chromosome 2q (PAvgRnk<.000417). Two aggregate criteria for linkage were also met (clusters of nominally significant P values that did not occur in 1,000 replicates of the entire data set with no linkage present): 12 consecutive bins with both PAvgRnk and Pord<.05, including regions of chromosomes 5q, 3p, 11q, 6p, 1q, 22q, 8p, 20q, and 14p, and 19 consecutive bins with Pord<.05, additionally including regions of chromosomes 16q, 18q, 10p, 15q, 6q, and 17q. There is greater consistency of linkage results across studies than has been previously recognized. The results suggest that some or all of these regions contain loci that increase susceptibility to schizophrenia in diverse populations. PMID:12802786
Barral, Sandra; Cheng, Rong; Reitz, Christiane; Vardarajan, Badri; Lee, Joseph; Kunkle, Brian; Beecham, Gary; Cantwell, Laura S; Pericak-Vance, Margaret A; Farrer, Lindsay A; Haines, Jonathan L; Goate, Alison M; Foroud, Tatiana; Boerwinkle, Eric; Schellenberg, Gerard D; Mayeux, Richard
2015-12-01
We performed linkage analyses in Caribbean Hispanic families with multiple late-onset Alzheimer's disease (LOAD) cases to identify regions that may contain disease causative variants. We selected 67 LOAD families to perform genome-wide linkage scan. Analysis of the linked regions was repeated using the entire sample of 282 families. Validated chromosomal regions were analyzed using joint linkage and association. We identified 26 regions linked to LOAD (HLOD ≥3.6). We validated 13 of the regions (HLOD ≥2.5) using the entire family sample. The strongest signal was at 11q12.3 (rs2232932: HLODmax = 4.7, Pjoint = 6.6 × 10(-6)), a locus located ∼2 Mb upstream of the membrane-spanning 4A gene cluster. We additionally identified a locus at 7p14.3 (rs10255835: HLODmax = 4.9, Pjoint = 1.2 × 10(-5)), a region harboring genes associated with the nervous system (GARS, GHRHR, and NEUROD6). Future sequencing efforts should focus on these regions because they may harbor familial LOAD causative mutations. Copyright © 2015 The Alzheimer's Association. Published by Elsevier Inc. All rights reserved.
Linkage Analysis in Autoimmune Addison's Disease: NFATC1 as a Potential Novel Susceptibility Locus.
Mitchell, Anna L; Bøe Wolff, Anette; MacArthur, Katie; Weaver, Jolanta U; Vaidya, Bijay; Erichsen, Martina M; Darlay, Rebecca; Husebye, Eystein S; Cordell, Heather J; Pearce, Simon H S
2015-01-01
Autoimmune Addison's disease (AAD) is a rare, highly heritable autoimmune endocrinopathy. It is possible that there may be some highly penetrant variants which confer disease susceptibility that have yet to be discovered. DNA samples from 23 multiplex AAD pedigrees from the UK and Norway (50 cases, 67 controls) were genotyped on the Affymetrix SNP 6.0 array. Linkage analysis was performed using Merlin. EMMAX was used to carry out a genome-wide association analysis comparing the familial AAD cases to 2706 UK WTCCC controls. To explore some of the linkage findings further, a replication study was performed by genotyping 64 SNPs in two of the four linked regions (chromosomes 7 and 18), on the Sequenom iPlex platform in three European AAD case-control cohorts (1097 cases, 1117 controls). The data were analysed using a meta-analysis approach. In a parametric analysis, applying a rare dominant model, loci on chromosomes 7, 9 and 18 had LOD scores >2.8. In a non-parametric analysis, a locus corresponding to the HLA region on chromosome 6, known to be associated with AAD, had a LOD score >3.0. In the genome-wide association analysis, a SNP cluster on chromosome 2 and a pair of SNPs on chromosome 6 were associated with AAD (P <5x10-7). A meta-analysis of the replication study data demonstrated that three chromosome 18 SNPs were associated with AAD, including a non-synonymous variant in the NFATC1 gene. This linkage study has implicated a number of novel chromosomal regions in the pathogenesis of AAD in multiplex AAD families and adds further support to the role of HLA in AAD. The genome-wide association analysis has also identified a region of interest on chromosome 2. A replication study has demonstrated that the NFATC1 gene is worthy of future investigation, however each of the regions identified require further, systematic analysis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thein, S.L.; Weatherall, D.J.; Sampietro, M.
[open quotes]Heterocellular hereditary persistence of fetal hemoglobin[close quotes] (HPFH) is the term used to describe the genetically determined persistence of fetal hemoglobin (Hb F) production into adult life, in the absence of any related hematological disorder. Whereas some forms are caused by mutations in the [beta]-globin gene cluster on chromosome 11, others segregate independently. While the latter are of particular interest with respect to the regulation of globin gene switching, it has not been possible to determine their chromosomal location, mainly because their mode of inheritance is not clear, but also because several other factors are known to modify Hbmore » F production. The authors have examined a large Asian Indian pedigree which includes individuals with heterocellular HPFH associated with [beta]-thalassemia and/or [alpha]-thalassemia. Segregation analysis was conducted on the HPFH trait FC, defined to be the percentage of Hb F-containing cells (F-cells), using the class D regressive model. The results provide evidence for the presence of a major gene, dominant or codominant, which controls the FC values with residual familial correlations. The major gene was detected when the effects of genetic modifiers, notably [beta]-thalassemia and the XmnI-[sup G][gamma] polymorphism, are accounted for in this analysis. Linkage with the [beta]-globin gene cluster is excluded. The transmission of the FC values in this pedigree is informative enough to allow detection of linkage with an appropriate marker(s). The analytical approach outlined in this study, using simple regression to allow for genetic modifiers and thus allowing the mode of inheritance of a trait to be dissected out, may be useful as a model for segregation and linkage analyses of other complex phenotypes. 39 refs., 4 figs., 6 tabs.« less
Campa, Ana; Giraldez, Ramón; Ferreira, Juan José
2009-06-01
Resistance to nine races of the pathogenic fungus Colletotrichum lindemuthianum, causal agent of anthracnose, was evaluated in F(3) families derived from the cross between the anthracnose differential bean cultivars TU (resistant to races, 3, 6, 7, 31, 38, 39, 102, and 449) and MDRK (resistant to races, 449, and 1545). Molecular marker analyses were carried out in the F(2) individuals in order to map and characterize the anthracnose resistance genes or gene clusters present in these two differential cultivars. The results of the combined segregation indicate that at least three independent loci conferring resistance to anthracnose are present in TU. One of them, corresponding to the previously described anthracnose resistance locus Co-5, is located in linkage group B7, and is formed by a cluster of different genes conferring specific resistance to races, 3, 6, 7, 31, 38, 39, 102, and 449. Evidence of intra-cluster recombination between these specific resistance genes was found. The second locus present in TU confers specific resistance to races 31 and 102, and the third locus confers specific resistance to race 102, the location of these two loci remains unknown. The resistance to race 1545 present in MDRK is due to two independent dominant genes. The results of the combined segregation of two F(4) families showing monogenic segregation for resistance to race 1545 indicates that one of these two genes is linked to marker OF10(530), located in linkage group B1, and corresponds to the previously described anthracnose resistance locus Co-1. The second gene conferring resistance to race 1545 in MDRK is linked to marker Pv-ctt001, located in linkage group B4, and corresponds to the Co-3/Co-9 cluster. The resistance to race 449 present in MDRK is conferred by a single gene, located in linkage group B4, probably included in the same Co-3/Co-9 cluster.
Conservation of gene linkage in dispersed vertebrate NK homeobox clusters.
Wotton, Karl R; Weierud, Frida K; Juárez-Morales, José L; Alvares, Lúcia E; Dietrich, Susanne; Lewis, Katharine E
2009-10-01
Nk homeobox genes are important regulators of many different developmental processes including muscle, heart, central nervous system and sensory organ development. They are thought to have arisen as part of the ANTP megacluster, which also gave rise to Hox and ParaHox genes, and at least some NK genes remain tightly linked in all animals examined so far. The protostome-deuterostome ancestor probably contained a cluster of nine Nk genes: (Msx)-(Nk4/tinman)-(Nk3/bagpipe)-(Lbx/ladybird)-(Tlx/c15)-(Nk7)-(Nk6/hgtx)-(Nk1/slouch)-(Nk5/Hmx). Of these genes, only NKX2.6-NKX3.1, LBX1-TLX1 and LBX2-TLX2 remain tightly linked in humans. However, it is currently unclear whether this is unique to the human genome as we do not know which of these Nk genes are clustered in other vertebrates. This makes it difficult to assess whether the remaining linkages are due to selective pressures or because chance rearrangements have "missed" certain genes. In this paper, we identify all of the paralogs of these ancestrally clustered NK genes in several distinct vertebrates. We demonstrate that tight linkages of Lbx1-Tlx1, Lbx2-Tlx2 and Nkx3.1-Nkx2.6 have been widely maintained in both the ray-finned and lobe-finned fish lineages. Moreover, the recently duplicated Hmx2-Hmx3 genes are also tightly linked. Finally, we show that Lbx1-Tlx1 and Hmx2-Hmx3 are flanked by highly conserved noncoding elements, suggesting that shared regulatory regions may have resulted in evolutionary pressure to maintain these linkages. Consistent with this, these pairs of genes have overlapping expression domains. In contrast, Lbx2-Tlx2 and Nkx3.1-Nkx2.6, which do not seem to be coexpressed, are also not associated with conserved noncoding sequences, suggesting that an alternative mechanism may be responsible for the continued clustering of these genes.
Bouzigon, Emmanuelle; Dizier, Marie-Hélène; Krähenbühl, Christine; Lemainque, Arnaud; Annesi-Maesano, Isabella; Betard, Christine; Bousquet, Jean; Charpin, Denis; Gormand, Frédéric; Guilloud-Bataille, Michel; Just, Jocelyne; Le Moual, Nicole; Maccario, Jean; Matran, Régis; Neukirch, Françoise; Oryszczyn, Marie-Pierre; Paty, Evelyne; Pin, Isabelle; Rosenberg-Bourgin, Myriam; Vervloet, Daniel; Kauffmann, Francine; Lathrop, Mark; Demenais, Florence
2004-12-15
A genome-wide scan for asthma phenotypes was conducted in the whole sample of 295 EGEA families selected through at least one asthmatic subject. In addition to asthma, seven phenotypes involved in the main asthma physiopathological pathways were considered: SPT (positive skin prick test response to at least one of 11 allergens), SPTQ score being the number of positive skin test responses to 11 allergens, Phadiatop (positive specific IgE response to a mixture of allergens), total IgE levels, eosinophils, bronchial responsiveness (BR) to methacholine challenge and %predicted FEV(1). Four regions showed evidence for linkage (P=0.001): 6q14 for %FEV(1), 12p13 for IgE, 17q22-q24 for SPT and 21q21 for both SPTQ and %FEV(1). Nine other regions indicated smaller linkage signals (0.001
Molecular epidemiological study of HIV-1 CRF01_AE transmission in Hong Kong.
Chen, J H K; Wong, K H; Li, P; Chan, K C; Lee, M P; Lam, H Y; Cheng, V C C; Yuen, K Y; Yam, W C
2009-08-15
The objective of this study was to investigate the transmission history of the HIV-1 CRF01_AE epidemics in Hong Kong between 1994 and 2007. A total of 465 HIV-1 CRF01_AE pol sequences were derived from an in-house or a commercial HIV-1 genotyping system. Phylogenies of CRF01_AE sequences were analyzed by the Bayesian coalescent method. CRF01_AE patient population included 363 males (78.1%) and 102 females (21.9%), whereas 65% (314 of 465) were local Chinese. Major transmission routes were heterosexual contact (63%), followed by intravenous drug use (IDU) (19%) and men having sex with men (MSM) (17%). From phylogenetic analysis, local CRF01_AE strains were from multiple origins with 3 separate transmission clusters identified. Cluster 1 consisted mainly of Chinese male IDUs and heterosexuals. Clusters 2 and 3 included mainly local Chinese MSM and non-Chinese Asian IDUs, respectively. Chinese reference isolates available from China (Fujian, Guangxi, or Liaoning) were clonally related to our transmission clusters, demonstrating the epidemiological linkage of CRF01_AE infections between Hong Kong and China. The 3 individual local transmission clusters were estimated to have initiated since late 1980s and late 1990s, causing subsequent epidemics in the early 2000s. This is the first comprehensive molecular epidemiological study of HIV-1 CRF01_AE in Hong Kong. It revealed that MSM contact is becoming a major route of local CRF01_AE transmission in Hong Kong. Epidemiological linkage of CRF01_AE between Hong Kong and China observed in this study indicates the importance of regular molecular epidemiological surveillance for the HIV-1 epidemic in our region.
A Study of the 5S Ribosomal RNAs of the Vibrionaceae
1984-01-01
codon (UAA, UAG, or UGA) TBE Tris-borate-EDTA buffer ug microgram, i.e., 10-’ gram 6 ul microliter. iJe., 10- 6 liter UPG unweighted pair-group UPGMA ...Psy~ww~w .......................... .. 4.------------------ 0 IC 5b. The UPGMA , or UPS average linkage, dendrogram resulting from the...cluster, and the V. damsela - Q. anguillarus doublet are identical to that predicted by UPGMA analysis. C. CONSERVED AND HYPERVARIABLE REGIONS As
Parker, Heidi G.; Kukekova, Anna V.; Akey, Dayna T.; Goldstein, Orly; Kirkness, Ewen F.; Baysac, Kathleen C.; Mosher, Dana S.; Aguirre, Gustavo D.; Acland, Gregory M.; Ostrander, Elaine A.
2007-01-01
The features of modern dog breeds that increase the ease of mapping common diseases, such as reduced heterogeneity and extensive linkage disequilibrium, may also increase the difficulty associated with fine mapping and identifying causative mutations. One way to address this problem is by combining data from multiple breeds segregating the same trait after initial linkage has been determined. The multibreed approach increases the number of potentially informative recombination events and reduces the size of the critical haplotype by taking advantage of shortened linkage disequilibrium distances found across breeds. In order to identify breeds that likely share a trait inherited from the same ancestral source, we have used cluster analysis to divide 132 breeds of dog into five primary breed groups. We then use the multibreed approach to fine-map Collie eye anomaly (cea), a complex disorder of ocular development that was initially mapped to a 3.9-cM region on canine chromosome 37. Combined genotypes from affected individuals from four breeds of a single breed group significantly narrowed the candidate gene region to a 103-kb interval spanning only four genes. Sequence analysis revealed that all affected dogs share a homozygous deletion of 7.8 kb in the NHEJ1 gene. This intronic deletion spans a highly conserved binding domain to which several developmentally important proteins bind. This work both establishes that the primary cea mutation arose as a single disease allele in a common ancestor of herding breeds as well as highlights the value of comparative population analysis for refining regions of linkage. PMID:17916641
Principal Component and Linkage Analysis of Cardiovascular Risk Traits in the Norfolk Isolate
Cox, Hannah C.; Bellis, Claire; Lea, Rod A.; Quinlan, Sharon; Hughes, Roger; Dyer, Thomas; Charlesworth, Jac; Blangero, John; Griffiths, Lyn R.
2009-01-01
Objective(s) An individual's risk of developing cardiovascular disease (CVD) is influenced by genetic factors. This study focussed on mapping genetic loci for CVD-risk traits in a unique population isolate derived from Norfolk Island. Methods This investigation focussed on 377 individuals descended from the population founders. Principal component analysis was used to extract orthogonal components from 11 cardiovascular risk traits. Multipoint variance component methods were used to assess genome-wide linkage using SOLAR to the derived factors. A total of 285 of the 377 related individuals were informative for linkage analysis. Results A total of 4 principal components accounting for 83% of the total variance were derived. Principal component 1 was loaded with body size indicators; principal component 2 with body size, cholesterol and triglyceride levels; principal component 3 with the blood pressures; and principal component 4 with LDL-cholesterol and total cholesterol levels. Suggestive evidence of linkage for principal component 2 (h2 = 0.35) was observed on chromosome 5q35 (LOD = 1.85; p = 0.0008). While peak regions on chromosome 10p11.2 (LOD = 1.27; p = 0.005) and 12q13 (LOD = 1.63; p = 0.003) were observed to segregate with principal components 1 (h2 = 0.33) and 4 (h2 = 0.42), respectively. Conclusion(s): This study investigated a number of CVD risk traits in a unique isolated population. Findings support the clustering of CVD risk traits and provide interesting evidence of a region on chromosome 5q35 segregating with weight, waist circumference, HDL-c and total triglyceride levels. PMID:19339786
Adsorption of small molecules on the [Zn-Zn]2+ linkage in zeolite. A DFT study of ferrierite
NASA Astrophysics Data System (ADS)
Benco, Lubomir
2017-02-01
In zeolites monovalent Zn(I) forms a sub-nano particles [Zn-Zn]2+ stabilized in rings of the zeolite framework, which exhibit interesting catalytic properties. This work reports on adsorption properties of [Zn-Zn]2+ particles in zeolite ferrierite investigated for a set of probing diatomic (N2, O2, H2, CO, NO) and triatomic (CO2, N2O, NO2, H2O) molecules using dispersion-corrected DFT. Three [Zn-Zn]2+ sites are compared differing in the location and stability. On all sites molecules form physisorbed clusters with the molecule connected on-top of the Zn-Zn linkage. In physisorbed clusters adsorption induces only slight change of bonding and the geometry of the Zn-Zn linkage. Some molecules can form stable chemisorbed clusters in which the molecule is integrated between two Zn+ cations. The sandwich-like chemisorption causes pronounced changes of bonding and can lead to the transfer of the electron density between two Zn+ cations and to a change of the oxidation state. The knowledge of bonding of small molecules can help understanding of the mechanism of conversion reactions catalyzed by sub-nano [Zn-Zn] particles.
The ties that bind: interorganizational linkages and physician-system alignment.
Alexander, J A; Waters, T M; Burns, L R; Shortell, S M; Gillies, R R; Budetti, P P; Zuckerman, H S
2001-07-01
To examine the association between the degree of alignment between physicians and health care systems, and interorganizational linkages between physician groups and health care systems. The study used a cross sectional, comparative analysis using a sample of 1,279 physicians practicing in loosely affiliated arrangements and 1,781 physicians in 61 groups closely affiliated with 14 vertically integrated health systems. Measures of physician alignment were based on multiitem scales validated in previous studies and derived from surveys sent to individual physicians. Measures of interorganizational linkages were specified at the institutional, administrative, and technical core levels of the physician group and were developed from surveys sent to the administrator of each of the 61 physician groups in the sample. Two stage Heckman models with fixed effects adjustments in the second stage were used to correct for sample selection and clustering respectively. After accounting for sample selection, fixed effects, and group and individual controls, physicians in groups with more valued practice service linkages display consistently higher alignment with systems than physicians in groups that have fewer such linkages. Results also suggest that centralized administrative control lowers physician-system alignment for selected measures of alignment. Governance interlocks exhibited only weak associations with alignment. Our findings suggest that alignment generally follows resource exchanges that promote value-added contributions to physicians and physician groups while preserving control and authority within the group.
Im, Chak Han; Park, Young-Hoon; Hammel, Kenneth E; Park, Bokyung; Kwon, Soon Wook; Ryu, Hojin; Ryu, Jae-San
2016-07-01
Breeding new strains with improved traits is a long-standing goal of mushroom breeders that can be expedited by marker-assisted selection (MAS). We constructed a genetic linkage map of Pleurotus eryngii based on segregation analysis of markers in postmeiotic monokaryons from KNR2312. In total, 256 loci comprising 226 simple sequence-repeat (SSR) markers, 2 mating-type factors, and 28 insertion/deletion (InDel) markers were mapped. The map consisted of 12 linkage groups (LGs) spanning 1047.8cM, with an average interval length of 4.09cM. Four independent populations (Pd3, Pd8, Pd14, and Pd15) derived from crossing between four monokaryons from KNR2532 as a tester strain and 98 monokaryons from KNR2312 were used to characterize quantitative trait loci (QTL) for nine traits such as yield, quality, cap color, and earliness. Using composite interval mapping (CIM), 71 QTLs explaining between 5.82% and 33.17% of the phenotypic variations were identified. Clusters of more than five QTLs for various traits were identified in three genomic regions, on LGs 1, 7 and 9. Regardless of the population, 6 of the 9 traits studied and 18 of the 71 QTLs found in this study were identified in the largest cluster, LG1, in the range from 65.4 to 110.4cM. The candidate genes for yield encoding transcription factor, signal transduction, mycelial growth and hydrolase are suggested by using manual and computational analysis of genome sequence corresponding to QTL region with the highest likelihood odds (LOD) for yield. The genetic map and the QTLs established in this study will help breeders and geneticists to develop selection markers for agronomically important characteristics of mushrooms and to identify the corresponding genes. Copyright © 2016 Elsevier Inc. All rights reserved.
Valentini, Giseli; Gonçalves-Vidigal, Maria Celeste; Hurtado-Gonzales, Oscar P; de Lima Castro, Sandra Aparecida; Cregan, Perry B; Song, Qijian; Pastor-Corrales, Marcial A
2017-08-01
Co-segregation analysis and high-throughput genotyping using SNP, SSR, and KASP markers demonstrated genetic linkage between Ur-14 and Co-3 4 /Phg-3 loci conferring resistance to the rust, anthracnose and angular leaf spot diseases of common bean. Rust, anthracnose, and angular leaf spot are major diseases of common bean in the Americas and Africa. The cultivar Ouro Negro has the Ur-14 gene that confers broad spectrum resistance to rust and the gene cluster Co-3 4 /Phg-3 containing two tightly linked genes conferring resistance to anthracnose and angular leaf spot, respectively. We used co-segregation analysis and high-throughput genotyping of 179 F 2:3 families from the Rudá (susceptible) × Ouro Negro (resistant) cross-phenotyped separately with races of the rust and anthracnose pathogens. The results confirmed that Ur-14 and Co-3 4 /Phg-3 cluster in Ouro Negro conferred resistance to rust and anthracnose, respectively, and that Ur-14 and the Co-3 4 /Phg-3 cluster were closely linked. Genotyping the F 2:3 families, first with 5398 SNPs on the Illumina BeadChip BARCBEAN6K_3 and with 15 SSR, and eight KASP markers, specifically designed for the candidate region containing Ur-14 and Co-3 4 /Phg-3, permitted the creation of a high-resolution genetic linkage map which revealed that Ur-14 was positioned at 2.2 cM from Co-3 4 /Phg-3 on the short arm of chromosome Pv04 of the common bean genome. Five flanking SSR markers were tightly linked at 0.1 and 0.2 cM from Ur-14, and two flanking KASP markers were tightly linked at 0.1 and 0.3 cM from Co-3 4 /Phg-3. Many other SSR, SNP, and KASP markers were also linked to these genes. These markers will be useful for the development of common bean cultivars combining the important Ur-14 and Co-3 4 /Phg-3 genes conferring resistance to three of the most destructive diseases of common bean.
Xia, Zhiqiang; Zhang, Shengkui; Wen, Mingfu; Lu, Cheng; Sun, Yufang; Zou, Meiling; Wang, Wenquan
2018-01-01
As an important biofuel plant, the demand for higher yield Jatropha curcas L. is rapidly increasing. However, genetic analysis of Jatropha and molecular breeding for higher yield have been hampered by the limited number of molecular markers available. An ultrahigh-density linkage map for a Jatropha mapping population of 153 individuals was constructed and covered 1380.58 cM of the Jatropha genome, with average marker density of 0.403 cM. The genetic linkage map consisted of 3422 SNP and indel markers, which clustered into 11 linkage groups. With this map, 13 repeatable QTLs (reQTLs) for fruit yield traits were identified. Ten reQTLs, qNF - 1 , qNF - 2a , qNF - 2b , qNF - 2c , qNF - 3 , qNF - 4 , qNF - 6 , qNF - 7a , qNF - 7b and qNF - 8, that control the number of fruits (NF) mapped to LGs 1, 2, 3, 4, 6, 7 and 8, whereas three reQTLs, qTWF - 1 , qTWF - 2 and qTWF - 3, that control the total weight of fruits (TWF) mapped to LGs 1, 2 and 3, respectively. It is interesting that there are two candidate critical genes, which may regulate Jatropha fruit yield. We also identified three pleiotropic reQTL pairs associated with both the NF and TWF traits. This study is the first to report an ultrahigh-density Jatropha genetic linkage map construction, and the markers used in this study showed great potential for QTL mapping. Thirteen fruit-yield reQTLs and two important candidate genes were identified based on this linkage map. This genetic linkage map will be a useful tool for the localization of other economically important QTLs and candidate genes for Jatropha .
Knutson, Stacy T.; Westwood, Brian M.; Leuthaeuser, Janelle B.; Turner, Brandon E.; Nguyendac, Don; Shea, Gabrielle; Kumar, Kiran; Hayden, Julia D.; Harper, Angela F.; Brown, Shoshana D.; Morris, John H.; Ferrin, Thomas E.; Babbitt, Patricia C.
2017-01-01
Abstract Protein function identification remains a significant problem. Solving this problem at the molecular functional level would allow mechanistic determinant identification—amino acids that distinguish details between functional families within a superfamily. Active site profiling was developed to identify mechanistic determinants. DASP and DASP2 were developed as tools to search sequence databases using active site profiling. Here, TuLIP (Two‐Level Iterative clustering Process) is introduced as an iterative, divisive clustering process that utilizes active site profiling to separate structurally characterized superfamily members into functionally relevant clusters. Underlying TuLIP is the observation that functionally relevant families (curated by Structure‐Function Linkage Database, SFLD) self‐identify in DASP2 searches; clusters containing multiple functional families do not. Each TuLIP iteration produces candidate clusters, each evaluated to determine if it self‐identifies using DASP2. If so, it is deemed a functionally relevant group. Divisive clustering continues until each structure is either a functionally relevant group member or a singlet. TuLIP is validated on enolase and glutathione transferase structures, superfamilies well‐curated by SFLD. Correlation is strong; small numbers of structures prevent statistically significant analysis. TuLIP‐identified enolase clusters are used in DASP2 GenBank searches to identify sequences sharing functional site features. Analysis shows a true positive rate of 96%, false negative rate of 4%, and maximum false positive rate of 4%. F‐measure and performance analysis on the enolase search results and comparison to GEMMA and SCI‐PHY demonstrate that TuLIP avoids the over‐division problem of these methods. Mechanistic determinants for enolase families are evaluated and shown to correlate well with literature results. PMID:28054422
Knutson, Stacy T; Westwood, Brian M; Leuthaeuser, Janelle B; Turner, Brandon E; Nguyendac, Don; Shea, Gabrielle; Kumar, Kiran; Hayden, Julia D; Harper, Angela F; Brown, Shoshana D; Morris, John H; Ferrin, Thomas E; Babbitt, Patricia C; Fetrow, Jacquelyn S
2017-04-01
Protein function identification remains a significant problem. Solving this problem at the molecular functional level would allow mechanistic determinant identification-amino acids that distinguish details between functional families within a superfamily. Active site profiling was developed to identify mechanistic determinants. DASP and DASP2 were developed as tools to search sequence databases using active site profiling. Here, TuLIP (Two-Level Iterative clustering Process) is introduced as an iterative, divisive clustering process that utilizes active site profiling to separate structurally characterized superfamily members into functionally relevant clusters. Underlying TuLIP is the observation that functionally relevant families (curated by Structure-Function Linkage Database, SFLD) self-identify in DASP2 searches; clusters containing multiple functional families do not. Each TuLIP iteration produces candidate clusters, each evaluated to determine if it self-identifies using DASP2. If so, it is deemed a functionally relevant group. Divisive clustering continues until each structure is either a functionally relevant group member or a singlet. TuLIP is validated on enolase and glutathione transferase structures, superfamilies well-curated by SFLD. Correlation is strong; small numbers of structures prevent statistically significant analysis. TuLIP-identified enolase clusters are used in DASP2 GenBank searches to identify sequences sharing functional site features. Analysis shows a true positive rate of 96%, false negative rate of 4%, and maximum false positive rate of 4%. F-measure and performance analysis on the enolase search results and comparison to GEMMA and SCI-PHY demonstrate that TuLIP avoids the over-division problem of these methods. Mechanistic determinants for enolase families are evaluated and shown to correlate well with literature results. © 2017 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Chavanas, Stéphane; Garner, Chad; Bodemer, Christine; Ali, Mohsin; Teillac, Dominique Hamel-; Wilkinson, John; Bonafé, Jean-Louis; Paradisi, Mauro; Kelsell, David P.; Ansai, Shin-ichi; Mitsuhashi, Yoshihiko; Larrègue, Marc; Leigh, Irene M.; Harper, John I.; Taïeb, Alain; Prost, Yves de; Cardon, Lon R.; Hovnanian, Alain
2000-01-01
Netherton syndrome (NS [MIM 256500]) is a rare and severe autosomal recessive disorder characterized by congenital ichthyosis, a specific hair-shaft defect (trichorrhexis invaginata), and atopic manifestations. Infants with this syndrome often fail to thrive; life-threatening complications result in high postnatal mortality. We report the assignment of the NS gene to chromosome 5q32, by linkage analysis and homozygosity mapping in 20 families affected with NS. Significant evidence for linkage (maximum multipoint LOD score 10.11) between markers D5S2017 and D5S413 was obtained, with no evidence for locus heterogeneity. Analysis of critical recombinants mapped the NS locus between markers D5S463 and D5S2013, within an <3.5-cM genetic interval. The NS locus is telomeric to the cytokine gene cluster in 5q31. The five known genes encoding casein kinase Iα, the α subunit of retinal rod cGMP phosphodiesterase, the regulator of mitotic-spindle assembly, adrenergic receptor β2, and the diastrophic dysplasia sulfate–transporter gene, as well as the 38 expressed-sequence tags mapped within the critical region, are not obvious candidates. Our study is the first step toward the positional cloning of the NS gene. This finding promises a better understanding of the molecular mechanisms that control epidermal differentiation and immunity. PMID:10712206
A genomewide screen for late-onset Alzheimer disease in a genetically isolated Dutch population.
Liu, Fan; Arias-Vásquez, Alejandro; Sleegers, Kristel; Aulchenko, Yurii S; Kayser, Manfred; Sanchez-Juan, Pascual; Feng, Bing-Jian; Bertoli-Avella, Aida M; van Swieten, John; Axenovich, Tatiana I; Heutink, Peter; van Broeckhoven, Christine; Oostra, Ben A; van Duijn, Cornelia M
2007-07-01
Alzheimer disease (AD) is the most common cause of dementia. We conducted a genome screen of 103 patients with late-onset AD who were ascertained as part of the Genetic Research in Isolated Populations (GRIP) program that is conducted in a recently isolated population from the southwestern area of The Netherlands. All patients and their 170 closely related relatives were genotyped using 402 microsatellite markers. Extensive genealogy information was collected, which resulted in an extremely large and complex pedigree of 4,645 members. The pedigree was split into 35 subpedigrees, to reduce the computational burden of linkage analysis. Simulations aiming to evaluate the effect of pedigree splitting on false-positive probabilities showed that a LOD score of 3.64 corresponds to 5% genomewide type I error. Multipoint analysis revealed four significant and one suggestive linkage peaks. The strongest evidence of linkage was found for chromosome 1q21 (heterogeneity LOD [HLOD]=5.20 at marker D1S498). Approximately 30 cM upstream of this locus, we found another peak at 1q25 (HLOD=4.0 at marker D1S218). These two loci are in a previously established linkage region. We also confirmed the AD locus at 10q22-24 (HLOD=4.15 at marker D10S185). There was significant evidence of linkage of AD to chromosome 3q22-24 (HLOD=4.44 at marker D3S1569). For chromosome 11q24-25, there was suggestive evidence of linkage (HLOD=3.29 at marker D11S1320). We next tested for association between cognitive function and 4,173 single-nucleotide polymorphisms in the linked regions in an independent sample consisting of 197 individuals from the GRIP region. After adjusting for multiple testing, we were able to detect significant associations for cognitive function in four of five AD-linked regions, including the new region on chromosome 3q22-24 and regions 1q25, 10q22-24, and 11q25. With use of cognitive function as an endophenotype of AD, our study indicates the that the RGSL2, RALGPS2, and C1orf49 genes are the potential disease-causing genes at 1q25. Our analysis of chromosome 10q22-24 points to the HTR7, MPHOSPH1, and CYP2C cluster. This is the first genomewide screen that showed significant linkage to chromosome 3q23 markers. For this region, our analysis identified the NMNAT3 and CLSTN2 genes. Our findings confirm linkage to chromosome 11q25. We were unable to confirm SORL1; instead, our analysis points to the OPCML and HNT genes.
Evidence of linkage of HDL level variation to APOC3 in two samples with different ascertainment.
Gagnon, France; Jarvik, Gail P; Motulsky, Arno G; Deeb, Samir S; Brunzell, John D; Wijsman, Ellen M
2003-11-01
The APOA1-C3-A4-A5 gene complex encodes genes whose products are implicated in the metabolism of HDL and/or triglycerides. Although the relationship between polymorphisms in this gene cluster and dyslipidemias was first reported more than 15 years ago, association and linkage results have remained inconclusive. This is due, in part, to the oligogenic and multivariate nature of dyslipidemic phenotypes. Therefore, we investigate evidence of linkage of APOC3 and HDL using two samples of dyslipidemic pedigrees: familial combined hyperlipidemia (FCHL) and isolated low-HDL (ILHDL). We used a strategy that deals with several difficulties inherent in the study of complex traits: by using a Bayesian Markov Chain Monte Carlo (MCMC) approach we allow for oligogenic trait models, as well as simultaneous incorporation of covariates, in the context of multipoint analysis. By using this approach on extended pedigrees we provide evidence of linkage of APOC3 and HDL level variation in two samples with different ascertainment. In addition to APOC3, we estimate that two to three genes, each with a substantial effect on total variance, are responsible for HDL variation in both data sets. We also provide evidence, using the FCHL data set, for a pleiotropic effect between HDL, HDL3 and triglycerides at the APOC3 locus.
Zhao, Yunlei; Wang, Hongmei; Chen, Wei; Li, Yunhai
2014-01-01
Understanding the population structure and linkage disequilibrium in an association panel can effectively avoid spurious associations and improve the accuracy in association mapping. In this study, one hundred and fifty eight elite cotton (Gossypium hirsutum L.) germplasm from all over the world, which were genotyped with 212 whole genome-wide marker loci and phenotyped with an disease nursery and greenhouse screening method, were assayed for population structure, linkage disequilibrium, and association mapping of Verticillium wilt resistance. A total of 480 alleles ranging from 2 to 4 per locus were identified from all collections. Model-based analysis identified two groups (G1 and G2) and seven subgroups (G1a–c, G2a–d), and differentiation analysis showed that subgroup having a single origin or pedigree was apt to differentiate with those having a mixed origin. Only 8.12% linked marker pairs showed significant LD (P<0.001) in this association panel. The LD level for linked markers is significantly higher than that for unlinked markers, suggesting that physical linkage strongly influences LD in this panel, and LD level was elevated when the panel was classified into groups and subgroups. The LD decay analysis for several chromosomes showed that different chromosomes showed a notable change in LD decay distances for the same gene pool. Based on the disease nursery and greenhouse environment, 42 marker loci associated with Verticillium wilt resistance were identified through association mapping, which widely were distributed among 15 chromosomes. Among which 10 marker loci were found to be consistent with previously identified QTLs and 32 were new unreported marker loci, and QTL clusters for Verticillium wilt resistanc on Chr.16 were also proved in our study, which was consistent with the strong linkage in this chromosome. Our results would contribute to association mapping and supply the marker candidates for marker-assisted selection of Verticillium wilt resistance in cotton. PMID:24466016
Quality Evaluation of Agricultural Distillates Using an Electronic Nose
Dymerski, Tomasz; Gębicki, Jacek; Wardencki, Waldemar; Namieśnik, Jacek
2013-01-01
The paper presents the application of an electronic nose instrument to fast evaluation of agricultural distillates differing in quality. The investigations were carried out using a prototype of electronic nose equipped with a set of six semiconductor sensors by FIGARO Co., an electronic circuit converting signal into digital form and a set of thermostats able to provide gradient temperature characteristics to a gas mixture. A volatile fraction of the agricultural distillate samples differing in quality was obtained by barbotage. Interpretation of the results involved three data analysis techniques: principal component analysis, single-linkage cluster analysis and cluster analysis with spheres method. The investigations prove the usefulness of the presented technique in the quality control of agricultural distillates. Optimum measurements conditions were also defined, including volumetric flow rate of carrier gas (15 L/h), thermostat temperature during the barbotage process (15 °C) and time of sensor signal acquisition from the onset of the barbotage process (60 s). PMID:24287525
Hsueh, Wen-Chi; He, Qimei; Willcox, D. Craig; Nievergelt, Caroline M.; Donlon, Timothy A.; Kwok, Pui-Yan; Suzuki, Makoto; Willcox, Bradley J.
2014-01-01
Isolated populations have advantages for genetic studies of longevity from decreased haplotype diversity and long-range linkage disequilibrium. This permits smaller sample sizes without loss of power, among other utilities. Little is known about the genome of the Okinawans, a potential population isolate, recognized for longevity. Therefore, we assessed genetic diversity, structure, and admixture in Okinawans, and compared this with Caucasians, Chinese, Japanese, and Africans from HapMap II, genotyped on the same Affymetrix GeneChip Human Mapping 500K array. Principal component analysis, haplotype coverage, and linkage disequilibrium decay revealed a distinct Okinawan genome—more homogeneity, less haplotype diversity, and longer range linkage disequilibrium. Population structure and admixture analyses utilizing 52 global reference populations from the Human Genome Diversity Cell Line Panel demonstrated that Okinawans clustered almost exclusively with East Asians. Sibling relative risk (λs) analysis revealed that siblings of Okinawan centenarians have 3.11 times (females) and 3.77 times (males) more likelihood of centenarianism. These findings suggest that Okinawans are genetically distinct and share several characteristics of a population isolate, which are prone to develop extreme phenotypes (eg, longevity) from genetic drift, natural selection, and population bottlenecks. These data support further exploration of genetic influence on longevity in the Okinawans. PMID:24444611
Genomic characterization of putative allergen genes in peach/almond and their synteny with apple
Chen, Lin; Zhang, Shuiming; Illa, Eudald; Song, Lijuan; Wu, Shandong; Howad, Werner; Arús, Pere; Weg, Eric van de; Chen, Kunsong; Gao, Zhongshan
2008-01-01
Background Fruits from several species of the Rosaceae family are reported to cause allergic reactions in certain populations. The allergens identified belong to mainly four protein families: pathogenesis related 10 proteins, thaumatin-like proteins, lipid transfer proteins and profilins. These families of putative allergen genes in apple (Mal d 1 to 4) have been mapped on linkage maps and subsequent genetic study on allelic diversity and hypoallergenic traits has been carried out recently. In peach (Prunus persica), these allergen gene families are denoted as Pru p 1 to 4 and for almond (Prunus dulcis)Pru du 1 to 4. Genetic analysis using current molecular tools may be helpful to establish the cause of allergenicity differences observed among different peach cultivars. This study was to characterize putative peach allergen genes for their genomic sequences and linkage map positions, and to compare them with previously characterized homologous genes in apple (Malus domestica). Results Eight Pru p/du 1 genes were identified, four of which were new. All the Pru p/du 1 genes were mapped in a single bin on the top of linkage group 1 (G1). Five Pru p/du 2 genes were mapped on four different linkage groups, two very similar Pru p/du 2.01 genes (A and B) were on G3, Pru p/du 2.02 on G7,Pru p/du 2.03 on G8 and Pru p/du 2.04 on G1. There were differences in the intron and exon structure in these Pru p/du 2 genes and in their amino acid composition. Three Pru p/du 3 genes (3.01–3.03) containing an intron and a mini exon of 10 nt were mapped in a cluster on G6. Two Pru p/du 4 genes (Pru p/du 4.01 and 4.02) were located on G1 and G7, respectively. The Pru p/du 1 cluster on G1 aligned to the Mal d 1 clusters on LG16; Pru p/du 2.01A and B on G3 to Mal d 2.01A and B on LG9; the Pru p/du 3 cluster on G6 to Mal d 3.01 on LG12; Pru p/du 4.01 on G1 to Mal d 4.03 on LG2; and Pru p/du 4.02 on G7 to Mal d 4.02 on LG2. Conclusion A total of 18 putative peach/almond allergen genes have been mapped on five linkage groups. Their positions confirm the high macro-synteny between peach/almond and apple. The insight gained will help to identify key genes causing differences in allergenicity among different cultivars of peach and other Prunus species. PMID:19014629
Machine-learned cluster identification in high-dimensional data.
Ultsch, Alfred; Lötsch, Jörn
2017-02-01
High-dimensional biomedical data are frequently clustered to identify subgroup structures pointing at distinct disease subtypes. It is crucial that the used cluster algorithm works correctly. However, by imposing a predefined shape on the clusters, classical algorithms occasionally suggest a cluster structure in homogenously distributed data or assign data points to incorrect clusters. We analyzed whether this can be avoided by using emergent self-organizing feature maps (ESOM). Data sets with different degrees of complexity were submitted to ESOM analysis with large numbers of neurons, using an interactive R-based bioinformatics tool. On top of the trained ESOM the distance structure in the high dimensional feature space was visualized in the form of a so-called U-matrix. Clustering results were compared with those provided by classical common cluster algorithms including single linkage, Ward and k-means. Ward clustering imposed cluster structures on cluster-less "golf ball", "cuboid" and "S-shaped" data sets that contained no structure at all (random data). Ward clustering also imposed structures on permuted real world data sets. By contrast, the ESOM/U-matrix approach correctly found that these data contain no cluster structure. However, ESOM/U-matrix was correct in identifying clusters in biomedical data truly containing subgroups. It was always correct in cluster structure identification in further canonical artificial data. Using intentionally simple data sets, it is shown that popular clustering algorithms typically used for biomedical data sets may fail to cluster data correctly, suggesting that they are also likely to perform erroneously on high dimensional biomedical data. The present analyses emphasized that generally established classical hierarchical clustering algorithms carry a considerable tendency to produce erroneous results. By contrast, unsupervised machine-learned analysis of cluster structures, applied using the ESOM/U-matrix method, is a viable, unbiased method to identify true clusters in the high-dimensional space of complex data. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Linkage of the Nit1C gene cluster to bacterial cyanide assimilation as a nitrogen source.
Jones, Lauren B; Ghosh, Pallab; Lee, Jung-Hyun; Chou, Chia-Ni; Kunz, Daniel A
2018-05-21
A genetic linkage between a conserved gene cluster (Nit1C) and the ability of bacteria to utilize cyanide as the sole nitrogen source was demonstrated for nine different bacterial species. These included three strains whose cyanide nutritional ability has formerly been documented (Pseudomonas fluorescens Pf11764, Pseudomonas putida BCN3 and Klebsiella pneumoniae BCN33), and six not previously known to have this ability [Burkholderia (Paraburkholderia) xenovorans LB400, Paraburkholderia phymatum STM815, Paraburkholderia phytofirmans PsJN, Cupriavidus (Ralstonia) eutropha H16, Gluconoacetobacter diazotrophicus PA1 5 and Methylobacterium extorquens AM1]. For all bacteria, growth on or exposure to cyanide led to the induction of the canonical nitrilase (NitC) linked to the gene cluster, and in the case of Pf11764 in particular, transcript levels of cluster genes (nitBCDEFGH) were raised, and a nitC knock-out mutant failed to grow. Further studies demonstrated that the highly conserved nitB gene product was also significantly elevated. Collectively, these findings provide strong evidence for a genetic linkage between Nit1C and bacterial growth on cyanide, supporting use of the term cyanotrophy in describing what may represent a new nutritional paradigm in microbiology. A broader search of Nit1C genes in presently available genomes revealed its presence in 270 different bacteria, all contained within the domain Bacteria, including Gram-positive Firmicutes and Actinobacteria, and Gram-negative Proteobacteria and Cyanobacteria. Absence of the cluster in the Archaea is congruent with events that may have led to the inception of Nit1C occurring coincidentally with the first appearance of cyanogenic species on Earth, dating back 400-500 million years.
β-globin gene cluster haplotypes in ethnic minority populations of southwest China
Sun, Hao; Liu, Hongxian; Huang, Kai; Lin, Keqin; Huang, Xiaoqin; Chu, Jiayou; Ma, Shaohui; Yang, Zhaoqing
2017-01-01
The genetic diversity and relationships among ethnic minority populations of southwest China were investigated using seven polymorphic restriction enzyme sites in the β-globin gene cluster. The haplotypes of 1392 chromosomes from ten ethnic populations living in southwest China were determined. Linkage equilibrium and recombination hotspot were found between the 5′ sites and 3′ sites of the β-globin gene cluster. 5′ haplotypes 2 (+−−−), 6 (−++−+), 9 (−++++) and 3′ haplotype FW3 (−+) were the predominant haplotypes. Notably, haplotype 9 frequency was significantly high in the southwest populations, indicating their difference with other Chinese. The interpopulation differentiation of southwest Chinese minority populations is less than those in populations of northern China and other continents. Phylogenetic analysis shows that populations sharing same ethnic origin or language clustered to each other, indicating current β-globin cluster diversity in the Chinese populations reflects their ethnic origin and linguistic affiliations to a great extent. This study characterizes β-globin gene cluster haplotypes in southwest Chinese minorities for the first time, and reveals the genetic variability and affinity of these populations using β-globin cluster haplotype frequencies. The results suggest that ethnic origin plays an important role in shaping variations of the β-globin gene cluster in the southwestern ethnic populations of China. PMID:28205625
Boxman, Ingeborg L A; Verhoef, Linda; Vennema, Harry; Ngui, Siew-Lin; Friesema, Ingrid H M; Whiteside, Chris; Lees, David; Koopmans, Marion
2016-01-01
This report describes an outbreak investigation starting with two closely related suspected food-borne clusters of Dutch hepatitis A cases, nine primary cases in total, with an unknown source in the Netherlands. The hepatitis A virus (HAV) genotype IA sequences of both clusters were highly similar (459/460 nt) and were not reported earlier. Food questionnaires and a case-control study revealed an association with consumption of mussels. Analysis of mussel supply chains identified the most likely production area. International enquiries led to identification of a cluster of patients near this production area with identical HAV sequences with onsets predating the first Dutch cluster of cases. The most likely source for this cluster was a case who returned from an endemic area in Central America, and a subsequent household cluster from which treated domestic sewage was discharged into the suspected mussel production area. Notably, mussels from this area were also consumed by a separate case in the United Kingdom sharing an identical strain with the second Dutch cluster. In conclusion, a small number of patients in a non-endemic area led to geographically dispersed hepatitis A outbreaks with food as vehicle. This link would have gone unnoticed without sequence analyses and international collaboration.
[A network to promote health systems based on primary health care in the Region of the Americas].
Herrera Vázquez, María Magdalena; Rodríguez Avila, Nuria; Nebot Adell, Carme; Montenegro, Hernán
2007-05-01
To identify the relational components of an international network of organizations that provide technical and financial assistance to promote the development of health systems based on primary health care in the countries of the Region of the Americas; to analyze the linkages that would allow the collaborating partners of the Pan American Health Organization (PAHO) to work together on health issues; and to determine the basic theoretical elements that can help to develop action strategies that support advocacy efforts by a network. This was a qualitative and quantitative cross-sectional study based on identifying key informants and on analyzing social networks. Ethnographic and relational information from 46 international organizations was collected through a self-administered semistructured questionnaire. From 46 international health cooperation organizations, 29 decision makers from 29 organizations participated (63.0% response rate). The structure and the strength of the network was evaluated in terms of density, closeness, clustering, and centralization. The statistical analysis was done using computer programs that included UCINET, Pajek, and Microsoft Access. We found a structurally centralized theoretical network, whose nodes were clustered into four central subgroups linked by a shared vision. The leadership, influence, and political interests reflected the formal and technical-cooperation linkages, the formal support for health systems based on primary health care, and the flow of resources being more often technical ones than financial ones. The interorganizational relational components and the social-action ties that were identified could help in the development and consolidation of a thematic network for advocacy and for the management of technical and financial assistance that supports primary health care in the Americas. The linkages for joint action that were identified could advance international cooperation in developing health systems based on primary health care, once PAHO formulates clear implementation strategies and takes a leadership position in mobilizing financial resources and in creating informal and interpersonal linkages for action.
QTL analysis of frost damage in pea suggests different mechanisms involved in frost tolerance.
Klein, Anthony; Houtin, Hervé; Rond, Céline; Marget, Pascal; Jacquin, Françoise; Boucherot, Karen; Huart, Myriam; Rivière, Nathalie; Boutet, Gilles; Lejeune-Hénaut, Isabelle; Burstin, Judith
2014-06-01
Avoidance mechanisms and intrinsic resistance are complementary strategies to improve winter frost tolerance and yield potential in field pea. The development of the winter pea crop represents a major challenge to expand plant protein production in temperate areas. Breeding winter cultivars requires the combination of freezing tolerance as well as high seed productivity and quality. In this context, we investigated the genetic determinism of winter frost tolerance and assessed its genetic relationship with yield and developmental traits. Using a newly identified source of frost resistance, we developed a population of recombinant inbred lines and evaluated it in six environments in Dijon and Clermont-Ferrand between 2005 and 2010. We developed a genetic map comprising 679 markers distributed over seven linkage groups and covering 947.1 cM. One hundred sixty-one quantitative trait loci (QTL) explaining 9-71 % of the phenotypic variation were detected across the six environments for all traits measured. Two clusters of QTL mapped on the linkage groups III and one cluster on LGVI reveal the genetic links between phenology, morphology, yield-related traits and frost tolerance in winter pea. QTL clusters on LGIII highlighted major developmental gene loci (Hr and Le) and the QTL cluster on LGVI explained up to 71 % of the winter frost damage variation. This suggests that a specific architecture and flowering ideotype defines frost tolerance in winter pea. However, two consistent frost tolerance QTL on LGV were independent of phenology and morphology traits, showing that different protective mechanisms are involved in frost tolerance. Finally, these results suggest that frost tolerance can be bred independently to seed productivity and quality.
Application of a Taxonomical Structure for Classifying Goods Procured by the Federal Government
1991-12-01
between all pairs of objects. Also called a "tree" or "phenogram". "• UPGMA Clustering Method- (Un--weighted pair-group method using weighted averages...clustering arrangement, specifically, the unweighted pair-group method using arithmetic averages ( UPGMA ) (more commonly known as the 49 average linkage method
Zeng, Rui; Smith, Erin; Barrientos, Antoni
2018-03-06
Mitoribosomes are specialized for the synthesis of hydrophobic membrane proteins encoded by mtDNA, all essential for oxidative phosphorylation. Despite their linkage to human mitochondrial diseases and the recent cryoelectron microscopy reconstruction of yeast and mammalian mitoribosomes, how they are assembled remains obscure. Here, we dissected the yeast mitoribosome large subunit (mtLSU) assembly process by systematic genomic deletion of 44 mtLSU proteins (MRPs). Analysis of the strain collection unveiled 37 proteins essential for functional mtLSU assembly, three of which are critical for mtLSU 21S rRNA stability. Hierarchical cluster analysis of mtLSU subassemblies accumulated in mutant strains revealed co-operative assembly of protein sets forming structural clusters and preassembled modules. It also indicated crucial roles for mitochondrion-specific membrane-binding MRPs in anchoring newly transcribed 21S rRNA to the inner membrane, where assembly proceeds. Our results define the yeast mtLSU assembly landscape in vivo and provide a foundation for studies of mitoribosome assembly across evolution. Copyright © 2018 Elsevier Inc. All rights reserved.
Savary, Serge; Delbac, Lionel; Rochas, Amélie; Taisant, Guillaume; Willocquet, Laetitia
2009-08-01
Dual epidemics are defined as epidemics developing on two or several plant organs in the course of a cropping season. Agricultural pathosystems where such epidemics develop are often very important, because the harvestable part is one of the organs affected. These epidemics also are often difficult to manage, because the linkage between epidemiological components occurring on different organs is poorly understood, and because prediction of the risk toward the harvestable organs is difficult. In the case of downy mildew (DM) and powdery mildew (PM) of grapevine, nonlinear modeling and logistic regression indicated nonlinearity in the foliage-cluster relationships. Nonlinear modeling enabled the parameterization of a transmission coefficient that numerically links the two components, leaves and clusters, in DM and PM epidemics. Logistic regression analysis yielded a series of probabilistic models that enabled predicting preset levels of cluster infection risks based on DM and PM severities on the foliage at successive crop stages. The usefulness of this framework for tactical decision-making for disease control is discussed.
Correlation and network analysis of global financial indices
NASA Astrophysics Data System (ADS)
Kumar, Sunil; Deo, Nivedita
2012-08-01
Random matrix theory (RMT) and network methods are applied to investigate the correlation and network properties of 20 financial indices. The results are compared before and during the financial crisis of 2008. In the RMT method, the components of eigenvectors corresponding to the second largest eigenvalue form two clusters of indices in the positive and negative directions. The components of these two clusters switch in opposite directions during the crisis. The network analysis uses the Fruchterman-Reingold layout to find clusters in the network of indices at different thresholds. At a threshold of 0.6, before the crisis, financial indices corresponding to the Americas, Europe, and Asia-Pacific form separate clusters. On the other hand, during the crisis at the same threshold, the American and European indices combine together to form a strongly linked cluster while the Asia-Pacific indices form a separate weakly linked cluster. If the value of the threshold is further increased to 0.9 then the European indices (France, Germany, and the United Kingdom) are found to be the most tightly linked indices. The structure of the minimum spanning tree of financial indices is more starlike before the crisis and it changes to become more chainlike during the crisis. The average linkage hierarchical clustering algorithm is used to find a clearer cluster structure in the network of financial indices. The cophenetic correlation coefficients are calculated and found to increase significantly, which indicates that the hierarchy increases during the financial crisis. These results show that there is substantial change in the structure of the organization of financial indices during a financial crisis.
Correlation and network analysis of global financial indices.
Kumar, Sunil; Deo, Nivedita
2012-08-01
Random matrix theory (RMT) and network methods are applied to investigate the correlation and network properties of 20 financial indices. The results are compared before and during the financial crisis of 2008. In the RMT method, the components of eigenvectors corresponding to the second largest eigenvalue form two clusters of indices in the positive and negative directions. The components of these two clusters switch in opposite directions during the crisis. The network analysis uses the Fruchterman-Reingold layout to find clusters in the network of indices at different thresholds. At a threshold of 0.6, before the crisis, financial indices corresponding to the Americas, Europe, and Asia-Pacific form separate clusters. On the other hand, during the crisis at the same threshold, the American and European indices combine together to form a strongly linked cluster while the Asia-Pacific indices form a separate weakly linked cluster. If the value of the threshold is further increased to 0.9 then the European indices (France, Germany, and the United Kingdom) are found to be the most tightly linked indices. The structure of the minimum spanning tree of financial indices is more starlike before the crisis and it changes to become more chainlike during the crisis. The average linkage hierarchical clustering algorithm is used to find a clearer cluster structure in the network of financial indices. The cophenetic correlation coefficients are calculated and found to increase significantly, which indicates that the hierarchy increases during the financial crisis. These results show that there is substantial change in the structure of the organization of financial indices during a financial crisis.
A SSR-based composite genetic linkage map for the cultivated peanut (Arachis hypogaea L.) genome
2010-01-01
Background The construction of genetic linkage maps for cultivated peanut (Arachis hypogaea L.) has and continues to be an important research goal to facilitate quantitative trait locus (QTL) analysis and gene tagging for use in a marker-assisted selection in breeding. Even though a few maps have been developed, they were constructed using diploid or interspecific tetraploid populations. The most recently published intra-specific map was constructed from the cross of cultivated peanuts, in which only 135 simple sequence repeat (SSR) markers were sparsely populated in 22 linkage groups. The more detailed linkage map with sufficient markers is necessary to be feasible for QTL identification and marker-assisted selection. The objective of this study was to construct a genetic linkage map of cultivated peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank. Results Three recombinant inbred lines (RILs) populations were constructed from three crosses with one common female parental line Yueyou 13, a high yielding Spanish market type. The four parents were screened with 1044 primer pairs designed to amplify SSRs and 901 primer pairs produced clear PCR products. Of the 901 primer pairs, 146, 124 and 64 primer pairs (markers) were polymorphic in these populations, respectively, and used in genotyping these RIL populations. Individual linkage maps were constructed from each of the three populations and a composite map based on 93 common loci were created using JoinMap. The composite linkage maps consist of 22 composite linkage groups (LG) with 175 SSR markers (including 47 SSRs on the published AA genome maps), representing the 20 chromosomes of A. hypogaea. The total composite map length is 885.4 cM, with an average marker density of 5.8 cM. Segregation distortion in the 3 populations was 23.0%, 13.5% and 7.8% of the markers, respectively. These distorted loci tended to cluster on LG1, LG3, LG4 and LG5. There were only 15 EST-SSR markers mapped due to low polymorphism. By comparison, there were potential synteny, collinear order of some markers and conservation of collinear linkage groups among the maps and with the AA genome but not fully conservative. Conclusion A composite linkage map was constructed from three individual mapping populations with 175 SSR markers in 22 composite linkage groups. This composite genetic linkage map is among the first "true" tetraploid peanut maps produced. This map also consists of 47 SSRs that have been used in the published AA genome maps, and could be used in comparative mapping studies. The primers described in this study are PCR-based markers, which are easy to share for genetic mapping in peanuts. All 1044 primer pairs are provided as additional files and the three RIL populations will be made available to public upon request for quantitative trait loci (QTL) analysis and linkage map improvement. PMID:20105299
Larraya, Luis M.; Idareta, Eneko; Arana, Dani; Ritter, Enrique; Pisabarro, Antonio G.; Ramírez, Lucia
2002-01-01
Mycelium growth rate is a quantitative characteristic that exhibits continuous variation. This trait has applied interest, as growth rate is correlated with production yield and increased advantage against competitors. In this work, we studied growth rate variation in the edible basidiomycete Pleurotus ostreatus growing as monokaryotic or dikaryotic mycelium on Eger medium or on wheat straw. Our analysis resulted in identification of several genomic regions (quantitative trait loci [QTLs]) involved in the control of growth rate that can be mapped on the genetic linkage map of this fungus. In some cases monokaryotic and dikaryotic QTLs clustered at the same map position, indicating that there are principal genomic areas responsible for growth rate control. The availability of this linkage map of growth rate QTLs can help in the design of rational strain breeding programs based on genomic information. PMID:11872457
Computing the shape of brain networks using graph filtration and Gromov-Hausdorff metric.
Lee, Hyekyoung; Chung, Moo K; Kang, Hyejin; Kim, Boong-Nyun; Lee, Dong Soo
2011-01-01
The difference between networks has been often assessed by the difference of global topological measures such as the clustering coefficient, degree distribution and modularity. In this paper, we introduce a new framework for measuring the network difference using the Gromov-Hausdorff (GH) distance, which is often used in shape analysis. In order to apply the GH distance, we define the shape of the brain network by piecing together the patches of locally connected nearest neighbors using the graph filtration. The shape of the network is then transformed to an algebraic form called the single linkage matrix. The single linkage matrix is subsequently used in measuring network differences using the GH distance. As an illustration, we apply the proposed framework to compare the FDG-PET based functional brain networks out of 24 attention deficit hyperactivity disorder (ADHD) children, 26 autism spectrum disorder (ASD) children and 11 pediatric control subjects.
He, Shui-Lian; Yang, Yang; Morrell, Peter L; Yi, Ting-Shuang
2015-01-01
Foxtail millet (Setaria italica (L.) Beauv) is one of the earliest domesticated grains, which has been cultivated in northern China by 8,700 years before present (YBP) and across Eurasia by 4,000 YBP. Owing to a small genome and diploid nature, foxtail millet is a tractable model crop for studying functional genomics of millets and bioenergy grasses. In this study, we examined nucleotide sequence diversity, geographic structure, and levels of linkage disequilibrium at four nuclear loci (ADH1, G3PDH, IGS1 and TPI1) in representative samples of 311 landrace accessions across its cultivated range. Higher levels of nucleotide sequence and haplotype diversity were observed in samples from China relative to other sampled regions. Genetic assignment analysis classified the accessions into seven clusters based on nucleotide sequence polymorphisms. Intralocus LD decayed rapidly to half the initial value within ~1.2 kb or less.
Gerald, W. L.; Karam, J. D.
1984-01-01
The results of this study bear on the relationship between genetic linkage and control of interactions between the protein products of different cistrons. In T4 bacteriophage, genes 45 and 44 encode essential components of the phage DNA replication multiprotein complex. T4 gene 45 maps directly upstream of gene 44 relative to the overall direction of reading of this region of the phage chromosome, but it is not known whether these two genes are cotranscribed. It has been shown that a nonsense lesion of T4 gene 45 exerts a cis-dominant inhibitory effect on growth of a missense mutant of gene 44 but not on growth of phage carrying the wild-type gene 44 allele. In previous work, we confirmed these observations on polarity of the gene 45 mutation but detected no polar effects by this lesion on synthesis of either mutant or wild-type gene 44 protein. In the present study, we demonstrate that mRNA for gene 44 protein is separable by gel electrophoresis from gene 45-protein-encoding mRNA. That is, the two proteins are not synthesized from one polycistronic message, and the cis-dominant inhibitory effect of the gene 45 mutation on gene 44 function is probably expressed at a posttranslational stage. We propose that close genetic linkage, whether or not it provides shared transcriptional and translational regulatory signals for certain clusters of functionally related cistrons, may determine the intracellular compartmentalization for synthesis of proteins encoded by these clusters. In prokaryotes, such linkage-dependent compartmentation may minimize the diffusion distances between gene products that are synthesized at low levels and are destined to interact. PMID:6745641
Linkage maps of the Atlantic salmon (Salmo salar) genome derived from RAD sequencing
2014-01-01
Background Genetic linkage maps are useful tools for mapping quantitative trait loci (QTL) influencing variation in traits of interest in a population. Genotyping-by-sequencing approaches such as Restriction-site Associated DNA sequencing (RAD-Seq) now enable the rapid discovery and genotyping of genome-wide SNP markers suitable for the development of dense SNP linkage maps, including in non-model organisms such as Atlantic salmon (Salmo salar). This paper describes the development and characterisation of a high density SNP linkage map based on SbfI RAD-Seq SNP markers from two Atlantic salmon reference families. Results Approximately 6,000 SNPs were assigned to 29 linkage groups, utilising markers from known genomic locations as anchors. Linkage maps were then constructed for the four mapping parents separately. Overall map lengths were comparable between male and female parents, but the distribution of the SNPs showed sex-specific patterns with a greater degree of clustering of sire-segregating SNPs to single chromosome regions. The maps were integrated with the Atlantic salmon draft reference genome contigs, allowing the unique assignment of ~4,000 contigs to a linkage group. 112 genome contigs mapped to two or more linkage groups, highlighting regions of putative homeology within the salmon genome. A comparative genomics analysis with the stickleback reference genome identified putative genes closely linked to approximately half of the ordered SNPs and demonstrated blocks of orthology between the Atlantic salmon and stickleback genomes. A subset of 47 RAD-Seq SNPs were successfully validated using a high-throughput genotyping assay, with a correspondence of 97% between the two assays. Conclusions This Atlantic salmon RAD-Seq linkage map is a resource for salmonid genomics research as genotyping-by-sequencing becomes increasingly common. This is aided by the integration of the SbfI RAD-Seq SNPs with existing reference maps and the draft reference genome, as well as the identification of putative genes proximal to the SNPs. Differences in the distribution of recombination events between the sexes is evident, and regions of homeology have been identified which are reflective of the recent salmonid whole genome duplication. PMID:24571138
Kumar, Arvind; Rai, Lal Chand
2017-07-01
Soil quality is an important factor and maintained by inhabited microorganisms. Soil physicochemical characteristics determine indigenous microbial population and rice provides food security to major population of the world. Therefore, this study aimed to assess the impact of physicochemical variables on bacterial community composition and diversity in conventional paddy fields which could reflect a real picture of the bacterial communities operating in the paddy agro-ecosystem. To fulfill the objective; soil physicochemical characterization, bacterial community composition and diversity analysis was carried out using culture-independent PCR-DGGE method from twenty soils distributed across eight districts. Bacterial communities were grouped into three clusters based on UPGMA cluster analysis of DGGE banding pattern. The linkage of measured physicochemical variables with bacterial community composition was analyzed by canonical correspondence analysis (CCA). CCA ordination biplot results were similar to UPGMA cluster analysis. High levels of species-environment correlations (0.989 and 0.959) were observed and the largest proportion of species data variability was explained by total organic carbon (TOC), available nitrogen, total nitrogen and pH. Thus, results suggest that TOC and nitrogen are key regulators of bacterial community composition in the conventional paddy fields. Further, high diversity indices and evenness values demonstrated heterogeneity and co-abundance of the bacterial communities.
Simultaneous Production of Anabaenopeptins and Namalides by the Cyanobacterium Nostoc sp. CENA543.
Shishido, Tânia K; Jokela, Jouni; Fewer, David P; Wahlsten, Matti; Fiore, Marli F; Sivonen, Kaarina
2017-11-17
Anabaenopeptins are a diverse group of cyclic peptides, which contain an unusual ureido linkage. Namalides are shorter structural homologues of anabaenopeptins, which also contain an ureido linkage. The biosynthetic origins of namalides are unknown despite a strong resemblance to anabaenopeptins. Here, we show the cyanobacterium Nostoc sp. CENA543 strain producing new (nostamide B-E (2, 4, 5, and 6)) and known variants of anabaenopeptins (schizopeptin 791 (1) and anabaenopeptin 807 (3)). Surprisingly, Nostoc sp. CENA543 also produced namalide B (8) and the new namalides D (7), E (9), and F (10) in similar amounts to anabaenopeptins. Analysis of the complete Nostoc sp. CENA543 genome sequence indicates that both anabaenopeptins and namalides are produced by the same biosynthetic pathway through module skipping during biosynthesis. This unique process involves the skipping of two modules present in different nonribosomal peptide synthetases during the namalide biosynthesis. This skipping is an efficient mechanism since both anabaenopeptins and namalides are synthesized in similar amounts by Nostoc sp. CENA543. Consequently, gene skipping may be used to increase and possibly broaden the chemical diversity of related peptides produced by a single biosynthetic gene cluster. Genome mining demonstrated that the anabaenopeptin gene clusters are widespread in cyanobacteria and can also be found in tectomicrobia bacteria.
Xiao, Xin; Chen, Zaiming; Chen, Baoliang
2016-01-01
Biochar is increasingly gaining attention due to multifunctional roles in soil amelioration, pollution mitigation and carbon sequestration. It is a significant challenge to compare the reported results from world-wide labs regarding the structure and sorption of biochars derived from various precursors under different pyrolytic conditions due to a lack of a simple linkage. By combining the published works on various biochars, we established a quantitative relationship between H/C atomic ratio and pyrolytic temperature (T), aromatic structure, and sorption properties for naphthalene and phenanthrene. A reverse sigmoid shape between T and the H/C ratio was observed, which was independent of the precursors of biochars, including the ash contents. Linear correlations of Freundlich parameters (N, log Kf) and sorption amount (log Qe, log QA) with H/C ratios were found. A rectangle-like model was proposed to predict the aromatic cluster sizes of biochars from their H/C ratios, and then a good structure-sorption relationship was derived. These quantitative relationships indicate that the H/C atomic ratio is a universal linkage to predict pyrolytic temperatures, aromatic cluster sizes, and sorption characteristics. This study would guide the global study of biochars toward being comparable, and then the development of the structure-sorption relationships will benefit the structural design and environmental application of biochars. PMID:26940984
McNicol, L A; De, S P; Kaper, J B; West, P A; Colwell, R R
1983-01-01
A total of 165 strains of vibrios isolated from clinical and environmental sources in the United States, India, and Bangladesh, 11 reference cultures, and 4 duplicated cultures were compared in a numerical taxonomic study using 83 unit characters. Similarity between strains was computed by using the simple matching coefficient and the Jaccard coefficient. Strains were clustered by unweighted average linkage and single linkage algorithms. All methods gave similar cluster compositions. The estimated probability of error in the study was obtained from a comparison of the results of duplicated strains and was within acceptable limits. A total of 174 of the 180 organisms studied were divided into eight major clusters. Two clusters were identified as Vibrio cholerae, one as Vibrio mimicus, one as Vibrio parahaemolyticus, three as Vibrio species, and one as Aeromonas hydrophila. The V. mimicus cluster could be further divided into two subclusters, and the major V. cholerae group could be split into seven minor subclusters. Phenotypic traits routinely used to identify clinical isolates of V. cholerae can be used to identify environmental V. cholerae isolates. No distinction was found between strains of V. cholerae isolated from regions endemic for cholera and strains from nonendemic regions. PMID:6874901
Gu, Yu; Zhao, Qian-Cheng; Sun, De-Ling; Song, Wen-Qin
2007-06-01
Nucleotide binding site (NBS) profiling, a new method was used to map resistance gene analogues (RGAs) in cauliflower (Brassica oleracea var. botrytis). This method allows amplification and the mapping of genetic markers anchored in the conserved NBS encoding domain of plant disease resistance genes. AFLP was also performed to construct the cauliflower intervarietal genetic map. The aim of constructing genetic map was to identify potential molecular markers linked to important agronomic traits that would be particularly useful for development and improving the species. Using 17 AFLP primer combinations and two degeneration primer/enzyme combinations, a total of 234 AFLP markers and 21 NBS markers were mapped in the F2 population derived from self-pollinating a single F1 plant of the cross AD White Flower x C-8. The markers were mapped in 9 of major linkage groups spanning 668.4 cM, with an average distance of 2.9 cM between adjacent mapped markers. The AFLP markers were well distributed throughout the linkage groups. The linkage groups contained from 12 to 47 loci each and the distance between two consecutive loci ranged from 0 to 14.9 cM. NBS markers were mapped on 8 of the 9 linkage groups of the genetic map. Most of these markers were organized in clusters. This result demonstrates the feasibility of the NBS-profiling method for generating NBS markers for resistance loci in cauliflower. The clustering of the markers mapped in this study adds to the evidence that most of them could be real RGAs.
Shanker, Jayashree; Perumal, Ganapathy; Rao, Veena S; Khadrinarasimhiah, Natesha B; John, Shibu; Hebbagodi, Sridhara; Mukherjee, Manjari; Kakkar, Vijay V
2008-01-01
Background The APOA1-C3-A5 gene cluster plays an important role in the regulation of lipids. Asian Indians have an increased tendency for abnormal lipid levels and high risk of Coronary Artery Disease (CAD). Therefore, the present study aimed to elucidate the relationship of four single nucleotide polymorphisms (SNPs) in the Apo11q cluster, namely the -75G>A, +83C>T SNPs in the APOA1 gene, the Sac1 SNP in the APOC3 gene and the S19W variant in the APOA5 gene to plasma lipids and CAD in 190 affected sibling pairs (ASPs) belonging to Asian Indian families with a strong CAD history. Methods & results Genotyping and lipid assays were carried out using standard protocols. Plasma lipids showed a strong heritability (h2 48% – 70%; P < 0.0001). A subset of 77 ASPs with positive sign of Logarithm of Odds (LOD) score showed significant linkage to CAD trait by multi-point analysis (LOD score 7.42, P < 0.001) and to Sac1 (LOD score 4.49) and -75G>A (LOD score 2.77) SNPs by single-point analysis (P < 0.001). There was significant proportion of mean allele sharing (pi) for the Sac1 (pi 0.59), -75G>A (pi 0.56) and +83C>T (pi 0.52) (P < 0.001) SNPs, respectively. QTL analysis showed suggestive evidence of linkage of the Sac1 SNP to Total Cholesterol (TC), High Density Lipoprotein-cholesterol (HDL-C) and Apolipoprotein B (ApoB) with LOD scores of 1.42, 1.72 and 1.19, respectively (P < 0.01). The Sac1 and -75G>A SNPs along with hypertension showed maximized correlations with TC, TG and Apo B by association analysis. Conclusion The APOC3-Sac1 SNP is an important genetic variant that is associated with CAD through its interaction with plasma lipids and other standard risk factors among Asian Indians. PMID:18801202
Nedeljkovic, Ivana; Terzikhan, Natalie; Vonk, Judith M; van der Plaat, Diana A; Lahousse, Lies; van Diemen, Cleo C; Hobbs, Brian D; Qiao, Dandi; Cho, Michael H; Brusselle, Guy G; Postma, Dirkje S; Boezen, H M; van Duijn, Cornelia M; Amin, Najaf
2018-01-01
Chronic obstructive pulmonary disease (COPD) is a complex and heritable disease, associated with multiple genetic variants. Specific familial types of COPD may be explained by rare variants, which have not been widely studied. We aimed to discover rare genetic variants underlying COPD through a genome-wide linkage scan. Affected-only analysis was performed using the 6K Illumina Linkage IV Panel in 142 cases clustered in 27 families from a genetic isolate, the Erasmus Rucphen Family (ERF) study. Potential causal variants were identified by searching for shared rare variants in the exome-sequence data of the affected members of the families contributing most to the linkage peak. The identified rare variants were then tested for association with COPD in a large meta-analysis of several cohorts. Significant evidence for linkage was observed on chromosomes 15q14-15q25 [logarithm of the odds (LOD) score = 5.52], 11p15.4-11q14.1 (LOD = 3.71) and 5q14.3-5q33.2 (LOD = 3.49). In the chromosome 15 peak, that harbors the known COPD locus for nicotinic receptors, and in the chromosome 5 peak we could not identify shared variants. In the chromosome 11 locus, we identified four rare (minor allele frequency (MAF) <0.02), predicted pathogenic, missense variants. These were shared among the affected family members. The identified variants localize to genes including neuroblast differentiation-associated protein ( AHNAK ), previously associated with blood biomarkers in COPD, phospholipase C Beta 3 ( PLCB3 ), shown to increase airway hyper-responsiveness, solute carrier family 22-A11 ( SLC22A11 ), involved in amino acid metabolism and ion transport, and metallothionein-like protein 5 ( MTL5 ), involved in nicotinate and nicotinamide metabolism. Association of SLC22A11 and MTL5 variants were confirmed in the meta-analysis of 9,888 cases and 27,060 controls. In conclusion, we have identified novel rare variants in plausible genes related to COPD. Further studies utilizing large sample whole-genome sequencing should further confirm the associations at chromosome 11 and investigate the chromosome 15 and 5 linked regions.
Cai, Guowen; Cole, Shelley A; Freeland-Graves, Jeanne H; MacCluer, Jean W; Blangero, John; Comuzzie, Anthony G
2004-10-01
Metabolic syndrome refers to the clustering of disease conditions such as insulin resistance, hyperinsulinemia, dyslipidemia, hypertension, and obesity. To explore the genetic predispositions of this complex syndrome, we conducted a principal components analysis using data on 14 phenotypes related to the risk of developing metabolic syndrome. The subjects were 566 nondiabetic Mexican Americans, distributed in 41 extended families from the San Antonio Family Heart Study. The factor scores obtained from these 14 phenotypes were used in multipoint linkage analysis using SOLAR. Factors were identified that accounted for 73% of the total variance of the original variables: body size-adiposity, insulin-glucose, blood pressure, and lipid levels. Each factor exhibited evidence for either significant or suggestive linkage involving four factor-specific chromosomal regions relating to chromosomes 1, 3, 4, and 6. Significant evidence for linkage of the lipid factor was found on chromosome 4 near marker D4S403 (LOD = 3.52), where the cholecystokinin A receptor (CCKAR) and ADP-ribosyl cyclase 1 (CD38) genes are located. Suggestive evidence for linkage of the body size-adiposity factor to chromosome 1 near marker D1S1597 (LOD = 2.53) in the region containing the nuclear receptor subfamily 0, group B, member 2 gene (NROB2) also was observed. The insulin-glucose and blood pressure factors were linked suggestively to regions on chromosome 3 near marker D3S1595 (LOD = 2.20) and on chromosome 6 near marker D6S 1031 (LOD = 2.08), respectively. In summary, our findings suggest that the factor structures for the risk of metabolic syndrome are influenced by multiple distinct genes across the genome.
2014-01-01
Background Bean anthracnose is caused by the fungus Colletotrichum lindemuthianum (Sacc. & Magnus) Lams.- Scrib. Resistance to C. lindemuthianum in common bean (Phaseolus vulgaris L.) generally follows a qualitative mode of inheritance. The pathogen shows extensive pathogenic variation and up to 20 anthracnose resistance loci (named Co-), conferring resistance to specific races, have been described. Anthracnose resistance has generally been investigated by analyzing a limited number of isolates or races in segregating populations. In this work, we analyzed the response against eleven C. lindemuthianum races in a recombinant inbred line (RIL) common bean population derived from the cross Xana × Cornell 49242 in which a saturated linkage map was previously developed. Results A systematic genetic analysis was carried out to dissect the complex resistance segregations observed, which included contingency analyses, subpopulations and genetic mapping. Twenty two resistance genes were identified, some with a complementary mode of action. The Cornell 49242 genotype carries a complex cluster of resistance genes at the end of linkage group (LG) Pv11 corresponding to the previously described anthracnose resistance cluster Co-2. In this position, specific resistance genes to races 3, 6, 7, 19, 38, 39, 65, 357, 449 and 453 were identified, with one of them showing a complementary mode of action. In addition, Cornell 49242 had an independent gene on LG Pv09 showing a complementary mode of action for resistance to race 453. Resistance genes in genotype Xana were located on three regions involving LGs Pv01, Pv02 and Pv04. All resistance genes identified in Xana showed a complementary mode of action, except for two controlling resistance to races 65 and 73 located on LG Pv01, in the position of the previously described anthracnose resistance cluster Co-1. Conclusions Results shown herein reveal a complex and specific interaction between bean and fungus genotypes leading to anthracnose resistance. Organization of specific resistance genes in clusters including resistance genes with different modes of action (dominant and complementary genes) was also confirmed. Finally, new locations for anthracnose resistance genes were identified in LG Pv09. PMID:24779442
Pascual-García, Alberto; Abia, David; Ortiz, Angel R; Bastolla, Ugo
2009-03-01
Structural classifications of proteins assume the existence of the fold, which is an intrinsic equivalence class of protein domains. Here, we test in which conditions such an equivalence class is compatible with objective similarity measures. We base our analysis on the transitive property of the equivalence relationship, requiring that similarity of A with B and B with C implies that A and C are also similar. Divergent gene evolution leads us to expect that the transitive property should approximately hold. However, if protein domains are a combination of recurrent short polypeptide fragments, as proposed by several authors, then similarity of partial fragments may violate the transitive property, favouring the continuous view of the protein structure space. We propose a measure to quantify the violations of the transitive property when a clustering algorithm joins elements into clusters, and we find out that such violations present a well defined and detectable cross-over point, from an approximately transitive regime at high structure similarity to a regime with large transitivity violations and large differences in length at low similarity. We argue that protein structure space is discrete and hierarchic classification is justified up to this cross-over point, whereas at lower similarities the structure space is continuous and it should be represented as a network. We have tested the qualitative behaviour of this measure, varying all the choices involved in the automatic classification procedure, i.e., domain decomposition, alignment algorithm, similarity score, and clustering algorithm, and we have found out that this behaviour is quite robust. The final classification depends on the chosen algorithms. We used the values of the clustering coefficient and the transitivity violations to select the optimal choices among those that we tested. Interestingly, this criterion also favours the agreement between automatic and expert classifications. As a domain set, we have selected a consensus set of 2,890 domains decomposed very similarly in SCOP and CATH. As an alignment algorithm, we used a global version of MAMMOTH developed in our group, which is both rapid and accurate. As a similarity measure, we used the size-normalized contact overlap, and as a clustering algorithm, we used average linkage. The resulting automatic classification at the cross-over point was more consistent than expert ones with respect to the structure similarity measure, with 86% of the clusters corresponding to subsets of either SCOP or CATH superfamilies and fewer than 5% containing domains in distinct folds according to both SCOP and CATH. Almost 15% of SCOP superfamilies and 10% of CATH superfamilies were split, consistent with the notion of fold change in protein evolution. These results were qualitatively robust for all choices that we tested, although we did not try to use alignment algorithms developed by other groups. Folds defined in SCOP and CATH would be completely joined in the regime of large transitivity violations where clustering is more arbitrary. Consistently, the agreement between SCOP and CATH at fold level was lower than their agreement with the automatic classification obtained using as a clustering algorithm, respectively, average linkage (for SCOP) or single linkage (for CATH). The networks representing significant evolutionary and structural relationships between clusters beyond the cross-over point may allow us to perform evolutionary, structural, or functional analyses beyond the limits of classification schemes. These networks and the underlying clusters are available at http://ub.cbm.uam.es/research/ProtNet.php.
Paintsil, Elijah; Verevochkin, Sergei V; Dukhovlinova, Elena; Niccolai, Linda; Barbour, Russell; White, Edward; Toussova, Olga V; Alexander, Louis; Kozlov, Andrei P; Heimer, Robert
2009-11-01
To understand the epidemiology and transmission patterns of hepatitis C virus (HCV), the predominant blood borne-pathogen infecting injection drug users (IDUs), in a part of the former Soviet Union. Cross-sectional respondent-driven sample of IDUs. St Petersburg, Russia. A total of 387 IDUs were recruited in late 2005 and throughout 2006. Participants were surveyed to collect demographic, medical and both general and dyad-specific drug injection and sexual behaviors. A blood sample was collected to detect antibodies to hepatitis C and to amplify viral RNA for molecular analysis. The molecular data, including genotypes, were analyzed spatially and linkage patterns were compared to the social linkages obtained by respondent-driven sampling (RDS) for chains of respondents and among the injection dyads. HCV infection was all but ubiquitous: 94.6% of IDUs were HCV-seropositive. Among the 209 viral sequences amplified, genotype 3a predominated (n = 119, 56.9%), followed by 1b (n = 61, 29.2%) and 1a (n = 25, 11.9%). There was no significant clustering of genotypes spatially. Neither genotypes nor closely related sequences were clustered within RDS chains. Analysis of HCV sequences from dyads failed to find associations of genotype or sequence homology within pairs. Genotyping reveals that there have been at least five unique introductions of HCV genotypes into the IDU community in St Petersburg. Analysis of prevalent infections does not appear to correlate with the social networks of IDUs, suggesting that simple approaches to link these networks to prevalent infections, rather than incident transmission, will not prove meaningful. On a more positive note, the majority of IDUs are infected with 3a genotype that is associated with sustained virological response to antiviral therapy.
Chang, Ni-Bin; Wimberly, Brent; Xuan, Zhemin
2012-03-01
This study presents an integrated k-means clustering and gravity model (IKCGM) for investigating the spatiotemporal patterns of nutrient and associated dissolved oxygen levels in Tampa Bay, Florida. By using a k-means clustering analysis to first partition the nutrient data into a user-specified number of subsets, it is possible to discover the spatiotemporal patterns of nutrient distribution in the bay and capture the inherent linkages of hydrodynamic and biogeochemical features. Such patterns may then be combined with a gravity model to link the nutrient source contribution from each coastal watershed to the generated clusters in the bay to aid in the source proportion analysis for environmental management. The clustering analysis was carried out based on 1 year (2008) water quality data composed of 55 sample stations throughout Tampa Bay collected by the Environmental Protection Commission of Hillsborough County. In addition, hydrological and river water quality data of the same year were acquired from the United States Geological Survey's National Water Information System to support the gravity modeling analysis. The results show that the k-means model with 8 clusters is the optimal choice, in which cluster 2 at Lower Tampa Bay had the minimum values of total nitrogen (TN) concentrations, chlorophyll a (Chl-a) concentrations, and ocean color values in every season as well as the minimum concentration of total phosphorus (TP) in three consecutive seasons in 2008. The datasets indicate that Lower Tampa Bay is an area with limited nutrient input throughout the year. Cluster 5, located in Middle Tampa Bay, displayed elevated TN concentrations, ocean color values, and Chl-a concentrations, suggesting that high values of colored dissolved organic matter are linked with some nutrient sources. The data presented by the gravity modeling analysis indicate that the Alafia River Basin is the major contributor of nutrients in terms of both TP and TN values in all seasons. With this new integration, improvements for environmental monitoring and assessment were achieved to advance our understanding of sea-land interactions and nutrient cycling in a critical coastal bay, the Gulf of Mexico. This journal is © The Royal Society of Chemistry 2012
2012-01-01
Background Cotton is the world’s most important natural textile fiber and a significant oilseed crop. Decoding cotton genomes will provide the ultimate reference and resource for research and utilization of the species. Integration of high-density genetic maps with genomic sequence information will largely accelerate the process of whole-genome assembly in cotton. Results In this paper, we update a high-density interspecific genetic linkage map of allotetraploid cultivated cotton. An additional 1,167 marker loci have been added to our previously published map of 2,247 loci. Three new marker types, InDel (insertion-deletion) and SNP (single nucleotide polymorphism) developed from gene information, and REMAP (retrotransposon-microsatellite amplified polymorphism), were used to increase map density. The updated map consists of 3,414 loci in 26 linkage groups covering 3,667.62 cM with an average inter-locus distance of 1.08 cM. Furthermore, genome-wide sequence analysis was finished using 3,324 informative sequence-based markers and publicly-available Gossypium DNA sequence information. A total of 413,113 EST and 195 BAC sequences were physically anchored and clustered by 3,324 sequence-based markers. Of these, 14,243 ESTs and 188 BACs from different species of Gossypium were clustered and specifically anchored to the high-density genetic map. A total of 2,748 candidate unigenes from 2,111 ESTs clusters and 63 BACs were mined for functional annotation and classification. The 337 ESTs/genes related to fiber quality traits were integrated with 132 previously reported cotton fiber quality quantitative trait loci, which demonstrated the important roles in fiber quality of these genes. Higher-level sequence conservation between different cotton species and between the A- and D-subgenomes in tetraploid cotton was found, indicating a common evolutionary origin for orthologous and paralogous loci in Gossypium. Conclusion This study will serve as a valuable genomic resource for tetraploid cotton genome assembly, for cloning genes related to superior agronomic traits, and for further comparative genomic analyses in Gossypium. PMID:23046547
Sung, Qing; Liu, Caiyun; Zhang, Guanyun; Zhang, Jian; Tung, Chen-Ho; Wang, Yifeng
2018-06-21
Novel 17-nuclear Zr-/Hf- oxide clusters ({Zr17} and {Hf17}) are isolated from aqueous systems. In the clusters, Zr/Hf ions are connected via μ3-O, μ3-OH and μ2-OH linkages into a pinwheel core which is wrapped with SO42-, HCOO- and aqua ligands. Octahedral hexanuclear Zr-/Hf- oxide clusters ({Zr6}oct and {Hf6}oct) are also isolated from the same hydrothermal system by decreasing the synthesis temperature. Structural analysis, synthetic conditions, vibrational spectra and ionic conductivity of the clusters are studied. Structural studies and synthesis inspection suggest that formation of {Zr6}oct and {Zr17} involves assembly of the same transferable building blocks, but the condensation degree and thermodynamic stability of the products increase with hydrothermal temperature. The role of {Zr6}oct and {Zr17} in the formation of ZrO2 nanocrystals are then discussed in the scenario of nonclassical nucleation theory. Besides, the Zr-oxide clusters exhibit ionic conductivity due to the mobility of protons. This study not only adds new members to the Zr-/Hf- oxide cluster family, but also establishes a connection from Zr4+ ions to ZrO2 in the hydrothermal preparation of zirconium oxide nanomaterials. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Towards cloning the WAS-gene locus: YAC-contigs and PFGE analysis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Meindi, A.; Schindelhauer, D.; Hellebrand, H.
1994-09-01
Patients with X-linked recessive Wiskott-Aldrich syndrome (WAS) manifest eczema, thrombocytopenia and severe immunodeficiency. Mapping studies place the WAS gene locus between the markers TIMP and DXS255 which both have been shown to be recombinant with the disease locus. Linkage analysis in eight families including a large Swiss family showed tight linkage of the disease to the loci DXS255 and DXS1126 and exclusion of TIMP as well as polymorphic loci adjacent to the OATL1 pseudogene cluster (e.g., DXS6616). Physical mapping with established YAC contigs and a radiation hybrid encompassing the Xp11.22-11.3 region revealed the loci order TIMP-PFC-elk1-DXS1367-DXS6616-OATL1-(DXS11260DXS226)-C5-3-TGE-3, SYP and (DXS255-DXS146). Themore » markers TIMP and C5-3 are contained on the same 1.6 Mb MluI-fragment. A novel expressed sequence (R1) could be placed between elk-1 and the PFC gene while the STS C5-3 could be localized adjacent to DXS1126. The gene cluster around DXS1126 could be connected with the TFE-3 and synaptophysin genes which map on the same 400 kb MluI fragment and two overlapping YACs. The minimum distance between SYP and DXS255 is 1.2 Mb; the maximum distance is 2.2 Mb. Expressed sequences which are obtained from a cosmid contig around DXS1126 and C5-3 are being used for mutation screening in WAS patients.« less
Worldwide clustering of the corruption perception
NASA Astrophysics Data System (ADS)
Paulus, Michal; Kristoufek, Ladislav
2015-06-01
We inspect a possible clustering structure of the corruption perception among 134 countries. Using the average linkage clustering, we uncover a well-defined hierarchy in the relationships among countries. Four main clusters are identified and they suggest that countries worldwide can be quite well separated according to their perception of corruption. Moreover, we find a strong connection between corruption levels and a stage of development inside the clusters. The ranking of countries according to their corruption perfectly copies the ranking according to the economic performance measured by the gross domestic product per capita of the member states. To the best of our knowledge, this study is the first one to present an application of hierarchical and clustering methods to the specific case of corruption.
Okesola, Nonhlanhla; Tanser, Frank; Thiebaut, Rodolphe; Rekacewicz, Claire; Newell, Marie-Louise
2016-01-01
Background The 2015 WHO recommendation of antiretroviral therapy (ART) for all immediately following HIV diagnosis is partially based on the anticipated impact on HIV incidence in the surrounding population. We investigated this approach in a cluster-randomised trial in a high HIV prevalence setting in rural KwaZulu-Natal. We present findings from the first phase of the trial and report on uptake of home-based HIV testing, linkage to care, uptake of ART, and community attitudes about ART. Methods and Findings Between 9 March 2012 and 22 May 2014, five clusters in the intervention arm (immediate ART offered to all HIV-positive adults) and five clusters in the control arm (ART offered according to national guidelines, i.e., CD4 count ≤ 350 cells/μl) contributed to the first phase of the trial. Households were visited every 6 mo. Following informed consent and administration of a study questionnaire, each resident adult (≥16 y) was asked for a finger-prick blood sample, which was used to estimate HIV prevalence, and offered a rapid HIV test using a serial HIV testing algorithm. All HIV-positive adults were referred to the trial clinic in their cluster. Those not linked to care 3 mo after identification were contacted by a linkage-to-care team. Study procedures were not blinded. In all, 12,894 adults were registered as eligible for participation (5,790 in intervention arm; 7,104 in control arm), of whom 9,927 (77.0%) were contacted at least once during household visits. HIV status was ever ascertained for a total of 8,233/9,927 (82.9%), including 2,569 ascertained as HIV-positive (942 tested HIV-positive and 1,627 reported a known HIV-positive status). Of the 1,177 HIV-positive individuals not previously in care and followed for at least 6 mo in the trial, 559 (47.5%) visited their cluster trial clinic within 6 mo. In the intervention arm, 89% (194/218) initiated ART within 3 mo of their first clinic visit. In the control arm, 42.3% (83/196) had a CD4 count ≤ 350 cells/μl at first visit, of whom 92.8% initiated ART within 3 mo. Regarding attitudes about ART, 93% (8,802/9,460) of participants agreed with the statement that they would want to start ART as soon as possible if HIV-positive. Estimated baseline HIV prevalence was 30.5% (2,028/6,656) (95% CI 25.0%, 37.0%). HIV prevalence, uptake of home-based HIV testing, linkage to care within 6 mo, and initiation of ART within 3 mo in those with CD4 count ≤ 350 cells/μl did not differ significantly between the intervention and control clusters. Selection bias related to noncontact could not be entirely excluded. Conclusions Home-based HIV testing was well received in this rural population, although men were less easily contactable at home; immediate ART was acceptable, with good viral suppression and retention. However, only about half of HIV-positive people accessed care within 6 mo of being identified, with nearly two-thirds accessing care by 12 mo. The observed delay in linkage to care would limit the individual and public health ART benefits of universal testing and treatment in this population. Trial registration ClinicalTrials.gov NCT01509508 PMID:27504637
Iwuji, Collins C; Orne-Gliemann, Joanna; Larmarange, Joseph; Okesola, Nonhlanhla; Tanser, Frank; Thiebaut, Rodolphe; Rekacewicz, Claire; Newell, Marie-Louise; Dabis, Francois
2016-08-01
The 2015 WHO recommendation of antiretroviral therapy (ART) for all immediately following HIV diagnosis is partially based on the anticipated impact on HIV incidence in the surrounding population. We investigated this approach in a cluster-randomised trial in a high HIV prevalence setting in rural KwaZulu-Natal. We present findings from the first phase of the trial and report on uptake of home-based HIV testing, linkage to care, uptake of ART, and community attitudes about ART. Between 9 March 2012 and 22 May 2014, five clusters in the intervention arm (immediate ART offered to all HIV-positive adults) and five clusters in the control arm (ART offered according to national guidelines, i.e., CD4 count ≤ 350 cells/μl) contributed to the first phase of the trial. Households were visited every 6 mo. Following informed consent and administration of a study questionnaire, each resident adult (≥16 y) was asked for a finger-prick blood sample, which was used to estimate HIV prevalence, and offered a rapid HIV test using a serial HIV testing algorithm. All HIV-positive adults were referred to the trial clinic in their cluster. Those not linked to care 3 mo after identification were contacted by a linkage-to-care team. Study procedures were not blinded. In all, 12,894 adults were registered as eligible for participation (5,790 in intervention arm; 7,104 in control arm), of whom 9,927 (77.0%) were contacted at least once during household visits. HIV status was ever ascertained for a total of 8,233/9,927 (82.9%), including 2,569 ascertained as HIV-positive (942 tested HIV-positive and 1,627 reported a known HIV-positive status). Of the 1,177 HIV-positive individuals not previously in care and followed for at least 6 mo in the trial, 559 (47.5%) visited their cluster trial clinic within 6 mo. In the intervention arm, 89% (194/218) initiated ART within 3 mo of their first clinic visit. In the control arm, 42.3% (83/196) had a CD4 count ≤ 350 cells/μl at first visit, of whom 92.8% initiated ART within 3 mo. Regarding attitudes about ART, 93% (8,802/9,460) of participants agreed with the statement that they would want to start ART as soon as possible if HIV-positive. Estimated baseline HIV prevalence was 30.5% (2,028/6,656) (95% CI 25.0%, 37.0%). HIV prevalence, uptake of home-based HIV testing, linkage to care within 6 mo, and initiation of ART within 3 mo in those with CD4 count ≤ 350 cells/μl did not differ significantly between the intervention and control clusters. Selection bias related to noncontact could not be entirely excluded. Home-based HIV testing was well received in this rural population, although men were less easily contactable at home; immediate ART was acceptable, with good viral suppression and retention. However, only about half of HIV-positive people accessed care within 6 mo of being identified, with nearly two-thirds accessing care by 12 mo. The observed delay in linkage to care would limit the individual and public health ART benefits of universal testing and treatment in this population. ClinicalTrials.gov NCT01509508.
Cortical atrophy patterns in early Parkinson's disease patients using hierarchical cluster analysis.
Uribe, Carme; Segura, Barbara; Baggio, Hugo Cesar; Abos, Alexandra; Garcia-Diaz, Anna Isabel; Campabadal, Anna; Marti, Maria Jose; Valldeoriola, Francesc; Compta, Yaroslau; Tolosa, Eduard; Junque, Carme
2018-05-01
Cortical brain atrophy detectable with MRI in non-demented advanced Parkinson's disease (PD) is well characterized, but its presence in early disease stages is still under debate. We aimed to investigate cortical atrophy patterns in a large sample of early untreated PD patients using a hypothesis-free data-driven approach. Seventy-seven de novo PD patients and 50 controls from the Parkinson's Progression Marker Initiative database with T1-weighted images in a 3-tesla Siemens scanner were included in this study. Mean cortical thickness was extracted from 360 cortical areas defined by the Human Connectome Project Multi-Modal Parcellation version 1.0, and a hierarchical cluster analysis was performed using Ward's linkage method. A general linear model with cortical thickness data was then used to compare clustering groups using FreeSurfer software. We identified two patterns of cortical atrophy. Compared with controls, patients grouped in pattern 1 (n = 33) were characterized by cortical thinning in bilateral orbitofrontal, anterior cingulate, and lateral and medial anterior temporal gyri. Patients in pattern 2 (n = 44) showed cortical thinning in bilateral occipital gyrus, cuneus, superior parietal gyrus, and left postcentral gyrus, and they showed neuropsychological impairment in memory and other cognitive domains. Even in the early stages of PD, there is evidence of cortical brain atrophy. Neuroimaging clustering analysis is able to detect two subgroups of cortical thinning, one with mainly anterior atrophy, and the other with posterior predominance and worse cognitive performance. Copyright © 2018 Elsevier Ltd. All rights reserved.
Campa, Ana; Giraldez, Ramón; Ferreira, Juan José
2011-06-01
Resistance to the eight races (3, 7, 19, 31, 81, 449, 453, and 1545) of the pathogenic fungus Colletotrichum lindemuthianum (anthracnose) was evaluated in F(3) families derived from the cross between the anthracnose differential bean cultivars Kaboon and Michelite. Molecular marker analyses were carried out in the F(2) individuals in order to map and characterize the anthracnose resistance genes or gene clusters present in Kaboon. The analysis of the combined segregations indicates that the resistance present in Kaboon against these eight anthracnose races is determined by 13 different race-specific genes grouped in three clusters. One of these clusters, corresponding to locus Co-1 in linkage group (LG) 1, carries two dominant genes conferring specific resistance to races 81 and 1545, respectively, and a gene necessary (dominant complementary gene) for the specific resistance to race 31. A second cluster, corresponding to locus Co-3/9 in LG 4, carries six dominant genes conferring specific resistance to races 3, 7, 19, 449, 453, and 1545, respectively, and the second dominant complementary gene for the specific resistance to race 31. A third cluster of unknown location carries three dominant genes conferring specific resistance to races 449, 453, and 1545, respectively. This is the first time that two anthracnose resistance genes with a complementary mode of action have been mapped in common bean and their relationship with previously known Co- resistance genes established.
Broders, K D; Boraks, A; Sanchez, A M; Boland, G J
2012-01-01
The occurrence of multiple introduction events, or sudden emergence from a host jump, of forest pathogens may be an important factor in successful establishment in a novel environment or on a new host; however, few studies have focused on the introduction and emergence of fungal pathogens in forest ecosystems. While Ophiognomonia clavigignenti-juglandacearum (Oc-j), the butternut canker fungus, has caused range-wide mortality of butternut trees in North America since its first observation in 1967, the history of its emergence and spread across the United States and Canada remains unresolved. Using 17 single nucleotide polymorphic loci, we investigated the genetic population structure of 101 isolates of Oc-j from across North America. Clustering analysis revealed that the Oc-j population in North America is made up of three differentiated genetic clusters of isolates, and these genetic clusters were found to have a strong clonal structure. These results, in combination with the geographic distribution of the populations, suggest that Oc-j was introduced or has emerged in North America on more than one occasion, and these clonal lineages have since proliferated across much of the range of butternut. No evidence of genetic recombination was observed in the linkage analysis, and conservation of the distinct genetic clusters in regions where isolates from two or more genetic clusters are present, would indicate a very minimal or non-existent role of sexual recombination in populations of Oc-j in North America. PMID:23139872
Model-Based Linkage Analysis of a Quantitative Trait.
Song, Yeunjoo E; Song, Sunah; Schnell, Audrey H
2017-01-01
Linkage Analysis is a family-based method of analysis to examine whether any typed genetic markers cosegregate with a given trait, in this case a quantitative trait. If linkage exists, this is taken as evidence in support of a genetic basis for the trait. Historically, linkage analysis was performed using a binary disease trait, but has been extended to include quantitative disease measures. Quantitative traits are desirable as they provide more information than binary traits. Linkage analysis can be performed using single-marker methods (one marker at a time) or multipoint (using multiple markers simultaneously). In model-based linkage analysis the genetic model for the trait of interest is specified. There are many software options for performing linkage analysis. Here, we use the program package Statistical Analysis for Genetic Epidemiology (S.A.G.E.). S.A.G.E. was chosen because it also includes programs to perform data cleaning procedures and to generate and test genetic models for a quantitative trait, in addition to performing linkage analysis. We demonstrate in detail the process of running the program LODLINK to perform single-marker analysis, and MLOD to perform multipoint analysis using output from SEGREG, where SEGREG was used to determine the best fitting statistical model for the trait.
Topdar, N; Kundu, A; Sinha, M K; Sarkar, D; Das, M; Banerjee, S; Kar, C S; Satya, P; Balyan, H S; Mahapatra, B S; Gupta, P K
2013-01-01
We report the first complete microsatellite genetic map of jute (Corchorus olitorius L.; 2n = 2x = 14) using an F6 recombinant inbred population. Of the 403 microsatellite markers screened, 82 were mapped on the seven linkage groups (LGs) that covered a total genetic distance of 799.9 cM, with an average marker interval of 10.7 cM. LG5 had the longest and LG7 the shortest genetic lengths, whereas LG1 had the maximum and LG7 the minimum number of markers. Segregation distortion of microsatellite loci was high (61%), with the majority of them (76%) skewed towards the female parent. Genomewide non-parametric single-marker analysis in combination with multiple quantitative trait loci (QTL)-models (MQM) mapping detected 26 definitive QTLs for bast fibre quality, yield and yield-related traits. These were unevenly distributed on six LGs, as colocalized clusters, at genomic sectors marked by 15 microsatellite loci. LG1 was the QTL-richest map sector, with the densest colocalized clusters of QTLs governing fibre yield, yield-related traits and tensile strength. Expectedly, favorable QTLs were derived from the desirable parents, except for nearly all of those of fibre fineness, which might be due to the creation of new gene combinations. Our results will be a good starting point for further genome analyses in jute.
Microseismic Event Relocation and Focal Mechanism Estimation Based on PageRank Linkage
NASA Astrophysics Data System (ADS)
Aguiar, A. C.; Myers, S. C.
2017-12-01
Microseismicity associated with enhanced geothermal systems (EGS) is key in understanding how subsurface stimulation can modify stress, fracture rock, and increase permeability. Large numbers of microseismic events are commonly associated with hydroshearing an EGS, making data mining methods useful in their analysis. We focus on PageRank, originally developed as Google's search engine, and subsequently adapted for use in seismology to detect low-frequency earthquakes by linking events directly and indirectly through cross-correlation (Aguiar and Beroza, 2014). We expand on this application by using PageRank to define signal-correlation topology for micro-earthquakes from the Newberry Volcano EGS in Central Oregon, which has been stimulated two times using high-pressure fluid injection. We create PageRank signal families from both data sets and compare these to the spatial and temporal proximity of associated earthquakes. PageRank families are relocated using differential travel times measured by waveform cross-correlation (CC) and the Bayesloc approach (Myers et al., 2007). Prior to relocation events are loosely clustered with events at a distance from the cluster. After relocation, event families are found to be tightly clustered. Indirect linkage of signals using PageRank is a reliable way to increase the number of events confidently determined to be similar, suggesting an efficient and effective grouping of earthquakes with similar physical characteristics (ie. location, focal mechanism, stress drop). We further explore the possibility of using PageRank families to identify events with similar relative phase polarities and estimate focal mechanisms following Shelly et al. (2016) method, where CC measurements are used to determine individual polarities within event clusters. Given a positive result, PageRank might be a useful tool in adaptive approaches to enhance production at well-instrumented geothermal sites. Prepared by LLNL under Contract DE-AC52-07NA27344. LLNL-ABS-722404.
Boyack, Kevin W.; Newman, David; Duhon, Russell J.; Klavans, Richard; Patek, Michael; Biberstine, Joseph R.; Schijvenaars, Bob; Skupin, André; Ma, Nianli; Börner, Katy
2011-01-01
Background We investigate the accuracy of different similarity approaches for clustering over two million biomedical documents. Clustering large sets of text documents is important for a variety of information needs and applications such as collection management and navigation, summary and analysis. The few comparisons of clustering results from different similarity approaches have focused on small literature sets and have given conflicting results. Our study was designed to seek a robust answer to the question of which similarity approach would generate the most coherent clusters of a biomedical literature set of over two million documents. Methodology We used a corpus of 2.15 million recent (2004-2008) records from MEDLINE, and generated nine different document-document similarity matrices from information extracted from their bibliographic records, including titles, abstracts and subject headings. The nine approaches were comprised of five different analytical techniques with two data sources. The five analytical techniques are cosine similarity using term frequency-inverse document frequency vectors (tf-idf cosine), latent semantic analysis (LSA), topic modeling, and two Poisson-based language models – BM25 and PMRA (PubMed Related Articles). The two data sources were a) MeSH subject headings, and b) words from titles and abstracts. Each similarity matrix was filtered to keep the top-n highest similarities per document and then clustered using a combination of graph layout and average-link clustering. Cluster results from the nine similarity approaches were compared using (1) within-cluster textual coherence based on the Jensen-Shannon divergence, and (2) two concentration measures based on grant-to-article linkages indexed in MEDLINE. Conclusions PubMed's own related article approach (PMRA) generated the most coherent and most concentrated cluster solution of the nine text-based similarity approaches tested, followed closely by the BM25 approach using titles and abstracts. Approaches using only MeSH subject headings were not competitive with those based on titles and abstracts. PMID:21437291
Optimality Measures for Monotone Equivariant Cluster Techniques.
1980-09-01
complete linkage, u-clustering (u - .3, .5, .7), uv-clustering (uv = (.2,.4), (.2,.6), (.4,.6)) as well as the UPGMA algorithm. The idea will be to...Table 15. Notice that these measure-- do indeed pioduce difftxent verdicts. OPI rates UPGMA as best with uv = (.2,.4) R € second. By OP2, UPGMA is best...By OPI, UPGQA and uv = (.4,.6) are tied for first place, while by OP2, UPGMA is best with uv = (.2,.6), uv = (.2,.4) and uv = (.4,.6) close behind
Goicoechea, P G; Herrán, A; Durand, J; Bodénès, C; Plomion, C; Kremer, A
2015-01-01
We analyzed the genetic mosaic of speciation in two hybridizing Mediterranean white oaks from the Iberian Peninsula (Quercus faginea Lamb. and Quercus pyrenaica Willd.). The two species show ecological divergence in flowering phenology, leaf morphology and composition, and in their basic or acidic soil preferences. Ninety expressed sequence tag-simple sequence repeats (EST-SSRs) and eight nuclear SSRs were genotyped in 96 trees from each species. Genotyping was designed in two steps. First, we used 69 markers evenly distributed over the 12 linkage groups (LGs) of the oak linkage map to confirm the species genetic identity of the sampled genotypes, and searched for differentiation outliers. Then, we genotyped 29 additional markers from the chromosome bins containing the outliers and repeated the multilocus scans. We found one or two additional outliers within four saturated bins, thus confirming that outliers are organized into clusters. Linkage disequilibrium (LD) was extensive; even for loosely linked and for independent markers. Consequently, score tests for association between two-marker haplotypes and the ‘species trait' showed a broad genomic divergence, although substantial variation across the genome and within LGs was also observed. We discuss the influence of several confounding effects on neutrality tests and review the evolutionary processes leading to extensive LD. Finally, we examine how LD analyses within regions that contain outlier clusters and quantitative trait loci can help to identify regions of divergence and/or genomic hitchhiking in the light of predictions from ecological speciation theory. PMID:25515016
Hierarchical clustering of HPV genotype patterns in the ASCUS-LSIL triage study
Wentzensen, Nicolas; Wilson, Lauren E.; Wheeler, Cosette M.; Carreon, Joseph D.; Gravitt, Patti E.; Schiffman, Mark; Castle, Philip E.
2010-01-01
Anogenital cancers are associated with about 13 carcinogenic HPV types in a broader group that cause cervical intraepithelial neoplasia (CIN). Multiple concurrent cervical HPV infections are common which complicate the attribution of HPV types to different grades of CIN. Here we report the analysis of HPV genotype patterns in the ASCUS-LSIL triage study using unsupervised hierarchical clustering. Women who underwent colposcopy at baseline (n = 2780) were grouped into 20 disease categories based on histology and cytology. Disease groups and HPV genotypes were clustered using complete linkage. Risk of 2-year cumulative CIN3+, viral load, colposcopic impression, and age were compared between disease groups and major clusters. Hierarchical clustering yielded four major disease clusters: Cluster 1 included all CIN3 histology with abnormal cytology; Cluster 2 included CIN3 histology with normal cytology and combinations with either CIN2 or high-grade squamous intraepithelial lesion (HSIL) cytology; Cluster 3 included older women with normal or low grade histology/cytology and low viral load; Cluster 4 included younger women with low grade histology/cytology, multiple infections, and the highest viral load. Three major groups of HPV genotypes were identified: Group 1 included only HPV16; Group 2 included nine carcinogenic types plus non-carcinogenic HPV53 and HPV66; and Group 3 included non-carcinogenic types plus carcinogenic HPV33 and HPV45. Clustering results suggested that colposcopy missed a prevalent precancer in many women with no biopsy/normal histology and HSIL. This result was confirmed by an elevated 2-year risk of CIN3+ in these groups. Our novel approach to study multiple genotype infections in cervical disease using unsupervised hierarchical clustering can address complex genotype distributions on a population level. PMID:20959485
Pilot personality and crew coordination - Implications for training and selection
NASA Technical Reports Server (NTRS)
Chidester, Thomas R.; Helmreich, Robert L.; Gregorich, Steven E.; Geis, Craig E.
1991-01-01
It is contended that past failures to find linkages between performance and personality were due to a combination of premature performance evaluation, inadequate statistical modeling, and/or the reliance on data gathered in contrived as opposed to realistic situations. The goal of the research presented is to isolate subgroups of pilots along performance-related personality dimensions and to document limits on the impact of crew coordination training between the groups. Three different profiles were identified through cluster analysis of personality scales that replicated across samples and predicted attitude change following training in crew coordination.
2012-01-01
Background The turbot (Scophthalmus maximus) is a relevant species in European aquaculture. The small turbot genome provides a source for genomics strategies to use in order to understand the genetic basis of productive traits, particularly those related to sex, growth and pathogen resistance. Genetic maps represent essential genomic screening tools allowing to localize quantitative trait loci (QTL) and to identify candidate genes through comparative mapping. This information is the backbone to develop marker-assisted selection (MAS) programs in aquaculture. Expressed sequenced tag (EST) resources have largely increased in turbot, thus supplying numerous type I markers suitable for extending the previous linkage map, which was mostly based on anonymous loci. The aim of this study was to construct a higher-resolution turbot genetic map using EST-linked markers, which will turn out to be useful for comparative mapping studies. Results A consensus gene-enriched genetic map of the turbot was constructed using 463 SNP and microsatellite markers in nine reference families. This map contains 438 markers, 180 EST-linked, clustered at 24 linkage groups. Linkage and comparative genomics evidences suggested additional linkage group fusions toward the consolidation of turbot map according to karyotype information. The linkage map showed a total length of 1402.7 cM with low average intermarker distance (3.7 cM; ~2 Mb). A global 1.6:1 female-to-male recombination frequency (RF) ratio was observed, although largely variable among linkage groups and chromosome regions. Comparative sequence analysis revealed large macrosyntenic patterns against model teleost genomes, significant hits decreasing from stickleback (54%) to zebrafish (20%). Comparative mapping supported particular chromosome rearrangements within Acanthopterygii and aided to assign unallocated markers to specific turbot linkage groups. Conclusions The new gene-enriched high-resolution turbot map represents a useful genomic tool for QTL identification, positional cloning strategies, and future genome assembling. This map showed large synteny conservation against model teleost genomes. Comparative genomics and data mining from landmarks will provide straightforward access to candidate genes, which will be the basis for genetic breeding programs and evolutionary studies in this species. PMID:22747677
Lochner, Christine; Hemmings, Sian M J; Kinnear, Craig J; Niehaus, Dana J H; Nel, Daniel G; Corfield, Valerie A; Moolman-Smook, Johanna C; Seedat, Soraya; Stein, Dan J
2005-01-01
Comorbidity of certain obsessive-compulsive spectrum disorders (OCSDs; such as Tourette's disorder) in obsessive-compulsive disorder (OCD) may serve to define important OCD subtypes characterized by differing phenomenology and neurobiological mechanisms. Comorbidity of the putative OCSDs in OCD has, however, not often been systematically investigated. The Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition , Axis I Disorders-Patient Version as well as a Structured Clinical Interview for Putative OCSDs (SCID-OCSD) were administered to 210 adult patients with OCD (N = 210, 102 men and 108 women; mean age, 35.7 +/- 13.3). A subset of Caucasian subjects (with OCD, n = 171; control subjects, n = 168), including subjects from the genetically homogeneous Afrikaner population (with OCD, n = 77; control subjects, n = 144), was genotyped for polymorphisms in genes involved in monoamine function. Because the items of the SCID-OCSD are binary (present/absent), a cluster analysis (Ward's method) using the items of SCID-OCSD was conducted. The association of identified clusters with demographic variables (age, gender), clinical variables (age of onset, obsessive-compulsive symptom severity and dimensions, level of insight, temperament/character, treatment response), and monoaminergic genotypes was examined. Cluster analysis of the OCSDs in our sample of patients with OCD identified 3 separate clusters at a 1.1 linkage distance level. The 3 clusters were named as follows: (1) "reward deficiency" (including trichotillomania, Tourette's disorder, pathological gambling, and hypersexual disorder), (2) "impulsivity" (including compulsive shopping, kleptomania, eating disorders, self-injury, and intermittent explosive disorder), and (3) "somatic" (including body dysmorphic disorder and hypochondriasis). Several significant associations were found between cluster scores and other variables; for example, cluster I scores were associated with earlier age of onset of OCD and the presence of tics, cluster II scores were associated with female gender and childhood emotional abuse, and cluster III scores were associated with less insight and with somatic obsessions and compulsions. However, none of these clusters were associated with any particular genetic variant. Analysis of comorbid OCSDs in OCD suggested that these lie on a number of different dimensions. These dimensions are partially consistent with previous theoretical approaches taken toward classifying OCD spectrum disorders. The lack of genetic validation of these clusters in the present study may indicate the involvement of other, as yet untested, genes. Further genetic and cluster analyses of comorbid OCSDs in OCD may ultimately contribute to a better delineation of OCD endophenotypes.
Lin, Sheng-Hsiang; Liu, Chih-Min; Liu, Yu-Li; Fann, Cathy Shen-Jang; Hsiao, Po-Chang; Wu, Jer-Yuarn; Hung, Shuen-Iu; Chen, Chun-Houh; Wu, Han-Ming; Jou, Yuh-Shan; Liu, Shi K.; Hwang, Tzung J.; Hsieh, Ming H.; Chang, Chien-Ching; Yang, Wei-Chih; Lin, Jin-Jia; Chou, Frank Huang-Chih; Faraone, Stephen V.; Tsuang, Ming T.; Hwu, Hai-Gwo; Chen, Wei J.
2009-01-01
Chromosome 6p is one of the most commonly implicated regions in the genome-wide linkage scans of schizophrenia, whereas further association studies for markers in this region were inconsistent likely due to heterogeneity. This study aimed to identify more homogeneous subgroups of families for fine mapping on regions around markers D6S296 and D6S309 (both in 6p24.3) as well as D6S274 (in 6p22.3) by means of similarity in neurocognitive functioning. A total of 160 families of patients with schizophrenia comprising at least two affected siblings who had data for 8 neurocognitive test variables of the Continuous Performance Test (CPT) and the Wisconsin Card Sorting Test (WCST) were subjected to cluster analysis with data visualization using the test scores of both affected siblings. Family clusters derived were then used separately in family-based association tests for 64 single nucleotide polymorphisms covering the region of 6p24.3 and 6p22.3. Three clusters were derived from the family-based clustering, with deficit cluster 1 representing deficit on the CPT, deficit cluster 2 representing deficit on both the CPT and the WCST, and a third cluster of non-deficit. After adjustment using false discovery rate for multiple testing, SNP rs13873 and haplotype rs1225934-rs13873 on BMP6-TXNDC5 genes were significantly associated with schizophrenia for the deficit cluster 1 but not for the deficit cluster 2 or non-deficit cluster. Our results provide further evidence that the BMP6-TXNDC5 locus on 6p24.3 may play a role in the selective impairments on sustained attention of schizophrenia. PMID:19694819
USDA-ARS?s Scientific Manuscript database
The Ouro Negro common bean cultivar contains the Co-34/Phg-3 gene cluster that confers resistance to the anthracnose (ANT) and angular leaf spot (ALS) pathogens. These genes are tightly linked on chromosome 4. Ouro Negro also has the Ur-14 rust resistance gene, reportedly in the vicinity of Co- 34; ...
Users' perception as a tool to improve urban beach planning and management.
Cervantes, Omar; Espejel, Ileana; Arellano, Evarista; Delhumeau, Sheila
2008-08-01
Four beaches that share physiographic characteristics (sandy, wide, and long) but differ in socioeconomic and cultural terms (three are located in northwestern Mexico and one in California, USA) were evaluated by beach users. Surveys (565) composed of 36 questions were handed out to beach users on weekends and holidays in 2005. The 25 questions that revealed the most information were selected by factor analysis and classified by cluster analysis. Beach users' preferences were assigned a value by comparing the present survey results with the characteristics of an "ideal" recreational urban beach. Cluster analysis separated three groups of questions: (a) services and infrastructure, (b) recreational activities, and (c) beach conditions. Cluster linkage distance (r=0.82, r=0.78, r=0.67) was used as a weight and multiplied by the value of beach descriptive factors. Mazatlán and Oceanside obtained the highest values because there are enough infrastructure and services; on the contrary, Ensenada and Rosarito were rated medium and low because infrastructure and services are lacking. The presently proposed method can contribute to improving current beach evaluations because the final score represents the beach users' evaluation of the quality of the beach. The weight considered in the present study marks the beach users' preferences among the studied beaches. Adding this weight to beach evaluation will contribute to more specific beach planning in which users' perception is considered.
Users' Perception as a Tool to Improve Urban Beach Planning and Management
NASA Astrophysics Data System (ADS)
Cervantes, Omar; Espejel, Ileana; Arellano, Evarista; Delhumeau, Sheila
2008-08-01
Four beaches that share physiographic characteristics (sandy, wide, and long) but differ in socioeconomic and cultural terms (three are located in northwestern Mexico and one in California, USA) were evaluated by beach users. Surveys (565) composed of 36 questions were handed out to beach users on weekends and holidays in 2005. The 25 questions that revealed the most information were selected by factor analysis and classified by cluster analysis. Beach users’ preferences were assigned a value by comparing the present survey results with the characteristics of an “ideal” recreational urban beach. Cluster analysis separated three groups of questions: (a) services and infrastructure, (b) recreational activities, and (c) beach conditions. Cluster linkage distance ( r = 0.82, r = 0.78, r = 0.67) was used as a weight and multiplied by the value of beach descriptive factors. Mazatlán and Oceanside obtained the highest values because there are enough infrastructure and services; on the contrary, Ensenada and Rosarito were rated medium and low because infrastructure and services are lacking. The presently proposed method can contribute to improving current beach evaluations because the final score represents the beach users’ evaluation of the quality of the beach. The weight considered in the present study marks the beach users’ preferences among the studied beaches. Adding this weight to beach evaluation will contribute to more specific beach planning in which users’ perception is considered.
Linkage and related analyses of Barrett's esophagus and its associated adenocarcinomas.
Sun, Xiangqing; Elston, Robert; Falk, Gary W; Grady, William M; Faulx, Ashley; Mittal, Sumeet K; Canto, Marcia I; Shaheen, Nicholas J; Wang, Jean S; Iyer, Prasad G; Abrams, Julian A; Willis, Joseph E; Guda, Kishore; Markowitz, Sanford; Barnholtz-Sloan, Jill S; Chandar, Apoorva; Brock, Wendy; Chak, Amitabh
2016-07-01
Familial aggregation and segregation analysis studies have provided evidence of a genetic basis for esophageal adenocarcinoma (EAC) and its premalignant precursor, Barrett's esophagus (BE). We aim to demonstrate the utility of linkage analysis to identify the genomic regions that might contain the genetic variants that predispose individuals to this complex trait (BE and EAC). We genotyped 144 individuals in 42 multiplex pedigrees chosen from 1000 singly ascertained BE/EAC pedigrees, and performed both model-based and model-free linkage analyses, using S.A.G.E. and other software. Segregation models were fitted, from the data on both the 42 pedigrees and the 1000 pedigrees, to determine parameters for performing model-based linkage analysis. Model-based and model-free linkage analyses were conducted in two sets of pedigrees: the 42 pedigrees and a subset of 18 pedigrees with female affected members that are expected to be more genetically homogeneous. Genome-wide associations were also tested in these families. Linkage analyses on the 42 pedigrees identified several regions consistently suggestive of linkage by different linkage analysis methods on chromosomes 2q31, 12q23, and 4p14. A linkage on 15q26 is the only consistent linkage region identified in the 18 female-affected pedigrees, in which the linkage signal is higher than in the 42 pedigrees. Other tentative linkage signals are also reported. Our linkage study of BE/EAC pedigrees identified linkage regions on chromosomes 2, 4, 12, and 15, with some reported associations located within our linkage peaks. Our linkage results can help prioritize association tests to delineate the genetic determinants underlying susceptibility to BE and EAC.
Huang, Ruili; Lin, Ja-An; Sedykh, Alexander; Zhao, Jinghua; Tice, Raymond R.; Paules, Richard S.; Xia, Menghang; Auerbach, Scott S.
2017-01-01
Cytotoxicity is a commonly used in vitro endpoint for evaluating chemical toxicity. In support of the U.S. Tox21 screening program, the cytotoxicity of ~10K chemicals was interrogated at 0, 8, 16, 24, 32, & 40 hours of exposure in a concentration dependent fashion in two cell lines (HEK293, HepG2) using two multiplexed, real-time assay technologies. One technology measures the metabolic activity of cells (i.e., cell viability, glo) while the other evaluates cell membrane integrity (i.e., cell death, flor). Using glo technology, more actives and greater temporal variations were seen in HEK293 cells, while results for the flor technology were more similar across the two cell types. Chemicals were grouped into classes based on their cytotoxicity kinetics profiles and these classes were evaluated for their associations with activity in the Tox21 nuclear receptor and stress response pathway assays. Some pathways, such as the activation of H2AX, were associated with the fast-responding cytotoxicity classes, while others, such as activation of TP53, were associated with the slow-responding cytotoxicity classes. By clustering pathways based on their degree of association to the different cytotoxicity kinetics labels, we identified clusters of pathways where active chemicals presented similar kinetics of cytotoxicity. Such linkages could be due to shared underlying biological processes between pathways, for example, activation of H2AX and heat shock factor. Others involving nuclear receptor activity are likely due to shared chemical structures rather than pathway level interactions. Based on the linkage between androgen receptor antagonism and Nrf2 activity, we surmise that a subclass of androgen receptor antagonists cause cytotoxicity via oxidative stress that is associated with Nrf2 activation. In summary, the real-time cytotoxicity screen provides informative chemical cytotoxicity kinetics data related to their cytotoxicity mechanisms, and with our analysis, it is possible to formulate mechanism-based hypotheses on the cytotoxic properties of the tested chemicals. PMID:28531190
NASA Astrophysics Data System (ADS)
Di, Nur Faraidah Muhammad; Satari, Siti Zanariah
2017-05-01
Outlier detection in linear data sets has been done vigorously but only a small amount of work has been done for outlier detection in circular data. In this study, we proposed multiple outliers detection in circular regression models based on the clustering algorithm. Clustering technique basically utilizes distance measure to define distance between various data points. Here, we introduce the similarity distance based on Euclidean distance for circular model and obtain a cluster tree using the single linkage clustering algorithm. Then, a stopping rule for the cluster tree based on the mean direction and circular standard deviation of the tree height is proposed. We classify the cluster group that exceeds the stopping rule as potential outlier. Our aim is to demonstrate the effectiveness of proposed algorithms with the similarity distances in detecting the outliers. It is found that the proposed methods are performed well and applicable for circular regression model.
Li, C T; Shi, C H; Wu, J G; Xu, H M; Zhang, H Z; Ren, Y L
2004-04-01
The selection of an appropriate sampling strategy and a clustering method is important in the construction of core collections based on predicted genotypic values in order to retain the greatest degree of genetic diversity of the initial collection. In this study, methods of developing rice core collections were evaluated based on the predicted genotypic values for 992 rice varieties with 13 quantitative traits. The genotypic values of the traits were predicted by the adjusted unbiased prediction (AUP) method. Based on the predicted genotypic values, Mahalanobis distances were calculated and employed to measure the genetic similarities among the rice varieties. Six hierarchical clustering methods, including the single linkage, median linkage, centroid, unweighted pair-group average, weighted pair-group average and flexible-beta methods, were combined with random, preferred and deviation sampling to develop 18 core collections of rice germplasm. The results show that the deviation sampling strategy in combination with the unweighted pair-group average method of hierarchical clustering retains the greatest degree of genetic diversities of the initial collection. The core collections sampled using predicted genotypic values had more genetic diversity than those based on phenotypic values.
Genome wide linkage disequilibrium and genetic structure in Sicilian dairy sheep breeds.
Mastrangelo, Salvatore; Di Gerlando, Rosalia; Tolone, Marco; Tortorici, Lina; Sardina, Maria Teresa; Portolano, Baldassare
2014-10-10
The recent availability of sheep genome-wide SNP panels allows providing background information concerning genome structure in domestic animals. The aim of this work was to investigate the patterns of linkage disequilibrium (LD), the genetic diversity and population structure in Valle del Belice, Comisana, and Pinzirita dairy sheep breeds using the Illumina Ovine SNP50K Genotyping array. Average r (2) between adjacent SNPs across all chromosomes was 0.155 ± 0.204 for Valle del Belice, 0.156 ± 0.208 for Comisana, and 0.128 ± 0.188 for Pinzirita breeds, and some variations in LD value across chromosomes were observed, in particular for Valle del Belice and Comisana breeds. Average values of r (2) estimated for all pairwise combinations of SNPs pooled over all autosomes were 0.058 ± 0.023 for Valle del Belice, 0.056 ± 0.021 for Comisana, and 0.037 ± 0.017 for Pinzirita breeds. The LD declined as a function of distance and average r (2) was lower than the values observed in other sheep breeds. Consistency of results among the several used approaches (Principal component analysis, Bayesian clustering, F ST, Neighbor networks) showed that while Valle del Belice and Pinzirita breeds formed a unique cluster, Comisana breed showed the presence of substructure. In Valle del Belice breed, the high level of genetic differentiation within breed, the heterogeneous cluster in Admixture analysis, but at the same time the highest inbreeding coefficient, suggested that the breed had a wide genetic base with inbred individuals belonging to the same flock. The Sicilian breeds were characterized by low genetic differentiation and high level of admixture. Pinzirita breed displayed the highest genetic diversity (He, Ne) whereas the lowest value was found in Valle del Belice breed. This study has reported for the first time estimates of LD and genetic diversity from a genome-wide perspective in Sicilian dairy sheep breeds. Our results indicate that breeds formed non-overlapping clusters and are clearly separated populations and that Comisana sheep breed does not constitute a homogenous population. The information generated from this study has important implications for the design and applications of association studies as well as for development of conservation and/or selection breeding programs.
MicroRNA Gene Regulatory Networks in Peripheral Nerve Sheath Tumors
2013-09-01
3.0 hierarchical clustering of both the X and the Y-axis using Centroid linkage. The resulting clustered matrixes were visualized using Java Treeview...To score potential ceRNA interactions, the 54979 human interactions were loaded into a mySQL database and when the user selects a given mRNA all...on the fly using PHP interactions with mySQL in a similar fashion as previously described in our publicly available databases such as sarcoma
Ruzagira, Eugene; Grosskurth, Heiner; Kamali, Anatoli; Baisley, Kathy
2017-10-01
The aim of this study was to determine whether counselling provided subsequent to HIV testing and referral for care increases linkage to care among HIV-positive persons identified through home-based HIV counselling and testing (HBHCT) in Masaka, Uganda. The study was an open-label cluster-randomized trial. 28 rural communities were randomly allocated (1:1) to intervention (HBHCT, referral and counselling at one and two months) or control (HBHCT and referral only). HIV-positive care-naïve adults (≥18 years) were enrolled. To conceal participants' HIV status, one HIV-negative person was recruited for every three HIV-positive participants. Primary outcomes were linkage to care (clinic-verified registration for care) status at six months, and time to linkage. Primary analyses were intention-to-treat using random effects logistic regression or Cox regression with shared frailty, as appropriate. Three hundred and two(intervention, n = 149; control, n = 153) HIV-positive participants were enrolled. Except for travel time to the nearest HIV clinic, baseline participant characteristics were generally balanced between trial arms. Retention was similar across trial arms (92% overall). One hundred and twenty-seven (42.1%) participants linked to care: 76 (51.0%) in the intervention arm versus 51 (33.3%) in the control arm [odds ratio = 2.18, 95% confidence interval (CI) = 1.26-3.78; p = 0.008)]. There was evidence of interaction between trial arm and follow-up time (p = 0.009). The probability of linkage to care, did not differ between arms in the first two months of follow-up, but was subsequently higher in the intervention arm versus the control arm [hazard ratio = 4.87, 95% CI = 1.79-13.27, p = 0.002]. Counselling substantially increases linkage to care among HIV-positive adults identified through HBHCT and may enhance efforts to increase antiretroviral therapy coverage in sub-Saharan Africa. © 2017 The Authors. Journal of the International AIDS Society published by John Wiley & sons Ltd on behalf of the International AIDS Society.
Cheong, Kit-Leong; Wu, Ding-Tao; Deng, Yong; Leong, Fong; Zhao, Jing; Zhang, Wen-Jie; Li, Shao-Ping
2016-11-20
The objective of this study was to qualify and quantify the specific polysaccharides in Panax spp. The analyses of specific polysaccharides were performed by using GC-MS, saccharide mapping and high performance size exclusion chromatography (HPSEC) coupled with multi angle laser light scattering (MALLS) and refractive index detector (RID). Results showed that compositional monosaccharides were the same in different species of Panax and composed of rhamnose, arabinose, galacturonic acid, mannose, glucose, and galactose. Saccharide mapping results showed that glycosides linkages, which existed in specific polysaccharides from Panax spp., were similar. Additionally, the content of specific polysaccharides of P. ginseng, P. notoginseng and P. quinquefolium were 17.9-20.5mg/g, 11.9-15.0mg/g, and 9.9-13.3mg/g, respectively. P. ginseng, P. notoginseng, and P. quinquefolium could be clustered into three groups using both hierarchical cluster analysis and principal component analysis. The results possessed great potential in characterization and content determination of specific polysaccharides in Panax spp. Copyright © 2016 Elsevier Ltd. All rights reserved.
Genotypes and subgenotypes of hepatitis B virus circulating in an endemic area in Peru.
Ramírez-Soto, Max Carlos; Bracho, Maria Alma; González-Candelas, Fernando; Huichi-Atamari, Milagros
2018-01-01
Although hepatitis B virus (HBV) infection is still endemic in Abancay, Peru, two decades after vaccination against hepatitis B started in the area, little is known about the diversity and circulation of genotypes and subgenotypes of the virus. To identify the genotypes and subtypes of HBV circulating in Abancay, complete genome sequences of 11 treatment-naive HBV-infected patients were obtained, and phylogenetic analysis was conducted with these and additional sequences from GenBank. Genotyping revealed the presence of genotype F in all the samples from Abancay. Subgenotype F1b was dominant and only one isolate belonged to subgenotype F4, which represents the first description of this subgenotype in Peru. Phylogenetic analysis revealed that most subgenotype F1b isolates from Peru clustered in a subgroup along with two sequences from Argentina, whereas two clusters with two HBV/F1b sequences each were indicative of recent epidemiological linkage, but only one could be verified by independent data. These results suggest that the HBV subgenotype F1b seems to be the predominant subgenotype in Abancay, Peru.
Genomewide scan for gout in taiwanese aborigines reveals linkage to chromosome 4q25.
Cheng, Li Shu-Chuan; Chiang, Shang-Lun; Tu, Hung-Pin; Chang, Shun-Jen; Wang, Tsu-Nai; Ko, Allen Min-Jen; Chakraborty, Ranajit; Ko, Ying-Chin
2004-09-01
Gout is a disorder of uric-acid metabolism. The Pacific Austronesian population, including Taiwanese aborigines, has a remarkably high prevalence of hyperuricemia and gout, which suggests a founder effect across the Pacific region. We report here a genomewide linkage study of 21 multiplex pedigrees with gout from an aboriginal tribe in Taiwan. From observations of familial clustering, early onset of gout, and clinically severe manifestations, we hypothesized that a major gene plays a role in this trait. Using 382 random polymorphic markers spread across 22 autosomes, we demonstrated a highly significant linkage for gout at marker D4S2623 on chromosome 4q25 (P=.0002 by nonparametric linkage [the NPL(all) statistic]; empirical P=.0006; LOD=4.3, P=4.4x10-6 by logistic regression). When alcohol consumption was included as a covariate in the model, the LOD score increased to 5.66 (P=1.3x10-6). Quantitative traits, including serum uric acid and creatinine, also showed a moderate linkage to this region. To our knowledge, this is the first genome-scan report to identify a genetic locus harboring a gout-susceptibility gene.
Liu, Zhanjiang; Karsi, Attila; Li, Ping; Cao, Dongfeng; Dunham, R
2003-01-01
Catfish is the major aquaculture species in the United States. The hybrid catfish produced by crossing channel catfish females with blue catfish males exhibit a number of desirable production traits, but their mass production has been difficult. To introduce desirable genes from blue catfish into channel catfish through introgression, a genetic linkage map is helpful. In this project, a genetic linkage map was constructed using amplified fragment length polymorphism (AFLP). A total of 607 AFLP markers were analyzed using 65 primer combinations and an interspecific backcross resource family. A total of 418 AFLP markers were assigned to 44 linkage groups. Among the remaining 189 markers, 101 were not used because of significant segregation distortion, 29 were unlinked, and 59 were eliminated because they span very large distances. The 418 AFLP markers covered 1593 cM Kosambi. The AFLP markers showed a high level of clustering that appears to be related to certain primer combinations. This linkage map will serve as the basis for mapping a greater number of markers to provide a map with high enough resolution for it to be useful for selective breeding programs using introgression. PMID:14573480
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eichenbaum-Voline, Sophie; Olivier, Michael; Jones, Emma L.
2002-09-15
Combined hyperlipidemia (CHL) is a common disorder of lipidmetabolism that leads to an increased risk of cardiovascular disease. Thelipid profile of CHL is characterised by high levels of atherogeniclipoproteins and low levels of high-density-lipoprotein-cholesterol.Apolipoprotein (APO) A5 is a newly discovered gene involved in lipidmetabolism located within 30kbp of the APOA1/C3/A4 gene cluster. Previousstudies have indicated that sequence variants in this cluster areassociated with increased plasma lipid levels. To establish whethervariation at the APOA5 gene contributes to the transmission of CHL, weperformed linkage and linkage disequilibrium (LD) tests on a large cohortof families (n=128) with familial CHL (FCHL). The linkage datamore » producedevidence for linkage of the APOA1/C3/A4/A5 genomic interval to FCHL (NPL= 1.7, P = 0.042). The LD studies substantiated these data. Twoindependent rare alleles, APOA5c.56G and APOC3c.386G of this gene clusterwere over-transmitted in FCHL (P = 0.004 and 0.007, respectively), andthis was associated with a reduced transmission of the most commonAPOA1/C3/A4/A5 haplotype (frequency 0.4425) to affected subjects (P =0.013). The APOA5c.56G allele was associated with increased plasmatriglyceride levels in FCHL probands, whereas the second, andindependent, APOC3c.386G allele was associated with increased plasmatriglyceride levels in FCHL pedigree founders. Thus, this allele (or anallele in LD) may mark a quantitative trait associated with FCHL, as wellas representing a disease susceptibility locus for the condition. Thisstudy establishes that sequence variation in the APOA1/C3/A4/A5 genecluster contributes to the transmission of FCHL in a substantialproportion of affected families, and that these sequence variants mayalso contribute to the lipid abnormalities of the metabolic syndrome,which is present in up to 40 percent of persons with cardiovasculardisease.« less
Lowe, K M; Walker, M A
2006-05-01
The first genetic linkage map of grape derived from rootstock parents was constructed using 188 progeny from a cross of Ramsey (Vitis champinii) x Riparia Gloire (V. riparia). Of 354 simple sequence repeat markers tested, 205 were polymorphic for at least one parent, and 57.6% were fully informative. Maps of Ramsey, Riparia Gloire, and the F1 population were created using JoinMap software, following a pseudotestcross strategy. The set of 205 SSRs allowed for the identification of all 19 Vitis linkage groups (2n=38), with a total combined map length of 1,304.7 cM, averaging 6.8 cM between markers. The maternal map consists of 172 markers aligned into 19 linkage groups (1,244.9 cM) while 126 markers on the paternal map cover 18 linkage groups (1,095.5 cM). The expected genome coverage is over 92%. Segregation distortion occurred in the Ramsey, Riparia Gloire, and consensus maps for 10, 13, and 16% of the markers, respectively. These distorted markers clustered primarily on the linkage groups 3, 5, 14 and 17. No genome-wide difference in recombination rate was observed between Ramsey and Riparia Gloire based on 315 common marker intervals. Fifty-four new Vitis-EST-derived SSR markers were mapped, and were distributed evenly across the genome on 16 of the 19 linkage groups. These dense linkage maps of two phenotypically diverse North American Vitis species are valuable tools for studying the genetics of many rootstock traits including nematode resistance, lime and salt tolerance, and ability to induce vigor.
An Information-Theoretic-Cluster Visualization for Self-Organizing Maps.
Brito da Silva, Leonardo Enzo; Wunsch, Donald C
2018-06-01
Improved data visualization will be a significant tool to enhance cluster analysis. In this paper, an information-theoretic-based method for cluster visualization using self-organizing maps (SOMs) is presented. The information-theoretic visualization (IT-vis) has the same structure as the unified distance matrix, but instead of depicting Euclidean distances between adjacent neurons, it displays the similarity between the distributions associated with adjacent neurons. Each SOM neuron has an associated subset of the data set whose cardinality controls the granularity of the IT-vis and with which the first- and second-order statistics are computed and used to estimate their probability density functions. These are used to calculate the similarity measure, based on Renyi's quadratic cross entropy and cross information potential (CIP). The introduced visualizations combine the low computational cost and kernel estimation properties of the representative CIP and the data structure representation of a single-linkage-based grouping algorithm to generate an enhanced SOM-based visualization. The visual quality of the IT-vis is assessed by comparing it with other visualization methods for several real-world and synthetic benchmark data sets. Thus, this paper also contains a significant literature survey. The experiments demonstrate the IT-vis cluster revealing capabilities, in which cluster boundaries are sharply captured. Additionally, the information-theoretic visualizations are used to perform clustering of the SOM. Compared with other methods, IT-vis of large SOMs yielded the best results in this paper, for which the quality of the final partitions was evaluated using external validity indices.
Abanyie, F; Harvey, R R; Harris, J R; Wiegand, R E; Gaul, L; Desvignes-Kendrick, M; Irvin, K; Williams, I; Hall, R L; Herwaldt, B; Gray, E B; Qvarnstrom, Y; Wise, M E; Cantu, V; Cantey, P T; Bosch, S; DA Silva, A J; Fields, A; Bishop, H; Wellman, A; Beal, J; Wilson, N; Fiore, A E; Tauxe, R; Lance, S; Slutsker, L; Parise, M
2015-12-01
The 2013 multistate outbreaks contributed to the largest annual number of reported US cases of cyclosporiasis since 1997. In this paper we focus on investigations in Texas. We defined an outbreak-associated case as laboratory-confirmed cyclosporiasis in a person with illness onset between 1 June and 31 August 2013, with no history of international travel in the previous 14 days. Epidemiological, environmental, and traceback investigations were conducted. Of the 631 cases reported in the multistate outbreaks, Texas reported the greatest number of cases, 270 (43%). More than 70 clusters were identified in Texas, four of which were further investigated. One restaurant-associated cluster of 25 case-patients was selected for a case-control study. Consumption of cilantro was most strongly associated with illness on meal date-matched analysis (matched odds ratio 19·8, 95% confidence interval 4·0-∞). All case-patients in the other three clusters investigated also ate cilantro. Traceback investigations converged on three suppliers in Puebla, Mexico. Cilantro was the vehicle of infection in the four clusters investigated; the temporal association of these clusters with the large overall increase in cyclosporiasis cases in Texas suggests cilantro was the vehicle of infection for many other cases. However, the paucity of epidemiological and traceback information does not allow for a conclusive determination; moreover, molecular epidemiological tools for cyclosporiasis that could provide more definitive linkage between case clusters are needed.
NASA Astrophysics Data System (ADS)
Nugroho, P.
2018-02-01
Creative industries existence is inseparable from the underlying social construct which provides sources for creativity and innovation. The working of social capital in a society facilitates information exchange, knowledge transfer and technology acquisition within the industry through social networks. As a result, a socio-spatial divide exists in directing the growth of the creative industries. This paper aims to examine how such a socio-spatial divide contributes to the local creative industry development in Semarang and Kudus batik clusters. Explanatory sequential mixed methods approach covering a quantitative approach followed by a qualitative approach is chosen to understand better the interplay between tangible and intangible variables in the local batik clusters. Surveys on secondary data taken from the government statistics and reports, previous studies, and media exposures are completed in the former approach to identify clustering pattern of the local batik industry and the local embeddedness factors which have shaped the existing business environment. In-depth interviews, content analysis, and field observations are engaged in the latter approach to explore reciprocal relationships between the elements of social capital and the local batik cluster development. The result demonstrates that particular social ties have determined the forms of spatial proximity manifested in forward and backward business linkages. Trust, shared norms, and inherited traditions are the key social capital attributes that lead to such a socio-spatial divide. Therefore, the intermediating roles of the bridging actors are necessary to encouraging cooperation among the participating stakeholders for a better cluster development.
Nance-Horan syndrome: linkage analysis in a family from The Netherlands.
Bergen, A A; ten Brink, J; Schuurman, E J; Bleeker-Wagemakers, E M
1994-05-01
Linkage analysis was carried out in a Dutch family with Nance-Horan (NH) syndrome. Close linkage without recombination between NH and the Xp loci DXS207, DXS43, and DXS365 (zmax = 3.23) was observed. Multipoint linkage analysis and the analysis of recombinations in multiple informative meioses suggest the genetic order Xcen-DMD (exon 49)-DXS451-(NH, DXS207, DXS365, DXS43)-(STS, DXF30)-Xpter. These data refine the localization of the NH locus on the distal Xp.
Distribution of lod scores in oligogenic linkage analysis.
Williams, J T; North, K E; Martin, L J; Comuzzie, A G; Göring, H H; Blangero, J
2001-01-01
In variance component oligogenic linkage analysis it can happen that the residual additive genetic variance bounds to zero when estimating the effect of the ith quantitative trait locus. Using quantitative trait Q1 from the Genetic Analysis Workshop 12 simulated general population data, we compare the observed lod scores from oligogenic linkage analysis with the empirical lod score distribution under a null model of no linkage. We find that zero residual additive genetic variance in the null model alters the usual distribution of the likelihood-ratio statistic.
Two Novel Glycoside Hydrolases Responsible for the Catabolism of Cyclobis-(1→6)-α-nigerosyl*
Tagami, Takayoshi; Miyano, Eri; Sadahiro, Juri; Okuyama, Masayuki; Iwasaki, Tomohito; Kimura, Atsuo
2016-01-01
The actinobacterium Kribbella flavida NBRC 14399T produces cyclobis-(1→6)-α-nigerosyl (CNN), a cyclic glucotetraose with alternate α-(1→6)- and α-(1→3)-glucosidic linkages, from starch in the culture medium. We identified gene clusters associated with the production and intracellular catabolism of CNN in the K. flavida genome. One cluster encodes 6-α-glucosyltransferase and 3-α-isomaltosyltransferase, which are known to coproduce CNN from starch. The other cluster contains four genes annotated as a transcriptional regulator, sugar transporter, glycoside hydrolase family (GH) 31 protein (Kfla1895), and GH15 protein (Kfla1896). Kfla1895 hydrolyzed the α-(1→3)-glucosidic linkages of CNN and produced isomaltose via a possible linear tetrasaccharide. The initial rate of hydrolysis of CNN (11.6 s−1) was much higher than that of panose (0.242 s−1), and hydrolysis of isomaltotriose and nigerose was extremely low. Because Kfla1895 has a strong preference for the α-(1→3)-isomaltosyl moiety and effectively hydrolyzes the α-(1→3)-glucosidic linkage, it should be termed 1,3-α-isomaltosidase. Kfla1896 effectively hydrolyzed isomaltose with liberation of β-glucose, but displayed low or no activity toward CNN and the general GH15 enzyme substrates such as maltose, soluble starch, or dextran. The kcat/Km for isomaltose (4.81 ± 0.18 s−1 mm−1) was 6.9- and 19-fold higher than those for panose and isomaltotriose, respectively. These results indicate that Kfla1896 is a new GH15 enzyme with high substrate specificity for isomaltose, suggesting the enzyme should be designated an isomaltose glucohydrolase. This is the first report to identify a starch-utilization pathway that proceeds via CNN. PMID:27302067
Moen, Thomas; Sonesson, Anna K; Hayes, Ben; Lien, Sigbjørn; Munck, Hege; Meuwissen, Theo HE
2007-01-01
Background Infectious Salmon Anaemia (ISA) is a viral disease affecting farmed Atlantic salmon (Salmo salar) worldwide. The identification of Quantitative Trait Loci (QTL) affecting resistance to the disease could improve our understanding of the genetics underlying the trait and provide a means for Marker-Assisted Selection. We previously performed a genome scan on commercial Atlantic salmon families challenge tested for ISA resistance, identifying several putative QTL. In the present study, we set out to validate the strongest of these QTL in a larger family material coming from the same challenge test, and to determine the position of the QTL by interval mapping. We also wanted to explore different ways of performing QTL analysis within a survival analysis framework (i.e. using time-to-event data), and to compare results using survival analysis with results from analysis on the dichotomous trait 'affected/resistant'. Results The QTL, located on Atlantic salmon linkage group 8 (following SALMAP notation), was confirmed in the new data set. Its most likely position was at a marker cluster containing markers BHMS130, BHMS170 and BHMS553. Significant segregation distortion was observed in the same region, but was shown to be unrelated to the QTL. A maximum likelihood procedure for identifying QTL, based on the Cox proportional hazard model, was developed. QTL mapping was also done using the Haley-Knott method (affected/resistant data), and within a variance-component framework (affected/resistant data and time-to-event data). In all cases, analysis using affected/resistant data gave stronger evidence for a QTL than did analysis using time-to-event data. Conclusion A QTL for resistance to Infectious Salmon Anaemia in Atlantic salmon was validated in this study, and its more precise location on linkage group eight was determined. The QTL explained 6% of the phenotypic variation in resistance to the disease. The linkage group also displayed significant segregation distortion. Survival models proved in this case not to be more suitable than models based on the dichotomous trait 'affected/resistant' for analysing the data. PMID:17697344
Swarm v2: highly-scalable and high-resolution amplicon clustering.
Mahé, Frédéric; Rognes, Torbjørn; Quince, Christopher; de Vargas, Colomban; Dunthorn, Micah
2015-01-01
Previously we presented Swarm v1, a novel and open source amplicon clustering program that produced fine-scale molecular operational taxonomic units (OTUs), free of arbitrary global clustering thresholds and input-order dependency. Swarm v1 worked with an initial phase that used iterative single-linkage with a local clustering threshold (d), followed by a phase that used the internal abundance structures of clusters to break chained OTUs. Here we present Swarm v2, which has two important novel features: (1) a new algorithm for d = 1 that allows the computation time of the program to scale linearly with increasing amounts of data; and (2) the new fastidious option that reduces under-grouping by grafting low abundant OTUs (e.g., singletons and doubletons) onto larger ones. Swarm v2 also directly integrates the clustering and breaking phases, dereplicates sequencing reads with d = 0, outputs OTU representatives in fasta format, and plots individual OTUs as two-dimensional networks.
Niu, Yuze; Gao, Fengtao; Zhao, Yongwei; Zhang, Jing; Sun, Jian; Shao, Changwei; Liao, Xiaolin; Wang, Lei; Tian, Yongsheng; Chen, Songlin
2012-01-01
High-density genetic linkage maps were constructed for the Japanese flounder (Paralichthys olivaceus). A total of 1624 microsatellite markers were polymorphic in the reference family. Linkage analysis using JoinMap 4.0 resulted in the mapping of 1487 markers to 24 linkage groups, a result which was consistent with the 24 chromosomes seen in chromosome spreads. The female map was composed of 1257 markers, covering a total of 1663.8 cM with an average interval 1.35 cM between markers. The male map consisted of 1224 markers, spanning 1726.5 cM, with an average interval of 1.44 cM. The genome length in the Japanese flounder was estimated to be 1730.3 cM for the females and 1798.0 cM for the males, a coverage of 96.2% for the female and 96.0% for the male map. The mean recombination at common intervals throughout the genome revealed a slight difference between sexes, i.e. 1.07 times higher in the male than female. High-density genetic linkage maps are very useful for marker-assisted selection (MAS) programs for economically valuable traits in this species and for further evolutionary studies in flatfish and vertebrate species. Furthermore, four quantiative trait loci (QTL) associated with growth traits were mapped on the genetic map. One QTL was identified for body weight on LG 14 f, which explained 14.85% of the total variation of the body weight. Three QTL were identified for body width on LG14f and LG14m, accounting for 16.75%, 13.62% and 13.65% of the total variation in body width, respectively. The additive effects were evident as negative values. There were four QTL for growth traits clustered on LG14, which should prove to be very useful for improving growth traits using molecular MAS. PMID:23209734
Price, Neil P J; Hartman, Trina M; Vermillion, Karl E
2015-07-21
The structural analysis of complex carbohydrates typically requires the assignment of three parameters: monosaccharide composition, the position of glycosidic linkages between monosaccharides, and the position and nature of noncarbohydrate substituents. The glycosidic linkage positions are often determined by permethylation analysis, but this can be complicated by high viscosity or poor solubility, resulting in under-methylation. This is a drawback because an under-methylated position may be misinterpreted as the erroneous site of a linkage or substituent. Here, we describe an alternative approach to linkage analysis that makes use of a nonreversible deuterium exchange of C-H protons on the carbohydrate backbone. The exchange reaction is conducted in deuterated water catalyzed by Raney nickel, and results in the selective exchange of C-H protons adjacent to free hydroxyl groups. Hence, the position of the residual C-H protons is indicative of the position of glycosidic linkages or other substituents and can be readily assigned by heteronuclear single quantum coherence-nuclear magnetic resonance (HSQC-NMR) or, following suitable derivatization, by gas chromatography-mass spectroscopy (GC/MS) analysis. Moreover, because the only changes to the parent sugar are proton/deuterium exchanges, the composition and linkage analysis can be determined in a single step.
Harischandra, Iresha Nilmini; Dassanayake, Ranil Samantha; De Silva, Bambaranda Gammacharige Don Nissanka Kolitha
2016-01-04
The disease re-emergence threat from the major malaria vector in Sri Lanka, Anopheles culicifacies, is currently increasing. To predict malaria vector dynamics, knowledge of population genetics and gene flow is required, but this information is unavailable for Sri Lanka. This study was carried out to determine the population structure of An. culicifacies E in Sri Lanka. Eight microsatellite markers were used to examine An. culicifacies E collected from six sites in Sri Lanka during 2010-2012. Standard population genetic tests and analyses, genetic differentiation, Hardy-Weinberg equilibrium, linkage disequilibrium, Bayesian cluster analysis, AMOVA, SAMOVA and isolation-by-distance were conducted using five polymorphic loci. Five microsatellite loci were highly polymorphic with high allelic richness. Hardy-Weinberg Equilibrium (HWE) was significantly rejected for four loci with positive F(IS) values in the pooled population (p < 0.0100). Three loci showed high deviations in all sites except Kataragama, which was in agreement with HWE for all loci except one locus (p < 0.0016). Observed heterozygosity was less than the expected values for all sites except Kataragama, where reported negative F(IS) values indicated a heterozygosity excess. Genetic differentiation was observed for all sampling site pairs and was not supported by the isolation by distance model. Bayesian clustering analysis identified the presence of three sympatric clusters (gene pools) in the studied population. Significant genetic differentiation was detected in cluster pairs with low gene flow and isolation by distance was not detected between clusters. Furthermore, the results suggested the presence of a barrier to gene flow that divided the populations into two parts with the central hill region of Sri Lanka as the dividing line. Three sympatric clusters were detected among An. culicifacies E specimens isolated in Sri Lanka. There was no effect of geographic distance on genetic differentiation and the central mountain ranges in Sri Lanka appeared to be a barrier to gene flow.
Liao, Minlei; Li, Yunfeng; Kianifard, Farid; Obi, Engels; Arcona, Stephen
2016-03-02
Cluster analysis (CA) is a frequently used applied statistical technique that helps to reveal hidden structures and "clusters" found in large data sets. However, this method has not been widely used in large healthcare claims databases where the distribution of expenditure data is commonly severely skewed. The purpose of this study was to identify cost change patterns of patients with end-stage renal disease (ESRD) who initiated hemodialysis (HD) by applying different clustering methods. A retrospective, cross-sectional, observational study was conducted using the Truven Health MarketScan® Research Databases. Patients aged ≥18 years with ≥2 ESRD diagnoses who initiated HD between 2008 and 2010 were included. The K-means CA method and hierarchical CA with various linkage methods were applied to all-cause costs within baseline (12-months pre-HD) and follow-up periods (12-months post-HD) to identify clusters. Demographic, clinical, and cost information was extracted from both periods, and then examined by cluster. A total of 18,380 patients were identified. Meaningful all-cause cost clusters were generated using K-means CA and hierarchical CA with either flexible beta or Ward's methods. Based on cluster sample sizes and change of cost patterns, the K-means CA method and 4 clusters were selected: Cluster 1: Average to High (n = 113); Cluster 2: Very High to High (n = 89); Cluster 3: Average to Average (n = 16,624); or Cluster 4: Increasing Costs, High at Both Points (n = 1554). Median cost changes in the 12-month pre-HD and post-HD periods increased from $185,070 to $884,605 for Cluster 1 (Average to High), decreased from $910,930 to $157,997 for Cluster 2 (Very High to High), were relatively stable and remained low from $15,168 to $13,026 for Cluster 3 (Average to Average), and increased from $57,909 to $193,140 for Cluster 4 (Increasing Costs, High at Both Points). Relatively stable costs after starting HD were associated with more stable scores on comorbidity index scores from the pre-and post-HD periods, while increasing costs were associated with more sharply increasing comorbidity scores. The K-means CA method appeared to be the most appropriate in healthcare claims data with highly skewed cost information when taking into account both change of cost patterns and sample size in the smallest cluster.
Bayesian linkage and segregation analysis: factoring the problem.
Matthysse, S
2000-01-01
Complex segregation analysis and linkage methods are mathematical techniques for the genetic dissection of complex diseases. They are used to delineate complex modes of familial transmission and to localize putative disease susceptibility loci to specific chromosomal locations. The computational problem of Bayesian linkage and segregation analysis is one of integration in high-dimensional spaces. In this paper, three available techniques for Bayesian linkage and segregation analysis are discussed: Markov Chain Monte Carlo (MCMC), importance sampling, and exact calculation. The contribution of each to the overall integration will be explicitly discussed.
Handbook of Occupational Programs. Task Linkage Project Publication No. 1.
ERIC Educational Resources Information Center
Georgia State Univ., Atlanta. School of Education.
To demonstrate the continuity between secondary and postsecondary occupational programs and the link between them and industrial manpower roles, this handbook cross references Georgia occupational educational programs and related job titles. Nineteen occupational clusters included in secondary schools are covered: agricultural power and mechanics;…
Genetic variants of TREML2 are associated with HLA-B27-positive ankylosing spondylitis.
Feng, Yuan; Hong, Yaqiang; Zhang, Xin; Cao, Chunwei; Yang, Xichao; Lai, Shujuan; Fan, Chunmei; Cheng, Feng; Yan, Mei; Li, Chaohua; Huang, Wan; Chen, Wei; Zhu, Ping; Zeng, Changqing
2018-08-20
Although ankylosing spondylitis (AS) is a common, highly heritable arthropathy, the precise genetic mechanism underlying the disease remains elusive. Here, we investigate the disease-causing mutations in a large AS family with distinguished complexity, consisting of 23 patients covering four generations and exhibiting a mixed HLA-B27 (+) and (-) status. Linkage analysis with 32 members using three methods and whole-exome sequencing analysis with three HLA-B27 (+) patients, one HLA-B27 (-) patient, and one healthy individual did not identify a mutation common to all of the patients, strongly suggesting the existence of genetic heterogeneity in this large pedigree. However, if only B27-positive patients were analyzed, the linkage analysis located a 22-Mb region harboring the HLA gene cluster in chromosome 6 (LOD = 4.2), and the subsequent exome analysis identified two non-synonymous mutations in the TREML2 and IP6K3 genes. These genes were resequenced among 370 sporadic AS patients and 487 healthy individuals. A significantly higher mutation frequency of TREML2 was observed in AS patients (1.51% versus 0.21%). The results obtained for the AS pedigree and sporadic patients suggest that mutation of TREML2 is a major factor leading to AS for HLA-B27 (+) members in this large family and that TREML2 is also a susceptibility gene promoting the development of ankylosing spondylitis in HLA-B27 (+) individuals. Copyright © 2018 Elsevier B.V. All rights reserved.
McGary, Kriston L; Slot, Jason C; Rokas, Antonis
2013-07-09
Genomic analyses have proliferated without being tied to tangible phenotypes. For example, although coordination of both gene expression and genetic linkage have been offered as genetic mechanisms for the frequently observed clustering of genes participating in fungal metabolic pathways, elucidation of the phenotype(s) favored by selection, resulting in cluster formation and maintenance, has not been forthcoming. We noted that the cause of certain well-studied human metabolic disorders is the accumulation of toxic intermediate compounds (ICs), which occurs when the product of an enzyme is not used as a substrate by a downstream neighbor in the metabolic network. This raises the hypothesis that the phenotype favored by selection to drive gene clustering is the mitigation of IC toxicity. To test this, we examined 100 diverse fungal genomes for the simplest type of cluster, gene pairs that are both metabolic neighbors and chromosomal neighbors immediately adjacent to each other, which we refer to as "double neighbor gene pairs" (DNGPs). Examination of the toxicity of their corresponding ICs shows that, compared with chromosomally nonadjacent metabolic neighbors, DNGPs are enriched for ICs that have acutely toxic LD50 doses or reactive functional groups. Furthermore, DNGPs are significantly more likely to be divergently oriented on the chromosome; remarkably, ∼40% of these DNGPs have ICs known to be toxic. We submit that the structure of synteny in metabolic pathways of fungi is a signature of selection for protection against the accumulation of toxic metabolic intermediates.
McGary, Kriston L.; Slot, Jason C.; Rokas, Antonis
2013-01-01
Genomic analyses have proliferated without being tied to tangible phenotypes. For example, although coordination of both gene expression and genetic linkage have been offered as genetic mechanisms for the frequently observed clustering of genes participating in fungal metabolic pathways, elucidation of the phenotype(s) favored by selection, resulting in cluster formation and maintenance, has not been forthcoming. We noted that the cause of certain well-studied human metabolic disorders is the accumulation of toxic intermediate compounds (ICs), which occurs when the product of an enzyme is not used as a substrate by a downstream neighbor in the metabolic network. This raises the hypothesis that the phenotype favored by selection to drive gene clustering is the mitigation of IC toxicity. To test this, we examined 100 diverse fungal genomes for the simplest type of cluster, gene pairs that are both metabolic neighbors and chromosomal neighbors immediately adjacent to each other, which we refer to as “double neighbor gene pairs” (DNGPs). Examination of the toxicity of their corresponding ICs shows that, compared with chromosomally nonadjacent metabolic neighbors, DNGPs are enriched for ICs that have acutely toxic LD50 doses or reactive functional groups. Furthermore, DNGPs are significantly more likely to be divergently oriented on the chromosome; remarkably, ∼40% of these DNGPs have ICs known to be toxic. We submit that the structure of synteny in metabolic pathways of fungi is a signature of selection for protection against the accumulation of toxic metabolic intermediates. PMID:23798424
Li, Zhihua; Du, Shaowu; Wu, Xintao
2004-08-09
Reaction of [MoOS(3)](2)(-) and [WS(4)](2)(-) with Cudtp (dtp = diethyl dithiophosphate) gave rise to the clusters [Bu(4)N](2)[(MoOS(3))(4)Cu(12)(dtp)(6)], 1, and [Et(4)N][(WS(4)Cu(4))(dtp)(3)], 2, respectively. In cluster 1, the dtp- ligands act as both monodentate and bidentate ligands that bridge between Cu atoms and link together a closed double-cubane-like [Mo(2)O(2)S(6)Cu(6)](2+) core and two incomplete cubane-like [MoOS(3)Cu(3)]+ units. In cluster 2, the [WS(4)Cu(4)](2+) fragments were connected via bidentate and doubly bridging dtp- bridges to give a chain polymeric anion. Cluster 1 is the first example of a Mo/Cu/S cluster that contains a closed double-cubane-like structure. Compound 2 is also rare and the first W/Cu/S polymer with dtp- linkages.
An inversion inv(4)(p12-p15.3) in autistic siblings implicates the 4p GABA receptor gene cluster.
Vincent, J B; Horike, S I; Choufani, S; Paterson, A D; Roberts, W; Szatmari, P; Weksberg, R; Fernandez, B; Scherer, S W
2006-05-01
We describe the case of two brothers diagnosed with autism who both carry a paracentic inversion of the short arm of chromosome 4 (46,XY, inv(4)(p12-p15.3)). We have determined that this inversion is inherited from an apparently unaffected mother and unaffected maternal grandfather. Methods/ Using fluorescence in situ hybridisation analysis and Southern blot hybridisation we identified the breakpoints. The proximal breakpoint (4p12) maps to a region containing a cluster of gamma-aminobutyric acid A (GABA(A)) receptor genes, and directly interrupts the GABRG1 gene, the distal-most gene of the cluster. We also identified an insertion/deletion polymorphism for a approximately 2 kb LINE1 (L1) element that occurs within intron 7 of GABRG1. Our genotype analysis amongst autism families indicated that the L1 deletion allele did not show increased transmission to affected individuals. No linkage disequilibrium was evident between the L1 and single nucleotide polymorphisms in adjacent GABA(A) receptor genes on 4p, where a recent study has identified significant association with autism. Despite this, the identification of an inversion breakpoint disrupting GABRG1 provides solid support for the genetic involvement of the short arm of chromosome 4 in the genetic aetiology of autism, and for the hypothesis of disrupted GABA neurotransmission in autism.
Genetic Diversity of Plasmodium falciparum in Haiti: Insights from Microsatellite Markers
Carter, Tamar E.; Malloy, Halley; Existe, Alexandre; Memnon, Gladys; St. Victor, Yves; Okech, Bernard A.; Mulligan, Connie J.
2015-01-01
Hispaniola, comprising Haiti and the Dominican Republic, has been identified as a candidate for malaria elimination. However, incomplete surveillance data in Haiti hamper efforts to assess the impact of ongoing malaria control interventions. Characteristics of the genetic diversity of Plasmodium falciparum populations can be used to assess parasite transmission, which is information vital to evaluating malaria elimination efforts. Here we characterize the genetic diversity of P. falciparum samples collected from patients at seven sites in Haiti using 12 microsatellite markers previously employed in population genetic analyses of global P. falciparum populations. We measured multiplicity of infections, level of genetic diversity, degree of population geographic substructure, and linkage disequilibrium (defined as non-random association of alleles from different loci). For low transmission populations like Haiti, we expect to see few multiple infections, low levels of genetic diversity, high degree of population structure, and high linkage disequilibrium. In Haiti, we found low levels of multiple infections (12.9%), moderate to high levels of genetic diversity (mean number of alleles per locus = 4.9, heterozygosity = 0.61), low levels of population structure (highest pairwise Fst = 0.09 and no clustering in principal components analysis), and moderate linkage disequilibrium (ISA = 0.05, P<0.0001). In addition, population bottleneck analysis revealed no evidence for a reduction in the P. falciparum population size in Haiti. We conclude that the high level of genetic diversity and lack of evidence for a population bottleneck may suggest that Haiti’s P. falciparum population has been stable and discuss the implications of our results for understanding the impact of malaria control interventions. We also discuss the relevance of parasite population history and other host and vector factors when assessing transmission intensity from genetic diversity data. PMID:26462203
Neiberg, Rebecca H; Aickin, Mikel; Grzywacz, Joseph G; Lang, Wei; Quandt, Sara A; Bell, Ronny A; Arcury, Thomas A
2011-04-01
There are widespread assumptions that a large proportion of American adults use a variety of complementary and alternative medicine (CAM) therapies. The goal of this study is to explore the clustering or linkages among CAM categories in the general population. Linkset analysis and data from the 2002 National Health Interview Survey (NHIS) were used to address two specific aims. First, the dominant linkages of CAM categories used by the same individual were delineated, and population estimates were generated of the percentage of American adults using different linksets of CAM categories. Second, it was determined whether dominant linkages of CAM modalities differ by age, gender, ethnicity, and education. Linkset analysis, a method of estimating co-occurrence beyond chance, was used on data from the 2002 NHIS (N = 29,862) to identify possible sets of CAM use. Most adults use CAM therapies from a single category. Approximately 20% of adults combined two CAM categories, with the combination of mind-body therapies and biologically based therapies estimated to be most common. Only 5% of adults use therapies representing three or more CAM categories. Combining therapies across multiple CAM categories was more common among those 46-64, women, whites, and those with a college education. The results of this study allow researchers to refine descriptions of CAM use in the adult population. Most adults do not use a wide assortment of CAM; most use therapies within a single CAM category. Sets of CAM use were found to differ by age, gender, ethnicity, and education in ways consistent with previous research.
Low Divergence of Clonorchis sinensis in China Based on Multilocus Analysis
Sun, Jiufeng; Huang, Yan; Huang, Huaiqiu; Liang, Pei; Wang, Xiaoyun; Mao, Qiang; Men, Jingtao; Chen, Wenjun; Deng, Chuanhuan; Zhou, Chenhui; Lv, Xiaoli; Zhou, Juanjuan; Zhang, Fan; Li, Ran; Tian, Yanli; Lei, Huali; Liang, Chi; Hu, Xuchu; Xu, Jin; Li, Xuerong; XinbingYu
2013-01-01
Clonorchis sinensis, an ancient parasite that infects a number of piscivorous mammals, attracts significant public health interest due to zoonotic exposure risks in Asia. The available studies are insufficient to reflect the prevalence, geographic distribution, and intraspecific genetic diversity of C. sinensis in endemic areas. Here, a multilocus analysis based on eight genes (ITS1, act, tub, ef-1a, cox1, cox3, nad4 and nad5 [4.986 kb]) was employed to explore the intra-species genetic construction of C. sinensis in China. Two hundred and fifty-six C. sinensis isolates were obtained from environmental reservoirs from 17 provinces of China. A total of 254 recognized Multilocus Types (MSTs) showed high diversity among these isolates using multilocus analysis. The comparison analysis of nuclear and mitochondrial phylogeny supports separate clusters in a nuclear dendrogram. Genetic differentiation analysis of three clusters (A, B, and C) showed low divergence within populations. Most isolates from clusters B and C are geographically limited to central China, while cluster A is extraordinarily genetically diverse. Further genetic analyses between different geographic distributions, water bodies and hosts support the low population divergence. The latter haplotype analyses were consistent with the phylogenetic and genetic differentiation results. A recombination network based on concatenated sequences showed a concentrated linkage recombination population in cox1, cox3, nad4 and nad5, with spatial structuring in ITS1. Coupled with the history record and archaeological evidence of C. sinensis infection in mummified desiccated feces, these data point to an ancient origin of C. sinensis in China. In conclusion, we present a likely phylogenetic structure of the C. sinensis population in mainland China, highlighting its possible tendency for biogeographic expansion. Meanwhile, ITS1 was found to be an effective marker for tracking C. sinensis infection worldwide. Thus, the present study improves our understanding of the global epidemiology and evolution of C. sinensis. PMID:23825605
Sexual recombination is a signature of a persisting malaria epidemic in Peru
2011-01-01
Background The aim of this study was to consider the impact that multi-clone, complex infections have on a parasite population structure in a low transmission setting. In general, complexity of infection (minimum number of clones within an infection) and the overall population level diversity is expected to be minimal in low transmission settings. Additionally, the parasite population structure is predicted to be clonal, rather than sexual due to infrequent parasite inoculation and lack of recombination between genetically distinct clones. However, in this low transmission of the Peruvian Amazon, complex infections are becoming more frequent, in spite of decreasing infection prevalence. In this study, it was hypothesized that sexual recombination between distinct clonal lineages of Plasmodium falciparum parasites were altering the subpopulation structure and effectively maintaining the population-level diversity. Methods Fourteen microsatellite markers were chosen to describe the genetic diversity in 313 naturally occurring P. falciparum infections from Peruvian Amazon. The population and subpopulation structure was characterized by measuring: clusteredness, expected heterozygosity (He), allelic richness, private allelic richness, and linkage disequilibrium. Next, microsatellite haplotypes and alleles were correlated with P. falciparum merozoite surface protein 1 Block 2 (Pfmsp1-B2) to examine the presence of recombinant microsatellite haplotypes. Results The parasite population structure consists of six genetically diverse subpopulations of clones, called "clusters". Clusters 1, 3, 4, and 6 have unique haplotypes that exceed 70% of the total number of clones within each cluster, while Clusters 2 and 5 have a lower proportion of unique haplotypes, but still exceed 46%. By measuring the He, allelic richness, and private allelic richness within each of the six subpopulations, relatively low levels of genetic diversity within each subpopulation (except Cluster 4) are observed. This indicated that the number of alleles, and not the combination of alleles, are limited. Next, the standard index of association (IAS) was measured, which revealed a significant decay in linkage disequilibrium (LD) associated with Cluster 6, which is indicative of independent assortment of alleles. This decay in LD is a signature of this subpopulation approaching linkage equilibrium by undergoing sexual recombination. To trace possible recombination events, the two most frequent microsatellite haplotypes observed over time (defined by either a K1 or Mad20) were selected as the progenitors and then potential recombinants were identified in within the natural population. Conclusions Contrary to conventional low transmission models, this study provides evidence of a parasite population structure that is superficially defined by a clonal backbone. Sexual recombination does occur and even arguably is responsible for maintaining the substructure of this population. PMID:22039962
NASA Astrophysics Data System (ADS)
Conway, Declan; Dalin, Carole; Landman, Willem A.; Osborn, Timothy J.
2017-12-01
Hydropower comprises a significant and rapidly expanding proportion of electricity production in eastern and southern Africa. In both regions, hydropower is exposed to high levels of climate variability and regional climate linkages are strong, yet an understanding of spatial interdependences is lacking. Here we consider river basin configuration and define regions of coherent rainfall variability using cluster analysis to illustrate exposure to the risk of hydropower supply disruption of current (2015) and planned (2030) hydropower sites. Assuming completion of the dams planned, hydropower will become increasingly concentrated in the Nile (from 62% to 82% of total regional capacity) and Zambezi (from 73% to 85%) basins. By 2030, 70% and 59% of total hydropower capacity will be located in one cluster of rainfall variability in eastern and southern Africa, respectively, increasing the risk of concurrent climate-related electricity supply disruption in each region. Linking of nascent regional electricity sharing mechanisms could mitigate intraregional risk, although these mechanisms face considerable political and infrastructural challenges.
Kunkler, I H; Prescott, R J; Lee, R J; Brebner, J A; Cairns, J A; Fielding, R G; Bowman, A; Neades, G; Walls, A D F; Chetty, U; Dixon, J M; Smith, M E; Gardner, T W; Macnab, M; Swann, S; Maclean, J R
2007-11-01
The TELEMAM trial aimed to assess the clinical effectiveness and costs of telemedicine in conducting breast cancer multi-disciplinary meetings (MDTs). Over 12 months 473 MDT patient discussions in two district general hospitals (DGHs) were cluster randomised (2:1) to the intervention of telemedicine linkage to breast specialists in a cancer centre or to the control group of 'in-person' meetings. Primary endpoints were clinical effectiveness and costs. Economic analysis was based on a cost-minimisation approach. Levels of agreement of MDT members on a scale from 1 to 5 were high and similar in both the telemedicine and standard meetings for decision sharing (4.04 versus 4.17), consensus (4.06 versus 4.20) and confidence in the decision (4.16 versus 4.07). The threshold at which the telemedicine meetings became cheaper than standard MDTs was approximately 40 meetings per year. Telemedicine delivered breast cancer multi-disciplinary meetings have similar clinical effectiveness to standard 'in-person' meetings.
Can Network Linkage Effects Determine Return? Evidence from Chinese Stock Market
Qiao, Haishu; Xia, Yue; Li, Ying
2016-01-01
This study used the dynamic conditional correlations (DCC) method to identify the linkage effects of Chinese stock market, and further detected the influence of network linkage effects on magnitude of security returns across different industries. Applying two physics-derived techniques, the minimum spanning tree and the hierarchical tree, we analyzed the stock interdependence within the network of the China Securities Index (CSI) industry index basket. We observed that that obvious linkage effects existed among stock networks. CII and CCE, CAG and ITH as well as COU, CHA and REI were confirmed as the core nodes in the three different networks respectively. We also investigated the stability of linkage effects by estimating the mean correlations and mean distances, as well as the normalized tree length of these indices. In addition, using the GMM model approach, we found inter-node influence within the stock network had a pronounced effect on stock returns. Our results generally suggested that there appeared to be greater clustering effect among the indexes belonging to related industrial sectors than those of diverse sectors, and network comovement was significantly affected by impactive financial events in the reality. Besides, stocks that were more central within the network of stock market usually had higher returns for compensation because they endured greater exposure to correlation risk. PMID:27257816
Can Network Linkage Effects Determine Return? Evidence from Chinese Stock Market.
Qiao, Haishu; Xia, Yue; Li, Ying
2016-01-01
This study used the dynamic conditional correlations (DCC) method to identify the linkage effects of Chinese stock market, and further detected the influence of network linkage effects on magnitude of security returns across different industries. Applying two physics-derived techniques, the minimum spanning tree and the hierarchical tree, we analyzed the stock interdependence within the network of the China Securities Index (CSI) industry index basket. We observed that that obvious linkage effects existed among stock networks. CII and CCE, CAG and ITH as well as COU, CHA and REI were confirmed as the core nodes in the three different networks respectively. We also investigated the stability of linkage effects by estimating the mean correlations and mean distances, as well as the normalized tree length of these indices. In addition, using the GMM model approach, we found inter-node influence within the stock network had a pronounced effect on stock returns. Our results generally suggested that there appeared to be greater clustering effect among the indexes belonging to related industrial sectors than those of diverse sectors, and network comovement was significantly affected by impactive financial events in the reality. Besides, stocks that were more central within the network of stock market usually had higher returns for compensation because they endured greater exposure to correlation risk.
Conformational and functional analysis of molecular dynamics trajectories by Self-Organising Maps
2011-01-01
Background Molecular dynamics (MD) simulations are powerful tools to investigate the conformational dynamics of proteins that is often a critical element of their function. Identification of functionally relevant conformations is generally done clustering the large ensemble of structures that are generated. Recently, Self-Organising Maps (SOMs) were reported performing more accurately and providing more consistent results than traditional clustering algorithms in various data mining problems. We present a novel strategy to analyse and compare conformational ensembles of protein domains using a two-level approach that combines SOMs and hierarchical clustering. Results The conformational dynamics of the α-spectrin SH3 protein domain and six single mutants were analysed by MD simulations. The Cα's Cartesian coordinates of conformations sampled in the essential space were used as input data vectors for SOM training, then complete linkage clustering was performed on the SOM prototype vectors. A specific protocol to optimize a SOM for structural ensembles was proposed: the optimal SOM was selected by means of a Taguchi experimental design plan applied to different data sets, and the optimal sampling rate of the MD trajectory was selected. The proposed two-level approach was applied to single trajectories of the SH3 domain independently as well as to groups of them at the same time. The results demonstrated the potential of this approach in the analysis of large ensembles of molecular structures: the possibility of producing a topological mapping of the conformational space in a simple 2D visualisation, as well as of effectively highlighting differences in the conformational dynamics directly related to biological functions. Conclusions The use of a two-level approach combining SOMs and hierarchical clustering for conformational analysis of structural ensembles of proteins was proposed. It can easily be extended to other study cases and to conformational ensembles from other sources. PMID:21569575
NASA Technical Reports Server (NTRS)
Slaby, Scott M.; Ewing, David W.; Zehe, Michael J.
1997-01-01
The AM1 semiempirical quantum chemical method was used to model the interaction of perfluoroethers with aluminum surfaces. Perfluorodimethoxymethane and perfluorodimethyl ether were studied interacting with aluminum surfaces, which were modeled by a five-atom cluster and a nine-atom cluster. Interactions were studied for edge (high index) sites and top (low index) sites of the clusters. Both dissociative binding and nondissociative binding were found, with dissociative binding being stronger. The two different ethers bound and dissociated on the clusters in different ways: perfluorodimethoxymethane through its oxygen atoms, but perfluorodimethyl ether through its fluorine atoms. The acetal linkage of perfluorodimeth-oxymethane was the key structural feature of this molecule in its binding and dissociation on the aluminum surface models. The high-index sites of the clusters caused the dissociation of both ethers. These results are consistent with the experimental observation that perfluorinated ethers decompose in contact with sputtered aluminum surfaces.
Numerical taxonomy and ecology of petroleum-degrading bacteria.
Austin, B; Calomiris, J J; Walker, J D; Colwell, R R
1977-01-01
A total of 99 strains of petroleum-degrading bacteria isolated from Chesapeake Bay water and sediment were identified by using numerical taxonomy procedures. The isolates, together with 33 reference cultures, were examined for 48 biochemical, cultural, morphological, and physiological characters. The data were analyzed by computer, using both the simple matching and the Jaccard coefficients. Clustering was achieved by the unweighted average linkage method. From the sorted similarity matrix and dendrogram, 14 phenetic groups, comprising 85 of the petroleum-degrading bacteria, were defined at the 80 to 85% similarity level. These groups were identified as actinomycetes (mycelial forms, four clusters), coryneforms, Enterobacteriaceae, Klebsiella aerogenes, Micrococcus spp. (two clusters), Nocardia species (two clusters), Pseudomonas spp. (two clusters), and Sphaerotilus natans. It is concluded that the degradation of petroleum is accomplished by a diverse range of bacterial taxa, some of which were isolated only at given sampling stations and, more specifically, from sediment collected at a given station. PMID:889329
Locia-Aguilar, G J; López-Saucedo, B; Deheza-Bautista, S; Salado-Beltrán, O V; Martínez-Sevilla, V M; Rangel-Villalobos, H
2018-03-31
Allele distribution and forensic parameters were estimated for 15 STR loci (AmpFlSTR Identifiler kit) in 251 Mexican-Mestizos from the state of Guerrero (South, Mexico). Genotype distribution was in agreement with Hardy-Weinberg expectations for all 15 STRs. Similarly, linkage disequilibrium test demonstrated no association between pair of loci. The power of exclusion and power of discrimination values were 99.999634444% and >99.99999999%, respectively. Genetic relationship analysis regarding Mestizo populations from the main geographic regions of Mexico suggests that the Center and the present South regions conform one population cluster, separated from the Southeast and Northwest regions. Copyright © 2018 Elsevier B.V. All rights reserved.
Topology of the correlation networks among major currencies using hierarchical structure methods
NASA Astrophysics Data System (ADS)
Keskin, Mustafa; Deviren, Bayram; Kocakaplan, Yusuf
2011-02-01
We studied the topology of correlation networks among 34 major currencies using the concept of a minimal spanning tree and hierarchical tree for the full years of 2007-2008 when major economic turbulence occurred. We used the USD (US Dollar) and the TL (Turkish Lira) as numeraires in which the USD was the major currency and the TL was the minor currency. We derived a hierarchical organization and constructed minimal spanning trees (MSTs) and hierarchical trees (HTs) for the full years of 2007, 2008 and for the 2007-2008 period. We performed a technique to associate a value of reliability to the links of MSTs and HTs by using bootstrap replicas of data. We also used the average linkage cluster analysis for obtaining the hierarchical trees in the case of the TL as the numeraire. These trees are useful tools for understanding and detecting the global structure, taxonomy and hierarchy in financial data. We illustrated how the minimal spanning trees and their related hierarchical trees developed over a period of time. From these trees we identified different clusters of currencies according to their proximity and economic ties. The clustered structure of the currencies and the key currency in each cluster were obtained and we found that the clusters matched nicely with the geographical regions of corresponding countries in the world such as Asia or Europe. As expected the key currencies were generally those showing major economic activity.
Divis, Paul C. S.; Singh, Balbir; Anderios, Fread; Hisam, Shamilah; Matusop, Asmad; Kocken, Clemens H.; Assefa, Samuel A.; Duffy, Craig W.; Conway, David J.
2015-01-01
Human malaria parasite species were originally acquired from other primate hosts and subsequently became endemic, then spread throughout large parts of the world. A major zoonosis is now occurring with Plasmodium knowlesi from macaques in Southeast Asia, with a recent acceleration in numbers of reported cases particularly in Malaysia. To investigate the parasite population genetics, we developed sensitive and species-specific microsatellite genotyping protocols and applied these to analysis of samples from 10 sites covering a range of >1,600 km within which most cases have occurred. Genotypic analyses of 599 P. knowlesi infections (552 in humans and 47 in wild macaques) at 10 highly polymorphic loci provide radical new insights on the emergence. Parasites from sympatric long-tailed macaques (Macaca fascicularis) and pig-tailed macaques (M. nemestrina) were very highly differentiated (FST = 0.22, and K-means clustering confirmed two host-associated subpopulations). Approximately two thirds of human P. knowlesi infections were of the long-tailed macaque type (Cluster 1), and one third were of the pig-tailed-macaque type (Cluster 2), with relative proportions varying across the different sites. Among the samples from humans, there was significant indication of genetic isolation by geographical distance overall and within Cluster 1 alone. Across the different sites, the level of multi-locus linkage disequilibrium correlated with the degree of local admixture of the two different clusters. The widespread occurrence of both types of P. knowlesi in humans enhances the potential for parasite adaptation in this zoonotic system. PMID:26020959
Sobel, E.; Lange, K.
1996-01-01
The introduction of stochastic methods in pedigree analysis has enabled geneticists to tackle computations intractable by standard deterministic methods. Until now these stochastic techniques have worked by running a Markov chain on the set of genetic descent states of a pedigree. Each descent state specifies the paths of gene flow in the pedigree and the founder alleles dropped down each path. The current paper follows up on a suggestion by Elizabeth Thompson that genetic descent graphs offer a more appropriate space for executing a Markov chain. A descent graph specifies the paths of gene flow but not the particular founder alleles traveling down the paths. This paper explores algorithms for implementing Thompson's suggestion for codominant markers in the context of automatic haplotyping, estimating location scores, and computing gene-clustering statistics for robust linkage analysis. Realistic numerical examples demonstrate the feasibility of the algorithms. PMID:8651310
USDA-ARS?s Scientific Manuscript database
The structural analysis of complex carbohydrates typically requires the assignment of three parameters: monosaccharide composition, the position of glycosidic linkages between monosaccharides, and the position and nature of non-carbohydrate substituents. The glycosidic linkage positions are often de...
Two genetic markers closely linked to adult polycystic kidney disease on chromosome 16.
Reeders, S T; Breuning, M H; Corney, G; Jeremiah, S J; Meera Khan, P; Davies, K E; Hopkinson, D A; Pearson, P L; Weatherall, D J
1986-01-01
The genetic locus for autosomal dominant adult polycystic kidney disease was recently assigned to chromosome 16 by the finding of genetic linkage to the alpha globin gene cluster. Further study showed that the phosphoglycolate phosphatase locus is also closely linked to both the locus for adult polycystic kidney disease and the alpha globin gene cluster. These findings have important implications for the prenatal and presymptomatic diagnosis of adult polycystic kidney disease and for a better understanding of its pathogenesis. Images FIG 1 PMID:3008903
Koizumi, A; Shoji, Y; Nozaki, J; Noguchi, A; E, X; Dakeishi, M; Ohura, T; Tsuyoshi, K; Yasuhiko, W; Manabe, M; Takasago, Y; Takada, G
2000-09-01
Lysinuric protein intolerance is an autosomal recessive disease characterized by defective transport of the dibasic aminoacids. Mutational analysis of LPI patients in the northern part of Japan revealed that six were homozygous for the R410X mutation and two others were compound heterozygotes of R410X and other unknown mutations. In the population epidemiology study in a local cluster in the northern part of Iwate, ten heterozygotes were found in 1190 newborn babies leading to an estimated LPI incidence of 1/57,000. Polymorphism analysis revealed two major alleles, A and B, in intron 8. While the population frequency of allele A was 0.9 and that of allele B was 0.1 in the northern part of Japan the R410X mutations were exclusively on allele B in 31 chromosomes suggesting a founder effect. Genetic analysis in patients revealed strong linkage disequilibrium with D14S283 and TCRA indicating that the R410X mutation occurred before at least 130 generations ago (about 2600 years). The R410X mutation was shown to be useful as a molecular marker for screening LPI patients in the northern part of Japan. Copyright 2000 Wiley-Liss, Inc.
Nickel-catalyzed proton-deuterium exchange (HDX) for linkage analysis of complex carbohydrates
USDA-ARS?s Scientific Manuscript database
The structural assignment of complex carbohydrates typically requires the analysis of at least three parameters: 1. composition; 2. linkage; and 3. substituents. These are often assigned on a small scale by gas chromatography/mass spectrometry (GC/MS). Linkage positions are determined by permethylat...
Localization of genes involved in the metabolic syndrome using multivariate linkage analysis.
Olswold, Curtis; de Andrade, Mariza
2003-12-31
There are no well accepted criteria for the diagnosis of the metabolic syndrome. However, the metabolic syndrome is identified clinically by the presence of three or more of these five variables: larger waist circumference, higher triglyceride levels, lower HDL-cholesterol concentrations, hypertension, and impaired fasting glucose. We use sets of two or three variables, which are available in the Framingham Heart Study data set, to localize genes responsible for this syndrome using multivariate quantitative linkage analysis. This analysis demonstrates the applicability of using multivariate linkage analysis and how its use increases the power to detect linkage when genes are involved in the same disease mechanism.
A guide to evaluating linkage quality for the analysis of linked data.
Harron, Katie L; Doidge, James C; Knight, Hannah E; Gilbert, Ruth E; Goldstein, Harvey; Cromwell, David A; van der Meulen, Jan H
2017-10-01
Linked datasets are an important resource for epidemiological and clinical studies, but linkage error can lead to biased results. For data security reasons, linkage of personal identifiers is often performed by a third party, making it difficult for researchers to assess the quality of the linked dataset in the context of specific research questions. This is compounded by a lack of guidance on how to determine the potential impact of linkage error. We describe how linkage quality can be evaluated and provide widely applicable guidance for both data providers and researchers. Using an illustrative example of a linked dataset of maternal and baby hospital records, we demonstrate three approaches for evaluating linkage quality: applying the linkage algorithm to a subset of gold standard data to quantify linkage error; comparing characteristics of linked and unlinked data to identify potential sources of bias; and evaluating the sensitivity of results to changes in the linkage procedure. These approaches can inform our understanding of the potential impact of linkage error and provide an opportunity to select the most appropriate linkage procedure for a specific analysis. Evaluating linkage quality in this way will improve the quality and transparency of epidemiological and clinical research using linked data. © The Author 2017. Published by Oxford University Press on behalf of the International Epidemiological Association.
Samadpour, M; Grimm, L M; Desai, B; Alfi, D; Ongerth, J E; Tarr, P I
1993-12-01
Genomic DNAs prepared from 168 isolates of Escherichia coli O157:H7 were analyzed for restriction fragment length polymorphisms on Southern blots probed with bacteriophage lambda DNA. The isolates analyzed included strains from a recent large multistate outbreak of E. coli O157:H7 infection associated with consumption of poorly cooked beef in restaurants, a day-care center cluster, and temporally and geographically unrelated isolates. E. coli O157:H7 isolates recovered from the incriminated meat and from 61 (96.8%) of 63 patients from Washington and Nevada possessed identical lambda restriction fragment length patterns. The lambda restriction fragment length polymorphisms observed in 11 (91.7%) of 12 day-care center patients were identical, but they differed from that of the strain associated with the multistate outbreak. E. coli O157:H7 from 42 patients temporally or geographically unrelated to either cluster of infection possessed unique and different lambda restriction fragment length patterns, except for paired isolates from three separate clusters of infection. These data demonstrate that the hybridization of DNA digests of E. coli O157:H7 with radiolabelled bacteriophage lambda DNA can be a useful, stable, and discriminatory epidemiologic tool for analyzing the linkage between strains of E. coli O157:H7.
Haplotype analysis of the apolipoprotein gene cluster on human chromosome 11
Olivier, Michael; Wang, Xujing; Cole, Regina; Gau, Brian; Kim, Jessica; Rubin, Edward M.; Pennacchio, Len A.
2009-01-01
Members of the apolipoprotein gene cluster (APOA1/C3/A4/A5) on human chromosome 11q23 play an important role in lipid metabolism. Polymorphisms in both APOA5 and APOC3 are strongly associated with plasma triglyceride concentrations. The close genomic locations of these two genes as well as their functional similarity have hindered efforts to define whether each gene independently influences human triglyceride concentrations. In this study, we examined the linkage disequilibrium and haplotype structure of 49 SNPs in a 150-kb region spanning the gene cluster. We identified a total of five common APOA5 haplotypes with a frequency of greater than 8% in samples of northern European origin. The APOA5 haplotype block did not extend past the 7 SNPs in the gene and was separated from the other apolipoprotein gene in the cluster by a region of significantly increased recombination. Furthermore, one previously identified triglyceride risk haplotype of APOA5 (APOA5*3) showed no association with three APOC3 SNPs previously associated with triglyceride concentrations, in contrast to the other risk haplotype (APOA5*2), which was associated with all three minor APOC3 SNP alleles. These results highlight the complex genetic relationship between APOA5 and APOC3 and support the notion that APOA5 represents an independent risk gene affecting plasma triglyceride concentrations in humans. PMID:15081120
Swarm v2: highly-scalable and high-resolution amplicon clustering
Quince, Christopher; de Vargas, Colomban; Dunthorn, Micah
2015-01-01
Previously we presented Swarm v1, a novel and open source amplicon clustering program that produced fine-scale molecular operational taxonomic units (OTUs), free of arbitrary global clustering thresholds and input-order dependency. Swarm v1 worked with an initial phase that used iterative single-linkage with a local clustering threshold (d), followed by a phase that used the internal abundance structures of clusters to break chained OTUs. Here we present Swarm v2, which has two important novel features: (1) a new algorithm for d = 1 that allows the computation time of the program to scale linearly with increasing amounts of data; and (2) the new fastidious option that reduces under-grouping by grafting low abundant OTUs (e.g., singletons and doubletons) onto larger ones. Swarm v2 also directly integrates the clustering and breaking phases, dereplicates sequencing reads with d = 0, outputs OTU representatives in fasta format, and plots individual OTUs as two-dimensional networks. PMID:26713226
SECIMTools: a suite of metabolomics data analysis tools.
Kirpich, Alexander S; Ibarra, Miguel; Moskalenko, Oleksandr; Fear, Justin M; Gerken, Joseph; Mi, Xinlei; Ashrafi, Ali; Morse, Alison M; McIntyre, Lauren M
2018-04-20
Metabolomics has the promise to transform the area of personalized medicine with the rapid development of high throughput technology for untargeted analysis of metabolites. Open access, easy to use, analytic tools that are broadly accessible to the biological community need to be developed. While technology used in metabolomics varies, most metabolomics studies have a set of features identified. Galaxy is an open access platform that enables scientists at all levels to interact with big data. Galaxy promotes reproducibility by saving histories and enabling the sharing workflows among scientists. SECIMTools (SouthEast Center for Integrated Metabolomics) is a set of Python applications that are available both as standalone tools and wrapped for use in Galaxy. The suite includes a comprehensive set of quality control metrics (retention time window evaluation and various peak evaluation tools), visualization techniques (hierarchical cluster heatmap, principal component analysis, modular modularity clustering), basic statistical analysis methods (partial least squares - discriminant analysis, analysis of variance, t-test, Kruskal-Wallis non-parametric test), advanced classification methods (random forest, support vector machines), and advanced variable selection tools (least absolute shrinkage and selection operator LASSO and Elastic Net). SECIMTools leverages the Galaxy platform and enables integrated workflows for metabolomics data analysis made from building blocks designed for easy use and interpretability. Standard data formats and a set of utilities allow arbitrary linkages between tools to encourage novel workflow designs. The Galaxy framework enables future data integration for metabolomics studies with other omics data.
Yuan, Congying; Wang, Meinan; Skinner, Danniel Z; See, Deven R; Xia, Chongjing; Guo, Xinhong; Chen, Xianming
2018-01-01
Puccinia striiformis f. sp. tritici, the wheat stripe rust pathogen, is a dikaryotic, biotrophic, and macrocyclic fungus. Genetic study of P. striiformis f. sp. tritici virulence was not possible until the recent discovery of Berberis spp. and Mahonia spp. as alternate hosts. To determine inheritance of virulence and map virulence genes, a segregating population of 119 isolates was developed by self-fertilizing P. striiformis f. sp. tritici isolate 08-220 (race PSTv-11) on barberry leaves under controlled greenhouse conditions. The progeny isolates were phenotyped on a set of 29 wheat lines with single genes for race-specific resistance and genotyped with simple sequence repeat (SSR) markers, single nucleotide polymorphism (SNP) markers derived from secreted protein genes, and SNP markers from genotyping-by-sequencing (GBS). Using the GBS technique, 10,163 polymorphic GBS-SNP markers were identified. Clustering and principal component analysis grouped these markers into six genetic groups, and a genetic map, consisting of six linkage groups, was constructed with 805 markers. The six clusters or linkage groups resulting from these analyses indicated a haploid chromosome number of six in P. striiformis f. sp. tritici. Through virulence testing of the progeny isolates, the parental isolate was found to be homozygous for the avirulence loci corresponding to resistance genes Yr5, Yr10, Yr15, Yr24, Yr32, YrSP, YrTr1, Yr45, and Yr53 and homozygous for the virulence locus corresponding to resistance gene Yr41. Segregation was observed for virulence phenotypes in response to the remaining 19 single-gene lines. A single dominant gene or two dominant genes with different nonallelic gene interactions were identified for each of the segregating virulence phenotypes. Of 27 dominant virulence genes identified, 17 were mapped to two chromosomes. Markers tightly linked to some of the virulence loci may facilitate further studies to clone these genes. The virulence genes and their inheritance information are useful for understanding the host-pathogen interactions and for selecting effective resistance genes or gene combinations for developing stripe rust resistant wheat cultivars.
Graves, T.A.; Farley, S.; Goldstein, M.I.; Servheen, C.
2007-01-01
We identified primary habitat and functional corridors across a landscape using Global Positioning System (GPS) collar locations of brown bears (Ursus arctos). After deriving density, speed, and angular deviation of movement, we classified landscape function for a group of animals with a cluster analysis. We described areas with high amounts of sinuous movement as primary habitat patches and areas with high amounts of very directional, fast movement as highly functional bear corridors. The time between bear locations and scale of analysis influenced the number and size of corridors identified. Bear locations should be collected at intervals ???6 h to correctly identify travel corridors. Our corridor identification technique will help managers move beyond the theoretical discussion of corridors and linkage zones to active management of landscape features that will preserve connectivity. ?? 2007 Springer Science+Business Media, Inc.
Existence and significance of communities in the World Trade Web
NASA Astrophysics Data System (ADS)
Piccardi, Carlo; Tajoli, Lucia
2012-06-01
The World Trade Web (WTW), which models the international transactions among countries, is a fundamental tool for studying the economics of trade flows, their evolution over time, and their implications for a number of phenomena, including the propagation of economic shocks among countries. In this respect, the possible existence of communities is a key point, because it would imply that countries are organized in groups of preferential partners. In this paper, we use four approaches to analyze communities in the WTW between 1962 and 2008, based, respectively, on modularity optimization, cluster analysis, stability functions, and persistence probabilities. Overall, the four methods agree in finding no evidence of significant partitions. A few weak communities emerge from the analysis, but they do not represent secluded groups of countries, as intercommunity linkages are also strong, supporting the view of a truly globalized trading system.
Existence and significance of communities in the World Trade Web.
Piccardi, Carlo; Tajoli, Lucia
2012-06-01
The World Trade Web (WTW), which models the international transactions among countries, is a fundamental tool for studying the economics of trade flows, their evolution over time, and their implications for a number of phenomena, including the propagation of economic shocks among countries. In this respect, the possible existence of communities is a key point, because it would imply that countries are organized in groups of preferential partners. In this paper, we use four approaches to analyze communities in the WTW between 1962 and 2008, based, respectively, on modularity optimization, cluster analysis, stability functions, and persistence probabilities. Overall, the four methods agree in finding no evidence of significant partitions. A few weak communities emerge from the analysis, but they do not represent secluded groups of countries, as intercommunity linkages are also strong, supporting the view of a truly globalized trading system.
Introduction to Vocations. High Tech Focus. Final Report 1984-85.
ERIC Educational Resources Information Center
Wayne Township Schools, NJ.
This report contains the materials that were developed during a project to make middle-grade students more aware of high tech careers through the following activities: (1) teacher and student visitations of community sites to explore high tech careers in 15 occupational clusters; (2) exploratory activities to facilitate linkages and articulation…
The impact of meteorology on ozone in Houston
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eder, B.K.; Davis, J.M.; Nychka, D.
1997-12-31
This paper compares the results from both a one-stage hierarchical clustering technique (average linkage) and a two-stage technique (average linkage then k-means) as part of an objective meteorological Classification scheme designed to better elucidate ozone`s dependence on meteorology in the Houston, Texas, area. When applied to twelve years of meteorological data (1981-1992), each technique identified seven statistically distinct meteorological regimes, the majority of which exhibited significantly different daily 1-hour maximum ozone (O{sub 3}) concentrations. While both clustering approaches proved successful, the two-stage approach did appear superior in terms of better segregation of the mean O{sub 3}, concentrations. Both approaches indicatedmore » that the largest mean daily one-hour maximum concentrations are associated with migrating anticyclones and not with the quasi-permanent Bermuda High that often dominates the southeastern United States during the summer. As a result, maximum ozone concentrations are just as likely during the months of April, May, September and October as they are during the summer months. These findings support and help explain the unique O{sub 3}, climatology experienced by the Houston area.« less
Novel molecular subtypes of serous and endometrioid ovarian cancer linked to clinical outcome.
Tothill, Richard W; Tinker, Anna V; George, Joshy; Brown, Robert; Fox, Stephen B; Lade, Stephen; Johnson, Daryl S; Trivett, Melanie K; Etemadmoghadam, Dariush; Locandro, Bianca; Traficante, Nadia; Fereday, Sian; Hung, Jillian A; Chiew, Yoke-Eng; Haviv, Izhak; Gertig, Dorota; DeFazio, Anna; Bowtell, David D L
2008-08-15
The study aim to identify novel molecular subtypes of ovarian cancer by gene expression profiling with linkage to clinical and pathologic features. Microarray gene expression profiling was done on 285 serous and endometrioid tumors of the ovary, peritoneum, and fallopian tube. K-means clustering was applied to identify robust molecular subtypes. Statistical analysis identified differentially expressed genes, pathways, and gene ontologies. Laser capture microdissection, pathology review, and immunohistochemistry validated the array-based findings. Patient survival within k-means groups was evaluated using Cox proportional hazards models. Class prediction validated k-means groups in an independent dataset. A semisupervised survival analysis of the array data was used to compare against unsupervised clustering results. Optimal clustering of array data identified six molecular subtypes. Two subtypes represented predominantly serous low malignant potential and low-grade endometrioid subtypes, respectively. The remaining four subtypes represented higher grade and advanced stage cancers of serous and endometrioid morphology. A novel subtype of high-grade serous cancers reflected a mesenchymal cell type, characterized by overexpression of N-cadherin and P-cadherin and low expression of differentiation markers, including CA125 and MUC1. A poor prognosis subtype was defined by a reactive stroma gene expression signature, correlating with extensive desmoplasia in such samples. A similar poor prognosis signature could be found using a semisupervised analysis. Each subtype displayed distinct levels and patterns of immune cell infiltration. Class prediction identified similar subtypes in an independent ovarian dataset with similar prognostic trends. Gene expression profiling identified molecular subtypes of ovarian cancer of biological and clinical importance.
Naumenko, Olesya I; Zheng, Han; Wang, Jianping; Senchenkova, Sof'ya N; Wang, Hong; Shashkov, Alexander S; Chizhov, Alexander O; Li, Qun; Knirel, Yuriy A; Xiong, Yanwen
2018-03-02
The O-specific polysaccharide (O-antigen) was obtained by mild acid degradation of the lipopolysaccharide of Escherichia albertii O5 (strain T150248) and studied by sugar analysis, selective cleavages of glycosidic linkages, and 1D and 2D 1 H and 13 C NMR spectroscopy. Partial solvolysis with anh (anhydrous) CF 3 CO 2 H and hydrolysis with 0.05 M CF 3 CO 2 H cleaved predominantly the glycosidic linkage of β-GalpNAc or β-Galf, respectively, whereas the linkages of α-GlcpNAc and β-Galp were stable. Mixtures of the corresponding tri- and tetra-saccharides thus obtained were studied by NMR spectroscopy and high-resolution ESI MS. The following new structure was established for the tetrasaccharide repeat (O-unit) of the O-polysaccharide: →4)-α-d-GlcpNAc-(1 → 4)-β-d-Galp6Ac-(1 → 6)-β-d-Galf-(1 → 3)-β-d-GalpNAc-(1→where the degree of O-acetylation of d-Galp is ∼70%. The O-polysaccharide studied has a β-d-Galp-(1 → 6)-β-d-Galf-(1 → 3)-β-d-GalpNAc trisaccharide fragment in common with the O-polysaccharides of E. albertii O7, Escherichia coli O124 and O164, and Shigella dysenteriae type 3 studied earlier. The orf5-7 in the O-antigen gene cluster of E. albertii O5 are 47%, 78%, and 75% identical on the amino acid level to genes for predicted enzymes of E. albertii O7, including Galp-transferase wfeS, UDP-d-Galp mutase glf, and Galf-transferase wfeT, respectively, which are putatively involved with the synthesis of the shared trisaccharide fragment of the O-polysaccharides. The occurrence upstream of the O-antigen gene cluster of a 4-epimerase gene gnu for conversion of undecaprenyl diphosphate-linked d-GlcNAc (UndPP-d-GlcNAc) into UndPP-d-GalNAc indicates that d-GalNAc is the first monosaccharide of the O-unit, and hence the O-units are interlinked in the O-polysaccharide of E. albertii O5 by the β-d-GalpNAc-(1 → 4)-α-d-GlcpNAc linkage. Copyright © 2017. Published by Elsevier Ltd.
Welsh, Wayne N; Knudsen, Hannah K; Knight, Kevin; Ducharme, Lori; Pankow, Jennifer; Urbine, Terry; Lindsey, Adrienne; Abdel-Salam, Sami; Wood, Jennifer; Monico, Laura; Link, Nathan; Albizu-Garcia, Carmen; Friedmann, Peter D
2016-01-01
Weak coordination between community correctional agencies and community-based treatment providers is a major barrier to diffusion of medication-assisted treatment (MAT)--the inclusion of medications (e.g., methadone and buprenorphine) in combination with traditional counseling and behavioral therapies to treat substance use disorders. In a multisite cluster randomized trial, experimental sites (j = 10) received a 3-h MAT training plus a 12-month linkage intervention; control sites (j = 10) received the 3-h training alone. Hierarchical linear models showed that the intervention resulted in significant improvements in perceptions of interagency coordination among treatment providers, but not probation/parole agents. Implications for policy and practice are discussed.
Accommodating Chromosome Inversions in Linkage Analysis
Chen, Gary K.; Slaten, Erin; Ophoff, Roel A.; Lange, Kenneth
2006-01-01
This work develops a population-genetics model for polymorphic chromosome inversions. The model precisely describes how an inversion changes the nature of and approach to linkage equilibrium. The work also describes algorithms and software for allele-frequency estimation and linkage analysis in the presence of an inversion. The linkage algorithms implemented in the software package Mendel estimate recombination parameters and calculate the posterior probability that each pedigree member carries the inversion. Application of Mendel to eight Centre d'Étude du Polymorphisme Humain pedigrees in a region containing a common inversion on 8p23 illustrates its potential for providing more-precise estimates of the location of an unmapped marker or trait gene. Our expanded cytogenetic analysis of these families further identifies inversion carriers and increases the evidence of linkage. PMID:16826515
Yan, Haidong; Zhang, Yu; Zeng, Bing; Yin, Guohua; Zhang, Xinquan; Ji, Yang; Huang, Linkai; Jiang, Xiaomei; Liu, Xinchun; Peng, Yan; Ma, Xiao; Yan, Yanhong
2016-01-08
Orchardgrass (Dactylis glomerata L.), is a well-known perennial forage species; however, rust diseases have caused a noticeable reduction in the quality and production of orchardgrass. In this study, genetic diversity was assessed and the marker-trait associations for rust were examined using 18 EST-SSR and 21 SCoT markers in 75 orchardgrass accessions. A high level of genetic diversity was detected in orchardgrass with an average genetic diversity index of 0.369. For the EST-SSR and SCoT markers, 164 and 289 total bands were obtained, of which 148 (90.24%) and 272 (94.12%) were polymorphic, respectively. Results from an AMOVA analysis showed that more genetic variance existed within populations (87.57%) than among populations (12.43%). Using a parameter marker index, the efficiencies of the EST-SSR and SCoT markers were compared to show that SCoTs have higher marker efficiency (8.07) than EST-SSRs (4.82). The results of a UPGMA cluster analysis and a STRUCTURE analysis were both correlated with the geographic distribution of the orchardgrass accessions. Linkage disequilibrium analysis revealed an average r² of 0.1627 across all band pairs, indicating a high extent of linkage disequilibrium in the material. An association analysis between the rust trait and 410 bands from the EST-SSR and SCoT markers using TASSEL software revealed 20 band panels were associated with the rust trait in both 2011 and 2012. The 20 bands obtained from association analysis could be used in breeding programs for lineage selection to prevent great losses of orchardgrass caused by rust, and provide valuable information for further association mapping using this collection of orchardgrass.
Conclusion of LOD-score analysis for family data generated under two-locus models.
Dizier, M H; Babron, M C; Clerget-Darpoux, F
1996-06-01
The power to detect linkage by the LOD-score method is investigated here for diseases that depend on the effects of two genes. The classical strategy is, first, to detect a major-gene (MG) effect by segregation analysis and, second, to seek for linkage with genetic markers by the LOD-score method using the MG parameters. We already showed that segregation analysis can lead to evidence for a MG effect for many two-locus models, with the estimates of the MG parameters being very different from those of the two genes involved in the disease. We show here that use of these MG parameter estimates in the LOD-score analysis may lead to a failure to detect linkage for some two-locus models. For these models, use of the sib-pair method gives a non-negligible increase of power to detect linkage. The linkage-homogeneity test among subsamples differing for the familial disease distribution provides evidence of parameter misspecification, when the MG parameters are used. Moreover, for most of the models, use of the MG parameters in LOD-score analysis leads to a large bias in estimation of the recombination fraction and sometimes also to a rejection of linkage for the true recombination fraction. A final important point is that a strong evidence of an MG effect, obtained by segregation analysis, does not necessarily imply that linkage will be detected for at least one of the two genes, even with the true parameters and with a close informative marker.
A novel nonsense mutation in CRYBB1 associated with autosomal dominant congenital cataract
Yang, Juhua; Zhu, Yihua; Gu, Feng; He, Xiang; Cao, Zongfu; Li, Xuexi; Tong, Yi
2008-01-01
Purpose To identify the molecular defect underlying an autosomal dominant congenital nuclear cataract in a Chinese family. Methods Twenty-two members of a three-generation pedigree were recruited, clinical examinations were performed, and genomic DNA was extracted from peripheral blood leukocytes. All members were genotyped with polymorphic microsatellite markers adjacent to each of the known cataract-related genes. Linkage analysis was performed after genotyping. Candidate genes were screened for mutation using direct sequencing. Individuals were screened for presence of a mutation by restriction fragment length polymorphism (RFLP) analysis. Results Linkage analysis identified a maximum LOD score of 3.31 (recombination fraction [θ]=0.0) with marker D22S1167 on chromosome 22, which flanks the β-crystallin gene cluster (CRYBB3, CRYBB2, CRYBB1, and CRYBA4). Sequencing the coding regions and the flanking intronic sequences of these four candidate genes identified a novel, heterozygous C→T transition in exon 6 of CRYBB1 in the affected individuals of the family. This single nucleotide change introduced a novel BfaI site and was predicted to result in a nonsense mutation at codon 223 that changed a phylogenetically conserved amino acid to a stop codon (p.Q223X). RFLP analysis confirmed that this mutation co-segregated with the disease phenotype in all available family members and was not found in 100 normal unrelated individuals from the same ethnic background. Conclusions This study has identified a novel nonsense mutation in CRYBB1 (p.Q223X) associated with autosomal dominant congenital nuclear cataract. PMID:18432316
Two-trait-locus linkage analysis: A powerful strategy for mapping complex genetic traits
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schork, N.J.; Boehnke, M.; Terwilliger, J.D.
1993-11-01
Nearly all diseases mapped to date follow clear Mendelian, single-locus segregation patterns. In contrast, many common familial diseases such as diabetes, psoriasis, several forms of cancer, and schizophrenia are familial and appear to have a genetic component but do not exhibit simple Mendelian transmission. More complex models are required to explain the genetics of these important diseases. In this paper, the authors explore two-trait-locus, two-marker-locus linkage analysis in which two trait loci are mapped simultaneously to separate genetic markers. The authors compare the utility of this approach to standard one-trait-locus, one-marker-locus linkage analysis with and without allowance for heterogeneity. Themore » authors also compare the utility of the two-trait-locus, two-marker-locus analysis to two-trait-locus, one-marker-locus linkage analysis. For common diseases, pedigrees are often bilineal, with disease genes entering via two or more unrelated pedigree members. Since such pedigrees often are avoided in linkage studies, the authors also investigate the relative information content of unilineal and bilineal pedigrees. For the dominant-or-recessive and threshold models that the authors consider, the authors find that two-trait-locus, two-marker-locus linkage analysis can provide substantially more linkage information, as measured by expected maximum lod score, than standard one-trait-locus, one-marker-locus methods, even allowing for heterogeneity, while, for a dominant-or-dominant generating model, one-locus models that allow for heterogeneity extract essentially as much information as the two-trait-locus methods. For these three models, the authors also find that bilineal pedigrees provide sufficient linkage information to warrant their inclusion in such studies. The authors discuss strategies for assessing the significance of the two linkages assumed in two-trait-locus, two-marker-locus models. 37 refs., 1 fig., 4 tabs.« less
Muleta, Kebede T; Bulli, Peter; Rynearson, Sheri; Chen, Xianming; Pumphrey, Michael
2017-01-01
Stripe rust, caused by Puccinia striiformis Westend. f. sp. tritici Erikss. (Pst) remains one of the most significant diseases of wheat worldwide. We investigated stripe rust resistance by genome-wide association analysis (GWAS) in 959 spring wheat accessions from the United States Department of Agriculture-Agricultural Research Service National Small Grains Collection, representing major global production environments. The panel was characterized for field resistance in multi-environment field trials and seedling resistance under greenhouse conditions. A genome-wide set of 5,619 informative SNP markers were used to examine the population structure, linkage disequilibrium and marker-trait associations in the germplasm panel. Based on model-based analysis of population structure and hierarchical Ward clustering algorithm, the accessions were clustered into two major subgroups. These subgroups were largely separated according to geographic origin and improvement status of the accessions. A significant correlation was observed between the population sub-clusters and response to stripe rust infection. We identified 11 and 7 genomic regions with significant associations with stripe rust resistance at adult plant and seedling stages, respectively, based on a false discovery rate multiple correction method. The regions harboring all, except three, of the QTL identified from the field and greenhouse studies overlap with positions of previously reported QTL. Further work should aim at validating the identified QTL using proper germplasm and populations to enhance their utility in marker assisted breeding.
Bulli, Peter; Rynearson, Sheri; Chen, Xianming; Pumphrey, Michael
2017-01-01
Stripe rust, caused by Puccinia striiformis Westend. f. sp. tritici Erikss. (Pst) remains one of the most significant diseases of wheat worldwide. We investigated stripe rust resistance by genome-wide association analysis (GWAS) in 959 spring wheat accessions from the United States Department of Agriculture-Agricultural Research Service National Small Grains Collection, representing major global production environments. The panel was characterized for field resistance in multi-environment field trials and seedling resistance under greenhouse conditions. A genome-wide set of 5,619 informative SNP markers were used to examine the population structure, linkage disequilibrium and marker-trait associations in the germplasm panel. Based on model-based analysis of population structure and hierarchical Ward clustering algorithm, the accessions were clustered into two major subgroups. These subgroups were largely separated according to geographic origin and improvement status of the accessions. A significant correlation was observed between the population sub-clusters and response to stripe rust infection. We identified 11 and 7 genomic regions with significant associations with stripe rust resistance at adult plant and seedling stages, respectively, based on a false discovery rate multiple correction method. The regions harboring all, except three, of the QTL identified from the field and greenhouse studies overlap with positions of previously reported QTL. Further work should aim at validating the identified QTL using proper germplasm and populations to enhance their utility in marker assisted breeding. PMID:28591221
Cluster fusion-fission dynamics in the Singapore stock exchange
NASA Astrophysics Data System (ADS)
Teh, Boon Kin; Cheong, Siew Ann
2015-10-01
In this paper, we investigate how the cross-correlations between stocks in the Singapore stock exchange (SGX) evolve over 2008 and 2009 within overlapping one-month time windows. In particular, we examine how these cross-correlations change before, during, and after the Sep-Oct 2008 Lehman Brothers Crisis. To do this, we extend the complete-linkage hierarchical clustering algorithm, to obtain robust clusters of stocks with stronger intracluster correlations, and weaker intercluster correlations. After we identify the robust clusters in all time windows, we visualize how these change in the form of a fusion-fission diagram. Such a diagram depicts graphically how the cluster sizes evolve, the exchange of stocks between clusters, as well as how strongly the clusters mix. From the fusion-fission diagram, we see a giant cluster growing and disintegrating in the SGX, up till the Lehman Brothers Crisis in September 2008 and the market crashes of October 2008. After the Lehman Brothers Crisis, clusters in the SGX remain small for few months before giant clusters emerge once again. In the aftermath of the crisis, we also find strong mixing of component stocks between clusters. As a result, the correlation between initially strongly-correlated pairs of stocks decay exponentially with average life time of about a month. These observations impact strongly how portfolios and trading strategies should be formulated.
Ocular findings associated with a Cys39Arg mutation in the Norrie disease gene.
Joos, K M; Kimura, A E; Vandenburgh, K; Bartley, J A; Stone, E M
1994-12-01
To diagnose the carriers and noncarriers in a family affected with Norrie disease based on molecular analysis. Family members from three generations, including one affected patient, two obligate carriers, one carrier identified with linkage analysis, one noncarrier identified with linkage analysis, and one female family member with indeterminate carrier status, were examined clinically and electrophysiologically. Linkage analysis had previously failed to determine the carrier status of one female family member in the third generation. Blood samples were screened for mutations in the Norrie disease gene with single-strand conformation polymorphism analysis. The mutation was characterized by dideoxy-termination sequencing. Ophthalmoscopy and electroretinographic examination failed to detect the carrier state. The affected individuals and carriers in this family were found to have a transition from thymidine to cytosine in the first nucleotide of codon 39 of the Norrie disease gene, causing a cysteine-to-arginine mutation. Single-strand conformation polymorphism analysis identified a patient of indeterminate status (by linkage) to be a noncarrier of Norrie disease. Ophthalmoscopy and electroretinography could not identify carriers of this Norrie disease mutation. Single-strand conformation polymorphism analysis was more sensitive and specific than linkage analysis in identifying carriers in this family.
The relationship between carbon dioxide emission and economic growth: Hierarchical structure methods
NASA Astrophysics Data System (ADS)
Deviren, Seyma Akkaya; Deviren, Bayram
2016-06-01
Carbon dioxide (CO2) emission has an essential role in the current debate on sustainable development and environmental protection. CO2 emission is also directly linked with use of energy which plays a focal role both for production and consumption in the world economy. Therefore the relationship between the CO2 emission and economic growth has a significant implication for the environmental and economical policies. In this study, within the scope of sociophysics, the topology, taxonomy and relationships among the 33 countries, which have almost the high CO2 emission and economic growth values, are investigated by using the hierarchical structure methods, such as the minimal spanning tree (MST) and hierarchical tree (HT), over the period of 1970-2010. The average linkage cluster analysis (ALCA) is also used to examine the cluster structure more clearly in HTs. According to their proximity, economic ties and economic growth, different clusters of countries are identified from the structural topologies of these trees. We have found that the high income & OECD countries are closely connected to each other and are isolated from the upper middle and lower middle income countries from the MSTs, which are obtained both for the CO2 emission and economic growth. Moreover, the high income & OECD clusters are homogeneous with respect to the economic activities and economic ties of the countries. It is also mentioned that the Group of Seven (G7) countries (CAN, ENG, FRA, GER, ITA, JPN, USA) are connected to each other and these countries are located at the center of the MST for the results of CO2 emission. The same analysis may also successfully apply to the other environmental sources and different countries.
NASA Astrophysics Data System (ADS)
Lima, Carlos H. R.; AghaKouchak, Amir; Lall, Upmanu
2017-12-01
Floods are the main natural disaster in Brazil, causing substantial economic damage and loss of life. Studies suggest that some extreme floods result from a causal climate chain. Exceptional rain and floods are determined by large-scale anomalies and persistent patterns in the atmospheric and oceanic circulations, which influence the magnitude, extent, and duration of these extremes. Moreover, floods can result from different generating mechanisms. These factors contradict the assumptions of homogeneity, and often stationarity, in flood frequency analysis. Here we outline a methodological framework based on clustering using self-organizing maps (SOMs) that allows the linkage of large-scale processes to local-scale observations. The methodology is applied to flood data from several sites in the flood-prone Upper Paraná River basin (UPRB) in southern Brazil. The SOM clustering approach is employed to classify the 6-day rainfall field over the UPRB into four categories, which are then used to classify floods into four types based on the spatiotemporal dynamics of the rainfall field prior to the observed flood events. An analysis of the vertically integrated moisture fluxes, vorticity, and high-level atmospheric circulation revealed that these four clusters are related to known tropical and extratropical processes, including the South American low-level jet (SALLJ); extratropical cyclones; and the South Atlantic Convergence Zone (SACZ). Persistent anomalies in the sea surface temperature fields in the Pacific and Atlantic oceans are also found to be associated with these processes. Floods associated with each cluster present different patterns in terms of frequency, magnitude, spatial variability, scaling, and synchronization of events across the sites and subbasins. These insights suggest new directions for flood risk assessment, forecasting, and management.
Filliol, Ingrid; Motiwala, Alifiya S.; Cavatore, Magali; Qi, Weihong; Hazbón, Manzour Hernando; Bobadilla del Valle, Miriam; Fyfe, Janet; García-García, Lourdes; Rastogi, Nalin; Sola, Christophe; Zozio, Thierry; Guerrero, Marta Inírida; León, Clara Inés; Crabtree, Jonathan; Angiuoli, Sam; Eisenach, Kathleen D.; Durmaz, Riza; Joloba, Moses L.; Rendón, Adrian; Sifuentes-Osornio, José; Ponce de León, Alfredo; Cave, M. Donald; Fleischmann, Robert; Whittam, Thomas S.; Alland, David
2006-01-01
We analyzed a global collection of Mycobacterium tuberculosis strains using 212 single nucleotide polymorphism (SNP) markers. SNP nucleotide diversity was high (average across all SNPs, 0.19), and 96% of the SNP locus pairs were in complete linkage disequilibrium. Cluster analyses identified six deeply branching, phylogenetically distinct SNP cluster groups (SCGs) and five subgroups. The SCGs were strongly associated with the geographical origin of the M. tuberculosis samples and the birthplace of the human hosts. The most ancestral cluster (SCG-1) predominated in patients from the Indian subcontinent, while SCG-1 and another ancestral cluster (SCG-2) predominated in patients from East Asia, suggesting that M. tuberculosis first arose in the Indian subcontinent and spread worldwide through East Asia. Restricted SCG diversity and the prevalence of less ancestral SCGs in indigenous populations in Uganda and Mexico suggested a more recent introduction of M. tuberculosis into these regions. The East African Indian and Beijing spoligotypes were concordant with SCG-1 and SCG-2, respectively; X and Central Asian spoligotypes were also associated with one SCG or subgroup combination. Other clades had less consistent associations with SCGs. Mycobacterial interspersed repetitive unit (MIRU) analysis provided less robust phylogenetic information, and only 6 of the 12 MIRU microsatellite loci were highly differentiated between SCGs as measured by GST. Finally, an algorithm was devised to identify two minimal sets of either 45 or 6 SNPs that could be used in future investigations to enable global collaborations for studies on evolution, strain differentiation, and biological differences of M. tuberculosis. PMID:16385065
Hulse-Kemp, Amanda M.; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D.; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L.; Kochan, Kelli J.; Riggs, Penny K.; Scheffler, Jodi A.; Udall, Joshua A.; Ulloa, Mauricio; Wang, Shirley S.; Zhu, Qian-Hao; Bag, Sumit K.; Bhardwaj, Archana; Burke, John J.; Byers, Robert L.; Claverie, Michel; Gore, Michael A.; Harker, David B.; Islam, Md S.; Jenkins, Johnie N.; Jones, Don C.; Lacape, Jean-Marc; Llewellyn, Danny J.; Percy, Richard G.; Pepper, Alan E.; Poland, Jesse A.; Mohan Rai, Krishan; Sawant, Samir V.; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M.; Wang, Fei; Yourstone, Scott M.; Zheng, Xiuting; Lawley, Cindy T.; Ganal, Martin W.; Van Deynze, Allen; Wilson, Iain W.; Stelly, David M.
2015-01-01
High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569
Talmud, Philippa J; Hawe, Emma; Martin, Steve; Olivier, Michael; Miller, George J; Rubin, Edward M; Pennacchio, Len A; Humphries, Steve E
2002-11-15
Since triglycerides (TG) are a major independent risk factor for coronary heart disease, understanding their genetic and environmental determinants is of major importance. Mouse models indicate an inverse relationship between levels of the newly identified apolipoprotein AV (APOAV) and TG concentrations. We have examined the relative influence of human APOA5 variants on plasma lipids, compared to the impact of variation in APOC3 and APOA4 which lie in the same cluster. Single nucleotide polymorphisms (SNPs) in APOA5 (S19W, -1131T>C) and APOA4 (T347S, Q360H) and an APOA4/A5 intergenic T>C SNP were examined in a large study of healthy middle-aged men (n=2808). APOA5 19WW and -1131CC men had 52% and 40% higher TG (P<0.003) compared to common allele homozygotes, respectively, effects which were independent and additive. APOA4 347SS men had 23% lower TG compared to TT men (P<0.002). Haplotype analysis was carried out to identify TG-raising alleles and included, in addition, four previously genotyped APOC3 SNPs (-2845T>G, -482C>T, 1100C>T, and 3238C>G). The major TG-raising alleles were defined by APOA5 W19 and APOC3 -482T. This suggests that the TG-lowering effect of APOA4 S347 might merely reflect the strong negative linkage disequilibrium with the common alleles of these variants. Thus variation in APOA5 is associated with differences in TGs in healthy men, independent of those previously reported for APOC3, while association between APOA4 and TG reflects linkage disequilibrium with these sites. The molecular mechanisms for these effects remain to be determined.
Microseismic Event Grouping Based on PageRank Linkage at the Newberry Volcano Geothermal Site
NASA Astrophysics Data System (ADS)
Aguiar, A. C.; Myers, S. C.
2016-12-01
The Newberry Volcano DOE FORGE site in Central Oregon has been stimulated two times using high-pressure fluid injection to study the Enhanced Geothermal Systems (EGS) technology. Several hundred microseismic events were generated during the first stimulation in the fall of 2012. Initial locations of this microseismicity do not show well defined subsurface structure in part because event location uncertainties are large (Foulger and Julian, 2013). We focus on this stimulation to explore the spatial and temporal development of microseismicity, which is key to understanding how subsurface stimulation modifies stress, fractures rock, and increases permeability. We use PageRank, Google's initial search algorithm, to determine connectivity within the events (Aguiar and Beroza, 2014) and assess signal-correlation topology for the micro-earthquakes. We then use this information to create signal families and compare these to the spatial and temporal proximity of associated earthquakes. We relocate events within families (identified by PageRank linkage) using the Bayesloc approach (Myers et al., 2007). Preliminary relocations show tight spatial clustering of event families as well as evidence of events relocating to a different cluster than originally reported. We also find that signal similarity (linkage) at several stations, not just one or two, is needed in order to determine that events are in close proximity to one another. We show that indirect linkage of signals using PageRank is a reliable way to increase the number of events that are confidently determined to be similar to one another, which may lead to efficient and effective grouping of earthquakes with similar physical characteristics, such as focal mechanisms and stress drop. Our ultimate goal is to determine whether changes in the state of stress and/or changes in the generation of subsurface fracture networks can be detected using PageRank topology as well as aid in the event relocation to obtain more accurate subsurface structure. Prepared by LLNL under Contract DE-AC52-07NA27344. LLNL-ABS-699142.
Identifying economics' place amongst academic disciplines: a science or a social science?
Hudson, John
2017-01-01
Different academic disciplines exhibit different styles, including styles in journal titles. Using data from the 2014 Research Excellence Framework (REF) in the UK we are able to identify the stylistic trends of different disciplines using 155,552 journal titles across all disciplines. Cluster analysis is then used to group the different disciplines together. The resulting identification fits the social sciences, the sciences and the arts and humanities reasonably well. Economics overall, fits best with philosophy, but the linkage is weak. When we divided economics into papers published in theory, econometrics and the remaining journals, the first two link with mathematics and computer science, particularly econometrics, and thence the sciences. The rest of economics then links with business and thence the social sciences.
Genetic Population Structure Analysis in New Hampshire Reveals Eastern European Ancestry
Sloan, Chantel D.; Andrew, Angeline D.; Duell, Eric J.; Williams, Scott M.; Karagas, Margaret R.; Moore, Jason H.
2009-01-01
Genetic structure due to ancestry has been well documented among many divergent human populations. However, the ability to associate ancestry with genetic substructure without using supervised clustering has not been explored in more presumably homogeneous and admixed US populations. The goal of this study was to determine if genetic structure could be detected in a United States population from a single state where the individuals have mixed European ancestry. Using Bayesian clustering with a set of 960 single nucleotide polymorphisms (SNPs) we found evidence of population stratification in 864 individuals from New Hampshire that can be used to differentiate the population into six distinct genetic subgroups. We then correlated self-reported ancestry of the individuals with the Bayesian clustering results. Finnish and Russian/Polish/Lithuanian ancestries were most notably found to be associated with genetic substructure. The ancestral results were further explained and substantiated using New Hampshire census data from 1870 to 1930 when the largest waves of European immigrants came to the area. We also discerned distinct patterns of linkage disequilibrium (LD) between the genetic groups in the growth hormone receptor gene (GHR). To our knowledge, this is the first time such an investigation has uncovered a strong link between genetic structure and ancestry in what would otherwise be considered a homogenous US population. PMID:19738909
Genetic population structure analysis in New Hampshire reveals Eastern European ancestry.
Sloan, Chantel D; Andrew, Angeline D; Duell, Eric J; Williams, Scott M; Karagas, Margaret R; Moore, Jason H
2009-09-07
Genetic structure due to ancestry has been well documented among many divergent human populations. However, the ability to associate ancestry with genetic substructure without using supervised clustering has not been explored in more presumably homogeneous and admixed US populations. The goal of this study was to determine if genetic structure could be detected in a United States population from a single state where the individuals have mixed European ancestry. Using Bayesian clustering with a set of 960 single nucleotide polymorphisms (SNPs) we found evidence of population stratification in 864 individuals from New Hampshire that can be used to differentiate the population into six distinct genetic subgroups. We then correlated self-reported ancestry of the individuals with the Bayesian clustering results. Finnish and Russian/Polish/Lithuanian ancestries were most notably found to be associated with genetic substructure. The ancestral results were further explained and substantiated using New Hampshire census data from 1870 to 1930 when the largest waves of European immigrants came to the area. We also discerned distinct patterns of linkage disequilibrium (LD) between the genetic groups in the growth hormone receptor gene (GHR). To our knowledge, this is the first time such an investigation has uncovered a strong link between genetic structure and ancestry in what would otherwise be considered a homogenous US population.
Booma, P M; Prabhakaran, S; Dhanalakshmi, R
2014-01-01
Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality.
Booma, P. M.; Prabhakaran, S.; Dhanalakshmi, R.
2014-01-01
Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality. PMID:25136661
The global transmission network of HIV-1.
Wertheim, Joel O; Leigh Brown, Andrew J; Hepler, N Lance; Mehta, Sanjay R; Richman, Douglas D; Smith, Davey M; Kosakovsky Pond, Sergei L
2014-01-15
Human immunodeficiency virus type 1 (HIV-1) is pandemic, but its contemporary global transmission network has not been characterized. A better understanding of the properties and dynamics of this network is essential for surveillance, prevention, and eventual eradication of HIV. Here, we apply a simple and computationally efficient network-based approach to all publicly available HIV polymerase sequences in the global database, revealing a contemporary picture of the spread of HIV-1 within and between countries. This approach automatically recovered well-characterized transmission clusters and extended other clusters thought to be contained within a single country across international borders. In addition, previously undescribed transmission clusters were discovered. Together, these clusters represent all known modes of HIV transmission. The extent of international linkage revealed by our comprehensive approach demonstrates the need to consider the global diversity of HIV, even when describing local epidemics. Finally, the speed of this method allows for near-real-time surveillance of the pandemic's progression.
Conclusion of LOD-score analysis for family data generated under two-locus models.
Dizier, M. H.; Babron, M. C.; Clerget-Darpoux, F.
1996-01-01
The power to detect linkage by the LOD-score method is investigated here for diseases that depend on the effects of two genes. The classical strategy is, first, to detect a major-gene (MG) effect by segregation analysis and, second, to seek for linkage with genetic markers by the LOD-score method using the MG parameters. We already showed that segregation analysis can lead to evidence for a MG effect for many two-locus models, with the estimates of the MG parameters being very different from those of the two genes involved in the disease. We show here that use of these MG parameter estimates in the LOD-score analysis may lead to a failure to detect linkage for some two-locus models. For these models, use of the sib-pair method gives a non-negligible increase of power to detect linkage. The linkage-homogeneity test among subsamples differing for the familial disease distribution provides evidence of parameter misspecification, when the MG parameters are used. Moreover, for most of the models, use of the MG parameters in LOD-score analysis leads to a large bias in estimation of the recombination fraction and sometimes also to a rejection of linkage for the true recombination fraction. A final important point is that a strong evidence of an MG effect, obtained by segregation analysis, does not necessarily imply that linkage will be detected for at least one of the two genes, even with the true parameters and with a close informative marker. PMID:8651311
Meta-analysis of 32 genome-wide linkage studies of schizophrenia
Ng, MYM; Levinson, DF; Faraone, SV; Suarez, BK; DeLisi, LE; Arinami, T; Riley, B; Paunio, T; Pulver, AE; Irmansyah; Holmans, PA; Escamilla, M; Wildenauer, DB; Williams, NM; Laurent, C; Mowry, BJ; Brzustowicz, LM; Maziade, M; Sklar, P; Garver, DL; Abecasis, GR; Lerer, B; Fallin, MD; Gurling, HMD; Gejman, PV; Lindholm, E; Moises, HW; Byerley, W; Wijsman, EM; Forabosco, P; Tsuang, MT; Hwu, H-G; Okazaki, Y; Kendler, KS; Wormley, B; Fanous, A; Walsh, D; O’Neill, FA; Peltonen, L; Nestadt, G; Lasseter, VK; Liang, KY; Papadimitriou, GM; Dikeos, DG; Schwab, SG; Owen, MJ; O’Donovan, MC; Norton, N; Hare, E; Raventos, H; Nicolini, H; Albus, M; Maier, W; Nimgaonkar, VL; Terenius, L; Mallet, J; Jay, M; Godard, S; Nertney, D; Alexander, M; Crowe, RR; Silverman, JM; Bassett, AS; Roy, M-A; Mérette, C; Pato, CN; Pato, MT; Roos, J Louw; Kohn, Y; Amann-Zalcenstein, D; Kalsi, G; McQuillin, A; Curtis, D; Brynjolfson, J; Sigmundsson, T; Petursson, H; Sanders, AR; Duan, J; Jazin, E; Myles-Worsley, M; Karayiorgou, M; Lewis, CM
2009-01-01
A genome scan meta-analysis (GSMA) was carried out on 32 independent genome-wide linkage scan analyses that included 3255 pedigrees with 7413 genotyped cases affected with schizophrenia (SCZ) or related disorders. The primary GSMA divided the autosomes into 120 bins, rank-ordered the bins within each study according to the most positive linkage result in each bin, summed these ranks (weighted for study size) for each bin across studies and determined the empirical probability of a given summed rank (PSR) by simulation. Suggestive evidence for linkage was observed in two single bins, on chromosomes 5q (142-168 Mb) and 2q (103-134 Mb). Genome-wide evidence for linkage was detected on chromosome 2q (119-152 Mb) when bin boundaries were shifted to the middle of the previous bins. The primary analysis met empirical criteria for ‘aggregate’ genome-wide significance, indicating that some or all of 10 bins are likely to contain loci linked to SCZ, including regions of chromosomes 1, 2q, 3q, 4q, 5q, 8p and 10q. In a secondary analysis of 22 studies of European-ancestry samples, suggestive evidence for linkage was observed on chromosome 8p (16-33 Mb). Although the newer genome-wide association methodology has greater power to detect weak associations to single common DNA sequence variants, linkage analysis can detect diverse genetic effects that segregate in families, including multiple rare variants within one locus or several weakly associated loci in the same region. Therefore, the regions supported by this meta-analysis deserve close attention in future studies. PMID:19349958
Sillén, Anna; Brohede, Jesper; Forsell, Charlotte; Lilius, Lena; Andrade, Jorge; Odeberg, Jacob; Kimura, Toru; Winblad, Bengt; Graff, Caroline
2011-01-01
We have previously reported the results of an extended genome-wide scan of Swedish Alzheimer disease (AD)-affected families; in this paper, we analyzed a subset of these families with autopsy-confirmed AD. We report the fine-mapping, using both microsatellite markers and single-nucleotide polymorphisms (SNPs), in the observed maximum logarithm of the odds (LOD)-2 unit (LOD(max)-2) region under the identified linkage peak, linkage analysis of the fine-mapping data with additionally analyzed pedigrees, and association analysis of SNPs selected from candidate genes in the linked interval. The subset was made on the criterion of at least one autopsy-confirmed AD case per family, resulting in 24 families. Linkage analysis of a family subset having at least one autopsy-confirmed AD case showed a significant nonparametric single-point LOD score of 4.4 in 8q24. Fine-mapping under the linkage peak with 10 microsatellite markers yielded an increase in the multipoint (mpt) LOD score from 2.1 to 3.0. SNP genotyping was performed on 21 selected candidate transcripts of the LOD(max)-2 region. Both family-based association and linkage analysis were performed on extended material from 30 families, resulting in a suggestive linkage at peak marker rs6577853 (mpt LOD score = 2.4). The 8q24 region has been implicated to be involved in AD etiology. Copyright © 2011 S. Karger AG, Basel.
Young star clusters in the circumnuclear region of NGC 2110
DOE Office of Scientific and Technical Information (OSTI.GOV)
Durré, Mark; Mould, Jeremy, E-mail: mdurre@swin.edu.au
2014-03-20
High-resolution observations in the near infrared show star clusters around the active galactic nucleus (AGN) of the Seyfert 1 NGC 2110, along with a 90 × 35 pc bar of shocked gas material around its nucleus. These are seen for the first time in our imaging and gas kinematics of the central 100 pc with the Keck OSIRIS instrument with adaptive optics. Each of these clusters is two to three times brighter than the Arches cluster close to the center of the Milky Way. The core star formation rate is 0.3 M {sub ☉} yr{sup –1}. The photoionized gas (Hemore » I) dynamics imply an enclosed mass of 3-4 × 10{sup 8} M {sub ☉}. These observations demonstrate the physical linkage between AGN feedback, which triggers star formation in massive clusters, and the resulting stellar (and supernovae) winds, which cause the observed [Fe II] emission and feed the black hole.« less
Numerical taxonomy and ecology of petroleum-degrading bacteria
DOE Office of Scientific and Technical Information (OSTI.GOV)
Austin, B.; Calomiris, J.J.; Walker, J.D.
1977-07-01
A total of 99 strains of petroleum-degrading bacteria isolated from Chesapeake Bay water and sediment were identified by using numerical taxonomy procedures. The isolates, together with 33 reference cultures, were examined for 48 biochemical, cultural, morphological, and physiological characters. The data were analyzed by computer, using both the simple matching and the Jaccard coefficients. Clustering was achieved by the unweighted average linkage method. From the sorted similarity matrix and dendrogram, 14 phenetic groups, comprising 85 of the petroleum-degrading bacteria, were defined at the 80 to 85% similarity level. These groups were identified as actinomycetes (mycelial forms, four clusters), coryneforms, Enterobacteriaceae,more » Klebsiella aerogenes, Micrococcus spp. (two clusters), Nocardia species (two clusters), Pseudomonas spp. (two clusters), and Sphaerotilus natans. It is concluded that the degradation of petroleum is accomplished by a diverse range of bacterial taxa, some of which were isolated only at given sampling stations and, more specifically, from sediment collected at a given station.« less
Large-scale linkage analysis of 1302 affected relative pairs with rheumatoid arthritis
Hamshere, Marian L; Segurado, Ricardo; Moskvina, Valentina; Nikolov, Ivan; Glaser, Beate; Holmans, Peter A
2007-01-01
Rheumatoid arthritis is the most common systematic autoimmune disease and its etiology is believed to have both strong genetic and environmental components. We demonstrate the utility of including genetic and clinical phenotypes as covariates within a linkage analysis framework to search for rheumatoid arthritis susceptibility loci. The raw genotypes of 1302 affected relative pairs were combined from four large family-based samples (North American Rheumatoid Arthritis Consortium, United Kingdom, European Consortium on Rheumatoid Arthritis Families, and Canada). The familiality of the clinical phenotypes was assessed. The affected relative pairs were subjected to autosomal multipoint affected relative-pair linkage analysis. Covariates were included in the linkage analysis to take account of heterogeneity within the sample. Evidence of familiality was observed with age at onset (p << 0.001) and rheumatoid factor (RF) IgM (p << 0.001), but not definite erosions (p = 0.21). Genome-wide significant evidence for linkage was observed on chromosome 6. Genome-wide suggestive evidence for linkage was observed on chromosomes 13 and 20 when conditioning on age at onset, chromosome 15 conditional on gender, and chromosome 19 conditional on RF IgM after allowing for multiple testing of covariates. PMID:18466440
NASA Astrophysics Data System (ADS)
Stauffer, R. M.; Thompson, A. M.; Young, G. S.; Oltmans, S. J.; Johnson, B.
2016-12-01
Ozone (O3) climatologies are typically created by averaging ozonesonde profiles on a monthly or seasonal basis, either for specific regions or zonally. We demonstrate the advantages of using a statistical clustering technique, self-organizing maps (SOM), over this simple averaging, through analysis of more than 4500 sonde profiles taken from the long-term US sites at Boulder, CO; Huntsville, AL; Trinidad Head, CA; and Wallops Island, VA. First, we apply SOM to O3 mixing ratios from surface to 12 km amsl. At all four sites, profiles in SOM clusters exhibit similar tropopause height, 500 hPa height and temperature, and total and tropospheric column O3. Second, when profiles from each SOM cluster are compared to monthly O3 means, near-tropopause O3 in three of the clusters is double (over +100 ppbv) the climatological O3 mixing ratio. The three clusters include 13-16% of all profiles, mostly from winter and spring. Large mid-tropospheric deviations from monthly means are found in two highly-populated clusters that represent either distinctly polluted (summer) or clean O3 (fall-winter, high tropopause) profiles. Thus, SOM indeed appear to represent US O3 profile statistics better than conventional climatologies. In the case of Trinidad Head, SOM clusters of O3 profile data from the lower troposphere (surface-6 km amsl) can discriminate background vs polluted O3 and the meteorology associated with each. Two of nine O3 clusters exhibit thin layers ( 100s of m thick) of high O3, typically between 1 and 4 km. Comparisons between clusters and downwind, high-altitude surface O3 measurements display a marked impact of the elevated tropospheric O3. Days corresponding to the high O3 clusters exhibit hourly surface O3 anomalies at surface sites of +5 -10 ppbv compared to a climatology; the anomalies can last up to four days. We also explore applications of SOM to tropical ozonesonde profiles, where tropospheric O3 variability is generally smaller.
Berger, W; van Duijnhoven, G; Pinckers, A; Smits, A; Ropers, H H; Cremers, F
1995-01-01
Linkage analysis has been performed in a large Dutch pedigree with X-linked recessive congenital stationary night blindness (CSNB) by utilizing 16 DNA markers from the proximal short arm of the human X chromosome (Xp21.1-11.2). Thirteen polymorphic markers are at least partially informative and have enabled pairwise and multipoint linkage analysis. For three loci, i.e. DXS228, the monoamine oxidase B gene and the Norrie disease gene (NDG), multipoint linkage studies have yielded maximum lod scores of > 3.0 at a recombination fraction of zero. Analysis of recombination events has enabled us to rule out the possibility that the underlying defect in this family is allelic to RP3; the gene defect could also be excluded from the proximal part of the region known to carry RP2. Linkage data are consistent with a possible involvement of the NDG but mutations in the open reading frame of this gene have not been found.
Hühn, M
1995-05-01
Some approaches to molecular marker-assisted linkage detection for a dominant disease-resistance trait based on a segregating F2 population are discussed. Analysis of two-point linkage is carried out by the traditional measure of maximum lod score. It depends on (1) the maximum-likelihood estimate of the recombination fraction between the marker and the disease-resistance gene locus, (2) the observed absolute frequencies, and (3) the unknown number of tested individuals. If one replaces the absolute frequencies by expressions depending on the unknown sample size and the maximum-likelihood estimate of recombination value, the conventional rule for significant linkage (maximum lod score exceeds a given linkage threshold) can be resolved for the sample size. For each sub-population used for linkage analysis [susceptible (= recessive) individuals, resistant (= dominant) individuals, complete F2] this approach gives a lower bound for the necessary number of individuals required for the detection of significant two-point linkage by the lod-score method.
Prioritizing tiger conservation through landscape genetics and habitat linkages.
Yumnam, Bibek; Jhala, Yadvendradev V; Qureshi, Qamar; Maldonado, Jesus E; Gopal, Rajesh; Saini, Swati; Srinivas, Y; Fleischer, Robert C
2014-01-01
Even with global support for tiger (Panthera tigris) conservation their survival is threatened by poaching, habitat loss and isolation. Currently about 3,000 wild tigers persist in small fragmented populations within seven percent of their historic range. Identifying and securing habitat linkages that connect source populations for maintaining landscape-level gene flow is an important long-term conservation strategy for endangered carnivores. However, habitat corridors that link regional tiger populations are often lost to development projects due to lack of objective evidence on their importance. Here, we use individual based genetic analysis in combination with landscape permeability models to identify and prioritize movement corridors across seven tiger populations within the Central Indian Landscape. By using a panel of 11 microsatellites we identified 169 individual tigers from 587 scat and 17 tissue samples. We detected four genetic clusters within Central India with limited gene flow among three of them. Bayesian and likelihood analyses identified 17 tigers as having recent immigrant ancestry. Spatially explicit tiger occupancy obtained from extensive landscape-scale surveys across 76,913 km(2) of forest habitat was found to be only 21,290 km(2). After accounting for detection bias, the covariates that best explained tiger occupancy were large, remote, dense forest patches; large ungulate abundance, and low human footprint. We used tiger occupancy probability to parameterize habitat permeability for modeling habitat linkages using least-cost and circuit theory pathway analyses. Pairwise genetic differences (FST) between populations were better explained by modeled linkage costs (r>0.5, p<0.05) compared to Euclidean distances, which was in consonance with observed habitat fragmentation. The results of our study highlight that many corridors may still be functional as there is evidence of contemporary migration. Conservation efforts should provide legal status to corridors, use smart green infrastructure to mitigate development impacts, and restore habitats where connectivity has been lost.
Prioritizing Tiger Conservation through Landscape Genetics and Habitat Linkages
Yumnam, Bibek; Jhala, Yadvendradev V.; Qureshi, Qamar; Maldonado, Jesus E.; Gopal, Rajesh; Saini, Swati; Srinivas, Y.; Fleischer, Robert C.
2014-01-01
Even with global support for tiger (Panthera tigris) conservation their survival is threatened by poaching, habitat loss and isolation. Currently about 3,000 wild tigers persist in small fragmented populations within seven percent of their historic range. Identifying and securing habitat linkages that connect source populations for maintaining landscape-level gene flow is an important long-term conservation strategy for endangered carnivores. However, habitat corridors that link regional tiger populations are often lost to development projects due to lack of objective evidence on their importance. Here, we use individual based genetic analysis in combination with landscape permeability models to identify and prioritize movement corridors across seven tiger populations within the Central Indian Landscape. By using a panel of 11 microsatellites we identified 169 individual tigers from 587 scat and 17 tissue samples. We detected four genetic clusters within Central India with limited gene flow among three of them. Bayesian and likelihood analyses identified 17 tigers as having recent immigrant ancestry. Spatially explicit tiger occupancy obtained from extensive landscape-scale surveys across 76,913 km2 of forest habitat was found to be only 21,290 km2. After accounting for detection bias, the covariates that best explained tiger occupancy were large, remote, dense forest patches; large ungulate abundance, and low human footprint. We used tiger occupancy probability to parameterize habitat permeability for modeling habitat linkages using least-cost and circuit theory pathway analyses. Pairwise genetic differences (F ST) between populations were better explained by modeled linkage costs (r>0.5, p<0.05) compared to Euclidean distances, which was in consonance with observed habitat fragmentation. The results of our study highlight that many corridors may still be functional as there is evidence of contemporary migration. Conservation efforts should provide legal status to corridors, use smart green infrastructure to mitigate development impacts, and restore habitats where connectivity has been lost. PMID:25393234
NASA Astrophysics Data System (ADS)
Yang, Rui; Li, Xiangyang; Zhang, Tong
2014-10-01
This paper uses two physics-derived techniques, the minimum spanning tree and the hierarchical tree, to investigate the networks formed by CITIC (China International Trust and Investment Corporation) industry indices in three periods from 2006 to 2013. The study demonstrates that obvious industry clustering effects exist in the networks, and Durable Consumer Goods, Industrial Products, Information Technology, Frequently Consumption and Financial Industry are the core nodes in the networks. We also use the rolling window technique to investigate the dynamic evolution of the networks' stability, by calculating the mean correlations and mean distances, as well as the variance of correlations and the distances of these indices. China's stock market is still immature and subject to administrative interventions. Therefore, through this analysis, regulators can focus on monitoring the core nodes to ensure the overall stability of the entire market, while investors can enhance their portfolio allocations or investment decision-making.
Knight, Jo; North, Bernard V; Sham, Pak C; Curtis, David
2003-12-31
This paper presents a method of performing model-free LOD-score based linkage analysis on quantitative traits. It is implemented in the QMFLINK program. The method is used to perform a genome screen on the Framingham Heart Study data. A number of markers that show some support for linkage in our study coincide substantially with those implicated in other linkage studies of hypertension. Although the new method needs further testing on additional real and simulated data sets we can already say that it is straightforward to apply and may offer a useful complementary approach to previously available methods for the linkage analysis of quantitative traits.
Knight, Jo; North, Bernard V; Sham, Pak C; Curtis, David
2003-01-01
This paper presents a method of performing model-free LOD-score based linkage analysis on quantitative traits. It is implemented in the QMFLINK program. The method is used to perform a genome screen on the Framingham Heart Study data. A number of markers that show some support for linkage in our study coincide substantially with those implicated in other linkage studies of hypertension. Although the new method needs further testing on additional real and simulated data sets we can already say that it is straightforward to apply and may offer a useful complementary approach to previously available methods for the linkage analysis of quantitative traits. PMID:14975142
Structural origin underlying poor glass forming ability of Al metallic glass
NASA Astrophysics Data System (ADS)
Li, F.; Liu, X. J.; Hou, H. Y.; Chen, G.; Chen, G. L.
2011-07-01
We performed molecular dynamics simulations to study the glass formation and local atomic structure of rapidly quenched Al. Both potential energy and structural parameters indicate that the glass transition temperature of amorphous Al is as low as 300 K, which may lead to the poor thermal stability of the amorphous Al as it is prone to crystallize even at room temperature. Voronoi polyhedra analysis reveals that the most popular polyhedron is the deformed body-centered cubic (bcc) cluster characterized by the index < 0, 3, 6, 4 > in the amorphous Al, while the icosahedron with the index < 0, 0, 12, 0 > is always predominant in bulk metallic glass formers with excellent glass forming ability (GFA). Moreover, these deformed-bcc short-range orders can make up medium-range orders via the linkage of vertex-, edge-, face-, intercrossed-shared atoms, which are believed to more easily transform into face-centered cubic (fcc) Al nanocrystal compared with the icosahedral clusters in terms of the symmetrical similarity between bcc and fcc structures. This finding could unveil the structural origin of poor GFA of Al-based alloys.
Highly conserved non-coding elements on either side of SOX9 associated with Pierre Robin sequence.
Benko, Sabina; Fantes, Judy A; Amiel, Jeanne; Kleinjan, Dirk-Jan; Thomas, Sophie; Ramsay, Jacqueline; Jamshidi, Negar; Essafi, Abdelkader; Heaney, Simon; Gordon, Christopher T; McBride, David; Golzio, Christelle; Fisher, Malcolm; Perry, Paul; Abadie, Véronique; Ayuso, Carmen; Holder-Espinasse, Muriel; Kilpatrick, Nicky; Lees, Melissa M; Picard, Arnaud; Temple, I Karen; Thomas, Paul; Vazquez, Marie-Paule; Vekemans, Michel; Roest Crollius, Hugues; Hastie, Nicholas D; Munnich, Arnold; Etchevers, Heather C; Pelet, Anna; Farlie, Peter G; Fitzpatrick, David R; Lyonnet, Stanislas
2009-03-01
Pierre Robin sequence (PRS) is an important subgroup of cleft palate. We report several lines of evidence for the existence of a 17q24 locus underlying PRS, including linkage analysis results, a clustering of translocation breakpoints 1.06-1.23 Mb upstream of SOX9, and microdeletions both approximately 1.5 Mb centromeric and approximately 1.5 Mb telomeric of SOX9. We have also identified a heterozygous point mutation in an evolutionarily conserved region of DNA with in vitro and in vivo features of a developmental enhancer. This enhancer is centromeric to the breakpoint cluster and maps within one of the microdeletion regions. The mutation abrogates the in vitro enhancer function and alters binding of the transcription factor MSX1 as compared to the wild-type sequence. In the developing mouse mandible, the 3-Mb region bounded by the microdeletions shows a regionally specific chromatin decompaction in cells expressing Sox9. Some cases of PRS may thus result from developmental misexpression of SOX9 due to disruption of very-long-range cis-regulatory elements.
Beau De Rochars, Valery Madsen; Lednicky, John; White, Sarah; Loeb, Julia; Elbadry, Maha A; Telisma, Taina; Chavannes, Sonese; Anilis, Marie Gina; Cella, Eleonora; Ciccozzi, Massimo; Okech, Bernard A; Salemi, Marco; Morris, J Glenn
2017-01-11
Human coronavirus (HCoV) NL63 is recognized as a common cause of upper respiratory infections and influenza-like illness. In screening children with acute undifferentiated febrile illness in a school cohort in rural Haiti, we identified HCoV-NL63 in blood samples from four children. Cases clustered over an 11-day period; children did not have respiratory symptoms, but two had gastrointestinal complaints. On phylogenetic analysis, the Haitian HCoV-NL63 strains cluster together in a highly supported monophyletic clade linked most closely with recently reported strains from Malaysia; two respiratory HCoV-NL63 strains identified in north Florida in the same general period form a separate clade, albeit again with close linkages with the Malaysian strains. Our data highlight the variety of presentations that may be seen with HCoV-NL63, and underscore the apparent ease with which CoV strains move among countries, with our data consistent with recurrent introduction of strains into the Caribbean (Haiti and Florida) from Asia. © The American Society of Tropical Medicine and Hygiene.
Genetic Studies of Stuttering in a Founder Population
Wittke-Thompson, Jacqueline K.; Ambrose, Nicoline; Yairi, Ehud; Roe, Cheryl; Cook, Edwin H.; Ober, Carole; Cox, Nancy J.
2007-01-01
Genome-wide linkage and association analyses were conducted to identify genetic determinants of stuttering in a founder population in which 48 individuals affected with stuttering are connected in a single 232-person genealogy. A novel approach was devised to account for all necessary relationships to enable multipoint linkage analysis. Regions with nominal evidence for linkage were found on chromosomes 3 (P=0.013, 208.8 centiMorgans (cM)), 13 (P=0.012, 52.6 cM), and 15 (P=0.02, 100 cM). Regions with nominal evidence for association with stuttering that overlapped with a linkage signal are located on chromosomes 3 (P=0.0047, 195 cM), 9 (P=0.0067, 46.5 cM), and 13 (P=0.0055, 52.6 cM). We also conducted the first meta-analysis for stuttering using results from linkage studies in the Hutterites and The Illinois International Genetics of Stuttering Project and identified regions with nominal evidence for linkage on chromosomes 2 (P=0.013, 180–195 cM) and 5 (P=0.0051, 105–120 cM; P=0.015, 120–135 cM). None of the linkage signals detected in the Hutterite sample alone, or in the meta-analysis, meet genome-wide criteria for significance, although some of the stronger signals overlap linkage mapping signals previously reported for other speech and language disorders. PMID:17276504
A taxonomy of hospitals participating in Medicare accountable care organizations.
Bazzoli, Gloria J; Harless, David W; Chukmaitov, Askar S
2017-03-03
Medicare was an early innovator of accountable care organizations (ACOs), establishing the Medicare Shared Savings Program (MSSP) and Pioneer programs in 2012-2013. Existing research has documented that ACOs bring together an array of health providers with hospitals serving as important participants. Hospitals vary markedly in their service structure and organizational capabilities, and thus, one would expect hospital ACO participants to vary in these regards. Our research identifies hospital subgroups that share certain capabilities and competencies. Such research, in conjunction with existing ACO research, provides deeper understanding of the structure and operation of these organizations. Given that Medicare was an initiator of the ACO concept, our findings provide a baseline to track the evolution of ACO hospitals over time. Hierarchical clustering methods are used in separate analyses of MSSP and Pioneer ACO hospitals. Hospitals participating in ACOs with 2012-2013 start dates are identified through multiple sources. Study data come from the Centers for Medicare and Medicaid Services, American Hospital Association, and Health Information and Management Systems Society. Five-cluster solutions were developed separately for the MSSP and Pioneer hospital samples. Both the MSSP and Pioneer taxonomies had several clusters with high levels of health information technology capabilities. Also distinct clusters with strong physician linkages were present. We examined Pioneer ACO hospitals that subsequently left the program and found that they commonly had low levels of ambulatory care services or health information technology. Distinct subgroups of hospitals exist in both the MSSP and Pioneer programs, suggesting that individual hospitals serve different roles within an ACO. Health information technology and physician linkages appear to be particularly important features in ACO hospitals. ACOs need to consider not only geographic and service mix when selecting hospital participants but also their vertical integration features and management competencies.
Shatruk, Mikhail; Dragulescu-Andrasi, Alina; Chambers, Kristen E; Stoian, Sebastian A; Bominaar, Emile L; Achim, Catalina; Dunbar, Kim R
2007-05-16
Pentanuclear, cyanide-bridged clusters [M(tmphen)2]3[M'(CN)6]2 (M/M' = Zn/Cr (1), Zn/Fe (2), Fe/Fe (3), Fe/Co (4), and Fe/Cr (5); tmphen = 3,4,7,8-tetramethyl-1,10-phenanthroline) were prepared by combining [M'III(CN)6]3- anions with mononuclear complexes of MII ions with two capping tmphen ligands. The clusters consist of a trigonal bipyramidal (TBP) core with three MII ions in the equatorial positions and two M'III ions in the axial positions. Compounds 1-4 are isostructural and crystallize in the monoclinic space group P21/c. Complex 5 crystallizes in the enantiomorphic space group P3221. The magnetic properties of compounds 1 and 2 reflect the contributions of the individual [CrIII(CN)6]3- and [FeIII(CN)6]3- ions. The FeII ions in compounds 3 and 4 exhibit a gradual, temperature-induced spin transition between high spin (HS) and low spin (LS), as determined by the combination of Mössbauer spectroscopy, magnetic measurements, and single-crystal X-ray studies. The investigation of compound 5 by these methods and by IR spectroscopy indicates that cyanide linkage isomerism occurs during cluster formation. The magnetic behavior of 5 is determined by weak ferromagnetic coupling between the axial CrIII centers mediated by the equatorial diamagnetic FeII ions. Mössbauer spectra collected in the presence of a high applied field have allowed, for the first time, the direct experimental observation of uncompensated spin density at diamagnetic metal ions that bridge paramagnetic metal ions.
Ion Mobility Mass Spectrometry Analysis of Isomeric Disaccharide Precursor, Product and Cluster Ions
Li, Hongli; Bendiak, Brad; Siems, William F.; Gang, David R.; Hill, Herbert H.
2015-01-01
RATIONALE Carbohydrates are highly variable in structure owing to differences in their anomeric configurations, monomer stereochemistry, inter-residue linkage positions and general branching features. The separation of carbohydrate isomers poses a great challenge for current analytical techniques. METHODS The isomeric heterogeneity of disaccharide ions and monosaccharideglycolaldehyde product ions evaluated using electrospray traveling wave ion mobility mass spectrometry (Synapt G2 high definition mass spectrometer) in both positive and negative ion modes investigation. RESULTS The separation of isomeric disaccharide ions was observed but not fully achieved based on their mobility profiles. The mobilities of isomeric product ions, the monosaccharide-glycolaldehydes, derived from different disaccharide isomers were measured. Multiple mobility peaks were observed for both monosaccharide-glycolaldehyde cations and anions, indicating that there was more than one structural configuration in the gas phase as verified by NMR in solution. More importantly, the mobility patterns for isomeric monosaccharide-glycolaldehyde product ions were different, which enabled partial characterization of their respective disaccharide ions. Abundant disaccharide cluster ions were also observed. The Results showed that a majority of isomeric cluster ions had different drift times and, moreover, more than one mobility peak was detected for a number of specific cluster ions. CONCLUSIONS It is demonstrated that ion mobility mass spectrometry is an advantageous method to assess the isomeric heterogeneity of carbohydrate compounds. It is capable of differentiating different types of carbohydrate ions having identical m/z values as well as multiple structural configurations of single compounds. PMID:24591031
Zhang, Gaiyun; Zhang, Haibo; Li, Sumei; Xiao, Ji; Zhang, Guangtao; Zhu, Yiguang; Niu, Siwen; Ju, Jianhua
2012-01-01
Amicetin, an antibacterial and antiviral agent, belongs to a group of disaccharide nucleoside antibiotics featuring an α-(1→4)-glycoside bond in the disaccharide moiety. In this study, the amicetin biosynthesis gene cluster was cloned from Streptomyces vinaceusdrappus NRRL 2363 and localized on a 37-kb contiguous DNA region. Heterologous expression of the amicetin biosynthesis gene cluster in Streptomyces lividans TK64 resulted in the production of amicetin and its analogues, thereby confirming the identity of the ami gene cluster. In silico sequence analysis revealed that 21 genes were putatively involved in amicetin biosynthesis, including 3 for regulation and transportation, 10 for disaccharide biosynthesis, and 8 for the formation of the amicetin skeleton by the linkage of cytosine, p-aminobenzoic acid (PABA), and the terminal (+)-α-methylserine moieties. The inactivation of the benzoate coenzyme A (benzoate-CoA) ligase gene amiL and the N-acetyltransferase gene amiF led to two mutants that accumulated the same two compounds, cytosamine and 4-acetamido-3-hydroxybenzoic acid. These data indicated that AmiF functioned as an amide synthethase to link cytosine and PABA. The inactivation of amiR, encoding an acyl-CoA-acyl carrier protein transacylase, resulted in the production of plicacetin and norplicacetin, indicating AmiR to be responsible for attachment of the terminal methylserine moiety to form another amide bond. These findings implicated two alternative strategies for amide bond formation in amicetin biosynthesis. PMID:22267658
Linkage analysis of quantitative refraction and refractive errors in the Beaver Dam Eye Study.
Klein, Alison P; Duggal, Priya; Lee, Kristine E; Cheng, Ching-Yu; Klein, Ronald; Bailey-Wilson, Joan E; Klein, Barbara E K
2011-07-13
Refraction, as measured by spherical equivalent, is the need for an external lens to focus images on the retina. While genetic factors play an important role in the development of refractive errors, few susceptibility genes have been identified. However, several regions of linkage have been reported for myopia (2q, 4q, 7q, 12q, 17q, 18p, 22q, and Xq) and for quantitative refraction (1p, 3q, 4q, 7p, 8p, and 11p). To replicate previously identified linkage peaks and to identify novel loci that influence quantitative refraction and refractive errors, linkage analysis of spherical equivalent, myopia, and hyperopia in the Beaver Dam Eye Study was performed. Nonparametric, sibling-pair, genome-wide linkage analyses of refraction (spherical equivalent adjusted for age, education, and nuclear sclerosis), myopia and hyperopia in 834 sibling pairs within 486 extended pedigrees were performed. Suggestive evidence of linkage was found for hyperopia on chromosome 3, region q26 (empiric P = 5.34 × 10(-4)), a region that had shown significant genome-wide evidence of linkage to refraction and some evidence of linkage to hyperopia. In addition, the analysis replicated previously reported genome-wide significant linkages to 22q11 of adjusted refraction and myopia (empiric P = 4.43 × 10(-3) and 1.48 × 10(-3), respectively) and to 7p15 of refraction (empiric P = 9.43 × 10(-4)). Evidence was also found of linkage to refraction on 7q36 (empiric P = 2.32 × 10(-3)), a region previously linked to high myopia. The findings provide further evidence that genes controlling refractive errors are located on 3q26, 7p15, 7p36, and 22q11.
Celedón, Juan C; Soto-Quiros, Manuel E; Avila, Lydiana; Lake, Stephen L; Liang, Catherine; Fournier, Eduardo; Spesny, Mitzi; Hersh, Craig P; Sylvia, Jody S; Hudson, Thomas J; Verner, Andrei; Klanderman, Barbara J; Freimer, Nelson B; Silverman, Edwin K; Weiss, Scott T
2007-01-01
Although asthma is a major public health problem in certain Hispanic subgroups in the United States and Latin America, only one genome scan for asthma has included Hispanic individuals. Because of small sample size, that study had limited statistical power to detect linkage to asthma and its intermediate phenotypes in Hispanic participants. To identify genomic regions that contain susceptibility genes for asthma and airway responsiveness in an isolated Hispanic population living in the Central Valley of Costa Rica, we conducted a genome-wide linkage analysis of asthma (n = 638) and airway responsiveness (n = 488) in members of eight large pedigrees of Costa Rican children with asthma. Nonparametric multipoint linkage analysis of asthma was conducted by the NPL-PAIR allele-sharing statistic, and variance component models were used for the multipoint linkage analysis of airway responsiveness as a quantitative phenotype. All linkage analyses were repeated after exclusion of the phenotypic data of former and current smokers. Chromosome 12q showed some evidence of linkage to asthma, particularly in nonsmokers (P < 0.01). Among nonsmokers, there was suggestive evidence of linkage to airway responsiveness on chromosome 12q24.31 (LOD = 2.33 at 146 cM). After genotyping 18 additional short-tandem repeat markers on chromosome 12q, there was significant evidence of linkage to airway responsiveness on chromosome 12q24.31 (LOD = 3.79 at 144 cM), with a relatively narrow 1.5-LOD unit support interval for the observed linkage peak (142-147 cM). Our results suggest that chromosome 12q24.31 contains a locus (or loci) that influence a critical intermediate phenotype of asthma (airway responsiveness) in Costa Ricans.
Curtis, David; Knight, Jo; Sham, Pak C
2005-09-01
Although LOD score methods have been applied to diseases with complex modes of inheritance, linkage analysis of quantitative traits has tended to rely on non-parametric methods based on regression or variance components analysis. Here, we describe a new method for LOD score analysis of quantitative traits which does not require specification of a mode of inheritance. The technique is derived from the MFLINK method for dichotomous traits. A range of plausible transmission models is constructed, constrained to yield the correct population mean and variance for the trait but differing with respect to the contribution to the variance due to the locus under consideration. Maximized LOD scores under homogeneity and admixture are calculated, as is a model-free LOD score which compares the maximized likelihoods under admixture assuming linkage and no linkage. These LOD scores have known asymptotic distributions and hence can be used to provide a statistical test for linkage. The method has been implemented in a program called QMFLINK. It was applied to data sets simulated using a variety of transmission models and to a measure of monoamine oxidase activity in 105 pedigrees from the Collaborative Study on the Genetics of Alcoholism. With the simulated data, the results showed that the new method could detect linkage well if the true allele frequency for the trait was close to that specified. However, it performed poorly on models in which the true allele frequency was much rarer. For the Collaborative Study on the Genetics of Alcoholism data set only a modest overlap was observed between the results obtained from the new method and those obtained when the same data were analysed previously using regression and variance components analysis. Of interest is that D17S250 produced a maximized LOD score under homogeneity and admixture of 2.6 but did not indicate linkage using the previous methods. However, this region did produce evidence for linkage in a separate data set, suggesting that QMFLINK may have been able to detect a true linkage which was not picked up by the other methods. The application of model-free LOD score analysis to quantitative traits is novel and deserves further evaluation of its merits and disadvantages relative to other methods.
Alcohol outlets and clusters of violence
2011-01-01
Background Alcohol related violence continues to be a major public health problem in the United States. In particular, there is substantial evidence of an association between alcohol outlets and assault. However, because the specific geographic relationships between alcohol outlets and the distribution of violence remains obscured, it is important to identify the spatial linkages that may exist, enhancing public health efforts to curb both violence and morbidity. Methods The present study utilizes police-recorded data on simple and aggravated assaults in Cincinnati, Ohio. Addresses of alcohol outlets for Cincinnati, including all bars, alcohol-serving restaurants, and off-premise liquor and convenience stores were obtained from the Ohio Division of Liquor Control and geocoded for analysis. A combination of proximity analysis, spatial cluster detection approaches and a geographic information system were used to identify clusters of alcohol outlets and the distribution of violence around them. Results A brief review of the empirical work relating to alcohol outlet density and violence is provided, noting that the majority of this literature is cross-sectional and ecological in nature, yielding a somewhat haphazard and aggregate view of how outlet type(s) and neighborhood characteristics like social organization and land use are related to assaultive violence. The results of the statistical analysis for Cincinnati suggest that while alcohol outlets are not problematic per se, assaultive violence has a propensity to cluster around agglomerations of alcohol outlets. This spatial relationship varies by distance and is also related to the characteristics of the alcohol outlet agglomeration. Specifically, spatially dense distributions of outlets appear to be more prone to clusters of assaultive violence when compared to agglomerations with a lower density of outlets. Conclusion With a more thorough understanding of the spatial relationships between alcohol outlets and the distribution of assaults, policymakers in urban areas can make more informed regulatory decisions regarding alcohol licenses. Further, this research suggests that public health officials and epidemiologists need to develop a better understanding of what actually occurs in and around alcohol outlets, determining what factors (whether outlet, neighborhood, or spatially related) help fuel their relationship with violence and other alcohol-related harm. PMID:21542932
Bodea, Corneliu A; Middleton, Frank A; Melhem, Nadine M; Klei, Lambertus; Song, Youeun; Tiobech, Josepha; Marumoto, Pearl; Yano, Victor; Faraone, Stephen V; Roeder, Kathryn; Myles-Worsley, Marina; Devlin, Bernie; Byerley, William
2017-02-01
To localize genetic variation affecting risk for psychotic disorders in the population of Palau, we genotyped DNA samples from 203 Palauan individuals diagnosed with psychotic disorders, broadly defined, and 125 control subjects using a genome-wide single nucleotide polymorphism array. Palau has unique features advantageous for this study: due to its population history, Palauans are substantially interrelated; affected individuals often, but not always, cluster in families; and we have essentially complete ascertainment of affected individuals. To localize risk variants to genomic regions, we evaluated long-shared haplotypes, ≥10 Mb, identifying clusters of affected individuals who share such haplotypes. This extensive sharing, typically identical by descent, was significantly greater in cases than population controls, even after controlling for relatedness. Several regions of the genome exhibited substantial excess of shared haplotypes for affected individuals, including 3p21, 3p12, 4q28, and 5q23-q31. Two of these regions, 4q28 and 5q23-q31, showed significant linkage by traditional LOD score analysis and could harbor variants of more sizeable risk for psychosis or a multiplicity of risk variants. The pattern of haplotype sharing in 4q28 highlights PCDH10 , encoding a cadherin-related neuronal receptor, as possibly involved in risk.
Development of New Candidate Gene and EST-Based Molecular Markers for Gossypium Species
Buyyarapu, Ramesh; Kantety, Ramesh V.; Yu, John Z.; Saha, Sukumar; Sharma, Govind C.
2011-01-01
New source of molecular markers accelerate the efforts in improving cotton fiber traits and aid in developing high-density integrated genetic maps. We developed new markers based on candidate genes and G. arboreum EST sequences that were used for polymorphism detection followed by genetic and physical mapping. Nineteen gene-based markers were surveyed for polymorphism detection in 26 Gossypium species. Cluster analysis generated a phylogenetic tree with four major sub-clusters for 23 species while three species branched out individually. CAP method enhanced the rate of polymorphism of candidate gene-based markers between G. hirsutum and G. barbadense. Two hundred A-genome based SSR markers were designed after datamining of G. arboreum EST sequences (Mississippi Gossypium arboreum EST-SSR: MGAES). Over 70% of MGAES markers successfully produced amplicons while 65 of them demonstrated polymorphism between the parents of G. hirsutum and G. barbadense RIL population and formed 14 linkage groups. Chromosomal localization of both candidate gene-based and MGAES markers was assisted by euploid and hypoaneuploid CS-B analysis. Gene-based and MGAES markers were highly informative as they were designed from candidate genes and fiber transcriptome with a potential to be integrated into the existing cotton genetic and physical maps. PMID:22315588
Tripathi, G.; Rangaswamy, D.; Borkar, M.; Prasad, N.; Sharma, R. K.; Sankhwar, S. N.; Agrawal, S.
2015-01-01
We evaluated whether polymorphisms in interleukin (IL-1) gene cluster (IL-1 alpha [IL-1A], IL-1 beta [IL-1B], and IL-1 receptor antagonist [IL-1RN]) are associated with end stage renal disease (ESRD). A total of 258 ESRD patients and 569 ethnicity matched controls were examined for IL-1 gene cluster. These were genotyped for five single-nucleotide gene polymorphisms in the IL-1A, IL-1B and IL-1RN genes and a variable number of tandem repeats (VNTR) in the IL-1RN. The IL-1B − 3953 and IL-1RN + 8006 polymorphism frequencies were significantly different between the two groups. At IL-1B, the T allele of − 3953C/T was increased among ESRD (P = 0.0001). A logistic regression model demonstrated that two repeat (240 base pair [bp]) of the IL-1Ra VNTR polymorphism was associated with ESRD (P = 0.0001). The C/C/C/C/C/1 haplotype was more prevalent in ESRD = 0.007). No linkage disequilibrium (LD) was observed between six loci of IL-1 gene. We further conducted a meta-analysis of existing studies and found that there is a strong association of IL-1 RN VNTR 86 bp repeat polymorphism with susceptibility to ESRD (odds ratio = 2.04, 95% confidence interval = 1.48-2.82; P = 0.000). IL-1B − 5887, +8006 and the IL-1RN VNTR polymorphisms have been implicated as potential risk factors for ESRD. The meta-analysis showed a strong association of IL-1RN 86 bp VNTR polymorphism with susceptibility to ESRD. PMID:25684870
Horne, Benjamin D; Malhotra, Alka; Camp, Nicola J
2003-01-01
Background High triglycerides (TG) and low high-density lipoprotein cholesterol (HDL-C) jointly increase coronary disease risk. We performed linkage analysis for TG/HDL-C ratio in the Framingham Heart Study data as a quantitative trait, using methods implemented in LINKAGE, GENEHUNTER (GH), MCLINK, and SOLAR. Results were compared to each other and to those from a previous evaluation using SOLAR for TG/HDL-C ratio on this sample. We also investigated linked pedigrees in each region using by-pedigree analysis. Results Fourteen regions with at least suggestive linkage evidence were identified, including some that may increase and some that may decrease coronary risk. Ten of the 14 regions were identified by more than one analysis, and several of these regions were not previously detected. The best regions identified for each method were on chromosomes 2 (LOD = 2.29, MCLINK), 5 (LOD = 2.65, GH), 7 (LOD = 2.67, SOLAR), and 22 (LOD = 3.37, LINKAGE). By-pedigree multi-point LOD values in MCLINK showed linked pedigrees for all five regions, ranging from 3 linked pedigrees (chromosome 5) to 14 linked pedigrees (chromosome 7), and suggested localizations of between 9 cM and 27 cM in size. Conclusion Reasonable concordance was found across analysis methods. No single method identified all regions, either by full sample LOD or with by-pedigree analysis. Concordance across methods appeared better at the pedigree level, with many regions showing by-pedigree support in MCLINK when no evidence was observed in the full sample. Thus, investigating by-pedigree linkage evidence may provide a useful tool for evaluating linkage regions. PMID:14975161
Horne, Benjamin D; Malhotra, Alka; Camp, Nicola J
2003-12-31
High triglycerides (TG) and low high-density lipoprotein cholesterol (HDL-C) jointly increase coronary disease risk. We performed linkage analysis for TG/HDL-C ratio in the Framingham Heart Study data as a quantitative trait, using methods implemented in LINKAGE, GENEHUNTER (GH), MCLINK, and SOLAR. Results were compared to each other and to those from a previous evaluation using SOLAR for TG/HDL-C ratio on this sample. We also investigated linked pedigrees in each region using by-pedigree analysis. Fourteen regions with at least suggestive linkage evidence were identified, including some that may increase and some that may decrease coronary risk. Ten of the 14 regions were identified by more than one analysis, and several of these regions were not previously detected. The best regions identified for each method were on chromosomes 2 (LOD = 2.29, MCLINK), 5 (LOD = 2.65, GH), 7 (LOD = 2.67, SOLAR), and 22 (LOD = 3.37, LINKAGE). By-pedigree multi-point LOD values in MCLINK showed linked pedigrees for all five regions, ranging from 3 linked pedigrees (chromosome 5) to 14 linked pedigrees (chromosome 7), and suggested localizations of between 9 cM and 27 cM in size. Reasonable concordance was found across analysis methods. No single method identified all regions, either by full sample LOD or with by-pedigree analysis. Concordance across methods appeared better at the pedigree level, with many regions showing by-pedigree support in MCLINK when no evidence was observed in the full sample. Thus, investigating by-pedigree linkage evidence may provide a useful tool for evaluating linkage regions.
Genetic linkage maps are valuable tools in evolutionary biology; however, their availability for wild populations is extremely limited. Fundulus heteroclitus (Atlantic killifish) is a non-migratory estuarine fish that exhibits high allelic and phenotypic diversity partitioned among subpopulations that reside in disparate environmental conditions. An ideal candidate model organism for studying gene-environment interactions, the molecular toolbox for F. heteroclitus is limited. We identified hundreds of novel microsatellites which, when combined with existing microsatellites and single nucleotide polymorphisms (SNPs), were used to construct the first genetic linkage map for this species. By integrating independent linkage maps from three genetic crosses, we developed a consensus map containing 24 linkage groups, consistent with the number of chromosomes reported for this species. These linkage groups span 2300 centimorgans (cM) of recombinant genomic space, intermediate in size relative to the current linkage maps for the teleosts, medaka and zebrafish. Comparisons between fish genomes support a high degree of synteny between the consensus F. heteroclitus linkage map and the medaka and (to a lesser extent) zebrafish physical genome assemblies.This dataset is associated with the following publication:Waits , E., J. Martinson , B. Rinner, S. Morris, D. Proestou, D. Champlin , and D. Nacci. Genetic linkage map and comparative genome analysis for the estuarine Atlanti
Hoffmann, Katrin; Planitz, Christian; Rüschendorf, Franz; Müller-Myhsok, Bertram; Stassen, Hans H; Lucke, Barbara; Mattheisen, Manuel; Stumvoll, Michael; Bochmann, Rolf; Zschornack, Martin; Wienker, Thomas F; Nürnberg, Peter; Reis, André; Luft, Friedrich C; Lindner, Tom H
2009-05-01
Genome-wide linkage studies and genome-wide association studies have not as yet identified major genes contributing to primary hypertension in the general population. This state-of-affairs suggests considerable heterogeneity with small contributing effects for primary hypertension, or other complex genetic traits, in outbred populations. Isolated populations, as recent data from Iceland and French Canada suggest, could offer a solution to this problem. We studied a Slavic isolate in Germany, the Sorbs, and genotyped 1040 polymorphic microsatellite markers in 87 multigeneration families. Our genome-wide linkage scan revealed a locus on chromosome 1p36.13 at D1S3669-D1S2826 (40.95 cM Marshfield coordinates; logarithm of the odds = 3.45, nominal P = 0.00003) that reached genome-wide significance (P = 0.004), indicating the increased power in isolated populations. The chromosome 1 locus maps to a region in which traits such as diabetes, hyperlipidemia, obesity and BMI cluster. Our results suggest that this locus contributes to the metabolic syndrome, and that further attention in this and other populations is warranted.
Lilja, Heidi E; Soro, Aino; Ylitalo, Kati; Nuotio, Ilpo; Viikari, Jorma S A; Salomaa, Veikko; Vartiainen, Erkki; Taskinen, Marja-Riitta; Peltonen, Leena; Pajukanta, Päivi
2002-09-01
In patients with premature coronary heart disease, the most common lipoprotein abnormality is high-density lipoprotein (HDL) deficiency. To assess the genetic background of the low HDL-cholesterol trait, we performed a candidate gene study in 25 families with low HDL, collected from the genetically isolated population of Finland. We studied 21 genes encoding essential proteins involved in the HDL metabolism by genotyping intragenic and flanking markers for these genes. We found suggestive evidence for linkage in two candidate regions: Marker D1S2844, in the apolipoprotein A-II (APOA2) region, yielded a LOD score of 2.14 and marker D11S939 flanking the apolipoprotein A-I/C-III/A-IV gene cluster (APOA1C3A4) produced a LOD score of 1.69. Interestingly, we identified potential shared haplotypes in these two regions in a subset of low HDL families. These families also contributed to the obtained positive LOD scores, whereas the rest of the families produced negative LOD scores. None of the remaining candidate regions provided any evidence for linkage. Since only a limited number of loci were tested in this candidate gene study, these LOD scores suggest significant involvement of the APOA2 gene and the APOA1C3A4 gene cluster, or loci in their immediate vicinity, in the pathogenesis of low HDL.
Mihailovska, Eva; Raith, Marianne; Valencia, Rocio G.; Fischer, Irmgard; Banchaabouchi, Mumna Al; Herbst, Ruth; Wiche, Gerhard
2014-01-01
Mutations in the cytolinker protein plectin lead to grossly distorted morphology of neuromuscular junctions (NMJs) in patients suffering from epidermolysis bullosa simplex (EBS)-muscular dystrophy (MS) with myasthenic syndrome (MyS). Here we investigated whether plectin contributes to the structural integrity of NMJs by linking them to the postsynaptic intermediate filament (IF) network. Live imaging of acetylcholine receptors (AChRs) in cultured myotubes differentiated ex vivo from immortalized plectin-deficient myoblasts revealed them to be highly mobile and unable to coalesce into stable clusters, in contrast to wild-type cells. We found plectin isoform 1f (P1f) to bridge AChRs and IFs via direct interaction with the AChR-scaffolding protein rapsyn in an isoform-specific manner; forced expression of P1f in plectin-deficient cells rescued both compromised AChR clustering and IF network anchoring. In conditional plectin knockout mice with gene disruption in muscle precursor/satellite cells (Pax7-Cre/cKO), uncoupling of AChRs from IFs was shown to lead to loss of postsynaptic membrane infoldings and disorganization of the NMJ microenvironment, including its invasion by microtubules. In their phenotypic behavior, mutant mice closely mimicked EBS-MD-MyS patients, including impaired body balance, severe muscle weakness, and reduced life span. Our study demonstrates that linkage to desmin IF networks via plectin is crucial for formation and maintenance of AChR clusters, postsynaptic NMJ organization, and body locomotion. PMID:25318670
Ecological Consistency of SSU rRNA-Based Operational Taxonomic Units at a Global Scale
Schmidt, Thomas S. B.; Matias Rodrigues, João F.; von Mering, Christian
2014-01-01
Operational Taxonomic Units (OTUs), usually defined as clusters of similar 16S/18S rRNA sequences, are the most widely used basic diversity units in large-scale characterizations of microbial communities. However, it remains unclear how well the various proposed OTU clustering algorithms approximate ‘true’ microbial taxa. Here, we explore the ecological consistency of OTUs – based on the assumption that, like true microbial taxa, they should show measurable habitat preferences (niche conservatism). In a global and comprehensive survey of available microbial sequence data, we systematically parse sequence annotations to obtain broad ecological descriptions of sampling sites. Based on these, we observe that sequence-based microbial OTUs generally show high levels of ecological consistency. However, different OTU clustering methods result in marked differences in the strength of this signal. Assuming that ecological consistency can serve as an objective external benchmark for cluster quality, we conclude that hierarchical complete linkage clustering, which provided the most ecologically consistent partitions, should be the default choice for OTU clustering. To our knowledge, this is the first approach to assess cluster quality using an external, biologically meaningful parameter as a benchmark, on a global scale. PMID:24763141
Triwitayakorn, Kanokporn; Chatkulkawin, Pornsupa; Kanjanawattanawong, Supanath; Sraphet, Supajit; Yoocha, Thippawan; Sangsrakru, Duangjai; Chanprasert, Juntima; Ngamphiw, Chumpol; Jomchai, Nukoon; Therawattanasuk, Kanikar; Tangphatsornruang, Sithichoke
2011-01-01
To obtain more information on the Hevea brasiliensis genome, we sequenced the transcriptome from the vegetative shoot apex yielding 2 311 497 reads. Clustering and assembly of the reads produced a total of 113 313 unique sequences, comprising 28 387 isotigs and 84 926 singletons. Also, 17 819 expressed sequence tag (EST)-simple sequence repeats (SSRs) were identified from the data set. To demonstrate the use of this EST resource for marker development, primers were designed for 430 of the EST-SSRs. Three hundred and twenty-three primer pairs were amplifiable in H. brasiliensis clones. Polymorphic information content values of selected 47 SSRs among 20 H. brasiliensis clones ranged from 0.13 to 0.71, with an average of 0.51. A dendrogram of genetic similarities between the 20 H. brasiliensis clones using these 47 EST-SSRs suggested two distinct groups that correlated well with clone pedigree. These novel EST-SSRs together with the published SSRs were used for the construction of an integrated parental linkage map of H. brasiliensis based on 81 lines of an F1 mapping population. The map consisted of 97 loci, consisting of 37 novel EST-SSRs and 60 published SSRs, distributed on 23 linkage groups and covered 842.9 cM with a mean interval of 11.9 cM and ∼4 loci per linkage group. Although the numbers of linkage groups exceed the haploid number (18), but with several common markers between homologous linkage groups with the previous map indicated that the F1 map in this study is appropriate for further study in marker-assisted selection. PMID:22086998
2013-01-01
Background Cucumber is an important vegetable crop that is susceptible to many pathogens, but no disease resistance (R) genes have been cloned. The availability of whole genome sequences provides an excellent opportunity for systematic identification and characterization of the nucleotide binding and leucine-rich repeat (NB-LRR) type R gene homolog (RGH) sequences in the genome. Cucumber has a very narrow genetic base making it difficult to construct high-density genetic maps. Development of a consensus map by synthesizing information from multiple segregating populations is a method of choice to increase marker density. As such, the objectives of the present study were to identify and characterize NB-LRR type RGHs, and to develop a high-density, integrated cucumber genetic-physical map anchored with RGH loci. Results From the Gy14 draft genome, 70 NB-containing RGHs were identified and characterized. Most RGHs were in clusters with uneven distribution across seven chromosomes. In silico analysis indicated that all 70 RGHs had EST support for gene expression. Phylogenetic analysis classified 58 RGHs into two clades: CNL and TNL. Comparative analysis revealed high-degree sequence homology and synteny in chromosomal locations of these RGH members between the cucumber and melon genomes. Fifty-four molecular markers were developed to delimit 67 of the 70 RGHs, which were integrated into a genetic map through linkage analysis. A 1,681-locus cucumber consensus map including 10 gene loci and spanning 730.0 cM in seven linkage groups was developed by integrating three component maps with a bin-mapping strategy. Physically, 308 scaffolds with 193.2 Mbp total DNA sequences were anchored onto this consensus map that covered 52.6% of the 367 Mbp cucumber genome. Conclusions Cucumber contains relatively few NB-LRR RGHs that are clustered and unevenly distributed in the genome. All RGHs seem to be transcribed and shared significant sequence homology and synteny with the melon genome suggesting conservation of these RGHs in the Cucumis lineage. The 1,681-locus consensus genetic-physical map developed and the RGHs identified and characterized herein are valuable genomics resources that may have many applications such as quantitative trait loci identification, map-based gene cloning, association mapping, marker-assisted selection, as well as assembly of a more complete cucumber genome. PMID:23531125
Gaunt, Tom R; Rodriguez, Santiago; Zapata, Carlos; Day, Ian NM
2006-01-01
Background Various software tools are available for the display of pairwise linkage disequilibrium across multiple single nucleotide polymorphisms. The HapMap project also presents these graphics within their website. However, these approaches are limited in their use of data from multiallelic markers and provide limited information in a graphical form. Results We have developed a software package (MIDAS – Multiallelic Interallelic Disequilibrium Analysis Software) for the estimation and graphical display of interallelic linkage disequilibrium. Linkage disequilibrium is analysed for each allelic combination (of one allele from each of two loci), between all pairwise combinations of any type of multiallelic loci in a contig (or any set) of many loci (including single nucleotide polymorphisms, microsatellites, minisatellites and haplotypes). Data are presented graphically in a novel and informative way, and can also be exported in tabular form for other analyses. This approach facilitates visualisation of patterns of linkage disequilibrium across genomic regions, analysis of the relationships between different alleles of multiallelic markers and inferences about patterns of evolution and selection. Conclusion MIDAS is a linkage disequilibrium analysis program with a comprehensive graphical user interface providing novel views of patterns of linkage disequilibrium between all types of multiallelic and biallelic markers. Availability Available from and PMID:16643648
Autosomal Dominant Nonsyndromic Cleft Lip and Palate: Significant Evidence of Linkage at 18q21.1
Beiraghi, Soraya ; Nath, Swapan K. ; Gaines, Matthew ; Mandhyan, Desh D. ; Hutchings, David ; Ratnamala, Uppala ; McElreavey, Ken ; Bartoloni, Lucia ; Antonarakis, Gregory S. ; Antonarakis, Stylianos E. ; Radhakrishna, Uppala
2007-01-01
Nonsyndromic cleft lip with or without cleft palate (NSCL/P) is one of the most common congenital facial defects, with an incidence of 1 in 700–1,000 live births among individuals of European descent. Several linkage and association studies of NSCL/P have suggested numerous candidate genes and genomic regions. A genomewide linkage analysis of a large multigenerational family (UR410) with NSCL/P was performed using a single-nucleotide–polymorphism array. Nonparametric linkage (NPL) analysis provided significant evidence of linkage for marker rs728683 on chromosome 18q21.1 (NPL=43.33 and P=.000061; nonparametric LOD=3.97 and P=.00001). Parametric linkage analysis with a dominant mode of inheritance and reduced penetrance resulted in a maximum LOD score of 3.61 at position 47.4 Mb on chromosome 18q21.1. Haplotype analysis with informative crossovers defined a 5.7-Mb genomic region spanned by proximal marker rs1824683 (42,403,918 bp) and distal marker rs768206 (48,132,862 bp). Thus, a novel genomic region on 18q21.1 was identified that most likely harbors a high-risk variant for NSCL/P in this family; we propose to name this locus “OFC11” (orofacial cleft 11). PMID:17564975
Shao, Changwei; Niu, Yongchao; Rastas, Pasi; Liu, Yang; Xie, Zhiyuan; Li, Hengde; Wang, Lei; Jiang, Yong; Tai, Shuaishuai; Tian, Yongsheng; Sakamoto, Takashi; Chen, Songlin
2015-01-01
High-resolution genetic maps are essential for fine mapping of complex traits, genome assembly, and comparative genomic analysis. Single-nucleotide polymorphisms (SNPs) are the primary molecular markers used for genetic map construction. In this study, we identified 13,362 SNPs evenly distributed across the Japanese flounder (Paralichthys olivaceus) genome. Of these SNPs, 12,712 high-confidence SNPs were subjected to high-throughput genotyping and assigned to 24 consensus linkage groups (LGs). The total length of the genetic linkage map was 3,497.29 cM with an average distance of 0.47 cM between loci, thereby representing the densest genetic map currently reported for Japanese flounder. Nine positive quantitative trait loci (QTLs) forming two main clusters for Vibrio anguillarum disease resistance were detected. All QTLs could explain 5.1–8.38% of the total phenotypic variation. Synteny analysis of the QTL regions on the genome assembly revealed 12 immune-related genes, among them 4 genes strongly associated with V. anguillarum disease resistance. In addition, 246 genome assembly scaffolds with an average size of 21.79 Mb were anchored onto the LGs; these scaffolds, comprising 522.99 Mb, represented 95.78% of assembled genomic sequences. The mapped assembly scaffolds in Japanese flounder were used for genome synteny analyses against zebrafish (Danio rerio) and medaka (Oryzias latipes). Flounder and medaka were found to possess almost one-to-one synteny, whereas flounder and zebrafish exhibited a multi-syntenic correspondence. The newly developed high-resolution genetic map, which will facilitate QTL mapping, scaffold assembly, and genome synteny analysis of Japanese flounder, marks a milestone in the ongoing genome project for this species. PMID:25762582
Genetic studies of stuttering in a founder population.
Wittke-Thompson, Jacqueline K; Ambrose, Nicoline; Yairi, Ehud; Roe, Cheryl; Cook, Edwin H; Ober, Carole; Cox, Nancy J
2007-01-01
Genome-wide linkage and association analyses were conducted to identify genetic determinants of stuttering in a founder population in which 48 individuals affected with stuttering are connected in a single 232-person genealogy. A novel approach was devised to account for all necessary relationships to enable multipoint linkage analysis. Regions with nominal evidence for linkage were found on chromosomes 3 (P=0.013, 208.8 centiMorgans (cM)), 13 (P=0.012, 52.6 cM), and 15 (P=0.02, 100 cM). Regions with nominal evidence for association with stuttering that overlapped with a linkage signal are located on chromosomes 3 (P=0.0047, 195 cM), 9 (P=0.0067, 46.5 cM), and 13 (P=0.0055, 52.6 cM). We also conducted the first meta-analysis for stuttering using results from linkage studies in the Hutterites and The Illinois International Genetics of Stuttering Project and identified regions with nominal evidence for linkage on chromosomes 2 (P=0.013, 180-195 cM) and 5 (P=0.0051, 105-120 cM; P=0.015, 120-135 cM). None of the linkage signals detected in the Hutterite sample alone, or in the meta-analysis, meet genome-wide criteria for significance, although some of the stronger signals overlap linkage mapping signals previously reported for other speech and language disorders. After reading this article, the reader will be able to: (1) summarize information about the background of common disorders and methodology of genetic studies; (2) evaluate the role of genetics in stuttering; (3) discuss the value of using founder populations in genetic studies; (4) articulate the importance of combining several studies in a meta-analysis; (5) discuss the overlap of genetic signals identified in stuttering with other speech and language disorders.
Nance-Horan syndrome: localization within the region Xp21.1-Xp22.3 by linkage analysis.
Stambolian, D; Lewis, R A; Buetow, K; Bond, A; Nussbaum, R
1990-07-01
Nance-Horan Syndrome (NHS) or X-linked cataract-dental syndrome (MIM 302350) is a disease of unknown pathogenesis characterized by congenital cataracts and dental anomalies. We performed linkage analysis in three kindreds with NHS by using six RFLP markers between Xp11.3 and Xp22.3. Close linkage was found between NHS and polymorphic loci DXS43 (theta = 0 with lod score 2.89), DXS41 (theta = 0 with lod score 3.44), and DXS67 (theta = 0 with lod score 2.74), defined by probes pD2, p99-6, and pB24, respectively. Recombinations were found with the marker loci DXS84 (theta = .04 with lod score 4.13), DXS143 (theta = .06 with lod score 3.11) and DXS7 (theta = .09 with lod score 1.68). Multipoint linkage analysis determined the NHS locus to be linked completely to DXS41 (lod score = 7.07). Our linkage results, combined with analysis of Xp interstitial deletions, suggest that the NHS locus is located within or close to the Xp22.1-Xp22.2 region.
Nance-Horan syndrome: localization within the region Xp21.1-Xp22.3 by linkage analysis.
Stambolian, D; Lewis, R A; Buetow, K; Bond, A; Nussbaum, R
1990-01-01
Nance-Horan Syndrome (NHS) or X-linked cataract-dental syndrome (MIM 302350) is a disease of unknown pathogenesis characterized by congenital cataracts and dental anomalies. We performed linkage analysis in three kindreds with NHS by using six RFLP markers between Xp11.3 and Xp22.3. Close linkage was found between NHS and polymorphic loci DXS43 (theta = 0 with lod score 2.89), DXS41 (theta = 0 with lod score 3.44), and DXS67 (theta = 0 with lod score 2.74), defined by probes pD2, p99-6, and pB24, respectively. Recombinations were found with the marker loci DXS84 (theta = .04 with lod score 4.13), DXS143 (theta = .06 with lod score 3.11) and DXS7 (theta = .09 with lod score 1.68). Multipoint linkage analysis determined the NHS locus to be linked completely to DXS41 (lod score = 7.07). Our linkage results, combined with analysis of Xp interstitial deletions, suggest that the NHS locus is located within or close to the Xp22.1-Xp22.2 region. PMID:1971992
Linkage localization of X-linked Charcot-Marie-Tooth disease
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bergoffen, J.; Trofatter, J.; Haines, J.L.
1993-02-01
Charcot-Marie-Tooth disease (CMT), also known as hereditary motor and sensory neuropathy, is a heterogeneous group of slowly progressive, degenerative disorders of peripheral nerve. X-linked CMT (CMTX) (McKusick 302800), a subdivision of type I, or demyelinating, CMT is an X-linked dominant condition with variable penetrance. Previous linkage analysis using RFLPs demonstrated linkage to markers on the proximal long and short arms of the X chromosome, with the more likely localization on the proximal long arm of the X chromosome. Available variable simple-sequence repeats (VSSRs) broaden the possibilities for linkage analysis. This paper presents new linkage data and recombination analysis derived frommore » work with four VSSR markers - AR, PGKP1, DXS453, and DXYS1X - in addition to analysis using RFLP markers described elsewhere. These studies localize the CMTX gene to the proximal Xq segment between PGKP1 (Xq11.2-12) and DXS72 (Xq21.1), with a combined maximum multipoint lod score of 15.3 at DXS453 ([theta] = 0). 32 refs., 3 figs., 2 tabs.« less
Siggs, Owen M; Javadiyan, Shari; Sharma, Shiwani; Souzeau, Emmanuelle; Lower, Karen M; Taranath, Deepa A; Black, Jo; Pater, John; Willoughby, John G; Burdon, Kathryn P; Craig, Jamie E
2017-01-01
Congenital cataract is a rare but severe paediatric visual impediment, often caused by variants in one of several crystallin genes that produce the bulk of structural proteins in the lens. Here we describe a pedigree with autosomal dominant isolated congenital cataract and linkage to the crystallin gene cluster on chromosome 22. No rare single nucleotide variants or short indels were identified by exome sequencing, yet copy number variant analysis revealed a duplication spanning both CRYBB1 and CRYBA4. While the CRYBA4 duplication was complete, the CRYBB1 duplication was not, with the duplicated CRYBB1 product predicted to create a gain of function allele. This association suggests a new genetic mechanism for the development of isolated congenital cataract. PMID:28272538
Advances in spatial epidemiology and geographic information systems.
Kirby, Russell S; Delmelle, Eric; Eberth, Jan M
2017-01-01
The field of spatial epidemiology has evolved rapidly in the past 2 decades. This study serves as a brief introduction to spatial epidemiology and the use of geographic information systems in applied research in epidemiology. We highlight technical developments and highlight opportunities to apply spatial analytic methods in epidemiologic research, focusing on methodologies involving geocoding, distance estimation, residential mobility, record linkage and data integration, spatial and spatio-temporal clustering, small area estimation, and Bayesian applications to disease mapping. The articles included in this issue incorporate many of these methods into their study designs and analytical frameworks. It is our hope that these studies will spur further development and utilization of spatial analysis and geographic information systems in epidemiologic research. Copyright © 2016 Elsevier Inc. All rights reserved.
Hierarchical structures of correlations networks among Turkey’s exports and imports by currencies
NASA Astrophysics Data System (ADS)
Kocakaplan, Yusuf; Deviren, Bayram; Keskin, Mustafa
2012-12-01
We have examined the hierarchical structures of correlations networks among Turkey’s exports and imports by currencies for the 1996-2010 periods, using the concept of a minimal spanning tree (MST) and hierarchical tree (HT) which depend on the concept of ultrametricity. These trees are useful tools for understanding and detecting the global structure, taxonomy and hierarchy in financial markets. We derived a hierarchical organization and build the MSTs and HTs during the 1996-2001 and 2002-2010 periods. The reason for studying two different sub-periods, namely 1996-2001 and 2002-2010, is that the Euro (EUR) came into use in 2001, and some countries have made their exports and imports with Turkey via the EUR since 2002, and in order to test various time-windows and observe temporal evolution. We have carried out bootstrap analysis to associate a value of the statistical reliability to the links of the MSTs and HTs. We have also used the average linkage cluster analysis (ALCA) to observe the cluster structure more clearly. Moreover, we have obtained the bidimensional minimal spanning tree (BMST) due to economic trade being a bidimensional problem. From the structural topologies of these trees, we have identified different clusters of currencies according to their proximity and economic ties. Our results show that some currencies are more important within the network, due to a tighter connection with other currencies. We have also found that the obtained currencies play a key role for Turkey’s exports and imports and have important implications for the design of portfolio and investment strategies.
Self consistency grouping: a stringent clustering method
2012-01-01
Background Numerous types of clustering like single linkage and K-means have been widely studied and applied to a variety of scientific problems. However, the existing methods are not readily applicable for the problems that demand high stringency. Methods Our method, self consistency grouping, i.e. SCG, yields clusters whose members are closer in rank to each other than to any member outside the cluster. We do not define a distance metric; we use the best known distance metric and presume that it measures the correct distance. SCG does not impose any restriction on the size or the number of the clusters that it finds. The boundaries of clusters are determined by the inconsistencies in the ranks. In addition to the direct implementation that finds the complete structure of the (sub)clusters we implemented two faster versions. The fastest version is guaranteed to find only the clusters that are not subclusters of any other clusters and the other version yields the same output as the direct implementation but does so more efficiently. Results Our tests have demonstrated that SCG yields very few false positives. This was accomplished by introducing errors in the distance measurement. Clustering of protein domain representatives by structural similarity showed that SCG could recover homologous groups with high precision. Conclusions SCG has potential for finding biological relationships under stringent conditions. PMID:23320864
Linkage analysis of Norrie disease with an X-chromosomal ornithine aminotransferase locus.
Bateman, J B; Kojis, T L; Cantor, R M; Heinzmann, C; Ngo, J T; Spence, M A; Inana, G; Kivlin, J D; Curtis, D; Sparkes, R S
1993-01-01
Norrie disease is a rare disease of newborn males caused by prenatal or perinatal retinal detachment, which may be associated with mental retardation, psychosis, and/or hearing loss. DXS7 (L1.28) and MAO A and B loci have been linked to the ND locus on the short arm of the X chromosome. Sequences homologous to OAT also have been mapped to the short arm of the X chromosome. We performed linkage analyses between the ND locus and one of the OAT-like clusters of sequences on the X chromosome (OATL1), using a ScaI RFLP in a ND family, and increased the previously calculated lod score (z) to over 3 (3.38; theta = 0.05). Similarly, we calculated a lod score of 4.06 (theta = 0.01) between the OATL1 and DXS7 loci. Alone, the OATL1 ScaI RFLP system is expected to be informative in 48% of females. If this system were used in combination with the DXS7 TaqI polymorphism, 71% of females would be informative for at least one of the markers and 21% would be informative for both. Because the OATL1 ScaI RFLP is a relatively common polymorphism, this system should be useful for the identification of ND carriers and affected male fetuses and newborns.
Leung, Tommy W C; Mak, Darwin; Wong, K H; Wang, Y; Song, Y H; Tsang, D N C; Wong, C; Shao, Y M; Lim, W L
2008-07-01
We conducted a molecular epidemiological study on newly diagnosed human immunodeficiency virus type 1 (HIV-1)-infected patients in Hong Kong to identify the epidemiological linkage of HIV-1 infection in the locality. Reverse transcription polymerase chain reaction (RT-PCR) for HIV-1 was performed on newly diagnosed HIV-1-positive sera collected from January 2002 to December 2006. PCR products correspond to the env C2V3V4 region and gag p17/p24 junction of the HIV-1 genome were nucleotide sequenced. Phylogenetic analyses performed on the acquired nucleotide sequences revealed that CRF01_AE and subtype B were the two dominant HIV-1 subtypes. Analyses also demonstrated the presence of three emerging HIV-1 clusters among the subtype B sequences in Hong Kong. Individual cluster possesses a unique cluster-specific amino acid signature for identification. Data show that one of the clusters (Cluster I) is rapidly expanding. In addition to the unique cluster-specific amino acid signature, the majority of sequences in Cluster I harbor a 6-amino acid insertion at the gag p17/p24 junction in a region that is thought to be closely associated with HIV-1 infectivity.
Woodbury-Smith, M; Bilder, D A; Morgan, J; Jerominski, L; Darlington, T; Dyer, T; Paterson, A D; Coon, H
2017-01-01
It has long been recognized that there is an association between enlarged head circumference (HC) and autism spectrum disorder (ASD), but the genetics of HC in ASD is not well understood. In order to investigate the genetic underpinning of HC in ASD, we undertook a genome-wide linkage study of HC followed by linkage signal targeted association among a sample of 67 extended pedigrees with ASD. HC measurements on members of 67 multiplex ASD extended pedigrees were used as a quantitative trait in a genome-wide linkage analysis. The Illumina 6K SNP linkage panel was used, and analyses were carried out using the SOLAR implemented variance components model. Loci identified in this way formed the target for subsequent association analysis using the Illumina OmniExpress chip and imputed genotypes. A modification of the qTDT was used as implemented in SOLAR. We identified a linkage signal spanning 6p21.31 to 6p22.2 (maximum LOD = 3.4). Although targeted association did not find evidence of association with any SNP overall, in one family with the strongest evidence of linkage, there was evidence for association (rs17586672, p = 1.72E-07). Although this region does not overlap with ASD linkage signals in these same samples, it has been associated with other psychiatric risk, including ADHD, developmental dyslexia, schizophrenia, specific language impairment, and juvenile bipolar disorder. The genome-wide significant linkage signal represents the first reported observation of a potential quantitative trait locus for HC in ASD and may be relevant in the context of complex multivariate risk likely leading to ASD.
Teaching Principles of Linkage and Gene Mapping with the Tomato.
ERIC Educational Resources Information Center
Hawk, James A.; And Others
1980-01-01
A three-point linkage system in tomatoes is used to explain concepts of gene mapping, linking and statistical analysis. The system is designed for teaching the effective use of statistics, and the power of genetic analysis from statistical analysis of phenotypic ratios. (Author/SA)
Robust LOD scores for variance component-based linkage analysis.
Blangero, J; Williams, J T; Almasy, L
2000-01-01
The variance component method is now widely used for linkage analysis of quantitative traits. Although this approach offers many advantages, the importance of the underlying assumption of multivariate normality of the trait distribution within pedigrees has not been studied extensively. Simulation studies have shown that traits with leptokurtic distributions yield linkage test statistics that exhibit excessive Type I error when analyzed naively. We derive analytical formulae relating the deviation from the expected asymptotic distribution of the lod score to the kurtosis and total heritability of the quantitative trait. A simple correction constant yields a robust lod score for any deviation from normality and for any pedigree structure, and effectively eliminates the problem of inflated Type I error due to misspecification of the underlying probability model in variance component-based linkage analysis.
Yokoyama, Eiji; Hashimoto, Ruiko; Etoh, Yoshiki; Ichihara, Sachiko; Horikawa, Kazumi; Uchimura, Masako
2011-01-01
The distribution of insertion sequence (IS) 629 among strains of enterohemorrhagic Escherichia coli serovar O157 (O157) was investigated and compared with the strain lineages defined by lineage specific polymorphism assay-6 (LSPA-6) to demonstrate the effectiveness of IS629 analysis for population genetics analysis. Using pulsed-field gel electrophoresis and variable-number tandem repeat typing, 140 strains producing both VT1 and VT2 and 98 strains producing only VT2 were selected from a total of 592 strains isolated from patients and asymptomatic carriers in Chiba Prefecture, Japan, during 2003-2008. By LSPA-6 analysis, six strains had atypical amplicon sizes in their Z5935 loci and five strains had atypical amplicon sizes in their arp-iclR intergenic regions. Sequence analyses of PCR amplified DNAs showed that five of the six loci used for LSPA-6 analysis had tandem repeats and the allele changes were due to changes in the number of tandem repeats. Subculturing and long-term incubation was found to have no detectable effect on the lineages defined by LSPA-6 analysis, demonstrating the robustness of LSPA-6 analysis. Minimum spanning tree analysis reconstruction revealed that strains in lineage I, I/II, and II clustered on separate branches, indicating that the distribution of IS629 was biased among O157 strains in different lineages. Strains with LSPA-6 codes 231111, 211113, and 211114 had atypical amplicon sizes and were clustered in lineage I/II branch, and strains with LSPA-6 codes 212114, 221123, 221223, 222123, 222224, 242123, 252123, and 242222 had atypical amplicon sizes and clustered in lineage II branches. Linkage disequilibrium was observed in strains in every lineage when the standardized index of association was calculated using IS629 distribution data. Therefore, the distribution analysis of IS629 may be effective for population genetics analysis of O157 due to the biased IS629 distribution among strains in the three O157 lineages. Copyright © 2010 Elsevier B.V. All rights reserved.
Jun-Jun Liu; Anna W. Schoettle; Richard A. Sniezko; Rona N. Sturrock; Arezoo Zamany; Holly Williams; Amanda Ha; Danelle Chan; Bob Danchok; Douglas P. Savin; Angelia Kegley
2016-01-01
Linkage of DNA markers with phenotypic traits provides essential information to dissect clustered genes with potential phenotypic contributions in a target genome region. Pinus flexilis E. James (limber pine) is a keystone five-needle pine species in mountain-top ecosystems of North America. White pine blister rust (WPBR), caused by a non-native fungal...
GHOST: global hepatitis outbreak and surveillance technology.
Longmire, Atkinson G; Sims, Seth; Rytsareva, Inna; Campo, David S; Skums, Pavel; Dimitrova, Zoya; Ramachandran, Sumathi; Medrzycki, Magdalena; Thai, Hong; Ganova-Raeva, Lilia; Lin, Yulin; Punkova, Lili T; Sue, Amanda; Mirabito, Massimo; Wang, Silver; Tracy, Robin; Bolet, Victor; Sukalac, Thom; Lynberg, Chris; Khudyakov, Yury
2017-12-06
Hepatitis C is a major public health problem in the United States and worldwide. Outbreaks of hepatitis C virus (HCV) infections associated with unsafe injection practices, drug diversion, and other exposures to blood are difficult to detect and investigate. Effective HCV outbreak investigation requires comprehensive surveillance and robust case investigation. We previously developed and validated a methodology for the rapid and cost-effective identification of HCV transmission clusters. Global Hepatitis Outbreak and Surveillance Technology (GHOST) is a cloud-based system enabling users, regardless of computational expertise, to analyze and visualize transmission clusters in an independent, accurate and reproducible way. We present and explore performance of several GHOST implemented algorithms using next-generation sequencing data experimentally obtained from hypervariable region 1 of genetically related and unrelated HCV strains. GHOST processes data from an entire MiSeq run in approximately 3 h. A panel of seven specimens was used for preparation of six repeats of MiSeq libraries. Testing sequence data from these libraries by GHOST showed a consistent transmission linkage detection, testifying to high reproducibility of the system. Lack of linkage among genetically unrelated HCV strains and constant detection of genetic linkage between HCV strains from known transmission pairs and from follow-up specimens at different levels of MiSeq-read sampling indicate high specificity and sensitivity of GHOST in accurate detection of HCV transmission. GHOST enables automatic extraction of timely and relevant public health information suitable for guiding effective intervention measures. It is designed as a virtual diagnostic system intended for use in molecular surveillance and outbreak investigations rather than in research. The system produces accurate and reproducible information on HCV transmission clusters for all users, irrespective of their level of bioinformatics expertise. Improvement in molecular detection capacity will contribute to increasing the rate of transmission detection, thus providing opportunity for rapid, accurate and effective response to outbreaks of hepatitis C. Although GHOST was originally developed for hepatitis C surveillance, its modular structure is readily applicable to other infectious diseases. Worldwide availability of GHOST for the detection of HCV transmissions will foster deeper involvement of public health researchers and practitioners in hepatitis C outbreak investigation.
Medland, Sarah E; Loesch, Danuta Z; Mdzewski, Bogdan; Zhu, Gu; Montgomery, Grant W; Martin, Nicholas G
2007-01-01
The finger ridge count (a measure of pattern size) is one of the most heritable complex traits studied in humans and has been considered a model human polygenic trait in quantitative genetic analysis. Here, we report the results of the first genome-wide linkage scan for finger ridge count in a sample of 2,114 offspring from 922 nuclear families. Both univariate linkage to the absolute ridge count (a sum of all the ridge counts on all ten fingers), and multivariate linkage analyses of the counts on individual fingers, were conducted. The multivariate analyses yielded significant linkage to 5q14.1 (Logarithm of odds [LOD] = 3.34, pointwise-empirical p-value = 0.00025) that was predominantly driven by linkage to the ring, index, and middle fingers. The strongest univariate linkage was to 1q42.2 (LOD = 2.04, point-wise p-value = 0.002, genome-wide p-value = 0.29). In summary, the combination of univariate and multivariate results was more informative than simple univariate analyses alone. Patterns of quantitative trait loci factor loadings consistent with developmental fields were observed, and the simple pleiotropic model underlying the absolute ridge count was not sufficient to characterize the interrelationships between the ridge counts of individual fingers. PMID:17907812
Linkage Analysis of Quantitative Refraction and Refractive Errors in the Beaver Dam Eye Study
Duggal, Priya; Lee, Kristine E.; Cheng, Ching-Yu; Klein, Ronald; Bailey-Wilson, Joan E.; Klein, Barbara E. K.
2011-01-01
Purpose. Refraction, as measured by spherical equivalent, is the need for an external lens to focus images on the retina. While genetic factors play an important role in the development of refractive errors, few susceptibility genes have been identified. However, several regions of linkage have been reported for myopia (2q, 4q, 7q, 12q, 17q, 18p, 22q, and Xq) and for quantitative refraction (1p, 3q, 4q, 7p, 8p, and 11p). To replicate previously identified linkage peaks and to identify novel loci that influence quantitative refraction and refractive errors, linkage analysis of spherical equivalent, myopia, and hyperopia in the Beaver Dam Eye Study was performed. Methods. Nonparametric, sibling-pair, genome-wide linkage analyses of refraction (spherical equivalent adjusted for age, education, and nuclear sclerosis), myopia and hyperopia in 834 sibling pairs within 486 extended pedigrees were performed. Results. Suggestive evidence of linkage was found for hyperopia on chromosome 3, region q26 (empiric P = 5.34 × 10−4), a region that had shown significant genome-wide evidence of linkage to refraction and some evidence of linkage to hyperopia. In addition, the analysis replicated previously reported genome-wide significant linkages to 22q11 of adjusted refraction and myopia (empiric P = 4.43 × 10−3 and 1.48 × 10−3, respectively) and to 7p15 of refraction (empiric P = 9.43 × 10−4). Evidence was also found of linkage to refraction on 7q36 (empiric P = 2.32 × 10−3), a region previously linked to high myopia. Conclusions. The findings provide further evidence that genes controlling refractive errors are located on 3q26, 7p15, 7p36, and 22q11. PMID:21571680
Di Gaspero, G; Cipriani, G; Adam-Blondon, A-F; Testolin, R
2007-05-01
Genetic maps functionally oriented towards disease resistance have been constructed in grapevine by analysing with a simultaneous maximum-likelihood estimation of linkage 502 markers including microsatellites and resistance gene analogs (RGAs). Mapping material consisted of two pseudo-testcrosses, 'Chardonnay' x 'Bianca' and 'Cabernet Sauvignon' x '20/3' where the seed parents were Vitis vinifera genotypes and the male parents were Vitis hybrids carrying resistance to mildew diseases. Individual maps included 320-364 markers each. The simultaneous use of two mapping crosses made with two pairs of distantly related parents allowed mapping as much as 91% of the markers tested. The integrated map included 420 Simple Sequence Repeat (SSR) markers that identified 536 SSR loci and 82 RGA markers that identified 173 RGA loci. This map consisted of 19 linkage groups (LGs) corresponding to the grape haploid chromosome number, had a total length of 1,676 cM and a mean distance between adjacent loci of 3.6 cM. Single-locus SSR markers were randomly distributed over the map (CD = 1.12). RGA markers were found in 18 of the 19 LGs but most of them (83%) were clustered on seven LGs, namely groups 3, 7, 9, 12, 13, 18 and 19. Several RGA clusters mapped to chromosomal regions where phenotypic traits of resistance to fungal diseases such as downy mildew and powdery mildew, bacterial diseases such as Pierce's disease, and pests such as dagger and root-knot nematode, were previously mapped in different segregating populations. The high number of RGA markers integrated into this new map will help find markers linked to genetic determinants of different pest and disease resistances in grape.
Chronic and Recurrent Otitis Media: A Genome Scan for Susceptibility Loci
Daly, Kathleen A.; Brown, W. Mark; Segade, Fernando; Bowden, Donald W.; Keats, Bronya J.; Lindgren, Bruce R.; Levine, Samuel C.; Rich, Stephen S.
2004-01-01
Otitis media (OM) is the most common childhood disease. Almost all children experience at least one episode, but morbidity is greatest in children who experience chronic/recurrent OM (COME/ROM). There is mounting evidence that COME/ROM clusters in families and exhibits substantial heritability. Subjects who had tympanostomy tube surgery for COME/ROM (probands) and their families were recruited for the present study, and an ear examination was performed, without knowledge of the subject’s history, to determine presence of OM sequelae. In addition, tympanometric testing was performed at three frequencies (226, 630 or 710, and 1,400 Hz) to detect abnormal middle-ear mechanics, and hearing was screened at 20 dB for the speech frequencies. Of these families, 121 had at least two individuals who had received the diagnosis of COME/ROM (364 affected and genotyped individuals), of whom 238 affected and informative relative pairs were used for analyses. Single-point nonparametric linkage analysis provided evidence of linkage of COME/ROM to chromosome 10q at marker D10S212 (LOD 3.78; P=3.0×10-5) and to chromosome 19q at marker D19S254 (LOD 2.61; P=5.3×10-4). Analyses conditional on support for linkage at chromosomes 10q and 19q resulted in a significant increase in LOD score support on chromosome 3p (between markers D3S4545 and D3S1259). These results suggest that risk of COME/ROM is determined by interactions between genes that reside in several candidate regions of the genome and are probably modulated by other environmental risk factors. PMID:15514890
Linkage Analyses of Stimulant Dependence, Craving and Heavy Use in American Indians
Ehlers, Cindy L.; Gizer, Ian R.; Gilder, David A.; Wilhelmsen, Kirk C.
2011-01-01
Amphetamine-type substances are the second most widely used illicit drugs in the United States. There is evidence to suggest that stimulant use (cocaine and methamphetamine) has a heritable component, yet the areas of the genome underlying these use disorders are yet to be identified. This study’s aims were to map loci linked to stimulant dependence, heavy use, and craving in an American Indian community at high risk for substance dependence. DSM diagnosis of stimulant dependence, as well as indices of stimulant “craving” and “heavy use”, were obtained using the Semi-Structured Assessment for the Genetics of Alcoholism (SSAGA). Genotypes were determined for a panel of 791 micro-satellite polymorphisms in 381 members of multiplex families using SOLAR. Stimulant dependence, stimulant “craving” and “heavy stimulant use”, were all found to be heritable. Analyses of multipoint variance component LOD scores, failed to yield evidence of linkage for stimulant dependence. For the stimulant “craving” phenotype, linkage analysis revealed a locus that had a LOD score of 3.02 on chromosome 15q25.3-26.1 near the nicotinic receptor gene cluster. A LOD score of 2.05 was found at this same site for “heavy stimulant use”. Additional loci with LOD scores above 2.00 were found for stimulant “craving” on chromosomes 12p13.33-13.32 and 18q22.3. These results corroborate the importance of “craving” as an important phenotype that is associated with regions on chromosome 12, 15 and 18, that have been highlighted in prior segregation studies in this and other populations for substance dependence-related phenotypes. PMID:21812097
Gong, Wen-Bing; Li, Lei; Zhou, Yan; Bian, Yin-Bing; Kwan, Hoi-Shan; Cheung, Man-Kit; Xiao, Yang
2016-06-01
To provide a better understanding of the genetic architecture of fruiting body formation of Lentinula edodes, quantitative trait loci (QTLs) mapping was employed to uncover the loci underlying seven fruiting body-related traits (FBRTs). An improved L. edodes genetic linkage map, comprising 572 markers on 12 linkage groups with a total map length of 983.7 cM, was constructed by integrating 82 genomic sequence-based insertion-deletion (InDel) markers into a previously published map. We then detected a total of 62 QTLs for seven target traits across two segregating testcross populations, with individual QTLs contributing 5.5 %-30.2 % of the phenotypic variation. Fifty-three out of the 62 QTLs were clustered in six QTL hotspots, suggesting the existence of main genomic regions regulating the morphological characteristics of fruiting bodies in L. edodes. A stable QTL hotspot on MLG2, containing QTLs for all investigated traits, was identified in both testcross populations. QTLs for related traits were frequently co-located on the linkage groups, demonstrating the genetic basis for phenotypic correlation of traits. Meta-QTL (mQTL) analysis was performed and identified 16 mQTLs with refined positions and narrow confidence intervals (CIs). Nine genes, including those encoding MAP kinase, blue-light photoreceptor, riboflavin-aldehyde-forming enzyme and cyclopropane-fatty-acyl-phospholipid synthase, and cytochrome P450s, were likely to be candidate genes controlling the shape of fruiting bodies. The study has improved our understanding of the genetic architecture of fruiting body formation in L. edodes. To our knowledge, this is the first genome-wide QTL detection of FBRTs in L. edodes. The improved genetic map, InDel markers and QTL hotspot regions revealed here will assist considerably in the conduct of future genetic and breeding studies of L. edodes.
Rauscher, Gilda; Simko, Ivan
2013-01-22
Lettuce (Lactuca sativa L.) is the major crop from the group of leafy vegetables. Several types of molecular markers were developed that are effectively used in lettuce breeding and genetic studies. However only a very limited number of microsattelite-based markers are publicly available. We have employed the method of enriched microsatellite libraries to develop 97 genomic SSR markers. Testing of newly developed markers on a set of 36 Lactuca accession (33 L. sativa, and one of each L. serriola L., L. saligna L., and L. virosa L.) revealed that both the genetic heterozygosity (UHe = 0.56) and the number of loci per SSR (Na = 5.50) are significantly higher for genomic SSR markers than for previously developed EST-based SSR markers (UHe = 0.32, Na = 3.56). Fifty-four genomic SSR markers were placed on the molecular linkage map of lettuce. Distribution of markers in the genome appeared to be random, with the exception of possible cluster on linkage group 6. Any combination of 32 genomic SSRs was able to distinguish genotypes of all 36 accessions. Fourteen of newly developed SSR markers originate from fragments with high sequence similarity to resistance gene candidates (RGCs) and RGC pseudogenes. Analysis of molecular variance (AMOVA) of L. sativa accessions showed that approximately 3% of genetic diversity was within accessions, 79% among accessions, and 18% among horticultural types. The newly developed genomic SSR markers were added to the pool of previously developed EST-SSRs markers. These two types of SSR-based markers provide useful tools for lettuce cultivar fingerprinting, development of integrated molecular linkage maps, and mapping of genes.
2013-01-01
Background Lettuce (Lactuca sativa L.) is the major crop from the group of leafy vegetables. Several types of molecular markers were developed that are effectively used in lettuce breeding and genetic studies. However only a very limited number of microsattelite-based markers are publicly available. We have employed the method of enriched microsatellite libraries to develop 97 genomic SSR markers. Results Testing of newly developed markers on a set of 36 Lactuca accession (33 L. sativa, and one of each L. serriola L., L. saligna L., and L. virosa L.) revealed that both the genetic heterozygosity (UHe = 0.56) and the number of loci per SSR (Na = 5.50) are significantly higher for genomic SSR markers than for previously developed EST-based SSR markers (UHe = 0.32, Na = 3.56). Fifty-four genomic SSR markers were placed on the molecular linkage map of lettuce. Distribution of markers in the genome appeared to be random, with the exception of possible cluster on linkage group 6. Any combination of 32 genomic SSRs was able to distinguish genotypes of all 36 accessions. Fourteen of newly developed SSR markers originate from fragments with high sequence similarity to resistance gene candidates (RGCs) and RGC pseudogenes. Analysis of molecular variance (AMOVA) of L. sativa accessions showed that approximately 3% of genetic diversity was within accessions, 79% among accessions, and 18% among horticultural types. Conclusions The newly developed genomic SSR markers were added to the pool of previously developed EST-SSRs markers. These two types of SSR-based markers provide useful tools for lettuce cultivar fingerprinting, development of integrated molecular linkage maps, and mapping of genes. PMID:23339733
Panas, Robert M.
2016-06-23
This paper presents a new analytical method for predicting the large displacement behavior of flexural double parallelogram (DP) bearings with underconstraint eliminator (UE) linkages. This closed-form perturbative Euler analysis method is able to – for the first time – directly incorporate the elastomechanics of a discrete UE linkage, which is a hybrid flexure element that is linked to ground as well as both stages on the bearing. The models are used to understand a nested linkage UE design, however the method is extensible to other UE linkages. Design rules and figures-of-merit are extracted from the analysis models, which provide powerfulmore » tools for accelerating the design process. The models, rules and figures-of-merit enable the rapid design of a UE for a desired large displacement behavior, as well as providing a means for determining the limits of UE and DP structures. This will aid in the adoption of UE linkages into DP bearings for precision mechanisms. Models are generated for a nested linkage UE design, and the performance of this DP with UE structure is compared to a DP-only bearing. As a result, the perturbative Euler analysis is shown to match existing theories for DP-only bearings with distributed compliance within ≈2%, and Finite Element Analysis for the DP with UE bearings within an average 10%.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Panas, Robert M.
This paper presents a new analytical method for predicting the large displacement behavior of flexural double parallelogram (DP) bearings with underconstraint eliminator (UE) linkages. This closed-form perturbative Euler analysis method is able to – for the first time – directly incorporate the elastomechanics of a discrete UE linkage, which is a hybrid flexure element that is linked to ground as well as both stages on the bearing. The models are used to understand a nested linkage UE design, however the method is extensible to other UE linkages. Design rules and figures-of-merit are extracted from the analysis models, which provide powerfulmore » tools for accelerating the design process. The models, rules and figures-of-merit enable the rapid design of a UE for a desired large displacement behavior, as well as providing a means for determining the limits of UE and DP structures. This will aid in the adoption of UE linkages into DP bearings for precision mechanisms. Models are generated for a nested linkage UE design, and the performance of this DP with UE structure is compared to a DP-only bearing. As a result, the perturbative Euler analysis is shown to match existing theories for DP-only bearings with distributed compliance within ≈2%, and Finite Element Analysis for the DP with UE bearings within an average 10%.« less
Search for a schizophrenia susceptibility locus of human chromosome 22
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coon, H.; Hoff, M.; Holik, J.
1994-06-15
We used 10 highly informative DNA polymorphic markers and genetic linkage analysis to examine whether a gene locus predisposing to schizophrenia is located on chromosome 22, in 105 families with schizophrenia and schizoaffective disorder. The LOD score method, including analysis for heterogeneity, provided no conclusive evidence of linkage under a dominant, recessive, or penetrance free model of inheritance. Affected sib-pair analysis was inconclusive. Affected Pedigree Member (APM) analysis gave only suggestive evidence for linkage. Multipoint APM analysis, using 4 adjacent loci including D22S281 and IL2RB, a region of interest from the APM analysis, gave non-significant results for the three differentmore » weighting functions. 18 refs., 1 fig., 7 tabs.« less
González, Víctor M; Aventín, Núria; Centeno, Emilio; Puigdomènech, Pere
2014-12-17
Plant NBS-LRR -resistance genes tend to be found in clusters, which have been shown to be hot spots of genome variability. In melon, half of the 81 predicted NBS-LRR genes group in nine clusters, and a 1 Mb region on linkage group V contains the highest density of R-genes and presence/absence gene polymorphisms found in the melon genome. This region is known to contain the locus of Vat, an agronomically important gene that confers resistance to aphids. However, the presence of duplications makes the sequencing and annotation of R-gene clusters difficult, usually resulting in multi-gapped sequences with higher than average errors. A 1-Mb sequence that contains the largest NBS-LRR gene cluster found in melon was improved using a strategy that combines Illumina paired-end mapping and PCR-based gap closing. Unknown sequence was decreased by 70% while about 3,000 SNPs and small indels were corrected. As a result, the annotations of 18 of a total of 23 NBS-LRR genes found in this region were modified, including additional coding sequences, amino acid changes, correction of splicing boundaries, or fussion of ORFs in common transcription units. A phylogeny analysis of the R-genes and their comparison with syntenic sequences in other cucurbits point to a pattern of local gene amplifications since the diversification of cucurbits from other families, and through speciation within the family. A candidate Vat gene is proposed based on the sequence similarity between a reported Vat gene from a Korean melon cultivar and a sequence fragment previously absent in the unrefined sequence. A sequence refinement strategy allowed substantial improvement of a 1 Mb fragment of the melon genome and the re-annotation of the largest cluster of NBS-LRR gene homologues found in melon. Analysis of the cluster revealed that resistance genes have been produced by sequence duplication in adjacent genome locations since the divergence of cucurbits from other close families, and through the process of speciation within the family a candidate Vat gene was also identified using sequence previously unavailable, which demonstrates the advantages of genome assembly refinements when analyzing complex regions such as those containing clusters of highly similar genes.
Genotyping-by-sequencing enables linkage mapping in three octoploid cultivated strawberry families
Salinas, Natalia; Tennessen, Jacob A.; Zurn, Jason D.; Sargent, Daniel James; Hancock, James; Bassil, Nahla V.
2017-01-01
Genotyping-by-sequencing (GBS) was used to survey genome-wide single-nucleotide polymorphisms (SNPs) in three biparental strawberry (Fragaria × ananassa) populations with the goal of evaluating this technique in a species with a complex octoploid genome. GBS sequence data were aligned to the F. vesca ‘Fvb’ reference genome in order to call SNPs. Numbers of polymorphic SNPs per population ranged from 1,163 to 3,190. Linkage maps consisting of 30–65 linkage groups were produced from the SNP sets derived from each parent. The linkage groups covered 99% of the Fvb reference genome, with three to seven linkage groups from a given parent aligned to any particular chromosome. A phylogenetic analysis performed using the POLiMAPS pipeline revealed linkage groups that were most similar to ancestral species F. vesca for each chromosome. Linkage groups that were most similar to a second ancestral species, F. iinumae, were only resolved for Fvb 4. The quantity of missing data and heterogeneity in genome coverage inherent in GBS complicated the analysis, but POLiMAPS resolved F. × ananassa chromosomal regions derived from diploid ancestor F. vesca. PMID:28875078
Li, Bingshan; Leal, Suzanne M.
2008-01-01
Missing genotype data can increase false-positive evidence for linkage when either parametric or nonparametric analysis is carried out ignoring intermarker linkage disequilibrium (LD). Previously it was demonstrated by Huang et al. [1] that no bias occurs in this situation for affected sib-pairs with unrelated parents when either both parents are genotyped or genotype data is available for two additional unaffected siblings when parental genotypes are missing. However, this is not the case for autosomal recessive consanguineous pedigrees, where missing genotype data for any pedigree member within a consanguinity loop can increase false-positive evidence of linkage. False-positive evidence for linkage is further increased when cryptic consanguinity is present. The amount of false-positive evidence for linkage, and which family members aid in its reduction, is highly dependent on which family members are genotyped. When parental genotype data is available, the false-positive evidence for linkage is usually not as strong as when parental genotype data is unavailable. For a pedigree with an affected proband whose first-cousin parents have been genotyped, further reduction in the false-positive evidence of linkage can be obtained by including genotype data from additional affected siblings of the proband or genotype data from the proband's sibling-grandparents. For the situation, when parental genotypes are unavailable, false-positive evidence for linkage can be reduced by including genotype data from either unaffected siblings of the proband or the proband's married-in-grandparents in the analysis. PMID:18073490
Genome-wide population structure and evolutionary history of the Frizarta dairy sheep.
Kominakis, A; Hager-Theodorides, A L; Saridaki, A; Antonakos, G; Tsiamis, G
2017-10-01
In the present study, we used genomic data, generated with a medium density single nucleotide polymorphisms (SNP) array, to acquire more information on the population structure and evolutionary history of the synthetic Frizarta dairy sheep. First, two typical measures of linkage disequilibrium (LD) were estimated at various physical distances that were then used to make inferences on the effective population size at key past time points. Population structure was also assessed by both multidimensional scaling analysis and k-means clustering on the distance matrix obtained from the animals' genomic relationships. The Wright's fixation F ST index was also employed to assess herds' genetic homogeneity and to indirectly estimate past migration rates. The Wright's fixation F IS index and genomic inbreeding coefficients based on the genomic relationship matrix as well as on runs of homozygosity were also estimated. The Frizarta breed displays relatively low LD levels with r 2 and |D'| equal to 0.18 and 0.50, respectively, at an average inter-marker distance of 31 kb. Linkage disequilibrium decayed rapidly by distance and persisted over just a few thousand base pairs. Rate of LD decay (β) varied widely among the 26 autosomes with larger values estimated for shorter chromosomes (e.g. β=0.057, for OAR6) and smaller values for longer ones (e.g. β=0.022, for OAR2). The inferred effective population size at the beginning of the breed's formation was as high as 549, was then reduced to 463 in 1981 (end of the breed's formation) and further declined to 187, one generation ago. Multidimensional scaling analysis and k-means clustering suggested a genetically homogenous population, F ST estimates indicated relatively low genetic differentiation between herds, whereas a heat map of the animals' genomic kinship relationships revealed a stratified population, at a herd level. Estimates of genomic inbreeding coefficients suggested that most recent parental relatedness may have been a major determinant of the current effective population size. A denser than the 50k SNP panel may be more beneficial when performing genome wide association studies in the breed.
Ulgen, Ayse; Han, Zhihua; Li, Wentian
2003-12-31
We address the question of whether statistical correlations among quantitative traits lead to correlation of linkage results of these traits. Five measured quantitative traits (total cholesterol, fasting glucose, HDL cholesterol, blood pressure, and triglycerides), and one derived quantitative trait (total cholesterol divided by the HDL cholesterol) are used for phenotype correlation studies. Four of them are used for linkage analysis. We show that although correlation among phenotypes partially reflects the correlation among linkage analysis results, the LOD-score correlations are on average low. The most significant peaks found by using different traits do not often overlap. Studying covariances at specific locations in LOD scores may provide clues for further bivariate linkage analyses.
DEBRIS DISKS OF MEMBERS OF THE BLANCO 1 OPEN CLUSTER
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stauffer, John R.; Noriega-Crespo, Alberto; Rebull, Luisa M.
2010-08-20
We have used the Spitzer Space Telescope to obtain Multiband Imaging Photometer for Spitzer (MIPS) 24 {mu}m photometry for 37 members of the {approx}100 Myr old open cluster Blanco 1. For the brightest 25 of these stars (where we have 3{sigma} uncertainties less than 15%), we find significant mid-IR excesses for eight stars, corresponding to a debris disk detection frequency of about 32%. The stars with excesses include two A stars, four F dwarfs, and two G dwarfs. The most significant linkage between 24 {mu}m excess and any other stellar property for our Blanco 1 sample of stars is withmore » binarity. Blanco 1 members that are photometric binaries show few or no detected 24 {mu}m excesses whereas a quarter of the apparently single Blanco 1 members do have excesses. We have examined the MIPS data for two other clusters of similar age to Blanco 1-NGC 2547 and the Pleiades. The AFGK photometric binary star members of both of these clusters also show a much lower frequency of 24 {mu}m excesses compared to stars that lie near the single-star main sequence. We provide a new determination of the relation between the V - K {sub s} color and K {sub s} - [24] color for main sequence photospheres based on Hyades members observed with MIPS. As a result of our analysis of the Hyades data, we identify three low mass Hyades members as candidates for having debris disks near the MIPS detection limit.« less
Tabb, Keri L.; Hellwege, Jacklyn N.; Palmer, Nicholette D.; Dimitrov, Latchezar; Sajuthi, Satria; Taylor, Kent D.; NG, Maggie C.Y.; Hawkins, Gregory A.; Chen, Yii-Der Ida; Brown, W. Mark; McWilliams, David; Williams, Adrienne; Lorenzo, Carlos; Norris, Jill M.; Long, Jirong; Rotter, Jerome I.; Curran, Joanne E.; Blangero, John; Wagenknecht, Lynne E.; Langefeld, Carl D.; Bowden, Donald W.
2017-01-01
Summary Family-based methods are a potentially powerful tool to identify trait-defining genetic variants in extended families, particularly when used to complement conventional association analysis. We utilized two-point linkage analysis and single variant association analysis to evaluate whole exome sequencing (WES) data from 1,205 Hispanic Americans (78 families) from the Insulin Resistance Atherosclerosis Family Study. WES identified 211,612 variants above the minor allele frequency threshold of ≥0.005. These variants were tested for linkage and/or association with 50 cardiometabolic traits after quality control checks. Two-point linkage analysis yielded 10,580,600 LOD scores with 1,148 LOD scores ≥3, 183 LOD scores ≥4, and 29 LOD scores ≥5. The maximal novel LOD score was 5.50 for rs2289043:T>C, in UNC5C with subcutaneous adipose tissue volume. Association analysis identified 13 variants attaining genome-wide significance (p<5×10-08), with the strongest association between rs651821:C>T in APOA5, and triglyceride levels (p=3.67×10-10). Overall, there was a 5.2-fold increase in the number of informative variants detected by WES compared to exome chip analysis in this population, nearly 30% of which were novel variants relative to dbSNP build 138. Thus, integration of results from two-point linkage and single-variant association analysis from WES data enabled identification of novel signals potentially contributing to cardiometabolic traits. PMID:28067407
Linkage studies in primary open angle glaucoma
DOE Office of Scientific and Technical Information (OSTI.GOV)
Avramopoulos, D.; Grigoriadu, M.; Kitsos, G.
1994-09-01
Glaucoma is a leading cause of blindness worldwide. The majority of glaucoma is associated with an open, normal appearing anterior chamber angle and is termed primary open angle glaucoma (POAG, MIM 137760). It is characterized by elevated intraocular pressure and onset in middle age or later. A subset of POAG with juvenile onset has recently been linked to chromosome 1q in two families with autosomal dominant inheritance. Eleven pedigrees with autosomal dominant POG (non-juvenile-onset) have been identified in Epirus, Greece. In the present study DNA samples have been collected from 50 individuals from one large pedigree, including 12 affected individuals.more » Preliminary results of linkage analysis with chromosome 1 microsatellites using the computer program package LINKAGE Version 5.1 showed no linkage with the markers previously linked to juvenile-onset POAG. Further linkage analysis is being pursued, and the results will be presented.« less
No evidence for linkage between the X-chromosome marker DXS7 and schizophrenia
DOE Office of Scientific and Technical Information (OSTI.GOV)
Okoro, C.; Bell, R.; Sham, P.
DeLisi et al. have examined the X and Y chromosomes for linkage to schizophrenia in 126 small families and report a small positive LOD score for the marker DXS7, adjacent to the MAO locus at Xp11.4-11.3. Because of this, we have examined the DXS7 for linkage to schizophrenia using 17 pedigrees in which male-to-male transmission of schizophrenia was absent. Alleles at DXS7 were genotyped using the PCR and LOD scores calculated using five models of inheritance, including classical dominant, recessive and intermediate models. LOD scores were substantially negative for all models examined and analysis for linkage heterogeneity using the LOD2more » method showed no significance. Analysis by the nonparametric affected sib-pair method likewise indicated no linkage. We conclude that DXS7 is not a major locus for schizophrenia in our collection of pedigrees. 29 refs., 1 tab.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, L.; Forsell, C.; Lilius, L.
1996-05-31
An association between the {epsilon}4 allele of the apolipoprotein E gene (APOE) and late-onset Alzheimer`s disease (AD) was recently demonstrated. In order to confirm the association and to gauge the ability of standard genetic linkage methods to identify susceptibility genes, we investigated 15 Swedish late-onset AD families. We found an association of familial AD to the APOE {epsilon}4 allele (P = 0.01) but no indication of linkage to the APOE region using 2-point linkage analysis, and only weak evidence using the affected pedigree-member (APM) method. Our results confirm an APOE {epsilon}4 association with late-onset familial AD and indicate that susceptibilitymore » genes can easily be missed when using standard lod score and APM genetic linkage analysis. 19 refs., 1 fig., 4 tabs.« less
1980-11-01
FINSTER , E SINN, R N GRIMES N0001475--0305 UNCLASSIFIED TR-35 NL’ minimnmlhnnnhu ,IIIIIIIIIIIIIl hEIIIIIIIIEIII EEEEEEEEEEEL 1.8 MICROCOPY’ RESOLUTION...David C./ Finster Ekk/inn Russell . Grimes Department of Chemistry ",00t University ofLyirginla ’ Charlottesville, Va. 22901 Prepared for Publication In...a Commo-Metallacarborane. Synthesis and Structure of a Fluxi:. Metal-Boron Cluster, [n5C 5 (CCB3)512HCo3(C13)4C4B8H7 David C. Finster , Ekk Sinn, and
Amin, Najaf; Hottenga, Jouke-Jan; Hansell, Narelle K; Janssens, A Cecile JW; de Moor, Marleen HM; Madden, Pamela AF; Zorkoltseva, Irina V; Penninx, Brenda W; Terracciano, Antonio; Uda, Manuela; Tanaka, Toshiko; Esko, Tonu; Realo, Anu; Ferrucci, Luigi; Luciano, Michelle; Davies, Gail; Metspalu, Andres; Abecasis, Goncalo R; Deary, Ian J; Raikkonen, Katri; Bierut, Laura J; Costa, Paul T; Saviouk, Viatcheslav; Zhu, Gu; Kirichenko, Anatoly V; Isaacs, Aaron; Aulchenko, Yurii S; Willemsen, Gonneke; Heath, Andrew C; Pergadia, Michele L; Medland, Sarah E; Axenovich, Tatiana I; de Geus, Eco; Montgomery, Grant W; Wright, Margaret J; Oostra, Ben A; Martin, Nicholas G; Boomsma, Dorret I; van Duijn, Cornelia M
2013-01-01
Personality traits are complex phenotypes related to psychosomatic health. Individually, various gene finding methods have not achieved much success in finding genetic variants associated with personality traits. We performed a meta-analysis of four genome-wide linkage scans (N=6149 subjects) of five basic personality traits assessed with the NEO Five-Factor Inventory. We compared the significant regions from the meta-analysis of linkage scans with the results of a meta-analysis of genome-wide association studies (GWAS) (N∼17 000). We found significant evidence of linkage of neuroticism to chromosome 3p14 (rs1490265, LOD=4.67) and to chromosome 19q13 (rs628604, LOD=3.55); of extraversion to 14q32 (ATGG002, LOD=3.3); and of agreeableness to 3p25 (rs709160, LOD=3.67) and to two adjacent regions on chromosome 15, including 15q13 (rs970408, LOD=4.07) and 15q14 (rs1055356, LOD=3.52) in the individual scans. In the meta-analysis, we found strong evidence of linkage of extraversion to 4q34, 9q34, 10q24 and 11q22, openness to 2p25, 3q26, 9p21, 11q24, 15q26 and 19q13 and agreeableness to 4q34 and 19p13. Significant evidence of association in the GWAS was detected between openness and rs677035 at 11q24 (P-value=2.6 × 10−06, KCNJ1). The findings of our linkage meta-analysis and those of the GWAS suggest that 11q24 is a susceptible locus for openness, with KCNJ1 as the possible candidate gene. PMID:23211697
Fundamental Investigations of Durability at a Polymer Electrolyte-Electrode Interface
2008-04-01
before before σ -σ σ after before before σ -σ σ Cleavage of the side chain ether linkage (Fig. 3), which intrudes into the hydrophilic ionic cluster...directly correlated to peroxide yields measured Figure 3: ATR-FTIR Spectrum of Nafion ®112 (H-form) indicating absorption bands obtained using...electrocatalyst-based fuel cell electrode (referred as sacrificial electrode) directly into the liquid electrolyte, in which oxygen reduction was
Han, Andrew W.; Sandy, Moriah; Fishman, Brian; Trindade-Silva, Amaro E.; Soares, Carlos A. G.; Distel, Daniel L.; Butler, Alison; Haygood, Margo G.
2013-01-01
Shipworms are marine bivalve mollusks (Family Teredinidae) that use wood for shelter and food. They harbor a group of closely related, yet phylogenetically distinct, bacterial endosymbionts in bacteriocytes located in the gills. This endosymbiotic community is believed to support the host's nutrition in multiple ways, through the production of cellulolytic enzymes and the fixation of nitrogen. The genome of the shipworm endosymbiont Teredinibacter turnerae T7901 was recently sequenced and in addition to the potential for cellulolytic enzymes and diazotrophy, the genome also revealed a rich potential for secondary metabolites. With nine distinct biosynthetic gene clusters, nearly 7% of the genome is dedicated to secondary metabolites. Bioinformatic analyses predict that one of the gene clusters is responsible for the production of a catecholate siderophore. Here we describe this gene cluster in detail and present the siderophore product from this cluster. Genes similar to the entCEBA genes of enterobactin biosynthesis involved in the production and activation of dihydroxybenzoic acid (DHB) are present in this cluster, as well as a two-module non-ribosomal peptide synthetase (NRPS). A novel triscatecholate siderophore, turnerbactin, was isolated from the supernatant of iron-limited T. turnerae T7901 cultures. Turnerbactin is a trimer of N-(2,3-DHB)-L-Orn-L-Ser with the three monomeric units linked by Ser ester linkages. A monomer, dimer, dehydrated dimer, and dehydrated trimer of 2,3-DHB-L-Orn-L-Ser were also found in the supernatant. A link between the gene cluster and siderophore product was made by constructing a NRPS mutant, TtAH03. Siderophores could not be detected in cultures of TtAH03 by HPLC analysis and Fe-binding activity of culture supernatant was significantly reduced. Regulation of the pathway by iron is supported by identification of putative Fur box sequences and observation of increased Fe-binding activity under iron restriction. Evidence of a turnerbactin fragment was found in shipworm extracts, suggesting the production of turnerbactin in the symbiosis. PMID:24146831
Leu, Costin; de Kovel, Carolien G F; Zara, Federico; Striano, Pasquale; Pezzella, Marianna; Robbiano, Angela; Bianchi, Amedeo; Bisulli, Francesca; Coppola, Antonietta; Giallonardo, Anna Teresa; Beccaria, Francesca; Trenité, Dorothée Kasteleijn-Nolst; Lindhout, Dick; Gaus, Verena; Schmitz, Bettina; Janz, Dieter; Weber, Yvonne G; Becker, Felicitas; Lerche, Holger; Kleefuss-Lie, Ailing A; Hallman, Kerstin; Kunz, Wolfram S; Elger, Christian E; Muhle, Hiltrud; Stephani, Ulrich; Møller, Rikke S; Hjalgrim, Helle; Mullen, Saul; Scheffer, Ingrid E; Berkovic, Samuel F; Everett, Kate V; Gardiner, Mark R; Marini, Carla; Guerrini, Renzo; Lehesjoki, Anna-Elina; Siren, Auli; Nabbout, Rima; Baulac, Stephanie; Leguern, Eric; Serratosa, Jose M; Rosenow, Felix; Feucht, Martha; Unterberger, Iris; Covanis, Athanasios; Suls, Arvid; Weckhuysen, Sarah; Kaneva, Radka; Caglayan, Hande; Turkdogan, Dilsad; Baykan, Betul; Bebek, Nerses; Ozbek, Ugur; Hempelmann, Anne; Schulz, Herbert; Rüschendorf, Franz; Trucks, Holger; Nürnberg, Peter; Avanzini, Giuliano; Koeleman, Bobby P C; Sander, Thomas
2012-02-01
Genetic generalized epilepsies (GGEs) have a lifetime prevalence of 0.3% with heritability estimates of 80%. A considerable proportion of families with siblings affected by GGEs presumably display an oligogenic inheritance. The present genome-wide linkage meta-analysis aimed to map: (1) susceptibility loci shared by a broad spectrum of GGEs, and (2) seizure type-related genetic factors preferentially predisposing to either typical absence or myoclonic seizures, respectively. Meta-analysis of three genome-wide linkage datasets was carried out in 379 GGE-multiplex families of European ancestry including 982 relatives with GGEs. To dissect out seizure type-related susceptibility genes, two family subgroups were stratified comprising 235 families with predominantly genetic absence epilepsies (GAEs) and 118 families with an aggregation of juvenile myoclonic epilepsy (JME). To map shared and seizure type-related susceptibility loci, both nonparametric loci (NPL) and parametric linkage analyses were performed for a broad trait model (GGEs) in the entire set of GGE-multiplex families and a narrow trait model (typical absence or myoclonic seizures) in the subgroups of JME and GAE families. For the entire set of 379 GGE-multiplex families, linkage analysis revealed six loci achieving suggestive evidence for linkage at 1p36.22, 3p14.2, 5q34, 13q12.12, 13q31.3, and 19q13.42. The linkage finding at 5q34 was consistently supported by both NPL and parametric linkage results across all three family groups. A genome-wide significant nonparametric logarithm of odds score of 3.43 was obtained at 2q34 in 118 JME families. Significant parametric linkage to 13q31.3 was found in 235 GAE families assuming recessive inheritance (heterogeneity logarithm of odds = 5.02). Our linkage results support an oligogenic predisposition of familial GGE syndromes. The genetic risk factor at 5q34 confers risk to a broad spectrum of familial GGE syndromes, whereas susceptibility loci at 2q34 and 13q31.3 preferentially predispose to myoclonic seizures or absence seizures, respectively. Phenotype- genotype strategies applying narrow trait definitions in phenotypic homogeneous subgroups of families improve the prospects of disentangling the genetic basis of common familial GGE syndromes. Wiley Periodicals, Inc. © 2012 International League Against Epilepsy.
MytiBase: a knowledgebase of mussel (M. galloprovincialis) transcribed sequences
Venier, Paola; De Pittà, Cristiano; Bernante, Filippo; Varotto, Laura; De Nardi, Barbara; Bovo, Giuseppe; Roch, Philippe; Novoa, Beatriz; Figueras, Antonio; Pallavicini, Alberto; Lanfranchi, Gerolamo
2009-01-01
Background Although Bivalves are among the most studied marine organisms due to their ecological role, economic importance and use in pollution biomonitoring, very little information is available on the genome sequences of mussels. This study reports the functional analysis of a large-scale Expressed Sequence Tag (EST) sequencing from different tissues of Mytilus galloprovincialis (the Mediterranean mussel) challenged with toxic pollutants, temperature and potentially pathogenic bacteria. Results We have constructed and sequenced seventeen cDNA libraries from different Mediterranean mussel tissues: gills, digestive gland, foot, anterior and posterior adductor muscle, mantle and haemocytes. A total of 24,939 clones were sequenced from these libraries generating 18,788 high-quality ESTs which were assembled into 2,446 overlapping clusters and 4,666 singletons resulting in a total of 7,112 non-redundant sequences. In particular, a high-quality normalized cDNA library (Nor01) was constructed as determined by the high rate of gene discovery (65.6%). Bioinformatic screening of the non-redundant M. galloprovincialis sequences identified 159 microsatellite-containing ESTs. Clusters, consensuses, related similarities and gene ontology searches have been organized in a dedicated, searchable database . Conclusion We defined the first species-specific catalogue of M. galloprovincialis ESTs including 7,112 unique transcribed sequences. Putative microsatellite markers were identified. This annotated catalogue represents a valuable platform for expression studies, marker validation and genetic linkage analysis for investigations in the biology of Mediterranean mussels. PMID:19203376
DOE Office of Scientific and Technical Information (OSTI.GOV)
Escamilla, M.A.; Reus, V.I.; Smith, L.B.
1996-05-31
Linkage disequilibrium (LD) analysis provides a powerful means for screening the genome to map the location of disease genes, such as those for bipolar disorder (BP). As described in this paper, the population of the Central Valley of Costa Rica, which is descended from a small number of founders, should be suitable for LD mapping; this assertion is supported by reconstruction of extended haplotypes shared by distantly related individuals in this population suffering low-frequency hearing loss (LFHL1), which has previously been mapped by linkage analysis. A sampling strategy is described for applying LD methods to map genes for BP, andmore » clinical and demographic characteristics of an initially collected sample are discussed. This sample will provide a complement to a previously collected set of Costa Rican BP families which is under investigation using standard linkage analysis. 42 refs., 4 figs., 2 tabs.« less
Hellwege, Jacklyn N; Palmer, Nicholette D; Mark Brown, W; Brown, Mark W; Ziegler, Julie T; Sandy An, S; An, Sandy S; Guo, Xiuqing; Ida Chen, Y-D; Chen, Ida Y-D; Taylor, Kent; Hawkins, Gregory A; Ng, Maggie C Y; Speliotes, Elizabeth K; Lorenzo, Carlos; Norris, Jill M; Rotter, Jerome I; Wagenknecht, Lynne E; Langefeld, Carl D; Bowden, Donald W
2015-02-01
We previously identified a low-frequency (1.1 %) coding variant (G45R; rs200573126) in the adiponectin gene (ADIPOQ) which was the basis for a multipoint microsatellite linkage signal (LOD = 8.2) for plasma adiponectin levels in Hispanic families. We have empirically evaluated the ability of data from targeted common variants, exome chip genotyping, and genome-wide association study data to detect linkage and association to adiponectin protein levels at this locus. Simple two-point linkage and association analyses were performed in 88 Hispanic families (1,150 individuals) using 10,958 SNPs on chromosome 3. Approaches were compared for their ability to map the functional variant, G45R, which was strongly linked (two-point LOD = 20.98) and powerfully associated (p value = 8.1 × 10(-50)). Over 450 SNPs within a broad 61 Mb interval around rs200573126 showed nominal evidence of linkage (LOD > 3) but only four other SNPs in this region were associated with p values < 1.0 × 10(-4). When G45R was accounted for, the maximum LOD score across the interval dropped to 4.39 and the best p value was 1.1 × 10(-5). Linked and/or associated variants ranged in frequency (0.0018-0.50) and type (coding, non-coding) and had little detectable linkage disequilibrium with rs200573126 (r (2) < 0.20). In addition, the two-point linkage approach empirically outperformed multipoint microsatellite and multipoint SNP analysis. In the absence of data for rs200573126, family-based linkage analysis using a moderately dense SNP dataset, including both common and low-frequency variants, resulted in stronger evidence for an adiponectin locus than association data alone. Thus, linkage analysis can be a useful tool to facilitate identification of high-impact genetic variants.
Genome-wide analysis of the genetic regulation of gene expression in human neutrophils
Andiappan, Anand Kumar; Melchiotti, Rossella; Poh, Tuang Yeow; Nah, Michelle; Puan, Kia Joo; Vigano, Elena; Haase, Doreen; Yusof, Nurhashikin; San Luis, Boris; Lum, Josephine; Kumar, Dilip; Foo, Shihui; Zhuang, Li; Vasudev, Anusha; Irwanto, Astrid; Lee, Bernett; Nardin, Alessandra; Liu, Hong; Zhang, Furen; Connolly, John; Liu, Jianjun; Mortellaro, Alessandra; Wang, De Yun; Poidinger, Michael; Larbi, Anis; Zolezzi, Francesca; Rotzschke, Olaf
2015-01-01
Neutrophils are an abundant immune cell type involved in both antimicrobial defence and autoimmunity. The regulation of their gene expression, however, is still largely unknown. Here we report an eQTL study on isolated neutrophils from 114 healthy individuals of Chinese ethnicity, identifying 21,210 eQTLs on 832 unique genes. Unsupervised clustering analysis of these eQTLs confirms their role in inflammatory responses and immunological diseases but also indicates strong involvement in dermatological pathologies. One of the strongest eQTL identified (rs2058660) is also the tagSNP of a linkage block reported to affect leprosy and Crohn's disease in opposite directions. In a functional study, we can link the C allele with low expression of the β-chain of IL18-receptor (IL18RAP). In neutrophils, this results in a reduced responsiveness to IL-18, detected both on the RNA and protein level. Thus, the polymorphic regulation of human neutrophils can impact beneficial as well as pathological inflammatory responses. PMID:26259071
Potential linkage for schizophrenia on chromosome 22q12-q13: A replication study
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schwab, S.G.; Bondy, B.; Wildenauer, D.B.
1995-10-09
In an attempt to replicate a potential linkage on chromosome 22q12-q13.1 reported by Pulver et al., we have analyzed 4 microsatellite markers which span this chromosomal region, including the IL2RB locus, for linkage with schizophrenia in 30 families from Israel and Germany. Linkage analysis by pairwise lod score analysis as well as by multipoint analysis did not provide evidence for a single major gene locus. However, a lod score of Z{sub max} = 0.612 was obtained for a dominant model of inheritance with the marker D22S304 at recombination fraction 0.2 by pairwise analysis. In addition, using a nonparametric method, sibmore » pair analysis, a P value of 0.068 corresponding to a lod score of 0.48 was obtained for this marker. This finding, together with those of Pulver et al., is suggestive of a genetic factor in this region, predisposing for schizophrenia in a subset of families. Further studies using nonparametric methods should be conducted in order to clarify this point. 32 refs., 1 fig., 4 tabs.« less
Whitwell, Jennifer L; Przybelski, Scott A; Weigand, Stephen D; Ivnik, Robert J; Vemuri, Prashanthi; Gunter, Jeffrey L; Senjem, Matthew L; Shiung, Maria M; Boeve, Bradley F; Knopman, David S; Parisi, Joseph E; Dickson, Dennis W; Petersen, Ronald C; Jack, Clifford R; Josephs, Keith A
2009-11-01
The behavioural variant of frontotemporal dementia is a progressive neurodegenerative syndrome characterized by changes in personality and behaviour. It is typically associated with frontal lobe atrophy, although patterns of atrophy are heterogeneous. The objective of this study was to examine case-by-case variability in patterns of grey matter atrophy in subjects with the behavioural variant of frontotemporal dementia and to investigate whether behavioural variant of frontotemporal dementia can be divided into distinct anatomical subtypes. Sixty-six subjects that fulfilled clinical criteria for a diagnosis of the behavioural variant of frontotemporal dementia with a volumetric magnetic resonance imaging scan were identified. Grey matter volumes were obtained for 26 regions of interest, covering frontal, temporal and parietal lobes, striatum, insula and supplemental motor area, using the automated anatomical labelling atlas. Regional volumes were divided by total grey matter volume. A hierarchical agglomerative cluster analysis using Ward's clustering linkage method was performed to cluster the behavioural variant of frontotemporal dementia subjects into different anatomical clusters. Voxel-based morphometry was used to assess patterns of grey matter loss in each identified cluster of subjects compared to an age and gender-matched control group at P < 0.05 (family-wise error corrected). We identified four potentially useful clusters with distinct patterns of grey matter loss, which we posit represent anatomical subtypes of the behavioural variant of frontotemporal dementia. Two of these subtypes were associated with temporal lobe volume loss, with one subtype showing loss restricted to temporal lobe regions (temporal-dominant subtype) and the other showing grey matter loss in the temporal lobes as well as frontal and parietal lobes (temporofrontoparietal subtype). Another two subtypes were characterized by a large amount of frontal lobe volume loss, with one subtype showing grey matter loss in the frontal lobes as well as loss of the temporal lobes (frontotemporal subtype) and the other subtype showing loss relatively restricted to the frontal lobes (frontal-dominant subtype). These four subtypes differed on clinical measures of executive function, episodic memory and confrontation naming. There were also associations between the four subtypes and genetic or pathological diagnoses which were obtained in 48% of the cohort. The clusters did not differ in behavioural severity as measured by the Neuropsychiatric Inventory; supporting the original classification of the behavioural variant of frontotemporal dementia in these subjects. Our findings suggest behavioural variant of frontotemporal dementia can therefore be subdivided into four different anatomical subtypes.
Case, Cheryl; Kandola, Kami; Chui, Linda; Li, Vincent; Nix, Nancy; Johnson, Rhonda
2013-01-01
Background Tuberculosis (TB) is an important public health problem in the Northwest Territories (NWT), particularly among Canadian Aboriginal people. Objective To analyse the transmission patterns of tuberculosis among the population living in the NWT, a territorial jurisdiction located within Northern Canada. Methods This population-based retrospective study examined the DNA fingerprints of all laboratory confirmed cases of TB in the NWT, Canada, between 1990 and 2009. An isolate of each lab-confirmed case had genotyping done using IS6110 Restriction Fragment Length Polymorphism. DNA patterns were assigned to each DNA fingerprint, and indistinguishable fingerprints patterns were assigned a cluster. Social network analysis (SNA) was used to examine direct linkages among cases determined through conventional contact tracing (CCT), their DNA fingerprint and home community. Results Of the 225 lab-confirmed cases identified, the study was limited to 195 subjects due to DNA fingerprinting data availability. The mean age of the cases was 43.8 years (±22.6) and 120 (61.5%) males. The Dene (First Nations) encompassed 120 of the cases (87.7%), 8 cases (4.1%) were Inuit, 2 cases (1.0%) were Metis, 7 cases (3.6%) were Immigrants and 1 case had unknown ethnicity. One hundred and eighty six (95.4%) subjects were clustered, resulting in 8 clusters. Trend analysis showed significant relationships between with risk factors for unemployment (p=0.020), geographic location (p≤0.001) and homelessness (p≤0.001). Other significant risk factors included excessive alcohol consumption, prior infection with Mycobacterium tuberculosis and prior contact with a case of TB. Conclusions This study demonstrates how DNA fingerprinting and SNA can be additional epidemiological tools, along with CCT method, to determine transmission patterns of TB. PMID:23671837
Environmetric data interpretation to assess the water quality of Maritsa River catchment.
Papazova, Petia; Simeonova, Pavlina
2013-01-01
Maritsa River is one of the largest rivers flowing on Bulgarian territory. The quality of its waters is of substantial importance for irrigation, industrial, recreation and domestic use. Besides, part of the river is flowing on Turkish territory and the control and management of the Maritsa catchment is of mutual interst for the neighboring countires. Thus, performing interpretation and modeling of the river water quality is a major environmetric problem. Two multivariate statstical methods (Cluster analysis/CA/and Principal components analysis/PCA/) were applied for model assessment of the water quality of Maritsa River on Bulgarian territory. The study used long-term monitoring data from 21 sampling sites characterized by 8 surface water quality indicators. The application of CA to the indicators results in 3 significant clusters showing the impact of biological, anthropogenic and eutrophication sources. For further assessment of the monitoring data, PCA was implemented, which identified, again,three latent factors confirming, in principle, the clustering output. The latent factors were conditionally named "biologic", "anthropogenic" and "eutrophication" source. Their identification coinside correctly to the location of real pollution sources along the Maritsa River catchment. The linkage of the sampling sites along the river flow by CA identified four special patterns separated by specific tracers levels: biological and anthropogenic major impact for pattern 1, euthrophication major impact for pattern 2, background levels for pattern 3 and eutrophication and agricultural major impact for pattern 4. The apportionment models of the pollution determined the contribution of each one of identified pollution factors to the total concentration of each one of the water quality parameters. Thus, a better risk management of the surface water quality is achieved both on local and national level.
Iqbal, Muhammad Javed; Mamidi, Sujan; Ahsan, Rubina; Kianian, Shahryar F; Coyne, Clarice J; Hamama, Anwar A; Narina, Satya S; Bhardwaj, Harbans L
2012-08-01
White lupin (Lupinus albus L.) has been around since 300 B.C. and is recognized for its ability to grow on poor soils and application as green manure in addition to seed harvest. The seed has very high levels of protein (33-47 %) and oil (6-13 %). It also has many secondary metabolites that are potentially of nutraceutical value to animals and humans. Despite such a great potential, lupins role in modern agriculture began only in the twentieth century. Although a large collection of Lupinus germplasm accessions is available worldwide, rarely have they been genetically characterized. Additionally, scarce genomic resources in terms of recombinant populations and genome information have been generated for L. albus. With the advancement in association mapping methods, the natural populations have the potential to replace the recombinant populations in gene mapping and marker-trait associations. Therefore, we studied the genetic similarity, population structure and marker-trait association in a USDA germplasm collection for their current and future application in this crop improvement. A total of 122 PI (Plant Inventory) lines were screened with 18 AFLP primer pairs that generated 2,277 fragments. A subset of 892 polymorphic markers with MAF >0.05 (minor allele frequency) were used for association mapping. The cluster analysis failed to group accessions on the basis of their passport information, and a weak structure and low linkage disequilibrium (LD) were observed indicating the usefulness of the collection for association mapping. Moreover, we were also able to identify two markers (a p value of 1.53 × 10(-4) and 2.3 × 10(-4)) that explained 22.69 and 20.5 % of seed weight variation determined using R (LR) (2) . The implications of lack of geographic clustering, population structure, low LD and the ability of AFLP to map seed weight trait using association mapping and the usefulness of the PI collections in breeding programs are discussed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whitehouse, W.P.; Rees, M.; Curtis, D.
1993-09-01
Evidence for a locus (EJM1) in the HLA region of chromosome 6p predisposing to idiopathic generalized epilepsy (IGE) in the families of patients with juvenile myoclonic epilepsy (JME) has been obtained in two previous studies of separately ascertained groups of kindreds. Linkage analysis has been undertaken in a third set of 25 families including a patient with JME and at least one first-degree relative with IGE. Family members were typed for eight polymorphic loci on chromosome 6p: F13A, D6889, D6S109, D6S105, D6S10, C4B, DQA1/A2, and TCTE1. Pairwise and multipoint linkage analysis was carried out assuming autosomal dominant and autosomal recessivemore » inheritance and age-dependent high or low penetrance. No significant evidence in favor of linkage was obtained at any locus. Multipoint linkage analysis generated significant exclusion data (lod score < -2.0) at HLA and for a region 10-30 cM telomeric to HLA, the extent of which varied with the level of penetrance assumed. These observations indicate that genetic heterogeneity exists within this epilepsy phenotype. 39 refs., 4 figs., 2 tabs.« less
Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira
2014-01-01
Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection.
DOE Office of Scientific and Technical Information (OSTI.GOV)
May, M.; Schwartz, C.; Huston, S.
The Opitz GBBB syndrome (OS) is characterized in part by widely spaced inner ocular canthi and hypospadias. Recently, linkage analysis showed that the gene for the X-linked form to be located in an 18 cM region spanning Xp22. We have now conducted linkage analysis in a family previously published as having the BBB syndrome and found tight linkage to DXS7104 (Z = 3.3, {theta} = 0.0). Our data narrows the candidate region to 4 cM and should facilitate the identification and characterization of one of the genes involved in midline development. 21 refs., 1 fig., 1 tab.
Reducing Information Overload in Large Seismic Data Sets
DOE Office of Scientific and Technical Information (OSTI.GOV)
HAMPTON,JEFFERY W.; YOUNG,CHRISTOPHER J.; MERCHANT,BION J.
2000-08-02
Event catalogs for seismic data can become very large. Furthermore, as researchers collect multiple catalogs and reconcile them into a single catalog that is stored in a relational database, the reconciled set becomes even larger. The sheer number of these events makes searching for relevant events to compare with events of interest problematic. Information overload in this form can lead to the data sets being under-utilized and/or used incorrectly or inconsistently. Thus, efforts have been initiated to research techniques and strategies for helping researchers to make better use of large data sets. In this paper, the authors present their effortsmore » to do so in two ways: (1) the Event Search Engine, which is a waveform correlation tool and (2) some content analysis tools, which area combination of custom-built and commercial off-the-shelf tools for accessing, managing, and querying seismic data stored in a relational database. The current Event Search Engine is based on a hierarchical clustering tool known as the dendrogram tool, which is written as a MatSeis graphical user interface. The dendrogram tool allows the user to build dendrogram diagrams for a set of waveforms by controlling phase windowing, down-sampling, filtering, enveloping, and the clustering method (e.g. single linkage, complete linkage, flexible method). It also allows the clustering to be based on two or more stations simultaneously, which is important to bridge gaps in the sparsely recorded event sets anticipated in such a large reconciled event set. Current efforts are focusing on tools to help the researcher winnow the clusters defined using the dendrogram tool down to the minimum optimal identification set. This will become critical as the number of reference events in the reconciled event set continually grows. The dendrogram tool is part of the MatSeis analysis package, which is available on the Nuclear Explosion Monitoring Research and Engineering Program Web Site. As part of the research into how to winnow the reference events in these large reconciled event sets, additional database query approaches have been developed to provide windows into these datasets. These custom built content analysis tools help identify dataset characteristics that can potentially aid in providing a basis for comparing similar reference events in these large reconciled event sets. Once these characteristics can be identified, algorithms can be developed to create and add to the reduced set of events used by the Event Search Engine. These content analysis tools have already been useful in providing information on station coverage of the referenced events and basic statistical, information on events in the research datasets. The tools can also provide researchers with a quick way to find interesting and useful events within the research datasets. The tools could also be used as a means to review reference event datasets as part of a dataset delivery verification process. There has also been an effort to explore the usefulness of commercially available web-based software to help with this problem. The advantages of using off-the-shelf software applications, such as Oracle's WebDB, to manipulate, customize and manage research data are being investigated. These types of applications are being examined to provide access to large integrated data sets for regional seismic research in Asia. All of these software tools would provide the researcher with unprecedented power without having to learn the intricacies and complexities of relational database systems.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dunne, P.W.; Doody, R.S.; Epstein, H.F.
Children diagnosed with developmental dysphasia develop speech very late without exhibiting sensory or motor dysfunction, and when they do begin to speak their grammar is abnormal. A large three-generation British pedigree was recently identified in which 16 out of 30 members were diagnosed as dysphasic. Assuming a dominant mode of inheritance with homogeneous phenotypic expression and complete penetrance among affected members, we showed by simulation analysis that this pedigree has the power to detect linkage to marker loci with an average maximum LOD score of 3.67 at {theta}=0.1. Given the absence of male-to-male transmission and a ratio of female tomore » male affecteds (10/6) in this pedigree within the expected range for an X-linked dominant mode of inheritance, we decided to begin a genome-wide linkage analysis with microsatellite markers on the human X chromosome. Fifteen individuals (10 affected) from three generations were genotyped with 35 polymorphic STS`s (Research Genetics) which were approximately uniformly distributed along the X chromosome. Two-point linkage was assessed using the MLINK and ILINK programs from the LINKAGE package. Markers DXS1223, DXS987, DXS996 and DXS1060 on Xp22 showed consistent linkage to the disease locus with a maximum LOD score of 0.86 at a distance of 22 cM for DXS1060. If further analysis with additional markers and additional family members confirms X-linkage, such a localization would provide support for Lehrke`s hypothesis for X-linkage of major intellectual traits including verbal functioning.« less
Etienne, Kizee A.; Gillece, John; Hilsabeck, Remy; Schupp, Jim M.; Colman, Rebecca; Lockhart, Shawn R.; Gade, Lalitha; Thompson, Elizabeth H.; Sutton, Deanna A.; Neblett-Fanfair, Robyn; Park, Benjamin J.; Turabelidze, George; Keim, Paul; Brandt, Mary E.; Deak, Eszter; Engelthaler, David M.
2012-01-01
Case reports of Apophysomyces spp. in immunocompetent hosts have been a result of traumatic deep implantation of Apophysomyces spp. spore-contaminated soil or debris. On May 22, 2011 a tornado occurred in Joplin, MO, leaving 13 tornado victims with Apophysomyces trapeziformis infections as a result of lacerations from airborne material. We used whole genome sequence typing (WGST) for high-resolution phylogenetic SNP analysis of 17 outbreak Apophysomyces isolates and five additional temporally and spatially diverse Apophysomyces control isolates (three A. trapeziformis and two A. variabilis isolates). Whole genome SNP phylogenetic analysis revealed three clusters of genotypically related or identical A. trapeziformis isolates and multiple distinct isolates among the Joplin group; this indicated multiple genotypes from a single or multiple sources. Though no linkage between genotype and location of exposure was observed, WGST analysis determined that the Joplin isolates were more closely related to each other than to the control isolates, suggesting local population structure. Additionally, species delineation based on WGST demonstrated the need to reassess currently accepted taxonomic classifications of phylogenetic species within the genus Apophysomyces. PMID:23209631
Etienne, Kizee A; Gillece, John; Hilsabeck, Remy; Schupp, Jim M; Colman, Rebecca; Lockhart, Shawn R; Gade, Lalitha; Thompson, Elizabeth H; Sutton, Deanna A; Neblett-Fanfair, Robyn; Park, Benjamin J; Turabelidze, George; Keim, Paul; Brandt, Mary E; Deak, Eszter; Engelthaler, David M
2012-01-01
Case reports of Apophysomyces spp. in immunocompetent hosts have been a result of traumatic deep implantation of Apophysomyces spp. spore-contaminated soil or debris. On May 22, 2011 a tornado occurred in Joplin, MO, leaving 13 tornado victims with Apophysomyces trapeziformis infections as a result of lacerations from airborne material. We used whole genome sequence typing (WGST) for high-resolution phylogenetic SNP analysis of 17 outbreak Apophysomyces isolates and five additional temporally and spatially diverse Apophysomyces control isolates (three A. trapeziformis and two A. variabilis isolates). Whole genome SNP phylogenetic analysis revealed three clusters of genotypically related or identical A. trapeziformis isolates and multiple distinct isolates among the Joplin group; this indicated multiple genotypes from a single or multiple sources. Though no linkage between genotype and location of exposure was observed, WGST analysis determined that the Joplin isolates were more closely related to each other than to the control isolates, suggesting local population structure. Additionally, species delineation based on WGST demonstrated the need to reassess currently accepted taxonomic classifications of phylogenetic species within the genus Apophysomyces.
USDA-ARS?s Scientific Manuscript database
A genome-wide association study (GWAS) is the foremost strategy used for finding genes that control human diseases and agriculturally important traits, but it often reports false positives. In contrast, its complementary method, linkage analysis, provides direct genetic confirmation, but with limite...
Linkage analysis of systolic blood pressure: a score statistic and computer implementation
Wang, Kai; Peng, Yingwei
2003-01-01
A genome-wide linkage analysis was conducted on systolic blood pressure using a score statistic. The randomly selected Replicate 34 of the simulated data was used. The score statistic was applied to the sibships derived from the general pedigrees. An add-on R program to GENEHUNTER was developed for this analysis and is freely available. PMID:14975145
A power study of bivariate LOD score analysis of a complex trait and fear/discomfort with strangers
Ji, Fei; Lee, Dayoung; Mendell, Nancy Role
2005-01-01
Complex diseases are often reported along with disease-related traits (DRT). Sometimes investigators consider both disease and DRT phenotypes separately and sometimes they consider individuals as affected if they have either the disease or the DRT, or both. We propose instead to consider the joint distribution of the disease and the DRT and do a linkage analysis assuming a pleiotropic model. We evaluated our results through analysis of the simulated datasets provided by Genetic Analysis Workshop 14. We first conducted univariate linkage analysis of the simulated disease, Kofendrerd Personality Disorder and one of its simulated associated traits, phenotype b (fear/discomfort with strangers). Subsequently, we considered the bivariate phenotype, which combined the information on Kofendrerd Personality Disorder and fear/discomfort with strangers. We developed a program to perform bivariate linkage analysis using an extension to the Elston-Stewart peeling method of likelihood calculation. Using this program we considered the microsatellites within 30 cM of the gene pleiotropic for this simulated disease and DRT. Based on 100 simulations of 300 families we observed excellent power to detect linkage within 10 cM of the disease locus using the DRT and the bivariate trait. PMID:16451570
A power study of bivariate LOD score analysis of a complex trait and fear/discomfort with strangers.
Ji, Fei; Lee, Dayoung; Mendell, Nancy Role
2005-12-30
Complex diseases are often reported along with disease-related traits (DRT). Sometimes investigators consider both disease and DRT phenotypes separately and sometimes they consider individuals as affected if they have either the disease or the DRT, or both. We propose instead to consider the joint distribution of the disease and the DRT and do a linkage analysis assuming a pleiotropic model. We evaluated our results through analysis of the simulated datasets provided by Genetic Analysis Workshop 14. We first conducted univariate linkage analysis of the simulated disease, Kofendrerd Personality Disorder and one of its simulated associated traits, phenotype b (fear/discomfort with strangers). Subsequently, we considered the bivariate phenotype, which combined the information on Kofendrerd Personality Disorder and fear/discomfort with strangers. We developed a program to perform bivariate linkage analysis using an extension to the Elston-Stewart peeling method of likelihood calculation. Using this program we considered the microsatellites within 30 cM of the gene pleiotropic for this simulated disease and DRT. Based on 100 simulations of 300 families we observed excellent power to detect linkage within 10 cM of the disease locus using the DRT and the bivariate trait.
NASA Astrophysics Data System (ADS)
Chankrachang, M.; Limphirat, W.; Yongyingsakthavorn, P.; Nontakaew, U.; Tohsan, A.
2017-09-01
A study of sulfidic linkages formed in natural rubber (NR) latex medical gloves by using X-ray Absorption Near Edge Structure (XANES) is presented in this paper. The NR latex compound was prepared by using prevulcanization method, that is, it was prevulcanized at room temperature for 24 hrs before utilization. After the 24 hrs of prevulcanization, the latex film samples were obtained by dipping process. The dipped films were subjected to vulcanize at 110°C for 5 to 25 min. It was observed that after the compound was prevulcanized for 24 hrs, polysulfidic linkages were mainly formed in the sample. It was however found that after curing at 110°C for 5-25 min, the polysulfidic linkages are tended to change into disulfide linkages. Especially, in the case of 25 minutes cured sample, disulfide linkages are found to be the main linkages. In term of tensile strength, it was observed that when cure time increased from 5 - 10 min, tensile strengths were also increased. But when the cure time of the film is 25 minutes, tensile strength was slightly dropped. The dropped of tensile strength when cure time is longer than 10 minutes can be ascribed to a degradation of polysulfidic and disulfidic linkages during curing. Therefore, by using XANES analysis, it was found to be very useful to understand the cure characteristic, thus it can be very helpful to optimize cure time and tensile properties of the product.
Rommelse, Nanda N.J.; Arias-Vásquez, Alejandro; Altink, Marieke E.; Buschgens, Cathelijne J.M.; Fliers, Ellen; Asherson, Philip; Faraone, Stephen V.; Buitelaar, Jan K.; Sergeant, Joseph A.; Oosterlaan, Jaap; Franke, Barbara
2008-01-01
ADHD linkage findings have not all been consistently replicated, suggesting that other approaches to linkage analysis in ADHD might be necessary, such as the use of (quantitative) endophenotypes (heritable traits associated with an increased risk for ADHD). Genome-wide linkage analyses were performed in the Dutch subsample of the International Multi-Center ADHD Genetics (IMAGE) study comprising 238 DSM-IV combined-type ADHD probands and their 112 affected and 195 nonaffected siblings. Eight candidate neuropsychological ADHD endophenotypes with heritabilities > 0.2 were used as quantitative traits. In addition, an overall component score of neuropsychological functioning was used. A total of 5407 autosomal single-nucleotide polymorphisms (SNPs) were used to run multipoint regression-based linkage analyses. Two significant genome-wide linkage signals were found, one for Motor Timing on chromosome 2q21.1 (LOD score: 3.944) and one for Digit Span on 13q12.11 (LOD score: 3.959). Ten suggestive linkage signals were found (LOD scores ≥ 2) on chromosomes 2p, 2q, 3p, 4q, 8q, 12p, 12q, 14q, and 17q. The suggestive linkage signal for the component score that was found at 2q14.3 (LOD score: 2.878) overlapped with the region significantly linked to Motor Timing. Endophenotype approaches may increase power to detect susceptibility loci in ADHD and possibly in other complex disorders. PMID:18599010
2009-12-10
sites of integrin-clustering that link the actin cytoskeleton to the extracellular matrix (ECM; (Burridge et al., 1988)). The primary functions of...Hall, 1992). Furthermore, in fibroblasts, focal adhesion kinase (FAK), a key FA signaling molecule, is necessary for mechanosensing (Geiger et al...promotes FAK activation through phosphorylation on Y397 and Y925, followed by FAK- dependent extracellular signal-regulated kinase (ERK) phosphorylation
Crown, William; Chang, Jessica; Olson, Melvin; Kahler, Kristijan; Swindle, Jason; Buzinec, Paul; Shah, Nilay; Borah, Bijan
2015-09-01
Missing data, particularly missing variables, can create serious analytic challenges in observational comparative effectiveness research studies. Statistical linkage of datasets is a potential method for incorporating missing variables. Prior studies have focused upon the bias introduced by imperfect linkage. This analysis uses a case study of hepatitis C patients to estimate the net effect of statistical linkage on bias, also accounting for the potential reduction in missing variable bias. The results show that statistical linkage can reduce bias while also enabling parameter estimates to be obtained for the formerly missing variables. The usefulness of statistical linkage will vary depending upon the strength of the correlations of the missing variables with the treatment variable, as well as the outcome variable of interest.
MaRaCluster: A Fragment Rarity Metric for Clustering Fragment Spectra in Shotgun Proteomics.
The, Matthew; Käll, Lukas
2016-03-04
Shotgun proteomics experiments generate large amounts of fragment spectra as primary data, normally with high redundancy between and within experiments. Here, we have devised a clustering technique to identify fragment spectra stemming from the same species of peptide. This is a powerful alternative method to traditional search engines for analyzing spectra, specifically useful for larger scale mass spectrometry studies. As an aid in this process, we propose a distance calculation relying on the rarity of experimental fragment peaks, following the intuition that peaks shared by only a few spectra offer more evidence than peaks shared by a large number of spectra. We used this distance calculation and a complete-linkage scheme to cluster data from a recent large-scale mass spectrometry-based study. The clusterings produced by our method have up to 40% more identified peptides for their consensus spectra compared to those produced by the previous state-of-the-art method. We see that our method would advance the construction of spectral libraries as well as serve as a tool for mining large sets of fragment spectra. The source code and Ubuntu binary packages are available at https://github.com/statisticalbiotechnology/maracluster (under an Apache 2.0 license).
NASA Astrophysics Data System (ADS)
Verma, Kanupriya; Viswanathan, K. S.; Majumder, Moumita; Sathyamurthy, N.
2017-11-01
The 1:1 dimer of borazine-acetylene has been studied for the first time, both experimentally and computationally. The borazine-acetylene dimer was trapped in Ar and N2 matrices, and studied using infrared spectroscopy. Our experiments clearly revealed two isomers of the borazine-acetylene complex, one in which the N-H of borazine interacted with the carbon of acetylene, and another in which the C-H of acetylene formed a hydrogen bond with a nitrogen atom of borazine. The formation of both isomers in the matrix was evidenced by shifts in the vibrational frequencies of the appropriate modes. Reassuringly, the experimental observations were corroborated by our computations using the second-order Møller-Plesset perturbation theoretic method and coupled-cluster singles, doubles and perturbative triples method in conjunction with different Dunning basis sets, which indicated both these isomers to be stable minima, with the N-HṡṡṡC complex being the global minimum. Atoms-in-molecules and energy decomposition analysis were also carried out for the different isomers of the dimer. These studies reveal that replacing the three C-C linkages in benzene with three B-N linkages in borazine modifies the interaction in the dimer sufficiently, to result in a different potential energy landscape for the borazine-acetylene system when compared with the benzene-acetylene system.
Linkage analysis of Norrie disease with an X-chromosomal ornithine aminotransferase locus.
Bateman, J B; Kojis, T L; Cantor, R M; Heinzmann, C; Ngo, J T; Spence, M A; Inana, G; Kivlin, J D; Curtis, D; Sparkes, R S
1993-01-01
Norrie disease is a rare disease of newborn males caused by prenatal or perinatal retinal detachment, which may be associated with mental retardation, psychosis, and/or hearing loss. DXS7 (L1.28) and MAO A and B loci have been linked to the ND locus on the short arm of the X chromosome. Sequences homologous to OAT also have been mapped to the short arm of the X chromosome. We performed linkage analyses between the ND locus and one of the OAT-like clusters of sequences on the X chromosome (OATL1), using a ScaI RFLP in a ND family, and increased the previously calculated lod score (z) to over 3 (3.38; theta = 0.05). Similarly, we calculated a lod score of 4.06 (theta = 0.01) between the OATL1 and DXS7 loci. Alone, the OATL1 ScaI RFLP system is expected to be informative in 48% of females. If this system were used in combination with the DXS7 TaqI polymorphism, 71% of females would be informative for at least one of the markers and 21% would be informative for both. Because the OATL1 ScaI RFLP is a relatively common polymorphism, this system should be useful for the identification of ND carriers and affected male fetuses and newborns. PMID:7908152
Linkages and Interactions Analysis of Major Effect Drought Grain Yield QTLs in Rice.
Vikram, Prashant; Swamy, B P Mallikarjuna; Dixit, Shalabh; Trinidad, Jennylyn; Sta Cruz, Ma Teresa; Maturan, Paul C; Amante, Modesto; Kumar, Arvind
2016-01-01
Quantitative trait loci conferring high grain yield under drought in rice are important genomic resources for climate resilient breeding. Major and consistent drought grain yield QTLs usually co-locate with flowering and/or plant height QTLs, which could be due to either linkage or pleiotropy. Five mapping populations used for the identification of major and consistent drought grain yield QTLs underwent multiple-trait, multiple-interval mapping test (MT-MIM) to estimate the significance of pleiotropy effects. Results indicated towards possible linkages between the drought grain yield QTLs with co-locating flowering and/or plant height QTLs. Linkages of days to flowering and plant height were eliminated through a marker-assisted breeding approach. Drought grain yield QTLs also showed interaction effects with flowering QTLs. Drought responsiveness of the flowering locus on chromosome 3 (qDTY3.2) has been revealed through allelic analysis. Considering linkage and interaction effects associated with drought QTLs, a comprehensive marker-assisted breeding strategy was followed to develop rice genotypes with improved grain yield under drought stress.
2010-01-01
Background The biological dimensions of genes are manifold. These include genomic properties, (e.g., X/autosomal linkage, recombination) and functional properties (e.g., expression level, tissue specificity). Multiple properties, each generally of subtle influence individually, may affect the evolution of genes or merely be (auto-)correlates. Results of multidimensional analyses may reveal the relative importance of these properties on the evolution of genes, and therefore help evaluate whether these properties should be considered during analyses. While numerous properties are now considered during studies, most work still assumes the stereotypical solitary gene as commonly depicted in textbooks. Here, we investigate the Drosophila melanogaster genome to determine whether deviations from the stereotypical gene architecture correlate with other properties of genes. Results Deviations from the stereotypical gene architecture were classified as the following gene constellations: Overlapping genes were defined as those that overlap in the 5-prime, exonic, or intronic regions. Chromatin co-clustering genes were defined as genes that co-clustered within 20 kb of transcriptional territories. If this scheme is applied the stereotypical gene emerges as a rare occurrence (7.5%), slightly varied schemes yielded between ~1%-50%. Moreover, when following our scheme, paired-overlapping genes and chromatin co-clustering genes accounted for 50.1 and 42.4% of the genes analyzed, respectively. Gene constellation was a correlate of a number of functional and evolutionary properties of genes, but its statistical effect was ~1-2 orders of magnitude lower than the effects of recombination, chromosome linkage and protein function. Analysis of datasets on male reproductive proteins showed these were biased in their representation of gene constellations and evolutionary rate Ka/Ks estimates, but these biases did not overwhelm the biologically meaningful observation of high evolutionary rates of male reproductive genes. Conclusion Given the rarity of the solitary stereotypical gene, and the abundance of gene constellations that deviate from it, the presence of gene constellations, while once thought to be exceptional in large Eukaryote genomes, might have broader relevance to the understanding and study of the genome. However, according to our definition, while gene constellations can be significant correlates of functional properties of genes, they generally are weak correlates of the evolution of genes. Thus, the need for their consideration would depend on the context of studies. PMID:20497561
Testing for linkage disequilibrium in the New Zealand radiata pine breeding population
S. Kumar; Craig Echt; P.L. Wilcox; T.E. Richardson
2004-01-01
Linkage analysis is commonly uscd to find marker-trait associations within the full-sib families of forest tree and other species. Study of marker-trait associations at the population level is termed linkage-disequilibrium (LD) mapping. A female-tester design comprising 200 full-sib families generated by crossing 40 pollen parents with five female parents was used to...
Joanna Endter-Wada; Dale J. Blahna
2011-01-01
This article presents the " Linkages to Public Land" (LPL) Framework, a general but comprehensive data-gathering and analysis approach aimed at informing citizen and agency decision making about the social environment of public land. This social assessment and planning approach identifies and categorizes various types of linkages that people have to public...
McCulloch, Kathryn M.; McCranie, Emilianne K.; Smith, Jarrod A.; ...
2015-08-03
Orthosomycins are oligosaccharide antibiotics that include avilamycin, everninomicin, and hygromycin B and are hallmarked by a rigidifying interglycosidic spirocyclic ortho-δ-lactone (orthoester) linkage between at least one pair of carbohydrates. A subset of orthosomycins additionally contain a carbohydrate capped by a methylenedioxy bridge. The orthoester linkage is necessary for antibiotic activity but rarely observed in natural products. Orthoester linkage and methylenedioxy bridge biosynthesis require similar oxidative cyclizations adjacent to a sugar ring. In this paper, we have identified a conserved group of nonheme iron, α-ketoglutarate–dependent oxygenases likely responsible for this chemistry. High-resolution crystal structures of the EvdO1 and EvdO2 oxygenases ofmore » everninomicin biosynthesis, the AviO1 oxygenase of avilamycin biosynthesis, and HygX of hygromycin B biosynthesis show how these enzymes accommodate large substrates, a challenge that requires a variation in metal coordination in HygX. Excitingly, the ternary complex of HygX with cosubstrate α-ketoglutarate and putative product hygromycin B identified an orientation of one glycosidic linkage of hygromycin B consistent with metal-catalyzed hydrogen atom abstraction from substrate. These structural results are complemented by gene disruption of the oxygenases evdO1 and evdMO1 from the everninomicin biosynthetic cluster, which demonstrate that functional oxygenase activity is critical for antibiotic production. Finally, our data therefore support a role for these enzymes in the production of key features of the orthosomycin antibiotics.« less
Contemplating the plasmalemmal control center model
NASA Technical Reports Server (NTRS)
Pickard, B. G.
1994-01-01
An abundant epidermal mechanosensory calcium-selective ion channel appears able not only to detect mechanical stimuli such as those that initiate gravitropism but also to detect thermal, electrical, and various chemical stimuli. Because it responds to multimodal input with a second messenger output, this channel system seems likely to be an integrator that can engage in feedbacks with many other systems of the cell--and feedback is the hallmark of regulation. In general, the mechanical tension required for channel activation is likely transmitted from the relatively rigid cell wall to the plasma membrane system via linkage or adhesion sites that display antigenicities recognized by antibodies to animal beta-1 integrin, vitronectin, and fibronectin and which have mechanical connections to the cytoskeleton. Thus, functionally, leverage exerted against any given adhesion site will tend to control channels within a surrounding domain. Reactions initiated by passage of calcium ions through the channels could presumably be more effectively regulated if channels within the domains were somewhat clustered and if appropriate receptors, kinases, porters, pumps, and some key cytoskeletal anchoring sites were in turn clustered about them. Accumulating evidence suggests not only that activity of clusters of channels may contribute to control of cytoskeletal architecture and of regulatory protein function within their domain, but also that both a variety of regulatory proteins and components of the cortical cytoskeleton may contribute to control of channel activity. The emerging capabilities of electronic optical microscopy are well suited for resolving the spatial distributions of many of these cytoskeletal and regulatory molecules in living cells, and for following some of their behaviors as channels are stimulated to open and cytosolic calcium builds in their vicinity. Such microscopy, coupled with biochemical and physiological probing, should help to establish the nature of the feedback loops putatively controlled by the linkage sites and their channel domains.
ERIC Educational Resources Information Center
Wiseman, Alexander W.; Alromi, Naif
A cross-national analysis was conducted to identify contextual influences that shape policies regarding the school-to-work transition and education-work linkages. The study's theoretical framework included principles based on technical-rational perspectives and neo-institutional perspectives. The study tested the following hypotheses: (1) schools…
Keating, Dominic T; Sadlier, Denise M; Patricelli, Andrea; Smith, Sinead M; Walls, Dermot; Egan, Jim J; Doran, Peter P
2006-09-01
The molecular mechanisms of Idiopathic Pulmonary Fibrosis (IPF) remain elusive. Transforming Growth Factor beta 1(TGF-beta1) is a key effector cytokine in the development of lung fibrosis. We used microarray and computational biology strategies to identify genes whose expression is significantly altered in alveolar epithelial cells (A549) in response to TGF-beta1, IL-4 and IL-13 and Epstein Barr virus. A549 cells were exposed to 10 ng/ml TGF-beta1, IL-4 and IL-13 at serial time points. Total RNA was used for hybridisation to Affymetrix Human Genome U133A microarrays. Each in vitro time-point was studied in duplicate and an average RMA value computed. Expression data for each time point was compared to control and a signal log ratio of 0.6 or greater taken to identify significant differential regulation. Using normalised RMA values and unsupervised Average Linkage Hierarchical Cluster Analysis, a list of 312 extracellular matrix (ECM) proteins or modulators of matrix turnover was curated via Onto-Compare and Gene-Ontology (GO) databases for baited cluster analysis of ECM associated genes. Interrogation of the dataset using ontological classification focused cluster analysis revealed coordinate differential expression of a large cohort of extracellular matrix associated genes. Of this grouping members of the ADAM (A disintegrin and Metalloproteinase domain containing) family of genes were differentially expressed. ADAM gene expression was also identified in EBV infected A549 cells as well as IL-13 and IL-4 stimulated cells. We probed pathologenomic activities (activation and functional activity) of ADAM19 and ADAMTS9 using siRNA and collagen assays. Knockdown of these genes resulted in diminished production of collagen in A549 cells exposed to TGF-beta1, suggesting a potential role for these molecules in ECM accumulation in IPF.
Andersen, O M; Petersen, H H; Jacobsen, C; Moestrup, S K; Etzerodt, M; Andreasen, P A; Thøgersen, H C
2001-07-01
The low-density-lipoprotein-receptor (LDLR)-related protein (LRP) is composed of several classes of domains, including complement-type repeats (CR), which occur in clusters that contain binding sites for a multitude of different ligands. Each approximately 40-residue CR domain contains three conserved disulphide linkages and an octahedral Ca(2+) cage. LRP is a scavenging receptor for ligands from extracellular fluids, e.g. alpha(2)-macroglobulin (alpha(2)M)-proteinase complexes, lipoprotein-containing particles and serine proteinase-inhibitor complexes, like the complex between urokinase-type plasminogen activator (uPA) and the plasminogen activator inhibitor-1 (PAI-1). In the present study we analysed the interaction of the uPA-PAI-1 complex with an ensemble of fragments representing a complete overlapping set of two-domain fragments accounting for the ligand-binding cluster II (CR3-CR10) of LRP. By ligand blotting, solid-state competition analysis and surface-plasmon-resonance analysis, we demonstrate binding to multiple CR domains, but show a preferential interaction between the uPA-PAI-1 complex and a two-domain fragment comprising CR domains 5 and 6 of LRP. We demonstrate that surface-exposed aspartic acid and tryptophan residues at identical positions in the two homologous domains, CR5 and CR6 (Asp(958,CR5), Asp(999,CR6), Trp(953,CR5) and Trp(994,CR6)), are critical for the binding of the complex as well as for the binding of the receptor-associated protein (RAP) - the folding chaperone/escort protein required for transport of LRP to the cell surface. Accordingly, the present work provides (1) an identification of a preferred binding site within LRP CR cluster II; (2) evidence that the uPA-PAI-1 binding site involves residues from two adjacent protein domains; and (3) direct evidence identifying specific residues as important for the binding of uPA-PAI-1 as well as for the binding of RAP.
Linkage analysis of the Nail-patella syndrome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Campeau, E.; Watkins, D.; Rouleau, G.A.
1995-01-01
Nail-patella syndrome (NPS) is an autosomal dominant disorder characterized by dysplasia of nails and patella, decreased mobility of the elbow, iliac horns, and, in some cases, nephropathy. The disorder has been mapped to the long arm of chromosome 9, but the precise localization and identity of the NPS gene are unknown. Linkage analysis in three NPS families, using highly informative dinucleotide repeat polymorphisms on 9q33-q34, confirmed linkage of NPS to this chromosome. Recombinations were detected, by two-point linkage analysis, between NPS and the centromeric markers D9S60 and the gelsolin gene and the telomeric markers D9S64 and D9S66, in one ofmore » the families. Haplotype analysis suggested an additional recombination between NPS and the argininosuccinate synthetase (ASS) gene. These results localize the NPS gene to an interval on 9q34.1, distal to D9S60 an proximal to ASS, comprising a genetic distance of {approximately}9 cM. This represents a significant refinement in the localization of the NPS gene. 25 refs., 2 figs., 1 tab.« less
Use of Multivariate Linkage Analysis for Dissection of a Complex Cognitive Trait
Marlow, Angela J.; Fisher, Simon E.; Francks, Clyde; MacPhie, I. Laurence; Cherny, Stacey S.; Richardson, Alex J.; Talcott, Joel B.; Stein, John F.; Monaco, Anthony P.; Cardon, Lon R.
2003-01-01
Replication of linkage results for complex traits has been exceedingly difficult, owing in part to the inability to measure the precise underlying phenotype, small sample sizes, genetic heterogeneity, and statistical methods employed in analysis. Often, in any particular study, multiple correlated traits have been collected, yet these have been analyzed independently or, at most, in bivariate analyses. Theoretical arguments suggest that full multivariate analysis of all available traits should offer more power to detect linkage; however, this has not yet been evaluated on a genomewide scale. Here, we conduct multivariate genomewide analyses of quantitative-trait loci that influence reading- and language-related measures in families affected with developmental dyslexia. The results of these analyses are substantially clearer than those of previous univariate analyses of the same data set, helping to resolve a number of key issues. These outcomes highlight the relevance of multivariate analysis for complex disorders for dissection of linkage results in correlated traits. The approach employed here may aid positional cloning of susceptibility genes in a wide spectrum of complex traits. PMID:12587094
Tests for linkage and association in nuclear families.
Martin, E R; Kaplan, N L; Weir, B S
1997-01-01
The transmission/disequilibrium test (TDT) originally was introduced to test for linkage between a genetic marker and a disease-susceptibility locus, in the presence of association. Recently, the TDT has been used to test for association in the presence of linkage. The motivation for this is that linkage analysis typically identifies large candidate regions, and further refinement is necessary before a search for the disease gene is begun, on the molecular level. Evidence of association and linkage may indicate which markers in the region are closest to a disease locus. As a test of linkage, transmissions from heterozygous parents to all of their affected children can be included in the TDT; however, the TDT is a valid chi2 test of association only if transmissions to unrelated affected children are used in the analysis. If the sample contains independent nuclear families with multiple affected children, then one procedure that has been used to test for association is to select randomly a single affected child from each sibship and to apply the TDT to those data. As an alternative, we propose two statistics that use data from all of the affected children. The statistics give valid chi2 tests of the null hypothesis of no association or no linkage and generally are more powerful than the TDT with a single, randomly chosen, affected child from each family. PMID:9311750
Coupled Triboelectric Nanogenerator Networks for Efficient Water Wave Energy Harvesting.
Xu, Liang; Jiang, Tao; Lin, Pei; Shao, Jia Jia; He, Chuan; Zhong, Wei; Chen, Xiang Yu; Wang, Zhong Lin
2018-02-27
Water wave energy is a promising clean energy source, which is abundant but hard to scavenge economically. Triboelectric nanogenerator (TENG) networks provide an effective approach toward massive harvesting of water wave energy in oceans. In this work, a coupling design in TENG networks for such purposes is reported. The charge output of the rationally linked units is over 10 times of that without linkage. TENG networks of three different connecting methods are fabricated and show better performance for the ones with flexible connections. The network is based on an optimized ball-shell structured TENG unit with high responsivity to small agitations. The dynamic behavior of single and multiple TENG units is also investigated comprehensively to fully understand their performance in water. The study shows that a rational design on the linkage among the units could be an effective strategy for TENG clusters to operate collaboratively for reaching a higher performance.
NASA Astrophysics Data System (ADS)
Yang, Shuang; Wu, Wells W.; Shen, Rong-Fong; Bern, Marshall; Cipollo, John
2018-04-01
Mass spectrometric analysis of intact glycopeptides can reveal detailed information about glycosite, glycan structural features, and their heterogeneity. Sialyl glycopeptides can be positively, negatively, or neutrally charged depending on pH of their buffer solution and ionization conditions. To detect sialoglycopeptides, a negative-ion mode mass spectrometry may be applied with a minimal loss of sialic acids, although the positively charged or neutral glycopeptides may be excluded. Alternatively, the sialyl glycopeptides can be identified using positive-ion mode analysis by doping a high concentration of sodium salts to the analytes. Although manipulation of unmodified sialoglycopeptides can be useful for analysis of samples, less than optimal ionization, facile loss of sialyl and unfavorable ionization of accompanying non-sialyl peptides make such strategies suboptimal. Currently available chemical derivatization methods, while stabilizing for sialic acid, mask sialic acid linkage configuration. Here, we report the development of a novel approach to neutralize sialic acids via sequentially chemical modification that also reveals their linkage configuration, often an important determinant in biological function. This method utilizes several components to facilitate glycopeptide identification. These include the following: solid phase derivatization, enhanced ionization of sialoglycopeptides, differentiation of sialic acid linkage, and enrichment of the modified glycopeptides by hydrophilic interaction liquid chromatography. This technology can be used as a tool for quantitative analysis of protein sialylation in diseases with determination of sialic acid linkage configuration. [Figure not available: see fulltext.
A Genomewide Linkage Scan of Cocaine Dependence and Major Depressive Episode in Two Populations
Yang, Bao-Zhu; Han, Shizhong; Kranzler, Henry R; Farrer, Lindsay A; Gelernter, Joel
2011-01-01
Cocaine dependence (CD) and major depressive episode (MDE) frequently co-occur with poorer treatment outcome and higher relapse risk. Shared genetic risk was affirmed; to date, there have been no reports of genomewide linkage scans (GWLSs) surveying the susceptibility regions for comorbid CD and MDE (CD–MDE). We aimed to identify chromosomal regions and candidate genes susceptible to CD, MDE, and CD–MDE in African Americans (AAs) and European Americans (EAs). A total of 1896 individuals were recruited from 384 AA and 355 EA families, each with at least a sibling-pair with CD and/or opioid dependence. Array-based genotyping of about 6000 single-nucleotide polymorphisms was completed for all individuals. Parametric and non-parametric genomewide linkage analyses were performed. We found a genomewide-significant linkage peak on chromosome 7 at 183.4 cM for non-parametric analysis of CD–MDE in AAs (lod=3.8, genomewide empirical p=0.016; point-wise p=0.00001). A nearly genomewide significant linkage was identified for CD–MDE in EAs on chromosome 5 at 14.3 cM (logarithm of odds (lod)=2.95, genomewide empirical p=0.055; point-wise p=0.00012). Parametric analysis corroborated the findings in these two regions and improved the support for the peak on chromosome 5 so that it reached genomewide significance (heterogeneity lod=3.28, genomewide empirical p=0.046; point-wise p=0.00053). This is the first GWLS for CD–MDE. The genomewide significant linkage regions on chromosomes 5 and 7 harbor four particularly promising candidate genes: SRD5A1, UBE3C, PTPRN2, and VIPR2. Replication of the linkage findings in other populations is warranted, as is a focused analysis of the genes located in the linkage regions implicated here. PMID:21849985
Genetic heterogeneity in Finnish hereditary prostate cancer using ordered subset analysis
Simpson, Claire L; Cropp, Cheryl D; Wahlfors, Tiina; George, Asha; Jones, MaryPat S; Harper, Ursula; Ponciano-Jackson, Damaris; Tammela, Teuvo; Schleutker, Johanna; Bailey-Wilson, Joan E
2013-01-01
Prostate cancer (PrCa) is the most common male cancer in developed countries and the second most common cause of cancer death after lung cancer. We recently reported a genome-wide linkage scan in 69 Finnish hereditary PrCa (HPC) families, which replicated the HPC9 locus on 17q21-q22 and identified a locus on 2q37. The aim of this study was to identify and to detect other loci linked to HPC. Here we used ordered subset analysis (OSA), conditioned on nonparametric linkage to these loci to detect other loci linked to HPC in subsets of families, but not the overall sample. We analyzed the families based on their evidence for linkage to chromosome 2, chromosome 17 and a maximum score using the strongest evidence of linkage from either of the two loci. Significant linkage to a 5-cM linkage interval with a peak OSA nonparametric allele-sharing LOD score of 4.876 on Xq26.3-q27 (ΔLOD=3.193, empirical P=0.009) was observed in a subset of 41 families weakly linked to 2q37, overlapping the HPCX1 locus. Two peaks that were novel to the analysis combining linkage evidence from both primary loci were identified; 18q12.1-q12.2 (OSA LOD=2.541, ΔLOD=1.651, P=0.03) and 22q11.1-q11.21 (OSA LOD=2.395, ΔLOD=2.36, P=0.006), which is close to HPC6. Using OSA allows us to find additional loci linked to HPC in subsets of families, and underlines the complex genetic heterogeneity of HPC even in highly aggregated families. PMID:22948022
Etain, Bruno; Mathieu, Flavie; Rietschel, Marcella; Maier, Wolfgang; Albus, Margot; Mckeon, Patrick; Roche, S.; Kealey, Carmel; Blackwood, Douglas; Muir, Walter; Bellivier, Franc; Henry, C.; Dina, Christian; Gallina, Sophie; Gurling, H.; Malafosse, Alain; Preisig, Martin; Ferrero, François; Cichon, Sven; Schumacher, J.; Ohlraun, Stéphanie; Borrmann-Hassenbach, M.; Propping, Peter; Abou Jamra, Rami; Schulze, Thomas G.; Marusic, Andrej; Dernovsek, Mojca Z.; Giros, Bruno; Bourgeron, Thomas; Lemainque, Arnaud; Bacq, Delphine; Betard, Christine; Charon, Céline; Nöthen, Markus M.; Lathrop, Mark; Leboyer, Marion
2006-01-01
Summary Preliminary studies suggested that age at onset (AAO) may help to define homogeneous bipolar affective disorder (BPAD) subtypes. This candidate symptom approach might be useful to identify vulnerability genes. Thus, the probability of detecting major disease-causing genes might be increased by focusing on families with early-onset BPAD type I probands. This study was conducted as part of the European Collaborative Study of Early Onset BPAD (France, Germany, Ireland, Scotland, Switzerland, England, Slovenia). We performed a genome-wide search with 384 microsatellite markers using non parametric linkage analysis in 87 sib-pairs ascertained through an early-onset BPAD type I proband (age at onset of 21 years or below). Non parametric multi-point analysis suggested eight regions of linkage with p-values <0.01 (2p21, 2q14.3, 3p14, 5q33, 7q36, 10q23, 16q23 and 20p12). The 3p14 region showed the most significant linkage (genome-wide p-value estimated over 10.000 simulated replicates of 0.015 [0.01–0.02]). After genome-wide search analysis, we performed additional linkage analyses with increase marker density using markers in four regions suggestive for linkage and having an information contents lower than 75% (3p14, 10q23, 16q23 and 20p12). For these regions, the information content improved by about 10%. In chromosome 3, the non parametric linkage score increased from 3.51 to 3.83. This study is the first to use early onset bipolar type I probands in an attempt to increase sample homogeneity. These preliminary findings require confirmation in independent panels of families. PMID:16534504
DOE Office of Scientific and Technical Information (OSTI.GOV)
Elmslie, F.V.; Williamson, M.P.; Rees, M.
1996-09-01
Linkage analysis in separately ascertained families of probands with juvenile myoclonic epilepsy (JME) has previously provided evidence both for and against the existence of a locus (designated {open_quotes}EJM1{close_quotes}), on chromosome 6p, predisposing to a trait defined as either clinical JME, its associated electroencephalographic abnormality, or idiopathic generalized epilepsy. Linkage analysis was performed in 19 families in which a proband and at least one first- or two second-degree relatives have clinical JME. Family members were typed for seven highly polymorphic microsatellite markers on chromosome 6p: D6S260, D6S276, D6S291, D6S271, D6S465, D6S257, and D6S254. Pairwise and multipoint linkage analysis was carried outmore » under the assumptions of autosomal dominant inheritance at 70% and 50% penetrance and autosomal recessive inheritance at 70% and 50% penetrance. No significant evidence in favor of linkage to the clinical trait of JME was obtained for any locus. The region formally excluded (LOD score <-2) by using multipoint analysis varies depending on the assumptions made concerning inheritance parameters and the proportion of linked families, {alpha} - that is, the degree of locus heterogeneity. Further analysis either classifying all unaffected individuals as unknown or excluding a subset of four families in which pyknoleptic absence seizures were present in one or more individuals did not alter these conclusions. 24 refs., 4 figs., 1 tab.« less
Olsen, Aaron M; Westneat, Mark W
2016-12-01
Many musculoskeletal systems, including the skulls of birds, fishes, and some lizards consist of interconnected chains of mobile skeletal elements, analogous to linkage mechanisms used in engineering. Biomechanical studies have applied linkage models to a diversity of musculoskeletal systems, with previous applications primarily focusing on two-dimensional linkage geometries, bilaterally symmetrical pairs of planar linkages, or single four-bar linkages. Here, we present new, three-dimensional (3D), parallel linkage models of the skulls of birds and fishes and use these models (available as free kinematic simulation software), to investigate structure-function relationships in these systems. This new computational framework provides an accessible and integrated workflow for exploring the evolution of structure and function in complex musculoskeletal systems. Linkage simulations show that kinematic transmission, although a suitable functional metric for linkages with single rotating input and output links, can give misleading results when applied to linkages with substantial translational components or multiple output links. To take into account both linear and rotational displacement we define force mechanical advantage for a linkage (analogous to lever mechanical advantage) and apply this metric to measure transmission efficiency in the bird cranial mechanism. For linkages with multiple, expanding output points we propose a new functional metric, expansion advantage, to measure expansion amplification and apply this metric to the buccal expansion mechanism in fishes. Using the bird cranial linkage model, we quantify the inaccuracies that result from simplifying a 3D geometry into two dimensions. We also show that by combining single-chain linkages into parallel linkages, more links can be simulated while decreasing or maintaining the same number of input parameters. This generalized framework for linkage simulation and analysis can accommodate linkages of differing geometries and configurations, enabling novel interpretations of the mechanics of force transmission across a diversity of vertebrate feeding mechanisms and enhancing our understanding of musculoskeletal function and evolution. J. Morphol. 277:1570-1583, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Farook, Vidya S.; Coletta, Dawn K.; Puppala, Sobha; Schneider, Jennifer; Chittoor, Geetha; Hu, Shirley L.; Winnier, Deidre A.; Norton, Luke; Dyer, Thomas D.; Arya, Rector; Cole, Shelley A.; Carless, Melanie; Göring, Harald H.; Almasy, Laura; Mahaney, Michael C.; Comuzzie, Anthony G.; Curran, Joanne E.; Blangero, John; Duggirala, Ravindranath; Lehman, Donna M.; Jenkinson, Christopher P.; DeFronzo, Ralph A.
2014-01-01
Objective Type 2 diabetes (T2DM) is a complex metabolic disease and is more prevalent in certain ethnic groups such as the Mexican Americans. The goal of our study was to perform a genome-wide linkage analysis to localize T2DM susceptibility loci in Mexican Americans. Methods We used the phenotypic and genotypic data from 1,122 Mexican American individuals (307 families) who participated in the Veterans Administration Genetic Epidemiology Study (VAGES). Genome-wide linkage analysis was performed, using the variance components approach. Data from two additional Mexican American family studies, the San Antonio Family Heart Study (SAFHS) and the San Antonio Family Diabetes/Gallbladder Study (SAFDGS), were combined with the VAGES data to test for improved linkage evidence. Results After adjusting for covariate effects, T2DM was found to be under significant genetic influences (h2 = 0.62, P = 2.7 × 10−6). The strongest evidence for linkage of T2DM occurred between markers D9S1871 and D9S2169 on chromosome 9p24.2-p24.1 (LOD = 1.8). Given that we previously reported suggestive evidence for linkage of T2DM at this region in SAFDGS also, we found the significant and increased linkage evidence (LOD = 4.3, empirical P = 1.0 × 10−5, genome-wide P = 1.6 × 10−3) for T2DM at the same chromosomal region when we performed genome-wide linkage analysis of the VAGES data combined with SAFHS and SAFDGS data. Conclusion Significant T2DM linkage evidence was found on chromosome 9p24 in Mexican Americans. Importantly, the chromosomal region of interest in this study overlaps with several recent genome-wide association studies (GWASs) involving T2DM related traits. Given its overlap with such findings and our own initial T2DM association findings in the 9p24 chromosomal region, high throughput sequencing of the linked chromosomal region could identify the potential causal T2DM genes. PMID:24060607
A strabismus susceptibility locus on chromosome 7p
Parikh, Vaishali; Shugart, Yin Yao; Doheny, Kimberly F.; Zhang, Jie; Li, Lan; Williams, John; Hayden, David; Craig, Brian; Capo, Hilda; Chamblee, Denise; Chen, Cathy; Collins, Mary; Dankner, Stuart; Fiergang, Dean; Guyton, David; Hunter, David; Hutcheon, Marcia; Keys, Marshall; Morrison, Nancy; Munoz, Michelle; Parks, Marshall; Plotsky, David; Protzko, Eugene; Repka, Michael X.; Sarubbi, Maria; Schnall, Bruce; Siatkowski, R. Michael; Traboulsi, Elias; Waeltermann, Joanne; Nathans, Jeremy
2003-01-01
Strabismus has been known to have a significant genetic component, but the mode of inheritance and the identity of the relevant genes have been enigmatic. This paper reports linkage analysis of nonsyndromic strabismus. The principal results of this study are: (i) the demonstrated feasibility of identifying and recruiting large families in which multiple members have (or had) strabismus; (ii) the linkage in one large family of a presumptive strabismus susceptibility locus to 7p22.1 with a multipoint logarithm of odds score of 4.51 under a model of recessive inheritance; and (iii) the failure to observe significant linkage to 7p in six other multiplex families, consistent with genetic heterogeneity among families. These findings suggest that it will be possible to localize and ultimately identify strabismus susceptibility genes by linkage analysis and mutation screening of candidate genes. PMID:14519848
Ge, Y; Li, X; Yang, X X; Cui, C S; Qu, S P
2015-05-22
Cucurbita maxima is one of the most widely cultivated vegetables in China and exhibits distinct morphological characteristics. In this study, genetic linkage analysis with 57 simple-sequence repeats, 21 amplified fragment length polymorphisms, 3 random-amplified polymorphic DNA, and one morphological marker revealed 20 genetic linkage groups of C. maxima covering a genetic distance of 991.5 cM with an average of 12.1 cM between adjacent markers. Genetic linkage analysis identified the simple-sequence repeat marker 'PU078072' 5.9 cM away from the locus 'Rc', which controls rind color. The genetic map in the present study will be useful for better mapping, tagging, and cloning of quantitative trait loci/gene(s) affecting economically important traits and for breeding new varieties of C. maxima through marker-assisted selection.
Genome-wide linkage and association analysis of cardiometabolic phenotypes in Hispanic Americans.
Hellwege, Jacklyn N; Palmer, Nicholette D; Dimitrov, Latchezar; Keaton, Jacob M; Tabb, Keri L; Sajuthi, Satria; Taylor, Kent D; Ng, Maggie C Y; Speliotes, Elizabeth K; Hawkins, Gregory A; Long, Jirong; Ida Chen, Yii-Der; Lorenzo, Carlos; Norris, Jill M; Rotter, Jerome I; Langefeld, Carl D; Wagenknecht, Lynne E; Bowden, Donald W
2017-02-01
Linkage studies of complex genetic diseases have been largely replaced by genome-wide association studies, due in part to limited success in complex trait discovery. However, recent interest in rare and low-frequency variants motivates re-examination of family-based methods. In this study, we investigated the performance of two-point linkage analysis for over 1.6 million single-nucleotide polymorphisms (SNPs) combined with single variant association analysis to identify high impact variants, which are both strongly linked and associated with cardiometabolic traits in up to 1414 Hispanics from the Insulin Resistance Atherosclerosis Family Study (IRASFS). Evaluation of all 50 phenotypes yielded 83 557 000 LOD (logarithm of the odds) scores, with 9214 LOD scores ⩾3.0, 845 ⩾4.0 and 89 ⩾5.0, with a maximal LOD score of 6.49 (rs12956744 in the LAMA1 gene for tumor necrosis factor-α (TNFα) receptor 2). Twenty-seven variants were associated with P<0.005 as well as having an LOD score >4, including variants in the NFIB gene under a linkage peak with TNFα receptor 2 levels on chromosome 9. Linkage regions of interest included a broad peak (31 Mb) on chromosome 1q with acute insulin response (max LOD=5.37). This region was previously documented with type 2 diabetes in family-based studies, providing support for the validity of these results. Overall, we have demonstrated the utility of two-point linkage and association in comprehensive genome-wide array-based SNP genotypes.
Yan, Liying; Huang, Lei; Xu, Liya; Huang, Jin; Ma, Fei; Zhu, Xiaohui; Tang, Yaqiong; Liu, Mingshan; Lian, Ying; Liu, Ping; Li, Rong; Lu, Sijia; Tang, Fuchou; Qiao, Jie; Xie, X Sunney
2015-12-29
In vitro fertilization (IVF), preimplantation genetic diagnosis (PGD), and preimplantation genetic screening (PGS) help patients to select embryos free of monogenic diseases and aneuploidy (chromosome abnormality). Next-generation sequencing (NGS) methods, while experiencing a rapid cost reduction, have improved the precision of PGD/PGS. However, the precision of PGD has been limited by the false-positive and false-negative single-nucleotide variations (SNVs), which are not acceptable in IVF and can be circumvented by linkage analyses, such as short tandem repeats or karyomapping. It is noteworthy that existing methods of detecting SNV/copy number variation (CNV) and linkage analysis often require separate procedures for the same embryo. Here we report an NGS-based PGD/PGS procedure that can simultaneously detect a single-gene disorder and aneuploidy and is capable of linkage analysis in a cost-effective way. This method, called "mutated allele revealed by sequencing with aneuploidy and linkage analyses" (MARSALA), involves multiple annealing and looping-based amplification cycles (MALBAC) for single-cell whole-genome amplification. Aneuploidy is determined by CNVs, whereas SNVs associated with the monogenic diseases are detected by PCR amplification of the MALBAC product. The false-positive and -negative SNVs are avoided by an NGS-based linkage analysis. Two healthy babies, free of the monogenic diseases of their parents, were born after such embryo selection. The monogenic diseases originated from a single base mutation on the autosome and the X-chromosome of the disease-carrying father and mother, respectively.
Population Structure of Hispanics in the United States: The Multi-Ethnic Study of Atherosclerosis
Manichaikul, Ani; Palmas, Walter; Rodriguez, Carlos J.; Peralta, Carmen A.; Divers, Jasmin; Guo, Xiuqing; Chen, Wei-Min; Wong, Quenna; Williams, Kayleen; Kerr, Kathleen F.; Taylor, Kent D.; Tsai, Michael Y.; Goodarzi, Mark O.; Sale, Michèle M.; Diez-Roux, Ana V.; Rich, Stephen S.; Rotter, Jerome I.; Mychaleckyj, Josyf C.
2012-01-01
Using ∼60,000 SNPs selected for minimal linkage disequilibrium, we perform population structure analysis of 1,374 unrelated Hispanic individuals from the Multi-Ethnic Study of Atherosclerosis (MESA), with self-identification corresponding to Central America (n = 93), Cuba (n = 50), the Dominican Republic (n = 203), Mexico (n = 708), Puerto Rico (n = 192), and South America (n = 111). By projection of principal components (PCs) of ancestry to samples from the HapMap phase III and the Human Genome Diversity Panel (HGDP), we show the first two PCs quantify the Caucasian, African, and Native American origins, while the third and fourth PCs bring out an axis that aligns with known South-to-North geographic location of HGDP Native American samples and further separates MESA Mexican versus Central/South American samples along the same axis. Using k-means clustering computed from the first four PCs, we define four subgroups of the MESA Hispanic cohort that show close agreement with self-identification, labeling the clusters as primarily Dominican/Cuban, Mexican, Central/South American, and Puerto Rican. To demonstrate our recommendations for genetic analysis in the MESA Hispanic cohort, we present pooled and stratified association analysis of triglycerides for selected SNPs in the LPL and TRIB1 gene regions, previously reported in GWAS of triglycerides in Caucasians but as yet unconfirmed in Hispanic populations. We report statistically significant evidence for genetic association in both genes, and we further demonstrate the importance of considering population substructure and genetic heterogeneity in genetic association studies performed in the United States Hispanic population. PMID:22511882
Shao, Changwei; Niu, Yongchao; Rastas, Pasi; Liu, Yang; Xie, Zhiyuan; Li, Hengde; Wang, Lei; Jiang, Yong; Tai, Shuaishuai; Tian, Yongsheng; Sakamoto, Takashi; Chen, Songlin
2015-04-01
High-resolution genetic maps are essential for fine mapping of complex traits, genome assembly, and comparative genomic analysis. Single-nucleotide polymorphisms (SNPs) are the primary molecular markers used for genetic map construction. In this study, we identified 13,362 SNPs evenly distributed across the Japanese flounder (Paralichthys olivaceus) genome. Of these SNPs, 12,712 high-confidence SNPs were subjected to high-throughput genotyping and assigned to 24 consensus linkage groups (LGs). The total length of the genetic linkage map was 3,497.29 cM with an average distance of 0.47 cM between loci, thereby representing the densest genetic map currently reported for Japanese flounder. Nine positive quantitative trait loci (QTLs) forming two main clusters for Vibrio anguillarum disease resistance were detected. All QTLs could explain 5.1-8.38% of the total phenotypic variation. Synteny analysis of the QTL regions on the genome assembly revealed 12 immune-related genes, among them 4 genes strongly associated with V. anguillarum disease resistance. In addition, 246 genome assembly scaffolds with an average size of 21.79 Mb were anchored onto the LGs; these scaffolds, comprising 522.99 Mb, represented 95.78% of assembled genomic sequences. The mapped assembly scaffolds in Japanese flounder were used for genome synteny analyses against zebrafish (Danio rerio) and medaka (Oryzias latipes). Flounder and medaka were found to possess almost one-to-one synteny, whereas flounder and zebrafish exhibited a multi-syntenic correspondence. The newly developed high-resolution genetic map, which will facilitate QTL mapping, scaffold assembly, and genome synteny analysis of Japanese flounder, marks a milestone in the ongoing genome project for this species. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
The effect of using genealogy-based haplotypes for genomic prediction
2013-01-01
Background Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. Methods A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. Results About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Conclusions Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy. PMID:23496971
The effect of using genealogy-based haplotypes for genomic prediction.
Edriss, Vahid; Fernando, Rohan L; Su, Guosheng; Lund, Mogens S; Guldbrandtsen, Bernt
2013-03-06
Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy.
López, Camilo E; Acosta, Iván F; Jara, Carlos; Pedraza, Fabio; Gaitán-Solís, Eliana; Gallego, Gerardo; Beebe, Steve; Tohme, Joe
2003-01-01
ABSTRACT A polymerase chain reaction approach using degenerate primers that targeted the conserved domains of cloned plant disease resistance genes (R genes) was used to isolate a set of 15 resistance gene analogs (RGAs) from common bean (Phaseolus vulgaris). Eight different classes of RGAs were obtained from nucleotide binding site (NBS)-based primers and seven from not previously described Toll/Interleukin-1 receptor-like (TIR)-based primers. Putative amino acid sequences of RGAs were significantly similar to R genes and contained additional conserved motifs. The NBS-type RGAs were classified in two subgroups according to the expected final residue in the kinase-2 motif. Eleven RGAs were mapped at 19 loci on eight linkage groups of the common bean genetic map constructed at Centro Internacional de Agricultura Tropical. Genetic linkage was shown for eight RGAs with partial resistance to anthracnose, angular leaf spot (ALS) and Bean golden yellow mosaic virus (BGYMV). RGA1 and RGA2 were associated with resistance loci to anthracnose and BGYMV and were part of two clusters of R genes previously described. A new major cluster was detected by RGA7 and explained up to 63.9% of resistance to ALS and has a putative contribution to anthracnose resistance. These results show the usefulness of RGAs as candidate genes to detect and eventually isolate numerous R genes in common bean.
Genome scan for linkage to asthma using a linkage disequilibrium-lod score test.
Jiang, Y; Slager, S L; Huang, J
2001-01-01
We report a genome-wide linkage study of asthma on the German and Collaborative Study on the Genetics of Asthma (CSGA) data. Using a combined linkage and linkage disequilibrium test and the nonparametric linkage score, we identified 13 markers from the German data, 1 marker from the African American (CSGA) data, and 7 markers from the Caucasian (CSGA) data in which the p-values ranged between 0.0001 and 0.0100. From our analysis and taking into account previous published linkage studies of asthma, we suggest that three regions in chromosome 5 (around D5S418, D5S644, and D5S422), one region in chromosome 6 (around three neighboring markers D6S1281, D6S291, and D6S1019), one region in chromosome 11 (around D11S2362), and two regions in chromosome 12 (around D12S351 and D12S324) especially merit further investigation.
De Vos, Stephanie; Bossier, Peter; Van Stappen, Gilbert; Vercauteren, Ilse; Sorgeloos, Patrick; Vuylsteke, Marnik
2013-01-01
We report on the construction of sex-specific linkage maps, the identification of sex-linked markers and the genome size estimation for the brine shrimp Artemia franciscana. Overall, from the analysis of 433 AFLP markers segregating in a 112 full-sib family we identified 21 male and 22 female linkage groups (2n = 42), covering 1,041 and 1,313 cM respectively. Fifteen putatively homologous linkage groups, including the sex linkage groups, were identified between the female and male linkage map. Eight sex-linked AFLP marker alleles were inherited from the female parent, supporting the hypothesis of a WZ–ZZ sex-determining system. The haploid Artemia genome size was estimated to 0.93 Gb by flow cytometry. The produced Artemia linkage maps provide the basis for further fine mapping and exploring of the sex-determining region and are a possible marker resource for mapping genomic loci underlying phenotypic differences among Artemia species. PMID:23469207
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oetting, W.S.; Lee, H.K.; Flanders, D.J.
The use of short tandem repeat polymorphisms (STRPs) as marker loci for linkage analysis is becoming increasingly important due to their large numbers in the human genome and their high degree of polymorphism. Fluorescence-based detection of the STRP pattern with an automated DNA sequencer has improved the efficiency of this technique by eliminating the need for radioactivity and producing a digitized autoradiogram-like image that can be used for computer analysis. In an effort to simplify the procedure and to reduce the cost of fluorescence STRP analysis, we have developed a technique known as multiplexing STRPs with tailed primers (MSTP) usingmore » primers that have a 19-bp extension, identical to the sequence of an M13 sequencing primer, on the 5{prime} end of the forward primer in conjunction with multiplexing several primer pairs in a single polymerase chain reaction (PCR) amplification. The banding pattern is detected with the addition of the M13 primer-dye conjugate as the sole primer conjugated to the fluorescent dye, eliminating the need for direct conjugation of the infrared fluorescent dye to the STRP primers. The use of MSTP for linkage analysis greatly reduces the number of PCR reactions. Up to five primer pairs can be multiplexed together in the same reaction. At present, a set of 148 STRP markers spaced at an average genetic distance of 28 cM throughout the autosomal genome can be analyzed in 37 sets of multiplexed amplification reactions. We have automated the analysis of these patterns for linkage using software that both detects the STRP banding pattern and determines their sizes. This information can then be exported in a user-defined format from a database manager for linkage analysis. 15 refs., 2 figs., 4 tabs.« less
Genetic Alterations in Familial Breast Cancer: Mapping and Cloning Genes Other Than BRCAl
1997-09-01
predisposition to breast cancer in families. The gene PTEN was successfully cloned by this project, and simultaneously by others (for a different ...with germline translocations’and breast cancer for the identification of tumor suppressor genes. 14. SUBJECT TERMS Breast cancer 17. SECURITY...would limit the statistical power of linkage analysis. Therefore, we decided to integrate linkage analysis with the analysis of germline chromosomal
Screening for Multiple Genes Influencing Dyslexia.
ERIC Educational Resources Information Center
Smith, Shelley D.; And Others
1991-01-01
Examines the "sib pair" method of linkage analysis designed to locate genes influencing dyslexia, which has several advantages over the "LOD" score method. Notes that the sib pair analysis was able to detect the same linkages as the LOD method, plus a possible third region. Confirms that the sib pair method is an effective means of screening. (RS)
Time-resolved metabolomics reveals metabolic modulation in rice foliage
Sato, Shigeru; Arita, Masanori; Soga, Tomoyoshi; Nishioka, Takaaki; Tomita, Masaru
2008-01-01
Background To elucidate the interaction of dynamics among modules that constitute biological systems, comprehensive datasets obtained from "omics" technologies have been used. In recent plant metabolomics approaches, the reconstruction of metabolic correlation networks has been attempted using statistical techniques. However, the results were unsatisfactory and effective data-mining techniques that apply appropriate comprehensive datasets are needed. Results Using capillary electrophoresis mass spectrometry (CE-MS) and capillary electrophoresis diode-array detection (CE-DAD), we analyzed the dynamic changes in the level of 56 basic metabolites in plant foliage (Oryza sativa L. ssp. japonica) at hourly intervals over a 24-hr period. Unsupervised clustering of comprehensive metabolic profiles using Kohonen's self-organizing map (SOM) allowed classification of the biochemical pathways activated by the light and dark cycle. The carbon and nitrogen (C/N) metabolism in both periods was also visualized as a phenotypic linkage map that connects network modules on the basis of traditional metabolic pathways rather than pairwise correlations among metabolites. The regulatory networks of C/N assimilation/dissimilation at each time point were consistent with previous works on plant metabolism. In response to environmental stress, glutathione and spermidine fluctuated synchronously with their regulatory targets. Adenine nucleosides and nicotinamide coenzymes were regulated by phosphorylation and dephosphorylation. We also demonstrated that SOM analysis was applicable to the estimation of unidentifiable metabolites in metabolome analysis. Hierarchical clustering of a correlation coefficient matrix could help identify the bottleneck enzymes that regulate metabolic networks. Conclusion Our results showed that our SOM analysis with appropriate metabolic time-courses effectively revealed the synchronous dynamics among metabolic modules and elucidated the underlying biochemical functions. The application of discrimination of unidentified metabolites and the identification of bottleneck enzymatic steps even to non-targeted comprehensive analysis promise to facilitate an understanding of large-scale interactions among components in biological systems. PMID:18564421
Integrative analysis of the Lake Simcoe watershed (Ontario, Canada) as a socio-ecological system.
Neumann, Alex; Kim, Dong-Kyun; Perhar, Gurbir; Arhonditsis, George B
2017-03-01
Striving for long-term sustainability in catchments dominated by human activities requires development of interdisciplinary research methods to account for the interplay between environmental concerns and socio-economic pressures. In this study, we present an integrative analysis of the Lake Simcoe watershed, Ontario, Canada, as viewed from the perspective of a socio-ecological system. Key features of our analysis are (i) the equally weighted consideration of environmental attributes with socioeconomic priorities and (ii) the identification of the minimal number of key socio-hydrological variables that should be included in a parsimonious watershed management framework, aiming to establish linkages between urbanization trends and nutrient export. Drawing parallels with the concept of Hydrological Response Units, we used Self-Organizing Mapping to delineate spatial organizations with similar socio-economic and environmental attributes, also referred to as Socio-Environmental Management Units (SEMUs). Our analysis provides evidence of two SEMUs with contrasting features, the "undisturbed" and "anthropogenically-influenced", within the Lake Simcoe watershed. The "undisturbed" cluster occupies approximately half of the Lake Simcoe catchment (45%) and is characterized by low landscape diversity and low average population density <0.4 humans ha -1 . By contrast, the socio-environmental functional properties of the "anthropogenically-influenced" cluster highlight the likelihood of a stability loss in the long-run, as inferred from the distinct signature of urbanization activities on the tributary nutrient export, and the loss of subwatershed sensitivity to natural mechanisms that may ameliorate the degradation patterns. Our study also examines how the SEMU concept can augment the contemporary integrated watershed management practices and provides directions in order to promote environmental programs for lake conservation and to increase public awareness and engagement in stewardship initiatives. Copyright © 2016 Elsevier Ltd. All rights reserved.
Sun, Jian; Matsumoto, Ken'ichiro; Tabata, Yuta; Kadoya, Ryosuke; Ooi, Toshihiko; Abe, Hideki; Taguchi, Seiichi
2015-11-01
Polyhydroxyalkanoate depolymerase derived from Variovorax sp. C34 (PhaZVs) was identified as the first enzyme that is capable of degrading isotactic P[67 mol% (R)-lactate(LA)-co-(R)-3-hydroxybutyrate(3HB)] [P(D-LA-co-D-3HB)]. This study aimed at analyzing the monomer sequence specificity of PhaZVs for hydrolyzing P(LA-co-3HB) in comparison with a P(3HB) depolymerase from Alcaligenes faecalis T1 (PhaZAf) that did not degrade the same copolymer. Degradation of P(LA-co-3HB) by action of PhaZVs generated dimers, 3HB-3HB, 3HB-LA, LA-3HB, and LA-LA, and the monomers, suggesting that PhaZVs cleaved the linkages between LA and 3HB units and between LA units. To provide a direct evidence for the hydrolysis of these sequences, the synthetic methyl trimers, 3HB-3HB-3HB, LA-LA-3HB, LA-3HB-LA, and 3HB-LA-LA, were treated with the PhaZs. Unexpectedly, not only PhaZVs but also PhaZAf hydrolyzed all of these substrates, namely PhaZAf also cleaved LA-LA linkage. Considering the fact that both PhaZs did not degrade P[(R)-LA] (PDLA) homopolymer, the cleavage capability of LA-LA linkage by PhaZs was supposed to depend on the length of the LA-clustering region in the polymer chain. To test this hypothesis, PDLA oligomers (6 to 40 mer) were subjected to the PhaZ assay, revealing that there was an inverse relationship between molecular weight of the substrates and their hydrolysis efficiency. Moreover, PhaZVs exhibited the degrading activity toward significantly longer PDLA oligomers compared to PhaZAf. Therefore, the cleaving capability of PhaZs used here toward the D-LA-based polymers containing the LA-clustering region was strongly associated with the substrate length, rather than the monomer sequence specificity of the enzyme.
2011-01-01
Background Genetic interactions within hybrids influence their overall fitness. Understanding the details of these interactions can improve our understanding of speciation. One experimental approach is to investigate deviations from Mendelian expectations (segregation distortion) in the inheritance of mapped genetic markers. In this study, we used the copepod Tigriopus californicus, a species which exhibits high genetic divergence between populations and a general pattern of reduced fitness in F2 interpopulation hybrids. Previous studies have implicated both nuclear-cytoplasmic and nuclear-nuclear interactions in causing this fitness reduction. We identified and mapped population-diagnostic single nucleotide polymorphisms (SNPs) and used these to examine segregation distortion across the genome within F2 hybrids. Results We generated a linkage map which included 45 newly elucidated SNPs and 8 population-diagnostic microsatellites used in previous studies. The map, the first available for the Copepoda, was estimated to cover 75% of the genome and included markers on all 12 T. californicus chromosomes. We observed little segregation distortion in newly hatched F2 hybrid larvae (fewer than 10% of markers at p < 0.05), but strikingly higher distortion in F2 hybrid adult males (45% of markers at p < 0.05). Hence, segregation distortion was primarily caused by selection against particular genetic combinations which acted between hatching and maturity. Distorted markers were not distributed randomly across the genome but clustered on particular chromosomes. In contrast to other studies in this species we found little evidence for cytonuclear coadaptation. Instead, different linkage groups exhibited markedly different patterns of distortion, which appear to have been influenced by nuclear-nuclear epistatic interactions and may also reflect genetic load carried within the parental lines. Conclusion Adult male F2 hybrids between two populations of T. californius exhibit dramatic segregation distortion across the genome. Distorted loci are clustered within specific linkage groups, and the direction of distortion differs between chromosomes. This segregation distortion is due to selection acting between hatching and adulthood. PMID:21639918
Quantifying landscape linkages among giant panda subpopulations in regional scale conservation.
Qi, Dunwu; Hu, Yibo; Gu, Xiaodong; Yang, Xuyi; Yang, Guang; Wei, Fuwen
2012-06-01
Understanding habitat requirements and identifying landscape linkages are essential for the survival of isolated populations of endangered species. Currently, some of the giant panda populations are isolated, which threatens their long-term survival, particularly in the Xiaoxiangling mountains. In the present study, we quantified niche requirements and then identified potential linkages of giant panda subpopulations in the most isolated region, using ecological niche factor analysis and a least-cost path model. Giant pandas preferred habitat with conifer forest and gentle slopes (>20 to ≤30°). Based on spatial distribution of suitable habitat, linkages were identified for the Yele subpopulation to 4 other subpopulations (Liziping, Matou, Xinmin and Wanba). Their lengths ranged from 15 to 54 km. The accumulated cost ranged from 693 to 3166 and conifer forest covered over 31%. However, a variety of features (e.g. major roads, human settlements and large unforested areas) might act as barriers along the linkages for giant panda dispersal. Our analysis quantified giant panda subpopulation connectivity to ensure long-term survival. © 2012 ISZS, Blackwell Publishing and IOZ/CAS.
Significant Linkage for Tourette Syndrome in a Large French Canadian Family
Mérette, Chantal; Brassard, Andrée; Potvin, Anne; Bouvier, Hélène; Rousseau, François; Émond, Claudia; Bissonnette, Luc; Roy, Marc-André; Maziade, Michel; Ott, Jurg; Caron, Chantal
2000-01-01
Family and twin studies provide strong evidence that genetic factors are involved in the transmission of Gilles de la Tourette syndrome (TS) and related psychiatric disorders. To detect the underlying susceptibility gene(s) for TS, we performed linkage analysis in one large French Canadian family (127 members) from the Charlevoix region, in which 20 family members were definitely affected by TS and 20 others showed related tic disorders. Using model-based linkage analysis, we observed a LOD score of 3.24 on chromosome 11 (11q23). This result was obtained in a multipoint approach involving marker D11S1377, the marker for which significant linkage disequilibrium with TS recently has been detected in an Afrikaner population. Altogether, 25 markers were studied, and, for level of significance, we derived a criterion that took into account the multiple testing arising from the use of three phenotype definitions and three modes of inheritance, a procedure that yielded a LOD score of 3.18. Hence, even after adjustment for multiple testing, the present study shows statistically significant evidence for genetic linkage with TS. PMID:10986045
N'Diaye, Amidou; Haile, Jemanesh K; Cory, Aron T; Clarke, Fran R; Clarke, John M; Knox, Ron E; Pozniak, Curtis J
2017-01-01
Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype-based analysis over single marker analysis to detect loci associated with colour traits in durum wheat.
Littlejohn, Mathew D; Turner, Sally-Anne; Walker, Caroline G; Berry, Sarah D; Tiplady, Kathryn; Sherlock, Ric G; Sutherland, Greg; Swift, Simon; Garrick, Dorian; Lacy-Hulbert, S Jane; McDougall, Scott; Spelman, Richard J; Snell, Russell G; Hillerton, J Eric
2018-05-01
Inflammation of the mammary gland following bacterial infection, commonly known as mastitis, affects all mammalian species. Although the aetiology and epidemiology of mastitis in the dairy cow are well described, the genetic factors mediating resistance to mammary gland infection are not well known, due in part to the difficulty in obtaining robust phenotypic information from sufficiently large numbers of individuals. To address this problem, an experimental mammary gland infection experiment was undertaken, using a Friesian-Jersey cross breed F2 herd. A total of 604 animals received an intramammary infusion of Streptococcus uberis in one gland, and the clinical response over 13 milkings was used for linkage mapping and genome-wide association analysis. A quantitative trait locus (QTL) was detected on bovine chromosome 11 for clinical mastitis status using micro-satellite and Affymetrix 10 K SNP markers, and then exome and genome sequence data used from the six F1 sires of the experimental animals to examine this region in more detail. A total of 485 sequence variants were typed in the QTL interval, and association mapping using these and an additional 37 986 genome-wide markers from the Illumina SNP50 bovine SNP panel revealed association with markers encompassing the interleukin-1 gene cluster locus. This study highlights a region on bovine chromosome 11, consistent with earlier studies, as conferring resistance to experimentally induced mammary gland infection, and newly prioritises the IL1 gene cluster for further analysis in genetic resistance to mastitis.
Solar Effects on Global Climate Due to Cosmic Rays and Solar Energetic Particles
NASA Technical Reports Server (NTRS)
Turco, R. P.; Raeder, J.; DAuria, R.
2005-01-01
Although the work reported here does not directly connect solar variability with global climate change, this research establishes a plausible quantitative causative link between observed solar activity and apparently correlated variations in terrestrial climate parameters. Specifically, we have demonstrated that ion-mediated nucleation of atmospheric particles is a likely, and likely widespread, phenomenon that relates solar variability to changes in the microphysical properties of clouds. To investigate this relationship, we have constructed and applied a new model describing the formation and evolution of ionic clusters under a range of atmospheric conditions throughout the lower atmosphere. The activation of large ionic clusters into cloud nuclei is predicted to be favorable in the upper troposphere and mesosphere, and possibly in the lower stratosphere. The model developed under this grant needs to be extended to include additional cluster families, and should be incorporated into microphysical models to further test the cause-and-effect linkages that may ultimately explain key aspects of the connections between solar variability and climate.
Bashan, Anat; Yonath, Ada
2009-01-01
Crystallography of ribosomes, the universal cell nucleoprotein assemblies facilitating the translation of the genetic-code into proteins, met with severe problems owing to their large size, complex structure, inherent flexibility and high conformational variability. For the case of the small ribosomal subunit, which caused extreme difficulties, post crystallization treatment by minute amounts of a heteropolytungstate cluster allowed structure determination at atomic resolution. This cluster played a dual role in ribosomal crystallography: providing anomalous phasing power and dramatically increased the resolution, by stabilization of a selected functional conformation. Thus, four out of the fourteen clusters that bind to each of the crystallized small subunits are attached to a specific ribosomal protein in a fashion that may control a significant component of the subunit internal flexibility, by “gluing” symmetrical related subunits. Here we highlight basic issues in the relationship between metal ions and macromolecules and present common traits controlling in the interactions between polymetalates and various macromolecules, which may be extended towards the exploitation of polymetalates for therapeutical treatment. PMID:19915655
Role of excess ligand and effect of thermal treatment in hybrid inorganic-organic EUV resists
NASA Astrophysics Data System (ADS)
Mattson, Eric C.; Rupich, Sara M.; Cabrera, Yasiel; Chabal, Yves J.
2018-03-01
The chemical structure and thermal reactivity of recently discovered inorganic-organic hybrid resist materials are characterized using a combination of in situ and ex situ infrared (IR) spectroscopy and x-ray photoemission spectroscopy (XPS). The materials are comprised of a small HfOx core capped with methacrylic acid ligands that form a combined hybrid cluster, HfMAA. The observed IR modes are consistent with the calculated modes predicted from the previously determined x-ray crystal structure of the HfMAA-12 cluster, but also contain extrinsic hydroxyl groups. We find that the water content of the films is dependent on the concentration of excess ligand added to the solution. The effect of environment used during post-application baking (PAB) is studied and correlated to changes in solubility of the films. In doing so, we find that hydroxylation of the clusters results in formation of additional Hf-O-Hf linkages upon heating, which in turn impacts the solubility of the films.
Dynamic analysis of six-bar mechanical press for deep drawing
NASA Astrophysics Data System (ADS)
Mitsi, S.; Tsiafis, I.; Bouzakis, K. D.
2017-02-01
This paper analyzes the dynamical behavior of a six-bar linkage used in mechanical presses for metal forming such as deep drawing. In the under study mechanism, a four-bar linkage is connected to a slider through an articulated binary link. The motion of the six-bar linkage is studied by kinematic analysis developing an analytical method. Furthermore, using an iterative method and d’ Alembert’s principle, the joint forces and drive moment are evaluated considering joint frictions. The simulation results obtained with a MATLAB program are validated by comparing the theoretical values of the input moment with the ones obtained from the conservation of energy law.
Bailey, Richard I; Innocenti, Paolo; Morrow, Edward H; Friberg, Urban; Qvarnström, Anna
2011-02-28
The evolution of female choice mechanisms favouring males of their own kind is considered a crucial step during the early stages of speciation. However, although the genomics of mate choice may influence both the likelihood and speed of speciation, the identity and location of genes underlying assortative mating remain largely unknown. We used mate choice experiments and gene expression analysis of female Drosophila melanogaster to examine three key components influencing speciation. We show that the 1,498 genes in Zimbabwean female D. melanogaster whose expression levels differ when mating with more (Zimbabwean) versus less (Cosmopolitan strain) preferred males include many with high expression in the central nervous system and ovaries, are disproportionately X-linked and form a number of clusters with low recombination distance. Significant involvement of the brain and ovaries is consistent with the action of a combination of pre- and postcopulatory female choice mechanisms, while sex linkage and clustering of genes lead to high potential evolutionary rate and sheltering against the homogenizing effects of gene exchange between populations. Taken together our results imply favourable genomic conditions for the evolution of reproductive isolation through mate choice in Zimbabwean D. melanogaster and suggest that mate choice may, in general, act as an even more important engine of speciation than previously realized.
Microarray gene expression profiling using core biopsies of renal neoplasia.
Rogers, Craig G; Ditlev, Jonathon A; Tan, Min-Han; Sugimura, Jun; Qian, Chao-Nan; Cooper, Jeff; Lane, Brian; Jewett, Michael A; Kahnoski, Richard J; Kort, Eric J; Teh, Bin T
2009-01-01
We investigate the feasibility of using microarray gene expression profiling technology to analyze core biopsies of renal tumors for classification of tumor histology. Core biopsies were obtained ex-vivo from 7 renal tumors-comprised of four histological subtypes-following radical nephrectomy using 18-gauge biopsy needles. RNA was isolated from these samples and, in the case of biopsy samples, amplified by in vitro transcription. Microarray analysis was then used to quantify the mRNA expression patterns in these samples relative to non-diseased renal tissue mRNA. Genes with significant variation across all non-biopsy tumor samples were identified, and the relationship between tumor and biopsy samples in terms of expression levels of these genes was then quantified in terms of Euclidean distance, and visualized by complete linkage clustering. Final pathologic assessment of kidney tumors demonstrated clear cell renal cell carcinoma (4), oncocytoma (1), angiomyolipoma (1) and adrenalcortical carcinoma (1). Five of the seven biopsy samples were most similar in terms of gene expression to the resected tumors from which they were derived in terms of Euclidean distance. All seven biopsies were assigned to the correct histological class by hierarchical clustering. We demonstrate the feasibility of gene expression profiling of core biopsies of renal tumors to classify tumor histology.
Microarray gene expression profiling using core biopsies of renal neoplasia
Rogers, Craig G.; Ditlev, Jonathon A.; Tan, Min-Han; Sugimura, Jun; Qian, Chao-Nan; Cooper, Jeff; Lane, Brian; Jewett, Michael A.; Kahnoski, Richard J.; Kort, Eric J.; Teh, Bin T.
2009-01-01
We investigate the feasibility of using microarray gene expression profiling technology to analyze core biopsies of renal tumors for classification of tumor histology. Core biopsies were obtained ex-vivo from 7 renal tumors—comprised of four histological subtypes—following radical nephrectomy using 18-gauge biopsy needles. RNA was isolated from these samples and, in the case of biopsy samples, amplified by in vitro transcription. Microarray analysis was then used to quantify the mRNA expression patterns in these samples relative to non-diseased renal tissue mRNA. Genes with significant variation across all non-biopsy tumor samples were identified, and the relationship between tumor and biopsy samples in terms of expression levels of these genes was then quantified in terms of Euclidean distance, and visualized by complete linkage clustering. Final pathologic assessment of kidney tumors demonstrated clear cell renal cell carcinoma (4), oncocytoma (1), angiomyolipoma (1) and adrenalcortical carcinoma (1). Five of the seven biopsy samples were most similar in terms of gene expression to the resected tumors from which they were derived in terms of Euclidean distance. All seven biopsies were assigned to the correct histological class by hierarchical clustering. We demonstrate the feasibility of gene expression profiling of core biopsies of renal tumors to classify tumor histology. PMID:19966938
Walker, Anne-Sophie; Gladieux, Pierre; Decognet, Véronique; Fermaud, Marc; Confais, Johann; Roudet, Jean; Bardin, Marc; Bout, Alexandre; Nicot, Philippe C; Poncet, Christine; Fournier, Elisabeth
2015-04-01
Understanding the causes of population subdivision is of fundamental importance, as studying barriers to gene flow between populations may reveal key aspects of the process of adaptive divergence and, for pathogens, may help forecasting disease emergence and implementing sound management strategies. Here, we investigated population subdivision in the multihost fungus Botrytis cinerea based on comprehensive multiyear sampling on different hosts in three French regions. Analyses revealed a weak association between population structure and geography, but a clear differentiation according to the host plant of origin. This was consistent with adaptation to hosts, but the distribution of inferred genetic clusters and the frequency of admixed individuals indicated a lack of strict host specificity. Differentiation between individuals collected in the greenhouse (on Solanum) and outdoor (on Vitis and Rubus) was stronger than that observed between individuals from the two outdoor hosts, probably reflecting an additional isolating effect associated with the cropping system. Three genetic clusters coexisted on Vitis but did not persist over time. Linkage disequilibrium analysis indicated that outdoor populations were regularly recombining, whereas clonality was predominant in the greenhouse. Our findings open up new perspectives for disease control by managing plant debris in outdoor conditions and reinforcing prophylactic measures indoor. © 2014 Society for Applied Microbiology and John Wiley & Sons Ltd.
Wang, Chunfang; Jia, Guanqing; Zhi, Hui; Niu, Zhengang; Chai, Yang; Li, Wei; Wang, Yongfang; Li, Haiquan; Lu, Ping; Zhao, Baohua; Diao, Xianmin
2012-01-01
As an ancient cereal of great importance for dryland agriculture even today, foxtail millet (Setaria italica) is fast becoming a new plant genomic model crop. A genotypic analysis of 250 foxtail millet landraces, which represent 1% of foxtail millet germplasm kept in the Chinese National Gene Bank (CNGB), was conducted with 77 SSRs covering the foxtail millet genome. A high degree of molecular diversity among the landraces was found, with an average of 20.9 alleles per locus detected. STRUCTURE, neighbor-jointing, and principal components analyses classify the accessions into three clusters (topmost hierarchy) and, ultimately, four conservative subgroups (substructuring within the topmost clusters) in total, which are in good accordance with eco-geographical distribution in China. The highest subpopulation diversity was identified in the accessions of Pop3 from the middle regions of the Yellow River, followed by accessions in Pop1 from the downstream regions of the Yellow River, suggesting that foxtail millet was domesticated in the Yellow River drainage area first and then spread to other parts of the country. Linkage disequilibrium (LD) decay of less than 20 cM of genetic distance in the foxtail millet landrace genome was observed, which suggests that it could be possible to achieve resolution down to the 20 cM level for association mapping. PMID:22870400
Guarnizo, Carlos E.; Paz, Andrea; Muñoz-Ortiz, Astrid; Flechas, Sandra V.; Méndez-Narváez, Javier; Crawford, Andrew J.
2015-01-01
Colombia hosts the second highest amphibian species diversity on Earth, yet its fauna remains poorly studied, especially using molecular genetic techniques. We present the results of the first wide-scale DNA barcoding survey of anurans of Colombia, focusing on a transect across the Eastern Cordillera. We surveyed 10 sites between the Magdalena Valley to the west and the eastern foothills of the Eastern Cordillera, sequencing portions of the mitochondrial 16S ribosomal RNA and cytochrome oxidase subunit 1 (CO1) genes for 235 individuals from 52 nominal species. We applied two barcode algorithms, Automatic Barcode Gap Discovery and Refined Single Linkage Analysis, to estimate the number of clusters or “unconfirmed candidate species” supported by DNA barcode data. Our survey included ~7% of the anuran species known from Colombia. While barcoding algorithms differed slightly in the number of clusters identified, between three and ten nominal species may be obscuring candidate species (in some cases, more than one cryptic species per nominal species). Our data suggest that the high elevations of the Eastern Cordillera and the low elevations of the Chicamocha canyon acted as geographic barriers in at least seven nominal species, promoting strong genetic divergences between populations associated with the Eastern Cordillera. PMID:26000447
Buccheri, Maria A; Spina, Sonia; Ruberto, Concetta; Lombardo, Turi; Labie, Dominique; Ragusa, And Angela
2013-01-01
Fetal hemoglobin (Hb F) is the principal ameliorating factor of β-thalassemia (β-thal) and sickle cell disease. Persistent production in adult life is a quantitative trait regulated by loci inside or outside the β-globin gene cluster. From genome-wide association studies, principal quantitative trait loci (QTL) (accounting for 50.0% of Hb F variability in different populations) have been identified in the BCL11A gene, HBS1L-MYB intergenic polymorphism and the β-globin gene cluster itself. In this study, we analyzed quantitative trait haplotypes in two Sicilian families with extremely mild β-thal and unusually high Hb F expression, in order to examine possible genetic background variations in a similar β-thalassemic phenotype. This study redefines the linkage disequilibrium blocks at these loci, but also shows slight differences between probands in haplotype combinations which could reflect different mechanisms of high Hb F production in patients with β-thal. We proposed a haplotype-based approach as a useful tool for the understanding of β-thal phenotype variation in patients with similar β-thalassemic backgrounds in an attempt to answer the recurring question of why patients with the same β-thalassemic genotype show different phenotypes.
Bian, Chao; Hu, Yinchang; Ravi, Vydianathan; Kuznetsova, Inna S.; Shen, Xueyan; Mu, Xidong; Sun, Ying; You, Xinxin; Li, Jia; Li, Xiaofeng; Qiu, Ying; Tay, Boon-Hui; Thevasagayam, Natascha May; Komissarov, Aleksey S.; Trifonov, Vladimir; Kabilov, Marsel; Tupikin, Alexey; Luo, Jianren; Liu, Yi; Song, Hongmei; Liu, Chao; Wang, Xuejie; Gu, Dangen; Yang, Yexin; Li, Wujiao; Polgar, Gianluca; Fan, Guangyi; Zeng, Peng; Zhang, He; Xiong, Zijun; Tang, Zhujing; Peng, Chao; Ruan, Zhiqiang; Yu, Hui; Chen, Jieming; Fan, Mingjun; Huang, Yu; Wang, Min; Zhao, Xiaomeng; Hu, Guojun; Yang, Huanming; Wang, Jian; Wang, Jun; Xu, Xun; Song, Linsheng; Xu, Gangchun; Xu, Pao; Xu, Junmin; O’Brien, Stephen J.; Orbán, László; Venkatesh, Byrappa; Shi, Qiong
2016-01-01
The Asian arowana (Scleropages formosus), one of the world’s most expensive cultivated ornamental fishes, is an endangered species. It represents an ancient lineage of teleosts: the Osteoglossomorpha. Here, we provide a high-quality chromosome-level reference genome of a female golden-variety arowana using a combination of deep shotgun sequencing and high-resolution linkage mapping. In addition, we have also generated two draft genome assemblies for the red and green varieties. Phylogenomic analysis supports a sister group relationship between Osteoglossomorpha (bonytongues) and Elopomorpha (eels and relatives), with the two clades together forming a sister group of Clupeocephala which includes all the remaining teleosts. The arowana genome retains the full complement of eight Hox clusters unlike the African butterfly fish (Pantodon buchholzi), another bonytongue fish, which possess only five Hox clusters. Differential gene expression among three varieties provides insights into the genetic basis of colour variation. A potential heterogametic sex chromosome is identified in the female arowana karyotype, suggesting that the sex is determined by a ZW/ZZ sex chromosomal system. The high-quality reference genome of the golden arowana and the draft assemblies of the red and green varieties are valuable resources for understanding the biology, adaptation and behaviour of Asian arowanas. PMID:27089831
Wang, Chunfang; Jia, Guanqing; Zhi, Hui; Niu, Zhengang; Chai, Yang; Li, Wei; Wang, Yongfang; Li, Haiquan; Lu, Ping; Zhao, Baohua; Diao, Xianmin
2012-07-01
As an ancient cereal of great importance for dryland agriculture even today, foxtail millet (Setaria italica) is fast becoming a new plant genomic model crop. A genotypic analysis of 250 foxtail millet landraces, which represent 1% of foxtail millet germplasm kept in the Chinese National Gene Bank (CNGB), was conducted with 77 SSRs covering the foxtail millet genome. A high degree of molecular diversity among the landraces was found, with an average of 20.9 alleles per locus detected. STRUCTURE, neighbor-jointing, and principal components analyses classify the accessions into three clusters (topmost hierarchy) and, ultimately, four conservative subgroups (substructuring within the topmost clusters) in total, which are in good accordance with eco-geographical distribution in China. The highest subpopulation diversity was identified in the accessions of Pop3 from the middle regions of the Yellow River, followed by accessions in Pop1 from the downstream regions of the Yellow River, suggesting that foxtail millet was domesticated in the Yellow River drainage area first and then spread to other parts of the country. Linkage disequilibrium (LD) decay of less than 20 cM of genetic distance in the foxtail millet landrace genome was observed, which suggests that it could be possible to achieve resolution down to the 20 cM level for association mapping.
Naanyu, Violet; Vedanthan, Rajesh; Kamano, Jemima H; Rotich, Jackson K; Lagat, Kennedy K; Kiptoo, Peninah; Kofler, Claire; Mutai, Kennedy K; Bloomfield, Gerald S; Menya, Diana; Kimaiyo, Sylvester; Fuster, Valentin; Horowitz, Carol R; Inui, Thomas S
2016-03-01
Hypertension, the leading global risk factor for mortality, is characterized by low treatment and control rates in low- and middle-income countries. Poor linkage to hypertension care contributes to poor outcomes for patients. However, specific factors influencing linkage to hypertension care are not well known. To evaluate factors influencing linkage to hypertension care in rural western Kenya. Qualitative research study using a modified Health Belief Model that incorporates the impact of emotional and environmental factors on behavior. Mabaraza (traditional community assembly) participants (n = 242) responded to an open invitation to residents in their respective communities. Focus groups, formed by purposive sampling, consisted of hypertensive individuals, at-large community members, and community health workers (n = 169). We performed content analysis of the transcripts with NVivo 10 software, using both deductive and inductive codes. We used a two-round Delphi method to rank the barriers identified in the content analysis. We selected factors using triangulation of frequency of codes and themes from the transcripts, in addition to the results of the Delphi exercise. Sociodemographic characteristics of participants were summarized using descriptive statistics. We identified 27 barriers to linkage to hypertension care, grouped into individual (cognitive and emotional) and environmental factors. Cognitive factors included the asymptomatic nature of hypertension and limited information. Emotional factors included fear of being a burden to the family and fear of being screened for stigmatized diseases such as HIV. Environmental factors were divided into physical (e.g. distance), socioeconomic (e.g. poverty), and health system factors (e.g. popularity of alternative therapies). The Delphi results were generally consistent with the findings from the content analysis. Individual and environmental factors are barriers to linkage to hypertension care in rural western Kenya. Our analysis provides new insights and methodological approaches that may be relevant to other low-resource settings worldwide.
Dubé, M P; Mlodzienski, M A; Kibar, Z; Farlow, M R; Ebers, G; Harper, P; Kolodny, E H; Rouleau, G A; Figlewicz, D A
1997-03-01
Hereditary spastic paraplegia (HSP) is a degenerative disorder of the motor system, defined by progressive weakness and spasticity of the lower limbs. HSP may be inherited as an autosomal dominant (AD), autosomal recessive, or an X-linked trait. AD HSP is genetically heterogeneous, and three loci have been identified so far: SPG3 maps to chromosome 14q, SPG4 to 2p, and SPG4a to 15q. We have undertaken linkage analysis with 21 uncomplicated AD families to the three AD HSP loci. We report significant linkage for three of our families to the SPG4 locus and exclude several families by multipoint linkage. We used linkage information from several different research teams to evaluate the statistical probability of linkage to the SPG4 locus for uncomplicated AD HSP families and established the critical LOD-score value necessary for confirmation of linkage to the SPG4 locus from Bayesian statistics. In addition, we calculated the empirical P-values for the LOD scores obtained with all families with computer simulation methods. Power to detect significant linkage, as well as type I error probabilities, were evaluated. This combined analytical approach permitted conclusive linkage analyses on small to medium-size families, under the restrictions of genetic heterogeneity.
Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira
2014-01-01
Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection. PMID:25048025
Nimmakayala, Padma; Abburi, Venkata L.; Saminathan, Thangasamy; Almeida, Aldo; Davenport, Brittany; Davidson, Joshua; Reddy, C. V. Chandra Mohan; Hankins, Gerald; Ebert, Andreas; Choi, Doil; Stommel, John; Reddy, Umesh K.
2016-01-01
Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to characterize population structure and species domestication of these two important incompatible cultivated pepper species. Estimated mean nucleotide diversity (π) and Tajima's D across various chromosomes revealed biased distribution toward negative values on all chromosomes (except for chromosome 4) in cultivated C. baccatum, indicating a population bottleneck during domestication of C. baccatum. In contrast, C. annuum chromosomes showed positive π and Tajima's D on all chromosomes except chromosome 8, which may be because of domestication at multiple sites contributing to wider genetic diversity. For C. baccatum, 13,129 SNPs were available, with minor allele frequency (MAF) ≥0.05; PCA of the SNPs revealed 283 C. baccatum accessions grouped into 3 distinct clusters, for strong population structure. The fixation index (FST) between domesticated C. annuum and C. baccatum was 0.78, which indicates genome-wide divergence. We conducted extensive linkage disequilibrium (LD) analysis of C. baccatum var. pendulum cultivars on all adjacent SNP pairs within a chromosome to identify regions of high and low LD interspersed with a genome-wide average LD block size of 99.1 kb. We characterized 1742 haplotypes containing 4420 SNPs (range 9–2 SNPs per haplotype). Genome-wide association study (GWAS) of peduncle length, a trait that differentiates wild and domesticated C. baccatum types, revealed 36 significantly associated genome-wide SNPs. Population structure, identity by state (IBS) and LD patterns across the genome will be of potential use for future GWAS of economically important traits in C. baccatum peppers. PMID:27857720
Nimmakayala, Padma; Abburi, Venkata L; Saminathan, Thangasamy; Almeida, Aldo; Davenport, Brittany; Davidson, Joshua; Reddy, C V Chandra Mohan; Hankins, Gerald; Ebert, Andreas; Choi, Doil; Stommel, John; Reddy, Umesh K
2016-01-01
Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to characterize population structure and species domestication of these two important incompatible cultivated pepper species. Estimated mean nucleotide diversity (π) and Tajima's D across various chromosomes revealed biased distribution toward negative values on all chromosomes (except for chromosome 4) in cultivated C. baccatum , indicating a population bottleneck during domestication of C. baccatum . In contrast, C. annuum chromosomes showed positive π and Tajima's D on all chromosomes except chromosome 8, which may be because of domestication at multiple sites contributing to wider genetic diversity. For C. baccatum , 13,129 SNPs were available, with minor allele frequency (MAF) ≥0.05; PCA of the SNPs revealed 283 C. baccatum accessions grouped into 3 distinct clusters, for strong population structure. The fixation index ( F ST ) between domesticated C. annuum and C. baccatum was 0.78, which indicates genome-wide divergence. We conducted extensive linkage disequilibrium (LD) analysis of C. baccatum var. pendulum cultivars on all adjacent SNP pairs within a chromosome to identify regions of high and low LD interspersed with a genome-wide average LD block size of 99.1 kb. We characterized 1742 haplotypes containing 4420 SNPs (range 9-2 SNPs per haplotype). Genome-wide association study (GWAS) of peduncle length, a trait that differentiates wild and domesticated C. baccatum types, revealed 36 significantly associated genome-wide SNPs. Population structure, identity by state (IBS) and LD patterns across the genome will be of potential use for future GWAS of economically important traits in C. baccatum peppers.
Ma, G J; Song, Q J; Markell, S G; Qi, L L
2018-07-01
A novel rust resistance gene, R 15 , derived from the cultivated sunflower HA-R8 was assigned to linkage group 8 of the sunflower genome using a genotyping-by-sequencing approach. SNP markers closely linked to R 15 were identified, facilitating marker-assisted selection of resistance genes. The rust virulence gene is co-evolving with the resistance gene in sunflower, leading to the emergence of new physiologic pathotypes. This presents a continuous threat to the sunflower crop necessitating the development of resistant sunflower hybrids providing a more efficient, durable, and environmentally friendly host plant resistance. The inbred line HA-R8 carries a gene conferring resistance to all known races of the rust pathogen in North America and can be used as a broad-spectrum resistance resource. Based on phenotypic assessments of 140 F 2 individuals derived from a cross of HA 89 with HA-R8, rust resistance in the population was found to be conferred by a single dominant gene (R 15 ) originating from HA-R8. Genotypic analysis with the currently available SSR markers failed to find any association between rust resistance and any markers. Therefore, we used genotyping-by-sequencing (GBS) analysis to achieve better genomic coverage. The GBS data showed that R 15 was located at the top end of linkage group (LG) 8. Saturation with 71 previously mapped SNP markers selected within this region further showed that it was located in a resistance gene cluster on LG8, and mapped to a 1.0-cM region between three co-segregating SNP makers SFW01920, SFW00128, and SFW05824 as well as the NSA_008457 SNP marker. These closely linked markers will facilitate marker-assisted selection and breeding in sunflower.
Wildlife connectivity approaches and best practices in U.S. state wildlife action plans.
Lacher, Iara; Wilkerson, Marit L
2014-02-01
As habitat loss and fragmentation threaten biodiversity on large geographic scales, creating and maintaining connectivity of wildlife populations is an increasingly common conservation objective. To assess the progress and success of large-scale connectivity planning, conservation researchers need a set of plans that cover large geographic areas and can be analyzed as a single data set. The state wildlife action plans (SWAPs) fulfill these requirements. We examined 50 SWAPs to determine the extent to which wildlife connectivity planning, via linkages, is emphasized nationally. We defined linkage as connective land that enables wildlife movement. For our content analysis, we identified and quantified 6 keywords and 7 content criteria that ranged in specificity and were related to linkages for wide-ranging terrestrial vertebrates and examined relations between content criteria and statewide data on focal wide-ranging species, spending, revenue, and conserved land. Our results reflect nationwide disparities in linkage conservation priorities and highlight the continued need for wildlife linkage planning. Only 30% or less of the 50 SWAPs fulfilled highly specific content criteria (e.g., identifying geographic areas for linkage placement or management). We found positive correlations between our content criteria and statewide data on percent conserved land, total focal species, and spending on parks and recreation. We supplemented our content analysis with interviews with 17 conservation professionals to gain specific information about state-specific context and future directions of linkage conservation. Based on our results, relevant literature, and interview responses, we suggest the following best practices for wildlife linkage conservation plans: collect ecologically meaningful background data; be specific; establish community-wide partnerships; and incorporate sociopolitical and socioeconomic information. © 2013 Society for Conservation Biology.
Wildlife connectivity approaches and best practices in U.S. state wildlife action plans
Lacher, Iara; Wilkerson, Marit L.
2014-01-01
As habitat loss and fragmentation threaten biodiversity on large geographic scales, creating and maintaining connectivity of wildlife populations is an increasingly common conservation objective. To assess the progress and success of large-scale connectivity planning, conservation researchers need a set of plans that cover large geographic areas and can be analyzed as a single data set. The state wildlife action plans (SWAPs) fulfill these requirements. We examined 50 SWAPs to determine the extent to which wildlife connectivity planning, via linkages, is emphasized nationally. We defined linkage as connective land that enables wildlife movement. For our content analysis, we identified and quantified 6 keywords and 7 content criteria that ranged in specificity and were related to linkages for wide-ranging terrestrial vertebrates and examined relations between content criteria and statewide data on focal wide-ranging species, spending, revenue, and conserved land. Our results reflect nationwide disparities in linkage conservation priorities and highlight the continued need for wildlife linkage planning. Only 30% or less of the 50 SWAPs fulfilled highly specific content criteria (e.g., identifying geographic areas for linkage placement or management). We found positive correlations between our content criteria and statewide data on percent conserved land, total focal species, and spending on parks and recreation. We supplemented our content analysis with interviews with 17 conservation professionals to gain specific information about state-specific context and future directions of linkage conservation. Based on our results, relevant literature, and interview responses, we suggest the following best practices for wildlife linkage conservation plans: collect ecologically meaningful background data; be specific; establish community-wide partnerships; and incorporate sociopolitical and socioeconomic information.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Willems, P.; Vits, L.; Buntinx, I.
1993-11-01
Nonspecific X-linked mental retardation (MRX) includes several distinct entities with mental retardation but without additional distinguishing features. The MRX family reported here has been classified previously as MRX9. In this study, the authors performed linkage analysis of MRX9 with a panel of 43 polymorphic DNA markers dispersed over chromosome X. Two-point linkage analysis revealed lod scores of 3.52 and 3.82 at 0% recombination for OATL1 and MAOA, both located in Xp11.2-p11.4. Lod scores for linkage with PGK1P1, DXS106, and DXS132, all located in Xq11-q13, were 3.83, 3.82, and 3.52, respectively, all at 0% recombination. Multipoint linkage analysis showed two peaksmore » with MAOA and DXS132/DXS106, respectively. Analysis of recombinational events indicated a position of the MRX9 gene between DXS164 and DXS453. These findings are compatible with a location of the MRX9 gene in the pericentromeric region of the X chromosome at Xp21-q13. 26 refs., 3 figs., 2 tabs.« less
Holmans, Peter; Zubenko, George S; Crowe, Raymond R; DePaulo, J Raymond; Scheftner, William A; Weissman, Myrna M; Zubenko, Wendy N; Boutelle, Sandra; Murphy-Eberenz, Kathleen; MacKinnon, Dean; McInnis, Melvin G; Marta, Diana H; Adams, Philip; Knowles, James A; Gladis, Madeleine; Thomas, Jo; Chellis, Jennifer; Miller, Erin; Levinson, Douglas F
2004-06-01
A genome scan was performed on the first phase sample of the Genetics of Recurrent Early-Onset Depression (GenRED) project. The sample consisted of 297 informative families containing 415 independent affected sibling pairs (ASPs), or, counting all possible pairs, 685 informative affected relative pairs (555 ASPs and 130 other pair types). Affected cases had recurrent major depressive disorder (MDD) with onset before age 31 years for probands or age 41 years for other affected relatives; the mean age at onset was 18.5 years, and the mean number of depressive episodes was 7.3. The Center for Inherited Disease Research genotyped 389 microsatellite markers (mean spacing of 9.3 cM). The primary linkage analysis considered allele sharing in all possible affected relative pairs with the use of the Z(lr) statistic computed by the ALLEGRO program. A secondary logistic regression analysis considered the effect of the sex of the pair as a covariate. Genomewide significant linkage was observed on chromosome 15q25.3-26.2 (Zlr=4.14, equivalent LOD = 3.73, empirical genomewide P=.023). The linkage was not sex specific. No other suggestive or significant results were observed in the primary analysis. The secondary analysis produced three regions of suggestive linkage, but these results should be interpreted cautiously because they depended primarily on the small subsample of 42 male-male pairs. Chromosome 15q25.3-26.2 deserves further study as a candidate region for susceptibility to MDD.
Yu, Yang; Zhang, Xiaojun; Yuan, Jianbo; Wang, Quanchao; Li, Shihao; Huang, Hao; Li, Fuhua; Xiang, Jianhai
2017-06-01
The Pacific white shrimp Litopenaeus vannamei is a predominant aquaculture shrimp species in the world. Like other animals, the L. vannamei exhibited sexual dimorphism in growth trait. Mapping of the sex-determining locus will be very helpful to clarify the sex determination system and further benefit the shrimp aquaculture industry towards the production of mono-sex stocks. Based on the data used for high-density linkage map construction, linkage-mapping analysis was conducted. The sex determination region was mapped in linkage group (LG) 18. A large region from 0 to 21.205 cM in LG18 showed significant association with sex. However, none of the markers in this region showed complete association with sex in the other populations. So an association analysis was designed using the female parent, pool of female progenies, male parent, and pool of male progenies. Markers were de novo developed and those showing significant differences between female and male pools were identified. Among them, three sex-associated markers including one fully associated marker were identified. Integration of linkage and association analysis showed that the sex determination region was fine-mapped in a small region along LG18. The identified sex-associated marker can be used for the sex detection of this species at genetic level. The fine-mapped sex-determining region will contribute to the mapping of sex-determining gene and help to clarify sex determination system for L. vannamei.
[Linkage analysis of a family with familial hypertriglyceridemia].
Tang, Xin; Lin, Ying; Liu, Bing; Ma, Shi; Yang, Yang; Yang, Zheng-lin
2009-10-01
To perform linkage analysis and mutation screening in a Chinese family with familial hpertriglyceridemia (FHTG). Thirty-two family members including 12 hypertriglyceridemia patients participated in the study. Genotyping and haplotype analysis for 22 subjects were performed using short tandem repeat (STR) microsatellite polymorphism markers on 16 candidate genes and/or loci related to lipid metabolism. Two of the sixteen known candidate genes, APOA2 and USF1 were screened for mutation by direct DNA sequencing. No linkage was found between the candidate genes/loci of APOA5, LIPI, RP1, APOC2, ABC1, LMF1, APOA1-APOC3-APOA4, LPL, APOB, CETP, LCAT, LDLR, APOE and the phenotype in this family. The two-point Lod scores (theta =0) were all less than-1.0 for all the markers tested. Linkage analysis suggested linkage to chromosome 1q23.3-24.2 between the disease phenotype and STR marker D1S194 with a two-point maximum Lod score of 2.44 at theta =0. Fine mapping indicated that the disease gene was localized to a 5.87 cM interval between D1S104 and D1S196. No disease-causing mutation was detected in the APOA2 and USF1 genes. The above mentioned candidate genes were excluded as the disease causing genes for this family. The results implied that there might be a novel gene/locus for FHTG on chromosome 1q23.3-1q24.2.
Hinckley, Jesse D; Abbott, Diana; Burns, Trudy L; Heiman, Meadow; Shapiro, Amy D; Wang, Kai; Di Paola, Jorge
2013-01-01
We characterized a large Amish pedigree and, in 384 pedigree members, analyzed the genetic variance components with covariate screen as well as genome-wide quantitative trait locus (QTL) linkage analysis of red blood cell count (RBC), hemoglobin (HB), hematocrit (HCT), mean corpuscular volume (MCV), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC), red cell distribution width (RDW), platelet count (PLT), and white blood cell count (WBC) using SOLAR. Age and gender were found to be significant covariates in many CBC traits. We obtained significant heritability estimates for RBC, MCV, MCH, MCHC, RDW, PLT, and WBC. We report four candidate loci with Logarithm of the odds (LOD) scores above 2.0: 6q25 (MCH), 9q33 (WBC), 10p12 (RDW), and 20q13 (MCV). We also report eleven candidate loci with LOD scores between 1.5 and <2.0. Bivariate linkage analysis of MCV and MCH on chromosome 20 resulted in a higher maximum LOD score of 3.14. Linkage signals on chromosomes 4q28, 6p22, 6q25, and 20q13 are concomitant with previously reported QTL. All other linkage signals reported herein represent novel evidence of candidate QTL. Interestingly rs1800562, the most common causal variant of hereditary hemochromatosis in HFE (6p22) was associated with MCH and MCHC in this family. Linkage studies like the one presented here will allow investigators to focus the search for rare variants amidst the noise encountered in the large amounts of data generated by whole-genome sequencing. PMID:24058921
Refined genetic mapping of X-linked Charcot-Marie-Tooth neuropathy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fain, P.R.; Barker, D.F.; Chance, P.F.
1994-02-01
Genetic linkage studies were conducted in four multigenerational families with X-linked Charcot-Marie-Tooth disease (CMTX), using 12 highly polymorphic short-tandem-repeat markers for the pericentromeric region of the X Chromosome. Pairwise linkage analysis with individual markers confirmed tight linkage of CMTX to the pericentromeric region in each family. Multipoint analyses strongly support the order DXS337-CMTX-DXS441-(DXS56, PGK1). 38 refs., 2 figs., 1 tab.
Kwitek-Black, A E; Carmi, R; Duyk, G M; Buetow, K H; Elbedour, K; Parvari, R; Yandava, C N; Stone, E M; Sheffield, V C
1993-12-01
Bardet-Biedl syndrome is an autosomal recessive disorder characterized by mental retardation, obesity, retinitis pigmentosa, polydactyly and hypogonadism. Other findings include hypertension, diabetes mellitus and renal and cardiovascular anomalies. We have performed a genome-wide search for linkage in a large inbred Bedouin family. Pairwise analysis established linkage with the locus D16S408 with no recombination and a lod score of 4.2. A multilocus lod score of 5.3 was observed. By demonstrating homozygosity, in all affected individuals, for the same allele of marker D16S408, further support for linkage is found, and the utility of homozygosity mapping using inbred families is demonstrated. In a second family, linkage was excluded at this locus, suggesting non-allelic genetic heterogeneity in this disorder.
Linking stressors and ecological responses
Gentile, J.H.; Solomon, K.R.; Butcher, J.B.; Harrass, M.; Landis, W.G.; Power, M.; Rattner, B.A.; Warren-Hicks, W.J.; Wenger, R.; Foran, Jeffery A.; Ferenc, Susan A.
1999-01-01
To characterize risk, it is necessary to quantify the linkages and interactions between chemical, physical and biological stressors and endpoints in the conceptual framework for ecological risk assessment (ERA). This can present challenges in a multiple stressor analysis, and it will not always be possible to develop a quantitative stressor-response profile. This review commences with a conceptual representation of the problem of developing a linkage analysis for multiple stressors and responses. The remainder of the review surveys a variety of mathematical and statistical methods (e.g., ranking methods, matrix models, multivariate dose-response for mixtures, indices, visualization, simulation modeling and decision-oriented methods) for accomplishing the linkage analysis for multiple stressors. Describing the relationships between multiple stressors and ecological effects are critical components of 'effects assessment' in the ecological risk assessment framework.
Genetic linkage analysis of schizophrenia using chromosome 11q13-24 markers in Israeli pedigrees
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mulcrone, J.; Marchblanks, R.; Whatley, S.A.
It is generally agreed that there is a genetic component in the etiology of schizophrenia which may be tested by the application of linkage analysis to multiply-affected families. One genetic region of interest is the long arm of chromosome 11 because of previously reported associations of genetic variation in this region with schizophrenia, and because of the fact that it contains the locus for the dopamine D2 receptor gene. In this study we have examined the segregation of schizophrenia with microsatellite dinucleotide repeat DNA markers along chromosome 11q in 5 Israeli families multiply-affected for schizophrenia. The hypothesis of linkage undermore » genetic homogeneity of causation was tested under a number of genetic models. Linkage analysis provided no evidence for significant causal mutations within the region bounded by INT and D11S420 on chromosome 11q. It is still possible, however, that a gene of major effect exists in this region, either with low penetrance or with heterogeneity. 32 refs., 2 figs., 4 tabs.« less
Linkage analysis in Usher syndrome type I (USH1) families from Spain.
Espinós, C; Nájera, C; Millán, J M; Ayuso, C; Baiget, M; Pérez-Garrigues, H; Rodrigo, O; Vilela, C; Beneyto, M
1998-01-01
Usher syndrome (USH) is an autosomal recessive hereditary disorder characterised by congenital sensorineural hearing loss and gradual visual impairment secondary to retinitis pigmentosa (RP). The disorder is clinically and genetically heterogeneous. With regard to Usher type I (USH1), several subtypes have been described, the most frequent being USH1B located on chromosome 11q13.5. Of 18 USH1 families studied by linkage analysis, 12 (67%) showed significant lod score values for locus D11S527 (Zmax=14.032, theta=0.000) situated on chromosome 11q. Our findings suggest considerable genetic heterogeneity in the Spanish USH1 population. It is important to note that one of our families linked to the USH1B locus shows interesting intrafamilial clinical variability. As regards the remaining six USH1 families, the linkage analysis did not provide conclusive data, although two of them show slight linkage to markers located on chromosome 3q (Zmax=1.880, theta=0.000 for D3S1279), the same location that had previously been assigned to some USH3 families. Images PMID:9610802
Kumawat, Giriraj; Raje, Ranjeet S; Bhutani, Shefali; Pal, Jitendra K; Mithra, Amitha S V C R; Gaikwad, Kishor; Sharma, Tilak R; Singh, Nagendra K
2012-10-08
Pigeonpea is an important grain legume of the semi-arid tropics and sub-tropical regions where it plays a crucial role in the food and nutritional security of the people. The average productivity of pigeonpea has remained very low and stagnant for over five decades due to lack of genomic information and intensive breeding efforts. Previous SSR-based linkage maps of pigeonpea used inter-specific crosses due to low inter-varietal polymorphism. Here our aim was to construct a high density intra-specific linkage map using genic-SNP markers for mapping of major quantitative trait loci (QTLs) for key agronomic traits, including plant height, number of primary and secondary branches, number of pods, days to flowering and days to maturity in pigeonpea. A population of 186 F2:3 lines derived from an intra-specific cross between inbred lines 'Pusa Dwarf' and 'HDM04-1' was used to construct a dense molecular linkage map of 296 genic SNP and SSR markers covering a total adjusted map length of 1520.22 cM for the 11 chromosomes of the pigeonpea genome. This is the first dense intra-specific linkage map of pigeonpea with the highest genome length coverage. Phenotypic data from the F2:3 families were used to identify thirteen QTLs for the six agronomic traits. The proportion of phenotypic variance explained by the individual QTLs ranged from 3.18% to 51.4%. Ten of these QTLs were clustered in just two genomic regions, indicating pleiotropic effects or close genetic linkage. In addition to the main effects, significant epistatic interaction effects were detected between the QTLs for number of pods per plant. A large amount of information on transcript sequences, SSR markers and draft genome sequence is now available for pigeonpea. However, there is need to develop high density linkage maps and identify genes/QTLs for important agronomic traits for practical breeding applications. This is the first report on identification of QTLs for plant type and maturity traits in pigeonpea. The QTLs identified in this study provide a strong foundation for further validation and fine mapping for utilization in the pigeonpea improvement.
Genetic analysis of tolerance to boron toxicity in the legume Medicago truncatula.
Bogacki, Paul; Peck, David M; Nair, Ramakrishnan M; Howie, Jake; Oldach, Klaus H
2013-03-27
Medicago truncatula Gaertn. (barrel medic) is cultivated as a pasture legume for its high protein content and ability to improve soils through nitrogen fixation. Toxic concentrations of the micronutrient Boron (B) in agricultural soils hamper the production of cereal and leguminous crops. In cereals, the genetic analysis of B tolerance has led to the development of molecular selection tools to introgress and maintain the B tolerance trait in breeding lines. There is a comparable need for selection tools in legumes that grow on these toxic soils, often in rotation with cereals. Genetic variation for B tolerance in Medicago truncatula was utilised to generate two F2 populations from crosses between tolerant and intolerant parents. Phenotyping under B stress revealed a close correlation between B tolerance and biomass production and a segregation ratio explained by a single dominant locus. M. truncatula homologues of the Arabidopsis major intrinsic protein (MIP) gene AtNIP5;1 and the efflux-type transporter gene AtBOR1, both known for B transport, were identified and nearby molecular markers screened across F2 lines to verify linkage with the B-tolerant phenotype. Most (95%) of the phenotypic variation could be explained by the SSR markers h2_6e22a and h2_21b19a, which flank a cluster of five predicted MIP genes on chromosome 4. Three CAPS markers (MtBtol-1,-2,-3) were developed to dissect the region further. Expression analysis of the five predicted MIPs indicated that only MtNIP3 was expressed when leaf tissue and roots were assessed. MtNIP3 showed low and equal expression in the roots of tolerant and intolerant lines but a 4-fold higher expression level in the leaves of B-tolerant cultivars. The expression profile correlates closely with the B concentration measured in the leaves and roots of tolerant and intolerant plants. Whereas no significant difference in B concentration exists between roots of tolerant and intolerant plants, the B concentration in the leaves of tolerant plants is less than half that of intolerant plants, which further supports MtNIP3 as the best candidate for the tolerance trait-defining gene in Medicago truncatula. The close linkage of the MtNIP3 locus to B toxicity tolerance provides a source of molecular selection tools to pasture breeding programs. The economical importance of the locus warrants further investigation of the individual members of the MIP gene cluster in other pasture and in grain legumes.
Genetic analysis of tolerance to Boron toxicity in the legume Medicago truncatula
2013-01-01
Background Medicago truncatula Gaertn. (barrel medic) is cultivated as a pasture legume for its high protein content and ability to improve soils through nitrogen fixation. Toxic concentrations of the micronutrient Boron (B) in agricultural soils hamper the production of cereal and leguminous crops. In cereals, the genetic analysis of B tolerance has led to the development of molecular selection tools to introgress and maintain the B tolerance trait in breeding lines. There is a comparable need for selection tools in legumes that grow on these toxic soils, often in rotation with cereals. Results Genetic variation for B tolerance in Medicago truncatula was utilised to generate two F2 populations from crosses between tolerant and intolerant parents. Phenotyping under B stress revealed a close correlation between B tolerance and biomass production and a segregation ratio explained by a single dominant locus. M. truncatula homologues of the Arabidopsis major intrinsic protein (MIP) gene AtNIP5;1 and the efflux-type transporter gene AtBOR1, both known for B transport, were identified and nearby molecular markers screened across F2 lines to verify linkage with the B-tolerant phenotype. Most (95%) of the phenotypic variation could be explained by the SSR markers h2_6e22a and h2_21b19a, which flank a cluster of five predicted MIP genes on chromosome 4. Three CAPS markers (MtBtol-1,-2,-3) were developed to dissect the region further. Expression analysis of the five predicted MIPs indicated that only MtNIP3 was expressed when leaf tissue and roots were assessed. MtNIP3 showed low and equal expression in the roots of tolerant and intolerant lines but a 4-fold higher expression level in the leaves of B-tolerant cultivars. The expression profile correlates closely with the B concentration measured in the leaves and roots of tolerant and intolerant plants. Whereas no significant difference in B concentration exists between roots of tolerant and intolerant plants, the B concentration in the leaves of tolerant plants is less than half that of intolerant plants, which further supports MtNIP3 as the best candidate for the tolerance trait-defining gene in Medicago truncatula. Conclusion The close linkage of the MtNIP3 locus to B toxicity tolerance provides a source of molecular selection tools to pasture breeding programs. The economical importance of the locus warrants further investigation of the individual members of the MIP gene cluster in other pasture and in grain legumes. PMID:23531152
Zeng, Li-ping; Hu, Zheng-mao; Mu, Li-li; Mei, Gui-sen; Lu, Xiu-ling; Zheng, Yong-jun; Li, Pei-jian; Zhang, Ying-xue; Pan, Qian; Long, Zhi-gao; Dai, He-ping; Zhang, Zhuo-hua; Xia, Jia-hui; Zhao, Jing-ping; Xia, Kun
2011-06-01
To investigate the relationship of susceptibility loci in chromosomes 1q21-25 and 6p21-25 and schizophrenia subtypes in Chinese population. A genomic scan and parametric and non-parametric analyses were performed on 242 individuals from 36 schizophrenia pedigrees, including 19 paranoid schizophrenia and 17 undifferentiated schizophrenia pedigrees, from Henan province of China using 5 microsatellite markers in the chromosome region 1q21-25 and 8 microsatellite markers in the chromosome region 6p21-25, which were the candidates of previous studies. All affected subjects were diagnosed and typed according to the criteria of the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revised (DSM-IV-TR; American Psychiatric Association, 2000). All subjects signed informed consent. In chromosome 1, parametric analysis under the dominant inheritance mode of all 36 pedigrees showed that the maximum multi-point heterogeneity Log of odds score method (HLOD) score was 1.33 (α = 0.38). The non-parametric analysis and the single point and multi-point nonparametric linkage (NPL) scores suggested linkage at D1S484, D1S2878, and D1S196. In the 19 paranoid schizophrenias pedigrees, linkage was not observed for any of the 5 markers. In the 17 undifferentiated schizophrenia pedigrees, the multi-point NPL score was 1.60 (P= 0.0367) at D1S484. The single point NPL score was 1.95(P= 0.0145) and the multi-point NPL score was 2.39 (P= 0.0041) at D1S2878. Additionally, the multi-point NPL score was 1.74 (P= 0.0255) at D1S196. These same three loci showed suggestive linkage during the integrative analysis of all 36 pedigrees. In chromosome 6, parametric linkage analysis under the dominant and recessive inheritance and the non-parametric linkage analysis of all 36 pedigrees and the 17 undifferentiated schizophrenia pedigrees, linkage was not observed for any of the 8 markers. In the 19 paranoid schizophrenias pedigrees, parametric analysis showed that under recessive inheritance mode the maximum single-point HLOD score was 1.26 (α = 0.40) and the multi-point HLOD was 1.12 (α = 0.38) at D6S289 in the chromosome 6p23. In nonparametric analysis, the single-point NPL score was 1.52 (P= 0.0402) and the multi-point NPL score was 1.92 (P= 0.0206) at D6S289. Susceptibility genes correlated with undifferentiated schizophrenia pedigrees from D1S484, D1S2878, D1S196 loci, and those correlated with paranoid schizophrenia pedigrees from D6S289 locus are likely present in chromosome regions 1q23.3 and 1q24.2, and chromosome region 6p23, respectively.
Barkley, Ruth Ann; Brown, Andrew C; Hanis, Craig L; Kardia, Sharon L; Turner, Stephen T; Boerwinkle, Eric
2003-07-01
The distribution of plasma lipoprotein[a] (Lp[a]) concentrations, a risk factor for cardiovascular disease, varies greatly among racial groups, with African Americans having values that are shifted toward higher levels than those of whites. The underlying cause of this heterogeneity is unknown, but a role for "trans-acting" factors has been hypothesized. This study used genetic linkage analysis to localize genetic factors influencing Lp[a] levels in African Americans that were absent in other populations; linkage results were analyzed separately in non-Hispanic whites, Hispanic whites, and African Americans. As expected, all three samples showed highly significant linkage at the approximate location of the lysophosphatidic acid locus. The white populations also independently had regions of significant linkage on chromosome 19 (LOD 3.80) and suggestive linkage on chromosomes 12 (LOD 1.60), 14 (LOD 2.56), and 19 (LOD 2.52). No linkage evidence was found to support the hypothesis of another single gene with large effects specifically segregating in African Americans that may account for their elevated Lp[a] levels.
Genome-wide scans for microalbuminuria in Mexican Americans: the San Antonio Family Heart Study.
Arar, Nedal; Nath, Subrata; Thameem, Farook; Bauer, Richard; Voruganti, Saroja; Comuzzie, Anthony; Cole, Shelley; Blangero, John; MacCluer, Jean; Abboud, Hanna
2007-02-01
Microalbuminuria, defined as urine albumin-to-creatinine ratio of 0.03 to 0.299 mg/mg, is a major risk factor for cardiovascular disease. Several genetic epidemiological studies have established that microalbuminuria clusters in families, suggesting a genetic predisposition. We estimated heritability of microalbuminuria and performed a genome-wide linkage analysis to identify chromosomal regions influencing urine albumin-to-creatinine ratio in 486 Mexican Americans from 26 multiplex families. Significant heritability was demonstrated for urine albumin-to-creatinine ratio (h = 24%, P < 0.003) after accounting for age, sex, body mass index, triglycerides, and hypertension. Genome scan revealed significant evidence of linkage of urine albumin-to-creatinine ratio to a region on chromosome 20q12 (LOD score of 3.5, P < 0.001) near marker D20S481. This region also exhibited a LOD score of 2.8 with diabetes status as a covariate and 3.0 with hypertension status as a covariate suggesting that the effect of this locus on urine albumin-to-creatinine ratio is largely independent of diabetes and hypertension. Findings indicate that there is a gene or genes located on human chromosome 20q12 that may have functional relevance to albumin excretion in Mexican Americans. Identifying and understanding the role of the genes that determine albumin excretion would lead to the development of novel therapeutic strategies targeted at high-risk individuals in whom intensive preventive measures may be most beneficial.
Validation of an instrument to measure inter-organisational linkages in general practice.
Amoroso, Cheryl; Proudfoot, Judith; Bubner, Tanya; Jayasinghe, Upali W; Holton, Christine; Winstanley, Julie; Beilby, Justin; Harris, Mark F
2007-12-03
Linkages between general medical practices and external services are important for high quality chronic disease care. The purpose of this research is to describe the development, evaluation and use of a brief tool that measures the comprehensiveness and quality of a general practice's linkages with external providers for the management of patients with chronic disease. In this study, clinical linkages are defined as the communication, support, and referral arrangements between services for the care and assistance of patients with chronic disease. An interview to measure surgery-level (rather than individual clinician-level) clinical linkages was developed, piloted, reviewed, and evaluated with 97 Australian general practices. Two validated survey instruments were posted to patients, and a survey of locally available services was developed and posted to participating Divisions of General Practice (support organisations). Hypotheses regarding internal validity, association with local services, and patient satisfaction were tested using factor analysis, logistic regression and multilevel regression models. The resulting General Practice Clinical Linkages Interview (GP-CLI) is a nine-item tool with three underlying factors: referral and advice linkages, shared care and care planning linkages, and community access and awareness linkages. Local availability of chronic disease services has no affect on the comprehensiveness of services with which practices link, however, comprehensiveness of clinical linkages has an association with patient assessment of access, receptionist services, and of continuity of care in their general practice. The GP-CLI may be useful to researchers examining comparable health care systems for measuring the comprehensiveness and quality of linkages at a general practice-level with related services, possessing both internal and external validity. The tool can be used with large samples exploring the impact, outcomes, and facilitators of high quality clinical linkages in general practice.
Nelson, Matthew N.; Moolhuijzen, Paula M.; Boersma, Jeffrey G.; Chudy, Magdalena; Lesniewska, Karolina; Bellgard, Matthew; Oliver, Richard P.; Święcicki, Wojciech; Wolko, Bogdan; Cowling, Wallace A.; Ellwood, Simon R.
2010-01-01
We have developed a dense reference genetic map of Lupinus angustifolius (2n = 40) based on a set of 106 publicly available recombinant inbred lines derived from a cross between domesticated and wild parental lines. The map comprised 1090 loci in 20 linkage groups and three small clusters, drawing together data from several previous mapping publications plus almost 200 new markers, of which 63 were gene-based markers. A total of 171 mainly gene-based, sequence-tagged site loci served as bridging points for comparing the Lu. angustifolius genome with the genome sequence of the model legume, Lotus japonicus via BLASTn homology searching. Comparative analysis indicated that the genomes of Lu. angustifolius and Lo. japonicus are highly diverged structurally but with significant regions of conserved synteny including the region of the Lu. angustifolius genome containing the pod-shatter resistance gene, lentus. We discuss the potential of synteny analysis for identifying candidate genes for domestication traits in Lu. angustifolius and in improving our understanding of Fabaceae genome evolution. PMID:20133394
Rosenthal, Mariana; Johnson, Christopher J; Scoppa, Steve; Carter, Kris
2016-01-01
Investigations of suspected cancer clusters are resource intensive and rarely identify true clusters: among 428 publicly reported US investigations during 1990-2011, only 1 etiologic cluster was identified. In 2013, the Cancer Data Registry of Idaho (CDRI) was contacted regarding a suspected cancer cluster at a worksite (Cluster A) and among an occupational cohort (Cluster B). We investigated to determine whether these were true clusters. We derived investigation cohorts for Cluster A from facility-provided employee records and for Cluster B from professional licensing records. We used Registry PlusTM Link Plus to conduct probabilistic linkage of cohort members to the CDRI registry and completed matching through manual review by using LexisNexis®, Accurint®, and the Social Security Death Index. We calculated standardized incidence ratios (SIR) using the MP-SIR session type in SEER*Stat and Idaho and US referent populations. For Cluster A, we identified 34 cancer cases during 9,689 person-years; compared with Idaho and US rates, 95 percent CIs for SIRs included 1.0 for 24 of 24 primary site categories. For Cluster B, we identified 78 cancer cases during 15,154 person-years; compared with Idaho rates, 95 percent CI for SIRs included 1.0 for 23 of 24 primary site categories and was less than 1.0 for lung and bronchus cancers, and compared with US rates, 95 percent CI for SIRs included 1.0 for 22 of 24 primary site categories and was less than 1.0 for lung and bronchus and colorectal cancers. We identified no statistically significant excess in cancer incidence in either cohort. SEER*Stat's MP-SIR is an efficient tool for performing SIR assessments, a Centers for Disease Control and Prevention/Council of State and Territorial Epidemiologists-recommended step when investigating suspected cancer clusters.
Le Hellard, Stephanie; Lee, Andrew J; Underwood, Sarah; Thomson, Pippa A; Morris, Stewart W; Torrance, Helen S; Anderson, Susan M; Adams, Richard R; Navarro, Pau; Christoforou, Andrea; Houlihan, Lorna M; Detera-Wadleigh, Sevilla; Owen, Michael J; Asherson, Philip; Muir, Walter J; Blackwood, Douglas H R; Wray, Naomi R; Porteous, David J; Evans, Kathryn L
2007-03-15
Bipolar affective disorder (BPAD) and schizophrenia (SCZ) are common conditions. Their causes are unknown, but they include a substantial genetic component. Previously, we described significant linkage of BPAD to a chromosome 4p locus within a large pedigree (F22). Others subsequently have found evidence for linkage of BPAD and SCZ to this region. We constructed high-resolution haplotypes for four linked families, calculated logarithm of the odds (LOD) scores, and developed a novel method to assess the extent of allele sharing within genes between the families. We describe an increase in the F22 LOD score for this region. Definition and comparison of the linked haplotypes allowed us to prioritize two subregions of 3.8 and 4.4 Mb. Analysis of the extent of allele sharing within these subregions identified 200 kb that shows increased allele sharing between families. Linkage of BPAD to chromosome 4p has been strengthened. Haplotype analysis in the additional linked families refined the 20-Mb linkage region. Development of a novel allele-sharing method allowed us to bridge the gap between conventional linkage and association studies. Description of a 200-kb region of increased allele sharing prioritizes this region, which contains two functional candidate genes for BPAD, SLC2A9, and WDR1, for subsequent studies.
Uyei, Jennifer; Fiellin, David A; Buchelli, Marianne; Rodriguez-Santana, Ramon; Braithwaite, R Scott
2017-03-01
In the USA, an epidemic of opioid overdose deaths is occurring, many of which are from heroin. Combining naloxone distribution with linkage to addiction treatment or pre-exposure prophylaxis (PrEP) for HIV prevention through syringe service programmes has the potential to save lives and be cost-effective. We estimated the outcomes and cost-effectiveness of five alternative strategies: no additional intervention, naloxone distribution, naloxone distribution plus linkage to addiction treatment, naloxone distribution plus PrEP, and naloxone distribution plus linkage to addiction treatment and PrEP. We developed a decision analytical Markov model to simulate opioid overdose, HIV incidence, overdose-related deaths, and HIV-related deaths in people who inject drugs in Connecticut, USA. Model input parameters were derived from published sources. We compared each strategy with no intervention, as well as simultaneously considering all strategies. Sensitivity analysis was done for all variables. Linkage to addiction treatment was referral to an opioid treatment programme for methadone. Endpoints were survival, life expectancy, quality-adjusted life-years (QALYs), number and percentage of overdose deaths averted, number of HIV-related deaths averted, total costs (in 2015 US$) associated with each strategy, and incremental cost per QALY gained. In the base-case analysis, compared with no additional intervention, the naloxone distribution strategy yielded an incremental cost-effectiveness ratio (ICER) of $323 per QALY, and naloxone distribution plus linkage to addiction treatment was cost saving compared with no additional intervention (greater effectiveness and less expensive). The most efficient strategies (ie, those conferring the greatest health benefit for a particular budget) were naloxone distribution combined with linkage to addiction treatment (cost saving), and naloxone distribution combined with PrEP and linkage to addiction treatment (ICER $95 337 per QALY) at a willingness-to-pay threshold of $100 000. In probabilistic sensitivity analysis, the combination of naloxone distribution, PrEP, and linkage to addiction treatment was the optimal strategy in 37% of iterations and the combination of naloxone distribution and linkage to addiction treatment was the optimal strategy in 34% of iterations. Naloxone distribution through syringe service programmes is cost-effective compared with syringe distribution alone, but when combined with linkage to addiction treatment is cost saving compared with no additional services. A strategy that combines naloxone distribution, PrEP, and linkage to addiction treatment results in greater health benefits in people who inject drugs and is also cost-effective. State of Connecticut Department of Public Health and the National Institute of Mental Health. Copyright © 2017 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY-NC-ND license. Published by Elsevier Ltd.. All rights reserved.
Nunes, José de Ribamar da Silva; Liu, Shikai; Pértille, Fábio; Perazza, Caio Augusto; Villela, Priscilla Marqui Schmidt; de Almeida-Val, Vera Maria Fonseca; Hilsdorf, Alexandre Wagner Silva; Liu, Zhanjiang; Coutinho, Luiz Lehmann
2017-01-01
Colossoma macropomum, or tambaqui, is the largest native Characiform species found in the Amazon and Orinoco river basins, yet few resources for genetic studies and the genetic improvement of tambaqui exist. In this study, we identified a large number of single-nucleotide polymorphisms (SNPs) for tambaqui and constructed a high-resolution genetic linkage map from a full-sib family of 124 individuals and their parents using the genotyping by sequencing method. In all, 68,584 SNPs were initially identified using minimum minor allele frequency (MAF) of 5%. Filtering parameters were used to select high-quality markers for linkage analysis. We selected 7,734 SNPs for linkage mapping, resulting in 27 linkage groups with a minimum logarithm of odds (LOD) of 8 and maximum recombination fraction of 0.35. The final genetic map contains 7,192 successfully mapped markers that span a total of 2,811 cM, with an average marker interval of 0.39 cM. Comparative genomic analysis between tambaqui and zebrafish revealed variable levels of genomic conservation across the 27 linkage groups which allowed for functional SNP annotations. The large-scale SNP discovery obtained here, allowed us to build a high-density linkage map in tambaqui, which will be useful to enhance genetic studies that can be applied in breeding programs. PMID:28387238
Design and analysis of an underactuated anthropomorphic finger for upper limb prosthetics.
Omarkulov, Nurdos; Telegenov, Kuat; Zeinullin, Maralbek; Begalinova, Ainur; Shintemirov, Almas
2015-01-01
This paper presents the design of a linkage based finger mechanism ensuring extended range of anthropomorphic gripping motions. The finger design is done using a path-point generation method based on geometrical dimensions and motion of a typical index human finger. Following the design description, and its kinematics analysis, the experimental evaluation of the finger gripping performance is presented using the finger 3D printed prototype. The finger underactuation is achieved by utilizing mechanical linkage system, consisting of two crossed four-bar linkage mechanisms. It is shown that the proposed finger design can be used to design a five-fingered anthropomorphic hand and has the potential for upper limb prostheses development.
Significant Admixture Linkage Disequilibrium across 30 cM around the FY Locus in African Americans
Lautenberger, James A.; Stephens, J. Claiborne; O'Brien, Stephen J.; Smith, Michael W.
2000-01-01
Scientists, to understand the importance of allelic polymorphisms on phenotypes that are quantitative and environmentally interacting, are now turning to population-association screens, especially in instances in which pedigree analysis is difficult. Because association screens require linkage disequilibrium between markers and disease loci, maximizing the degree of linkage disequilibrium increases the chances of discovering functional gene-marker associations. One theoretically valid approach—mapping by admixture linkage disequilibrium (MALD), using recently admixed African Americans—is empirically evaluated here by measurement of marker associations with 15 short tandem repeats (STRs) and an insertion/deletion polymorphism of the AT3 locus in a 70-cM segment at 1q22-23, around the FY (Duffy) locus. The FY polymorphism (−46T→C) disrupts the GATA promoter motif, specifically blocking FY erythroid expression and has a nearly fixed allele-frequency difference between European Americans and native Africans that is likely a consequence of a selective advantage of FY−/− in malaria infections. Analysis of linkage disequilibrium around the FY gene has indicated that there is strong and consistent linkage disequilibrium between FY and three flanking loci (D1S303, SPTA1, and D1S484) spanning 8 cM. We observed significant linkage-disequilibrium signals over a 30-cM region from −4.4 to 16.3 cM (from D1S2777 to D1S196) for STRs and at 26.4 cM (AT3), which provided quantitative estimates of centimorgan limits, by MALD assessment in African American population-association analyses, of 5–10 cM. PMID:10712211
Genetic structure in four West African population groups
Adeyemo, Adebowale A; Chen, Guanjie; Chen, Yuanxiu; Rotimi, Charles
2005-01-01
Background Africa contains the most genetically divergent group of continental populations and several studies have reported that African populations show a high degree of population stratification. In this regard, it is important to investigate the potential for population genetic structure or stratification in genetic epidemiology studies involving multiple African populations. The presences of genetic sub-structure, if not properly accounted for, have been reported to lead to spurious association between a putative risk allele and a disease. Within the context of the Africa America Diabetes Mellitus (AADM) Study (a genetic epidemiologic study of type 2 diabetes mellitus in West Africa), we have investigated population structure or stratification in four ethnic groups in two countries (Akan and Gaa-Adangbe from Ghana, Yoruba and Igbo from Nigeria) using data from 372 autosomal microsatellite loci typed in 493 unrelated persons (986 chromosomes). Results There was no significant population genetic structure in the overall sample. The smallest probability is associated with an inferred cluster of 1 and little of the posterior probability is associated with a higher number of inferred clusters. The distribution of members of the sample to inferred clusters is consistent with this finding; roughly the same proportion of individuals from each group is assigned to each cluster with little variation between the ethnic groups. Analysis of molecular variance (AMOVA) showed that the between-population component of genetic variance is less than 0.1% in contrast to 99.91% for the within population component. Pair-wise genetic distances between the four ethnic groups were also very similar. Nonetheless, the small between-population genetic variance was sufficient to distinguish the two Ghanaian groups from the two Nigerian groups. Conclusion There was little evidence for significant population substructure in the four major West African ethnic groups represented in the AADM study sample. Ethnicity apparently did not introduce differential allele frequencies that may affect analysis and interpretation of linkage and association studies. These findings, although not entirely surprising given the geographical proximity of these groups, provide important insights into the genetic relationships between the ethnic groups studied and confirm previous results that showed close genetic relationship between most studied West African groups. PMID:15978124
Yang, Xueli; Gu, Dongfeng; He, Jiang; Hixson, James E.; Rao, Dabeeru C.; Lu, Fanghong; Mu, Jianjun; Jaquish, Cashell E.; Chen, Jing; Huang, Jianfeng; Shimmin, Lawrence C.; Rice, Treva K.; Chen, Jichun; Wu, Xigui; Liu, Depei; Kelly, Tanika N.
2014-01-01
Background Blood pressure (BP) response to cold pressor test (CPT) is associated with increased risk of cardiovascular disease. We performed a genome-wide linkage scan and regional association analysis to identify genetic determinants of BP response to CPT. Methods and Results A total of 1,961 Chinese participants completed the CPT. Multipoint quantitative trait linkage analysis was performed, followed by single-marker and gene-based analyses of variants in promising linkage regions (logarithm of odds, LOD ≥ 2). A suggestive linkage signal was identified for systolic BP (SBP) response to CPT at 20p13-20p12.3, with a maximum multipoint LOD score of 2.37. Based on regional association analysis with 1,351 SNPs in the linkage region, we found that marker rs2326373 at 20p13 was significantly associated with mean arterial pressure (MAP) responses to CPT (P = 8.8×10−6) after FDR adjustment for multiple comparisons. A similar trend was also observed for SBP response (P = 0.03) and DBP response (P = 4.6×10−5). Results of gene-based analyses showed that variants in genes MCM8 and SLC23A2 were associated with SBP response to CPT (P = 4.0×10−5 and 2.7×10−4, respectively), and variants in genes MCM8 and STK35 were associated with MAP response to CPT (P = 1.5×10−5 and 5.0×10−5, respectively). Conclusions Within a suggestive linkage region on chromosome 20, we identified a novel variant associated with BP responses to CPT. We also found gene-based associations of MCM8, SLC23A2 and STK35 in this region. Further work is warranted to confirm these findings. Clinical Trial Registration URL: http://www.clinicaltrials.gov; Unique identifier: NCT00721721. PMID:25028485
Chromosome 14 and late-onset familial alzheimer disease (FAD)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schellenberg, G.D.; Anderson, L.; Nemens, E.
1993-09-01
Familial Alzheimer disease (FAD) is genetically heterogeneous. Two loci responsible for early-onset FAD have been identified: the amyloid precursor protein gene on chromosome 21 and the as-yet-unidentified locus on chromosome 14. The genetics of late-onset FAD is unresolved. Maximum-likelihood, affected-pedigree-member (APM), and sib-pair analysis were used, in 49 families with a mean age at onset [>=]60 years, to determine whether the chromosome 14 locus is responsible for late-onset FAD. The markers used were D14S53, D14S43, and D14S52. The LOD score method was used to test for linkage of late-onset FAD to the chromosome 14 markers, under three different models: age-dependentmore » penetrance, an affected-only analysis, and age-dependent penetrance with allowance for possible age-dependent sporadic cases. No evidence for linkage was obtained under any of these conditions for the late-onset kindreds, and strong evidence against linkage (LOD score [>=]2.0) to this region was obtained. Heterogeneity tests of the LOD score results for the combined group of families (early onset, Volga Germans, and late onset) favored the hypothesis of linkage to chromosome 14 with genetic heterogeneity. The positive results are primarily from early-onset families. APM analysis gave significant evidence for linkage of D14S43 and D14S52 to FAD in early-onset kindreds (P<.02). No evidence for linkage was found for the entire late-onset family group. Significant evidence for linkage to D14S52, however, was found for a subgroup of families of intermediate age at onset (mean age at onset [>=]60 years and <70 years). These results indicate that the chromosome 14 locus is not responsible for Alzheimer disease in most late-onset FAD kindreds but could play a role in a subset of these kindreds. 37 refs., 1 fig., 6 tabs.« less
Weiser, Armin A; Thöns, Christian; Filter, Matthias; Falenski, Alexander; Appel, Bernd; Käsbohrer, Annemarie
2016-01-01
FoodChain-Lab is modular open-source software for trace-back and trace-forward analysis in food-borne disease outbreak investigations. Development of FoodChain-Lab has been driven by a need for appropriate software in several food-related outbreaks in Germany since 2011. The software allows integrated data management, data linkage, enrichment and visualization as well as interactive supply chain analyses. Identification of possible outbreak sources or vehicles is facilitated by calculation of tracing scores for food-handling stations (companies or persons) and food products under investigation. The software also supports consideration of station-specific cross-contamination, analysis of geographical relationships, and topological clustering of the tracing network structure. FoodChain-Lab has been applied successfully in previous outbreak investigations, for example during the 2011 EHEC outbreak and the 2013/14 European hepatitis A outbreak. The software is most useful in complex, multi-area outbreak investigations where epidemiological evidence may be insufficient to discriminate between multiple implicated food products. The automated analysis and visualization components would be of greater value if trading information on food ingredients and compound products was more easily available.
Filter, Matthias; Falenski, Alexander; Appel, Bernd; Käsbohrer, Annemarie
2016-01-01
FoodChain-Lab is modular open-source software for trace-back and trace-forward analysis in food-borne disease outbreak investigations. Development of FoodChain-Lab has been driven by a need for appropriate software in several food-related outbreaks in Germany since 2011. The software allows integrated data management, data linkage, enrichment and visualization as well as interactive supply chain analyses. Identification of possible outbreak sources or vehicles is facilitated by calculation of tracing scores for food-handling stations (companies or persons) and food products under investigation. The software also supports consideration of station-specific cross-contamination, analysis of geographical relationships, and topological clustering of the tracing network structure. FoodChain-Lab has been applied successfully in previous outbreak investigations, for example during the 2011 EHEC outbreak and the 2013/14 European hepatitis A outbreak. The software is most useful in complex, multi-area outbreak investigations where epidemiological evidence may be insufficient to discriminate between multiple implicated food products. The automated analysis and visualization components would be of greater value if trading information on food ingredients and compound products was more easily available. PMID:26985673
Jia, Guanqing; Shi, Shenkui; Wang, Chunfang; Niu, Zhengang; Chai, Yang; Zhi, Hui; Diao, Xianmin
2013-09-01
Green foxtail (Setaria viridis) is a new model plant for the genomic investigation of C4 photosynthesis biology. As the ancestor of foxtail millet (Setaria italica), an ancient cereal of great importance in arid regions of the world, green foxtail is crucial for the study of domestication and evolution of this ancient crop. In the present study, 288 green foxtail accessions, which were collected from all geographical regions of China, were analysed using 77 simple sequence repeats (SSRs) that cover the whole genome. A high degree of molecular diversity was detected in these accessions, with an average of 33.5 alleles per locus. Two clusters, which were inconsistent with the distribution of eco-geographical regions in China, were inferred from STRUCTURE, Neighbor-Joining, and principal component analysis, indicating a partially mixed distribution of Chinese green foxtails. The higher subpopulation diversity was from accessions mainly collected from North China. A low level of linkage disequilibrium was observed in the green foxtail genome. Furthermore, a combined analysis of green foxtail and foxtail millet landraces was conducted, and the origin and domestication of foxtail millet was inferred in North China.
X-linked infantile spinal muscular atrophy: clinical definition and molecular mapping.
Dressman, Devin; Ahearn, Mary Ellen; Yariz, Kemal O; Basterrecha, Hugo; Martínez, Francisco; Palau, Francesc; Barmada, M Michael; Clark, Robin Dawn; Meindl, Alfons; Wirth, Brunhilde; Hoffman, Eric P; Baumbach-Reardon, Lisa
2007-01-01
X-linked infantile spinal-muscular atrophy (XL-SMA) is a rare disorder, which presents with the clinical characteristics of hypotonia, areflexia, and multiple congenital contractures (arthrogryposis) associated with loss of anterior horn cells and death in infancy. We have previously reported a single family with XL-SMA that mapped to Xp11.3-q11.2. Here we report further clinical description of XL-SMA plus an additional seven unrelated (XL-SMA) families from North America and Europe that show linkage data consistent with the same region. We first investigated linkage to the candidate disease gene region using microsatellite repeat markers. We further saturated the candidate disease gene region using polymorphic microsatellite repeat markers and single nucleotide polymorphisms in an effort to narrow the critical region. Two-point and multipoint linkage analysis was performed using the Allegro software package. Linkage analysis of all XL-SMA families displayed linkage consistent with the original XL-SMA region. The addition of new families and new markers has narrowed the disease gene interval for a XL-SMA locus between SNP FLJ22843 near marker DXS 8080 and SNP ARHGEF9 which is near DXS7132 (Xp11.3-Xq11.1).
Duffy, A; Turecki, G; Grof, P; Cavazzoni, P; Grof, E; Joober, R; Ahrens, B; Berghöfer, A; Müller-Oerlinghausen, B; Dvoráková, M; Libigerová, E; Vojtĕchovský, M; Zvolský, P; Nilsson, A; Licht, R W; Rasmussen, N A; Schou, M; Vestergaard, P; Holzinger, A; Schumann, C; Thau, K; Robertson, C; Rouleau, G A; Alda, M
2000-01-01
OBJECTIVE: To test for genetic linkage and association with GABAergic candidate genes in lithium-responsive bipolar disorder. DESIGN: Polymorphisms located in genes that code for GABRA3, GABRA5 and GABRB3 subunits of the GABAA receptor were investigated using association and linkage strategies. PARTICIPANTS: A total of 138 patients with bipolar 1 disorder with a clear response to lithium prophylaxis, selected from specialized lithium clinics in Canada and Europe that are part of the International Group for the Study of Lithium-Treated Patients, and 108 psychiatrically healthy controls. Families of 24 probands were suitable for linkage analysis. OUTCOME MEASURES: The association between the candidate genes and patients with bipolar disorder versus that of controls and genetic linkage within families. RESULTS: There was no significant association or linkage found between lithium-responsive bipolar disorder and the GABAergic candidate genes investigated. CONCLUSIONS: This study does not support a major role for the GABAergic candidate genes tested in lithium-responsive bipolar disorder. PMID:11022400
Action of transglucosidase from Aspergillus niger on maltoheptaose and [U-(13)C]maltose.
Ota, Masafumi; Okamoto, Takeshi; Wakabayashi, Hidehiko
2009-03-10
Oligosaccharides synthesized from a mixture of maltoheptaose and [U-(13)C]maltose with transglucosidase [EC 2.4.1.24] from Aspergillus niger were investigated. When the reaction mixture was incubated at 15 degrees C for 1h, several types of oligosaccharides with DP (degree of polymerization) 2 to DP8 containing alpha-D-Glcp-(1-->6)-maltoheptaose were detected by liquid chromatography-mass spectrometry (LC-MS) and methylation analysis. Most of these compounds consisted of alpha-(1-->4) linkages in the main chain and alpha-(1-->6) linkages at the non-reducing ends. However, when the reaction mixture was incubated for 96h, most of these products were converted into oligosaccharides with DP2 to DP5 consisting of only alpha-(1-->6) linkages. These results suggested that A. niger transglucosidase rapidly transferred glucosyl residues to maltooligosaccharides, and gradually hydrolyzed both alpha-(1-->4) linkages and alpha-(1-->6) linkages at the non-reducing end, and transformed these into smaller molecules of mainly alpha-(1-->6) linkages.
Vaughan, Laura Kelly; Wiener, Howard W.; Aslibekyan, Stella; Allison, David B.; Havel, Peter J.; Stanhope, Kimber L.; O’Brien, Diane M.; Hopkins, Scarlett E.; Lemas, Dominick J.; Boyer, Bert B.; Tiwari, Hemant K.
2015-01-01
Objective To identify novel genetic markers of obesity-related traits and to identify gene-diet interactions with n-3 polyunsaturated fatty acid (n-3 PUFA) intake in Yup’ik people. Material and Methods We measured body composition, plasma adipokines and ghrelin in 982 participants enrolled in the Center for Alaska Native Health Research (CANHR) Study. We conducted a genome-wide SNP linkage scan and targeted association analysis, fitting additional models to investigate putative gene-diet interactions. Finally, we performed bioinformatic analysis to uncover likely candidate genes within the identified linkage peaks. Results We observed evidence of linkage for all obesity-related traits, replicating previous results and identifying novel regions of interest for adiponectin (10q26.13-2) and thigh circumference (8q21.11-13). Bioinformatic analysis revealed DOCK1, PTPRE (10q26.13-2) and FABP4 (8q21.11-13) as putative candidate genes in the newly identified regions. Targeted SNP analysis under the linkage peaks identified associations between three SNPs and obesity-related traits: rs1007750 on chromosome 8 and thigh circumference (P=0.0005), rs878953 on chromosome 5 and thigh skinfold (P=0.0004), and rs1596854 on chromosome 11 for waist circumference (P=0.0003). Finally, we showed that n-3 PUFA modified the association between obesity related traits and two additional variants (rs2048417 on chromosome 3 for adiponectin, P for interaction=0.0006 and rs730414 on chromosome 11 for percentage body fat, P for interaction=0.0004). Conclusions This study presents evidence of novel genomic regions and gene-diet interactions that may contribute to the pathophysiology of obesity-related traits among Yup’ik people. PMID:25772781
Vaughan, Laura Kelly; Wiener, Howard W; Aslibekyan, Stella; Allison, David B; Havel, Peter J; Stanhope, Kimber L; O'Brien, Diane M; Hopkins, Scarlett E; Lemas, Dominick J; Boyer, Bert B; Tiwari, Hemant K
2015-06-01
To identify novel genetic markers of obesity-related traits and to identify gene-diet interactions with n-3 polyunsaturated fatty acid (n-3 PUFA) intake in Yup'ik people. We measured body composition, plasma adipokines and ghrelin in 982 participants enrolled in the Center for Alaska Native Health Research (CANHR) Study. We conducted a genome-wide SNP linkage scan and targeted association analysis, fitting additional models to investigate putative gene-diet interactions. Finally, we performed bioinformatic analysis to uncover likely candidate genes within the identified linkage peaks. We observed evidence of linkage for all obesity-related traits, replicating previous results and identifying novel regions of interest for adiponectin (10q26.13-2) and thigh circumference (8q21.11-13). Bioinformatic analysis revealed DOCK1, PTPRE (10q26.13-2) and FABP4 (8q21.11-13) as putative candidate genes in the newly identified regions. Targeted SNP analysis under the linkage peaks identified associations between three SNPs and obesity-related traits: rs1007750 on chromosome 8 and thigh circumference (P=0.0005), rs878953 on chromosome 5 and thigh skinfold (P=0.0004), and rs1596854 on chromosome 11 for waist circumference (P=0.0003). Finally, we showed that n-3 PUFA modified the association between obesity related traits and two additional variants (rs2048417 on chromosome 3 for adiponectin, P for interaction=0.0006 and rs730414 on chromosome 11 for percentage body fat, P for interaction=0.0004). This study presents evidence of novel genomic regions and gene-diet interactions that may contribute to the pathophysiology of obesity-related traits among Yup'ik people. Copyright © 2015 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hallmayer, J.; Pintado, E.; Lotspeich, L.
Approximately 2%-5% of autistic children show cytogenetic evidence of the fragile X syndrome. This report tests whether infantile autism in multiplex autism families arises from an unusual manifestion of the fragile X syndrome. This could arise either by expansion of the (CGG)n trinucleotide repeat in FMR-1 or from a mutation elsewhere in the gene. We studied 35 families that met stringent criteria for multiplex autism. Amplification of the trinucleotide repeat and analysis of methylation status were performed in 79 autistic children and in 31 of their unaffected siblings by Southern blot analysis. No examples of amplified repeats were seen inmore » the autistic or control children or in their parents or grandparents. We next examined the hypothesis that there was a mutation elsewhere in the FMR-1 gene, by linkage analysis in 32 of these families. We tested four different dominant models and a recessive model. Linkage to FMR-1 could be excluded (lod score between -24 and -62) in all models by using probes DXS548, FRAXAC1, and FRAXAC2 and the CGG repeat itself. Tests for heterogeneity in this sample were negative, and the occurrence of positive lod scores in this data set could be attributed to chance. Analysis of the data by the affected-sib method also did not show evidence for linkage of any marker to autism. These results enable us to reject the hypothesis that multiplex autism arises from expansion of the (CGG)n trinucleotide repeat in FMR-1. Further, because the overall lod scores for all probes in all models tested were highly negative, linkage to FMR-1 can also be ruled out in multiplex autistic families. 35 refs., 2 figs., 5 tabs.« less
The power to detect linkage in complex disease by means of simple LOD-score analyses.
Greenberg, D A; Abreu, P; Hodge, S E
1998-01-01
Maximum-likelihood analysis (via LOD score) provides the most powerful method for finding linkage when the mode of inheritance (MOI) is known. However, because one must assume an MOI, the application of LOD-score analysis to complex disease has been questioned. Although it is known that one can legitimately maximize the maximum LOD score with respect to genetic parameters, this approach raises three concerns: (1) multiple testing, (2) effect on power to detect linkage, and (3) adequacy of the approximate MOI for the true MOI. We evaluated the power of LOD scores to detect linkage when the true MOI was complex but a LOD score analysis assumed simple models. We simulated data from 14 different genetic models, including dominant and recessive at high (80%) and low (20%) penetrances, intermediate models, and several additive two-locus models. We calculated LOD scores by assuming two simple models, dominant and recessive, each with 50% penetrance, then took the higher of the two LOD scores as the raw test statistic and corrected for multiple tests. We call this test statistic "MMLS-C." We found that the ELODs for MMLS-C are >=80% of the ELOD under the true model when the ELOD for the true model is >=3. Similarly, the power to reach a given LOD score was usually >=80% that of the true model, when the power under the true model was >=60%. These results underscore that a critical factor in LOD-score analysis is the MOI at the linked locus, not that of the disease or trait per se. Thus, a limited set of simple genetic models in LOD-score analysis can work well in testing for linkage. PMID:9718328
The power to detect linkage in complex disease by means of simple LOD-score analyses.
Greenberg, D A; Abreu, P; Hodge, S E
1998-09-01
Maximum-likelihood analysis (via LOD score) provides the most powerful method for finding linkage when the mode of inheritance (MOI) is known. However, because one must assume an MOI, the application of LOD-score analysis to complex disease has been questioned. Although it is known that one can legitimately maximize the maximum LOD score with respect to genetic parameters, this approach raises three concerns: (1) multiple testing, (2) effect on power to detect linkage, and (3) adequacy of the approximate MOI for the true MOI. We evaluated the power of LOD scores to detect linkage when the true MOI was complex but a LOD score analysis assumed simple models. We simulated data from 14 different genetic models, including dominant and recessive at high (80%) and low (20%) penetrances, intermediate models, and several additive two-locus models. We calculated LOD scores by assuming two simple models, dominant and recessive, each with 50% penetrance, then took the higher of the two LOD scores as the raw test statistic and corrected for multiple tests. We call this test statistic "MMLS-C." We found that the ELODs for MMLS-C are >=80% of the ELOD under the true model when the ELOD for the true model is >=3. Similarly, the power to reach a given LOD score was usually >=80% that of the true model, when the power under the true model was >=60%. These results underscore that a critical factor in LOD-score analysis is the MOI at the linked locus, not that of the disease or trait per se. Thus, a limited set of simple genetic models in LOD-score analysis can work well in testing for linkage.
Genetic Candidate Variants in Two Multigenerational Families with Childhood Apraxia of Speech
Wijsman, Ellen M.; Nato, Alejandro Q.; Matsushita, Mark M.; Chapman, Kathy L.; Stanaway, Ian B.; Wolff, John; Oda, Kaori; Gabo, Virginia B.; Raskind, Wendy H.
2016-01-01
Childhood apraxia of speech (CAS) is a severe and socially debilitating form of speech sound disorder with suspected genetic involvement, but the genetic etiology is not yet well understood. Very few known or putative causal genes have been identified to date, e.g., FOXP2 and BCL11A. Building a knowledge base of the genetic etiology of CAS will make it possible to identify infants at genetic risk and motivate the development of effective very early intervention programs. We investigated the genetic etiology of CAS in two large multigenerational families with familial CAS. Complementary genomic methods included Markov chain Monte Carlo linkage analysis, copy-number analysis, identity-by-descent sharing, and exome sequencing with variant filtering. No overlaps in regions with positive evidence of linkage between the two families were found. In one family, linkage analysis detected two chromosomal regions of interest, 5p15.1-p14.1, and 17p13.1-q11.1, inherited separately from the two founders. Single-point linkage analysis of selected variants identified CDH18 as a primary gene of interest and additionally, MYO10, NIPBL, GLP2R, NCOR1, FLCN, SMCR8, NEK8, and ANKRD12, possibly with additive effects. Linkage analysis in the second family detected five regions with LOD scores approaching the highest values possible in the family. A gene of interest was C4orf21 (ZGRF1) on 4q25-q28.2. Evidence for previously described causal copy-number variations and validated or suspected genes was not found. Results are consistent with a heterogeneous CAS etiology, as is expected in many neurogenic disorders. Future studies will investigate genome variants in these and other families with CAS. PMID:27120335
An Autosomal Genetic Linkage Map of the Sheep Genome
Crawford, A. M.; Dodds, K. G.; Ede, A. J.; Pierson, C. A.; Montgomery, G. W.; Garmonsway, H. G.; Beattie, A. E.; Davies, K.; Maddox, J. F.; Kappes, S. W.; Stone, R. T.; Nguyen, T. C.; Penty, J. M.; Lord, E. A.; Broom, J. E.; Buitkamp, J.; Schwaiger, W.; Epplen, J. T.; Matthew, P.; Matthews, M. E.; Hulme, D. J.; Beh, K. J.; McGraw, R. A.; Beattie, C. W.
1995-01-01
We report the first extensive ovine genetic linkage map covering 2070 cM of the sheep genome. The map was generated from the linkage analysis of 246 polymorphic markers, in nine three-generation fullsib pedigrees, which make up the AgResearch International Mapping Flock. We have exploited many markers from cattle so that valuable comparisons between these two ruminant linkage maps can be made. The markers, used in the segregation analyses, comprised 86 anonymous microsatellite markers derived from the sheep genome, 126 anonymous microsatellites from cattle, one from deer, and 33 polymorphic markers of various types associated with known genes. The maximum number of informative meioses within the mapping flock was 222. The average number of informative meioses per marker was 140 (range 18-209). Linkage groups have been assigned to all 26 sheep autosomes. PMID:7498748
Georges, Anouk; Cambisano, Nadine; Ahariz, Naïma; Karim, Latifa; Georges, Michel
2013-01-01
A genome-wide linkage scan was conducted in a Northern-European multigenerational pedigree with nine of 40 related members affected with concomitant strabismus. Twenty-seven members of the pedigree including all affected individuals were genotyped using a SNP array interrogating > 300,000 common SNPs. We conducted parametric and non-parametric linkage analyses assuming segregation of an autosomal dominant mutation, yet allowing for incomplete penetrance and phenocopies. We detected two chromosome regions with near-suggestive evidence for linkage, respectively on chromosomes 8 and 18. The chromosome 8 linkage implied a penetrance of 0.80 and a rate of phenocopy of 0.11, while the chromosome 18 linkage implied a penetrance of 0.64 and a rate of phenocopy of 0. Our analysis excludes a simple genetic determinism of strabismus in this pedigree. PMID:24376720
Georges, Anouk; Cambisano, Nadine; Ahariz, Naïma; Karim, Latifa; Georges, Michel
2013-01-01
A genome-wide linkage scan was conducted in a Northern-European multigenerational pedigree with nine of 40 related members affected with concomitant strabismus. Twenty-seven members of the pedigree including all affected individuals were genotyped using a SNP array interrogating > 300,000 common SNPs. We conducted parametric and non-parametric linkage analyses assuming segregation of an autosomal dominant mutation, yet allowing for incomplete penetrance and phenocopies. We detected two chromosome regions with near-suggestive evidence for linkage, respectively on chromosomes 8 and 18. The chromosome 8 linkage implied a penetrance of 0.80 and a rate of phenocopy of 0.11, while the chromosome 18 linkage implied a penetrance of 0.64 and a rate of phenocopy of 0. Our analysis excludes a simple genetic determinism of strabismus in this pedigree.
DOT National Transportation Integrated Search
2009-12-01
Transportation agencies use a variety of metrics to document progress toward achieving specific goals and objectives. This guide, developed by Federal Highway Administration (FHWA) Planning and Environmental Linkages (PEL) program, is intended to hel...
Genes, age, and alcoholism: analysis of GAW14 data.
Apprey, Victor; Afful, Joseph; Harrell, Jules P; Taylor, Robert E; Bonney, George E
2005-12-30
A genetic analysis of age of onset of alcoholism was performed on the Collaborative Study on the Genetics of Alcoholism data released for Genetic Analysis Workshop 14. Our study illustrates an application of the log-normal age of onset model in our software Genetic Epidemiology Models (GEMs). The phenotype ALDX1 of alcoholism was studied. The analysis strategy was to first find the markers of the Affymetrix SNP dataset with significant association with age of onset, and then to perform linkage analysis on them. ALDX1 revealed strong evidence of linkage for marker tsc0041591 on chromosome 2 and suggestive linkage for marker tsc0894042 on chromosome 3. The largest separation in mean ages of onset of ALDX1 was 19.76 and 24.41 between male smokers who are carriers of the risk allele of tsc0041591 and the non-carriers, respectively. Hence, male smokers who are carriers of marker tsc0041591 on chromosome 2 have an average onset of ALDX1 almost 5 years earlier than non-carriers.
DMRfinder: efficiently identifying differentially methylated regions from MethylC-seq data.
Gaspar, John M; Hart, Ronald P
2017-11-29
DNA methylation is an epigenetic modification that is studied at a single-base resolution with bisulfite treatment followed by high-throughput sequencing. After alignment of the sequence reads to a reference genome, methylation counts are analyzed to determine genomic regions that are differentially methylated between two or more biological conditions. Even though a variety of software packages is available for different aspects of the bioinformatics analysis, they often produce results that are biased or require excessive computational requirements. DMRfinder is a novel computational pipeline that identifies differentially methylated regions efficiently. Following alignment, DMRfinder extracts methylation counts and performs a modified single-linkage clustering of methylation sites into genomic regions. It then compares methylation levels using beta-binomial hierarchical modeling and Wald tests. Among its innovative attributes are the analyses of novel methylation sites and methylation linkage, as well as the simultaneous statistical analysis of multiple sample groups. To demonstrate its efficiency, DMRfinder is benchmarked against other computational approaches using a large published dataset. Contrasting two replicates of the same sample yielded minimal genomic regions with DMRfinder, whereas two alternative software packages reported a substantial number of false positives. Further analyses of biological samples revealed fundamental differences between DMRfinder and another software package, despite the fact that they utilize the same underlying statistical basis. For each step, DMRfinder completed the analysis in a fraction of the time required by other software. Among the computational approaches for identifying differentially methylated regions from high-throughput bisulfite sequencing datasets, DMRfinder is the first that integrates all the post-alignment steps in a single package. Compared to other software, DMRfinder is extremely efficient and unbiased in this process. DMRfinder is free and open-source software, available on GitHub ( github.com/jsh58/DMRfinder ); it is written in Python and R, and is supported on Linux.
Costantini, Laura; Battilana, Juri; Lamaj, Flutura; Fanizza, Girolamo; Grando, Maria Stella
2008-01-01
Background The timing of grape ripening initiation, length of maturation period, berry size and seed content are target traits in viticulture. The availability of early and late ripening varieties is desirable for staggering harvest along growing season, expanding production towards periods when the fruit gets a higher value in the market and ensuring an optimal plant adaptation to climatic and geographic conditions. Berry size determines grape productivity; seedlessness is especially demanded in the table grape market and is negatively correlated to fruit size. These traits result from complex developmental processes modified by genetic, physiological and environmental factors. In order to elucidate their genetic determinism we carried out a quantitative analysis in a 163 individuals-F1 segregating progeny obtained by crossing two table grape cultivars. Results Molecular linkage maps covering most of the genome (2n = 38 for Vitis vinifera) were generated for each parent. Eighteen pairs of homologous groups were integrated into a consensus map spanning over 1426 cM with 341 markers (mainly microsatellite, AFLP and EST-derived markers) and an average map distance between loci of 4.2 cM. Segregating traits were evaluated in three growing seasons by recording flowering, veraison and ripening dates and by measuring berry size, seed number and weight. QTL (Quantitative Trait Loci) analysis was carried out based on single marker and interval mapping methods. QTLs were identified for all but one of the studied traits, a number of them steadily over more than one year. Clusters of QTLs for different characters were detected, suggesting linkage or pleiotropic effects of loci, as well as regions affecting specific traits. The most interesting QTLs were investigated at the gene level through a bioinformatic analysis of the underlying Pinot noir genomic sequence. Conclusion Our results revealed novel insights into the genetic control of relevant grapevine features. They provide a basis for performing marker-assisted selection and testing the role of specific genes in trait variation. PMID:18419811
McCaskie, Pamela A; Carter, Kim W; McCaskie, Simon R; Palmer, Lyle J
2005-01-01
We used our newly developed linkage disequilibrium (LD) plotting software, JLIN, to plot linkage disequilibrium between pairs of single-nucleotide polymorphisms (SNPs) for three chromosomes of the Genetic Analysis Workshop 14 Aipotu simulated population to assess the effect of missing data on LD calculations. Our haplotype analysis program, SIMHAP, was used to assess the effect of missing data on haplotype-phenotype association. Genotype data was removed at random, at levels of 1%, 5%, and 10%, and the LD calculations and haplotype association results for these levels of missingness were compared to those for the complete dataset. It was concluded that ignoring individuals with missing data substantially affects the number of regions of LD detected which, in turn, could affect tagging SNPs chosen to generate haplotypes. PMID:16451612
Meta-analysis of genome-wide linkage studies in BMI and obesity.
Saunders, Catherine L; Chiodini, Benedetta D; Sham, Pak; Lewis, Cathryn M; Abkevich, Victor; Adeyemo, Adebowale A; de Andrade, Mariza; Arya, Rector; Berenson, Gerald S; Blangero, John; Boehnke, Michael; Borecki, Ingrid B; Chagnon, Yvon C; Chen, Wei; Comuzzie, Anthony G; Deng, Hong-Wen; Duggirala, Ravindranath; Feitosa, Mary F; Froguel, Philippe; Hanson, Robert L; Hebebrand, Johannes; Huezo-Dias, Patricia; Kissebah, Ahmed H; Li, Weidong; Luke, Amy; Martin, Lisa J; Nash, Matthew; Ohman, Miina; Palmer, Lyle J; Peltonen, Leena; Perola, Markus; Price, R Arlen; Redline, Susan; Srinivasan, Sathanur R; Stern, Michael P; Stone, Steven; Stringham, Heather; Turner, Stephen; Wijmenga, Cisca; Collier, David A
2007-09-01
The objective was to provide an overall assessment of genetic linkage data of BMI and BMI-defined obesity using a nonparametric genome scan meta-analysis. We identified 37 published studies containing data on over 31,000 individuals from more than >10,000 families and obtained genome-wide logarithm of the odds (LOD) scores, non-parametric linkage (NPL) scores, or maximum likelihood scores (MLS). BMI was analyzed in a pooled set of all studies, as a subgroup of 10 studies that used BMI-defined obesity, and for subgroups ascertained through type 2 diabetes, hypertension, or subjects of European ancestry. Bins at chromosome 13q13.2- q33.1, 12q23-q24.3 achieved suggestive evidence of linkage to BMI in the pooled analysis and samples ascertained for hypertension. Nominal evidence of linkage to these regions and suggestive evidence for 11q13.3-22.3 were also observed for BMI-defined obesity. The FTO obesity gene locus at 16q12.2 also showed nominal evidence for linkage. However, overall distribution of summed rank p values <0.05 is not different from that expected by chance. The strongest evidence was obtained in the families ascertained for hypertension at 9q31.1-qter and 12p11.21-q23 (p < 0.01). Despite having substantial statistical power, we did not unequivocally implicate specific loci for BMI or obesity. This may be because genes influencing adiposity are of very small effect, with substantial genetic heterogeneity and variable dependence on environmental factors. However, the observation that the FTO gene maps to one of the highest ranking bins for obesity is interesting and, while not a validation of this approach, indicates that other potential loci identified in this study should be investigated further.
Evidence for bivariate linkage of obesity and HDL-C levels in the Framingham Heart Study.
Arya, Rector; Lehman, Donna; Hunt, Kelly J; Schneider, Jennifer; Almasy, Laura; Blangero, John; Stern, Michael P; Duggirala, Ravindranath
2003-12-31
Epidemiological studies have indicated that obesity and low high-density lipoprotein (HDL) levels are strong cardiovascular risk factors, and that these traits are inversely correlated. Despite the belief that these traits are correlated in part due to pleiotropy, knowledge on specific genes commonly affecting obesity and dyslipidemia is very limited. To address this issue, we first conducted univariate multipoint linkage analysis for body mass index (BMI) and HDL-C to identify loci influencing variation in these phenotypes using Framingham Heart Study data relating to 1702 subjects distributed across 330 pedigrees. Subsequently, we performed bivariate multipoint linkage analysis to detect common loci influencing covariation between these two traits. We scanned the genome and identified a major locus near marker D6S1009 influencing variation in BMI (LOD = 3.9) using the program SOLAR. We also identified a major locus for HDL-C near marker D2S1334 on chromosome 2 (LOD = 3.5) and another region near marker D6S1009 on chromosome 6 with suggestive evidence for linkage (LOD = 2.7). Since these two phenotypes have been independently mapped to the same region on chromosome 6q, we used the bivariate multipoint linkage approach using SOLAR. The bivariate linkage analysis of BMI and HDL-C implicated the genetic region near marker D6S1009 as harboring a major gene commonly influencing these phenotypes (bivariate LOD = 6.2; LODeq = 5.5) and appears to improve power to map the correlated traits to a region, precisely. We found substantial evidence for a quantitative trait locus with pleiotropic effects, which appears to influence both BMI and HDL-C phenotypes in the Framingham data.
Simpson, Claire L.; Wojciechowski, Robert; Ibay, Grace; Stambolian, Dwight
2011-01-01
Purpose Despite many years of research, most of the genetic factors contributing to myopia development remain unknown. Genetic studies have pointed to a strong inherited component, but although many candidate regions have been implicated, few genes have been positively identified. Methods We have previously reported 2 genomewide linkage scans in a population of 63 highly aggregated Ashkenazi Jewish families that identified a locus on chromosome 22. Here we used ordered subset analysis (OSA), conditioned on non-parametric linkage to chromosome 22 to detect other chromosomal regions which had evidence of linkage to myopia in subsets of the families, but not the overall sample. Results Strong evidence of linkage to a 19-cM linkage interval with a peak OSA nonparametric allele-sharing logarithm-of-odds (LOD) score of 3.14 on 20p12-q11.1 (ΔLOD=2.39, empirical p=0.029) was identified in a subset of 20 families that also exhibited strong evidence of linkage to chromosome 22. One other locus also presented with suggestive LOD scores >2.0 on chromosome 11p14-q14 and one locus on chromosome 6q22-q24 had an OSA LOD score=1.76 (ΔLOD=1.65, empirical p=0.02). Conclusions The chromosome 6 and 20 loci are entirely novel and appear linked in a subset of families whose myopia is known to be linked to chromosome 22. The chromosome 11 locus overlaps with the known Myopia-7 (MYP7, OMIM 609256) locus. Using ordered subset analysis allows us to find additional loci linked to myopia in subsets of families, and underlines the complex genetic heterogeneity of myopia even in highly aggregated families and genetically isolated populations such as the Ashkenazi Jews. PMID:21738393
NASA Astrophysics Data System (ADS)
Seanego, K. G.; Moyo, N. A. G.
Population growth in urban areas is putting pressure on sewage treatment plants. The improper treatment of sewage entering the aquatic ecosystems causes deterioration of the water quality of the receiving water body. The effect of sewage effluent on the Sand River was assessed. Eight sampling sites were selected, site 1 and 2 were upstream of the sewage treatment plant along the urbanised area of Polokwane, whilst sites 3, 4, 5, 6, 7 and 8 were downstream. The physico-chemical parameters and coliform counts in the water samples were determined. The suitability of the water for irrigation was also determined. Hierarchical average linkage cluster analysis produced two clusters, grouping two sites above the sewage treatment works and six sites downstream of the sewage effluent discharge point. Principal component analysis (PCA) identified total nitrogen, total phosphorus, conductivity and salinity as the major factors contributing to the variability of the Sand River water quality. These factors are strongly associated with the downstream sites. Canonial correspondence analysis (CCA) indicated the macroinvertebrates, Chironomidae, Belastomatidae, Chaoborus and Hirudinea being strongly associated with nitrogen, phosphorus, conductivity and temperature. Escherichia coli levels in the Polokwane wastewater treatment works maturation ponds, could potentially lead to contamination of the Polokwane aquifer. The Sodium Adsorption Ratio was between 1.5 and 3.0 and residual sodium carbonate was below 1.24 Meq/l, indicating that the Sand River water is still suitable for irrigation. The total phosphorus concentrations fluctuated across the different site. Total nitrogen concentrations showed a gradual decrease downstream from the point of discharge. This shows that the river still has a good self-purification capacity.
Chak Han Im; Young-Hoon Park; Kenneth E. Hammel; Bokyung Park; Soon Wook Kwon; Hojin Ryu; Jae-San Ryu
2016-01-01
Breeding new strains with improved traits is a long-standing goal of mushroom breeders that can be expedited by marker-assisted selection (MAS). We constructed a genetic linkage map of Pleurotus eryngii based on segregation analysis of markers in postmeiotic monokaryons from KNR2312. In total, 256 loci comprising 226 simple sequence-repeat (SSR) markers, 2 mating-type...
A Genetic Linkage Map for Cattle
Bishop, M. D.; Kappes, S. M.; Keele, J. W.; Stone, R. T.; Sunden, SLF.; Hawkins, G. A.; Toldo, S. S.; Fries, R.; Grosz, M. D.; Yoo, J.; Beattie, C. W.
1994-01-01
We report the most extensive physically anchored linkage map for cattle produced to date. Three-hundred thirteen genetic markers ordered in 30 linkage groups, anchored to 24 autosomal chromosomes (n = 29), the X and Y chromosomes, four unanchored syntenic groups and two unassigned linkage groups spanning 2464 cM of the bovine genome are summarized. The map also assigns 19 type I loci to specific chromosomes and/or syntenic groups and four cosmid clones containing informative microsatellites to chromosomes 13, 25 and 29 anchoring syntenic groups U11, U7 and U8, respectively. This map provides the skeletal framework prerequisite to development of a comprehensive genetic map for cattle and analysis of economic trait loci (ETL). PMID:7908653
Global Occurrence of Archaeal amoA Genes in Terrestrial Hot Springs▿
Zhang, Chuanlun L.; Ye, Qi; Huang, Zhiyong; Li, WenJun; Chen, Jinquan; Song, Zhaoqi; Zhao, Weidong; Bagwell, Christopher; Inskeep, William P.; Ross, Christian; Gao, Lei; Wiegel, Juergen; Romanek, Christopher S.; Shock, Everett L.; Hedlund, Brian P.
2008-01-01
Despite the ubiquity of ammonium in geothermal environments and the thermodynamic favorability of aerobic ammonia oxidation, thermophilic ammonia-oxidizing microorganisms belonging to the crenarchaeota kingdom have only recently been described. In this study, we analyzed microbial mats and surface sediments from 21 hot spring samples (pH 3.4 to 9.0; temperature, 41 to 86°C) from the United States, China, and Russia and obtained 846 putative archaeal ammonia monooxygenase large-subunit (amoA) gene and transcript sequences, representing a total of 41 amoA operational taxonomic units (OTUs) at 2% identity. The amoA gene sequences were highly diverse, yet they clustered within two major clades of archaeal amoA sequences known from water columns, sediments, and soils: clusters A and B. Eighty-four percent (711/846) of the sequences belonged to cluster A, which is typically found in water columns and sediments, whereas 16% (135/846) belonged to cluster B, which is typically found in soils and sediments. Although a few amoA OTUs were present in several geothermal regions, most were specific to a single region. In addition, cluster A amoA genes formed geographic groups, while cluster B sequences did not group geographically. With the exception of only one hot spring, principal-component analysis and UPGMA (unweighted-pair group method using average linkages) based on the UniFrac metric derived from cluster A grouped the springs by location, regardless of temperature or bulk water pH, suggesting that geography may play a role in structuring communities of putative ammonia-oxidizing archaea (AOA). The amoA genes were distinct from those of low-temperature environments; in particular, pair-wise comparisons between hot spring amoA genes and those from sympatric soils showed less than 85% sequence identity, underscoring the distinctness of hot spring archaeal communities from those of the surrounding soil system. Reverse transcription-PCR showed that amoA genes were transcribed in situ in one spring and the transcripts were closely related to the amoA genes amplified from the same spring. Our study demonstrates the global occurrence of putative archaeal amoA genes in a wide variety of terrestrial hot springs and suggests that geography may play an important role in selecting different assemblages of AOA. PMID:18676703
Global occurrence of archaeal amoA genes in terrestrial hot springs.
Zhang, Chuanlun L; Ye, Qi; Huang, Zhiyong; Li, Wenjun; Chen, Jinquan; Song, Zhaoqi; Zhao, Weidong; Bagwell, Christopher; Inskeep, William P; Ross, Christian; Gao, Lei; Wiegel, Juergen; Romanek, Christopher S; Shock, Everett L; Hedlund, Brian P
2008-10-01
Despite the ubiquity of ammonium in geothermal environments and the thermodynamic favorability of aerobic ammonia oxidation, thermophilic ammonia-oxidizing microorganisms belonging to the crenarchaeota kingdom have only recently been described. In this study, we analyzed microbial mats and surface sediments from 21 hot spring samples (pH 3.4 to 9.0; temperature, 41 to 86 degrees C) from the United States, China, and Russia and obtained 846 putative archaeal ammonia monooxygenase large-subunit (amoA) gene and transcript sequences, representing a total of 41 amoA operational taxonomic units (OTUs) at 2% identity. The amoA gene sequences were highly diverse, yet they clustered within two major clades of archaeal amoA sequences known from water columns, sediments, and soils: clusters A and B. Eighty-four percent (711/846) of the sequences belonged to cluster A, which is typically found in water columns and sediments, whereas 16% (135/846) belonged to cluster B, which is typically found in soils and sediments. Although a few amoA OTUs were present in several geothermal regions, most were specific to a single region. In addition, cluster A amoA genes formed geographic groups, while cluster B sequences did not group geographically. With the exception of only one hot spring, principal-component analysis and UPGMA (unweighted-pair group method using average linkages) based on the UniFrac metric derived from cluster A grouped the springs by location, regardless of temperature or bulk water pH, suggesting that geography may play a role in structuring communities of putative ammonia-oxidizing archaea (AOA). The amoA genes were distinct from those of low-temperature environments; in particular, pair-wise comparisons between hot spring amoA genes and those from sympatric soils showed less than 85% sequence identity, underscoring the distinctness of hot spring archaeal communities from those of the surrounding soil system. Reverse transcription-PCR showed that amoA genes were transcribed in situ in one spring and the transcripts were closely related to the amoA genes amplified from the same spring. Our study demonstrates the global occurrence of putative archaeal amoA genes in a wide variety of terrestrial hot springs and suggests that geography may play an important role in selecting different assemblages of AOA.
Labhardt, Niklaus Daniel; Motlomelo, Masetsibi; Cerutti, Bernard; Pfeiffer, Karolin; Kamele, Mashaete; Hobbins, Michael A; Ehmer, Jochen
2014-12-01
The success of HIV programs relies on widely accessible HIV testing and counseling (HTC) services at health facilities as well as in the community. Home-based HTC (HB-HTC) is a popular community-based approach to reach persons who do not test at health facilities. Data comparing HB-HTC to other community-based HTC approaches are very limited. This trial compares HB-HTC to mobile clinic HTC (MC-HTC). The trial was powered to test the hypothesis of higher HTC uptake in HB-HTC campaigns than in MC-HTC campaigns. Twelve clusters were randomly allocated to HB-HTC or MC-HTC. The six clusters in the HB-HTC group received 30 1-d multi-disease campaigns (five villages per cluster) that delivered services by going door-to-door, whereas the six clusters in MC-HTC group received campaigns involving community gatherings in the 30 villages with subsequent service provision in mobile clinics. Time allocation and human resources were standardized and equal in both groups. All individuals accessing the campaigns with unknown HIV status or whose last HIV test was >12 wk ago and was negative were eligible. All outcomes were assessed at the individual level. Statistical analysis used multivariable logistic regression. Odds ratios and p-values were adjusted for gender, age, and cluster effect. Out of 3,197 participants from the 12 clusters, 2,563 (80.2%) were eligible (HB-HTC: 1,171; MC-HTC: 1,392). The results for the primary outcomes were as follows. Overall HTC uptake was higher in the HB-HTC group than in the MC-HTC group (92.5% versus 86.7%; adjusted odds ratio [aOR]: 2.06; 95% CI: 1.18-3.60; p = 0. 011). Among adolescents and adults ≥ 12 y, HTC uptake did not differ significantly between the two groups; however, in children <12 y, HTC uptake was higher in the HB-HTC arm (87.5% versus 58.7%; aOR: 4.91; 95% CI: 2.41-10.0; p<0.001). Out of those who took up HTC, 114 (4.9%) tested HIV-positive, 39 (3.6%) in the HB-HTC arm and 75 (6.2%) in the MC-HTC arm (aOR: 0.64; 95% CI: 0.48-0.86; p = 0.002). Ten (25.6%) and 19 (25.3%) individuals in the HB-HTC and in the MC-HTC arms, respectively, linked to HIV care within 1 mo after testing positive. Findings for secondary outcomes were as follows: HB-HTC reached more first-time testers, particularly among adolescents and young adults, and had a higher proportion of men among participants. However, after adjusting for clustering, the difference in male participation was not significant anymore. Age distribution among participants and immunological and clinical stages among persons newly diagnosed HIV-positive did not differ significantly between the two groups. Major study limitations included the campaigns' restriction to weekdays and a relatively low HIV prevalence among participants, the latter indicating that both arms may have reached an underexposed population. This study demonstrates that both HB-HTC and MC-HTC can achieve high uptake of HTC. The choice between these two community-based strategies will depend on the objective of the activity: HB-HTC was better in reaching children, individuals who had never tested before, and men, while MC-HTC detected more new HIV infections. The low rate of linkage to care after a positive HIV test warrants future consideration of combining community-based HTC approaches with strategies to improve linkage to care for persons who test HIV-positive. ClinicalTrials.gov NCT01459120. Please see later in the article for the Editors' Summary.
Genome-wide scan of IQ finds significant linkage to a quantitative trait locus on 2q.
Luciano, M; Wright, M J; Duffy, D L; Wainwright, M A; Zhu, G; Evans, D M; Geffen, G M; Montgomery, G W; Martin, N G
2006-01-01
A genome-wide linkage scan of 795 microsatellite markers (761 autosomal, 34 X chromosome) was performed on Multidimensional Aptitude Battery subtests and verbal, performance and full scale scores, the WAIS-R Digit Symbol subtest, and two word-recognition tests (Schonell Graded Word Reading Test, Cambridge Contextual Reading Test) highly predictive of IQ. The sample included 361 families comprising 2-5 siblings who ranged in age from 15.7 to 22.2 years; genotype, but not phenotype, data were available for 81% of parents. A variance components analysis which controlled for age and sex effects showed significant linkage for the Cambridge reading test and performance IQ to the same region on chromosome 2, with respective LOD scores of 4.15 and 3.68. Suggestive linkage (LOD score>2.2) for various measures was further supported on chromosomes 6, 7, 11, 14, 21 and 22. Where location of linkage peaks converged for IQ subtests within the same scale, the overall scale score provided increased evidence for linkage to that region over any individual subtest. Association studies of candidate genes, particularly those involved in neural transmission and development, will be directed to genes located under the linkage peaks identified in this study.
An Enhanced Linkage Map of the Sheep Genome Comprising More Than 1000 Loci
Maddox, Jillian F.; Davies, Kizanne P.; Crawford, Allan M.; Hulme, Dennis J.; Vaiman, Daniel; Cribiu, Edmond P.; Freking, Bradley A.; Beh, Ken J.; Cockett, Noelle E.; Kang, Nina; Riffkin, Christopher D.; Drinkwater, Roger; Moore, Stephen S.; Dodds, Ken G.; Lumsden, Joanne M.; van Stijn, Tracey C.; Phua, Sin H.; Adelson, David L.; Burkin, Heather R.; Broom, Judith E.; Buitkamp, Johannes; Cambridge, Lisa; Cushwa, William T.; Gerard, Emily; Galloway, Susan M.; Harrison, Blair; Hawken, Rachel J.; Hiendleder, Stefan; Henry, Hannah M.; Medrano, Juan F.; Paterson, Korena A.; Schibler, Laurent; Stone, Roger T.; van Hest, Beryl
2001-01-01
A medium-density linkage map of the ovine genome has been developed. Marker data for 550 new loci were generated and merged with the previous sheep linkage map. The new map comprises 1093 markers representing 1062 unique loci (941 anonymous loci, 121 genes) and spans 3500 cM (sex-averaged) for the autosomes and 132 cM (female) on the X chromosome. There is an average spacing of 3.4 cM between autosomal loci and 8.3 cM between highly polymorphic [polymorphic information content (PIC) ≥ 0.7] autosomal loci. The largest gap between markers is 32.5 cM, and the number of gaps of >20 cM between loci, or regions where loci are missing from chromosome ends, has been reduced from 40 in the previous map to 6. Five hundred and seventy-three of the loci can be ordered on a framework map with odds of >1000 : 1. The sheep linkage map contains strong links to both the cattle and goat maps. Five hundred and seventy-two of the loci positioned on the sheep linkage map have also been mapped by linkage analysis in cattle, and 209 of the loci mapped on the sheep linkage map have also been placed on the goat linkage map. Inspection of ruminant linkage maps indicates that the genomic coverage by the current sheep linkage map is comparable to that of the available cattle maps. The sheep map provides a valuable resource to the international sheep, cattle, and goat gene mapping community. PMID:11435411
Jalali, Ali; Aldinger, Kimberly A.; Chary, Ajit; Mclone, David G.; Bowman, Robin M.; Le, Luan Cong; Jardine, Phillip; Newbury-Ecob, Ruth; Mallick, Andrew; Jafari, Nadereh; Russell, Eric J.; Curran, John; Nguyen, Pam; Ouahchi, Karim; Lee, Charles; Dobyns, William B.; Millen, Kathleen J.; Pina-Neto, Joao M.; Kessler, John A.; Bassuk, Alexander G.
2010-01-01
We previously reported a Vietnamese-American family with isolated autosomal dominant occipital cephalocele. Upon further neuroimaging studies, we have recharacterized this condition as autosomal dominant Dandy-Walker with occipital cephalocele (ADDWOC). A similar ADDWOC family from Brazil was also recently described. To determine the genetic etiology of ADDWOC, we performed genome-wide linkage analysis on members of the Vietnamese-American and Brazilian pedigrees. Linkage analysis of the Vietnamese-American family identified the ADDWOC causative locus on chromosome 2q36.1 with a multipoint parametric LOD score of 3.3, while haplotype analysis refined the locus to 1.1 Mb. Sequencing of the five known genes in this locus did not identify any protein-altering mutations. However, a terminal deletion of chromosome 2 in a patient with an isolated case of Dandy-Walker malformation also encompassed the 2q36.1 chromosomal region. The Brazilian pedigree did not show linkage to this 2q36.1 region. Taken together, these results demonstrate a locus for ADDWOC on 2q36.1 and also suggest locus heterogeneity for ADDWOC. PMID:18204864
Identification of a herpes simplex labialis susceptibility region on human chromosome 21.
Hobbs, Maurine R; Jones, Brandt B; Otterud, Brith E; Leppert, Mark; Kriesel, John D
2008-02-01
Most of the United States population is infected with either herpes simplex virus type 1 (HSV-1), herpes simplex virus type 2, or both. Reactivations of HSV-1 infection cause herpes simplex labialis (HSL; cold sores or fever blisters), which is the most common recurring viral infection in humans. To investigate the possibility of a human genetic component conferring resistance or susceptibility to cold sores (i.e., a HSL susceptibility gene), we conducted a genetic linkage analysis that included serotyping and phenotyping 421 individuals from 39 families enrolled in the Utah Genetic Reference Project. Linkage analysis identified a 2.5-Mb nonrecombinant region of interest on the long arm of human chromosome 21, with a multipoint logarithm of odds score of 3.9 noted near marker abmc65 (D21S409). Nonparametric linkage analysis of the data also provided strong evidence for linkage (P = .0005). This region of human chromosome 21 contains 6 candidate genes for herpes susceptibility. The development of frequent cold sores is associated with a region on the long arm of human chromosome 21. This region contains several candidate genes that could influence the frequency of outbreaks of HSL.
Muraoka, Azusa; Inokuchi, Yoshiya; Hammer, Nathan I; Shin, Joong-Won; Johnson, Mark A; Nagata, Takashi
2009-08-06
The [(CO2)n(H2O)]- cluster anions are studied using infrared photodissociation (IPD) spectroscopy in the 2800-3800 cm(-1) range. The observed IPD spectra display a drastic change in the vibrational band features at n = 4, indicating a sharp discontinuity in the structural evolution of the monohydrated cluster anions. The n = 2 and 3 spectra are composed of a series of sharp bands around 3600 cm(-1), which are assignable to the stretching vibrations of H2O bound to C2O4- in a double ionic hydrogen-bonding (DIHB) configuration, as was previously discussed (J. Chem. Phys. 2005, 122, 094303). In the n > or = 4 spectrum, a pair of intense bands additionally appears at approximately 3300 cm(-1). With the aid of ab initio calculations at the MP2/6-31+G* level, the 3300 cm(-1) bands are assigned to the bending overtone and the hydrogen-bonded OH vibration of H2O bound to CO2- via a single O-H...O linkage. Thus, the structures of [(CO2)n(H2O)]- evolve with cluster size such that DIHB to C2O4- is favored in the smaller clusters with n = 2 and 3 whereas CO2- is preferentially stabilized via the formation of a single ionic hydrogen-bonding (SIHB) configuration in the larger clusters with n > or = 4.
Evaluation of identifier field agreement in linked neonatal records.
Hall, E S; Marsolo, K; Greenberg, J M
2017-08-01
To better address barriers arising from missing and unreliable identifiers in neonatal medical records, we evaluated agreement and discordance among traditional and non-traditional linkage fields within a linked neonatal data set. The retrospective, descriptive analysis represents infants born from 2013 to 2015. We linked children's hospital neonatal physician billing records to newborn medical records originating from an academic delivery hospital and evaluated rates of agreement, discordance and missingness for a set of 12 identifier field pairs used in the linkage algorithm. We linked 7293 of 7404 physician billing records (98.5%), all of which were deemed valid upon manual review. Linked records contained a mean of 9.1 matching and 1.6 non-matching identifier pairs. Only 4.8% had complete agreement among all 12 identifier pairs. Our approach to selection of linkage variables and data formatting preparatory to linkage have generalizability, which may inform future neonatal and perinatal record linkage efforts.
An autosomal genetic linkage map of the sheep genome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Crawford, A.M.; Ede, A.J.; Pierson, C.A.
1995-06-01
We report the first extensive ovine genetic linkage map covering 2070 cM of the sheep genome. The map was generated from the linkage analysis of 246 polymorphic markers, in nine three-generation full-sib pedigrees, which make up the AgResearch International Mapping Flock. We have exploited many markers from cattle so that valuable comparisons between these two ruminant linkage maps can be made. The markers, used in the segregation analyses, comprised 86 anonymous microsatellite markers derived from the sheep genome, 126 anonymous microsatellites from cattle, one from deer, and 33 polymorphic markers of various types associated with known genes. The maximum numbermore » of informative meioses within the mapping flock was 22. The average number of informative meioses per marker was 140 (range 18-209). Linkage groups have been assigned to all 26 sheep autosomes. 102 refs., 8 figs., 5 tabs.« less
Loewenstein, Yaniv; Portugaly, Elon; Fromer, Menachem; Linial, Michal
2008-07-01
UPGMA (average linking) is probably the most popular algorithm for hierarchical data clustering, especially in computational biology. However, UPGMA requires the entire dissimilarity matrix in memory. Due to this prohibitive requirement, UPGMA is not scalable to very large datasets. We present a novel class of memory-constrained UPGMA (MC-UPGMA) algorithms. Given any practical memory size constraint, this framework guarantees the correct clustering solution without explicitly requiring all dissimilarities in memory. The algorithms are general and are applicable to any dataset. We present a data-dependent characterization of hardness and clustering efficiency. The presented concepts are applicable to any agglomerative clustering formulation. We apply our algorithm to the entire collection of protein sequences, to automatically build a comprehensive evolutionary-driven hierarchy of proteins from sequence alone. The newly created tree captures protein families better than state-of-the-art large-scale methods such as CluSTr, ProtoNet4 or single-linkage clustering. We demonstrate that leveraging the entire mass embodied in all sequence similarities allows to significantly improve on current protein family clusterings which are unable to directly tackle the sheer mass of this data. Furthermore, we argue that non-metric constraints are an inherent complexity of the sequence space and should not be overlooked. The robustness of UPGMA allows significant improvement, especially for multidomain proteins, and for large or divergent families. A comprehensive tree built from all UniProt sequence similarities, together with navigation and classification tools will be made available as part of the ProtoNet service. A C++ implementation of the algorithm is available on request.
Evolution of Chemical Diversity in Echinocandin Lipopeptide Antifungal Metabolites
Yue, Qun; Chen, Li; Zhang, Xiaoling; Li, Kuan; Sun, Jingzu; Liu, Xingzhong
2015-01-01
The echinocandins are a class of antifungal drugs that includes caspofungin, micafungin, and anidulafungin. Gene clusters encoding most of the structural complexity of the echinocandins provided a framework for hypotheses about the evolutionary history and chemical logic of echinocandin biosynthesis. Gene orthologs among echinocandin-producing fungi were identified. Pathway genes, including the nonribosomal peptide synthetases (NRPSs), were analyzed phylogenetically to address the hypothesis that these pathways represent descent from a common ancestor. The clusters share cooperative gene contents and linkages among the different strains. Individual pathway genes analyzed in the context of similar genes formed unique echinocandin-exclusive phylogenetic lineages. The echinocandin NRPSs, along with the NRPS from the inp gene cluster in Aspergillus nidulans and its orthologs, comprise a novel lineage among fungal NRPSs. NRPS adenylation domains from different species exhibited a one-to-one correspondence between modules and amino acid specificity that is consistent with models of tandem duplication and subfunctionalization. Pathway gene trees and Ascomycota phylogenies are congruent and consistent with the hypothesis that the echinocandin gene clusters have a common origin. The disjunct Eurotiomycete-Leotiomycete distribution appears to be consistent with a scenario of vertical descent accompanied by incomplete lineage sorting and loss of the clusters from most lineages of the Ascomycota. We present evidence for a single evolutionary origin of the echinocandin family of gene clusters and a progression of structural diversification in two fungal classes that diverged approximately 290 to 390 million years ago. Lineage-specific gene cluster evolution driven by selection of new chemotypes contributed to diversification of the molecular functionalities. PMID:26024901
Nagano, Soichiro; Shirasawa, Kenta; Hirakawa, Hideki; Maeda, Fumi; Ishikawa, Masami; Isobe, Sachiko N
2017-05-12
The strawberry, Fragaria × ananassa, is an allo-octoploid (2n = 8x = 56) and outcrossing species. Although it is the most widely consumed berry crop in the world, its complex genome structure has hindered its genetic and genomic analysis, and thus discrimination of subgenome-specific loci among the homoeologous chromosomes is needed. In the present study, we identified candidate subgenome-specific single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) loci, and constructed a linkage map using an S 1 mapping population of the cultivar 'Reikou' with an IStraw90 Axiom® SNP array and previously published SSR markers. The 'Reikou' linkage map consisted of 11,574 loci (11,002 SNPs and 572 SSR loci) spanning 2816.5 cM of 31 linkage groups. The 11,574 loci were located on 4738 unique positions (bin) on the linkage map. Of the mapped loci, 8999 (8588 SNPs and 411 SSR loci) showed a 1:2:1 segregation ratio of AA:AB:BB allele, which suggested the possibility of deriving loci from candidate subgenome-specific sequences. In addition, 2575 loci (2414 SNPs and 161 SSR loci) showed a 3:1 segregation of AB:BB allele, indicating they were derived from homoeologous genomic sequences. Comparative analysis of the homoeologous linkage groups revealed differences in genome structure among the subgenomes. Our results suggest that candidate subgenome-specific loci are randomly located across the genomes, and that there are small- to large-scale structural variations among the subgenomes. The mapped SNPs and SSR loci on the linkage map are expected to be seed points for the construction of pseudomolecules in the octoploid strawberry.
Jasinska, A.J.; Service, S.; Jawaheer, D.; DeYoung, J.; Levinson, M.; Zhang, Z.; Kremeyer, B.; Muller, H.; Aldana, I.; Garcia, J.; Restrepo, G.; Lopez, C.; Palacio, C.; Duque, C.; Parra, M.; Vega, J.; Ortiz, D.; Bedoya, G.; Mathews, C.; Davanzo, P.; Fournier, E.; Bejarano, J.; Ramirez, M.; Ortiz, C. Araya; Araya, X.; Molina, J.; Sabatti, C.; Reus, V.; Ospina, J.; Macaya, G.; Ruiz-Linares, A.; Freimer, N.B.
2016-01-01
We previously reported linkage of bipolar disorder to 5q33-q34 in families from two closely related population isolates, the Central Valley of Costa Rica (CVCR) and Antioquia, Colombia (CO). Here we present follow up results from fine-scale mapping in large CVCR and CO families segregating severe bipolar disorder, BP-I, and in 343 population trios/duos from CVCR and CO. Employing densely spaced SNPs to fine map the prior linkage peak region increases linkage evidence and clarifies the position of the putative BP-I locus. We performed two-point linkage analysis with 1134 SNPs in an approximately 9 Mb region between markers D5S410 and D5S422. Combining pedigrees from CVCR and CO yields a LOD score of 4.9 at SNP rs10035961. Two other SNPs (rs7721142 and rs1422795) within the same 94 kb region also displayed LOD scores greater than 4. This linkage peak coincides with our prior microsatellite results and suggests a narrowed BP-I susceptibility regions in these families. To investigate if the locus implicated in the familial form of BP-I also contributes to disease risk in the population, we followed up the family results with association analysis in duo and trio samples, obtaining signals within 2 Mb of the peak linkage signal in the pedigrees; rs12523547 and rs267015 (P = 0.00004 and 0.00016, respectively) in the CO sample and rs244960 in the CVCR sample and the combined sample, with P = 0.00032 and 0.00016, respectively. It remains unclear whether these association results reflect the same locus contributing to BP susceptibility within the extended pedigrees. PMID:19319892
Poon, Art F. Y.; Gustafson, Réka; Daly, Patricia; Zerr, Laura; Demlow, S. Ellen; Wong, Jason; Woods, Conan K; Hogg, Robert S.; Krajden, Mel; Moore, David; Kendall, Perry; Montaner, Julio S. G.; Harrigan, P. Richard
2016-01-01
Background Due to the rapid evolution of HIV, infections with similar genetic sequences are likely to be related by recent transmission events. Clusters of related infections can represent subpopulations with high rates of HIV transmission. Here we describe the implementation of an automated “near real-time” system using clustering analysis of routinely collected HIV resistance genotypes to monitor and characterize HIV transmission hotspots in British Columbia (BC). Methods A monitoring system was implemented on the BC Drug Treatment Database, which currently holds over 32000 anonymized HIV genotypes for nearly 9000 residents of BC living with HIV. On average, five to six new HIV genotypes are deposited in the database every day, which triggers an automated re-analysis of the entire database. Clusters of five or more individuals were extracted on the basis of short phylogenetic distances between their respective HIV sequences. Monthly reports on the growth and characteristics of clusters were generated by the system and distributed to public health officers. Findings In June 2014, the monitoring system detected the expansion of a cluster by 11 new cases over three months, including eight cases with transmitted drug resistance. This cluster generally comprised young men who have sex with men. The subsequent report precipitated an enhanced public health follow-up to ensure linkage to care and treatment initiation in the affected subpopulation. Of the nine cases associated with this follow-up, all had already been linked to care and five cases had started treatment. Subsequent to the follow-up, three additional cases started treatment and the majority of cases achieved suppressed viral loads. Over the following 12 months, 12 new cases were detected in this cluster with a marked reduction in the onward transmission of drug resistance. Interpretation Our findings demonstrate the first application of an automated phylogenetic system monitoring a clinical database to detect a recent HIV outbreak and support the ensuing public health response. By making secondary use of routinely collected HIV genotypes, this approach is cost-effective, attains near realtime monitoring of new cases, and can be implemented in all settings where HIV genotyping is the standard of care. Funding This work was supported by the BC Centre for Excellence in HIV/AIDS and by grants from the Canadian Institutes for Health Research (CIHR HOP-111406, HOP-107544), the Genome BC, Genome Canada and CIHR Partnership in Genomics and Personalized Health (Large-Scale Applied Research Project HIV142 contract to PRH, JSGM, and AFYP), and by the US National Institute on Drug Abuse (1-R01-DA036307-01, 5-R01-031055-02, R01-DA021525-06, and R01-DA011591). PMID:27126490
On computation of p-values in parametric linkage analysis.
Kurbasic, Azra; Hössjer, Ola
2004-01-01
Parametric linkage analysis is usually used to find chromosomal regions linked to a disease (phenotype) that is described with a specific genetic model. This is done by investigating the relations between the disease and genetic markers, that is, well-characterized loci of known position with a clear Mendelian mode of inheritance. Assume we have found an interesting region on a chromosome that we suspect is linked to the disease. Then we want to test the hypothesis of no linkage versus the alternative one of linkage. As a measure we use the maximal lod score Z(max). It is well known that the maximal lod score has asymptotically a (2 ln 10)(-1) x (1/2 chi2(0) + 1/2 chi2(1)) distribution under the null hypothesis of no linkage when only one point (one marker) on the chromosome is studied. In this paper, we show, both by simulations and theoretical arguments, that the null hypothesis distribution of Zmax has no simple form when more than one marker is used (multipoint analysis). In fact, the distribution of Zmax depends on the number of families, their structure, the assumed genetic model, marker denseness, and marker informativity. This means that a constant critical limit of Zmax leads to tests associated with different significance levels. Because of the above-mentioned problems, from the statistical point of view the maximal lod score should be supplemented by a p-value when results are reported. Copyright (c) 2004 S. Karger AG, Basel.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sander, A.; Schmelzle, R.; Murray, J.C.
1995-01-01
Van der Woude syndrome (VWS) is an autosomal dominant craniofacial disorder characterized by lip pits, clefting of the primary or secondary palate, and hypodontia. The gene has been localized, by RFLP-based linkage studies, to region 1q32-41 between D1S65-REN and D1S65-TGFB2. In this study we report the linkage analysis of 15 VWS families, using 18 microsatellite markers. Multipoint linkage analysis places the gene, with significant odds of 2,344:1, in a 4.1-cM interval flanked by D1S245 and D1S414. Two-point linkage analysis demonstrates close linkage of VWS with D1S205 (lod score [Z] = 24.41 at {theta} = .00) and with D1S491 (Z =more » 21.23 at {theta} = .00). The results revise the previous assignment of the VWS locus and show in an integrated map of the region 1q32-42 that the VWS gene resides more distally than previously suggested. When information about heterozygosity of the closely linked marker D1S491 in the affected members of the VWS family with a microdeletion is taken into account, the VWS critical region can be further narrowed, to the 3.6-cM interval between D1S491 and D1S414. 38 refs., 3 figs., 2 tabs.« less
Caucasian Families Exhibit Significant Linkage of Myopia to Chromosome 11p.
Musolf, Anthony M; Simpson, Claire L; Moiz, Bilal A; Long, Kyle A; Portas, Laura; Murgia, Federico; Ciner, Elise B; Stambolian, Dwight; Bailey-Wilson, Joan E
2017-07-01
Myopia is a common visual disorder caused by eye overgrowth, resulting in blurry vision. It affects one in four Americans, and its prevalence is increasing. The genetic mechanisms that underpin myopia are not completely understood. Here, we use genotype data and linkage analyses to identify high-risk genetic loci that are significantly linked to myopia. Individuals from 56 Caucasian families with a history of myopia were genotyped on an exome-based array, and the single nucleotide polymorphism (SNP) data were merged with microsatellite genotype data. Refractive error measures on the samples were converted into binary phenotypes consisting of affected, unaffected, or unknown myopia status. Parametric linkage analyses assuming an autosomal dominant model with 90% penetrance and 10% phenocopy rate were performed. Single variant two-point analyses yielded three significantly linked SNPs at 11p14.1 and 11p11.2; a further 45 SNPs at 11p were found to be suggestive. No other chromosome had any significant SNPs or more than seven suggestive linkages. Two of the significant SNPs were located in BBOX1-AS1 and one in the intergenic region between ORA47 and TRIM49B. Collapsed haplotype pattern two-point analysis and multipoint analyses also yielded multiple suggestively linked genes at 11p. Multipoint analysis also identified suggestive evidence of linkage on 20q13. We identified three genome-wide significant linked variants on 11p for myopia in Caucasians. Although the novel specific signals still need to be replicated, 11p is a promising region that has been identified by other linkage studies with a number of potentially interesting candidate genes. We hope that the identification of these regions on 11p as potential causal regions for myopia will lead to more focus on these regions and maybe possible replication of our specific linkage peaks in other studies. We further plan targeted sequencing on 11p for our most highly linked families to more clearly understand the source of the linkage in this region.
Genotyping by Sequencing in Almond: SNP Discovery, Linkage Mapping, and Marker Design
Goonetilleke, Shashi N.; March, Timothy J.; Wirthensohn, Michelle G.; Arús, Pere; Walker, Amanda R.; Mather, Diane E.
2017-01-01
In crop plant genetics, linkage maps provide the basis for the mapping of loci that affect important traits and for the selection of markers to be applied in crop improvement. In outcrossing species such as almond (Prunus dulcis Mill. D. A. Webb), application of a double pseudotestcross mapping approach to the F1 progeny of a biparental cross leads to the construction of a linkage map for each parent. Here, we report on the application of genotyping by sequencing to discover and map single nucleotide polymorphisms in the almond cultivars “Nonpareil” and “Lauranne.” Allele-specific marker assays were developed for 309 tag pairs. Application of these assays to 231 Nonpareil × Lauranne F1 progeny provided robust linkage maps for each parent. Analysis of phenotypic data for shell hardness demonstrated the utility of these maps for quantitative trait locus mapping. Comparison of these maps to the peach genome assembly confirmed high synteny and collinearity between the peach and almond genomes. The marker assays were applied to progeny from several other Nonpareil crosses, providing the basis for a composite linkage map of Nonpareil. Applications of the assays to a panel of almond clones and a panel of rootstocks used for almond production demonstrated the broad applicability of the markers and provide subsets of markers that could be used to discriminate among accessions. The sequence-based linkage maps and single nucleotide polymorphism assays presented here could be useful resources for the genetic analysis and genetic improvement of almond. PMID:29141988
Wide-cross whole-genome radiation hybrid mapping of cotton (Gossypium hirsutum L.).
Gao, Wenxiang; Chen, Z Jeffrey; Yu, John Z; Raska, Dwaine; Kohel, Russell J; Womack, James E; Stelly, David M
2004-01-01
We report the development and characterization of a "wide-cross whole-genome radiation hybrid" (WWRH) panel from cotton (Gossypium hirsutum L.). Chromosomes were segmented by gamma-irradiation of G. hirsutum (n = 26) pollen, and segmented chromosomes were rescued after in vivo fertilization of G. barbadense egg cells (n = 26). A 5-krad gamma-ray WWRH mapping panel (N = 93) was constructed and genotyped at 102 SSR loci. SSR marker retention frequencies were higher than those for animal systems and marker retention patterns were informative. Using the program RHMAP, 52 of 102 SSR markers were mapped into 16 syntenic groups. Linkage group 9 (LG 9) SSR markers BNL0625 and BNL2805 had been colocalized by linkage analysis, but their order was resolved by differential retention among WWRH plants. Two linkage groups, LG 13 and LG 9, were combined into one syntenic group, and the chromosome 1 linkage group marker BNL4053 was reassigned to chromosome 9. Analyses of cytogenetic stocks supported synteny of LG 9 and LG 13 and localized them to the short arm of chromosome 17. They also supported reassignment of marker BNL4053 to the long arm of chromosome 9. A WWRH map of the syntenic group composed of linkage groups 9 and 13 was constructed by maximum-likelihood analysis under the general retention model. The results demonstrate not only the feasibility of WWRH panel construction and mapping, but also complementarity to traditional linkage mapping and cytogenetic methods. PMID:15280245
Li, Faji; Wen, Weie; He, Zhonghu; Liu, Jindong; Jin, Hui; Cao, Shuanghe; Geng, Hongwei; Yan, Jun; Zhang, Pingzhi; Wan, Yingxiu; Xia, Xianchun
2018-06-01
We identified 21 new and stable QTL, and 11 QTL clusters for yield-related traits in three bread wheat populations using the wheat 90 K SNP assay. Identification of quantitative trait loci (QTL) for yield-related traits and closely linked molecular markers is important in order to identify gene/QTL for marker-assisted selection (MAS) in wheat breeding. The objectives of the present study were to identify QTL for yield-related traits and dissect the relationships among different traits in three wheat recombinant inbred line (RIL) populations derived from crosses Doumai × Shi 4185 (D × S), Gaocheng 8901 × Zhoumai 16 (G × Z) and Linmai 2 × Zhong 892 (L × Z). Using the available high-density linkage maps previously constructed with the wheat 90 K iSelect single nucleotide polymorphism (SNP) array, 65, 46 and 53 QTL for 12 traits were identified in the three RIL populations, respectively. Among them, 34, 23 and 27 were likely to be new QTL. Eighteen common QTL were detected across two or three populations. Eleven QTL clusters harboring multiple QTL were detected in different populations, and the interval 15.5-32.3 cM around the Rht-B1 locus on chromosome 4BS harboring 20 QTL is an important region determining grain yield (GY). Thousand-kernel weight (TKW) is significantly affected by kernel width and plant height (PH), whereas flag leaf width can be used to select lines with large kernel number per spike. Eleven candidate genes were identified, including eight cloned genes for kernel, heading date (HD) and PH-related traits as well as predicted genes for TKW, spike length and HD. The closest SNP markers of stable QTL or QTL clusters can be used for MAS in wheat breeding using kompetitive allele-specific PCR or semi-thermal asymmetric reverse PCR assays for improvement of GY.
Mdladla, K; Dzomba, E F; Huson, H J; Muchadeyi, F C
2016-08-01
The sustainability of goat farming in marginal areas of southern Africa depends on local breeds that are adapted to specific agro-ecological conditions. Unimproved non-descript goats are the main genetic resources used for the development of commercial meat-type breeds of South Africa. Little is known about genetic diversity and the genetics of adaptation of these indigenous goat populations. This study investigated the genetic diversity, population structure and breed relations, linkage disequilibrium, effective population size and persistence of gametic phase in goat populations of South Africa. Three locally developed meat-type breeds of the Boer (n = 33), Savanna (n = 31), Kalahari Red (n = 40), a feral breed of Tankwa (n = 25) and unimproved non-descript village ecotypes (n = 110) from four goat-producing provinces of the Eastern Cape, KwaZulu-Natal, Limpopo and North West were assessed using the Illumina Goat 50K SNP Bead Chip assay. The proportion of SNPs with minor allele frequencies >0.05 ranged from 84.22% in the Tankwa to 97.58% in the Xhosa ecotype, with a mean of 0.32 ± 0.13 across populations. Principal components analysis, admixture and pairwise FST identified Tankwa as a genetically distinct population and supported clustering of the populations according to their historical origins. Genome-wide FST identified 101 markers potentially under positive selection in the Tankwa. Average linkage disequilibrium was highest in the Tankwa (r(2) = 0.25 ± 0.26) and lowest in the village ecotypes (r(2) range = 0.09 ± 0.12 to 0.11 ± 0.14). We observed an effective population size of <150 for all populations 13 generations ago. The estimated correlations for all breed pairs were lower than 0.80 at marker distances >100 kb with the exception of those in Savanna and Tswana populations. This study highlights the high level of genetic diversity in South African indigenous goats as well as the utility of the genome-wide SNP marker panels in genetic studies of these populations. © 2016 Stichting International Foundation for Animal Genetics.
Authentication of Piper betle L. folium and quantification of their antifungal-activity.
Wirasuta, I Made Agus Gelgel; Srinadi, I Gusti Ayu Made; Dwidasmara, Ida Bagus Gede; Ardiyanti, Ni Luh Putu Putri; Trisnadewi, I Gusti Ayu Arya; Paramita, Ni Luh Putu Vidya
2017-07-01
The TLC profiles of intra- and inter-day precision for Piper betle L . (PBL) folium methanol extract was studied for their peak marker recognition and identification. The Numerical chromatographic parameters (NCPs) of the peak markers, the hierarchical clustering analysis (HCA) and the principal component analysis (PCA) were applied to authenticate the PBL. folium extract from other Piper species folium extract and to ensure the antifungal activity quality of the PBL essential oil. The spotted extract was developed with the mobile phase of toluene: ethyl acetate; 93:7, (v/v). The eluted plate was viewed with the TLC-Visualizer, scanned under absorption and fluorescent mode detection, and on each sample the in-situ UV spectra were recorded between 190 to 400 nm. The NCPs profiles of intra- and inter-day precision results offered multi-dimensional chromatogram fingerprints for better marker peak pattern recognition and identification. Using the r -value fingerprints data series generated with this method allowed more precise discrimination the PBL. from other Piper species compared to the marker peak area fingerprint method. The cosine pair comparison was a simple method for authentication of two different fingerprints. The ward linkage clustering and the pair cross-correlation comparison were better chemometric methods to determine the consistency peak area ratio between fingerprints. The first component PCA-loading values of peak marker area fingerprints were correlated linearly to both the bio-marker concentration as well as the antifungal activity. This relationship could be used to control the quality and pharmacological potency. This simple method was developed for the authentication and quantification of herbal medicine.
Elsheikha, H. M.; Schott, H. C.; Mansfield, L. S.
2006-01-01
Sarcocystis neurona causes serious neurological disease in horses and other vertebrates in the Americas. Based on epidemiological data, this parasite has recently emerged. Here, the genetic diversity of Sarcocystis neurona was evaluated using the amplified fragment length polymorphism (AFLP) method. Fifteen S. neurona taxa from different regions collected over the last 10 years were used; six isolates were from clinically diseased horses, eight isolates were from wild-caught opossums (Didelphis virginiana), and one isolate was from a cowbird (Molothrus ater). Additionally, four outgroup taxa were also fingerprinted. Nine primer pairs were used to generate AFLP patterns, with a total number of amplified fragments ranging from 30 to 60, depending on the isolate and primers tested. Based on the presence/absence of amplified AFLP fragments and pairwise similarity values, all the S. neurona isolates tested were clustered in one monophyletic group. No significant correlation could be found between genomic similarity and host origin of the S. neurona isolates. AFLP revealed significant intraspecific genetic variations, and S. neurona appeared as a highly variable species. Furthermore, linkage disequilibrium analysis suggested that S. neurona populations within Michigan have an intermediate type of population structure that includes characteristics of both clonal and panamictic population structures. AFLP is a reliable molecular technique that has provided one of the most informative approaches to ascertain phylogenetic relationships in S. neurona and its closest relatives, allowing them to be clustered by relative similarity using band matching and unweighted pair group method with arithmetic mean analysis, which may be applicable to other related protozoal species. PMID:16714575
Elsheikha, H M; Schott, H C; Mansfield, L S
2006-06-01
Sarcocystis neurona causes serious neurological disease in horses and other vertebrates in the Americas. Based on epidemiological data, this parasite has recently emerged. Here, the genetic diversity of Sarcocystis neurona was evaluated using the amplified fragment length polymorphism (AFLP) method. Fifteen S. neurona taxa from different regions collected over the last 10 years were used; six isolates were from clinically diseased horses, eight isolates were from wild-caught opossums (Didelphis virginiana), and one isolate was from a cowbird (Molothrus ater). Additionally, four outgroup taxa were also fingerprinted. Nine primer pairs were used to generate AFLP patterns, with a total number of amplified fragments ranging from 30 to 60, depending on the isolate and primers tested. Based on the presence/absence of amplified AFLP fragments and pairwise similarity values, all the S. neurona isolates tested were clustered in one monophyletic group. No significant correlation could be found between genomic similarity and host origin of the S. neurona isolates. AFLP revealed significant intraspecific genetic variations, and S. neurona appeared as a highly variable species. Furthermore, linkage disequilibrium analysis suggested that S. neurona populations within Michigan have an intermediate type of population structure that includes characteristics of both clonal and panamictic population structures. AFLP is a reliable molecular technique that has provided one of the most informative approaches to ascertain phylogenetic relationships in S. neurona and its closest relatives, allowing them to be clustered by relative similarity using band matching and unweighted pair group method with arithmetic mean analysis, which may be applicable to other related protozoal species.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wirtz, M.K.; Acott, T.S.; Samples, J.R.
1994-09-01
The gene for one form of juvenile glaucoma has been mapped to chromosome 1q21-q31. This raises the possibility of primary open-angle glaucoma (POAG) also mapping to this region if the same defective gene causes both diseases. To ask this question linkage analysis was performed on a large POAG kindred. Blood samples or skin biopsies were obtained from 40 members of this family. Individuals were diagnosed as having POAG if they met two or more of the following criteria: (1) Visual field defects compatible with glaucoma on automated perimetry; (2) Optic nerve head and/or nerve fiber layer analysis compatible with glaucomatousmore » damage; (3) high intraocular pressures (> 20 mm Hg). Patients were considered glaucoma suspects if they only met one criterion. These individuals were excluded from the analysis. Of the 40 members, seven were diagnosed with POAG; four were termed suspects. The earliest age of onset was 38 years old, while the average age of onset was 65 years old. We performed two-point and multipoint linkage analysis, using five markers which encompass the region 1q21-q31; specifically, D1S194, D1S210, D1S212, D1S191 and LAMB2. Two-point lod scores excluded tight linkage with all markers except D1S212 (maximum lod score of 1.07 at theta = 0.0). In the multipoint analysis, including D1S210-D1S212-LAMB2 and POAG, the entire 11 cM region spanned by these markers was excluded for linkage with POAG; that is, lod scores were < -2.0. In conclusion, POAG in this family does not map to chromosome 1q21-q31 and, thus, they carry a gene that is distinct from the juvenile glaucoma gene.« less
Hiras, Jennifer; Wu, Yu-Wei; Eichorst, Stephanie A.; ...
2015-09-01
Recent studies have expanded the phylum Chlorobi, demonstrating that the green sulfur bacteria (GSB), the original cultured representatives of the phylum, are a part of a larger lineage whose members have more diverse metabolic capabilities that overlap with members of the phylum Bacteroidetes. The 16S rRNA gene of an uncultivated clone, OPB56, distantly related to the phyla Chlorobi and Bacteroidetes, was recovered from Obsidian Pool in Yellowstone National Park; however, the detailed phylogeny and function of OPB56 and related clones have remained unknown. Culturing of thermophilic bacterial consortia from compost by adaptation to grow on ionic-liquid pretreated switchgrass provided amore » consortium in which one of the most abundant members, NICIL-2, clustered with OPB56-related clones. Phylogenetic analysis using the full-length 16S rRNA gene from NICIL-2 demonstrated that it was part of a monophyletic clade, referred to as OPB56, distinct from the Bacteroidetes and Chlorobi. A near complete draft genome ( > 95% complete) was recovered from metagenomic data from the culture adapted to grow on ionic-liquid pretreated switchgrass using an automated binning algorithm, and this genome was used for marker gene-based phylogenetic analysis and metabolic reconstruction. Six additional genomes related to NICIL-2 were reconstructed from metagenomic data sets obtained from thermal springs at Yellowstone National Park and Nevada Great Boiling Spring. In contrast to the 16S rRNA gene phylogenetic analysis, protein phylogenetic analysis was most consistent with the clustering of the Chlorobea, Ignavibacteria and OPB56 into a single phylum level clade. Metabolic reconstruction of NICIL-2 demonstrated a close linkage with the class Ignavibacteria and the family Rhodothermaceae, a deeply branching Bacteroidetes lineage. The combined phylogenetic and functional analysis of the NICIL-2 genome has refined the membership in the phylum Chlorobi and emphasized the close evolutionary and metabolic relationship between the phyla Chlorobi and the Bacteroidetes.« less
Hiras, Jennifer; Wu, Yu-Wei; Eichorst, Stephanie A; Simmons, Blake A; Singer, Steven W
2016-04-01
Recent studies have expanded the phylum Chlorobi, demonstrating that the green sulfur bacteria (GSB), the original cultured representatives of the phylum, are a part of a broader lineage whose members have more diverse metabolic capabilities that overlap with members of the phylum Bacteroidetes. The 16S rRNA gene of an uncultivated clone, OPB56, distantly related to the phyla Chlorobi and Bacteroidetes, was recovered from Obsidian Pool in Yellowstone National Park; however, the detailed phylogeny and function of OPB56 and related clones have remained unknown. Culturing of thermophilic bacterial consortia from compost by adaptation to grow on ionic-liquid pretreated switchgrass provided a consortium in which one of the most abundant members, NICIL-2, clustered with OPB56-related clones. Phylogenetic analysis using the full-length 16S rRNA gene from NICIL-2 demonstrated that it was part of a monophyletic clade, referred to as OPB56, distinct from the Bacteroidetes and Chlorobi. A near complete draft genome (>95% complete) was recovered from metagenomic data from the culture adapted to grow on ionic-liquid pretreated switchgrass using an automated binning algorithm, and this genome was used for marker gene-based phylogenetic analysis and metabolic reconstruction. Six additional genomes related to NICIL-2 were reconstructed from metagenomic data sets obtained from thermal springs at Yellowstone National Park and Nevada Great Boiling Spring. In contrast to the 16S rRNA gene phylogenetic analysis, protein phylogenetic analysis was most consistent with the clustering of the Chlorobea, Ignavibacteria and OPB56 into a single phylum level clade. Metabolic reconstruction of NICIL-2 demonstrated a close linkage with the class Ignavibacteria and the family Rhodothermaceae, a deeply branching Bacteroidetes lineage. The combined phylogenetic and functional analysis of the NICIL-2 genome has refined the membership in the phylum Chlorobi and emphasized the close evolutionary and metabolic relationship between the phyla Chlorobi and the Bacteroidetes.
Wolf, Elizabeth; Herbeck, Joshua T; Van Rompaey, Stephen; Kitahata, Mari; Thomas, Katherine; Pepper, Gregory; Frenkel, Lisa
2017-04-01
HIV-1 incidence among youth, especially men who have sex with men (MSM), is increasing in the United States. We aimed to better understand the patterns of adolescent HIV-1 acquisition, to help guide future prevention interventions. We conducted a study combining epidemiologic and HIV-1 pol sequence data from a retrospective cohort of HIV-infected adults and adolescents in Seattle, WA between 2000 and 2013. Adolescents were defined as 13-24 years of age at the time of first HIV-1 care. Maximum-likelihood phylogenetic trees were reconstructed to identify putative viral transmission clusters of two or more individuals, followed by multivariable regression tests of associations between clustering and demographic and clinical parameters. The dataset included 3,102 sequences from 1,953 individuals; 72 putative transmission clusters were identified, representing 168 individuals (8.6%). MSM and MSM/intravenous drug use (IDU) were positively associated with clustering, with aOR 3.18 (95% CI: 1.34-7.55) and 2.59 (95% CI: 1.04-6.49), respectively. African American race was negatively associated with clustering (aOR 0.54 95% CI: 0.32-0.91). Twenty-five clusters contained one adolescent and five clusters contained two adolescents. Other individuals who clustered with adolescents were predominantly male (95%), white (85%), and either MSM (66%) or MSM/IDU (16%), with a greater mean age (34 years vs. 22 years; p < .01). In this Seattle cohort, HIV-1 transmission linkages were identified between white male adolescents and older MSM adults. Interventions aimed at age-discrepant pairs may reduce HIV-1 infections in adolescent males.
Improved Gravitation Field Algorithm and Its Application in Hierarchical Clustering
Zheng, Ming; Sun, Ying; Liu, Gui-xia; Zhou, You; Zhou, Chun-guang
2012-01-01
Background Gravitation field algorithm (GFA) is a new optimization algorithm which is based on an imitation of natural phenomena. GFA can do well both for searching global minimum and multi-minima in computational biology. But GFA needs to be improved for increasing efficiency, and modified for applying to some discrete data problems in system biology. Method An improved GFA called IGFA was proposed in this paper. Two parts were improved in IGFA. The first one is the rule of random division, which is a reasonable strategy and makes running time shorter. The other one is rotation factor, which can improve the accuracy of IGFA. And to apply IGFA to the hierarchical clustering, the initial part and the movement operator were modified. Results Two kinds of experiments were used to test IGFA. And IGFA was applied to hierarchical clustering. The global minimum experiment was used with IGFA, GFA, GA (genetic algorithm) and SA (simulated annealing). Multi-minima experiment was used with IGFA and GFA. The two experiments results were compared with each other and proved the efficiency of IGFA. IGFA is better than GFA both in accuracy and running time. For the hierarchical clustering, IGFA is used to optimize the smallest distance of genes pairs, and the results were compared with GA and SA, singular-linkage clustering, UPGMA. The efficiency of IGFA is proved. PMID:23173043
DOE Office of Scientific and Technical Information (OSTI.GOV)
Haines, J.L.; Worster, T.; Ter-Minassian, M.
The loci for juvenile (CLN3) and infantile (CLN1) neuronal ceroid lipofuscinosis (NCL) types have been mapped by genetic linkage analysis to chromosome arms 16p and 1p, respectively. The late-infantile defect CLN2 has not yet been mapped, although linkage analysis with tightly linked markers excludes it from both the JNCL and INCL loci. We have initiated a genome-wide search for the LNCL gene, taking advantage of the large collection of highly polymorphic markers that has been developed through the Human Genome Initiative. The high degree of heterozygosity of these markers makes it possible to carry out successful linkage analysis in smallmore » nuclear families, such as found in LNCL. Our current collection of LNCL pedigrees includes 19 US families and 11 Costa Rican families. To date, we have completed typing with over 50 markers on chromosomes 2, 9, 13, and 18-22. The results of this analysis formally exclude about 10% of the human genome as the location of the LNCL gene. 14 refs., 3 tabs.« less
Robustness of linkage strategy that leads to large-scale cooperation.
Inaba, Misato; Takahashi, Nobuyuki; Ohtsuki, Hisashi
2016-11-21
One of the most well-known models to characterize cooperation among unrelated individuals is Social dilemma (SD). However there is no consensus about how to solve the SD by itself. Since SDs are often embedded in other social interactions, including indirect reciprocity games (IR), human can coordinate their behaviors across multiple games. Such coordination is called 'linkage'. Recently linkage has been considered as a promising solution to resolve SDs, since excluding SD defectors (i.e. those who defected in SD) from indirectly reciprocal relationships functions as a costless sanction. A previous study performed mathematical modeling and revealed that a linkage strategy, which cooperates in SD and engages in the Standing strategy in IR based on the recipients' behaviors in both SD and IR, was an ESS against a non-linkage strategy which defects in SD and engages in the Standing strategy in IR based on recipients' behaviors only in IR (Panchanathan and Boyd, 2004). In order to investigate the robustness of the linkage strategy, we devised a non-linkage strategy, which cooperates in SD but does not link two games. First, we conducted a mathematical analysis and demonstrated that the linkage strategy was not an ESS against cooperating non-linkage strategy. Second, we conducted a series of agent-based computer simulations to examine how the strategies perform in situations in which various types of errors can occur. Our results showed that the linkage strategy was an ESS only when there are implementation errors in SD. However, the equilibrium of the linkage strategy was unstable when there are perception errors. Since we know that humans are not free from perception errors in their social life, future studies will need to show how perception errors can be overcome in order to provide support for the conclusion that linkage is a plausible solution to SDs. Copyright © 2016 Elsevier Ltd. All rights reserved.
Nested association mapping of stem rust resistance in wheat using genotyping by sequencing
USDA-ARS?s Scientific Manuscript database
Nested association mapping is an approach to map trait loci in which families within populations are interconnected by a common parent. By implementing joint-linkage association analysis, this approach is able to map causative loci with higher power and resolution compared to biparental linkage mapp...
Joint QTL linkage mapping for multiple-cross mating design sharing one common parent
USDA-ARS?s Scientific Manuscript database
Nested association mapping (NAM) is a novel genetic mating design that combines the advantages of linkage analysis and association mapping. This design provides opportunities to study the inheritance of complex traits, but also requires more advanced statistical methods. In this paper, we present th...
University-Industry Linkages in Developing Countries: Perceived Effect on Innovation
ERIC Educational Resources Information Center
Vaaland, Terje I.; Ishengoma, Esther
2016-01-01
Purpose: The purpose of this paper is to assess the perceptions of both universities and the resource-extractive companies on the influence of university-industry linkages (UILs) on innovation in a developing country. Design/Methodology/Approach: A total of 404 respondents were interviewed. Descriptive analysis and multinomial logistic regression…
Further evidence for the increased power of LOD scores compared with nonparametric methods.
Durner, M; Vieland, V J; Greenberg, D A
1999-01-01
In genetic analysis of diseases in which the underlying model is unknown, "model free" methods-such as affected sib pair (ASP) tests-are often preferred over LOD-score methods, although LOD-score methods under the correct or even approximately correct model are more powerful than ASP tests. However, there might be circumstances in which nonparametric methods will outperform LOD-score methods. Recently, Dizier et al. reported that, in some complex two-locus (2L) models, LOD-score methods with segregation analysis-derived parameters had less power to detect linkage than ASP tests. We investigated whether these particular models, in fact, represent a situation that ASP tests are more powerful than LOD scores. We simulated data according to the parameters specified by Dizier et al. and analyzed the data by using a (a) single locus (SL) LOD-score analysis performed twice, under a simple dominant and a recessive mode of inheritance (MOI), (b) ASP methods, and (c) nonparametric linkage (NPL) analysis. We show that SL analysis performed twice and corrected for the type I-error increase due to multiple testing yields almost as much linkage information as does an analysis under the correct 2L model and is more powerful than either the ASP method or the NPL method. We demonstrate that, even for complex genetic models, the most important condition for linkage analysis is that the assumed MOI at the disease locus being tested is approximately correct, not that the inheritance of the disease per se is correctly specified. In the analysis by Dizier et al., segregation analysis led to estimates of dominance parameters that were grossly misspecified for the locus tested in those models in which ASP tests appeared to be more powerful than LOD-score analyses.
The web graph of a tourism system
NASA Astrophysics Data System (ADS)
Baggio, Rodolfo
2007-06-01
The website network of a tourism destination is examined. Network theoretic metrics are used to gauge the static and dynamic characteristics of the webspace. The topology of the network is found partly similar to the one exhibited by similar systems. However, some differences are found, mainly due to the relatively poor connectivity and clusterisation of the network. These results are interpreted by considering the formation mechanisms and the connotation of the linkages between websites. Clustering and assortativity coefficients are proposed as quantitative estimations of the degree of collaboration and cooperation among destination stakeholders.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dixon, M.J.; Dixon, J.; Houseal, T.
Treacher Collins syndrome (TCOF1) is an autosomal dominant disorder of craniofacial development, the features of which include conductive hearing loss and cleft palate. The TCOF1 locus has been localized to chromosome 5q32-33.2. In the present study the authors have used the combined techniques of genetic linkage analysis and fluorescence in situ hybridization (FISH) to more accurately define the TCOF1 critical region. Cosmids IG90 and SPARC, which map to distal 5q, encompass two and one hypervariable microsatellite markers, respectively. The heterozygosity values of these three markers range from .72 to .81. Twenty-two unrelated TCOF1 families have been analyzed for linkage tomore » these markers. There is strong evidence demonstrating linkage to all three markers, the strongest support for positive linkage being provided by haplotyping those markers at the locus encompassed by the cosmid IG90 (Z[sub max]= 19.65; 0 = .010). FISH to metaphase chromosomes and interphase nuclei established that IG90 lies centromeric to SPARC. This information combined with the data generated by genetic linkage analysis demonstrated that the TCOF1 locus is closely flanked proximally by IG90 and distally by SPARC. 30 refs., 2 figs., 4 tabs.« less
Familial isolated hyperparathyroidism is linked to a 1.7 Mb region on chromosome 2p13.3–14
Warner, J; Nyholt, D R; Busfield, F; Epstein, M; Burgess, J; Stranks, S; Hill, P; Perry‐Keene, D; Learoyd, D; Robinson, B; Teh, B T; Prins, J B; Cardinal, J W
2006-01-01
Bachground Familial isolated hyperparathyroidism (FIHP) is an autosomal dominantly inherited form of primary hyperparathyroidism. Although comprising only about 1% of cases of primary hyperparathyroidism, identification and functional analysis of a causative gene for FIHP is likely to advance our understanding of parathyroid physiology and pathophysiology. Methods A genome‐wide screen of DNA from seven pedigrees with FIHP was undertaken in order to identify a region of genetic linkage with the disorder. Results Multipoint linkage analysis identified a region of suggestive linkage (LOD score 2.68) on chromosome 2. Fine mapping with the addition of three other families revealed significant linkage adjacent to D2S2368 (maximum multipoint LOD score 3.43). Recombination events defined a 1.7 Mb region of linkage between D2S2368 and D2S358 in nine pedigrees. Sequencing of the two most likely candidate genes in this region, however, did not identify a gene for FIHP. Conclusions We conclude that a causative gene for FIHP lies within this interval on chromosome 2. This is a major step towards eventual precise identification of a gene for FIHP, likely to be a key component in the genetic regulation of calcium homeostasis. PMID:16525030
Wood degradation under UV irradiation: A lignin characterization.
Cogulet, Antoine; Blanchet, Pierre; Landry, Véronic
2016-05-01
The photodegradation of white spruce by artificial ageing was studied by several techniques: colourimetry, FTIR-ATR and FT-Raman spectroscopy. Samples were exposed at a xenon lamp for 2000h. Two distinct colour changes were found by colourimetric analysis, yellowing and silvering. These colour modifications indicate the formation of chromophoric structures which supports previous FTIR-ATR experiments. The degradation of lignin to generate the first chromophoric group for yellowing and then the appearance of surface layer cellulose. New carbonyl compounds conjugated with double bond at 1615cm(-1) are probably the second chromophoric group. The crystallinity index was also calculated and showed an increase of cellulose crystallinity by prior degradation of amorphous cellulose. The FT-Raman analysis confirms the wood sensitivity to photodegradation but the most remarkable results is the increase of fluorescence as a function of time. In softwood lignin, the compound able to produce fluorescence is a free rotating 5-5' linkage of one biphenyl structure. At native state these linkages are not free rotating, this phenomenon means the release of 5-5' linkage of lignin structure by cleavage of both α carbon linkages (Norrish type I reaction). These data confirm also the photosensitivity of α and β carbon in lignin and the resistance of 5-5' linkages. Copyright © 2016 Elsevier B.V. All rights reserved.
Yu, Yang; Zhang, Xiaojun; Yuan, Jianbo; Li, Fuhua; Chen, Xiaohan; Zhao, Yongzhen; Huang, Long; Zheng, Hongkun; Xiang, Jianhai
2015-01-01
The Pacific white shrimp Litopenaeus vannamei is the dominant crustacean species in global seafood mariculture. Understanding the genome and genetic architecture is useful for deciphering complex traits and accelerating the breeding program in shrimp. In this study, a genome survey was conducted and a high-density linkage map was constructed using a next-generation sequencing approach. The genome survey was used to identify preliminary genome characteristics and to generate a rough reference for linkage map construction. De novo SNP discovery resulted in 25,140 polymorphic markers. A total of 6,359 high-quality markers were selected for linkage map construction based on marker coverage among individuals and read depths. For the linkage map, a total of 6,146 markers spanning 4,271.43 cM were mapped to 44 sex-averaged linkage groups, with an average marker distance of 0.7 cM. An integration analysis linked 5,885 genome scaffolds and 1,504 BAC clones to the linkage map. Based on the high-density linkage map, several QTLs for body weight and body length were detected. This high-density genetic linkage map reveals basic genomic architecture and will be useful for comparative genomics research, genome assembly and genetic improvement of L. vannamei and other penaeid shrimp species. PMID:26503227
Natural Variation of Epstein-Barr Virus Genes, Proteins, and Primary MicroRNA.
Correia, Samantha; Palser, Anne; Elgueta Karstegl, Claudio; Middeldorp, Jaap M; Ramayanti, Octavia; Cohen, Jeffrey I; Hildesheim, Allan; Fellner, Maria Dolores; Wiels, Joelle; White, Robert E; Kellam, Paul; Farrell, Paul J
2017-08-01
Viral gene sequences from an enlarged set of about 200 Epstein-Barr virus (EBV) strains, including many primary isolates, have been used to investigate variation in key viral genetic regions, particularly LMP1, Zp, gp350, EBNA1, and the BART microRNA (miRNA) cluster 2. Determination of type 1 and type 2 EBV in saliva samples from people from a wide range of geographic and ethnic backgrounds demonstrates a small percentage of healthy white Caucasian British people carrying predominantly type 2 EBV. Linkage of Zp and gp350 variants to type 2 EBV is likely to be due to their genes being adjacent to the EBNA3 locus, which is one of the major determinants of the type 1/type 2 distinction. A novel classification of EBNA1 DNA binding domains, named QCIGP, results from phylogeny analysis of their protein sequences but is not linked to the type 1/type 2 classification. The BART cluster 2 miRNA region is classified into three major variants through single-nucleotide polymorphisms (SNPs) in the primary miRNA outside the mature miRNA sequences. These SNPs can result in altered levels of expression of some miRNAs from the BART variant frequently present in Chinese and Indonesian nasopharyngeal carcinoma (NPC) samples. The EBV genetic variants identified here provide a basis for future, more directed analysis of association of specific EBV variations with EBV biology and EBV-associated diseases. IMPORTANCE Incidence of diseases associated with EBV varies greatly in different parts of the world. Thus, relationships between EBV genome sequence variation and health, disease, geography, and ethnicity of the host may be important for understanding the role of EBV in diseases and for development of an effective EBV vaccine. This paper provides the most comprehensive analysis so far of variation in specific EBV genes relevant to these diseases and proposed EBV vaccines. By focusing on variation in LMP1, Zp, gp350, EBNA1, and the BART miRNA cluster 2, new relationships with the known type 1/type 2 strains are demonstrated, and a novel classification of EBNA1 and the BART miRNAs is proposed. Copyright © 2017 Correia et al.
Vikram, Prashant; Swamy, B. P. Mallikarjuna; Dixit, Shalabh; Singh, Renu; Singh, Bikram P.; Miro, Berta; Kohli, Ajay; Henry, Amelia; Singh, N. K.; Kumar, Arvind
2015-01-01
Green Revolution (GR) rice varieties are high yielding but typically drought sensitive. This is partly due to the tight linkage between the loci governing plant height and drought tolerance. This linkage is illustrated here through characterization of qDTY1.1, a QTL for grain yield under drought that co-segregates with the GR gene sd1 for semi-dwarf plant height. We report that the loss of the qDTY1.1 allele during the GR was due to its tight linkage in repulsion with the sd1 allele. Other drought-yield QTLs (qDTY) also showed tight linkage with traits rejected in GR varieties. Genetic diversity analysis for 11 different qDTY regions grouped GR varieties separately from traditional drought-tolerant varieties, and showed lower frequency of drought tolerance alleles. The increased understanding and breaking of the linkage between drought tolerance and undesirable traits has led to the development of high-yielding drought-tolerant dwarf lines with positive qDTY alleles and provides new hope for extending the benefits of the GR to drought-prone rice-growing regions. PMID:26458744
Eberts, Rebecca L.; Wissel, Bjorn; Simpson, Gavin L.; Crawford, Stephen S.; Stott, Wendylee; Hanner, Robert H.; Manzon, Richard G.; Wilson, Joanna Y.; Boreham, Douglas R.; Somers, Christopher M.
2017-01-01
Lake Whitefish Coregonus clupeaformis is the most commercially valuable species in Lake Huron. The fishery for this species has historically been managed based on 25 management units (17 in Canada, 8 in the USA). However, congruence between the contemporary population structure of Lake Whitefish and management units is poorly understood. We used stable isotopes of carbon (δ13C) and nitrogen (δ15N), food web markers that reflect patterns in resource use (i.e., prey, location, habitat), to assess the population structure of spawning-phase Lake Whitefish collected from 32 sites (1,474 fish) across Lake Huron. We found large isotopic variation among fish from different sites (ranges: δ13C = 10.2‰, δ15N = 5.5‰) and variable niche size and levels of overlap (standard ellipse area = 1.0–4.3‰2). Lake Huron contained spawning-phase fish from four major isotopic clusters largely defined by extensive variation in δ13C, and the isotopic composition of fish sampled was spatially structured both within and between lake basins. Based on cluster compositions, we identified six putative regional groups, some of which represented sites of high diversity (three to four clusters) and others with less (one to two clusters). Analysis of isotopic values from Lake Whitefish collected from summer feeding locations and baseline prey items showed similar isotopic variation and established spatial linkage between spawning-phase and summer fish. Our results show that summer feeding location contributes strongly to the isotopic structure we observed in spawning-phase fish. One of the regional groups we identified in northern Georgian Bay is highly distinct based on isotopic composition and possibly ecologically unique within Lake Huron. Our findings are congruent with several previous studies using different markers (genetics, mark–recapture), and we conclude that current management units are generally too small and numerous to reflect the population structure of Lake Whitefish in Lake Huron.