Temporal scaling and spatial statistical analyses of groundwater level fluctuations
NASA Astrophysics Data System (ADS)
Sun, H.; Yuan, L., Sr.; Zhang, Y.
2017-12-01
Natural dynamics such as groundwater level fluctuations can exhibit multifractionality and/or multifractality due likely to multi-scale aquifer heterogeneity and controlling factors, whose statistics requires efficient quantification methods. This study explores multifractionality and non-Gaussian properties in groundwater dynamics expressed by time series of daily level fluctuation at three wells located in the lower Mississippi valley, after removing the seasonal cycle in the temporal scaling and spatial statistical analysis. First, using the time-scale multifractional analysis, a systematic statistical method is developed to analyze groundwater level fluctuations quantified by the time-scale local Hurst exponent (TS-LHE). Results show that the TS-LHE does not remain constant, implying the fractal-scaling behavior changing with time and location. Hence, we can distinguish the potentially location-dependent scaling feature, which may characterize the hydrology dynamic system. Second, spatial statistical analysis shows that the increment of groundwater level fluctuations exhibits a heavy tailed, non-Gaussian distribution, which can be better quantified by a Lévy stable distribution. Monte Carlo simulations of the fluctuation process also show that the linear fractional stable motion model can well depict the transient dynamics (i.e., fractal non-Gaussian property) of groundwater level, while fractional Brownian motion is inadequate to describe natural processes with anomalous dynamics. Analysis of temporal scaling and spatial statistics therefore may provide useful information and quantification to understand further the nature of complex dynamics in hydrology.
DEIVA: a web application for interactive visual analysis of differential gene expression profiles.
Harshbarger, Jayson; Kratz, Anton; Carninci, Piero
2017-01-07
Differential gene expression (DGE) analysis is a technique to identify statistically significant differences in RNA abundance for genes or arbitrary features between different biological states. The result of a DGE test is typically further analyzed using statistical software, spreadsheets or custom ad hoc algorithms. We identified a need for a web-based system to share DGE statistical test results, and locate and identify genes in DGE statistical test results with a very low barrier of entry. We have developed DEIVA, a free and open source, browser-based single page application (SPA) with a strong emphasis on being user friendly that enables locating and identifying single or multiple genes in an immediate, interactive, and intuitive manner. By design, DEIVA scales with very large numbers of users and datasets. Compared to existing software, DEIVA offers a unique combination of design decisions that enable inspection and analysis of DGE statistical test results with an emphasis on ease of use.
Statistical parsimony networks and species assemblages in Cephalotrichid nemerteans (nemertea).
Chen, Haixia; Strand, Malin; Norenburg, Jon L; Sun, Shichun; Kajihara, Hiroshi; Chernyshev, Alexey V; Maslakova, Svetlana A; Sundberg, Per
2010-09-21
It has been suggested that statistical parsimony network analysis could be used to get an indication of species represented in a set of nucleotide data, and the approach has been used to discuss species boundaries in some taxa. Based on 635 base pairs of the mitochondrial protein-coding gene cytochrome c oxidase I (COI), we analyzed 152 nemertean specimens using statistical parsimony network analysis with the connection probability set to 95%. The analysis revealed 15 distinct networks together with seven singletons. Statistical parsimony yielded three networks supporting the species status of Cephalothrix rufifrons, C. major and C. spiralis as they currently have been delineated by morphological characters and geographical location. Many other networks contained haplotypes from nearby geographical locations. Cladistic structure by maximum likelihood analysis overall supported the network analysis, but indicated a false positive result where subnetworks should have been connected into one network/species. This probably is caused by undersampling of the intraspecific haplotype diversity. Statistical parsimony network analysis provides a rapid and useful tool for detecting possible undescribed/cryptic species among cephalotrichid nemerteans based on COI gene. It should be combined with phylogenetic analysis to get indications of false positive results, i.e., subnetworks that would have been connected with more extensive haplotype sampling.
Gómez, Miguel A; Lorenzo, Alberto; Barakat, Rubén; Ortega, Enrique; Palao, José M
2008-02-01
The aim of the present study was to identify game-related statistics that differentiate winning and losing teams according to game location. The sample included 306 games of the 2004-2005 regular season of the Spanish professional men's league (ACB League). The independent variables were game location (home or away) and game result (win or loss). The game-related statistics registered were free throws (successful and unsuccessful), 2- and 3-point field goals (successful and unsuccessful), offensive and defensive rebounds, blocks, assists, fouls, steals, and turnovers. Descriptive and inferential analyses were done (one-way analysis of variance and discriminate analysis). The multivariate analysis showed that winning teams differ from losing teams in defensive rebounds (SC = .42) and in assists (SC = .38). Similarly, winning teams differ from losing teams when they play at home in defensive rebounds (SC = .40) and in assists (SC = .41). On the other hand, winning teams differ from losing teams when they play away in defensive rebounds (SC = .44), assists (SC = .30), successful 2-point field goals (SC = .31), and unsuccessful 3-point field goals (SC = -.35). Defensive rebounds and assists were the only game-related statistics common to all three analyses.
NASA Technical Reports Server (NTRS)
Keegan, W. B.
1974-01-01
In order to produce cost effective environmental test programs, the test specifications must be realistic and to be useful, they must be available early in the life of a program. This paper describes a method for achieving such specifications for subsystems by utilizing the results of a statistical analysis of data acquired at subsystem mounting locations during system level environmental tests. The paper describes the details of this statistical analysis. The resultant recommended levels are a function of the subsystems' mounting location in the spacecraft. Methods of determining this mounting 'zone' are described. Recommendations are then made as to which of the various problem areas encountered should be pursued further.
A national streamflow network gap analysis
Kiang, Julie E.; Stewart, David W.; Archfield, Stacey A.; Osborne, Emily B.; Eng, Ken
2013-01-01
The U.S. Geological Survey (USGS) conducted a gap analysis to evaluate how well the USGS streamgage network meets a variety of needs, focusing on the ability to calculate various statistics at locations that have streamgages (gaged) and that do not have streamgages (ungaged). This report presents the results of analysis to determine where there are gaps in the network of gaged locations, how accurately desired statistics can be calculated with a given length of record, and whether the current network allows for estimation of these statistics at ungaged locations. The analysis indicated that there is variability across the Nation’s streamflow data-collection network in terms of the spatial and temporal coverage of streamgages. In general, the Eastern United States has better coverage than the Western United States. The arid Southwestern United States, Alaska, and Hawaii were observed to have the poorest spatial coverage, using the dataset assembled for this study. Except in Hawaii, these areas also tended to have short streamflow records. Differences in hydrology lead to differences in the uncertainty of statistics calculated in different regions of the country. Arid and semiarid areas of the Central and Southwestern United States generally exhibited the highest levels of interannual variability in flow, leading to larger uncertainty in flow statistics. At ungaged locations, information can be transferred from nearby streamgages if there is sufficient similarity between the gaged watersheds and the ungaged watersheds of interest. Areas where streamgages exhibit high correlation are most likely to be suitable for this type of information transfer. The areas with the most highly correlated streamgages appear to coincide with mountainous areas of the United States. Lower correlations are found in the Central United States and coastal areas of the Southeastern United States. Information transfer from gaged basins to ungaged basins is also most likely to be successful when basin attributes show high similarity. At the scale of the analysis completed in this study, the attributes of basins upstream of USGS streamgages cover the full range of basin attributes observed at potential locations of interest fairly well. Some exceptions included very high or very low elevation areas and very arid areas.
A new statistical PCA-ICA algorithm for location of R-peaks in ECG.
Chawla, M P S; Verma, H K; Kumar, Vinod
2008-09-16
The success of ICA to separate the independent components from the mixture depends on the properties of the electrocardiogram (ECG) recordings. This paper discusses some of the conditions of independent component analysis (ICA) that could affect the reliability of the separation and evaluation of issues related to the properties of the signals and number of sources. Principal component analysis (PCA) scatter plots are plotted to indicate the diagnostic features in the presence and absence of base-line wander in interpreting the ECG signals. In this analysis, a newly developed statistical algorithm by authors, based on the use of combined PCA-ICA for two correlated channels of 12-channel ECG data is proposed. ICA technique has been successfully implemented in identifying and removal of noise and artifacts from ECG signals. Cleaned ECG signals are obtained using statistical measures like kurtosis and variance of variance after ICA processing. This analysis also paper deals with the detection of QRS complexes in electrocardiograms using combined PCA-ICA algorithm. The efficacy of the combined PCA-ICA algorithm lies in the fact that the location of the R-peaks is bounded from above and below by the location of the cross-over points, hence none of the peaks are ignored or missed.
Potential of IMU Sensors in Performance Analysis of Professional Alpine Skiers
Yu, Gwangjae; Jang, Young Jae; Kim, Jinhyeok; Kim, Jin Hae; Kim, Hye Young; Kim, Kitae; Panday, Siddhartha Bikram
2016-01-01
In this paper, we present an analysis to identify a sensor location for an inertial measurement unit (IMU) on the body of a skier and propose the best location to capture turn motions for training. We also validate the manner in which the data from the IMU sensor on the proposed location can characterize ski turns and performance with a series of statistical analyses, including a comparison with data collected from foot pressure sensors. The goal of the study is to logically identify the ideal location on the skier’s body to attach the IMU sensor and the best use of the data collected for the skier. The statistical analyses and the hierarchical clustering method indicate that the pelvis is the best location for attachment of an IMU, and numerical validation shows that the data collected from this location can effectively estimate the performance and characteristics of the skier. Moreover, placement of the sensor at this location does not distract the skier’s motion, and the sensor can be easily attached and detached. The findings of this study can be used for the development of a wearable device for the routine training of professional skiers. PMID:27043579
Spatial Differentiation of Landscape Values in the Murray River Region of Victoria, Australia
NASA Astrophysics Data System (ADS)
Zhu, Xuan; Pfueller, Sharron; Whitelaw, Paul; Winter, Caroline
2010-05-01
This research advances the understanding of the location of perceived landscape values through a statistically based approach to spatial analysis of value densities. Survey data were obtained from a sample of people living in and using the Murray River region, Australia, where declining environmental quality prompted a reevaluation of its conservation status. When densities of 12 perceived landscape values were mapped using geographic information systems (GIS), valued places clustered along the entire river bank and in associated National/State Parks and reserves. While simple density mapping revealed high value densities in various locations, it did not indicate what density of a landscape value could be regarded as a statistically significant hotspot or distinguish whether overlapping areas of high density for different values indicate identical or adjacent locations. A spatial statistic Getis-Ord Gi* was used to indicate statistically significant spatial clusters of high value densities or “hotspots”. Of 251 hotspots, 40% were for single non-use values, primarily spiritual, therapeutic or intrinsic. Four hotspots had 11 landscape values. Two, lacking economic value, were located in ecologically important river red gum forests and two, lacking wilderness value, were near the major towns of Echuca-Moama and Albury-Wodonga. Hotspots for eight values showed statistically significant associations with another value. There were high associations between learning and heritage values while economic and biological diversity values showed moderate associations with several other direct and indirect use values. This approach may improve confidence in the interpretation of spatial analysis of landscape values by enhancing understanding of value relationships.
Statistical analysis of heavy truck loads using Wisconsin weigh-in-motion data
DOT National Transportation Integrated Search
2009-09-01
This study involved statistical evaluation of heavy truck loads that were recorded in 2007 using Weigh-In-Motion : stations located throughout the State of Wisconsin. The heaviest 5% of all trucks in each class and axle groupings were : selected for ...
Modeling fixation locations using spatial point processes.
Barthelmé, Simon; Trukenbrod, Hans; Engbert, Ralf; Wichmann, Felix
2013-10-01
Whenever eye movements are measured, a central part of the analysis has to do with where subjects fixate and why they fixated where they fixated. To a first approximation, a set of fixations can be viewed as a set of points in space; this implies that fixations are spatial data and that the analysis of fixation locations can be beneficially thought of as a spatial statistics problem. We argue that thinking of fixation locations as arising from point processes is a very fruitful framework for eye-movement data, helping turn qualitative questions into quantitative ones. We provide a tutorial introduction to some of the main ideas of the field of spatial statistics, focusing especially on spatial Poisson processes. We show how point processes help relate image properties to fixation locations. In particular we show how point processes naturally express the idea that image features' predictability for fixations may vary from one image to another. We review other methods of analysis used in the literature, show how they relate to point process theory, and argue that thinking in terms of point processes substantially extends the range of analyses that can be performed and clarify their interpretation.
ViPAR: a software platform for the Virtual Pooling and Analysis of Research Data.
Carter, Kim W; Francis, Richard W; Carter, K W; Francis, R W; Bresnahan, M; Gissler, M; Grønborg, T K; Gross, R; Gunnes, N; Hammond, G; Hornig, M; Hultman, C M; Huttunen, J; Langridge, A; Leonard, H; Newman, S; Parner, E T; Petersson, G; Reichenberg, A; Sandin, S; Schendel, D E; Schalkwyk, L; Sourander, A; Steadman, C; Stoltenberg, C; Suominen, A; Surén, P; Susser, E; Sylvester Vethanayagam, A; Yusof, Z
2016-04-01
Research studies exploring the determinants of disease require sufficient statistical power to detect meaningful effects. Sample size is often increased through centralized pooling of disparately located datasets, though ethical, privacy and data ownership issues can often hamper this process. Methods that facilitate the sharing of research data that are sympathetic with these issues and which allow flexible and detailed statistical analyses are therefore in critical need. We have created a software platform for the Virtual Pooling and Analysis of Research data (ViPAR), which employs free and open source methods to provide researchers with a web-based platform to analyse datasets housed in disparate locations. Database federation permits controlled access to remotely located datasets from a central location. The Secure Shell protocol allows data to be securely exchanged between devices over an insecure network. ViPAR combines these free technologies into a solution that facilitates 'virtual pooling' where data can be temporarily pooled into computer memory and made available for analysis without the need for permanent central storage. Within the ViPAR infrastructure, remote sites manage their own harmonized research dataset in a database hosted at their site, while a central server hosts the data federation component and a secure analysis portal. When an analysis is initiated, requested data are retrieved from each remote site and virtually pooled at the central site. The data are then analysed by statistical software and, on completion, results of the analysis are returned to the user and the virtually pooled data are removed from memory. ViPAR is a secure, flexible and powerful analysis platform built on open source technology that is currently in use by large international consortia, and is made publicly available at [http://bioinformatics.childhealthresearch.org.au/software/vipar/]. © The Author 2015. Published by Oxford University Press on behalf of the International Epidemiological Association.
NASA Technical Reports Server (NTRS)
Butler, C. M.; Hogge, J. E.
1978-01-01
Air quality sampling was conducted. Data for air quality parameters, recorded on written forms, punched cards or magnetic tape, are available for 1972 through 1975. Computer software was developed to (1) calculate several daily statistical measures of location, (2) plot time histories of data or the calculated daily statistics, (3) calculate simple correlation coefficients, and (4) plot scatter diagrams. Computer software was developed for processing air quality data to include time series analysis and goodness of fit tests. Computer software was developed to (1) calculate a larger number of daily statistical measures of location, and a number of daily monthly and yearly measures of location, dispersion, skewness and kurtosis, (2) decompose the extended time series model and (3) perform some goodness of fit tests. The computer program is described, documented and illustrated by examples. Recommendations are made for continuation of the development of research on processing air quality data.
Location error uncertainties - an advanced using of probabilistic inverse theory
NASA Astrophysics Data System (ADS)
Debski, Wojciech
2016-04-01
The spatial location of sources of seismic waves is one of the first tasks when transient waves from natural (uncontrolled) sources are analyzed in many branches of physics, including seismology, oceanology, to name a few. Source activity and its spatial variability in time, the geometry of recording network, the complexity and heterogeneity of wave velocity distribution are all factors influencing the performance of location algorithms and accuracy of the achieved results. While estimating of the earthquake foci location is relatively simple a quantitative estimation of the location accuracy is really a challenging task even if the probabilistic inverse method is used because it requires knowledge of statistics of observational, modelling, and apriori uncertainties. In this presentation we addressed this task when statistics of observational and/or modeling errors are unknown. This common situation requires introduction of apriori constraints on the likelihood (misfit) function which significantly influence the estimated errors. Based on the results of an analysis of 120 seismic events from the Rudna copper mine operating in southwestern Poland we illustrate an approach based on an analysis of Shanon's entropy calculated for the aposteriori distribution. We show that this meta-characteristic of the aposteriori distribution carries some information on uncertainties of the solution found.
NASA Astrophysics Data System (ADS)
Senkbeil, J. C.; Brommer, D. M.; Comstock, I. J.; Loyd, T.
2012-07-01
Extratropical cyclones (ETCs) in the southern United States are often overlooked when compared with tropical cyclones in the region and ETCs in the northern United States. Although southern ETCs are significant weather events, there is currently not an operational scheme used for identifying and discussing these nameless storms. In this research, we classified 84 ETCs (1970-2009). We manually identified five distinct formation regions and seven unique ETC types using statistical classification. Statistical classification employed the use of principal components analysis and two methods of cluster analysis. Both manual and statistical storm types generally showed positive (negative) relationships with El Niño (La Niña). Manual storm types displayed precipitation swaths consistent with discrete storm tracks which further legitimizes the existence of multiple modes of southern ETCs. Statistical storm types also displayed unique precipitation intensity swaths, but these swaths were less indicative of track location. It is hoped that by classifying southern ETCs into types, that forecasters, hydrologists, and broadcast meteorologists might be able to better anticipate projected amounts of precipitation at their locations.
Defining the ecological hydrology of Taiwan Rivers using multivariate statistical methods
NASA Astrophysics Data System (ADS)
Chang, Fi-John; Wu, Tzu-Ching; Tsai, Wen-Ping; Herricks, Edwin E.
2009-09-01
SummaryThe identification and verification of ecohydrologic flow indicators has found new support as the importance of ecological flow regimes is recognized in modern water resources management, particularly in river restoration and reservoir management. An ecohydrologic indicator system reflecting the unique characteristics of Taiwan's water resources and hydrology has been developed, the Taiwan ecohydrological indicator system (TEIS). A major challenge for the water resources community is using the TEIS to provide environmental flow rules that improve existing water resources management. This paper examines data from the extensive network of flow monitoring stations in Taiwan using TEIS statistics to define and refine environmental flow options in Taiwan. Multivariate statistical methods were used to examine TEIS statistics for 102 stations representing the geographic and land use diversity of Taiwan. The Pearson correlation coefficient showed high multicollinearity between the TEIS statistics. Watersheds were separated into upper and lower-watershed locations. An analysis of variance indicated significant differences between upstream, more natural, and downstream, more developed, locations in the same basin with hydrologic indicator redundancy in flow change and magnitude statistics. Issues of multicollinearity were examined using a Principal Component Analysis (PCA) with the first three components related to general flow and high/low flow statistics, frequency and time statistics, and quantity statistics. These principle components would explain about 85% of the total variation. A major conclusion is that managers must be aware of differences among basins, as well as differences within basins that will require careful selection of management procedures to achieve needed flow regimes.
Langholz, Bryan; Thomas, Duncan C.; Stovall, Marilyn; Smith, Susan A.; Boice, John D.; Shore, Roy E.; Bernstein, Leslie; Lynch, Charles F.; Zhang, Xinbo; Bernstein, Jonine L.
2009-01-01
Summary Methods for the analysis of individually matched case-control studies with location-specific radiation dose and tumor location information are described. These include likelihood methods for analyses that just use cases with precise location of tumor information and methods that also include cases with imprecise tumor location information. The theory establishes that each of these likelihood based methods estimates the same radiation rate ratio parameters, within the context of the appropriate model for location and subject level covariate effects. The underlying assumptions are characterized and the potential strengths and limitations of each method are described. The methods are illustrated and compared using the WECARE study of radiation and asynchronous contralateral breast cancer. PMID:18647297
ERIC Educational Resources Information Center
Theobald, Rebecca
2005-01-01
The influence of location as exemplified by neighbourhood factors and school characteristics on primary education is examined in the context of the school choice movement of the last two decades. The analysis incorporates statistical information about schools and population data from Census 2000 describing neighbourhoods and schools in one…
NASA Astrophysics Data System (ADS)
Debski, Wojciech
2015-06-01
The spatial location of sources of seismic waves is one of the first tasks when transient waves from natural (uncontrolled) sources are analysed in many branches of physics, including seismology, oceanology, to name a few. Source activity and its spatial variability in time, the geometry of recording network, the complexity and heterogeneity of wave velocity distribution are all factors influencing the performance of location algorithms and accuracy of the achieved results. Although estimating of the earthquake foci location is relatively simple, a quantitative estimation of the location accuracy is really a challenging task even if the probabilistic inverse method is used because it requires knowledge of statistics of observational, modelling and a priori uncertainties. In this paper, we addressed this task when statistics of observational and/or modelling errors are unknown. This common situation requires introduction of a priori constraints on the likelihood (misfit) function which significantly influence the estimated errors. Based on the results of an analysis of 120 seismic events from the Rudna copper mine operating in southwestern Poland, we propose an approach based on an analysis of Shanon's entropy calculated for the a posteriori distribution. We show that this meta-characteristic of the a posteriori distribution carries some information on uncertainties of the solution found.
Catalog of earthquake hypocenters at Alaskan volcanoes: January 1 through December 31, 2006
Dixon, James P.; Stihler, Scott D.; Power, John A.; Searcy, Cheryl
2008-01-01
Between January 1 and December 31, 2006, AVO located 8,666 earthquakes of which 7,783 occurred on or near the 33 volcanoes monitored within Alaska. Monitoring highlights in 2006 include: an eruption of Augustine Volcano, a volcanic-tectonic earthquake swarm at Mount Martin, elevated seismicity and volcanic unrest at Fourpeaked Mountain, and elevated seismicity and low-level tremor at Mount Veniaminof and Korovin Volcano. A new seismic subnetwork was installed on Fourpeaked Mountain. This catalog includes: (1) descriptions and locations of seismic instrumentation deployed in the field during 2006, (2) a description of earthquake detection, recording, analysis, and data archival systems, (3) a description of seismic velocity models used for earthquake locations, (4) a summary of earthquakes located in 2006, and (5) an accompanying UNIX tar-file with a summary of earthquake origin times, hypocenters, magnitudes, phase arrival times, location quality statistics, daily station usage statistics, and all files used to determine the earthquake locations in 2006.
The Accuracy of GBM GRB Localizations
NASA Astrophysics Data System (ADS)
Briggs, Michael Stephen; Connaughton, V.; Meegan, C.; Hurley, K.
2010-03-01
We report an study of the accuracy of GBM GRB localizations, analyzing three types of localizations: those produced automatically by the GBM Flight Software on board GBM, those produced automatically with ground software in near real time, and localizations produced with human guidance. The two types of automatic locations are distributed in near real-time via GCN Notices; the human-guided locations are distributed on timescale of many minutes or hours using GCN Circulars. This work uses a Bayesian analysis that models the distribution of the GBM total location error by comparing GBM locations to more accurate locations obtained with other instruments. Reference locations are obtained from Swift, Super-AGILE, the LAT, and with the IPN. We model the GBM total location errors as having systematic errors in addition to the statistical errors and use the Bayesian analysis to constrain the systematic errors.
Indoor Location Sensing with Invariant Wi-Fi Received Signal Strength Fingerprinting
Husen, Mohd Nizam; Lee, Sukhan
2016-01-01
A method of location fingerprinting based on the Wi-Fi received signal strength (RSS) in an indoor environment is presented. The method aims to overcome the RSS instability due to varying channel disturbances in time by introducing the concept of invariant RSS statistics. The invariant RSS statistics represent here the RSS distributions collected at individual calibration locations under minimal random spatiotemporal disturbances in time. The invariant RSS statistics thus collected serve as the reference pattern classes for fingerprinting. Fingerprinting is carried out at an unknown location by identifying the reference pattern class that maximally supports the spontaneous RSS sensed from individual Wi-Fi sources. A design guideline is also presented as a rule of thumb for estimating the number of Wi-Fi signal sources required to be available for any given number of calibration locations under a certain level of random spatiotemporal disturbances. Experimental results show that the proposed method not only provides 17% higher success rate than conventional ones but also removes the need for recalibration. Furthermore, the resolution is shown finer by 40% with the execution time more than an order of magnitude faster than the conventional methods. These results are also backed up by theoretical analysis. PMID:27845711
Indoor Location Sensing with Invariant Wi-Fi Received Signal Strength Fingerprinting.
Husen, Mohd Nizam; Lee, Sukhan
2016-11-11
A method of location fingerprinting based on the Wi-Fi received signal strength (RSS) in an indoor environment is presented. The method aims to overcome the RSS instability due to varying channel disturbances in time by introducing the concept of invariant RSS statistics. The invariant RSS statistics represent here the RSS distributions collected at individual calibration locations under minimal random spatiotemporal disturbances in time. The invariant RSS statistics thus collected serve as the reference pattern classes for fingerprinting. Fingerprinting is carried out at an unknown location by identifying the reference pattern class that maximally supports the spontaneous RSS sensed from individual Wi-Fi sources. A design guideline is also presented as a rule of thumb for estimating the number of Wi-Fi signal sources required to be available for any given number of calibration locations under a certain level of random spatiotemporal disturbances. Experimental results show that the proposed method not only provides 17% higher success rate than conventional ones but also removes the need for recalibration. Furthermore, the resolution is shown finer by 40% with the execution time more than an order of magnitude faster than the conventional methods. These results are also backed up by theoretical analysis.
Varekar, Vikas; Karmakar, Subhankar; Jha, Ramakar
2016-02-01
The design of surface water quality sampling location is a crucial decision-making process for rationalization of monitoring network. The quantity, quality, and types of available dataset (watershed characteristics and water quality data) may affect the selection of appropriate design methodology. The modified Sanders approach and multivariate statistical techniques [particularly factor analysis (FA)/principal component analysis (PCA)] are well-accepted and widely used techniques for design of sampling locations. However, their performance may vary significantly with quantity, quality, and types of available dataset. In this paper, an attempt has been made to evaluate performance of these techniques by accounting the effect of seasonal variation, under a situation of limited water quality data but extensive watershed characteristics information, as continuous and consistent river water quality data is usually difficult to obtain, whereas watershed information may be made available through application of geospatial techniques. A case study of Kali River, Western Uttar Pradesh, India, is selected for the analysis. The monitoring was carried out at 16 sampling locations. The discrete and diffuse pollution loads at different sampling sites were estimated and accounted using modified Sanders approach, whereas the monitored physical and chemical water quality parameters were utilized as inputs for FA/PCA. The designed optimum number of sampling locations for monsoon and non-monsoon seasons by modified Sanders approach are eight and seven while that for FA/PCA are eleven and nine, respectively. Less variation in the number and locations of designed sampling sites were obtained by both techniques, which shows stability of results. A geospatial analysis has also been carried out to check the significance of designed sampling location with respect to river basin characteristics and land use of the study area. Both methods are equally efficient; however, modified Sanders approach outperforms FA/PCA when limited water quality and extensive watershed information is available. The available water quality dataset is limited and FA/PCA-based approach fails to identify monitoring locations with higher variation, as these multivariate statistical approaches are data-driven. The priority/hierarchy and number of sampling sites designed by modified Sanders approach are well justified by the land use practices and observed river basin characteristics of the study area.
AMMI adjustment for statistical analysis of an international wheat yield trial.
Crossa, J; Fox, P N; Pfeiffer, W H; Rajaram, S; Gauch, H G
1991-01-01
Multilocation trials are important for the CIMMYT Bread Wheat Program in producing high-yielding, adapted lines for a wide range of environments. This study investigated procedures for improving predictive success of a yield trial, grouping environments and genotypes into homogeneous subsets, and determining the yield stability of 18 CIMMYT bread wheats evaluated at 25 locations. Additive Main effects and Multiplicative Interaction (AMMI) analysis gave more precise estimates of genotypic yields within locations than means across replicates. This precision facilitated formation by cluster analysis of more cohesive groups of genotypes and locations for biological interpretation of interactions than occurred with unadjusted means. Locations were clustered into two subsets for which genotypes with positive interactions manifested in high, stable yields were identified. The analyses highlighted superior selections with both broad and specific adaptation.
PV System Component Fault and Failure Compilation and Analysis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Klise, Geoffrey Taylor; Lavrova, Olga; Gooding, Renee Lynne
This report describes data collection and analysis of solar photovoltaic (PV) equipment events, which consist of faults and fa ilures that occur during the normal operation of a distributed PV system or PV power plant. We present summary statistics from locations w here maintenance data is being collected at various intervals, as well as reliability statistics gathered from that da ta, consisting of fault/failure distributions and repair distributions for a wide range of PV equipment types.
Investigation of trends in flooding in the Tug Fork basin of Kentucky, Virginia, and West Virginia
Hirsch, Robert M.; Scott, Arthur G.; Wyant, Timothy
1982-01-01
Statistical analysis indicates that the average size of annual-flood peaks of the Tug Fork (Ky., Va., and W. Va.) has been increasing. However, additional statistical analysis does not indicate that the flood levels that were exceeded typically once or twice a year in the period 1947-79 are any more likely to be exceeded now than in 1947. Possible trends in streamchannel size also are investigated at three locations. No discernible trends in channel size are noted. Further statistical analysis of the trend in the size of annual-flood peaks shows that much of the annual variation is related to local rainfall and to the 'natural' hydrologic response in a relatively undisturbed subbasin. However, some statistical indication of trend persists after accounting for these natural factors, though it is of borderline statistical significance. Further study in the basin may relate flood magnitudes to both rainfall and to land use.
de Agostino Biella Passos, Vivian; de Carvalho Carrara, Cleide Felício; da Silva Dalben, Gisele; Costa, Beatriz; Gomide, Marcia Ribeiro
2014-03-01
To evaluate the prevalence of fistulas after palate repair and analyze their location and association with possible causal factors. Retrospective analysis of patient records and evaluation of preoperative initial photographs. Tertiary craniofacial center. Five hundred eighty-nine individuals with complete unilateral cleft lip and palate that underwent palate repair at the age of 12 to 36 months by the von Langenbeck technique, in a single stage, by the plastic surgery team of the hospital, from January 2003 to July 2007. The cleft width was visually classified by a single examiner as narrow, regular, or wide. The following regions of the palate were considered for the location: anterior, medium, transition (between hard and soft palate), and soft palate. Descriptive statistics and analysis of association between the occurrence of fistula and the different parameters were evaluated. Palatal fistulas were observed in 27% of the sample, with a greater proportion at the anterior region (37.11%). The chi-square statistical test revealed statistically significant association (P ≤ .05) between the fistulas and initial cleft width (P = .0003), intraoperative problems (P = .0037), and postoperative problems (P = .00002). The prevalence of palatal fistula was similar to mean values reported in the literature. Analysis of causal factors showed a positive association between palatal fistulas with wide and regular initial cleft width and intraoperative and postoperative problems. The anterior region presented the greatest occurrence of fistulas.
Sobel, E.; Lange, K.
1996-01-01
The introduction of stochastic methods in pedigree analysis has enabled geneticists to tackle computations intractable by standard deterministic methods. Until now these stochastic techniques have worked by running a Markov chain on the set of genetic descent states of a pedigree. Each descent state specifies the paths of gene flow in the pedigree and the founder alleles dropped down each path. The current paper follows up on a suggestion by Elizabeth Thompson that genetic descent graphs offer a more appropriate space for executing a Markov chain. A descent graph specifies the paths of gene flow but not the particular founder alleles traveling down the paths. This paper explores algorithms for implementing Thompson's suggestion for codominant markers in the context of automatic haplotyping, estimating location scores, and computing gene-clustering statistics for robust linkage analysis. Realistic numerical examples demonstrate the feasibility of the algorithms. PMID:8651310
Southard, Rodney E.
2013-01-01
The weather and precipitation patterns in Missouri vary considerably from year to year. In 2008, the statewide average rainfall was 57.34 inches and in 2012, the statewide average rainfall was 30.64 inches. This variability in precipitation and resulting streamflow in Missouri underlies the necessity for water managers and users to have reliable streamflow statistics and a means to compute select statistics at ungaged locations for a better understanding of water availability. Knowledge of surface-water availability is dependent on the streamflow data that have been collected and analyzed by the U.S. Geological Survey for more than 100 years at approximately 350 streamgages throughout Missouri. The U.S. Geological Survey, in cooperation with the Missouri Department of Natural Resources, computed streamflow statistics at streamgages through the 2010 water year, defined periods of drought and defined methods to estimate streamflow statistics at ungaged locations, and developed regional regression equations to compute selected streamflow statistics at ungaged locations. Streamflow statistics and flow durations were computed for 532 streamgages in Missouri and in neighboring States of Missouri. For streamgages with more than 10 years of record, Kendall’s tau was computed to evaluate for trends in streamflow data. If trends were detected, the variable length method was used to define the period of no trend. Water years were removed from the dataset from the beginning of the record for a streamgage until no trend was detected. Low-flow frequency statistics were then computed for the entire period of record and for the period of no trend if 10 or more years of record were available for each analysis. Three methods are presented for computing selected streamflow statistics at ungaged locations. The first method uses power curve equations developed for 28 selected streams in Missouri and neighboring States that have multiple streamgages on the same streams. Statistical estimates on one of these streams can be calculated at an ungaged location that has a drainage area that is between 40 percent of the drainage area of the farthest upstream streamgage and within 150 percent of the drainage area of the farthest downstream streamgage along the stream of interest. The second method may be used on any stream with a streamgage that has operated for 10 years or longer and for which anthropogenic effects have not changed the low-flow characteristics at the ungaged location since collection of the streamflow data. A ratio of drainage area of the stream at the ungaged location to the drainage area of the stream at the streamgage was computed to estimate the statistic at the ungaged location. The range of applicability is between 40- and 150-percent of the drainage area of the streamgage, and the ungaged location must be located on the same stream as the streamgage. The third method uses regional regression equations to estimate selected low-flow frequency statistics for unregulated streams in Missouri. This report presents regression equations to estimate frequency statistics for the 10-year recurrence interval and for the N-day durations of 1, 2, 3, 7, 10, 30, and 60 days. Basin and climatic characteristics were computed using geographic information system software and digital geospatial data. A total of 35 characteristics were computed for use in preliminary statewide and regional regression analyses based on existing digital geospatial data and previous studies. Spatial analyses for geographical bias in the predictive accuracy of the regional regression equations defined three low-flow regions with the State representing the three major physiographic provinces in Missouri. Region 1 includes the Central Lowlands, Region 2 includes the Ozark Plateaus, and Region 3 includes the Mississippi Alluvial Plain. A total of 207 streamgages were used in the regression analyses for the regional equations. Of the 207 U.S. Geological Survey streamgages, 77 were located in Region 1, 120 were located in Region 2, and 10 were located in Region 3. Streamgages located outside of Missouri were selected to extend the range of data used for the independent variables in the regression analyses. Streamgages included in the regression analyses had 10 or more years of record and were considered to be affected minimally by anthropogenic activities or trends. Regional regression analyses identified three characteristics as statistically significant for the development of regional equations. For Region 1, drainage area, longest flow path, and streamflow-variability index were statistically significant. The range in the standard error of estimate for Region 1 is 79.6 to 94.2 percent. For Region 2, drainage area and streamflow variability index were statistically significant, and the range in the standard error of estimate is 48.2 to 72.1 percent. For Region 3, drainage area and streamflow-variability index also were statistically significant with a range in the standard error of estimate of 48.1 to 96.2 percent. Limitations on the use of estimating low-flow frequency statistics at ungaged locations are dependent on the method used. The first method outlined for use in Missouri, power curve equations, were developed to estimate the selected statistics for ungaged locations on 28 selected streams with multiple streamgages located on the same stream. A second method uses a drainage-area ratio to compute statistics at an ungaged location using data from a single streamgage on the same stream with 10 or more years of record. Ungaged locations on these streams may use the ratio of the drainage area at an ungaged location to the drainage area at a streamgage location to scale the selected statistic value from the streamgage location to the ungaged location. This method can be used if the drainage area of the ungaged location is within 40 to 150 percent of the streamgage drainage area. The third method is the use of the regional regression equations. The limits for the use of these equations are based on the ranges of the characteristics used as independent variables and that streams must be affected minimally by anthropogenic activities.
Spatial statistical analysis of tree deaths using airborne digital imagery
NASA Astrophysics Data System (ADS)
Chang, Ya-Mei; Baddeley, Adrian; Wallace, Jeremy; Canci, Michael
2013-04-01
High resolution digital airborne imagery offers unprecedented opportunities for observation and monitoring of vegetation, providing the potential to identify, locate and track individual vegetation objects over time. Analytical tools are required to quantify relevant information. In this paper, locations of trees over a large area of native woodland vegetation were identified using morphological image analysis techniques. Methods of spatial point process statistics were then applied to estimate the spatially-varying tree death risk, and to show that it is significantly non-uniform. [Tree deaths over the area were detected in our previous work (Wallace et al., 2008).] The study area is a major source of ground water for the city of Perth, and the work was motivated by the need to understand and quantify vegetation changes in the context of water extraction and drying climate. The influence of hydrological variables on tree death risk was investigated using spatial statistics (graphical exploratory methods, spatial point pattern modelling and diagnostics).
Toward statistical modeling of saccadic eye-movement and visual saliency.
Sun, Xiaoshuai; Yao, Hongxun; Ji, Rongrong; Liu, Xian-Ming
2014-11-01
In this paper, we present a unified statistical framework for modeling both saccadic eye movements and visual saliency. By analyzing the statistical properties of human eye fixations on natural images, we found that human attention is sparsely distributed and usually deployed to locations with abundant structural information. This observations inspired us to model saccadic behavior and visual saliency based on super-Gaussian component (SGC) analysis. Our model sequentially obtains SGC using projection pursuit, and generates eye movements by selecting the location with maximum SGC response. Besides human saccadic behavior simulation, we also demonstrated our superior effectiveness and robustness over state-of-the-arts by carrying out dense experiments on synthetic patterns and human eye fixation benchmarks. Multiple key issues in saliency modeling research, such as individual differences, the effects of scale and blur, are explored in this paper. Based on extensive qualitative and quantitative experimental results, we show promising potentials of statistical approaches for human behavior research.
A note on statistical analysis of shape through triangulation of landmarks
Rao, C. Radhakrishna
2000-01-01
In an earlier paper, the author jointly with S. Suryawanshi proposed statistical analysis of shape through triangulation of landmarks on objects. It was observed that the angles of the triangles are invariant to scaling, location, and rotation of objects. No distinction was made between an object and its reflection. The present paper provides the methodology of shape discrimination when reflection is also taken into account and makes suggestions for modifications to be made when some of the landmarks are collinear. PMID:10737780
Zhang, Kui; Wiener, Howard; Beasley, Mark; George, Varghese; Amos, Christopher I; Allison, David B
2006-08-01
Individual genome scans for quantitative trait loci (QTL) mapping often suffer from low statistical power and imprecise estimates of QTL location and effect. This lack of precision yields large confidence intervals for QTL location, which are problematic for subsequent fine mapping and positional cloning. In prioritizing areas for follow-up after an initial genome scan and in evaluating the credibility of apparent linkage signals, investigators typically examine the results of other genome scans of the same phenotype and informally update their beliefs about which linkage signals in their scan most merit confidence and follow-up via a subjective-intuitive integration approach. A method that acknowledges the wisdom of this general paradigm but formally borrows information from other scans to increase confidence in objectivity would be a benefit. We developed an empirical Bayes analytic method to integrate information from multiple genome scans. The linkage statistic obtained from a single genome scan study is updated by incorporating statistics from other genome scans as prior information. This technique does not require that all studies have an identical marker map or a common estimated QTL effect. The updated linkage statistic can then be used for the estimation of QTL location and effect. We evaluate the performance of our method by using extensive simulations based on actual marker spacing and allele frequencies from available data. Results indicate that the empirical Bayes method can account for between-study heterogeneity, estimate the QTL location and effect more precisely, and provide narrower confidence intervals than results from any single individual study. We also compared the empirical Bayes method with a method originally developed for meta-analysis (a closely related but distinct purpose). In the face of marked heterogeneity among studies, the empirical Bayes method outperforms the comparator.
NASA Astrophysics Data System (ADS)
Srivastava, S. K., Sr.; Sharma, D. A.; Sachdeva, K.
2017-12-01
Indo-Gangetic plains of India experience severe fog conditions during the peak winter months of December and January every year. In this paper an attempt has been to analyze the spatial and temporal variability of winter fog over Indo-Gangetic plains. Further, an attempt has also been made to configure an efficient meso-scale numerical weather prediction model using different parameterization schemes and develop a forecasting tool for prediction of fog during winter months over Indo-Gangetic plains. The study revealed that an alarming increasing positive trend of fog frequency prevails over many locations of IGP. Hot spot and cluster analysis were conducted to identify the high fog prone zones using GIS and inferential statistical tools respectively. Hot spots on an average experiences fog on 68.27% days, it is followed by moderate and cold spots with 48.03% and 21.79% respectively. The study proposes a new FASP (Fog Analysis, sensitivity and prediction) Model for overall analysis and prediction of fog at a particular location and period over IGP. In the first phase of this model long term climatological fog data of a location is analyzed to determine its characteristics and prevailing trend using various advanced statistical techniques. During a second phase a sensitivity test is conducted with different combination of parameterization schemes to determine the most suitable combination for fog simulation over a particular location and period and in the third and final phase, first ARIMA model is used to predict the number of fog days in future . Thereafter, Numerical model is used to predict the various meteorological parameters favourable for fog forecast. Finally, Hybrid model is used for fog forecast over the study location. The results of the FASP model are validated with actual ground based fog data using statistical tools. Forecast Fog-gram generated using hybrid model during Jan 2017 shows highly encouraging results for fog occurrence/Non occurrence between 25 hrs to 72 hours forecast. The model predicted the fog occurrences/Non occurrence with more than 85 % accuracy over most of the locations across the study area. The minimum visibility departure is within 500 m on 90% occasions over the central IGP and within 1000m on more than 80 % occasions over most of the locations across Indo-Gangetic plains.
Pradhan, Biswajeet; Chaudhari, Amruta; Adinarayana, J; Buchroithner, Manfred F
2012-01-01
In this paper, an attempt has been made to assess, prognosis and observe dynamism of soil erosion by universal soil loss equation (USLE) method at Penang Island, Malaysia. Multi-source (map-, space- and ground-based) datasets were used to obtain both static and dynamic factors of USLE, and an integrated analysis was carried out in raster format of GIS. A landslide location map was generated on the basis of image elements interpretation from aerial photos, satellite data and field observations and was used to validate soil erosion intensity in the study area. Further, a statistical-based frequency ratio analysis was carried out in the study area for correlation purposes. The results of the statistical correlation showed a satisfactory agreement between the prepared USLE-based soil erosion map and landslide events/locations, and are directly proportional to each other. Prognosis analysis on soil erosion helps the user agencies/decision makers to design proper conservation planning program to reduce soil erosion. Temporal statistics on soil erosion in these dynamic and rapid developments in Penang Island indicate the co-existence and balance of ecosystem.
NASA Technical Reports Server (NTRS)
Potter, Christopher
2015-01-01
Results from Landsat satellite image times series analysis since 1983 of this study area showed gradual, statistically significant increases in the normalized difference vegetation index (NDVI) in more than 90% of the (predominantly second-growth) evergreen forest locations sampled.
Online Statistical Modeling (Regression Analysis) for Independent Responses
NASA Astrophysics Data System (ADS)
Made Tirta, I.; Anggraeni, Dian; Pandutama, Martinus
2017-06-01
Regression analysis (statistical analmodelling) are among statistical methods which are frequently needed in analyzing quantitative data, especially to model relationship between response and explanatory variables. Nowadays, statistical models have been developed into various directions to model various type and complex relationship of data. Rich varieties of advanced and recent statistical modelling are mostly available on open source software (one of them is R). However, these advanced statistical modelling, are not very friendly to novice R users, since they are based on programming script or command line interface. Our research aims to developed web interface (based on R and shiny), so that most recent and advanced statistical modelling are readily available, accessible and applicable on web. We have previously made interface in the form of e-tutorial for several modern and advanced statistical modelling on R especially for independent responses (including linear models/LM, generalized linier models/GLM, generalized additive model/GAM and generalized additive model for location scale and shape/GAMLSS). In this research we unified them in the form of data analysis, including model using Computer Intensive Statistics (Bootstrap and Markov Chain Monte Carlo/ MCMC). All are readily accessible on our online Virtual Statistics Laboratory. The web (interface) make the statistical modeling becomes easier to apply and easier to compare them in order to find the most appropriate model for the data.
Experimental and environmental factors affect spurious detection of ecological thresholds
Daily, Jonathan P.; Hitt, Nathaniel P.; Smith, David; Snyder, Craig D.
2012-01-01
Threshold detection methods are increasingly popular for assessing nonlinear responses to environmental change, but their statistical performance remains poorly understood. We simulated linear change in stream benthic macroinvertebrate communities and evaluated the performance of commonly used threshold detection methods based on model fitting (piecewise quantile regression [PQR]), data partitioning (nonparametric change point analysis [NCPA]), and a hybrid approach (significant zero crossings [SiZer]). We demonstrated that false detection of ecological thresholds (type I errors) and inferences on threshold locations are influenced by sample size, rate of linear change, and frequency of observations across the environmental gradient (i.e., sample-environment distribution, SED). However, the relative importance of these factors varied among statistical methods and between inference types. False detection rates were influenced primarily by user-selected parameters for PQR (τ) and SiZer (bandwidth) and secondarily by sample size (for PQR) and SED (for SiZer). In contrast, the location of reported thresholds was influenced primarily by SED. Bootstrapped confidence intervals for NCPA threshold locations revealed strong correspondence to SED. We conclude that the choice of statistical methods for threshold detection should be matched to experimental and environmental constraints to minimize false detection rates and avoid spurious inferences regarding threshold location.
Statistical modeling of space shuttle environmental data
NASA Technical Reports Server (NTRS)
Tubbs, J. D.; Brewer, D. W.
1983-01-01
Statistical models which use a class of bivariate gamma distribution are examined. Topics discussed include: (1) the ratio of positively correlated gamma varieties; (2) a method to determine if unequal shape parameters are necessary in bivariate gamma distribution; (3) differential equations for modal location of a family of bivariate gamma distribution; and (4) analysis of some wind gust data using the analytical results developed for modeling application.
Federal Register 2010, 2011, 2012, 2013, 2014
2010-11-30
... Based on Customary Charges In Sec. 447.271(a), DHHS is adding an introductory phrase to read ``Except as... hospital that is located outside of a Core-Based Statistical Area (for Medicaid) and outside a Metropolitan Statistical Area for Medicare) and has fewer than 100 beds. DHHS is not preparing an analysis for section 1102...
GIS Tools For Improving Pedestrian & Bicycle Safety
DOT National Transportation Integrated Search
2000-07-01
Geographic Information System (GIS) software turns statistical data, such as accidents, and geographic data, such as roads and crash locations, into meaningful information for spatial analysis and mapping. In this project, GIS-based analytical techni...
NASA Technical Reports Server (NTRS)
Hyde, G.
1976-01-01
The 13/18 GHz COMSAT Propagation Experiment (CPE) was performed to measure attenuation caused by hydrometeors along slant paths from transmitting terminals on the ground to the ATS-6 satellite. The effectiveness of site diversity in overcoming this impairment was also studied. Problems encountered in assembling a valid data base of rain induced attenuation data for statistical analysis are considered. The procedures used to obtain the various statistics are then outlined. The graphs and tables of statistical data for the 15 dual frequency (13 and 18 GHz) site diversity locations are discussed. Cumulative rain rate statistics for the Fayetteville and Boston sites based on point rainfall data collected are presented along with extrapolations of the attenuation and point rainfall data.
ERIC Educational Resources Information Center
Moore, Andrea Lisa
2013-01-01
Toxic Release Inventory facilities are among the many environmental hazards shown to create environmental inequities in the United States. This project examined four factors associated with Toxic Release Inventory, specifically, manufacturing facility location at multiple spatial scales using spatial analysis techniques (i.e., O-ring statistic and…
Machine learning to analyze images of shocked materials for precise and accurate measurements
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dresselhaus-Cooper, Leora; Howard, Marylesa; Hock, Margaret C.
A supervised machine learning algorithm, called locally adaptive discriminant analysis (LADA), has been developed to locate boundaries between identifiable image features that have varying intensities. LADA is an adaptation of image segmentation, which includes techniques that find the positions of image features (classes) using statistical intensity distributions for each class in the image. In order to place a pixel in the proper class, LADA considers the intensity at that pixel and the distribution of intensities in local (nearby) pixels. This paper presents the use of LADA to provide, with statistical uncertainties, the positions and shapes of features within ultrafast imagesmore » of shock waves. We demonstrate the ability to locate image features including crystals, density changes associated with shock waves, and material jetting caused by shock waves. This algorithm can analyze images that exhibit a wide range of physical phenomena because it does not rely on comparison to a model. LADA enables analysis of images from shock physics with statistical rigor independent of underlying models or simulations.« less
Tian, Zengshan; Xu, Kunjie; Yu, Xiang
2014-01-01
This paper studies the statistical errors for the fingerprint-based RADAR neighbor matching localization with the linearly calibrated reference points (RPs) in logarithmic received signal strength (RSS) varying Wi-Fi environment. To the best of our knowledge, little comprehensive analysis work has appeared on the error performance of neighbor matching localization with respect to the deployment of RPs. However, in order to achieve the efficient and reliable location-based services (LBSs) as well as the ubiquitous context-awareness in Wi-Fi environment, much attention has to be paid to the highly accurate and cost-efficient localization systems. To this end, the statistical errors by the widely used neighbor matching localization are significantly discussed in this paper to examine the inherent mathematical relations between the localization errors and the locations of RPs by using a basic linear logarithmic strength varying model. Furthermore, based on the mathematical demonstrations and some testing results, the closed-form solutions to the statistical errors by RADAR neighbor matching localization can be an effective tool to explore alternative deployment of fingerprint-based neighbor matching localization systems in the future. PMID:24683349
Zhou, Mu; Tian, Zengshan; Xu, Kunjie; Yu, Xiang; Wu, Haibo
2014-01-01
This paper studies the statistical errors for the fingerprint-based RADAR neighbor matching localization with the linearly calibrated reference points (RPs) in logarithmic received signal strength (RSS) varying Wi-Fi environment. To the best of our knowledge, little comprehensive analysis work has appeared on the error performance of neighbor matching localization with respect to the deployment of RPs. However, in order to achieve the efficient and reliable location-based services (LBSs) as well as the ubiquitous context-awareness in Wi-Fi environment, much attention has to be paid to the highly accurate and cost-efficient localization systems. To this end, the statistical errors by the widely used neighbor matching localization are significantly discussed in this paper to examine the inherent mathematical relations between the localization errors and the locations of RPs by using a basic linear logarithmic strength varying model. Furthermore, based on the mathematical demonstrations and some testing results, the closed-form solutions to the statistical errors by RADAR neighbor matching localization can be an effective tool to explore alternative deployment of fingerprint-based neighbor matching localization systems in the future.
Analysis of Acoustic Emission Parameters from Corrosion of AST Bottom Plate in Field Testing
NASA Astrophysics Data System (ADS)
Jomdecha, C.; Jirarungsatian, C.; Suwansin, W.
Field testing of aboveground storage tank (AST) to monitor corrosion of the bottom plate is presented in this chapter. AE testing data of the ten AST with different sizes, materials, and products were employed to monitor the bottom plate condition. AE sensors of 30 and 150 kHz were used to monitor the corrosion activity of up to 24 channels including guard sensors. Acoustic emission (AE) parameters were analyzed to explore the AE parameter patterns of occurring corrosion compared to the laboratory results. Amplitude, count, duration, and energy were main parameters of analysis. Pattern recognition technique with statistical was implemented to eliminate the electrical and environmental noises. The results showed the specific AE patterns of corrosion activities related to the empirical results. In addition, plane algorithm was utilized to locate the significant AE events from corrosion. Both results of parameter patterns and AE event locations can be used to interpret and locate the corrosion activities. Finally, basic statistical grading technique was used to evaluate the bottom plate condition of the AST.
Statistical wind analysis for near-space applications
NASA Astrophysics Data System (ADS)
Roney, Jason A.
2007-09-01
Statistical wind models were developed based on the existing observational wind data for near-space altitudes between 60 000 and 100 000 ft (18 30 km) above ground level (AGL) at two locations, Akon, OH, USA, and White Sands, NM, USA. These two sites are envisioned as playing a crucial role in the first flights of high-altitude airships. The analysis shown in this paper has not been previously applied to this region of the stratosphere for such an application. Standard statistics were compiled for these data such as mean, median, maximum wind speed, and standard deviation, and the data were modeled with Weibull distributions. These statistics indicated, on a yearly average, there is a lull or a “knee” in the wind between 65 000 and 72 000 ft AGL (20 22 km). From the standard statistics, trends at both locations indicated substantial seasonal variation in the mean wind speed at these heights. The yearly and monthly statistical modeling indicated that Weibull distributions were a reasonable model for the data. Forecasts and hindcasts were done by using a Weibull model based on 2004 data and comparing the model with the 2003 and 2005 data. The 2004 distribution was also a reasonable model for these years. Lastly, the Weibull distribution and cumulative function were used to predict the 50%, 95%, and 99% winds, which are directly related to the expected power requirements of a near-space station-keeping airship. These values indicated that using only the standard deviation of the mean may underestimate the operational conditions.
A Meta-Analysis of Referential Communication Studies: A Computer Readable Literature Review.
ERIC Educational Resources Information Center
Dickson, W. Patrick; Moskoff, Mary
A computer-assisted analysis of studies on referential communication (giving directions/explanations) located 66 reports involving 80 experiments, 114 referential tasks, and over 6,200 individuals. The studies were entered into a statistical software package system (SPSS) and analyzed for characteristics of the subjects and experimental designs,…
A Meta-Analysis of Writing Instruction for Students in the Elementary Grades
ERIC Educational Resources Information Center
Graham, Steve; McKeown, Debra; Kiuhara, Sharlene; Harris, Karen R.
2012-01-01
In an effort to identify effective instructional practices for teaching writing to elementary grade students, we conducted a meta-analysis of the writing intervention literature, focusing our efforts on true and quasi-experiments. We located 115 documents that included the statistics for computing an effect size (ES). We calculated an average…
Effects of the water level on the flow topology over the Bolund island
NASA Astrophysics Data System (ADS)
Cuerva-Tejero, A.; Yeow, T. S.; Gallego-Castillo, C.; Lopez-Garcia, O.
2014-06-01
We have analyzed the influence of the actual height of Bolund island above water level on different full-scale statistics of the velocity field over the peninsula. Our analysis is focused on the database of 10-minute statistics provided by Risø-DTU for the Bolund Blind Experiment. We have considered 10-minut.e periods with near-neutral atmospheric conditions, mean wind speed values in the interval [5,20] m/s, and westerly wind directions. As expected, statistics such as speed-up, normalized increase of turbulent kinetic energy and probability of recirculating flow show a large dependence on the emerged height of the island for the locations close to the escarpment. For the published ensemble mean values of speed-up and normalized increase of turbulent kinetic energy in these locations, we propose that some ammount of uncertainty could be explained as a deterministic dependence of the flow field statistics upon the actual height of the Bolund island above the sea level.
López-Carr, David; Pricope, Narcisa G.; Aukema, Juliann E.; Jankowska, Marta M.; Funk, Christopher C.; Husak, Gregory J.; Michaelsen, Joel C.
2014-01-01
We present an integrative measure of exposure and sensitivity components of vulnerability to climatic and demographic change for the African continent in order to identify “hot spots” of high potential population vulnerability. Getis-Ord Gi* spatial clustering analyses reveal statistically significant locations of spatio-temporal precipitation decline coinciding with high population density and increase. Statistically significant areas are evident, particularly across central, southern, and eastern Africa. The highly populated Lake Victoria basin emerges as a particularly salient hot spot. People located in the regions highlighted in this analysis suffer exceptionally high exposure to negative climate change impacts (as populations increase on lands with decreasing rainfall). Results may help inform further hot spot mapping and related research on demographic vulnerabilities to climate change. Results may also inform more suitable geographical targeting of policy interventions across the continent.
a Comparative Analysis of Five Cropland Datasets in Africa
NASA Astrophysics Data System (ADS)
Wei, Y.; Lu, M.; Wu, W.
2018-04-01
The food security, particularly in Africa, is a challenge to be resolved. The cropland area and spatial distribution obtained from remote sensing imagery are vital information. In this paper, according to cropland area and spatial location, we compare five global cropland datasets including CCI Land Cover, GlobCover, MODIS Collection 5, GlobeLand30 and Unified Cropland in circa 2010 of Africa in terms of cropland area and spatial location. The accuracy of cropland area calculated from five datasets was analyzed compared with statistic data. Based on validation samples, the accuracies of spatial location for the five cropland products were assessed by error matrix. The results show that GlobeLand30 has the best fitness with the statistics, followed by MODIS Collection 5 and Unified Cropland, GlobCover and CCI Land Cover have the lower accuracies. For the accuracy of spatial location of cropland, GlobeLand30 reaches the highest accuracy, followed by Unified Cropland, MODIS Collection 5 and GlobCover, CCI Land Cover has the lowest accuracy. The spatial location accuracy of five datasets in the Csa with suitable farming condition is generally higher than in the Bsk.
Statistical shape analysis using 3D Poisson equation--A quantitatively validated approach.
Gao, Yi; Bouix, Sylvain
2016-05-01
Statistical shape analysis has been an important area of research with applications in biology, anatomy, neuroscience, agriculture, paleontology, etc. Unfortunately, the proposed methods are rarely quantitatively evaluated, and as shown in recent studies, when they are evaluated, significant discrepancies exist in their outputs. In this work, we concentrate on the problem of finding the consistent location of deformation between two population of shapes. We propose a new shape analysis algorithm along with a framework to perform a quantitative evaluation of its performance. Specifically, the algorithm constructs a Signed Poisson Map (SPoM) by solving two Poisson equations on the volumetric shapes of arbitrary topology, and statistical analysis is then carried out on the SPoMs. The method is quantitatively evaluated on synthetic shapes and applied on real shape data sets in brain structures. Copyright © 2016 Elsevier B.V. All rights reserved.
Tobin, Karin; Rudolph, Jonathan; Latkin, Carl
2018-01-01
Background Although studies that characterize the risk environment by linking contextual factors with individual-level data have advanced infectious disease and substance use research, there are opportunities to refine how we define relevant neighborhood exposures; this can in turn reduce the potential for exposure misclassification. For example, for those who do not inject at home, injection risk behaviors may be more influenced by the environment where they inject than where they live. Similarly, among those who spend more time away from home, a measure that accounts for different neighborhood exposures by weighting each unique location proportional to the percentage of time spent there may be more correlated with health behaviors than one’s residential environment. Objective This study aimed to develop a Web-based application that interacts with Google Maps application program interfaces (APIs) to collect contextually relevant locations and the amount of time spent in each. Our analysis examined the extent of overlap across different location types and compared different approaches for classifying neighborhood exposure. Methods Between May 2014 and March 2017, 547 participants enrolled in a Baltimore HIV care and prevention study completed an interviewer-administered Web-based survey that collected information about where participants were recruited, worked, lived, socialized, injected drugs, and spent most of their time. For each location, participants gave an address or intersection which they confirmed using Google Map and Street views. Geographic coordinates (and hours spent in each location) were joined to neighborhood indicators by Community Statistical Area (CSA). We computed a weighted exposure based on the proportion of time spent in each unique location. We compared neighborhood exposures based on each of the different location types with one another and the weighted exposure using analysis of variance with Bonferroni corrections to account for multiple comparisons. Results Participants reported spending the most time at home, followed by the location where they injected drugs. Injection locations overlapped most frequently with locations where people reported socializing and living or sleeping. The least time was spent in the locations where participants reported earning money and being recruited for the study; these locations were also the least likely to overlap with other location types. We observed statistically significant differences in neighborhood exposures according to the approach used. Overall, people reported earning money in higher-income neighborhoods and being recruited for the study and injecting in neighborhoods with more violent crime, abandoned houses, and poverty. Conclusions This analysis revealed statistically significant differences in neighborhood exposures when defined by different locations or weighted based on exposure time. Future analyses are needed to determine which exposure measures are most strongly associated with health and risk behaviors and to explore whether associations between individual-level behaviors and neighborhood exposures are modified by exposure times. PMID:29351899
A global compilation of coral sea-level benchmarks: Implications and new challenges
NASA Astrophysics Data System (ADS)
Medina-Elizalde, Martín
2013-01-01
I present a quality-controlled compilation of sea-level data from U-Th dated corals, encompassing 30 studies of 13 locations around the world. The compilation contains relative sea level (RSL) data from each location based on both conventional and open-system U-Th ages. I have applied a commonly used age quality control criterion based on the initial 234U/238U activity ratios of corals in order to select reliable ages and to reconstruct sea level histories for the last 150,000 yr. This analysis reveals scatter of RSL estimates among coeval coral benchmarks both within individual locations and between locations, particularly during Marine Isotope Stage (MIS) 5a and the glacial inception following the last interglacial. The character of data scatter during these time intervals imply that uncertainties still exist regarding tectonics, glacio-isostacy, U-series dating, and/or coral position. To elucidate robust underlying patterns, with confidence limits, I performed a Monte Carlo-style statistical analysis of the compiled coral data considering appropriate age and sea-level uncertainties. By its nature, such an analysis has the tendency to smooth/obscure millennial-scale (and finer) details that may be important in individual datasets, and favour the major underlying patterns that are supported by all datasets. This statistical analysis is thus functional to illustrate major trends that are statistically robust ('what we know'), trends that are suggested but still are supported by few data ('what we might know, subject to addition of more supporting data and improved corrections'), and which patterns/data are clear outliers ('unlikely to be realistic given the rest of the global data and possibly needing further adjustments'). Prior to the last glacial maximum and with the possible exception of the 130-120 ka period, available coral data generally have insufficient temporal resolution and unexplained scatter, which hinders identification of a well-defined pattern with usefully narrow confidence limits. This analysis thus provides a framework that objectively identifies critical targets for new data collection, improved corrections, and integration of coral data with independent, stratigraphically continuous methods of sea-level reconstruction.
42 CFR 485.610 - Condition of participation: Status and location.
Code of Federal Regulations, 2010 CFR
2010-10-01
... requirements: (i) The CAH is located outside any area that is a Metropolitan Statistical Area, as defined by... Statistical Area, as defined by the Office of Management and Budget, but is being treated as being located in... this section and is located in a county that, in FY 2004, was not part of a Metropolitan Statistical...
Some regularity on how to locate electrodes for higher fECG SNRs
NASA Astrophysics Data System (ADS)
Zhang, Jie-Min; Huang, Xiao-Lin; Guan, Qun; Liu, Tie-Bing; Li, Ping; Zhao, Ying; Liu, Hong-Xing
2015-03-01
The electrocardiogram (ECG) recorded from the abdominal surface of a pregnant woman is a composite of maternal ECG, fetal ECG (fECG) and other noises, while only the fECG component is always needed by us. With different locations of electrode pairs on the maternal abdominal surface to measure fECGs, the signal-to-noise ratios (SNRs) of the recorded abdominal ECGs are also correspondingly different. Some regularity on how to locate electrodes to obtain higher fECG SNRs is needed practically. In this paper, 343 groups of abdominal ECG records were acquired from 78 pregnant women with different electrode pairs locating, and an appropriate extended research database is formed. Then the regularity on fECG SNRs corresponding to different electrode pairs locating was studied. Based on statistical analysis, it is shown that the fECG SNRs are significantly higher in certain locations than others. Reasonable explanation is also provided to the statistical result using the theories of the fetal cardiac electrical axis and the signal phase delay. Project supported by the National Natural Science Foundation of China (Grant No. 61271079) and the Supporting Plan Project of Jiangsu Province, China (Grant No. BE2010720).
Method for Identifying Probable Archaeological Sites from Remotely Sensed Data
NASA Technical Reports Server (NTRS)
Tilton, James C.; Comer, Douglas C.; Priebe, Carey E.; Sussman, Daniel
2011-01-01
Archaeological sites are being compromised or destroyed at a catastrophic rate in most regions of the world. The best solution to this problem is for archaeologists to find and study these sites before they are compromised or destroyed. One way to facilitate the necessary rapid, wide area surveys needed to find these archaeological sites is through the generation of maps of probable archaeological sites from remotely sensed data. We describe an approach for identifying probable locations of archaeological sites over a wide area based on detecting subtle anomalies in vegetative cover through a statistically based analysis of remotely sensed data from multiple sources. We further developed this approach under a recent NASA ROSES Space Archaeology Program project. Under this project we refined and elaborated this statistical analysis to compensate for potential slight miss-registrations between the remote sensing data sources and the archaeological site location data. We also explored data quantization approaches (required by the statistical analysis approach), and we identified a superior data quantization approached based on a unique image segmentation approach. In our presentation we will summarize our refined approach and demonstrate the effectiveness of the overall approach with test data from Santa Catalina Island off the southern California coast. Finally, we discuss our future plans for further improving our approach.
Zhou, Mu; Xu, Yu Bin; Ma, Lin; Tian, Shuo
2012-01-01
The expected errors of RADAR sensor networks with linear probabilistic location fingerprints inside buildings with varying Wi-Fi Gaussian strength are discussed. As far as we know, the statistical errors of equal and unequal-weighted RADAR networks have been suggested as a better way to evaluate the behavior of different system parameters and the deployment of reference points (RPs). However, up to now, there is still not enough related work on the relations between the statistical errors, system parameters, number and interval of the RPs, let alone calculating the correlated analytical expressions of concern. Therefore, in response to this compelling problem, under a simple linear distribution model, much attention will be paid to the mathematical relations of the linear expected errors, number of neighbors, number and interval of RPs, parameters in logarithmic attenuation model and variations of radio signal strength (RSS) at the test point (TP) with the purpose of constructing more practical and reliable RADAR location sensor networks (RLSNs) and also guaranteeing the accuracy requirements for the location based services in future ubiquitous context-awareness environments. Moreover, the numerical results and some real experimental evaluations of the error theories addressed in this paper will also be presented for our future extended analysis. PMID:22737027
Zhou, Mu; Xu, Yu Bin; Ma, Lin; Tian, Shuo
2012-01-01
The expected errors of RADAR sensor networks with linear probabilistic location fingerprints inside buildings with varying Wi-Fi Gaussian strength are discussed. As far as we know, the statistical errors of equal and unequal-weighted RADAR networks have been suggested as a better way to evaluate the behavior of different system parameters and the deployment of reference points (RPs). However, up to now, there is still not enough related work on the relations between the statistical errors, system parameters, number and interval of the RPs, let alone calculating the correlated analytical expressions of concern. Therefore, in response to this compelling problem, under a simple linear distribution model, much attention will be paid to the mathematical relations of the linear expected errors, number of neighbors, number and interval of RPs, parameters in logarithmic attenuation model and variations of radio signal strength (RSS) at the test point (TP) with the purpose of constructing more practical and reliable RADAR location sensor networks (RLSNs) and also guaranteeing the accuracy requirements for the location based services in future ubiquitous context-awareness environments. Moreover, the numerical results and some real experimental evaluations of the error theories addressed in this paper will also be presented for our future extended analysis.
Siordia, Carlos; Saenz, Joseph; Tom, Sarah E.
2014-01-01
Type II diabetes is a growing health problem in the United States. Understanding geographic variation in diabetes prevalence will inform where resources for management and prevention should be allocated. Investigations of the correlates of diabetes prevalence have largely ignored how spatial nonstationarity might play a role in the macro-level distribution of diabetes. This paper introduces the reader to the concept of spatial nonstationarity—variance in statistical relationships as a function of geographical location. Since spatial nonstationarity means different predictors can have varying effects on model outcomes, we make use of a geographically weighed regression to calculate correlates of diabetes as a function of geographic location. By doing so, we demonstrate an exploratory example in which the diabetes-poverty macro-level statistical relationship varies as a function of location. In particular, we provide evidence that when predicting macro-level diabetes prevalence, poverty is not always positively associated with diabetes PMID:25414731
Siordia, Carlos; Saenz, Joseph; Tom, Sarah E
2012-01-01
Type II diabetes is a growing health problem in the United States. Understanding geographic variation in diabetes prevalence will inform where resources for management and prevention should be allocated. Investigations of the correlates of diabetes prevalence have largely ignored how spatial nonstationarity might play a role in the macro-level distribution of diabetes. This paper introduces the reader to the concept of spatial nonstationarity-variance in statistical relationships as a function of geographical location. Since spatial nonstationarity means different predictors can have varying effects on model outcomes, we make use of a geographically weighed regression to calculate correlates of diabetes as a function of geographic location. By doing so, we demonstrate an exploratory example in which the diabetes-poverty macro-level statistical relationship varies as a function of location. In particular, we provide evidence that when predicting macro-level diabetes prevalence, poverty is not always positively associated with diabetes.
NASA Astrophysics Data System (ADS)
Das, Shreya; Nag, S. K.
2017-05-01
Multivariate statistical techniques, cluster and principal component analysis were applied to the data on groundwater quality of Suri I and II Blocks of Birbhum District, West Bengal, India, to extract principal factors corresponding to the different sources of variation in the hydrochemistry as well as the main controls on the hydrochemistry. For this, bore well water samples have been collected in two phases, during Post-monsoon (November 2012) and Pre-monsoon (April 2013) from 26 sampling locations spread homogeneously over the two blocks. Excess fluoride in groundwater has been reported at two locations both in post- and in pre-monsoon sessions, with a rise observed in pre-monsoon. Localized presence of excess iron has also been observed during both sessions. The water is found to be mildly alkaline in post-monsoon but slightly acidic at some locations during pre-monsoon. Correlation and cluster analysis studies demonstrate that fluoride shares a moderately positive correlation with pH in post-monsoon and a very strong one with carbonate in pre-monsoon indicating dominance of rock water interaction and ion exchange activity in the study area. Certain locations in the study area have been reported with less than 0.6 mg/l fluoride in groundwater, leading to possibility of occurrence of severe dental caries especially in children. Low values of sulfate and phosphate in water indicate a meager chance of contamination of groundwater due to anthropogenic factors.
An automated multi-scale network-based scheme for detection and location of seismic sources
NASA Astrophysics Data System (ADS)
Poiata, N.; Aden-Antoniow, F.; Satriano, C.; Bernard, P.; Vilotte, J. P.; Obara, K.
2017-12-01
We present a recently developed method - BackTrackBB (Poiata et al. 2016) - allowing to image energy radiation from different seismic sources (e.g., earthquakes, LFEs, tremors) in different tectonic environments using continuous seismic records. The method exploits multi-scale frequency-selective coherence in the wave field, recorded by regional seismic networks or local arrays. The detection and location scheme is based on space-time reconstruction of the seismic sources through an imaging function built from the sum of station-pair time-delay likelihood functions, projected onto theoretical 3D time-delay grids. This imaging function is interpreted as the location likelihood of the seismic source. A signal pre-processing step constructs a multi-band statistical representation of the non stationary signal, i.e. time series, by means of higher-order statistics or energy envelope characteristic functions. Such signal-processing is designed to detect in time signal transients - of different scales and a priori unknown predominant frequency - potentially associated with a variety of sources (e.g., earthquakes, LFE, tremors), and to improve the performance and the robustness of the detection-and-location location step. The initial detection-location, based on a single phase analysis with the P- or S-phase only, can then be improved recursively in a station selection scheme. This scheme - exploiting the 3-component records - makes use of P- and S-phase characteristic functions, extracted after a polarization analysis of the event waveforms, and combines the single phase imaging functions with the S-P differential imaging functions. The performance of the method is demonstrated here in different tectonic environments: (1) analysis of the one year long precursory phase of 2014 Iquique earthquake in Chile; (2) detection and location of tectonic tremor sources and low-frequency earthquakes during the multiple episodes of tectonic tremor activity in southwestern Japan.
NASA Astrophysics Data System (ADS)
Ohyanagi, S.; Dileonardo, C.
2013-12-01
As a natural phenomenon earthquake occurrence is difficult to predict. Statistical analysis of earthquake data was performed using candlestick chart and Bollinger Band methods. These statistical methods, commonly used in the financial world to analyze market trends were tested against earthquake data. Earthquakes above Mw 4.0 located on shore of Sanriku (37.75°N ~ 41.00°N, 143.00°E ~ 144.50°E) from February 1973 to May 2013 were selected for analysis. Two specific patterns in earthquake occurrence were recognized through the analysis. One is a spread of candlestick prior to the occurrence of events greater than Mw 6.0. A second pattern shows convergence in the Bollinger Band, which implies a positive or negative change in the trend of earthquakes. Both patterns match general models for the buildup and release of strain through the earthquake cycle, and agree with both the characteristics of the candlestick chart and Bollinger Band analysis. These results show there is a high correlation between patterns in earthquake occurrence and trend analysis by these two statistical methods. The results of this study agree with the appropriateness of the application of these financial analysis methods to the analysis of earthquake occurrence.
Perry, Charles A.; Wolock, David M.; Artman, Joshua C.
2004-01-01
Streamflow statistics of flow duration and peak-discharge frequency were estimated for 4,771 individual locations on streams listed on the 1999 Kansas Surface Water Register. These statistics included the flow-duration values of 90, 75, 50, 25, and 10 percent, as well as the mean flow value. Peak-discharge frequency values were estimated for the 2-, 5-, 10-, 25-, 50-, and 100-year floods. Least-squares multiple regression techniques were used, along with Tobit analyses, to develop equations for estimating flow-duration values of 90, 75, 50, 25, and 10 percent and the mean flow for uncontrolled flow stream locations. The contributing-drainage areas of 149 U.S. Geological Survey streamflow-gaging stations in Kansas and parts of surrounding States that had flow uncontrolled by Federal reservoirs and used in the regression analyses ranged from 2.06 to 12,004 square miles. Logarithmic transformations of climatic and basin data were performed to yield the best linear relation for developing equations to compute flow durations and mean flow. In the regression analyses, the significant climatic and basin characteristics, in order of importance, were contributing-drainage area, mean annual precipitation, mean basin permeability, and mean basin slope. The analyses yielded a model standard error of prediction range of 0.43 logarithmic units for the 90-percent duration analysis to 0.15 logarithmic units for the 10-percent duration analysis. The model standard error of prediction was 0.14 logarithmic units for the mean flow. Regression equations used to estimate peak-discharge frequency values were obtained from a previous report, and estimates for the 2-, 5-, 10-, 25-, 50-, and 100-year floods were determined for this report. The regression equations and an interpolation procedure were used to compute flow durations, mean flow, and estimates of peak-discharge frequency for locations along uncontrolled flow streams on the 1999 Kansas Surface Water Register. Flow durations, mean flow, and peak-discharge frequency values determined at available gaging stations were used to interpolate the regression-estimated flows for the stream locations where available. Streamflow statistics for locations that had uncontrolled flow were interpolated using data from gaging stations weighted according to the drainage area and the bias between the regression-estimated and gaged flow information. On controlled reaches of Kansas streams, the streamflow statistics were interpolated between gaging stations using only gaged data weighted by drainage area.
Rockfalls in the Duratón canyon, central Spain: Inventory and statistical analysis
NASA Astrophysics Data System (ADS)
Tanarro, Luis M.; Muñoz, Julio
2012-10-01
This paper presents an initial analysis of the rockfall processes affecting the walls of the canyon of the River Duratón. This 34 km long meandering canyon in the basin of the River Duero in central Spain (41°18' N, 3°45' W) has evolved in a large-scale outcrop of Late Cretaceous calcareous rocks (dolomite and limestone) deformed into a series of asymmetrical folds. Its vertical scarps range from 80 to 100 m; its width varies from 150 to 300 m; and its floor is between 30 and 50 m wide. The research consisted of drawing up an inventory of rockfalls from a field survey and mapping the fallen blocks deposited on the basal talus or on the canyon floor, which in turn allowed the original location of each block on the scarps to be identified and located on the orthophotos available. A Digital Elevation Model (DEM) was produced using a Geographic Information System (GIS) and maps made of the aspects and slopes. The aspect of each rockfall data point was determined, and this initial database was completed with other significant parameters (location on the valley side, relationship with the tectonic structure and relative age). An approximate delimitation was also produced of the potential rockfall source area, by reclassifying the slopes according to morphometric criteria. The result is a geomorphic rockfall inventory map, showing the distribution of the rockfalls and a basic statistical analysis to allow a preliminary evaluation of the rockfall characteristics in relation to both their topoclimatic location (aspect) and their structural location (with or counter to the dip of the strata) and to the current geomorphic dynamic through a study of recent scars on the scarps. Recent rockfalls have also been related to the meteorological conditions in which they occurred.
Alania, M; De Backer, A; Lobato, I; Krause, F F; Van Dyck, D; Rosenauer, A; Van Aert, S
2017-10-01
In this paper, we investigate how precise atoms of a small nanocluster can ultimately be located in three dimensions (3D) from a tilt series of images acquired using annular dark field (ADF) scanning transmission electron microscopy (STEM). Therefore, we derive an expression for the statistical precision with which the 3D atomic position coordinates can be estimated in a quantitative analysis. Evaluating this statistical precision as a function of the microscope settings also allows us to derive the optimal experimental design. In this manner, the optimal angular tilt range, required electron dose, optimal detector angles, and number of projection images can be determined. Copyright © 2016 Elsevier B.V. All rights reserved.
Detection of short-term response of the low ionosphere on gamma ray bursts
NASA Astrophysics Data System (ADS)
Nina, Aleksandra; Simić, Saša.; Srećković, Vladimir A.; Popović, Luka Č.
2015-10-01
In this paper, we study the possibility of detection of short-term terrestrial lower ionospheric response to gamma ray bursts (GRBs) using a statistical analysis of perturbations of six very low or low-frequency (VLF/LF) radio signals emitted by transmitters located worldwide and recorded by VLF/LF receiver located in Belgrade (Serbia). We consider a sample of 54 short-lasting GRBs (shorter than 1 min) detected by the Swift satellite during the period 2009-2012. We find that a statistically significant perturbation can be present in the low ionosphere, and reactions on GRBs may be observed immediately after the beginning of the GRB event or with a time delay of 60 s-90 s.
Game Location and Team Quality Effects on Performance Profiles in Professional Soccer
Lago-Peñas, Carlos; Lago-Ballesteros, Joaquin
2011-01-01
Home advantage in team sports has an important role in determining the outcome of a game. The aim of the present study was to identify the soccer game- related statistics that best discriminate home and visiting teams according to the team quality. The sample included all 380 games of the Spanish professional men’s league. The independent variables were game location (home or away) and the team quality. Teams were classified into four groups according to their final ranking at the end of the league. The game-related statistics registered were divided into three groups: (i) variables related to goals scored; (ii) variables related to offense and (iii) variables related to defense. A univariate (t-test and Mann-Whitney U) and multivariate (discriminant analysis) analysis of data was done. Results showed that home teams have significantly higher means for goal scored, total shots, shots on goal, attacking moves, box moves, crosses, offsides committed, assists, passes made, successful passes, dribbles made, successful dribbles, ball possession, and gains of possession, while visiting teams presented higher means for losses of possession and yellow cards. In addition, the findings of the current study confirm that game location and team quality are important in determining technical and tactical performances in matches. Teams described as superior and those described as inferior did not experience the same home advantage. Future research should consider the influence of other confounding variables such as weather conditions, game status and team form. Key points Home teams have significantly higher figures for attack indicators probably due to facilities familiarity and crowd effects. The teams’ game-related statistics profile varied according to game location and team quality. Teams described as superior and those described as inferior did not experience the same home advantage. PMID:24150619
Garcia, Luís Filipe; de Oliveira, Luís Caldas; de Matos, David Martins
2016-01-01
This study compared the performance of two statistical location-aware pictogram prediction mechanisms, with an all-purpose (All) pictogram prediction mechanism, having no location knowledge. The All approach had a unique language model under all locations. One of the location-aware alternatives, the location-specific (Spec) approach, made use of specific language models for pictogram prediction in each location of interest. The other location-aware approach resulted from combining the Spec and the All approaches, and was designated the mixed approach (Mix). In this approach, the language models acquired knowledge from all locations, but a higher relevance was assigned to the vocabulary from the associated location. Results from simulations showed that the Mix and Spec approaches could only outperform the baseline in a statistically significant way if pictogram users reuse more than 50% and 75% of their sentences, respectively. Under low sentence reuse conditions there were no statistically significant differences between the location-aware approaches and the All approach. Under these conditions, the Mix approach performed better than the Spec approach in a statistically significant way.
Kashif, Amer S; Lotz, Thomas F; Heeren, Adrianus M W; Chase, James G
2013-11-01
It is estimated that every year, 1 × 10(6) women are diagnosed with breast cancer, and more than 410,000 die annually worldwide. Digital Image Elasto Tomography (DIET) is a new noninvasive breast cancer screening modality that induces mechanical vibrations in the breast and images its surface motion with digital cameras to detect changes in stiffness. This research develops a new automated approach for diagnosing breast cancer using DIET based on a modal analysis model. The first and second natural frequency of silicone phantom breasts is analyzed. Separate modal analysis is performed for each region of the phantom to estimate the modal parameters using imaged motion data over several input frequencies. Statistical methods are used to assess the likelihood of a frequency shift, which can indicate tumor location. Phantoms with 5, 10, and 20 mm stiff inclusions are tested, as well as a homogeneous (healthy) phantom. Inclusions are located at four locations with different depth. The second natural frequency proves to be a reliable metric with the potential to clearly distinguish lesion like inclusions of different stiffness, as well as providing an approximate location for the tumor like inclusions. The 10 and 20 mm inclusions are always detected regardless of depth. The 5 mm inclusions are only detected near the surface. The homogeneous phantom always yields a negative result, as expected. Detection is based on a statistical likelihood analysis to determine the presence of significantly different frequency response over the phantom, which is a novel approach to this problem. The overall results show promise and justify proof of concept trials with human subjects.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vallée, Jacques P., E-mail: jacques.vallee@nrc-cnrc.gc.ca
2014-07-01
From the Sun's location in the Galactic disk, different arm tracers (CO, H I, hot dust, etc.) have been employed to locate a tangent to each spiral arm. Using all various and different observed spiral arm tracers (as published elsewhere), we embark on a new goal, namely the statistical analysis of these published data (data mining) to statistically compute the mean location of each spiral arm tracer. We show for a typical arm cross-cut, a separation of 400 pc between the mid-arm and the dust lane (at the inner edge of the arm, toward the Galactic center). Are some armsmore » major and others minor? Separating arms into two sets, as suggested by some, we find the same arm widths between the two sets. Our interpretation is that we live in a multiple (four-arm) spiral (logarithmic) pattern (around a pitch angle of 12°) for the stars and gas in the Milky Way, with a sizable interarm separation (around 3 kpc) at the Sun's location and the same arm width for each arm (near 400 pc from mid-arm to dust lane).« less
Koprivica, Mladen; Neskovic, Natasa; Neskovic, Aleksandar; Paunovic, George
2014-01-01
As a result of dense installations of public mobile base station, additional electromagnetic radiation occurs in the living environment. In order to determine the level of radio-frequency radiation generated by base stations, extensive electromagnetic field strength measurements were carried out for 664 base station locations. Base station locations were classified into three categories: indoor, masts and locations with installations on buildings. Having in mind the large percentage (47 %) of sites with antenna masts, a detailed analysis of this location category was performed, and the measurement results were presented. It was concluded that the total electric field strength in the vicinity of base station antenna masts in no case exceeded 10 V m(-1), which is quite below the International Commission on Non-Ionizing Radiation Protection reference levels. At horizontal distances >50 m from the mast bottom, the median and maximum values were <1 and 2 V m(-1), respectively.
Analysis of Parasite and Other Skewed Counts
Alexander, Neal
2012-01-01
Objective To review methods for the statistical analysis of parasite and other skewed count data. Methods Statistical methods for skewed count data are described and compared, with reference to those used over a ten year period of Tropical Medicine and International Health. Two parasitological datasets are used for illustration. Results Ninety papers were identified, 89 with descriptive and 60 with inferential analysis. A lack of clarity is noted in identifying measures of location, in particular the Williams and geometric mean. The different measures are compared, emphasizing the legitimacy of the arithmetic mean for skewed data. In the published papers, the t test and related methods were often used on untransformed data, which is likely to be invalid. Several approaches to inferential analysis are described, emphasizing 1) non-parametric methods, while noting that they are not simply comparisons of medians, and 2) generalized linear modelling, in particular with the negative binomial distribution. Additional methods, such as the bootstrap, with potential for greater use are described. Conclusions Clarity is recommended when describing transformations and measures of location. It is suggested that non-parametric methods and generalized linear models are likely to be sufficient for most analyses. PMID:22943299
Langan, Dean; Higgins, Julian P T; Gregory, Walter; Sutton, Alexander J
2012-05-01
We aim to illustrate the potential impact of a new study on a meta-analysis, which gives an indication of the robustness of the meta-analysis. A number of augmentations are proposed to one of the most widely used of graphical displays, the funnel plot. Namely, 1) statistical significance contours, which define regions of the funnel plot in which a new study would have to be located to change the statistical significance of the meta-analysis; and 2) heterogeneity contours, which show how a new study would affect the extent of heterogeneity in a given meta-analysis. Several other features are also described, and the use of multiple features simultaneously is considered. The statistical significance contours suggest that one additional study, no matter how large, may have a very limited impact on the statistical significance of a meta-analysis. The heterogeneity contours illustrate that one outlying study can increase the level of heterogeneity dramatically. The additional features of the funnel plot have applications including 1) informing sample size calculations for the design of future studies eligible for inclusion in the meta-analysis; and 2) informing the updating prioritization of a portfolio of meta-analyses such as those prepared by the Cochrane Collaboration. Copyright © 2012 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Ridgley, James Alexander, Jr.
This dissertation is an exploratory quantitative analysis of various independent variables to determine their effect on the professional longevity (years of service) of high school science teachers in the state of Florida for the academic years 2011-2012 to 2013-2014. Data are collected from the Florida Department of Education, National Center for Education Statistics, and the National Assessment of Educational Progress databases. The following research hypotheses are examined: H1 - There are statistically significant differences in Level 1 (teacher variables) that influence the professional longevity of a high school science teacher in Florida. H2 - There are statistically significant differences in Level 2 (school variables) that influence the professional longevity of a high school science teacher in Florida. H3 - There are statistically significant differences in Level 3 (district variables) that influence the professional longevity of a high school science teacher in Florida. H4 - When tested in a hierarchical multiple regression, there are statistically significant differences in Level 1, Level 2, or Level 3 that influence the professional longevity of a high school science teacher in Florida. The professional longevity of a Floridian high school science teacher is the dependent variable. The independent variables are: (Level 1) a teacher's sex, age, ethnicity, earned degree, salary, number of schools taught in, migration count, and various years of service in different areas of education; (Level 2) a school's geographic location, residential population density, average class size, charter status, and SES; and (Level 3) a school district's average SES and average spending per pupil. Statistical analyses of exploratory MLRs and a HMR are used to support the research hypotheses. The final results of the HMR analysis show a teacher's age, salary, earned degree (unknown, associate, and doctorate), and ethnicity (Hispanic and Native Hawaiian/Pacific Islander); a school's charter status; and a school district's average SES are all significant predictors of a Florida high school science teacher's professional longevity. Although statistically significant in the initial exploratory MLR analyses, a teacher's ethnicity (Asian and Black), a school's geographic location (city and rural), and a school's SES are not statistically significant in the final HMR model.
Liumbruno, Giancarlo Maria; Panetta, Valentina; Bonini, Rosaria; Chianese, Rosa; Fiorin, Francesco; Lupi, Maria Antonietta; Tomasini, Ivana; Grazzini, Giuliano
2011-01-01
Introduction The aim of the survey described in this article was to determine decisional and strategic factors useful for redefining minimum structural, technological and organisational requisites for transfusion structures, as well as for the production of guidelines for accreditation of transfusion structures by the National Blood Centre. Materials and methods A structured questionnaire containing 65 questions was sent to all Transfusion Services in Italy. The questions covered: management of the quality system, accreditation, conformity with professional standards, structural and technological requisites, as well as potential to supply transfusion medicine-related health care services. All the questionnaires returned underwent statistical analysis. Results Replies were received from 64.7% of the Transfusion Services. Thirty-nine percent of these had an ISO 9001 certificate, with marked differences according to geographical location; location-related differences were also present for responses to other questions and were confirmed by multivariate statistical analysis. Over half of the Transfusion Services (53.6%) had blood donation sites run by donor associations. The statistical analysis revealed only one statistically significant difference between these donation sites: those connected to certified Transfusion Services were more likely themselves to have ISO 9001 certification than those connected to services who did not have such certification. Conclusions The data collected in this survey are representative of the Italian national transfusion system. A re-definition of the authorisation and accreditation requisites for transfusion activities must take into account European and national legislation when determining these requisites in order to facilitate their effective applicability, promote their efficient fulfilment and enhance the development of homogeneous and transparent quality systems. PMID:21839026
Loha, Eskindir; Lindtjørn, Bernt
2010-06-16
Malaria transmission is complex and is believed to be associated with local climate changes. However, simple attempts to extrapolate malaria incidence rates from averaged regional meteorological conditions have proven unsuccessful. Therefore, the objective of this study was to determine if variations in specific meteorological factors are able to consistently predict P. falciparum malaria incidence at different locations in south Ethiopia. Retrospective data from 42 locations were collected including P. falciparum malaria incidence for the period of 1998-2007 and meteorological variables such as monthly rainfall (all locations), temperature (17 locations), and relative humidity (three locations). Thirty-five data sets qualified for the analysis. Ljung-Box Q statistics was used for model diagnosis, and R squared or stationary R squared was taken as goodness of fit measure. Time series modelling was carried out using Transfer Function (TF) models and univariate auto-regressive integrated moving average (ARIMA) when there was no significant predictor meteorological variable. Of 35 models, five were discarded because of the significant value of Ljung-Box Q statistics. Past P. falciparum malaria incidence alone (17 locations) or when coupled with meteorological variables (four locations) was able to predict P. falciparum malaria incidence within statistical significance. All seasonal AIRMA orders were from locations at altitudes above 1742 m. Monthly rainfall, minimum and maximum temperature was able to predict incidence at four, five and two locations, respectively. In contrast, relative humidity was not able to predict P. falciparum malaria incidence. The R squared values for the models ranged from 16% to 97%, with the exception of one model which had a negative value. Models with seasonal ARIMA orders were found to perform better. However, the models for predicting P. falciparum malaria incidence varied from location to location, and among lagged effects, data transformation forms, ARIMA and TF orders. This study describes P. falciparum malaria incidence models linked with meteorological data. Variability in the models was principally attributed to regional differences, and a single model was not found that fits all locations. Past P. falciparum malaria incidence appeared to be a superior predictor than meteorology. Future efforts in malaria modelling may benefit from inclusion of non-meteorological factors.
Bias correction of bounded location errors in presence-only data
Hefley, Trevor J.; Brost, Brian M.; Hooten, Mevin B.
2017-01-01
Location error occurs when the true location is different than the reported location. Because habitat characteristics at the true location may be different than those at the reported location, ignoring location error may lead to unreliable inference concerning species–habitat relationships.We explain how a transformation known in the spatial statistics literature as a change of support (COS) can be used to correct for location errors when the true locations are points with unknown coordinates contained within arbitrary shaped polygons.We illustrate the flexibility of the COS by modelling the resource selection of Whooping Cranes (Grus americana) using citizen contributed records with locations that were reported with error. We also illustrate the COS with a simulation experiment.In our analysis of Whooping Crane resource selection, we found that location error can result in up to a five-fold change in coefficient estimates. Our simulation study shows that location error can result in coefficient estimates that have the wrong sign, but a COS can efficiently correct for the bias.
Shields, Timothy; Pinchoff, Jessie; Lubinda, Jailos; Hamapumbu, Harry; Searle, Kelly; Kobayashi, Tamaki; Thuma, Philip E; Moss, William J; Curriero, Frank C
2016-05-31
Satellite imagery is increasingly available at high spatial resolution and can be used for various purposes in public health research and programme implementation. Comparing a census generated from two satellite images of the same region in rural southern Zambia obtained four and a half years apart identified patterns of household locations and change over time. The length of time that a satellite image-based census is accurate determines its utility. Households were enumerated manually from satellite images obtained in 2006 and 2011 of the same area. Spatial statistics were used to describe clustering, cluster detection, and spatial variation in the location of households. A total of 3821 household locations were enumerated in 2006 and 4256 in 2011, a net change of 435 houses (11.4% increase). Comparison of the images indicated that 971 (25.4%) structures were added and 536 (14.0%) removed. Further analysis suggested similar household clustering in the two images and no substantial difference in concentration of households across the study area. Cluster detection analysis identified a small area where significantly more household structures were removed than expected; however, the amount of change was of limited practical significance. These findings suggest that random sampling of households for study participation would not induce geographic bias if based on a 4.5-year-old image in this region. Application of spatial statistical methods provides insights into the population distribution changes between two time periods and can be helpful in assessing the accuracy of satellite imagery.
Spatial-temporal analysis of building surface temperatures in Hung Hom
NASA Astrophysics Data System (ADS)
Zeng, Ying; Shen, Yueqian
2015-12-01
This thesis presents a study on spatial-temporal analysis of building surface temperatures in Hung Hom. Observations were collected from Aug 2013 to Oct 2013 at a 30-min interval, using iButton sensors (N=20) covering twelve locations in Hung Hom. And thermal images were captured in PolyU from 05 Aug 2013 to 06 Aug 2013. A linear regression model of iButton and thermal records is established to calibrate temperature data. A 3D modeling system is developed based on Visual Studio 2010 development platform, using ArcEngine10.0 component, Microsoft Access 2010 database and C# programming language. The system realizes processing data, spatial analysis, compound query and 3D face temperature rendering and so on. After statistical analyses, building face azimuths are found to have a statistically significant relationship with sun azimuths at peak time. And seasonal building temperature changing also corresponds to the sun angle and sun azimuth variations. Building materials are found to have a significant effect on building surface temperatures. Buildings with lower albedo materials tend to have higher temperatures and larger thermal conductivity material have significant diurnal variations. For the geographical locations, the peripheral faces of campus have higher temperatures than the inner faces during day time and buildings located at the southeast are cooler than the western. Furthermore, human activity is found to have a strong relationship with building surface temperatures through weekday and weekend comparison.
NASA Technical Reports Server (NTRS)
Goldhirsh, J.
1982-01-01
The first absolute rain fade distribution method described establishes absolute fade statistics at a given site by means of a sampled radar data base. The second method extrapolates absolute fade statistics from one location to another, given simultaneously measured fade and rain rate statistics at the former. Both methods employ similar conditional fade statistic concepts and long term rain rate distributions. Probability deviations in the 2-19% range, with an 11% average, were obtained upon comparison of measured and predicted levels at given attenuations. The extrapolation of fade distributions to other locations at 28 GHz showed very good agreement with measured data at three sites located in the continental temperate region.
Markov chain Monte Carlo linkage analysis: effect of bin width on the probability of linkage.
Slager, S L; Juo, S H; Durner, M; Hodge, S E
2001-01-01
We analyzed part of the Genetic Analysis Workshop (GAW) 12 simulated data using Monte Carlo Markov chain (MCMC) methods that are implemented in the computer program Loki. The MCMC method reports the "probability of linkage" (PL) across the chromosomal regions of interest. The point of maximum PL can then be taken as a "location estimate" for the location of the quantitative trait locus (QTL). However, Loki does not provide a formal statistical test of linkage. In this paper, we explore how the bin width used in the calculations affects the max PL and the location estimate. We analyzed age at onset (AO) and quantitative trait number 5, Q5, from 26 replicates of the general simulated data in one region where we knew a major gene, MG5, is located. For each trait, we found the max PL and the corresponding location estimate, using four different bin widths. We found that bin width, as expected, does affect the max PL and the location estimate, and we recommend that users of Loki explore how their results vary with different bin widths.
Statistical analysis of the 70 meter antenna surface distortions
NASA Technical Reports Server (NTRS)
Kiedron, K.; Chian, C. T.; Chuang, K. L.
1987-01-01
Statistical analysis of surface distortions of the 70 meter NASA/JPL antenna, located at Goldstone, was performed. The purpose of this analysis is to verify whether deviations due to gravity loading can be treated as quasi-random variables with normal distribution. Histograms of the RF pathlength error distribution for several antenna elevation positions were generated. The results indicate that the deviations from the ideal antenna surface are not normally distributed. The observed density distribution for all antenna elevation angles is taller and narrower than the normal density, which results in large positive values of kurtosis and a significant amount of skewness. The skewness of the distribution changes from positive to negative as the antenna elevation changes from zenith to horizon.
NASA Astrophysics Data System (ADS)
Barette, Florian; Poppe, Sam; Smets, Benoît; Benbakkar, Mhammed; Kervyn, Matthieu
2017-10-01
We present an integrated, spatially-explicit database of existing geochemical major-element analyses available from (post-) colonial scientific reports, PhD Theses and international publications for the Virunga Volcanic Province, located in the western branch of the East African Rift System. This volcanic province is characterised by alkaline volcanism, including silica-undersaturated, alkaline and potassic lavas. The database contains a total of 908 geochemical analyses of eruptive rocks for the entire volcanic province with a localisation for most samples. A preliminary analysis of the overall consistency of the database, using statistical techniques on sets of geochemical analyses with contrasted analytical methods or dates, demonstrates that the database is consistent. We applied a principal component analysis and cluster analysis on whole-rock major element compositions included in the database to study the spatial variation of the chemical composition of eruptive products in the Virunga Volcanic Province. These statistical analyses identify spatially distributed clusters of eruptive products. The known geochemical contrasts are highlighted by the spatial analysis, such as the unique geochemical signature of Nyiragongo lavas compared to other Virunga lavas, the geochemical heterogeneity of the Bulengo area, and the trachyte flows of Karisimbi volcano. Most importantly, we identified separate clusters of eruptive products which originate from primitive magmatic sources. These lavas of primitive composition are preferentially located along NE-SW inherited rift structures, often at distance from the central Virunga volcanoes. Our results illustrate the relevance of a spatial analysis on integrated geochemical data for a volcanic province, as a complement to classical petrological investigations. This approach indeed helps to characterise geochemical variations within a complex of magmatic systems and to identify specific petrologic and geochemical investigations that should be tackled within a study area.
A statistical approach to the brittle fracture of a multi-phase solid
NASA Technical Reports Server (NTRS)
Liu, W. K.; Lua, Y. I.; Belytschko, T.
1991-01-01
A stochastic damage model is proposed to quantify the inherent statistical distribution of the fracture toughness of a brittle, multi-phase solid. The model, based on the macrocrack-microcrack interaction, incorporates uncertainties in locations and orientations of microcracks. Due to the high concentration of microcracks near the macro-tip, a higher order analysis based on traction boundary integral equations is formulated first for an arbitrary array of cracks. The effects of uncertainties in locations and orientations of microcracks at a macro-tip are analyzed quantitatively by using the boundary integral equations method in conjunction with the computer simulation of the random microcrack array. The short range interactions resulting from surrounding microcracks closet to the main crack tip are investigated. The effects of microcrack density parameter are also explored in the present study. The validity of the present model is demonstrated by comparing its statistical output with the Neville distribution function, which gives correct fits to sets of experimental data from multi-phase solids.
Differences in Pain Location, Intensity and Quality by Pain Pattern in Outpatients with Cancer
Ngamkham, Srisuda; Holden, Janean E.; Wilkie, Diana J.
2013-01-01
Pain pattern represents how the individual’s pain changes temporally with activities or other factors. Understanding pain pattern is important for appropriate timing of pain interventions, but researchers have studied less the temporal aspects of cancer pain than pain location, intensity, and quality parameters. The study purpose was to explore differences in pain location, intensity, and quality by pattern groups in outpatients with cancer. We conducted a comparative, secondary data analysis of data collected from 1994 to 2007. 762 outpatients with cancer had completed the 0-to-10 Pain Intensity Number Scale and the McGill Pain Questionnaire to measure pain location, quality and pattern. From all possible combinations of the three types of pain patterns, we created seven pain pattern groups. The distribution of pain pattern was: pattern-1 (27%); pattern-2 (24%); pattern-3 (8%); pattern-4 (12%); pattern-5 (3%); pattern-6 (18%); and pattern-7 (8%). A statistically significant higher proportion of patients with continuous pain patterns (pattern 1, 4, 5, and 7) reported pain location in two or more sites. Patients with pattern 1, 4, and 7 reported statistically significant, higher worst pain mean scores than patients with pattern 2, 3, and 6 (not continuous descriptors). Patients with pattern7 reported statistically significant, higher mean scores (pain rating index-sensory and total number of words selected) than patients with pattern1, 2, 3, 4, and 6. Using pain pattern groups may help clinicians to understand temporal changes in cancer pain and to provide more effective pain management by recognizing the high risk if the pain is continuous. PMID:21512345
The Learning Organization Model across Vocational and Academic Teacher Groups
ERIC Educational Resources Information Center
Park, Joo Ho; Rojewski, Jay W.
2006-01-01
Multiple-group confirmatory factor analysis was used to investigate factorial invariance between vocational and academic teacher groups on a measure of the learning organization concept. Participants were 488 full-time teachers of public trade industry-technical and business schools located within Seoul, South Korea. Statistically significant…
Determinants of Student Wastage in Higher Education.
ERIC Educational Resources Information Center
Johnes, Jill
1990-01-01
Statistical analysis of a sample of the 1979 entry cohort to Lancaster University indicates that the likelihood of non-completion is determined by various characteristics including the student's academic ability, gender, marital status, work experience prior to university, school background, and location of home in relation to university.…
3D Self-Localisation From Angle of Arrival Measurements
2009-04-01
systems can provide precise position information. However, there are situations where GPS is not adequate such as indoor, underwater, extraterrestrial or...Transactions on Pattern Analysis and Machine Intelligence , Vol. 22, No. 6, June 2000, pp 610-622. 7. Torrieri, D.J., "Statistical Theory of Passive Location
Global Statistical Learning in a Visual Search Task
ERIC Educational Resources Information Center
Jones, John L.; Kaschak, Michael P.
2012-01-01
Locating a target in a visual search task is facilitated when the target location is repeated on successive trials. Global statistical properties also influence visual search, but have often been confounded with local regularities (i.e., target location repetition). In two experiments, target locations were not repeated for four successive trials,…
Torres, Hianne Miranda de; Arruda, Julyanna Jacinto de; Silva-Filho, João Manoel da; Faria, Danielle Lago Bruno de; Nascimento, Monikelly Carmo Chagas; Torres, Érica Miranda de
2017-01-01
The anatomical characteristics of permanent maxillary canines were evaluated through visual examination, periapical radiography, and cone beam computed tomography (CBCT), and measurements obtained from the images and directly on the teeth were compared. Fifty extracted human maxillary canines were classified according to the side of the mouth. The direction of root curvature and location of the apical foramen were also verified. Periapical radiographs and CBCTs of the specimens were obtained. The number of root canals was verified. Tooth length and the mesiodistal and buccopalatal widths of the root were measured directly on the specimens as well as on the radiographs and CBCTs. Data were analyzed by chi-square testing and analysis of variance (α = 0.05). All teeth-26 (52%) from the right side of the dental arch and 24 (48%) from the left-had only 1 main canal each. The apical foramen was located exactly in the root apex in 34 teeth (68%). Root curvature toward the distal side was observed in the apical third in 23 teeth (46%). There were no statistically significant differences between the canines' arch side and either the foramen location (P = 0.104) or the root curvature (P = 0.215). No statistically significant differences were found in measurements of tooth length (P = 0.669), mesiodistal root width (P = 0.517), or buccopalatal root width (P = 0.672) obtained from specimens and images. Both CBCTs and periapical radiographs were reliable for determining the tooth length, mesiodistal root width, and buccopalatal root width of maxillary canines and produced statistically similar measurements.
Lee, Do-Youl; Kim, Se-Hoon; Suh, Jung-Keun; Cho, Tai-Hyoung; Chung, Yong-Gu
2012-09-01
This study was designed to investigate the correlation between insertion depth of artificial disc and postoperative kyphotic deformity after Prodisc-C total disc replacement surgery, and the range of artificial disc insertion depth which is effective in preventing postoperative whole cervical or segmental kyphotic deformity. A retrospective radiological analysis was performed in 50 patients who had undergone single level total disc replacement surgery. Records were reviewed to obtain demographic data. Preoperative and postoperative radiographs were assessed to determine C2-7 Cobb's angle and segmental angle and to investigate postoperative kyphotic deformity. A formula was introduced to calculate insertion depth of Prodisc-C artificial disc. Statistical analysis was performed to search the correlation between insertion depth of Prodisc-C artificial disc and postoperative kyphotic deformity, and to estimate insertion depth of Prodisc-C artificial disc to prevent postoperative kyphotic deformity. In this study no significant statistical correlation was observed between insertion depth of Prodisc-C artificial disc and postoperative kyphotic deformity regarding C2-7 Cobb's angle. Statistical correlation between insertion depth of Prodisc-C artificial disc and postoperative kyphotic deformity was observed regarding segmental angle (p<0.05). It failed to estimate proper insertion depth of Prodisc-C artificial disc effective in preventing postoperative kyphotic deformity. Postoperative segmental kyphotic deformity is associated with insertion depth of Prodisc-C artificial disc. Anterior located artificial disc leads to lordotic segmental angle and posterior located artificial disc leads to kyphotic segmental angle postoperatively. But C2-7 Cobb's angle is not affected by artificial disc location after the surgery.
Sando, Roy; Chase, Katherine J.
2017-03-23
A common statistical procedure for estimating streamflow statistics at ungaged locations is to develop a relational model between streamflow and drainage basin characteristics at gaged locations using least squares regression analysis; however, least squares regression methods are parametric and make constraining assumptions about the data distribution. The random forest regression method provides an alternative nonparametric method for estimating streamflow characteristics at ungaged sites and requires that the data meet fewer statistical conditions than least squares regression methods.Random forest regression analysis was used to develop predictive models for 89 streamflow characteristics using Precipitation-Runoff Modeling System simulated streamflow data and drainage basin characteristics at 179 sites in central and eastern Montana. The predictive models were developed from streamflow data simulated for current (baseline, water years 1982–99) conditions and three future periods (water years 2021–38, 2046–63, and 2071–88) under three different climate-change scenarios. These predictive models were then used to predict streamflow characteristics for baseline conditions and three future periods at 1,707 fish sampling sites in central and eastern Montana. The average root mean square error for all predictive models was about 50 percent. When streamflow predictions at 23 fish sampling sites were compared to nearby locations with simulated data, the mean relative percent difference was about 43 percent. When predictions were compared to streamflow data recorded at 21 U.S. Geological Survey streamflow-gaging stations outside of the calibration basins, the average mean absolute percent error was about 73 percent.
Local indicators of geocoding accuracy (LIGA): theory and application
Jacquez, Geoffrey M; Rommel, Robert
2009-01-01
Background Although sources of positional error in geographic locations (e.g. geocoding error) used for describing and modeling spatial patterns are widely acknowledged, research on how such error impacts the statistical results has been limited. In this paper we explore techniques for quantifying the perturbability of spatial weights to different specifications of positional error. Results We find that a family of curves describes the relationship between perturbability and positional error, and use these curves to evaluate sensitivity of alternative spatial weight specifications to positional error both globally (when all locations are considered simultaneously) and locally (to identify those locations that would benefit most from increased geocoding accuracy). We evaluate the approach in simulation studies, and demonstrate it using a case-control study of bladder cancer in south-eastern Michigan. Conclusion Three results are significant. First, the shape of the probability distributions of positional error (e.g. circular, elliptical, cross) has little impact on the perturbability of spatial weights, which instead depends on the mean positional error. Second, our methodology allows researchers to evaluate the sensitivity of spatial statistics to positional accuracy for specific geographies. This has substantial practical implications since it makes possible routine sensitivity analysis of spatial statistics to positional error arising in geocoded street addresses, global positioning systems, LIDAR and other geographic data. Third, those locations with high perturbability (most sensitive to positional error) and high leverage (that contribute the most to the spatial weight being considered) will benefit the most from increased positional accuracy. These are rapidly identified using a new visualization tool we call the LIGA scatterplot. Herein lies a paradox for spatial analysis: For a given level of positional error increasing sample density to more accurately follow the underlying population distribution increases perturbability and introduces error into the spatial weights matrix. In some studies positional error may not impact the statistical results, and in others it might invalidate the results. We therefore must understand the relationships between positional accuracy and the perturbability of the spatial weights in order to have confidence in a study's results. PMID:19863795
Ensink, Elliot; Sinha, Jessica; Sinha, Arkadeep; Tang, Huiyuan; Calderone, Heather M; Hostetter, Galen; Winter, Jordan; Cherba, David; Brand, Randall E; Allen, Peter J; Sempere, Lorenzo F; Haab, Brian B
2015-10-06
Experiments involving the high-throughput quantification of image data require algorithms for automation. A challenge in the development of such algorithms is to properly interpret signals over a broad range of image characteristics, without the need for manual adjustment of parameters. Here we present a new approach for locating signals in image data, called Segment and Fit Thresholding (SFT). The method assesses statistical characteristics of small segments of the image and determines the best-fit trends between the statistics. Based on the relationships, SFT identifies segments belonging to background regions; analyzes the background to determine optimal thresholds; and analyzes all segments to identify signal pixels. We optimized the initial settings for locating background and signal in antibody microarray and immunofluorescence data and found that SFT performed well over multiple, diverse image characteristics without readjustment of settings. When used for the automated analysis of multicolor, tissue-microarray images, SFT correctly found the overlap of markers with known subcellular localization, and it performed better than a fixed threshold and Otsu's method for selected images. SFT promises to advance the goal of full automation in image analysis.
Ensink, Elliot; Sinha, Jessica; Sinha, Arkadeep; Tang, Huiyuan; Calderone, Heather M.; Hostetter, Galen; Winter, Jordan; Cherba, David; Brand, Randall E.; Allen, Peter J.; Sempere, Lorenzo F.; Haab, Brian B.
2016-01-01
Certain experiments involve the high-throughput quantification of image data, thus requiring algorithms for automation. A challenge in the development of such algorithms is to properly interpret signals over a broad range of image characteristics, without the need for manual adjustment of parameters. Here we present a new approach for locating signals in image data, called Segment and Fit Thresholding (SFT). The method assesses statistical characteristics of small segments of the image and determines the best-fit trends between the statistics. Based on the relationships, SFT identifies segments belonging to background regions; analyzes the background to determine optimal thresholds; and analyzes all segments to identify signal pixels. We optimized the initial settings for locating background and signal in antibody microarray and immunofluorescence data and found that SFT performed well over multiple, diverse image characteristics without readjustment of settings. When used for the automated analysis of multi-color, tissue-microarray images, SFT correctly found the overlap of markers with known subcellular localization, and it performed better than a fixed threshold and Otsu’s method for selected images. SFT promises to advance the goal of full automation in image analysis. PMID:26339978
Bingemann, Dieter; Allen, Rachel M.
2012-01-01
We describe a statistical method to analyze dual-channel photon arrival trajectories from single molecule spectroscopy model-free to identify break points in the intensity ratio. Photons are binned with a short bin size to calculate the logarithm of the intensity ratio for each bin. Stochastic photon counting noise leads to a near-normal distribution of this logarithm and the standard student t-test is used to find statistically significant changes in this quantity. In stochastic simulations we determine the significance threshold for the t-test’s p-value at a given level of confidence. We test the method’s sensitivity and accuracy indicating that the analysis reliably locates break points with significant changes in the intensity ratio with little or no error in realistic trajectories with large numbers of small change points, while still identifying a large fraction of the frequent break points with small intensity changes. Based on these results we present an approach to estimate confidence intervals for the identified break point locations and recommend a bin size to choose for the analysis. The method proves powerful and reliable in the analysis of simulated and actual data of single molecule reorientation in a glassy matrix. PMID:22837704
Irvine, Kathryn M.; Manlove, Kezia; Hollimon, Cynthia
2012-01-01
An important consideration for long term monitoring programs is determining the required sampling effort to detect trends in specific ecological indicators of interest. To enhance the Greater Yellowstone Inventory and Monitoring Network’s water resources protocol(s) (O’Ney 2006 and O’Ney et al. 2009 [under review]), we developed a set of tools to: (1) determine the statistical power for detecting trends of varying magnitude in a specified water quality parameter over different lengths of sampling (years) and different within-year collection frequencies (monthly or seasonal sampling) at particular locations using historical data, and (2) perform periodic trend analyses for water quality parameters while addressing seasonality and flow weighting. A power analysis for trend detection is a statistical procedure used to estimate the probability of rejecting the hypothesis of no trend when in fact there is a trend, within a specific modeling framework. In this report, we base our power estimates on using the seasonal Kendall test (Helsel and Hirsch 2002) for detecting trend in water quality parameters measured at fixed locations over multiple years. We also present procedures (R-scripts) for conducting a periodic trend analysis using the seasonal Kendall test with and without flow adjustment. This report provides the R-scripts developed for power and trend analysis, tutorials, and the associated tables and graphs. The purpose of this report is to provide practical information for monitoring network staff on how to use these statistical tools for water quality monitoring data sets.
2014-04-01
WRF ) model is a numerical weather prediction system designed for operational forecasting and atmospheric research. This report examined WRF model... WRF , weather research and forecasting, atmospheric effects 16. SECURITY CLASSIFICATION OF: 17. LIMITATION OF ABSTRACT SAR 18. NUMBER OF...and Forecasting ( WRF ) model. The authors would also like to thank Ms. Sherry Larson, STS Systems Integration, LLC, ARL Technical Publishing Branch
A random-sum Wilcoxon statistic and its application to analysis of ROC and LROC data.
Tang, Liansheng Larry; Balakrishnan, N
2011-01-01
The Wilcoxon-Mann-Whitney statistic is commonly used for a distribution-free comparison of two groups. One requirement for its use is that the sample sizes of the two groups are fixed. This is violated in some of the applications such as medical imaging studies and diagnostic marker studies; in the former, the violation occurs since the number of correctly localized abnormal images is random, while in the latter the violation is due to some subjects not having observable measurements. For this reason, we propose here a random-sum Wilcoxon statistic for comparing two groups in the presence of ties, and derive its variance as well as its asymptotic distribution for large sample sizes. The proposed statistic includes the regular Wilcoxon rank-sum statistic. Finally, we apply the proposed statistic for summarizing location response operating characteristic data from a liver computed tomography study, and also for summarizing diagnostic accuracy of biomarker data.
NASA Astrophysics Data System (ADS)
Colone, L.; Hovgaard, M. K.; Glavind, L.; Brincker, R.
2018-07-01
A method for mass change detection on wind turbine blades using natural frequencies is presented. The approach is based on two statistical tests. The first test decides if there is a significant mass change and the second test is a statistical group classification based on Linear Discriminant Analysis. The frequencies are identified by means of Operational Modal Analysis using natural excitation. Based on the assumption of Gaussianity of the frequencies, a multi-class statistical model is developed by combining finite element model sensitivities in 10 classes of change location on the blade, the smallest area being 1/5 of the span. The method is experimentally validated for a full scale wind turbine blade in a test setup and loaded by natural wind. Mass change from natural causes was imitated with sand bags and the algorithm was observed to perform well with an experimental detection rate of 1, localization rate of 0.88 and mass estimation rate of 0.72.
Stress shadows - a controversial topic
NASA Astrophysics Data System (ADS)
Lasocki, Stanislaw; Karakostas, Vassilis G.; Papadimitriou, Eletheria E.; Orlecka-Sikora, Beata
2010-05-01
The spatial correlation between the positive Coulomb stress changes and the subsequent seismic activity has been firmly confirmed in many recent studies. If, however, the static stress transfer is a consistent expression of interaction between earthquakes one should also observe a decrease of the activity in the zones of negative stress changes. Instead, the existence of stress shadows is poorly evidenced and may be questioned. We tested the influence of the static stress changes associated with the coseismic slip of the 1995 Mw6.5 Kozani-Grevena (Greece) earthquake on locations of its aftershocks. The study was based on a detailed slip model for the main shock and accurate locations and reliable fault plane solutions of an adequate number of the aftershocks. We developed a statistical testing method, which tested whether the proportions of aftershocks located inside areas determined by a selected criterion on the static stress change could be attained if there were no effect of the stress change due to the main shock on aftershock locations. The areas of stress change were determined at the focus of every aftershock. The distribution of test statistic was constructed with the use of a two-dimensional nonparametric, kernel density estimator of the reference epicenter distribution. The tests highly confidently indicated a rise in probability to locate aftershocks inside areas of positive static stress change, which supported the hypothesis on the triggering effect in these areas. Furthermore, it was evidenced that a larger stress increase caused a stronger triggering effect. The analysis, however, did not evidence the existence of stress shadows inside areas of negative stress change. Contrary to expectations, the tests indicated a significant increase of the probability of event location in the areas of a stress decrease of more than or equal to 5.0 and 10.0 bar. It turned out that for areas of larger absolute stress change this probability increased regardless of the sign of the change though distinctly more in areas of positive than of negative change. In the case of seismicity accompanying underground mining exploitation the coseismic stress changes expressed in terms of the Coulomb failure function are at least of one order smaller than those for earthquakes. Furthermore, they are only a small component of the total stress field variations in mining rockmass, which are mainly controlled by the mining process. Nevertheless, our studies of the induced seismicity in the Rudna mine in the Legnica-Głogow Copper District in Poland showed that the influence of the Coulomb stress changes on locations of subsequent events was statistically significant. We analyzed series of seismic events quantifying the triggering and inhibiting effect by the proportion of events in the series whose locations were consistent with the stress increased and stress decreased zones, respectively. It was found out that more than 60 per-cent of the analyzed seismic events occurred in areas where stress was enhanced due to the occurrence of previous events. The significance of this result was determined by comparing it with 2000 results of the same analysis carried out on the random permutations of the original series of events. The test indicated that the locations in positive stress changes areas were preferred statistically significantly when the stress changes exceeded 0.05 bar. However, no statistically significant inhibiting effect of negative static stress changes, within the considered range of these changes, was ascertained. Here we present details of these two studies and discuss possible reasons behind the negative conclusions on the existence of stress shadows.
Wright, David K.; MacEachern, Scott; Lee, Jaeyong
2014-01-01
The locations of diy-geδ-bay (DGB) sites in the Mandara Mountains, northern Cameroon are hypothesized to occur as a function of their ability to see and be seen from points on the surrounding landscape. A series of geostatistical, two-way and Bayesian logistic regression analyses were performed to test two hypotheses related to the intervisibility of the sites to one another and their visual prominence on the landscape. We determine that the intervisibility of the sites to one another is highly statistically significant when compared to 10 stratified-random permutations of DGB sites. Bayesian logistic regression additionally demonstrates that the visibility of the sites to points on the surrounding landscape is statistically significant. The location of sites appears to have also been selected on the basis of lower slope than random permutations of sites. Using statistical measures, many of which are not commonly employed in archaeological research, to evaluate aspects of visibility on the landscape, we conclude that the placement of DGB sites improved their conspicuousness for enhanced ritual, social cooperation and/or competition purposes. PMID:25383883
Seabed mapping and characterization of sediment variability using the usSEABED data base
Goff, J.A.; Jenkins, C.J.; Jeffress, Williams S.
2008-01-01
We present a methodology for statistical analysis of randomly located marine sediment point data, and apply it to the US continental shelf portions of usSEABED mean grain size records. The usSEABED database, like many modern, large environmental datasets, is heterogeneous and interdisciplinary. We statistically test the database as a source of mean grain size data, and from it provide a first examination of regional seafloor sediment variability across the entire US continental shelf. Data derived from laboratory analyses ("extracted") and from word-based descriptions ("parsed") are treated separately, and they are compared statistically and deterministically. Data records are selected for spatial analysis by their location within sample regions: polygonal areas defined in ArcGIS chosen by geography, water depth, and data sufficiency. We derive isotropic, binned semivariograms from the data, and invert these for estimates of noise variance, field variance, and decorrelation distance. The highly erratic nature of the semivariograms is a result both of the random locations of the data and of the high level of data uncertainty (noise). This decorrelates the data covariance matrix for the inversion, and largely prevents robust estimation of the fractal dimension. Our comparison of the extracted and parsed mean grain size data demonstrates important differences between the two. In particular, extracted measurements generally produce finer mean grain sizes, lower noise variance, and lower field variance than parsed values. Such relationships can be used to derive a regionally dependent conversion factor between the two. Our analysis of sample regions on the US continental shelf revealed considerable geographic variability in the estimated statistical parameters of field variance and decorrelation distance. Some regional relationships are evident, and overall there is a tendency for field variance to be higher where the average mean grain size is finer grained. Surprisingly, parsed and extracted noise magnitudes correlate with each other, which may indicate that some portion of the data variability that we identify as "noise" is caused by real grain size variability at very short scales. Our analyses demonstrate that by applying a bias-correction proxy, usSEABED data can be used to generate reliable interpolated maps of regional mean grain size and sediment character.
Robustness of S1 statistic with Hodges-Lehmann for skewed distributions
NASA Astrophysics Data System (ADS)
Ahad, Nor Aishah; Yahaya, Sharipah Soaad Syed; Yin, Lee Ping
2016-10-01
Analysis of variance (ANOVA) is a common use parametric method to test the differences in means for more than two groups when the populations are normally distributed. ANOVA is highly inefficient under the influence of non- normal and heteroscedastic settings. When the assumptions are violated, researchers are looking for alternative such as Kruskal-Wallis under nonparametric or robust method. This study focused on flexible method, S1 statistic for comparing groups using median as the location estimator. S1 statistic was modified by substituting the median with Hodges-Lehmann and the default scale estimator with the variance of Hodges-Lehmann and MADn to produce two different test statistics for comparing groups. Bootstrap method was used for testing the hypotheses since the sampling distributions of these modified S1 statistics are unknown. The performance of the proposed statistic in terms of Type I error was measured and compared against the original S1 statistic, ANOVA and Kruskal-Wallis. The propose procedures show improvement compared to the original statistic especially under extremely skewed distribution.
Toward the modeling of land use change: A spatial analysis using remote sensing and historical data
NASA Technical Reports Server (NTRS)
Honea, R. B.
1976-01-01
It was hypothesized that the chronological observation of land use change could be shown to follow a predictable pattern and these patterns could be correlated with other statistical data to develop transition probabilities suitable for modeling purposes. A literature review and preliminary research, however, indicated a totally stochastic approach was not practical for simulating land use change and thus a more deterministic approach was adopted. The approach used assumes the determinants of the land use conversion process are found in the market place, where land transactions among buyers and sellers occur. Only one side of the market transaction process is studied, however, namely, the purchaser's desires in securing an ideal or suitable site. The problem was to identify the ideal qualities, quantities or attributes desired in an industrial site (or housing development), and to formulate a general algorithmic statement capable of identifying potential development sites. Research procedures involved developing a list of variables previously noted in the literature to be related to site selection and streamlining the list to a set suitable for statistical testing. A sample of 157 industries which have located (or relocated) in the 16-county Knoxville metropolitan region since 1950 was selected for industrial location analysis. Using NASA color infrared photography and Tennessee Valley Authority historical aerial photography, data were collected on the spatial characteristics of each industrial location event. These data were then subjected to factor analysis to determine the interrelations of variables.
Integrating SAS and GIS software to improve habitat-use estimates from radiotelemetry data
Kenow, K.P.; Wright, R.G.; Samuel, M.D.; Rasmussen, P.W.
2001-01-01
Radiotelemetry has been used commonly to remotely determine habitat use by a variety of wildlife species. However, habitat misclassification can occur because the true location of a radiomarked animal can only be estimated. Analytical methods that provide improved estimates of habitat use from radiotelemetry location data using a subsampling approach have been proposed previously. We developed software, based on these methods, to conduct improved habitat-use analyses. A Statistical Analysis System (SAS)-executable file generates a random subsample of points from the error distribution of an estimated animal location and formats the output into ARC/INFO-compatible coordinate and attribute files. An associated ARC/INFO Arc Macro Language (AML) creates a coverage of the random points, determines the habitat type at each random point from an existing habitat coverage, sums the number of subsample points by habitat type for each location, and outputs tile results in ASCII format. The proportion and precision of habitat types used is calculated from the subsample of points generated for each radiotelemetry location. We illustrate the method and software by analysis of radiotelemetry data for a female wild turkey (Meleagris gallopavo).
Kuhn-Tucker optimization based reliability analysis for probabilistic finite elements
NASA Technical Reports Server (NTRS)
Liu, W. K.; Besterfield, G.; Lawrence, M.; Belytschko, T.
1988-01-01
The fusion of probability finite element method (PFEM) and reliability analysis for fracture mechanics is considered. Reliability analysis with specific application to fracture mechanics is presented, and computational procedures are discussed. Explicit expressions for the optimization procedure with regard to fracture mechanics are given. The results show the PFEM is a very powerful tool in determining the second-moment statistics. The method can determine the probability of failure or fracture subject to randomness in load, material properties and crack length, orientation, and location.
Chang, Pao-Erh Paul; Yang, Jen-Chih Rena; Den, Walter; Wu, Chang-Fu
2014-09-01
Emissions of volatile organic compounds (VOCs) are most frequent environmental nuisance complaints in urban areas, especially where industrial districts are nearby. Unfortunately, identifying the responsible emission sources of VOCs is essentially a difficult task. In this study, we proposed a dynamic approach to gradually confine the location of potential VOC emission sources in an industrial complex, by combining multi-path open-path Fourier transform infrared spectrometry (OP-FTIR) measurement and the statistical method of principal component analysis (PCA). Close-cell FTIR was further used to verify the VOC emission source by measuring emitted VOCs from selected exhaust stacks at factories in the confined areas. Multiple open-path monitoring lines were deployed during a 3-month monitoring campaign in a complex industrial district. The emission patterns were identified and locations of emissions were confined by the wind data collected simultaneously. N,N-Dimethyl formamide (DMF), 2-butanone, toluene, and ethyl acetate with mean concentrations of 80.0 ± 1.8, 34.5 ± 0.8, 103.7 ± 2.8, and 26.6 ± 0.7 ppbv, respectively, were identified as the major VOC mixture at all times of the day around the receptor site. As the toxic air pollutant, the concentrations of DMF in air samples were found exceeding the ambient standard despite the path-average effect of OP-FTIR upon concentration levels. The PCA data identified three major emission sources, including PU coating, chemical packaging, and lithographic printing industries. Applying instrumental measurement and statistical modeling, this study has established a systematic approach for locating emission sources. Statistical modeling (PCA) plays an important role in reducing dimensionality of a large measured dataset and identifying underlying emission sources. Instrumental measurement, however, helps verify the outcomes of the statistical modeling. The field study has demonstrated the feasibility of using multi-path OP-FTIR measurement. The wind data incorporating with the statistical modeling (PCA) may successfully identify the major emission source in a complex industrial district.
Rodríguez-Arias, Miquel Angel; Rodó, Xavier
2004-03-01
Here we describe a practical, step-by-step primer to scale-dependent correlation (SDC) analysis. The analysis of transitory processes is an important but often neglected topic in ecological studies because only a few statistical techniques appear to detect temporary features accurately enough. We introduce here the SDC analysis, a statistical and graphical method to study transitory processes at any temporal or spatial scale. SDC analysis, thanks to the combination of conventional procedures and simple well-known statistical techniques, becomes an improved time-domain analogue of wavelet analysis. We use several simple synthetic series to describe the method, a more complex example, full of transitory features, to compare SDC and wavelet analysis, and finally we analyze some selected ecological series to illustrate the methodology. The SDC analysis of time series of copepod abundances in the North Sea indicates that ENSO primarily is the main climatic driver of short-term changes in population dynamics. SDC also uncovers some long-term, unexpected features in the population. Similarly, the SDC analysis of Nicholson's blowflies data locates where the proposed models fail and provides new insights about the mechanism that drives the apparent vanishing of the population cycle during the second half of the series.
2010-01-01
Background Malaria transmission is complex and is believed to be associated with local climate changes. However, simple attempts to extrapolate malaria incidence rates from averaged regional meteorological conditions have proven unsuccessful. Therefore, the objective of this study was to determine if variations in specific meteorological factors are able to consistently predict P. falciparum malaria incidence at different locations in south Ethiopia. Methods Retrospective data from 42 locations were collected including P. falciparum malaria incidence for the period of 1998-2007 and meteorological variables such as monthly rainfall (all locations), temperature (17 locations), and relative humidity (three locations). Thirty-five data sets qualified for the analysis. Ljung-Box Q statistics was used for model diagnosis, and R squared or stationary R squared was taken as goodness of fit measure. Time series modelling was carried out using Transfer Function (TF) models and univariate auto-regressive integrated moving average (ARIMA) when there was no significant predictor meteorological variable. Results Of 35 models, five were discarded because of the significant value of Ljung-Box Q statistics. Past P. falciparum malaria incidence alone (17 locations) or when coupled with meteorological variables (four locations) was able to predict P. falciparum malaria incidence within statistical significance. All seasonal AIRMA orders were from locations at altitudes above 1742 m. Monthly rainfall, minimum and maximum temperature was able to predict incidence at four, five and two locations, respectively. In contrast, relative humidity was not able to predict P. falciparum malaria incidence. The R squared values for the models ranged from 16% to 97%, with the exception of one model which had a negative value. Models with seasonal ARIMA orders were found to perform better. However, the models for predicting P. falciparum malaria incidence varied from location to location, and among lagged effects, data transformation forms, ARIMA and TF orders. Conclusions This study describes P. falciparum malaria incidence models linked with meteorological data. Variability in the models was principally attributed to regional differences, and a single model was not found that fits all locations. Past P. falciparum malaria incidence appeared to be a superior predictor than meteorology. Future efforts in malaria modelling may benefit from inclusion of non-meteorological factors. PMID:20553590
NASA Technical Reports Server (NTRS)
Manning, Robert M.
1990-01-01
A static and dynamic rain-attenuation model is presented which describes the statistics of attenuation on an arbitrarily specified satellite link for any location for which there are long-term rainfall statistics. The model may be used in the design of the optimal stochastic control algorithms to mitigate the effects of attenuation and maintain link reliability. A rain-statistics data base is compiled, which makes it possible to apply the model to any location in the continental U.S. with a resolution of 0-5 degrees in latitude and longitude. The model predictions are compared with experimental observations, showing good agreement.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ladd-Lively, Jennifer L
2014-01-01
The objective of this work was to determine the feasibility of using on-line multivariate statistical process control (MSPC) for safeguards applications in natural uranium conversion plants. Multivariate statistical process control is commonly used throughout industry for the detection of faults. For safeguards applications in uranium conversion plants, faults could include the diversion of intermediate products such as uranium dioxide, uranium tetrafluoride, and uranium hexafluoride. This study was limited to a 100 metric ton of uranium (MTU) per year natural uranium conversion plant (NUCP) using the wet solvent extraction method for the purification of uranium ore concentrate. A key component inmore » the multivariate statistical methodology is the Principal Component Analysis (PCA) approach for the analysis of data, development of the base case model, and evaluation of future operations. The PCA approach was implemented through the use of singular value decomposition of the data matrix where the data matrix represents normal operation of the plant. Component mole balances were used to model each of the process units in the NUCP. However, this approach could be applied to any data set. The monitoring framework developed in this research could be used to determine whether or not a diversion of material has occurred at an NUCP as part of an International Atomic Energy Agency (IAEA) safeguards system. This approach can be used to identify the key monitoring locations, as well as locations where monitoring is unimportant. Detection limits at the key monitoring locations can also be established using this technique. Several faulty scenarios were developed to test the monitoring framework after the base case or normal operating conditions of the PCA model were established. In all of the scenarios, the monitoring framework was able to detect the fault. Overall this study was successful at meeting the stated objective.« less
Seasonality of Kawasaki Disease: A Global Perspective
Burns, Jane C.; Herzog, Lauren; Fabri, Olivia; Tremoulet, Adriana H.; Rodó, Xavier; Uehara, Ritei; Burgner, David; Bainto, Emelia; Pierce, David; Tyree, Mary; Cayan, Daniel
2013-01-01
Background Understanding global seasonal patterns of Kawasaki disease (KD) may provide insight into the etiology of this vasculitis that is now the most common cause of acquired heart disease in children in developed countries worldwide. Methods Data from 1970-2012 from 25 countries distributed over the globe were analyzed for seasonality. The number of KD cases from each location was normalized to minimize the influence of greater numbers from certain locations. The presence of seasonal variation of KD at the individual locations was evaluated using three different tests: time series modeling, spectral analysis, and a Monte Carlo technique. Results A defined seasonal structure emerged demonstrating broad coherence in fluctuations in KD cases across the Northern Hemisphere extra-tropical latitudes. In the extra-tropical latitudes of the Northern Hemisphere, KD case numbers were highest in January through March and approximately 40% higher than in the months of lowest case numbers from August through October. Datasets were much sparser in the tropics and the Southern Hemisphere extra-tropics and statistical significance of the seasonality tests was weak, but suggested a maximum in May through June, with approximately 30% higher number of cases than in the least active months of February, March and October. The seasonal pattern in the Northern Hemisphere extra-tropics was consistent across the first and second halves of the sample period. Conclusion Using the first global KD time series, analysis of sites located in the Northern Hemisphere extra-tropics revealed statistically significant and consistent seasonal fluctuations in KD case numbers with high numbers in winter and low numbers in late summer and fall. Neither the tropics nor the Southern Hemisphere extra-tropics registered a statistically significant aggregate seasonal cycle. These data suggest a seasonal exposure to a KD agent that operates over large geographic regions and is concentrated during winter months in the Northern Hemisphere extra-tropics. PMID:24058585
Seasonality of Kawasaki disease: a global perspective.
Burns, Jane C; Herzog, Lauren; Fabri, Olivia; Tremoulet, Adriana H; Rodó, Xavier; Uehara, Ritei; Burgner, David; Bainto, Emelia; Pierce, David; Tyree, Mary; Cayan, Daniel
2013-01-01
Understanding global seasonal patterns of Kawasaki disease (KD) may provide insight into the etiology of this vasculitis that is now the most common cause of acquired heart disease in children in developed countries worldwide. Data from 1970-2012 from 25 countries distributed over the globe were analyzed for seasonality. The number of KD cases from each location was normalized to minimize the influence of greater numbers from certain locations. The presence of seasonal variation of KD at the individual locations was evaluated using three different tests: time series modeling, spectral analysis, and a Monte Carlo technique. A defined seasonal structure emerged demonstrating broad coherence in fluctuations in KD cases across the Northern Hemisphere extra-tropical latitudes. In the extra-tropical latitudes of the Northern Hemisphere, KD case numbers were highest in January through March and approximately 40% higher than in the months of lowest case numbers from August through October. Datasets were much sparser in the tropics and the Southern Hemisphere extra-tropics and statistical significance of the seasonality tests was weak, but suggested a maximum in May through June, with approximately 30% higher number of cases than in the least active months of February, March and October. The seasonal pattern in the Northern Hemisphere extra-tropics was consistent across the first and second halves of the sample period. Using the first global KD time series, analysis of sites located in the Northern Hemisphere extra-tropics revealed statistically significant and consistent seasonal fluctuations in KD case numbers with high numbers in winter and low numbers in late summer and fall. Neither the tropics nor the Southern Hemisphere extra-tropics registered a statistically significant aggregate seasonal cycle. These data suggest a seasonal exposure to a KD agent that operates over large geographic regions and is concentrated during winter months in the Northern Hemisphere extra-tropics.
NASA Astrophysics Data System (ADS)
Matsunaga, Kazunari; Seki, Kanako; Brain, David A.; Hara, Takuya; Masunaga, Kei; Mcfadden, James P.; Halekas, Jasper S.; Mitchell, David L.; Mazelle, Christian; Espley, J. R.; Gruesbeck, Jacob; Jakosky, Bruce M.
2017-09-01
Direct interaction between the solar wind (SW) and the Martian upper atmosphere forms a characteristic region, called the induced magnetosphere between the magnetosheath and the ionosphere. Since the SW deceleration due to increasing mass loading by heavy ions plays an important role in the induced magnetosphere formation, the ion composition is also expected to change around the induced magnetosphere boundary (IMB). Here we report on relations of the IMB, the ion composition boundary (ICB), and the pressure balance boundary based on a statistical analysis of about 8 months of simultaneous ion, electron, and magnetic field observations by Mars Atmosphere and Volatile EvolutioN (MAVEN) mission. We chose the period when MAVEN observed the SW directly near its apoapsis to investigate their dependence on SW parameters. Results show that IMBs almost coincide with ICBs on the dayside and locations of all three boundaries are affected by the SW dynamic pressure. A remarkable feature is that all boundaries tend to locate at higher altitudes in the southern hemisphere than in the northern hemisphere on the nightside. This clear geographical asymmetry is permanently seen regardless of locations of the strong crustal B fields in the southern hemisphere, while the boundary locations become higher when the crustal B fields locate on the dayside. On the nightside, IMBs usually locate at higher altitude than ICBs. However, ICBs are likely to be located above IMBs in the nightside, southern, and downward ESW hemisphere when the strong crustal B fields locate on the dayside.
Managing Student Loan Default Risk: Evidence from a Privately Guaranteed Portfolio.
ERIC Educational Resources Information Center
Monteverde, Kirk
2000-01-01
Application of the statistical techniques of survival analysis and credit scoring to private education loans extended to law students found a pronounced seasoning effect for such loans and the robust predictive power of credit bureau scoring of borrowers. Other predictors of default included school-of-attendance, school's geographic location, and…
Descriptive Analysis of Student Ratings
ERIC Educational Resources Information Center
Marasini, Donata; Quatto, Piero
2011-01-01
Let X be a statistical variable representing student ratings of University teaching. It is natural to assume for X an ordinal scale consisting of k categories (in ascending order of satisfaction). At first glance, student ratings can be summarized by a location index (such as the mode or the median of X) associated with a convenient measure of…
Saturation analysis of ChIP-seq data for reproducible identification of binding peaks
Hansen, Peter; Hecht, Jochen; Ibrahim, Daniel M.; Krannich, Alexander; Truss, Matthias; Robinson, Peter N.
2015-01-01
Chromatin immunoprecipitation coupled with next-generation sequencing (ChIP-seq) is a powerful technology to identify the genome-wide locations of transcription factors and other DNA binding proteins. Computational ChIP-seq peak calling infers the location of protein–DNA interactions based on various measures of enrichment of sequence reads. In this work, we introduce an algorithm, Q, that uses an assessment of the quadratic enrichment of reads to center candidate peaks followed by statistical analysis of saturation of candidate peaks by 5′ ends of reads. We show that our method not only is substantially faster than several competing methods but also demonstrates statistically significant advantages with respect to reproducibility of results and in its ability to identify peaks with reproducible binding site motifs. We show that Q has superior performance in the delineation of double RNAPII and H3K4me3 peaks surrounding transcription start sites related to a better ability to resolve individual peaks. The method is implemented in C+l+ and is freely available under an open source license. PMID:26163319
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gilbert, R O; Essington, E H; Brady, D N
Statistical design and analysis activities for the Nevada Applied Ecology Group (NAEG) during 1976 are briefly outlined. This is followed by a description of soil data collected thus far at nuclear study sites. Radionuclide concentrations in surface soil collected along a transect from ground zero (GZ) along the main fallout pattern are given for Nuclear Site (NS) 201. Concentrations in soil collected at 315 locations on a grid system at 200 foot spacings are also given for this site. The /sup 241/Am to /sup 137/Cs ratios change over NS 201 depending on location relative to GZ. They range from lessmore » than one where /sup 241/Am is at low levels, to more than fifty where /sup 241/Am levels are high (near GZ). The estimated median /sup 239/ /sup 240/Pu to /sup 241/Am ratio is 11 and appears to be relatively constant over the area (the 95 percent lower and upper limits on the true median ratio are about 8 and 14).« less
NASA Technical Reports Server (NTRS)
Shepherd, J. Marshall
2004-01-01
The study employs a 108-year precipitation data record to identify statistically significant anomalies in rainfall downwind of the Phoenix urban region. The analysis reveals that during the monsoon season locations northeastern suburbs and exurbs of the Phoenix metropolitan area have experienced statistically significant increases in mean precipitation of 12 to 14 percent from a pre-urban (1895-1949) to post-urban (1950-2003) period. Mean and median post-urban precipitation totals in the anomaly region are significantly greater, in the statistical sense, than regions west of the city and in nearby mountainous regions of similar or greater topography. Further analysis of satellite-based rainfall totals for the summer of 2003 also reveal the existence of the anomaly region during a severe drought period. The anomaly can not simply be attributed to maximum topographic relief and is hypothesize to be related to urban-topographic interactions.
Omoumi, Patrick; Babel, Hugo; Jolles, Brigitte M; Favre, Julien
2018-04-16
To test, through tridimensional analysis, whether (1) cartilage thickness at the posterior aspect of femoral condyles differs in knees with medial femorotibial osteoarthritis (OA) compared to non-OA knees; (2) the location of the thickest cartilage at the posterior aspect of femoral condyles differs between OA and non-OA knees. CT arthrograms of knees without radiographic OA (n = 30) and with severe medial femorotibial OA (n = 30) were selected retrospectively from patients over 50 years of age. The groups did not differ in gender, age and femoral size. CT arthrograms were segmented to measure the mean cartilage thickness, the maximal cartilage thickness and its location in a region of interest at the posterior aspect of condyles. For the medial condyle, mean and maximum cartilage thicknesses were statistically significantly higher in OA knees compared to non-OA knees [1.66 vs 1.46 mm (p = 0.03) and 2.56 vs 2.14 mm (p = 0.003), respectively]. The thickest cartilage was located in the half most medial aspect of the posterior medial condyle for both groups, without significant difference between groups. For the lateral condyle, no statistically significant difference between non-OA and OA knees was found (p ≥ 0.17). Cartilage at the posterior aspect of the medial condyle, but not the lateral condyle, is statistically significantly thicker in advanced medial femorotibial OA knees compared to non-OA knees. The thickest cartilage was located in the half most medial aspect of the posterior medial condyle. These results will serve as the basis for future research to determine the histobiological processes involved in this thicker cartilage. Advances in knowledge: This study, through a quantitative tridimensional approach, shows that cartilage at the posterior aspect of the medial condyles is thicker in severe femorotibial osteoarthritic knees compared to non-OA knees. In the posterior aspect of the medial condyle, the thickest cartilage is located in the vicinity of the center of the half most medial aspect of the posterior medial condyle. These results will serve as the basis for future research to determine the histobiological processes involved in this thicker cartilage.
Chen, Wen; Zhou, Fangjing; Hall, Brian J; Wang, Yu; Latkin, Carl; Ling, Li; Tucker, Joseph D
2016-01-01
Objectives To assess associations between residences location, risky sexual behaviours and sexually transmitted diseases (STDs) among adults living in Guangzhou, China. Methods Data were obtained from 751 Chinese adults aged 18–59 years in Guangzhou, China, using stratified random sampling by using spatial epidemiological methods. Face-to-face household interviews were conducted to collect self-report data on risky sexual behaviours and diagnosed STDs. Kulldorff’s spatial scan statistic was implemented to identify and detect spatial distribution and clusters of risky sexual behaviours and STDs. The presence and location of statistically significant clusters were mapped in the study areas using ArcGIS software. Results The prevalence of self-reported risky sexual behaviours was between 5.1% and 50.0%. The self-reported lifetime prevalence of diagnosed STDs was 7.06%. Anal intercourse clustered in an area located along the border within the rural–urban continuum (p=0.001). High rate clusters for alcohol or other drugs using before sex (p=0.008) and migrants who lived in Guangzhou <1 year (p=0.007) overlapped this cluster. Excess cases for unprotected sex (p=0.031) overlapped the cluster for college students (p<0.001). Five of nine (55.6%) students who had sexual experience during the last 12 months located in the cluster of unprotected sex. Conclusions Short-term migrants and college students reported greater risky sexual behaviours. Programmes to increase safer sex within these communities to reduce the risk of STDs are warranted in Guangzhou. Spatial analysis identified geographical clusters of risky sexual behaviours, which is critical for optimising surveillance and targeting control measures for these locations in the future. PMID:26843400
Missouri StreamStats—A water-resources web application
Ellis, Jarrett T.
2018-01-31
The U.S. Geological Survey (USGS) maintains and operates more than 8,200 continuous streamgages nationwide. Types of data that may be collected, computed, and stored for streamgages include streamgage height (water-surface elevation), streamflow, and water quality. The streamflow data allow scientists and engineers to calculate streamflow statistics, such as the 1-percent annual exceedance probability flood (also known as the 100-year flood), the mean flow, and the 7-day, 10-year low flow, which are used by managers to make informed water resource management decisions, at each streamgage location. Researchers, regulators, and managers also commonly need physical characteristics (basin characteristics) that describe the unique properties of a basin. Common uses for streamflow statistics and basin characteristics include hydraulic design, water-supply management, water-use appropriations, and flood-plain mapping for establishing flood-insurance rates and land-use zones. The USGS periodically publishes reports that update the values of basin characteristics and streamflow statistics at selected gaged locations (locations with streamgages), but these studies usually only update a subset of streamgages, making data retrieval difficult. Additionally, streamflow statistics and basin characteristics are most often needed at ungaged locations (locations without streamgages) for which published streamflow statistics and basin characteristics do not exist. Missouri StreamStats is a web-based geographic information system that was created by the USGS in cooperation with the Missouri Department of Natural Resources to provide users with access to an assortment of tools that are useful for water-resources planning and management. StreamStats allows users to easily obtain the most recent published streamflow statistics and basin characteristics for streamgage locations and to automatically calculate selected basin characteristics and estimate streamflow statistics at ungaged locations.
NASA Astrophysics Data System (ADS)
Abd-el-Malek, Mina; Abdelsalam, Ahmed K.; Hassan, Ola E.
2017-09-01
Robustness, low running cost and reduced maintenance lead Induction Motors (IMs) to pioneerly penetrate the industrial drive system fields. Broken rotor bars (BRBs) can be considered as an important fault that needs to be early assessed to minimize the maintenance cost and labor time. The majority of recent BRBs' fault diagnostic techniques focus on differentiating between healthy and faulty rotor cage. In this paper, a new technique is proposed for detecting the location of the broken bar in the rotor. The proposed technique relies on monitoring certain statistical parameters estimated from the analysis of the start-up stator current envelope. The envelope of the signal is obtained using Hilbert Transformation (HT). The proposed technique offers non-invasive, fast computational and accurate location diagnostic process. Various simulation scenarios are presented that validate the effectiveness of the proposed technique.
Luo, Li; Zhu, Yun
2012-01-01
Abstract The genome-wide association studies (GWAS) designed for next-generation sequencing data involve testing association of genomic variants, including common, low frequency, and rare variants. The current strategies for association studies are well developed for identifying association of common variants with the common diseases, but may be ill-suited when large amounts of allelic heterogeneity are present in sequence data. Recently, group tests that analyze their collective frequency differences between cases and controls shift the current variant-by-variant analysis paradigm for GWAS of common variants to the collective test of multiple variants in the association analysis of rare variants. However, group tests ignore differences in genetic effects among SNPs at different genomic locations. As an alternative to group tests, we developed a novel genome-information content-based statistics for testing association of the entire allele frequency spectrum of genomic variation with the diseases. To evaluate the performance of the proposed statistics, we use large-scale simulations based on whole genome low coverage pilot data in the 1000 Genomes Project to calculate the type 1 error rates and power of seven alternative statistics: a genome-information content-based statistic, the generalized T2, collapsing method, multivariate and collapsing (CMC) method, individual χ2 test, weighted-sum statistic, and variable threshold statistic. Finally, we apply the seven statistics to published resequencing dataset from ANGPTL3, ANGPTL4, ANGPTL5, and ANGPTL6 genes in the Dallas Heart Study. We report that the genome-information content-based statistic has significantly improved type 1 error rates and higher power than the other six statistics in both simulated and empirical datasets. PMID:22651812
Luo, Li; Zhu, Yun; Xiong, Momiao
2012-06-01
The genome-wide association studies (GWAS) designed for next-generation sequencing data involve testing association of genomic variants, including common, low frequency, and rare variants. The current strategies for association studies are well developed for identifying association of common variants with the common diseases, but may be ill-suited when large amounts of allelic heterogeneity are present in sequence data. Recently, group tests that analyze their collective frequency differences between cases and controls shift the current variant-by-variant analysis paradigm for GWAS of common variants to the collective test of multiple variants in the association analysis of rare variants. However, group tests ignore differences in genetic effects among SNPs at different genomic locations. As an alternative to group tests, we developed a novel genome-information content-based statistics for testing association of the entire allele frequency spectrum of genomic variation with the diseases. To evaluate the performance of the proposed statistics, we use large-scale simulations based on whole genome low coverage pilot data in the 1000 Genomes Project to calculate the type 1 error rates and power of seven alternative statistics: a genome-information content-based statistic, the generalized T(2), collapsing method, multivariate and collapsing (CMC) method, individual χ(2) test, weighted-sum statistic, and variable threshold statistic. Finally, we apply the seven statistics to published resequencing dataset from ANGPTL3, ANGPTL4, ANGPTL5, and ANGPTL6 genes in the Dallas Heart Study. We report that the genome-information content-based statistic has significantly improved type 1 error rates and higher power than the other six statistics in both simulated and empirical datasets.
An Automated Method for Navigation Assessment for Earth Survey Sensors Using Island Targets
NASA Technical Reports Server (NTRS)
Patt, F. S.; Woodward, R. H.; Gregg, W. W.
1997-01-01
An automated method has been developed for performing navigation assessment on satellite-based Earth sensor data. The method utilizes islands as targets which can be readily located in the sensor data and identified with reference locations. The essential elements are an algorithm for classifying the sensor data according to source, a reference catalogue of island locations, and a robust pattern-matching algorithm for island identification. The algorithms were developed and tested for the Sea-viewing Wide Field-of-view Sensor (SeaWiFS), an ocean colour sensor. This method will allow navigation error statistics to be automatically generated for large numbers of points, supporting analysis over large spatial and temporal ranges.
Automated navigation assessment for earth survey sensors using island targets
NASA Technical Reports Server (NTRS)
Patt, Frederick S.; Woodward, Robert H.; Gregg, Watson W.
1997-01-01
An automated method has been developed for performing navigation assessment on satellite-based Earth sensor data. The method utilizes islands as targets which can be readily located in the sensor data and identified with reference locations. The essential elements are an algorithm for classifying the sensor data according to source, a reference catalog of island locations, and a robust pattern-matching algorithm for island identification. The algorithms were developed and tested for the Sea-viewing Wide Field-of-view Sensor (SeaWiFS), an ocean color sensor. This method will allow navigation error statistics to be automatically generated for large numbers of points, supporting analysis over large spatial and temporal ranges.
NASA Astrophysics Data System (ADS)
Loftus, K.; Saar, S. H.
2017-12-01
NOAA's Space Weather Prediction Center publishes the current definitive public soft X-ray flare catalog, derived using data from the X-ray Sensor (XRS) on the Geostationary Operational Environmental Satellites (GOES) series. However, this flare list has shortcomings for use in scientific analysis. Its detection algorithm has drawbacks (missing smaller flux events and poorly characterizing complex ones), and its event timing is imprecise (peak and end times are frequently marked incorrectly, and hence peak fluxes are underestimated). It also lacks explicit and regular spatial location data. We present a new database, "The Where of the Flare" catalog, which improves upon the precision of NOAA's current version, with more consistent and accurate spatial locations, timings, and peak fluxes. Our catalog also offers several new parameters per flare (e.g. background flux, integrated flux). We use data from the GOES Solar X-ray Imager (SXI) for spatial flare locating. Our detection algorithm is more sensitive to smaller flux events close to the background level and more precisely marks flare start/peak/end times so that integrated flux can be accurately calculated. It also decomposes complex events (with multiple overlapping flares) by constituent peaks. The catalog dates from the operation of the first SXI instrument in 2003 until the present. We give an overview of the detection algorithm's design, review the catalog's features, and discuss preliminary statistical analyses of light curve morphology, complex event decomposition, and integrated flux distribution. The Where of the Flare catalog will be useful in studying X-ray flare statistics and correlating X-ray flare properties with other observations. This work was supported by Contract #8100002705 from Lockheed-Martin to SAO in support of the science of NASA's IRIS mission.
Survival Model for Foot and Leg High Rate Axial Impact Injury Data.
Bailey, Ann M; McMurry, Timothy L; Poplin, Gerald S; Salzar, Robert S; Crandall, Jeff R
2015-01-01
Understanding how lower extremity injuries from automotive intrusion and underbody blast (UBB) differ is of key importance when determining whether automotive injury criteria can be applied to blast rate scenarios. This article provides a review of existing injury risk analyses and outlines an approach to improve injury prediction for an expanded range of loading rates. This analysis will address issues with existing injury risk functions including inaccuracies due to inertial and potential viscous resistance at higher loading rates. This survival analysis attempts to minimize these errors by considering injury location statistics and a predictor variable selection process dependent upon failure mechanisms of bone. Distribution of foot/ankle/leg injuries induced by axial impact loading at rates characteristic of UBB as well as automotive intrusion was studied and calcaneus injuries were found to be the most common injury; thus, footplate force was chosen as the main predictor variable because of its proximity to injury location to prevent inaccuracies associated with inertial differences due to loading rate. A survival analysis was then performed with age, sex, dorsiflexion angle, and mass as covariates. This statistical analysis uses data from previous axial postmortem human surrogate (PMHS) component leg tests to provide perspectives on how proximal boundary conditions and loading rate affect injury probability in the foot/ankle/leg (n = 82). Tibia force-at-fracture proved to be up to 20% inaccurate in previous analyses because of viscous resistance and inertial effects within the data set used, suggesting that previous injury criteria are accurate only for specific rates of loading and boundary conditions. The statistical model presented in this article predicts 50% probability of injury for a plantar force of 10.2 kN for a 50th percentile male with a neutral ankle position. Force rate was found to be an insignificant covariate because of the limited range of loading rate differences within the data set; however, compensation for inertial effects caused by measuring the force-at-fracture in a location closer to expected injury location improved the model's predictive capabilities for the entire data set. This study provides better injury prediction capabilities for both automotive and blast rates because of reduced sensitivity to inertial effects and tibia-fibula load sharing. Further, a framework is provided for future injury criteria generation for high rate loading scenarios. This analysis also suggests key improvements to be made to existing anthropomorphic test device (ATD) lower extremities to provide accurate injury prediction for high rate applications such as UBB.
Comparison of two trajectory based models for locating particle sources for two rural New York sites
NASA Astrophysics Data System (ADS)
Zhou, Liming; Hopke, Philip K.; Liu, Wei
Two back trajectory-based statistical models, simplified quantitative transport bias analysis and residence-time weighted concentrations (RTWC) have been compared for their capabilities of identifying likely locations of source emissions contributing to observed particle concentrations at Potsdam and Stockton, New York. Quantitative transport bias analysis (QTBA) attempts to take into account the distribution of concentrations around the directions of the back trajectories. In full QTBA approach, deposition processes (wet and dry) are also considered. Simplified QTBA omits the consideration of deposition. It is best used with multiple site data. Similarly the RTWC approach uses concentrations measured at different sites along with the back trajectories to distribute the concentration contributions across the spatial domain of the trajectories. In this study, these models are used in combination with the source contribution values obtained by the previous positive matrix factorization analysis of particle composition data from Potsdam and Stockton. The six common sources for the two sites, sulfate, soil, zinc smelter, nitrate, wood smoke and copper smelter were analyzed. The results of the two methods are consistent and locate large and clearly defined sources well. RTWC approach can find more minor sources but may also give unrealistic estimations of the source locations.
Bonetti, Jennifer; Quarino, Lawrence
2014-05-01
This study has shown that the combination of simple techniques with the use of multivariate statistics offers the potential for the comparative analysis of soil samples. Five samples were obtained from each of twelve state parks across New Jersey in both the summer and fall seasons. Each sample was examined using particle-size distribution, pH analysis in both water and 1 M CaCl2 , and a loss on ignition technique. Data from each of the techniques were combined, and principal component analysis (PCA) and canonical discriminant analysis (CDA) were used for multivariate data transformation. Samples from different locations could be visually differentiated from one another using these multivariate plots. Hold-one-out cross-validation analysis showed error rates as low as 3.33%. Ten blind study samples were analyzed resulting in no misclassifications using Mahalanobis distance calculations and visual examinations of multivariate plots. Seasonal variation was minimal between corresponding samples, suggesting potential success in forensic applications. © 2014 American Academy of Forensic Sciences.
Bigdely-Shamlo, Nima; Mullen, Tim; Kreutz-Delgado, Kenneth; Makeig, Scott
2013-01-01
A crucial question for the analysis of multi-subject and/or multi-session electroencephalographic (EEG) data is how to combine information across multiple recordings from different subjects and/or sessions, each associated with its own set of source processes and scalp projections. Here we introduce a novel statistical method for characterizing the spatial consistency of EEG dynamics across a set of data records. Measure Projection Analysis (MPA) first finds voxels in a common template brain space at which a given dynamic measure is consistent across nearby source locations, then computes local-mean EEG measure values for this voxel subspace using a statistical model of source localization error and between-subject anatomical variation. Finally, clustering the mean measure voxel values in this locally consistent brain subspace finds brain spatial domains exhibiting distinguishable measure features and provides 3-D maps plus statistical significance estimates for each EEG measure of interest. Applied to sufficient high-quality data, the scalp projections of many maximally independent component (IC) processes contributing to recorded high-density EEG data closely match the projection of a single equivalent dipole located in or near brain cortex. We demonstrate the application of MPA to a multi-subject EEG study decomposed using independent component analysis (ICA), compare the results to k-means IC clustering in EEGLAB (sccn.ucsd.edu/eeglab), and use surrogate data to test MPA robustness. A Measure Projection Toolbox (MPT) plug-in for EEGLAB is available for download (sccn.ucsd.edu/wiki/MPT). Together, MPA and ICA allow use of EEG as a 3-D cortical imaging modality with near-cm scale spatial resolution. PMID:23370059
[Statistical analysis of articles in "Chinese journal of applied physiology" from 1999 to 2008].
Du, Fei; Fang, Tao; Ge, Xue-ming; Jin, Peng; Zhang, Xiao-hong; Sun, Jin-li
2010-05-01
To evaluate the academic level and influence of "Chinese Journal of Applied Physiology" through statistical analysis for the fund sponsored articles published in the recent ten years. The articles of "Chinese Journal of Applied Physiology" from 1999 to 2008 were investigated. The number and the percentage of the fund sponsored articles, the fund organization and the author region were quantitatively analyzed by using the literature metrology method. The number of the fund sponsored articles increased unceasingly. The ratio of the fund from local government significantly enhanced in the latter five years. Most of the articles were from institutes located at Beijing, Zhejiang and Tianjin. "Chinese Journal of Applied Physiology" has a fine academic level and social influence.
Ahmad, Sheikh Saeed; Aziz, Neelam; Butt, Amna; Shabbir, Rabia; Erum, Summra
2015-09-01
One of the features of medical geography that has made it so useful in health research is statistical spatial analysis, which enables the quantification and qualification of health events. The main objective of this research was to study the spatial distribution patterns of malaria in Rawalpindi district using spatial statistical techniques to identify the hot spots and the possible risk factor. Spatial statistical analyses were done in ArcGIS, and satellite images for land use classification were processed in ERDAS Imagine. Four hundred and fifty water samples were also collected from the study area to identify the presence or absence of any microbial contamination. The results of this study indicated that malaria incidence varied according to geographical location, with eco-climatic condition and showing significant positive spatial autocorrelation. Hotspots or location of clusters were identified using Getis-Ord Gi* statistic. Significant clustering of malaria incidence occurred in rural central part of the study area including Gujar Khan, Kaller Syedan, and some part of Kahuta and Rawalpindi Tehsil. Ordinary least square (OLS) regression analysis was conducted to analyze the relationship of risk factors with the disease cases. Relationship of different land cover with the disease cases indicated that malaria was more related with agriculture, low vegetation, and water class. Temporal variation of malaria cases showed significant positive association with the meteorological variables including average monthly rainfall and temperature. The results of the study further suggested that water supply and sewage system and solid waste collection system needs a serious attention to prevent any outbreak in the study area.
Across-cohort QC analyses of GWAS summary statistics from complex traits.
Chen, Guo-Bo; Lee, Sang Hong; Robinson, Matthew R; Trzaskowski, Maciej; Zhu, Zhi-Xiang; Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Kutalik, Zoltán; Loos, Ruth J F; Frayling, Timothy M; Hirschhorn, Joel N; Yang, Jian; Wray, Naomi R; Visscher, Peter M
2016-01-01
Genome-wide association studies (GWASs) have been successful in discovering SNP trait associations for many quantitative traits and common diseases. Typically, the effect sizes of SNP alleles are very small and this requires large genome-wide association meta-analyses (GWAMAs) to maximize statistical power. A trend towards ever-larger GWAMA is likely to continue, yet dealing with summary statistics from hundreds of cohorts increases logistical and quality control problems, including unknown sample overlap, and these can lead to both false positive and false negative findings. In this study, we propose four metrics and visualization tools for GWAMA, using summary statistics from cohort-level GWASs. We propose methods to examine the concordance between demographic information, and summary statistics and methods to investigate sample overlap. (I) We use the population genetics F st statistic to verify the genetic origin of each cohort and their geographic location, and demonstrate using GWAMA data from the GIANT Consortium that geographic locations of cohorts can be recovered and outlier cohorts can be detected. (II) We conduct principal component analysis based on reported allele frequencies, and are able to recover the ancestral information for each cohort. (III) We propose a new statistic that uses the reported allelic effect sizes and their standard errors to identify significant sample overlap or heterogeneity between pairs of cohorts. (IV) To quantify unknown sample overlap across all pairs of cohorts, we propose a method that uses randomly generated genetic predictors that does not require the sharing of individual-level genotype data and does not breach individual privacy.
Across-cohort QC analyses of GWAS summary statistics from complex traits
Chen, Guo-Bo; Lee, Sang Hong; Robinson, Matthew R; Trzaskowski, Maciej; Zhu, Zhi-Xiang; Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Kutalik, Zoltán; Loos, Ruth J F; Frayling, Timothy M; Hirschhorn, Joel N; Yang, Jian; Wray, Naomi R; Visscher, Peter M
2017-01-01
Genome-wide association studies (GWASs) have been successful in discovering SNP trait associations for many quantitative traits and common diseases. Typically, the effect sizes of SNP alleles are very small and this requires large genome-wide association meta-analyses (GWAMAs) to maximize statistical power. A trend towards ever-larger GWAMA is likely to continue, yet dealing with summary statistics from hundreds of cohorts increases logistical and quality control problems, including unknown sample overlap, and these can lead to both false positive and false negative findings. In this study, we propose four metrics and visualization tools for GWAMA, using summary statistics from cohort-level GWASs. We propose methods to examine the concordance between demographic information, and summary statistics and methods to investigate sample overlap. (I) We use the population genetics Fst statistic to verify the genetic origin of each cohort and their geographic location, and demonstrate using GWAMA data from the GIANT Consortium that geographic locations of cohorts can be recovered and outlier cohorts can be detected. (II) We conduct principal component analysis based on reported allele frequencies, and are able to recover the ancestral information for each cohort. (III) We propose a new statistic that uses the reported allelic effect sizes and their standard errors to identify significant sample overlap or heterogeneity between pairs of cohorts. (IV) To quantify unknown sample overlap across all pairs of cohorts, we propose a method that uses randomly generated genetic predictors that does not require the sharing of individual-level genotype data and does not breach individual privacy. PMID:27552965
Chhabra, Anmol; Quinn, Andrea; Ries, Amanda
2018-01-01
Accurate history collection is integral to medication reconciliation. Studies support pharmacy involvement in the process, but assessment of global time spent is limited. The authors hypothesized the location of a medication-focused interview would impact time spent. The objective was to compare time spent by pharmacists and nurses based on the location of a medication-focused interview. Time spent by the interviewing pharmacist, admitting nurse, and centralized pharmacist verifying admission orders was collected. Patient groups were based on whether the interview was conducted in the emergency department (ED) or medical floor. The primary end point was a composite of the 3 time points. Secondary end points were individual time components and number and types of transcription discrepancies identified during medical floor interviews. Pharmacists and nurses spent an average of ten fewer minutes per ED patient versus a medical floor patient ( P = .028). Secondary end points were not statistically significant. Transcription discrepancies were identified at a rate of 1 in 4 medications. Post hoc analysis revealed the time spent by pharmacists and nurses was 2.4 minutes shorter per medication when interviewed in the ED ( P < .001). The primary outcome was statistically and clinically significant. Limitations included inability to blind and lack of cost-saving analysis. Pharmacist involvement in ED medication reconciliation leads to time savings during the admission process.
Tzavidis, Nikos; Salvati, Nicola; Schmid, Timo; Flouri, Eirini; Midouhas, Emily
2016-02-01
Multilevel modelling is a popular approach for longitudinal data analysis. Statistical models conventionally target a parameter at the centre of a distribution. However, when the distribution of the data is asymmetric, modelling other location parameters, e.g. percentiles, may be more informative. We present a new approach, M -quantile random-effects regression, for modelling multilevel data. The proposed method is used for modelling location parameters of the distribution of the strengths and difficulties questionnaire scores of children in England who participate in the Millennium Cohort Study. Quantile mixed models are also considered. The analyses offer insights to child psychologists about the differential effects of risk factors on children's outcomes.
Geospatial Characterization of Fluvial Wood Arrangement in a Semi-confined Alluvial River
NASA Astrophysics Data System (ADS)
Martin, D. J.; Harden, C. P.; Pavlowsky, R. T.
2014-12-01
Large woody debris (LWD) has become universally recognized as an integral component of fluvial systems, and as a result, has become increasingly common as a river restoration tool. However, "natural" processes of wood recruitment and the subsequent arrangement of LWD within the river network are poorly understood. This research used a suite of spatial statistics to investigate longitudinal arrangement patterns of LWD in a low-gradient, Midwestern river. First, a large-scale GPS inventory of LWD, performed on the Big River in the eastern Missouri Ozarks, resulted in over 4,000 logged positions of LWD along seven river segments that covered nearly 100 km of the 237 km river system. A global Moran's I analysis indicates that LWD density is spatially autocorrelated and displays a clustering tendency within all seven river segments (P-value range = 0.000 to 0.054). A local Moran's I analysis identified specific locations along the segments where clustering occurs and revealed that, on average, clusters of LWD density (high or low) spanned 400 m. Spectral analyses revealed that, in some segments, LWD density is spatially periodic. Two segments displayed strong periodicity, while the remaining segments displayed varying degrees of noisiness. Periodicity showed a positive association with gravel bar spacing and meander wavelength, although there were insufficient data to statistically confirm the relationship. A wavelet analysis was then performed to investigate periodicity relative to location along the segment. The wavelet analysis identified significant (α = 0.05) periodicity at discrete locations along each of the segments. Those reaches yielding strong periodicity showed stronger relationships between LWD density and the geomorphic/riparian independent variables tested. Analyses consistently identified valley width and sinuosity as being associated with LWD density. The results of these analyses contribute a new perspective on the longitudinal distribution of LWD in a river system, which should help identify physical and/or riparian control mechanisms of LWD arrangement and support the development of models of LWD arrangement. Additionally, the spatial statistical tools presented here have shown to be valuable for identifying longitudinal patterns in river system components.
Yongqiang Liu
2003-01-01
It was suggested in a recent statistical correlation analysis that predictability of monthly-seasonal precipitation could be improved by using coupled singular value decomposition (SVD) pattems between soil moisture and precipitation instead of their values at individual locations. This study provides predictive evidence for this suggestion by comparing skills of two...
This SOP describes the methods and procedures for two types of QA procedures: spot checks of hand entered data, and QA procedures for co-located and split samples. The spot checks were used to determine whether the error rate goal for the input of hand entered data was being att...
William H. McWilliams; Stanford L. Arner; Charles J. Barnett
1997-01-01
The USDA Forest Service's Forest Inventory and Analysis (FIA) program and the Forest Health Monitoring (FHM) program maintain networks of sample locations providing coarse-scale information that characterize general indicators of forest health. Tree mortality is the primary FIA variable for analyzing forest health. Recent FIA inventories of New York, Pennsylvania...
Geographic analysis of forest health indicators using spatial scan statistics
John W. Coulston; Kurt H. Riitters
2003-01-01
Forest health analysts seek to define the location, extent, and magnitude of changes in forest ecosystems, to explain the observed changes when possible, and to draw attention to the unexplained changes for further investigation. The data come from a variety of sources including satellite images, field plot measurements, and low-altitude aerial surveys. Indicators...
Raidullah, Ebadullah; Francis, Maria L.
2014-01-01
Objectives: This study aimed to evaluate the accuracy of Root ZX in determining working length in presence of normal saline, 0.2% chlorhexidine and 2.5% of sodium hypochlorite. Material and Methods: Sixty extracted, single rooted, single canal human teeth were used. Teeth were decoronated at CEJ and actual canal length determined. Then working length measurements were obtained with Root ZX in presence of normal saline 0.9%, 0.2% chlorhexidine and 2.5% NaOCl. The working length obtained with Root ZX were compared with actual canal length and subjected to statistical analysis. Results: No statistical significant difference was found between actual canal length and Root ZX measurements in presence of normal saline and 0.2% chlorhexidine. Highly statistical difference was found between actual canal length and Root ZX measurements in presence of 2.5% of NaOCl, however all the measurements were within the clinically acceptable range of ±0.5mm. Conclusion: The accuracy of EL measurement of Root ZX within±0.5 mm of AL was consistently high in the presence of 0.2% chlorhexidine, normal saline and 2.5% sodium hypochlorite. Clinical significance: This study signifies the efficacy of ROOT ZX (Third generation apex locator) as a dependable aid in endodontic working length. Key words:Electronic apex locator, working length, root ZX accuracy, intracanal irrigating solutions. PMID:24596634
Accounting for measurement error: a critical but often overlooked process.
Harris, Edward F; Smith, Richard N
2009-12-01
Due to instrument imprecision and human inconsistencies, measurements are not free of error. Technical error of measurement (TEM) is the variability encountered between dimensions when the same specimens are measured at multiple sessions. A goal of a data collection regimen is to minimise TEM. The few studies that actually quantify TEM, regardless of discipline, report that it is substantial and can affect results and inferences. This paper reviews some statistical approaches for identifying and controlling TEM. Statistically, TEM is part of the residual ('unexplained') variance in a statistical test, so accounting for TEM, which requires repeated measurements, enhances the chances of finding a statistically significant difference if one exists. The aim of this paper was to review and discuss common statistical designs relating to types of error and statistical approaches to error accountability. This paper addresses issues of landmark location, validity, technical and systematic error, analysis of variance, scaled measures and correlation coefficients in order to guide the reader towards correct identification of true experimental differences. Researchers commonly infer characteristics about populations from comparatively restricted study samples. Most inferences are statistical and, aside from concerns about adequate accounting for known sources of variation with the research design, an important source of variability is measurement error. Variability in locating landmarks that define variables is obvious in odontometrics, cephalometrics and anthropometry, but the same concerns about measurement accuracy and precision extend to all disciplines. With increasing accessibility to computer-assisted methods of data collection, the ease of incorporating repeated measures into statistical designs has improved. Accounting for this technical source of variation increases the chance of finding biologically true differences when they exist.
NASA Astrophysics Data System (ADS)
Ghannadpour, Seyyed Saeed; Hezarkhani, Ardeshir
2016-03-01
The U-statistic method is one of the most important structural methods to separate the anomaly from the background. It considers the location of samples and carries out the statistical analysis of the data without judging from a geochemical point of view and tries to separate subpopulations and determine anomalous areas. In the present study, to use U-statistic method in three-dimensional (3D) condition, U-statistic is applied on the grade of two ideal test examples, by considering sample Z values (elevation). So far, this is the first time that this method has been applied on a 3D condition. To evaluate the performance of 3D U-statistic method and in order to compare U-statistic with one non-structural method, the method of threshold assessment based on median and standard deviation (MSD method) is applied on the two example tests. Results show that the samples indicated by U-statistic method as anomalous are more regular and involve less dispersion than those indicated by the MSD method. So that, according to the location of anomalous samples, denser areas of them can be determined as promising zones. Moreover, results show that at a threshold of U = 0, the total error of misclassification for U-statistic method is much smaller than the total error of criteria of bar {x}+n× s. Finally, 3D model of two test examples for separating anomaly from background using 3D U-statistic method is provided. The source code for a software program, which was developed in the MATLAB programming language in order to perform the calculations of the 3D U-spatial statistic method, is additionally provided. This software is compatible with all the geochemical varieties and can be used in similar exploration projects.
How to inhibit a distractor location? Statistical learning versus active, top-down suppression.
Wang, Benchi; Theeuwes, Jan
2018-05-01
Recently, Wang and Theeuwes (Journal of Experimental Psychology: Human Perception and Performance, 44(1), 13-17, 2018a) demonstrated the role of lingering selection biases in an additional singleton search task in which the distractor singleton appeared much more often in one location than in all other locations. For this location, there was less capture and selection efficiency was reduced. It was argued that statistical learning induces plasticity within the spatial priority map such that particular locations that are high likely to contain a distractor are suppressed relative to all other locations. The current study replicated these findings regarding statistical learning (Experiment 1) and investigated whether similar effects can be obtained by cueing the distractor location in a top-down way on a trial-by-trial basis. The results show that top-down cueing of the distractor location with long (1,500 ms; Experiment 2) and short stimulus-onset symmetries (SOAs) (600 ms; Experiment 3) does not result in suppression: The amount of capture nor the efficiency of selection was affected by the cue. If anything, we found an attentional benefit (instead of the suppression) for the short SOA. We argue that through statistical learning, weights within the attentional priority map are changed such that one location containing a salient distractor is suppressed relative to all other locations. Our cueing experiments show that this effect cannot be accomplished by active, top-down suppression. Consequences for recent theories of distractor suppression are discussed.
ERIC Educational Resources Information Center
Ministerio de Educacion, Guatemala City (Guatemala). Oficina de Planeamiento Integral de la Educacion.
This booklet presents statistics concerning primary education in Guatemala. The first section covers enrollment, considering such factors as type of school and location. Other sections provide statistics on teachers, their locations, the number of schools, enrollment in terms of students repeating grades or leaving school, students advancing out…
Rescaled earthquake recurrence time statistics: application to microrepeaters
NASA Astrophysics Data System (ADS)
Goltz, Christian; Turcotte, Donald L.; Abaimov, Sergey G.; Nadeau, Robert M.; Uchida, Naoki; Matsuzawa, Toru
2009-01-01
Slip on major faults primarily occurs during `characteristic' earthquakes. The recurrence statistics of characteristic earthquakes play an important role in seismic hazard assessment. A major problem in determining applicable statistics is the short sequences of characteristic earthquakes that are available worldwide. In this paper, we introduce a rescaling technique in which sequences can be superimposed to establish larger numbers of data points. We consider the Weibull and log-normal distributions, in both cases we rescale the data using means and standard deviations. We test our approach utilizing sequences of microrepeaters, micro-earthquakes which recur in the same location on a fault. It seems plausible to regard these earthquakes as a miniature version of the classic characteristic earthquakes. Microrepeaters are much more frequent than major earthquakes, leading to longer sequences for analysis. In this paper, we present results for the analysis of recurrence times for several microrepeater sequences from Parkfield, CA as well as NE Japan. We find that, once the respective sequence can be considered to be of sufficient stationarity, the statistics can be well fitted by either a Weibull or a log-normal distribution. We clearly demonstrate this fact by our technique of rescaled combination. We conclude that the recurrence statistics of the microrepeater sequences we consider are similar to the recurrence statistics of characteristic earthquakes on major faults.
Statistical Analysis of 30 Years Rainfall Data: A Case Study
NASA Astrophysics Data System (ADS)
Arvind, G.; Ashok Kumar, P.; Girish Karthi, S.; Suribabu, C. R.
2017-07-01
Rainfall is a prime input for various engineering design such as hydraulic structures, bridges and culverts, canals, storm water sewer and road drainage system. The detailed statistical analysis of each region is essential to estimate the relevant input value for design and analysis of engineering structures and also for crop planning. A rain gauge station located closely in Trichy district is selected for statistical analysis where agriculture is the prime occupation. The daily rainfall data for a period of 30 years is used to understand normal rainfall, deficit rainfall, Excess rainfall and Seasonal rainfall of the selected circle headquarters. Further various plotting position formulae available is used to evaluate return period of monthly, seasonally and annual rainfall. This analysis will provide useful information for water resources planner, farmers and urban engineers to assess the availability of water and create the storage accordingly. The mean, standard deviation and coefficient of variation of monthly and annual rainfall was calculated to check the rainfall variability. From the calculated results, the rainfall pattern is found to be erratic. The best fit probability distribution was identified based on the minimum deviation between actual and estimated values. The scientific results and the analysis paved the way to determine the proper onset and withdrawal of monsoon results which were used for land preparation and sowing.
NASA Astrophysics Data System (ADS)
Chakraborthy, Parthasarathi; Chattopadhyay, Surajit
2013-02-01
Endeavor of the present paper is to investigate the statistical properties of the total ozone concentration time series over Arosa, Switzerland (9.68°E, 46.78°N). For this purpose, different statistical data analysis procedures have been employed for analyzing the mean monthly total ozone concentration data, collected over a period of 40 years (1932-1971), at the above location. Based on the computations on the available data set, the study reports different degrees of variations in different months. The month of July is reported as the month of lowest variability. April and May are found to be the most correlated months with respect to total ozone concentration.
Statistical analysis of the calibration procedure for personnel radiation measurement instruments
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bush, W.J.; Bengston, S.J.; Kalbeitzer, F.L.
1980-11-01
Thermoluminescent analyzer (TLA) calibration procedures were used to estimate personnel radiation exposure levels at the Idaho National Engineering Laboratory (INEL). A statistical analysis is presented herein based on data collected over a six month period in 1979 on four TLA's located in the Department of Energy (DOE) Radiological and Environmental Sciences Laboratory at the INEL. The data were collected according to the day-to-day procedure in effect at that time. Both gamma and beta radiation models are developed. Observed TLA readings of thermoluminescent dosimeters are correlated with known radiation levels. This correlation is then used to predict unknown radiation doses frommore » future analyzer readings of personnel thermoluminescent dosimeters. The statistical techniques applied in this analysis include weighted linear regression, estimation of systematic and random error variances, prediction interval estimation using Scheffe's theory of calibration, the estimation of the ratio of the means of two normal bivariate distributed random variables and their corresponding confidence limits according to Kendall and Stuart, tests of normality, experimental design, a comparison between instruments, and quality control.« less
A Principal Component Analysis/Fuzzy Comprehensive Evaluation for Rockburst Potential in Kimberlite
NASA Astrophysics Data System (ADS)
Pu, Yuanyuan; Apel, Derek; Xu, Huawei
2018-02-01
Kimberlite is an igneous rock which sometimes bears diamonds. Most of the diamonds mined in the world today are found in kimberlite ores. Burst potential in kimberlite has not been investigated, because kimberlite is mostly mined using open-pit mining, which poses very little threat of rock bursting. However, as the mining depth keeps increasing, the mines convert to underground mining methods, which can pose a threat of rock bursting in kimberlite. This paper focuses on the burst potential of kimberlite at a diamond mine in northern Canada. A combined model with the methods of principal component analysis (PCA) and fuzzy comprehensive evaluation (FCE) is developed to process data from 12 different locations in kimberlite pipes. Based on calculated 12 fuzzy evaluation vectors, 8 locations show a moderate burst potential, 2 locations show no burst potential, and 2 locations show strong and violent burst potential, respectively. Using statistical principles, a Mahalanobis distance is adopted to build a comprehensive fuzzy evaluation vector for the whole mine and the final evaluation for burst potential is moderate, which is verified by a practical rockbursting situation at mine site.
Educational debt: does it have an influence on initial job location and specialty choice?
Snyder, Jennifer; Nehrenz, Guy; Danielsen, Randy; Pedersen, Donald
2014-01-01
This study applied a quantitative design and analyzed the impact of educational debt on initial specialty and location choices for physician assistant (PA) graduates in Indiana. PAs who graduated between January 1, 2000, and December 31, 2010, and actively practice in Indiana were surveyed. Descriptive statistics and chi-square analyses were performed to determine whether any significant relationships existed among practice specialty, location, and gender. 157 participants (33%) responded to the survey and were considered in the final analysis. Males were more likely than females to be influenced by debt in choosing their specialty and the location of their initial job. A majority of PAs would have reconsidered rural practice if they had received federal and or state loan forgiveness for educational debt. This study provides evidence that debt may influence practice specialty and location choice. Further studies are needed to determine how gender might account for decisions to practice in certain specialties and location.
Indelicato, Serena; Bongiorno, David; Tuzzolino, Nicola; Mannino, Maria Rosaria; Muscarella, Rosalia; Fradella, Pasquale; Gargano, Maria Elena; Nicosia, Salvatore; Ceraulo, Leopoldo
2018-03-14
Multivariate analysis was performed on a large data set of groundwater and leachate samples collected during 9 years of operation of the Bellolampo municipal solid waste landfill (located above Palermo, Italy). The aim was to obtain the most likely correlations among the data. The analysis results are presented. Groundwater samples were collected in the period 2004-2013, whereas the leachate analysis refers to the period 2006-2013. For groundwater, statistical data evaluation revealed notable differences among the samples taken from the numerous wells located around the landfill. Characteristic parameters revealed by principal component analysis (PCA) were more deeply investigated, and corresponding thematic maps were drawn. The composition of the leachate was also thoroughly investigated. Several chemical macro-descriptors were calculated, and the results are presented. A comparison of PCA results for the leachate and groundwater data clearly reveals that the groundwater's main components substantially differ from those of the leachate. This outcome strongly suggests excluding leachate permeation through the multiple landfill lining.
Nauru Island Effect Detection Data Set
Long, Chuck
2010-07-15
During Nauru99 it was noted that the island was producing small clouds that advected over the ARM site. The Nauru Island Effect Study was run for 1.5 years and the methodology developed to detect the occurrence. Nauru ACRF downwelling SW, wind direction, and air temperature data are used, along with downwelling SW data from Licor radiometers located on the southern end of the island near the airport landing strip. A statistical analysis and comparison of data from the two locations is used to detect the likely occurrence of an island influence on the Nauru ACRF site data
Koprivica, Mladen; Slavkovic, Vladimir; Neskovic, Natasa; Neskovic, Aleksandar
2016-03-01
As a result of dense deployment of public mobile base stations, additional electromagnetic (EM) radiation occurs in the modern human environment. At the same time, public concern about the exposure to EM radiation emitted by such sources has increased. In order to determine the level of radio frequency radiation generated by base stations, extensive EM field strength measurements were carried out for 664 base station locations, from which 276 locations refer to the case of base stations with antenna system installed on buildings. Having in mind the large percentage (42 %) of locations with installations on buildings, as well as the inevitable presence of people in their vicinity, a detailed analysis of this location category was performed. Measurement results showed that the maximum recorded value of total electric field strength has exceeded International Commission on Non-Ionizing Radiation Protection general public exposure reference levels at 2.5 % of locations and Serbian national reference levels at 15.6 % of locations. It should be emphasised that the values exceeding the reference levels were observed only outdoor, while in indoor total electric field strength in no case exceeded the defined reference levels. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Oregon ground-water quality and its relation to hydrogeological factors; a statistical approach
Miller, T.L.; Gonthier, J.B.
1984-01-01
An appraisal of Oregon ground-water quality was made using existing data accessible through the U.S. Geological Survey computer system. The data available for about 1,000 sites were separated by aquifer units and hydrologic units. Selected statistical moments were described for 19 constituents including major ions. About 96 percent of all sites in the data base were sampled only once. The sample data were classified by aquifer unit and hydrologic unit and analysis of variance was run to determine if significant differences exist between the units within each of these two classifications for the same 19 constituents on which statistical moments were determined. Results of the analysis of variance indicated both classification variables performed about the same, but aquifer unit did provide more separation for some constituents. Samples from the Rogue River basin were classified by location within the flow system and type of flow system. The samples were then analyzed using analysis of variance on 14 constituents to determine if there were significant differences between subsets classified by flow path. Results of this analysis were not definitive, but classification as to the type of flow system did indicate potential for segregating water-quality data into distinct subsets. (USGS)
Water quality analysis of the Rapur area, Andhra Pradesh, South India using multivariate techniques
NASA Astrophysics Data System (ADS)
Nagaraju, A.; Sreedhar, Y.; Thejaswi, A.; Sayadi, Mohammad Hossein
2017-10-01
The groundwater samples from Rapur area were collected from different sites to evaluate the major ion chemistry. The large number of data can lead to difficulties in the integration, interpretation, and representation of the results. Two multivariate statistical methods, hierarchical cluster analysis (HCA) and factor analysis (FA), were applied to evaluate their usefulness to classify and identify geochemical processes controlling groundwater geochemistry. Four statistically significant clusters were obtained from 30 sampling stations. This has resulted two important clusters viz., cluster 1 (pH, Si, CO3, Mg, SO4, Ca, K, HCO3, alkalinity, Na, Na + K, Cl, and hardness) and cluster 2 (EC and TDS) which are released to the study area from different sources. The application of different multivariate statistical techniques, such as principal component analysis (PCA), assists in the interpretation of complex data matrices for a better understanding of water quality of a study area. From PCA, it is clear that the first factor (factor 1), accounted for 36.2% of the total variance, was high positive loading in EC, Mg, Cl, TDS, and hardness. Based on the PCA scores, four significant cluster groups of sampling locations were detected on the basis of similarity of their water quality.
Changing viewer perspectives reveals constraints to implicit visual statistical learning.
Jiang, Yuhong V; Swallow, Khena M
2014-10-07
Statistical learning-learning environmental regularities to guide behavior-likely plays an important role in natural human behavior. One potential use is in search for valuable items. Because visual statistical learning can be acquired quickly and without intention or awareness, it could optimize search and thereby conserve energy. For this to be true, however, visual statistical learning needs to be viewpoint invariant, facilitating search even when people walk around. To test whether implicit visual statistical learning of spatial information is viewpoint independent, we asked participants to perform a visual search task from variable locations around a monitor placed flat on a stand. Unbeknownst to participants, the target was more often in some locations than others. In contrast to previous research on stationary observers, visual statistical learning failed to produce a search advantage for targets in high-probable regions that were stable within the environment but variable relative to the viewer. This failure was observed even when conditions for spatial updating were optimized. However, learning was successful when the rich locations were referenced relative to the viewer. We conclude that changing viewer perspective disrupts implicit learning of the target's location probability. This form of learning shows limited integration with spatial updating or spatiotopic representations. © 2014 ARVO.
Mechanical properties of silicate glasses exposed to a low-Earth orbit
NASA Technical Reports Server (NTRS)
Wiedlocher, David E.; Tucker, Dennis S.; Nichols, Ron; Kinser, Donald L.
1992-01-01
The effects of a 5.8 year exposure to low earth orbit environment upon the mechanical properties of commercial optical fused silica, low iron soda-lime-silica, Pyrex 7740, Vycor 7913, BK-7, and the glass ceramic Zerodur were examined. Mechanical testing employed the ASTM-F-394 piston on 3-ball method in a liquid nitrogen environment. Samples were exposed on the Long Duration Exposure Facility (LDEF) in two locations. Impacts were observed on all specimens except Vycor. Weibull analysis as well as a standard statistical evaluation were conducted. The Weibull analysis revealed no differences between control samples and the two exposed samples. We thus concluded that radiation components of the Earth orbital environment did not degrade the mechanical strength of the samples examined within the limits of experimental error. The upper bound of strength degradation for meteorite impacted samples based upon statistical analysis and observation was 50 percent.
Urban Transmission of American Cutaneous Leishmaniasis in Argentina: Spatial Analysis Study
Gil, José F.; Nasser, Julio R.; Cajal, Silvana P.; Juarez, Marisa; Acosta, Norma; Cimino, Rubén O.; Diosque, Patricio; Krolewiecki, Alejandro J.
2010-01-01
We used kernel density and scan statistics to examine the spatial distribution of cases of pediatric and adult American cutaneous leishmaniasis in an urban disease-endemic area in Salta Province, Argentina. Spatial analysis was used for the whole population and stratified by women > 14 years of age (n = 159), men > 14 years of age (n = 667), and children < 15 years of age (n = 213). Although kernel density for adults encompassed nearly the entire city, distribution in children was most prevalent in the peripheral areas of the city. Scan statistic analysis for adult males, adult females, and children found 11, 2, and 8 clusters, respectively. Clusters for children had the highest odds ratios (P < 0.05) and were located in proximity of plantations and secondary vegetation. The data from this study provide further evidence of the potential urban transmission of American cutaneous leishmaniasis in northern Argentina. PMID:20207869
Trend-surface analysis of morphometric parameters: A case study in southeastern Brazil
NASA Astrophysics Data System (ADS)
Grohmann, Carlos Henrique
2005-10-01
Trend-surface analysis was carried out on data from morphometric parameters isobase and hydraulic gradient. The study area, located in the eastern border of Quadrilátero Ferrífero, southeastern Brazil, presents four main geomorphological units, one characterized by fluvial dissection, two of mountainous relief, with a scarp of hundreds of meters of fall between them, and a flat plateau in the central portion of the fluvially dissected terrains. Morphometric maps were evaluated in GRASS-GIS and statistics were made on R statistical language, using the spatial package. Analysis of variance (ANOVA) was made to test the significance of each surface and the significance of increasing polynomial degree. The best results were achieved with sixth-order surface for isobase and second-order surface for hydraulic gradient. Shape and orientation of residual maps contours for selected trends were compared with structures inferred from several morphometric maps, and a good correlation is present.
ERIC Educational Resources Information Center
Landsberger, Betty H.
To locate possible causes for the gender and race differences observed in adolescent health status, an analysis was made of the relationship between the scores of a national sample of 12- to 17-year-old adolescents on selected items of the National Center for Health Statistics' Health Examination Survey. Thirty survey items indicating social…
Environmental Studies: Mathematical, Computational and Statistical Analyses
1993-03-03
mathematical analysis addresses the seasonally and longitudinally averaged circulation which is under the influence of a steady forcing located asymmetrically...employed, as has been suggested for some situations. A general discussion of how interfacial phenomena influence both the original contamination process...describing the large-scale advective and dispersive behaviour of contaminants transported by groundwater and the uncertainty associated with field-scale
Parallel Climate Data Assimilation PSAS Package Achieves 18 GFLOPs on 512-Node Intel Paragon
NASA Technical Reports Server (NTRS)
Ding, H. Q.; Chan, C.; Gennery, D. B.; Ferraro, R. D.
1995-01-01
Several algorithms were added to the Physical-space Statistical Analysis System (PSAS) from Goddard, which assimilates observational weather data by correcting for different levels of uncertainty about the data and different locations for mobile observation platforms. The new algorithms and use of the 512-node Intel Paragon allowed a hundred-fold decrease in processing time.
A Heat Vulnerability Index and Adaptation Solutions for Pittsburgh, Pennsylvania.
Bradford, Kathryn; Abrahams, Leslie; Hegglin, Miriam; Klima, Kelly
2015-10-06
With increasing evidence of global warming, many cities have focused attention on response plans to address their populations' vulnerabilities. Despite expected increased frequency and intensity of heat waves, the health impacts of such events in urban areas can be minimized with careful policy and economic investments. We focus on Pittsburgh, Pennsylvania and ask two questions. First, what are the top factors contributing to heat vulnerability and how do these characteristics manifest geospatially throughout Pittsburgh? Second, assuming the City wishes to deploy additional cooling centers, what placement will optimally address the vulnerability of the at risk populations? We use national census data, ArcGIS geospatial modeling, and statistical analysis to determine a range of heat vulnerability indices and optimal cooling center placement. We find that while different studies use different data and statistical calculations, all methods tested locate additional cooling centers at the confluence of the three rivers (Downtown), the northeast side of Pittsburgh (Shadyside/Highland Park), and the southeast side of Pittsburgh (Squirrel Hill). This suggests that for Pittsburgh, a researcher could apply the same factor analysis procedure to compare data sets for different locations and times; factor analyses for heat vulnerability are more robust than previously thought.
Trend analysis of annual precipitation of Mauritius for the period 1981-2010
NASA Astrophysics Data System (ADS)
Raja, Nussaïbah B.; Aydin, Olgu
2018-04-01
This study researched the precipitation variability across 53 meteorological stations in Mauritius and different subregions of the island, over a 30-year study period (1981-2010). Time series was investigated for each 5-year interval and also for the whole study period. Non-parametric Mann-Kendall and Spearman's rho statistical tests were used to detect trends in annual precipitation. A mix of positive (increasing) and negative (decreasing) trends was highlighted for the 5-year interval analysis. The statistical tests nevertheless agreed on the overall trend for Mauritius and the subregions. Most regions showed a decrease in precipitation during the period 1996-2000. This is attributed to the 1998-2000 drought period which was brought about by a moderate La Niña event. In general, an increase in precipitation levels was observed across the country during the study period. This increase is the result of an increase in extreme precipitation events in the region. On the other hand, two subregions, both located in the highlands, experienced a decline in precipitation levels. Since most of the reservoirs in Mauritius are located in these two subregions, this implies serious consequences for water availability in the country if existing storage capacities are kept.
A Heat Vulnerability Index and Adaptation Solutions for Pittsburgh, Pennsylvania
NASA Astrophysics Data System (ADS)
Klima, K.; Abrahams, L.; Bradford, K.; Hegglin, M.
2015-12-01
With increasing evidence of global warming, many cities have focused attention on response plans to address their populations' vulnerabilities. Despite expected increased frequency and intensity of heat waves, the health impacts of such events in urban areas can be minimized with careful policy and economic investments. We focus on Pittsburgh, Pennsylvania and ask two questions. First, what are the top factors contributing to heat vulnerability and how do these characteristics manifest geospatially throughout Pittsburgh? Second, assuming the City wishes to deploy additional cooling centers, what placement will optimally address the vulnerability of the at risk populations? We use national census data, ArcGIS geospatial modeling, and statistical analysis to determine a range of heat vulnerability indices and optimal cooling center placement. We find that while different studies use different data and statistical calculations, all methods tested locate additional cooling centers at the confluence of the three rivers (Downtown), the northeast side of Pittsburgh (Shadyside/ Highland Park), and the southeast side of Pittsburgh (Squirrel Hill). This suggests that for Pittsburgh, a researcher could apply the same factor analysis procedure to compare datasets for different locations and times; factor analyses for heat vulnerability are more robust than previously thought.
NMR-based metabolomic analysis of spatial variation in soft corals.
He, Qing; Sun, Ruiqi; Liu, Huijuan; Geng, Zhufeng; Chen, Dawei; Li, Yinping; Han, Jiao; Lin, Wenhan; Du, Shushan; Deng, Zhiwei
2014-03-28
Soft corals are common marine organisms that inhabit tropical and subtropical oceans. They are shown to be rich source of secondary metabolites with biological activities. In this work, soft corals from two geographical locations were investigated using ¹H-NMR spectroscopy coupled with multivariate statistical analysis at the metabolic level. A partial least-squares discriminant analysis showed clear separation among extracts of soft corals grown in Sanya Bay and Weizhou Island. The specific markers that contributed to discrimination between soft corals in two origins belonged to terpenes, sterols and N-containing compounds. The satisfied precision of classification obtained indicates this approach using combined ¹H-NMR and chemometrics is effective to discriminate soft corals collected in different geographical locations. The results revealed that metabolites of soft corals evidently depended on living environmental condition, which would provide valuable information for further relevant coastal marine environment evaluation.
NASA Astrophysics Data System (ADS)
Sanchez, J. L.; Osipowicz, T.; Tang, S. M.; Tay, T. S.; Win, T. T.
1997-07-01
The trace element concentrations found in geological samples can shed light on the formation process. In the case of gemstones, which might be of artificial or natural origin, there is also considerable interest in the development of methods that provide identification of the origin of a sample. For rubies, trace element concentrations present in natural samples were shown previously to be significant indicators of the region of origin [S.M. Tang et al., Appl. Spectr. 42 (1988) 44, and 43 (1989) 219]. Here we report the results of micro-PIXE analyses of trace element (Ti, V, Cr, Fe, Cu and Ga) concentrations of a large set ( n = 130) of natural rough rubies from nine locations in Myanmar (Burma). The resulting concentrations are subjected to statistical analysis. Six of the nine groups form clusters when the data base is evaluated using tree clustering and principal component analysis.
Demonstration of Wavelet Techniques in the Spectral Analysis of Bypass Transition Data
NASA Technical Reports Server (NTRS)
Lewalle, Jacques; Ashpis, David E.; Sohn, Ki-Hyeon
1997-01-01
A number of wavelet-based techniques for the analysis of experimental data are developed and illustrated. A multiscale analysis based on the Mexican hat wavelet is demonstrated as a tool for acquiring physical and quantitative information not obtainable by standard signal analysis methods. Experimental data for the analysis came from simultaneous hot-wire velocity traces in a bypass transition of the boundary layer on a heated flat plate. A pair of traces (two components of velocity) at one location was excerpted. A number of ensemble and conditional statistics related to dominant time scales for energy and momentum transport were calculated. The analysis revealed a lack of energy-dominant time scales inside turbulent spots but identified transport-dominant scales inside spots that account for the largest part of the Reynolds stress. Momentum transport was much more intermittent than were energetic fluctuations. This work is the first step in a continuing study of the spatial evolution of these scale-related statistics, the goal being to apply the multiscale analysis results to improve the modeling of transitional and turbulent industrial flows.
Soltani, Shahla; Asghari Moghaddam, Asghar; Barzegar, Rahim; Kazemian, Naeimeh; Tziritis, Evangelos
2017-08-18
Kordkandi-Duzduzan plain is one of the fertile plains of East Azarbaijan Province, NW of Iran. Groundwater is an important resource for drinking and agricultural purposes due to the lack of surface water resources in the region. The main objectives of the present study are to identify the hydrogeochemical processes and the potential sources of major, minor, and trace metals and metalloids such as Cr, Mn, Cd, Fe, Al, and As by using joint hydrogeochemical techniques and multivariate statistical analysis and to evaluate groundwater quality deterioration with the use of PoS environmental index. To achieve these objectives, 23 groundwater samples were collected in September 2015. Piper diagram shows that the mixed Ca-Mg-Cl is the dominant groundwater type, and some of the samples have Ca-HCO 3 , Ca-Cl, and Na-Cl types. Multivariate statistical analyses indicate that weathering and dissolution of different rocks and minerals, e.g., silicates, gypsum, and halite, ion exchange, and agricultural activities influence the hydrogeochemistry of the study area. The cluster analysis divides the samples into two distinct clusters which are completely different in EC (and its dependent variables such as Na + , K + , Ca 2+ , Mg 2+ , SO 4 2- , and Cl - ), Cd, and Cr variables according to the ANOVA statistical test. Based on the median values, the concentrations of pH, NO 3 - , SiO 2 , and As in cluster 1 are elevated compared with those of cluster 2, while their maximum values occur in cluster 2. According to the PoS index, the dominant parameter that controls quality deterioration is As, with 60% of contribution. Samples of lowest PoS values are located in the southern and northern parts (recharge area) while samples of the highest values are located in the discharge area and the eastern part.
Occurrence of Phlebitis: A Systematic Review and Meta-analysis.
Chang, Wen P; Peng, Yu X
Peripheral venous catheters (PVCs) are commonly used in clinical practice. However, varying degrees of phlebitis often occur in patients receiving intravenous injections. The relevant literature suggests that phlebitis occurrence is highly associated with the catheter gauge, insertion site, and catheterization duration. Nevertheless, no meta-analysis has been performed on the influence of these three factors on the occurrence of phlebitis. The objective of this study was to determine whether any significant differences exist in the occurrence of phlebitis between catheters of 20 gauge or smaller and those larger than 20 gauge, between catheters inserted in the antecubital fossa and those inserted in other locations on the upper limbs, or between catheters inserted for more than 96 hours and those inserted for 96 hours or less. Using a systematic approach, we searched for literature published between 2006 and 2017 in the Cumulative Index to Nursing and Allied Health Literature (CINAHL), PubMed, ProQuest, and Cochrane Library databases. We used Comprehensive Meta-analysis Version 2 to perform our meta-analysis. After the screening and review processes, we identified 17 studies that met our selection conditions. Among these studies, 14 contained complete data for meta-analysis. These studies involved 4,343 patients and 5,846 PVCs. Regarding the overall effect size in the meta-analysis, the results of the forest plot comparing catheters of 20 gauge or smaller and those larger than 20 gauge presented a risk ratio (RR) of 0.88 (95% confidence interval [0.67, 1.17], p = .380), indicating no statistically significant difference in the occurrence of phlebitis between catheters of the aforementioned gauges. The results of the forest plot comparing catheters inserted in the antecubital fossa and those inserted in other locations on the upper limbs presented an RR of 1.05 (95% confidence interval [0.82, 1.34], p = .696), indicating no statistically significant difference in the occurrence of phlebitis between catheters inserted in the aforementioned locations. The results of the forest plot comparing catheters inserted for more than 96 hours and those inserted for 96 hours or less presented an RR of 1.13 (95% confidence interval [0.49, 2.61], p = .779), indicating no statistically significant difference in the occurrence of phlebitis between catheters inserted for the aforementioned durations. The empirical results of this meta-analysis can serve as a reference for hospital management for selecting the PVC gauge, insertion site, and catheterization duration. In addition to the three factors that we analyzed, whether any other factors influence the occurrence of phlebitis in patients with catheter implantation is worth investigating in future research.
Statistical functions and relevant correlation coefficients of clearness index
NASA Astrophysics Data System (ADS)
Pavanello, Diego; Zaaiman, Willem; Colli, Alessandra; Heiser, John; Smith, Scott
2015-08-01
This article presents a statistical analysis of the sky conditions, during years from 2010 to 2012, for three different locations: the Joint Research Centre site in Ispra (Italy, European Solar Test Installation - ESTI laboratories), the site of National Renewable Energy Laboratory in Golden (Colorado, USA) and the site of Brookhaven National Laboratories in Upton (New York, USA). The key parameter is the clearness index kT, a dimensionless expression of the global irradiance impinging upon a horizontal surface at a given instant of time. In the first part, the sky conditions are characterized using daily averages, giving a general overview of the three sites. In the second part the analysis is performed using data sets with a short-term resolution of 1 sample per minute, demonstrating remarkable properties of the statistical distributions of the clearness index, reinforced by a proof using fuzzy logic methods. Successively some time-dependent correlations between different meteorological variables are presented in terms of Pearson and Spearman correlation coefficients, and introducing a new one.
Using independent component analysis for electrical impedance tomography
NASA Astrophysics Data System (ADS)
Yan, Peimin; Mo, Yulong
2004-05-01
Independent component analysis (ICA) is a way to resolve signals into independent components based on the statistical characteristics of the signals. It is a method for factoring probability densities of measured signals into a set of densities that are as statistically independent as possible under the assumptions of a linear model. Electrical impedance tomography (EIT) is used to detect variations of the electric conductivity of the human body. Because there are variations of the conductivity distributions inside the body, EIT presents multi-channel data. In order to get all information contained in different location of tissue it is necessary to image the individual conductivity distribution. In this paper we consider to apply ICA to EIT on the signal subspace (individual conductivity distribution). Using ICA the signal subspace will then be decomposed into statistically independent components. The individual conductivity distribution can be reconstructed by the sensitivity theorem in this paper. Compute simulations show that the full information contained in the multi-conductivity distribution will be obtained by this method.
Azad Aminjan, Maboud; Moaddab, Seyyed Reza; Hosseini Ravandi, Mohammad; Kazemi Haki, Behzad
2015-10-01
Nowadays in the world, tuberculosis is the second largest killer of adults after HIV. Due to the location of presidios that is mostly located in hazardous zones soldiers and army personnel are considered high risk, therefore we decided to determine the prevalence of tuberculosis status in this group of people. This was a cross-sectional descriptive research that studied the prevalence of pulmonary tuberculosis in soldiers and military personnel in the last 15 years in tuberculosis and lung disease research center at Tabriz University of Medical Sciences. The statistical population consisted of all the soldiers and military personnel. The detection method in this study was based on microscopic examination following Ziehl-Neelsen Stain and in Leuven Stein Johnson culturing. Descriptive statistics was used for statistical analysis and statistical values less than 0.05 were considered significant. By review information in this center since the 1988-2013 with 72 military personnel suffering from tuberculosis, it was revealed that among them 30 women, 42 men, 14 soldiers, 29 family members, and 29 military personnel are pointed. A significant correlation was found between TB rates among military personnel and their families. Although in recent years, the national statistics indicate a decline of tuberculosis, but the results of our study showed that TB is still a serious disease that must comply with the first symptoms of tuberculosis in military personnel and their families that should be diagnosed as soon as possible.
An analysis of student performance benchmarks in dental hygiene via distance education.
Olmsted, Jodi L
2010-01-01
Three graduate programs, 35 undergraduate programs and 12 dental hygiene degree completion programs in the United States use varying forms of Distance Learning (DL). Relying heavily on DL leaves an unanswered question: Is learner performance on standard benchmark assessments impacted when using technology as a delivery system? A 10 year, longitudinal examination looked for student performance differences in a Distance Education (DE) dental hygiene program. The purpose of this research was to determine if there was a difference in performance between learners taught in a traditional classroom as compared to their counterparts taking classes through an alternative delivery system. A longitudinal, ex post facto design was used. Two hundred and sixty-six subject records were examined. Seventy-seven individuals (29%) were lost through attrition over 10 years. One hundred and eighty-nine records were used as the study sample, 117 individuals were located face-to-face and 72 were at a distance. Independent variables included time and location, while the dependent variables included course grades, grade point average (GPA) and the National Board of Dental Hygiene Examination (NBDHE). Three research questions were asked: Were there statistically significant differences in learner performance on the National Board of Dental Hygiene Examination (NBDHE)? Were there statistically significant differences in learner performance when considering GPAs? Did statistically significant differences in performance exist relating to individual course grades? T-tests were used for data analysis in answering the research questions. From a cumulative perspective, no statistically significant differences were apparent for the NBDHE and GPAs or for individual courses. Interactive Television (ITV), the synchronous DL system examined, was considered effective for delivering education to learners if similar performance outcomes were the evaluation criteria.
Mi, Jia; Li, Jie; Zhang, Qinglu; Wang, Xing; Liu, Hongyu; Cao, Yanlu; Liu, Xiaoyan; Sun, Xiao; Shang, Mengmeng; Liu, Qing
2016-01-01
Abstract The purpose of the study was to establish a mathematical model for correlating the combination of ultrasonography and noncontrast helical computerized tomography (NCHCT) with the total energy of Holmium laser lithotripsy. In this study, from March 2013 to February 2014, 180 patients with single urinary calculus were examined using ultrasonography and NCHCT before Holmium laser lithotripsy. The calculus location and size, acoustic shadowing (AS) level, twinkling artifact intensity (TAI), and CT value were all documented. The total energy of lithotripsy (TEL) and the calculus composition were also recorded postoperatively. Data were analyzed using Spearman's rank correlation coefficient, with the SPSS 17.0 software package. Multiple linear regression was also used for further statistical analysis. A significant difference in the TEL was observed between renal calculi and ureteral calculi (r = –0.565, P < 0.001), and there was a strong correlation between the calculus size and the TEL (r = 0.675, P < 0.001). The difference in the TEL between the calculi with and without AS was highly significant (r = 0.325, P < 0.001). The CT value of the calculi was significantly correlated with the TEL (r = 0.386, P < 0.001). A correlation between the TAI and TEL was also observed (r = 0.391, P < 0.001). Multiple linear regression analysis revealed that the location, size, and TAI of the calculi were related to the TEL, and the location and size were statistically significant predictors (adjusted r2 = 0.498, P < 0.001). A mathematical model correlating the combination of ultrasonography and NCHCT with TEL was established; this model may provide a foundation to guide the use of energy in Holmium laser lithotripsy. The TEL can be estimated by the location, size, and TAI of the calculus. PMID:27930563
Statistical Analysis And Treatment Of Accident Black Spots: A Case Study Of Nandyal Mandal
NASA Astrophysics Data System (ADS)
Sudharshan Reddy, B.; Vishnu Vardhan Reddy, L.; Sreenivasa Reddy, G., Dr
2017-08-01
Background: Increased, economic activity raised the consumption levels of the people across the country. This created scope for increase in travel and transportation. The increase in the vehicles since last 10 years has put lot of pressure on the existing roads and ultimately resulting in road accidents. Nandyal Mandal is located in the Kurnool district of Andhra Pradesh and well developed in both agricultural and industrial sectors after Kurnool. 567 accidents occurred in the last seven years at 143 locations shows the severity of the accidents in the Nandyal Mandal. There is a need to carry out some work in the Nandyal Mandal to improve the accidents black spots for reducing the accidents. Methods: Last seven years (2010-2016) of accident data collected from Police Stations. Weighted Severity Index (WSI), a scientific method is used for identifying the accident black spots. Statistical analysis has carried out for the collected data using Chi-Square Test to determine the independence of accidents with other attributes. Chi-Square Goodness of fit test conducted for test whether the accidents are occurring by chance or following any pattern. Results: WSI values are determined for the 143 locations. The Locations with high WSI are treated as accident black spots. Five black spots are taken for field study. After field observations and interaction with the public, some improvements are suggested for improving the accident black spots. There is no relationship between the severity of accidents and the other attributes like month, season, day, hours in day and the age group except type of vehicle. Road accidents are distributed throughout the Year, Month and Season. Road accidents are not distributed throughout the day.
Ghafoorian, Mohsen; Karssemeijer, Nico; Heskes, Tom; van Uden, Inge W M; Sanchez, Clara I; Litjens, Geert; de Leeuw, Frank-Erik; van Ginneken, Bram; Marchiori, Elena; Platel, Bram
2017-07-11
The anatomical location of imaging features is of crucial importance for accurate diagnosis in many medical tasks. Convolutional neural networks (CNN) have had huge successes in computer vision, but they lack the natural ability to incorporate the anatomical location in their decision making process, hindering success in some medical image analysis tasks. In this paper, to integrate the anatomical location information into the network, we propose several deep CNN architectures that consider multi-scale patches or take explicit location features while training. We apply and compare the proposed architectures for segmentation of white matter hyperintensities in brain MR images on a large dataset. As a result, we observe that the CNNs that incorporate location information substantially outperform a conventional segmentation method with handcrafted features as well as CNNs that do not integrate location information. On a test set of 50 scans, the best configuration of our networks obtained a Dice score of 0.792, compared to 0.805 for an independent human observer. Performance levels of the machine and the independent human observer were not statistically significantly different (p-value = 0.06).
Statistical Characterization of School Bus Drive Cycles Collected via Onboard Logging Systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
Duran, A.; Walkowicz, K.
In an effort to characterize the dynamics typical of school bus operation, National Renewable Energy Laboratory (NREL) researchers set out to gather in-use duty cycle data from school bus fleets operating across the country. Employing a combination of Isaac Instruments GPS/CAN data loggers in conjunction with existing onboard telemetric systems resulted in the capture of operating information for more than 200 individual vehicles in three geographically unique domestic locations. In total, over 1,500 individual operational route shifts from Washington, New York, and Colorado were collected. Upon completing the collection of in-use field data using either NREL-installed data acquisition devices ormore » existing onboard telemetry systems, large-scale duty-cycle statistical analyses were performed to examine underlying vehicle dynamics trends within the data and to explore vehicle operation variations between fleet locations. Based on the results of these analyses, high, low, and average vehicle dynamics requirements were determined, resulting in the selection of representative standard chassis dynamometer test cycles for each condition. In this paper, the methodology and accompanying results of the large-scale duty-cycle statistical analysis are presented, including graphical and tabular representations of a number of relationships between key duty-cycle metrics observed within the larger data set. In addition to presenting the results of this analysis, conclusions are drawn and presented regarding potential applications of advanced vehicle technology as it relates specifically to school buses.« less
A statistical study of sporadic sodium layer observed by Sodium lidar at Hefei (31.8° N, 117.3° E)
NASA Astrophysics Data System (ADS)
Dou, X.-K.; Xue, X.-H.; Chen, T.-D.; Wan, W.-X.; Cheng, X.-W.; Li, T.; Chen, C.; Qiu, S.; Chen, Z.-Y.
2009-06-01
Sodium lidar observations of sporadic sodium layers (SSLs) during the past 3 years at a mid-latitude location (Hefei, China, 31.8° N, 117.3° E) are reported in this paper. From 64 SSL events detected in about 900 h of observation, an SSL occurrence rate of 1 event every 14 h at our location was obtained. This result, combined with previous studies, reveals that the SSL occurrence can be relatively frequent at some mid-latitude locations. Statistical analysis of main parameters for the 64 SSL events was performed. By examining the corresponding data from an ionosonde, a considerable correlation was found with a Pearson coefficient of 0.66 between seasonal variations of SSL and those of sporadic E (Es) during nighttime, which was in line with the research by Nagasawa and Abo (1995). From comparison between observations from the University of Science and Technology of China (USTC) lidar and from Wuhan Institute of Physics and Mathematics (WIPM) lidar (Wuhan, China, 31° N, 114° E), the minimum horizontal range for some events was estimated to be over 500 km.
A scoping review of spatial cluster analysis techniques for point-event data.
Fritz, Charles E; Schuurman, Nadine; Robertson, Colin; Lear, Scott
2013-05-01
Spatial cluster analysis is a uniquely interdisciplinary endeavour, and so it is important to communicate and disseminate ideas, innovations, best practices and challenges across practitioners, applied epidemiology researchers and spatial statisticians. In this research we conducted a scoping review to systematically search peer-reviewed journal databases for research that has employed spatial cluster analysis methods on individual-level, address location, or x and y coordinate derived data. To illustrate the thematic issues raised by our results, methods were tested using a dataset where known clusters existed. Point pattern methods, spatial clustering and cluster detection tests, and a locally weighted spatial regression model were most commonly used for individual-level, address location data (n = 29). The spatial scan statistic was the most popular method for address location data (n = 19). Six themes were identified relating to the application of spatial cluster analysis methods and subsequent analyses, which we recommend researchers to consider; exploratory analysis, visualization, spatial resolution, aetiology, scale and spatial weights. It is our intention that researchers seeking direction for using spatial cluster analysis methods, consider the caveats and strengths of each approach, but also explore the numerous other methods available for this type of analysis. Applied spatial epidemiology researchers and practitioners should give special consideration to applying multiple tests to a dataset. Future research should focus on developing frameworks for selecting appropriate methods and the corresponding spatial weighting schemes.
Site Suitability Analysis for Beekeeping via Analythical Hyrearchy Process, Konya Example
NASA Astrophysics Data System (ADS)
Sarı, F.; Ceylan, D. A.
2017-11-01
Over the past decade, the importance of the beekeeping activities has been emphasized in the field of biodiversity, ecosystems, agriculture and human health. Thus, efficient management and deciding correct beekeeping activities seems essential to maintain and improve productivity and efficiency. Due to this importance, considering the economic contributions to the rural area, the need for suitability analysis concept has been revealed. At this point, Multi Criteria Decision Analysis (MCDA) and Geographical Information Systems (GIS) integration provides efficient solutions to the complex structure of decision- making process for beekeeping activities. In this study, site suitability analysis via Analytical Hierarchy Process (AHP) was carried out for Konya city in Turkey. Slope, elevation, aspect, distance to water resources, roads and settlements, precipitation and flora criteria are included to determine suitability. The requirements, expectations and limitations of beekeeping activities are specified with the participation of experts and stakeholders. The final suitability map were validated with existing 117 beekeeping locations and Turkish Statistical Institute 2016 beekeeping statistics for Konya province.
Catalog of earthquake hypocenters at Alaskan volcanoes: January 1 through December 31, 2002
Dixon, James P.; Stihler, Scott D.; Power, John A.; Tytgat, Guy; Moran, Seth C.; Sánchez, John; Estes, Steve; McNutt, Stephen R.; Paskievitch, John
2003-01-01
The Alaska Volcano Observatory (AVO), a cooperative program of the U.S. Geological Survey, the Geophysical Institute of the University of Alaska Fairbanks, and the Alaska Division of Geological and Geophysical Surveys, has maintained seismic monitoring networks at historically active volcanoes in Alaska since 1988 (Power and others, 1993; Jolly and others, 1996; Jolly and others, 2001; Dixon and others, 2002). The primary objectives of this program are the seismic monitoring of active, potentially hazardous, Alaskan volcanoes and the investigation of seismic processes associated with active volcanism. This catalog presents the basic seismic data and changes in the seismic monitoring program for the period January 1, 2002 through December 31, 2002. Appendix G contains a list of publications pertaining to seismicity of Alaskan volcanoes based on these and previously recorded data. The AVO seismic network was used to monitor twenty-four volcanoes in real time in 2002. These include Mount Wrangell, Mount Spurr, Redoubt Volcano, Iliamna Volcano, Augustine Volcano, Katmai Volcanic Group (Snowy Mountain, Mount Griggs, Mount Katmai, Novarupta, Trident Volcano, Mount Mageik, Mount Martin), Aniakchak Crater, Mount Veniaminof, Pavlof Volcano, Mount Dutton, Isanotski Peaks, Shishaldin Volcano, Fisher Caldera, Westdahl Peak, Akutan Peak, Makushin Volcano, Great Sitkin Volcano, and Kanaga Volcano (Figure 1). Monitoring highlights in 2002 include an earthquake swarm at Great Sitkin Volcano in May-June; an earthquake swarm near Snowy Mountain in July-September; low frequency (1-3 Hz) tremor and long-period events at Mount Veniaminof in September-October and in December; and continuing volcanogenic seismic swarms at Shishaldin Volcano throughout the year. Instrumentation and data acquisition highlights in 2002 were the installation of a subnetwork on Okmok Volcano, the establishment of telemetry for the Mount Veniaminof subnetwork, and the change in the data acquisition system to an EARTHWORM detection system. AVO located 7430 earthquakes during 2002 in the vicinity of the monitored volcanoes. This catalog includes: (1) a description of instruments deployed in the field and their locations; (2) a description of earthquake detection, recording, analysis, and data archival systems; (3) a description of velocity models used for earthquake locations; (4) a summary of earthquakes located in 2002; and (5) an accompanying UNIX tar-file with a summary of earthquake origin times, hypocenters, magnitudes, and location quality statistics; daily station usage statistics; and all HYPOELLIPSE files used to determine the earthquake locations in 2002.The AVO seismic network was used to monitor twenty-four volcanoes in real time in 2002. These include Mount Wrangell, Mount Spurr, Redoubt Volcano, Iliamna Volcano, Augustine Volcano, Katmai Volcanic Group (Snowy Mountain, Mount Griggs, Mount Katmai, Novarupta, Trident Volcano, Mount Mageik, Mount Martin), Aniakchak Crater, Mount Veniaminof, Pavlof Volcano, Mount Dutton, Isanotski Peaks, Shishaldin Volcano, Fisher Caldera, Westdahl Peak, Akutan Peak, Makushin Volcano, Great Sitkin Volcano, and Kanaga Volcano (Figure 1). Monitoring highlights in 2002 include an earthquake swarm at Great Sitkin Volcano in May-June; an earthquake swarm near Snowy Mountain in July-September; low frequency (1-3 Hz) tremor and long-period events at Mount Veniaminof in September-October and in December; and continuing volcanogenic seismic swarms at Shishaldin Volcano throughout the year. Instrumentation and data acquisition highlights in 2002 were the installation of a subnetwork on Okmok Volcano, the establishment of telemetry for the Mount Veniaminof subnetwork, and the change in the data acquisition system to an EARTHWORM detection system. AVO located 7430 earthquakes during 2002 in the vicinity of the monitored volcanoes.This catalog includes: (1) a description of instruments deployed in the field and their locations; (2) a description of earthquake detection, recording, analysis, and data archival systems; (3) a description of velocity models used for earthquake locations; (4) a summary of earthquakes located in 2002; and (5) an accompanying UNIX tar-file with a summary of earthquake origin times, hypocenters, magnitudes, and location quality statistics; daily station usage statistics; and all HYPOELLIPSE files used to determine the earthquake locations in 2002.
da Costa Lobato, Tarcísio; Hauser-Davis, Rachel Ann; de Oliveira, Terezinha Ferreira; Maciel, Marinalva Cardoso; Tavares, Maria Regina Madruga; da Silveira, Antônio Morais; Saraiva, Augusto Cesar Fonseca
2015-02-15
The Amazon area has been increasingly suffering from anthropogenic impacts, especially due to the construction of hydroelectric power plant reservoirs. The analysis and categorization of the trophic status of these reservoirs are of interest to indicate man-made changes in the environment. In this context, the present study aimed to categorize the trophic status of a hydroelectric power plant reservoir located in the Brazilian Amazon by constructing a novel Water Quality Index (WQI) and Trophic State Index (TSI) for the reservoir using major ion concentrations and physico-chemical water parameters determined in the area and taking into account the sampling locations and the local hydrological regimes. After applying statistical analyses (factor analysis and cluster analysis) and establishing a rule base of a fuzzy system to these indicators, the results obtained by the proposed method were then compared to the generally applied Carlson and a modified Lamparelli trophic state index (TSI), specific for trophic regions. The categorization of the trophic status by the proposed fuzzy method was shown to be more reliable, since it takes into account the specificities of the study area, while the Carlson and Lamparelli TSI do not, and, thus, tend to over or underestimate the trophic status of these ecosystems. The statistical techniques proposed and applied in the present study, are, therefore, relevant in cases of environmental management and policy decision-making processes, aiding in the identification of the ecological status of water bodies. With this, it is possible to identify which factors should be further investigated and/or adjusted in order to attempt the recovery of degraded water bodies. Copyright © 2014 Elsevier B.V. All rights reserved.
Borowska, Alicja; Szwaczkowski, Tomasz; Kamiński, Stanisław; Hering, Dorota M; Kordan, Władysław; Lecewicz, Marek
2018-05-01
Use of information theory can be an alternative statistical approach to detect genome regions and candidate genes that are associated with livestock traits. The aim of this study was to verify the validity of the SNPs effects on some semen quality variables of bulls using entropy analysis. Records from 288 Holstein-Friesian bulls from one AI station were included. The following semen quality variables were analyzed: CASA kinematic variables of sperm (total motility, average path velocity, straight line velocity, curvilinear velocity, amplitude of lateral head displacement, beat cross frequency, straightness, linearity), sperm membrane integrity (plazmolema, mitochondrial function), sperm ATP content. Molecular data included 48,192 SNPs. After filtering (call rate = 0.95 and MAF = 0.05), 34,794 SNPs were included in the entropy analysis. The entropy and conditional entropy were estimated for each SNP. Conditional entropy quantifies the remaining uncertainty about values of the variable with the knowledge of SNP. The most informative SNPs for each variable were determined. The computations were performed using the R statistical package. A majority of the loci had relatively small contributions. The most informative SNPs for all variables were mainly located on chromosomes: 3, 4, 5 and 16. The results from the study indicate that important genome regions and candidate genes that determine semen quality variables in bulls are located on a number of chromosomes. Some detected clusters of SNPs were located in RNA (U6 and 5S_rRNA) for all the variables for which analysis occurred. Associations between PARK2 as well GALNT13 genes and some semen characteristics were also detected. Copyright © 2018 Elsevier B.V. All rights reserved.
Statistical Analysis of Tsunami Variability
NASA Astrophysics Data System (ADS)
Zolezzi, Francesca; Del Giudice, Tania; Traverso, Chiara; Valfrè, Giulio; Poggi, Pamela; Parker, Eric J.
2010-05-01
The purpose of this paper was to investigate statistical variability of seismically generated tsunami impact. The specific goal of the work was to evaluate the variability in tsunami wave run-up due to uncertainty in fault rupture parameters (source effects) and to the effects of local bathymetry at an individual location (site effects). This knowledge is critical to development of methodologies for probabilistic tsunami hazard assessment. Two types of variability were considered: • Inter-event; • Intra-event. Generally, inter-event variability refers to the differences of tsunami run-up at a given location for a number of different earthquake events. The focus of the current study was to evaluate the variability of tsunami run-up at a given point for a given magnitude earthquake. In this case, the variability is expected to arise from lack of knowledge regarding the specific details of the fault rupture "source" parameters. As sufficient field observations are not available to resolve this question, numerical modelling was used to generate run-up data. A scenario magnitude 8 earthquake in the Hellenic Arc was modelled. This is similar to the event thought to have caused the infamous 1303 tsunami. The tsunami wave run-up was computed at 4020 locations along the Egyptian coast between longitudes 28.7° E and 33.8° E. Specific source parameters (e.g. fault rupture length and displacement) were varied, and the effects on wave height were determined. A Monte Carlo approach considering the statistical distribution of the underlying parameters was used to evaluate the variability in wave height at locations along the coast. The results were evaluated in terms of the coefficient of variation of the simulated wave run-up (standard deviation divided by mean value) for each location. The coefficient of variation along the coast was between 0.14 and 3.11, with an average value of 0.67. The variation was higher in areas of irregular coast. This level of variability is similar to that seen in ground motion attenuation correlations used for seismic hazard assessment. The second issue was intra-event variability. This refers to the differences in tsunami wave run-up along a section of coast during a single event. Intra-event variability investigated directly considering field observations. The tsunami events used in the statistical evaluation were selected on the basis of the completeness and reliability of the available data. Tsunami considered for the analysis included the recent and well surveyed tsunami of Boxing Day 2004 (Great Indian Ocean Tsunami), Java 2006, Okushiri 1993, Kocaeli 1999, Messina 1908 and a case study of several historic events in Hawaii. Basic statistical analysis was performed on the field observations from these tsunamis. For events with very wide survey regions, the run-up heights have been grouped in order to maintain a homogeneous distance from the source. Where more than one survey was available for a given event, the original datasets were maintained separately to avoid combination of non-homogeneous data. The observed run-up measurements were used to evaluate the minimum, maximum, average, standard deviation and coefficient of variation for each data set. The minimum coefficient of variation was 0.12 measured for the 2004 Boxing Day tsunami at Nias Island (7 data) while the maximum is 0.98 for the Okushiri 1993 event (93 data). The average coefficient of variation is of the order of 0.45.
An operational wave forecasting system for the east coast of India
NASA Astrophysics Data System (ADS)
Sandhya, K. G.; Murty, P. L. N.; Deshmukh, Aditya N.; Balakrishnan Nair, T. M.; Shenoi, S. S. C.
2018-03-01
Demand for operational ocean state forecasting is increasing, owing to the ever-increasing marine activities in the context of blue economy. In the present study, an operational wave forecasting system for the east coast of India is proposed using unstructured Simulating WAves Nearshore model (UNSWAN). This modelling system uses very high resolution mesh near the Indian east coast and coarse resolution offshore, and thus avoids the necessity of nesting with a global wave model. The model is forced with European Centre for Medium-Range Weather Forecasts (ECMWF) winds and simulates wave parameters and wave spectra for the next 3 days. The spatial pictures of satellite data overlaid on simulated wave height show that the model is capable of simulating the significant wave heights and their gradients realistically. Spectral validation has been done using the available data to prove the reliability of the model. To further evaluate the model performance, the wave forecast for the entire year 2014 is evaluated against buoy measurements over the region at 4 waverider buoy locations. Seasonal analysis of significant wave height (Hs) at the four locations showed that the correlation between the modelled and observed was the highest (in the range 0.78-0.96) during the post-monsoon season. The variability of Hs was also the highest during this season at all locations. The error statistics showed clear seasonal and geographical location dependence. The root mean square error at Visakhapatnam was the same (0.25) for all seasons, but it was the smallest for pre-monsoon season (0.12 m and 0.17 m) for Puducherry and Gopalpur. The wind sea component showed higher variability compared to the corresponding swell component in all locations and for all seasons. The variability was picked by the model to a reasonable level in most of the cases. The results of statistical analysis show that the modelling system is suitable for use in the operational scenario.
Heavy metals in water of the San Pedro River in Chihuahua, Mexico and its potential health risk.
Gutiérrez, Roberto L; Rubio-Arias, Hector; Quintana, Ray; Ortega, Juan Angel; Gutierrez, Melida
2008-06-01
The objective of this study was to determine the seasonal and downstream water quality variations of the San Pedro River in Chihuahua, Mexico. Water samples were collected monthly from October 2005 to August 2006 in triplicate, totaling 165 water samples. The five sampling locations were: below the Francisco I. Madero dam (LP); between Rosales and Delicias (RD); Meoqui (M); El Torreon (ET), and Julimes (LJ). The levels of As, Be, Ca, Cd, Co, Cu, Cr, Fe, Li, Mg, Mn, Mo, Ni, Pb, Sb, Se, Sr, Ti, Ta, V and Zn were measured using an Inductively Coupled Plasma- Optical Emission Spectrometry (ICP-OES) Perkin Elmer 2100. In addition, temperature, pH, electrical conductivity and total and fecal coliformes were determined. The statistical analysis considered a factorial treatment design; where factor A was the location point and factor B was sampling date. In addition, a multivariate technique looking for principal components was performed. The results indicated that some samples exceeded Mexican standards for As, Be, Ca, Cd, Co, Cr, Fe, Mn, Ni, Pb, Sb, Se, Sr and Zn. The As level must be considered for a red flag to the communities along the Rio San Pedro because both the monthly average level (0.10 mg L-1) and location (0.10 mg L-1) exceeded the Mexican and International norms. The multivariate analysis showed a predominant aggregation at the LP location, meaning that there was a predominance of As, Sr, Fe and Li. At the rest of the locations the elements did not present a tendency for aggregation. Statistics applied to sampling month showed that December, January, March and April were aggregated in a negative quadrant of component 1 indicating a predominance of V, Ni, Be, Fe and As. Overall, the results confirmed that this stretch of the San Pedro River is contaminated with heavy metals and other contaminants that might affect human health as well as the health of the ecosystem.
Heavy metals in water of the San Pedro River in Chihuahua, Mexico and its potential health risk
Gutiérrez, Roberto L.; Rubio-Arias, Hector; Quintana, Ray; Ortega, Juan Angel; Gutierrez, Melida
2008-01-01
The objective of this study was to determine the seasonal and downstream water quality variations of the San Pedro River in Chihuahua, Mexico. Water samples were collected monthly from October 2005 to August 2006 in triplicate, totaling 165 water samples. The five sampling locations were: below the Francisco I. Madero dam (LP); between Rosales and Delicias (RD); Meoqui (M); El Torreon (ET), and Julimes (LJ). The levels of As, Be, Ca, Cd, Co, Cu, Cr, Fe, Li, Mg, Mn, Mo, Ni, Pb, Sb, Se, Sr, Ti, Ta, V and Zn were measured using an Inductively Coupled Plasma-Optical Emission Spectrometry (ICP-OES) Perkin Elmer 2100. In addition, temperature, pH, electrical conductivity and total and fecal coliformes were determined. The statistical analysis considered a factorial treatment design; where factor A was the location point and factor B was sampling date. In addition, a multivariate technique looking for principal components was performed. The results indicated that some samples exceeded Mexican standards for As, Be, Ca, Cd, Co, Cr, Fe, Mn, Ni, Pb, Sb, Se, Sr and Zn. The As level must be considered for a red flag to the communities along the Rio San Pedro because both the monthly average level (0.10 mg L−1) and location (0.10 mg L−1) exceeded the Mexican and International norms. The multivariate analysis showed a predominant aggregation at the LP location, meaning that there was a predominance of As, Sr, Fe and Li. At the rest of the locations the elements did not present a tendency for aggregation. Statistics applied to sampling month showed that December, January, March and April were aggregated in a negative quadrant of component 1 indicating a predominance of V, Ni, Be, Fe and As. Overall, the results confirmed that this stretch of the San Pedro River is contaminated with heavy metals and other contaminants that might affect human health as well as the health of the ecosystem. PMID:18678922
Sainz, A; Ruiz, F
2006-03-01
A spatial and temporal analysis (period 1990-2003) of 15 sampling points distributed along the southwestern Spanish coast permits to delimitate the influence area of the extremely polluted discharges coming from the Tinto-Odiel system in the bottom sediments of the adjacent littoral area. As, Cu, Pb and Zn are the main heavy metals transported by the freshwater runoffs toward the shallow shelf and present very high negative (r < -0.7) and significant (p < 0.001) correlations with the distance to the estuarine mouth. The statistical analysis (index of geoaccumulation, Pearson correlation matrix, cluster analysis) of their concentrations in the littoral sediments located between the Guadiana and Guadalquivir mouths delimitates three zones: (a) Zone 1 (from the estuarine mouth to 6 km to the east), characterized by moderate to strongly polluted bottom sediments and main responsible of the mean annual variations of the former heavy metals in the area studied; (b) Zone 2 (from 21.2 km to the west to 29 km to the east), characterized by moderate pollution levels; and (c) Zone 3, located near the Guadiana and Guadalquivir mouths, with very low As-Cu-Pb contents and unpolluted to moderately levels of Zn due to urban sewages or the presence of local low mobility areas for this element.
CT Image Sequence Analysis for Object Recognition - A Rule-Based 3-D Computer Vision System
Dongping Zhu; Richard W. Conners; Daniel L. Schmoldt; Philip A. Araman
1991-01-01
Research is now underway to create a vision system for hardwood log inspection using a knowledge-based approach. In this paper, we present a rule-based, 3-D vision system for locating and identifying wood defects using topological, geometric, and statistical attributes. A number of different features can be derived from the 3-D input scenes. These features and evidence...
ERIC Educational Resources Information Center
Ba, Harouna; Meade, Terri; Pierson, Elizabeth; Ferguson, Camille; Roy, Amanda; Williams, Hakim
2009-01-01
Forrest County Agricultural High School (FCAHS) is located in Brooklyn, a small rural town in southern Mississippi and part of the Hattiesburg Metropolitan Statistical Area. Unlike the other schools that participated in the Cisco 21S initiative, FCAHS is not part of a larger school district. Therefore, the unit of analysis throughout this summary…
O'Neel, Shad; Larsen, Christopher F.; Rupert, Natalia; Hansen, Roger
2010-01-01
Since the installation of the Alaska Regional Seismic Network in the 1970s, data analysts have noted nontectonic seismic events thought to be related to glacier dynamics. While loose associations with the glaciers of the St. Elias Mountains have been made, no detailed study of the source locations has been undertaken. We performed a two-step investigation surrounding these events, beginning with manual locations that guided an automated detection and event sifting routine. Results from the manual investigation highlight characteristics of the seismic waveforms including single-peaked (narrowband) spectra, emergent onsets, lack of distinct phase arrivals, and a predominant cluster of locations near the calving termini of several neighboring tidewater glaciers. Through these locations, comparison with previous work, analyses of waveform characteristics, frequency-magnitude statistics and temporal patterns in seismicity, we suggest calving as a source for the seismicity. Statistical properties and time series analysis of the event catalog suggest a scale-invariant process that has no single or simple forcing. These results support the idea that calving is often a response to short-lived or localized stress perturbations. Our results demonstrate the utility of passive seismic instrumentation to monitor relative changes in the rate and magnitude of iceberg calving at tidewater glaciers that may be volatile or susceptible to ensuing rapid retreat, especially when existing seismic infrastructure can be used.
GEOGRAPIC INFORMATION SYSTEMS IN DETERMINING ROAD TRAFFIC CRASH ANALYSIS IN IBADAN, NIGERIA.
Rukewe, A; Taiwo, O J; Fatiregun, A A; Afuwape, O O; Alonge, T O
2014-01-01
Road traffic accidents are frequent in this environment, hence the need to determine the place of geographic information systems in the documentation of road traffic accidents. To investigate and document the variations in crash frequencies by types and across different road types in Ibadan, Nigeria. Road traffic accident data between January and June 2011 were obtained from the University College Hospital Emergency Department's trauma registry. All the traffic accidents were categorized into motor vehicular, motorbike and pedestrian crashes. Georeferencing of accident locations mentioned by patients was done using a combination of Google Earth and ArcGIS software. Nearest neighbor statistic, Moran's-I, Getis-Ord statistics, Student T-test, and ANOVA were used in investigating the spatial dynamics in crashes. Out of 600 locations recorded, 492 (82.0%) locations were correctly georeferenced. Crashes were clustered in space with motorbike crashes showing greatest clustering. There was significant difference in crashes between dual and non-dual carriage roads (P = 0.0001), but none between the inner city and the periphery (p = 0.115). However, significant variations also exist among the three categories analyzed (p = 0.004) and across the eleven Local Government Areas (P = 0.017). This study showed that the use of Geographic Information System can help in understanding variations in road traffic accident occurrence, while at the same time identifying locations and neighborhoods with unusually higher accidents frequency.
Risk factors for hydrocephalus and neurological deficit in children born with an encephalocele.
Da Silva, Stephanie L; Jeelani, Yasser; Dang, Ha; Krieger, Mark D; McComb, J Gordon
2015-04-01
There is a known association of hydrocephalus with encephaloceles. Risk factors for hydrocephalus and neurological deficit were ascertained in a series of patients born with an encephalocele. A retrospective analysis was undertaken of patients treated for encephaloceles at Children's Hospital Los Angeles between 1994 and 2012. The following factors were evaluated for their prognostic value: age at presentation, sex, location of encephalocele, size, contents, microcephaly, presence of hydrocephalus, CSF leak, associated cranial anomalies, and neurological outcome. Seventy children were identified, including 38 girls and 32 boys. The median age at presentation was 2 months. The mean follow-up duration was 3.7 years. Encephalocele location was classified as anterior (n = 14) or posterior (n = 56) to the coronal suture. The average maximum encephalocele diameter was 4 cm (range 0.5-23 cm). Forty-seven encephaloceles contained neural tissue. Eight infants presented at birth with CSF leaking from the encephalocele, with 1 being infected. Six patients presented with hydrocephalus, while 11 developed progressive hydrocephalus postoperatively. On univariate analysis, the presence of neural tissue, cranial anomalies, encephalocele size of at least 2 cm, seizure disorder, and microcephaly were each positively associated with hydrocephalus. On multivariate logistic regression modeling, the single prognostic factor for hydrocephalus of borderline statistical significance was the presence of neural tissue (odds ratio [OR] = 5.8, 95% confidence interval [CI] = 0.8-74.0). Fourteen patients had severe developmental delay, 28 had mild/moderate delay, and 28 were neurologically normal. On univariate analysis, the presence of cranial anomalies, larger size of encephalocele, hydrocephalus, and microcephaly were positively associated with neurological deficit. In the multivariable model, the only statistically significant prognostic factor for neurological deficit was presence of hydrocephalus (OR 17.2, 95% CI 1.7-infinity). In multivariate models, the presence of neural tissue was borderline significantly associated with hydrocephalus and the presence of hydrocephalus was significantly associated with neurological deficit. The location of the encephalocele did not have a statistically significant association with incidence of hydrocephalus or neurological deficit. In contrast to modestly good/fair neurological outcome in children with an encephalocele without hydrocephalus, the presence of hydrocephalus resulted in a far worse neurological outcome.
NASA Astrophysics Data System (ADS)
Yu, Junliang; Froning, Dieter; Reimer, Uwe; Lehnert, Werner
2018-06-01
The lattice Boltzmann method is adopted to simulate the three dimensional dynamic process of liquid water breaking through the gas diffusion layer (GDL) in the polymer electrolyte membrane fuel cell. 22 micro-structures of Toray GDL are built based on a stochastic geometry model. It is found that more than one breakthrough locations are formed randomly on the GDL surface. Breakthrough location distance (BLD) are analyzed statistically in two ways. The distribution is evaluated statistically by the Lilliefors test. It is concluded that the BLD can be described by the normal distribution with certain statistic characteristics. Information of the shortest neighbor breakthrough location distance can be the input modeling setups on the cell-scale simulations in the field of fuel cell simulation.
Spatio-temporal analysis of annual rainfall in Crete, Greece
NASA Astrophysics Data System (ADS)
Varouchakis, Emmanouil A.; Corzo, Gerald A.; Karatzas, George P.; Kotsopoulou, Anastasia
2018-03-01
Analysis of rainfall data from the island of Crete, Greece was performed to identify key hydrological years and return periods as well as to analyze the inter-annual behavior of the rainfall variability during the period 1981-2014. The rainfall spatial distribution was also examined in detail to identify vulnerable areas of the island. Data analysis using statistical tools and spectral analysis were applied to investigate and interpret the temporal course of the available rainfall data set. In addition, spatial analysis techniques were applied and compared to determine the rainfall spatial distribution on the island of Crete. The analysis presented that in contrast to Regional Climate Model estimations, rainfall rates have not decreased, while return periods vary depending on seasonality and geographic location. A small but statistical significant increasing trend was detected in the inter-annual rainfall variations as well as a significant rainfall cycle almost every 8 years. In addition, statistically significant correlation of the island's rainfall variability with the North Atlantic Oscillation is identified for the examined period. On the other hand, regression kriging method combining surface elevation as secondary information improved the estimation of the annual rainfall spatial variability on the island of Crete by 70% compared to ordinary kriging. The rainfall spatial and temporal trends on the island of Crete have variable characteristics that depend on the geographical area and on the hydrological period.
NASA Astrophysics Data System (ADS)
Ye, M.; Pacheco Castro, R. B.; Pacheco Avila, J.; Cabrera Sansores, A.
2014-12-01
The karstic aquifer of Yucatan is a vulnerable and complex system. The first fifteen meters of this aquifer have been polluted, due to this the protection of this resource is important because is the only source of potable water of the entire State. Through the assessment of groundwater quality we can gain some knowledge about the main processes governing water chemistry as well as spatial patterns which are important to establish protection zones. In this work multivariate statistical techniques are used to assess the groundwater quality of the supply wells (30 to 40 meters deep) in the hidrogeologic region of the Ring of Cenotes, located in Yucatan, Mexico. Cluster analysis and principal component analysis are applied in groundwater chemistry data of the study area. Results of principal component analysis show that the main sources of variation in the data are due sea water intrusion and the interaction of the water with the carbonate rocks of the system and some pollution processes. The cluster analysis shows that the data can be divided in four clusters. The spatial distribution of the clusters seems to be random, but is consistent with sea water intrusion and pollution with nitrates. The overall results show that multivariate statistical analysis can be successfully applied in the groundwater quality assessment of this karstic aquifer.
A demographic analysis of vertical root fractures.
Cohen, Stephen; Berman, Louis H; Blanco, Lucia; Bakland, Leif; Kim, Jay S
2006-12-01
Teeth with vertical root fractures (VRFs) have complete or incomplete fractures that extends through the enamel, dentin and pulp, down the long axis of the tooth. Several different variables were investigated and statistically evaluated as to their correlation with the presence of VRFs. Specifically analyzed were gender, tooth location, age, radiographic and clinical findings, bruxism, and pulpal status. The data were collected from three different endodontists, from three different geographic locations, comprising a total of 227 teeth. Although VRFs may occur in conjunction with any of the parameters investigated, only certain factors were found to occur in a significant number of cases. The results indicate that VRFs are statistically more prevalent in mandibular molars and maxillary premolars. They are associated with periradicular bone loss, pain to percussion, extensive restorations, and seem to occur more often in females and older patients. However, VRFs are not necessarily related to periapical bone loss, a widening of the periodontal ligament space, associated periodontal pockets, a sinus tract, particular pulpal status, or bruxism.
Loading and dilution: arsenic, sodium and nutrients in a section of the River Tisza, Hungary
NASA Astrophysics Data System (ADS)
Türk, Gábor; Prokisch, József; Simon, Edina; Szabó, Szilárd
2015-11-01
We aimed to reveal the risk of arsenic in a Hungarian river (the Tisza) at the mouth of a polluted canal. Four sampling sites were involved in this work and samples were collected on a weekly basis for arsenic and sodium, and on a monthly basis for nutrients. Significant differences were found concerning each studied component between the sampling locations of the River Tisza. Statistical analysis also revealed that the values of the upper and lower river tracts did not differ significantly. Thus, water carried by the canal is being diluted before it reaches the farthest sampling location.
Yigzaw, Kassaye Yitbarek; Michalas, Antonis; Bellika, Johan Gustav
2017-01-03
Techniques have been developed to compute statistics on distributed datasets without revealing private information except the statistical results. However, duplicate records in a distributed dataset may lead to incorrect statistical results. Therefore, to increase the accuracy of the statistical analysis of a distributed dataset, secure deduplication is an important preprocessing step. We designed a secure protocol for the deduplication of horizontally partitioned datasets with deterministic record linkage algorithms. We provided a formal security analysis of the protocol in the presence of semi-honest adversaries. The protocol was implemented and deployed across three microbiology laboratories located in Norway, and we ran experiments on the datasets in which the number of records for each laboratory varied. Experiments were also performed on simulated microbiology datasets and data custodians connected through a local area network. The security analysis demonstrated that the protocol protects the privacy of individuals and data custodians under a semi-honest adversarial model. More precisely, the protocol remains secure with the collusion of up to N - 2 corrupt data custodians. The total runtime for the protocol scales linearly with the addition of data custodians and records. One million simulated records distributed across 20 data custodians were deduplicated within 45 s. The experimental results showed that the protocol is more efficient and scalable than previous protocols for the same problem. The proposed deduplication protocol is efficient and scalable for practical uses while protecting the privacy of patients and data custodians.
Workflow Management for Complex HEP Analyses
NASA Astrophysics Data System (ADS)
Erdmann, M.; Fischer, R.; Rieger, M.; von Cube, R. F.
2017-10-01
We present the novel Analysis Workflow Management (AWM) that provides users with the tools and competences of professional large scale workflow systems, e.g. Apache’s Airavata[1]. The approach presents a paradigm shift from executing parts of the analysis to defining the analysis. Within AWM an analysis consists of steps. For example, a step defines to run a certain executable for multiple files of an input data collection. Each call to the executable for one of those input files can be submitted to the desired run location, which could be the local computer or a remote batch system. An integrated software manager enables automated user installation of dependencies in the working directory at the run location. Each execution of a step item creates one report for bookkeeping purposes containing error codes and output data or file references. Required files, e.g. created by previous steps, are retrieved automatically. Since data storage and run locations are exchangeable from the steps perspective, computing resources can be used opportunistically. A visualization of the workflow as a graph of the steps in the web browser provides a high-level view on the analysis. The workflow system is developed and tested alongside of a ttbb cross section measurement where, for instance, the event selection is represented by one step and a Bayesian statistical inference is performed by another. The clear interface and dependencies between steps enables a make-like execution of the whole analysis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Min-Joo; Park, So-Hyun; Research Institute of Biomedical Engineering, The Catholic University of Korea, Seoul
2013-10-01
The partial-breast irradiation (PBI) technique, an alternative to whole-breast irradiation, is a beam delivery method that uses a limited range of treatment volume. The present study was designed to determine the optimal PBI treatment modalities for 8 different tumor locations. Treatment planning was performed on computed tomography (CT) data sets of 6 patients who had received lumpectomy treatments. Tumor locations were classified into 8 subsections according to breast quadrant and depth. Three-dimensional conformal radiation therapy (3D-CRT), electron beam therapy (ET), and helical tomotherapy (H-TOMO) were utilized to evaluate the dosimetric effect for each tumor location. Conformation number (CN), radical dosemore » homogeneity index (rDHI), and dose delivered to healthy tissue were estimated. The Kruskal-Wallis, Mann-Whitney U, and Bonferroni tests were used for statistical analysis. The ET approach showed good sparing effects and acceptable target coverage for the lower inner quadrant—superficial (LIQ-S) and lower inner quadrant—deep (LIQ-D) locations. The H-TOMO method was the least effective technique as no evaluation index achieved superiority for all tumor locations except CN. The ET method is advisable for treating LIQ-S and LIQ-D tumors, as opposed to 3D-CRT or H-TOMO, because of acceptable target coverage and much lower dose applied to surrounding tissue.« less
Langenderfer, Joseph E; Rullkoetter, Paul J; Mell, Amy G; Laz, Peter J
2009-04-01
An accurate assessment of shoulder kinematics is useful for understanding healthy normal and pathological mechanics. Small variability in identifying and locating anatomical landmarks (ALs) has potential to affect reported shoulder kinematics. The objectives of this study were to quantify the effect of landmark location variability on scapular and humeral kinematic descriptions for multiple subjects using probabilistic analysis methods, and to evaluate the consistency in results across multiple subjects. Data from 11 healthy subjects performing humeral elevation in the scapular plane were used to calculate Euler angles describing humeral and scapular kinematics. Probabilistic analyses were performed for each subject to simulate uncertainty in the locations of 13 upper-extremity ALs. For standard deviations of 4 mm in landmark location, the analysis predicted Euler angle envelopes between the 1 and 99 percentile bounds of up to 16.6 degrees . While absolute kinematics varied with the subject, the average 1-99% kinematic ranges for the motion were consistent across subjects and sensitivity factors showed no statistically significant differences between subjects. The description of humeral kinematics was most sensitive to the location of landmarks on the thorax, while landmarks on the scapula had the greatest effect on the description of scapular elevation. The findings of this study can provide a better understanding of kinematic variability, which can aid in making accurate clinical diagnoses and refining kinematic measurement techniques.
NASA Astrophysics Data System (ADS)
Verdhora Ry, Rexha; Septyana, T.; Widiyantoro, S.; Nugraha, A. D.; Ardjuna, A.
2017-04-01
Microseismic monitoring and constraining its hypocenters in and around hydrocarbon reservoirs provides insight into induced deformation related to hydraulic fracturing. In this study, we used data from a single vertical array of sensors in a borehole, providing measures of arrival times and polarizations. Microseismic events are located using 1-D velocity models and arrival times of P- and S-waves. However, in the case of all the sensors being deployed in a near-vertical borehole, there is a high ambiguity in the source location. Herein, we applied a procedure using azimuth of P-wave particle motion to constrain and improve the source location. We used a dataset acquired during 1-day of fracture stimulation at a CBM field in Indonesia. We applied five steps of location procedure to investigate microseismic events induced by these hydraulic fracturing activities. First, arrival times for 1584 candidate events were manually picked. Then we refined the arrival times using energy ratio method to obtain high consistency picking. Using these arrival times, we estimated back-azimuth using P-wave polarization analysis. We also added the combination of polarities analysis to remove 180° ambiguity. In the end, we determined hypocenter locations using grid-search method that guided in the back-azimuth trace area to minimize the misfit function of arrival times. We have successfully removed the ambiguity and produced a good solution for hypocenter locations as indicated statistically by small RMS. Most of the events clusters highlight coherent structures around the treatment well site and revealed faults. The same procedure can be applied to various other cases such as microseismic monitoring in the field of geothermal and shale gas/oil exploration, also CCS (Carbon Capture and Storage) development.
Locational Issues in New Apprenticeships. Australian Apprenticeships.
ERIC Educational Resources Information Center
Dumbrell, T.; Finnegan, W.; de Montfort, R.
A study examined geographical distribution of Australian apprenticeship commencements (ACs) in the context of various labor force and population statistics by industry, location of jobs by industry, and youth population. Apprenticeship and traineeship statistics between 1995-98 were examined to demonstrate differences in development of the system…
NASA Astrophysics Data System (ADS)
Roy, P. K.; Pal, S.; Banerjee, G.; Biswas Roy, M.; Ray, D.; Majumder, A.
2014-12-01
River is considered as one of the main sources of freshwater all over the world. Hence analysis and maintenance of this water resource is globally considered a matter of major concern. This paper deals with the assessment of surface water quality of the Ichamati river using multivariate statistical techniques. Eight distinct surface water quality observation stations were located and samples were collected. For the samples collected statistical techniques were applied to the physico-chemical parameters and depth of siltation. In this paper cluster analysis is done to determine the relations between surface water quality and siltation depth of river Ichamati. Multiple regressions and mathematical equation modeling have been done to characterize surface water quality of Ichamati river on the basis of physico-chemical parameters. It was found that surface water quality of the downstream river was different from the water quality of the upstream. The analysis of the water quality parameters of the Ichamati river clearly indicate high pollution load on the river water which can be accounted to agricultural discharge, tidal effect and soil erosion. The results further reveal that with the increase in depth of siltation, water quality degraded.
Li, Pan; Xiao, Zhitao; Braciak, Todd A; Ou, Qingjian; Chen, Gong; Oduncu, Fuat S
2017-01-01
The progression of colorectal cancer (CRC) may differ depending on the location of the tumor and the age of onset of the disease. Previous studies also suggested that the molecular basis of CRC varies with tumor location, which could affect the clinical management of patients. Therefore, we performed survival analysis looking at different age groups and mismatch repair status (MMR) of CRC patients according to primary tumor location in an attempt to identify subgroups of CRC that might help in the prognosis of disease. A group of 2233 patients operated on to remove their CRC tumors were analyzed (521 with right colon cancer, 740 with left colon cancer and 972 with rectal cancer). The expression of four MMR genes was assessed by immunohistochemistry (IHC), independent of clinical criteria. From the data collected, a predictive model for overall survival (OS) could be constructed for some associations of tumor location and age of onset using Kaplan-Meier, logistic and Cox regression analysis. When tumor location was considered as the lone factor, we found no statistical difference in overall survival (OS) between right cancer (68%), left cancer (67%) or rectal cancer tumor locations (71%) (HR: 1.17, 95%CI (confidence interval): 0.97-1.43, P = 0.057). When age of onset was considered, middle age (40-59 years) and older (60-85 years) patients were found to have higher OS than younger onset cancer (20-39 years) patients (69% vs 71% vs 59%, HR: 1.07, 95% confidence interval (CI): 0.91-1.25, P = 0.008). When both age of onset and tumor location were considered in combination as disease factors, we found that the subgroup of patients with left colon cancer from middle age (69%) and older (67%) aged patients had higher OS than younger (54%) patients (HR: 0.89, 95%CI: 0.68-1.16, P = 0.048). However in patients with right colon cancers, we found no statistical difference is OS between younger, middle age or older grouped patients (60% vs 71% vs 67%, HR: 0.84, 95% CI: 0.61-1.16, P = 0.194). With regard to rectal located cancers, we found that younger (62%) and middle age (68) patients had lower OS than older (77%) patients (HR:1.46, 95%CI: 1.13-1.88, P = 0.004). The rates of deficient MMR (dMMR) was 10.4%. We found no statistical difference in OS stratified by tumor locations. However, right colon cancer patients with dMMR (86%) had higher OS than those with proficient MMR (pMMR) (63%) (HR: 3.01, 95% CI: 1.82-4.97, P<0.001). Left colon cancer patients with dMMR (76%) also had higher OS than those with pMMR (66%) (HR: 1.67, 95% CI: 0.95-2.92, P = 0.01). Oppositely, rectal cancer patients with dMMR (60%) had lower OS than those pMMR (68%) (HR: 0.77, 95% CI: 0.51-1.17, P = 0.04). These data demonstrate that primary tumor location can be an important factor when considered along with age of onset for the prognosis of CRC. Primary tumor location is also an important factor to evaluate the predictive effect of MMR status for the prognosis of CRC.
Chandrasekaran, A; Ravisankar, R; Harikrishnan, N; Satapathy, K K; Prasad, M V R; Kanagasabapathy, K V
2015-02-25
Anthropogenic activities increase the accumulation of heavy metals in the soil environment. Soil pollution significantly reduces environmental quality and affects the human health. In the present study soil samples were collected at different locations of Yelagiri Hills, Tamilnadu, India for heavy metal analysis. The samples were analyzed for twelve selected heavy metals (Mg, Al, K, Ca, Ti, Fe, V, Cr, Mn, Co, Ni and Zn) using energy dispersive X-ray fluorescence (EDXRF) spectroscopy. Heavy metals concentration in soil were investigated using enrichment factor (EF), geo-accumulation index (Igeo), contamination factor (CF) and pollution load index (PLI) to determine metal accumulation, distribution and its pollution status. Heavy metal toxicity risk was assessed using soil quality guidelines (SQGs) given by target and intervention values of Dutch soil standards. The concentration of Ni, Co, Zn, Cr, Mn, Fe, Ti, K, Al, Mg were mainly controlled by natural sources. Multivariate statistical methods such as correlation matrix, principal component analysis and cluster analysis were applied for the identification of heavy metal sources (anthropogenic/natural origin). Geo-statistical methods such as kirging identified hot spots of metal contamination in road areas influenced mainly by presence of natural rocks. Copyright © 2014 Elsevier B.V. All rights reserved.
Booth, Brian G; Keijsers, Noël L W; Sijbers, Jan; Huysmans, Toon
2018-05-03
Pedobarography produces large sets of plantar pressure samples that are routinely subsampled (e.g. using regions of interest) or aggregated (e.g. center of pressure trajectories, peak pressure images) in order to simplify statistical analysis and provide intuitive clinical measures. We hypothesize that these data reductions discard gait information that can be used to differentiate between groups or conditions. To test the hypothesis of null information loss, we created an implementation of statistical parametric mapping (SPM) for dynamic plantar pressure datasets (i.e. plantar pressure videos). Our SPM software framework brings all plantar pressure videos into anatomical and temporal correspondence, then performs statistical tests at each sampling location in space and time. Novelly, we introduce non-linear temporal registration into the framework in order to normalize for timing differences within the stance phase. We refer to our software framework as STAPP: spatiotemporal analysis of plantar pressure measurements. Using STAPP, we tested our hypothesis on plantar pressure videos from 33 healthy subjects walking at different speeds. As walking speed increased, STAPP was able to identify significant decreases in plantar pressure at mid-stance from the heel through the lateral forefoot. The extent of these plantar pressure decreases has not previously been observed using existing plantar pressure analysis techniques. We therefore conclude that the subsampling of plantar pressure videos - a task which led to the discarding of gait information in our study - can be avoided using STAPP. Copyright © 2018 Elsevier B.V. All rights reserved.
Spectral Discrete Probability Density Function of Measured Wind Turbine Noise in the Far Field
Ashtiani, Payam; Denison, Adelaide
2015-01-01
Of interest is the spectral character of wind turbine noise at typical residential set-back distances. In this paper, a spectral statistical analysis has been applied to immission measurements conducted at three locations. This method provides discrete probability density functions for the Turbine ONLY component of the measured noise. This analysis is completed for one-third octave sound levels, at integer wind speeds, and is compared to existing metrics for measuring acoustic comfort as well as previous discussions on low-frequency noise sources. PMID:25905097
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peterson, Elena S.; McCue, Lee Ann; Rutledge, Alexandra C.
2012-04-25
Visual Exploration and Statistics to Promote Annotation (VESPA) is an interactive visual analysis software tool that facilitates the discovery of structural mis-annotations in prokaryotic genomes. VESPA integrates high-throughput peptide-centric proteomics data and oligo-centric or RNA-Seq transcriptomics data into a genomic context. The data may be interrogated via visual analysis across multiple levels of genomic resolution, linked searches, exports and interaction with BLAST to rapidly identify location of interest within the genome and evaluate potential mis-annotations.
Spatial analysis on future housing markets: economic development and housing implications.
Liu, Xin; Wang, Lizhe
2014-01-01
A coupled projection method combining formal modelling and other statistical techniques was developed to delineate the relationship between economic and social drivers for net new housing allocations. Using the example of employment growth in Tyne and Wear, UK, until 2016, the empirical analysis yields housing projections at the macro- and microspatial levels (e.g., region to subregion to elected ward levels). The results have important implications for the strategic planning of locations for housing and employment, demonstrating both intuitively and quantitatively how local economic developments affect housing demand.
Spatial Analysis on Future Housing Markets: Economic Development and Housing Implications
Liu, Xin; Wang, Lizhe
2014-01-01
A coupled projection method combining formal modelling and other statistical techniques was developed to delineate the relationship between economic and social drivers for net new housing allocations. Using the example of employment growth in Tyne and Wear, UK, until 2016, the empirical analysis yields housing projections at the macro- and microspatial levels (e.g., region to subregion to elected ward levels). The results have important implications for the strategic planning of locations for housing and employment, demonstrating both intuitively and quantitatively how local economic developments affect housing demand. PMID:24892097
Gotvald, Anthony J.
2017-01-13
The U.S. Geological Survey, in cooperation with the Georgia Department of Natural Resources, Environmental Protection Division, developed regional regression equations for estimating selected low-flow frequency and mean annual flow statistics for ungaged streams in north Georgia that are not substantially affected by regulation, diversions, or urbanization. Selected low-flow frequency statistics and basin characteristics for 56 streamgage locations within north Georgia and 75 miles beyond the State’s borders in Alabama, Tennessee, North Carolina, and South Carolina were combined to form the final dataset used in the regional regression analysis. Because some of the streamgages in the study recorded zero flow, the final regression equations were developed using weighted left-censored regression analysis to analyze the flow data in an unbiased manner, with weights based on the number of years of record. The set of equations includes the annual minimum 1- and 7-day average streamflow with the 10-year recurrence interval (referred to as 1Q10 and 7Q10), monthly 7Q10, and mean annual flow. The final regional regression equations are functions of drainage area, mean annual precipitation, and relief ratio for the selected low-flow frequency statistics and drainage area and mean annual precipitation for mean annual flow. The average standard error of estimate was 13.7 percent for the mean annual flow regression equation and ranged from 26.1 to 91.6 percent for the selected low-flow frequency equations.The equations, which are based on data from streams with little to no flow alterations, can be used to provide estimates of the natural flows for selected ungaged stream locations in the area of Georgia north of the Fall Line. The regression equations are not to be used to estimate flows for streams that have been altered by the effects of major dams, surface-water withdrawals, groundwater withdrawals (pumping wells), diversions, or wastewater discharges. The regression equations should be used only for ungaged sites with drainage areas between 1.67 and 576 square miles, mean annual precipitation between 47.6 and 81.6 inches, and relief ratios between 0.146 and 0.607; these are the ranges of the explanatory variables used to develop the equations. An attempt was made to develop regional regression equations for the area of Georgia south of the Fall Line by using the same approach used during this study for north Georgia; however, the equations resulted with high average standard errors of estimates and poorly predicted flows below 0.5 cubic foot per second, which may be attributed to the karst topography common in that area.The final regression equations developed from this study are planned to be incorporated into the U.S. Geological Survey StreamStats program. StreamStats is a Web-based geographic information system that provides users with access to an assortment of analytical tools useful for water-resources planning and management, and for engineering design applications, such as the design of bridges. The StreamStats program provides streamflow statistics and basin characteristics for U.S. Geological Survey streamgage locations and ungaged sites of interest. StreamStats also can compute basin characteristics and provide estimates of streamflow statistics for ungaged sites when users select the location of a site along any stream in Georgia.
Statistical analysis of 4 types of neck whiplash injuries based on classical meridian theory.
Chen, Yemeng; Zhao, Yan; Xue, Xiaolin; Li, Hui; Wu, Xiuyan; Zhang, Qunce; Zheng, Xin; Wang, Tianfang
2015-01-01
As one component of the Chinese medicine meridian system, the meridian sinew (Jingjin, (see text), tendino-musculo) is specially described as being for acupuncture treatment of the musculoskeletal system because of its dynamic attributes and tender point correlations. In recent decades, the therapeutic importance of the sinew meridian has become revalued in clinical application. Based on this theory, the authors have established therapeutic strategies of acupuncture treatment in Whiplash-Associated Disorders (WAD) by categorizing four types of neck symptom presentations. The advantage of this new system is to make it much easier for the clinician to find effective acupuncture points. This study attempts to prove the significance of the proposed therapeutic strategies by analyzing data collected from a clinical survey of various WAD using non-supervised statistical methods, such as correlation analysis, factor analysis, and cluster analysis. The clinical survey data have successfully verified discrete characteristics of four neck syndromes, based upon the range of motion (ROM) and tender point location findings. A summary of the relationships among the symptoms of the four neck syndromes has shown the correlation coefficient as having a statistical significance (P < 0.01 or P < 0.05), especially with regard to ROM. Furthermore, factor and cluster analyses resulted in a total of 11 categories of general symptoms, which implies syndrome factors are more related to the Liver, as originally described in classical theory. The hypothesis of meridian sinew syndromes in WAD is clearly supported by the statistical analysis of the clinical trials. This new discovery should be beneficial in improving therapeutic outcomes.
The Swiss-Army-Knife Approach to the Nearly Automatic Analysis for Microearthquake Sequences.
NASA Astrophysics Data System (ADS)
Kraft, T.; Simon, V.; Tormann, T.; Diehl, T.; Herrmann, M.
2017-12-01
Many Swiss earthquake sequence have been studied using relative location techniques, which often allowed to constrain the active fault planes and shed light on the tectonic processes that drove the seismicity. Yet, in the majority of cases the number of located earthquakes was too small to infer the details of the space-time evolution of the sequences, or their statistical properties. Therefore, it has mostly been impossible to resolve clear patterns in the seismicity of individual sequences, which are needed to improve our understanding of the mechanisms behind them. Here we present a nearly automatic workflow that combines well-established seismological analysis techniques and allows to significantly improve the completeness of detected and located earthquakes of a sequence. We start from the manually timed routine catalog of the Swiss Seismological Service (SED), which contains the larger events of a sequence. From these well-analyzed earthquakes we dynamically assemble a template set and perform a matched filter analysis on the station with: the best SNR for the sequence; and a recording history of at least 10-15 years, our typical analysis period. This usually allows us to detect events several orders of magnitude below the SED catalog detection threshold. The waveform similarity of the events is then further exploited to derive accurate and consistent magnitudes. The enhanced catalog is then analyzed statistically to derive high-resolution time-lines of the a- and b-value and consequently the occurrence probability of larger events. Many of the detected events are strong enough to be located using double-differences. No further manual interaction is needed; we simply time-shift the arrival-time pattern of the detecting template to the associated detection. Waveform similarity assures a good approximation of the expected arrival-times, which we use to calculate event-pair arrival-time differences by cross correlation. After a SNR and cycle-skipping quality check these are directly fed into hypoDD. Using this procedure we usually improve the number of well-relocated events by a factor 2-5. We demonstrate the successful application of the workflow at the example of natural sequences in Switzerland and present first results of the advanced analysis the was possible with the enhanced catalogs.
Dependency of high coastal water level and river discharge at the global scale
NASA Astrophysics Data System (ADS)
Ward, P.; Couasnon, A.; Haigh, I. D.; Muis, S.; Veldkamp, T.; Winsemius, H.; Wahl, T.
2017-12-01
It is widely recognized that floods cause huge socioeconomic impacts. From 1980-2013, global flood losses exceeded $1 trillion, with 220,000 fatalities. These impacts are particularly hard felt in low-lying densely populated deltas and estuaries, whose location at the coast-land interface makes them naturally prone to flooding. When river and coastal floods coincide, their impacts in these deltas and estuaries are often worse than when they occur in isolation. Such floods are examples of so-called `compound events'. In this contribution, we present the first global scale analysis of the statistical dependency of high coastal water levels (and the storm surge component alone) and river discharge. We show that there is statistical dependency between these components at more than half of the stations examined. We also show time-lags in the highest correlation between peak discharges and coastal water levels. Finally, we assess the probability of the simultaneous occurrence of design discharge and design coastal water levels, assuming both independence and statistical dependence. For those stations where we identified statistical dependency, the probability is between 1 and 5 times greater, when the dependence structure is accounted for. This information is essential for understanding the likelihood of compound flood events occurring at locations around the world as well as for accurate flood risk assessments and effective flood risk management. The research was carried out by analysing the statistical dependency between observed coastal water levels (and the storm surge component) from GESLA-2 and river discharge using gauged data from GRDC stations all around the world. The dependence structure was examined using copula functions.
Zhang, Jingjing; Dennis, Todd E.
2015-01-01
We present a simple framework for classifying mutually exclusive behavioural states within the geospatial lifelines of animals. This method involves use of three sequentially applied statistical procedures: (1) behavioural change point analysis to partition movement trajectories into discrete bouts of same-state behaviours, based on abrupt changes in the spatio-temporal autocorrelation structure of movement parameters; (2) hierarchical multivariate cluster analysis to determine the number of different behavioural states; and (3) k-means clustering to classify inferred bouts of same-state location observations into behavioural modes. We demonstrate application of the method by analysing synthetic trajectories of known ‘artificial behaviours’ comprised of different correlated random walks, as well as real foraging trajectories of little penguins (Eudyptula minor) obtained by global-positioning-system telemetry. Our results show that the modelling procedure correctly classified 92.5% of all individual location observations in the synthetic trajectories, demonstrating reasonable ability to successfully discriminate behavioural modes. Most individual little penguins were found to exhibit three unique behavioural states (resting, commuting/active searching, area-restricted foraging), with variation in the timing and locations of observations apparently related to ambient light, bathymetry, and proximity to coastlines and river mouths. Addition of k-means clustering extends the utility of behavioural change point analysis, by providing a simple means through which the behaviours inferred for the location observations comprising individual movement trajectories can be objectively classified. PMID:25922935
COVARIATE-ADAPTIVE CLUSTERING OF EXPOSURES FOR AIR POLLUTION EPIDEMIOLOGY COHORTS*
Keller, Joshua P.; Drton, Mathias; Larson, Timothy; Kaufman, Joel D.; Sandler, Dale P.; Szpiro, Adam A.
2017-01-01
Cohort studies in air pollution epidemiology aim to establish associations between health outcomes and air pollution exposures. Statistical analysis of such associations is complicated by the multivariate nature of the pollutant exposure data as well as the spatial misalignment that arises from the fact that exposure data are collected at regulatory monitoring network locations distinct from cohort locations. We present a novel clustering approach for addressing this challenge. Specifically, we present a method that uses geographic covariate information to cluster multi-pollutant observations and predict cluster membership at cohort locations. Our predictive k-means procedure identifies centers using a mixture model and is followed by multi-class spatial prediction. In simulations, we demonstrate that predictive k-means can reduce misclassification error by over 50% compared to ordinary k-means, with minimal loss in cluster representativeness. The improved prediction accuracy results in large gains of 30% or more in power for detecting effect modification by cluster in a simulated health analysis. In an analysis of the NIEHS Sister Study cohort using predictive k-means, we find that the association between systolic blood pressure (SBP) and long-term fine particulate matter (PM2.5) exposure varies significantly between different clusters of PM2.5 component profiles. Our cluster-based analysis shows that for subjects assigned to a cluster located in the Midwestern U.S., a 10 μg/m3 difference in exposure is associated with 4.37 mmHg (95% CI, 2.38, 6.35) higher SBP. PMID:28572869
Zhang, Jingjing; O'Reilly, Kathleen M; Perry, George L W; Taylor, Graeme A; Dennis, Todd E
2015-01-01
We present a simple framework for classifying mutually exclusive behavioural states within the geospatial lifelines of animals. This method involves use of three sequentially applied statistical procedures: (1) behavioural change point analysis to partition movement trajectories into discrete bouts of same-state behaviours, based on abrupt changes in the spatio-temporal autocorrelation structure of movement parameters; (2) hierarchical multivariate cluster analysis to determine the number of different behavioural states; and (3) k-means clustering to classify inferred bouts of same-state location observations into behavioural modes. We demonstrate application of the method by analysing synthetic trajectories of known 'artificial behaviours' comprised of different correlated random walks, as well as real foraging trajectories of little penguins (Eudyptula minor) obtained by global-positioning-system telemetry. Our results show that the modelling procedure correctly classified 92.5% of all individual location observations in the synthetic trajectories, demonstrating reasonable ability to successfully discriminate behavioural modes. Most individual little penguins were found to exhibit three unique behavioural states (resting, commuting/active searching, area-restricted foraging), with variation in the timing and locations of observations apparently related to ambient light, bathymetry, and proximity to coastlines and river mouths. Addition of k-means clustering extends the utility of behavioural change point analysis, by providing a simple means through which the behaviours inferred for the location observations comprising individual movement trajectories can be objectively classified.
NASA Astrophysics Data System (ADS)
Varouchakis, Emmanouil; Kourgialas, Nektarios; Karatzas, George; Giannakis, Georgios; Lilli, Maria; Nikolaidis, Nikolaos
2014-05-01
Riverbank erosion affects the river morphology and the local habitat and results in riparian land loss, damage to property and infrastructures, ultimately weakening flood defences. An important issue concerning riverbank erosion is the identification of the areas vulnerable to erosion, as it allows for predicting changes and assists with stream management and restoration. One way to predict the vulnerable to erosion areas is to determine the erosion probability by identifying the underlying relations between riverbank erosion and the geomorphological and/or hydrological variables that prevent or stimulate erosion. A statistical model for evaluating the probability of erosion based on a series of independent local variables and by using logistic regression is developed in this work. The main variables affecting erosion are vegetation index (stability), the presence or absence of meanders, bank material (classification), stream power, bank height, river bank slope, riverbed slope, cross section width and water velocities (Luppi et al. 2009). In statistics, logistic regression is a type of regression analysis used for predicting the outcome of a categorical dependent variable, e.g. binary response, based on one or more predictor variables (continuous or categorical). The probabilities of the possible outcomes are modelled as a function of independent variables using a logistic function. Logistic regression measures the relationship between a categorical dependent variable and, usually, one or several continuous independent variables by converting the dependent variable to probability scores. Then, a logistic regression is formed, which predicts success or failure of a given binary variable (e.g. 1 = "presence of erosion" and 0 = "no erosion") for any value of the independent variables. The regression coefficients are estimated by using maximum likelihood estimation. The erosion occurrence probability can be calculated in conjunction with the model deviance regarding the independent variables tested (Atkinson et al. 2003). The developed statistical model is applied to the Koiliaris River Basin in the island of Crete, Greece. The aim is to determine the probability of erosion along the Koiliaris' riverbanks considering a series of independent geomorphological and/or hydrological variables. Data for the river bank slope and for the river cross section width are available at ten locations along the river. The riverbank has indications of erosion at six of the ten locations while four has remained stable. Based on a recent work, measurements for the two independent variables and data regarding bank stability are available at eight different locations along the river. These locations were used as validation points for the proposed statistical model. The results show a very close agreement between the observed erosion indications and the statistical model as the probability of erosion was accurately predicted at seven out of the eight locations. The next step is to apply the model at more locations along the riverbanks. In November 2013, stakes were inserted at selected locations in order to be able to identify the presence or absence of erosion after the winter period. In April 2014 the presence or absence of erosion will be identified and the model results will be compared to the field data. Our intent is to extend the model by increasing the number of independent variables in order to indentify the key factors favouring erosion along the Koiliaris River. We aim at developing an easy to use statistical tool that will provide a quantified measure of the erosion probability along the riverbanks, which could consequently be used to prevent erosion and flooding events. Atkinson, P. M., German, S. E., Sear, D. A. and Clark, M. J. 2003. Exploring the relations between riverbank erosion and geomorphological controls using geographically weighted logistic regression. Geographical Analysis, 35 (1), 58-82. Luppi, L., Rinaldi, M., Teruggi, L. B., Darby, S. E. and Nardi, L. 2009. Monitoring and numerical modelling of riverbank erosion processes: A case study along the Cecina River (central Italy). Earth Surface Processes and Landforms, 34 (4), 530-546. Acknowledgements This work is part of an on-going THALES project (CYBERSENSORS - High Frequency Monitoring System for Integrated Water Resources Management of Rivers). The project has been co-financed by the European Union (European Social Fund - ESF) and Greek national funds through the Operational Program "Education and Lifelong Learning" of the National Strategic Reference Framework (NSRF) - Research Funding Program: THALES. Investing in knowledge society through the European Social Fund.
GARNET--gene set analysis with exploration of annotation relations.
Rho, Kyoohyoung; Kim, Bumjin; Jang, Youngjun; Lee, Sanghyun; Bae, Taejeong; Seo, Jihae; Seo, Chaehwa; Lee, Jihyun; Kang, Hyunjung; Yu, Ungsik; Kim, Sunghoon; Lee, Sanghyuk; Kim, Wan Kyu
2011-02-15
Gene set analysis is a powerful method of deducing biological meaning for an a priori defined set of genes. Numerous tools have been developed to test statistical enrichment or depletion in specific pathways or gene ontology (GO) terms. Major difficulties towards biological interpretation are integrating diverse types of annotation categories and exploring the relationships between annotation terms of similar information. GARNET (Gene Annotation Relationship NEtwork Tools) is an integrative platform for gene set analysis with many novel features. It includes tools for retrieval of genes from annotation database, statistical analysis & visualization of annotation relationships, and managing gene sets. In an effort to allow access to a full spectrum of amassed biological knowledge, we have integrated a variety of annotation data that include the GO, domain, disease, drug, chromosomal location, and custom-defined annotations. Diverse types of molecular networks (pathways, transcription and microRNA regulations, protein-protein interaction) are also included. The pair-wise relationship between annotation gene sets was calculated using kappa statistics. GARNET consists of three modules--gene set manager, gene set analysis and gene set retrieval, which are tightly integrated to provide virtually automatic analysis for gene sets. A dedicated viewer for annotation network has been developed to facilitate exploration of the related annotations. GARNET (gene annotation relationship network tools) is an integrative platform for diverse types of gene set analysis, where complex relationships among gene annotations can be easily explored with an intuitive network visualization tool (http://garnet.isysbio.org/ or http://ercsb.ewha.ac.kr/garnet/).
Wind speed statistics for Goldstone, California, anemometer sites
NASA Technical Reports Server (NTRS)
Berg, M.; Levy, R.; Mcginness, H.; Strain, D.
1981-01-01
An exploratory wind survey at an antenna complex was summarized statistically for application to future windmill designs. Data were collected at six locations from a total of 10 anemometers. Statistics include means, standard deviations, cubes, pattern factors, correlation coefficients, and exponents for power law profile of wind speed. Curves presented include: mean monthly wind speeds, moving averages, and diurnal variation patterns. It is concluded that three of the locations have sufficiently strong winds to justify consideration for windmill sites.
ERIC Educational Resources Information Center
Felstead, Alan; Jewson, Nick; Phizacklea, Annie; Walters, Sally
The patterns, extent, and problems of working at home in the United Kingdom were examined through a multivariate analysis of data from the Labour Force Survey, which has questioned respondents about the location of their workplace since 1992. The numbers of people working "mainly" at home increased from 345,920 (1.5%) in 1981 to 680,612…
Patricia K. Lebow; Charles G. Carll
2010-01-01
A statistical analysis was performed that identified time trends in the Scheffer Index value for 167 locations in the conterminous United States over the period 1969-2008. Year-to-year variation in Index values was found to be larger than year-to-year variation in most other weather parameters. Despite the substantial yearly variation, regression equations, with time (...
McSwain, Kristen Bukowski; Strickland, A.G.
2010-01-01
Groundwater conditions in Brunswick County, North Carolina, have been monitored continuously since 2000 through the operation and maintenance of groundwater-level observation wells in the surficial, Castle Hayne, and Peedee aquifers of the North Atlantic Coastal Plain aquifer system. Groundwater-resource conditions for the Brunswick County area were evaluated by relating the normal range (25th to 75th percentile) monthly mean groundwater-level and precipitation data for water years 2001 to 2008 to median monthly mean groundwater levels and monthly sum of daily precipitation for water year 2008. Summaries of precipitation and groundwater conditions for the Brunswick County area and hydrographs and statistics of continuous groundwater levels collected during the 2008 water year are presented in this report. Groundwater levels varied by aquifer and geographic location within Brunswick County, but were influenced by drought conditions and groundwater withdrawals. Water levels were normal in two of the eight observation wells and below normal in the remaining six wells. Seasonal Kendall trend analysis performed on more than 9 years of monthly mean groundwater-level data collected in an observation well located within the Brunswick County well field indicated there is a strong downward trend, with water levels declining at a rate of about 2.2 feet per year.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hernandez, T.; Poquioma, W.
1997-08-01
This study presents the results of an integrated reservoir study of the Eocene B-Inferior/VLG-3659, Area 7, Ceuta filed. This field located in the Maracaibo Lake in the western side of Venezuela. The objective was to evaluating the feasibility to implement a secondary recovery project by means of water flooding. Core information was used for this study (194 ft), PVT analysis, RFI, build-up and statistic`s pressure analysis, modem logs and production history data. Using geostatistical techniques (Kriging) it was defined a low uncertainty geological model that was validated by means of a black oil simulator (Eclipse). The results showed a goodmore » comparison of historical pressure of the reservoir against those obtained from the model, without the need of {open_quotes}history matching{close_quotes}. It means without modifying neither the initial rock properties nor reservoir fluids. The results of this study recommended drilling in two new locations, also the reactivation of four producing wells and water flooding under peripherical array by means of four injection wells, with the recovery of an additional 30.2 MMSTB. The economical evaluation shows an internal return rate of 31.4%.« less
NASA Astrophysics Data System (ADS)
Bugała, Artur; Bednarek, Karol; Kasprzyk, Leszek; Tomczewski, Andrzej
2017-10-01
The paper presents the most representative - from the three-year measurement time period - characteristics of daily and monthly electricity production from a photovoltaic conversion using modules installed in a fixed and 2-axis tracking construction. Results are presented for selected summer, autumn, spring and winter days. Analyzed measuring stand is located on the roof of the Faculty of Electrical Engineering Poznan University of Technology building. The basic parameters of the statistical analysis like mean value, standard deviation, skewness, kurtosis, median, range, or coefficient of variation were used. It was found that the asymmetry factor can be useful in the analysis of the daily electricity production from a photovoltaic conversion. In order to determine the repeatability of monthly electricity production, occurring between the summer, and summer and winter months, a non-parametric Mann-Whitney U test was used as a statistical solution. In order to analyze the repeatability of daily peak hours, describing the largest value of the hourly electricity production, a non-parametric Kruskal-Wallis test was applied as an extension of the Mann-Whitney U test. Based on the analysis of the electric energy distribution from a prepared monitoring system it was found that traditional forecasting methods of the electricity production from a photovoltaic conversion, like multiple regression models, should not be the preferred methods of the analysis.
Diana, Barbara; Zurloni, Valentino; Elia, Massimiliano; Cavalera, Cesare M; Jonsson, Gudberg K; Anguera, M Teresa
2017-01-01
The influence of game location on performance has been widely examined in sport contexts. Concerning soccer, game-location affects positively the secondary and tertiary level of performance; however, there are fewer evidences about its effect on game structure (primary level of performance). This study aimed to detect the effect of game location on a primary level of performance in soccer. In particular, the objective was to reveal the hidden structures underlying the attack actions, in both home and away matches played by a top club (Serie A 2012/2013-First Leg). The methodological approach was based on systematic observation, supported by digital recordings and T-pattern analysis. Data were analyzed with THEME 6.0 software. A quantitative analysis, with nonparametric Mann-Whitney test and descriptive statistics, was carried out to test the hypotheses. A qualitative analysis on complex patterns was performed to get in-depth information on the game structure. This study showed that game tactics were significantly different, with home matches characterized by a more structured and varied game than away matches. In particular, a higher number of different patterns, with a higher level of complexity and including more unique behaviors was detected in home matches than in the away ones. No significant differences were found in the number of events coded per game between the two conditions. THEME software, and the corresponding T-pattern detection algorithm, enhance research opportunities by going further than frequency-based analyses, making this method an effective tool in supporting sport performance analysis and training.
Diana, Barbara; Zurloni, Valentino; Elia, Massimiliano; Cavalera, Cesare M.; Jonsson, Gudberg K.; Anguera, M. Teresa
2017-01-01
The influence of game location on performance has been widely examined in sport contexts. Concerning soccer, game-location affects positively the secondary and tertiary level of performance; however, there are fewer evidences about its effect on game structure (primary level of performance). This study aimed to detect the effect of game location on a primary level of performance in soccer. In particular, the objective was to reveal the hidden structures underlying the attack actions, in both home and away matches played by a top club (Serie A 2012/2013—First Leg). The methodological approach was based on systematic observation, supported by digital recordings and T-pattern analysis. Data were analyzed with THEME 6.0 software. A quantitative analysis, with nonparametric Mann–Whitney test and descriptive statistics, was carried out to test the hypotheses. A qualitative analysis on complex patterns was performed to get in-depth information on the game structure. This study showed that game tactics were significantly different, with home matches characterized by a more structured and varied game than away matches. In particular, a higher number of different patterns, with a higher level of complexity and including more unique behaviors was detected in home matches than in the away ones. No significant differences were found in the number of events coded per game between the two conditions. THEME software, and the corresponding T-pattern detection algorithm, enhance research opportunities by going further than frequency-based analyses, making this method an effective tool in supporting sport performance analysis and training. PMID:28878712
Wu, Johnny C; Gardner, David P; Ozer, Stuart; Gutell, Robin R; Ren, Pengyu
2009-08-28
The accurate prediction of the secondary and tertiary structure of an RNA with different folding algorithms is dependent on several factors, including the energy functions. However, an RNA higher-order structure cannot be predicted accurately from its sequence based on a limited set of energy parameters. The inter- and intramolecular forces between this RNA and other small molecules and macromolecules, in addition to other factors in the cell such as pH, ionic strength, and temperature, influence the complex dynamics associated with transition of a single stranded RNA to its secondary and tertiary structure. Since all of the factors that affect the formation of an RNAs 3D structure cannot be determined experimentally, statistically derived potential energy has been used in the prediction of protein structure. In the current work, we evaluate the statistical free energy of various secondary structure motifs, including base-pair stacks, hairpin loops, and internal loops, using their statistical frequency obtained from the comparative analysis of more than 50,000 RNA sequences stored in the RNA Comparative Analysis Database (rCAD) at the Comparative RNA Web (CRW) Site. Statistical energy was computed from the structural statistics for several datasets. While the statistical energy for a base-pair stack correlates with experimentally derived free energy values, suggesting a Boltzmann-like distribution, variation is observed between different molecules and their location on the phylogenetic tree of life. Our statistical energy values calculated for several structural elements were utilized in the Mfold RNA-folding algorithm. The combined statistical energy values for base-pair stacks, hairpins and internal loop flanks result in a significant improvement in the accuracy of secondary structure prediction; the hairpin flanks contribute the most.
NASA Astrophysics Data System (ADS)
Nguyen, A.; Mueller, C.; Brooks, A. N.; Kislik, E. A.; Baney, O. N.; Ramirez, C.; Schmidt, C.; Torres-Perez, J. L.
2014-12-01
The Sierra Nevada is experiencing changes in hydrologic regimes, such as decreases in snowmelt and peak runoff, which affect forest health and the availability of water resources. Currently, the USDA Forest Service Region 5 is undergoing Forest Plan revisions to include climate change impacts into mitigation and adaptation strategies. However, there are few processes in place to conduct quantitative assessments of forest conditions in relation to mountain hydrology, while easily and effectively delivering that information to forest managers. To assist the USDA Forest Service, this study is the final phase of a three-term project to create a Decision Support System (DSS) to allow ease of access to historical and forecasted hydrologic, climatic, and terrestrial conditions for the entire Sierra Nevada. This data is featured within three components of the DSS: the Mapping Viewer, Statistical Analysis Portal, and Geospatial Data Gateway. Utilizing ArcGIS Online, the Sierra DSS Mapping Viewer enables users to visually analyze and locate areas of interest. Once the areas of interest are targeted, the Statistical Analysis Portal provides subbasin level statistics for each variable over time by utilizing a recently developed web-based data analysis and visualization tool called Plotly. This tool allows users to generate graphs and conduct statistical analyses for the Sierra Nevada without the need to download the dataset of interest. For more comprehensive analysis, users are also able to download datasets via the Geospatial Data Gateway. The third phase of this project focused on Python-based data processing, the adaptation of the multiple capabilities of ArcGIS Online and Plotly, and the integration of the three Sierra DSS components within a website designed specifically for the USDA Forest Service.
Equilibrium statistical-thermal models in high-energy physics
NASA Astrophysics Data System (ADS)
Tawfik, Abdel Nasser
2014-05-01
We review some recent highlights from the applications of statistical-thermal models to different experimental measurements and lattice QCD thermodynamics that have been made during the last decade. We start with a short review of the historical milestones on the path of constructing statistical-thermal models for heavy-ion physics. We discovered that Heinz Koppe formulated in 1948, an almost complete recipe for the statistical-thermal models. In 1950, Enrico Fermi generalized this statistical approach, in which he started with a general cross-section formula and inserted into it, the simplifying assumptions about the matrix element of the interaction process that likely reflects many features of the high-energy reactions dominated by density in the phase space of final states. In 1964, Hagedorn systematically analyzed the high-energy phenomena using all tools of statistical physics and introduced the concept of limiting temperature based on the statistical bootstrap model. It turns to be quite often that many-particle systems can be studied with the help of statistical-thermal methods. The analysis of yield multiplicities in high-energy collisions gives an overwhelming evidence for the chemical equilibrium in the final state. The strange particles might be an exception, as they are suppressed at lower beam energies. However, their relative yields fulfill statistical equilibrium, as well. We review the equilibrium statistical-thermal models for particle production, fluctuations and collective flow in heavy-ion experiments. We also review their reproduction of the lattice QCD thermodynamics at vanishing and finite chemical potential. During the last decade, five conditions have been suggested to describe the universal behavior of the chemical freeze-out parameters. The higher order moments of multiplicity have been discussed. They offer deep insights about particle production and to critical fluctuations. Therefore, we use them to describe the freeze-out parameters and suggest the location of the QCD critical endpoint. Various extensions have been proposed in order to take into consideration the possible deviations of the ideal hadron gas. We highlight various types of interactions, dissipative properties and location-dependences (spatial rapidity). Furthermore, we review three models combining hadronic with partonic phases; quasi-particle model, linear sigma model with Polyakov potentials and compressible bag model.
Cundell, A M; Bean, R; Massimore, L; Maier, C
1998-01-01
To determine the relationship between the sampling time of the environmental monitoring, i.e., viable counts, in aseptic filling areas and the microbial count and frequency of alerts for air, surface and personnel microbial monitoring, statistical analyses were conducted on 1) the frequency of alerts versus the time of day for routine environmental sampling conducted in calendar year 1994, and 2) environmental monitoring data collected at 30-minute intervals during routine aseptic filling operations over two separate days in four different clean rooms with multiple shifts and equipment set-ups at a parenteral manufacturing facility. Statistical analyses showed, except for one floor location that had significantly higher number of counts but no alert or action level samplings in the first two hours of operation, there was no relationship between the number of counts and the time of sampling. Further studies over a 30-day period at the floor location showed no relationship between time of sampling and microbial counts. The conclusion reached in the study was that there is no worst case time for environmental monitoring at that facility and that sampling any time during the aseptic filling operation will give a satisfactory measure of the microbial cleanliness in the clean room during the set-up and aseptic filling operation.
Code System for Performance Assessment Ground-water Analysis for Low-level Nuclear Waste.
DOE Office of Scientific and Technical Information (OSTI.GOV)
MATTHEW,; KOZAK, W.
1994-02-09
Version 00 The PAGAN code system is a part of the performance assessment methodology developed for use by the U. S. Nuclear Regulatory Commission in evaluating license applications for low-level waste disposal facilities. In this methodology, PAGAN is used as one candidate approach for analysis of the ground-water pathway. PAGAN, Version 1.1 has the capability to model the source term, vadose-zone transport, and aquifer transport of radionuclides from a waste disposal unit. It combines the two codes SURFACE and DISPERSE which are used as semi-analytical solutions to the convective-dispersion equation. This system uses menu driven input/out for implementing a simplemore » ground-water transport analysis and incorporates statistical uncertainty functions for handling data uncertainties. The output from PAGAN includes a time- and location-dependent radionuclide concentration at a well in the aquifer, or a time- and location-dependent radionuclide flux into a surface-water body.« less
Patirana, A.; Hatcher, S.A.; Friesen, Vicki L.
2002-01-01
Population decline in red-legged kittiwakes (Rissa brevirostris) over recent decades has necessitated the collection of information on the distribution of genetic variation within and among colonies for implementation of suitable management policies. Here we present a preliminary study of the extent of genetic structuring and gene flow among the three principal breeding locations of red-legged kittiwakes using the hypervariable Domain I of the mitochondrial control region. Genetic variation was high relative to other species of seabirds, and was similar among locations. Analysis of molecular variance indicated that population genetic structure was statistically significant, and nested clade analysis suggested that kittiwakes breeding on Bering Island maybe genetically isolated from those elsewhere. However, phylogeographic structure was weak. Although this analysis involved only a single locus and a small number of samples, it suggests that red-legged kittiwakes probably constitute a single evolutionary significant unit; the possibility that they constitute two management units requires further investigation.
Issues in Quantitative Analysis of Ultraviolet Imager (UV) Data: Airglow
NASA Technical Reports Server (NTRS)
Germany, G. A.; Richards, P. G.; Spann, J. F.; Brittnacher, M. J.; Parks, G. K.
1999-01-01
The GGS Ultraviolet Imager (UVI) has proven to be especially valuable in correlative substorm, auroral morphology, and extended statistical studies of the auroral regions. Such studies are based on knowledge of the location, spatial, and temporal behavior of auroral emissions. More quantitative studies, based on absolute radiometric intensities from UVI images, require a more intimate knowledge of the instrument behavior and data processing requirements and are inherently more difficult than studies based on relative knowledge of the oval location. In this study, UVI airglow observations are analyzed and compared with model predictions to illustrate issues that arise in quantitative analysis of UVI images. These issues include instrument calibration, long term changes in sensitivity, and imager flat field response as well as proper background correction. Airglow emissions are chosen for this study because of their relatively straightforward modeling requirements and because of their implications for thermospheric compositional studies. The analysis issues discussed here, however, are identical to those faced in quantitative auroral studies.
Detection of crossover time scales in multifractal detrended fluctuation analysis
NASA Astrophysics Data System (ADS)
Ge, Erjia; Leung, Yee
2013-04-01
Fractal is employed in this paper as a scale-based method for the identification of the scaling behavior of time series. Many spatial and temporal processes exhibiting complex multi(mono)-scaling behaviors are fractals. One of the important concepts in fractals is crossover time scale(s) that separates distinct regimes having different fractal scaling behaviors. A common method is multifractal detrended fluctuation analysis (MF-DFA). The detection of crossover time scale(s) is, however, relatively subjective since it has been made without rigorous statistical procedures and has generally been determined by eye balling or subjective observation. Crossover time scales such determined may be spurious and problematic. It may not reflect the genuine underlying scaling behavior of a time series. The purpose of this paper is to propose a statistical procedure to model complex fractal scaling behaviors and reliably identify the crossover time scales under MF-DFA. The scaling-identification regression model, grounded on a solid statistical foundation, is first proposed to describe multi-scaling behaviors of fractals. Through the regression analysis and statistical inference, we can (1) identify the crossover time scales that cannot be detected by eye-balling observation, (2) determine the number and locations of the genuine crossover time scales, (3) give confidence intervals for the crossover time scales, and (4) establish the statistically significant regression model depicting the underlying scaling behavior of a time series. To substantive our argument, the regression model is applied to analyze the multi-scaling behaviors of avian-influenza outbreaks, water consumption, daily mean temperature, and rainfall of Hong Kong. Through the proposed model, we can have a deeper understanding of fractals in general and a statistical approach to identify multi-scaling behavior under MF-DFA in particular.
NASA Astrophysics Data System (ADS)
Kawzenuk, B.; Sellars, S. L.; Nguyen, P.; Ralph, F. M.; Sorooshian, S.
2017-12-01
The CONNected objECT (CONNECT) algorithm is applied to Integrated Water Vapor Transport (IVT) data from the NASA's Modern-Era Retrospective Analysis for Research and Applications - Version 2 reanalysis product for the period 1980 to 2016 to study water vapor transport globally. The algorithm generates life-cycle records as statistical objects for the time and space location of the evolving strong vapor transport events. Global statistics are presented and used to investigate how climate variability impacts the events' location and frequency. Results show distinct water vapor object frequency and seasonal peaks during NH and SH Winter. Moreover, a positive linear trend in the annual number of objects is reported, increasing by 3.58 objects year-over-year (with 95% confidence, +/- 1.39). In addition, we show five distinct regions where these events typically exist (southeastern United States, eastern China, South Pacific south of 25°S, eastern South America and off the southern tip of South Africa), and where they rarely exist (eastern South Pacific Ocean and central southern Atlantic Ocean between 5°N-25°S). In addition, the event frequency and geographical location are also shown to be related to the Arctic Oscillation, Pacific North American Pattern, and the Quasi-Biennial Oscillation.
Modality-Driven Classification and Visualization of Ensemble Variance
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bensema, Kevin; Gosink, Luke; Obermaier, Harald
Advances in computational power now enable domain scientists to address conceptual and parametric uncertainty by running simulations multiple times in order to sufficiently sample the uncertain input space. While this approach helps address conceptual and parametric uncertainties, the ensemble datasets produced by this technique present a special challenge to visualization researchers as the ensemble dataset records a distribution of possible values for each location in the domain. Contemporary visualization approaches that rely solely on summary statistics (e.g., mean and variance) cannot convey the detailed information encoded in ensemble distributions that are paramount to ensemble analysis; summary statistics provide no informationmore » about modality classification and modality persistence. To address this problem, we propose a novel technique that classifies high-variance locations based on the modality of the distribution of ensemble predictions. Additionally, we develop a set of confidence metrics to inform the end-user of the quality of fit between the distribution at a given location and its assigned class. We apply a similar method to time-varying ensembles to illustrate the relationship between peak variance and bimodal or multimodal behavior. These classification schemes enable a deeper understanding of the behavior of the ensemble members by distinguishing between distributions that can be described by a single tendency and distributions which reflect divergent trends in the ensemble.« less
NASA Astrophysics Data System (ADS)
Fathy, Adel; Ghamry, Essam
2017-03-01
Though the Equatorial Ionospheric Anomaly (EIA) is represented by two crests within ±15° latitude, a single crest is also observed in the entire ionosphere. Few studies have addressed single crest phenomena. A statistical study of 2237 single crest phenomenon from the in situ electron density measurements of Swarm A satellite was investigated during December 2013-December 2015. Our analysis focused on local time, seasonal, and both geographic and geomagnetic latitudinal variations. Our results show the following observations: 1 - The maximum number of events peaks mainly in the dayside region around 0800-1200 LT and these occur mainly within the magnetic equator. 2 - The maximum amplitude of the single crests take place most prominently during equinoxes. 3 - The majority of single crests occur in the northern hemisphere. 4 - The seasonal distribution of the events shows that the summer events are located further from the magnetic equator in the northern hemisphere and shift their locations into the southern hemisphere in winter, while spring events are centered along the magnetic equator. 5 - Dayside single crest events appear close to the magnetic equator and more centered on the equator in winter season. 6 - Dawn, night and dusk side events reverse their location from northern hemisphere in summer to southern hemisphere in winter.
Assessment of Three Flood Hazard Mapping Methods: A Case Study of Perlis
NASA Astrophysics Data System (ADS)
Azizat, Nazirah; Omar, Wan Mohd Sabki Wan
2018-03-01
Flood is a common natural disaster and also affect the all state in Malaysia. Regarding to Drainage and Irrigation Department (DID) in 2007, about 29, 270 km2 or 9 percent of region of the country is prone to flooding. Flood can be such devastating catastrophic which can effected to people, economy and environment. Flood hazard mapping can be used is an important part in flood assessment to define those high risk area prone to flooding. The purposes of this study are to prepare a flood hazard mapping in Perlis and to evaluate flood hazard using frequency ratio, statistical index and Poisson method. The six factors affecting the occurrence of flood including elevation, distance from the drainage network, rainfall, soil texture, geology and erosion were created using ArcGIS 10.1 software. Flood location map in this study has been generated based on flooded area in year 2010 from DID. These parameters and flood location map were analysed to prepare flood hazard mapping in representing the probability of flood area. The results of the analysis were verified using flood location data in year 2013, 2014, 2015. The comparison result showed statistical index method is better in prediction of flood area rather than frequency ratio and Poisson method.
Krawczyk, Christopher; Gradziel, Pat; Geraghty, Estella M.
2014-01-01
Objectives. We used a geographic information system and cluster analyses to determine locations in need of enhanced Special Supplemental Nutrition Program for Women, Infants, and Children (WIC) Program services. Methods. We linked documented births in the 2010 California Birth Statistical Master File with the 2010 data from the WIC Integrated Statewide Information System. Analyses focused on the density of pregnant women who were eligible for but not receiving WIC services in California’s 7049 census tracts. We used incremental spatial autocorrelation and hot spot analyses to identify clusters of WIC-eligible nonparticipants. Results. We detected clusters of census tracts with higher-than-expected densities, compared with the state mean density of WIC-eligible nonparticipants, in 21 of 58 (36.2%) California counties (P < .05). In subsequent county-level analyses, we located neighborhood-level clusters of higher-than-expected densities of eligible nonparticipants in Sacramento, San Francisco, Fresno, and Los Angeles Counties (P < .05). Conclusions. Hot spot analyses provided a rigorous and objective approach to determine the locations of statistically significant clusters of WIC-eligible nonparticipants. Results helped inform WIC program and funding decisions, including the opening of new WIC centers, and offered a novel approach for targeting public health services. PMID:24354821
Selected low-flow frequency statistics for continuous-record streamgage locations in Maryland, 2010
Doheny, Edward J.; Banks, William S.L.
2010-01-01
According to a 2008 report by the Governor's Advisory Committee on the Management and Protection of the State's Water Resources, Maryland's population grew by 35 percent between 1970 and 2000, and is expected to increase by an additional 27 percent between 2000 and 2030. Because domestic water demand generally increases in proportion to population growth, Maryland will be facing increased pressure on water resources over the next 20 years. Water-resources decisions should be based on sound, comprehensive, long-term data and low-flow frequency statistics from all available streamgage locations with unregulated streamflow and adequate record lengths. To provide the Maryland Department of the Environment with tools for making future water-resources decisions, the U.S. Geological Survey initiated a study in October 2009 to compute low-flow frequency statistics for selected streamgage locations in Maryland with 10 or more years of continuous streamflow records. This report presents low-flow frequency statistics for 114 continuous-record streamgage locations in Maryland. The computed statistics presented for each streamgage location include the mean 7-, 14-, and 30-consecutive day minimum daily low-flow dischages for recurrence intervals of 2, 10, and 20 years, and are based on approved streamflow records that include a minimum of 10 complete climatic years of record as of June 2010. Descriptive information for each of these streamgage locations, including the station number, station name, latitude, longitude, county, physiographic province, and drainage area, also is presented. The statistics are planned for incorporation into StreamStats, which is a U.S. Geological Survey Web application for obtaining stream information, and is being used by water-resource managers and decision makers in Maryland to address water-supply planning and management, water-use appropriation and permitting, wastewater and industrial discharge permitting, and setting minimum required streamflows to protect freshwater biota and ecosystems.
Characteristics of the umbilical artery velocity waveform as function of measurement site.
Ruissen, C J; von Drongelen, M M; Hoogland, H J; Jager, W; Hoeks, A P
1990-01-01
In 30 uncomplicated singleton pregnancies, varying in duration between 24 and 40 weeks, the variability of the flow velocity waveform (FVW) along the course of the umbilical artery was investigated. Blood flow velocities were recorded at 4 locations in the vessel: within the fetal abdomen, 0-5 cm from the origin of the umbilical cord, in the free-floating part, and 0-5 cm from its insertion in the placenta. From the Doppler signals recorded, the pulsatility index (PI) and a parameter for the frequency distribution index (FDI) were calculated. PI values differed among the locations, but no unequivocal tendency could be demonstrated. Statistical analysis, including multiple regression analysis for maternal and menstrual age and fetal heart rate, showed no significant difference in PI and FDI values for any of the 4 locations. It can be concluded that in uncomplicated pregnancies, possible changes in FVW (quantified by PI) along the course of the umbilical artery have no clinical relevance. Therefore, standardization for the sampling site when measuring PI in this vessel seems to be unnecessary.
Ziegeweid, Jeffrey R.; Lorenz, David L.; Sanocki, Chris A.; Czuba, Christiana R.
2015-12-24
Equations developed in this study apply only to stream locations where flows are not substantially affected by regulation, diversion, or urbanization. All equations presented in this study will be incorporated into StreamStats, a web-based geographic information system tool developed by the U.S. Geological Survey. StreamStats allows users to obtain streamflow statistics, basin characteristics, and other information for user-selected locations on streams through an interactive map.
Button, C; Dicks, M; Haines, R; Barker, R; Davids, K
2011-08-01
Previous research on gaze behaviour in sport has typically reported summary fixation statistics thereby largely ignoring the temporal sequencing of gaze. In the present study on penalty kicking in soccer, our aim was to apply a Markov chain modelling method to eye movement data obtained from goalkeepers. Building on the discrete analysis of gaze employed by Dicks et al. (Atten Percept Psychophys 72(3):706-720, 2010b), we wanted to statistically model the relative probabilities of the goalkeeper's gaze being directed to different locations throughout the penalty taker's approach (Dicks et al. in Atten Percept Psychophys 72(3):706-720, 2010b). Examination of gaze behaviours under in situ and video-simulation task constraints reveals differences in information pickup for perception and action (Attention, Perception and Psychophysics 72(3), 706-720). The probabilities of fixating anatomical locations of the penalty taker were high under simulated movement response conditions. In contrast, when actually required to intercept kicks, the goalkeepers initially favoured watching the penalty taker's head but then rapidly shifted focus directly to the ball for approximately the final second prior to foot-ball contact. The increased spatio-temporal demands of in situ interceptive actions over laboratory-based simulated actions lead to different visual search strategies being used. When eye movement data are modelled as time series, it is possible to discern subtle but important behavioural characteristics that are less apparent with discrete summary statistics alone.
Spatial and Temporal Emergence Pattern of Lyme Disease in Virginia
Li, Jie; Kolivras, Korine N.; Hong, Yili; Duan, Yuanyuan; Seukep, Sara E.; Prisley, Stephen P.; Campbell, James B.; Gaines, David N.
2014-01-01
The emergence of infectious diseases over the past several decades has highlighted the need to better understand epidemics and prepare for the spread of diseases into new areas. As these diseases expand their geographic range, cases are recorded at different geographic locations over time, making the analysis and prediction of this expansion complicated. In this study, we analyze spatial patterns of the disease using a statistical smoothing analysis based on areal (census tract level) count data of Lyme disease cases in Virginia from 1998 to 2011. We also use space and space–time scan statistics to reveal the presence of clusters in the spatial and spatiotemporal distribution of Lyme disease. Our results confirm and quantify the continued emergence of Lyme disease to the south and west in states along the eastern coast of the United States. The results also highlight areas where education and surveillance needs are highest. PMID:25331806
Jamshidi-Zanjani, Ahmad; Saeedi, Mohsen
2017-07-01
Vertical distribution of metals (Cu, Zn, Cr, Fe, Mn, Pb, Ni, Cd, and Li) in four sediment core samples (C 1 , C 2 , C 3 , and C 4 ) from Anzali international wetland located southwest of the Caspian Sea was examined. Background concentration of each metal was calculated according to different statistical approaches. The results of multivariate statistical analysis showed that Fe and Mn might have significant role in the fate of Ni and Zn in sediment core samples. Different sediment quality indexes were utilized to assess metal pollution in sediment cores. Moreover, a new sediment quality index named aggregative toxicity index (ATI) based on sediment quality guidelines (SQGs) was developed to assess the degree of metal toxicity in an aggregative manner. The increasing pattern of metal pollution and their toxicity degree in upper layers of core samples indicated increasing effects of anthropogenic sources in the study area.
Abbott, M.; Einerson, J.; Schuster, P.; Susong, D.; Taylor, Howard E.; ,
2004-01-01
Snow sampling and analysis methods which produce accurate and ultra-low measurements of trace elements and common ion concentration in southeastern Idaho snow, were developed. Snow samples were collected over two winters to assess trace elements and common ion concentrations in air pollutant fallout across the southeastern Idaho. The area apportionment of apportionment of fallout concentrations measured at downwind location were investigated using pattern recognition and multivariate statistical technical techniques. Results show a high level of contribution from phosphates processing facilities located outside Pocatello in the southern portion of the Eastern Snake River Plain, and no obvious source area profiles other than at Pocatello.
NASA Astrophysics Data System (ADS)
Yan, Rui; Parrot, Michel; Pinçon, Jean-Louis
2017-12-01
In this paper, we present the result of a statistical study performed on the ionospheric ion density variations above areas of seismic activity. The ion density was observed by the low altitude satellite DEMETER between 2004 and 2010. In the statistical analysis a superposed epoch method is used where the observed ionospheric ion density close to the epicenters both in space and in time is compared to background values recorded at the same location and in the same conditions. Data associated with aftershocks have been carefully removed from the database to prevent spurious effects on the statistics. It is shown that, during nighttime, anomalous ionospheric perturbations related to earthquakes with magnitudes larger than 5 are evidenced. At the time of these perturbations the background ion fluctuation departs from a normal distribution. They occur up to 200 km from the epicenters and mainly 5 days before the earthquakes. As expected, an ion density perturbation occurring just after the earthquakes and close to the epicenters is also evidenced.
Environmental Health Practice: Statistically Based Performance Measurement
Enander, Richard T.; Gagnon, Ronald N.; Hanumara, R. Choudary; Park, Eugene; Armstrong, Thomas; Gute, David M.
2007-01-01
Objectives. State environmental and health protection agencies have traditionally relied on a facility-by-facility inspection-enforcement paradigm to achieve compliance with government regulations. We evaluated the effectiveness of a new approach that uses a self-certification random sampling design. Methods. Comprehensive environmental and occupational health data from a 3-year statewide industry self-certification initiative were collected from representative automotive refinishing facilities located in Rhode Island. Statistical comparisons between baseline and postintervention data facilitated a quantitative evaluation of statewide performance. Results. The analysis of field data collected from 82 randomly selected automotive refinishing facilities showed statistically significant improvements (P<.05, Fisher exact test) in 4 major performance categories: occupational health and safety, air pollution control, hazardous waste management, and wastewater discharge. Statistical significance was also shown when a modified Bonferroni adjustment for multiple comparisons was performed. Conclusions. Our findings suggest that the new self-certification approach to environmental and worker protection is effective and can be used as an adjunct to further enhance state and federal enforcement programs. PMID:17267709
Puertas, E Benjamín; Rivera, Tamara Y
2016-11-01
To 1) describe patterns of specialty choice; 2) investigate relationships between career selection and selected demographic indicators; and 3) identify salary perception, factors that influence career choice in primary care, and factors that influence desired location of future medical practice. The study used a mixed-methods approach that included a cross-sectional questionnaire survey applied to 234 last-year medical students in Honduras (September 2014), and semi-structured interviews with eight key informants (October 2014). Statistical analysis included chi-square and factor analysis. An alpha level of 0.05 was used to determine significance. In the qualitative analysis, several codes were associated with each other, and five major themes emerged. Primary care careers were the preferred choice for 8.1% of students, who preferred urban settings for future practice location. The perceived salary of specialties other than primary care was significantly higher than those of general practitioners, family practitioners, and pediatricians (P < 0.001). Participants considered "making a difference," income, teaching, prestige, and challenging work the most important factors influencing career choice. Practice in ambulatory settings was significantly associated with a preference for primary care specialties (P = < 0.05). Logistic regression analysis found that factors related to patient-based care were statistically significant for selecting primary care (P = 0.006). The qualitative analysis further endorsed the survey findings, identifying additional factors that influence career choice (future work option; availability of residency positions; and social factors, including violence). Rationales behind preference of a specialty appeared to be based on a combination of ambition and prestige, and on personal and altruistic considerations. Most factors that influence primary care career choice are similar to those found in the literature. There are several factors distinctive to medical students in Honduras-most of them barriers to primary care career choice.
Analysis of trends in water-quality data for water conservation area 3A, the Everglades, Florida
Mattraw, H.C.; Scheidt, D.J.; Federico, A.C.
1987-01-01
Rainfall and water quality data bases from the South Florida Water Management District were used to evaluate water quality trends at 10 locations near or in Water Conservation Area 3A in The Everglades. The Seasonal Kendall test was applied to specific conductance, orthophosphate-phosphorus, nitrate-nitrogen, total Kjeldahl nitrogen, and total nitrogen regression residuals for the period 1978-82. Residuals of orthophosphate and nitrate quadratic models, based on antecedent 7-day rainfall at inflow gate S-11B, were the only two constituent-structure pairs that showed apparent significant (p < 0.05) increases in constituent concentrations. Elimination of regression models with distinct residual patterns and data outlines resulted in 17 statistically significant station water quality combinations for trend analysis. No water quality trends were observed. The 1979 Memorandum of Agreement outlining the water quality monitoring program between the Everglades National Park and the U.S. Army Corps of Engineers stressed collection four times a year at three stations, and extensive coverage of water quality properties. Trend analysis and other rigorous statistical evaluation programs are better suited to data monitoring programs that include more frequent sampling and that are organized in a water quality data management system. Pronounced areal differences in water quality suggest that a water quality monitoring system for Shark River Slough in Everglades National Park include collection locations near the source of inflow to Water Conservation Area 3A. (Author 's abstract)
Application of Multivariate Statistical Analysis to Biomarkers in Se-Turkey Crude Oils
NASA Astrophysics Data System (ADS)
Gürgey, K.; Canbolat, S.
2017-11-01
Twenty-four crude oil samples were collected from the 24 oil fields distributed in different districts of SE-Turkey. API and Sulphur content (%), Stable Carbon Isotope, Gas Chromatography (GC), and Gas Chromatography-Mass Spectrometry (GC-MS) data were used to construct a geochemical data matrix. The aim of this study is to examine the genetic grouping or correlations in the crude oil samples, hence the number of source rocks present in the SE-Turkey. To achieve these aims, two of the multivariate statistical analysis techniques (Principle Component Analysis [PCA] and Cluster Analysis were applied to data matrix of 24 samples and 8 source specific biomarker variables/parameters. The results showed that there are 3 genetically different oil groups: Batman-Nusaybin Oils, Adıyaman-Kozluk Oils and Diyarbakir Oils, in addition to a one mixed group. These groupings imply that at least, three different source rocks are present in South-Eastern (SE) Turkey. Grouping of the crude oil samples appears to be consistent with the geographic locations of the oils fields, subsurface stratigraphy as well as geology of the area.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Merrill, D.W.; Selvin, S.; Close, E.R.
In studying geographic disease distributions, one normally compares rates of arbitrarily defined geographic subareas (e.g. census tracts), thereby sacrificing the geographic detail of the original data. The sparser the data, the larger the subareas must be in order to calculate stable rates. This dilemma is avoided with the technique of Density Equalizing Map Projections (DEMP). Boundaries of geographic subregions are adjusted to equalize population density over the entire study area. Case locations plotted on the transformed map should have a uniform distribution if the underlying disease-rates are constant. On the transformed map, the statistical analysis of the observed distribution ismore » greatly simplified. Even for sparse distributions, the statistical significance of a supposed disease cluster can be reliably calculated. The present report describes the first successful application of the DEMP technique to a sizeable ``real-world`` data set of epidemiologic interest. An improved DEMP algorithm [GUSE93, CLOS94] was applied to a data set previously analyzed with conventional techniques [SATA90, REYN91]. The results from the DEMP analysis and a conventional analysis are compared.« less
Modeling the Dynamic Interrelations between Mobility, Utility, and Land Asking Price
NASA Astrophysics Data System (ADS)
Hidayat, E.; Rudiarto, I.; Siegert, F.; Vries, W. D.
2018-02-01
Limited and insufficient information about the dynamic interrelation among mobility, utility, and land price is the main reason to conduct this research. Several studies, with several approaches, and several variables have been conducted so far in order to model the land price. However, most of these models appear to generate primarily static land prices. Thus, a research is required to compare, design, and validate different models which calculate and/or compare the inter-relational changes of mobility, utility, and land price. The applied method is a combination of analysis of literature review, expert interview, and statistical analysis. The result is newly improved mathematical model which have been validated and is suitable for the case study location. This improved model consists of 12 appropriate variables. This model can be implemented in the Salatiga city as the case study location in order to arrange better land use planning to mitigate the uncontrolled urban growth.
ISRNA: an integrative online toolkit for short reads from high-throughput sequencing data.
Luo, Guan-Zheng; Yang, Wei; Ma, Ying-Ke; Wang, Xiu-Jie
2014-02-01
Integrative Short Reads NAvigator (ISRNA) is an online toolkit for analyzing high-throughput small RNA sequencing data. Besides the high-speed genome mapping function, ISRNA provides statistics for genomic location, length distribution and nucleotide composition bias analysis of sequence reads. Number of reads mapped to known microRNAs and other classes of short non-coding RNAs, coverage of short reads on genes, expression abundance of sequence reads as well as some other analysis functions are also supported. The versatile search functions enable users to select sequence reads according to their sub-sequences, expression abundance, genomic location, relationship to genes, etc. A specialized genome browser is integrated to visualize the genomic distribution of short reads. ISRNA also supports management and comparison among multiple datasets. ISRNA is implemented in Java/C++/Perl/MySQL and can be freely accessed at http://omicslab.genetics.ac.cn/ISRNA/.
NASA Astrophysics Data System (ADS)
Jensen, Matilde Bisballe; Utriainen, Tuuli Maria; Steinert, Martin
2018-01-01
This paper presents the experienced difficulties of students participating in the multidisciplinary, remote collaborating engineering design course challenge-based innovation at CERN. This is with the aim to identify learning barriers and improve future learning experiences. We statistically analyse the rated differences between distinct design activities, educational background and remote vs. co-located collaboration. The analysis is based on a quantitative and qualitative questionnaire (N = 37). Our analysis found significant ranking differences between remote and co-located activities. This questions whether the remote factor might be a barrier for the originally intended learning goals. Further a correlation between analytical and converging design phases was identified. Hence, future facilitators are suggested to help students in the transition from one design phase to the next rather than only teaching methods in the individual design phases. Finally, we discuss how educators address the identified learning barriers when designing future courses including multidisciplinary or remote collaboration.
Spatial decision support system to evaluate crop residue energy potential by anaerobic digestion.
Escalante, Humberto; Castro, Liliana; Gauthier-Maradei, Paola; Rodríguez De La Vega, Reynel
2016-11-01
Implementing anaerobic digestion (AD) in energy production from crop residues requires development of decision tools to assess its feasibility and sustainability. A spatial decision support system (SDSS) was constructed to assist decision makers to select appropriate feedstock according to biomethanation potential, identify the most suitable location for biogas facilities, determine optimum plant capacity and supply chain, and evaluate associated risks and costs. SDSS involves a spatially explicit analysis, fuzzy multi-criteria analysis, and statistical and optimization models. The tool was validated on seven crop residues located in Santander, Colombia. For example, fique bagasse generates about 0.21millionm(3)CH4year(-1) (0.329m(3)CH4kg(-1) volatile solids) with a minimum profitable plant of about 2000tonyear(-1) and an internal rate of return of 10.5%. SDSS can be applied to evaluate other biomass resources, availability periods, and co-digestion potential. Copyright © 2016. Published by Elsevier Ltd.
Statistical lamb wave localization based on extreme value theory
NASA Astrophysics Data System (ADS)
Harley, Joel B.
2018-04-01
Guided wave localization methods based on delay-and-sum imaging, matched field processing, and other techniques have been designed and researched to create images that locate and describe structural damage. The maximum value of these images typically represent an estimated damage location. Yet, it is often unclear if this maximum value, or any other value in the image, is a statistically significant indicator of damage. Furthermore, there are currently few, if any, approaches to assess the statistical significance of guided wave localization images. As a result, we present statistical delay-and-sum and statistical matched field processing localization methods to create statistically significant images of damage. Our framework uses constant rate of false alarm statistics and extreme value theory to detect damage with little prior information. We demonstrate our methods with in situ guided wave data from an aluminum plate to detect two 0.75 cm diameter holes. Our results show an expected improvement in statistical significance as the number of sensors increase. With seventeen sensors, both methods successfully detect damage with statistical significance.
Calibrating the Difficulty of an Assessment Tool: The Blooming of a Statistics Examination
ERIC Educational Resources Information Center
Dunham, Bruce; Yapa, Gaitri; Yu, Eugenia
2015-01-01
Bloom's taxonomy is proposed as a tool by which to assess the level of complexity of assessment tasks in statistics. Guidelines are provided for how to locate tasks at each level of the taxonomy, along with descriptions and examples of suggested test questions. Through the "Blooming" of an examination--that is, locating its constituent…
A consistent framework for Horton regression statistics that leads to a modified Hack's law
Furey, P.R.; Troutman, B.M.
2008-01-01
A statistical framework is introduced that resolves important problems with the interpretation and use of traditional Horton regression statistics. The framework is based on a univariate regression model that leads to an alternative expression for Horton ratio, connects Horton regression statistics to distributional simple scaling, and improves the accuracy in estimating Horton plot parameters. The model is used to examine data for drainage area A and mainstream length L from two groups of basins located in different physiographic settings. Results show that confidence intervals for the Horton plot regression statistics are quite wide. Nonetheless, an analysis of covariance shows that regression intercepts, but not regression slopes, can be used to distinguish between basin groups. The univariate model is generalized to include n > 1 dependent variables. For the case where the dependent variables represent ln A and ln L, the generalized model performs somewhat better at distinguishing between basin groups than two separate univariate models. The generalized model leads to a modification of Hack's law where L depends on both A and Strahler order ??. Data show that ?? plays a statistically significant role in the modified Hack's law expression. ?? 2008 Elsevier B.V.
Statistical Analysis of Zebrafish Locomotor Response.
Liu, Yiwen; Carmer, Robert; Zhang, Gaonan; Venkatraman, Prahatha; Brown, Skye Ashton; Pang, Chi-Pui; Zhang, Mingzhi; Ma, Ping; Leung, Yuk Fai
2015-01-01
Zebrafish larvae display rich locomotor behaviour upon external stimulation. The movement can be simultaneously tracked from many larvae arranged in multi-well plates. The resulting time-series locomotor data have been used to reveal new insights into neurobiology and pharmacology. However, the data are of large scale, and the corresponding locomotor behavior is affected by multiple factors. These issues pose a statistical challenge for comparing larval activities. To address this gap, this study has analyzed a visually-driven locomotor behaviour named the visual motor response (VMR) by the Hotelling's T-squared test. This test is congruent with comparing locomotor profiles from a time period. Different wild-type (WT) strains were compared using the test, which shows that they responded differently to light change at different developmental stages. The performance of this test was evaluated by a power analysis, which shows that the test was sensitive for detecting differences between experimental groups with sample numbers that were commonly used in various studies. In addition, this study investigated the effects of various factors that might affect the VMR by multivariate analysis of variance (MANOVA). The results indicate that the larval activity was generally affected by stage, light stimulus, their interaction, and location in the plate. Nonetheless, different factors affected larval activity differently over time, as indicated by a dynamical analysis of the activity at each second. Intriguingly, this analysis also shows that biological and technical repeats had negligible effect on larval activity. This finding is consistent with that from the Hotelling's T-squared test, and suggests that experimental repeats can be combined to enhance statistical power. Together, these investigations have established a statistical framework for analyzing VMR data, a framework that should be generally applicable to other locomotor data with similar structure.
Statistical Analysis of Zebrafish Locomotor Response
Zhang, Gaonan; Venkatraman, Prahatha; Brown, Skye Ashton; Pang, Chi-Pui; Zhang, Mingzhi; Ma, Ping; Leung, Yuk Fai
2015-01-01
Zebrafish larvae display rich locomotor behaviour upon external stimulation. The movement can be simultaneously tracked from many larvae arranged in multi-well plates. The resulting time-series locomotor data have been used to reveal new insights into neurobiology and pharmacology. However, the data are of large scale, and the corresponding locomotor behavior is affected by multiple factors. These issues pose a statistical challenge for comparing larval activities. To address this gap, this study has analyzed a visually-driven locomotor behaviour named the visual motor response (VMR) by the Hotelling’s T-squared test. This test is congruent with comparing locomotor profiles from a time period. Different wild-type (WT) strains were compared using the test, which shows that they responded differently to light change at different developmental stages. The performance of this test was evaluated by a power analysis, which shows that the test was sensitive for detecting differences between experimental groups with sample numbers that were commonly used in various studies. In addition, this study investigated the effects of various factors that might affect the VMR by multivariate analysis of variance (MANOVA). The results indicate that the larval activity was generally affected by stage, light stimulus, their interaction, and location in the plate. Nonetheless, different factors affected larval activity differently over time, as indicated by a dynamical analysis of the activity at each second. Intriguingly, this analysis also shows that biological and technical repeats had negligible effect on larval activity. This finding is consistent with that from the Hotelling’s T-squared test, and suggests that experimental repeats can be combined to enhance statistical power. Together, these investigations have established a statistical framework for analyzing VMR data, a framework that should be generally applicable to other locomotor data with similar structure. PMID:26437184
Comparison of climate related changes in two Arctic fjords, Hornsund and Porsanger
NASA Astrophysics Data System (ADS)
Aniskiewicz, Paulina; Stramska, Małgorzata
2017-04-01
In the Arctic zone the climate change is amplified in comparison to globally averaged trends, and the observed trends are variable spatially. Our research is focused on two Artic fjords: Porsanger and Horsund. Porsanger fjord is located in the coastal waters of the Barents Sea. Hornsund is one of fjords located in the western part of Svalbard archipelago. In this presentation we have used data provided by the Norwegian Meteorological Institute for three meteorological stations. Two of them are located in the Porsanger fjord (Lakselv - in the inner part, Honningsvåg - in the outer zone). The third station provides data from the Hornsund fjord. Using these data we have estimated the 33-year trends (1983-2015) of air temperature and relative humidity in each station using linear regression analysis (statistically significant at 95In the inner part of the Porsanger fjord (Lakselv) the multiyear trend of increasing annual mean air temperature has been estimated at 0.006°C per year. The monthly trends were statistically significant in May, September and November. The strongest seasonal warming has been observed in spring and autumn. The trends of increasing annual mean humidity was about 0.2In Hornsund the air temperature trend (0.2°C per year) is significantly larger than in Porsanger. The trends of air temperature were statistically significant for eight months (except March, April, June and July) and three seasons (besides spring). The trends of relative humidity were not statistically significant. Thanks to this research we can discuss how atmospheric conditions and climate related trends change in time and seasons of the year in two different Arctic regions. The project has been financed from the funds of the Leading National Research Centre (KNOW) received by the Centre for Polar Studies for the period 2014-2018. This work was also funded by the Norway Grants (NCBR contract No. 201985, project NORDFLUX). Partial support comes from the Institute of Oceanology (IO PAN).
Geostatistics and GIS: tools for characterizing environmental contamination.
Henshaw, Shannon L; Curriero, Frank C; Shields, Timothy M; Glass, Gregory E; Strickland, Paul T; Breysse, Patrick N
2004-08-01
Geostatistics is a set of statistical techniques used in the analysis of georeferenced data that can be applied to environmental contamination and remediation studies. In this study, the 1,1-dichloro-2,2-bis(p-chlorophenyl)ethylene (DDE) contamination at a Superfund site in western Maryland is evaluated. Concern about the site and its future clean up has triggered interest within the community because residential development surrounds the area. Spatial statistical methods, of which geostatistics is a subset, are becoming increasingly popular, in part due to the availability of geographic information system (GIS) software in a variety of application packages. In this article, the joint use of ArcGIS software and the R statistical computing environment are demonstrated as an approach for comprehensive geostatistical analyses. The spatial regression method, kriging, is used to provide predictions of DDE levels at unsampled locations both within the site and the surrounding areas where residential development is ongoing.
Liu, Yuewei; Chen, Weihong
2012-02-01
As a nonparametric method, the Kruskal-Wallis test is widely used to compare three or more independent groups when an ordinal or interval level of data is available, especially when the assumptions of analysis of variance (ANOVA) are not met. If the Kruskal-Wallis statistic is statistically significant, Nemenyi test is an alternative method for further pairwise multiple comparisons to locate the source of significance. Unfortunately, most popular statistical packages do not integrate the Nemenyi test, which is not easy to be calculated by hand. We described the theory and applications of the Kruskal-Wallis and Nemenyi tests, and presented a flexible SAS macro to implement the two tests. The SAS macro was demonstrated by two examples from our cohort study in occupational epidemiology. It provides a useful tool for SAS users to test the differences among three or more independent groups using a nonparametric method.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whicker, Jeffrey Jay; Gillis, Jessica Mcdonnel; Ruedig, Elizabeth
This report summarizes the sampling design used, associated statistical assumptions, as well as general guidelines for conducting post-sampling data analysis. Sampling plan components presented here include how many sampling locations to choose and where within the sampling area to collect those samples. The type of medium to sample (i.e., soil, groundwater, etc.) and how to analyze the samples (in-situ, fixed laboratory, etc.) are addressed in other sections of the sampling plan.
NASA Technical Reports Server (NTRS)
Shumka, A.; Sollock, S. G.
1981-01-01
This paper represents the first comprehensive survey of the Mount Laguna Photovoltaic Installation. The novel techniques used for performing the field tests have been effective in locating and characterizing defective modules. A comparative analysis on the two types of modules used in the array indicates that they have significantly different failure rates, different distributions in degradational space and very different failure modes. A life cycle model is presented to explain a multimodal distribution observed for one module type. A statistical model is constructed and it is shown to be in good agreement with the field data.
Harrigan, George G; Harrison, Jay M
2012-01-01
New transgenic (GM) crops are subjected to extensive safety assessments that include compositional comparisons with conventional counterparts as a cornerstone of the process. The influence of germplasm, location, environment, and agronomic treatments on compositional variability is, however, often obscured in these pair-wise comparisons. Furthermore, classical statistical significance testing can often provide an incomplete and over-simplified summary of highly responsive variables such as crop composition. In order to more clearly describe the influence of the numerous sources of compositional variation we present an introduction to two alternative but complementary approaches to data analysis and interpretation. These include i) exploratory data analysis (EDA) with its emphasis on visualization and graphics-based approaches and ii) Bayesian statistical methodology that provides easily interpretable and meaningful evaluations of data in terms of probability distributions. The EDA case-studies include analyses of herbicide-tolerant GM soybean and insect-protected GM maize and soybean. Bayesian approaches are presented in an analysis of herbicide-tolerant GM soybean. Advantages of these approaches over classical frequentist significance testing include the more direct interpretation of results in terms of probabilities pertaining to quantities of interest and no confusion over the application of corrections for multiple comparisons. It is concluded that a standardized framework for these methodologies could provide specific advantages through enhanced clarity of presentation and interpretation in comparative assessments of crop composition.
A Probabilistic Atlas of Diffuse WHO Grade II Glioma Locations in the Brain
Baumann, Cédric; Zouaoui, Sonia; Yordanova, Yordanka; Blonski, Marie; Rigau, Valérie; Chemouny, Stéphane; Taillandier, Luc; Bauchet, Luc; Duffau, Hugues; Paragios, Nikos
2016-01-01
Diffuse WHO grade II gliomas are diffusively infiltrative brain tumors characterized by an unavoidable anaplastic transformation. Their management is strongly dependent on their location in the brain due to interactions with functional regions and potential differences in molecular biology. In this paper, we present the construction of a probabilistic atlas mapping the preferential locations of diffuse WHO grade II gliomas in the brain. This is carried out through a sparse graph whose nodes correspond to clusters of tumors clustered together based on their spatial proximity. The interest of such an atlas is illustrated via two applications. The first one correlates tumor location with the patient’s age via a statistical analysis, highlighting the interest of the atlas for studying the origins and behavior of the tumors. The second exploits the fact that the tumors have preferential locations for automatic segmentation. Through a coupled decomposed Markov Random Field model, the atlas guides the segmentation process, and characterizes which preferential location the tumor belongs to and consequently which behavior it could be associated to. Leave-one-out cross validation experiments on a large database highlight the robustness of the graph, and yield promising segmentation results. PMID:26751577
Statistical Characteristics of Wrong-Way Driving Crashes on Illinois Freeways.
Zhou, Huaguo; Zhao, Jiguang; Pour-Rouholamin, Mahdi; Tobias, Priscilla A
2015-01-01
Driving the wrong way on freeways, namely wrong-way driving (WWD), has been found to be a major concern for more than 6 decades. The purpose of this study was to identify characteristics of this type of crash as well as to rank the locations/interchanges according to their vulnerability to WWD entries. The WWD crash data on Illinois freeways were statistically analyzed for a 6-year time period (2004 to 2009) from 3 aspects: crash, vehicle, and person. The temporal distributions, geographical distributions, roadway characteristics, and crash locations were analyzed for WWD crashes. The driver demographic information, physical condition, and injury severity were analyzed for wrong-way drivers. The vehicle characteristics, vehicle operation, and collision results were analyzed for WWD vehicles. A method was brought about to identify wrong-way entry points that was then used to develop a relative-importance technique and rank different interchange types in terms of potential WWD incidents. The findings revealed that a large proportion of WWD crashes occurred during the weekend from midnight to 5 a.m. Approximately 80% of WWD crashes were located in urban areas and nearly 70% of wrong-way vehicles were passenger cars. Approximately 58% of wrong-way drivers were driving under the influence (DUI). Of those, nearly 50% were confirmed to be impaired by alcohol, about 4% were impaired by drugs, and more than 3% had been drinking. The analysis of interchange ranking found that compressed diamond interchanges, single point diamond interchanges (SPDIs), partial cloverleaf interchanges, and freeway feeders had the highest wrong-way crash rates (wrong-way crashes per 100 interchanges per year). The findings of this study call for more attention to WWD crashes from different aspects such as driver age group, time of day, day of week, and DUI drivers. Based on the analysis results of WWD distance, the study explained why a 5-mile radius of WWD crash location should be studied for WWD fatal crashes with unknown entry points.
A wavelet-based statistical analysis of FMRI data: I. motivation and data distribution modeling.
Dinov, Ivo D; Boscardin, John W; Mega, Michael S; Sowell, Elizabeth L; Toga, Arthur W
2005-01-01
We propose a new method for statistical analysis of functional magnetic resonance imaging (fMRI) data. The discrete wavelet transformation is employed as a tool for efficient and robust signal representation. We use structural magnetic resonance imaging (MRI) and fMRI to empirically estimate the distribution of the wavelet coefficients of the data both across individuals and spatial locations. An anatomical subvolume probabilistic atlas is used to tessellate the structural and functional signals into smaller regions each of which is processed separately. A frequency-adaptive wavelet shrinkage scheme is employed to obtain essentially optimal estimations of the signals in the wavelet space. The empirical distributions of the signals on all the regions are computed in a compressed wavelet space. These are modeled by heavy-tail distributions because their histograms exhibit slower tail decay than the Gaussian. We discovered that the Cauchy, Bessel K Forms, and Pareto distributions provide the most accurate asymptotic models for the distribution of the wavelet coefficients of the data. Finally, we propose a new model for statistical analysis of functional MRI data using this atlas-based wavelet space representation. In the second part of our investigation, we will apply this technique to analyze a large fMRI dataset involving repeated presentation of sensory-motor response stimuli in young, elderly, and demented subjects.
The Role of Discrete Global Grid Systems in the Global Statistical Geospatial Framework
NASA Astrophysics Data System (ADS)
Purss, M. B. J.; Peterson, P.; Minchin, S. A.; Bermudez, L. E.
2016-12-01
The United Nations Committee of Experts on Global Geospatial Information Management (UN-GGIM) has proposed the development of a Global Statistical Geospatial Framework (GSGF) as a mechanism for the establishment of common analytical systems that enable the integration of statistical and geospatial information. Conventional coordinate reference systems address the globe with a continuous field of points suitable for repeatable navigation and analytical geometry. While this continuous field is represented on a computer in a digitized and discrete fashion by tuples of fixed-precision floating point values, it is a non-trivial exercise to relate point observations spatially referenced in this way to areal coverages on the surface of the Earth. The GSGF states the need to move to gridded data delivery and the importance of using common geographies and geocoding. The challenges associated with meeting these goals are not new and there has been a significant effort within the geospatial community to develop nested gridding standards to tackle these issues over many years. These efforts have recently culminated in the development of a Discrete Global Grid Systems (DGGS) standard which has been developed under the auspices of Open Geospatial Consortium (OGC). DGGS provide a fixed areal based geospatial reference frame for the persistent location of measured Earth observations, feature interpretations, and modelled predictions. DGGS address the entire planet by partitioning it into a discrete hierarchical tessellation of progressively finer resolution cells, which are referenced by a unique index that facilitates rapid computation, query and analysis. The geometry and location of the cell is the principle aspect of a DGGS. Data integration, decomposition, and aggregation is optimised in the DGGS hierarchical structure and can be exploited for efficient multi-source data processing, storage, discovery, transmission, visualization, computation, analysis, and modelling. During the 6th Session of the UN-GGIM in August 2016 the role of DGGS in the context of the GSGF was formally acknowledged. This paper proposes to highlight the synergies and role of DGGS in the Global Statistical Geospatial Framework and to show examples of the use of DGGS to combine geospatial statistics with traditional geoscientific data.
A study of various methods for calculating locations of lightning events
NASA Technical Reports Server (NTRS)
Cannon, John R.
1995-01-01
This article reports on the results of numerical experiments on finding the location of lightning events using different numerical methods. The methods include linear least squares, nonlinear least squares, statistical estimations, cluster analysis and angular filters and combinations of such techniques. The experiments involved investigations of methods for excluding fake solutions which are solutions that appear to be reasonable but are in fact several kilometers distant from the actual location. Some of the conclusions derived from the study are that bad data produces fakes, that no fool-proof method of excluding fakes was found, that a short base-line interferometer under development at Kennedy Space Center to measure the direction cosines of an event shows promise as a filter for excluding fakes. The experiments generated a number of open questions, some of which are discussed at the end of the report.
Policy compliance of smokers on a tobacco-free university campus.
Russette, Helen C; Harris, Kari Jo; Schuldberg, David; Green, Linda
2014-01-01
To explore factors influencing compliance with campus tobacco policies and strategies to increase compliance. Sixty tobacco smokers (April 2012). A 22-item intercept-interview with closed- and open-ended questions was conducted with smokers in adjacent compliant and noncompliant areas at 1 university with a 100% tobacco ban. Data were analyzed using descriptive statistics and content analysis. Most reported that the smoking policy was not enforced. Noncompliant smokers had less knowledge of locations where tobacco use was permitted and were more likely to identify their smoking location as compliant and had knowingly violated the policy. Choice of location to smoke was related to convenience and a desire to follow the policy. Smokers recommended consequences for noncompliance and structures that accommodated smoking to increase adherence to the tobacco ban. Additional education, environmental, and contingency strategies are needed to increase compliance with the policy banning tobacco use on this campus.
Koprivica, Mladen; Petrić, Majda; Nešković, Nataša; Nešković, Aleksandar
2016-01-01
To determine the level of radiofrequency radiation generated by base stations of Global System for Mobile Communications and Universal Mobile Telecommunication System, extensive electromagnetic field strength measurements were carried out in the vicinity of 664 base station locations. These were classified into three categories: indoor, masts, and locations with installations on buildings. Although microcell base stations with antennas installed indoors typically emit less power than outdoor macrocell base stations, the fact that people can be found close to antennas requires exposure originating from these base stations to be carefully considered. Measurement results showed that maximum recorded value of electric field strength exceeded International Commission on Non-Ionizing Radiation Protection reference levels at 7% of indoor base station locations. At the same time, this percentage was much lower in the case of masts and installations on buildings (0% and 2.5%, respectively). © 2015 Wiley Periodicals, Inc.
Large-scale Atmospheric Transport Processes
NASA Technical Reports Server (NTRS)
Plumb, R. Alan
2004-01-01
Continuing earlier work, we continued an investigation of the seasonal behavior of the edges of the stratospheric surf zone. These edges form a barrier between the rapidly mixed surf zone and the relatively isolated tropics. In collaboration with Dr Lynn Sparling at GSFC, we used a statistical analysis of HALOE and CLAES trace gas data from UARS to identify and locate these edges during each UARS observing period. We found that the edges on both sides of the equator are present all year (a fact that is important for conceptual models of stratospheric transport), though that on the summer side of the equator is much less sharp than the winter edge. The edges migrate seasonally into the summer hemisphere. Their location also shows influence of the QBO, together with the SAO at higher altitudes. Comparisons with effective diffusivities, and the edge locations, suggest that the edge is sustained by surf zone entrainment during winter, but by the residual circulation during summer.
Jamjoom, Faris Z; Kim, Do-Gyoon; Lee, Damian J; McGlumphy, Edwin A; Yilmaz, Burak
2018-02-05
Effects of length and location of the edentulous area on the accuracy of prosthetic treatment plan incorporation into cone-beam computed tomography (CBCT) scans has not been investigated. To evaluate the effect of length and location of the edentulous area on the accuracy of prosthetic treatment plan incorporation into CBCT scans using different methods. Direct digital scans of a completely dentate master model with removable radiopaque teeth were made using an intraoral scanner, and digital scans of stone duplicates of the master model were made using a laboratory scanner. Specific teeth were removed to simulate different clinical situations and their CBCT scans were made. Surface scans were registered onto the CBCT scans. Radiographic templates for each clinical situation were also fabricated and used during CBCT scans of the master models. Using metrology software, three-dimensional (3D) deviation was measured on standard tesselation language (STL) files created from the CBCT scans against an STL file of the master model created from a CBCT scan. Statistical analysis was done using the MIXED procedure in a statistical software and Tukey HSD test (α =.05). The interaction between location and method was significant (P = .009). Location had no significant effect on registration methods (P > .05), but on the radiographic templates (P = .011). Length of the edentulous area did not have any significant effect (P > .05). Accuracy of digital image registration methods was similar and higher than that of radiographic templates in all clinical situations. Tooth-bound radiographic templates were significantly more accurate than the free-end templates. The results of this study suggest using image registration instead of radiographic templates when planning dental implants, particularly in free-end situations. © 2018 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Firouznia, Kavous, E-mail: k_firouznia@yahoo.com; Ghanaati, Hossein; Sanaati, Mina
The purpose of this study was to evaluate whether the size, location, or number of fibroids affects therapeutic efficacy or complications of uterine artery embolization (UAE). Patients with symptomatic uterine fibroids (n = 101) were treated by selective bilateral UAE using 500- to 710-{mu}m polyvinyl alcohol (PVA) particles. Baseline measures of clinical symptoms, sonography, and MRI taken before the procedure were compared to those taken 1, 3, 6, and 12 months later. Complications and outcomes were analyzed for associations with fibroid size, location, and number. Reductions in mean fibroid volume were similar in patients with single (66.6 {+-} 21.5%) andmore » multiple (67.4 {+-} 25.0%) fibroids (p-value = 0.83). Menstrual improvement occurred in patients with single (93.3%) and multiple (72.2%) fibroids (p = 0.18). Changes in submucosal and other fibroids were not significantly different between the two groups (p's > 0.56). Linear regression analysis between primary fibroid volume as independent variable and percentage reduction of fibroid volume after 1 year yielded an R{sup 2} of 0.083 and the model coefficient was not statistically significant (p = 0.072). Multivariate regression models revealed no statistically or clinically significant coefficients or odds ratios for three independent variables (primary fibroid size, total number, and fibroid location) and all outcome variables (percent reduction of uterus and fibroid volumes in 1 year, improvement of clinical symptoms [menstrual, bulk related, and urinary] in 1 year, and complications after UAE). In conclusion, neither the success rate nor the probability of complications was affected by the primary fibroid size, location, or total number of fibroids.« less
NASA Astrophysics Data System (ADS)
Wang, Yuming; Chen, Caixia; Gui, Bin; Shen, Chenglong; Ye, Pinzhong; Wang, S.
2011-04-01
How to properly understand coronal mass ejections (CMEs) viewed in white light coronagraphs is crucial to many relative researches in solar and space physics. The issue is now particularly addressed in this paper through studying the source locations of all the 1078 Large Angle and Spectrometric Coronagraph (LASCO) CMEs listed in Coordinated Data Analysis Workshop (CDAW) CME catalog during 1997-1998 and their correlation with CMEs' apparent parameters. By manually checking LASCO and Extreme Ultraviolet Imaging Telescope (EIT) movies of these CMEs, we find that, except 231 CMEs whose source locations cannot be identified due to poor data, there are 288 CMEs with location identified on the frontside solar disk, 234 CMEs appearing above solar limb, and 325 CMEs without evident eruptive signatures in the field of view of EIT. On the basis of the statistical results of CMEs' source locations, there are four physical issues: (1) the missing rate of CMEs by SOHO LASCO and EIT, (2) the mass of CMEs, (3) the causes of halo CMEs, and (4) the deflections of CMEs in the corona, are exhaustively analyzed. It is found that (1) about 32% frontside CMEs cannot be recognized by SOHO, (2) the brightness of a CME at any heliocentric distance is roughly positively correlated with its speed, and the CME mass derived from the brightness is probably overestimated, (3) both projection effect and violent eruption are the major causes of halo CMEs, and especially for limb halo CMEs the latter is the primary one, and (4) most CMEs deflected toward equator near the solar minimum; these deflections can be classified into three types: the asymmetrical expansion, the nonradial ejection, and the deflected propagation.
NASA Astrophysics Data System (ADS)
Baram, S.; Ronen, Z.; Kurtzman, D.; Peeters, A.; Dahan, O.
2013-12-01
Land cultivation and dairy waste lagoons are considered to be nonpoint and point sources of groundwater contamination by chloride (Cl-) and nitrate (NO3-). The objective of this work is to introduce a methodology to assess the past and future impacts of such agricultural activities on regional groundwater quality. The method is based on mass balances and on spatial statistical analysis of Cl- and NO3-concentration distributions in the saturated and unsaturated zones. The method enables quantitative analysis of the relation between the locations of pollution point sources and the spatial variability in Cl- and NO3- concentrations in groundwater. The method was applied to the Beer-Tuvia region, Israel, where intensive dairy farming along with land cultivation has been practiced for over 50 years above the local phreatic aquifer. Mass balance calculations accounted for the various groundwater recharge and abstraction sources and sinks in the entire region. The mass balances showed that leachates from lagoons and the cultivated land have contributed 6.0 and 89.4 % of the total mass of Cl- added to the aquifer and 12.6 and 77.4 % of the total mass of NO3-. The chemical composition of the aquifer and vadose zone water suggested that irrigated agricultural activity in the region is the main contributor of Cl- and NO3- to the groundwater. A low spatial correlation between the Cl- and NO3- concentrations in the groundwater and the on-land location of the dairy farms strengthened this assumption, despite the dairy waste lagoon being a point source for groundwater contamination by Cl- and NO3-. Results demonstrate that analyzing vadose zone and groundwater data by spatial statistical analysis methods can significantly contribute to the understanding of the relations between groundwater contaminating sources, and to assessing appropriate remediation steps.
Statistical Distribution Analysis of Lineated Bands on Europa
NASA Astrophysics Data System (ADS)
Chen, T.; Phillips, C. B.; Pappalardo, R. T.
2016-12-01
Tina Chen, Cynthia B. Phillips, Robert T. Pappalardo Europa's surface is covered with intriguing linear and disrupted features, including lineated bands that range in scale and size. Previous studies have shown the possibility of an icy shell at the surface that may be concealing a liquid ocean with the potential to harboring life (Pappalardo et al., 1999). Utilizing the high-resolution imaging data from the Galileo spacecraft, we examined bands through a morphometric and morphologic approach. Greeley et al. (2000) and Procktor et al. (2002) have defined bands as wide, hummocky to lineated features that have distinctive surface texture and albedo compared to its surrounding terrain. We took morphometric measurements of lineated bands to find correlations in properties such as size, location, and orientation, and to shed light on formation models. We will present our measurements of over 100 bands on Europa that was mapped on the USGS Europa Global Mosaic Base Map (2002). We also conducted a statistical analysis to understand the distribution of lineated bands globally, and whether the widths of the bands differ by location. Our preliminary analysis from our statistical distribution evaluation, combined with the morphometric measurements, supports a uniform ice shell thickness for Europa rather than one that varies geographically. References: Greeley, Ronald, et al. "Geologic mapping of Europa." Journal of Geophysical Research: Planets 105.E9 (2000): 22559-22578.; Pappalardo, R. T., et al. "Does Europa have a subsurface ocean? Evaluation of the geological evidence." Journal of Geophysical Research: Planets 104.E10 (1999): 24015-24055.; Prockter, Louise M., et al. "Morphology of Europan bands at high resolution: A mid-ocean ridge-type rift mechanism." Journal of Geophysical Research: Planets 107.E5 (2002).; U.S. Geological Survey, 2002, Controlled photomosaic map of Europa, Je 15M CMN: U.S. Geological Survey Geologic Investigations Series I-2757, available at http://pubs.usgs.gov/imap/i2757/
Barbero, Marina M. D.; Oliveira, Henrique N.; de Camargo, Gregório M. F.; Fernandes Júnior, Gerardo A.; Aspilcueta-Borquis, Rusbel R.; Souza, Fabio R. P.; Boligon, Arione A.; Melo, Thaise P.; Regatieri, Inaê C.; Feitosa, Fabieli L. B.; Fonseca, Larissa F. S.; Magalhães, Ana F. B.; Costa, Raphael B.; Albuquerque, Lucia G.
2018-01-01
Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs. PMID:29293544
Takada, Luciana; Barbero, Marina M D; Oliveira, Henrique N; de Camargo, Gregório M F; Fernandes Júnior, Gerardo A; Aspilcueta-Borquis, Rusbel R; Souza, Fabio R P; Boligon, Arione A; Melo, Thaise P; Regatieri, Inaê C; Feitosa, Fabieli L B; Fonseca, Larissa F S; Magalhães, Ana F B; Costa, Raphael B; Albuquerque, Lucia G
2018-01-01
Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs.
Diaconescu, Andrei; Alexandrescu, Sorin; Ionel, Zenaida; Zlate, Cristian; Grigorie, Razvan; Brasoveanu, Vladislav; Hrehoret, Doina; Ciurea, Silviu; Botea, Florin; Tomescu, Dana; Droc, Gabriela; Croitoru, Adina; Herlea, Vlad; Boros, Mirela; Grasu, Mugur; Dumitru, Radu; Toma, Mihai; Ionescu, Mihnea; Vasilescu, Catalin; Popescu, Irinel
2017-01-01
Background: The benefit of hepatic resection in case of concomitant colorectal hepatic and extrahepatic metastases (CHEHMs) is still debatable. The purpose of this study is to assess the results of resection of hepatic and extrahepatic metastases in patients with CHEHMs in a high-volume center for both hepatobiliary and colorectal surgery and to identify prognostic factors that correlate with longer survival in these patients. It was performed a retrospective analysis of 678 consecutive patients with liver resection for colorectal cancer metastases operated in a single Centre between April 1996 and March 2016. Among these, 73 patients presented CHEHMs. Univariate analysis was performed to identify the risk factors for overall survival (OS) in these patients. Results: There were 20 CHMs located at the lymphatic node level, 20 at the peritoneal level, 12 at the ovary and lung level, 12 presenting as local relapses and 9 other sites. 53 curative resections (R0) were performed. The difference in overall survival between the CHEHMs group and the CHMs group is statistically significant for the entire groups (p 0.0001), as well as in patients who underwent R0 resection (p 0.0001). In CHEHMs group, the OS was statistically significant higher in patients who underwent R0 resection vs. those with R1/R2 resection (p=0.004). Three variables were identified as prognostic factors for poor OS following univariate analysis: 4 or more hepatic metastases, major hepatectomy and the performance of operation during first period of the study (1996 - 2004). There was a tendency toward better OS in patients with ovarian or pulmonary location of extrahepatic disease, although the difference was not statistically significant. In patients with concomitant hepatic and extrahepatic metastases, complete resection of metastatic burden significantly prolong survival. The patients with up to 4 liver metastases, resectable by minor hepatectomy benefit the most from this aggressive onco-surgical management. Celsius.
Is "Safety-in-numbers" theory applies to the pattern of pedestrian accidents in Seoul, South Korea.
NASA Astrophysics Data System (ADS)
Choi, Y.; Yoon, H.
2016-12-01
Every year, about 1.25 million people die of vehicle-related accidents, among which half are pedestrians with higher vulnerability: pedestrian, cyclists and motorcyclist (World Health Organization, 2016). This urges city governments in the world to strive for pedestrian safety and to apply diverse theories to transportation planning and design. The common belief is that the number of pedestrian accidents is directly and positively associated with the volume of pedestrian, however, another hypothesis, called "safety-in-numbers" effect, tells an opposite story in that accident rates declines with increase of the volume of pedestrian. In this study, we examine first, whether the safety-in-numbers theory applies to the pattern of pedestrian accidents in Seoul, and second, further investigate environmental factors that are associated with the pedestrian safety. On the first count, we use geospatial statistical analyses of the multi-year pedestrian accident data collected by Korea Road Traffic Authority (KoRoad) and the pedestrian volume data collected by SK Telecom (SKT). With Kernel Density Estimation and Bivariate Local Moran's I, we identify spatial clustering of pedestrian accidents in the city, and examine whether those locations match with concentrations of pedestrian volume. On the second count, we use statistical analysis, tobit, poisson and negative binomial regression to investigate relationships between pedestrian volume and number of pedestrian accident for the two types of geographic areas by the results of the aforementioned analysis; Area 1- locations of high volume of pedestrian with high number of accident, Area 2- locations of high volume of pedestrian with low number of accident. For environmental factors potentially explaining pedestrian accidents, we include land use composition, number of traffic lanes, crosswalk presence, pedestrian signal, traffic island and sidewalk width in our analysis. This research will be valuable in city governments' decision making with planning guidelines and political protocols for making safer pedestrian environment.
Analysis of video-recorded images to determine linear and angular dimensions in the growing horse.
Hunt, W F; Thomas, V G; Stiefel, W
1999-09-01
Studies of growth and conformation require statistical methods that are not applicable to subjective conformation standards used by breeders and trainers. A new system was developed to provide an objective approach for both science and industry, based on analysis of video images to measure aspects of conformation that were represented by angles or lengths. A studio crush was developed in which video images of horses of different sizes were taken after bone protuberances, located by palpation, were marked with white paper stickers. Screen pixel coordinates of calibration marks, bone markers and points on horse outlines were digitised from captured images and corrected for aspect ratio and 'fish-eye' lens effects. Calculations from the corrected coordinates produced linear dimensions and angular dimensions useful for comparison of horses for conformation and experimental purposes. The precision achieved by the method in determining linear and angular dimensions was examined through systematically determining variance for isolated steps of the procedure. Angles of the front limbs viewed from in front were determined with a standard deviation of 2-5 degrees and effects of viewing angle were detectable statistically. The height of the rump and wither were determined with precision closely related to the limitations encountered in locating a point on a screen, which was greater for markers applied to the skin than for points at the edge of the image. Parameters determined from markers applied to the skin were, however, more variable (because their relation to bone position was affected by movement), but still provided a means by which a number of aspects of size and conformation can be determined objectively for many horses during growth. Sufficient precision was achieved to detect statistically relatively small effects on calculated parameters of camera height position.
Modeling Predictors of Duties Not Including Flying Status.
Tvaryanas, Anthony P; Griffith, Converse
2018-01-01
The purpose of this study was to reuse available datasets to conduct an analysis of potential predictors of U.S. Air Force aircrew nonavailability in terms of being in "duties not to include flying" (DNIF) status. This study was a retrospective cohort analysis of U.S. Air Force aircrew on active duty during the period from 2003-2012. Predictor variables included age, Air Force Specialty Code (AFSC), clinic location, diagnosis, gender, pay grade, and service component. The response variable was DNIF duration. Nonparametric methods were used for the exploratory analysis and parametric methods were used for model building and statistical inference. Out of a set of 783 potential predictor variables, 339 variables were identified from the nonparametric exploratory analysis for inclusion in the parametric analysis. Of these, 54 variables had significant associations with DNIF duration in the final model fitted to the validation data set. The predicted results of this model for DNIF duration had a correlation of 0.45 with the actual number of DNIF days. Predictor variables included age, 6 AFSCs, 7 clinic locations, and 40 primary diagnosis categories. Specific demographic (i.e., age), occupational (i.e., AFSC), and health (i.e., clinic location and primary diagnosis category) DNIF drivers were identified. Subsequent research should focus on the application of primary, secondary, and tertiary prevention measures to ameliorate the potential impact of these DNIF drivers where possible.Tvaryanas AP, Griffith C Jr. Modeling predictors of duties not including flying status. Aerosp Med Hum Perform. 2018; 89(1):52-57.
Rodrigues-Pinto, E; Pereira, P; Coelho, R; Andrade, P; Ribeiro, A; Lopes, S; Moutinho-Ribeiro, P; Macedo, G
2017-02-01
Self-expanding metal stents (SEMS) are the treatment of choice for advanced esophageal cancers. Literature is scarce on risk factors predictors for adverse events after SEMS placement. Assess risk factors for adverse events after SEMS placement in advanced esophageal cancer and evaluate survival after SEMS placement. Cross-sectional study of patients with advanced esophageal cancer referred for SEMS placement, during a period of 3 years. Ninety-seven patients with advanced esophageal cancer placed SEMS. Adverse events were more common when tumors were located at the level of the distal esophagus/cardia (47% vs 23%, P = 0.011, OR 3.1), with statistical significance being kept in the multivariate analysis (OR 3.1, P = 0.018). Time until adverse events was lower in the tumors located at the level of the distal esophagus/cardia (P = 0.036). Survival was higher in patients who placed SEMS with curative intent (327 days [126-528] vs. 119 days [91-147], P = 0.002) and in patients submitted subsequently to surgery compared with those who did just chemo/radiotherapy or who did not do further treatment (563 days [378-748] vs. 154 days [133-175] vs. 46 days [20-72], P < 0.001). Subsequent treatment kept statistical significance in the multivariate analysis (HR 3.4, P < 0.001). SEMS allow palliation of dysphagia in advanced esophageal cancer and are associated with an increased out-of-hospital survival, as long as there are conditions for further treatments. Tumors located at the level of the distal esophagus/cardia are associated with a greater number of adverse events, which also occur earlier. © 2016 International Society for Diseases of the Esophagus.
Sensitivity of bud burst in key tree species in the UK to recent climate variability and change
NASA Astrophysics Data System (ADS)
Abernethy, Rachel; Cook, Sally; Hemming, Deborah; McCarthy, Mark
2017-04-01
Analysing the relationship between the changing climate of the UK and the spatial and temporal distribution of spring bud burst plays an important role in understanding ecosystem functionality and predicting future phenological trends. The location and timing of bud burst of eleven species of trees alongside climatic factors such as, temperature, precipitation and hours of sunshine (photoperiod) were used to investigate: i. which species' bud burst timing experiences the greatest impact from a changing climate, ii. which climatic factor has the greatest influence on the timing of bud burst, and iii. whether the location of bud burst is influenced by climate variability. Winter heatwave duration was also analysed as part of an investigation into the relationship between temperature trends of a specific winter period and the following spring events. Geographic Information Systems (GIS) and statistical analysis tools were used to visualise spatial patterns and to analyse the phenological and climate data through regression and analysis of variance (ANOVA) tests. Where there were areas that showed a strong positive or negative relationship between phenology and climate, satellite imagery was used to calculate a Normalised Difference Vegetation Index (NDVI) and a Leaf Area Index (LAI) to further investigate the relationships found. It was expected that in the north of the UK, where bud burst tends to occur later in the year than in the south, that the bud bursts would begin to occur earlier due to increasing temperatures and increased hours of sunshine. However, initial results show that for some species, the bud burst timing tends to remain or become later in the year. Interesting results will be found when investigating the statistical significance between the changing location of the bud bursts and each climatic factor.
Population connectivity of the plating coral Agaricia lamarcki from southwest Puerto Rico
NASA Astrophysics Data System (ADS)
Hammerman, Nicholas M.; Rivera-Vicens, Ramon E.; Galaska, Matthew P.; Weil, Ernesto; Appledoorn, Richard S.; Alfaro, Monica; Schizas, Nikolaos V.
2018-03-01
Identifying genetic connectivity and discrete population boundaries is an important objective for management of declining Caribbean reef-building corals. A double digest restriction-associated DNA sequencing protocol was utilized to generate 321 single nucleotide polymorphisms to estimate patterns of horizontal and vertical gene flow in the brooding Caribbean plate coral, Agaricia lamarcki. Individual colonies ( n = 59) were sampled from eight locations throughout southwestern Puerto Rico from six shallow ( 10-20 m) and two mesophotic habitats ( 30-40 m). Descriptive summary statistics (fixation index, F ST), analysis of molecular variance, and analysis through landscape and ecological associations and discriminant analysis of principal components estimated high population connectivity with subtle subpopulation structure among all sampling localities.
Archfield, Stacey A.; Pugliese, Alessio; Castellarin, Attilio; Skøien, Jon O.; Kiang, Julie E.
2013-01-01
In the United States, estimation of flood frequency quantiles at ungauged locations has been largely based on regional regression techniques that relate measurable catchment descriptors to flood quantiles. More recently, spatial interpolation techniques of point data have been shown to be effective for predicting streamflow statistics (i.e., flood flows and low-flow indices) in ungauged catchments. Literature reports successful applications of two techniques, canonical kriging, CK (or physiographical-space-based interpolation, PSBI), and topological kriging, TK (or top-kriging). CK performs the spatial interpolation of the streamflow statistic of interest in the two-dimensional space of catchment descriptors. TK predicts the streamflow statistic along river networks taking both the catchment area and nested nature of catchments into account. It is of interest to understand how these spatial interpolation methods compare with generalized least squares (GLS) regression, one of the most common approaches to estimate flood quantiles at ungauged locations. By means of a leave-one-out cross-validation procedure, the performance of CK and TK was compared to GLS regression equations developed for the prediction of 10, 50, 100 and 500 yr floods for 61 streamgauges in the southeast United States. TK substantially outperforms GLS and CK for the study area, particularly for large catchments. The performance of TK over GLS highlights an important distinction between the treatments of spatial correlation when using regression-based or spatial interpolation methods to estimate flood quantiles at ungauged locations. The analysis also shows that coupling TK with CK slightly improves the performance of TK; however, the improvement is marginal when compared to the improvement in performance over GLS.
Tiedeman, Claire; Ely, D. Matthew; Hill, Mary C.; O'Brien, Grady M.
2004-01-01
We develop a new observation‐prediction (OPR) statistic for evaluating the importance of system state observations to model predictions. The OPR statistic measures the change in prediction uncertainty produced when an observation is added to or removed from an existing monitoring network, and it can be used to guide refinement and enhancement of the network. Prediction uncertainty is approximated using a first‐order second‐moment method. We apply the OPR statistic to a model of the Death Valley regional groundwater flow system (DVRFS) to evaluate the importance of existing and potential hydraulic head observations to predicted advective transport paths in the saturated zone underlying Yucca Mountain and underground testing areas on the Nevada Test Site. Important existing observations tend to be far from the predicted paths, and many unimportant observations are in areas of high observation density. These results can be used to select locations at which increased observation accuracy would be beneficial and locations that could be removed from the network. Important potential observations are mostly in areas of high hydraulic gradient far from the paths. Results for both existing and potential observations are related to the flow system dynamics and coarse parameter zonation in the DVRFS model. If system properties in different locations are as similar as the zonation assumes, then the OPR results illustrate a data collection opportunity whereby observations in distant, high‐gradient areas can provide information about properties in flatter‐gradient areas near the paths. If this similarity is suspect, then the analysis produces a different type of data collection opportunity involving testing of model assumptions critical to the OPR results.
Germany Country Analysis Brief
2016-01-01
Germany was the largest energy consumer in Europe and the seventh-largest energy consumer in the world in 2015, according to BP Statistical Review of World Energy. It was also the fourth-largest economy in the world by nominal gross domestic product (GDP) after the United States, China, and Japan in 2015. Its size and location give it considerable influence over the European Union’s energy sector. However, Germany must rely on imports to meet the majority of its energy demand.
NASA Astrophysics Data System (ADS)
Gallin, Louis-Jonardan; Farges, Thomas; Marchiano, Régis; Coulouvrat, François; Defer, Eric; Rison, William; Schulz, Wolfgang; Nuret, Mathieu
2016-04-01
In the framework of the European Hydrological Cycle in the Mediterranean Experiment project, a field campaign devoted to the study of electrical activity during storms took place in the south of France in 2012. An acoustic station composed of four microphones and four microbarometers was deployed within the coverage of a Lightning Mapping Array network. On the 26 October 2012, a thunderstorm passed just over the acoustic station. Fifty-six natural thunder events, due to cloud-to-ground and intracloud flashes, were recorded. This paper studies the acoustic reconstruction, in the low frequency range from 1 to 40 Hz, of the recorded flashes and their comparison with detections from electromagnetic networks. Concurrent detections from the European Cooperation for Lightning Detection lightning location system were also used. Some case studies show clearly that acoustic signal from thunder comes from the return stroke but also from the horizontal discharges which occur inside the clouds. The huge amount of observation data leads to a statistical analysis of lightning discharges acoustically recorded. Especially, the distributions of altitudes of reconstructed acoustic detections are explored in detail. The impact of the distance to the source on these distributions is established. The capacity of the acoustic method to describe precisely the lower part of nearby cloud-to-ground discharges, where the Lightning Mapping Array network is not effective, is also highlighted.
Multispectral determination of soil moisture-2. [Guymon, Oklahoma and Dalhart, Texas
NASA Technical Reports Server (NTRS)
Estes, J. E.; Simonett, D. S. (Principal Investigator); Hajic, E. J.; Hilton, B. M.; Lees, R. D.
1982-01-01
Soil moisture data obtained using scatterometers, modular multispectral scanners and passive microwave radiometers were revised and grouped into four field cover types for statistical anaysis. Guymon data are grouped as alfalfa, bare, milo with rows perpendicular to the field view, and milo viewed parallel to the field of view. Dalhart data are grouped as bare combo, stubble, disked stubble, and corn field. Summary graphs combine selected analyses to compare the effects of field cover. The analysis for each of the cover types is presented in tables and graphs. Other tables show elementary statistics, correlation matrices, and single variable regressions. Selected eigenvectors and factor analyses are included and the highest correlating sensor typs for each location are summarized.
Computational pathology: Exploring the spatial dimension of tumor ecology.
Nawaz, Sidra; Yuan, Yinyin
2016-09-28
Tumors are evolving ecosystems where cancer subclones and the microenvironment interact. This is analogous to interaction dynamics between species in their natural habitats, which is a prime area of study in ecology. Spatial statistics are frequently used in ecological studies to infer complex relations including predator-prey, resource dependency and co-evolution. Recently, the emerging field of computational pathology has enabled high-throughput spatial analysis by using image processing to identify different cell types and their locations within histological tumor samples. We discuss how these data may be analyzed with spatial statistics used in ecology to reveal patterns and advance our understanding of ecological interactions occurring among cancer cells and their microenvironment. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
24 CFR 1710.13 - Metropolitan Statistical Area (MSA) exemption.
Code of Federal Regulations, 2011 CFR
2011-04-01
... 24 Housing and Urban Development 5 2011-04-01 2011-04-01 false Metropolitan Statistical Area (MSA... Requirements § 1710.13 Metropolitan Statistical Area (MSA) exemption. (a) Eligibility requirements. The sale of... since April 28, 1969. (2) The lot is located within a Metropolitan Statistical Area (MSA) as defined by...
24 CFR 1710.13 - Metropolitan Statistical Area (MSA) exemption.
Code of Federal Regulations, 2010 CFR
2010-04-01
... 24 Housing and Urban Development 5 2010-04-01 2010-04-01 false Metropolitan Statistical Area (MSA... Requirements § 1710.13 Metropolitan Statistical Area (MSA) exemption. (a) Eligibility requirements. The sale of... since April 28, 1969. (2) The lot is located within a Metropolitan Statistical Area (MSA) as defined by...
Universal Recurrence Time Statistics of Characteristic Earthquakes
NASA Astrophysics Data System (ADS)
Goltz, C.; Turcotte, D. L.; Abaimov, S.; Nadeau, R. M.
2006-12-01
Characteristic earthquakes are defined to occur quasi-periodically on major faults. Do recurrence time statistics of such earthquakes follow a particular statistical distribution? If so, which one? The answer is fundamental and has important implications for hazard assessment. The problem cannot be solved by comparing the goodness of statistical fits as the available sequences are too short. The Parkfield sequence of M ≍ 6 earthquakes, one of the most extensive reliable data sets available, has grown to merely seven events with the last earthquake in 2004, for example. Recently, however, advances in seismological monitoring and improved processing methods have unveiled so-called micro-repeaters, micro-earthquakes which recur exactly in the same location on a fault. It seems plausible to regard these earthquakes as a miniature version of the classic characteristic earthquakes. Micro-repeaters are much more frequent than major earthquakes, leading to longer sequences for analysis. Due to their recent discovery, however, available sequences contain less than 20 events at present. In this paper we present results for the analysis of recurrence times for several micro-repeater sequences from Parkfield and adjacent regions. To improve the statistical significance of our findings, we combine several sequences into one by rescaling the individual sets by their respective mean recurrence intervals and Weibull exponents. This novel approach of rescaled combination yields the most extensive data set possible. We find that the resulting statistics can be fitted well by an exponential distribution, confirming the universal applicability of the Weibull distribution to characteristic earthquakes. A similar result is obtained from rescaled combination, however, with regard to the lognormal distribution.
Purcell, Jeremy J.; Rapp, Brenda
2013-01-01
Previous research has shown that damage to the neural substrates of orthographic processing can lead to functional reorganization during reading (Tsapkini et al., 2011); in this research we ask if the same is true for spelling. To examine the functional reorganization of spelling networks we present a novel three-stage Individual Peak Probability Comparison (IPPC) analysis approach for comparing the activation patterns obtained during fMRI of spelling in a single brain-damaged individual with dysgraphia to those obtained in a set of non-impaired control participants. The first analysis stage characterizes the convergence in activations across non-impaired control participants by applying a technique typically used for characterizing activations across studies: Activation Likelihood Estimate (ALE) (Turkeltaub et al., 2002). This method was used to identify locations that have a high likelihood of yielding activation peaks in the non-impaired participants. The second stage provides a characterization of the degree to which the brain-damaged individual's activations correspond to the group pattern identified in Stage 1. This involves performing a Mahalanobis distance statistics analysis (Tsapkini et al., 2011) that compares each of a control group's peak activation locations to the nearest peak generated by the brain-damaged individual. The third stage evaluates the extent to which the brain-damaged individual's peaks are atypical relative to the range of individual variation among the control participants. This IPPC analysis allows for a quantifiable, statistically sound method for comparing an individual's activation pattern to the patterns observed in a control group and, thus, provides a valuable tool for identifying functional reorganization in a brain-damaged individual with impaired spelling. Furthermore, this approach can be applied more generally to compare any individual's activation pattern with that of a set of other individuals. PMID:24399981
Prediction of River Flooding using Geospatial and Statistical Analysis in New York, USA and Kent, UK
NASA Astrophysics Data System (ADS)
Marsellos, A.; Tsakiri, K.; Smith, M.
2014-12-01
Flooding in the rivers normally occurs during periods of excessive precipitation (i.e. New York, USA; Kent, UK) or ice jams during the winter period (New York, USA). For the prediction and mapping of the river flooding, it is necessary to evaluate the spatial distribution of the water (volume) in the river as well as study the interaction between the climatic and hydrological variables. Two study areas have been analyzed; one in Mohawk River, New York and one in Kent, United Kingdom (UK). A high resolution Digital Elevation Model (DEM) of the Mohawk River, New York has been used for a GIS flooding simulation to determine the maximum elevation value of the water that cannot continue to be restricted in the trunk stream and as a result flooding in the river may be triggered. The Flooding Trigger Level (FTL) is determined by incremental volumetric and surface calculations from Triangulated Irregular Network (TIN) with the use of GIS software and LiDAR data. The prediction of flooding in the river can also be improved by the statistical analysis of the hydrological and climatic variables in Mohawk River and Kent, UK. A methodology of time series analysis has been applied for the decomposition of the hydrological (water flow and ground water data) and climatic data in both locations. The KZ (Kolmogorov-Zurbenko) filter is used for the decomposition of the time series into the long, seasonal, and short term components. The explanation of the long term component of the water flow using the climatic variables has been improved up to 90% for both locations. Similar analysis has been performed for the prediction of the seasonal and short term component. This methodology can be applied for flooding of the rivers in multiple sites.
Young, Robin L; Weinberg, Janice; Vieira, Verónica; Ozonoff, Al; Webster, Thomas F
2010-07-19
A common, important problem in spatial epidemiology is measuring and identifying variation in disease risk across a study region. In application of statistical methods, the problem has two parts. First, spatial variation in risk must be detected across the study region and, second, areas of increased or decreased risk must be correctly identified. The location of such areas may give clues to environmental sources of exposure and disease etiology. One statistical method applicable in spatial epidemiologic settings is a generalized additive model (GAM) which can be applied with a bivariate LOESS smoother to account for geographic location as a possible predictor of disease status. A natural hypothesis when applying this method is whether residential location of subjects is associated with the outcome, i.e. is the smoothing term necessary? Permutation tests are a reasonable hypothesis testing method and provide adequate power under a simple alternative hypothesis. These tests have yet to be compared to other spatial statistics. This research uses simulated point data generated under three alternative hypotheses to evaluate the properties of the permutation methods and compare them to the popular spatial scan statistic in a case-control setting. Case 1 was a single circular cluster centered in a circular study region. The spatial scan statistic had the highest power though the GAM method estimates did not fall far behind. Case 2 was a single point source located at the center of a circular cluster and Case 3 was a line source at the center of the horizontal axis of a square study region. Each had linearly decreasing logodds with distance from the point. The GAM methods outperformed the scan statistic in Cases 2 and 3. Comparing sensitivity, measured as the proportion of the exposure source correctly identified as high or low risk, the GAM methods outperformed the scan statistic in all three Cases. The GAM permutation testing methods provide a regression-based alternative to the spatial scan statistic. Across all hypotheses examined in this research, the GAM methods had competing or greater power estimates and sensitivities exceeding that of the spatial scan statistic.
2010-01-01
Background A common, important problem in spatial epidemiology is measuring and identifying variation in disease risk across a study region. In application of statistical methods, the problem has two parts. First, spatial variation in risk must be detected across the study region and, second, areas of increased or decreased risk must be correctly identified. The location of such areas may give clues to environmental sources of exposure and disease etiology. One statistical method applicable in spatial epidemiologic settings is a generalized additive model (GAM) which can be applied with a bivariate LOESS smoother to account for geographic location as a possible predictor of disease status. A natural hypothesis when applying this method is whether residential location of subjects is associated with the outcome, i.e. is the smoothing term necessary? Permutation tests are a reasonable hypothesis testing method and provide adequate power under a simple alternative hypothesis. These tests have yet to be compared to other spatial statistics. Results This research uses simulated point data generated under three alternative hypotheses to evaluate the properties of the permutation methods and compare them to the popular spatial scan statistic in a case-control setting. Case 1 was a single circular cluster centered in a circular study region. The spatial scan statistic had the highest power though the GAM method estimates did not fall far behind. Case 2 was a single point source located at the center of a circular cluster and Case 3 was a line source at the center of the horizontal axis of a square study region. Each had linearly decreasing logodds with distance from the point. The GAM methods outperformed the scan statistic in Cases 2 and 3. Comparing sensitivity, measured as the proportion of the exposure source correctly identified as high or low risk, the GAM methods outperformed the scan statistic in all three Cases. Conclusions The GAM permutation testing methods provide a regression-based alternative to the spatial scan statistic. Across all hypotheses examined in this research, the GAM methods had competing or greater power estimates and sensitivities exceeding that of the spatial scan statistic. PMID:20642827
NASA Astrophysics Data System (ADS)
Magnusdottir, G.; Bain, C.; Smyth, P.; Stern, H.; Knapp, K.
2010-12-01
A team of multidisciplinary scientists at the University of California Irvine has developed a novel spatial-temporal statistical model to detect the presence/absence of the ITCZ in high-resolution instantaneous satellite data. The Markov random field (MRF) statistical model is briefly introduced and compared to other automatic methods such as thresholding. The statistical model emulates human identification of the ITCZ as an envelope of convective activity (as seen in different fields) plus produces the same results given the same data, which may not be the case for human analysis. The MRF statistical model uses satellite data at a given location as well as information from its neighboring points (in time and space) to decide whether the given point is classified as ITCZ or non-ITCZ. Two different labels of ITCZ occurrence are produced. IR-only labels result from running the model with 3-hourly infrared data available for a 30 yr period, 1980--2009. Data-all labels result from running the model with additional satellite data (visible and total precipitable water), available from 1995--2008. IR-only labels detect less area of ITCZ than Data-all labels, especially where the ITCZ is shallower. Yet, qualitatively, the results for the two sets of labels are similar. Here, we focus on results from the IR-only labels over the east Pacific for the past 30 summer half-years (May to October). The IR data are from the HURSAT Basin data of NOAA’s National Climatic Data Center, which are derived from ISCCP B1 data. The data were collected from radiometers on different geostationary satellites. The IR channel data were recalibrated to reduce inter-satellite differences. The seasonal distribution of the ITCZ through the summer half year is presented, showing typical location and extent. The ITCZ is mostly confined to the eastern Pacific in May, and becomes more zonally distributed towards September and October each year. Northward and westward shifts in the location of the ITCZ occur in line with the seasonal cycle and warm sea surface temperatures. The ITCZ is quite variable on interannual time scales and highly correlated with ENSO variability. When we removed the ENSO signal from labels, interannual variability remained high. The resulting IR-only labels, showed no evidence of a trend in location, nor evidence of a trend in area for the 30 yr period. However, a trend in cloudiness within labels is observed and will be discussed.
Air Quality Forecasting through Different Statistical and Artificial Intelligence Techniques
NASA Astrophysics Data System (ADS)
Mishra, D.; Goyal, P.
2014-12-01
Urban air pollution forecasting has emerged as an acute problem in recent years because there are sever environmental degradation due to increase in harmful air pollutants in the ambient atmosphere. In this study, there are different types of statistical as well as artificial intelligence techniques are used for forecasting and analysis of air pollution over Delhi urban area. These techniques are principle component analysis (PCA), multiple linear regression (MLR) and artificial neural network (ANN) and the forecasting are observed in good agreement with the observed concentrations through Central Pollution Control Board (CPCB) at different locations in Delhi. But such methods suffers from disadvantages like they provide limited accuracy as they are unable to predict the extreme points i.e. the pollution maximum and minimum cut-offs cannot be determined using such approach. Also, such methods are inefficient approach for better output forecasting. But with the advancement in technology and research, an alternative to the above traditional methods has been proposed i.e. the coupling of statistical techniques with artificial Intelligence (AI) can be used for forecasting purposes. The coupling of PCA, ANN and fuzzy logic is used for forecasting of air pollutant over Delhi urban area. The statistical measures e.g., correlation coefficient (R), normalized mean square error (NMSE), fractional bias (FB) and index of agreement (IOA) of the proposed model are observed in better agreement with the all other models. Hence, the coupling of statistical and artificial intelligence can be use for the forecasting of air pollutant over urban area.
Edema is not a reliable diagnostic sign to exclude small brain metastases.
Schneider, Tanja; Kuhne, Jan Felix; Bittrich, Paul; Schroeder, Julian; Magnus, Tim; Mohme, Malte; Grosser, Malte; Schoen, Gerhard; Fiehler, Jens; Siemonsen, Susanne
2017-01-01
No prior systematic study on the extent of vasogenic edema (VE) in patients with brain metastases (BM) exists. Here, we aim to determine 1) the general volumetric relationship between BM and VE, 2) a threshold diameter above which a BM shows VE, and 3) the influence of the primary tumor and location of the BM in order to improve diagnostic processes and understanding of edema formation. This single center, retrospective study includes 173 untreated patients with histologically proven BM. Semi-manual segmentation of 1416 BM on contrast-enhanced T1-weighted images and of 865 VE on fluid-attenuated inversion recovery/T2-weighted images was conducted. Statistical analyses were performed using a paired-samples t-test, linear regression/generalized mixed-effects model, and receiver-operating characteristic (ROC) curve controlling for the possible effect of non-uniformly distributed metastases among patients. For BM with non-confluent edema (n = 545), there was a statistically significant positive correlation between the volumes of the BM and the VE (P < 0.001). The optimal threshold for edema formation was a diameter of 9.4 mm for all BM. The primary tumors as interaction term in multivariate analysis had a significant influence on VE formation whereas location had not. Hence VE development is dependent on the volume of the underlying BM and the site of the primary neoplasm, but not from the location of the BM.
Ogura, I; Kaneda, T; Sasaki, Y; Buch, K; Sakai, O
2015-06-01
Temporal bone fracture after mandibular trauma is thought to be rare, and its prevalence has not been reported in the literature. The purpose of this study was to investigate the prevalence of temporal bone fractures in patients with mandibular fractures and the relationship between temporal bone fractures and the mandibular fracture location using multidetector-row computed tomography (MDCT). A prospective study was performed in 201 patients with mandibular fractures who underwent 64-MDCT scans. The mandibular fracture locations were classified as median, paramedian, angle, and condylar types. Statistical analysis for the relationship between prevalence of temporal bone fractures and mandibular fracture locations was performed using χ(2) test with Fisher's exact test. A P-value < 0.05 was considered statistically significant. The percentage of cases with temporal bone fracture was 3.0 % of all patients with mandibular fractures and 19.0 % of those with multiple mandibular fractures of paramedian and condylar type. There was a significant relationship between the incidence of temporal bone fracture and the paramedian- and condylar-type mandibular fracture (P = 0.001). Multiple mandibular fractures of paramedian and condylar type may be a stronger indicator for temporal bone fractures. This study suggests that patients with mandibular fracture, especially the paramedian and condylar type, should be examined for coexisting temporal bone fracture using MDCT.
Field comparison of analytical results from discrete-depth ground water samplers
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zemo, D.A.; Delfino, T.A.; Gallinatti, J.D.
1995-07-01
Discrete-depth ground water samplers are used during environmental screening investigations to collect ground water samples in lieu of installing and sampling monitoring wells. Two of the most commonly used samplers are the BAT Enviroprobe and the QED HydroPunch I, which rely on differing sample collection mechanics. Although these devices have been on the market for several years, it was unknown what, if any, effect the differences would have on analytical results for ground water samples containing low to moderate concentrations of chlorinated volatile organic compounds (VOCs). This study investigated whether the discrete-depth ground water sampler used introduces statistically significant differencesmore » in analytical results. The goal was to provide a technical basis for allowing the two devices to be used interchangeably during screening investigations. Because this study was based on field samples, it included several sources of potential variability. It was necessary to separate differences due to sampler type from variability due to sampling location, sample handling, and laboratory analytical error. To statistically evaluate these sources of variability, the experiment was arranged in a nested design. Sixteen ground water samples were collected from eight random locations within a 15-foot by 15-foot grid. The grid was located in an area where shallow ground water was believed to be uniformly affected by VOCs. The data were evaluated using analysis of variance.« less
Jaynes, M.L.
1994-01-01
Hydrologic, water-quality, and meteorologic data were collected from January 1993 through March 1994 as part of a water-quality investigation of the Upper Catawba River Basin, North Carolina. Specific objectives of the investigation were to characterize the water quality of Rhodhiss Lake, Lake Hickory, and three tributary streams, and to calibrate hydrodynamic water-quality models for the two reservoirs. Sampling locations included 11 sites in Rhodhiss Lake, 14 sites in Lake Hickory, and 3 tributary sites. Tributary sites were located at Lower Creek upstream from Rhodhiss Lake and at Upper Little River and Middle Little River upstream from Lake Hickory. During 21 sampling visits, specific conductance, pH, water temperature, dissolved-oxygen concentration, and water transparency were measured at all sampling locations. Water samples were collected for analysis of biochemical oxygen demand, fecal coliform bacteria, hardness, alkalinity, total and volatile suspended solids, suspended sediment, nutrients, total organic carbon, chlorophyll, iron, calcium, and magnesium from three sites in each reservoir and from the three tributary sites. Chemical and particle-size analyses of bottom material from Rhodhiss Lake and Lake Hickory were performed once during the study. At selected locations, automated instruments recorded water level, streamflow, water temperature, solar radiation, and air temperature at 15-minute intervals throughout the study. Hydrologic data presented in the report include monthly water-level statistics and daily mean values of discharge. Diagrams, tables, and statistical summaries of water-quality data are provided. Meteorologic data in the report include monthly precipitation, and daily mean values of solar radiation and air temperature.
Donation return time at fixed and mobile donation sites
Carey, Patricia M.; High, Patrick M.; Schlumpf, Karen S.; Johnson, Bryce R.; Mast, Alan E.; Rios, Jorge A.; Simon, Toby L.; Wilkinson, Susan L.
2013-01-01
BACKGROUND This study investigated the effect of blood donation environment, fixed or mobile with differing sponsor types, on donation return time. STUDY DESIGN AND METHODS Data from 2006 through 2009 at six US blood centers participating in the Retrovirus Epidemiology Donor Study-II (REDS-II) were used for analysis. Descriptive statistics stratified by whole blood (WB), plateletpheresis (PP), and double red blood cell (R2) donations were obtained for fixed and mobile locations, including median number of donations and median interdonation interval. A survival analysis estimated median return time at fixed and mobile sites, while controlling for censored return times, demographics, blood center, and mandatory recovery times. RESULTS Two-thirds (67.9%) of WB donations were made at mobile sites, 97.4% of PP donations were made at fixed sites, and R2 donations were equally distributed between fixed and mobile locations. For donations at fixed sites only or alternating between fixed and mobile sites, the highest median numbers of donations were nine and eight, respectively, and the shortest model-adjusted median return times (controlling for mandatory eligibility times of 56 and 112 days) were 36 and 30 days for WB and R2 donations, respectively. For PP donations, the shortest model-adjusted median return time was 23 days at a fixed location and the longest was 693 days at community locations. CONCLUSION WB, PP, and R2 donors with the shortest time between donations were associated with fixed locations and those alternating between fixed and mobile locations, even after controlling for differing mandatory recovery times for the different blood donation procedures. PMID:21745215
NASA Astrophysics Data System (ADS)
Marks, Jamar Terry
The purpose of this quasi-experimental, nonequivalent pretest-posttest control group design study was to determine if any differences existed in upper elementary school students' science academic achievement when instructed using an 8-week integrated science and English language arts literacy supplemental instructional intervention in conjunction with traditional science classroom instruction as compared to when instructed using solely traditional science classroom instruction. The targeted sample population consisted of fourth-grade students enrolled in a public elementary school located in the southeastern region of the United States. The convenience sample size consisted of 115 fourth-grade students enrolled in science classes. The pretest and posttest academic achievement data collected consisted of the science segment from the Spring 2015, and Spring 2016 state standardized assessments. Pretest and posttest academic achievement data were analyzed using an ANCOVA statistical procedure to test for differences, and the researcher reported the results of the statistical analysis. The results of the study show no significant difference in science academic achievement between treatment and control groups. An interpretation of the results and recommendations for future research were provided by the researcher upon completion of the statistical analysis.
Insights into Corona Formation through Statistical Analyses
NASA Technical Reports Server (NTRS)
Glaze, L. S.; Stofan, E. R.; Smrekar, S. E.; Baloga, S. M.
2002-01-01
Statistical analysis of an expanded database of coronae on Venus indicates that the populations of Type 1 (with fracture annuli) and 2 (without fracture annuli) corona diameters are statistically indistinguishable, and therefore we have no basis for assuming different formation mechanisms. Analysis of the topography and diameters of coronae shows that coronae that are depressions, rimmed depressions, and domes tend to be significantly smaller than those that are plateaus, rimmed plateaus, or domes with surrounding rims. This is consistent with the model of Smrekar and Stofan and inconsistent with predictions of the spreading drop model of Koch and Manga. The diameter range for domes, the initial stage of corona formation, provides a broad constraint on the buoyancy of corona-forming plumes. Coronae are only slightly more likely to be topographically raised than depressions, with Type 1 coronae most frequently occurring as rimmed depressions and Type 2 coronae most frequently occuring with flat interiors and raised rims. Most Type 1 coronae are located along chasmata systems or fracture belts, while Type 2 coronas are found predominantly as isolated features in the plains. Coronae at hotspot rises tend to be significantly larger than coronae in other settings, consistent with a hotter upper mantle at hotspot rises and their active state.
Spatial Statistics for Tumor Cell Counting and Classification
NASA Astrophysics Data System (ADS)
Wirjadi, Oliver; Kim, Yoo-Jin; Breuel, Thomas
To count and classify cells in histological sections is a standard task in histology. One example is the grading of meningiomas, benign tumors of the meninges, which requires to assess the fraction of proliferating cells in an image. As this process is very time consuming when performed manually, automation is required. To address such problems, we propose a novel application of Markov point process methods in computer vision, leading to algorithms for computing the locations of circular objects in images. In contrast to previous algorithms using such spatial statistics methods in image analysis, the present one is fully trainable. This is achieved by combining point process methods with statistical classifiers. Using simulated data, the method proposed in this paper will be shown to be more accurate and more robust to noise than standard image processing methods. On the publicly available SIMCEP benchmark for cell image analysis algorithms, the cell count performance of the present paper is significantly more accurate than results published elsewhere, especially when cells form dense clusters. Furthermore, the proposed system performs as well as a state-of-the-art algorithm for the computer-aided histological grading of meningiomas when combined with a simple k-nearest neighbor classifier for identifying proliferating cells.
NASA Technical Reports Server (NTRS)
Ray, Terrill W.; Anderson, Don L.
1994-01-01
There is increasing use of statistical correlations between geophysical fields and between geochemical and geophysical fields in attempts to understand how the Earth works. Typically, such correlations have been based on spherical harmonic expansions. The expression of functions on the sphere as spherical harmonic series has many pitfalls, especially if the data are nonuniformly and/or sparsely sampled. Many of the difficulties involved in the use of spherical harmonic expansion techniques can be avoided through the use of spatial domain correlations, but this introduces other complications, such as the choice of a sampling lattice. Additionally, many geophysical and geochemical fields fail to satisfy the assumptions of standard statistical significance tests. This is especially problematic when the data values to be correlated with a geophysical field were collected at sample locations which themselves correlate with that field. This paper examines many correlations which have been claimed in the past between geochemistry and mantle tomography and between hotspot, ridge, and slab locations and tomography using both spherical harmonic coefficient correlations and spatial domain correlations. No conclusively significant correlations are found between isotopic geochemistry and mantle tomography. The Crough and Jurdy (short) hotspot location list shows statistically significant correlation with lowermost mantle tomography for degree 2 of the spherical harmonic expansion, but there are no statistically significant correlations in the spatial case. The Vogt (long) hotspot location list does not correlate with tomography anywhere in the mantle using either technique. Both hotspot lists show a strong correlation between hotspot locations and geoid highs when spatially correlated, but no correlations are revealed by spherical harmonic techniques. Ridge locations do not show any statistically significant correlations with tomography, slab locations, or the geoid; the strongest correlation is with lowermost mantle tomography, which is probably spurious. The most striking correlations are between mantle tomography and post-Pangean subducted slabs. The integrated locations of slabs correlate strongly with fast areas near the transition zone and the core-mantle boundary and with slow regions from 1022-1248 km depth. This seems to be consistent with the 'avalanching' downwellings which have been indicated by models of the mantle which include an endothermic phase transition at the 670-km discontinuity, although this is not a unique interpretation. Taken as a whole, these results suggest that slabs and associated cold downwellings are the dominant feature of mantle convection. Hotspot locations are no better correlated with lower mantle tomography than are ridge locations.
Continuous EEG signal analysis for asynchronous BCI application.
Hsu, Wei-Yen
2011-08-01
In this study, we propose a two-stage recognition system for continuous analysis of electroencephalogram (EEG) signals. An independent component analysis (ICA) and correlation coefficient are used to automatically eliminate the electrooculography (EOG) artifacts. Based on the continuous wavelet transform (CWT) and Student's two-sample t-statistics, active segment selection then detects the location of active segment in the time-frequency domain. Next, multiresolution fractal feature vectors (MFFVs) are extracted with the proposed modified fractal dimension from wavelet data. Finally, the support vector machine (SVM) is adopted for the robust classification of MFFVs. The EEG signals are continuously analyzed in 1-s segments, and every 0.5 second moves forward to simulate asynchronous BCI works in the two-stage recognition architecture. The segment is first recognized as lifted or not in the first stage, and then is classified as left or right finger lifting at stage two if the segment is recognized as lifting in the first stage. Several statistical analyses are used to evaluate the performance of the proposed system. The results indicate that it is a promising system in the applications of asynchronous BCI work.
NASA Technical Reports Server (NTRS)
Bremner, P. G.; Blelloch, P. A.; Hutchings, A.; Shah, P.; Streett, C. L.; Larsen, C. E.
2011-01-01
This paper describes the measurement and analysis of surface fluctuating pressure level (FPL) data and vibration data from a plume impingement aero-acoustic and vibration (PIAAV) test to validate NASA s physics-based modeling methods for prediction of panel vibration in the near field of a hot supersonic rocket plume. For this test - reported more fully in a companion paper by Osterholt & Knox at 26th Aerospace Testing Seminar, 2011 - the flexible panel was located 2.4 nozzle diameters from the plume centerline and 4.3 nozzle diameters downstream from the nozzle exit. The FPL loading is analyzed in terms of its auto spectrum, its cross spectrum, its spatial correlation parameters and its statistical properties. The panel vibration data is used to estimate the in-situ damping under plume FPL loading conditions and to validate both finite element analysis (FEA) and statistical energy analysis (SEA) methods for prediction of panel response. An assessment is also made of the effects of non-linearity in the panel elasticity.
NASA Astrophysics Data System (ADS)
Hu, Chongqing; Li, Aihua; Zhao, Xingyang
2011-02-01
This paper proposes a multivariate statistical analysis approach to processing the instantaneous engine speed signal for the purpose of locating multiple misfire events in internal combustion engines. The state of each cylinder is described with a characteristic vector extracted from the instantaneous engine speed signal following a three-step procedure. These characteristic vectors are considered as the values of various procedure parameters of an engine cycle. Therefore, determination of occurrence of misfire events and identification of misfiring cylinders can be accomplished by a principal component analysis (PCA) based pattern recognition methodology. The proposed algorithm can be implemented easily in practice because the threshold can be defined adaptively without the information of operating conditions. Besides, the effect of torsional vibration on the engine speed waveform is interpreted as the presence of super powerful cylinder, which is also isolated by the algorithm. The misfiring cylinder and the super powerful cylinder are often adjacent in the firing sequence, thus missing detections and false alarms can be avoided effectively by checking the relationship between the cylinders.
Bossier, Han; Seurinck, Ruth; Kühn, Simone; Banaschewski, Tobias; Barker, Gareth J.; Bokde, Arun L. W.; Martinot, Jean-Luc; Lemaitre, Herve; Paus, Tomáš; Millenet, Sabina; Moerkerke, Beatrijs
2018-01-01
Given the increasing amount of neuroimaging studies, there is a growing need to summarize published results. Coordinate-based meta-analyses use the locations of statistically significant local maxima with possibly the associated effect sizes to aggregate studies. In this paper, we investigate the influence of key characteristics of a coordinate-based meta-analysis on (1) the balance between false and true positives and (2) the activation reliability of the outcome from a coordinate-based meta-analysis. More particularly, we consider the influence of the chosen group level model at the study level [fixed effects, ordinary least squares (OLS), or mixed effects models], the type of coordinate-based meta-analysis [Activation Likelihood Estimation (ALE) that only uses peak locations, fixed effects, and random effects meta-analysis that take into account both peak location and height] and the amount of studies included in the analysis (from 10 to 35). To do this, we apply a resampling scheme on a large dataset (N = 1,400) to create a test condition and compare this with an independent evaluation condition. The test condition corresponds to subsampling participants into studies and combine these using meta-analyses. The evaluation condition corresponds to a high-powered group analysis. We observe the best performance when using mixed effects models in individual studies combined with a random effects meta-analysis. Moreover the performance increases with the number of studies included in the meta-analysis. When peak height is not taken into consideration, we show that the popular ALE procedure is a good alternative in terms of the balance between type I and II errors. However, it requires more studies compared to other procedures in terms of activation reliability. Finally, we discuss the differences, interpretations, and limitations of our results. PMID:29403344
NASA Astrophysics Data System (ADS)
Krasting, John P.
Snowfall is an important feature of the Earth's climate system that has the ability to influence both the natural world and human activity. This dissertation examines past and future changes in snowfall related to increasing concentrations of anthropogenic greenhouse gases. Snowfall observations for North America, derived snowfall products for the Northern Hemisphere, and simulations performed with 13 coupled atmosphere-ocean global climate models are analyzed. The analysis of the spatial pattern of simulated annual trends on a grid point basis from 1951 to 1999 indicates that a transition zone exists above 60° N latitude across the Northern Hemisphere that separates negative trends in annual snowfall in the mid-latitudes and positive trends at higher latitudes. Regional analysis of observed annual snowfall indicates that statistically significant trends are found in western North America, Japan, and southern Russia. A majority of the observed historical trends in annual snowfall elsewhere in the Northern Hemisphere, however, are not statistically significant and this result is consistent with model simulations. Projections of future snowfall indicate the presence of a similar transition zone between negative and positive snowfall trends that corresponds with the area between the -10 to -15°C isotherms of the multi-model mean temperature of the late twentieth century in each of the fall, winter, and spring seasons. Redistributions of snowfall throughout the entire snow season are likely -- even in locations where there is little change in annual snowfall. Changes in the fraction of precipitation falling as snow contribute to decreases in snowfall across most Northern Hemisphere regions, while changes in precipitation typically contribute to increases in snowfall. Snowfall events less than or equal to 5 cm are found to decrease in the future across most of the Northern Hemisphere, while snowfall events greater than or equal to 20 cm increase in some locations, such as northern Quebec. A signal-to-noise analysis reveals that the projected changes in snowfall are likely to become apparent during the twenty-first century for most locations in the Northern Hemisphere.
Influence of environmental statistics on inhibition of saccadic return
Farrell, Simon; Ludwig, Casimir J. H.; Ellis, Lucy A.; Gilchrist, Iain D.
2009-01-01
Initiating an eye movement is slowed if the saccade is directed to a location that has been fixated in the recent past. We show that this inhibitory effect is modulated by the temporal statistics of the environment: If a return location is likely to become behaviorally relevant, inhibition of return is absent. By fitting an accumulator model of saccadic decision-making, we show that the inhibitory effect and the sensitivity to local statistics can be dissociated in their effects on the rate of accumulation of evidence, and the threshold controlling the amount of evidence needed to generate a saccade. PMID:20080778
EEG source analysis of data from paralysed subjects
NASA Astrophysics Data System (ADS)
Carabali, Carmen A.; Willoughby, John O.; Fitzgibbon, Sean P.; Grummett, Tyler; Lewis, Trent; DeLosAngeles, Dylan; Pope, Kenneth J.
2015-12-01
One of the limitations of Encephalography (EEG) data is its quality, as it is usually contaminated with electric signal from muscle. This research intends to study results of two EEG source analysis methods applied to scalp recordings taken in paralysis and in normal conditions during the performance of a cognitive task. The aim is to determinate which types of analysis are appropriate for dealing with EEG data containing myogenic components. The data used are the scalp recordings of six subjects in normal conditions and during paralysis while performing different cognitive tasks including the oddball task which is the object of this research. The data were pre-processed by filtering it and correcting artefact, then, epochs of one second long for targets and distractors were extracted. Distributed source analysis was performed in BESA Research 6.0, using its results and information from the literature, 9 ideal locations for source dipoles were identified. The nine dipoles were used to perform discrete source analysis, fitting them to the averaged epochs for obtaining source waveforms. The results were statistically analysed comparing the outcomes before and after the subjects were paralysed. Finally, frequency analysis was performed for better explain the results. The findings were that distributed source analysis could produce confounded results for EEG contaminated with myogenic signals, conversely, statistical analysis of the results from discrete source analysis showed that this method could help for dealing with EEG data contaminated with muscle electrical signal.
NASA Astrophysics Data System (ADS)
Shauly, Eitan; Parag, Allon; Khmaisy, Hafez; Krispil, Uri; Adan, Ofer; Levi, Shimon; Latinski, Sergey; Schwarzband, Ishai; Rotstein, Israel
2011-04-01
A fully automated system for process variability analysis of high density standard cell was developed. The system consists of layout analysis with device mapping: device type, location, configuration and more. The mapping step was created by a simple DRC run-set. This database was then used as an input for choosing locations for SEM images and for specific layout parameter extraction, used by SPICE simulation. This method was used to analyze large arrays of standard cell blocks, manufactured using Tower TS013LV (Low Voltage for high-speed applications) Platforms. Variability of different physical parameters like and like Lgate, Line-width-roughness and more as well as of electrical parameters like drive current (Ion), off current (Ioff) were calculated and statistically analyzed, in order to understand the variability root cause. Comparison between transistors having the same W/L but with different layout configurations and different layout environments (around the transistor) was made in terms of performances as well as process variability. We successfully defined "robust" and "less-robust" transistors configurations, and updated guidelines for Design-for-Manufacturing (DfM).
Analysis of Information Content in High-Spectral Resolution Sounders using Subset Selection Analysis
NASA Technical Reports Server (NTRS)
Velez-Reyes, Miguel; Joiner, Joanna
1998-01-01
In this paper, we summarize the results of the sensitivity analysis and data reduction carried out to determine the information content of AIRS and IASI channels. The analysis and data reduction was based on the use of subset selection techniques developed in the linear algebra and statistical community to study linear dependencies in high dimensional data sets. We applied the subset selection method to study dependency among channels by studying the dependency among their weighting functions. Also, we applied the technique to study the information provided by the different levels in which the atmosphere is discretized for retrievals and analysis. Results from the method correlate well with intuition in many respects and point out to possible modifications for band selection in sensor design and number and location of levels in the analysis process.
NASA Technical Reports Server (NTRS)
Manning, Robert M.
1986-01-01
A rain attenuation prediction model is described for use in calculating satellite communication link availability for any specific location in the world that is characterized by an extended record of rainfall. Such a formalism is necessary for the accurate assessment of such availability predictions in the case of the small user-terminal concept of the Advanced Communication Technology Satellite (ACTS) Project. The model employs the theory of extreme value statistics to generate the necessary statistical rainrate parameters from rain data in the form compiled by the National Weather Service. These location dependent rain statistics are then applied to a rain attenuation model to obtain a yearly prediction of the occurrence of attenuation on any satellite link at that location. The predictions of this model are compared to those of the Crane Two-Component Rain Model and some empirical data and found to be very good. The model is then used to calculate rain attenuation statistics at 59 locations in the United States (including Alaska and Hawaii) for the 20 GHz downlinks and 30 GHz uplinks of the proposed ACTS system. The flexibility of this modeling formalism is such that it allows a complete and unified treatment of the temporal aspects of rain attenuation that leads to the design of an optimum stochastic power control algorithm, the purpose of which is to efficiently counter such rain fades on a satellite link.
Ladd, David E.; Law, George S.
2007-01-01
The U.S. Geological Survey (USGS) provides streamflow and other stream-related information needed to protect people and property from floods, to plan and manage water resources, and to protect water quality in the streams. Streamflow statistics provided by the USGS, such as the 100-year flood and the 7-day 10-year low flow, frequently are used by engineers, land managers, biologists, and many others to help guide decisions in their everyday work. In addition to streamflow statistics, resource managers often need to know the physical and climatic characteristics (basin characteristics) of the drainage basins for locations of interest to help them understand the mechanisms that control water availability and water quality at these locations. StreamStats is a Web-enabled geographic information system (GIS) application that makes it easy for users to obtain streamflow statistics, basin characteristics, and other information for USGS data-collection stations and for ungaged sites of interest. If a user selects the location of a data-collection station, StreamStats will provide previously published information for the station from a database. If a user selects a location where no data are available (an ungaged site), StreamStats will run a GIS program to delineate a drainage basin boundary, measure basin characteristics, and estimate streamflow statistics based on USGS streamflow prediction methods. A user can download a GIS feature class of the drainage basin boundary with attributes including the measured basin characteristics and streamflow estimates.
NASA Technical Reports Server (NTRS)
Slobin, S. D.; Piazzolla, S.
2002-01-01
Cloud opacity is one of the main atmospheric physical phenomena that can jeopardize the successful completion of an optical link between a spacecraft and a ground station. Hence, the site location chosen for a telescope used for optical communications must rely on knowledge of weather and cloud cover statistics for the geographical area where the telescope itself is located.
Similarity in Bilateral Isolated Internal Orbital Fractures.
Chen, Hung-Chang; Cox, Jacob T; Sanyal, Abanti; Mahoney, Nicholas R
2018-04-13
In evaluating patients sustaining bilateral isolated internal orbital fractures, the authors have observed both similar fracture locations and also similar expansion of orbital volumes. In this study, we aim to investigate if there is a propensity for the 2 orbits to fracture in symmetrically similar patterns when sustaining similar trauma. A retrospective chart review was performed studying all cases at our institution of bilateral isolated internal orbital fractures involving the medial wall and/or the floor at the time of presentation. The similarity of the bilateral fracture locations was evaluated using the Fisher's exact test. The bilateral expanded orbital volumes were analyzed using the Wilcoxon signed-rank test to assess for orbital volume similarity. Twenty-four patients with bilateral internal orbital fractures were analyzed for fracture location similarity. Seventeen patients (70.8%) had 100% concordance in the orbital subregion fractured, and the association between the right and the left orbital fracture subregion locations was statistically significant (P < 0.0001). Fifteen patients were analyzed for orbital volume similarity. The average orbital cavity volume was 31.2 ± 3.8 cm on the right and 32.0 ± 3.7 cm on the left. There was a statistically significant difference between right and left orbital cavity volumes (P = 0.0026). The data from this study suggest that an individual who suffers isolated bilateral internal orbital fractures has a statistically significant similarity in the location of their orbital fractures. However, there does not appear to be statistically significant similarity in the expansion of the orbital volumes in these patients.
Distribution of water quality parameters in Dhemaji district, Assam (India).
Buragohain, Mridul; Bhuyan, Bhabajit; Sarma, H P
2010-07-01
The primary objective of this study is to present a statistically significant water quality database of Dhemaji district, Assam (India) with special reference to pH, fluoride, nitrate, arsenic, iron, sodium and potassium. 25 water samples collected from different locations of five development blocks in Dhemaji district have been studied separately. The implications presented are based on statistical analyses of the raw data. Normal distribution statistics and reliability analysis (correlation and covariance matrix) have been employed to find out the distribution pattern, localisation of data, and other related information. Statistical observations show that all the parameters under investigation exhibit non uniform distribution with a long asymmetric tail either on the right or left side of the median. The width of the third quartile was consistently found to be more than the second quartile for each parameter. Differences among mean, mode and median, significant skewness and kurtosis value indicate that the distribution of various water quality parameters in the study area is widely off normal. Thus, the intrinsic water quality is not encouraging due to unsymmetrical distribution of various water quality parameters in the study area.
Texture as a basis for acoustic classification of substrate in the nearshore region
NASA Astrophysics Data System (ADS)
Dennison, A.; Wattrus, N. J.
2016-12-01
Segmentation and classification of substrate type from two locations in Lake Superior, are predicted using multivariate statistical processing of textural measures derived from shallow-water, high-resolution multibeam bathymetric data. During a multibeam sonar survey, both bathymetric and backscatter data are collected. It is well documented that the statistical characteristic of a sonar backscatter mosaic is dependent on substrate type. While classifying the bottom-type on the basis on backscatter alone can accurately predict and map bottom-type, it lacks the ability to resolve and capture fine textural details, an important factor in many habitat mapping studies. Statistical processing can capture the pertinent details about the bottom-type that are rich in textural information. Further multivariate statistical processing can then isolate characteristic features, and provide the basis for an accurate classification scheme. Preliminary results from an analysis of bathymetric data and ground-truth samples collected from the Amnicon River, Superior, Wisconsin, and the Lester River, Duluth, Minnesota, demonstrate the ability to process and develop a novel classification scheme of the bottom type in two geomorphologically distinct areas.
Empirical Reference Distributions for Networks of Different Size
Smith, Anna; Calder, Catherine A.; Browning, Christopher R.
2016-01-01
Network analysis has become an increasingly prevalent research tool across a vast range of scientific fields. Here, we focus on the particular issue of comparing network statistics, i.e. graph-level measures of network structural features, across multiple networks that differ in size. Although “normalized” versions of some network statistics exist, we demonstrate via simulation why direct comparison is often inappropriate. We consider normalizing network statistics relative to a simple fully parameterized reference distribution and demonstrate via simulation how this is an improvement over direct comparison, but still sometimes problematic. We propose a new adjustment method based on a reference distribution constructed as a mixture model of random graphs which reflect the dependence structure exhibited in the observed networks. We show that using simple Bernoulli models as mixture components in this reference distribution can provide adjusted network statistics that are relatively comparable across different network sizes but still describe interesting features of networks, and that this can be accomplished at relatively low computational expense. Finally, we apply this methodology to a collection of ecological networks derived from the Los Angeles Family and Neighborhood Survey activity location data. PMID:27721556
Keenan, Michael R; Smentkowski, Vincent S; Ulfig, Robert M; Oltman, Edward; Larson, David J; Kelly, Thomas F
2011-06-01
We demonstrate for the first time that multivariate statistical analysis techniques can be applied to atom probe tomography data to estimate the chemical composition of a sample at the full spatial resolution of the atom probe in three dimensions. Whereas the raw atom probe data provide the specific identity of an atom at a precise location, the multivariate results can be interpreted in terms of the probabilities that an atom representing a particular chemical phase is situated there. When aggregated to the size scale of a single atom (∼0.2 nm), atom probe spectral-image datasets are huge and extremely sparse. In fact, the average spectrum will have somewhat less than one total count per spectrum due to imperfect detection efficiency. These conditions, under which the variance in the data is completely dominated by counting noise, test the limits of multivariate analysis, and an extensive discussion of how to extract the chemical information is presented. Efficient numerical approaches to performing principal component analysis (PCA) on these datasets, which may number hundreds of millions of individual spectra, are put forward, and it is shown that PCA can be computed in a few seconds on a typical laptop computer.
NASA Astrophysics Data System (ADS)
Munawar, Iqra
2016-07-01
Crime mapping is a dynamic process. It can be used to assist all stages of the problem solving process. Mapping crime can help police protect citizens more effectively. The decision to utilize a certain type of map or design element may change based on the purpose of a map, the audience or the available data. If the purpose of the crime analysis map is to assist in the identification of a particular problem, selected data may be mapped to identify patterns of activity that have been previously undetected. The main objective of this research was to study the spatial distribution patterns of the four common crimes i.e Narcotics, Arms, Burglary and Robbery in Gujranwala City using spatial statistical techniques to identify the hotspots. Hotspots or location of clusters were identified using Getis-Ord Gi* Statistic. Crime analysis mapping can be used to conduct a comprehensive spatial analysis of the problem. Graphic presentations of such findings provide a powerful medium to communicate conditions, patterns and trends thus creating an avenue for analysts to bring about significant policy changes. Moreover Crime mapping also helps in the reduction of crime rate.
Influence of the carrying vehicle in the aerospatial survey of natural radioactivity
NASA Technical Reports Server (NTRS)
Dejesusparada, N. (Principal Investigator); Martin, I. M.
1981-01-01
The importance of the choice of the carrying vehicle in aerial surveys of natural radioactivity, particularly in the location of uraniferous regions, is discussed. The results of observations depend on the exposure time, that is, the velocity and altitude the carrying vehicle can attain. Overflights of the same region using identical instrumentation but in two different types of aircraft were performed. A detailed statistical analysis of the measurements obtained during these flights demonstrates the precision of localization achievable by this method.
2007-06-01
or JTF air mobility operations (AFDC, 2000). As stated in the following definition, the NAMS integrates the primary functions of airlift, air...control, and communications (C3), logistics support, and aerial port functions . The goal of the en route is to minimize delays for AMC mission...process. The resulting data was used to perform a statistical analysis of AMC off-station aircraft logistic support records for AMC’s six primary
Dynamic Modeling and Testing of MSRR-1 for Use in Microgravity Environments Analysis
NASA Technical Reports Server (NTRS)
Gattis, Christy; LaVerde, Bruce; Howell, Mike; Phelps, Lisa H. (Technical Monitor)
2001-01-01
Delicate microgravity science is unlikely to succeed on the International Space Station if vibratory and transient disturbers corrupt the environment. An analytical approach to compute the on-orbit acceleration environment at science experiment locations within a standard payload rack resulting from these disturbers is presented. This approach has been grounded by correlation and comparison to test verified transfer functions. The method combines the results of finite element and statistical energy analysis using tested damping and modal characteristics to provide a reasonable approximation of the total root-mean-square (RMS) acceleration spectra at the interface to microgravity science experiment hardware.
History of water quality parameters - a study on the Sinos River/Brazil.
Konzen, G B; Figueiredo, J A S; Quevedo, D M
2015-05-01
Water is increasingly becoming a valuable resource, constituting one of the central themes of environmental, economic and social discussions. The Sinos River, located in southern Brazil, is the main river from the Sinos River Basin, representing a source of drinking water supply for a highly populated region. Considering its size and importance, it becomes necessary to conduct a study to follow up the water quality of this river, which is considered by some experts as one of the most polluted rivers in Brazil. As for this study, its great importance lies in the historical analysis of indicators. In this sense, we sought to develop aspects related to the management of water resources by performing a historical analysis of the Water Quality Index (WQI) of the Sinos River, using statistical methods. With regard to the methodological procedures, it should be pointed out that this study performs a time analysis of monitoring data on parameters related to a punctual measurement that is variable in time, using statistical tools. The data used refer to analyses of the water quality of the Sinos River (WQI) from the State Environmental Protection Agency Henrique Luiz Roessler (Fundação Estadual de Proteção Ambiental Henrique Luiz Roessler, FEPAM) covering the period between 2000 and 2008, as well as to a theoretical analysis focusing on the management of water resources. The study of WQI and its parameters by statistical analysis has shown to be effective, ensuring its effectiveness as a tool for the management of water resources. The descriptive analysis of the WQI and its parameters showed that the water quality of the Sinos River is concerning low, which reaffirms that it is one of the most polluted rivers in Brazil. It should be highlighted that there was an overall difficulty in obtaining data with the appropriate periodicity, as well as a long complete series, which limited the conduction of statistical studies such as the present one.
Frans, Lonna M.; Helsel, Dennis R.
2005-01-01
Trends in nitrate concentrations in water from 474 wells in 17 subregions in the Columbia Basin Ground Water Management Area (GWMA) in three counties in eastern Washington were evaluated using a variety of statistical techniques, including the Friedman test and the Kendall test. The Kendall test was modified from its typical 'seasonal' version into a 'regional' version by using well locations in place of seasons. No statistically significant trends in nitrate concentrations were identified in samples from wells in the GWMA, the three counties, or the 17 subregions from 1998 to 2002 when all data were included in the analysis. For wells in which nitrate concentrations were greater than 10 milligrams per liter (mg/L), however, a significant downward trend of -0.4 mg/L per year was observed between 1998 and 2002 for the GWMA as a whole, as well as for Adams County (-0.35 mg/L per year) and for Franklin County (-0.46 mg/L per year). Trend analysis for a smaller but longer-term 51-well dataset in Franklin County found a statistically significant upward trend in nitrate concentrations of 0.1 mg/L per year between 1986 and 2003. The largest increase of nitrate concentrations occurred between 1986 and 1991. No statistically significant differences were observed in this dataset between 1998 and 2003 indicating that the increase in nitrate concentrations has leveled off.
Lead Determination and Heterogeneity Analysis in Soil from a Former Firing Range
NASA Astrophysics Data System (ADS)
Urrutia-Goyes, Ricardo; Argyraki, Ariadne; Ornelas-Soto, Nancy
2017-07-01
Public places can have an unknown past of pollutants deposition. The exposition to such contaminants can create environmental and health issues. The characterization of a former firing range in Athens, Greece will allow its monitoring and encourage its remediation. This study is focused on Pb contamination in the site due to its presence in ammunition. A dense sampling design with 91 location (10 m apart) was used to determine the spatial distribution of the element in the surface soil of the study area. Duplicates samples were also collected one meter apart from 8 random locations to estimate the heterogeneity of the site. Elemental concentrations were measured using a portable XRF device after simple sample homogenization in the field. Robust Analysis of Variance showed that the contributions to the total variance were 11% from sampling, 1% analytical, and 88% geochemical; reflecting the suitability of the technique. Moreover, the extended random uncertainty relative to the mean concentration was 91.5%; confirming the high heterogeneity of the site. Statistical analysis defined a very high contamination in the area yielding to suggest the need for more in-depth analysis of other contaminants and possible health risks.
Phung, Dung; Huang, Cunrui; Rutherford, Shannon; Dwirahmadi, Febi; Chu, Cordia; Wang, Xiaoming; Nguyen, Minh; Nguyen, Nga Huy; Do, Cuong Manh; Nguyen, Trung Hieu; Dinh, Tuan Anh Diep
2015-05-01
The present study is an evaluation of temporal/spatial variations of surface water quality using multivariate statistical techniques, comprising cluster analysis (CA), principal component analysis (PCA), factor analysis (FA) and discriminant analysis (DA). Eleven water quality parameters were monitored at 38 different sites in Can Tho City, a Mekong Delta area of Vietnam from 2008 to 2012. Hierarchical cluster analysis grouped the 38 sampling sites into three clusters, representing mixed urban-rural areas, agricultural areas and industrial zone. FA/PCA resulted in three latent factors for the entire research location, three for cluster 1, four for cluster 2, and four for cluster 3 explaining 60, 60.2, 80.9, and 70% of the total variance in the respective water quality. The varifactors from FA indicated that the parameters responsible for water quality variations are related to erosion from disturbed land or inflow of effluent from sewage plants and industry, discharges from wastewater treatment plants and domestic wastewater, agricultural activities and industrial effluents, and contamination by sewage waste with faecal coliform bacteria through sewer and septic systems. Discriminant analysis (DA) revealed that nephelometric turbidity units (NTU), chemical oxygen demand (COD) and NH₃ are the discriminating parameters in space, affording 67% correct assignation in spatial analysis; pH and NO₂ are the discriminating parameters according to season, assigning approximately 60% of cases correctly. The findings suggest a possible revised sampling strategy that can reduce the number of sampling sites and the indicator parameters responsible for large variations in water quality. This study demonstrates the usefulness of multivariate statistical techniques for evaluation of temporal/spatial variations in water quality assessment and management.
Sea water quality assessment of Prince Islands' beaches in Istanbul.
Ilter Turkdogan Aydinol, F; Kanat, Gurdal; Bayhan, Hurrem
2012-01-01
In this study, seawater samples were subjected to microbiological and physicochemical analysis (water temperature, pH, Secchi disc depth and ammonia) in the Prince Islands which are located in Marmara Sea, being one of the most popular swimming areas in Istanbul. The monitoring program of the study has been carried out in the summer for 6 weeks at eight stations around the Prince Islands. Measured total coliform values were between 5 ± 2 and 26 ± 55 and faecal coliform values were between 4 ± 2 and 24 ± 50 in the monitoring stations. A statistical study has been conducted to find the relationship between total and faecal coliform concentrations, and t tests were applied. There was no significant difference in each location of the Islands, except one location. The results were evaluated by comparing with national and EU bathing water standards. Results of the study show that deep sea discharges and sea currents contribute dilution of coliform concentration in a positive way, and locations near coastal zones of the islands have acceptable values which are required by the regulations.
Progress in Turbulence Detection via GNSS Occultation Data
NASA Technical Reports Server (NTRS)
Cornman, L. B.; Goodrich, R. K.; Axelrad, P.; Barlow, E.
2012-01-01
The increased availability of radio occultation (RO) data offers the ability to detect and study turbulence in the Earth's atmosphere. An analysis of how RO data can be used to determine the strength and location of turbulent regions is presented. This includes the derivation of a model for the power spectrum of the log-amplitude and phase fluctuations of the permittivity (or index of refraction) field. The bulk of the paper is then concerned with the estimation of the model parameters. Parameter estimators are introduced and some of their statistical properties are studied. These estimators are then applied to simulated log-amplitude RO signals. This includes the analysis of global statistics derived from a large number of realizations, as well as case studies that illustrate various specific aspects of the problem. Improvements to the basic estimation methods are discussed, and their beneficial properties are illustrated. The estimation techniques are then applied to real occultation data. Only two cases are presented, but they illustrate some of the salient features inherent in real data.
NASA Technical Reports Server (NTRS)
He, Yuning
2015-01-01
The behavior of complex aerospace systems is governed by numerous parameters. For safety analysis it is important to understand how the system behaves with respect to these parameter values. In particular, understanding the boundaries between safe and unsafe regions is of major importance. In this paper, we describe a hierarchical Bayesian statistical modeling approach for the online detection and characterization of such boundaries. Our method for classification with active learning uses a particle filter-based model and a boundary-aware metric for best performance. From a library of candidate shapes incorporated with domain expert knowledge, the location and parameters of the boundaries are estimated using advanced Bayesian modeling techniques. The results of our boundary analysis are then provided in a form understandable by the domain expert. We illustrate our approach using a simulation model of a NASA neuro-adaptive flight control system, as well as a system for the detection of separation violations in the terminal airspace.
Mapping probabilities of extreme continental water storage changes from space gravimetry
NASA Astrophysics Data System (ADS)
Kusche, J.; Eicker, A.; Forootan, E.; Springer, A.; Longuevergne, L.
2016-08-01
Using data from the Gravity Recovery And Climate Experiment (GRACE) mission, we derive statistically robust "hot spot" regions of high probability of peak anomalous—i.e., with respect to the seasonal cycle—water storage (of up to 0.7 m one-in-five-year return level) and flux (up to 0.14 m/month). Analysis of, and comparison with, up to 32 years of ERA-Interim reanalysis fields reveals generally good agreement of these hot spot regions to GRACE results and that most exceptions are located in the tropics. However, a simulation experiment reveals that differences observed by GRACE are statistically significant, and further error analysis suggests that by around the year 2020, it will be possible to detect temporal changes in the frequency of extreme total fluxes (i.e., combined effects of mainly precipitation and floods) for at least 10-20% of the continental area, assuming that we have a continuation of GRACE by its follow-up GRACE Follow-On (GRACE-FO) mission.
NASA Astrophysics Data System (ADS)
Alahmadi, F.; Rahman, N. A.; Abdulrazzak, M.
2014-09-01
Rainfall frequency analysis is an essential tool for the design of water related infrastructure. It can be used to predict future flood magnitudes for a given magnitude and frequency of extreme rainfall events. This study analyses the application of rainfall partial duration series (PDS) in the vast growing urban Madinah city located in the western part of Saudi Arabia. Different statistical distributions were applied (i.e. Normal, Log Normal, Extreme Value type I, Generalized Extreme Value, Pearson Type III, Log Pearson Type III) and their distribution parameters were estimated using L-moments methods. Also, different selection criteria models are applied, e.g. Akaike Information Criterion (AIC), Corrected Akaike Information Criterion (AICc), Bayesian Information Criterion (BIC) and Anderson-Darling Criterion (ADC). The analysis indicated the advantage of Generalized Extreme Value as the best fit statistical distribution for Madinah partial duration daily rainfall series. The outcome of such an evaluation can contribute toward better design criteria for flood management, especially flood protection measures.
Effect of environment and genotype on commercial maize hybrids using LC/MS-based metabolomics.
Baniasadi, Hamid; Vlahakis, Chris; Hazebroek, Jan; Zhong, Cathy; Asiago, Vincent
2014-02-12
We recently applied gas chromatography coupled to time-of-flight mass spectrometry (GC/TOF-MS) and multivariate statistical analysis to measure biological variation of many metabolites due to environment and genotype in forage and grain samples collected from 50 genetically diverse nongenetically modified (non-GM) DuPont Pioneer commercial maize hybrids grown at six North American locations. In the present study, the metabolome coverage was extended using a core subset of these grain and forage samples employing ultra high pressure liquid chromatography (uHPLC) mass spectrometry (LC/MS). A total of 286 and 857 metabolites were detected in grain and forage samples, respectively, using LC/MS. Multivariate statistical analysis was utilized to compare and correlate the metabolite profiles. Environment had a greater effect on the metabolome than genetic background. The results of this study support and extend previously published insights into the environmental and genetic associated perturbations to the metabolome that are not associated with transgenic modification.
NASA Technical Reports Server (NTRS)
Manning, Robert M.
1996-01-01
The purpose of the propagation studies within the ACTS Project Office is to acquire 20 and 30 GHz rain fade statistics using the ACTS beacon links received at the NGS (NASA Ground Station) in Cleveland. Other than the raw, statistically unprocessed rain fade events that occur in real time, relevant rain fade statistics derived from such events are the cumulative rain fade statistics as well as fade duration statistics (beyond given fade thresholds) over monthly and yearly time intervals. Concurrent with the data logging exercise, monthly maximum rainfall levels recorded at the US Weather Service at Hopkins Airport are appended to the database to facilitate comparison of observed fade statistics with those predicted by the ACTS Rain Attenuation Model. Also, the raw fade data will be in a format, complete with documentation, for use by other investigators who require realistic fade event evolution in time for simulation purposes or further analysis for comparisons with other rain fade prediction models, etc. The raw time series data from the 20 and 30 GHz beacon signals is purged of non relevant data intervals where no rain fading has occurred. All other data intervals which contain rain fade events are archived with the accompanying time stamps. The definition of just what constitutes a rain fade event will be discussed later. The archived data serves two purposes. First, all rain fade event data is recombined into a contiguous data series every month and every year; this will represent an uninterrupted record of the actual (i.e., not statistically processed) temporal evolution of rain fade at 20 and 30 GHz at the location of the NGS. The second purpose of the data in such a format is to enable a statistical analysis of prevailing propagation parameters such as cumulative distributions of attenuation on a monthly and yearly basis as well as fade duration probabilities below given fade thresholds, also on a monthly and yearly basis. In addition, various subsidiary statistics such as attenuation rate probabilities are derived. The purged raw rain fade data as well as the results of the analyzed data will be made available for use by parties in the private sector upon their request. The process which will be followed in this dissemination is outlined in this paper.
Jácome, Gabriel; Valarezo, Carla; Yoo, Changkyoo
2018-03-30
Pollution and the eutrophication process are increasing in lake Yahuarcocha and constant water quality monitoring is essential for a better understanding of the patterns occurring in this ecosystem. In this study, key sensor locations were determined using spatial and temporal analyses combined with geographical information systems (GIS) to assess the influence of weather features, anthropogenic activities, and other non-point pollution sources. A water quality monitoring network was established to obtain data on 14 physicochemical and microbiological parameters at each of seven sample sites over a period of 13 months. A spatial and temporal statistical approach using pattern recognition techniques, such as cluster analysis (CA) and discriminant analysis (DA), was employed to classify and identify the most important water quality parameters in the lake. The original monitoring network was reduced to four optimal sensor locations based on a fuzzy overlay of the interpolations of concentration variations of the most important parameters.
Critically evaluating the theory and performance of Bayesian analysis of macroevolutionary mixtures
Moore, Brian R.; Höhna, Sebastian; May, Michael R.; Rannala, Bruce; Huelsenbeck, John P.
2016-01-01
Bayesian analysis of macroevolutionary mixtures (BAMM) has recently taken the study of lineage diversification by storm. BAMM estimates the diversification-rate parameters (speciation and extinction) for every branch of a study phylogeny and infers the number and location of diversification-rate shifts across branches of a tree. Our evaluation of BAMM reveals two major theoretical errors: (i) the likelihood function (which estimates the model parameters from the data) is incorrect, and (ii) the compound Poisson process prior model (which describes the prior distribution of diversification-rate shifts across branches) is incoherent. Using simulation, we demonstrate that these theoretical issues cause statistical pathologies; posterior estimates of the number of diversification-rate shifts are strongly influenced by the assumed prior, and estimates of diversification-rate parameters are unreliable. Moreover, the inability to correctly compute the likelihood or to correctly specify the prior for rate-variable trees precludes the use of Bayesian approaches for testing hypotheses regarding the number and location of diversification-rate shifts using BAMM. PMID:27512038
Optimizing human activity patterns using global sensitivity analysis.
Fairchild, Geoffrey; Hickmann, Kyle S; Mniszewski, Susan M; Del Valle, Sara Y; Hyman, James M
2014-12-01
Implementing realistic activity patterns for a population is crucial for modeling, for example, disease spread, supply and demand, and disaster response. Using the dynamic activity simulation engine, DASim, we generate schedules for a population that capture regular (e.g., working, eating, and sleeping) and irregular activities (e.g., shopping or going to the doctor). We use the sample entropy (SampEn) statistic to quantify a schedule's regularity for a population. We show how to tune an activity's regularity by adjusting SampEn, thereby making it possible to realistically design activities when creating a schedule. The tuning process sets up a computationally intractable high-dimensional optimization problem. To reduce the computational demand, we use Bayesian Gaussian process regression to compute global sensitivity indices and identify the parameters that have the greatest effect on the variance of SampEn. We use the harmony search (HS) global optimization algorithm to locate global optima. Our results show that HS combined with global sensitivity analysis can efficiently tune the SampEn statistic with few search iterations. We demonstrate how global sensitivity analysis can guide statistical emulation and global optimization algorithms to efficiently tune activities and generate realistic activity patterns. Though our tuning methods are applied to dynamic activity schedule generation, they are general and represent a significant step in the direction of automated tuning and optimization of high-dimensional computer simulations.
Optimizing human activity patterns using global sensitivity analysis
Hickmann, Kyle S.; Mniszewski, Susan M.; Del Valle, Sara Y.; Hyman, James M.
2014-01-01
Implementing realistic activity patterns for a population is crucial for modeling, for example, disease spread, supply and demand, and disaster response. Using the dynamic activity simulation engine, DASim, we generate schedules for a population that capture regular (e.g., working, eating, and sleeping) and irregular activities (e.g., shopping or going to the doctor). We use the sample entropy (SampEn) statistic to quantify a schedule’s regularity for a population. We show how to tune an activity’s regularity by adjusting SampEn, thereby making it possible to realistically design activities when creating a schedule. The tuning process sets up a computationally intractable high-dimensional optimization problem. To reduce the computational demand, we use Bayesian Gaussian process regression to compute global sensitivity indices and identify the parameters that have the greatest effect on the variance of SampEn. We use the harmony search (HS) global optimization algorithm to locate global optima. Our results show that HS combined with global sensitivity analysis can efficiently tune the SampEn statistic with few search iterations. We demonstrate how global sensitivity analysis can guide statistical emulation and global optimization algorithms to efficiently tune activities and generate realistic activity patterns. Though our tuning methods are applied to dynamic activity schedule generation, they are general and represent a significant step in the direction of automated tuning and optimization of high-dimensional computer simulations. PMID:25580080
Optimizing human activity patterns using global sensitivity analysis
Fairchild, Geoffrey; Hickmann, Kyle S.; Mniszewski, Susan M.; ...
2013-12-10
Implementing realistic activity patterns for a population is crucial for modeling, for example, disease spread, supply and demand, and disaster response. Using the dynamic activity simulation engine, DASim, we generate schedules for a population that capture regular (e.g., working, eating, and sleeping) and irregular activities (e.g., shopping or going to the doctor). We use the sample entropy (SampEn) statistic to quantify a schedule’s regularity for a population. We show how to tune an activity’s regularity by adjusting SampEn, thereby making it possible to realistically design activities when creating a schedule. The tuning process sets up a computationally intractable high-dimensional optimizationmore » problem. To reduce the computational demand, we use Bayesian Gaussian process regression to compute global sensitivity indices and identify the parameters that have the greatest effect on the variance of SampEn. Here we use the harmony search (HS) global optimization algorithm to locate global optima. Our results show that HS combined with global sensitivity analysis can efficiently tune the SampEn statistic with few search iterations. We demonstrate how global sensitivity analysis can guide statistical emulation and global optimization algorithms to efficiently tune activities and generate realistic activity patterns. Finally, though our tuning methods are applied to dynamic activity schedule generation, they are general and represent a significant step in the direction of automated tuning and optimization of high-dimensional computer simulations.« less
Pugh, Aaron L.
2014-01-01
Users of streamflow information often require streamflow statistics and basin characteristics at various locations along a stream. The USGS periodically calculates and publishes streamflow statistics and basin characteristics for streamflowgaging stations and partial-record stations, but these data commonly are scattered among many reports that may or may not be readily available to the public. The USGS also provides and periodically updates regional analyses of streamflow statistics that include regression equations and other prediction methods for estimating statistics for ungaged and unregulated streams across the State. Use of these regional predictions for a stream can be complex and often requires the user to determine a number of basin characteristics that may require interpretation. Basin characteristics may include drainage area, classifiers for physical properties, climatic characteristics, and other inputs. Obtaining these input values for gaged and ungaged locations traditionally has been time consuming, subjective, and can lead to inconsistent results.
User’s guide for the Delaware River Basin Streamflow Estimator Tool (DRB-SET)
Stuckey, Marla H.; Ulrich, James E.
2016-06-09
IntroductionThe Delaware River Basin Streamflow Estimator Tool (DRB-SET) is a tool for the simulation of streamflow at a daily time step for an ungaged stream location in the Delaware River Basin. DRB-SET was developed by the U.S. Geological Survey (USGS) and funded through WaterSMART as part of the National Water Census, a USGS research program on national water availability and use that develops new water accounting tools and assesses water availability at the regional and national scales. DRB-SET relates probability exceedances at a gaged location to those at an ungaged stream location. Once the ungaged stream location has been identified by the user, an appropriate streamgage is automatically selected in DRB-SET using streamflow correlation (map correlation method). Alternately, the user can manually select a different streamgage or use the closest streamgage. A report file is generated documenting the reference streamgage and ungaged stream location information, basin characteristics, any warnings, baseline (minimally altered) and altered (affected by regulation, diversion, mining, or other anthropogenic activities) daily mean streamflow, and the mean and median streamflow. The estimated daily flows for the ungaged stream location can be easily exported as a text file that can be used as input into a statistical software package to determine additional streamflow statistics, such as flow duration exceedance or streamflow frequency statistics.
RADSS: an integration of GIS, spatial statistics, and network service for regional data mining
NASA Astrophysics Data System (ADS)
Hu, Haitang; Bao, Shuming; Lin, Hui; Zhu, Qing
2005-10-01
Regional data mining, which aims at the discovery of knowledge about spatial patterns, clusters or association between regions, has widely applications nowadays in social science, such as sociology, economics, epidemiology, crime, and so on. Many applications in the regional or other social sciences are more concerned with the spatial relationship, rather than the precise geographical location. Based on the spatial continuity rule derived from Tobler's first law of geography: observations at two sites tend to be more similar to each other if the sites are close together than if far apart, spatial statistics, as an important means for spatial data mining, allow the users to extract the interesting and useful information like spatial pattern, spatial structure, spatial association, spatial outlier and spatial interaction, from the vast amount of spatial data or non-spatial data. Therefore, by integrating with the spatial statistical methods, the geographical information systems will become more powerful in gaining further insights into the nature of spatial structure of regional system, and help the researchers to be more careful when selecting appropriate models. However, the lack of such tools holds back the application of spatial data analysis techniques and development of new methods and models (e.g., spatio-temporal models). Herein, we make an attempt to develop such an integrated software and apply it into the complex system analysis for the Poyang Lake Basin. This paper presents a framework for integrating GIS, spatial statistics and network service in regional data mining, as well as their implementation. After discussing the spatial statistics methods involved in regional complex system analysis, we introduce RADSS (Regional Analysis and Decision Support System), our new regional data mining tool, by integrating GIS, spatial statistics and network service. RADSS includes the functions of spatial data visualization, exploratory spatial data analysis, and spatial statistics. The tool also includes some fundamental spatial and non-spatial database in regional population and environment, which can be updated by external database via CD or network. Utilizing this data mining and exploratory analytical tool, the users can easily and quickly analyse the huge mount of the interrelated regional data, and better understand the spatial patterns and trends of the regional development, so as to make a credible and scientific decision. Moreover, it can be used as an educational tool for spatial data analysis and environmental studies. In this paper, we also present a case study on Poyang Lake Basin as an application of the tool and spatial data mining in complex environmental studies. At last, several concluding remarks are discussed.
Turbulence Statistics of a Buoyant Jet in a Stratified Environment
NASA Astrophysics Data System (ADS)
McCleney, Amy Brooke
Using non-intrusive optical diagnostics, turbulence statistics for a round, incompressible, buoyant, and vertical jet discharging freely into a stably linear stratified environment is studied and compared to a reference case of a neutrally buoyant jet in a uniform environment. This is part of a validation campaign for computational fluid dynamics (CFD). Buoyancy forces are known to significantly affect the jet evolution in a stratified environment. Despite their ubiquity in numerous natural and man-made flows, available data in these jets are limited, which constrain our understanding of the underlying physical processes. In particular, there is a dearth of velocity field data, which makes it challenging to validate numerical codes, currently used for modeling these important flows. Herein, jet near- and far-field behaviors are obtained with a combination of planar laser induced fluorescence (PLIF) and multi-scale time-resolved particle image velocimetry (TR-PIV) for Reynolds number up to 20,000. Deploying non-intrusive optical diagnostics in a variable density environment is challenging in liquids. The refractive index is strongly affected by the density, which introduces optical aberrations and occlusions that prevent the resolution of the flow. One solution consists of using index matched fluids with different densities. Here a pair of water solutions - isopropanol and NaCl - are identified that satisfy these requirements. In fact, they provide a density difference up to 5%, which is the largest reported for such fluid pairs. Additionally, by design, the kinematic viscosities of the solutions are identical. This greatly simplifies the analysis and subsequent simulations of the data. The spectral and temperature dependence of the solutions are fully characterized. In the near-field, shear layer roll-up is analyzed and characterized as a function of initial velocity profile. In the far-field, turbulence statistics are reported for two different scales, one capturing the entire jet at near Taylor microscale resolution, and the other, thanks to the careful refractive index matching of the liquids, resolving the Taylor scale at near Kolmogorov scale resolution. This is accomplished using a combination of TR-PIV and long-distance micro-PIV. The turbulence statistics obtained at various downstream locations and magnifications are obtained for density differences of 0%, 1%, and 3%. To validate the experimental methodology and provide a reference case for validation, the effect of initial velocity profile on the neutrally buoyant jet in the self-preserving regime is studied at two Reynolds numbers of 10,000 and 20,000. For the neutrally buoyant jet, it is found that independent of initial conditions the jet follows a self-similar behavior in the far-field; however, the spreading rate is strongly dependent on initial velocity profile. High magnification analysis at the small turbulent length scales shows a flow field where the mean statistics compare well to the larger field of view case. Investigation of the near-field shows the jet is strongly influenced by buoyancy, where an increase in vortex ring formation frequency and number of pairings occur. The buoyant jet with a 1% density difference shows an alteration of the centerline velocity decay, but the radial distribution of the mean axial velocity collapses well at all measurement locations. Jet formation dramatically changes for a buoyant jet with a 3% density difference, where the jet reaches a terminal height and spreads out horizontally at its neutral buoyancy location. Analysis of both the mean axial velocity and strain rates show the jet is no longer self-similar; for example, the mean centerline velocity does not decay uniformly as the jet develops. The centerline strain rates at this density difference also show trends which are strongly influenced by the altered centerline velocity. The overall centerline analysis shows that turbulence suppression occurs as a result of the stratification for both the 1% and 3% density difference. Analysis on the kinetic energy budget shows that the mean convection, production, transportation, and dissipation of energy is altered from stratification. High resolution data of the jet enable flow structures to be captured in the neutrally buoyant region of the flow. Vortices of different sizes are identified. Longer data sets are necessary to perform a statistical analysis of their distribution and to compare them to homogeneous environment case. This multi-scale analysis shows potential for studying energy transfer between length scales.
Obi, James; Ibidunni, Ayodotun Stephen; Tolulope, Atolagbe; Olokundun, Maxwell Ayodele; Amaihian, Augusta Bosede; Borishade, Taiye Tairat; Fred, Peter
2018-06-01
The focus of this research was to present a data article on the contribution of SMEs to economic development in a transiting economy. Descriptive research design was adopted in this study. Data were obtained from 600 respondents in 60 small-scale enterprises located in different parts of the country (20 small-scale enterprises located in Lagos State, 20 in Anambra State and 20 in Kano State of Nigeria respectively). Data analysis was carried out using tables and percentages and the null hypotheses of the study was tested using chi-square ( X 2 ) inferential statistical model at 5% level of significance. The findings revealed that there is a significant relationship between the operation of small and medium-scale enterprises and economic growth in developing nations.
The persistent signature of tropical cyclones in ambient seismic noise
NASA Astrophysics Data System (ADS)
Gualtieri, Lucia; Camargo, Suzana J.; Pascale, Salvatore; Pons, Flavio M. E.; Ekström, Göran
2018-02-01
The spectrum of ambient seismic noise shows strong signals associated with tropical cyclones, yet a detailed understanding of these signals and the relationship between them and the storms is currently lacking. Through the analysis of more than a decade of seismic data recorded at several stations located in and adjacent to the northwest Pacific Ocean, here we show that there is a persistent and frequency-dependent signature of tropical cyclones in ambient seismic noise that depends on characteristics of the storm and on the detailed location of the station relative to the storm. An adaptive statistical model shows that the spectral amplitude of ambient seismic noise, and notably of the short-period secondary microseisms, has a strong relationship with tropical cyclone intensity and can be employed to extract information on the tropical cyclones.
Estimating procedure times for surgeries by determining location parameters for the lognormal model.
Spangler, William E; Strum, David P; Vargas, Luis G; May, Jerrold H
2004-05-01
We present an empirical study of methods for estimating the location parameter of the lognormal distribution. Our results identify the best order statistic to use, and indicate that using the best order statistic instead of the median may lead to less frequent incorrect rejection of the lognormal model, more accurate critical value estimates, and higher goodness-of-fit. Using simulation data, we constructed and compared two models for identifying the best order statistic, one based on conventional nonlinear regression and the other using a data mining/machine learning technique. Better surgical procedure time estimates may lead to improved surgical operations.
NASA Astrophysics Data System (ADS)
Brereton, Carol A.; Johnson, Matthew R.
2012-05-01
Fugitive pollutant sources from the oil and gas industry are typically quite difficult to find within industrial plants and refineries, yet they are a significant contributor of global greenhouse gas emissions. A novel approach for locating fugitive emission sources using computationally efficient trajectory statistical methods (TSM) has been investigated in detailed proof-of-concept simulations. Four TSMs were examined in a variety of source emissions scenarios developed using transient CFD simulations on the simplified geometry of an actual gas plant: potential source contribution function (PSCF), concentration weighted trajectory (CWT), residence time weighted concentration (RTWC), and quantitative transport bias analysis (QTBA). Quantitative comparisons were made using a correlation measure based on search area from the source(s). PSCF, CWT and RTWC could all distinguish areas near major sources from the surroundings. QTBA successfully located sources in only some cases, even when provided with a large data set. RTWC, given sufficient domain trajectory coverage, distinguished source areas best, but otherwise could produce false source predictions. Using RTWC in conjunction with CWT could overcome this issue as well as reduce sensitivity to noise in the data. The results demonstrate that TSMs are a promising approach for identifying fugitive emissions sources within complex facility geometries.
Prostate segmentation in MR images using discriminant boundary features.
Yang, Meijuan; Li, Xuelong; Turkbey, Baris; Choyke, Peter L; Yan, Pingkun
2013-02-01
Segmentation of the prostate in magnetic resonance image has become more in need for its assistance to diagnosis and surgical planning of prostate carcinoma. Due to the natural variability of anatomical structures, statistical shape model has been widely applied in medical image segmentation. Robust and distinctive local features are critical for statistical shape model to achieve accurate segmentation results. The scale invariant feature transformation (SIFT) has been employed to capture the information of the local patch surrounding the boundary. However, when SIFT feature being used for segmentation, the scale and variance are not specified with the location of the point of interest. To deal with it, the discriminant analysis in machine learning is introduced to measure the distinctiveness of the learned SIFT features for each landmark directly and to make the scale and variance adaptive to the locations. As the gray values and gradients vary significantly over the boundary of the prostate, separate appearance descriptors are built for each landmark and then optimized. After that, a two stage coarse-to-fine segmentation approach is carried out by incorporating the local shape variations. Finally, the experiments on prostate segmentation from MR image are conducted to verify the efficiency of the proposed algorithms.
Measuring patients' satisfaction with pharmaceutical services at a public hospital in Qatar.
Khudair, Imran Fahmi; Raza, Syed Asif
2013-01-01
The aim of this paper is to study pharmacy service impact on patient satisfaction and to determine what factors saliently link with pharmaceutical service performance at Hamad General Hospital. A patient satisfaction questionnaire was designed using the literature and consultation with Hamad General Hospital medical experts. The questionnaire contained 22 items that focused on five influencing factors: promptness; attitude; supply; location; medication education; and respondent demographic aspects. A total of 220 respondents completed the questionnaire. An exploratory factor analysis was used to group items and a structural equation model was developed to test causality between five factors along with their influence on patient satisfaction. The study establishes statistical evidence that patient satisfaction is positively influenced by service promptness, pharmacist attitude, medication counseling, pharmacy location and waiting area. Several socio-demographic characteristics have statistically different effect on satisfaction, notably: gender; marital status; health status; age; educational level; and ethnicity. However, medication supply did not influence patient satisfaction. Pharmaceutical services are recognized as an essential healthcare-system component. Their impact on customer satisfaction has been investigated in many countries; however, there is no such study in Qatar. The findings identify pharmaceutical service performance indicators and provide guidelines to improve Qatari pharmaceutical services.
Huang, Shuguang; Yeo, Adeline A; Li, Shuyu Dan
2007-10-01
The Kolmogorov-Smirnov (K-S) test is a statistical method often used for comparing two distributions. In high-throughput screening (HTS) studies, such distributions usually arise from the phenotype of independent cell populations. However, the K-S test has been criticized for being overly sensitive in applications, and it often detects a statistically significant difference that is not biologically meaningful. One major reason is that there is a common phenomenon in HTS studies that systematic drifting exists among the distributions due to reasons such as instrument variation, plate edge effect, accidental difference in sample handling, etc. In particular, in high-content cellular imaging experiments, the location shift could be dramatic since some compounds themselves are fluorescent. This oversensitivity of the K-S test is particularly overpowered in cellular assays where the sample sizes are very big (usually several thousands). In this paper, a modified K-S test is proposed to deal with the nonspecific location-shift problem in HTS studies. Specifically, we propose that the distributions are "normalized" by density curve alignment before the K-S test is conducted. In applications to simulation data and real experimental data, the results show that the proposed method has improved specificity.
Fazenda, Bruno; Scarre, Chris; Till, Rupert; Pasalodos, Raquel Jiménez; Guerra, Manuel Rojo; Tejedor, Cristina; Peredo, Roberto Ontañón; Watson, Aaron; Wyatt, Simon; Benito, Carlos García; Drinkall, Helen; Foulds, Frederick
2017-09-01
During the 1980 s, acoustic studies of Upper Palaeolithic imagery in French caves-using the technology then available-suggested a relationship between acoustic response and the location of visual motifs. This paper presents an investigation, using modern acoustic measurement techniques, into such relationships within the caves of La Garma, Las Chimeneas, La Pasiega, El Castillo, and Tito Bustillo in Northern Spain. It addresses methodological issues concerning acoustic measurement at enclosed archaeological sites and outlines a general framework for extraction of acoustic features that may be used to support archaeological hypotheses. The analysis explores possible associations between the position of visual motifs (which may be up to 40 000 yrs old) and localized acoustic responses. Results suggest that motifs, in general, and lines and dots, in particular, are statistically more likely to be found in places where reverberation is moderate and where the low frequency acoustic response has evidence of resonant behavior. The work presented suggests that an association of the location of Palaeolithic motifs with acoustic features is a statistically weak but tenable hypothesis, and that an appreciation of sound could have influenced behavior among Palaeolithic societies of this region.
Non-localization and localization ROC analyses using clinically based scoring
NASA Astrophysics Data System (ADS)
Paquerault, Sophie; Samuelson, Frank W.; Myers, Kyle J.; Smith, Robert C.
2009-02-01
We are investigating the potential for differences in study conclusions when assessing the estimated impact of a computer-aided detection (CAD) system on readers' performance. The data utilized in this investigation were derived from a multi-reader multi-case observer study involving one hundred mammographic background images to which fixed-size and fixed-intensity Gaussian signals were added, generating a low- and high-intensity signal sets. The study setting allowed CAD assessment in two situations: when CAD sensitivity was 1) superior or 2) lower than the average reader. Seven readers were asked to review each set in the unaided and CAD-aided reading modes, mark and rate their findings. Using this data, we studied the effect on study conclusion of three clinically-based receiver operating characteristic (ROC) scoring definitions. These scoring definitions included both location-specific and non-location-specific rules. The results showed agreement in the estimated impact of CAD on the overall reader performance. In the study setting where CAD sensitivity is superior to the average reader, the mean difference in AUC between the CAD-aided read and unaided read was 0.049 (95%CIs: -0.027; 0.130) for the image scoring definition that is based on non-location-specific rules, and 0.104 (95%CIs: 0.036; 0.174) and 0.090 (95%CIs: 0.031; 0.155) for image scoring definitions that are based on location-specific rules. The increases in AUC were statistically significant for the location-specific scoring definitions. It was further observed that the variance on these estimates was reduced when using the location-specific scoring definitions compared to that using a non-location-specific scoring definition. In the study setting where CAD sensitivity is equivalent or lower than the average reader, the mean differences in AUC are slightly above 0.01 for all image scoring definitions. These increases in AUC were not statistical significant for any of the image scoring definitions. The results on the variance analysis differed from those observed in the other study setting. This investigation furthers our understanding of the relationships between non-localization-specific and localization-specific ROC assessment methodologies and their relevance to clinical practice.
Gridded Calibration of Ensemble Wind Vector Forecasts Using Ensemble Model Output Statistics
NASA Astrophysics Data System (ADS)
Lazarus, S. M.; Holman, B. P.; Splitt, M. E.
2017-12-01
A computationally efficient method is developed that performs gridded post processing of ensemble wind vector forecasts. An expansive set of idealized WRF model simulations are generated to provide physically consistent high resolution winds over a coastal domain characterized by an intricate land / water mask. Ensemble model output statistics (EMOS) is used to calibrate the ensemble wind vector forecasts at observation locations. The local EMOS predictive parameters (mean and variance) are then spread throughout the grid utilizing flow-dependent statistical relationships extracted from the downscaled WRF winds. Using data withdrawal and 28 east central Florida stations, the method is applied to one year of 24 h wind forecasts from the Global Ensemble Forecast System (GEFS). Compared to the raw GEFS, the approach improves both the deterministic and probabilistic forecast skill. Analysis of multivariate rank histograms indicate the post processed forecasts are calibrated. Two downscaling case studies are presented, a quiescent easterly flow event and a frontal passage. Strengths and weaknesses of the approach are presented and discussed.
Bruni, Aline Thaís; Velho, Jesus Antonio; Ferreira, Arthur Serra Lopes; Tasso, Maria Júlia; Ferrari, Raíssa Santos; Yoshida, Ricardo Luís; Dias, Marcos Salvador; Leite, Vitor Barbanti Pereira
2014-08-01
This study uses statistical techniques to evaluate reports on suicide scenes; it utilizes 80 reports from different locations in Brazil, randomly collected from both federal and state jurisdictions. We aimed to assess a heterogeneous group of cases in order to obtain an overall perspective of the problem. We evaluated variables regarding the characteristics of the crime scene, such as the detected traces (blood, instruments and clothes) that were found and we addressed the methodology employed by the experts. A qualitative approach using basic statistics revealed a wide distribution as to how the issue was addressed in the documents. We examined a quantitative approach involving an empirical equation and we used multivariate procedures to validate the quantitative methodology proposed for this empirical equation. The methodology successfully identified the main differences in the information presented in the reports, showing that there is no standardized method of analyzing evidences. Copyright © 2014 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.
Topological Cacti: Visualizing Contour-based Statistics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weber, Gunther H.; Bremer, Peer-Timo; Pascucci, Valerio
2011-05-26
Contours, the connected components of level sets, play an important role in understanding the global structure of a scalar field. In particular their nestingbehavior and topology-often represented in form of a contour tree-have been used extensively for visualization and analysis. However, traditional contour trees onlyencode structural properties like number of contours or the nesting of contours, but little quantitative information such as volume or other statistics. Here we use thesegmentation implied by a contour tree to compute a large number of per-contour (interval) based statistics of both the function defining the contour tree as well asother co-located functions. We introducemore » a new visual metaphor for contour trees, called topological cacti, that extends the traditional toporrery display of acontour tree to display additional quantitative information as width of the cactus trunk and length of its spikes. We apply the new technique to scalar fields ofvarying dimension and different measures to demonstrate the effectiveness of the approach.« less
Non-arbitrage in financial markets: A Bayesian approach for verification
NASA Astrophysics Data System (ADS)
Cerezetti, F. V.; Stern, Julio Michael
2012-10-01
The concept of non-arbitrage plays an essential role in finance theory. Under certain regularity conditions, the Fundamental Theorem of Asset Pricing states that, in non-arbitrage markets, prices of financial instruments are martingale processes. In this theoretical framework, the analysis of the statistical distributions of financial assets can assist in understanding how participants behave in the markets, and may or may not engender arbitrage conditions. Assuming an underlying Variance Gamma statistical model, this study aims to test, using the FBST - Full Bayesian Significance Test, if there is a relevant price difference between essentially the same financial asset traded at two distinct locations. Specifically, we investigate and compare the behavior of call options on the BOVESPA Index traded at (a) the Equities Segment and (b) the Derivatives Segment of BM&FBovespa. Our results seem to point out significant statistical differences. To what extent this evidence is actually the expression of perennial arbitrage opportunities is still an open question.
NASA Astrophysics Data System (ADS)
Thomas, J. N.; Huard, J.; Masci, F.
2017-02-01
There are many reports on the occurrence of anomalous changes in the ionosphere prior to large earthquakes. However, whether or not these changes are reliable precursors that could be useful for earthquake prediction is controversial within the scientific community. To test a possible statistical relationship between ionospheric disturbances and earthquakes, we compare changes in the total electron content (TEC) of the ionosphere with occurrences of M ≥ 6.0 earthquakes globally for 2000-2014. We use TEC data from the global ionosphere map (GIM) and an earthquake list declustered for aftershocks. For each earthquake, we look for anomalous changes in GIM-TEC within 2.5° latitude and 5.0° longitude of the earthquake location (the spatial resolution of GIM-TEC). Our analysis has not found any statistically significant changes in GIM-TEC prior to earthquakes. Thus, we have found no evidence that would suggest that monitoring changes in GIM-TEC might be useful for predicting earthquakes.
Direct statistical modeling and its implications for predictive mapping in mining exploration
NASA Astrophysics Data System (ADS)
Sterligov, Boris; Gumiaux, Charles; Barbanson, Luc; Chen, Yan; Cassard, Daniel; Cherkasov, Sergey; Zolotaya, Ludmila
2010-05-01
Recent advances in geosciences make more and more multidisciplinary data available for mining exploration. This allowed developing methodologies for computing forecast ore maps from the statistical combination of such different input parameters, all based on an inverse problem theory. Numerous statistical methods (e.g. algebraic method, weight of evidence, Siris method, etc) with varying degrees of complexity in their development and implementation, have been proposed and/or adapted for ore geology purposes. In literature, such approaches are often presented through applications on natural examples and the results obtained can present specificities due to local characteristics. Moreover, though crucial for statistical computations, "minimum requirements" needed for input parameters (number of minimum data points, spatial distribution of objects, etc) are often only poorly expressed. From these, problems often arise when one has to choose between one and the other method for her/his specific question. In this study, a direct statistical modeling approach is developed in order to i) evaluate the constraints on the input parameters and ii) test the validity of different existing inversion methods. The approach particularly focused on the analysis of spatial relationships between location of points and various objects (e.g. polygons and /or polylines) which is particularly well adapted to constrain the influence of intrusive bodies - such as a granite - and faults or ductile shear-zones on spatial location of ore deposits (point objects). The method is designed in a way to insure a-dimensionality with respect to scale. In this approach, both spatial distribution and topology of objects (polygons and polylines) can be parametrized by the user (e.g. density of objects, length, surface, orientation, clustering). Then, the distance of points with respect to a given type of objects (polygons or polylines) is given using a probability distribution. The location of points is computed assuming either independency or different grades of dependency between the two probability distributions. The results show that i)polygons surface mean value, polylines length mean value, the number of objects and their clustering are critical and ii) the validity of the different tested inversion methods strongly depends on the relative importance and on the dependency between the parameters used. In addition, this combined approach of direct and inverse modeling offers an opportunity to test the robustness of the inferred distribution point laws with respect to the quality of the input data set.
A study protocol to evaluate the relationship between outdoor air pollution and pregnancy outcomes
2010-01-01
Background The present study protocol is designed to assess the relationship between outdoor air pollution and low birth weight and preterm births outcomes performing a semi-ecological analysis. Semi-ecological design studies are widely used to assess effects of air pollution in humans. In this type of analysis, health outcomes and covariates are measured in individuals and exposure assignments are usually based on air quality monitor stations. Therefore, estimating individual exposures are one of the major challenges when investigating these relationships with a semi-ecologic design. Methods/Design Semi-ecologic study consisting of a retrospective cohort study with ecologic assignment of exposure is applied. Health outcomes and covariates are collected at Primary Health Care Center. Data from pregnant registry, clinical record and specific questionnaire administered orally to the mothers of children born in period 2007-2010 in Portuguese Alentejo Litoral region, are collected by the research team. Outdoor air pollution data are collected with a lichen diversity biomonitoring program, and individual pregnancy exposures are assessed with spatial geostatistical simulation, which provides the basis for uncertainty analysis of individual exposures. Awareness of outdoor air pollution uncertainty will improve validity of individual exposures assignments for further statistical analysis with multivariate regression models. Discussion Exposure misclassification is an issue of concern in semi-ecological design. In this study, personal exposures are assigned to each pregnant using geocoded addresses data. A stochastic simulation method is applied to lichen diversity values index measured at biomonitoring survey locations, in order to assess spatial uncertainty of lichen diversity value index at each geocoded address. These methods assume a model for spatial autocorrelation of exposure and provide a distribution of exposures in each study location. We believe that variability of simulated exposure values at geocoded addresses will improve knowledge on variability of exposures, improving therefore validity of individual exposures to input in posterior statistical analysis. PMID:20950449
A study protocol to evaluate the relationship between outdoor air pollution and pregnancy outcomes.
Ribeiro, Manuel C; Pereira, Maria J; Soares, Amílcar; Branquinho, Cristina; Augusto, Sofia; Llop, Esteve; Fonseca, Susana; Nave, Joaquim G; Tavares, António B; Dias, Carlos M; Silva, Ana; Selemane, Ismael; de Toro, Joaquin; Santos, Mário J; Santos, Fernanda
2010-10-15
The present study protocol is designed to assess the relationship between outdoor air pollution and low birth weight and preterm births outcomes performing a semi-ecological analysis. Semi-ecological design studies are widely used to assess effects of air pollution in humans. In this type of analysis, health outcomes and covariates are measured in individuals and exposure assignments are usually based on air quality monitor stations. Therefore, estimating individual exposures are one of the major challenges when investigating these relationships with a semi-ecologic design. Semi-ecologic study consisting of a retrospective cohort study with ecologic assignment of exposure is applied. Health outcomes and covariates are collected at Primary Health Care Center. Data from pregnant registry, clinical record and specific questionnaire administered orally to the mothers of children born in period 2007-2010 in Portuguese Alentejo Litoral region, are collected by the research team. Outdoor air pollution data are collected with a lichen diversity biomonitoring program, and individual pregnancy exposures are assessed with spatial geostatistical simulation, which provides the basis for uncertainty analysis of individual exposures. Awareness of outdoor air pollution uncertainty will improve validity of individual exposures assignments for further statistical analysis with multivariate regression models. Exposure misclassification is an issue of concern in semi-ecological design. In this study, personal exposures are assigned to each pregnant using geocoded addresses data. A stochastic simulation method is applied to lichen diversity values index measured at biomonitoring survey locations, in order to assess spatial uncertainty of lichen diversity value index at each geocoded address. These methods assume a model for spatial autocorrelation of exposure and provide a distribution of exposures in each study location. We believe that variability of simulated exposure values at geocoded addresses will improve knowledge on variability of exposures, improving therefore validity of individual exposures to input in posterior statistical analysis.
NASA Technical Reports Server (NTRS)
Matney, Mark
2011-01-01
A number of statistical tools have been developed over the years for assessing the risk of reentering objects to human populations. These tools make use of the characteristics (e.g., mass, material, shape, size) of debris that are predicted by aerothermal models to survive reentry. The statistical tools use this information to compute the probability that one or more of the surviving debris might hit a person on the ground and cause one or more casualties. The statistical portion of the analysis relies on a number of assumptions about how the debris footprint and the human population are distributed in latitude and longitude, and how to use that information to arrive at realistic risk numbers. Because this information is used in making policy and engineering decisions, it is important that these assumptions be tested using empirical data. This study uses the latest database of known uncontrolled reentry locations measured by the United States Department of Defense. The predicted ground footprint distributions of these objects are based on the theory that their orbits behave basically like simple Kepler orbits. However, there are a number of factors in the final stages of reentry - including the effects of gravitational harmonics, the effects of the Earth s equatorial bulge on the atmosphere, and the rotation of the Earth and atmosphere - that could cause them to diverge from simple Kepler orbit behavior and possibly change the probability of reentering over a given location. In this paper, the measured latitude and longitude distributions of these objects are directly compared with the predicted distributions, providing a fundamental empirical test of the model assumptions.
NASA Astrophysics Data System (ADS)
Ermann, Michael; Johnson, Marty E.; Harrison, Byron W.
2002-11-01
By adding a second room to a concert hall, and designing doors to control the sonic transparency between the two rooms, designers can create a new, coupled acoustic. Concert halls use coupling to achieve a variable, longer, and distinct reverberant quality for their musicians and listeners. For this study, a coupled-volume concert hall based on an existing performing arts center is conceived and computer modeled. It has a fixed geometric volume, form, and primary-room sound absorption. Ray-tracing software simulates impulse responses, varying both aperture size and secondary-room sound-absorption level, across a grid of receiver (listener) locations. The results are compared with statistical analysis that suggests a highly sensitive relationship between the double-sloped condition and the architecture of the space. This line of study aims to quantitatively and spatially correlate the double-sloped condition with (1) aperture size exposing the chamber, (2) sound absorptance in the coupled volume, and (3) listener location.
NASA Astrophysics Data System (ADS)
Ermann, Michael; Johnson, Marty E.; Harrison, Byron W.
2003-04-01
By adding a second room to a concert hall, and designing doors to control the sonic transparency between the two rooms, designers can create a new, coupled acoustic. Concert halls use coupling to achieve a variable, longer and distinct reverberant quality for their musicians and listeners. For this study, a coupled-volume concert hall based on an existing performing arts center is conceived and computer-modeled. It has a fixed geometric volume, form and primary-room sound absorption. Ray-tracing software simulates impulse responses, varying both aperture size and secondary-room sound absorption level, across a grid of receiver (listener) locations. The results are compared with statistical analysis that suggests a highly sensitive relationship between the double-sloped condition and the architecture of the space. This line of study aims to quantitatively and spatially correlate the double-sloped condition with (1) aperture size exposing the chamber, (2) sound absorptance in the coupled volume, and (3) listener location.
Nguyen, Xuan-Vy; Tran, Minh-Hue; Le, Trong-Dung; Papenbrock, Jutta
2017-12-01
Seagrasses beds are vulnerable ecosystems. Human-induced disturbances, including heavy metal pollution, cause losses in seagrass beds. Assessment of the heavy metal concentration in seagrass meadows is an urgent need in order to protect and sustain these ecosystems. The concentration of eight trace metals in the surface sediment was observed from six seagrass beds at Khanh Hoa's coast, Vietnam. Three pollution indices and statistical analysis were used to evaluate the levels of contamination with these elements. This report on heavy metals within seagrass beds in Vietnam shows that, based on enrichment factors, only one location revealed moderately severe enrichment of Cu. Geo-accumulation indices fall in the uncontaminated class at all locations whereas for the ecological risk factor, values of Cu at My Giang and of Pb at Thuy Trieu were in a moderate risk class. Hence, two of eight locations may be exposed to high Cu and Pb.
Correlation approach to identify coding regions in DNA sequences
NASA Technical Reports Server (NTRS)
Ossadnik, S. M.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.
1994-01-01
Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law correlations, whereas coding regions typically display only short-range correlations. We develop an algorithm based on this finding that enables investigators to perform a statistical analysis on long DNA sequences to locate possible coding regions. The algorithm is particularly successful in predicting the location of lengthy coding regions. For example, for the complete genome of yeast chromosome III (315,344 nucleotides), at least 82% of the predictions correspond to putative coding regions; the algorithm correctly identified all coding regions larger than 3000 nucleotides, 92% of coding regions between 2000 and 3000 nucleotides long, and 79% of coding regions between 1000 and 2000 nucleotides. The predictive ability of this new algorithm supports the claim that there is a fundamental difference in the correlation property between coding and noncoding sequences. This algorithm, which is not species-dependent, can be implemented with other techniques for rapidly and accurately locating relatively long coding regions in genomic sequences.
Geo-located Twitter as proxy for global mobility patterns.
Hawelka, Bartosz; Sitko, Izabela; Beinat, Euro; Sobolevsky, Stanislav; Kazakopoulos, Pavlos; Ratti, Carlo
2014-05-27
Pervasive presence of location-sharing services made it possible for researchers to gain an unprecedented access to the direct records of human activity in space and time. This article analyses geo-located Twitter messages in order to uncover global patterns of human mobility. Based on a dataset of almost a billion tweets recorded in 2012, we estimate the volume of international travelers by country of residence. Mobility profiles of different nations were examined based on such characteristics as mobility rate, radius of gyration, diversity of destinations, and inflow-outflow balance. Temporal patterns disclose the universally valid seasons of increased international mobility and the particular character of international travels of different nations. Our analysis of the community structure of the Twitter mobility network reveals spatially cohesive regions that follow the regional division of the world. We validate our result using global tourism statistics and mobility models provided by other authors and argue that Twitter is exceptionally useful for understanding and quantifying global mobility patterns.
Uncertainty analysis for the steady-state flows in a dual throat nozzle
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chen, Q.-Y.; Gottlieb, David; Hesthaven, Jan S.
2005-03-20
It is well known that the steady state of an isentropic flow in a dual-throat nozzle with equal throat areas is not unique. In particular there is a possibility that the flow contains a shock wave, whose location is determined solely by the initial condition. In this paper, we consider cases with uncertainty in this initial condition and use generalized polynomial chaos methods to study the steady-state solutions for stochastic initial conditions. Special interest is given to the statistics of the shock location. The polynomial chaos (PC) expansion modes are shown to be smooth functions of the spatial variable x,more » although each solution realization is discontinuous in the spatial variable x. When the variance of the initial condition is small, the probability density function of the shock location is computed with high accuracy. Otherwise, many terms are needed in the PC expansion to produce reasonable results due to the slow convergence of the PC expansion, caused by non-smoothness in random space.« less
Xiao, Hanqiong; Li, Wei; Ma, Ruixia; Gong, Zhengpeng; Shi, Haibo; Li, Huawei; Chen, Bing; Jiang, Ye; Dai, Chunfu
2015-06-01
To describe tne regional different factors which impact on early cochlear implantation in prelingual deaf children between eastern and western regions of China. The charts of 113 children who received the cochlear implantation after 24 months old were reviewed and analyzed. Forty-five of them came from the eastern region (Jiangsu, Zhejiang or Shanghai) while 68 of them came from the western region (Ningxia or Guizhou). Parental interviews were conducted to collect information regarding the factors that impact on early cochlear implantation. Result:Based on the univariate logistic regression analysis, the odds ratio (OR) value of universal newborn hearing screening (UNHS) was 5. 481, which indicated the correlation of UNHS with early cochlear implantation is significant. There was statistical difference between the 2 groups (P<0. 01). For the financial burden, the OR value was 3. 521(strong correlation) and there was statistical difference between the 2 groups (P<0. 01). For the communication barriers and community location, the OR value was 0. 566 and 1. 128 respectively, and there was no statistical difference between the 2 groups (P>0. 05). The multivariate analysis indicated that the UNHS and financial burden are statistically different between the eastern and western regions (P=0. 00 and 0. 040 respectively). The UNHS and financial burden are statistically different between the eastern reinforced in the western region. In addition, the government and society should provide powerful policy and more financial support in the western region of China. The innovation of management system is also helpful to the early cochlear implantation.
Alikari, Victoria; Sachlas, Athanasios; Giatrakou, Stavroula; Stathoulis, John; Fradelos, Evagelos; Theofilou, Paraskevi; Lavdaniti, Maria; Zyga, Sofia
2017-01-01
An important factor which influences the quality of life of patients with arthritis is the fatigue they experience. The purpose of this study was to assess the relationship between fatigue and quality of life among patients with osteoarthritis and rheumatoid arthritis. Between January 2015 and March 2015, 179 patients with osteoarthritis and rheumatoid arthritis completed the Fatigue Assessment Scale and the Missoula-VITAS Quality of Life Index-15 (MVQoLI-15). The study was conducted in Rehabilitation Centers located in the area of Peloponnese, Greece. Data related to sociodemographic characteristics and their individual medical histories were recorded. Statistical analysis was performed using the IBM SPSS Statistics version 19. The analysis did not reveal statistically significant correlation between fatigue and quality of life neither in the total sample nor among patients with osteoarthritis (r = -0.159; p = 0.126) or rheumatoid arthritis. However, there was a statistically significant relationship between some aspects of fatigue and dimensions of quality of life. Osteoarthritis patients had statistically significant lower MVQoLI-15 score than rheumatoid arthritis patients (13.73 ± 1.811 vs 14.61 ± 1.734) and lower FAS score than rheumatoid patients (26.14 ± 3.668 vs 29.94 ± 3.377) (p-value < 0.001). The finding that different aspects of fatigue may affect dimensions of quality of life may help health care professionals by proposing the early treatment of fatigue in order to gain benefits for quality of life.
Visual wetness perception based on image color statistics.
Sawayama, Masataka; Adelson, Edward H; Nishida, Shin'ya
2017-05-01
Color vision provides humans and animals with the abilities to discriminate colors based on the wavelength composition of light and to determine the location and identity of objects of interest in cluttered scenes (e.g., ripe fruit among foliage). However, we argue that color vision can inform us about much more than color alone. Since a trichromatic image carries more information about the optical properties of a scene than a monochromatic image does, color can help us recognize complex material qualities. Here we show that human vision uses color statistics of an image for the perception of an ecologically important surface condition (i.e., wetness). Psychophysical experiments showed that overall enhancement of chromatic saturation, combined with a luminance tone change that increases the darkness and glossiness of the image, tended to make dry scenes look wetter. Theoretical analysis along with image analysis of real objects indicated that our image transformation, which we call the wetness enhancing transformation, is consistent with actual optical changes produced by surface wetting. Furthermore, we found that the wetness enhancing transformation operator was more effective for the images with many colors (large hue entropy) than for those with few colors (small hue entropy). The hue entropy may be used to separate surface wetness from other surface states having similar optical properties. While surface wetness and surface color might seem to be independent, there are higher order color statistics that can influence wetness judgments, in accord with the ecological statistics. The present findings indicate that the visual system uses color image statistics in an elegant way to help estimate the complex physical status of a scene.
Bryan, Rebecca; Nair, Prasanth B; Taylor, Mark
2009-09-18
Interpatient variability is often overlooked in orthopaedic computational studies due to the substantial challenges involved in sourcing and generating large numbers of bone models. A statistical model of the whole femur incorporating both geometric and material property variation was developed as a potential solution to this problem. The statistical model was constructed using principal component analysis, applied to 21 individual computer tomography scans. To test the ability of the statistical model to generate realistic, unique, finite element (FE) femur models it was used as a source of 1000 femurs to drive a study on femoral neck fracture risk. The study simulated the impact of an oblique fall to the side, a scenario known to account for a large proportion of hip fractures in the elderly and have a lower fracture load than alternative loading approaches. FE model generation, application of subject specific loading and boundary conditions, FE processing and post processing of the solutions were completed automatically. The generated models were within the bounds of the training data used to create the statistical model with a high mesh quality, able to be used directly by the FE solver without remeshing. The results indicated that 28 of the 1000 femurs were at highest risk of fracture. Closer analysis revealed the percentage of cortical bone in the proximal femur to be a crucial differentiator between the failed and non-failed groups. The likely fracture location was indicated to be intertrochantic. Comparison to previous computational, clinical and experimental work revealed support for these findings.
Martini, Paolo; Risso, Davide; Sales, Gabriele; Romualdi, Chiara; Lanfranchi, Gerolamo; Cagnin, Stefano
2011-04-11
In the last decades, microarray technology has spread, leading to a dramatic increase of publicly available datasets. The first statistical tools developed were focused on the identification of significant differentially expressed genes. Later, researchers moved toward the systematic integration of gene expression profiles with additional biological information, such as chromosomal location, ontological annotations or sequence features. The analysis of gene expression linked to physical location of genes on chromosomes allows the identification of transcriptionally imbalanced regions, while, Gene Set Analysis focuses on the detection of coordinated changes in transcriptional levels among sets of biologically related genes. In this field, meta-analysis offers the possibility to compare different studies, addressing the same biological question to fully exploit public gene expression datasets. We describe STEPath, a method that starts from gene expression profiles and integrates the analysis of imbalanced region as an a priori step before performing gene set analysis. The application of STEPath in individual studies produced gene set scores weighted by chromosomal activation. As a final step, we propose a way to compare these scores across different studies (meta-analysis) on related biological issues. One complication with meta-analysis is batch effects, which occur because molecular measurements are affected by laboratory conditions, reagent lots and personnel differences. Major problems occur when batch effects are correlated with an outcome of interest and lead to incorrect conclusions. We evaluated the power of combining chromosome mapping and gene set enrichment analysis, performing the analysis on a dataset of leukaemia (example of individual study) and on a dataset of skeletal muscle diseases (meta-analysis approach). In leukaemia, we identified the Hox gene set, a gene set closely related to the pathology that other algorithms of gene set analysis do not identify, while the meta-analysis approach on muscular disease discriminates between related pathologies and correlates similar ones from different studies. STEPath is a new method that integrates gene expression profiles, genomic co-expressed regions and the information about the biological function of genes. The usage of the STEPath-computed gene set scores overcomes batch effects in the meta-analysis approaches allowing the direct comparison of different pathologies and different studies on a gene set activation level.
Stochastic rainfall synthesis for urban applications using different regionalization methods
NASA Astrophysics Data System (ADS)
Callau Poduje, A. C.; Leimbach, S.; Haberlandt, U.
2017-12-01
The proper design and efficient operation of urban drainage systems require long and continuous rainfall series in a high temporal resolution. Unfortunately, these time series are usually available in a few locations and it is therefore suitable to develop a stochastic precipitation model to generate rainfall in locations without observations. The model presented is based on an alternating renewal process and involves an external and an internal structure. The members of these structures are described by probability distributions which are site specific. Different regionalization methods based on site descriptors are presented which are used for estimating the distributions for locations without observations. Regional frequency analysis, multiple linear regressions and a vine-copula method are applied for this purpose. An area located in the north-west of Germany is used to compare the different methods and involves a total of 81 stations with 5 min rainfall records. The site descriptors include information available for the whole region: position, topography and hydrometeorologic characteristics which are estimated from long term observations. The methods are compared directly by cross validation of different rainfall statistics. Given that the model is stochastic the evaluation is performed based on ensembles of many long synthetic time series which are compared with observed ones. The performance is as well indirectly evaluated by setting up a fictional urban hydrological system to test the capability of the different methods regarding flooding and overflow characteristics. The results show a good representation of the seasonal variability and good performance in reproducing the sample statistics of the rainfall characteristics. The copula based method shows to be the most robust of the three methods. Advantages and disadvantages of the different methods are presented and discussed.
New Approaches to Robust Confidence Intervals for Location: A Simulation Study.
1984-06-01
obtain a denominator for the test statistic. Those statistics based on location estimates derived from Hampel’s redescending influence function or v...defined an influence function for a test in terms of the behavior of its P-values when the data are sampled from a model distribution modified by point...proposal could be used for interval estimation as well as hypothesis testing, the extension is immediate. Once an influence function has been defined
Obtaining Streamflow Statistics for Massachusetts Streams on the World Wide Web
Ries, Kernell G.; Steeves, Peter A.; Freeman, Aleda; Singh, Raj
2000-01-01
A World Wide Web application has been developed to make it easy to obtain streamflow statistics for user-selected locations on Massachusetts streams. The Web application, named STREAMSTATS (available at http://water.usgs.gov/osw/streamstats/massachusetts.html ), can provide peak-flow frequency, low-flow frequency, and flow-duration statistics for most streams in Massachusetts. These statistics describe the magnitude (how much), frequency (how often), and duration (how long) of flow in a stream. The U.S. Geological Survey (USGS) has published streamflow statistics, such as the 100-year peak flow, the 7-day, 10-year low flow, and flow-duration statistics, for its data-collection stations in numerous reports. Federal, State, and local agencies need these statistics to plan and manage use of water resources and to regulate activities in and around streams. Engineering and environmental consulting firms, utilities, industry, and others use the statistics to design and operate water-supply systems, hydropower facilities, industrial facilities, wastewater treatment facilities, and roads, bridges, and other structures. Until now, streamflow statistics for data-collection stations have often been difficult to obtain because they are scattered among many reports, some of which are not readily available to the public. In addition, streamflow statistics are often needed for locations where no data are available. STREAMSTATS helps solve these problems. STREAMSTATS was developed jointly by the USGS and MassGIS, the State Geographic Information Systems (GIS) agency, in cooperation with the Massachusetts Departments of Environmental Management and Environmental Protection. The application consists of three major components: (1) a user interface that displays maps and allows users to select stream locations for which they want streamflow statistics (fig. 1), (2) a data base of previously published streamflow statistics and descriptive information for 725 USGS data-collection stations, and (3) an automated procedure that determines characteristics of the land-surface area (basin) that drains to the stream and inserts those characteristics into equations that estimate the streamflow statistics. Each of these components is described and guidance for using STREAMSTATS is provided below.
Al-Khalid, Hamad; Alaskari, Ayman; Oraby, Samy
2011-01-01
Hardness homogeneity of the commonly used structural ferrous and nonferrous engineering materials is of vital importance in the design stage, therefore, reliable information regarding material properties homogeneity should be validated and any deviation should be addressed. In the current study the hardness variation, over wide spectrum radial locations of some ferrous and nonferrous structural engineering materials, was investigated. Measurements were performed over both faces (cross-section) of each stock bar according to a pre-specified stratified design, ensuring the coverage of the entire area both in radial and circumferential directions. Additionally the credibility of the apparatus and measuring procedures were examined through a statistically based calibration process of the hardness reference block. Statistical and response surface graphical analysis are used to examine the nature, adequacy and significance of the measured hardness values. Calibration of the apparatus reference block proved the reliability of the measuring system, where no strong evidence was found against the stochastic nature of hardness measures over the various stratified locations. Also, outlier elimination procedures were proved to be beneficial only at fewer measured points. Hardness measurements showed a dispersion domain that is within the acceptable confidence interval. For AISI 4140 and AISI 1020 steels, hardness is found to have a slight decrease trend as the diameter is reduced, while an opposite behavior is observed for AA 6082 aluminum alloy. However, no definite significant behavior was noticed regarding the effect of the sector sequence (circumferential direction). PMID:28817030
Al-Khalid, Hamad; Alaskari, Ayman; Oraby, Samy
2011-12-23
Hardness homogeneity of the commonly used structural ferrous and nonferrous engineering materials is of vital importance in the design stage, therefore, reliable information regarding material properties homogeneity should be validated and any deviation should be addressed. In the current study the hardness variation, over wide spectrum radial locations of some ferrous and nonferrous structural engineering materials, was investigated. Measurements were performed over both faces (cross-section) of each stock bar according to a pre-specified stratified design, ensuring the coverage of the entire area both in radial and circumferential directions. Additionally the credibility of the apparatus and measuring procedures were examined through a statistically based calibration process of the hardness reference block. Statistical and response surface graphical analysis are used to examine the nature, adequacy and significance of the measured hardness values. Calibration of the apparatus reference block proved the reliability of the measuring system, where no strong evidence was found against the stochastic nature of hardness measures over the various stratified locations. Also, outlier elimination procedures were proved to be beneficial only at fewer measured points. Hardness measurements showed a dispersion domain that is within the acceptable confidence interval. For AISI 4140 and AISI 1020 steels, hardness is found to have a slight decrease trend as the diameter is reduced, while an opposite behavior is observed for AA 6082 aluminum alloy. However, no definite significant behavior was noticed regarding the effect of the sector sequence (circumferential direction).
A new ionospheric storm scale based on TEC and foF2 statistics
NASA Astrophysics Data System (ADS)
Nishioka, Michi; Tsugawa, Takuya; Jin, Hidekatsu; Ishii, Mamoru
2017-01-01
In this paper, we propose the I-scale, a new ionospheric storm scale for general users in various regions in the world. With the I-scale, ionospheric storms can be classified at any season, local time, and location. Since the ionospheric condition largely depends on many factors such as solar irradiance, energy input from the magnetosphere, and lower atmospheric activity, it had been difficult to scale ionospheric storms, which are mainly caused by solar and geomagnetic activities. In this study, statistical analysis was carried out for total electron content (TEC) and F2 layer critical frequency (foF2) in Japan for 18 years from 1997 to 2014. Seasonal, local time, and latitudinal dependences of TEC and foF2 variabilities are excluded by normalizing each percentage variation using their statistical standard deviations. The I-scale is defined by setting thresholds to the normalized numbers to seven categories: I0, IP1, IP2, IP3, IN1, IN2, and IN3. I0 represents a quiet state, and IP1 (IN1), IP2 (IN2), and IP3 (IN3) represent moderate, strong, and severe positive (negative) storms, respectively. The proposed I-scale can be used for other locations, such as polar and equatorial regions. It is considered that the proposed I-scale can be a standardized scale to help the users to assess the impact of space weather on their systems.
2012-01-01
Background The debate over physicians’ geographical distribution has attracted the attention of the economic and public health literature over the last forty years. Nonetheless, it is still to date unclear what influences physicians’ location, and whether foreign physicians contribute to fill the geographical gaps left by national doctors in any given country. The present research sets out to investigate the current distribution of national and international physicians in Portugal, with the objective to understand its determinants and provide an evidence base for policy-makers to identify policies to influence it. Methods A cross-sectional study of physicians currently registered in Portugal was conducted to describe the population and explore the association of physician residence patterns with relevant personal and municipality characteristics. Data from the Portuguese Medical Council on physicians’ residence and characteristics were analysed, as well as data from the National Institute of Statistics on municipalities’ population, living standards and health care network. Descriptive statistics, chi-square tests, negative binomial and logistic regression modelling were applied to determine: (a) municipality characteristics predicting Portuguese and International physicians’ geographical distribution, and; (b) doctors’ characteristics that could increase the odds of residing outside the country’s metropolitan areas. Results There were 39,473 physicians in Portugal in 2008, 51.1% of whom male, and 40.2% between 41 and 55 years of age. They were predominantly Portuguese (90.5%), with Spanish, Brazilian and African nationalities also represented. Population, Population’s Purchasing Power, Nurses per capita and Municipality Development Index (MDI) were the municipality characteristics displaying the strongest association with national physicians’ location. For foreign physicians, the MDI was not statistically significant, while municipalities’ foreign population applying for residence appeared to be an additional positive factor in their location decisions. In general, being foreigner and male resulted to be the physician characteristics increasing the odds of residing outside the metropolitan areas. However, among the internationals, older doctors were more likely to reside outside metropolitan areas. Being Spanish or Brazilian (but not of African origin) was found to increase the odds of being based outside the Lisbon and Oporto metropolitan areas. Conclusions The present study showed the relevance of studying one country’s physician population to understand the factors driving national and international doctors’ location decisions. A more nuanced understanding of national and foreign doctors’ location appears to be needed to design more effective policies to reduce the imbalance of medical services across geographical areas. PMID:22748122
Russo, Giuliano; Ferrinho, Paulo; de Sousa, Bruno; Conceição, Cláudia
2012-07-02
The debate over physicians' geographical distribution has attracted the attention of the economic and public health literature over the last forty years. Nonetheless, it is still to date unclear what influences physicians' location, and whether foreign physicians contribute to fill the geographical gaps left by national doctors in any given country. The present research sets out to investigate the current distribution of national and international physicians in Portugal, with the objective to understand its determinants and provide an evidence base for policy-makers to identify policies to influence it. A cross-sectional study of physicians currently registered in Portugal was conducted to describe the population and explore the association of physician residence patterns with relevant personal and municipality characteristics. Data from the Portuguese Medical Council on physicians' residence and characteristics were analysed, as well as data from the National Institute of Statistics on municipalities' population, living standards and health care network. Descriptive statistics, chi-square tests, negative binomial and logistic regression modelling were applied to determine: (a) municipality characteristics predicting Portuguese and International physicians' geographical distribution, and; (b) doctors' characteristics that could increase the odds of residing outside the country's metropolitan areas. There were 39,473 physicians in Portugal in 2008, 51.1% of whom male, and 40.2% between 41 and 55 years of age. They were predominantly Portuguese (90.5%), with Spanish, Brazilian and African nationalities also represented. Population, Population's Purchasing Power, Nurses per capita and Municipality Development Index (MDI) were the municipality characteristics displaying the strongest association with national physicians' location. For foreign physicians, the MDI was not statistically significant, while municipalities' foreign population applying for residence appeared to be an additional positive factor in their location decisions. In general, being foreigner and male resulted to be the physician characteristics increasing the odds of residing outside the metropolitan areas. However, among the internationals, older doctors were more likely to reside outside metropolitan areas. Being Spanish or Brazilian (but not of African origin) was found to increase the odds of being based outside the Lisbon and Oporto metropolitan areas. The present study showed the relevance of studying one country's physician population to understand the factors driving national and international doctors' location decisions. A more nuanced understanding of national and foreign doctors' location appears to be needed to design more effective policies to reduce the imbalance of medical services across geographical areas.
Salvatore, Stefania; Bramness, Jørgen Gustav; Reid, Malcolm J; Thomas, Kevin Victor; Harman, Christopher; Røislien, Jo
2015-01-01
Wastewater-based epidemiology (WBE) is a new methodology for estimating the drug load in a population. Simple summary statistics and specification tests have typically been used to analyze WBE data, comparing differences between weekday and weekend loads. Such standard statistical methods may, however, overlook important nuanced information in the data. In this study, we apply functional data analysis (FDA) to WBE data and compare the results to those obtained from more traditional summary measures. We analysed temporal WBE data from 42 European cities, using sewage samples collected daily for one week in March 2013. For each city, the main temporal features of two selected drugs were extracted using functional principal component (FPC) analysis, along with simpler measures such as the area under the curve (AUC). The individual cities' scores on each of the temporal FPCs were then used as outcome variables in multiple linear regression analysis with various city and country characteristics as predictors. The results were compared to those of functional analysis of variance (FANOVA). The three first FPCs explained more than 99% of the temporal variation. The first component (FPC1) represented the level of the drug load, while the second and third temporal components represented the level and the timing of a weekend peak. AUC was highly correlated with FPC1, but other temporal characteristic were not captured by the simple summary measures. FANOVA was less flexible than the FPCA-based regression, and even showed concordance results. Geographical location was the main predictor for the general level of the drug load. FDA of WBE data extracts more detailed information about drug load patterns during the week which are not identified by more traditional statistical methods. Results also suggest that regression based on FPC results is a valuable addition to FANOVA for estimating associations between temporal patterns and covariate information.
The Utility of Robust Means in Statistics
ERIC Educational Resources Information Center
Goodwyn, Fara
2012-01-01
Location estimates calculated from heuristic data were examined using traditional and robust statistical methods. The current paper demonstrates the impact outliers have on the sample mean and proposes robust methods to control for outliers in sample data. Traditional methods fail because they rely on the statistical assumptions of normality and…
Statistical characterization of the Sub-Auroral Polarization Stream (SAPS)
NASA Astrophysics Data System (ADS)
Kunduri, B.; Baker, J. B.; Ruohoniemi, J. M.; Erickson, P. J.; Coster, A. J.; Oksavik, K.
2017-12-01
The Sub-Auroral Polarization Stream (SAPS) is a narrow region of westward directed plasma convection typically observed in the dusk-midnight sector equatorward of the main auroral oval. SAPS plays an important role in mid-latitude space weather dynamics and has a controlling influence on the evolution of large-scale plasma features, such as Storm Enhanced Density (SED) plumes. In this study, data from North American mid-latitude SuperDARN radars collected between January 2011 and December 2014 have been used to compile a database of SAPS events for statistical analysis. We examine the dependence of SAPS velocity magnitude and direction on geomagnetic activity and magnetic local time. The lowest speed limit and electric fields observed during SAPS are discussed and histograms of SAPS velocities for different Dst bins and MLAT-MLT locations are presented. We find significant differences in SAPS characteristics between periods of low and high geomagnetic activity, suggesting that SAPS are driven by different mechanisms during storm and non-storm conditions. To further explore this possibility, we have characterized the SAPS location and peak speed relative to the ionospheric trough specified by GPS Total Electron Content (TEC) data from the MIT Haystack Madrigal database. A particular emphasis is placed on identifying the extent to which the location, structure, and depth of the trough may play a controlling influence on SAPS speeds during storm and non-storm periods. The results are interpreted in terms of the current paradigm for active thermosphere-ionosphere feedback being an important component of SAPS physics.
Association of gene polymorphisms in ABO blood group chromosomal regions and menstrual disorders
SU, YONG; KONG, GUI-LIAN; SU, YA-LI; ZHOU, YAN; LV, LI-FANG; WANG, QIONG; HUANG, BAO-PING; ZHENG, RUI-ZHI; LI, QUAN-ZHONG; YUAN, HUI-JUAN; ZHAO, ZHI-GANG
2015-01-01
This study aimed to investigate whether single nucleotide polymorphisms (SNPs) located near the gene of the ABO blood group play an important role in the genetic aetiology of menstrual disorders (MDs). Polymerase chain reaction-ligase detection reaction technology was used to detect eight SNPs near the ABO gene location on the chromosomes in 250 cases of MD and 250 cases of normal menstruation. The differences in the distribution of each genotype, as well as the allele frequency in the normal and control groups, were analysed using Pearson's χ2 test to search for disease-associated loci. SHEsis software was used to analyse the linkage disequilibrium and haplotype frequencies and to inspect the correlation between haplotypes and the disease. Compared with the control group, the experimental group exhibited statistically significant differences in the genotype distribution frequencies of the rs657152 locus of the ABO blood group gene and the rs17250673 locus of the tumour necrosis factor cofactor 2 (TRAF2) gene, which is located downstream of the ABO gene. The allele distribution frequencies of rs657152 and rs495828 loci in the ABO blood group gene exhibited significant differences between the groups. Dominant and recessive genetic model analysis of each locus revealed that the experimental group exhibited statistically significant differences from the control group in the genotype distribution frequencies of rs657152 and rs495828 loci, respectively. These results indicate that the ABO blood group gene and TRAF2 gene may be a cause of MDs. PMID:26136981
Schares, G; Langenmayer, M C; Majzoub-Altweck, M; Scharr, J C; Gentile, A; Maksimov, A; Schares, S; Conraths, F J; Gollnick, N S
2016-01-30
Bovine besnoitiosis is caused by Besnoitia besnoiti, an apicomplexan parasite closely related to Toxoplasma gondii and Neospora caninum. In the acute stage of besnoitiosis, cattle suffer from pyrexia, swollen lymph nodes, anorexia and subcutaneous edema. In the chronic stage, tissue cysts are formed in a variety of tissues including the skin. Knowledge about the distribution of tissue cysts of different parts of the skin of infected animals is scarce. Four chronically infected cattle were euthanized and skin samples were taken from a total of 77 standardized cutaneous locations per animal. Portions of the dermis were taken, from which DNA was extracted and examined by real-time PCR. Cycle of transition (Ct) values reflecting the amount of parasite DNA in the samples were determined. For statistical analysis, samples were attributed to 11 larger skin regions ('OuterHindlegDistal', 'Rump, ForelegMiddle', 'NoseFrontEars', 'CheekEye', 'SideLowerPart', 'ForelegDistal', 'SideUpperPart', 'LegsInner', 'VentralHeadNeck', 'DorsalNeckWithersBackTail'). While all samples revealed a positive result in three female cattle, only 63.6% (49/77) of the samples of a bull showed positive results. For statistical analysis, a Ct value of 45 was assumed for samples with a negative result. The dams showed median Ct values of 16.1, 17.5 and 19.4, while in skin samples of the bull a median Ct value of 37.6 was observed. To determine the differences in DNA concentrations between different locations of the skin of the animals, a relative Ct (relCt) was determined by subtracting for each animal indv the MedianCtindv from each sample Ct. Analyses of the relCt values showed that the highest relative parasite DNA concentrations were observed in the categories 'OuterHindlegDistal', 'Rump', 'ForelegMiddle' and 'NoseFrontEars'. The relCt values in these categories differed statistically significantly from those determined for the categories 'VentralHeadNeck' and 'DorsalNeckWithersBackTail'. The analysis showed clear differences in the distribution and the detectability of parasite DNA in the skin of cattle infected with B. besnoiti. In all four animals, samples from the 'Rump' region (Regio fermoris) showed high parasite DNA concentrations. Because this region is also easily accessible for veterinarians, this skin location appears to be optimal for taking skin biopsies for detection or isolation of B. besnoiti. Copyright © 2015 Elsevier B.V. All rights reserved.
Cary, L.E.
1989-01-01
Selected water-quality data from two streamflow-gaging stations on the Powder River, Montana and Wyoming, were statistically analyzed for trends using the seasonal Kendall test. Data for water years 1952-63 and 1975-85 from the Powder River near Locate, Montana, and water years 1967-68 and 1976-85 from the Powder River at Sussex, Wyoming, were analyzed. Data for the earlier period near Locate were discharge-weighted monthly mean values, whereas data for the late period near Locate and at Sussex were from periodic samples. For data from water years 1952-63 near Locate, increasing trends were detected in sodium and sodium-adsorption ratio; no trends were detected in specific conductance, hardness, non-carbonate hardness, alkalinity, dissolved solids, or sulfate. For data from water years 1975-85 near Locate, increasing trends were detected in specific conductance, sodium, sodium-adsorption ratio, and chloride; no trends were detected in hardness, noncarbonate hardness, alkalinity, dissolved solids, calcium, magnesium, potassium, or sulfate. At Sussex (water years 1967-68 and 1976-85), increasing trends were detected in sodium, sodium-adsorption ratio, and chloride, and a decreasing trend was detected in sulfate. No trends were detected in specific conductance, alkalinity, or dissolved solids. When the 1967-68 data were deleted and the analysis repeated for the 1976-85 data, only sodium-adsorption ratio displayed a significant (increasing) trend. Because the study was exploratory, causes and effects were not considered. The results might have been affected by sample size, number of seasons, heterogeneity, significance level, serial correlation, and data adjustment for changes in discharge. (USGS)
Design of a Web-tool for diagnostic clinical trials handling medical imaging research.
Baltasar Sánchez, Alicia; González-Sistal, Angel
2011-04-01
New clinical studies in medicine are based on patients and controls using different imaging diagnostic modalities. Medical information systems are not designed for clinical trials employing clinical imaging. Although commercial software and communication systems focus on storage of image data, they are not suitable for storage and mining of new types of quantitative data. We sought to design a Web-tool to support diagnostic clinical trials involving different experts and hospitals or research centres. The image analysis of this project is based on skeletal X-ray imaging. It involves a computerised image method using quantitative analysis of regions of interest in healthy bone and skeletal metastases. The database is implemented with ASP.NET 3.5 and C# technologies for our Web-based application. For data storage, we chose MySQL v.5.0, one of the most popular open source databases. User logins were necessary, and access to patient data was logged for auditing. For security, all data transmissions were carried over encrypted connections. This Web-tool is available to users scattered at different locations; it allows an efficient organisation and storage of data (case report form) and images and allows each user to know precisely what his task is. The advantages of our Web-tool are as follows: (1) sustainability is guaranteed; (2) network locations for collection of data are secured; (3) all clinical information is stored together with the original images and the results derived from processed images and statistical analysis that enable us to perform retrospective studies; (4) changes are easily incorporated because of the modular architecture; and (5) assessment of trial data collected at different sites is centralised to reduce statistical variance.
Santos, Hellen-Bandeira-de-Pontes; dos Santos, Thayana-Karla-Guerra; Paz, Alexandre-Rolim; Cavalcanti, Yuri-Wanderley; Nonaka, Cassiano-Francisco-Weege; Godoy, Gustavo-Pina; Alves, Pollianna-Muniz
2016-03-01
In recent years have been observed an increased incidence of OSCC in young individuals. Based on this, the aim this study was to describe the clinical characteristics of all cases of OSCC in younger patients, diagnosed in two oncology referral hospitals, at the northeast region of Brazil within a 12-year period. Data regarding general characteristics of patients (age, gender and tobacco and/or alcohol habits) and information about the lesions (tumor location, size, regional lymph node metastasis, distant metastasis and clinical stage) were submitted to descriptive and inferential analysis. Statistical analysis included Chi-square and Fisher's exact tests (P<0.05). Out of 2311 registered cases of OSCC, 76 (3.3%) corresponded to OSCC in patients under 45 years old. Most of them were male (n=62, 81.6%) and tobacco and/or alcohol users (n=40, 52.8%). The most frequent site was the tongue (n=31, 40.8%), with predominance of cases classified at advanced clinical stage (III and IV, n = 46, 60.5%). The advanced stage of OSCC (III and IV) was statistically associated with male gender (P=0.035), lower education level (P=0.007), intraoral sites (P<0.001), presence of pain symptomatology (P=0.006), and consumption of tobacco and/or alcohol (P=0.001). The profile of OSCC in young patients resembles to the commonly characteristics reported for overall population. The late diagnosis in young patients usually results in poor prognosis, associated with gender, harmful habits and tumor location. Although prevalence is low, stimulus to prevention and to early diagnosis should be addressed to young individuals exposed to risk factors.
Li, Siyue; Zhang, Quanfa
2010-04-15
A data matrix (4032 observations), obtained during a 2-year monitoring period (2005-2006) from 42 sites in the upper Han River is subjected to various multivariate statistical techniques including cluster analysis, principal component analysis (PCA), factor analysis (FA), correlation analysis and analysis of variance to determine the spatial characterization of dissolved trace elements and heavy metals. Our results indicate that waters in the upper Han River are primarily polluted by Al, As, Cd, Pb, Sb and Se, and the potential pollutants include Ba, Cr, Hg, Mn and Ni. Spatial distribution of trace metals indicates the polluted sections mainly concentrate in the Danjiang, Danjiangkou Reservoir catchment and Hanzhong Plain, and the most contaminated river is in the Hanzhong Plain. Q-model clustering depends on geographical location of sampling sites and groups the 42 sampling sites into four clusters, i.e., Danjiang, Danjiangkou Reservoir region (lower catchment), upper catchment and one river in headwaters pertaining to water quality. The headwaters, Danjiang and lower catchment, and upper catchment correspond to very high polluted, moderate polluted and relatively low polluted regions, respectively. Additionally, PCA/FA and correlation analysis demonstrates that Al, Cd, Mn, Ni, Fe, Si and Sr are controlled by natural sources, whereas the other metals appear to be primarily controlled by anthropogenic origins though geogenic source contributing to them. 2009 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Willse, Alan R.; Belcher, Ann; Preti, George
2005-04-15
Gas chromatography (GC), combined with mass spectrometry (MS) detection, is a powerful analytical technique that can be used to separate, quantify, and identify volatile compounds in complex mixtures. This paper examines the application of GC-MS in a comparative experiment to identify volatiles that differ in concentration between two groups. A complex mixture might comprise several hundred or even thousands of volatile compounds. Because their number and location in a chromatogram generally are unknown, and because components overlap in populous chromatograms, the statistical problems offer significant challenges beyond traditional two-group screening procedures. We describe a statistical procedure to compare two-dimensional GC-MSmore » profiles between groups, which entails (1) signal processing: baseline correction and peak detection in single ion chromatograms; (2) aligning chromatograms in time; (3) normalizing differences in overall signal intensities; and (4) detecting chromatographic regions that differ between groups. Compared to existing approaches, the proposed method is robust to errors made at earlier stages of analysis, such as missed peaks or slightly misaligned chromatograms. To illustrate the method, we identify differences in GC-MS chromatograms of ether-extracted urine collected from two nearly identical inbred groups of mice, to investigate the relationship between odor and genetics of the major histocompatibility complex.« less
Statistical analysis of kinetic energy entrainment in a model wind turbine array boundary layer
NASA Astrophysics Data System (ADS)
Cal, Raul Bayoan; Hamilton, Nicholas; Kang, Hyung-Suk; Meneveau, Charles
2012-11-01
For large wind farms, kinetic energy must be entrained from the flow above the wind turbines to replenish wakes and enable power extraction in the array. Various statistical features of turbulence causing vertical entrainment of mean-flow kinetic energy are studied using hot-wire velocimetry data taken in a model wind farm in a scaled wind tunnel experiment. Conditional statistics and spectral decompositions are employed to characterize the most relevant turbulent flow structures and determine their length-scales. Sweep and ejection events are shown to be the largest contributors to the vertical kinetic energy flux, although their relative contribution depends upon the location in the wake. Sweeps are shown to be dominant in the region above the wind turbine array. A spectral analysis of the data shows that large scales of the flow, about the size of the rotor diameter in length or larger, dominate the vertical entrainment. The flow is more incoherent below the array, causing decreased vertical fluxes there. The results show that improving the rate of vertical kinetic energy entrainment into wind turbine arrays is a standing challenge and would require modifying the large-scale structures of the flow. This work was funded in part by the National Science Foundation (CBET-0730922, CBET-1133800 and CBET-0953053).
Unconscious analyses of visual scenes based on feature conjunctions.
Tachibana, Ryosuke; Noguchi, Yasuki
2015-06-01
To efficiently process a cluttered scene, the visual system analyzes statistical properties or regularities of visual elements embedded in the scene. It is controversial, however, whether those scene analyses could also work for stimuli unconsciously perceived. Here we show that our brain performs the unconscious scene analyses not only using a single featural cue (e.g., orientation) but also based on conjunctions of multiple visual features (e.g., combinations of color and orientation information). Subjects foveally viewed a stimulus array (duration: 50 ms) where 4 types of bars (red-horizontal, red-vertical, green-horizontal, and green-vertical) were intermixed. Although a conscious perception of those bars was inhibited by a subsequent mask stimulus, the brain correctly analyzed the information about color, orientation, and color-orientation conjunctions of those invisible bars. The information of those features was then used for the unconscious configuration analysis (statistical processing) of the central bars, which induced a perceptual bias and illusory feature binding in visible stimuli at peripheral locations. While statistical analyses and feature binding are normally 2 key functions of the visual system to construct coherent percepts of visual scenes, our results show that a high-level analysis combining those 2 functions is correctly performed by unconscious computations in the brain. (c) 2015 APA, all rights reserved).
Spatial diffusion of influenza outbreak-related climate factors in Chiang Mai Province, Thailand.
Nakapan, Supachai; Tripathi, Nitin Kumar; Tipdecho, Taravudh; Souris, Marc
2012-10-24
Influenza is one of the most important leading causes of respiratory illness in the countries located in the tropical areas of South East Asia and Thailand. In this study the climate factors associated with influenza incidence in Chiang Mai Province, Northern Thailand, were investigated. Identification of factors responsible for influenza outbreaks and the mapping of potential risk areas in Chiang Mai are long overdue. This work examines the association between yearly climate patterns between 2001 and 2008 and influenza outbreaks in the Chiang Mai Province. The climatic factors included the amount of rainfall, percent of rainy days, relative humidity, maximum, minimum temperatures and temperature difference. The study develops a statistical analysis to quantitatively assess the relationship between climate and influenza outbreaks and then evaluate its suitability for predicting influenza outbreaks. A multiple linear regression technique was used to fit the statistical model. The Inverse Distance Weighted (IDW) interpolation and Geographic Information System (GIS) techniques were used in mapping the spatial diffusion of influenza risk zones. The results show that there is a significance correlation between influenza outbreaks and climate factors for the majority of the studied area. A statistical analysis was conducted to assess the validity of the model comparing model outputs and actual outbreaks.
T-Ping, Cheng; Weckx, Luc Louis Maurice
2008-01-01
The data base of ENT care in the Brazilian public health system (Sistema Unico de Saude - SUS) will help organize public health programs. The following items were investigated in patients aged up to 17 years attended in public health system outpatient units in the city of Mariana, in the ENT screening unit, UNIFESP-EPM, and in CISMISEL: 1) The main otorhinolaryngological diagnoses; 2) The most frequently required exams, drugs, and surgical procedures and their indications; 3) The jobs of parents; the number of siblings; and 4) A statistical analysis and comparison of data in each location. We undertook a prospective study and a statistical analysis of variables that were gathered during the first visit. The age, the parents' salary, the number of siblings aged below 18 years, the presence of rhinitis, ears diseases, the exams, drugs and otological surgeries that were indicated were all statistically significant. The most common diagnosis was mouth breathing. The most common surgery was adenotonsillectomy. The most frequently requested exam was a lateral cranial radiograph. The number of unemployed parents, their poor salaries, and the number of siblings make it difficult for these patients to be treated in any facility other than the public heath system.
Hydrostatic paradox: experimental verification of pressure equilibrium
NASA Astrophysics Data System (ADS)
Kodejška, Č.; Ganci, S.; Říha, J.; Sedláčková, H.
2017-11-01
This work is focused on the experimental verification of the balance between the atmospheric pressure acting on the sheet of paper, which encloses the cylinder completely or partially filled with water from below, where the hydrostatic pressure of the water column acts against the atmospheric pressure. First of all this paper solves a theoretical analysis of the problem, which is based, firstly, on the equation for isothermal process and, secondly, on the equality of pressures inside and outside the cylinder. From the measured values the confirmation of the theoretical quadratic dependence of the air pressure inside the cylinder on the level of the liquid in the cylinder is obtained, the maximum change in the volume of air within the cylinder occurs for the height of the water column L of one half of the total height of the vessel H. The measurements were made for different diameters of the cylinder and with plates made of different materials located at the bottom of the cylinder to prevent liquid from flowing out of the cylinder. The measured values were subjected to statistical analysis, which demonstrated the validity of the zero hypothesis, i.e. that the measured values are not statistically significantly different from the theoretically calculated ones at the statistical significance level α = 0.05.
NASA Astrophysics Data System (ADS)
Sutton, Virginia Kay
This paper examines statistical issues associated with estimating paths of juvenile salmon through the intakes of Kaplan turbines. Passive sensors, hydrophones, detecting signals from ultrasonic transmitters implanted in individual fish released into the preturbine region were used to obtain the information to estimate fish paths through the intake. Aim and location of the sensors affects the spatial region in which the transmitters can be detected, and formulas relating this region to sensor aiming directions are derived. Cramer-Rao lower bounds for the variance of estimators of fish location are used to optimize placement of each sensor. Finally, a statistical methodology is developed for analyzing angular data collected from optimally placed sensors.
Ulgen, Ayse; Han, Zhihua; Li, Wentian
2003-12-31
We address the question of whether statistical correlations among quantitative traits lead to correlation of linkage results of these traits. Five measured quantitative traits (total cholesterol, fasting glucose, HDL cholesterol, blood pressure, and triglycerides), and one derived quantitative trait (total cholesterol divided by the HDL cholesterol) are used for phenotype correlation studies. Four of them are used for linkage analysis. We show that although correlation among phenotypes partially reflects the correlation among linkage analysis results, the LOD-score correlations are on average low. The most significant peaks found by using different traits do not often overlap. Studying covariances at specific locations in LOD scores may provide clues for further bivariate linkage analyses.
Solano, Rubén; Gómez-Barroso, Diana; Simón, Fernando; Lafuente, Sarah; Simón, Pere; Rius, Cristina; Gorrindo, Pilar; Toledo, Diana; Caylà, Joan A
2014-05-01
A retrospective, space-time study of whooping cough cases reported to the Public Health Agency of Barcelona, Spain between the years 2000 and 2011 is presented. It is based on 633 individual whooping cough cases and the 2006 population census from the Spanish National Statistics Institute, stratified by age and sex at the census tract level. Cluster identification was attempted using space-time scan statistic assuming a Poisson distribution and restricting temporal extent to 7 days and spatial distance to 500 m. Statistical calculations were performed with Stata 11 and SatScan and mapping was performed with ArcGis 10.0. Only clusters showing statistical significance (P <0.05) were mapped. The most likely cluster identified included five census tracts located in three neighbourhoods in central Barcelona during the week from 17 to 23 August 2011. This cluster included five cases compared with the expected level of 0.0021 (relative risk = 2436, P <0.001). In addition, 11 secondary significant space-time clusters were detected with secondary clusters occurring at different times and localizations. Spatial statistics is felt to be useful by complementing epidemiological surveillance systems through visualizing excess in the number of cases in space and time and thus increase the possibility of identifying outbreaks not reported by the surveillance system.
Lu, Qiongshi; Li, Boyang; Ou, Derek; Erlendsdottir, Margret; Powles, Ryan L; Jiang, Tony; Hu, Yiming; Chang, David; Jin, Chentian; Dai, Wei; He, Qidu; Liu, Zefeng; Mukherjee, Shubhabrata; Crane, Paul K; Zhao, Hongyu
2017-12-07
Despite the success of large-scale genome-wide association studies (GWASs) on complex traits, our understanding of their genetic architecture is far from complete. Jointly modeling multiple traits' genetic profiles has provided insights into the shared genetic basis of many complex traits. However, large-scale inference sets a high bar for both statistical power and biological interpretability. Here we introduce a principled framework to estimate annotation-stratified genetic covariance between traits using GWAS summary statistics. Through theoretical and numerical analyses, we demonstrate that our method provides accurate covariance estimates, thereby enabling researchers to dissect both the shared and distinct genetic architecture across traits to better understand their etiologies. Among 50 complex traits with publicly accessible GWAS summary statistics (N total ≈ 4.5 million), we identified more than 170 pairs with statistically significant genetic covariance. In particular, we found strong genetic covariance between late-onset Alzheimer disease (LOAD) and amyotrophic lateral sclerosis (ALS), two major neurodegenerative diseases, in single-nucleotide polymorphisms (SNPs) with high minor allele frequencies and in SNPs located in the predicted functional genome. Joint analysis of LOAD, ALS, and other traits highlights LOAD's correlation with cognitive traits and hints at an autoimmune component for ALS. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Channel fading for mobile satellite communications using spread spectrum signaling and TDRSS
NASA Technical Reports Server (NTRS)
Jenkins, Jeffrey D.; Fan, Yiping; Osborne, William P.
1995-01-01
This paper will present some preliminary results from a propagation experiment which employed NASA's TDRSS and an 8 MHz chip rate spread spectrum signal. Channel fade statistics were measured and analyzed in 21 representative geographical locations covering urban/suburban, open plain, and forested areas. Cumulative distribution Functions (CDF's) of 12 individual locations are presented and classified based on location. Representative CDF's from each of these three types of terrain are summarized. These results are discussed, and the fade depths exceeded 10 percent of the time in three types of environments are tabulated. The spread spectrum fade statistics for tree-lined roads are compared with the Empirical Roadside Shadowing Model.
AGOR 28: SIO Shipyard Representative Bi-Weekly Progress Report
2013-07-18
ADA DBL)/ADA t/s Lkr Md-2 (CG) 6 Sally Ride Gray Water Tank – Tank pressurized and boundaries inspected for leaks with ABS in...SPARES (VRS) LISTINGS, STATISTICS, AND LOCATIONS (DI-051 (VRS) for Pl 72428-03 Gray Water Discharge Pump)(R/ASR) 570/0 AGOR27 A051 STD Report...VENDOR RECOMMENDED SPARES (VRS) LISTINGS, STATISTICS, AND LOCATIONS ( DI-051 (VRS) for PL 72428-08 Hot Water Circ Pump)(R/ASR) 574/0 AGOR27 A055 TM
Veder, Barbara; Pope, Stan; Mani, Michèle; Beaudoin, Kelly; Ritchie, Janice
2014-01-01
Access to technologically mediated information and services under the umbrella of mental and physical health has become increasingly available to clients via Internet modalities, according to a recent study. In May 2010, video counseling was added to the counseling services offered through the Employee and Family Assistance Program at Shepell·fgi as a pilot project with a full operational launch in September 2011. The objective of this study was to conduct a retrospective post launch examination of the video counseling service through an analysis of the reported clinical outcomes of video and in-person counseling modalities. A chronological sample of 68 video counseling (VC) cases and 68 in-person (IP) cases were collected from a pool of client clinical files closed in 2012. To minimize the variables impacting the study and maintain as much clinical continuity as possible, the IP and the VC clients must have attended clinical sessions with any one of six counselors who provided both the VC and the IP services. The study compared the two counseling modalities along the following data points (see glossary of terms): (1) client demographic profiles (eg, age, gender, whether the sessions involved individuals or conjoint sessions with couples or families, etc), (2) presenting issue, (3) average session hours, (4) client rating of session helpfulness, (5) rates of goal completion, (6) client withdrawal rates, (7) no show and late cancellation rates, and (8) pre/post client self-assessment. Specific to VC, we examined client geographic location. Data analysis demonstrates that the VC and the IP showed a similar representation of presenting issues with nearly identical outcomes for client ratings of session helpfulness, rates of goal completion, pre/post client self-assessment, average session duration, and client geographic location. There were no statistically significant differences in the rates of withdrawal from counseling, no shows, and late cancellations between the VC and the IP counseling. The statistical analysis of the data was done on SPSS statistical software using 2-sample and pairwise comparison t tests at a 95% level of significance. Based on the study, VC and IP show similar outcomes in terms of client rating of session and goal attainment.
Evaluation of uncertainty in the adjustment of fundamental constants
NASA Astrophysics Data System (ADS)
Bodnar, Olha; Elster, Clemens; Fischer, Joachim; Possolo, Antonio; Toman, Blaza
2016-02-01
Combining multiple measurement results for the same quantity is an important task in metrology and in many other areas. Examples include the determination of fundamental constants, the calculation of reference values in interlaboratory comparisons, or the meta-analysis of clinical studies. However, neither the GUM nor its supplements give any guidance for this task. Various approaches are applied such as weighted least-squares in conjunction with the Birge ratio or random effects models. While the former approach, which is based on a location-scale model, is particularly popular in metrology, the latter represents a standard tool used in statistics for meta-analysis. We investigate the reliability and robustness of the location-scale model and the random effects model with particular focus on resulting coverage or credible intervals. The interval estimates are obtained by adopting a Bayesian point of view in conjunction with a non-informative prior that is determined by a currently favored principle for selecting non-informative priors. Both approaches are compared by applying them to simulated data as well as to data for the Planck constant and the Newtonian constant of gravitation. Our results suggest that the proposed Bayesian inference based on the random effects model is more reliable and less sensitive to model misspecifications than the approach based on the location-scale model.
Malewski, David F; Ream, Aimrie; Gaither, Caroline A
2015-01-01
Patient satisfaction with pharmaceutical care can be a strong predictor of medication and other health-related outcomes. Less understood is the role that location of pharmacies in urban or suburban environments plays in patient satisfaction with pharmacy and pharmacist services. The purpose of this study was to serve as a pilot examining urban and suburban community pharmacy populations for similarities and differences in patient satisfaction. Community pharmacy patients were asked to self-administer a 30-question patient satisfaction survey. Fifteen questions addressed their relationship with the pharmacist, 10 questions addressed satisfaction and accessibility of the pharmacy, and five questions addressed financial concerns. Five urban and five suburban pharmacies agreed to participate. Data analysis included descriptive statistics and chi-square analysis. Most patients reported high levels of satisfaction. Satisfaction with pharmacist relationship and service was 70% or higher with no significant differences between locations. There were significant differences between the urban and suburban patients regarding accessibility of pharmacy services, customer service and some patient/pharmacist trust issues. The significant differences between patient satisfaction in the suburban and urban populations warrant a larger study with more community pharmacies in other urban, suburban and rural locations to better understand and validate study findings. Copyright © 2015 Elsevier Inc. All rights reserved.
Spatial analysis of relative humidity during ungauged periods in a mountainous region
NASA Astrophysics Data System (ADS)
Um, Myoung-Jin; Kim, Yeonjoo
2017-08-01
Although atmospheric humidity influences environmental and agricultural conditions, thereby influencing plant growth, human health, and air pollution, efforts to develop spatial maps of atmospheric humidity using statistical approaches have thus far been limited. This study therefore aims to develop statistical approaches for inferring the spatial distribution of relative humidity (RH) for a mountainous island, for which data are not uniformly available across the region. A multiple regression analysis based on various mathematical models was used to identify the optimal model for estimating monthly RH by incorporating not only temperature but also location and elevation. Based on the regression analysis, we extended the monthly RH data from weather stations to cover the ungauged periods when no RH observations were available. Then, two different types of station-based data, the observational data and the data extended via the regression model, were used to form grid-based data with a resolution of 100 m. The grid-based data that used the extended station-based data captured the increasing RH trend along an elevation gradient. Furthermore, annual RH values averaged over the regions were examined. Decreasing temporal trends were found in most cases, with magnitudes varying based on the season and region.
Groundwater flow and hydrogeochemical evolution in the Jianghan Plain, central China
NASA Astrophysics Data System (ADS)
Gan, Yiqun; Zhao, Ke; Deng, Yamin; Liang, Xing; Ma, Teng; Wang, Yanxin
2018-05-01
Hydrogeochemical analysis and multivariate statistics were applied to identify flow patterns and major processes controlling the hydrogeochemistry of groundwater in the Jianghan Plain, which is located in central Yangtze River Basin (central China) and characterized by intensive surface-water/groundwater interaction. Although HCO3-Ca-(Mg) type water predominated in the study area, the 457 (21 surface water and 436 groundwater) samples were effectively classified into five clusters by hierarchical cluster analysis. The hydrochemical variations among these clusters were governed by three factors from factor analysis. Major components (e.g., Ca, Mg and HCO3) in surface water and groundwater originated from carbonate and silicate weathering (factor 1). Redox conditions (factor 2) influenced the geogenic Fe and As contamination in shallow confined groundwater. Anthropogenic activities (factor 3) primarily caused high levels of Cl and SO4 in surface water and phreatic groundwater. Furthermore, the factor score 1 of samples in the shallow confined aquifer gradually increased along the flow paths. This study demonstrates that enhanced information on hydrochemistry in complex groundwater flow systems, by multivariate statistical methods, improves the understanding of groundwater flow and hydrogeochemical evolution due to natural and anthropogenic impacts.
Statistical Downscaling of WRF-Chem Model: An Air Quality Analysis over Bogota, Colombia
NASA Astrophysics Data System (ADS)
Kumar, Anikender; Rojas, Nestor
2015-04-01
Statistical downscaling is a technique that is used to extract high-resolution information from regional scale variables produced by coarse resolution models such as Chemical Transport Models (CTMs). The fully coupled WRF-Chem (Weather Research and Forecasting with Chemistry) model is used to simulate air quality over Bogota. Bogota is a tropical Andean megacity located over a high-altitude plateau in the middle of very complex terrain. The WRF-Chem model was adopted for simulating the hourly ozone concentrations. The computational domains were chosen of 120x120x32, 121x121x32 and 121x121x32 grid points with horizontal resolutions of 27, 9 and 3 km respectively. The model was initialized with real boundary conditions using NCAR-NCEP's Final Analysis (FNL) and a 1ox1o (~111 km x 111 km) resolution. Boundary conditions were updated every 6 hours using reanalysis data. The emission rates were obtained from global inventories, namely the REanalysis of the TROpospheric (RETRO) chemical composition and the Emission Database for Global Atmospheric Research (EDGAR). Multiple linear regression and artificial neural network techniques are used to downscale the model output at each monitoring stations. The results confirm that the statistically downscaled outputs reduce simulated errors by up to 25%. This study provides a general overview of statistical downscaling of chemical transport models and can constitute a reference for future air quality modeling exercises over Bogota and other Colombian cities.
Assessment of water quality parameters using multivariate analysis for Klang River basin, Malaysia.
Mohamed, Ibrahim; Othman, Faridah; Ibrahim, Adriana I N; Alaa-Eldin, M E; Yunus, Rossita M
2015-01-01
This case study uses several univariate and multivariate statistical techniques to evaluate and interpret a water quality data set obtained from the Klang River basin located within the state of Selangor and the Federal Territory of Kuala Lumpur, Malaysia. The river drains an area of 1,288 km(2), from the steep mountain rainforests of the main Central Range along Peninsular Malaysia to the river mouth in Port Klang, into the Straits of Malacca. Water quality was monitored at 20 stations, nine of which are situated along the main river and 11 along six tributaries. Data was collected from 1997 to 2007 for seven parameters used to evaluate the status of the water quality, namely dissolved oxygen, biochemical oxygen demand, chemical oxygen demand, suspended solids, ammoniacal nitrogen, pH, and temperature. The data were first investigated using descriptive statistical tools, followed by two practical multivariate analyses that reduced the data dimensions for better interpretation. The analyses employed were factor analysis and principal component analysis, which explain 60 and 81.6% of the total variation in the data, respectively. We found that the resulting latent variables from the factor analysis are interpretable and beneficial for describing the water quality in the Klang River. This study presents the usefulness of several statistical methods in evaluating and interpreting water quality data for the purpose of monitoring the effectiveness of water resource management. The results should provide more straightforward data interpretation as well as valuable insight for managers to conceive optimum action plans for controlling pollution in river water.
Matiatos, Ioannis
2016-01-15
Nitrate (NO3) is one of the most common contaminants in aquatic environments and groundwater. Nitrate concentrations and environmental isotope data (δ(15)N-NO3 and δ(18)O-NO3) from groundwater of Asopos basin, which has different land-use types, i.e., a large number of industries (e.g., textile, metal processing, food, fertilizers, paint), urban and agricultural areas and livestock breeding facilities, were analyzed to identify the nitrate sources of water contamination and N-biogeochemical transformations. A Bayesian isotope mixing model (SIAR) and multivariate statistical analysis of hydrochemical data were used to estimate the proportional contribution of different NO3 sources and to identify the dominant factors controlling the nitrate content of the groundwater in the region. The comparison of SIAR and Principal Component Analysis showed that wastes originating from urban and industrial zones of the basin are mainly responsible for nitrate contamination of groundwater in these areas. Agricultural fertilizers and manure likely contribute to groundwater contamination away from urban fabric and industrial land-use areas. Soil contribution to nitrate contamination due to organic matter is higher in the south-western part of the area far from the industries and the urban settlements. The present study aims to highlight the use of environmental isotopes combined with multivariate statistical analysis in locating sources of nitrate contamination in groundwater leading to a more effective planning of environmental measures and remediation strategies in river basins and water bodies as defined by the European Water Frame Directive (Directive 2000/60/EC).
MNE software for processing MEG and EEG data
Gramfort, A.; Luessi, M.; Larson, E.; Engemann, D.; Strohmeier, D.; Brodbeck, C.; Parkkonen, L.; Hämäläinen, M.
2013-01-01
Magnetoencephalography and electroencephalography (M/EEG) measure the weak electromagnetic signals originating from neural currents in the brain. Using these signals to characterize and locate brain activity is a challenging task, as evidenced by several decades of methodological contributions. MNE, whose name stems from its capability to compute cortically-constrained minimum-norm current estimates from M/EEG data, is a software package that provides comprehensive analysis tools and workflows including preprocessing, source estimation, time–frequency analysis, statistical analysis, and several methods to estimate functional connectivity between distributed brain regions. The present paper gives detailed information about the MNE package and describes typical use cases while also warning about potential caveats in analysis. The MNE package is a collaborative effort of multiple institutes striving to implement and share best methods and to facilitate distribution of analysis pipelines to advance reproducibility of research. Full documentation is available at http://martinos.org/mne. PMID:24161808
Saiki, Jun; Holcombe, Alex O
2012-03-06
Sudden change of every object in a display is typically conspicuous. We find however that in the presence of a secondary task, with a display of moving dots, it can be difficult to detect a sudden change in color of all the dots. A field of 200 dots, half red and half green, half moving rightward and half moving leftward, gave the appearance of two surfaces. When all 200 dots simultaneously switched color between red and green, performance in detecting the switch was very poor. A key display characteristic was that the color proportions on each surface (summary statistics) were not affected by the color switch. When the color switch is accompanied by a change in these summary statistics, people perform well in detecting the switch, suggesting that the secondary task does not disrupt the availability of this statistical information. These findings suggest that when the change is missed, the old and new colors were represented, but the color-location pattern (binding of colors to locations) was not represented or not compared. Even after extended viewing, changes to the individual color-location pattern are not available, suggesting that the feeling of seeing these details is misleading.
NASA Astrophysics Data System (ADS)
Mfumu Kihumba, Antoine; Vanclooster, Marnik
2013-04-01
Drinking water in Kinshasa, the capital of the Democratic Republic of Congo, is provided by extracting groundwater from the local aquifer, particularly in peripheral areas. The exploited groundwater body is mainly unconfined and located within a continuous detrital aquifer, primarily composed of sedimentary formations. However, the aquifer is subjected to an increasing threat of anthropogenic pollution pressure. Understanding the detailed origin of this pollution pressure is important for sustainable drinking water management in Kinshasa. The present study aims to explain the observed nitrate pollution problem, nitrate being considered as a good tracer for other pollution threats. The analysis is made in terms of physical attributes that are readily available using a statistical modelling approach. For the nitrate data, use was made of a historical groundwater quality assessment study, for which the data were re-analysed. The physical attributes are related to the topography, land use, geology and hydrogeology of the region. Prior to the statistical modelling, intrinsic and specific vulnerability for nitrate pollution was assessed. This vulnerability assessment showed that the alluvium area in the northern part of the region is the most vulnerable area. This area consists of urban land use with poor sanitation. Re-analysis of the nitrate pollution data demonstrated that the spatial variability of nitrate concentrations in the groundwater body is high, and coherent with the fragmented land use of the region and the intrinsic and specific vulnerability maps. For the statistical modeling use was made of multiple regression and regression tree analysis. The results demonstrated the significant impact of land use variables on the Kinshasa groundwater nitrate pollution and the need for a detailed delineation of groundwater capture zones around the monitoring stations. Key words: Groundwater , Isotopic, Kinshasa, Modelling, Pollution, Physico-chemical.
Ghosal, Kavita; Pandey, Naren; Bhattacharya, Swati Gupta
2015-01-01
Pollen grains released by plants are dispersed into the air and can become trapped in human nasal mucosa, causing immediate release of allergens triggering severe Type 1 hypersensitivity reactions in susceptible allergic patients. Recent epidemiologic data show that 11-12% of people suffer from this type of disorders in India. Hence, it is important to examine whether pollen grains have a role in dissipating respiratory problems, including allergy and astma, in a subtropical suburban city. Meteorological data were collected for a period of two years, together with aerobiological sampling with a Burkard sampler. A pollen calendar was prepared for the city. A health survey and the hospitalization rate of local people for the above problems were documented following statistical analysis between pollen counts and the data from the two above-mentioned sources. Skin Prick Test and Indirect ELISA were performer for the identification of allergenic pollen grains. Bio-monitoring results showed that a total of 36 species of pollen grains were located in the air of the study area, where their presence is controlled by many important meteorological parameters proved from SPSS statistical analysis and by their blooming periods. Statistical analysis showed that there is a high positive correlation of monthly pollen counts with the data from the survey and hospital. Biochemical tests revealed the allergic nature of pollen grains of many local species found in the sampler. Bio-monitoring, together with statistical and biochemical results, leave no doubt about the role of pollen as a bio-pollutant. General knowledge about pollen allergy and specific allergenic pollen grains of a particular locality could be a good step towards better health for the cosmopolitan suburban city.
NASA Astrophysics Data System (ADS)
Guimarães Nobre, Gabriela; Arnbjerg-Nielsen, Karsten; Rosbjerg, Dan; Madsen, Henrik
2016-04-01
Traditionally, flood risk assessment studies have been carried out from a univariate frequency analysis perspective. However, statistical dependence between hydrological variables, such as extreme rainfall and extreme sea surge, is plausible to exist, since both variables to some extent are driven by common meteorological conditions. Aiming to overcome this limitation, multivariate statistical techniques has the potential to combine different sources of flooding in the investigation. The aim of this study was to apply a range of statistical methodologies for analyzing combined extreme hydrological variables that can lead to coastal and urban flooding. The study area is the Elwood Catchment, which is a highly urbanized catchment located in the city of Port Phillip, Melbourne, Australia. The first part of the investigation dealt with the marginal extreme value distributions. Two approaches to extract extreme value series were applied (Annual Maximum and Partial Duration Series), and different probability distribution functions were fit to the observed sample. Results obtained by using the Generalized Pareto distribution demonstrate the ability of the Pareto family to model the extreme events. Advancing into multivariate extreme value analysis, first an investigation regarding the asymptotic properties of extremal dependence was carried out. As a weak positive asymptotic dependence between the bivariate extreme pairs was found, the Conditional method proposed by Heffernan and Tawn (2004) was chosen. This approach is suitable to model bivariate extreme values, which are relatively unlikely to occur together. The results show that the probability of an extreme sea surge occurring during a one-hour intensity extreme precipitation event (or vice versa) can be twice as great as what would occur when assuming independent events. Therefore, presuming independence between these two variables would result in severe underestimation of the flooding risk in the study area.
Substorm injection boundaries. [magnetospheric electric field model
NASA Technical Reports Server (NTRS)
Mcilwain, C. E.
1974-01-01
An improved magnetospheric electric field model is used to compute the initial locations of particles injected by several substorms. Trajectories are traced from the time of their encounter with the ATS-5 satellite backwards to the onset time given by ground-based magnetometers. A spiral shaped inner boundary of injection is found which is quite similar to that found by a statistical analysis. This injection boundary is shown to move in an energy dependent fashion which can explain the soft energy spectra observed at the inner edge of the electrons plasma sheet.
Impacts of mining on water and soil.
Warhate, S R; Yenkie, M K N; Pokale, W K
2007-04-01
Out of seven coal mines situated in Wardha River Valley located at Wani (Dist. Yavatmal), five open caste coal mines are run by Western Coal Field Ltd, India. The results of 25 water and 19 soil samples (including one over burden) from Nilapur, Bramhani, Kolera, Gowari, Pimpari and Aheri for their pH, TDS, hardness, alkalinity, fluoride, chloride, nitrite, nitrate, phosphate, sulfate, cadmium, lead, zinc, copper, nickel, arsenic, manganese, sodium and potassium are studied in the present work. Statistical analysis and graphical presentation of the results are discussed in this paper.
Applications of geostatistics and Markov models for logo recognition
NASA Astrophysics Data System (ADS)
Pham, Tuan
2003-01-01
Spatial covariances based on geostatistics are extracted as representative features of logo or trademark images. These spatial covariances are different from other statistical features for image analysis in that the structural information of an image is independent of the pixel locations and represented in terms of spatial series. We then design a classifier in the sense of hidden Markov models to make use of these geostatistical sequential data to recognize the logos. High recognition rates are obtained from testing the method against a public-domain logo database.
Modulation of spatial attention by goals, statistical learning, and monetary reward.
Jiang, Yuhong V; Sha, Li Z; Remington, Roger W
2015-10-01
This study documented the relative strength of task goals, visual statistical learning, and monetary reward in guiding spatial attention. Using a difficult T-among-L search task, we cued spatial attention to one visual quadrant by (i) instructing people to prioritize it (goal-driven attention), (ii) placing the target frequently there (location probability learning), or (iii) associating that quadrant with greater monetary gain (reward-based attention). Results showed that successful goal-driven attention exerted the strongest influence on search RT. Incidental location probability learning yielded a smaller though still robust effect. Incidental reward learning produced negligible guidance for spatial attention. The 95 % confidence intervals of the three effects were largely nonoverlapping. To understand these results, we simulated the role of location repetition priming in probability cuing and reward learning. Repetition priming underestimated the strength of location probability cuing, suggesting that probability cuing involved long-term statistical learning of how to shift attention. Repetition priming provided a reasonable account for the negligible effect of reward on spatial attention. We propose a multiple-systems view of spatial attention that includes task goals, search habit, and priming as primary drivers of top-down attention.
Modulation of spatial attention by goals, statistical learning, and monetary reward
Sha, Li Z.; Remington, Roger W.
2015-01-01
This study documented the relative strength of task goals, visual statistical learning, and monetary reward in guiding spatial attention. Using a difficult T-among-L search task, we cued spatial attention to one visual quadrant by (i) instructing people to prioritize it (goal-driven attention), (ii) placing the target frequently there (location probability learning), or (iii) associating that quadrant with greater monetary gain (reward-based attention). Results showed that successful goal-driven attention exerted the strongest influence on search RT. Incidental location probability learning yielded a smaller though still robust effect. Incidental reward learning produced negligible guidance for spatial attention. The 95 % confidence intervals of the three effects were largely nonoverlapping. To understand these results, we simulated the role of location repetition priming in probability cuing and reward learning. Repetition priming underestimated the strength of location probability cuing, suggesting that probability cuing involved long-term statistical learning of how to shift attention. Repetition priming provided a reasonable account for the negligible effect of reward on spatial attention. We propose a multiple-systems view of spatial attention that includes task goals, search habit, and priming as primary drivers of top-down attention. PMID:26105657
Spatial epidemiology of suspected clinical leptospirosis in Sri Lanka.
Robertson, C; Nelson, T A; Stephen, C
2012-04-01
Leptospirosis is one of the most widespread zoonoses in the world. A large outbreak of suspected human leptospirosis began in Sri Lanka during 2008. This study investigated spatial variables associated with suspected leptospirosis risk during endemic and outbreak periods. Data were obtained for monthly numbers of reported cases of suspected clinical leptospirosis for 2005-2009 for all of Sri Lanka. Space-time scan statistics were combined with regression modelling to test associations during endemic and outbreak periods. The cross-correlation function was used to test association between rainfall and leptospirosis at four locations. During the endemic period (2005-2007), leptospirosis risk was positively associated with shorter average distance to rivers and with higher percentage of agriculture made up of farms <0·20 hectares. Temporal correlation analysis of suspected leptospirosis cases and rainfall revealed a 2-month lag in rainfall-case association during the baseline period. Outbreak locations in 2008 were characterized by shorter distance to rivers and higher population density. The analysis suggests the possibility of household transmission in densely populated semi-urban villages as a defining characteristic of the outbreak. The role of rainfall in the outbreak remains to be investigated, although analysis here suggests a more complex relationship than simple correlation.
Wavelet Transform Based Higher Order Statistical Analysis of Wind and Wave Time Histories
NASA Astrophysics Data System (ADS)
Habib Huseni, Gulamhusenwala; Balaji, Ramakrishnan
2017-10-01
Wind, blowing on the surface of the ocean, imparts the energy to generate the waves. Understanding the wind-wave interactions is essential for an oceanographer. This study involves higher order spectral analyses of wind speeds and significant wave height time histories, extracted from European Centre for Medium-Range Weather Forecast database at an offshore location off Mumbai coast, through continuous wavelet transform. The time histories were divided by the seasons; pre-monsoon, monsoon, post-monsoon and winter and the analysis were carried out to the individual data sets, to assess the effect of various seasons on the wind-wave interactions. The analysis revealed that the frequency coupling of wind speeds and wave heights of various seasons. The details of data, analysing technique and results are presented in this paper.
Three Dimensional CFD Analysis of the GTX Combustor
NASA Technical Reports Server (NTRS)
Steffen, C. J., Jr.; Bond, R. B.; Edwards, J. R.
2002-01-01
The annular combustor geometry of a combined-cycle engine has been analyzed with three-dimensional computational fluid dynamics. Both subsonic combustion and supersonic combustion flowfields have been simulated. The subsonic combustion analysis was executed in conjunction with a direct-connect test rig. Two cold-flow and one hot-flow results are presented. The simulations compare favorably with the test data for the two cold flow calculations; the hot-flow data was not yet available. The hot-flow simulation indicates that the conventional ejector-ramjet cycle would not provide adequate mixing at the conditions tested. The supersonic combustion ramjet flowfield was simulated with frozen chemistry model. A five-parameter test matrix was specified, according to statistical design-of-experiments theory. Twenty-seven separate simulations were used to assemble surrogate models for combustor mixing efficiency and total pressure recovery. ScramJet injector design parameters (injector angle, location, and fuel split) as well as mission variables (total fuel massflow and freestream Mach number) were included in the analysis. A promising injector design has been identified that provides good mixing characteristics with low total pressure losses. The surrogate models can be used to develop performance maps of different injector designs. Several complex three-way variable interactions appear within the dataset that are not adequately resolved with the current statistical analysis.
Coccioni, Rodolfo; Frontalini, Fabrizio; Marsili, Andrea; Mana, Davide
2009-01-01
Living benthic foraminiferal assemblages were studied in surface samples collected from the lagoon of Venice (Italy) in order to investigate the relationship between these sensitive microorganisms and trace element pollution. Geochemical analysis of sediments shows that the lagoon is affected by trace element pollution (Cd, Cu, Ni, Pb, Zn and Hg) with the highest concentrations in its inner part, which corresponds to the Porto Marghera industrial area. The biocenosis are largely dominated by Ammonia tepida, Haynesina germanica and Cribroelphidium oceanensis and, subordinately, by Aubignyna perlucida, Ammonia parkinsoniana and Bolivina striatula. Biotic and abiotic factors were statistically analyzed with multivariate technique of cluster analysis and principal component analysis. The statistical analysis reveals a strong relationship between trace elements (in particular Mn, Pb and Hg) and the occurrence of abnormalities in foraminiferal tests. Remarkably, greater proportions of abnormal specimens are usually found at stations located close to the heaviest polluted industrial zone of Porto Marghera. This paper shows that benthic foraminifera can be used as useful and relatively speedy and inexpensive bio-indicators in monitoring the health quality of the lagoon of Venice. It also provides a basis for future investigations aimed at unraveling the benthic foraminiferal response to human-induced pollution in marine and transitional marine environments.
2012-01-01
Background Quantitative trait loci (QTL) detection on a huge amount of phenotypes, like eQTL detection on transcriptomic data, can be dramatically impaired by the statistical properties of interval mapping methods. One of these major outcomes is the high number of QTL detected at marker locations. The present study aims at identifying and specifying the sources of this bias, in particular in the case of analysis of data issued from outbred populations. Analytical developments were carried out in a backcross situation in order to specify the bias and to propose an algorithm to control it. The outbred population context was studied through simulated data sets in a wide range of situations. The likelihood ratio test was firstly analyzed under the "one QTL" hypothesis in a backcross population. Designs of sib families were then simulated and analyzed using the QTL Map software. On the basis of the theoretical results in backcross, parameters such as the population size, the density of the genetic map, the QTL effect and the true location of the QTL, were taken into account under the "no QTL" and the "one QTL" hypotheses. A combination of two non parametric tests - the Kolmogorov-Smirnov test and the Mann-Whitney-Wilcoxon test - was used in order to identify the parameters that affected the bias and to specify how much they influenced the estimation of QTL location. Results A theoretical expression of the bias of the estimated QTL location was obtained for a backcross type population. We demonstrated a common source of bias under the "no QTL" and the "one QTL" hypotheses and qualified the possible influence of several parameters. Simulation studies confirmed that the bias exists in outbred populations under both the hypotheses of "no QTL" and "one QTL" on a linkage group. The QTL location was systematically closer to marker locations than expected, particularly in the case of low QTL effect, small population size or low density of markers, i.e. designs with low power. Practical recommendations for experimental designs for QTL detection in outbred populations are given on the basis of this bias quantification. Furthermore, an original algorithm is proposed to adjust the location of a QTL, obtained with interval mapping, which co located with a marker. Conclusions Therefore, one should be attentive when one QTL is mapped at the location of one marker, especially under low power conditions. PMID:22520935
Lefebvre, Alexandre; Rochefort, Gael Y.; Santos, Frédéric; Le Denmat, Dominique; Salmon, Benjamin; Pétillon, Jean-Marc
2016-01-01
Over the last decade, biomedical 3D-imaging tools have gained widespread use in the analysis of prehistoric bone artefacts. While initial attempts to characterise the major categories used in osseous industry (i.e. bone, antler, and dentine/ivory) have been successful, the taxonomic determination of prehistoric artefacts remains to be investigated. The distinction between reindeer and red deer antler can be challenging, particularly in cases of anthropic and/or taphonomic modifications. In addition to the range of destructive physicochemical identification methods available (mass spectrometry, isotopic ratio, and DNA analysis), X-ray micro-tomography (micro-CT) provides convincing non-destructive 3D images and analyses. This paper presents the experimental protocol (sample scans, image processing, and statistical analysis) we have developed in order to identify modern and archaeological antler collections (from Isturitz, France). This original method is based on bone microstructure analysis combined with advanced statistical support vector machine (SVM) classifiers. A combination of six microarchitecture biomarkers (bone volume fraction, trabecular number, trabecular separation, trabecular thickness, trabecular bone pattern factor, and structure model index) were screened using micro-CT in order to characterise internal alveolar structure. Overall, reindeer alveoli presented a tighter mesh than red deer alveoli, and statistical analysis allowed us to distinguish archaeological antler by species with an accuracy of 96%, regardless of anatomical location on the antler. In conclusion, micro-CT combined with SVM classifiers proves to be a promising additional non-destructive method for antler identification, suitable for archaeological artefacts whose degree of human modification and cultural heritage or scientific value has previously made it impossible (tools, ornaments, etc.). PMID:26901355
Methods for estimating flow-duration and annual mean-flow statistics for ungaged streams in Oklahoma
Esralew, Rachel A.; Smith, S. Jerrod
2010-01-01
Flow statistics can be used to provide decision makers with surface-water information needed for activities such as water-supply permitting, flow regulation, and other water rights issues. Flow statistics could be needed at any location along a stream. Most often, streamflow statistics are needed at ungaged sites, where no flow data are available to compute the statistics. Methods are presented in this report for estimating flow-duration and annual mean-flow statistics for ungaged streams in Oklahoma. Flow statistics included the (1) annual (period of record), (2) seasonal (summer-autumn and winter-spring), and (3) 12 monthly duration statistics, including the 20th, 50th, 80th, 90th, and 95th percentile flow exceedances, and the annual mean-flow (mean of daily flows for the period of record). Flow statistics were calculated from daily streamflow information collected from 235 streamflow-gaging stations throughout Oklahoma and areas in adjacent states. A drainage-area ratio method is the preferred method for estimating flow statistics at an ungaged location that is on a stream near a gage. The method generally is reliable only if the drainage-area ratio of the two sites is between 0.5 and 1.5. Regression equations that relate flow statistics to drainage-basin characteristics were developed for the purpose of estimating selected flow-duration and annual mean-flow statistics for ungaged streams that are not near gaging stations on the same stream. Regression equations were developed from flow statistics and drainage-basin characteristics for 113 unregulated gaging stations. Separate regression equations were developed by using U.S. Geological Survey streamflow-gaging stations in regions with similar drainage-basin characteristics. These equations can increase the accuracy of regression equations used for estimating flow-duration and annual mean-flow statistics at ungaged stream locations in Oklahoma. Streamflow-gaging stations were grouped by selected drainage-basin characteristics by using a k-means cluster analysis. Three regions were identified for Oklahoma on the basis of the clustering of gaging stations and a manual delineation of distinguishable hydrologic and geologic boundaries: Region 1 (western Oklahoma excluding the Oklahoma and Texas Panhandles), Region 2 (north- and south-central Oklahoma), and Region 3 (eastern and central Oklahoma). A total of 228 regression equations (225 flow-duration regressions and three annual mean-flow regressions) were developed using ordinary least-squares and left-censored (Tobit) multiple-regression techniques. These equations can be used to estimate 75 flow-duration statistics and annual mean-flow for ungaged streams in the three regions. Drainage-basin characteristics that were statistically significant independent variables in the regression analyses were (1) contributing drainage area; (2) station elevation; (3) mean drainage-basin elevation; (4) channel slope; (5) percentage of forested canopy; (6) mean drainage-basin hillslope; (7) soil permeability; and (8) mean annual, seasonal, and monthly precipitation. The accuracy of flow-duration regression equations generally decreased from high-flow exceedance (low-exceedance probability) to low-flow exceedance (high-exceedance probability) . This decrease may have happened because a greater uncertainty exists for low-flow estimates and low-flow is largely affected by localized geology that was not quantified by the drainage-basin characteristics selected. The standard errors of estimate of regression equations for Region 1 (western Oklahoma) were substantially larger than those standard errors for other regions, especially for low-flow exceedances. These errors may be a result of greater variability in low flow because of increased irrigation activities in this region. Regression equations may not be reliable for sites where the drainage-basin characteristics are outside the range of values of independent vari
Spatiotemporal patterns of ERP based on combined ICA-LORETA analysis
NASA Astrophysics Data System (ADS)
Zhang, Jiacai; Guo, Taomei; Xu, Yaqin; Zhao, Xiaojie; Yao, Li
2007-03-01
In contrast to the FMRI methods widely used up to now, this method try to understand more profoundly how the brain systems work under sentence processing task map accurately the spatiotemporal patterns of activity of the large neuronal populations in the human brain from the analysis of ERP data recorded on the brain scalp. In this study, an event-related brain potential (ERP) paradigm to record the on-line responses to the processing of sentences is chosen as an example. In order to give attention to both utilizing the ERPs' temporal resolution of milliseconds and overcoming the insensibility of cerebral location ERP sources, we separate these sources in space and time based on a combined method of independent component analysis (ICA) and low-resolution tomography (LORETA) algorithms. ICA blindly separate the input ERP data into a sum of temporally independent and spatially fixed components arising from distinct or overlapping brain or extra-brain sources. And then the spatial maps associated with each ICA component are analyzed, with use of LORETA to uniquely locate its cerebral sources throughout the full brain according to the assumption that neighboring neurons are simultaneously and synchronously activated. Our results show that the cerebral computation mechanism underlies content words reading is mediated by the orchestrated activity of several spatially distributed brain sources located in the temporal, frontal, and parietal areas, and activate at distinct time intervals and are grouped into different statistically independent components. Thus ICA-LORETA analysis provides an encouraging and effective method to study brain dynamics from ERP.
Polar azimuth diversity HF propagation experiment
NASA Astrophysics Data System (ADS)
Baker, Kurt A.; Haines, D. M.; Weijers, Bertus
1986-03-01
Presented are the results of an HF Azimuth Diversity Propagation Experiment conducted by RADC over several paths, transauroral and polar, separated in azimuth by 30, 70, and 100 degrees, as part of the RADC Adaptive HF Propagation Program. The data presented give the occurrence of several ionospheric characteristics important to the operation of HF networks in a disturbed environment. The analysis was performed on data collected during the four seasonal periods to obtain statistical samples representative of each season under slightly disturbed as well as quiet conditions. The system used to collect the data was a network of three chirpsounder transmitters and one receiver, each sweeping over a frequency range of 2 to 30 MHz, once every five minutes. The transmitters were located at Ava, N.Y., Grand Forks, N. Dak., and Barter Island, Alaska. The receiving system was located at Thule Air Base, Greenland.
On the choice of statistical models for estimating occurrence and extinction from animal surveys
Dorazio, R.M.
2007-01-01
In surveys of natural animal populations the number of animals that are present and available to be detected at a sample location is often low, resulting in few or no detections. Low detection frequencies are especially common in surveys of imperiled species; however, the choice of sampling method and protocol also may influence the size of the population that is vulnerable to detection. In these circumstances, probabilities of animal occurrence and extinction will generally be estimated more accurately if the models used in data analysis account for differences in abundance among sample locations and for the dependence between site-specific abundance and detection. Simulation experiments are used to illustrate conditions wherein these types of models can be expected to outperform alternative estimators of population site occupancy and extinction. ?? 2007 by the Ecological Society of America.
Bayesian Spatial Design of Optimal Deep Tubewell Locations in Matlab, Bangladesh.
Warren, Joshua L; Perez-Heydrich, Carolina; Yunus, Mohammad
2013-09-01
We introduce a method for statistically identifying the optimal locations of deep tubewells (dtws) to be installed in Matlab, Bangladesh. Dtw installations serve to mitigate exposure to naturally occurring arsenic found at groundwater depths less than 200 meters, a serious environmental health threat for the population of Bangladesh. We introduce an objective function, which incorporates both arsenic level and nearest town population size, to identify optimal locations for dtw placement. Assuming complete knowledge of the arsenic surface, we then demonstrate how minimizing the objective function over a domain favors dtws placed in areas with high arsenic values and close to largely populated regions. Given only a partial realization of the arsenic surface over a domain, we use a Bayesian spatial statistical model to predict the full arsenic surface and estimate the optimal dtw locations. The uncertainty associated with these estimated locations is correctly characterized as well. The new method is applied to a dataset from a village in Matlab and the estimated optimal locations are analyzed along with their respective 95% credible regions.
Diabetic foot ulcer incidence in relation to plantar pressure magnitude and measurement location.
Ledoux, William R; Shofer, Jane B; Cowley, Matthew S; Ahroni, Jessie H; Cohen, Victoria; Boyko, Edward J
2013-01-01
We prospectively examined the relationship between site-specific peak plantar pressure (PPP) and ulcer risk. Researchers have previously reported associations between diabetic foot ulcer and elevated plantar foot pressure, but the effect of location-specific pressures has not been studied. Diabetic subjects (n=591) were enrolled from a single VA hospital. Five measurements of in-shoe plantar pressure were collected using F-Scan. Pressures were measured at 8 areas: heel, lateral midfoot, medial midfoot, first metatarsal, second through fourth metatarsal, fifth metatarsal, hallux, and other toes. The relationship between incident plantar foot ulcer and PPP or pressure-time integral (PTI) was assessed using Cox regression. During follow-up (2.4years), 47 subjects developed plantar ulcers (10 heel, 12 metatarsal, 19 hallux, 6 other). Overall mean PPP was higher for ulcer subjects (219 vs. 194kPa), but the relationship differed by site (the metatarsals with ulcers had higher pressure, while the opposite was true for the hallux and heel). A statistical analysis was not performed on the means, but hazard ratios from a Cox survival analysis were nonsignificant for PPP across all sites and when adjusted for location. However, when the metatarsals were considered separately, higher baseline PPP was significantly associated with greater ulcer risk; at other sites, this relationship was nonsignificant. Hazard ratios for all PTI data were nonsignificant. Location must be considered when assessing the relationship between PPP and plantar ulceration. © 2013.
Zanesco, Caroline; Só, Marcus Vinicius Reis; Schmidt, Sabrina; Fontanella, Vania Regina Camargo; Grazziotin-Soares, Renata; Barletta, Fernando Branco
2017-03-01
This study aimed to evaluate apical transportation (AT), centering ratio (CR), and volume increase (VI) produced after instrumentation of mesiobuccal canals of maxillary molars with hand files, rotary, and reciprocating instruments using micro-computed tomographic (micro-CT) imaging and to demonstrate the ability of digital subtraction radiography (DSR) to evaluate AT. Forty-five canals were randomly assigned to either group K, manual K-files; PTN, ProTaper Next (Dentsply Maillefer, Ballaigues, Switzerland); or Rec, Reciproc (n = 15 for each group) for preparation. Master apical files were #25, X2 (#25/06), and R25 (#25/08), respectively. Micro-CT imaging was used to measure AT (mm) and CR (mm) at 3 different locations (1, 4, and 7 mm from the apex). VI (mm 3 ) was measured for each root third and for the whole canal. DSR (mesiodistal and buccolingual projections) was used to measure AT at 1 mm from the apex. AT and CR values were statistically similar across the groups at 1, 4, and 7 mm. AT results obtained for the different locations were similar within each group; CR, in turn, showed statistically lower values at 1 mm. VI was statistically similar in all groups. Both DSR and micro-CT imaging showed that AT always occurred on the outside of canal curvature. The highest mean value obtained for AT was 0.215 mm. AT, CR, and VI were similar for the K, PTN, and Rec groups. AT results were clinically irrelevant. DSR was as effective as micro-CT imaging in AT analysis and could be considered as an alternative method for assessing this outcome. Copyright © 2016 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
Imaging predictors of poststroke depression: methodological factors in voxel-based analysis
Gozzi, Sophia A; Wood, Amanda G; Chen, Jian; Vaddadi, Krishnarao; Phan, Thanh G
2014-01-01
Objective The purpose of this study was to explore the relationship between lesion location and poststroke depression using statistical parametric mapping. Methods First episode patients with stroke were assessed within 12 days and at 1-month poststroke. Patients with an a priori defined cut-off score of 11 on the Hospital Anxiety and Depression Scale (HADS) at follow-up were further assessed using the Mini-International Neuropsychiatric Interview (MINI) to confirm a clinical diagnosis of major or minor depression in accordance with Diagnostic and Statistical Manual-IV (DSM-IV) inclusion criteria. Participants were included if they were aged 18–85 years, proficient in English and eligible for MRI. Patients were excluded if they had a confounding diagnosis such as major depressive disorder at the time of admission, a neurodegenerative disease, epilepsy or an imminently life-threatening comorbid illness, subarachnoid or subdural stroke, a second episode of stroke before follow-up and/or a serious impairment of consciousness or language. Infarcts observed on MRI scans were manually segmented into binary images, linearly registered into a common stereotaxic coordinate space. Using statistical parametric mapping, we compared infarct patterns in patients with stroke with and without depression. Results 27% (15/55 patients) met criteria for depression at follow-up. Mean infarct volume was 19±53 mL and National Institute of Health Stroke Scale (NIHSS) at Time 1 (within 12 days of stroke) was 4±4, indicating a sample of mild strokes. No voxels or clusters were significant after a multiple comparison correction was applied (p>0.05). Examination of infarct maps showed that there was minimal overlap of infarct location between patients, thus invalidating the voxel comparison analysis. Conclusions This study provided inconclusive evidence for the association between infarcts in a specific region and poststroke depression. PMID:25001395
Gupta, A K; Nag, Subhankar; Mukhopadhyay, U K
2006-04-01
In this study, the relationship between inhalable particulate (PM(10)), fine particulate (PM(2.5)), coarse particles (PM(2.5 - 10)) and meteorological parameters such as temperature, relative humidity, solar radiation, wind speed were statistically analyzed and modelled for urban area of Kolkata during winter months of 2003-2004. Ambient air quality was monitored with a sampling frequency of twenty-four hours at three monitoring sites located near traffic intersections and in an industrial area. The monitoring sites were located 3-5 m above ground near highly trafficked and congested areas. The 24 h average PM(10) and PM(2.5) samples were collected using Thermo-Andersen high volume samplers and exposed filter papers were extracted and analysed for benzene soluble organic fraction. The ratios between PM(2.5) and PM(10) were found to be in the range of 0.6 to 0.92 and the highest ratio was found in the most polluted urban site. Statistical analysis has shown a strong positive correlation between PM(10) and PM(2.5) and inverse correlation was observed between particulate matter (PM(10) and PM(2.5)) and wind speed. Statistical analysis of air quality data shows that PM(10) and PM(2.5) are showing poor correlation with temperature, relative humidity and solar radiation. Regression equations for PM(10) and PM(2.5) and meteorological parameters were developed. The organic fraction of particulate matter soluble in benzene is an indication of poly aromatic hydrocarbon (PAH) concentration present in particulate matter. The relationship between the benzene soluble organic fraction (BSOF) of inhalable particulate (PM(10)) and fine particulate (PM(2.5)) were analysed for urban area of Kolkata. Significant positive correlation was observed between benzene soluble organic fraction of PM(10) (BSM10) and benzene soluble organic fraction of PM(2.5) (BSM2.5). Regression equations for BSM10 and BSM2.5 were developed.
Integrated environmental monitoring and multivariate data analysis-A case study.
Eide, Ingvar; Westad, Frank; Nilssen, Ingunn; de Freitas, Felipe Sales; Dos Santos, Natalia Gomes; Dos Santos, Francisco; Cabral, Marcelo Montenegro; Bicego, Marcia Caruso; Figueira, Rubens; Johnsen, Ståle
2017-03-01
The present article describes integration of environmental monitoring and discharge data and interpretation using multivariate statistics, principal component analysis (PCA), and partial least squares (PLS) regression. The monitoring was carried out at the Peregrino oil field off the coast of Brazil. One sensor platform and 3 sediment traps were placed on the seabed. The sensors measured current speed and direction, turbidity, temperature, and conductivity. The sediment trap samples were used to determine suspended particulate matter that was characterized with respect to a number of chemical parameters (26 alkanes, 16 PAHs, N, C, calcium carbonate, and Ba). Data on discharges of drill cuttings and water-based drilling fluid were provided on a daily basis. The monitoring was carried out during 7 campaigns from June 2010 to October 2012, each lasting 2 to 3 months due to the capacity of the sediment traps. The data from the campaigns were preprocessed, combined, and interpreted using multivariate statistics. No systematic difference could be observed between campaigns or traps despite the fact that the first campaign was carried out before drilling, and 1 of 3 sediment traps was located in an area not expected to be influenced by the discharges. There was a strong covariation between suspended particulate matter and total N and organic C suggesting that the majority of the sediment samples had a natural and biogenic origin. Furthermore, the multivariate regression showed no correlation between discharges of drill cuttings and sediment trap or turbidity data taking current speed and direction into consideration. Because of this lack of correlation with discharges from the drilling location, a more detailed evaluation of chemical indicators providing information about origin was carried out in addition to numerical modeling of dispersion and deposition. The chemical indicators and the modeling of dispersion and deposition support the conclusions from the multivariate statistics. Integr Environ Assess Manag 2017;13:387-395. © 2016 SETAC. © 2016 SETAC.
NASA Astrophysics Data System (ADS)
Gasc, J.; Brantut, N.; Schubnel, A.; Brunet, F.; Mueller, H.
2008-12-01
We have monitored from in-situ X-ray diffraction coupled to Acoustic Emission (AE) imaging, the behavior of a fine grained synthetic calcite aggregate, at 0.66 GPa and for temperatures ranging from ambient to 1200° C. The powder sample was placed in a boron-epoxy assembly with an 8 mm edge-length and loaded in the MAX80 cubic multi-anvil press installed on the German synchrotron (HASYLAB-DESY, Hamburg). AE were recorded using five piezoceramic transducers (5 MHz eigen frequency) glued on each of the five WC anvils (4 side anvils and upper one). Full waveforms were acquired using an eight channel digital oscilloscope and located using the software Insite (ASC Ltd). Beyond 600° C, calcite grains started growing as evidenced by huge changes in the relative intensity of the diffraction lines. This is correlated to a sudden burst of AE which all located within the sample volume. These AE may indicate that stress relaxation, going on as intra-crystalline plasticity mechanisms were activated, released enough acoustic energy to be recorded and located. Although the diffraction data showed that grain growth continued beyond 800° C, the acoustic activity progressively decreased to below the sensitivity of our recording device (i.e. the triggering level). However, at temperature higher than 1000° C, a large number of AE were recorded again ( 2000 events). AE location revealed that the AE front progressed inwards the sample. The complete loss of diffraction signal and the post-mortem recovery of small amounts of CaO suggest that the second AE burst may be related to calcite melting/decarbonation. Perspectives include thorough microstructural analysis of the samples using electron microscopies (SEM and TEM) as well as a statistical and mechanical analysis of the acoustic data.
Classification and Space-Time Analysis of Precipitation Events in Manizales, Caldas, Colombia.
NASA Astrophysics Data System (ADS)
Suarez Hincapie, J. N.; Vélez, J.; Romo Melo, L.; Chang, P.
2015-12-01
Manizales is a mid-mountain Andean city located near the Nevado del Ruiz volcano in west-central Colombia, this location exposes it to earthquakes, floods, landslides and volcanic eruptions. It is located in the intertropical convergence zone (ITCZ) and presents a climate with a bimodal rainfall regime (Cortés, 2010). Its mean annual rainfall is 2000 mm, one may observe precipitation 70% of the days over a year. This rain which favors the formation of large masses of clouds and the presence of macroclimatic phenomenon as "El Niño South Oscillation", has historically caused great impacts in the region (Vélez et al, 2012). For example the geographical location coupled with rain events results in a high risk of landslides in the city. Manizales has a hydrometeorological network of 40 stations that measure and transmit data of up to eight climate variables. Some of these stations keep 10 years of historical data. However, until now this information has not been used for space-time classification of precipitation events, nor has the meteorological variables that influence them been thoroughly researched. The purpose of this study was to classify historical events of rain in an urban area of Manizales and investigate patterns of atmospheric behavior that influence or trigger such events. Classification of events was performed by calculating the "n" index of the heavy rainfall, describing the behavior of precipitation as a function of time throughout the event (Monjo, 2009). The analysis of meteorological variables was performed using statistical quantification over variable time periods before each event. The proposed classification allowed for an analysis of the evolution of rainfall events. Specially, it helped to look for the influence of different meteorological variables triggering rainfall events in hazardous areas as the city of Manizales.
Statistical analyses and characteristics of volcanic tremor on Stromboli Volcano (Italy)
NASA Astrophysics Data System (ADS)
Falsaperla, S.; Langer, H.; Spampinato, S.
A study of volcanic tremor on Stromboli is carried out on the basis of data recorded daily between 1993 and 1995 by a permanent seismic station (STR) located 1.8km away from the active craters. We also consider the signal of a second station (TF1), which operated for a shorter time span. Changes in the spectral tremor characteristics can be related to modifications in volcanic activity, particularly to lava effusions and explosive sequences. Statistical analyses were carried out on a set of spectra calculated daily from seismic signals where explosion quakes were present or excluded. Principal component analysis and cluster analysis were applied to identify different classes of spectra. Three clusters of spectra are associated with two different states of volcanic activity. One cluster corresponds to a state of low to moderate activity, whereas the two other clusters are present during phases with a high magma column as inferred from the occurrence of lava fountains or effusions. We therefore conclude that variations in volcanic activity at Stromboli are usually linked to changes in the spectral characteristics of volcanic tremor. Site effects are evident when comparing the spectra calculated from signals synchronously recorded at STR and TF1. However, some major spectral peaks at both stations may reflect source properties. Statistical considerations and polarization analysis are in favor of a prevailing presence of P-waves in the tremor signal along with a position of the source northwest of the craters and at shallow depth.
Three-dimensional trend mapping from wire-line logs
Doveton, J.H.; Ke-an, Z.
1985-01-01
Mapping of lithofacies and porosities of stratigraphic units is complicated because these properties vary in three dimensions. The method of moments was proposed by Krumbein and Libby (1957) as a technique to aid in resolving this problem. Moments are easily computed from wireline logs and are simple statistics which summarize vertical variation in a log trace. Combinations of moment maps have proved useful in understanding vertical and lateral changes in lithology of sedimentary rock units. Although moments have meaning both as statistical descriptors and as mechanical properties, they also define polynomial curves which approximate lithologic changes as a function of depth. These polynomials can be fitted by least-squares methods, partitioning major trends in rock properties from finescale fluctuations. Analysis of variance yields the degree of fit of any polynomial and measures the proportion of vertical variability expressed by any moment or combination of moments. In addition, polynomial curves can be differentiated to determine depths at which pronounced expressions of facies occur and to determine the locations of boundaries between major lithologic subdivisions. Moments can be estimated at any location in an area by interpolating from log moments at control wells. A matrix algebra operation then converts moment estimates to coefficients of a polynomial function which describes a continuous curve of lithologic variation with depth. If this procedure is applied to a grid of geographic locations, the result is a model of variability in three dimensions. Resolution of the model is determined largely by number of moments used in its generation. The method is illustrated with an analysis of lithofacies in the Simpson Group of south-central Kansas; the three-dimensional model is shown as cross sections and slice maps. In this study, the gamma-ray log is used as a measure of shaliness of the unit. However, the method is general and can be applied, for example, to suites of neutron, density, or sonic logs to produce three-dimensional models of porosity in reservoir rocks. ?? 1985 Plenum Publishing Corporation.
Geng, Runzhe; Wang, Xiaoyan; Sharpley, Andrew N.; Meng, Fande
2015-01-01
Best management practices (BMPs) for agricultural diffuse pollution control are implemented at the field or small-watershed scale. However, the benefits of BMP implementation on receiving water quality at multiple spatial is an ongoing challenge. In this paper, we introduce an integrated approach that combines risk assessment (i.e., Phosphorus (P) index), model simulation techniques (Hydrological Simulation Program–FORTRAN), and a BMP placement tool at various scales to identify the optimal location for implementing multiple BMPs and estimate BMP effectiveness after implementation. A statistically significant decrease in nutrient discharge from watersheds is proposed to evaluate the effectiveness of BMPs, strategically targeted within watersheds. Specifically, we estimate two types of cost-effectiveness curves (total pollution reduction and proportion of watersheds improved) for four allocation approaches. Selection of a ‘‘best approach” depends on the relative importance of the two types of effectiveness, which involves a value judgment based on the random/aggregated degree of BMP distribution among and within sub-watersheds. A statistical optimization framework is developed and evaluated in Chaohe River Watershed located in the northern mountain area of Beijing. Results show that BMP implementation significantly (p >0.001) decrease P loss from the watershed. Remedial strategies where BMPs were targeted to areas of high risk of P loss, deceased P loads compared with strategies where BMPs were randomly located across watersheds. Sensitivity analysis indicated that aggregated BMP placement in particular watershed is the most cost-effective scenario to decrease P loss. The optimization approach outlined in this paper is a spatially hierarchical method for targeting nonpoint source controls across a range of scales from field to farm, to watersheds, to regions. Further, model estimates showed targeting at multiple scales is necessary to optimize program efficiency. The integrated model approach described that selects and places BMPs at varying levels of implementation, provides a new theoretical basis and technical guidance for diffuse pollution management in agricultural watersheds. PMID:26313561
Forester, James D; Im, Hae Kyung; Rathouz, Paul J
2009-12-01
Patterns of resource selection by animal populations emerge as a result of the behavior of many individuals. Statistical models that describe these population-level patterns of habitat use can miss important interactions between individual animals and characteristics of their local environment; however, identifying these interactions is difficult. One approach to this problem is to incorporate models of individual movement into resource selection models. To do this, we propose a model for step selection functions (SSF) that is composed of a resource-independent movement kernel and a resource selection function (RSF). We show that standard case-control logistic regression may be used to fit the SSF; however, the sampling scheme used to generate control points (i.e., the definition of availability) must be accommodated. We used three sampling schemes to analyze simulated movement data and found that ignoring sampling and the resource-independent movement kernel yielded biased estimates of selection. The level of bias depended on the method used to generate control locations, the strength of selection, and the spatial scale of the resource map. Using empirical or parametric methods to sample control locations produced biased estimates under stronger selection; however, we show that the addition of a distance function to the analysis substantially reduced that bias. Assuming a uniform availability within a fixed buffer yielded strongly biased selection estimates that could be corrected by including the distance function but remained inefficient relative to the empirical and parametric sampling methods. As a case study, we used location data collected from elk in Yellowstone National Park, USA, to show that selection and bias may be temporally variable. Because under constant selection the amount of bias depends on the scale at which a resource is distributed in the landscape, we suggest that distance always be included as a covariate in SSF analyses. This approach to modeling resource selection is easily implemented using common statistical tools and promises to provide deeper insight into the movement ecology of animals.
Geng, Runzhe; Wang, Xiaoyan; Sharpley, Andrew N; Meng, Fande
2015-01-01
Best management practices (BMPs) for agricultural diffuse pollution control are implemented at the field or small-watershed scale. However, the benefits of BMP implementation on receiving water quality at multiple spatial is an ongoing challenge. In this paper, we introduce an integrated approach that combines risk assessment (i.e., Phosphorus (P) index), model simulation techniques (Hydrological Simulation Program-FORTRAN), and a BMP placement tool at various scales to identify the optimal location for implementing multiple BMPs and estimate BMP effectiveness after implementation. A statistically significant decrease in nutrient discharge from watersheds is proposed to evaluate the effectiveness of BMPs, strategically targeted within watersheds. Specifically, we estimate two types of cost-effectiveness curves (total pollution reduction and proportion of watersheds improved) for four allocation approaches. Selection of a ''best approach" depends on the relative importance of the two types of effectiveness, which involves a value judgment based on the random/aggregated degree of BMP distribution among and within sub-watersheds. A statistical optimization framework is developed and evaluated in Chaohe River Watershed located in the northern mountain area of Beijing. Results show that BMP implementation significantly (p >0.001) decrease P loss from the watershed. Remedial strategies where BMPs were targeted to areas of high risk of P loss, deceased P loads compared with strategies where BMPs were randomly located across watersheds. Sensitivity analysis indicated that aggregated BMP placement in particular watershed is the most cost-effective scenario to decrease P loss. The optimization approach outlined in this paper is a spatially hierarchical method for targeting nonpoint source controls across a range of scales from field to farm, to watersheds, to regions. Further, model estimates showed targeting at multiple scales is necessary to optimize program efficiency. The integrated model approach described that selects and places BMPs at varying levels of implementation, provides a new theoretical basis and technical guidance for diffuse pollution management in agricultural watersheds.
NASA Astrophysics Data System (ADS)
Ander, Louise; Lark, Murray; Smedley, Pauline; Watts, Michael; Hamilton, Elliott; Fletcher, Tony; Crabbe, Helen; Close, Rebecca; Studden, Mike; Leonardi, Giovanni
2015-04-01
Random sampling design is optimal in order to be able to assess outcomes, such as the mean of a given variable across an area. However, this optimal sampling design may be compromised to an unknown extent by unavoidable real-world factors: the extent to which the study design can still be considered random, and the influence this may have on the choice of appropriate statistical data analysis is examined in this work. We take a study which relied on voluntary participation for the sampling of private water tap chemical composition in England, UK. This study was designed and implemented as a categorical, randomised study. The local geological classes were grouped into 10 types, which were considered to be most important in likely effects on groundwater chemistry (the source of all the tap waters sampled). Locations of the users of private water supplies were made available to the study group from the Local Authority in the area. These were then assigned, based on location, to geological groups 1 to 10 and randomised within each group. However, the permission to collect samples then required active, voluntary participation by householders and thus, unlike many environmental studies, could not always follow the initial sample design. Impediments to participation ranged from 'willing but not available' during the designated sampling period, to a lack of response to requests to sample (assumed to be wholly unwilling or unable to participate). Additionally, a small number of unplanned samples were collected via new participants making themselves known to the sampling teams, during the sampling period. Here we examine the impact this has on the 'random' nature of the resulting data distribution, by comparison with the non-participating known supplies. We consider the implications this has on choice of statistical analysis methods to predict values and uncertainty at un-sampled locations.
NASA Astrophysics Data System (ADS)
Fine, I.; Thomson, R.; Chadwick, W. W., Jr.; Davis, E. E.; Fox, C. G.
2016-12-01
We have used a set of high-resolution bottom pressure recorder (BPR) time series collected at Axial Seamount on the Juan de Fuca Ridge beginning in 1986 to examine tsunami waves of seismological origin in the northeast Pacific. These data are a combination of autonomous, internally-recording battery-powered instruments and cabled instruments on the OOI Cabled Array. Of the total of 120 tsunami events catalogued for the coasts of Japan, Alaska, western North America and Hawaii, we found evidence for 38 events in the Axial Seamount BPR records. Many of these tsunamis were not observed along the adjacent west coast of the USA and Canada because of the much higher noise level of coastal locations and the lack of digital tide gauge data prior to 2000. We have also identified several tsunamis of apparent seismological origin that were observed at coastal stations but not at the deep ocean site. Careful analysis of these observations suggests that they were likely of meteorological origin. Analysis of the pressure measurements from Axial Seamount, along with BPR measurements from a nearby ODP CORK (Ocean Drilling Program Circulation Obviation Retrofit Kit) borehole and DART (Deep-ocean Assessment and Reporting of Tsunamis) locations, reveals features of deep-ocean tsunamis that are markedly different from features observed at coastal locations. Results also show that the energy of deep-ocean tsunamis can differ significantly among the three sets of stations despite their close spatial spacing and that this difference is strongly dependent on the direction of the incoming tsunami waves. These deep-ocean observations provide the most comprehensive statistics possible for tsunamis in the Pacific Ocean over the past 30 years. New insight into the distribution of tsunami amplitudes and wave energy derived from the deep-ocean sites should prove useful for long-term tsunami prediction and mitigation for coastal communities along the west coast of the USA and Canada.
STATISTICAL ANALYSIS OF TANK 18F FLOOR SAMPLE RESULTS
DOE Office of Scientific and Technical Information (OSTI.GOV)
Harris, S.
2010-09-02
Representative sampling has been completed for characterization of the residual material on the floor of Tank 18F as per the statistical sampling plan developed by Shine [1]. Samples from eight locations have been obtained from the tank floor and two of the samples were archived as a contingency. Six samples, referred to in this report as the current scrape samples, have been submitted to and analyzed by SRNL [2]. This report contains the statistical analysis of the floor sample analytical results to determine if further data are needed to reduce uncertainty. Included are comparisons with the prior Mantis samples resultsmore » [3] to determine if they can be pooled with the current scrape samples to estimate the upper 95% confidence limits (UCL{sub 95%}) for concentration. Statistical analysis revealed that the Mantis and current scrape sample results are not compatible. Therefore, the Mantis sample results were not used to support the quantification of analytes in the residual material. Significant spatial variability among the current sample results was not found. Constituent concentrations were similar between the North and South hemispheres as well as between the inner and outer regions of the tank floor. The current scrape sample results from all six samples fall within their 3-sigma limits. In view of the results from numerous statistical tests, the data were pooled from all six current scrape samples. As such, an adequate sample size was provided for quantification of the residual material on the floor of Tank 18F. The uncertainty is quantified in this report by an upper 95% confidence limit (UCL{sub 95%}) on each analyte concentration. The uncertainty in analyte concentration was calculated as a function of the number of samples, the average, and the standard deviation of the analytical results. The UCL{sub 95%} was based entirely on the six current scrape sample results (each averaged across three analytical determinations).« less
STATISTICAL ANALYSIS OF TANK 19F FLOOR SAMPLE RESULTS
DOE Office of Scientific and Technical Information (OSTI.GOV)
Harris, S.
2010-09-02
Representative sampling has been completed for characterization of the residual material on the floor of Tank 19F as per the statistical sampling plan developed by Harris and Shine. Samples from eight locations have been obtained from the tank floor and two of the samples were archived as a contingency. Six samples, referred to in this report as the current scrape samples, have been submitted to and analyzed by SRNL. This report contains the statistical analysis of the floor sample analytical results to determine if further data are needed to reduce uncertainty. Included are comparisons with the prior Mantis samples resultsmore » to determine if they can be pooled with the current scrape samples to estimate the upper 95% confidence limits (UCL95%) for concentration. Statistical analysis revealed that the Mantis and current scrape sample results are not compatible. Therefore, the Mantis sample results were not used to support the quantification of analytes in the residual material. Significant spatial variability among the current scrape sample results was not found. Constituent concentrations were similar between the North and South hemispheres as well as between the inner and outer regions of the tank floor. The current scrape sample results from all six samples fall within their 3-sigma limits. In view of the results from numerous statistical tests, the data were pooled from all six current scrape samples. As such, an adequate sample size was provided for quantification of the residual material on the floor of Tank 19F. The uncertainty is quantified in this report by an UCL95% on each analyte concentration. The uncertainty in analyte concentration was calculated as a function of the number of samples, the average, and the standard deviation of the analytical results. The UCL95% was based entirely on the six current scrape sample results (each averaged across three analytical determinations).« less
NASA Astrophysics Data System (ADS)
Bierstedt, Svenja E.; Hünicke, Birgit; Zorita, Eduardo; Ludwig, Juliane
2017-07-01
We statistically analyse the relationship between the structure of migrating dunes in the southern Baltic and the driving wind conditions over the past 26 years, with the long-term aim of using migrating dunes as a proxy for past wind conditions at an interannual resolution. The present analysis is based on the dune record derived from geo-radar measurements by Ludwig et al. (2017). The dune system is located at the Baltic Sea coast of Poland and is migrating from west to east along the coast. The dunes present layers with different thicknesses that can be assigned to absolute dates at interannual timescales and put in relation to seasonal wind conditions. To statistically analyse this record and calibrate it as a wind proxy, we used a gridded regional meteorological reanalysis data set (coastDat2) covering recent decades. The identified link between the dune annual layers and wind conditions was additionally supported by the co-variability between dune layers and observed sea level variations in the southern Baltic Sea. We include precipitation and temperature into our analysis, in addition to wind, to learn more about the dependency between these three atmospheric factors and their common influence on the dune system. We set up a statistical linear model based on the correlation between the frequency of days with specific wind conditions in a given season and dune migration velocities derived for that season. To some extent, the dune records can be seen as analogous to tree-ring width records, and hence we use a proxy validation method usually applied in dendrochronology, cross-validation with the leave-one-out method, when the observational record is short. The revealed correlations between the wind record from the reanalysis and the wind record derived from the dune structure is in the range between 0.28 and 0.63, yielding similar statistical validation skill as dendroclimatological records.
Geographic Hotspots of Critical National Infrastructure.
Thacker, Scott; Barr, Stuart; Pant, Raghav; Hall, Jim W; Alderson, David
2017-12-01
Failure of critical national infrastructures can result in major disruptions to society and the economy. Understanding the criticality of individual assets and the geographic areas in which they are located is essential for targeting investments to reduce risks and enhance system resilience. Within this study we provide new insights into the criticality of real-life critical infrastructure networks by integrating high-resolution data on infrastructure location, connectivity, interdependence, and usage. We propose a metric of infrastructure criticality in terms of the number of users who may be directly or indirectly disrupted by the failure of physically interdependent infrastructures. Kernel density estimation is used to integrate spatially discrete criticality values associated with individual infrastructure assets, producing a continuous surface from which statistically significant infrastructure criticality hotspots are identified. We develop a comprehensive and unique national-scale demonstration for England and Wales that utilizes previously unavailable data from the energy, transport, water, waste, and digital communications sectors. The testing of 200,000 failure scenarios identifies that hotspots are typically located around the periphery of urban areas where there are large facilities upon which many users depend or where several critical infrastructures are concentrated in one location. © 2017 Society for Risk Analysis.
Generalisability in economic evaluation studies in healthcare: a review and case studies.
Sculpher, M J; Pang, F S; Manca, A; Drummond, M F; Golder, S; Urdahl, H; Davies, L M; Eastwood, A
2004-12-01
To review, and to develop further, the methods used to assess and to increase the generalisability of economic evaluation studies. Electronic databases. Methodological studies relating to economic evaluation in healthcare were searched. This included electronic searches of a range of databases, including PREMEDLINE, MEDLINE, EMBASE and EconLit, and manual searches of key journals. The case studies of a decision analytic model involved highlighting specific features of previously published economic studies related to generalisability and location-related variability. The case-study involving the secondary analysis of cost-effectiveness analyses was based on the secondary analysis of three economic studies using data from randomised trials. The factor most frequently cited as generating variability in economic results between locations was the unit costs associated with particular resources. In the context of studies based on the analysis of patient-level data, regression analysis has been advocated as a means of looking at variability in economic results across locations. These methods have generally accepted that some components of resource use and outcomes are exchangeable across locations. Recent studies have also explored, in cost-effectiveness analysis, the use of tests of heterogeneity similar to those used in clinical evaluation in trials. The decision analytic model has been the main means by which cost-effectiveness has been adapted from trial to non-trial locations. Most models have focused on changes to the cost side of the analysis, but it is clear that the effectiveness side may also need to be adapted between locations. There have been weaknesses in some aspects of the reporting in applied cost-effectiveness studies. These may limit decision-makers' ability to judge the relevance of a study to their specific situations. The case study demonstrated the potential value of multilevel modelling (MLM). Where clustering exists by location (e.g. centre or country), MLM can facilitate correct estimates of the uncertainty in cost-effectiveness results, and also a means of estimating location-specific cost-effectiveness. The review of applied economic studies based on decision analytic models showed that few studies were explicit about their target decision-maker(s)/jurisdictions. The studies in the review generally made more effort to ensure that their cost inputs were specific to their target jurisdiction than their effectiveness parameters. Standard sensitivity analysis was the main way of dealing with uncertainty in the models, although few studies looked explicitly at variability between locations. The modelling case study illustrated how effectiveness and cost data can be made location-specific. In particular, on the effectiveness side, the example showed the separation of location-specific baseline events and pooled estimates of relative treatment effect, where the latter are assumed exchangeable across locations. A large number of factors are mentioned in the literature that might be expected to generate variation in the cost-effectiveness of healthcare interventions across locations. Several papers have demonstrated differences in the volume and cost of resource use between locations, but few studies have looked at variability in outcomes. In applied trial-based cost-effectiveness studies, few studies provide sufficient evidence for decision-makers to establish the relevance or to adjust the results of the study to their location of interest. Very few studies utilised statistical methods formally to assess the variability in results between locations. In applied economic studies based on decision models, most studies either stated their target decision-maker/jurisdiction or provided sufficient information from which this could be inferred. There was a greater tendency to ensure that cost inputs were specific to the target jurisdiction than clinical parameters. Methods to assess generalisability and variability in economic evaluation studies have been discussed extensively in the literature relating to both trial-based and modelling studies. Regression-based methods are likely to offer a systematic approach to quantifying variability in patient-level data. In particular, MLM has the potential to facilitate estimates of cost-effectiveness, which both reflect the variation in costs and outcomes between locations and also enable the consistency of cost-effectiveness estimates between locations to be assessed directly. Decision analytic models will retain an important role in adapting the results of cost-effectiveness studies between locations. Recommendations for further research include: the development of methods of evidence synthesis which model the exchangeability of data across locations and allow for the additional uncertainty in this process; assessment of alternative approaches to specifying multilevel models to the analysis of cost-effectiveness data alongside multilocation randomised trials; identification of a range of appropriate covariates relating to locations (e.g. hospitals) in multilevel models; and further assessment of the role of econometric methods (e.g. selection models) for cost-effectiveness analysis alongside observational datasets, and to increase the generalisability of randomised trials.
Catalog of earthquake hypocenters at Alaskan volcanoes: January 1 through December 31, 2003
Dixon, James P.; Stihler, Scott D.; Power, John A.; Tytgat, Guy; Moran, Seth C.; Sanchez, John J.; McNutt, Stephen R.; Estes, Steve; Paskievitch, John
2004-01-01
The Alaska Volcano Observatory (AVO), a cooperative program of the U.S. Geological Survey, the Geophysical Institute of the University of Alaska Fairbanks, and the Alaska Division of Geological and Geophysical Surveys, has maintained seismic monitoring networks at historically active volcanoes in Alaska since 1988. The primary objectives of this program are the near real time seismic monitoring of active, potentially hazardous, Alaskan volcanoes and the investigation of seismic processes associated with active volcanism. This catalog presents the calculated earthquake hypocenter and phase arrival data, and changes in the seismic monitoring program for the period January 1 through December 31, 2003.The AVO seismograph network was used to monitor the seismic activity at twenty-seven volcanoes within Alaska in 2003. These include Mount Wrangell, Mount Spurr, Redoubt Volcano, Iliamna Volcano, Augustine Volcano, Katmai volcanic cluster (Snowy Mountain, Mount Griggs, Mount Katmai, Novarupta, Trident Volcano, Mount Mageik, Mount Martin), Aniakchak Crater, Mount Veniaminof, Pavlof Volcano, Mount Dutton, Isanotski Peaks, Shishaldin Volcano, Fisher Caldera, Westdahl Peak, Akutan Peak, Makushin Volcano, Okmok Caldera, Great Sitkin Volcano, Kanaga Volcano, Tanaga Volcano, and Mount Gareloi. Monitoring highlights in 2003 include: continuing elevated seismicity at Mount Veniaminof in January-April (volcanic unrest began in August 2002), volcanogenic seismic swarms at Shishaldin Volcano throughout the year, and low-level tremor at Okmok Caldera throughout the year. Instrumentation and data acquisition highlights in 2003 were the installation of subnetworks on Tanaga and Gareloi Islands, the installation of broadband installations on Akutan Volcano and Okmok Caldera, and the establishment of telemetry for the Okmok Caldera subnetwork. AVO located 3911 earthquakes in 2003.This catalog includes: (1) a description of instruments deployed in the field and their locations; (2) a description of earthquake detection, recording, analysis, and data archival systems; (3) a description of velocity models used for earthquake locations; (4) a summary of earthquakes located in 2003; and (5) an accompanying UNIX tar-file with a summary of earthquake origin times, hypocenters, magnitudes, phase arrival times, and location quality statistics; daily station usage statistics; and all HYPOELLIPSE files used to determine the earthquake locations in 2003.
Sioux City Riverbank Filtration Study
NASA Astrophysics Data System (ADS)
Mach, R.; Condon, J.; Johnson, J.
2003-04-01
The City of Sioux City (City) obtains a large percentage of their drinking water supply from both a horizontal collector well system and vertical wells located adjacent to the Missouri River. These wells are set in either the Missouri Alluvium or the Dakota Sandstone aquifer. Several of the collector well laterals extend out beneath the Missouri River, with the laterals being over twenty feet below the river channel bottom. Due to concerns regarding ground water under direct surface water influence, the Iowa Department of Natural Resources (IDNR) required the City to expand their water treatment process to deal with potential surface water contaminant issues. With the extensive cost of these plant upgrades, the City and Olsson Associates (OA) approached the IDNR requesting approval for assessing the degree of natural riverbank filtration for water treatment. If this natural process could be ascertained, the level of treatment from the plant could be reduced. The objective of this study was to quantify the degree of surface water (i.e. Missouri River) filtration due to the underlying Missouri River sediments. Several series of microscopic particulate analysis where conducted, along with tracking of turbidity, temperature, bacteria and a full scale particle count study. Six particle sizes from six sampling points were assessed over a nine-month period that spanned summer, fall and spring weather periods. The project was set up in two phases and utilized industry accepted statistical analyses to identify particle data trends. The first phase consisted of twice daily sample collection from the Missouri River and the collector well system for a one-month period. Statistical analysis of the data indicated reducing the sampling frequency and sampling locations would yield justifiable data while significantly reducing sampling and analysis costs. The IDNR approved this modification, and phase II included sampling and analysis under this reduced plant for an eight-month period. Final statistical analyses of the nine months of data indicate up to a four-log particle reduction occurs through river bank filtration. Consequently, Missouri River sediments within the City's well field are very effective in water filtration. This information was submitted to the IDNR for review and approval. Subsequently, the IDNR approved 4.0 log removal for Giardia and 3.5 log removal for Cryptosporidium through the riverbank and treatment plant. The City and IDNR have agreed on subrogate parameters for monitoring purposes.
ERIC Educational Resources Information Center
Ministerio de Educacion Nacional, Bogota (Colombia). Instituto Colombiano de Pedagogia.
This document provides statistical data on the distribution and education of teaching personnel working the elementary schools of Cordoba, Colombia, between 1958 and 1967. The statistics cover the number of men and women, public and private schools, urban and rural location, and the amount of education of the teachers. For overall statistics in…
ERIC Educational Resources Information Center
Ministerio de Educacion Nacional, Bogota (Colombia). Instituto Colombiano de Pedagogia.
This document provides statistical data on the distribution and education of teaching personnel working in the elementary schools of Narino, Colombia, between 1958 and 1967. The statistics cover the number of men and women, public and private schools, urban and rural location, and the amount of education of the teachers. For overall statistics in…
ERIC Educational Resources Information Center
Ministerio de Educacion Nacional, Bogota (Colombia). Instituto Colombiano de Pedagogia.
This document provides statistical data on the distribution and education of teaching personnel working in the elementary schools of Cauca, Colombia, between 1958 and 1967. The statistics cover the number of men and women, public and private schools, urban and rural location, and the amount of education of the teachers. For overall statistics in…
ERIC Educational Resources Information Center
Ministerio de Educacion Nacional, Bogota (Colombia). Instituto Colombiano de Pedagogia.
This document provides statistical data on the distribution and education of teaching personnel working in the elementary schools of Caldas, Colombia, between 1958 and 1967. The statistics cover the number of men and women, public and private schools, urban and rural location, and the amount of education of the teachers. For overall statistics in…
ERIC Educational Resources Information Center
Ministerio de Educacion Nacional, Bogota (Colombia). Instituto Colombiano de Pedagogia.
This document provides statistical data on the distribution and education of teaching personnel working in the elementary schools of Boyaca, Colombia, between 1958 and 1967. The statistics cover the number of men and women, public and private schools, urban and rural location, and the amount of education of the teachers. For overall statistics in…
ERIC Educational Resources Information Center
Ministerio de Educacion Nacional, Bogota (Colombia). Instituto Colombiano de Pedagogia.
This document provides statistical data on the distribution and education of teaching personnel working in the elementary schools of Huila, Colombia, between 1958 and 1967. The statistics cover the number of men and women, public and private schools, urban and rural location, and the amount of education of the teachers. For overall statistics in…
Pascual Huerta, Javier; Alarcón García, Juan María
2007-06-01
The study was aimed to investigate plantar fascia thickness at different locations in healthy asymptomatic subjects and its relationship to the following variables: weight, height, sex and age. The study evaluates 96 feet of healthy asymptomatic volunteers. The plantar fascia thickness was measured at four different locations: 1cm proximal to the insertion of the plantar fascia, at the insertion of the plantar fascia on the calcaneus and separate out 1 cm + 2 cm distal to the insertion. A 10 MHz linear-array transducer was used. There were statistically significant differences in plantar fascia thickness at the four different locations (p<0.001) although no differences in PF thickness were found between the two distal from insertion locations (1 and 2 cm). Multiple regression analysis showed sex as independent predictor of plantar fascia thickness at 1cm proximal to the insertion. At origin and 1cm distal to insertion weight was an independent predictor of plantar fascia thickness. There are differences of thickness at different locations of plantar fascia measured by ultrasonography. Thickness at 1cm proximal to the insertion is influenced by sex and thickness at origin and at 1cm distal to the insertion has a direct relationship with body weight. This could be attributed to the overloading effect that weight has on plantar fascia in healthy symptomatic subjects at these two locations. Height and age did not seem to influence as independent variables in plantar fascia thickness among non-painful subjects.
DOT National Transportation Integrated Search
1997-05-01
The Directory was created to assist transportation data users, policy makers, planners, researchers, information specialists and others in locating statistical contacts and transportation profiles in different countries. It lists, by continent, 1,925...
Public health information and statistics dissemination efforts for Indonesia on the Internet.
Hanani, Febiana; Kobayashi, Takashi; Jo, Eitetsu; Nakajima, Sawako; Oyama, Hiroshi
2011-01-01
To elucidate current issues related to health statistics dissemination efforts on the Internet in Indonesia and to propose a new dissemination website as a solution. A cross-sectional survey was conducted. Sources of statistics were identified using link relationship and Google™ search. Menu used to locate statistics, mode of presentation and means of access to statistics, and available statistics were assessed for each site. Assessment results were used to derive design specification; a prototype system was developed and evaluated with usability test. 49 sources were identified on 18 governmental, 8 international and 5 non-government websites. Of 49 menus identified, 33% used non-intuitive titles and lead to inefficient search. 69% of them were on government websites. Of 31 websites, only 39% and 23% used graph/chart and map for presentation. Further, only 32%, 39% and 19% provided query, export and print feature. While >50% sources reported morbidity, risk factor and service provision statistics, <40% sources reported health resource and mortality statistics. Statistics portal website was developed using Joomla!™ content management system. Usability test demonstrated its potential to improve data accessibility. In this study, government's efforts to disseminate statistics in Indonesia are supported by non-governmental and international organizations and existing their information may not be very useful because it is: a) not widely distributed, b) difficult to locate, and c) not effectively communicated. Actions are needed to ensure information usability, and one of such actions is the development of statistics portal website.
Site comparison for optical visibility statistics in southern California
NASA Technical Reports Server (NTRS)
Cowles, K.
1991-01-01
Negotiations are under way to locate an atmospheric visibility monitoring (AVM) observatory at Mount Lemmon, just north of Tucson, Arizona. Two more observatories will be located in the southwestern U.S. The observatories are being employed to improve a weather model for deep-space-to-ground optical communications. This article explains the factors considered in choosing a location and recommends Table Mountain Observatory as the location for another AVM facility.
Arora, Sukeshi Patel; Ketchum, Norma S; Michalek, Joel; Gelfond, Jonathon; Mahalingam, Devalingam
2017-04-22
Location of the primary tumor is prognostic and predictive of efficacy with VEGF-inhibitors (I) versus EGFR-I given first-line to metastatic colorectal cancer (mCRC) patients. However, little is known regarding the effect of location on prognosis and prediction in refractory mCRC. We assessed the efficacy of VEGF-I and EGFR-I in regards to location of the primary tumor in patients with refractory mCRC enrolled in early phase studies. A historical cohort analysis of mCRC patients, including 44 phase I trials our institution, from March 2004 to September 2012. Median Progression free survival (mPFS) and overall survival (mOS) were estimated from Kaplan-Meier curves and groups were statistically compared with the log-rank test. One hundred thirty-nine patients with a median age 59 (33-81). 73.9% received 3+ lines of therapy. All KRAS wild-type patients had received prior EGFR-I. right 20.9%, left 61.9%, and transverse 4.3%. For survival analysis, transverse CRC were included with right. Of the 112 patients, mOS was left (N = 80) 6.6 months versus right (N = 32) 5.9 months, P = 0.18. mPFS was left (n = 86) 2.0 months versus right (N = 35) 2.0 months, P = 0.76. In subgroup analysis, survival was significant for KRAS wild-type patients with left-sided mCRC had mOS of 6.2 months with other agents versus 9.4 months with EGFR-I (P = 0.03). In phase 1 clinical trials, although location alone was not prognostic in heavily pretreated patients, left-sided mCRC had improved survival with EGFR-I. Despite progression on EGFR-I, left-sided KRAS wild mCRC patients should be considered for phase 1 studies of agents targeting growth factor pathways.
ERIC Educational Resources Information Center
Ministerio de Educacion Nacional, Bogota (Colombia). Instituto Colombiano de Pedagogia.
This document provides statistical data on the distribution and education of teacher personnel working in Colombian elementary schools between 1940 and 1968. The statistics cover the number of men and women, public and private schools, urban and rural location, and the amount of education of teachers. (VM)
Do Introductory Statistics Courses in the United States Improve Students' Attitudes?
ERIC Educational Resources Information Center
Schau, Candace; Emmioglu, Esma
2012-01-01
We examined the attitudes of about 2200 students enrolled in 101 sections of post-secondary introductory statistics service courses located across the United States. Using the "Survey of Attitudes Toward Statistics-36," we assessed students' attitudes when they entered and left their courses, as well as changes in attitudes across their courses.…
Mapping probabilities of extreme continental water storage changes from space gravimetry
NASA Astrophysics Data System (ADS)
Kusche, J.; Eicker, A.; Forootan, E.; Springer, A.; Longuevergne, L.
2016-12-01
Using data from the Gravity Recovery and Climate Experiment (GRACE) mission, we derive statistically robust 'hotspot' regions of high probability of peak anomalous - i.e. with respect to the seasonal cycle - water storage (of up to 0.7 m one-in-five-year return level) and flux (up to 0.14 m/mon). Analysis of, and comparison with, up to 32 years of ERA-Interim reanalysis fields reveals generally good agreement of these hotspot regions to GRACE results, and that most exceptions are located in the Tropics. However, a simulation experiment reveals that differences observed by GRACE are statistically significant, and further error analysis suggests that by around the year 2020 it will be possible to detect temporal changes in the frequency of extreme total fluxes (i.e. combined effects of mainly precipitation and floods) for at least 10-20% of the continental area, assuming that we have a continuation of GRACE by its follow-up GRACE-FO. J. Kusche et al. (2016): Mapping probabilities of extreme continental water storage changes from space gravimetry, Geophysical Research Letters, accepted online, doi:10.1002/2016GL069538
Mena, Carlos; Sepúlveda, Cesar; Fuentes, Eduardo; Ormazábal, Yony; Palomo, Iván
2018-05-07
Cardiovascular diseases (CVDs) are the primary cause of death and disability in de world, and the detection of populations at risk as well as localization of vulnerable areas is essential for adequate epidemiological management. Techniques developed for spatial analysis, among them geographical information systems and spatial statistics, such as cluster detection and spatial correlation, are useful for the study of the distribution of the CVDs. These techniques, enabling recognition of events at different geographical levels of study (e.g., rural, deprived neighbourhoods, etc.), make it possible to relate CVDs to factors present in the immediate environment. The systemic literature presented here shows that this group of diseases is clustered with regard to incidence, mortality and hospitalization as well as obesity, smoking, increased glycated haemoglobin levels, hypertension physical activity and age. In addition, acquired variables such as income, residency (rural or urban) and education, contribute to CVD clustering. Both local cluster detection and spatial regression techniques give statistical weight to the findings providing valuable information that can influence response mechanisms in the health services by indicating locations in need of intervention and assignment of available resources.
Rapid Exploitation and Analysis of Documents
DOE Office of Scientific and Technical Information (OSTI.GOV)
Buttler, D J; Andrzejewski, D; Stevens, K D
Analysts are overwhelmed with information. They have large archives of historical data, both structured and unstructured, and continuous streams of relevant messages and documents that they need to match to current tasks, digest, and incorporate into their analysis. The purpose of the READ project is to develop technologies to make it easier to catalog, classify, and locate relevant information. We approached this task from multiple angles. First, we tackle the issue of processing large quantities of information in reasonable time. Second, we provide mechanisms that allow users to customize their queries based on latent topics exposed from corpus statistics. Third,more » we assist users in organizing query results, adding localized expert structure over results. Forth, we use word sense disambiguation techniques to increase the precision of matching user generated keyword lists with terms and concepts in the corpus. Fifth, we enhance co-occurrence statistics with latent topic attribution, to aid entity relationship discovery. Finally we quantitatively analyze the quality of three popular latent modeling techniques to examine under which circumstances each is useful.« less
The role of climatic variables in winter cereal yields: a retrospective analysis.
Luo, Qunying; Wen, Li
2015-02-01
This study examined the effects of observed climate including [CO2] on winter cereal [winter wheat (Triticum aestivum), barley (Hordeum vulgare) and oat (Avena sativa)] yields by adopting robust statistical analysis/modelling approaches (i.e. autoregressive fractionally integrated moving average, generalised addition model) based on long time series of historical climate data and cereal yield data at three locations (Moree, Dubbo and Wagga Wagga) in New South Wales, Australia. Research results show that (1) growing season rainfall was significantly, positively and non-linearly correlated with crop yield at all locations considered; (2) [CO2] was significantly, positively and non-linearly correlated with crop yields in all cases except wheat and barley yields at Wagga Wagga; (3) growing season maximum temperature was significantly, negatively and non-linearly correlated with crop yields at Dubbo and Moree (except for barley); and (4) radiation was only significantly correlated with oat yield at Wagga Wagga. This information will help to identify appropriate management adaptation options in dealing with the risk and in taking the opportunities of climate change.
NASA Astrophysics Data System (ADS)
Chakraborty, Jayajit; Green, Donna
2014-04-01
This study presents the first national level quantitative environmental justice assessment of industrial air pollution in Australia. Specifically, our analysis links the spatial distribution of sites and emissions associated with industrial pollution sources derived from the National Pollution Inventory, to Indigenous status and social disadvantage characteristics of communities derived from Australian Bureau of Statistics indicators. Our results reveal a clear national pattern of environmental injustice based on the locations of industrial pollution sources, as well as volume, and toxicity of air pollution released at these locations. Communities with the highest number of polluting sites, emission volume, and toxicity-weighted air emissions indicate significantly greater proportions of Indigenous population and higher levels of socio-economic disadvantage. The quantities and toxicities of industrial air pollution are particularly higher in communities with the lowest levels of educational attainment and occupational status. These findings emphasize the need for more detailed analysis in specific regions and communities where socially disadvantaged groups are disproportionately impacted by industrial air pollution. Our empirical findings also underscore the growing necessity to incorporate environmental justice considerations in environmental planning and policy-making in Australia.
Kumi-Kyereme, Akwasi; Amo-Adjei, Joshua
2013-06-17
This study compares ownership of health insurance among Ghanaian women with respect to wealth status and spatial location. We explore the overarching research question by employing geographic and proxy means targeting through interactive analysis of wealth status and spatial issues. The paper draws on the 2008 Ghana Demographic and Health Survey. Bivariate descriptive analysis coupled with binary logistic regression estimation technique was used to analyse the data. By wealth status, the likelihood of purchasing insurance was significantly higher among respondents from the middle, richer and richest households compared to the poorest (reference category) and these differences widened more profoundly in the Northern areas after interacting wealth with zone of residence. Among women at the bottom of household wealth (poorest and poorer), there were no statistically significant differences in insurance subscription in all the areas. The results underscore the relevance of geographic and proxy means targeting in identifying populations who may be need of special interventions as part of the efforts to increase enrolment as well as means of social protection against the vulnerable.
NASA Astrophysics Data System (ADS)
Wang, Jinman; Wang, Hongdan; Cao, Yingui; Bai, Zhongke; Qin, Qian
2016-02-01
Vegetation plays an important role in improving and restoring fragile ecological environments. In the Antaibao opencast coal mine, located in a loess area, the eco-environment has been substantially disturbed by mining activities, and the relationship between the vegetation and environmental factors is not very clear. Therefore, it is crucial to understand the effects of soil and topographic factors on vegetation restoration to improve the fragile ecosystems of damaged land. An investigation of the soil, topography and vegetation in 50 reclamation sample plots in Shanxi Pingshuo Antaibao opencast coal mine dumps was performed. Statistical analyses in this study included one-way ANOVA and significance testing using SPSS 20.0, and multivariate techniques of detrended correspondence analysis (DCA) and redundancy analysis (RDA) using CANOCO 4.5. The RDA revealed the environmental factors that affected vegetation restoration. Various vegetation and soil variables were significantly correlated. The available K and rock content were good explanatory variables, and they were positively correlated with tree volume. The effects of the soil factors on vegetation restoration were higher than those of the topographic factors.
Wang, Jinman; Wang, Hongdan; Cao, Yingui; Bai, Zhongke; Qin, Qian
2016-01-01
Vegetation plays an important role in improving and restoring fragile ecological environments. In the Antaibao opencast coal mine, located in a loess area, the eco-environment has been substantially disturbed by mining activities, and the relationship between the vegetation and environmental factors is not very clear. Therefore, it is crucial to understand the effects of soil and topographic factors on vegetation restoration to improve the fragile ecosystems of damaged land. An investigation of the soil, topography and vegetation in 50 reclamation sample plots in Shanxi Pingshuo Antaibao opencast coal mine dumps was performed. Statistical analyses in this study included one-way ANOVA and significance testing using SPSS 20.0, and multivariate techniques of detrended correspondence analysis (DCA) and redundancy analysis (RDA) using CANOCO 4.5. The RDA revealed the environmental factors that affected vegetation restoration. Various vegetation and soil variables were significantly correlated. The available K and rock content were good explanatory variables, and they were positively correlated with tree volume. The effects of the soil factors on vegetation restoration were higher than those of the topographic factors. PMID:26916152
Wang, Jinman; Wang, Hongdan; Cao, Yingui; Bai, Zhongke; Qin, Qian
2016-02-26
Vegetation plays an important role in improving and restoring fragile ecological environments. In the Antaibao opencast coal mine, located in a loess area, the eco-environment has been substantially disturbed by mining activities, and the relationship between the vegetation and environmental factors is not very clear. Therefore, it is crucial to understand the effects of soil and topographic factors on vegetation restoration to improve the fragile ecosystems of damaged land. An investigation of the soil, topography and vegetation in 50 reclamation sample plots in Shanxi Pingshuo Antaibao opencast coal mine dumps was performed. Statistical analyses in this study included one-way ANOVA and significance testing using SPSS 20.0, and multivariate techniques of detrended correspondence analysis (DCA) and redundancy analysis (RDA) using CANOCO 4.5. The RDA revealed the environmental factors that affected vegetation restoration. Various vegetation and soil variables were significantly correlated. The available K and rock content were good explanatory variables, and they were positively correlated with tree volume. The effects of the soil factors on vegetation restoration were higher than those of the topographic factors.
2013-01-01
Background This study compares ownership of health insurance among Ghanaian women with respect to wealth status and spatial location. We explore the overarching research question by employing geographic and proxy means targeting through interactive analysis of wealth status and spatial issues. Methods The paper draws on the 2008 Ghana Demographic and Health Survey. Bivariate descriptive analysis coupled with binary logistic regression estimation technique was used to analyse the data. Results By wealth status, the likelihood of purchasing insurance was significantly higher among respondents from the middle, richer and richest households compared to the poorest (reference category) and these differences widened more profoundly in the Northern areas after interacting wealth with zone of residence. Among women at the bottom of household wealth (poorest and poorer), there were no statistically significant differences in insurance subscription in all the areas. Conclusions The results underscore the relevance of geographic and proxy means targeting in identifying populations who may be need of special interventions as part of the efforts to increase enrolment as well as means of social protection against the vulnerable. PMID:23768255
Spelman, Tim; Gray, Orla; Lucas, Robyn; Butzkueven, Helmut
2015-12-09
This report describes a novel Stata-based application of trigonometric regression modelling to 55 years of multiple sclerosis relapse data from 46 clinical centers across 20 countries located in both hemispheres. Central to the success of this method was the strategic use of plot analysis to guide and corroborate the statistical regression modelling. Initial plot analysis was necessary for establishing realistic hypotheses regarding the presence and structural form of seasonal and latitudinal influences on relapse probability and then testing the performance of the resultant models. Trigonometric regression was then necessary to quantify these relationships, adjust for important confounders and provide a measure of certainty as to how plausible these associations were. Synchronization of graphing techniques with regression modelling permitted a systematic refinement of models until best-fit convergence was achieved, enabling novel inferences to be made regarding the independent influence of both season and latitude in predicting relapse onset timing in MS. These methods have the potential for application across other complex disease and epidemiological phenomena suspected or known to vary systematically with season and/or geographic location.