期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Product partition latent variable model for multiple change-point detection in multivariate data

Gift Nyamundanda Avril Hegarty Kevin Hayes 《Journal of applied statistics》2015,42(11):2321-2334

The product partition model (PPM) is a well-established efficient statistical method for detecting multiple change points in time-evolving univariate data. In this article, we refine the PPM for the purpose of detecting multiple change points in correlated multivariate time-evolving data. Our model detects distributional changes in both the mean and covariance structures of multivariate Gaussian data by exploiting a smaller dimensional representation of correlated multiple time series. The utility of the proposed method is demonstrated through experiments on simulated and real datasets. 相似文献

2.

Bayesian emulation of complex multi-output and dynamic computer models 总被引：1，自引：0，他引：1

Stefano Conti Anthony O’Hagan 《Journal of statistical planning and inference》2010

Computer models are widely used in scientific research to study and predict the behaviour of complex systems. The run times of computer-intensive simulators are often such that it is impractical to make the thousands of model runs that are conventionally required for sensitivity analysis, uncertainty analysis or calibration. In response to this problem, highly efficient techniques have recently been developed based on a statistical meta-model (the emulator) that is built to approximate the computer model. The approach, however, is less straightforward for dynamic simulators, designed to represent time-evolving systems. Generalisations of the established methodology to allow for dynamic emulation are here proposed and contrasted. Advantages and difficulties are discussed and illustrated with an application to the Sheffield Dynamic Global Vegetation Model, developed within the UK Centre for Terrestrial Carbon Dynamics. 相似文献

3.

BAYESIAN BETA REGRESSION: APPLICATIONS TO HOUSEHOLD EXPENDITURE DATA AND GENETIC DISTANCE BETWEEN FOOT-AND-MOUTH DISEASE VIRUSES 总被引：2，自引：1，他引：1

Adam J. Branscum Wesley O. Johnson Mark C. Thurmond 《Australian & New Zealand Journal of Statistics》2007,49(3):287-301

There is considerable interest in understanding how factors such as time and geographic distance between isolates might influence the evolutionary direction of foot‐and‐mouth disease. Genetic differences between viruses can be measured as the proportion of nucleotides that differ for a given sequence or gene. We present a Bayesian hierarchical regression model for the statistical analysis of continuous data with sample space restricted to the interval (0, 1). The data are modelled using beta distributions with means that depend on covariates through a link function. We discuss methodology for: (i) the incorporation of informative prior information into an analysis; (ii) fitting the model using Markov chain Monte Carlo sampling; (iii) model selection using Bayes factors; and (iv) semiparametric beta regression using penalized splines. The model was applied to two different datasets. 相似文献

4.

Evaluation of different statistical methods using SAS software: an in silico approach for analysis of real-time PCR data

Mohammadreza Nassiri Mahdi Elahi Torshizi Mohammad Doosti 《Journal of applied statistics》2018,45(2):306-319

Real-time polymerase chain reaction (PCR) is reliable quantitative technique in gene expression studies. The statistical analysis of real-time PCR data is quite crucial for results analysis and explanation. The statistical procedures of analyzing real-time PCR data try to determine the slope of regression line and calculate the reaction efficiency. Applications of mathematical functions have been used to calculate the target gene relative to the reference gene(s). Moreover, these statistical techniques compare C_t (threshold cycle) numbers between control and treatments group. There are many different procedures in SAS for real-time PCR data evaluation. In this study, the efficiency of calibrated model and delta delta C_t model have been statistically tested and explained. Several methods were tested to compare control with treatment means of C_t. The methods tested included t-test (parametric test), Wilcoxon test (non-parametric test) and multiple regression. Results showed that applied methods led to similar results and no significant difference was observed between results of gene expression measurement by the relative method. 相似文献

5.

Classification of multivariate non-stationary signals: The SLEX-shrinkage approach

Hilmar Böhm Hernando Ombao Rainer von Sachs Jerome Sanes 《Journal of statistical planning and inference》2010

相似文献

6.

Some Remarks on Failure-times,Surrogate Markers,Degradation, Wear,and the Quality of Life

Cox DR 《Lifetime data analysis》1999,5(4):307-314

Suppose that data are available on failure-times and on a time-evolving process on each individual, called in some contexts wear or degradation. Four rather different types of relation between such a process and failure are described and brief comments on appropriate analysis sketched. 相似文献

7.

A statistical approach to address the problem of heaping in self-reported income data

S. Zinn A. Würbach 《Journal of applied statistics》2016,43(4):682-703

Self-reported income information particularly suffers from an intentional coarsening of the data, which is called heaping or rounding. If it does not occur completely at random – which is usually the case – heaping and rounding have detrimental effects on the results of statistical analysis. Conventional statistical methods do not consider this kind of reporting bias, and thus might produce invalid inference. We describe a novel statistical modeling approach that allows us to deal with self-reported heaped income data in an adequate and flexible way. We suggest modeling heaping mechanisms and the true underlying model in combination. To describe the true net income distribution, we use the zero-inflated log-normal distribution. Heaping points are identified from the data by applying a heuristic procedure comparing a hypothetical income distribution and the empirical one. To determine heaping behavior, we employ two distinct models: either we assume piecewise constant heaping probabilities, or heaping probabilities are considered to increase steadily with proximity to a heaping point. We validate our approach by some examples. To illustrate the capacity of the proposed method, we conduct a case study using income data from the German National Educational Panel Study. 相似文献

8.

Estimation and Classification of BOLD Responses Over Multiple Trials

Kapur K Roy A Bhaumik DK Gibbons RD Lazar NA Sweeney JA Aryal S Patterson D 《统计学通讯:理论与方法》2009,38(16-17):3099-3113

In this article, we model functional magnetic resonance imaging (fMRI) data for event-related experiment data using a fourth degree spline to fit voxel specific blood oxygenation level-dependent (BOLD) responses. The data are preprocessed for removing long term temporal components such as drifts using wavelet approximations. The spatial dependence is incorporated in the data by the application of 3D Gaussian spatial filter. The methodology assigns an activation score to each trial based on the voxel specific characteristics of the response curve. The proposed procedure has a capability of being fully automated and it produces activation images based on overall scores assigned to each voxel. The methodology is illustrated on real data from an event-related design experiment of visually guided saccades (VGS). 相似文献

9.

A generalization of the uniform association model for assessing rater agreement in ordinal scales

Alireza Akbarzadeh Bagheban Farid Zayeri 《Journal of applied statistics》2010,37(8):1265-1273

相似文献

10.

Structured Spatio-Temporal Shot-Noise Cox Point Process Models, with a View to Modelling Forest Fires 总被引：1，自引：0，他引：1

JESPER MØLLER CARLOS DÍAZ-AVALOS 《Scandinavian Journal of Statistics》2010,37(1):2-25

Abstract. Spatio-temporal Cox point process models with a multiplicative structure for the driving random intensity, incorporating covariate information into temporal and spatial components, and with a residual term modelled by a shot-noise process, are considered. Such models are flexible and tractable for statistical analysis, using spatio-temporal versions of intensity and inhomogeneous K -functions, quick estimation procedures based on composite likelihoods and minimum contrast estimation, and easy simulation techniques. These advantages are demonstrated in connection with the analysis of a relatively large data set consisting of 2796 days and 5834 spatial locations of fires. The model is compared with a spatio-temporal log-Gaussian Cox point process model, and likelihood-based methods are discussed to some extent. 相似文献

11.

The Planning Stage in Statistical Reasoning

Norman L. Chervany P. George Benson Raja K. Iyer 《The American statistician》2013,67(4):222-226

An important step in the statistical problem-solving process is the selection of the appropriate statistical procedure for the real-world situation under analysis. A decision-tree term project has been found to be an effective teaching device to help MBA students understand this step. The project requires the students to construct a decision-tree structure, which, through a series of questions and responses, will lead from the statement of a statistical question to the appropriate sampling distribution to use in addressing the question. 相似文献

12.

Estimating DEA confidence intervals with statistical panel data analysis

Darold T. Barnum John M. Gleason Matthew G. Karlaftis Glen T. Schumock Karen L. Shields Sonali Tandon 《Journal of applied statistics》2012,39(4):815-828

This paper describes a statistical method for estimating data envelopment analysis (DEA) score confidence intervals for individual organizations or other entities. This method applies statistical panel data analysis, which provides proven and powerful methodologies for diagnostic testing and for estimation of confidence intervals. DEA scores are tested for violations of the standard statistical assumptions including contemporaneous correlation, serial correlation, heteroskedasticity and the absence of a normal distribution. Generalized least squares statistical models are used to adjust for violations that are present and to estimate valid confidence intervals within which the true efficiency of each individual decision-making unit occurs. This method is illustrated with two sets of panel data, one from large US urban transit systems and the other from a group of US hospital pharmacies. 相似文献

13.

A study on discriminant analysis techniques applied to multivariate lognormal data

《Journal of Statistical Computation and Simulation》2012,82(1-2):79-100

The purpose of this paper is to examine the multiple group (>2) discrimination problem in which the group sizes are unequal and the variables used in the classification are correlated with skewed distributions. Using statistical simulation based on data from a clinical study, we compare the performances, in terms of misclassification rates, of nine statistical discrimination methods. These methods are linear and quadratic discriminant analysis applied to untransformed data, rank transformed data, and inverse normal scores data, as well as fixed kernel discriminant analysis, variable kernel discriminant analysis, and variable kernel discriminant analysis applied to inverse normal scores data. It is found that the parametric methods with transformed data generally outperform the other methods, and the parametric methods applied to inverse normal scores usually outperform the parametric methods applied to rank transformed data. Although the kernel methods often have very biased estimates, the variable kernel method applied to inverse normal scores data provides considerable improvement in terms of total nonerror rate. 相似文献

14.

Tell me why I don't like Mondays: investigating day of the week effects on job satisfaction and psychological well-being

Mark P. Taylor 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2006,169(1):127-142

Summary. The paper explores the relationship between the day of the week on which a survey respondent is interviewed and their self-reported job satisfaction and mental health scores by using data from the British Household Panel Survey. Evidence presented here confirms that self-reported levels of job satisfaction and subjective levels of mental distress systematically vary according to the day of the week on which respondents are interviewed even when controlling for other observed and unobserved characteristics. However, we find that the main conclusions from previous studies of the determinants of job satisfaction and mental well-being are robust to the inclusion of day-of-interview controls. 相似文献

15.

Self-modelling warping functions

Daniel Gervini Theo Gasser 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2004,66(4):959-971

Summary. The paper introduces a semiparametric model for functional data. The warping functions are assumed to be linear combinations of q common components, which are estimated from the data (hence the name 'self-modelling'). Even small values of q provide remarkable model flexibility, comparable with nonparametric methods. At the same time, this approach avoids overfitting because the common components are estimated combining data across individuals. As a convenient by-product, component scores are often interpretable and can be used for statistical inference (an example of classification based on scores is given). 相似文献

16.

Testing for bioequivalence of highly variable drugs from TR‐RT crossover designs with heterogeneous residual variances

下载免费PDF全文

Qing Kang Christopher I. Vahl 《Pharmaceutical statistics》2017,16(5):361-377

Traditional bioavailability studies assess average bioequivalence (ABE) between the test (T) and reference (R) products under the crossover design with TR and RT sequences. With highly variable (HV) drugs whose intrasubject coefficient of variation in pharmacokinetic measures is 30% or greater, assertion of ABE becomes difficult due to the large sample sizes needed to achieve adequate power. In 2011, the FDA adopted a more relaxed, yet complex, ABE criterion and supplied a procedure to assess this criterion exclusively under TRR‐RTR‐RRT and TRTR‐RTRT designs. However, designs with more than 2 periods are not always feasible. This present work investigates how to evaluate HV drugs under TR‐RT designs. A mixed model with heterogeneous residual variances is used to fit data from TR‐RT designs. Under the assumption of zero subject‐by‐formulation interaction, this basic model is comparable to the FDA‐recommended model for TRR‐RTR‐RRT and TRTR‐RTRT designs, suggesting the conceptual plausibility of our approach. To overcome the distributional dependency among summary statistics of model parameters, we develop statistical tests via the generalized pivotal quantity (GPQ). A real‐world data example is given to illustrate the utility of the resulting procedures. Our simulation study identifies a GPQ‐based testing procedure that evaluates HV drugs under practical TR‐RT designs with desirable type I error rate and reasonable power. In comparison to the FDA's approach, this GPQ‐based procedure gives similar performance when the product's intersubject standard deviation is low (≤0.4) and is most useful when practical considerations restrict the crossover design to 2 periods. 相似文献

17.

Comparison of several imputation methods for missing baseline data in propensity scores analysis of binary outcome

Brenda J. Crowe Ilya A. Lipkovich Ouhong Wang 《Pharmaceutical statistics》2010,9(4):269-279

We performed a simulation study comparing the statistical properties of the estimated log odds ratio from propensity scores analyses of a binary response variable, in which missing baseline data had been imputed using a simple imputation scheme (Treatment Mean Imputation), compared with three ways of performing multiple imputation (MI) and with a Complete Case analysis. MI that included treatment (treated/untreated) and outcome (for our analyses, outcome was adverse event [yes/no]) in the imputer's model had the best statistical properties of the imputation schemes we studied. MI is feasible to use in situations where one has just a few outcomes to analyze. We also found that Treatment Mean Imputation performed quite well and is a reasonable alternative to MI in situations where it is not feasible to use MI. Treatment Mean Imputation performed better than MI methods that did not include both the treatment and outcome in the imputer's model. Copyright © 2009 John Wiley & Sons, Ltd. 相似文献

18.

On estimating the distribution of data envelopment analysis efficiency scores: an application to nursing homes’ care planning process

B. J. Gajewski R. Lee M. Bott U. Piamjariyakul R. L. Taunton 《Journal of applied statistics》2009,36(9):933-944

Data envelopment analysis (DEA) is a deterministic econometric model for calculating efficiency by using data from an observed set of decision-making units (DMUs). We propose a method for calculating the distribution of efficiency scores. Our framework relies on estimating data from an unobserved set of DMUs. The model provides posterior predictive data for the unobserved DMUs to augment the frontier in the DEA that provides a posterior predictive distribution for the efficiency scores. We explore the method on a multiple-input and multiple-output DEA model. The data for the example are from a comprehensive examination of how nursing homes complete a standardized mandatory assessment of residents. 相似文献

19.

For what use are tests of hypotheses and tests of significance

Harry V. Roberts 《统计学通讯:理论与方法》2013,42(8):753-761

Tests of sharp null hypotheses, although frequently computed, are.rarely appropriate as the major end product of statistical analyses, except possibly in some, areas of the natural sciences. But procedures closely akin to tests, although often less formal, are needed in almost every investigation in which exploratory data analysis is used to help to decide upon the statistical model appropriate for the final analysis. The term “diagnostic check”has been suggested by Box and Jenkins for these procedures. Traditional statistical tests often suggest useful diagnostic checks -and this, in my view, is what tests are mainly good for-but visual examination and interpretation of data plots are often equally important. Biere is also much to be gained by the development of new diagnostic checks, and testing theory may be useful as one guide to this development. 相似文献

20.

A multivariate time series model for the analysis and prediction of carbon monoxide atmospheric concentrations 总被引：2，自引：0，他引：2

Stefano F. Tonellato 《Journal of the Royal Statistical Society. Series C, Applied statistics》2001,50(2):187-200

We use a Bayesian multivariate time series model for the analysis of the dynamics of carbon monoxide atmospheric concentrations. The data are observed at four sites. It is assumed that the logarithm of the observed process can be represented as the sum of unobservable components: a trend, a daily periodicity, a stationary autoregressive signal and an erratic term. Bayesian analysis is performed via Gibbs sampling. In particular, we consider the problem of joint temporal prediction when data are observed at a few sites and it is not possible to fit a complex space–time model. A retrospective analysis of the trend component is also given, which is important in that it explains the evolution of the variability in the observed process. 相似文献