期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Influence diagnostics for the structural errors-in-variables model under the Student-t distribution 总被引：1，自引：1，他引：0

Manuel Galea Heleno Bolfarine Filidor Vilcalabra 《Journal of applied statistics》2002,29(8):1191-1204

The influence of observations on the parameter estimates for the simple structural errors-in-variables model with no equation error, under the Student-t distribution, is investigated using the local influence approach. The main conclusion is that the Student-t model with small degrees of freedom is able to incorporate possible outliers and influential observations in the data. The likelihood displacement approach is useful for outlier detection, especially when a masking phenomenon is present and the degrees of freedom parameter is large. The diagnostics are illustrated with two examples. 相似文献

2.

Robust methods for the analysis of spatially autocorrelated data

Andrea Cerioli Marco Riani 《Statistical Methods and Applications》2002,11(3):335-358

In this paper we propose a new robust technique for the analysis of spatial data through simultaneous autoregressive (SAR) models, which extends the Forward Search approach of Cerioli and Riani (1999) and Atkinson and Riani (2000). Our algorithm starts from a subset of outlier-free observations and then selects additional observations according to their degree of agreement with the postulated model. A number of useful diagnostics which are monitored along the search help to identify masked spatial outliers and high leverage sites. In contrast to other robust techniques, our method is particularly suited for the analysis of complex multidimensional systems since each step is performed through statistically and computationally efficient procedures, such as maximum likelihood. The main contribution of this paper is the development of joint robust estimation of both trend and autocorrelation parameters in spatial linear models. For this purpose we suggest a novel definition of the elemental sets of the Forward Search, which relies on blocks of contiguous spatial locations. 相似文献

3.

A procedure for testing suspected observations

Nirpeksh Kumar 《Statistical Papers》2013,54(2):471-478

The problem of multiple outliers detection in one-parameter exponential family is considered. The outlier detection procedure involves two estimates of scale parameter which are obtained by maximizing two log-likelihoods; the complete data log-likelihood and its conditional expectation given suspected observations. The procedure is also applied to the exponential and normal samples. 相似文献

4.

Spuriosity and outliers in directional data

Parthasarathy Bagchi Irwin Guttman 《Journal of applied statistics》1990,17(3):341-350

This paper discusses inference regarding the mean direction and the concentration parameters based on data from the von Mises distribution from a Bayesian point of view, when k(k < n/2) of the n observations are spurious, that is, are from a von Mises population with a shifted mean direction. The Bayesian analysis for this spuriosity case provides both detection, identification, and estimation for the mean direction and the concentration parameter when indeed spurious observations are present, possibly giving rise to outliers. 相似文献

5.

Influence diagnostics for censored regression models with autoregressive errors

下载免费PDF全文

Fernanda L. Schumacher Victor H. Lachos Filidor E. Vilca‐Labra Luis M. Castro 《Australian & New Zealand Journal of Statistics》2018,60(2):209-229

Observations collected over time are often autocorrelated rather than independent, and sometimes include observations below or above detection limits (i.e. censored values reported as less or more than a level of detection) and/or missing data. Practitioners commonly disregard censored data cases or replace these observations with some function of the limit of detection, which often results in biased estimates. Moreover, parameter estimation can be greatly affected by the presence of influential observations in the data. In this paper we derive local influence diagnostic measures for censored regression models with autoregressive errors of order p (hereafter, AR(p)‐CR models) on the basis of the Q‐function under three useful perturbation schemes. In order to account for censoring in a likelihood‐based estimation procedure for AR(p)‐CR models, we used a stochastic approximation version of the expectation‐maximisation algorithm. The accuracy of the local influence diagnostic measure in detecting influential observations is explored through the analysis of empirical studies. The proposed methods are illustrated using data, from a study of total phosphorus concentration, that contain left‐censored observations. These methods are implemented in the R package ARCensReg. 相似文献

6.

Regression Diagnostic under Model Misspecification

Li-Chu Chien Tsung-Shan Tsou 《Journal of applied statistics》2007,34(5):563-575

We propose two novel diagnostic measures for the detection of influential observations for regression parameters in linear regression. Traditional diagnostic statistics focus on the effect of deletion of data points either on parameter estimates, or on predicted values. A data point is regarded as influential by the new methods if its inclusion determines a significantly different likelihood function for the parameter of interest. The concerned likelihood function is asymptotically valid for practically all underlying distributions whose second moments exist. 相似文献

7.

Outliers and influential observations in the structural errors-in-variables model

Myung Geun Kim 《Journal of applied statistics》2000,27(4):451-460

The influence of observations on the parameter estimates for the simple structural errors-in-variables model with no equation error is investigated using the local influence method. Residuals themselves are not sufficient for detecting outliers. The likelihood displacement approach is useful for outlier detection especially when a masking phenomenon is present. An illustrative example is provided. 相似文献

8.

Continuous monitoring for changepoints in data streams using adaptive estimation

Dean A. Bodenham Niall M. Adams 《Statistics and Computing》2017,27(5):1257-1270

Data streams are characterised by a potentially unending sequence of high-frequency observations which are subject to unknown temporal variation. Many modern streaming applications demand the capability to sequentially detect changes as soon as possible after they occur, while continuing to monitor the stream as it evolves. We refer to this problem as continuous monitoring. Sequential algorithms such as CUSUM, EWMA and their more sophisticated variants usually require a pair of parameters to be selected for practical application. However, the choice of parameter values is often based on the anticipated size of the changes and a given choice is unlikely to be optimal for the multiple change sizes which are likely to occur in a streaming data context. To address this critical issue, we introduce a changepoint detection framework based on adaptive forgetting factors that, instead of multiple control parameters, only requires a single parameter to be selected. Simulated results demonstrate that this framework has utility in a continuous monitoring setting. In particular, it reduces the burden of selecting parameters in advance. Moreover, the methodology is demonstrated on real data arising from Foreign Exchange markets. 相似文献

9.

From grouped to de-grouped data: a new approach in distribution fitting for grouped data

Ying-Ju Chen Tatjana Miljkovic 《Journal of Statistical Computation and Simulation》2019,89(2):272-291

Sampling within a given interval with a constraint has not been previously considered. Standard parametric simulation engines require knowledge of the parameters of the distribution from which a sample is drawn. These methods are limited if additional constrains are required for the simulated data. We propose a method that generates the targeted number of individual observations within a given interval with a constraint that the average value of observations is known. This method is further extended to a grouped data setting, as a way of data de-grouping, when the frequency and average value of observations are provided for each group. Several simulation studies are employed to evaluate the performance of the proposed method, in case of both a single interval and grouped data, for different simulation settings. Furthermore, the proposed method is evaluated in the parameter recovery when different distributions are fitted to the de-grouped data. This method is found to be superior to the uniform method previously used in data de-grouping. The results of the simulation study are promising and they show that this method can be used successfully in the applications where data de-grouping requires that the average value of observations is maintained in each group. The application of the proposed method is illustrated on a real data of insurance losses for bodily injury claims. 相似文献

10.

Bayesian covariance estimation and inference in latent Gaussian process models

《Statistical Methodology》2014

This paper describes inference methods for functional data under the assumption that the functional data of interest are smooth latent functions, characterized by a Gaussian process, which have been observed with noise over a finite set of time points. The methods we propose are completely specified in a Bayesian environment that allows for all inferences to be performed through a simple Gibbs sampler. Our main focus is in estimating and describing uncertainty in the covariance function. However, these models also encompass functional data estimation, functional regression where the predictors are latent functions, and an automatic approach to smoothing parameter selection. Furthermore, these models require minimal assumptions on the data structure as the time points for observations do not need to be equally spaced, the number and placement of observations are allowed to vary among functions, and special treatment is not required when the number of functional observations is less than the dimensionality of those observations. We illustrate the effectiveness of these models in estimating latent functional data, capturing variation in the functional covariance estimate, and in selecting appropriate smoothing parameters in both a simulation study and a regression analysis of medfly fertility data. 相似文献

11.

Accounting for plasma levels below detection limits in a one-compartment zero-order absorption pharmacokinetics model

Alan D. Hutson Lauren Fishbein Patricia O'Brien Peter W. Stacpoole 《Journal of applied statistics》2002,29(8):1181-1190

We provide a simple method for fitting a one-compartment, zero-order absorption pharmacokinetics model in the presence of observations below the detection limit. This method may be extended to more complex pharmacokinetics models. We demonstrate, using a small simulation study, that the method provides accurate parameter estimates over a range of detection limits and we compare it to an ad hoc midpoint method. An applied example is provided from a pharmacokinetic investigation of a nicotine nasal spray. 相似文献

12.

Tests for exponentiality

Girish Patwardhan 《统计学通讯:理论与方法》2013,42(11):3705-3722

The idea of measuring the departure of data bu a plot of obeserved observations against their expectation has been expeetations has been exploited in this paper to develop tests for exponentiality the tests are for parameter two parameter exponential distribution with complete sample and one parameter exponential distribution with complete sample and one large sample distributions of the test statistics critical points have been computed for different levels of significance and applications of these have been computed for differents levels of significance and applications of these tests have been discussed in case of three data sets. 相似文献

13.

Dynamic generalized extreme value modeling via particle filters

Yonghua Wei 《统计学通讯:模拟与计算》2017,46(8):6324-6341

This article addresses how particle filters compare to MCMC methods for posterior density approximations of a model that allows for a dynamic state with fixed parameters and where the observation equation is nonlinear. This is a problem that was not been well studied in the specialized literature. We prove that these state and parameter estimations can be achieved via particle filter methods without the need of more expensive Forward Filtering Backward Sampling (FFBS) simulation. Estimation of a time-varying extreme value model via the generalized extreme value distribution is considered using these particle filter methods and compared to a MCMC algorithm that involves a variety of Metropolis-Hastings steps. We illustrate and compare the different methodologies with simulated data and some minimum daily stock returns occurring monthly from January 4, 1990 to December 28, 2007 using the Tokyo Stock Price Index (TOPIX). 相似文献

14.

A comparative study on detection of influential observations in linear regression

A. Hossain D. N. Naik 《Statistical Papers》1991,32(1):55-69

A large number of statistics are used in the literature to detect outliers and influential observations in the linear regression model. In this paper comparison studies have been made for determining a statistic which performs better than the other. This includes: (i) a detailed simulation study, and (ii) analyses of several data sets studied by different authors. Different choices of the design matrix of regression model are considered. Design A studies the performance of the various statistics for detecting the scale shift type outliers, and designs B and C provide information on the performance of the statistics for identifying the influential observations. We have used cutoff points using the exact distributions and Bonferroni's inequality for each statistic. The results show that the studentized residual which is used for detection of mean shift outliers is appropriate for detection of scale shift outliers also, and the Welsch's statistic and the Cook's distance are appropriate for detection of influential observations. 相似文献

15.

Trends in smoking cessation: a Markov model approach

Charles G. Minard David W. Wetter Carol J. Etzel 《Journal of applied statistics》2012,39(1):113-127

Intervention trials such as studies on smoking cessation may observe multiple, discrete outcomes over time. When the outcome is binary, participant observations may alternate between two states over the course of the study. The generalized estimating equation (GEE) approach is commonly used to analyze binary, longitudinal data in the context of independent variables. However, the sequence of observations may be assumed to follow a Markov chain with stationary transition probabilities when observations are made at fixed time points. Participants favoring the transition to one particular state over the other would be evidence of a trend in the observations. Using a log-transformed trend parameter, the determinants of a trend in a binary, longitudinal study may be evaluated by maximizing the likelihood function. A new methodology is presented here to test for the presence and determinants of a trend in binary, longitudinal observations. Empirical studies are evaluated and comparisons are made with the GEE approach. Practical application of the proposed method is made to the data available from an intervention study on smoking cessation. 相似文献

16.

On robustness of the Shiryaev–Roberts change-point detection procedure under parameter misspecification in the post-change distribution

Wenyu Du Grigory Sokolov 《统计学通讯:模拟与计算》2017,46(3):2185-2206

The gist of the quickest change-point detection problem is to detect the presence of a change in the statistical behavior of a series of sequentially made observations, and do so in an optimal detection-speed-versus-“false-positive”-risk manner. When optimality is understood either in the generalized Bayesian sense or as defined in Shiryaev's multi-cyclic setup, the so-called Shiryaev–Roberts (SR) detection procedure is known to be the “best one can do”, provided, however, that the observations’ pre- and post-change distributions are both fully specified. We consider a more realistic setup, viz. one where the post-change distribution is assumed known only up to a parameter, so that the latter may be misspecified. The question of interest is the sensitivity (or robustness) of the otherwise “best” SR procedure with respect to a possible misspecification of the post-change distribution parameter. To answer this question, we provide a case study where, in a specific Gaussian scenario, we allow the SR procedure to be “out of tune” in the way of the post-change distribution parameter, and numerically assess the effect of the “mistuning” on Shiryaev's (multi-cyclic) Stationary Average Detection Delay delivered by the SR procedure. The comprehensive quantitative robustness characterization of the SR procedure obtained in the study can be used to develop the respective theory as well as to provide a rational for practical design of the SR procedure. The overall qualitative conclusion of the study is an expected one: the SR procedure is less (more) robust for less (more) contrast changes and for lower (higher) levels of the false alarm risk. 相似文献

17.

An elastic net penalized small area model combining unit- and area-level data for regional hypertension prevalence estimation

J. P. Burgard J. Krause R. Münnich 《Journal of applied statistics》2021,48(9):1659

Hypertension is a highly prevalent cardiovascular disease. It marks a considerable cost factor to many national health systems. Despite its prevalence, regional disease distributions are often unknown and must be estimated from survey data. However, health surveys frequently lack in regional observations due to limited resources. Obtained prevalence estimates suffer from unacceptably large sampling variances and are not reliable. Small area estimation solves this problem by linking auxiliary data from multiple regions in suitable regression models. Typically, either unit- or area-level observations are considered for this purpose. But with respect to hypertension, both levels should be used. Hypertension has characteristic comorbidities and is strongly related to lifestyle features, which are unit-level information. It is also correlated with socioeconomic indicators that are usually measured on the area-level. But the level combination is challenging as it requires multi-level model parameter estimation from small samples. We use a multi-level small area model with level-specific penalization to overcome this issue. Model parameter estimation is performed via stochastic coordinate gradient descent. A jackknife estimator of the mean squared error is presented. The methodology is applied to combine health survey data and administrative records to estimate regional hypertension prevalence in Germany. 相似文献

18.

Sensitivity analysis in ordered and restricted parameter models

Enrique Castillo Carmen Castillo Ali S. Hadi 《Journal of statistical planning and inference》2008

This paper deals with the problem of local sensitivity analysis in ordered parameter models. In addition to order restrictions, some constraints imposed on the parameters by the model and/or the data are considered. Measures for assessing how much a change in the data modifies the results and conclusions of a statistical analysis of these models are presented. The sensitivity measures are derived using recent results in mathematical programming. The estimation problem is formulated as a primal nonlinear programming problem, and the sensitivities of the parameter estimates as well as the objective function sensitivities with respect to data are obtained. They are very effective in revealing the influential observations in this type of models and in evaluating the changes due to changes in data values. The methods are illustrated by their application to a wide variety of examples of order-restricted models including ordered exponential family parameters, ordered multinomial parameters, ordered linear model parameters, ordered and data constrained parameters, and ordered functions of parameters. 相似文献

19.

ARL-unbiased control charts for the monitoring of exponentially distributed characteristics based on type-II censored samples

《Journal of Statistical Computation and Simulation》2012,82(12):2734-2747

In this paper, the problem of monitoring process data that can be modelled by exponential distribution is considered when observations are from type-II censoring. Such data are common in many practical inspection environment. An average run length unbiased (ARL-unbiased) control scheme is developed when the in-control scale parameter is known. The performance of the proposed control charts are investigated in terms of the ARL and standard deviation of the run length. The effects of parameter estimation on the proposed control charts are also evaluated. Then, we consider the design of the ARL-unbiased control charts when the in-control scale parameter is estimated. Finally, an example is used to illustrate the implementation of the proposed control charts. 相似文献

20.

Influence Diagnostics on Testing Linear Hypothesis in Linear Models with Correlated Errors

Hadi Emami 《统计学通讯:理论与方法》2014,43(6):1050-1060

To assess the influence of observations on the parameter estimates, case deletion diagnostics are commonly used in linear regression models. For linear models with correlated errors we study the influence of observations on testing a linear hypothesis using single and multiple case deletions. The change in likelihood ratio test and F test theoretically is derived and it is shown these tests to be completely determined by two proposed generalized externally studentized residuals. An illustrative example of a real data set is also reported. 相似文献