期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Ishapathik Das Siuli Mukhopadhyay 《统计学通讯:模拟与计算》2013,42(10):2998-3021

Abstract

Model misspecification in generalized linear models (GLMs) occurs usually when the linear predictor and/or the link function assumed are incorrect. This article discusses the effect of such misspecification on design selection for multinomial GLMs and proposes the use of quantile dispersion graphs to select robust designs. Due to misspecification in the model, parameter estimates are usually biased and the designs are compared on the basis of their mean squared error of prediction. Several numerical examples including a real data set are presented to illustrate the proposed methodology. 相似文献

2.

Comparison of designs for multivariate generalized linear models

S. Mukhopadhyay A.I. Khuri 《Journal of statistical planning and inference》2008

The purpose of this paper is to discuss response surface designs for multivariate generalized linear models (GLMs). Such models are considered whenever several response variables can be measured for each setting of a group of control variables, and the response variables are adequately represented by GLMs. The mean-squared error of prediction (MSEP) matrix is used to assess the quality of prediction associated with a given design. The MSEP incorporates both the prediction variance and the prediction bias, which results from using maximum likelihood estimates of the parameters of the fitted linear predictor. For a given design, quantiles of a scalar-valued function of the MSEP are obtained within a certain region of interest. The quantiles depend on the unknown parameters of the linear predictor. The dispersion of these quantiles over the space of the unknown parameters is determined and then depicted by the so-called quantile dispersion graphs. An application of the proposed methodology is presented using the special case of the bivariate binary distribution. 相似文献

3.

Response surface design evaluation and comparison

Christine M. Anderson-Cook Connie M. Borror Douglas C. Montgomery 《Journal of statistical planning and inference》2009

Designing an experiment to fit a response surface model typically involves selecting among several candidate designs. There are often many competing criteria that could be considered in selecting the design, and practitioners are typically forced to make trade-offs between these objectives when choosing the final design. Traditional alphabetic optimality criteria are often used in evaluating and comparing competing designs. These optimality criteria are single-number summaries for quality properties of the design such as the precision with which the model parameters are estimated or the uncertainty associated with prediction. Other important considerations include the robustness of the design to model misspecification and potential problems arising from spurious or missing data. Several qualitative and quantitative properties of good response surface designs are discussed, and some of their important trade-offs are considered. Graphical methods for evaluating design performance for several important response surface problems are discussed and we show how these techniques can be used to compare competing designs. These graphical methods are generally superior to the simplistic summaries of alphabetic optimality criteria. Several special cases are considered, including robust parameter designs, split-plot designs, mixture experiment designs, and designs for generalized linear models. 相似文献

4.

Parametric simultaneous robust inferences for regression coefficient under generalized linear models

《Journal of Statistical Computation and Simulation》2012,82(4):850-867

In this article, the parametric robust regression approaches are proposed for making inferences about regression parameters in the setting of generalized linear models (GLMs). The proposed methods are able to test hypotheses on the regression coefficients in the misspecified GLMs. More specifically, it is demonstrated that with large samples, the normal and gamma regression models can be properly adjusted to become asymptotically valid for inferences about regression parameters under model misspecification. These adjusted regression models can provide the correct type I and II error probabilities and the correct coverage probability for continuous data, as long as the true underlying distributions have finite second moments. 相似文献

5.

Robust Designs in Generalized Linear Models: A Quantile Dispersion Graphs Approach

I. Das M. Aggarwal 《统计学通讯:模拟与计算》2015,44(9):2348-2370

This article studies design selection for generalized linear models (GLMs) using the quantile dispersion graphs (QDGs) approach in the presence of misspecification in the link and/or linear predictor. The uncertainty in the linear predictor is represented by a unknown function and estimated using kriging. For addressing misspecified link functions, a generalized family of link functions is used. Numerical examples are shown to illustrate the proposed methodology. 相似文献

6.

Graphical evaluation of robust parameter designs based on extended scaled prediction variance and extended spherical average prediction variance

Jin H. Oh Sung H. Park Soon S. Kwon 《统计学通讯:理论与方法》2018,47(14):3523-3531

For any response surface design, there are locations in the design region where responses are estimated well and locations where estimation is relatively poor. Consequently, graphical evaluation—such as variance dispersion graphs and the fraction of design space—is used as an alternative to a single-valued criterion. Such plots are used to investigate and compare the prediction capabilities of certain response surface designs currently available to the researcher. In this article, we propose the extended scaled prediction variance and extended spherical average prediction variance as prediction methods. We also illustrate how graphical methods can be employed to evaluate robust parameter designs. 相似文献

7.

Model misspecification in parametric dual modeling

《Journal of Statistical Computation and Simulation》2012,82(2):113-126

In typical normal theory regression, the assumption of homogeneity of variances is often not appropriate. Instead of treating the variances as a nuisance and transforming away the heterogeneity, the structure of the variances may be of interest and it is desirable to model the variances. Simultaneous modeling of the mean and variance of a response is known as dual modeling. When parametric models for the mean and variance are prescribed, estimation of the mean and variance parameters are interrelated. One commonly used dual model assumes a linear model for the mean and a log-linear variance model (Aitkin, 1987). This paper considers the impact of model misspecification (mean and variance) on the dual model estimation procedure. Asymptotic expressions for the mean and variance estimates, graphical illustrations of the impact of model misspecification, and simulation results are presented. 相似文献

8.

Minimax A-, c-, and I-optimal regression designs for models with heteroscedastic errors

Hanan Abousaleh Julie Zhou 《Revue canadienne de statistique》2023,51(1):258-274

It is well known that it is difficult to construct minimax optimal designs. Furthermore, since in practice we never know the true error variance, it is important to allow small deviations and construct robust optimal designs. We investigate a class of minimax optimal regression designs for models with heteroscedastic errors that are robust against possible misspecification of the error variance. Commonly used A-, c-, and I-optimality criteria are included in this class of minimax optimal designs. Several theoretical results are obtained, including a necessary condition and a reflection symmetry for these minimax optimal designs. In this article, we focus mainly on linear models and assume that an approximate error variance function is available. However, we also briefly discuss how the methodology works for nonlinear models. We then propose an effective algorithm to solve challenging nonconvex optimization problems to find minimax designs on discrete design spaces. Examples are given to illustrate minimax optimal designs and their properties. 相似文献

9.

Bias in Small-Sample Inference With Count-Data Models

McKinley L. Blackburn 《The American statistician》2020,74(3):267-273

Abstract

Both Poisson and negative binomial regression can provide quasi-likelihood estimates for coefficients in exponential-mean models that are consistent in the presence of distributional misspecification. It has generally been recommended, however, that inference be carried out using asymptotically robust estimators for the parameter covariance matrix. As with linear models, such robust inference tends to lead to over-rejection of null hypotheses in small samples. Alternative methods for estimating coefficient estimator variances are considered. No one approach seems to remove all test bias, but the results do suggest that the use of the jackknife with Poisson regression tends to be least biased for inference. 相似文献

10.

Constructing Efficient Experimental Designs for Generalized Linear Models

Moein Saleh 《统计学通讯:模拟与计算》2016,45(8):2827-2845

Exchange algorithms are popular for finding optimal or efficient designs for linear models, but there are few discussions of this type of algorithm for generalized linear models (GLMs) in literature. A new algorithm, generalized Coordinate Exchange Algorithm (gCEA), is developed in this article to construct efficient designs for GLMs. We compare the performance of the proposed algorithm with other optimization algorithms, including point exchange algorithm, columnwise-pairwise algorithm, simulated annealing and generic algorithm, and demonstrate the superior performance of this new algorithm. 相似文献

11.

The robustness of response surface designs with errors in factor levels

Juntao Fang Zhen He Shuguang He Guodong Wang 《统计学通讯:理论与方法》2020,49(10):2365-2383

ABSRTACT

Since errors in factor levels affect the traditional statistical properties of response surface designs, an important question to consider is robustness of design to errors. However, when the actual design could be observed in the experimental settings, its optimality and prediction are of interest. Various numerical and graphical methods are useful tools for understanding the behavior of the designs. The D- and G-efficiencies and the fraction of design space plot are adapted to assess second-order response surface designs where the predictor variables are disturbed by a random error. Our study shows that the D-efficiencies of the competing designs are considerably low for big variance of the error, while the G-efficiencies are quite good. Fraction of design space plots display the distribution of the scaled prediction variance through the design space with and without errors in factor levels. The robustness of experimental designs against factor errors is explored through comparative study. The construction and use of the D- and G-efficiencies and the fraction of design space plots are demonstrated with several examples of different designs with errors. 相似文献

12.

AN IMPROVED COMPOUND ESTIMATOR FOR ROBUST REGRESSION

《统计学通讯:模拟与计算》2013,42(4):653-672

ABSTRACT

Advances in statistical computing software have led to a substantial increase in the use of ordinary least squares (OLS) regression models in the engineering and applied statistics communities. Empirical evidence suggests that data sets can routinely have 10% or more outliers in many processes. Unfortunately, these outliers typically will render the OLS parameter estimates useless. The OLS diagnostic quantities and graphical plots can reliably identify a few outliers; however, they significantly lose power with increasing dimension and number of outliers. Although there have been recent advances in the methods that detect multiple outliers, improvements are needed in regression estimators that can fit well in the presence of outliers. We introduce a robust regression estimator that performs well regardless of outlier quantity and configuration. Our studies show that the best available estimators are vulnerable when the outliers are extreme in the regressor space (high leverage). Our proposed compound estimator modifies recently published methods with an improved initial estimate and measure of leverage. Extensive performance evaluations indicate that the proposed estimator performs the best and consistently fits the bulk of the data when outliers are present. The estimator, implemented in standard software, provides researchers and practitioners a tool for the model-building process to protect against the severe impact from multiple outliers. 相似文献

13.

for misspecified regression models

Peilin Shi Jane J. Ye Julie Zhou 《Revue canadienne de statistique》2003,31(4):397-414

The authors propose minimax robust designs for regression models whose response function is possibly misspecified. These designs, which minimize the maximum of the mean squared error matrix, can control the bias caused by model misspecification and provide the desired efficiency through one parameter. The authors call on a nonsmooth optimization technique to derive these designs analytically. Their results extend those of Heo, Schmuland & Wiens (2001). The authors also discuss several examples for approximately polynomial regression. 相似文献

14.

Missing data techniques for multilevel data: implications of model misspecification

Anne C. Black Ofer Harel D. Betsy McCoach 《Journal of applied statistics》2011,38(9):1845-1865

When modeling multilevel data, it is important to accurately represent the interdependence of observations within clusters. Ignoring data clustering may result in parameter misestimation. However, it is not well established to what degree parameter estimates are affected by model misspecification when applying missing data techniques (MDTs) to incomplete multilevel data. We compare the performance of three MDTs with incomplete hierarchical data. We consider the impact of imputation model misspecification on the quality of parameter estimates by employing multiple imputation under assumptions of a normal model (MI/NM) with two-level cross-sectional data when values are missing at random on the dependent variable at rates of 10%, 30%, and 50%. Five criteria are used to compare estimates from MI/NM to estimates from MI assuming a linear mixed model (MI/LMM) and maximum likelihood estimation to the same incomplete data sets. With 10% missing data (MD), techniques performed similarly for fixed-effects estimates, but variance components were biased with MI/NM. Effects of model misspecification worsened at higher rates of MD, with the hierarchical structure of the data markedly underrepresented by biased variance component estimates. MI/LMM and maximum likelihood provided generally accurate and unbiased parameter estimates but performance was negatively affected by increased rates of MD. 相似文献

15.

A mixture-based approach to robust analysis of generalised linear models

Ken J. Beath 《Journal of applied statistics》2018,45(12):2256-2268

A method for robustness in linear models is to assume that there is a mixture of standard and outlier observations with a different error variance for each class. For generalised linear models (GLMs) the mixture model approach is more difficult as the error variance for many distributions has a fixed relationship to the mean. This model is extended to GLMs by changing the classes to one where the standard class is a standard GLM and the outlier class which is an overdispersed GLM achieved by including a random effect term in the linear predictor. The advantages of this method are it can be extended to any model with a linear predictor, and outlier observations can be easily identified. Using simulation the model is compared to an M-estimator, and found to have improved bias and coverage. The method is demonstrated on three examples. 相似文献

16.

Efficient design of experiments in the Monod model

Holger Dette Viatcheslav B. Melas rey Pepelyshev Nikolai Strigul 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2003,65(3):725-742

Summary. Estimation and experimental design in a non-linear regression model that is used in microbiology are studied. The Monod model is defined implicitly by a differential equation and has numerous applications in microbial growth kinetics, water research, pharmacokinetics and plant physiology. It is proved that least squares estimates are asymptotically unbiased and normally distributed. The asymptotic covariance matrix of the estimator is the basis for the construction of efficient designs of experiments. In particular locally D -, E - and c -optimal designs are determined and their properties are studied theoretically and by simulation. If certain intervals for the non-linear parameters can be specified, locally optimal designs can be constructed which are robust with respect to a misspecification of the initial parameters and which allow efficient parameter estimation. Parameter variances can be decreased by a factor of 2 by simply sampling at optimal times during the experiment. 相似文献

17.

Optimal parametric design with applications to pharmacokinetic and pharmacodynamic trials

Jixian Wang 《Journal of applied statistics》2006,33(8):837-852

This paper considers optimal parametric designs, i.e. designs represented by probability measures determined by a set of parameters, for nonlinear models and illustrates their use in designs for pharmacokinetic (PK) and pharmacokinetic/pharmacodynamic (PK/PD) trials. For some practical problems, such as designs for modelling PK/PD relationship, this is often the only feasible type of design, as the design points follow a PK model and cannot be directly controlled. Even for ordinary design problems the parametric designs have some advantages over the traditional designs, which often have too few design points for model checking and may not be robust to model and parameter misspecifications. We first describe methods and algorithms to construct the parametric design for ordinary nonlinear design problems and show that the parametric designs are robust to parameter misspecification and have good power for model discrimination. Then we extend this design method to construct optimal repeated measurement designs for nonlinear mixed models. We also use this parametric design for modelling a PK/PD relationship and propose a simulation based algorithm. The application of parametric designs is illustrated with a three-parameter open one-compartment PK model for the ordinary design and repeated measurement design, and an Emax model for the phamacokinetic/pharmacodynamic trial design. 相似文献

18.

Computing A-optimal and E-optimal designs for regression models via semidefinite programming

Jane J. Ye Wenjie Zhou 《统计学通讯:模拟与计算》2017,46(3):2011-2024

In semidefinite programming (SDP), we minimize a linear objective function subject to a linear matrix being positive semidefinite. A powerful program, SeDuMi, has been developed in MATLAB to solve SDP problems. In this article, we show in detail how to formulate A-optimal and E-optimal design problems as SDP problems and solve them by SeDuMi. This technique can be used to construct approximate A-optimal and E-optimal designs for all linear and nonlinear regression models with discrete design spaces. In addition, the results on discrete design spaces provide useful guidance for finding optimal designs on any continuous design space, and a convergence result is derived. Moreover, restrictions in the designs can be easily incorporated in the SDP problems and solved by SeDuMi. Several representative examples and one MATLAB program are given. 相似文献

19.

A semi-parametric approach to robust parameter design

Stephanie M. Pickle Timothy J. Robinson Jeffrey B. Birch Christine M. Anderson-Cook 《Journal of statistical planning and inference》2008

Parameter design or robust parameter design (RPD) is an engineering methodology intended as a cost-effective approach for improving the quality of products and processes. The goal of parameter design is to choose the levels of the control variables that optimize a defined quality characteristic. An essential component of RPD involves the assumption of well estimated models for the process mean and variance. Traditionally, the modeling of the mean and variance has been done parametrically. It is often the case, particularly when modeling the variance, that nonparametric techniques are more appropriate due to the nature of the curvature in the underlying function. Most response surface experiments involve sparse data. In sparse data situations with unusual curvature in the underlying function, nonparametric techniques often result in estimates with problematic variation whereas their parametric counterparts may result in estimates with problematic bias. We propose the use of semi-parametric modeling within the robust design setting, combining parametric and nonparametric functions to improve the quality of both mean and variance model estimation. The proposed method will be illustrated with an example and simulations. 相似文献

20.

A graphical evaluation of logistic ridge estimator in mixture experiments

Kadri Ulas Akay 《Journal of applied statistics》2014,41(6):1217-1232

In comparison to other experimental studies, multicollinearity appears frequently in mixture experiments, a special study area of response surface methodology, due to the constraints on the components composing the mixture. In the analysis of mixture experiments by using a special generalized linear model, logistic regression model, multicollinearity causes precision problems in the maximum-likelihood logistic regression estimate. Therefore, effects due to multicollinearity can be reduced to a certain extent by using alternative approaches. One of these approaches is to use biased estimators for the estimation of the coefficients. In this paper, we suggest the use of logistic ridge regression (RR) estimator in the cases where there is multicollinearity during the analysis of mixture experiments using logistic regression. Also, for the selection of the biasing parameter, we use fraction of design space plots for evaluating the effect of the logistic RR estimator with respect to the scaled mean squared error of prediction. The suggested graphical approaches are illustrated on the tumor incidence data set. 相似文献