首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
The results of analyzing experimental data using a parametric model may heavily depend on the chosen model for regression and variance functions, moreover also on a possibly underlying preliminary transformation of the variables. In this paper we propose and discuss a complex procedure which consists in a simultaneous selection of parametric regression and variance models from a relatively rich model class and of Box-Cox variable transformations by minimization of a cross-validation criterion. For this it is essential to introduce modifications of the standard cross-validation criterion adapted to each of the following objectives: 1. estimation of the unknown regression function, 2. prediction of future values of the response variable, 3. calibration or 4. estimation of some parameter with a certain meaning in the corresponding field of application. Our idea of a criterion oriented combination of procedures (which usually if applied, then in an independent or sequential way) is expected to lead to more accurate results. We show how the accuracy of the parameter estimators can be assessed by a “moment oriented bootstrap procedure", which is an essential modification of the “wild bootstrap” of Härdle and Mammen by use of more accurate variance estimates. This new procedure and its refinement by a bootstrap based pivot (“double bootstrap”) is also used for the construction of confidence, prediction and calibration intervals. Programs written in Splus which realize our strategy for nonlinear regression modelling and parameter estimation are described as well. The performance of the selected model is discussed, and the behaviour of the procedures is illustrated, e.g., by an application in radioimmunological assay.  相似文献   

2.
Several alternative methods for derivation of the restricted ridge regression estimator (RRRE) are provided. Theoretical comparison and relationship of RRRE with related methods for regression with the multicollinearity problem are described. We also find inter-connections among RRRE, ordinary ridge regression estimator (ORRE), restricted least squares estimator (RLSE), modified ridge regression estimator (MRRE) and restricted modified generalized ridge estimator (RMGRE). Finally, numerical comparison, in addition to theoretical derivation, is also conducted with a Monte Carlo simulation and a real data example.  相似文献   

3.
We consider a regression of yy on xx given by a pair of mean and variance functions with a parameter vector θθ to be estimated that also appears in the distribution of the regressor variable xx. The estimation of θθ is based on an extended quasi-score (QS) function. We show that the QS estimator is optimal within a wide class of estimators based on linear-in-yy unbiased estimating functions. Of special interest is the case where the distribution of xx depends only on a subvector αα of θθ, which may be considered a nuisance parameter. In general, αα must be estimated simultaneously together with the rest of θθ, but there are cases where αα can be pre-estimated. A major application of this model is the classical measurement error model, where the corrected score (CS) estimator is an alternative to the QS estimator. We derive conditions under which the QS estimator is strictly more efficient than the CS estimator.  相似文献   

4.
Collapsibility with respect to a measure of association implies that the measure of association can be obtained from the marginal model. We first discuss model collapsibility and collapsibility with respect to regression coefficients for linear regression models. For parallel regression models, we give simple and different proofs of some of the known results and obtain also certain new results. For random coefficient regression models, we define (average) AA-collapsibility and obtain conditions under which it holds. We consider Poisson regression and logistic regression models also, and derive conditions for collapsibility and AA-collapsibility, respectively. These results generalize some of the results available in the literature. Some suitable examples are also discussed.  相似文献   

5.
The Dantzig selector (Candès & Tao, 2007) is a popular $\ell^{1}$ ‐regularization method for variable selection and estimation in linear regression. We present a very weak geometric condition on the observed predictors which is related to parallelism and, when satisfied, ensures the uniqueness of Dantzig selector estimators. The condition holds with probability 1, if the predictors are drawn from a continuous distribution. We discuss the necessity of this condition for uniqueness and also provide a closely related condition which ensures the uniqueness of lasso estimators (Tibshirani, 1996). Large sample asymptotics for the Dantzig selector, that is, almost sure convergence and the asymptotic distribution, follow directly from our uniqueness results and a continuity argument. The limiting distribution of the Dantzig selector is generally non‐normal. Though our asymptotic results require that the number of predictors is fixed (similar to Knight & Fu, 2000), our uniqueness results are valid for an arbitrary number of predictors and observations. The Canadian Journal of Statistics 41: 23–35; 2013 © 2012 Statistical Society of Canada  相似文献   

6.
7.
We study the distribution of the adaptive LASSO estimator [Zou, H., 2006. The adaptive LASSO and its oracle properties. J. Amer. Statist. Assoc. 101, 1418–1429] in finite samples as well as in the large-sample limit. The large-sample distributions are derived both for the case where the adaptive LASSO estimator is tuned to perform conservative model selection as well as for the case where the tuning results in consistent model selection. We show that the finite-sample as well as the large-sample distributions are typically highly nonnormal, regardless of the choice of the tuning parameter. The uniform convergence rate is also obtained, and is shown to be slower than n-1/2n-1/2 in case the estimator is tuned to perform consistent model selection. In particular, these results question the statistical relevance of the ‘oracle’ property of the adaptive LASSO estimator established in Zou [2006. The adaptive LASSO and its oracle properties. J. Amer. Statist. Assoc. 101, 1418–1429]. Moreover, we also provide an impossibility result regarding the estimation of the distribution function of the adaptive LASSO estimator. The theoretical results, which are obtained for a regression model with orthogonal design, are complemented by a Monte Carlo study using nonorthogonal regressors.  相似文献   

8.
The simple linear regression model with measurement error has been subject to much research. In this work we will focus on this model when the error in the explanatory variable is correlated with the error in the regression equation. Specifically, we are interested in the comparison between the ordinary errors-in-variables estimator of the regression coefficient ββ and the estimator that takes account of the correlation between the errors. Based on large sample approximations, we compare the estimators and find that the estimator that takes account of the correlation should be preferred in most situations. We also compare the estimators in small sample situations. This is done by stochastic simulation. The results show that the estimators behave quite similarly in most of the simulated situations, but that the ordinary errors-in-variables estimator performs considerably worse than the estimator that takes account of the correlation for certain parameter combinations. In addition, we look briefly into the bias introduced by ignoring correlated errors when computing sample correlations, and in predictions.  相似文献   

9.
In this article we investigate the problem of ascertaining A- and D-optimal designs in a cubic regression model with random coefficients. Our interest lies in estimation of all the parameters or in only those except the intercept term. Assuming the variance ratios to be known, we tabulate D-optimal designs for various combinations of the variance ratios. A-optimality does not pose any new problem in the random coefficients situation.  相似文献   

10.
A supersaturated design is a design whose run size is not enough for estimating all the main effects. It is commonly used in screening experiments, where the goals are to identify sparse and dominant active factors with low cost. In this paper, we study a variable selection method via the Dantzig selector, proposed by Candes and Tao [2007. The Dantzig selector: statistical estimation when pp is much larger than nn. Annals of Statistics 35, 2313–2351], to screen important effects. A graphical procedure and an automated procedure are suggested to accompany with the method. Simulation shows that this method performs well compared to existing methods in the literature and is more efficient at estimating the model size.  相似文献   

11.
General linear models with a common design matrix and with various structures of the variance–covariance matrix are considered. We say that a model is perfect for a linearly estimable parametric function, or the function is perfect in the model, if there exists the best linear unbiased estimator. All perfect models for a given function and all perfect functions in a given model are characterized.  相似文献   

12.
We investigate optimal designs for discriminating between exponential regression models of different complexity, which are widely used in the biological sciences; see, e.g., Landaw [1995. Robust sampling designs for compartmental models under large prior eigenvalue uncertainties. Math. Comput. Biomed. Appl. 181–187] or Gibaldi and Perrier [1982. Pharmacokinetics. Marcel Dekker, New York]. We discuss different approaches for the construction of appropriate optimality criteria, and find sharper upper bounds on the number of support points of locally optimal discrimination designs than those given by Caratheodory's Theorem. These results greatly facilitate the numerical construction of optimal designs. Various examples of optimal designs are then presented and compared to different other designs. Moreover, to protect the experiment against misspecifications of the nonlinear model parameters, we adapt the design criteria such that the resulting designs are robust with respect to such misspecifications and, again, provide several examples, which demonstrate the advantages of our approach.  相似文献   

13.
In the paper the problem of nonlinear unbiased estimation of expectation in linear models is considered. The considerations are restricted to linear plus quadratic estimators with quadratic parts invariant under a group of translations. The one way classification model is considered in detail, for which an explicit formula for the locally best estimators is presented. A numerical evaluation of variances of the best estimators is given for some unbalanced one way classification models and compared with the variance of the ordinary linear estimators.  相似文献   

14.
In this paper we consider linear sufficiency and linear completeness in the context of estimating the estimable parametric function KβKβ under the general Gauss–Markov model {y,Xβ2V}{y,Xβ,σ2V}. We give new characterizations for linear sufficiency, and define and characterize linear completeness in a case of estimation of KβKβ. Also, we consider a predictive approach for obtaining the best linear unbiased estimator of KβKβ, and subsequently, we give the linear analogues of the Rao–Blackwell and Lehmann–Scheffé Theorems in the context of estimating KβKβ.  相似文献   

15.
Scientific experiments commonly result in clustered discrete and continuous data. Existing methods for analyzing such data include the use of quasi-likelihood procedures and generalized estimating equations to estimate marginal mean response parameters. In applications to areas such as developmental toxicity studies, where discrete and continuous measurements are recorded on each fetus, or clinical ophthalmologic trials, where different types of observations are made on each eye, the assumption that data within cluster are exchangeable is often very reasonable. We use this assumption to formulate fully parametric regression models for clusters of bivariate data with binary and continuous components. The regression models proposed have marginal interpretations and reproducible model structures. Tractable expressions for likelihood equations are derived and iterative schemes are given for computing efficient estimates (MLEs) of the marginal mean, correlations, variances and higher moments. We demonstrate the use the ‘exchangeable’ procedure with an application to a developmental toxicity study involving fetal weight and malformation data.  相似文献   

16.
Zhijun Liu 《Statistics》2013,47(2):109-119
In this paper, the robustness of the least distances (LD) estimate in multivariate linear models, as defined by Bai, Chen, Miao and Rao (1990), is discussed in terms of the influence function as well as the breakdown point. The LD estimate is shown to be more robust than the least squares (LS) estimate. The robustness of the LD is similar to that of the least absolute deviations (LAD) estimate, a well studied robust estimate in the univariate case. In particular, if there are no outliers in the design matrices, the breakdown point of the LD estimate reaches the highest value, 1/2.  相似文献   

17.
We study a randomized adaptive design to assign one of the LL treatments to patients who arrive sequentially by means of an urn model. At each stage nn, a reward is distributed between treatments. The treatment applied is rewarded according to its response, 0?Yn?10?Yn?1, and 1-Yn1-Yn is distributed among the other treatments according to their performance until stage n-1n-1. Patients can be classified in K+1K+1 levels and we assume that the effect of this level in the response to the treatments is linear. We study the asymptotic behavior of the design when the ordinary least square estimators are used as a measure of performance until stage n-1n-1.  相似文献   

18.
Pairwise comparisons for proportions estimated by pooled testing   总被引:1,自引:0,他引:1  
When estimating the prevalence of a rare trait, pooled testing can confer substantial benefits when compared to individual testing. In addition to screening experiments for infectious diseases in humans, pooled testing has also been exploited in other applications such as drug testing, epidemiological studies involving animal disease, plant disease assessment, and screening for rare genetic mutations. Within a pooled-testing context, we consider situations wherein different strata or treatments are to be compared with the goals of assessing significant and practical differences between strata and ranking strata in terms of prevalence. To achieve these goals, we first present two simultaneous pairwise interval estimation procedures for use with pooled data. Our procedures rely on asymptotic results, so we investigate small-sample behavior and compare the two procedures in terms of simultaneous coverage probability and mean interval length. We then present a unified approach to determine pool sizes which deliver desired coverage properties while taking testing costs and interval precision into account. We illustrate our methods using data from an observational HIV study involving heterosexual males who use intravenous drugs.  相似文献   

19.
We consider the problem of selecting variables in factor analysis models. The $L_1$ regularization procedure is introduced to perform an automatic variable selection. In the factor analysis model, each variable is controlled by multiple factors when there are more than one underlying factor. We treat parameters corresponding to the multiple factors as grouped parameters, and then apply the group lasso. Furthermore, the weight of the group lasso penalty is modified to obtain appropriate estimates and improve the performance of variable selection. Crucial issues in this modeling procedure include the selection of the number of factors and a regularization parameter. Choosing these parameters can be viewed as a model selection and evaluation problem. We derive a model selection criterion for evaluating the factor analysis model via the weighted group lasso. Monte Carlo simulations are conducted to investigate the effectiveness of the proposed procedure. A real data example is also given to illustrate our procedure. The Canadian Journal of Statistics 40: 345–361; 2012 © 2012 Statistical Society of Canada  相似文献   

20.
The problem of constructing nonlinear regression models is investigated to analyze data with complex structure. We introduce radial basis functions with hyperparameter that adjusts the amount of overlapping basis functions and adopts the information of the input and response variables. By using the radial basis functions, we construct nonlinear regression models with help of the technique of regularization. Crucial issues in the model building process are the choices of a hyperparameter, the number of basis functions and a smoothing parameter. We present information-theoretic criteria for evaluating statistical models under model misspecification both for distributional and structural assumptions. We use real data examples and Monte Carlo simulations to investigate the properties of the proposed nonlinear regression modeling techniques. The simulation results show that our nonlinear modeling performs well in various situations, and clear improvements are obtained for the use of the hyperparameter in the basis functions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号