共查询到20条相似文献,搜索用时 15 毫秒
1.
Confidence bands in nonparametric regression have been studied for a long time with data assumed to be generated from independent and identically distributed (iid) random variables. The methods and theoretical results for iid data, however, do not directly apply to data from stratified multistage samples. In this paper, we extend the confidence bands introduced by Zhang and Lu (2008) for iid case to complex surveys based on an entirely data-driven procedure; the proposed confidence bands incorporate both the sampling weights and the kernel weights. Simulation studies show that the proposed method works well. 相似文献
2.
Abstract. The focus of this article is on simultaneous confidence bands over a rectangular covariate region for a linear regression model with k>1 covariates, for which only conservative or approximate confidence bands are available in the statistical literature stretching back to Working & Hotelling (J. Amer. Statist. Assoc. 24 , 1929; 73–85). Formulas of simultaneous confidence levels of the hyperbolic and constant width bands are provided. These involve only a k‐dimensional integral; it is unlikely that the simultaneous confidence levels can be expressed as an integral of less than k‐dimension. These formulas allow the construction for the first time of exact hyperbolic and constant width confidence bands for at least a small k(>1) by using numerical quadrature. Comparison between the hyperbolic and constant width bands is then addressed under both the average width and minimum volume confidence set criteria. It is observed that the constant width band can be drastically less efficient than the hyperbolic band when k>1. Finally it is pointed out how the methods given in this article can be applied to more general regression models such as fixed‐effect or random‐effect generalized linear regression models. 相似文献
3.
Suppose that we have a nonparametric regression model Y = m(X) + ε with X ∈ Rp, where X is a random design variable and is observed completely, and Y is the response variable and some Y-values are missing at random. Based on the “complete” data sets for Y after nonaprametric regression imputation and inverse probability weighted imputation, two estimators of the regression function m(x0) for fixed x0 ∈ Rp are proposed. Asymptotic normality of two estimators is established, which is used to construct normal approximation-based confidence intervals for m(x0). We also construct an empirical likelihood (EL) statistic for m(x0) with limiting distribution of χ21, which is used to construct an EL confidence interval for m(x0). 相似文献
4.
Several variations of monotone nonparametric regression have been developed over the past 30 years. One approach is to first apply nonparametric regression to data and then monotone smooth the initial estimates to “iron out” violations to the assumed order. Here, such estimators are considered, where local polynomial regression is first used, followed by either least squares isotonic regression or a monotone method using simple averages. The primary focus of this work is to evaluate different types of confidence intervals for these monotone nonparametric regression estimators through Monte Carlo simulation. Most of the confidence intervals use bootstrap or jackknife procedures. Estimation of a response variable as a function of two continuous predictor variables is considered, where the estimation is performed at the observed values of the predictors (instead of on a grid). The methods are then applied to data involving subjects that worked at plants that use beryllium metal who have developed chronic beryllium disease. 相似文献
5.
In this article, we develop a new and novel kernel density estimator for a sum of weighted averages from a single population based on utilizing the well defined kernel density estimator in conjunction with classic inversion theory. This idea is further developed for a kernel density estimator for the difference of weighed averages from two independent populations. The resulting estimator is “bootstrap-like” in terms of its properties with respect to the derivation of approximate confidence intervals via a “plug-in” approach. This new approach is distinct from the bootstrap methodology in that it is analytically and computationally feasible to provide an exact estimate of the distribution function through direct calculation. Thus, our approach eliminates the error due to Monte Carlo resampling that arises within the context of simulation based approaches that are oftentimes necessary in order to derive bootstrap-based confidence intervals for statistics involving weighted averages of i.i.d. random variables. We provide several examples and carry forth a simulation study to show that our kernel density estimator performs better than the standard central limit theorem based approximation in term of coverage probability. 相似文献
6.
N. S. Altman 《The American statistician》2013,67(3):175-185
7.
P. Ah-Kine 《统计学通讯:理论与方法》2013,42(3):441-452
A simultaneous confidence band provides useful information on the plausible range of an unknown regression model function, just as a confidence interval gives the plausible range of an unknown parameter. For a multiple linear regression model, confidence bands of different shapes, such as the hyperbolic band and the constant width band, can be constructed and the predictor variable region over which a confidence band is constructed can take various forms. One interesting but unsolved problem is to find the optimal (shape) confidence band over an ellipsoidal region χE under the Minimum Volume Confidence Set (MVCS) criterion of Liu and Hayter (2007) and Liu et al. (2009). This problem is challenging as it involves optimization over an unknown function that determines the shape of the confidence band over χE. As a step towards solving this difficult problem, in this paper, we introduce a family of confidence bands over χE, called the inner-hyperbolic bands, which includes the hyperbolic and constant-width bands as special cases. We then search for the optimal confidence band within this family under the MVCS criterion. The conclusion from this study is that the hyperbolic band is not optimal even within this family of inner-hyperbolic bands and so cannot be the overall optimal band. On the other hand, the constant width band can be optimal within the family of inner-hyperbolic bands when the region χE is small and so might be the overall optimal band. 相似文献
8.
This article addresses the problem of confidence band construction for a standard multiple linear regression model. An “independence point” method of construction is developed which generalizes the method of Gafarian (1964) for a simple linear regression model to a multiple linear regression model. Wynn (1984) pioneered the approach of basing confidence bands for a polynomial regression on a set of nodes where the function estimates are independent, and this approach is exploited in this article. This method requires only critical points from t-distributions so that the confidence bands are easy to construct. Both one-sided and two-sided confidence bands can be constructed using this method. An illustration of the new method is provided, and comparisons are made with other procedures. 相似文献
9.
Principal components regression (PCR) is used in resolving the multicollinearity problem but specification bias occurs due to the selection only of the important principal components to be included resulting in the deterioration of predictive ability of the model. We propose the PCR in a nonparametric framework to address the multicollinearity problem while minimizing the specification bias that affects predictive ability of the model. The simulation study illustrated that nonparametric PCR addresses the multicollinearity problem while retaining higher predictive ability relative to parametric principal components regression model. 相似文献
10.
Walter W. Hauck 《The American statistician》2013,67(2):158-160
A problem in logit analysis is the interval estimation of the logistic response curve. Scheffé's method is used to obtain confidence bands for the logistic response function for any number of explanatory variables. This method is computationally easier and more general than a previously reported method. 相似文献
11.
Nicolas W. Hengartner Marten H. Wegkamp Eric Matzner-Løber 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2002,64(4):791-804
Summary. The paper presents a general strategy for selecting the bandwidth of nonparametric regression estimators and specializes it to local linear regression smoothers. The procedure requires the sample to be divided into a training sample and a testing sample. Using the training sample we first compute a family of regression smoothers indexed by their bandwidths. Next we select the bandwidth by minimizing the empirical quadratic prediction error on the testing sample. The resulting bandwidth satisfies a finite sample oracle inequality which holds for all bounded regression functions. This permits asymptotically optimal estimation for nearly any regression function. The practical performance of the method is illustrated by a simulation study which shows good finite sample behaviour of our method compared with other bandwidth selection procedures. 相似文献
12.
John D. Spurrier 《统计学通讯:理论与方法》2013,42(9):969-973
Exact uniform confidence bands are presented for a sinusoidal regression model with known period. A table of probability points is presented for an important special case. A comparsion is made with the Working - Hotelling - Scheffé confidence bands. 相似文献
13.
Nonparametric regression can be considered as a problem of model choice. In this article, we present the results of a simulation study in which several nonparametric regression techniques including wavelets and kernel methods are compared with respect to their behavior on different test beds. We also include the taut-string method whose aim is not to minimize the distance of an estimator to some “true” generating function f but to provide a simple adequate approximation to the data. Test beds are situations where a “true” generating f exists and in this situation it is possible to compare the estimates of f with f itself. The measures of performance we use are the L2- and the L∞-norms and the ability to identify peaks. 相似文献
14.
Guaranteed Local Maximum Likelihood Detection of a Change Point in Nonparametric Logistic Regression
We consider nonparametric logistic regression and propose a generalized likelihood test for detecting a threshold effect that indicates a relationship between some risk factor and a defined outcome above the threshold but none below it. One important field of application is occupational medicine and in particular, epidemiological studies. In epidemiological studies, segmented fully parametric logistic regression models are often threshold models, where it is assumed that the exposure has no influence on a response up to a possible unknown threshold, and has an effect beyond that threshold. Finding efficient methods for detection and estimation of a threshold is a very important task in these studies. This article proposes such methods in a context of nonparametric logistic regression. We use a local version of unknown likelihood functions and show that under rather common assumptions the asymptotic power of our test is one. We present a guaranteed non asymptotic upper bound for the significance level of the proposed test. If applying the test yields the acceptance of the conclusion that there was a change point (and hence a threshold limit value), we suggest using the local maximum likelihood estimator of the change point and consider the asymptotic properties of this estimator. 相似文献
15.
Srinjoy Das 《The American statistician》2020,74(3):233-242
AbstractNonparametric regression is a standard statistical tool with increased importance in the Big Data era. Boundary points pose additional difficulties but local polynomial regression can be used to alleviate them. Local linear regression, for example, is easy to implement and performs quite well both at interior and boundary points. Estimating the conditional distribution function and/or the quantile function at a given regressor point is immediate via standard kernel methods but problems ensue if local linear methods are to be used. In particular, the distribution function estimator is not guaranteed to be monotone increasing, and the quantile curves can “cross.” In the article at hand, a simple method of correcting the local linear distribution estimator for monotonicity is proposed, and its good performance is demonstrated via simulations and real data examples. Supplementary materials for this article are available online. 相似文献
16.
Y. Xia 《Journal of the Royal Statistical Society. Series B, Statistical methodology》1998,60(4):797-811
Bias-corrected confidence bands for general nonparametric regression models are considered. We use local polynomial fitting to construct the confidence bands and combine the cross-validation method and the plug-in method to select the bandwidths. Related asymptotic results are obtained. Our simulations show that confidence bands constructed by local polynomial fitting have much better coverage than those constructed by using the Nadaraya–Watson estimator. The results are also applicable to nonparametric autoregressive time series models. 相似文献
17.
This paper addresses the problem of confidence band construction for a standard multiple linear regression model. A “ray” method of construction is developed which generalizes the method of Graybill and Bowden [1967. Linear segment confidence bands for simple linear regression models. J. Amer. Statist. Assoc. 62, 403–408] for a simple linear regression model to a multiple linear regression model. By choosing suitable directions for the rays this method requires only critical points from t-distributions so that the confidence bands are easy to construct. Both one-sided and two-sided confidence bands can be constructed using this method. An illustration of the new method is provided. 相似文献
18.
Motivated by covariate-adjusted regression (CAR) proposed by Sentürk and Müller (2005) and an application problem, in this article we introduce and investigate a covariate-adjusted partially linear regression model (CAPLM), in which both response and predictor vector can only be observed after being distorted by some multiplicative factors, and an additional variable such as age or period is taken into account. Although our model seems to be a special case of covariate-adjusted varying coefficient model (CAVCM) given by Sentürk (2006), the data types of CAPLM and CAVCM are basically different and then the methods for inferring the two models are different. In this article, the estimate method motivated by Cui et al. (2008) is employed to infer the new model. Furthermore, under some mild conditions, the asymptotic normality of estimator for the parametric component is obtained. Combined with the consistent estimate of asymptotic covariance, we obtain confidence intervals for the regression coefficients. Also, some simulations and a real data analysis are made to illustrate the new model and methods. 相似文献
19.
20.
Na Li 《统计学通讯:理论与方法》2013,42(22):3972-3987
In this article, procedures are proposed to test the hypothesis of equality of two or more regression functions. Tests are proposed by p-values, first under homoscedastic regression model, which are derived using fiducial method based on cubic spline interpolation. Then, we construct a test in the heteroscedastic case based on Fisher's method of combining independent tests. We study the behaviors of the tests by simulation experiments, in which comparisons with other tests are also given. The proposed tests have good performances. Finally, an application to a data set are given to illustrate the usefulness of the proposed test in practice. 相似文献