首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到19条相似文献,搜索用时 885 毫秒
1.
文章在响应变量随机缺失下,基于分位数回归研究了半参数模型的稳健估计问题。首先基于B样条基函数近似技术,将模型非参数函数的估计问题转化为样条系数向量估计问题;其次,在响应变量随机缺失下,提出了一种新的插补方法,对缺失的响应变量进行多重插补;再次,基于插补后的数据集,构造出新的分位数目标函数,得到模型非参数函数以及参数向量的稳健估计;最后给出了有效算法计算多重插补估计量。通过模拟研究验证了所提方法的有效性和稳健性。  相似文献   

2.
文章研究了半参数变系数EV模型在线性约束条件下的估计和检验问题,当响应变量缺失、非参数部分协变量带有测量误差时,利用局部纠偏的Profile最小二乘估计、Lagrange乘子方法和借补技术构造了回归模型参数分量两类纠偏约束估计量。此外,为了检验线性约束条件,构造了借补的Profile Lagrange乘子检验统计量,并通过蒙特卡洛数值模拟验证估计量和检验统计量的有效性。  相似文献   

3.
在基于抽样调查数据对总体参数进行估计的方法中,小域估计方法能够借助于辅助信息对小样本乃至无样本区域的参数进行有效的估计,并被广泛应用于抽样估计领域。单元水平模型作为小域估计的基本模型之一,是处理单元级别数据估计的有力工具之一。在单元水平模型的应用条件中,需假定区域随机误差和模型随机误差均服从正态分布。然而,在抽样调查中,满足这一条件的调查数据是很少的,尤其是在观测数据中出现离群值时。不满足正态性假设条件下的小域估计量会产生较大的偏差和均方误,因此有必要研究针对正态性假设和离群观测值不敏感的稳健估计方法。通过引入γ散度和γ似然函数,构建了基于单元水平模型的小域稳健估计方法,得到了模型参数的稳健估计和小域目标变量的稳健估计。与现有的稳健估计方法相比,所提新方法能更好地处理区域随机误差和模型随机误差非正态的情形,对于目标变量存在离群观测的情形,具有更好的稳健性,估计均方误更小。在利用模拟数据进行验证中,比较了不同误差分布情形下几类常用估计方法得到的估计量的均方误差,并进一步探究了随着污染分布的方差和比率变化,所得估计量的均方误差变化情形。最后,通过应用于经典的小域估计数据,进一步验证了所提新...  相似文献   

4.
邰凌楠等 《统计研究》2018,35(9):115-128
数据缺失问题普遍存在于应用研究中。在随机缺失机制假定下,本文从模型推断角度出发,针对线性缺失分位回归模型,提出一种新的有效估计方法——逆概率多重加权(IPMW)估计。该方法是在逆概率加权(IPW)估计的基础上,结合倾向得分匹配及模型平均思想,经过多次估计,加权确定最终参数估计结果。该方法适用于响应变量是独立同分布或独立非同分布的情形,并适用于绝大多数缺失场景。经过理论推导及模拟研究发现,IPMW估计量在继承IPW估计量的优势上具有更稳健的性质。最后,将该方法应用于含有缺失数据的微观调查数据中,研究了经济较发达的准一线城市中等收入群体消费水平的影响因素,对比两种估计方法的估计结果及置信带,发现逆概率多重加权估计量的标准偏差更小,估计结果更稳健。  相似文献   

5.
文章研究了仅自变量可重复观测的变系数线性结构型EV模型,利用重复观测数据和局部加权的方法,构造了模型参数的估计量,并证明了在一定条件下估计量是相合估计。  相似文献   

6.
文章讨论响应变量和部分协变量含测量误差的重复测量数据的建模和估计问题,获得参数极大似然估计的EM迭代算法以及估计量的渐近协方差矩阵,并利用Monte-Carlo模拟说明估计的有效性和模型的价值.最后,将研究理论用于处理气象数据的测量误差校正问题.  相似文献   

7.
缺失值是调查中普遍存在的问题,对缺失值进行插补是处理缺失值的较好方法.如果变量之间存在相关关系,可以通过正态线形模型利用不存在缺失值的变量对有存在缺失值的变量进行插补.较之单一插补,多重插补更能有效地估计总体方差,因此更多地被使用.文章借助Bootstrap法,让模型的参数和残差来自完全观测的Bootstrap样本的最小平法估计,可进一步准确估计总体方差.通过大量模拟试验,发现Bootstrap多重插补较之单一插补和一般多重插补能构建更宽的置信区间从而有更准确的总体参数覆盖率,这点在数据缺失比重很大时优势更明显.  相似文献   

8.
毕画  伍业锋 《统计研究》2017,(9):120-128
在超总体模型中,一般用于构建模型的辅助变量多为连续型变量,对混合类型辅助变量的模型研究较少.为了同时利用与研究变量相关的连续型和离散型辅助变量的信息,本文提出在模型校准的框架下,利用非参数核回归方法,得到混合类型辅助变量下的模型校准估计量.研究证明,该估计量是渐进设计无偏、设计一致和渐进正态的,并给出了估计量的方差和方差的估计量.数值模拟的结果显示,本文在总体回归函数为线性和非线性的情况下,估计效果均有所提高.此外,通过CLHLS数据的验证也表明该估计量的效果优于仅利用连续型辅助变量的估计量.  相似文献   

9.
王芝皓等 《统计研究》2021,38(7):127-139
在实际数据分析中经常会遇到零膨胀计数数据作为响应变量与函数型随机变量和随机向量作为预测变量相关联。本文考虑函数型部分变系数零膨胀模型 (FPVCZIM),模型中无穷维的斜率函数用函数型主成分基逼近,系数函数用B-样条进行拟合。通过EM 算法得到估计量,讨论其理论性质,在一些正则条件下获得了斜率函数和系数函数估计量的收敛速度。有限样本的Monte Carlo 模拟研究和真实数据分析被用来解释本文提出的方法。  相似文献   

10.
普通最小二乘法的几何分析   总被引:2,自引:0,他引:2  
普通最小二乘估计法是在目标函数残差平方和的达到最小的条件下求得参数估计量,从向量的角度来说,普通最小二乘法将被解释变量分解成了相互正交的两部分,通过空间向量理论和几何分析方法,可以在欧氏空间内对普通最小二乘估计量进行求解,这种分析过程使普通最小二乘法变得更直观。  相似文献   

11.
In this paper, we introduce a fresh methodology for imputing missing values by making use of sensible constraints on both a study variable and auxiliary variables that are correlated with the variable of interest. The resultant estimator based on these imputed values is shown to lead to the regression type method of imputation in survey sampling. Furthermore, when the data are hybrid of both that missing at random and missing complexly at random, the resultant estimator is shown to be a consistent estimator that has asymptotic mean squared error equal to that of the linear regression method of imputation. A generalization to any type of method of imputation is possible and has been included at the end.  相似文献   

12.
Abstract.  Theory on semi-parametric efficient estimation in missing data problems has been systematically developed by Robins and his coauthors. Except in relatively simple problems, semi-parametric efficient scores cannot be expressed in closed forms. Instead, the efficient scores are often expressed as solutions to integral equations. Neumann series was proposed in the form of successive approximation to the efficient scores in those situations. Statistical properties of the estimator based on the Neumann series approximation are difficult to obtain and as a result, have not been clearly studied. In this paper, we reformulate the successive approximation in a simple iterative form and study the statistical properties of the estimator based on the reformulation. We show that a doubly robust locally efficient estimator can be obtained following the algorithm in robustifying the likelihood score. The results can be applied to, among others, parametric regression, marginal regression and Cox regression when data are subject to missing values and the data are missing at random. A simulation study is conducted to evaluate the performance of the approach and a real data example is analysed to demonstrate the use of the approach.  相似文献   

13.
This paper considers linear and nonlinear regression with a response variable that is allowed to be “missing at random”. The only structural assumptions on the distribution of the variables are that the errors have mean zero and are independent of the covariates. The independence assumption is important. It enables us to construct an estimator for the response density that uses all the observed data, in contrast to the usual local smoothing techniques, and which therefore permits a faster rate of convergence. The idea is to write the response density as a convolution integral which can be estimated by an empirical version, with a weighted residual-based kernel estimator plugged in for the error density. For an appropriate class of regression functions, and a suitably chosen bandwidth, this estimator is consistent and converges with the optimal parametric rate n1/2. Moreover, the estimator is proved to be efficient (in the sense of Hájek and Le Cam) if an efficient estimator is used for the regression parameter.  相似文献   

14.
We consider the estimation of a regression coefficient in a linear regression when observations are missing due to nonresponse. Response is assumed to be determined by a nonobservable variable which is linearly related to an observable variable. The values of the observable variable are assumed to be available for the whole sample but the variable is not includsd in the regression relationship of interest . Several alternative estimators have been proposed for this situation under various simplifying assumptions. A sampling theory approach provides three alternative estimatrs by considering the observatins as obtained from a sub-sample, selected on the basis of the fully observable variable , as formulated by Nathan and Holt (1980). Under an econometric approach, Heckman (1979) proposed a two-stage (probit and OLS) estimator which is consistent under specificconditions. A simulation comparison of the four estimators and the ordinary least squares estimator , under multivariate normality of all the variables involved, indicates that the econometric approach estimator is not robust to departures from the conditions underlying its derivation, while two of the other estimators exhibit a similar degree of stable performance over a wide range of conditions. Simulations for a non-normal distribution show that gains in performance can be obtained if observations on the independent variable are available for the whole population.  相似文献   

15.
In this paper, we propose an empirical likelihood-based weighted estimator of regression parameter in quantile regression model with non ignorable missing covariates. The proposed estimator is computationally simple and achieves semiparametric efficiency if the probability of missingness on the fully observed variables is correctly specified. The efficiency gain of the proposed estimator over the complete-case-analysis estimator is quantified theoretically and illustrated via simulation and a real data application.  相似文献   

16.
Consider estimation of a population mean of a response variable when the observations are missing at random with respect to the covariate. Two common approaches to imputing the missing values are the nonparametric regression weighting method and the Horvitz-Thompson (HT) inverse weighting approach. The regression approach includes the kernel regression imputation and the nearest neighbor imputation. The HT approach, employing inverse kernel-estimated weights, includes the basic estimator, the ratio estimator and the estimator using inverse kernel-weighted residuals. Asymptotic normality of the nearest neighbor imputation estimators is derived and compared to kernel regression imputation estimator under standard regularity conditions of the regression function and the missing pattern function. A comprehensive simulation study shows that the basic HT estimator is most sensitive to discontinuity in the missing data patterns, and the nearest neighbors estimators can be insensitive to missing data patterns unbalanced with respect to the distribution of the covariate. Empirical studies show that the nearest neighbor imputation method is most effective among these imputation methods for estimating a finite population mean and for classifying the species of the iris flower data.  相似文献   

17.
This paper discusses the estimation of regression parameters after summarizing the data by a covariance matrix of the concatenated vector of explanatory variables and response variable. A robust estimate of the covariance matrix leads to a robust regression estimator. An M-estimator at the covariance estimation step is studied in the paper, and the resulting regression estimator is compared to a few previously proposed robust regression estimators.  相似文献   

18.
This paper addresses the problem of detecting a mixture of parallel regression lines when information about group member¬ship of individual cases is not given. The problem is approached as a missing variable problem, with the missing variables being the dummy variables that code for groups. If a mixture of par¬allel regression lines with normally distributed error terms is present, a simple regression model without dummy variables will produce residuals that follow approximately a mixed normal dis¬tribution. In a simulation studyr several goodness-of-fit tests of normality were used to test the residuals obtained from mis-specified models that excluded dummy variables, Factors varied in the simulation included the number and the separation of the parallel lines and the sample size, The goodness-of-fit test based on the sample kurtosis (82) was overall most powerful in detecting mixtures of parallel regression lines, Applications are discussed.  相似文献   

19.
In survival studies, current status data are frequently encountered when some individuals in a study are not successively observed. This paper considers the problem of simultaneous variable selection and parameter estimation in the high-dimensional continuous generalized linear model with current status data. We apply the penalized likelihood procedure with the smoothly clipped absolute deviation penalty to select significant variables and estimate the corresponding regression coefficients. With a proper choice of tuning parameters, the resulting estimator is shown to be a root n/pn-consistent estimator under some mild conditions. In addition, we show that the resulting estimator has the same asymptotic distribution as the estimator obtained when the true model is known. The finite sample behavior of the proposed estimator is evaluated through simulation studies and a real example.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号