期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Non‐parametric Regression Tests Using Dimension Reduction Techniques

BERTHOLD R. HAAG 《Scandinavian Journal of Statistics》2008,35(4):719-738

Abstract. Testing for parametric structure is an important issue in non‐parametric regression analysis. A standard approach is to measure the distance between a parametric and a non‐parametric fit with a squared deviation measure. These tests inherit the curse of dimensionality from the non‐parametric estimator. This results in a loss of power in finite samples and against local alternatives. This article proposes to circumvent the curse of dimensionality by projecting the residuals under the null hypothesis onto the space of additive functions. To estimate this projection, the smooth backfitting estimator is used. The asymptotic behaviour of the test statistic is derived and the consistency of a wild bootstrap procedure is established. The finite sample properties are investigated in a simulation study. 相似文献

2.

Nonresponse weighting adjustment using estimated response probability

Jae Kwang Kim Jay J. Kim 《Revue canadienne de statistique》2007,35(4):501-514

To reduce nonresponse bias in sample surveys, a method of nonresponse weighting adjustment is often used which consists of multiplying the sampling weight of the respondent by the inverse of the estimated response probability. The authors examine the asymptotic properties of this estimator. They prove that it is generally more efficient than an estimator which uses the true response probability, provided that the parameters which govern this probability are estimated by maximum likelihood. The authors discuss variance estimation methods that account for the effect of using the estimated response probability; they compare their performances in a small simulation study. They also discuss extensions to the regression estimator. 相似文献

3.

A comparison of dependence function estimators in multivariate extremes

Sabrina Vettori Raphaël Huser Marc G. Genton 《Statistics and Computing》2018,28(3):525-538

Various nonparametric and parametric estimators of extremal dependence have been proposed in the literature. Nonparametric methods commonly suffer from the curse of dimensionality and have been mostly implemented in extreme-value studies up to three dimensions, whereas parametric models can tackle higher-dimensional settings. In this paper, we assess, through a vast and systematic simulation study, the performance of classical and recently proposed estimators in multivariate settings. In particular, we first investigate the performance of nonparametric methods and then compare them with classical parametric approaches under symmetric and asymmetric dependence structures within the commonly used logistic family. We also explore two different ways to make nonparametric estimators satisfy the necessary dependence function shape constraints, finding a general improvement in estimator performance either (i) by substituting the estimator with its greatest convex minorant, developing a computational tool to implement this method for dimensions \(D\ge 2\) or (ii) by projecting the estimator onto a subspace of dependence functions satisfying such constraints and taking advantage of Bernstein–Bézier polynomials. Implementing the convex minorant method leads to better estimator performance as the dimensionality increases. 相似文献

4.

A new non parametric estimator for Pdf based on inverse gamma distribution

A. M. Mousa A. Fathi 《统计学通讯:理论与方法》2013,42(23):7002-7010

ABSTRACT

The non parametric approach is considered to estimate probability density function (Pdf) which is supported on(0, ∞). This approach is the inverse gamma kernel. We show that it has same properties as gamma, reciprocal inverse Gaussian, and inverse Gaussian kernels such that it is free of the boundary bias, non negative, and it achieves the optimal rate of convergence for the mean integrated squared error. Also some properties of the estimator were established such as bias and variance. Comparison of the bandwidth selection methods for inverse gamma kernel estimation of Pdf is done. 相似文献

5.

The Finite Sample Performance of Estimators for Mediation Analysis Under Sequential Conditional Independence

Martin Huber Michael Lechner Giovanni Mellace 《商业与经济统计学杂志》2016,34(1):139-160

Using a comprehensive simulation study based on empirical data, this article investigates the finite sample properties of different classes of parametric and semiparametric estimators of (natural) direct and indirect causal effects used in mediation analysis under sequential conditional independence assumptions. The estimators are based on regression, inverse probability weighting, and combinations thereof. Our simulation design uses a large population of Swiss jobseekers and considers variations of several features of the data-generating process (DGP) and the implementation of the estimators that are of practical relevance. We find that no estimator performs uniformly best (in terms of root mean squared error) in all simulations. Overall, so-called “g-computation” dominates. However, differences between estimators are often (but not always) minor in the various setups and the relative performance of the methods often (but not always) varies with the features of the DGP. 相似文献

6.

Dimension reduction in estimating equations with covariates missing at random

Ying Zhang 《Journal of nonparametric statistics》2018,30(2):491-504

To estimate parameters defined by estimating equations with covariates missing at random, we consider three bias-corrected nonparametric approaches based on inverse probability weighting, regression and augmented inverse probability weighting. However, when the dimension of covariates is not low, the estimation efficiency will be affected due to the curse of dimensionality. To address this issue, we propose a two-stage estimation procedure by using the dimension-reduced kernel estimation in conjunction with bias-corrected estimating equations. We show that the resulting three estimators are asymptotically equivalent and achieve the desirable properties. The impact of dimension reduction in nonparametric estimation of parameters is also investigated. The finite-sample performance of the proposed estimators is studied through simulation, and an application to an automobile data set is also presented. 相似文献

7.

A comparison study of nonparametric imputation methods

Jianhui Ning Philip E. Cheng 《Statistics and Computing》2012,22(1):273-285

Consider estimation of a population mean of a response variable when the observations are missing at random with respect to the covariate. Two common approaches to imputing the missing values are the nonparametric regression weighting method and the Horvitz-Thompson (HT) inverse weighting approach. The regression approach includes the kernel regression imputation and the nearest neighbor imputation. The HT approach, employing inverse kernel-estimated weights, includes the basic estimator, the ratio estimator and the estimator using inverse kernel-weighted residuals. Asymptotic normality of the nearest neighbor imputation estimators is derived and compared to kernel regression imputation estimator under standard regularity conditions of the regression function and the missing pattern function. A comprehensive simulation study shows that the basic HT estimator is most sensitive to discontinuity in the missing data patterns, and the nearest neighbors estimators can be insensitive to missing data patterns unbalanced with respect to the distribution of the covariate. Empirical studies show that the nearest neighbor imputation method is most effective among these imputation methods for estimating a finite population mean and for classifying the species of the iris flower data. 相似文献

8.

Mixtures of Gaussian copula factor analyzers for clustering high dimensional data

《Journal of the Korean Statistical Society》2019,48(3):480-492

Mixtures of factor analyzers is a useful model-based clustering method which can avoid the curse of dimensionality in high-dimensional clustering. However, this approach is sensitive to both diverse non-normalities of marginal variables and outliers, which are commonly observed in multivariate experiments. We propose mixtures of Gaussian copula factor analyzers (MGCFA) for clustering high-dimensional clustering. This model has two advantages; (1) it allows different marginal distributions to facilitate fitting flexibility of the mixture model, (2) it can avoid the curse of dimensionality by embedding the factor-analytic structure in the component-correlation matrices of the mixture distribution.An EM algorithm is developed for the fitting of MGCFA. The proposed method is free of the curse of dimensionality and allows any parametric marginal distribution which fits best to the data. It is applied to both synthetic data and a microarray gene expression data for clustering and shows its better performance over several existing methods. 相似文献

9.

基于半参数模型的Kuznets“倒U假说”再检验

许启发蔡超蒋翠侠《统计与信息论坛》2010,25(8):3-9

为解决Kuznets＂倒U假说＂检验中参数模型设定误差问题,采用非参数方式考察经济增长对不平等的非线性影响,而控制变量采用了局部线性的设定方式,从而建立了一个可用于检验＂倒U假说＂的半参数模型。通过半参数模型,对中国城镇居民和农村居民经济增长与收入不平等之间的＂倒U假说＂进行了再检验,并将检验结果与参数模型进行对比。实证结果表明：城镇居民Gini系数与人均GDP的关系呈近似＂倒U型＂曲线且处于上升的趋势;农村居民Gini系数与人均GDP的关系呈现弱的＂U型＂关系,拒绝了＂倒U假说＂。相似文献

10.

Tobit regression for modeling mean survival time using data subject to multiple sources of censoring

下载免费PDF全文

Qi Gong Douglas E. Schaubel 《Pharmaceutical statistics》2018,17(2):117-125

Mean survival time is often of inherent interest in medical and epidemiologic studies. In the presence of censoring and when covariate effects are of interest, Cox regression is the strong default, but mostly due to convenience and familiarity. When survival times are uncensored, covariate effects can be estimated as differences in mean survival through linear regression. Tobit regression can validly be performed through maximum likelihood when the censoring times are fixed (ie, known for each subject, even in cases where the outcome is observed). However, Tobit regression is generally inapplicable when the response is subject to random right censoring. We propose Tobit regression methods based on weighted maximum likelihood which are applicable to survival times subject to both fixed and random censoring times. Under the proposed approach, known right censoring is handled naturally through the Tobit model, with inverse probability of censoring weighting used to overcome random censoring. Essentially, the re‐weighting data are intended to represent those that would have been observed in the absence of random censoring. We develop methods for estimating the Tobit regression parameter, then the population mean survival time. A closed form large‐sample variance estimator is proposed for the regression parameter estimator, with a semiparametric bootstrap standard error estimator derived for the population mean. The proposed methods are easily implementable using standard software. Finite‐sample properties are assessed through simulation. The methods are applied to a large cohort of patients wait‐listed for kidney transplantation. 相似文献

11.

A Weighting Approach for GEE Analysis with Missing Data

Cuiling Wang Myunghee Cho Paik 《统计学通讯:理论与方法》2013,42(13):2397-2411

We propose a new weighting (WT) method to handle missing categorical outcomes in longitudinal data analysis using generalized estimating equations (GEE). The proposed WT provides a valid GEE estimator when the data are missing at random (MAR), and has more stable weights and shows advantage in efficiency compared to the inverse probability weighing method in the presence of small observation probabilities. The WT estimator is similar to the stabilized weighting (SWT) estimator under mild conditions, but it is more stable and efficient than SWT when the associations of the outcome with the observation probabilities and the covariate are strong. 相似文献

12.

Using response probability and ratio imputation in the estimation under callbacks

Hyeonah Park Jongwoo Jeon 《Journal of the Korean Statistical Society》2010,39(4):511-521

Although the response model has been frequently applied to nonresponse weighting adjustment or imputation, the estimation under callbacks has been relatively underdeveloped in the response model. We propose an estimator under callbacks using both the response probability and the ratio imputation and a replication variance estimator of the estimator. We also study the estimation of the response probability. A simulation study illustrates our technique. 相似文献

13.

网络访问固定样本调查的统计推断研究

刘展金勇进《统计与信息论坛》2017,(2):3-10

如何解决网络访问固定样本调查的统计推断问题,是大数据背景下网络调查面临的严重挑战。针对此问题,提出将网络访问固定样本的调查样本与概率样本结合,利用倾向得分逆加权和加权组调整构造伪权数来估计目标总体,进一步采用基于有放回概率抽样的Vwr方法、基于广义回归估计的Vgreg方法与Jackknife方法来估计方差,并比较不同方法估计的效果。研究表明:无论概率样本的样本量较大还是较小,本研究所提出的总体均值估计方法效果较好,并且在方差估计中Jackknife方法的估计效果最好。相似文献

14.

Empirical likelihood for the response mean of generalized linear models with missing at random responses

Xiaohui Liu 《统计学通讯:模拟与计算》2017,46(1):164-173

In this article, we present a new empirical likelihood ratio for constructing the confidence interval of the response mean of generalized linear models with missing at random responses. Compared with the existing methods, the proposal can avoid the so-called “curse of dimensionality” problem when the dimension of covariates is high, and is still chi-squared distributed asymptotically, nevertheless. Simulation studies are also provided to illustrate the performance of the developed method. 相似文献

15.

The challenges of evaluating dose response in flexible‐dose trials using marginal structural models

Ilya Lipkovich Craig H. Mallinckrodt Douglas E. Faries 《Pharmaceutical statistics》2012,11(6):485-493

Assessing dose response from flexible‐dose clinical trials is problematic. The true dose effect may be obscured and even reversed in observed data because dose is related to both previous and subsequent outcomes. To remove selection bias, we propose marginal structural models, inverse probability of treatment‐weighting (IPTW) methodology. Potential clinical outcomes are compared across dose groups using a marginal structural model (MSM) based on a weighted pooled repeated measures analysis (generalized estimating equations with robust estimates of standard errors), with dose effect represented by current dose and recent dose history, and weights estimated from the data (via logistic regression) and determined as products of (i) inverse probability of receiving dose assignments that were actually received and (ii) inverse probability of remaining on treatment by this time. In simulations, this method led to almost unbiased estimates of true dose effect under various scenarios. Results were compared with those obtained by unweighted analyses and by weighted analyses under various model specifications. The simulation showed that the IPTW MSM methodology is highly sensitive to model misspecification even when weights are known. Practitioners applying MSM should be cautious about the challenges of implementing MSM with real clinical data. Clinical trial data are used to illustrate the methodology. Copyright © 2012 John Wiley & Sons, Ltd. 相似文献

16.

Kernel‐based Generalized Cross‐validation in Non‐parametric Mixed‐effect Models

WANGLI XU LIXING ZHU 《Scandinavian Journal of Statistics》2009,36(2):229-247

Abstract. Although generalized cross‐validation (GCV) has been frequently applied to select bandwidth when kernel methods are used to estimate non‐parametric mixed‐effect models in which non‐parametric mean functions are used to model covariate effects, and additive random effects are applied to account for overdispersion and correlation, the optimality of the GCV has not yet been explored. In this article, we construct a kernel estimator of the non‐parametric mean function. An equivalence between the kernel estimator and a weighted least square type estimator is provided, and the optimality of the GCV‐based bandwidth is investigated. The theoretical derivations also show that kernel‐based and spline‐based GCV give very similar asymptotic results. This provides us with a solid base to use kernel estimation for mixed‐effect models. Simulation studies are undertaken to investigate the empirical performance of the GCV. A real data example is analysed for illustration. 相似文献

17.

Handling missing data by deleting completely observed records

Myunghee Cho Paik Cuiling Wang 《Journal of statistical planning and inference》2009

When data are missing, analyzing records that are completely observed may cause bias or inefficiency. Existing approaches in handling missing data include likelihood, imputation and inverse probability weighting. In this paper, we propose three estimators inspired by deleting some completely observed data in the regression setting. First, we generate artificial observation indicators that are independent of outcome given the observed data and draw inferences conditioning on the artificial observation indicators. Second, we propose a closely related weighting method. The proposed weighting method has more stable weights than those of the inverse probability weighting method (Zhao, L., Lipsitz, S., 1992. Designs and analysis of two-stage studies. Statistics in Medicine 11, 769–782). Third, we improve the efficiency of the proposed weighting estimator by subtracting the projection of the estimating function onto the nuisance tangent space. When data are missing completely at random, we show that the proposed estimators have asymptotic variances smaller than or equal to the variance of the estimator obtained from using completely observed records only. Asymptotic relative efficiency computation and simulation studies indicate that the proposed weighting estimators are more efficient than the inverse probability weighting estimators under wide range of practical situations especially when the missingness proportion is large. 相似文献

18.

A comparison of multiple imputation and doubly robust estimation for analyses with missing data 总被引：1，自引：0，他引：1

James R. Carpenter Michael G. Kenward Stijn Vansteelandt 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2006,169(3):571-584

Summary. Multiple imputation is now a well-established technique for analysing data sets where some units have incomplete observations. Provided that the imputation model is correct, the resulting estimates are consistent. An alternative, weighting by the inverse probability of observing complete data on a unit, is conceptually simple and involves fewer modelling assumptions, but it is known to be both inefficient (relative to a fully parametric approach) and sensitive to the choice of weighting model. Over the last decade, there has been a considerable body of theoretical work to improve the performance of inverse probability weighting, leading to the development of 'doubly robust' or 'doubly protected' estimators. We present an intuitive review of these developments and contrast these estimators with multiple imputation from both a theoretical and a practical viewpoint. 相似文献

19.

Rao's score test with nonparametric density estimators

《Journal of statistical planning and inference》2001,97(1):85-100

Traditionally, Rao's score (RS) tests are constructed under a parametric specification of the probability density function. We estimate the density function by a non-parametric estimator and consider a semi-parametric Rao's score (SPRS) test for a set of hypotheses concerning the parametric model. The asymptotic distribution of the SPRS test is analyzed. Further, for the regression model, we carry out a set of Monte Carlo experiments to analyze the size and power of the SPRS test in small samples. The robustness of SPRS test to the choice of the density estimator is also analyzed. 相似文献

20.

Inverse Probability Weighting with Missing Predictors of Treatment Assignment or Missingness

Shaun Seaman Ian White 《统计学通讯:理论与方法》2014,43(16):3499-3515

Inverse probability weighting (IPW) can deal with confounding in non randomized studies. The inverse weights are probabilities of treatment assignment (propensity scores), estimated by regressing assignment on predictors. Problems arise if predictors can be missing. Solutions previously proposed include assuming assignment depends only on observed predictors and multiple imputation (MI) of missing predictors. For the MI approach, it was recommended that missingness indicators be used with the other predictors. We determine when the two MI approaches, (with/without missingness indicators) yield consistent estimators and compare their efficiencies.We find that, although including indicators can reduce bias when predictors are missing not at random, it can induce bias when they are missing at random. We propose a consistent variance estimator and investigate performance of the simpler Rubin’s Rules variance estimator. In simulations we find both estimators perform well. IPW is also used to correct bias when an analysis model is fitted to incomplete data by restricting to complete cases. Here, weights are inverse probabilities of being a complete case. We explain how the same MI methods can be used in this situation to deal with missing predictors in the weight model, and illustrate this approach using data from the National Child Development Survey. 相似文献