期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

On kernel machine learning for propensity score estimation under complex confounding structures

Baiming Zou Xinlei Mi Patrick J. Tighe Gary G. Koch Fei Zou 《Pharmaceutical statistics》2021,20(4):752-764

Post marketing data offer rich information and cost-effective resources for physicians and policy-makers to address some critical scientific questions in clinical practice. However, the complex confounding structures (e.g., nonlinear and nonadditive interactions) embedded in these observational data often pose major analytical challenges for proper analysis to draw valid conclusions. Furthermore, often made available as electronic health records (EHRs), these data are usually massive with hundreds of thousands observational records, which introduce additional computational challenges. In this paper, for comparative effectiveness analysis, we propose a statistically robust yet computationally efficient propensity score (PS) approach to adjust for the complex confounding structures. Specifically, we propose a kernel-based machine learning method for flexibly and robustly PS modeling to obtain valid PS estimation from observational data with complex confounding structures. The estimated propensity score is then used in the second stage analysis to obtain the consistent average treatment effect estimate. An empirical variance estimator based on the bootstrap is adopted. A split-and-merge algorithm is further developed to reduce the computational workload of the proposed method for big data, and to obtain a valid variance estimator of the average treatment effect estimate as a by-product. As shown by extensive numerical studies and an application to postoperative pain EHR data comparative effectiveness analysis, the proposed approach consistently outperforms other competing methods, demonstrating its practical utility. 相似文献

2.

Comparison of various machine learning algorithms for estimating generalized propensity score

Chunhao Tu 《Journal of Statistical Computation and Simulation》2019,89(4):708-719

In this paper, we conducted a simulation study to evaluate the performance of four algorithms: multinomial logistic regression (MLR), bagging (BAG), random forest (RF), and gradient boosting (GB), for estimating generalized propensity score (GPS). Similar to the propensity score (PS), the ultimate goal of using GPS is to estimate unbiased average treatment effects (ATEs) in observational studies. We used the GPS estimates computed from these four algorithms with the generalized doubly robust (GDR) estimator to estimate ATEs in observational studies. We evaluated these ATE estimates in terms of bias and mean squared error (MSE). Simulation results show that overall, the GB algorithm produced the best ATE estimates based on these evaluation criteria. Thus, we recommend using the GB algorithm for estimating GPS in practice. 相似文献

3.

An improved comorbidity summary score for measuring disease burden and predicting mortality with applications to two national cohorts

Ralph C. Ward Leonard Egede Viswanathan Ramakrishnan Lewis Frey Robert Neal Axon Clara Libby E. Dismuke 《统计学通讯:理论与方法》2013,42(18):4642-4655

Abstract

Research involving administrative healthcare data to study patient outcomes requires the investigator to account for the patient’s disease burden in order to reduce the potential for biased results. Here we develop a comorbidity summary score based on variable importance measures derived from several statistical and machine learning methods and show it has superior predictive performance to the Elixhauser and Charlson indices when used to predict 1-year, 5-year, and 10-year mortality. We used two large Veterans Administration cohorts to develop and validate the summary score and compared predictive performance using the area under ROC curve (AUC) and the Brier score. 相似文献

4.

Performance evaluation of propensity score methods for estimating average treatment effects with multi-level treatments*

Hui Nian Juan Ding Huiyun Wu William D. Dupont Steve Brunwasser 《Journal of applied statistics》2019,46(5):853-873

The propensity score (PS) method is widely used to estimate the average treatment effect (TE) in observational studies. However, it is generally confined to the binary treatment assignment. In an extension to the settings of a multi-level treatment, Imbens proposed a generalized propensity score which is the conditional probability of receiving a particular level of the treatment given pre-treatment variables. The average TE can then be estimated by conditioning solely on the generalized PS under the assumption of weak unconfoundedness. In the present work, we adopted this approach and conducted extensive simulations to evaluate the performance of several methods using the generalized PS, including subclassification, matching, inverse probability of treatment weighting (IPTW), and covariate adjustment. Compared with other methods, IPTW had the preferred overall performance. We then applied these methods to a retrospective cohort study of 228,876 pregnant women. The impact of the exposure to different types of the antidepressant medications (no exposure, selective serotonin reuptake inhibitor (SSRI) only, non-SSRI only, and both) during pregnancy on several important infant outcomes (birth weight, gestation age, preterm labor, and respiratory distress) were assessed. 相似文献

5.

Estimation of the average causal effect via multiple propensity score stratification

Takanobu Nomura 《统计学通讯:模拟与计算》2018,47(1):48-62

Suppose we are interested in estimating the average causal effect (ACE) for the population mean from observational study. Because of simplicity and ease of interpretation, stratification by a propensity score (PS) is widely used to adjust for influence of confounding factors in estimation of the ACE. Appropriateness of the estimation by the PS stratification relies on correct specification of the PS. We propose an estimator based on stratification with multiple PS models by clustering techniques instead of model selection. If one of them correctly specifies, the proposed estimator removes bias and thus is more robust than the standard PS stratification. 相似文献

6.

Comparing different propensity score estimation methods for estimating the marginal causal effect through standardization to propensity scores

Safoora Gharibzadeh Mohammad Ali Mansournia Abbas Rahimiforoushani Ahad Alizadeh Atieh Amouzegar Kamran Mehrabani-Zeinabad 《统计学通讯:模拟与计算》2018,47(4):964-976

Hernan and Robins proposed a method for calculating marginal causal effect of treatment using standardization to propensity scores.

?Data adaptive methods have been suggested as alternatives to logistic regression for the estimation of propensity scores. We examined the performance of various data mining methods using simulated data. The estimators' performance was evaluated in terms of relative bias, 95% CI coverage rate, and mean squared error.

?All methods (except CART and GBM) displayed generally acceptable performance. However, under the conditions of moderate non-additivity and moderate nonlinearity, ANN and SL outperformed logistic regression with better bias reduction and more consistent 95% CI coverage. 相似文献

7.

Subsemble: an ensemble method for combining subset-specific algorithm fits

Stephanie Sapp Mark J. van der Laan John Canny 《Journal of applied statistics》2014,41(6):1247-1259

Ensemble methods using the same underlying algorithm trained on different subsets of observations have recently received increased attention as practical prediction tools for massive data sets. We propose Subsemble: a general subset ensemble prediction method, which can be used for small, moderate, or large data sets. Subsemble partitions the full data set into subsets of observations, fits a specified underlying algorithm on each subset, and uses a clever form of V-fold cross-validation to output a prediction function that combines the subset-specific fits. We give an oracle result that provides a theoretical performance guarantee for Subsemble. Through simulations, we demonstrate that Subsemble can be a beneficial tool for small- to moderate-sized data sets, and often has better prediction performance than the underlying algorithm fit just once on the full data set. We also describe how to include Subsemble as a candidate in a SuperLearner library, providing a practical way to evaluate the performance of Subsemble relative to the underlying algorithm fit just once on the full data set. 相似文献

8.

Propensity score applied to survival data analysis through proportional hazards models: a Monte Carlo study

Etienne Gayat Matthieu Resche‐Rigon Jean‐Yves Mary Raphaël Porcher 《Pharmaceutical statistics》2012,11(3):222-229

Propensity score methods are increasingly used in medical literature to estimate treatment effect using data from observational studies. Despite many papers on propensity score analysis, few have focused on the analysis of survival data. Even within the framework of the popular proportional hazard model, the choice among marginal, stratified or adjusted models remains unclear. A Monte Carlo simulation study was used to compare the performance of several survival models to estimate both marginal and conditional treatment effects. The impact of accounting or not for pairing when analysing propensity‐score‐matched survival data was assessed. In addition, the influence of unmeasured confounders was investigated. After matching on the propensity score, both marginal and conditional treatment effects could be reliably estimated. Ignoring the paired structure of the data led to an increased test size due to an overestimated variance of the treatment effect. Among the various survival models considered, stratified models systematically showed poorer performance. Omitting a covariate in the propensity score model led to a biased estimation of treatment effect, but replacement of the unmeasured confounder by a correlated one allowed a marked decrease in this bias. Our study showed that propensity scores applied to survival data can lead to unbiased estimation of both marginal and conditional treatment effect, when marginal and adjusted Cox models are used. In all cases, it is necessary to account for pairing when analysing propensity‐score‐matched data, using a robust estimator of the variance. Copyright © 2012 John Wiley & Sons, Ltd. 相似文献

9.

Controlling for confounding via propensity score methods can result in biased estimation of the conditional AUC: A simulation study

Hadiza I. Galadima Donna K. McClish 《Pharmaceutical statistics》2019,18(5):568-582

In the medical literature, there has been an increased interest in evaluating association between exposure and outcomes using nonrandomized observational studies. However, because assignments to exposure are not random in observational studies, comparisons of outcomes between exposed and nonexposed subjects must account for the effect of confounders. Propensity score methods have been widely used to control for confounding, when estimating exposure effect. Previous studies have shown that conditioning on the propensity score results in biased estimation of conditional odds ratio and hazard ratio. However, research is lacking on the performance of propensity score methods for covariate adjustment when estimating the area under the ROC curve (AUC). In this paper, AUC is proposed as measure of effect when outcomes are continuous. The AUC is interpreted as the probability that a randomly selected nonexposed subject has a better response than a randomly selected exposed subject. A series of simulations has been conducted to examine the performance of propensity score methods when association between exposure and outcomes is quantified by AUC; this includes determining the optimal choice of variables for the propensity score models. Additionally, the propensity score approach is compared with that of the conventional regression approach to adjust for covariates with the AUC. The choice of the best estimator depends on bias, relative bias, and root mean squared error. Finally, an example looking at the relationship of depression/anxiety and pain intensity in people with sickle cell disease is used to illustrate the estimation of the adjusted AUC using the proposed approaches. 相似文献

10.

Propensity score model specification for estimation of average treatment effects

Ingeborg Waernbaum 《Journal of statistical planning and inference》2010

Treatment effect estimators that utilize the propensity score as a balancing score, e.g., matching and blocking estimators are robust to misspecifications of the propensity score model when the misspecification is a balancing score. Such misspecifications arise from using the balancing property of the propensity score in the specification procedure. Here, we study misspecifications of a parametric propensity score model written as a linear predictor in a strictly monotonic function, e.g. a generalized linear model representation. Under mild assumptions we show that for misspecifications, such as not adding enough higher order terms or choosing the wrong link function, the true propensity score is a function of the misspecified model. Hence, the latter does not bring bias to the treatment effect estimator. It is also shown that a misspecification of the propensity score does not necessarily lead to less efficient estimation of the treatment effect. The results of the paper are highlighted in simulations where different misspecifications are studied. 相似文献

11.

基于协变量平衡加权的平均处理效应的稳健有效估计

吴浩彭非《统计研究》2020,37(4):114-128

倾向性得分是估计平均处理效应的重要工具。但在观察性研究中,通常会由于协变量在处理组与对照组分布的不平衡性而导致极端倾向性得分的出现,即存在十分接近于0或1的倾向性得分,这使得因果推断的强可忽略假设接近于违背,进而导致平均处理效应的估计出现较大的偏差与方差。Li等(2018a)提出了协变量平衡加权法,在无混杂性假设下通过实现协变量分布的加权平衡,解决了极端倾向性得分带来的影响。本文在此基础上,提出了基于协变量平衡加权法的稳健且有效的估计方法,并通过引入超级学习算法提升了模型在实证应用中的稳健性;更进一步,将前一方法推广至理论上不依赖于结果回归模型和倾向性得分模型假设的基于协变量平衡加权的稳健有效估计。蒙特卡洛模拟表明,本文提出的两种方法在结果回归模型和倾向性得分模型均存在误设时仍具有极小的偏差和方差。实证部分将两种方法应用于右心导管插入术数据,发现右心导管插入术大约会增加患者6. 3%死亡率。相似文献

12.

大数据背景下网络调查样本的随机森林倾向得分模型推断研究

刘展等《统计研究》2021,38(11):130-140

随着大数据与互联网技术的迅猛发展,网络调查的应用越来越广泛。本文提出网络调查样本的随机森林倾向得分模型推断方法,通过构建若干棵分类决策树组成随机森林,对网络调查样本单元的倾向得分进行估计,从而实现对总体的推断。模拟分析和实证研究结果表明:基于随机森林倾向得分模型的总体均值估计的相对偏差、方差与均方误差均比基于Logistic倾向得分模型的总体均值估计的相对偏差、方差与均方误差小,提出的方法估计效果更好。相似文献

13.

Comparison of clustering algorithms on generalized propensity score in observational studies: a simulation study

Chunhao Tu Shuo Jiao Woon Yuen Koh 《Journal of Statistical Computation and Simulation》2013,83(12):2206-2218

In observational studies, unbalanced observed covariates between treatment groups often cause biased inferences on the estimation of treatment effects. Recently, generalized propensity score (GPS) has been proposed to overcome this problem; however, a practical technique to apply the GPS is lacking. This study demonstrates how clustering algorithms can be used to group similar subjects based on transformed GPS. We compare four popular clustering algorithms: k-means clustering (KMC), model-based clustering, fuzzy c-means clustering and partitioning around medoids based on the following three criteria: average dissimilarity between subjects within clusters, average Dunn index and average silhouette width under four various covariate scenarios. Simulation studies show that the KMC algorithm has overall better performance compared with the other three clustering algorithms. Therefore, we recommend using the KMC algorithm to group similar subjects based on the transformed GPS. 相似文献

14.

Propensity score matching and stratification using multiparty data without pooling

Jixian Wang Roland Marion-Gallois 《Pharmaceutical statistics》2023,22(1):4-19

Matching and stratification based on confounding factors or propensity scores (PS) are powerful approaches for reducing confounding bias in indirect treatment comparisons. However, implementing these approaches requires pooled individual patient data (IPD). The research presented here was motivated by an indirect comparison between a single-armed trial in acute myeloid leukemia (AML), and two external AML registries with current treatments for a control. For confidentiality reasons, IPD cannot be pooled. Common approaches to adjusting confounding bias, such as PS matching or stratification, cannot be applied as 1) a model for PS, for example, a logistic model, cannot be fitted without pooling covariate data; 2) pooling response data may be necessary for some statistical inference (e.g., estimating the SE of mean difference of matched pairs) after PS matching. We propose a set of approaches that do not require pooling IPD, using a combination of methods including a linear discriminant for matching and stratification, and secure multiparty computation for estimation of within-pair sample variance and for calculations involving multiple control sources. The approaches only need to share aggregated data offline, rather than real-time secure data transfer, as required by typical secure multiparty computation for model fitting. For survival analysis, we propose an approach using restricted mean survival time. A simulation study was conducted to evaluate this approach in several scenarios, in particular, with a mixture of continuous and binary covariates. The results confirmed the robustness and efficiency of the proposed approach. A real data example is also provided for illustration. 相似文献

15.

Using generalized doubly robust estimator to estimate average treatment effects of multiple treatments in observational studies

Chunhao Tu Woon Yuen Koh Shuo Jiao 《Journal of Statistical Computation and Simulation》2013,83(8):1518-1526

The generalized doubly robust estimator is proposed for estimating the average treatment effect (ATE) of multiple treatments based on the generalized propensity score (GPS). In medical researches where observational studies are conducted, estimations of ATEs are usually biased since the covariate distributions could be unbalanced among treatments. To overcome this problem, Imbens [The role of the propensity score in estimating dose-response functions, Biometrika 87 (2000), pp. 706–710] and Feng et al. [Generalized propensity score for estimating the average treatment effect of multiple treatments, Stat. Med. (2011), in press. Available at: http://onlinelibrary.wiley.com/doi/10.1002/sim.4168/abstract] proposed weighted estimators that are extensions of a ratio estimator based on GPS to estimate ATEs with multiple treatments. However, the ratio estimator always produces a larger empirical sample variance than the doubly robust estimator, which estimates an ATE between two treatments based on the estimated propensity score (PS). We conduct a simulation study to compare the performance of our proposed estimator with Imbens’ and Feng et al.’s estimators, and simulation results show that our proposed estimator outperforms their estimators in terms of bias, empirical sample variance and mean-squared error of the estimated ATEs. 相似文献

16.

Modeling road traffic crashes with zero-inflation and site-specific random effects

Helai Huang Hong Chor Chin 《Statistical Methods and Applications》2010,19(3):445-462

Zero-inflated count models are increasingly employed in many fields in case of “zero-inflation”. In modeling road traffic crashes, it has also shown to be useful in obtaining a better model-fitting when zero crash counts are over-presented. However, the general specification of zero-inflated model can not account for the multilevel data structure in crash data, which may be an important source of over-dispersion. This paper examines zero-inflated Poisson regression with site-specific random effects (REZIP) with comparison to random effect Poisson model and standard zero-inflated poison model. A practical and flexible procedure, using Bayesian inference with Markov Chain Monte Carlo algorithm and cross-validation predictive density techniques, is applied for model calibration and suitability assessment. Using crash data in Singapore (1998–2005), the illustrative results demonstrate that the REZIP model may significantly improve the model-fitting and predictive performance of crash prediction models. This improvement can contribute to traffic safety management and engineering practices such as countermeasure design and safety evaluation of traffic treatments. 相似文献

17.

Bayesian misclassification and propensity score methods for clustered observational studies

Qi Zhou Yoo-Mi Chin James D. Stamey 《Journal of applied statistics》2018,45(9):1547-1560

Bayesian propensity score regression analysis with misclassified binary responses is proposed to analyse clustered observational data. This approach utilizes multilevel models and corrects for misclassification in the responses. Using the deviance information criterion (DIC), the performance of the approach is compared with approaches without correcting for misclassification, multilevel structure specification, or both in the study of the impact of female employment on the likelihood of physical violence. The smallest DIC confirms that our proposed model best fits the data. We conclude that female employment has an insignificant impact on the likelihood of physical spousal violence towards women. In addition, a simulation study confirms that the proposed approach performed best in terms of bias and coverage rate. Ignoring misclassification in response or multilevel structure of data would yield biased estimation of the exposure effect. 相似文献

18.

A comparison of balancing scores for estimating rate ratios of count data in observational studies

Chunhao Tu Woon Yuen Koh 《统计学通讯:模拟与计算》2017,46(1):772-778

In this article, we conduct a Monte Carlo study to examine four balancing scores (BS1: propensity score, BS2: prognostic score, BS3: adjusted propensity score estimated by the estimated prognostic score, and BS4: adjusted propensity score estimated by the estimated prognostic score and other covariates) for adjusting bias in estimating the marginal and the conditional rate ratios of count data in observational studies. Simulation results show that BS1–BS4 are not much different in terms of estimating the marginal and the conditional rate ratios, however, choosing the appropriate matching algorithm is more important than selecting a balancing scores. 相似文献

19.

Using Inverse Probability Weighting Estimators to Evaluate Various Propensity Scores When Treatment Switching Exists

Chunhao Tu Woon Yuen Koh 《统计学通讯:模拟与计算》2016,45(6):2182-2190

In this paper, we conduct a Monte Carlo simulation study to evaluate three propensity score (PS) scenarios for estimating an average treatment effect (ATE) in observational studies when treatment switching exists: (a) ignoring treatment switching in subjects (UPS), (b) removing subjects with treatment switching (RPS), and (c) adjusting for treatment switching effect (APS) with two inverse probability weighting estimators, IPW1 and IPW2. We evaluate these six estimators in terms of bias, mean squared error (MSE), empirical standard error (ESE), and coverage probability (CP) under various simulation scenarios. Simulation results show that the IPW2 estimator with RPS has relatively good performance. 相似文献

20.

Inference for proportional hazard model with propensity score

Bo Lu Luheng Wang Xingwei Tong Huiyun Xiang 《统计学通讯:理论与方法》2018,47(12):2908-2918

Since the publication of the seminal paper by Cox (1972), proportional hazard model has become very popular in regression analysis for right censored data. In observational studies, treatment assignment may depend on observed covariates. If these confounding variables are not accounted for properly, the inference based on the Cox proportional hazard model may perform poorly. As shown in Rosenbaum and Rubin (1983), under the strongly ignorable treatment assignment assumption, conditioning on the propensity score yields valid causal effect estimates. Therefore we incorporate the propensity score into the Cox model for causal inference with survival data. We derive the asymptotic property of the maximum partial likelihood estimator when the model is correctly specified. Simulation results show that our method performs quite well for observational data. The approach is applied to a real dataset on the time of readmission of trauma patients. We also derive the asymptotic property of the maximum partial likelihood estimator with a robust variance estimator, when the model is incorrectly specified. 相似文献