首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 125 毫秒
1.
Propensity score matching has been a long-standing tradition for handling confounding in causal inference, however, requiring stringent model assumptions. In this article, we propose novel double score matching (DSM) utilizing both the propensity score and prognostic score. To gain the protection of possible model misspecification, we posit multiple candidate models for each score. We show that the debiasing DSM estimator achieves the multiple robustness property in that it is consistent if any one of the score models is correctly specified. We characterize the asymptotic distribution for the DSM estimator requiring only one correct model specification based on the martingale representations of the matching estimators and theory for local normal experiments. We also provide a two-stage replication method for variance estimation and extend DSM for quantile estimation. Simulation demonstrates DSM outperforms single-score matching and prevailing multiply robust weighting estimators in the presence of extreme propensity scores.  相似文献   

2.
We have compared the efficacy of five imputation algorithms readily available in SAS for the quadratic discriminant function. Here, we have generated several different parametric-configuration training data with missing data, including monotone missing-at-random observations, and used a Monte Carlo simulation to examine the expected probabilities of misclassification for the two-class quadratic statistical discrimination problem under five different imputation methods. Specifically, we have compared the efficacy of the complete observation-only method and the mean substitution, regression, predictive mean matching, propensity score, and Markov Chain Monte Carlo (MCMC) imputation methods. We found that the MCMC and propensity score multiple imputation approaches are, in general, superior to the other imputation methods for the configurations and training-sample sizes we considered.  相似文献   

3.
Spatial data and non parametric methods arise frequently in studies of different areas and it is a common practice to analyze such data with semi-parametric spatial autoregressive (SPSAR) models. We propose the estimations of SPSAR models based on maximum likelihood estimation (MLE) and kernel estimation. The estimation of spatial regression coefficient ρ was done by optimizing the concentrated log-likelihood function with respect to ρ. Furthermore, under appropriate conditions, we derive the limiting distributions of our estimators for both the parametric and non parametric components in the model.  相似文献   

4.
Sample quantile, rank, and outlyingness functions play long-established roles in univariate exploratory data analysis. In recent years, various multivariate generalizations have been formulated, among which the “spatial” approach has become especially well developed, including fully affine equivariant/invariant versions with but modest computational burden (24, 6, 34, 32 and 25). The only shortcoming of the spatial approach is that its robustness decreases to zero as the quantile or outlyingness level is chosen farther out from the center (Dang and Serfling, 2010). This is especially detrimental to exploratory data analysis procedures such as detection of outliers and delineation of the “middle” 50%, 75%, or 90% of the data set, for example. Here we develop suitably robust versions using a trimming approach. The improvements in robustness are illustrated and characterized using simulated and actual data. Also, as a byproduct of the investigation, a new robust, affine equivariant, and computationally easy scatter estimator is introduced.  相似文献   

5.
Spatial outliers are spatially referenced objects whose non spatial attribute values are significantly different from the corresponding values in their spatial neighborhoods. In other words, a spatial outlier is a local instability or an extreme observation that deviates significantly in its spatial neighborhood, but possibly not be in the entire dataset. In this article, we have proposed a novel spatial outlier detection algorithm, location quotient (LQ) for multiple attributes spatial datasets, and compared its performance with the well-known mean and median algorithms for multiple attributes spatial datasets, in the literature. In particular, we have applied the mean, median, and LQ algorithms on a real dataset and on simulated spatial datasets of 13 different sizes to compare their performances. In addition, we have calculated area under the curve values in all the cases, which shows that our proposed algorithm is more powerful than the mean and median algorithms in almost all the considered cases and also plotted receiver operating characteristic curves in some cases.  相似文献   

6.
In this article, we propose an outlier detection approach in a multiple regression model using the properties of a difference-based variance estimator. This type of a difference-based variance estimator was originally used to estimate error variance in a non parametric regression model without estimating a non parametric function. This article first employed a difference-based error variance estimator to study the outlier detection problem in a multiple regression model. Our approach uses the leave-one-out type method based on difference-based error variance. The existing outlier detection approaches using the leave-one-out approach are highly affected by other outliers, while ours is not because our approach does not use the regression coefficient estimator. We compared our approach with several existing methods using a simulation study, suggesting the outperformance of our approach. The advantages of our approach are demonstrated using a real data application. Our approach can be extended to the non parametric regression model for outlier detection.  相似文献   

7.
The Weibull distribution is widely used due to its versatility and relative simplicity. In our paper, the non informative priors for the ratio of the scale parameters of two Weibull models are provided. The asymptotic matching of coverage probabilities of Bayesian credible intervals is considered, with the corresponding frequentist coverage probabilities. We developed the various priors for the ratio of two scale parameters using the following matching criteria: quantile matching, matching of distribution function, highest posterior density matching, and inversion of test statistics. One particular prior, which meets all the matching criteria, is found. Next, we derive the reference priors for groups of ordering. We see that all the reference priors satisfy a first-order matching criterion and that the one-at-a-time reference prior is a second-order matching prior. A simulation study is performed and an example given.  相似文献   

8.
Abstract

In the present communication, we consider the estimation of the common hazard rate of several exponential distributions with unknown and unequal location parameters with a common scale parameter under a general class of bowl-shaped scale invariant loss functions. We have shown that the best affine equivariant estimator (BAEE) is inadmissible by deriving a non smooth improved estimator. Further, we have obtained a smooth estimator which improves upon the BAEE. As an application, we have obtained explicit expressions of improved estimators for special loss functions. Finally, a simulation study is carried out for numerically comparing the risk performance of various estimators.  相似文献   

9.
The practice for testing homogeneity of several rival models is of interest. In this article, we consider a non parametric multiple test for non nested distributions in the context of the model selection. Based on the linear sign rank test, and the known union–intersection principle, we let the magnitude of the data to give a better performance to the test statistic. We consider the sample and the non nested rival models as blocks and treatments, respectively, and introduce the extended Friedman test version to compare with the results of the test based on the linear sign rank test. A real dataset based on the waiting time to earthquake is considered to illustrate the results.  相似文献   

10.
Recent evidence indicates that using multiple forward rates sharply predicts future excess returns on U.S. Treasury Bonds, with the R2's being around 30%. The projection coefficients in these regressions exhibit a distinct pattern that relates to the maturity of the forward rate. These dimensions of the data, in conjunction with the transition dynamics of bond yields, offer a serious challenge to term structure models. In this article we show that a regime-shifting term structure model can empirically account for these challenging data features. Alternative models, such as affine specification, fail to account for these important features. We find that regimes in the model are intimately related to bond risk premia and real business cycles.  相似文献   

11.
We apply statistical selection theory to multiple target detection problems by analyzing the Mahalanobis distances between multivariate normal populations and a desired standard (a known characteristic of a target). We want to achieve the goal of selecting a subset that contains no non target (negative) sites, which entails screening out all non targets. Correct selection (CS) is defined according to this goal. We consider two cases: (1) that all covariance matrices are known; and (2) that all covariance matrices are unknown, including both heteroscedastic and homoscedastic cases. Optimal selection procedures are proposed in order to reach the selection goal. The least favorable configurations (LFC) are found. Tables and figures are presented to illustrate the properties of our proposed procedures. Simulation examples are given to show that our procedures work well. The log-concavity results of the operating characteristic functions are also given.  相似文献   

12.
To increase the efficiency of comparisons between treatments in clinical trials, we may consider the use of a multiple matching design, in which, for each patient receiving the experimental treatment, we match with more than one patient receiving the standard treatment. To assess the efficacy of the experimental treatment, the risk ratio (RR) of patient responses between two treatments is certainly one of the most commonly used measures. Because the probability of patient responses in clinical trial is often not small, the odds ratio (OR), of which the practical interpretation is not easily understood, cannot approximate RR well. Thus, all sample size formulae in terms of OR for case-control studies with multiple matched controls per case can be of limited use here. In this paper, we develop three sample size formulae based on RR for randomized trials with multiple matching. We propose a test statistic for testing the equality of RR under multiple matching. On the basis of Monte Carlo simulation, we evaluate the performance of the proposed test statistic with respect to Type I error. To evaluate the accuracy and usefulness of the three sample size formulae developed in this paper, we further calculate their simulated powers and compare them with those of the sample size formula ignoring matching and the sample size formula based on OR for multiple matching published elsewhere. Finally, we include an example that employs the multiple matching study design about the use of the supplemental ascorbate in the supportive treatment of terminal cancer patients to illustrate the use of these formulae.  相似文献   

13.
It is often of interest to use regression analysis to study the relationship between occurrence of events in space and spatially-indexed covariates. One model for such regression analysis is the Poisson point process. Here, we develop a method to perform the selection of covariates and the estimation of model parameters simultaneously for this model via a regularization method. We assess the finite-sample properties of our method with a simulation study. In addition, we propose a variant of our method that allows the selection of covariates at multiple pixel resolutions. For illustration, we consider the locations of a tree species, Beilschmiedia pendula, in a study plot at Barro Colorado Island in central Panama. We find that Beilschmiedia pendula occurs in greater abundance at locations with higher elevation and steeper slope. Also, we identify three species to which Beilschmiedia pendula tends to be attracted, two species by which it appears to be repelled, and a species with no apparent relationship.  相似文献   

14.
We develop a discrete-time affine stochastic volatility model with time-varying conditional skewness (SVS). Importantly, we disentangle the dynamics of conditional volatility and conditional skewness in a coherent way. Our approach allows current asset returns to be asymmetric conditional on current factors and past information, which we term contemporaneous asymmetry. Conditional skewness is an explicit combination of the conditional leverage effect and contemporaneous asymmetry. We derive analytical formulas for various return moments that are used for generalized method of moments (GMM) estimation. Applying our approach to S&P500 index daily returns and option data, we show that one- and two-factor SVS models provide a better fit for both the historical and the risk-neutral distribution of returns, compared to existing affine generalized autoregressive conditional heteroscedasticity (GARCH), and stochastic volatility with jumps (SVJ) models. Our results are not due to an overparameterization of the model: the one-factor SVS models have the same number of parameters as their one-factor GARCH competitors and less than the SVJ benchmark.  相似文献   

15.
胡海鹏 《统计研究》2002,19(10):33-36
一、引言传统的债券定价方法—未来现金流量贴现法 ,是由美国的威廉姆斯 (Williams .JohnHenry)根据现值理论推导而来的 ,曾被广大投资者用来作为衡量债券投资价值的方法。然而 ,该模型由于贴现率的选取没有确定的标准 ,具有比较大的随意性 ,因而导致所计算的债券价格也表现出较大的随意性 ,逐渐暴露其不足之处。随着利率期限结构理论的不断发展 ,债券定价方法也相应地获得了很大的进展。尤其是最近十几年来出现了利率期限结构的随机过程无套利分析方法 ,该方法认为利率期限结构和债券价格同某些随机因素 (即状态变量 )相…  相似文献   

16.
李建成等 《统计研究》2021,38(11):115-129
经济高质量发展阶段,知识资源的空间再分配有着重要的现实意义。然而,在当前地理条件的约束下,对运输成本的改善将如何影响不同区位间技能劳动力跨空间知识合作行为,以及其在不同区位合作对象间进行知识分配的空间偏好及其倾向与策略变化,却鲜有研究。为此,本文首先构建了 一个纳入不同地理约束条件、运输成本变化以及基于异质性劳动力知识合作的区位选择模型;其中,城市间交通网络设施的改善降低了技能劳动力面对面交流的时间成本,提高了知识合作的匹配质量,从而促进了更远距离的知识合作及其绩效,同时伴随着知识配置从区域内部向外部的转变。另外,一个城市知识资源配置的空间偏好会随着该城市所达可交流市场量的增加而先递减再递增,并呈现知识资源集中分配趋势。进一步,本文利用2006-2015年的我国论文合作数据,基于多期双重差分法,估计了由高铁线路连通带来的劳动力运输成本的降低对知识资源空间配置行为的影响。本文的实证结果与理论模型相一致。  相似文献   

17.
记录链接的技术问题与统计理论密切相关,尤其是在建立记录链接分类规则时需要构建统计模型,识别关键变量以完成数据匹配。在贝叶斯框架下构建分层模型整合行政记录,通过多元回归可以实现匹配错误率的估计,而且一对一限制下的记录链接允许通过模块反映记录信息的来源变化,基于MCMC模拟的后验分布计算方便,有助于提高数据整合效率。  相似文献   

18.
The affine dynamic term structure model (DTSM) is the canonical empirical finance representation of the yield curve. However, the possibility that DTSM estimates may be distorted by small-sample bias has been largely ignored. We show that conventional estimates of DTSM coefficients are indeed severely biased, and this bias results in misleading estimates of expected future short-term interest rates and of long-maturity term premia. We provide a variety of bias-corrected estimates of affine DTSMs, for both maximally flexible and overidentified specifications. Our estimates imply interest rate expectations and term premia that are more plausible from a macrofinance perspective. This article has supplementary material online.  相似文献   

19.
Abstract. In geophysical and environmental problems, it is common to have multiple variables of interest measured at the same location and time. These multiple variables typically have dependence over space (and/or time). As a consequence, there is a growing interest in developing models for multivariate spatial processes, in particular, the cross‐covariance models. On the other hand, many data sets these days cover a large portion of the Earth such as satellite data, which require valid covariance models on a globe. We present a class of parametric covariance models for multivariate processes on a globe. The covariance models are flexible in capturing non‐stationarity in the data yet computationally feasible and require moderate numbers of parameters. We apply our covariance model to surface temperature and precipitation data from an NCAR climate model output. We compare our model to the multivariate version of the Matérn cross‐covariance function and models based on coregionalization and demonstrate the superior performance of our model in terms of AIC (and/or maximum loglikelihood values) and predictive skill. We also present some challenges in modelling the cross‐covariance structure of the temperature and precipitation data. Based on the fitted results using full data, we give the estimated cross‐correlation structure between the two variables.  相似文献   

20.
The Prime Power Conjecture asserts that the order of an affine difference set is an integral power of a prime number. K.T. Arasu and D. Jungnickel have shown that if the order of an affine difference set is even, then the order is either two or four or is divisible by eight. This paper extends this direction of research by studying cyclic affine difference sets whose orders are congruent to eight modulo sixteen. In particular, we give several numerical constraints that support the Prime Power Conjecture in this case. We conclude with a computer search for cyclic affine difference sets of order eight modulo sixteen that satisfy these new conditions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号