Damage models for natural hazards are used for decision making on reducing and transferring risk. The damage estimates from these models depend on many variables and their complex sometimes nonlinear relationships with the damage. In recent years, data‐driven modeling techniques have been used to capture those relationships. The available data to build such models are often limited. Therefore, in practice it is usually necessary to transfer models to a different context. In this article, we show that this implies the samples used to build the model are often not fully representative for the situation where they need to be applied on, which leads to a “sample selection bias.” In this article, we enhance data‐driven damage models by applying methods, not previously applied to damage modeling, to correct for this bias before the machine learning (ML) models are trained. We demonstrate this with case studies on flooding in Europe, and typhoon wind damage in the Philippines. Two sample selection bias correction methods from the ML literature are applied and one of these methods is also adjusted to our problem. These three methods are combined with stochastic generation of synthetic damage data. We demonstrate that for both case studies, the sample selection bias correction techniques reduce model errors, especially for the mean bias error this reduction can be larger than 30%. The novel combination with stochastic data generation seems to enhance these techniques. This shows that sample selection bias correction methods are beneficial for damage model transfer. 相似文献
Motivated by a breast cancer research program, this paper is concerned with the joint survivor function of multiple event times when their observations are subject to informative censoring caused by a terminating event. We formulate the correlation of the multiple event times together with the time to the terminating event by an Archimedean copula to account for the informative censoring. Adapting the widely used two-stage procedure under a copula model, we propose an easy-to-implement pseudo-likelihood based procedure for estimating the model parameters. The approach yields a new estimator for the marginal distribution of a single event time with semicompeting-risks data. We conduct both asymptotics and simulation studies to examine the proposed approach in consistency, efficiency, and robustness. Data from the breast cancer program are employed to illustrate this research.
We employ two population‐level experiments to accurately measure opposition to immigration before and after the economic crisis of 2008. Our design explicitly addresses social desirability bias, which is the tendency to give responses that are seen favorably by others and can lead to substantial underreporting of opposition to immigration. We find that overt opposition to immigration, expressed as support for a closed border, increases slightly after the crisis. However, once we account for social desirability bias, no significant increase remains. We conclude that the observed increase in anti‐immigration sentiment in the post‐crisis United States is attributable to greater expression of opposition rather than any underlying change in attitudes. 相似文献
In this article, we propose a novel approach for testing the equality of two log-normal populations using a computational approach test (CAT) that does not require explicit knowledge of the sampling distribution of the test statistic. Simulation studies demonstrate that the proposed approach can perform hypothesis testing with satisfying actual size even at small sample sizes. Overall, it is superior to other existing methods. Also, a CAT is proposed for testing about reliability of two log-normal populations when the means are the same. Simulations show that the actual size of this new approach is close to nominal level and better than the score test. At the end, the proposed methods are illustrated using two examples. 相似文献
Lifetime Data Analysis - Frailty models are generally used to model heterogeneity between the individuals. The distribution of the frailty variable is often assumed to be continuous. However, there... 相似文献
This article considers statistical inference for the heteroscedastic partially linear varying coefficient models. We construct an efficient estimator for the parametric component by applying the weighted profile least-squares approach, and show that it is semiparametrically efficient in the sense that the inverse of the asymptotic variance of the estimator reaches the semiparametric efficiency bound. Simulation studies are conducted to illustrate the performance of the proposed method. 相似文献