Damage models for natural hazards are used for decision making on reducing and transferring risk. The damage estimates from these models depend on many variables and their complex sometimes nonlinear relationships with the damage. In recent years, data‐driven modeling techniques have been used to capture those relationships. The available data to build such models are often limited. Therefore, in practice it is usually necessary to transfer models to a different context. In this article, we show that this implies the samples used to build the model are often not fully representative for the situation where they need to be applied on, which leads to a “sample selection bias.” In this article, we enhance data‐driven damage models by applying methods, not previously applied to damage modeling, to correct for this bias before the machine learning (ML) models are trained. We demonstrate this with case studies on flooding in Europe, and typhoon wind damage in the Philippines. Two sample selection bias correction methods from the ML literature are applied and one of these methods is also adjusted to our problem. These three methods are combined with stochastic generation of synthetic damage data. We demonstrate that for both case studies, the sample selection bias correction techniques reduce model errors, especially for the mean bias error this reduction can be larger than 30%. The novel combination with stochastic data generation seems to enhance these techniques. This shows that sample selection bias correction methods are beneficial for damage model transfer. 相似文献
VOLUNTAS: International Journal of Voluntary and Nonprofit Organizations - Age has long been understood as a strong demographic determinant of volunteering. However, to date, limited literature... 相似文献
Motivated by a breast cancer research program, this paper is concerned with the joint survivor function of multiple event times when their observations are subject to informative censoring caused by a terminating event. We formulate the correlation of the multiple event times together with the time to the terminating event by an Archimedean copula to account for the informative censoring. Adapting the widely used two-stage procedure under a copula model, we propose an easy-to-implement pseudo-likelihood based procedure for estimating the model parameters. The approach yields a new estimator for the marginal distribution of a single event time with semicompeting-risks data. We conduct both asymptotics and simulation studies to examine the proposed approach in consistency, efficiency, and robustness. Data from the breast cancer program are employed to illustrate this research.
A common assertion in the nonprofit literature is that nonprofit organizations can become more efficient, effective, and sustainable by embracing social entrepreneurship in their operational and strategic posture. In this article, we examine whether the mere label of social entrepreneurship results—with no actual organizational differences—in an increase in positive attributions associated with a nonprofit organization, an effect we call the social entrepreneurship bias. We experimentally test for the existence of a social entrepreneurship bias by examining how the label of social entrepreneurship alters how people judge a nonprofit’s effectiveness and decide how to allocate scarce donation funds.
Lifetime Data Analysis - Frailty models are generally used to model heterogeneity between the individuals. The distribution of the frailty variable is often assumed to be continuous. However, there... 相似文献
Sense of community (SOC) is associated with the quality of community life and the building of social capital. While its linkage to informal social behavior, such as neighboring, is inherent in discussions regarding theory, empirical evidence remains scarce. Moreover, the degree to which neighboring behavior influences SOC over time is largely unknown. Using a latent transition analysis, the effect of neighboring on SOC was investigated over a 5-year span from 2006 to 2011 among a sample of adults (n?=?165) in Arizona. Initially, a latent class analysis identified two SOC subgroups: Low SOC and High SOC. The likelihood of shifts in SOC class membership over 5 years was generally stable, with most individuals staying in the same group (82.3% Low SOC; 92.4% High SOC). Neighboring behavior and socio-demographic covariates impacted the likelihood that individuals changed classes, with 25.3% of Low SOC individuals transitioning to High SOC in 2011 and 55.4% of High SOC individuals moving to Low SOC in 2011. Specifically, having an income greater than $60,000 and visiting with neighbors lessened the likelihood of being in the Low SOC class in 2006; and length of residence and exchanging favors with neighbors lessened the likelihood of being in the Low SOC class in 2011. These findings have implications for both community design and community development practice. Design and development interventions that promote greater social interaction may help build and sustain SOC over time.
Researchers have been developing various extensions and modified forms of the Weibull distribution to enhance its capability for modeling and fitting different data sets. In this note, we investigate the potential usefulness of the new modification to the standard Weibull distribution called odd Weibull distribution in income economic inequality studies. Some mathematical and statistical properties of this model are proposed. We obtain explicit expressions for the first incomplete moment, quantile function, Lorenz and Zenga curves and related inequality indices. In addition to the well-known stochastic order based on Lorenz curve, the stochastic order based on Zenga curve is considered. Since the new generalized Weibull distribution seems to be suitable to model wealth, financial, actuarial and especially income distributions, these findings are fundamental in the understanding of how parameter values are related to inequality. Also, the estimation of parameters by maximum likelihood and moment methods is discussed. Finally, this distribution has been fitted to United States and Austrian income data sets and has been found to fit remarkably well in compare with the other widely used income models. 相似文献
Urban Ecosystems - The development of urban areas imposes challenges that wildlife must adapt to in order to persist in these new habitats. One of the greatest changes brought by urbanization has... 相似文献
ABSTRACTThe cost and time of pharmaceutical drug development continue to grow at rates that many say are unsustainable. These trends have enormous impact on what treatments get to patients, when they get them and how they are used. The statistical framework for supporting decisions in regulated clinical development of new medicines has followed a traditional path of frequentist methodology. Trials using hypothesis tests of “no treatment effect” are done routinely, and the p-value < 0.05 is often the determinant of what constitutes a “successful” trial. Many drugs fail in clinical development, adding to the cost of new medicines, and some evidence points blame at the deficiencies of the frequentist paradigm. An unknown number effective medicines may have been abandoned because trials were declared “unsuccessful” due to a p-value exceeding 0.05. Recently, the Bayesian paradigm has shown utility in the clinical drug development process for its probability-based inference. We argue for a Bayesian approach that employs data from other trials as a “prior” for Phase 3 trials so that synthesized evidence across trials can be utilized to compute probability statements that are valuable for understanding the magnitude of treatment effect. Such a Bayesian paradigm provides a promising framework for improving statistical inference and regulatory decision making. 相似文献
Theory and Society - The massive expansion of US higher education after World War II is a sociological puzzle: a spectacular feat of state capacity-building in a highly federated polity. Prior... 相似文献