Damage models for natural hazards are used for decision making on reducing and transferring risk. The damage estimates from these models depend on many variables and their complex sometimes nonlinear relationships with the damage. In recent years, data‐driven modeling techniques have been used to capture those relationships. The available data to build such models are often limited. Therefore, in practice it is usually necessary to transfer models to a different context. In this article, we show that this implies the samples used to build the model are often not fully representative for the situation where they need to be applied on, which leads to a “sample selection bias.” In this article, we enhance data‐driven damage models by applying methods, not previously applied to damage modeling, to correct for this bias before the machine learning (ML) models are trained. We demonstrate this with case studies on flooding in Europe, and typhoon wind damage in the Philippines. Two sample selection bias correction methods from the ML literature are applied and one of these methods is also adjusted to our problem. These three methods are combined with stochastic generation of synthetic damage data. We demonstrate that for both case studies, the sample selection bias correction techniques reduce model errors, especially for the mean bias error this reduction can be larger than 30%. The novel combination with stochastic data generation seems to enhance these techniques. This shows that sample selection bias correction methods are beneficial for damage model transfer. 相似文献
VOLUNTAS: International Journal of Voluntary and Nonprofit Organizations - While many scholars have postulated the decline of membership influence as an important consequence of the... 相似文献
Empirical Bayes is a versatile approach to “learn from a lot” in two ways: first, from a large number of variables and, second, from a potentially large amount of prior information, for example, stored in public repositories. We review applications of a variety of empirical Bayes methods to several well‐known model‐based prediction methods, including penalized regression, linear discriminant analysis, and Bayesian models with sparse or dense priors. We discuss “formal” empirical Bayes methods that maximize the marginal likelihood but also more informal approaches based on other data summaries. We contrast empirical Bayes to cross‐validation and full Bayes and discuss hybrid approaches. To study the relation between the quality of an empirical Bayes estimator and p, the number of variables, we consider a simple empirical Bayes estimator in a linear model setting. We argue that empirical Bayes is particularly useful when the prior contains multiple parameters, which model a priori information on variables termed “co‐data”. In particular, we present two novel examples that allow for co‐data: first, a Bayesian spike‐and‐slab setting that facilitates inclusion of multiple co‐data sources and types and, second, a hybrid empirical Bayes–full Bayes ridge regression approach for estimation of the posterior predictive interval. 相似文献
A simple connected graph G with 2n vertices is said to be k-extendable for an integer k with \(0<k<n\) if G contains a perfect matching and every matching of cardinality k in G is a subset of some perfect matching. Lakhal and Litzler (Inf Process Lett 65(1):11–16, 1998) discovered a polynomial algorithm that decides whether a bipartite graph is k-extendable. For general graphs, however, it has been an open problem whether there exists a polynomial algorithm. The new result presented in this paper is that the extendability problem is co-NP-complete. 相似文献
Population Research and Policy Review - The welfare state can be perceived as a safety net which helps individuals adjust to situations of risk or transition. Starting from this idea of the welfare... 相似文献
For large cohort studies with rare outcomes, the nested case-control design only requires data collection of small subsets of the individuals at risk. These are typically randomly sampled at the observed event times and a weighted, stratified analysis takes over the role of the full cohort analysis. Motivated by observational studies on the impact of hospital-acquired infection on hospital stay outcome, we are interested in situations, where not necessarily the outcome is rare, but time-dependent exposure such as the occurrence of an adverse event or disease progression is. Using the counting process formulation of general nested case-control designs, we propose three sampling schemes where not all commonly observed outcomes need to be included in the analysis. Rather, inclusion probabilities may be time-dependent and may even depend on the past sampling and exposure history. A bootstrap analysis of a full cohort data set from hospital epidemiology allows us to investigate the practical utility of the proposed sampling schemes in comparison to a full cohort analysis and a too simple application of the nested case-control design, if the outcome is not rare.
Social Indicators Research - This paper analyses the Human Development Index (HDI) time series from 2010 to 2017. An alternative index is studied, which combines the same components of the HDI by... 相似文献
A review of the US ‘program evaluation standards’ (PES), undertaken in a series of workshops and meetings of networks of evaluators in Africa, resulted in modifications to those standards. The result was presented to a plenary session of the Inaugural Conference of the African Evaluation Association in September 1999, attended by over 300 evaluators from 35 countries. The AfrEA Conference decided that a systematic effort should be made to produce a list of African evaluation guidelines, similar to the PES, and that this checklist should be reviewed by national evaluation associations and networks in Africa and field tested in several countries. Ten national and regional networks and associations suggested modifications to the text and endorsed the final version of the guidelines. 相似文献
National Park of Tijuca in Rio de Janeiro (Brazil) is about 3,300 ha and considered the largest urban forest in the world. Its floristic composition is typical of Atlantic Rain Forest. The reserve is being altered because of fire occurrences and urban expansion. This study identified locations and causes of forest fires, and makes management recommendations to restore damaged areas. From 1991 to 2000, forest firefighters recorded an average of 75-fire occurrences/year. Identified causes included hot air balloons (24%), intentional (24%), rubbish burning (21%) and religious practices (17%). Primary fuels included invasive grasses and ferns. Although hot air balloons destroyed larger areas of forest in each occurrence, a greater number of fires started in the invasive vegetation along roads that bisect the forest. In response to recurrent forests, invasive vegetation has spread gradually into the forest increasing forest degradation. To decrease fire damage, sites with high fire frequencies and density of invasive vegetation were planted with less flammable species. Results indicate that fire frequency decreased and density of invasive vegetation declined. This approach appears to prevent fire incidence, reduce the need for fire fighting, and preserve existing biodiversity. 相似文献