期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A Bayesian predictive inference for small area means incorporating covariates and sampling weights

Ma. Criselda S. Toto Balgobin Nandram 《Journal of statistical planning and inference》2010

The main goal in small area estimation is to use models to ‘borrow strength’ from the ensemble because the direct estimates of small area parameters are generally unreliable. However, model-based estimates from the small areas do not usually match the value of the single estimate for the large area. Benchmarking is done by applying a constraint, internally or externally, to ensure that the ‘total’ of the small areas matches the ‘grand total’. This is particularly useful because it is difficult to check model assumptions owing to the sparseness of the data. We use a Bayesian nested error regression model, which incorporates unit-level covariates and sampling weights, to develop a method to internally benchmark the finite population means of small areas. We use two examples to illustrate our method. We also perform a simulation study to further assess the properties of our method. 相似文献

2.

Updating small area population estimates in England and Wales

Simpson S Diamond I Tonkin P Tye R 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》1996,159(2):235-247

"Population estimates have important implications for resource allocation within government and commerce, and are often assumed to be without error. Currently, central government provides annual population estimates for all the local and health authority districts in Britain, but estimates are needed for smaller areas, typically for electoral wards and postal sectors. Small area estimates are provided by some local authorities and commercial organizations, using different methods; the accuracy of these estimates is modelled here within a multilevel framework. Certain characteristics of the small area and of the method of estimation are included as explanatory variables. Results show that the method of estimation used is of great importance." 相似文献

3.

The use of power transformations in small area estimation

Getachew Asfaw Dagne 《Journal of applied statistics》2003,30(4):411-423

Sample surveys are usually designed and analysed to produce estimates for larger areas. Nevertheless, sample sizes are often not large enough to give adequate precision for small area estimates of interest. To overcome such difficulties, borrowing strength from related small areas via modelling becomes essential. In line with this, we propose components of variance models with power transformations for small area estimation. This paper reports the results of a study aimed at incorporating the power transformation in small area estimation for improving the quality of small area predictions. The proposed methods are demonstrated on satellite data in conjunction with survey data to estimate mean acreage under a specified crop for counties in Iowa. 相似文献

4.

Small Area Estimation for Zero-Inflated Data

Hukum Chandra U. C. Sud 《统计学通讯:模拟与计算》2013,42(5):632-643

The commonly used method of small area estimation (SAE) under a linear mixed model may not be efficient if data contain substantial proportion of zeros than would be expected under standard model assumptions (hereafter zero-inflated data). The authors discuss the SAE for zero-inflated data under a two-part random effects model that account for excess zeros in the data. Empirical results show that proposed method for SAE works well and produces an efficient set of small area estimates. An application to real survey data from the National Sample Survey Office of India demonstrates the satisfactory performance of the method. The authors describe a parametric bootstrap method to estimate the mean squared error (MSE) of the proposed estimator of small areas. The bootstrap estimates of the MSE are compared to the true MSE in simulation study. 相似文献

5.

On estimating census undercount in small areas

Isaki CT Schultz LK Diffendal GJ Huang ET 《Journal of official statistics》1988,4(2):95-112

"Net undercount rates in the U.S. decennial census have been steadily declining over the last several censuses. Differential undercounts among race groups and geographic areas, however, appear to persist. In the following, we examine and compare several methodologies for providing small area estimates of census coverage by constructing artificial populations. Measures of performance are also introduced to assess the various small area estimates. Synthetic estimation in combination with regression modelling provide the best results over the methods considered. Sampling error effects are also simulated. The results form the basis for determining coverage evaluation survey small area estimates of the 1900 decennial census." 相似文献

6.

Assessing different uncertainty measures of EBLUP: a resampling-based approach

《Journal of Statistical Computation and Simulation》2012,82(7):713-727

The empirical best linear unbiased prediction approach is a popular method for the estimation of small area parameters. However, the estimation of reliable mean squared prediction error (MSPE) of the estimated best linear unbiased predictors (EBLUP) is a complicated process. In this paper we study the use of resampling methods for MSPE estimation of the EBLUP. A cross-sectional and time-series stationary small area model is used to provide estimates in small areas. Under this model, a parametric bootstrap procedure and a weighted jackknife method are introduced. A Monte Carlo simulation study is conducted in order to compare the performance of different resampling-based measures of uncertainty of the EBLUP with the analytical approximation. Our empirical results show that the proposed resampling-based approaches performed better than the analytical approximation in several situations, although in some cases they tend to underestimate the true MSPE of the EBLUP in a higher number of small areas. 相似文献

7.

A framework for progressively improving small area population estimates 总被引：1，自引：0，他引：1

Philip Rees Paul Norman Dominic Brown 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2004,167(1):5-36

Summary. The paper presents a framework for small area population estimation that enables users to select a method that is fit for the purpose. The adjustments to input data that are needed before use are outlined, with emphasis on developing consistent time series of inputs. We show how geographical harmonization of small areas, which is crucial to comparisons over time, can be achieved. For two study regions, the East of England and Yorkshire and the Humber, the differences in output and consequences of adopting different methods are illustrated. The paper concludes with a discussion of how data, on stream since 1998, might be included in future small area estimates. 相似文献

8.

M-quantile models with application to poverty mapping 总被引：1，自引：0，他引：1

Nikos Tzavidis Nicola Salvati Monica Pratesi Ray Chambers 《Statistical Methods and Applications》2008,17(3):393-411

Over the last decade there has been growing demand for estimates of population characteristics at small area level. Unfortunately, cost constraints in the design of sample surveys lead to small sample sizes within these areas and as a result direct estimation, using only the survey data, is inappropriate since it yields estimates with unacceptable levels of precision. Small area models are designed to tackle the small sample size problem. The most popular class of models for small area estimation is random effects models that include random area effects to account for between area variations. However, such models also depend on strong distributional assumptions, require a formal specification of the random part of the model and do not easily allow for outlier robust inference. An alternative approach to small area estimation that is based on the use of M-quantile models was recently proposed by Chambers and Tzavidis (Biometrika 93(2):255–268, 2006) and Tzavidis and Chambers (Robust prediction of small area means and distributions. Working paper, 2007). Unlike traditional random effects models, M-quantile models do not depend on strong distributional assumption and automatically provide outlier robust inference. In this paper we illustrate for the first time how M-quantile models can be practically employed for deriving small area estimates of poverty and inequality. The methodology we propose improves the traditional poverty mapping methods in the following ways: (a) it enables the estimation of the distribution function of the study variable within the small area of interest both under an M-quantile and a random effects model, (b) it provides analytical, instead of empirical, estimation of the mean squared error of the M-quantile small area mean estimates and (c) it employs a robust to outliers estimation method. The methodology is applied to data from the 2002 Living Standards Measurement Survey (LSMS) in Albania for estimating (a) district level estimates of the incidence of poverty in Albania, (b) district level inequality measures and (c) the distribution function of household per-capita consumption expenditure in each district. Small area estimates of poverty and inequality show that the poorest Albanian districts are in the mountainous regions (north and north east) with the wealthiest districts, which are also linked with high levels of inequality, in the coastal (south west) and southern part of country. We discuss the practical advantages of our methodology and note the consistency of our results with results from previous studies. We further demonstrate the usefulness of the M-quantile estimation framework through design-based simulations based on two realistic survey data sets containing small area information and show that the M-quantile approach may be preferable when the aim is to estimate the small area distribution function. 相似文献

9.

Local stationarity in small area estimation models

Roberto Benedetti Monica Pratesi Nicola Salvati 《Statistical Methods and Applications》2013,22(1):81-95

Small area estimators are often based on linear mixed models under the assumption that relationships among variables are stationary across the area of interest (Fay–Herriot models). This hypothesis is patently violated when the population is divided into heterogeneous latent subgroups. In this paper we propose a local Fay–Herriot model assisted by a Simulated Annealing algorithm to identify the latent subgroups of small areas. The value minimized through the Simulated Annealing algorithm is the sum of the estimated mean squared error (MSE) of the small area estimates. The technique is employed for small area estimates of erosion on agricultural land within the Rathbun Lake Watershed (IA, USA). The results are promising and show that introducing local stationarity in a small area model may lead to useful improvements in the performance of the estimators. 相似文献

10.

Small area estimates of labour force participation under a multinomial logit mixed model

Isabel Molina Ayoub Saei M. José Lombardía 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2007,170(4):975-1000

Summary. A new methodology is developed for estimating unemployment or employment characteristics in small areas, based on the assumption that the sample totals of unemployed and employed individuals follow a multinomial logit model with random area effects. The method is illustrated with UK labour force data aggregated by sex–age groups. For these data, the accuracy of direct estimates is poor in comparison with estimates that are derived from the multinomial logit model. Furthermore, two different estimators of the mean-squared errors are given: an analytical approximation obtained by Taylor linearization and an estimator based on bootstrapping. A simulation study for comparison of the two estimators shows the good performance of the bootstrap estimator. 相似文献

11.

Small area estimation with auxiliary survey data

Sharon L. Lohr N. G. N. Prasad 《Revue canadienne de statistique》2003,31(4):383-396

Large governmental surveys typically provide accurate national statistics. To decrease the mean squared error of estimates for small areas, i.e., domains in which the sample size is small, auxiliary variables from administrative records are often used as covariates in a mixed linear model. It is generally assumed that the auxiliary information is available for every small area. In many cases, though, such information is available for only some of the small areas, either from another survey or from a previous administration of the same survey. The authors propose and study small area estimators that use multivariate models to combine information from several surveys. They discuss computational algorithms, and a simulation study indicates that if quantities in the different surveys are sufficiently correlated, substantial gains in efficiency can be achieved. 相似文献

12.

Properties of the beta regression model for small area estimation of proportions and application to estimation of poverty rates

Ryan Janicki 《统计学通讯:理论与方法》2020,49(9):2264-2284

Abstract

Linear mixed effects models have been popular in small area estimation problems for modeling survey data when the sample size in one or more areas is too small for reliable inference. However, when the data are restricted to a bounded interval, the linear model may be inappropriate, particularly if the data are near the boundary. Nonlinear sampling models are becoming increasingly popular for small area estimation problems when the normal model is inadequate. This paper studies the use of a beta distribution as an alternative to the normal distribution as a sampling model for survey estimates of proportions which take values in (0, 1). Inference for small area proportions based on the posterior distribution of a beta regression model ensures that point estimates and credible intervals take values in (0, 1). Properties of a hierarchical Bayesian small area model with a beta sampling distribution and logistic link function are presented and compared to those of the linear mixed effect model. Propriety of the posterior distribution using certain noninformative priors is shown, and behavior of the posterior mean as a function of the sampling variance and the model variance is described. An example using 2010 Small Area Income and Poverty Estimates (SAIPE) data is given, and a numerical example studying small sample properties of the model is presented. 相似文献

13.

Empirical best linear unbiased and empirical Bayes prediction in multivariate small area estimation

《Journal of statistical planning and inference》1999,75(2):269-279

Small area estimation plays a prominent role in survey sampling due to a growing demand for reliable small area estimates from both public and private sectors. Popularity of model-based inference is increasing in survey sampling, particularly, in small area estimation. The estimates of the small area parameters can profitably ‘borrow strength’ from data on related multiple characteristics and/or auxiliary variables from other neighboring areas through appropriate models. Fay (1987, Small Area Statistics, Wiley, New York, pp. 91–102) proposed multivariate regression for small area estimation of multiple characteristics. The success of this modeling rests essentially on the strength of correlation of these dependent variables. To estimate small area mean vectors of multiple characteristics, multivariate modeling has been proposed in the literature via a multivariate variance components model. We use this approach to empirical best linear unbiased and empirical Bayes prediction of small area mean vectors. We use data from Battese et al. (1988, J. Amer. Statist. Assoc. 83, 28 –36) to conduct a simulation which shows that the multivariate approach may achieve substantial improvement over the usual univariate approach. 相似文献

14.

Information theoretic methods in small domain estimation

Rosa Bernardini Papalia Esteban Fernandez-Vazquez 《Econometric Reviews》2018,37(4):347-359

Small area estimation techniques are becoming increasingly used in survey applications to provide estimates for local areas of interest. The objective of this article is to develop and apply Information Theoretic (IT)-based formulations to estimate small area business and trade statistics. More specifically, we propose a Generalized Maximum Entropy (GME) approach to the problem of small area estimation that exploits auxiliary information relating to other known variables on the population and adjusts for consistency and additivity. The GME formulations, combining information from the sample together with out-of-sample aggregates of the population of interest, can be particularly useful in the context of small area estimation, for both direct and model-based estimators, since they do not require strong distributional assumptions on the disturbances. The performance of the proposed IT formulations is illustrated through real and simulated datasets. 相似文献

15.

Small Area Estimation Using Estimated Population Level Auxiliary Data

Hukum Chandra U. C. Sud Yogita Gharde 《统计学通讯:模拟与计算》2015,44(5):1197-1209

Unit level linear mixed models are often used in small area estimation (SAE), and the empirical best linear unbiased prediction (EBLUP) is widely used for the estimation of small area means under such models. However, EBLUP requires population level auxiliary data, atleast area specific aggregated values. Sometimes population level auxiliary data is either not available or not consistent with the survey data. We describe a SAE method that uses estimated population auxiliary information. Empirical results show that proposed method for SAE produces an efficient set of small area estimates. 相似文献

16.

SMALL‐AREA ESTIMATION OF POVERTY: THE AID INDUSTRY STANDARD AND ITS ALTERNATIVES

S. J. Haslett G. Jones 《Australian & New Zealand Journal of Statistics》2010,52(4):341-362

Small‐area estimation of poverty‐related variables is an increasingly important analytical tool in targeting the delivery of food and other aid in developing countries. We compare two methods for the estimation of small‐area means and proportions, namely empirical Bayes and composite estimation, with what has become the international standard method of Elbers, Lanjouw & Lanjouw (2003) . In addition to differences among the sets of estimates and associated estimated standard errors, we discuss data requirements, design and model selection issues and computational complexity. The Elbers, Lanjouw and Lanjouw (ELL) method is found to produce broadly similar estimates but to have much smaller estimated standard errors than the other methods. The question of whether these standard error estimates are downwardly biased is discussed. Although the question cannot yet be answered in full, as a precautionary measure it is strongly recommended that the ELL model be modified to include a small‐area‐level error component in addition to the cluster‐level and household‐level errors it currently contains. This recommendation is particularly important because the allocation of billions of dollars of aid funding is being determined and monitored via ELL. Under current aid distribution mechanisms, any downward bias in estimates of standard error may lead to allocations that are suboptimal because distinctions are made between estimated poverty levels at the small‐area level that are not significantly different statistically. 相似文献

17.

Spatial robust small area estimation 总被引：1，自引：0，他引：1

Timo Schmid Ralf T. Münnich 《Statistical Papers》2014,55(3):653-670

The accuracy of recent applications in small area statistics in many cases highly depends on the assumed properties of the underlying models and the availability of micro information. In finite population sampling, small sample sizes may increase the sensitivity of the modeling with respect to single units. In these cases, area-specific sample sizes tend to be small such that normal assumptions, even of area means, seem to be violated. Hence, applying robust estimation methods is expected to yield more reliable results. In general, two robust small area methods are applied, the robust EBLUP and the M-quantile method. Additionally, the use of adequate auxiliary information may further increase the accuracy of the estimates. In prediction based approaches where information is needed on universe level, in general, only few variables are available which can be used for modeling. In addition to variables from the dataset, in many cases further information may be available, e.g. geographical information which could indicate spatial dependencies between neighboring areas. This spatial information can be included in the modeling using spatially correlated area effects. Within the paper the classical robust EBLUP is extended to cover spatial area effects via a simultaneous autoregressive model. The performance of the different estimators are compared in a model-based simulation study. 相似文献

18.

Generating synthetic data to produce public-use microdata for small geographic areas based on complex sample survey data with application to the National Health Interview Survey

Joseph W. Sakshaug Trivellore E. Raghunathan 《Journal of applied statistics》2014,41(10):2103-2122

Small area statistics obtained from sample survey data provide a critical source of information used to study health, economic, and sociological trends. However, most large-scale sample surveys are not designed for the purpose of producing small area statistics. Moreover, data disseminators are prevented from releasing public-use microdata for small geographic areas for disclosure reasons; thus, limiting the utility of the data they collect. This research evaluates a synthetic data method, intended for data disseminators, for releasing public-use microdata for small geographic areas based on complex sample survey data. The method replaces all observed survey values with synthetic (or imputed) values generated from a hierarchical Bayesian model that explicitly accounts for complex sample design features, including stratification, clustering, and sampling weights. The method is applied to restricted microdata from the National Health Interview Survey and synthetic data are generated for both sampled and non-sampled small areas. The analytic validity of the resulting small area inferences is assessed by direct comparison with the actual data, a simulation study, and a cross-validation study. 相似文献

19.

On Measuring Uncertainty of Benchmarked Predictors with Application to Disease Risk Estimate

Tatsuya Kubokawa Mana Hasukawa Kunihiko Takahashi 《Scandinavian Journal of Statistics》2014,41(2):394-413

Empirical Bayes (EB) estimates in general linear mixed models are useful for the small area estimation in the sense of increasing precision of estimation of small area means. However, one potential difficulty of EB is that the overall estimate for a larger geographical area based on a (weighted) sum of EB estimates is not necessarily identical to the corresponding direct estimate such as the overall sample mean. Another difficulty is that EB estimates yield over‐shrinking, which results in the sampling variance smaller than the posterior variance. One way to fix these problems is the benchmarking approach based on the constrained empirical Bayes (CEB) estimators, which satisfy the constraints that the aggregated mean and variance are identical to the requested values of mean and variance. In this paper, we treat the general mixed models, derive asymptotic approximations of the mean squared error (MSE) of CEB and provide second‐order unbiased estimators of MSE based on the parametric bootstrap method. These results are applied to natural exponential families with quadratic variance functions. As a specific example, the Poisson‐gamma model is dealt with, and it is illustrated that the CEB estimates and their MSE estimates work well through real mortality data. 相似文献

20.

Small area housing stress estimation in Australia: Calculating confidence intervals for a spatial microsimulation model

Azizur Rahman 《统计学通讯:模拟与计算》2017,46(9):7466-7484

This study provides small area housing stress estimates by tenure type in Australia with a way of calculating confidence intervals for a spatial microsimulation model. Findings reveal that prevalence of housing stress for private-renter, buyer, public-renter and owner households are 59.6%, 33.2%, 6.9%, and 0.3%, respectively. Almost two-thirds of these households are located in statistical local areas (SLAs) in eight capital cities, and a large number of them are in Sydney and Melbourne. Estimates for private renters and buyers are significantly high in some capitals and southeast coastal regions. About 95.7% of SLAs show accurate estimates with narrow confidence intervals. 相似文献