期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Hierarchical Bayes GLMs for the analysis of spatial data: An application to disease mapping

《Journal of statistical planning and inference》1999,75(2):305-318

This paper considers estimation of cancer incidence rates for local areas. The raw estimates usually are based on small sample sizes, and hence are usually unreliable. A hierarchical Bayes generalized linear model approach is taken which connects the local areas, thereby enabling one to ‘borrow strength’. Random effects with pairwise difference priors model the spatial structure in the data. The methods are applied to cancer incidence estimation for census tracts in a certain region of the state of New York. 相似文献

2.

Modelling map positional error to infer true feature location

Jarrett J. Barber Alan E. Gelfand John A. Silander 《Revue canadienne de statistique》2006,34(4):659-676

The authors consider the issue of map positional error, or the difference between location as represented in a spatial database (i.e., a map) and the corresponding unobservable true location. They propose a fully model‐based approach that incorporates aspects of the map registration process commonly performed by users of geographic informations systems, including rubber‐sheeting. They explain how estimates of positional error can be obtained, hence estimates of true location. They show that with multiple maps of varying accuracy along with ground truthing data, suitable model averaging offers a strategy for using all of the maps to learn about true location. 相似文献

3.

Direct parametric inference for the cumulative incidence function

Jong-Hyeon Jeong Jason Fine 《Journal of the Royal Statistical Society. Series C, Applied statistics》2006,55(2):187-200

Summary. In survival data that are collected from phase III clinical trials on breast cancer, a patient may experience more than one event, including recurrence of the original cancer, new primary cancer and death. Radiation oncologists are often interested in comparing patterns of local or regional recurrences alone as first events to identify a subgroup of patients who need to be treated by radiation therapy after surgery. The cumulative incidence function provides estimates of the cumulative probability of locoregional recurrences in the presence of other competing events. A simple version of the Gompertz distribution is proposed to parameterize the cumulative incidence function directly. The model interpretation for the cumulative incidence function is more natural than it is with the usual cause-specific hazard parameterization. Maximum likelihood analysis is used to estimate simultaneously parametric models for cumulative incidence functions of all causes. The parametric cumulative incidence approach is applied to a data set from the National Surgical Adjuvant Breast and Bowel Project and compared with analyses that are based on parametric cause-specific hazard models and nonparametric cumulative incidence estimation. 相似文献

4.

Exploring spatial dependence in area-level random effect model for disaggregate-level crop yield estimation

Hukum Chandra 《Journal of applied statistics》2013,40(4):823-842

This paper describes an application of small area estimation (SAE) techniques under area-level spatial random effect models when only area (or district or aggregated) level data are available. In particular, the SAE approach is applied to produce district-level model-based estimates of crop yield for paddy in the state of Uttar Pradesh in India using the data on crop-cutting experiments supervised under the Improvement of Crop Statistics scheme and the secondary data from the Population Census. The diagnostic measures are illustrated to examine the model assumptions as well as reliability and validity of the generated model-based small area estimates. The results show a considerable gain in precision in model-based estimates produced applying SAE. Furthermore, the model-based estimates obtained by exploiting spatial information are more efficient than the one obtained by ignoring this information. However, both of these model-based estimates are more efficient than the direct survey estimate. In many districts, there is no survey data and therefore it is not possible to produce direct survey estimates for these districts. The model-based estimates generated using SAE are still reliable for such districts. These estimates produced by using SAE will provide invaluable information to policy-analysts and decision-makers. 相似文献

5.

Accounting for inaccuracies in population counts and case registration in cancer mapping studies 总被引：1，自引：0，他引：1

N. Best & J. Wakefield 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》1999,162(3):363-382

相似文献

6.

Small area estimation of proportions under a spatial dependent aggregated level random effects model

Hukum Chandra Nicola Salvati 《统计学通讯:理论与方法》2018,47(5):1234-1255

This paper describes small area estimation (SAE) of proportions under a spatial dependent generalized linear mixed model using aggregated level data. The SAE is also applied to produce reliable district level estimates and mapping of incidence of indebtedness in the State of Uttar Pradesh in India using debt and investment survey data collected by National Sample Survey Office (NSSO) and the secondary data from the Census. The results show a significant improvement in precision of model-based estimates generated by SAE as compared to direct estimates. The estimates generated by incorporating spatial information are more efficient than the one generated by ignoring this information. 相似文献

7.

Multivariate meta-analysis for data consortia,individual patient meta-analysis,and pooling projects

John Ritz Eugene Demidenko Donna Spiegelman 《Journal of statistical planning and inference》2008

We discuss maximum likelihood and estimating equations methods for combining results from multiple studies in pooling projects and data consortia using a meta-analysis model, when the multivariate estimates with their covariance matrices are available. The estimates to be combined are typically regression slopes, often from relative risk models in biomedical and epidemiologic applications. We generalize the existing univariate meta-analysis model and investigate the efficiency advantages of the multivariate methods, relative to the univariate ones. We generalize a popular univariate test for between-studies homogeneity to a multivariate test. The methods are applied to a pooled analysis of type of carotenoids in relation to lung cancer incidence from seven prospective studies. In these data, the expected gain in efficiency was evident, sometimes to a large extent. Finally, we study the finite sample properties of the estimators and compare the multivariate ones to their univariate counterparts. 相似文献

8.

Methods for Quantitative Risk Assessment Using Occupational Studies

Kenny S. Crump Bruce C. Allen 《The American statistician》2013,67(4):442-450

Although carcinogenic risk assessment is frequently based on animal bioassay data, occupational studies are generally considered the best source of data for quantitative risk estimation. The model selected for use with occupational study data is required to extrapolate on age, exposure level, and temporal exposure pattern. Relative and absolute risk models are examined, as are alternatives for the definition of a dose (exposure) variable. The models express disease incidence as a function of the chosen exposure variable and convert incidence into estimates of lifetime risk. In this form, predictions of the models can be compared. The methods are illustrated using three examples: arsenic exposure and respiratory cancer, leukemia associated with benzene exposure, and asbestos-induced mesothelioma. 相似文献

9.

Spatial Modeling of Habitat Preferences of Biological Species using Markov Random Fields 总被引：1，自引：0，他引：1

Carlos Díaz Avalos 《Journal of applied statistics》2007,34(7):807-821

Spatial modeling has gained interest in ecology during the past two decades, especially in the area of biodiversity, where reliable distribution maps are required. Several methods have been proposed to construct distribution maps, most of them acknowledging the presence of spatial interactions. In many cases, a key problem is the lack of true absence data. We present here a model suitable for use when true absence data are missing. The quality of the estimates obtained from the model is evaluated using ROC curve analysis as well as a quadratic cost function, computed from the false positive and false negative error rates. The model is also tested under random and clustered scattering of the presence records. We also present an application of the model to the construction of distribution maps of two endemic bird species in México. 相似文献

10.

Influence of light level on the incidence of road casualties and the predicted effect of changing 'summertime'

J. Broughton M. Hazelton & M. Stone 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》1999,162(2):137-175

Previous studies of the apparent influence of daylight level and hour changes on the incidence of road casualties are reviewed and refined, by analysis of official databases for Great Britain (1969–1973 and 1985–1994) and the USA (1991–1995). New statistical methods, based on precisely computed altitudes of the sun for each accident location, are used to model casualty frequencies aggregated by week and hour of day, and locally evaluated associations between individual casualty incidence and solar altitude. Estimates of the altitude factor are interpreted causally to give counterfactual estimates of the effect of different clock time schedules on countrywide casualty numbers. 相似文献

11.

A Case for Better Graphics: The Unclassed Choropleth Map

Nathan Gale William C. Halperin 《The American statistician》2013,67(4):330-336

In recent years a number of researchers have shown a strong interest in statistical graphics. One widely used graphical method is the “statistical map,” or what is better known in cartography as the choropleth map. A factor in the use of these maps has been the need to group data into classes, raising the obvious question of defining optimum class intervals. Computer technology now makes it possible to produce unclassed choropleth maps, minimizing quantization error. An example of the two methods is given using criminal justice statistics. The unclassed maps are more accurate with regard to quantization error and appear to be of superior visual quality 相似文献

12.

Numerical experiments with the thin plate histospline

Grace Wahba 《统计学通讯:理论与方法》2013,42(24):2475-2514

The thin plate volume matching and volume smoothing histosplines are described. These histosplines are suitable for estimating densities or incidence rates as a function of position on the plane when data is aggregated by area, for example by county. We give a numerical algorithm for the volume matching histospline and for the volume smoothing histospline using generalized cross validation to estimate the smoothing parameter. Some numerical experiments were performed using synthetic data, population data and SMR's (standardized mortality ratios) aggregated by county over the state of Wisconsin. The method turns out to be not particularly suited for obtaining population density maps where the population density can vary by two orders of magnitude, because the histospline can be negative in 相似文献

13.

Generalised Variance Function Estimation for Binary Variables in Large‐Scale Sample Surveys

Ricardo Cao José A. Vilar Juan M. Vilar 《Australian & New Zealand Journal of Statistics》2012,54(3):301-324

Generalised variance function (GVF) models are data analysis techniques often used in large‐scale sample surveys to approximate the design variance of point estimators for population means and proportions. Some potential advantages of the GVF approach include operational simplicity, more stable sampling errors estimates and providing a convenient method of summarising results when a high number of survey variables is considered. In this paper, several parametric and nonparametric methods for GVF estimation with binary variables are proposed and compared. The behavior of these estimators is analysed under heteroscedasticity and in the presence of outliers and influential observations. An empirical study based on the annual survey of living conditions in Galicia (a region in the northwest of Spain) illustrates the behaviour of the proposed estimators. 相似文献

14.

Estimating Lifetime and Age-Conditional Probabilities of Developing Cancer 总被引：1，自引：0，他引：1

Lap-Ming Wun Ray M. Merrill Eric J. Feuer 《Lifetime data analysis》1998,4(2):169-186

相似文献

15.

Simple methods for ecological inference in 2×2 tables

R. L. Chambers & D. G. Steel 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2001,164(1):175-192

This paper considers inference about the individual level relationship between two dichotomous variables based on aggregated data. It is known that such analyses suffer from 'ecological bias', caused by the lack of homogeneity of this relationship across the groups over which the aggregation occurs. Two new methods for overcoming this bias, one based on local smoothing and the other a simple semiparametric approach, are developed and evaluated. The local smoothing approach performs best when it is used with a covariate which accounts for some of the variation in the relationships across groups. The semiparametric approach performed well in our evaluation even without such auxiliary information 相似文献

16.

The Risk of a Mortality Catastrophe

Daniel Bauer Florian Kramer 《商业与经济统计学杂志》2016,34(3):391-405

We develop a continuous-time model for analyzing and valuing catastrophe mortality contingent claims based on stochastic modeling of the force of mortality. We derive parameter estimates from a 105-year time series of U.S. population mortality data using a simulated maximum likelihood approach based on a particle filter. Relying on the resulting parameters, we calculate loss profiles for a representative catastrophe mortality transaction and compare them to the “official” loss profiles that are provided by the issuers to investors and rating agencies. We find that although the loss profiles are subject to great uncertainties, the official figures fall significantly below the corresponding risk statistics based on our model. In particular, we find that the annualized incidence probability of a mortality catastrophe, defined as a 15% increase in aggregated mortality probabilities, is about 1.4%—compared to about 0.1% according to the official loss profiles. 相似文献

17.

Models for surveillance data under reporting delay: applications to US veteran first-time suicide attempters

Y. Xia N. Lu I. Katz R. Bossarte J. Arora H. He 《Journal of applied statistics》2015,42(9):1861-1876

Surveillance data provide a vital source of information for assessing the spread of a health problem or disease of interest and for planning for future health-care needs. However, the use of surveillance data requires proper adjustments of the reported caseload due to underreporting caused by reporting delays within a limited observation period. Although methods are available to address this classic statistical problem, they are largely focused on inference for the reporting delay distribution, with inference about caseload of disease incidence based on estimates for the delay distribution. This approach limits the complexity of models for disease incidence to provide reliable estimates and projections of incidence. Also, many of the available methods lack robustness since they require parametric distribution assumptions. We propose a new approach to overcome such limitations by allowing for separate models for the incidence and the reporting delay in a distribution-free fashion, but with joint inference for both modeling components, based on functional response models. In addition, we discuss inference about projections of future disease incidence to help identify significant shifts in temporal trends modeled based on the observed data. This latter issue on detecting ‘change points’ is not sufficiently addressed in the literature, despite the fact that such warning signs of potential outbreak are critically important for prevention purposes. We illustrate the approach with both simulated and real data, with the latter involving data for suicide attempts from the Veteran Healthcare Administration. 相似文献

18.

Bayesian age-stratified joinpoint regression model: an application to lung and brain cancer mortality

Ram C. Kafle Netra Khanal Chris P. Tsokos 《Journal of applied statistics》2014,41(12):2727-2742

Joinpoint regression model identifies significant changes in the trends of the incidence, mortality, and survival of a specific disease in a given population. The purpose of the present study is to develop an age-stratified Bayesian joinpoint regression model to describe mortality trend assuming that the observed counts are probabilistically characterized by the Poisson distribution. The proposed model is based on Bayesian model selection criteria with the smallest number of joinpoints that are sufficient to explain the Annual Percentage Change. The prior probability distributions are chosen in such a way that they are automatically derived from the model index contained in the model space. The proposed model and methodology estimates the age-adjusted mortality rates in different epidemiological studies to compare the trends by accounting the confounding effects of age. In developing the subject methods, we use the cancer mortality counts of adult lung and bronchus cancer, and brain and other Central Nervous System cancer patients obtained from the Surveillance Epidemiology and End Results data base of the National Cancer Institute. 相似文献

19.

Random sliced inverse regression

Haileab Hilafu 《统计学通讯:模拟与计算》2017,46(5):3516-3526

Sliced Inverse Regression (SIR; 1991) is a dimension reduction method for reducing the dimension of the predictors without losing regression information. The implementation of SIR requires inverting the covariance matrix of the predictors—which has hindered its use to analyze high-dimensional data where the number of predictors exceed the sample size. We propose random sliced inverse regression (rSIR) by applying SIR to many bootstrap samples, each using a subset of randomly selected candidate predictors. The final rSIR estimate is obtained by aggregating these estimates. A simple variable selection procedure is also proposed using these bootstrap estimates. The performance of the proposed estimates is studied via extensive simulation. Application to a dataset concerning myocardial perfusion diagnosis from cardiac Single Proton Emission Computed Tomography (SPECT) images is presented. 相似文献

20.

Small area estimates of labour force participation under a multinomial logit mixed model

Isabel Molina Ayoub Saei M. José Lombardía 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2007,170(4):975-1000

Summary. A new methodology is developed for estimating unemployment or employment characteristics in small areas, based on the assumption that the sample totals of unemployed and employed individuals follow a multinomial logit model with random area effects. The method is illustrated with UK labour force data aggregated by sex–age groups. For these data, the accuracy of direct estimates is poor in comparison with estimates that are derived from the multinomial logit model. Furthermore, two different estimators of the mean-squared errors are given: an analytical approximation obtained by Taylor linearization and an estimator based on bootstrapping. A simulation study for comparison of the two estimators shows the good performance of the bootstrap estimator. 相似文献