期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Multivariate modelling of spatial extremes based on copulas

Raymond K. S. Chan 《Journal of Statistical Computation and Simulation》2018,88(12):2404-2424

To model extreme spatial events, a general approach is to use the generalized extreme value (GEV) distribution with spatially varying parameters such as spatial GEV models and latent variable models. In the literature, this approach is mostly used to capture spatial dependence for only one type of event. This limits the applications to air pollutants data as different pollutants may chemically interact with each other. A recent advancement in spatial extremes modelling for multiple variables is the multivariate max-stable processes. Similarly to univariate max-stable processes, the multivariate version also assumes standard distributions such as unit-Fréchet as margins. Additional modelling is required for applications such as spatial prediction. In this paper, we extend the marginal methods such as spatial GEV models and latent variable models into a multivariate setting based on copulas so that it is capable of handling both the spatial dependence and the dependence among multiple pollutants. We apply our proposed model to analyse weekly maxima of nitrogen dioxide, sulphur dioxide, respirable suspended particles, fine suspended particles, and ozone collected in Pearl River Delta in China. 相似文献

2.

Fast and robust bootstrap 总被引：1，自引：0，他引：1

Matías Salibián-Barrera Stefan Van Aelst Gert Willems 《Statistical Methods and Applications》2008,17(1):41-71

In this paper we review recent developments on a bootstrap method for robust estimators which is computationally faster and more resistant to outliers than the classical bootstrap. This fast and robust bootstrap method is, under reasonable regularity conditions, asymptotically consistent. We describe the method in general and then consider its application to perform inference based on robust estimators for the linear regression and multivariate location-scatter models. In particular, we study confidence and prediction intervals and tests of hypotheses for linear regression models, inference for location-scatter parameters and principal components, and classification error estimation for discriminant analysis. 相似文献

3.

Bayes model averaging with selection of regressors

P. J. Brown M. Vannucci T. Fearn 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2002,64(3):519-536

Summary. When a number of distinct models contend for use in prediction, the choice of a single model can offer rather unstable predictions. In regression, stochastic search variable selection with Bayesian model averaging offers a cure for this robustness issue but at the expense of requiring very many predictors. Here we look at Bayes model averaging incorporating variable selection for prediction. This offers similar mean-square errors of prediction but with a vastly reduced predictor space. This can greatly aid the interpretation of the model. It also reduces the cost if measured variables have costs. The development here uses decision theory in the context of the multivariate general linear model. In passing, this reduced predictor space Bayes model averaging is contrasted with single-model approximations. A fast algorithm for updating regressions in the Markov chain Monte Carlo searches for posterior inference is developed, allowing many more variables than observations to be contemplated. We discuss the merits of absolute rather than proportionate shrinkage in regression, especially when there are more variables than observations. The methodology is illustrated on a set of spectroscopic data used for measuring the amounts of different sugars in an aqueous solution. 相似文献

4.

Small Area Estimation via Multivariate Fay–Herriot Models with Latent Spatial Dependence

下载免费PDF全文

Aaron T. Porter Christopher K. Wikle Scott H. Holan 《Australian & New Zealand Journal of Statistics》2015,57(1):15-29

The Fay–Herriot model is a standard model for direct survey estimators in which the true quantity of interest, the superpopulation mean, is latent and its estimation is improved through the use of auxiliary covariates. In the context of small area estimation, these estimates can be further improved by borrowing strength across spatial regions or by considering multiple outcomes simultaneously. We provide here two formulations to perform small area estimation with Fay–Herriot models that include both multivariate outcomes and latent spatial dependence. We consider two model formulations. In one of these formulations the outcome‐by‐space dependence structure is separable. The other accounts for the cross dependence through the use of a generalized multivariate conditional autoregressive (GMCAR) structure. The GMCAR model is shown, in a state‐level example, to produce smaller mean square prediction errors, relative to equivalent census variables, than the separable model and the state‐of‐the‐art multivariate model with unstructured dependence between outcomes and no spatial dependence. In addition, both the GMCAR and the separable models give smaller mean squared prediction error than the state‐of‐the‐art model when conducting small area estimation on county level data from the American Community Survey. 相似文献

5.

Goodness-of-link tests for multivariate regression models

José M. R. Murteira 《统计学通讯:理论与方法》2013,42(24):7367-7375

ABSTRACT

This note presents an approximation to multivariate regression models which is obtained from a first-order series expansion of the multivariate link function. The proposed approach yields a variable-addition approximation of regression models that enables a multivariate generalization of the well-known goodness-of-link specification test, available for univariate generalized linear models. Application of this general methodology is illustrated with models of multinomial discrete choice and multivariate fractional data, in which context it is shown to lead to well-established approximation and testing procedures. 相似文献

6.

Hierarchical Bayesian bivariate disease mapping: analysis of children and adults asthma visits to hospital

Mahmoud Torabi 《Journal of applied statistics》2014,41(3):612-621

In spatial epidemiology, detecting areas with high ratio of disease is important as it may lead to identifying risk factors associated with disease. This in turn may lead to further epidemiological investigations into the nature of disease. Disease mapping studies have been widely performed with considering only one disease in the estimated models. Simultaneous modelling of different diseases can also be a valuable tool both from the epidemiological and also from the statistical point of view. In particular, when we have several measurements recorded at each spatial location, one can consider multivariate models in order to handle the dependence among the multivariate components and the spatial dependence between locations. In this paper, spatial models that use multivariate conditionally autoregressive smoothing across the spatial dimension are considered. We study the patterns of incidence ratios and identify areas with consistently high ratio estimates as areas for further investigation. A hierarchical Bayesian approach using Markov chain Monte Carlo techniques is employed to simultaneously examine spatial trends of asthma visits by children and adults to hospital in the province of Manitoba, Canada, during 2000–2010. 相似文献

7.

Bayesian joint modeling of correlated counts data with application to adverse birth outcomes

Cindy Xin Feng 《Journal of applied statistics》2015,42(6):1206-1222

In disease mapping, health outcomes measured at the same spatial locations may be correlated, so one can consider joint modeling the multivariate health outcomes accounting for their dependence. The general approaches often used for joint modeling include shared component models and multivariate models. An alternative way to model the association between two health outcomes, when one outcome can naturally serve as a covariate of the other, is to use ecological regression model. For example, in our application, preterm birth (PTB) can be treated as a predictor for low birth weight (LBW) and vice versa. Therefore, we proposed to blend the ideas from joint modeling and ecological regression methods to jointly model the relative risks for LBW and PTBs over the health districts in Saskatchewan, Canada, in 2000–2010. This approach is helpful when proxy of areal-level contextual factors can be derived based on the outcomes themselves when direct information on risk factors are not readily available. Our results indicate that the proposed approach improves the model fit when compared with the conventional joint modeling methods. Further, we showed that when no strong spatial autocorrelation is present, joint outcome modeling using only independent error terms can still provide a better model fit when compared with the separate modeling. 相似文献

8.

Bayesian Analysis of Latent Threshold Dynamic Models

Jouchi Nakajima Mike West 《商业与经济统计学杂志》2013,31(2):151-164

We discuss a general approach to dynamic sparsity modeling in multivariate time series analysis. Time-varying parameters are linked to latent processes that are thresholded to induce zero values adaptively, providing natural mechanisms for dynamic variable inclusion/selection. We discuss Bayesian model specification, analysis and prediction in dynamic regressions, time-varying vector autoregressions, and multivariate volatility models using latent thresholding. Application to a topical macroeconomic time series problem illustrates some of the benefits of the approach in terms of statistical and economic interpretations as well as improved predictions. Supplementary materials for this article are available online. 相似文献

9.

Semiparametric model average prediction in panel data analysis

Tao Huang 《Journal of nonparametric statistics》2018,30(1):125-144

Forecasting in economic data analysis is dominated by linear prediction methods where the predicted values are calculated from a fitted linear regression model. With multiple predictor variables, multivariate nonparametric models were proposed in the literature. However, empirical studies indicate the prediction performance of multi-dimensional nonparametric models may be unsatisfactory. We propose a new semiparametric model average prediction (SMAP) approach to analyse panel data and investigate its prediction performance with numerical examples. Estimation of individual covariate effect only requires univariate smoothing and thus may be more stable than previous multivariate smoothing approaches. The estimation of optimal weight parameters incorporates the longitudinal correlation and the asymptotic properties of the estimated results are carefully studied in this paper. 相似文献

10.

Maximum likelihood estimates in the multivariate normal with patterned mean and covariance via the em algorithm

Dal ton F Andrade Ronald W Helms 《统计学通讯:理论与方法》2013,42(18):2239-2251

The maximum likelihood equations for a multivariate normal model with structured mean and structured covariance matrix may not have an explicit solution. In some cases the model's error term may be decomposed as the sum of two independent error terms, each having a patterned covariance matrix, such that if one of the unobservable error terms is artificially treated as "missing data", the EM algorithm can be used to compute the maximum likelihood estimates for the original problem. Some decompositions produce likelihood equations which do not have an explicit solution at each iteration of the EM algorithm, but within-iteration explicit solutions are shown for two general classes of models including covariance component models used for analysis of longitudinal data. 相似文献

11.

Strong consistency of presmoothed Kaplan–Meier integrals when covariables are present

Jacobo de Uña-Álvarez M. Celia Rodríguez-Campos 《Statistics》2013,47(6):483-496

It is known that the Kaplan–Meier estimation may be improved via presmoothing methods. In this article, we introduce an extended presmoothed Kaplan–Meier estimator in the presence of covariates. The main result is the strong consistency of general empirical integrals based on such an estimator. As applications, one can obtain a consis-tent multivariate empirical distribution under censoring, and also can obtain a consistent estimation of regression parameters. We illustrate the new estimation methods through simulations and real data analysis. 相似文献

12.

Diagnostics in multivariate generalized Birnbaum-Saunders regression models

Carolina Marchant Francisco José A. Cysneiros Juan F. Vivanco 《Journal of applied statistics》2016,43(15):2829-2849

Birnbaum–Saunders (BS) models are receiving considerable attention in the literature. Multivariate regression models are a useful tool of the multivariate analysis, which takes into account the correlation between variables. Diagnostic analysis is an important aspect to be considered in the statistical modeling. In this paper, we formulate multivariate generalized BS regression models and carry out a diagnostic analysis for these models. We consider the Mahalanobis distance as a global influence measure to detect multivariate outliers and use it for evaluating the adequacy of the distributional assumption. We also consider the local influence approach and study how a perturbation may impact on the estimation of model parameters. We implement the obtained results in the R software, which are illustrated with real-world multivariate data to show their potential applications. 相似文献

13.

Invariant co-ordinate selection

David E. Tyler Frank Critchley Lutz Dümbgen Hannu Oja 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2009,71(3):549-592

Summary. A general method for exploring multivariate data by comparing different estimates of multivariate scatter is presented. The method is based on the eigenvalue–eigenvector decomposition of one scatter matrix relative to another. In particular, it is shown that the eigenvectors can be used to generate an affine invariant co-ordinate system for the multivariate data. Consequently, we view this method as a method for invariant co-ordinate selection . By plotting the data with respect to this new invariant co-ordinate system, various data structures can be revealed. For example, under certain independent components models, it is shown that the invariant co- ordinates correspond to the independent components. Another example pertains to mixtures of elliptical distributions. In this case, it is shown that a subset of the invariant co-ordinates corresponds to Fisher's linear discriminant subspace, even though the class identifications of the data points are unknown. Some illustrative examples are given. 相似文献

14.

A family of models for uniform and serial dependence in repeated measurements studies

J. K. Lindsey 《Journal of the Royal Statistical Society. Series C, Applied statistics》2000,49(3):343-357

Data arising from a randomized double-masked clinical trial for multiple sclerosis have provided particularly variable longitudinal repeated measurements responses. Specific models for such data, other than those based on the multivariate normal distribution, would be a valuable addition to the applied statistician's toolbox. A useful family of multivariate distributions can be generated by substituting the integrated intensity of one distribution into a second (outer) distribution. The parameters in the second distribution are then used to create a dependence structure among observations on a unit. These may either be a form of serial dependence for longitudinal data or of uniform dependence within clusters. These are respectively analogous to the Kalman filter of state space models and to copulas, but they have the major advantage that they do not require any explicit integration. One useful outer distribution for constructing such multivariate distributions is the Pareto distribution. Certain special models based on it have previously been used in event history analysis, but those considered here have much wider application. 相似文献

15.

A test of the missing data mechanism for repeated measures data

Taesung Park Seungyeoun Lee Robert F. Woolson 《统计学通讯:理论与方法》2013,42(10):2813-2829

The occurrence of missing data is an often unavoidable consequence of repeated measures studies. Fortunately, multivariate general linear models such as growth curve models and linear mixed models with random effects have been well developed to analyze incomplete normally-distributed repeated measures data. Most statistical methods have assumed that the missing data occur at random. This assumption may include two types of missing data mechanism: missing completely at random (MCAR) and missing at random (MAR) in the sense of Rubin (1976). In this paper, we develop a test procedure for distinguishing these two types of missing data mechanism for incomplete normally-distributed repeated measures data. The proposed test is similar in spiril to the test of Park and Davis (1992). We derive the test for incomplete normally-distribrlted repeated measures data using linear mixed models. while Park and Davis (1992) cleirved thr test for incomplete repeatctl categorical data in the framework of Grizzle Starmer. and Koch (1969). Thr proposed procedure can be applied easily to any other multivariate general linear model which allow for missing data. The test is illustrated using the hip-replacernent patient.data from Crowder and Hand (1990). 相似文献

16.

Distributional aspects in latent variable models

M. Kukuk 《Statistical Papers》1994,35(1):231-242

For observable indicators with ordered categories one can assume underlying latent variables following certain marginal distributions. Transforming the latent variables changes its marginal distributions but not the observable qualitative indicators. The joint distribution of the latent variables can be constructed from the marginal distributions. There is a broad class of multivariate distributions for which the observable indicators are equivalent. By choosing the multivariate normal distribution from this class we can analyse a linear relationship between the transformed latent variables. This leads to latent structural equation models. Estimation of these latter models is therefore more general than the distributional assumption might initially suggest. Robustness of the estimation procedure is also discussed for deviations from this distribution family. Using ordinal business survey data of the German Ifo-institute we test the efficiency of firms' price expectations implied by the rational expectation hypothesis. 相似文献

17.

EXPONENTIAL SMOOTHING AND NON‐NEGATIVE DATA

Muhammad Akram Rob J. Hyndman J. Keith Ord 《Australian & New Zealand Journal of Statistics》2009,51(4):415-432

The most common forecasting methods in business are based on exponential smoothing, and the most common time series in business are inherently non‐negative. Therefore it is of interest to consider the properties of the potential stochastic models underlying exponential smoothing when applied to non‐negative data. We explore exponential smoothing state space models for non‐negative data under various assumptions about the innovations, or error, process. We first demonstrate that prediction distributions from some commonly used state space models may have an infinite variance beyond a certain forecasting horizon. For multiplicative error models that do not have this flaw, we show that sample paths will converge almost surely to zero even when the error distribution is non‐Gaussian. We propose a new model with similar properties to exponential smoothing, but which does not have these problems, and we develop some distributional properties for our new model. We then explore the implications of our results for inference, and compare the short‐term forecasting performance of the various models using data on the weekly sales of over 300 items of costume jewelry. The main findings of the research are that the Gaussian approximation is adequate for estimation and one‐step‐ahead forecasting. However, as the forecasting horizon increases, the approximate prediction intervals become increasingly problematic. When the model is to be used for simulation purposes, a suitably specified scheme must be employed. 相似文献

18.

Linear or Nonlinear? Automatic Structure Discovery for Partially Linear Models 总被引：1，自引：0，他引：1

Zhang HH Cheng G Liu Y 《Journal of the American Statistical Association》2011,106(495):1099-1112

Partially linear models provide a useful class of tools for modeling complex data by naturally incorporating a combination of linear and nonlinear effects within one framework. One key question in partially linear models is the choice of model structure, that is, how to decide which covariates are linear and which are nonlinear. This is a fundamental, yet largely unsolved problem for partially linear models. In practice, one often assumes that the model structure is given or known and then makes estimation and inference based on that structure. Alternatively, there are two methods in common use for tackling the problem: hypotheses testing and visual screening based on the marginal fits. Both methods are quite useful in practice but have their drawbacks. First, it is difficult to construct a powerful procedure for testing multiple hypotheses of linear against nonlinear fits. Second, the screening procedure based on the scatterplots of individual covariate fits may provide an educated guess on the regression function form, but the procedure is ad hoc and lacks theoretical justifications. In this article, we propose a new approach to structure selection for partially linear models, called the LAND (Linear And Nonlinear Discoverer). The procedure is developed in an elegant mathematical framework and possesses desired theoretical and computational properties. Under certain regularity conditions, we show that the LAND estimator is able to identify the underlying true model structure correctly and at the same time estimate the multivariate regression function consistently. The convergence rate of the new estimator is established as well. We further propose an iterative algorithm to implement the procedure and illustrate its performance by simulated and real examples. Supplementary materials for this article are available online. 相似文献

19.

MDL convergence speed for Bernoulli sequences

Jan Poland Marcus Hutter 《Statistics and Computing》2006,16(2):161-175

相似文献

20.

Bayesian integrative analysis for multi-fidelity computer experiments

Yunfei Wei 《Journal of applied statistics》2019,46(11):1973-1987

This paper proposes a Bayesian integrative analysis method for linking multi-fidelity computer experiments. Instead of assuming covariance structures of multivariate Gaussian process models, we handle the outputs from different levels of accuracy as independent processes and link them via a penalization method that controls the distance between their overall trends. Based on the priors induced by the penalty, we build Bayesian prediction models for the output at the highest accuracy. Simulated and real examples show that the proposed method is better than existing methods in terms of prediction accuracy for many cases. 相似文献