期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Graphical Techniques for Selecting Explanatory Variables for Time Series Data

J. M. Marriott & A. N. Pettitt 《Journal of the Royal Statistical Society. Series C, Applied statistics》1997,46(2):253-264

Bayesian model building techniques are developed for data with a strong time series structure and possibly exogenous explanatory variables that have strong explanatory and predictive power. The emphasis is on finding whether there are any explanatory variables that might be used for modelling if the data have a strong time series structure that should also be included. We use a time series model that is linear in past observations and that can capture both stochastic and deterministic trend, seasonality and serial correlation. We propose the plotting of absolute predictive error against predictive standard deviation. A series of such plots is utilized to determine which of several nested and non-nested models is optimal in terms of minimizing the dispersion of the predictive distribution and restricting predictive outliers. We apply the techniques to modelling monthly counts of fatal road crashes in Australia where economic, consumption and weather variables are available and we find that three such variables should be included in addition to the time series filter. The approach leads to graphical techniques to determine strengths of relationships between the dependent variable and covariates and to detect model inadequacy as well as determining useful numerical summaries. 相似文献

2.

Forecasting the Distribution of Economic Variables in a Data-Rich Environment

Sebastiano Manzan 《商业与经济统计学杂志》2013,31(1):144-164

This article investigates the relevance of considering a large number of macroeconomic indicators to forecast the complete distribution of a variable. The baseline time series model is a semiparametric specification based on the quantile autoregressive (QAR) model that assumes that the quantiles depend on the lagged values of the variable. We then augment the time series model with macroeconomic information from a large dataset by including principal components or a subset of variables selected by LASSO. We forecast the distribution of the h-month growth rate for four economic variables from 1975 to 2011 and evaluate the forecast accuracy relative to a stochastic volatility model using the quantile score. The results for the output and employment measures indicate that the multivariate models outperform the time series forecasts, in particular at long horizons and in tails of the distribution, while for the inflation variables the improved performance occurs mostly at the 6-month horizon. We also illustrate the practical relevance of predicting the distribution by considering forecasts at three dates during the last recession. 相似文献

3.

Seemingly unrelated regression tree

Jaeoh Kim 《Journal of applied statistics》2019,46(7):1177-1195

Nonparametric seemingly unrelated regression provides a powerful alternative to parametric seemingly unrelated regression for relaxing the linearity assumption. The existing methods are limited, particularly with sharp changes in the relationship between the predictor variables and the corresponding response variable. We propose a new nonparametric method for seemingly unrelated regression, which adopts a tree-structured regression framework, has satisfiable prediction accuracy and interpretability, no restriction on the inclusion of categorical variables, and is less vulnerable to the curse of dimensionality. Moreover, an important feature is constructing a unified tree-structured model for multivariate data, even though the predictor variables corresponding to the response variable are entirely different. This unified model can offer revelatory insights such as underlying economic meaning. We propose the key factors of tree-structured regression, which are an impurity function detecting complex nonlinear relationships between the predictor variables and the response variable, split rule selection with negligible selection bias, and tree size determination solving underfitting and overfitting problems. We demonstrate our proposed method using simulated data and illustrate it using data from the Korea stock exchange sector indices. 相似文献

4.

Identifying key factors in momentum in basketball games

Tao Chen Qingliang Fan Kai Liu Lingshan Le 《Journal of applied statistics》2021,48(16):3116

Momentum as elaborated under a recent novel definition has been shown quantitatively to have a significant impact on basketball game outcomes. This paper makes two contributions to the analytical literature on sports momentum: (1) two aspects of the new definition are operationalized so that its practicality becomes evident; and (2) through a dimension-reduction technique (elastic net), key factors associated with momentum are identified. Both technical variables such as field goals, assists, rebounds, etc. and environmental variables such as the spectator attendance rate and player salary dispersion are considered, and the potential for useful real-time analyzes is illustrated. 相似文献

5.

Semiparametric estimation of employment duration models

Joel L. Horowitz George R. Neumann 《Econometric Reviews》2013,32(1):5-40

There are a variety of economic areas, such as studies of employment duration and of the durability of capital goods, in which data on important variables typically are censored. The standard techinques for estimating a model from censored data require the distributions of unobservable random components of the model to be specified a priori up to a finite set of parameters, and misspecification of these distributions usually leads to inconsistent parameter estimates. However, economic theory rarely gives guidance about distributions and the standard estimation techniques do not provide convenient methods for identifying distributions from censored data. Recently, several distribution-free or semiparametric methods for estimating censored regression models have been developed. This paper presents the results of using two such methods to estimate a model of employment duration. The paper reports the operating characteristics of the semiparametric estimators and compares the semiparametric estimates with those obtained from a standard parametric model. 相似文献

6.

Analysis of the importance of on-field covariates in the German Bundesliga

Gunther Schauberger Andreas Groll Gerhard Tutz 《Journal of applied statistics》2018,45(9):1561-1578

In modern football, various variables as, for example, the distance a team runs or its percentage of ball possession, are collected throughout a match. However, there is a lack of methods to make use of these on-field variables simultaneously and to connect them with the final result of the match. This paper considers data from the German Bundesliga season 2015/2016. The objective is to identify the on-field variables that are connected to the sportive success or failure of the single teams. An extended Bradley–Terry model for football matches is proposed that is able to take into account on-field covariates. Penalty terms are used to reduce the complexity of the model and to find clusters of teams with equal covariate effects. The model identifies the running distance to be the on-field covariate that is most strongly connected to the match outcome. 相似文献

7.

INDEX TO VOLUME 8 (1 990)

James H. Stock Mark W. Watson 《商业与经济统计学杂志》2013,31(4):481-482

This article provides a simple shrinkage representation that describes the operational characteristics of various forecasting methods designed for a large number of orthogonal predictors (such as principal components). These methods include pretest methods, Bayesian model averaging, empirical Bayes, and bagging. We compare empirically forecasts from these methods with dynamic factor model (DFM) forecasts using a U.S. macroeconomic dataset with 143 quarterly variables spanning 1960–2008. For most series, including measures of real economic activity, the shrinkage forecasts are inferior to the DFM forecasts. This article has online supplementary material. 相似文献

8.

Forecasting and conditional projection using realistic prior distributions 总被引：2，自引：0，他引：2

Thomas Doan Robert Litterman Christopher Sims 《Econometric Reviews》1984,3(1):1-100

This paper develops a forecasting procedure based on a Bayesian method for estimating vector autoregressions. The procedure is applied t o 10 macroeconomic variables and is shown to improve out-of-sample forecasts relative to univariate equations. Although cross-variable responses are damped by the prior, considerable interaction among the variables is shown to be captured by the estimates

We provide unconditional forecasts as of 1982:12 and 1983:3. We also describe how a model such as this can be used to make conditional projections and to analyze policy alternatives. As an example, we analyze a Congressional Budget Office forecast made in 1982: 12

Although no automatic causal interpretations arise from models like ours, they provide a detailed characterization of the dynamic statistical interdependence of a set of economic variables, information that may help in evaluating causal hypotheses without containing any such hypotheses. 相似文献

9.

Mark-resighting analysis of a California gull population

Bruce H. Pugesek Chris Nations Kenneth L. Diem Roger Pradel 《Journal of applied statistics》1995,22(5-6):625-640

California gulls ( Larus californicus ) of known age and sex were censused on their breeding colony in 1979, 1980 and 1984 through 1993. Ages of 235 males and 196 females ranged from 4 to 27 years. Age classes used in the analysis were limited to 17, 4 through 19, and 20 or more as a final age category because data on gulls over 20 were sparse. Survival declined with age in a way that was parsimoniously modelled with a quadratic function. Other factors, sex and time, did not explain any variation in survival. Resighting depended on age, sex and time. Younger adults skipped breeding more frequently than did older adults, and females skipped breeding more frequently than did males. There was also good evidence for time dependence in resighting probability, but its inclusion in the model occurred at the expense of interpretability and precision. In a data set such as this, resighting probability may assume more importance than a mere 'nuisance parameter'. In this study, resighting history measured attendance at the breeding ground. In turn, attendance rates may be a manifestation of reproductive strategy, which can also have consequences for survival. In this situation, there may be heterogeneity in both survival and resighting probability that is unexplained by the model. While such complexity may well be a nuisance to deal with, it can also point to important biological questions. 相似文献

10.

Mark-resighting analysis of a California gull population

Bruce H. Pugesek Chris Nations Kenneth L. Diem Roger Pradel 《Journal of applied statistics》1995,22(5):625-640

California gulls ( Larus californicus ) of known age and sex were censused on their breeding colony in 1979, 1980 and 1984 through 1993. Ages of 235 males and 196 females ranged from 4 to 27 years. Age classes used in the analysis were limited to 17, 4 through 19, and 20 or more as a final age category because data on gulls over 20 were sparse. Survival declined with age in a way that was parsimoniously modelled with a quadratic function. Other factors, sex and time, did not explain any variation in survival. Resighting depended on age, sex and time. Younger adults skipped breeding more frequently than did older adults, and females skipped breeding more frequently than did males. There was also good evidence for time dependence in resighting probability, but its inclusion in the model occurred at the expense of interpretability and precision. In a data set such as this, resighting probability may assume more importance than a mere 'nuisance parameter'. In this study, resighting history measured attendance at the breeding ground. In turn, attendance rates may be a manifestation of reproductive strategy, which can also have consequences for survival. In this situation, there may be heterogeneity in both survival and resighting probability that is unexplained by the model. While such complexity may well be a nuisance to deal with, it can also point to important biological questions. 相似文献

11.

Designing stratified sampling in economic and business surveys

M.G.M. Khan D.K. Rao 《Journal of applied statistics》2015,42(10):2080-2099

In most economic and business surveys, the target variables (e.g. turnover of enterprises, income of households, etc.) commonly resemble skewed distributions with many small and few large units. In such surveys, if a stratified sampling technique is used as a method of sampling and estimation, the convenient way of stratification such as the use of demographical variables (e.g. gender, socioeconomic class, geographical region, religion, ethnicity, etc.) or other natural criteria, which is widely practiced in economic surveys, may fail to form homogeneous strata and is not much useful in order to increase the precision of the estimates of variables of interest. In this paper, a stratified sampling design for economic surveys based on auxiliary information has been developed, which can be used for constructing optimum stratification and determining optimum sample allocation to maximize the precision in estimate. 相似文献

12.

Imposing and Testing for Shape Restrictions in Flexible Parametric Models

Hendrik Wolff 《Econometric Reviews》2016,35(6):1013-1039

In many economic models, theory restricts the shape of functions, such as monotonicity or curvature conditions. This article reviews and presents a framework for constrained estimation and inference to test for shape conditions in parametric models. We show that “regional” shape-restricting estimators have important advantages in terms of model fit and flexibility (as opposed to standard “local” or “global” shape-restricting estimators). In our empirical illustration, this is the first article to impose and test for all shape restrictions required by economic theory simultaneously in the “Berndt and Wood” data. We find that this dataset is consistent with “duality theory,” whereas previous studies have found violations of economic theory. We discuss policy consequences for key parameters, such as whether energy and capital are complements or substitutes. 相似文献

13.

中国能源需求的经济计量分析 总被引：105，自引：1，他引：104

下载免费PDF全文

林伯强《统计研究》2001,18(10):34-37

一、引言世纪之交 ,中国已实现了现代化建设的前两步战略目标。从新世纪开始 ,中国将努力实现第三步战略目标 ,2 0 10年的ＧＤＰ比 2 0 0 0年翻一番。但实践表明 :能源短缺是经济发展的主要障碍之一。基于能源在国民经济中的重要地位 ,中国政府对诸如能源等基础设施建设始终予以高度重视。在学术研究方面 ,文献 [1]总结了经济学家给出的 2 0 0 0年中国能源需求预测的 7种结果 :最小的是13 9亿吨标准煤 ,最大的是 2 4亿吨标准煤。这些研究主要使用两类方法 :密度分析和回归分析。但几种使用密度法的预测结果都高估了 80年代初期的能源需求水… 相似文献

14.

Dynamic Forecasts of Qualitative Variables

《商业与经济统计学杂志》2013,31(1):96-104

This article presents a new Qual VAR model for incorporating information from qualitative and/or discrete variables in vector autoregressions. With a Qual VAR, it is possible to create dynamic forecasts of the qualitative variable using standard VAR projections. Previous forecasting methods for qualitative variables, in contrast, produce only static forecasts. I apply the Qual VAR to forecasting the 2001 business recession out of sample and to analyzing the Romer and Romer narrative measure of monetary policy contractions as an endogenous variable in a VAR. Out of sample, the model predicts the timing of the 2001 recession quite well relative to the recession probabilities put forth at the time by professional forecasters. Qual VARs—which include information about the qualitative variable—can also enhance the quality of density forecasts of the other variables in the system. 相似文献

15.

Bivariate transition model for analysing ordinal and nominal categorical responses: an application to the Labour Force Survey data

Z. Rezaei Ghahroodi F. Harandi D. Berridge 《Journal of applied statistics》2011,38(4):817-832

In many panel studies, bivariate ordinal–nominal responses are measured and the aim is to investigate the effects of explanatory variables on these responses. A regression analysis for these types of data must allow for the correlation among responses of the same individual. To analyse such ordinal–nominal responses using a proper weighting approach, an ordinal–nominal bivariate transition model is proposed and maximum likelihood is used to find the parameter estimates. We propose a method in which the likelihood function can be partitioned to make possible the use of existing software. The approach is applied to the Labour Force Survey data in Iran, where the ordinal response, at the first period, is the duration of unemployment for unemployed people and the nominal response, in the second period, is economic activity status of these individuals. The interest is to find the reasons for staying unemployed or moving to another status of economic activity. 相似文献

16.

Bayesian hierarchical models for the prediction of volleyball results

Andrea Gabrio 《Journal of applied statistics》2021,48(2):301

Statistical modelling of sports data has become more and more popular in the recent years and different types of models have been proposed to achieve a variety of objectives: from identifying the key characteristics which lead a team to win or lose to predicting the outcome of a game or the team rankings in national leagues. Although not as popular as football or basketball, volleyball is a team sport with both national and international level competitions in almost every country. However, there is almost no study investigating the prediction of volleyball game outcomes and team rankings in national leagues. We propose a Bayesian hierarchical model for the prediction of the rankings of volleyball national teams, which also allows to estimate the results of each match in the league. We consider two alternative model specifications of different complexity which are validated using data from the women''s volleyball Italian Serie A1 2017–2018 season. 相似文献

17.

Sparsity identification in ultra-high dimensional quantile regression models with longitudinal data

Xianli Gao 《统计学通讯:理论与方法》2020,49(19):4712-4736

Abstract

In this paper, we propose a variable selection method for quantile regression model in ultra-high dimensional longitudinal data called as the weighted adaptive robust lasso (WAR-Lasso) which is double-robustness. We derive the consistency and the model selection oracle property of WAR-Lasso. Simulation studies show the double-robustness of WAR-Lasso in both cases of heavy-tailed distribution of the errors and the heavy contaminations of the covariates. WAR-Lasso outperform other methods such as SCAD and etc. A real data analysis is carried out. It shows that WAR-Lasso tends to select fewer variables and the estimated coefficients are in line with economic significance. 相似文献

18.

Graphical Presentation of a Basketball Game

Peter H. Westfall 《The American statistician》2013,67(4):305-307

A graphical summary of the scoring activity for a basketball game is presented. The summary is a real-time plot of the score difference versus the elapsed time. In contrast to the usual score by quarters (or by half in college play), the graph easily allows complete identification of items of interest, such as largest leads, lead changes, come-from-behind activity, and percentage of game time for which a particular team of interest held a lead. The technique is applied to professional and collegiate games, and several interesting features are noted. 相似文献

19.

EM Algorithm Estimation of Simultaneous Equation Model with Limited Variables: An Example of Cigarette Consumption

Yu-Wen Wen 《统计学通讯:模拟与计算》2013,42(9):1914-1929

The two-part model and Heckman's sample selection model are often used in economic studies which involve analyzing the demand for limited variables. This study proposed a simultaneous equation model (SEM) and used the expectation-maximization algorithm to obtain the maximum likelihood estimate. We then constructed a simulation to compare the performance of estimates of price elasticity using SEM with those estimates from the two-part model and the sample selection model. The simulation shows that the estimates of price elasticity by SEM are more precise than those by the sample selection model and the two-part model when the model includes limited independent variables. Finally, we analyzed a real example of cigarette consumption as an application. We found an increase in cigarette price associated with a decrease in both the propensity to consume cigarettes and the amount actually consumed. 相似文献

20.

Regression for non-Euclidean data using distance matrices

Julian J. Faraway 《Journal of applied statistics》2014,41(11):2342-2357

Regression methods for common data types such as measured, count and categorical variables are well understood but increasingly statisticians need ways to model relationships between variable types such as shapes, curves, trees, correlation matrices and images that do not fit into the standard framework. Data types that lie in metric spaces but not in vector spaces are difficult to use within the usual regression setting, either as the response and/or a predictor. We represent the information in these variables using distance matrices which requires only the specification of a distance function. A low-dimensional representation of such distance matrices can be obtained using methods such as multidimensional scaling. Once these variables have been represented as scores, an internal model linking the predictors and the responses can be developed using standard methods. We call scoring as the transformation from a new observation to a score, whereas backscoring is a method to represent a score as an observation in the data space. Both methods are essential for prediction and explanation. We illustrate the methodology for shape data, unregistered curve data and correlation matrices using motion capture data from an experiment to study the motion of children with cleft lip. 相似文献