期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Nonparametric Bayesian analysis for multi-site hidden Markov model

Dal Ho Kim Aejung Jo 《统计学通讯:模拟与计算》2017,46(6):4896-4907

The hidden Markov model (HMM) provides an attractive framework for modeling long-term persistence in a variety of applications including pattern recognition. Unlike typical mixture models, hidden Markov states can represent the heterogeneity in data and it can be extended to a multivariate case using a hierarchical Bayesian approach. This article provides a nonparametric Bayesian modeling approach to the multi-site HMM by considering stick-breaking priors for each row of an infinite state transition matrix. This extension has many advantages over a parametric HMM. For example, it can provide more flexible information for identifying the structure of the HMM than parametric HMM analysis, such as the number of states in HMM. We exploit a simulation example and a real dataset to evaluate the proposed approach. 相似文献

2.

Nonparametric inference and uniqueness for periodically observed progressive disease models

Beth Ann Griffin Stephen W. Lagakos 《Lifetime data analysis》2010,16(2):157-175

In many studies examining the progression of HIV and other chronic diseases, subjects are periodically monitored to assess their progression through disease states. This gives rise to a specific type of panel data which have been termed “chain-of-events data”; e.g. data that result from periodic observation of a progressive disease process whose states occur in a prescribed order and where state transitions are not observable. Using a discrete time semi-Markov model, we develop an algorithm for nonparametric estimation of the distribution functions of sojourn times in a J state progressive disease model. Issues of uniqueness for chain-of-events data are not well-understood. Thus, a main goal of this paper is to determine the uniqueness of the nonparametric estimators of the distribution functions of sojourn times within states. We develop sufficient conditions for uniqueness of the nonparametric maximum likelihood estimator, including situations where some but not all of its components are unique. We illustrate the methods with three examples. 相似文献

3.

Switching nonparametric regression models

Camila P. E. de Souza Nancy E. Heckman 《Journal of nonparametric statistics》2014,26(4):617-637

We propose a methodology to analyse data arising from a curve that, over its domain, switches among J states. We consider a sequence of response variables, where each response y depends on a covariate x according to an unobserved state z. The states form a stochastic process and their possible values are j=1,?…?, J. If z equals j the expected response of y is one of J unknown smooth functions evaluated at x. We call this model a switching nonparametric regression model. We develop an Expectation–Maximisation algorithm to estimate the parameters of the latent state process and the functions corresponding to the J states. We also obtain standard errors for the parameter estimates of the state process. We conduct simulation studies to analyse the frequentist properties of our estimates. We also apply the proposed methodology to the well-known motorcycle dataset treating the data as coming from more than one simulated accident run with unobserved run labels. 相似文献

4.

Estimating heritability for cause specific mortality based on twin studies

Thomas H. Scheike Klaus K. Holst Jacob B. Hjelmborg 《Lifetime data analysis》2014,20(2):210-233

There has been considerable interest in studying the magnitude and type of inheritance of specific diseases. This is typically derived from family or twin studies, where the basic idea is to compare the correlation for different pairs that share different amount of genes. We here consider data from the Danish twin registry and discuss how to define heritability for cancer occurrence. The key point is that this should be done taking censoring as well as competing risks due to e.g. death into account. We describe the dependence between twins on the probability scale and show that various models can be used to achieve sensible estimates of the dependence within monozygotic and dizygotic twin pairs that may vary over time. These dependence measures can subsequently be decomposed into a genetic and environmental component using random effects models. We here present several novel models that in essence describe the association in terms of the concordance probability, i.e., the probability that both twins experience the event, in the competing risks setting. We also discuss how to deal with the left truncation present in the Nordic twin registries, due to sampling only of twin pairs where both twins are alive at the initiation of the registries. 相似文献

5.

A Bayesian HMM with random effects and an unknown number of states for DNA copy number analysis

Oscar M. Rueda Cristina Rueda Ramon Diaz-Uriarte 《Journal of Statistical Computation and Simulation》2013,83(1):82-96

Hidden Markov models (HMMs) have been shown to be a flexible tool for modelling complex biological processes. However, choosing the number of hidden states remains an open question and the inclusion of random effects also deserves more research, as it is a recent addition to the fixed-effect HMM in many application fields. We present a Bayesian mixed HMM with an unknown number of hidden states and fixed covariates. The model is fitted using reversible-jump Markov chain Monte Carlo, avoiding the need to select the number of hidden states. We show through simulations that the estimations produced are more precise than those from a fixed-effect HMM and illustrate its practical application to the analysis of DNA copy number data, a field where HMMs are widely used. 相似文献

6.

部分状态可见的隐马尔可夫模型状态序列的估计方法

楼振凯等《统计研究》2019,36(6):107-114

本文考虑了部分状态可见的隐马尔可夫模型的状态序列估计问题,在分析了现有算法无法合理估计状态路径之后,以状态转移概率、观测概率和可见状态作为先验信息,通过贝叶斯分析计算可见状态前后向状态的后验概率,并给出初始条件和递推公式,运用动态规划递推得到每个观测值对应的最可能状态以及最可能的状态路径。最后,本文给出一个系统故障识别的应用例子,验证了所设计算法的可行性。相似文献

7.

A Markov regime‐switching model for crude‐oil markets: Comparison of composite likelihood and full likelihood

Wei Zou Jiahua Chen 《Revue canadienne de statistique》2013,41(2):353-367

We use the two‐state Markov regime‐switching model to explain the behaviour of the WTI crude‐oil spot prices from January 1986 to February 2012. We investigated the use of methods based on the composite likelihood and the full likelihood. We found that the composite‐likelihood approach can better capture the general structural changes in world oil prices. The two‐state Markov regime‐switching model based on the composite‐likelihood approach closely depicts the cycles of the two postulated states: fall and rise. These two states persist for on average 8 and 15 months, which matches the observed cycles during the period. According to the fitted model, drops in oil prices are more volatile than rises. We believe that this information can be useful for financial officers working in related areas. The model based on the full‐likelihood approach was less satisfactory. We attribute its failure to the fact that the two‐state Markov regime‐switching model is too rigid and overly simplistic. In comparison, the composite likelihood requires only that the model correctly specifies the joint distribution of two adjacent price changes. Thus, model violations in other areas do not invalidate the results. The Canadian Journal of Statistics 41: 353–367; 2013 © 2013 Statistical Society of Canada 相似文献

8.

Hierarchical Bayesian Approach to a Multi-Site Hidden Markov Model

Dal Ho Kim Gyuhyeong Goh Yongku Kim 《统计学通讯:模拟与计算》2013,42(6):1241-1252

Multivariate data with a sequential or temporal structure occur in various fields of study. The hidden Markov model (HMM) provides an attractive framework for modeling long-term persistence in areas of pattern recognition through the extension of independent and identically distributed mixture models. Unlike in typical mixture models, the heterogeneity of data is represented by hidden Markov states. This article extends the HMM to a multi-site or multivariate case by taking a hierarchical Bayesian approach. This extension has many advantages over a single-site HMM. For example, it can provide more information for identifying the structure of the HMM than a single-site analysis. We evaluate the proposed approach by exploiting a spatial correlation that depends on the distance between sites. 相似文献

9.

Confidence intervals for impulse responses under departures from normality

Lutz Kilian 《Econometric Reviews》1998,17(1):1-29

Monte Carlo evidence shows that in structural VAR models with fat-tailed or skewed innovations the coverage accuracy of impulse response confidence intervals may deterorate substantially compared to the same model with Gaussian innovations. Empirical evidance suggests that such departures from normality are quite plausible for economic time series. The simulation results suggest that applied researchers are best off using nonparametric bootstrap intervals for impulse responses, regardless of whether or not there is evidence of fat tails or skewness in the error distribution. Allowing for departures from normality is shown to considerably weaken the evidence of the delayed overshooting puzzle in Eichenbaum and Evans (1995). 相似文献

10.

Extended Bayesian model averaging for heritability in twin studies

Miao-Yu Tsai 《Journal of applied statistics》2010,37(6):1043-1058

Family studies are often conducted to examine the existence of familial aggregation. Particularly, twin studies can model separately the genetic and environmental contribution. Here we estimate the heritability of quantitative traits via variance components of random-effects in linear mixed models (LMMs). The motivating example was a myopia twin study containing complex nesting data structures: twins and siblings in the same family and observations on both eyes for each individual. Three models are considered for this nesting structure. Our proposal takes into account the model uncertainty in both covariates and model structures via an extended Bayesian model averaging (EBMA) procedure. We estimate the heritability using EBMA under three suggested model structures. When compared with the results under the model with the highest posterior model probability, the EBMA estimate has smaller variation and is slightly conservative. Simulation studies are conducted to evaluate the performance of variance-components estimates, as well as the selections of risk factors, under the correct or incorrect structure. The results indicate that EBMA, with consideration of uncertainties in both covariates and model structures, is robust in model misspecification than the usual Bayesian model averaging (BMA) that considers only uncertainty in covariates selection. 相似文献

11.

Clustered Survival Data with Left‐truncation

下载免费PDF全文

Frank Eriksson Torben Martinussen Thomas H. Scheike 《Scandinavian Journal of Statistics》2015,42(4):1149-1166

Left‐truncation occurs frequently in survival studies, and it is well known how to deal with this for univariate survival times. However, there are few results on how to estimate dependence parameters and regression effects in semiparametric models for clustered survival data with delayed entry. Surprisingly, existing methods only deal with special cases. In this paper, we clarify different kinds of left‐truncation and suggest estimators for semiparametric survival models under specific truncation schemes. The large‐sample properties of the estimators are established. Small‐sample properties are investigated via simulation studies, and the suggested estimators are used in a study of prostate cancer based on the Finnish twin cohort where a twin pair is included only if both twins were alive in 1974. 相似文献

12.

Decriminalization and Marijuana Smoking Prevalence: Evidence From Australia

《商业与经济统计学杂志》2013,31(3):344-356

This article uses the 2001 National Drug Strategy Household Survey to assess the impact of marijuana decriminalization policy on marijuana smoking prevalence in Australia. Both parametric and nonparametric methods are used. The parametric approach includes endogenous probit switching, two-part, sample selection, and standard dummy variable models, while the nonparametric approach uses propensity score stratification matching. Specification analyses are also conducted. A nonparametric kernel-based test is constructed to select between parametric and nonparametric models, and the likelihood ratio test is used to choose among parametric models. Our analyses favor the endogenous switching model where decriminalization increases the probability of smoking by 16.2%. 相似文献

13.

Nonparametric estimation of a switching regression model

Ruffy S. Guilatco 《统计学通讯:模拟与计算》2017,46(2):840-854

High-dimensional data often exhibit multi-collinearity, leading to unstable regression coefficients. To address sample selection bias and problems associated with high dimensionality, principal components were extracted and used as predictors in a switching regression model. Since principal component regression often results to decline in predictive ability due to the selection of few principal components, we formulate the model with nonparametric function of principal components in lieu of individual predictors. Simulation studies indicated better predictive ability for nonparametric principal component switching regression over the parametric counterpart while mitigating the adverse effects of multi-collinearity and high dimensionality. 相似文献

14.

Robust tests for the equality of variances for clustered data

《Journal of Statistical Computation and Simulation》2012,82(4):365-377

Tests for the equality of variances are often needed in applications. In genetic studies the assumption of equal variances of continuous traits, measured in identical and fraternal twins, is crucial for heritability analysis. To test the equality of variances of traits, which are non-normally distributed, Levene [H. Levene, Robust tests for equality of variances, in Contributions to Probability and Statistics, I. Olkin, ed. Stanford University Press, Palo Alto, California, 1960, pp. 278–292] suggested a method that was surprisingly robust under non-normality, and the procedure was further improved by Brown and Forsythe [M.B. Brown and A.B. Forsythe, Robust tests for the equality of variances, J. Amer. Statis. Assoc. 69 (1974), pp. 364–367]. These tests assumed independence of observations. However, twin data are clustered – observations within a twin pair may be dependent due to shared genes and environmental factors. Uncritical application of the tests of Brown and Forsythe to clustered data may result in much higher than nominal Type I error probabilities. To deal with clustering we developed an extended version of Levene's test, where the ANOVA step is replaced with a regression analysis followed by a Wald-type test based on a clustered version of the robust Huber–White sandwich estimator of the covariance matrix. We studied the properties of our procedure using simulated non-normal clustered data and obtained Type I error rates close to nominal as well as reasonable powers. We also applied our method to oral glucose tolerance test data obtained from a twin study of the metabolic syndrome and related components and compared the results with those produced by the traditional approaches. 相似文献

15.

MAP segmentation in Bayesian hidden Markov models: a case study

Alexey Koloydenko Kristi Kuljus Jüri Lember 《Journal of applied statistics》2022,49(5):1203

We consider the problem of estimating the maximum posterior probability (MAP) state sequence for a finite state and finite emission alphabet hidden Markov model (HMM) in the Bayesian setup, where both emission and transition matrices have Dirichlet priors. We study a training set consisting of thousands of protein alignment pairs. The training data is used to set the prior hyperparameters for Bayesian MAP segmentation. Since the Viterbi algorithm is not applicable any more, there is no simple procedure to find the MAP path, and several iterative algorithms are considered and compared. The main goal of the paper is to test the Bayesian setup against the frequentist one, where the parameters of HMM are estimated using the training data. 相似文献

16.

Hypothesis Testing Problems in an Unbalanced Longitudinal Ophthalmology Study

Jonghyeon Kim 《统计学通讯:理论与方法》2013,42(3):461-476

ABSTRACT

The analysis of clustered data in a longitudinal ophthalmology study is complicated by correlations between repeatedly measured visual outcomes of paired eyes in a participant and missing observations due to the loss of follow-up. In the present article we consider hypothesis testing problems in an ophthalmology study, where eligible eyes are randomized to two treatments (when two eyes of a participant are eligible, the paired eyes are assigned to different treatments), and vision function outcomes are repeatedly measured over time. A large sample-based nonparametric test statistic and a nonparametric Bootstrap test analog are proposed for testing an interaction effect of two factors and testing an effect of a eye-specific factor within a level of the other person-specific factor on visual function outcomes. Both test statistics allow for missing observations, correlations between repeatedly measured outcomes on individual eyes, and correlations between repeatedly measured outcomes on both eyes of each participant. A simulation study shows that these proposed test statistics maintain nominal significance levels approximately and comparable powers to each other, as well as higher powers than the naive test statistic ignoring correlations between repeated bilateral measurements of both eyes in the same person. For illustration, we apply the proposed test statistics to the changes of visual field defect score in the Advanced Glaucoma Intervention Study. 相似文献

17.

Real-Time Forecasting With a Mixed-Frequency VAR

Frank Schorfheide Dongho Song 《商业与经济统计学杂志》2013,31(3):366-380

This article develops a vector autoregression (VAR) for time series which are observed at mixed frequencies—quarterly and monthly. The model is cast in state-space form and estimated with Bayesian methods under a Minnesota-style prior. We show how to evaluate the marginal data density to implement a data-driven hyperparameter selection. Using a real-time dataset, we evaluate forecasts from the mixed-frequency VAR and compare them to standard quarterly frequency VAR and to forecasts from MIDAS regressions. We document the extent to which information that becomes available within the quarter improves the forecasts in real time. This article has online supplementary materials. 相似文献

18.

Reply

Stephen K. McNees 《商业与经济统计学杂志》2013,31(1):126-127

This article develops a new identification procedure to estimate the contemporaneous relation between monetary policy and the stock market within a vector autoregression (VAR) framework. The approach combines high-frequency data from the futures market with the VAR methodology to circumvent exclusion restrictions and achieve identification. Our analysis casts doubt on VAR models imposing a recursive structure between innovations in policy rates and stock returns. We find that a tightening in policy rates has a negative impact on stock prices and that the Federal Reserve (Fed) has responded significantly to movements in the stock market. Estimates are robust to various model specifications. 相似文献

19.

A dynamic analysis of stock markets using a hidden Markov model

Luca De Angelis Leonard J. Paas 《Journal of applied statistics》2013,40(8):1682-1700

This paper proposes a framework to detect financial crises, pinpoint the end of a crisis in stock markets and support investment decision-making processes. This proposal is based on a hidden Markov model (HMM) and allows for a specific focus on conditional mean returns. By analysing weekly changes in the US stock market indexes over a period of 20 years, this study obtains an accurate detection of stable and turmoil periods and a probabilistic measure of switching between different stock market conditions. The results contribute to the discussion of the capabilities of Markov-switching models of analysing stock market behaviour. In particular, we find evidence that HMM outperforms threshold GARCH model with Student-t innovations both in-sample and out-of-sample, giving financial operators some appealing investment strategies. 相似文献

20.

Stochastic Model Specification Search for Time-Varying Parameter VARs 总被引：1，自引：1，他引：0

Eric Eisenstat Rodney W. Strachan 《Econometric Reviews》2016,35(8-10):1638-1665

This article develops a new econometric methodology for performing stochastic model specification search (SMSS) in the vast model space of time-varying parameter vector autoregressions (VARs) with stochastic volatility and correlated state transitions. This is motivated by the concern of overfitting and the typically imprecise inference in these highly parameterized models. For each VAR coefficient, this new method automatically decides whether it is constant or time-varying. Moreover, it can be used to shrink an otherwise unrestricted time-varying parameter VAR to a stationary VAR, thus providing an easy way to (probabilistically) impose stationarity in time-varying parameter models. We demonstrate the effectiveness of the approach with a topical application, where we investigate the dynamic effects of structural shocks in government spending on U.S. taxes and gross domestic product (GDP) during a period of very low interest rates. 相似文献