共查询到20条相似文献,搜索用时 0 毫秒
1.
A table of expected success rates under normally distributed success logit, used in conjunction with logistic regression analysis, enables easy calculation of expected win for betting on success of a future dichotomous trial. 相似文献
2.
《Journal of Statistical Computation and Simulation》2012,82(8):765-779
We present a variational estimation method for the mixed logistic regression model. The method is based on a lower bound approximation of the logistic function [Jaakkola, J.S. and Jordan, M.I., 2000, Bayesian parameter estimation via variational methods. Statistics & Computing, 10, 25–37.]. Based on the approximation, an EM algorithm can be derived that results in a considerable simplification of the maximization problem in that it does not require the numerical evaluation of integrals over the random effects. We assess the performance of the variational method for the mixed logistic regression model in a simulation study and an empirical data example, and compare it to Laplace's method. The results indicate that the variational method is a viable choice for estimating the fixed effects of the mixed logistic regression model under the condition that the number of outcomes within each cluster is sufficiently high. 相似文献
3.
4.
Faisal Nawaz Faisal Shahzad Ijaz Ur Rehman Muhammad Shujahat Shabir Hyder Sameer Al Barghouthi 《统计学通讯:理论与方法》2019,48(13):3360-3376
The developing markets are more volatile, unstable illiquid, and more prone to the external shocks. The non Gaussian VaR model gives more accurate risk models than Gaussian VaR models. Hence, the purpose of this study is to test if and how non Gaussian VaR models are comparatively better fit for risk modeling of developing markets than the Gaussian VaR models. The study measures the market risk for the daily closing price of Karachi Stock Exchange 100 index over the period of 2004–2016. The evaluation of VaR models suggests that non Gaussian dynamic model outperformed the Gaussian VaR models in developing markets. 相似文献
5.
This paper develops alternatives to maximum likelihood estimators (MLE) for logistic regression models and compares the mean squared error (MSE) of the estimators. The MLE for the vector of underlying success probabilities has low MSE only when the true probabilities are extreme (i.e., near 0 or 1). Extreme probabilities correspond to logistic regression parameter vectors which are large in norm. A competing “restricted” MLE and an empirical version of it are suggested as estimators with better performance than the MLE for central probabilities. An approximate EM-algorithm for estimating the restriction is described. As in the case of normal theory ridge estimators, the proposed estimators are shown to be formally derivable by Bayes and empirical Bayes arguments. The small sample operating characteristics of the proposed estimators are compared to the MLE via a simulation study; both the estimation of individual probabilities and of logistic parameters are considered. 相似文献
6.
Gwowen Shieh 《统计学通讯:模拟与计算》2013,42(3):763-791
Whittemore (1981) proposed an approach for calculating the sample size needed to test hypotheses with specified significance and power against a given alternative for logistic regression with small response probability. Based on the distribution of covariate, which could be either discrete or continuous, this approach first provides a simple closed-form approximation to the asymptotic covariance matrix of the maximum likelihood estimates, and then uses it to calculate the sample size needed to test a hypothesis about the parameter. Self et al. (1992) described a general approach for power and sample size calculations within the framework of generalized linear models, which include logistic regression as a special case. Their approach is based on an approximation to the distribution of the likelihood ratio statistic. Unlike the Whittemore approach, their approach is not limited to situations of small response probability. However, it is restricted to models with a finite number of covariate configurations. This study compares these two approaches to see how accurate they would be for the calculations of power and sample size in logistic regression models with various response probabilities and covariate distributions. The results indicate that the Whittemore approach has a slight advantage in achieving the nominal power only for one case with small response probability. It is outperformed for all other cases with larger response probabilities. In general, the approach proposed in Self et al. (1992) is recommended for all values of the response probability. However, its extension for logistic regression models with an infinite number of covariate configurations involves an arbitrary decision for categorization and leads to a discrete approximation. As shown in this paper, the examined discrete approximations appear to be sufficiently accurate for practical purpose. 相似文献
7.
Three regression models for ordinal data, those of Fienberg, McCullagh, and Anderson, are applied to an analysis of kidney function among transplant recipients. The conclusions arising from each model are presented and contrasted. 相似文献
8.
The problem of spuriousity has been dealt with from a Bayesian perspective by, among others, Box and Taio (1968) and in several papers by Guttman with various co-authors, beginning with Guttman (1973), The main objective of these papers has been to obtain posterior distributions of parameters, and to base inference on these distributions. In the current paper, the Bayesian argument is carried one step further by deriving predictive distributions of future observations. Inferences are then based on these distributions. We will obtain predictive results for several models, First, we consider the univariate normal case with one spurious observation, This is then generalized to several spurious observations. The multivariate normal situation is studied next. Finally, we consider the general linear model with normal errors. 相似文献
9.
K. Vaitheeswaran M. Subbiah R. Ramakrishnan T. Kannan 《Journal of applied statistics》2016,43(12):2254-2260
Estimating the risk factors of a disease such as diabetic retinopathy (DR) is one of the important research problems among bio-medical and statistical practitioners as well as epidemiologists. Incidentally many studies have focused in building models with binary outcomes, that may not exploit the available information. This article has investigated the importance of retaining the ordinal nature of the response variable (e.g. severity level of a disease) while determining the risk factors associated with DR. A generalized linear model approach with appropriate link functions has been studied using both Classical and Bayesian frameworks. From the result of this study, it can be observed that the ordinal logistic regression with probit link function could be more appropriate approach in determining the risk factors of DR. The study has emphasized the ways to handle the ordinal nature of the response variable with better model fit compared to other link functions. 相似文献
10.
Hierarchical related regression for combining aggregate and individual data in studies of socio-economic disease risk factors 总被引:2,自引:0,他引:2
Christopher Jackson Nicky Best Sylvia Richardson 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2008,171(1):159-178
Summary. To obtain information about the contribution of individual and area level factors to population health, it is desirable to use both data collected on areas, such as censuses, and on individuals, e.g. survey and cohort data. Recently developed models allow us to carry out simultaneous regressions on related data at the individual and aggregate levels. These can reduce 'ecological bias' that is caused by confounding, model misspecification or lack of information and increase power compared with analysing the data sets singly. We use these methods in an application investigating individual and area level sociodemographic predictors of the risk of hospital admissions for heart and circulatory disease in London. We discuss the practical issues that are encountered in this kind of data synthesis and demonstrate that this modelling framework is sufficiently flexible to incorporate a wide range of sources of data and to answer substantive questions. Our analysis shows that the variations that are observed are mainly attributable to individual level factors rather than the contextual effect of deprivation. 相似文献
11.
Gustavo H. A. Pereira Juliana Scudilio Manoel Santos-Neto Denise A. Botter Mnica C. Sandoval 《Journal of applied statistics》2020,47(10):1833
Zero adjusted regression models are used to fit variables that are discrete at zero and continuous at some interval of the positive real numbers. Diagnostic analysis in these models is usually performed using the randomized quantile residual, which is useful for checking the overall adequacy of a zero adjusted regression model. However, it may fail to identify some outliers. In this work, we introduce a class of residuals for outlier identification in zero adjusted regression models. Monte Carlo simulation studies and two applications suggest that one of the residuals of the class introduced here has good properties and detects outliers that are not identified by the randomized quantile residual. 相似文献
12.
L. R. Kerward 《Revue canadienne de statistique》1976,4(1):51-64
This paper presents a new test statistic for dynamic or stochastic mis-specification for the dynamic demand or dynamic adjustment class of economic models. The test statistic is based on residual autocorrelations, asymptotically X2 and is suspected to be of low power. The test is illustrated with an example from recent econometric literature. 相似文献
13.
Classification models can demonstrate apparent prediction accuracy even when there is no underlying relationship between the predictors and the response. Variable selection procedures can lead to false positive variable selections and overestimation of true model performance. A simulation study was conducted using logistic regression with forward stepwise, best subsets, and LASSO variable selection methods with varying total sample sizes (20, 50, 100, 200) and numbers of random noise predictor variables (3, 5, 10, 15, 20, 50). Using our critical values can help reduce needless follow-up on variables having no true association with the outcome. 相似文献
14.
We propose to perform model check for the Cox and Aalen regression models using martingale residual processes grouped after the risk score. Asymptotic distributions of the grouped martingale residual processes are deduced, so both formal and graphical model check can be performed. The method is validated by stochastic simulation. A data example with patients with primary biliary cirrhosis of the liver is discussed. 相似文献
15.
16.
This paper describes a Bayesian approach to mixture modelling and a method based on predictive distribution to determine the number of components in the mixtures. The implementation is done through the use of the Gibbs sampler. The method is described through the mixtures of normal and gamma distributions. Analysis is presented in one simulated and one real data example. The Bayesian results are then compared with the likelihood approach for the two examples. 相似文献
17.
Willian Luís de Oliveira Carlos Alberto Ribeiro Diniz Maria Durbán 《统计学通讯:模拟与计算》2013,42(8):2359-2383
ABSTRACTA general class of models for discrete and/or continuous responses is proposed in which joint distributions are constructed via the conditional approach. It is assumed that the distributions of one response and of the other response given the first one belong to exponential family of distributions. Furthermore, the marginal means are related to the covariates by link functions and a dependency structure between the responses is inserted into the model. Estimation methods, diagnostic analysis and a simulation study considering a Bernoulli-exponential model, a particular case of the class, are presented. Finally, this model is used in a real data set. 相似文献
18.
C.K. Narodia 《统计学通讯:理论与方法》2013,42(3):303-310
This paper investigates the characterizations of certain discrete distributions within the framework of multivariate additive damage models. The univariate case for such models appoared in an article by N. Krishnaii (1974) and Rao and Rubin (1964). In this paper the survival distriution in specified and it is shown that linearity of the regression of the undamaged part on the damaged part, or the damaged part on the undamaged part leads to the characterizations of independent binomials, independent negative binomials, independent Poissons, multinomial and negative multinomial for the original p-dimensional observation. 相似文献
19.
Qing Li Inyoung Kim Sheila G. Klauer Bruce G. Simons-Morton 《Journal of applied statistics》2018,45(4):604-625
The driving risk during the initial period after licensure for novice teenage drivers is typically the highest but decreases rapidly right after. The change-point of driving risk is a critical parameter for evaluating teenage driving risk, which also varies substantially among drivers. This paper presents latent class recurrent-event change-point models for detecting the change-points. The proposed model is applied to the Naturalist Teenage Driving Study, which continuously recorded the driving data of 42 novice teenage drivers for 18 months using advanced in-vehicle instrumentation. We propose a hierarchical BFMM to estimate the change-points by clusters of drivers with similar risk profiles. The model is based on a non-homogeneous Poisson process with piecewise-constant intensity functions. Latent variables which identify the membership of the subjects are used to detect potential clusters among subjects. Application to the Naturalistic Teenage Driving Study identifies three distinct clusters with change-points at 52.30, 108.99 and 150.20?hours of driving after first licensure, respectively. The overall intensity rate and the pattern of change also differ substantially among clusters. The results of this research provide more insight in teenagers' driving behaviour and will be critical to improve young drivers' safety education and parent management programs, as well as provide crucial reference for the GDL regulations to encourage safer driving. 相似文献
20.
Osvaldo Loquiha Niel Hens Emilia Martins-Fonteyn Herman Meulemans Edwin Wouters Marleen Temmerman 《Journal of applied statistics》2018,45(10):1781-1798
Two types of bivariate models for categorical response variables are introduced to deal with special categories such as ‘unsure’ or ‘unknown’ in combination with other ordinal categories, while taking additional hierarchical data structures into account. The latter is achieved by the use of different covariance structures for a trivariate random effect. The models are applied to data from the INSIDA survey, where interest goes to the effect of covariates on the association between HIV risk perception (quadrinomial with an ‘unknown risk’ category) and HIV infection status (binary). The final model combines continuation-ratio with cumulative link logits for the risk perception, together with partly correlated and partly shared trivariate random effects for the household level. The results indicate that only age has a significant effect on the association between HIV risk perception and infection status. The proposed models may be useful in various fields of application such as social and biomedical sciences, epidemiology and public health. 相似文献