期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Assessing cumulative logit models via a score test in random effect models

Kuo-Chin Lin 《Journal of applied statistics》2011,38(2):247-259

The purpose of this article is to develop a goodness-of-fit test based on score test statistics for cumulative logit models with extra variation of random effects. Two main theorems for the proposed score test statistics are derived. In simulation studies, the powers of the proposed tests are discussed and the power curve against a variety of dispersion parameters and bandwidths is depicted. The proposed method is illustrated by an ordinal data set from Mosteller and Tukey [23]. 相似文献

2.

A class of mixed models for recurrent event data

Liuquan Sun Xingqiu Zhao Jie Zhou 《Revue canadienne de statistique》2011,39(4):578-590

In this article, we propose a class of mixed models for recurrent event data. The new models include the proportional rates model and Box–Cox transformation rates models as special cases, and allow the effects of covariates on the rate functions of counting processes to be proportional or convergent. For inference on the model parameters, estimating equation approaches are developed. The asymptotic properties of the resulting estimators are established and the finite sample performance of the proposed procedure is evaluated through simulation studies. A real example with data taken from a clinic study on chronic granulomatous disease (CGD) is also illustrated for the use of the proposed methodology. The Canadian Journal of Statistics 39: 578–590; 2011. © 2011 Statistical Society of Canada 相似文献

3.

General purpose multiply robust data integration procedures for handling nonprobability samples

Sixia Chen David Haziza 《Scandinavian Journal of Statistics》2023,50(2):697-724

In recent years, there has been an increased interest in combining probability and nonprobability samples. Nonprobability sample are cheaper and quicker to conduct but the resulting estimators are vulnerable to bias as the participation probabilities are unknown. To adjust for the potential bias, estimation procedures based on parametric or nonparametric models have been discussed in the literature. However, the validity of the resulting estimators relies heavily on the validity of the underlying models. Also, nonparametric approaches may suffer from the curse of dimensionality and poor efficiency. We propose a data integration approach by combining multiple outcome regression models and propensity score models. The proposed approach can be used for estimating general parameters including totals, means, distribution functions, and percentiles. The resulting estimators are multiply robust in the sense that they remain consistent if all but one model are misspecified. The asymptotic properties of point and variance estimators are established. The results from a simulation study show the benefits of the proposed method in terms of bias and efficiency. Finally, we apply the proposed method using data from the Korea National Health and Nutrition Examination Survey and data from the National Health Insurance Sharing Services. 相似文献

4.

The Extensively Corrected Score for Measurement Error Models

下载免费PDF全文

Yih‐Huei Huang Chi‐Chung Wen Yu‐Hua Hsu 《Scandinavian Journal of Statistics》2015,42(4):911-924

In measurement error problems, two major and consistent estimation methods are the conditional score and the corrected score. They are functional methods that require no parametric assumptions on mismeasured covariates. The conditional score requires that a suitable sufficient statistic for the mismeasured covariate can be found, while the corrected score requires that the object score function can be estimated without bias. These assumptions limit their ranges of applications. The extensively corrected score proposed here is an extension of the corrected score. It yields consistent estimations in many cases when neither the conditional score nor the corrected score is feasible. We demonstrate its constructions in generalized linear models and the Cox proportional hazards model, assess its performances by simulation studies and illustrate its implementations by two real examples. 相似文献

5.

Score test for testing zero-inflated Poisson regression against zero-inflated generalized Poisson alternatives

Hossein Zamani 《Journal of applied statistics》2013,40(9):2056-2068

In several cases, count data often have excessive number of zero outcomes. This zero-inflated phenomenon is a specific cause of overdispersion, and zero-inflated Poisson regression model (ZIP) has been proposed for accommodating zero-inflated data. However, if the data continue to suggest additional overdispersion, zero-inflated negative binomial (ZINB) and zero-inflated generalized Poisson (ZIGP) regression models have been considered as alternatives. This study proposes the score test for testing ZIP regression model against ZIGP alternatives and proves that it is equal to the score test for testing ZIP regression model against ZINB alternatives. The advantage of using the score test over other alternative tests such as likelihood ratio and Wald is that the score test can be used to determine whether a more complex model is appropriate without fitting the more complex model. Applications of the proposed score test on several datasets are also illustrated. 相似文献

6.

Latent variable techniques for categorical data

Lancaster Gillian Green Mick 《Statistics and Computing》2002,12(2):153-161

Two useful statistical methods for generating a latent variable are described and extended to incorporate polytomous data and additional covariates. Item response analysis is not well-known outside its area of application, mainly because the procedures to fit the models are computer intensive and not routinely available within general statistical software packages. The linear score technique is less computer intensive, straightforward to implement and has been proposed as a good approximation to item response analysis. Both methods have been implemented in the standard statistical software package GLIM 4.0, and are compared to determine their effectiveness. 相似文献

7.

Semiparametric score test for varying copula parameter in Markov time series

Fentaw Abegaz U. V. Naik-Nimbalkar 《Statistics》2013,47(3):209-222

This article examines a semiparametric test for checking the constancy of serial dependence via copula models for Markov time series. A semiparametric score test is proposed for testing the constancy of the copula parameter against stochastically varying copula parameter. The asymptotic null distribution of the test is established. A semiparametric bootstrap procedure is employed for the estimation of the variance of the proposed score test. Illustrations are given based on simulated series and historic interest rate data. 相似文献

8.

Quasi-binomial zero-inflated regression model suitable for variables with bounded support

E. Gmez&#x;Dniz D. I. Gallardo H. W. Gmez 《Journal of applied statistics》2020,47(12):2208

In recent years, a variety of regression models, including zero-inflated and hurdle versions, have been proposed to explain the case of a dependent variable with respect to exogenous covariates. Apart from the classical Poisson, negative binomial and generalised Poisson distributions, many proposals have appeared in the statistical literature, perhaps in response to the new possibilities offered by advanced software that now enables researchers to implement numerous special functions in a relatively simple way. However, we believe that a significant research gap remains, since very little attention has been paid to the quasi-binomial distribution, which was first proposed over fifty years ago. We believe this distribution might constitute a valid alternative to existing regression models, in situations in which the variable has bounded support. Therefore, in this paper we present a zero-inflated regression model based on the quasi-binomial distribution, taking into account the moments and maximum likelihood estimators, and perform a score test to compare the zero-inflated quasi-binomial distribution with the zero-inflated binomial distribution, and the zero-inflated model with the homogeneous model (the model in which covariates are not considered). This analysis is illustrated with two data sets that are well known in the statistical literature and which contain a large number of zeros. 相似文献

9.

Comparison of Shared Frailty Models for Kidney Infection Data under Exponential Power Baseline Distribution

David D. Hanagal Alok D. Dabade 《统计学通讯:理论与方法》2013,42(23):5091-5108

Shared frailty models are often used to model heterogeneity in survival analysis. There are certain assumptions about the baseline distribution and distribution of frailty. In this paper, four shared frailty models with frailty distribution gamma, inverse Gaussian, compound Poisson, and compound negative binomial with exponential power as baseline distribution are proposed. These models are fitted using Markov Chain Monte Carlo methods. These models are illustrated with a real life bivariate survival data set of McGilchrist and Aisbett (1991) related to kidney infection, and the best model is suggested for the data using different model comparison criteria. 相似文献

10.

Events per variable for risk differences and relative risks using pseudo-observations

Stefan Nygaard Hansen Per Kragh Andersen Erik Thorlund Parner 《Lifetime data analysis》2014,20(4):584-598

A method based on pseudo-observations has been proposed for direct regression modeling of functionals of interest with right-censored data, including the survival function, the restricted mean and the cumulative incidence function in competing risks. The models, once the pseudo-observations have been computed, can be fitted using standard generalized estimating equation software. Regression models can however yield problematic results if the number of covariates is large in relation to the number of events observed. Guidelines of events per variable are often used in practice. These rules of thumb for the number of events per variable have primarily been established based on simulation studies for the logistic regression model and Cox regression model. In this paper we conduct a simulation study to examine the small sample behavior of the pseudo-observation method to estimate risk differences and relative risks for right-censored data. We investigate how coverage probabilities and relative bias of the pseudo-observation estimator interact with sample size, number of variables and average number of events per variable. 相似文献

11.

Variable selection in linear measurement error models via penalized score functions

Xianzheng Huang Hongmei Zhang 《Journal of statistical planning and inference》2013

We propose variable selection procedures based on penalized score functions derived for linear measurement error models. To calibrate the selection procedures, we define new tuning parameter selectors based on the scores. Large-sample properties of these new tuning parameter selectors are established for the proposed procedures. These new methods are compared in simulations and a real-data application with competing methods where one ignores measurement error or uses the Bayesian information criterion to choose the tuning parameter. 相似文献

12.

Two-Part Models for Analysis of Agatston Scores with Possible Proportionality Constraints

Cong Han Richard Kronmal 《统计学通讯:理论与方法》2013,42(1):99-111

ABSTRACT

Logit-linear and probit-linear two-part models can be used to analyze data that are a mixture of zeros and positive continuous responses. The slopes in the linear part of a model can be constrained to be proportional to the slopes in the logit or probit part. In this article, it is shown that implementing such a constraint will decrease (in Loewner ordering) the asymptotic covariance matrix of the maximum likelihood estimates. A case study is provided using coronary artery calcification data from the Multi-Ethnic Study of Atherosclerosis. 相似文献

13.

Diagnostics for elliptical linear mixed models with first-order autoregressive errors

《Journal of Statistical Computation and Simulation》2012,82(10):1281-1296

For longitudinal time series data, linear mixed models that contain both random effects across individuals and first-order autoregressive errors within individuals may be appropriate. Some statistical diagnostics based on the models under a proposed elliptical error structure are developed in this work. It is well known that the class of elliptical distributions offers a more flexible framework for modelling since it contains both light- and heavy-tailed distributions. Iterative procedures for the maximum-likelihood estimates of the model parameters are presented. Score tests for the presence of autocorrelation and the homogeneity of autocorrelation coefficients among individuals are constructed. The properties of test statistics are investigated through Monte Carlo simulations. The local influence method for the models is also given. The analysed results of a real data set illustrate the values of the models and diagnostic statistics. 相似文献

14.

Bayesian Poisson models for the graphical combination of dependent expert information

Jim Q. Smith & Álvaro E. Faria Jr 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2000,62(3):525-544

A supra-Bayesian (SB) wants to combine the information from a group of k experts to produce her distribution of a probability θ. Each expert gives his counts of what he thinks are the numbers of successes and failures in a sequence of independent trials, each with probability θ of success. These counts, used as a surrogate for each expert's own individual probability assessment (together with his associated level of confidence in his estimate), allow the SB to build various plausible conjugate models. Such models reflect her beliefs about the reliability of different experts and take account of different possible patterns of overlap of information between them. Corresponding combination rules are then obtained and compared with other more established rules and their properties examined. 相似文献

15.

Tests for Regression Models Fitted to Survey Data

Thomas Lumley Alastair Scott 《Australian & New Zealand Journal of Statistics》2014,56(1):1-14

Data from complex surveys are being used increasingly to build the same sort of explanatory and predictive models as those used in the rest of statistics. Unfortunately the assumptions underlying standard statistical methods are not even approximately valid for most survey data. The problem of parameter estimation has been largely solved, at least for routine data analysis, through the use of weighted estimating equations, and software for most standard analytical procedures is now available in the major statistical packages. One notable omission from standard software is an analogue of the likelihood ratio test. An exception is the Rao–Scott test for loglinear models in contingency tables. In this paper we show how the Rao–Scott test can be extended to handle arbitrary regression models. We illustrate the process of fitting a model to survey data with an example from NHANES. 相似文献

16.

Diagnostics for non-linear regression

《Journal of Statistical Computation and Simulation》2012,82(9):1109-1128

Sensitivity analysis in regression is concerned with assessing the sensitivity of the results of a regression model (e.g., the objective function, the regression parameters, and the fitted values) to changes in the data. Sensitivity analysis in least squares linear regression has seen a great surge of research activities over the last three decades. By contrast, sensitivity analysis in non-linear regression has received very little attention. This paper deals with the problem of local sensitivity analysis in non-linear regression. Closed-form general formulas are provided for the sensitivities of three standard methods for the estimation of the parameters of a non-linear regression model based on a set of data. These methods are the least squares, the minimax, and the least absolute value methods. The effectiveness of the proposed measures is illustrated by application to several non-linear models including the ultrasonic data and the onion yield data. The proposed sensitivity measures are shown to deal effectively with the detection of influential observations in non-linear regression models. 相似文献

17.

Efficient estimation for case‐cohort studies

Bin Nan 《Revue canadienne de statistique》2004,32(4):403-419

The author considers time‐to‐event data from case‐cohort designs. As existing methods are either inefficient or based on restrictive assumptions concerning the censoring mechanism, he proposes a semi‐parametrically efficient estimator under the usual assumptions for Cox regression models. The estimator in question is obtained by a one‐step Newton‐Raphson approximation that solves the efficient score equations with initial value obtained from an existing method. The author proves that the estimator is consistent, asymptotically efficient and normally distributed in the limit. He also resorts to simulations to show that the proposed estimator performs well in finite samples and that it considerably improves the efficiency of existing pseudo‐likelihood estimators when a correlate of the missing covariate is available. Although he focuses on the situation where covariates are discrete, the author also explores how the method can be applied to models with continuous covariates. 相似文献

18.

Testing for the Presence of a Cure Fraction in Clustered Interval‐Censored Survival Data

Xiangmei Ma Liming Xiang 《Australian & New Zealand Journal of Statistics》2013,55(2):173-190

Clustered interval‐censored survival data are often encountered in clinical and epidemiological studies due to geographic exposures and periodic visits of patients. When a nonnegligible cured proportion exists in the population, several authors in recent years have proposed to use mixture cure models incorporating random effects or frailties to analyze such complex data. However, the implementation of the mixture cure modeling approaches may be cumbersome. Interest then lies in determining whether or not it is necessary to adjust the cured proportion prior to the mixture cure analysis. This paper mainly focuses on the development of a score for testing the presence of cured subjects in clustered and interval‐censored survival data. Through simulation, we evaluate the sampling distribution and power behaviour of the score test. A bootstrap approach is further developed, leading to more accurate significance levels and greater power in small sample situations. We illustrate applications of the test using data sets from a smoking cessation study and a retrospective study of early breast cancer patients. 相似文献

19.

Geometric consistency of principal component scores for high-dimensional mixture models and its application

Kazuyoshi Yata Makoto Aoshima 《Scandinavian Journal of Statistics》2020,47(3):899-921

In this article, we consider clustering based on principal component analysis (PCA) for high-dimensional mixture models. We present theoretical reasons why PCA is effective for clustering high-dimensional data. First, we derive a geometric representation of high-dimension, low-sample-size (HDLSS) data taken from a two-class mixture model. With the help of the geometric representation, we give geometric consistency properties of sample principal component scores in the HDLSS context. We develop ideas of the geometric representation and provide geometric consistency properties for multiclass mixture models. We show that PCA can cluster HDLSS data under certain conditions in a surprisingly explicit way. Finally, we demonstrate the performance of the clustering using gene expression datasets. 相似文献

20.

Bayesian misclassification and propensity score methods for clustered observational studies

Qi Zhou Yoo-Mi Chin James D. Stamey 《Journal of applied statistics》2018,45(9):1547-1560

Bayesian propensity score regression analysis with misclassified binary responses is proposed to analyse clustered observational data. This approach utilizes multilevel models and corrects for misclassification in the responses. Using the deviance information criterion (DIC), the performance of the approach is compared with approaches without correcting for misclassification, multilevel structure specification, or both in the study of the impact of female employment on the likelihood of physical violence. The smallest DIC confirms that our proposed model best fits the data. We conclude that female employment has an insignificant impact on the likelihood of physical spousal violence towards women. In addition, a simulation study confirms that the proposed approach performed best in terms of bias and coverage rate. Ignoring misclassification in response or multilevel structure of data would yield biased estimation of the exposure effect. 相似文献