期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Using logistic regression for semiparametric comparison of population means and variances

Shuwen Wan Binrong Xu Biao Zhang 《统计学通讯:理论与方法》2013,42(9):2485-2503

Abstract

We propose to compare population means and variances under a semiparametric density ratio model. The proposed method is easy to implement by employing logistic regression procedures in many statistical software, and it often works very well when data are not normal. In this paper, we construct semiparametric estimators of the differences of two population means and variances, and derive their asymptotic distributions. We prove that the proposed semiparametric estimators are asymptotically more efficient than the corresponding non parametric ones. In addition, a simulation study and the analysis of two real data sets are presented. Finally, a short discussion is provided. 相似文献

2.

Semiparametric ROC surface estimation for continuous diagnostic tests via polytomous logistic regression procedures

Shuwen Wan Biao Zhang 《Journal of Statistical Computation and Simulation》2013,83(12):2195-2205

In this paper, we propose a semiparametric method of estimating receiver operating characteristic (ROC) surfaces for continuous diagnostic tests under density ratio models. Implementation of our method is easy since the usual polytomous logistic regression procedures in many statistical software packages can be employed. A simulated example is provided to facilitate the implementation of our method. Simulation results show that the proposed semiparametric ROC surface estimator is more efficient than the nonparametric counterpart and the parametric counterpart whether the normality assumption of data holds or not. Moreover, some simulation results on the underlying semiparametric distribution function estimators are also reported. In addition, some discussions on the proposed method as well as analysis of a real data set are provided. 相似文献

3.

Comparison of empirical likelihood and its dual likelihood under density ratio model

Huapeng Li Yang Liu Riquan Zhang 《Journal of nonparametric statistics》2018,30(3):581-597

Density ratio models (DRMs) are commonly used semiparametric models to link related populations. Empirical likelihood (EL) under DRM has been demonstrated to be a flexible and useful platform for semiparametric inferences. Since DRM-based EL has the same maximum point and maximum likelihood as its dual form (dual EL), EL-based inferences under DRM are usually made through the latter. A natural question comes up: is there any efficiency loss of doing so? We make a careful comparison of the dual EL and DRM-based EL estimation methods from theory and numerical simulations. We find that their point estimators for any parameter are exactly the same, while they may have different performances in interval estimation. In terms of coverage accuracy, the two intervals are comparable for non- or moderate skewed populations, and the DRM-based EL interval can be much superior for severely skewed populations. A real data example is analysed for illustration purpose. 相似文献

4.

Shrinkage and pretest estimators for longitudinal data analysis under partially linear models

S. Hossain S. Ejaz Ahmed Grace Y. Yi B. Chen 《Journal of nonparametric statistics》2016,28(3):531-549

In this paper, we develop marginal analysis methods for longitudinal data under partially linear models. We employ the pretest and shrinkage estimation procedures to estimate the mean response parameters as well as the association parameters, which may be subject to certain restrictions. We provide the analytic expressions for the asymptotic biases and risks of the proposed estimators, and investigate their relative performance to the unrestricted semiparametric least-squares estimator (USLSE). We show that if the dimension of association parameters exceeds two, the risk of the shrinkage estimators is strictly less than that of the USLSE in most of the parameter space. On the other hand, the risk of the pretest estimator depends on the validity of the restrictions of association parameters. A simulation study is conducted to evaluate the performance of the proposed estimators relative to that of the USLSE. A real data example is applied to illustrate the practical usefulness of the proposed estimation procedures. 相似文献

5.

ASYMPTOTIC DISTRIBUTIONS OF SEMIPARAMETRIC MAXIMUM LIKELIHOOD ESTIMATORS WITH ESTIMATING EQUATIONS FOR GROUP-CENSORED DATA

Di Chen Jye-Chyi Lu Shu Chuan Lin 《Australian & New Zealand Journal of Statistics》2005,47(2):173-192

Semiparametric maximum likelihood estimation with estimating equations (SMLE) is more flexible than traditional methods; it has fewer restrictions on distributions and regression models. The required information about distribution and regression structures is incorporated in estimating equations of the SMLE to improve the estimation quality of non‐parametric methods. The likelihood of SMLE for censored data involves complicated implicit functions without closed‐form expressions, and the first derivatives of the log‐profile‐likelihood cannot be expressed as summations of independent and identically distributed random variables; it is challenging to derive asymptotic properties of the SMLE for censored data. For group‐censored data, the paper shows that all the implicit functions are well defined and obtains the asymptotic distributions of the SMLE for model parameters and lifetime distributions. With several examples the paper compares the SMLE, the regular non‐parametric likelihood estimation method and the parametric MLEs in terms of their asymptotic efficiencies, and illustrates application of SMLE. Various asymptotic distributions of the likelihood ratio statistics are derived for testing the adequacy of estimating equations and a partial set of parameters equal to some known values. 相似文献

6.

Semiparametric estimation of treatment effect with density ratio model

Cunjie Lin Yong Zhou 《统计学通讯:理论与方法》2018,47(14):3338-3359

In this study we propose a unified semiparametric approach to estimate various indices of treatment effect under the density ratio model, which connects two density functions by an exponential tilt. For each index, we construct two estimating functions based on the model and apply the generalized method of moments to improve the estimates. The estimating functions are allowed to be non smooth with respect to parameters and hence make the proposed method more flexible. We establish the asymptotic properties of the proposed estimators and illustrate the application with several simulations and two real data sets. 相似文献

7.

Count Data Models with Correlated Unobserved Heterogeneity

STEFAN BOES 《Scandinavian Journal of Statistics》2010,37(3):382-402

Abstract. As previously argued, the correlation between included and omitted regressors generally causes inconsistency of standard estimators for count data models. Non‐linear instrumental variables estimation of an exponential model under conditional moment restrictions is one of the proposed remedies. This approach is extended here by fully exploiting the model assumptions and thereby improving efficiency of the resulting estimator. Empirical likelihood in particular has favourable properties in this setting compared with the two‐step generalized method of moments, as demonstrated in a Monte Carlo experiment. The proposed method is applied to the estimation of a cigarette demand function. 相似文献

8.

Cox regression of clustered event times with covariates missing not at random

Li Liu Yanyan Liu Yi Xiong X. Joan Hu 《Scandinavian Journal of Statistics》2019,46(4):1315-1346

Motivated by a recent tuberculosis (TB) study, this paper is concerned with covariates missing not at random (MNAR) and models the potential intracluster correlation by a frailty. We consider the regression analysis of right‐censored event times from clustered subjects under a Cox proportional hazards frailty model and present the semiparametric maximum likelihood estimator (SPMLE) of the model parameters. An easy‐to‐implement pseudo‐SPMLE is then proposed to accommodate more realistic situations using readily available supplementary information on the missing covariates. Algorithms are provided to compute the estimators and their consistent variance estimators. We demonstrate that both the SPMLE and the pseudo‐SPMLE are consistent and asymptotically normal by the arguments based on the theory of modern empirical processes. The proposed approach is examined numerically via simulation and illustrated with an analysis of the motivating TB study data. 相似文献

9.

Using Adjacent-category Logits Procedure for Estimating Receiver Operating Characteristic Surface

Arnaud D. Nze Ossima Mohamed C. Belkacemi Jean-Pierre Daurès 《统计学通讯:模拟与计算》2016,45(3):902-919

This article presents a semiparametric method for estimating receiver operating characteristic surface under density ratio model. The construction of the proposed method is based on the adjacent-category logit model and the empirical likelihood approach. A bootstrap approach for the VUS estimator inference is presented. In a simulation study, the proposed estimator is compared with the existing parametric and nonparametric estimators in terms of bias, standard error, and mean square error. Finally, a real data example and some discussions on the proposed method are provided. 相似文献

10.

A doubly restricted exponential dispersion model

Luis Fernando Grajales Raydonal Ospina Luis A. López Oscar O. Melo 《统计学通讯:理论与方法》2019,48(11):2827-2841

In this paper, we propose and develop a doubly restricted exponential dispersion model, i.e. a varying dispersion generalized linear model with two sets of restrictions, a set of linear restrictions for the mean response, and at the same time, for another set of linear restrictions for the dispersion of the distribution. This model would be useful to consider several situations where it is necessary to control/analyze drug-doses, active effects in factorial experiments, mean-variance relationships, among other situations. A penalized likelihood function is proposed and developed in order to achieve the restricted parameters and to develop the inferential results. Several special cases from the literature are commented on. A simply restricted varying dispersion beta regression model is exemplified by means of real and simulated data. Satisfactory and promising results are found. 相似文献

11.

Pseudo‐likelihood ratio tests for semiparametric multivariate copula model selection

Xiaohong Chen Yanqin Fan 《Revue canadienne de statistique》2005,33(3):389-414

The authors propose pseudo‐likelihood ratio tests for selecting semiparametric multivariate copula models in which the marginal distributions are unspecified, but the copula function is parameterized and can be misspecified. For the comparison of two models, the tests differ depending on whether the two copulas are generalized nonnested or generalized nested. For more than two models, the procedure is built on the reality check test of White (2000). Unlike White (2000), however, the test statistic is automatically standardized for generalized nonnested models (with the benchmark) and ignores generalized nested models asymptotically. The authors illustrate their approach with American insurance claim data. 相似文献

12.

Profile likelihood approaches for semiparametric copula and frailty models for clustered survival data

Il Do Ha Jong-Min Kim Takeshi Emura 《Journal of applied statistics》2019,46(14):2553-2571

ABSTRACT

In clustered survival data, the dependence among individual survival times within a cluster has usually been described using copula models and frailty models. In this paper we propose a profile likelihood approach for semiparametric copula models with different cluster sizes. We also propose a likelihood ratio method based on profile likelihood for testing the absence of association parameter (i.e. test of independence) under the copula models, leading to the boundary problem of the parameter space. For this purpose, we show via simulation study that the proposed likelihood ratio method using an asymptotic chi-square mixture distribution performs well as sample size increases. We compare the behaviors of the two models using the profile likelihood approach under a semiparametric setting. The proposed method is demonstrated using two well-known data sets. 相似文献

13.

Empirical likelihood confidence intervals for the mean of a population containing many zero values

Jiahua Chen Shun‐Yi Chen J. N. K. Rao 《Revue canadienne de statistique》2003,31(1):53-68

If a population contains many zero values and the sample size is not very large, the traditional normal approximation‐based confidence intervals for the population mean may have poor coverage probabilities. This problem is substantially reduced by constructing parametric likelihood ratio intervals when an appropriate mixture model can be found. In the context of survey sampling, however, there is a general preference for making minimal assumptions about the population under study. The authors have therefore investigated the coverage properties of nonparametric empirical likelihood confidence intervals for the population mean. They show that under a variety of hypothetical populations, these intervals often outperformed parametric likelihood intervals by having more balanced coverage rates and larger lower bounds. The authors illustrate their methodology using data from the Canadian Labour Force Survey for the year 2000. 相似文献

14.

On the asymptotic non‐equivalence of efficient‐GMM and MEL estimators in models with missing data

Xuerong Chen Yan Chen Alan T.K. Wan Yong Zhou 《Scandinavian Journal of Statistics》2019,46(2):361-388

The generalized method of moments (GMM) and empirical likelihood (EL) are popular methods for combining sample and auxiliary information. These methods are used in very diverse fields of research, where competing theories often suggest variables satisfying different moment conditions. Results in the literature have shown that the efficient‐GMM (GMM_E) and maximum empirical likelihood (MEL) estimators have the same asymptotic distribution to order n^?1/2 and that both estimators are asymptotically semiparametric efficient. In this paper, we demonstrate that when data are missing at random from the sample, the utilization of some well‐known missing‐data handling approaches proposed in the literature can yield GMM_E and MEL estimators with nonidentical properties; in particular, it is shown that the GMM_E estimator is semiparametric efficient under all the missing‐data handling approaches considered but that the MEL estimator is not always efficient. A thorough examination of the reason for the nonequivalence of the two estimators is presented. A particularly strong feature of our analysis is that we do not assume smoothness in the underlying moment conditions. Our results are thus relevant to situations involving nonsmooth estimating functions, including quantile and rank regressions, robust estimation, the estimation of receiver operating characteristic (ROC) curves, and so on. 相似文献

15.

Semiparametric models: a generalized self-consistency approach 总被引：1，自引：0，他引：1

Tsodikov A 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2003,65(3):759-774

Summary. In semiparametric models, the dimension d of the maximum likelihood problem is potentially unlimited. Conventional estimation methods generally behave like O ( d ³). A new O ( d ) estimation procedure is proposed for a large class of semiparametric models. Potentially unlimited dimension is handled in a numerically efficient way through a Nelson–Aalen-like estimator. Discussion of the new method is put in the context of recently developed minorization–maximization algorithms based on surrogate objective functions. The procedure for semiparametric models is used to demonstrate three methods to construct a surrogate objective function: using the difference of two concave functions, the EM way and the new quasi-EM (QEM) approach. The QEM approach is based on a generalization of the EM-like construction of the surrogate objective function so it does not depend on the missing data representation of the model. Like the EM algorithm, the QEM method has a dual interpretation, a result of merging the idea of surrogate maximization with the idea of imputation and self-consistency. The new approach is compared with other possible approaches by using simulations and analysis of real data. The proportional odds model is used as an example throughout the paper. 相似文献

16.

Semiparametric models of longitudinal and time-to-event data with applications to HIV viral dynamics and CD4 counts

Xiaobing Zhao Xian Zhou 《Journal of applied statistics》2015,42(11):2461-2477

We propose a semiparametric approach based on proportional hazards and copula method to jointly model longitudinal outcomes and the time-to-event. The dependence between the longitudinal outcomes on the covariates is modeled by a copula-based times series, which allows non-Gaussian random effects and overcomes the limitation of the parametric assumptions in existing linear and nonlinear random effects models. A modified partial likelihood method using estimated covariates at failure times is employed to draw statistical inference. The proposed model and method are applied to analyze a set of progression to AIDS data in a study of the association between the human immunodeficiency virus viral dynamics and the time trend in the CD4/CD8 ratio with measurement errors. Simulations are also reported to evaluate the proposed model and method. 相似文献

17.

Evaluating functional covariate‐environment interactions in the Cox regression model

Ling Zhou Haoqi Li Huazhen Lin Peter X.‐K. Song 《Revue canadienne de statistique》2019,47(2):204-221

Children exposed to mixtures of endocrine disrupting compounds such as phthalates are at high risk of experiencing significant friction in their growth and sexual maturation. This article is primarily motivated by a study that aims to assess the toxicants‐modified effects of risk factors related to the hazards of early or delayed onset of puberty among children living in Mexico City. To address the hypothesis of potential nonlinear modification of covariate effects, we propose a new Cox regression model with multiple functional covariate‐environment interactions, which allows covariate effects to be altered nonlinearly by mixtures of exposed toxicants. This new class of models is rather flexible and includes many existing semiparametric Cox models as special cases. To achieve efficient estimation, we develop the global partial likelihood method of inference, in which we establish key large‐sample results, including estimation consistency, asymptotic normality, semiparametric efficiency and the generalized likelihood ratio test for both parameters and nonparametric functions. The proposed methodology is examined via simulation studies and applied to the analysis of the motivating data, where maternal exposures to phthalates during the third trimester of pregnancy are found to be important risk modifiers for the age of attaining the first stage of puberty. The Canadian Journal of Statistics 47: 204–221; 2019 © 2019 Statistical Society of Canada 相似文献

18.

Pattern–mixture and selection models for analysing longitudinal data with monotone missing patterns

Jolene Birmingham rea Rotnitzky Garrett M. Fitzmaurice 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2003,65(1):275-297

Summary. We examine three pattern–mixture models for making inference about parameters of the distribution of an outcome of interest Y that is to be measured at the end of a longitudinal study when this outcome is missing in some subjects. We show that these pattern–mixture models also have an interpretation as selection models. Because these models make unverifiable assumptions, we recommend that inference about the distribution of Y be repeated under a range of plausible assumptions. We argue that, of the three models considered, only one admits a parameterization that facilitates the examination of departures from the assumption of sequential ignorability. The three models are nonparametric in the sense that they do not impose restrictions on the class of observed data distributions. Owing to the curse of dimensionality, the assumptions that are encoded in these models are sufficient for identification but not for inference. We describe additional flexible and easily interpretable assumptions under which it is possible to construct estimators that are well behaved with moderate sample sizes. These assumptions define semiparametric models for the distribution of the observed data. We describe a class of estimators which, up to asymptotic equivalence, comprise all the consistent and asymptotically normal estimators of the parameters of interest under the postulated semiparametric models. We illustrate our methods with the analysis of data from a randomized clinical trial of contracepting women. 相似文献

19.

Semiparametric methods for response-selective and missing data problems in regression

J. F. Lawless J. D. Kalbfleisch & C. J. Wild 《Journal of the Royal Statistical Society. Series B, Statistical methodology》1999,61(2):413-438

Suppose that data are generated according to the model f ( y | x ; θ ) g ( x ), where y is a response and x are covariates. We derive and compare semiparametric likelihood and pseudolikelihood methods for estimating θ for situations in which units generated are not fully observed and in which it is impossible or undesirable to model the covariate distribution. The probability that a unit is fully observed may depend on y , and there may be a subset of covariates which is observed only for a subsample of individuals. Our key assumptions are that the probability that a unit has missing data depends only on which of a finite number of strata that ( y , x ) belongs to and that the stratum membership is observed for every unit. Applications include case–control studies in epidemiology, field reliability studies and broad classes of missing data and measurement error problems. Our results make fully efficient estimation of θ feasible, and they generalize and provide insight into a variety of methods that have been proposed for specific problems. 相似文献

20.

The Copula Information Criteria

Steffen Grønneberg Nils Lid Hjort 《Scandinavian Journal of Statistics》2014,41(2):436-459

We derive two types of Akaike information criterion (AIC)‐like model‐selection formulae for the semiparametric pseudo‐maximum likelihood procedure. We first adapt the arguments leading to the original AIC formula, related to empirical estimation of a certain Kullback–Leibler information distance. This gives a significantly different formula compared with the AIC, which we name the copula information criterion. However, we show that such a model‐selection procedure cannot exist for copula models with densities that grow very fast near the edge of the unit cube. This problem affects most popular copula models. We then derive what we call the cross‐validation copula information criterion, which exists under weak conditions and is a first‐order approximation to exact cross validation. This formula is very similar to the standard AIC formula but has slightly different motivation. A brief illustration with real data is given. 相似文献