期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Inference on Polychotomous Responses in Finite Populations

SUMANTA ADHYA TATHAGATA BANERJEE GAURANGADEB CHATTOPADHYAY 《Scandinavian Journal of Statistics》2011,38(4):788-800

Abstract. A model‐based predictive estimator is proposed for the population proportions of a polychotomous response variable, based on a sample from the population and on auxiliary variables, whose values are known for the entire population. The responses for the non‐sample units are predicted using a multinomial logit model, which is a parametric function of the auxiliary variables. A bootstrap estimator is proposed for the variance of the predictive estimator, its consistency is proved and its small sample performance is compared with that of an analytical estimator. The proposed predictive estimator is compared with other available estimators, including model‐assisted ones, both in a simulation study involving different sampling designs and model mis‐specification, and using real data from an opinion survey. The results indicate that the prediction approach appears to use auxiliary information more efficiently than the model‐assisted approach. 相似文献

2.

Response shrinkage estimation in multinomial logit models

Faisal Maqbool Zahid Christian Heumann 《Journal of statistical planning and inference》2012,142(1):95-109

A shrinkage estimation method for multinomial logit models is developed. The proposed method is based on shrinking the responses for each category towards the underlying probabilities. The estimator is also used in combination with Pregibon's resistant fitting. The resulting estimator can also be used to control the over-estimation of Pregibon's resistant estimator. The proposed method handles not only the problem of separation in multinomial logit models but estimates also exist when the number of covariates is large relative to the sample size. Estimates exist even when the MLE does not exist. Estimates can be easily computed with all commonly used statistical packages supporting the fitting procedures with weights. Estimates are compared with the usual MLE and Firth's bias reduction technique in a simulation study and an application. 相似文献

3.

Efficient Estimation of Nested Logit models

David Brownstone Kenneth A. Small 《商业与经济统计学杂志》2013,31(1):67-74

This article examines the sequential, full information maximum likelihood (FIML), and linearized maximum likelihood (LML) estimators for a nested logit model of time-of-day choice for work trips. These estimators are compared using a Monte Carlo study based on specification and data from a previously published empirical study. The sequential estimator is found to be much less efficient than LML or FIML, and its uncorrected second-stage standard-error estimates are strongly downward biased. LML is only slightly less efficient than FIML, but it is often easier to compute. There are cases in which the sequential and LML estimators do not exist, but FIML still performs well. 相似文献

4.

Small area estimation of mean price of habitation transaction using time-series and cross-sectional area-level models

Luìs Nobre Pereira Pedro Simões Coelho 《Journal of applied statistics》2010,37(4):651-666

In this paper, a new small domain estimator for area-level data is proposed. The proposed estimator is driven by a real problem of estimating the mean price of habitation transaction at a regional level in a European country, using data collected from a longitudinal survey conducted by a national statistical office. At the desired level of inference, it is not possible to provide accurate direct estimates because the sample sizes in these domains are very small. An area-level model with a heterogeneous covariance structure of random effects assists the proposed combined estimator. This model is an extension of a model due to Fay and Herriot [5], but it integrates information across domains and over several periods of time. In addition, a modified method of estimation of variance components for time-series and cross-sectional area-level models is proposed by including the design weights. A Monte Carlo simulation, based on real data, is conducted to investigate the performance of the proposed estimators in comparison with other estimators frequently used in small area estimation problems. In particular, we compare the performance of these estimators with the estimator based on the Rao–Yu model [23]. The simulation study also accesses the performance of the modified variance component estimators in comparison with the traditional ANOVA method. Simulation results show that the estimators proposed perform better than the other estimators in terms of both precision and bias. 相似文献

5.

Some New Results on the Multinomial Randomized Response Model

《统计学通讯:理论与方法》2013,42(4):847-856

ABSTRACT

The randomized response technique is an effective survey method designed to elicit sensitive information while ensuring the privacy of the respondents. In this article, we present some new results on the randomization response model in situations wherein one or two response variables are assumed to follow a multinomial distribution. For a single sensitive question, we use the well-known Hopkins randomization device to derive estimates, both under the assumption of truthful and untruthful responses, and present a technique for making pairwise comparisons. When there are two sensitive questions of interest, we derive a Pearson product moment correlation estimator based on the multinomial model assumption. This estimator may be used to quantify the linear relationship between two variables when multinomial response data are observed according to a randomized-response protocol. 相似文献

6.

Estimating multiple-membership logit models with mixed effects: indirect inference versus data cloning

Anna Gottard Giorgio Calzolari 《Journal of Statistical Computation and Simulation》2017,87(12):2334-2348

Multiple-membership logit models with random effects are models for clustered binary data, where each statistical unit can belong to more than one group. The likelihood function of these models is analytically intractable. We propose two different approaches for parameter estimation: indirect inference and data cloning (DC). The former is a non-likelihood-based method which uses an auxiliary model to select reasonable estimates. We propose an auxiliary model with the same dimension of parameter space as the target model, which is particularly convenient to reach good estimates very fast. The latter method computes maximum likelihood estimates through the posterior distribution of an adequate Bayesian model, fitted to cloned data. We implement a DC algorithm specifically for multiple-membership models. A Monte Carlo experiment compares the two methods on simulated data. For further comparison, we also report Bayesian posterior mean and Integrated Nested Laplace Approximation hybrid DC estimates. Simulations show a negligible loss of efficiency for the indirect inference estimator, compensated by a relevant computational gain. The approaches are then illustrated with two real examples on matched paired data. 相似文献

7.

Model selection and parameter estimation of a multinomial logistic regression model

《Journal of Statistical Computation and Simulation》2012,82(7):1412-1426

In the multinomial regression model, we consider the methodology for simultaneous model selection and parameter estimation by using the shrinkage and LASSO (least absolute shrinkage and selection operation) [R. Tibshirani, Regression shrinkage and selection via the LASSO, J. R. Statist. Soc. Ser. B 58 (1996), pp. 267–288] strategies. The shrinkage estimators (SEs) provide significant improvement over their classical counterparts in the case where some of the predictors may or may not be active for the response of interest. The asymptotic properties of the SEs are developed using the notion of asymptotic distributional risk. We then compare the relative performance of the LASSO estimator with two SEs in terms of simulated relative efficiency. A simulation study shows that the shrinkage and LASSO estimators dominate the full model estimator. Further, both SEs perform better than the LASSO estimators when there are many inactive predictors in the model. A real-life data set is used to illustrate the suggested shrinkage and LASSO estimators. 相似文献

8.

The negative multinomial logit model

Douglas G. Bonett 《统计学通讯:理论与方法》2013,42(7):1713-1717

A polychotomous logit model is defined for negative multinomial frequency counts within independent populations. An efficient estimator of the model parameters and estimator covariance matrix is given in closed form. Minimum chi-square and Wald tests are presented. 相似文献

9.

Data augmentation,frequentist estimation,and the Bayesian analysis of multinomial logit models

Steven L. Scott 《Statistical Papers》2011,52(1):87-109

This article describes a convenient method of selecting Metropolis– Hastings proposal distributions for multinomial logit models. There are two key ideas involved. The first is that multinomial logit models have a latent variable representation similar to that exploited by Albert and Chib (J Am Stat Assoc 88:669–679, 1993) for probit regression. Augmenting the latent variables replaces the multinomial logit likelihood function with the complete data likelihood for a linear model with extreme value errors. While no conjugate prior is available for this model, a least squares estimate of the parameters is easily obtained. The asymptotic sampling distribution of the least squares estimate is Gaussian with known variance. The second key idea in this paper is to generate a Metropolis–Hastings proposal distribution by conditioning on the estimator instead of the full data set. The resulting sampler has many of the benefits of so-called tailored or approximation Metropolis–Hastings samplers. However, because the proposal distributions are available in closed form they can be implemented without numerical methods for exploring the posterior distribution. The algorithm converges geometrically ergodically, its computational burden is minor, and it requires minimal user input. Improvements to the sampler’s mixing rate are investigated. The algorithm is also applied to partial credit models describing ordinal item response data from the 1998 National Assessment of Educational Progress. Its application to hierarchical models and Poisson regression are briefly discussed. 相似文献

10.

Using Adjacent-category Logits Procedure for Estimating Receiver Operating Characteristic Surface

Arnaud D. Nze Ossima Mohamed C. Belkacemi Jean-Pierre Daurès 《统计学通讯:模拟与计算》2016,45(3):902-919

This article presents a semiparametric method for estimating receiver operating characteristic surface under density ratio model. The construction of the proposed method is based on the adjacent-category logit model and the empirical likelihood approach. A bootstrap approach for the VUS estimator inference is presented. In a simulation study, the proposed estimator is compared with the existing parametric and nonparametric estimators in terms of bias, standard error, and mean square error. Finally, a real data example and some discussions on the proposed method are provided. 相似文献

11.

Estimation of the Moment Generating Function

Edward E. Gbur Robert A. Collins 《统计学通讯:模拟与计算》2013,42(3):1113-1134

A number of statistical problems use the moment generating function (mgf) for purposes other than determining the moments of a distribution. If the distribution is not completely specified, then the mgf must be estimated from available data. The empirical mgf makes no assumptions concerning the underlying distribution except for the existence of the mgf. In contrast to the nonparametric approach provided by the empirical mgf, alternative estimators can be formed based on an assumed parametric model. Comparison of these approaches is considered for two parametric models; the normal and a one parameter gamma. Comparison criteria are efficiency and empirical confidence interval coverage. In general the parametric estimators outperform the empirical mgf when the model is correct. The comparisons are extended to underlying models which are two component mixtures from the distributional family assumed by the parametric estimators. Under the mixture models the superiority of the parametric estimator depends upon the model, value of the argument of the mgf, and the comparison criterion. The empirical mgf is the better estimator in some cases. 相似文献

12.

Pseudo-bayes estimation of multinomial proportions

James Albert 《统计学通讯:理论与方法》2013,42(16):1587-1611

In the simultaneous estimation of multinomial proportions, two estimators are developed which incorporate prior means and a prior parameter which reflects the accuracy of the prior means. These estimators possess substantially smaller risk than the standard estimator in a region of the parameter space and are much more robust than the conjugate Bayes estimator with respect to parameter values far from the prior mean. When vague prior information is available, these estimators and confidence regions derived from them appear to be attractive alternatives to the procedures based on the standard estimator. 相似文献

13.

Effects of non-normality and mild heteroscedasticity on estimators in regression

Jeffrey B. Birch Doris A. Binkley 《统计学通讯:模拟与计算》2013,42(3):331-354

Several estimators are examined for the simple linear regression model under a controlled, experimental situation with multiple observations at each design point. The model is examined under normal and non-normal error distributions and mild heterogeneity of variances across the chosen design points. We consider the ordinary, generalized, and estimated generalized least squares estimators and several examples of M estimators. The asymptotic properties of the M estimator using the Huber ψ are presented under these conditions for the multiple regression model. A simulation study is also presented which indicates that the M estimator possesses strong robustness properties under the presence of both non-normality and mild heteroscedasticity o£ errors. Finally, the M estimates are compared to the least squares estimates in two examples. 相似文献

14.

Two-parameter ridge estimator in the binary logistic regression

Yasin Asar Aşır Genç 《统计学通讯:模拟与计算》2017,46(9):7088-7099

The binary logistic regression is a commonly used statistical method when the outcome variable is dichotomous or binary. The explanatory variables are correlated in some situations of the logit model. This problem is called multicollinearity. It is known that the variance of the maximum likelihood estimator (MLE) is inflated in the presence of multicollinearity. Therefore, in this study, we define a new two-parameter ridge estimator for the logistic regression model to decrease the variance and overcome multicollinearity problem. We compare the new estimator to the other well-known estimators by studying their mean squared error (MSE) properties. Moreover, a Monte Carlo simulation is designed to evaluate the performances of the estimators. Finally, a real data application is illustrated to show the applicability of the new method. According to the results of the simulation and real application, the new estimator outperforms the other estimators for all of the situations considered. 相似文献

15.

Fixed Effects and Bias Due to the Incidental Parameters Problem in the Tobit Model 总被引：2，自引：0，他引：2

William Greene 《Econometric Reviews》2004,23(2):125-147

The maximum likelihood estimator (MLE) in nonlinear panel data models with fixed effects is widely understood (with a few exceptions) to be biased and inconsistent when T, the length of the panel, is small and fixed. However, there is surprisingly little theoretical or empirical evidence on the behavior of the estimator on which to base this conclusion. The received studies have focused almost exclusively on coefficient estimation in two binary choice models, the probit and logit models. In this note, we use Monte Carlo methods to examine the behavior of the MLE of the fixed effects tobit model. We find that the estimator's behavior is quite unlike that of the estimators of the binary choice models. Among our findings are that the location coefficients in the tobit model, unlike those in the probit and logit models, are unaffected by the “incidental parameters problem.” But, a surprising result related to the disturbance variance emerges instead - the finite sample bias appears here rather than in the slopes. This has implications for estimation of marginal effects and asymptotic standard errors, which are also examined in this paper. The effects are also examined for the probit and truncated regression models, extending the range of received results in the first of these beyond the widely cited biases in the coefficient estimators. 相似文献

16.

Minimum phi-divergence estimators for multinomial logistic regression with complex sample design

Elena Castilla Nirian Martín Leandro Pardo 《AStA Advances in Statistical Analysis》2018,102(3):381-411

This article develops the theoretical framework needed to study the multinomial regression model for complex sample design with pseudo-minimum phi-divergence estimators. The numerical example and the simulation study propose new estimators for the parameter of the logistic regression with overdispersed multinomial distributions for the response variables, the pseudo-minimum Cressie–Read divergence estimators, as well as new estimators for the intra-cluster correlation coefficient. The simulation study shows that the Binder’s method for the intra-cluster correlation coefficient exhibits an excellent performance when the pseudo-minimum Cressie–Read divergence estimator, with \(\lambda =\frac{2}{3}\), is plugged. 相似文献

17.

Estimation of a population size through capture-mark-recapture method: a comparison of various point and interval estimators

《Journal of Statistical Computation and Simulation》2012,82(3):335-354

This article deals with the estimation of a fixed population size through capture-mark-recapture method that gives rise to hypergeometric distribution. There are a few well-known and popular point estimators available in the literature, but no good comprehensive comparison is available about their merits. Apart from the available estimators, an empirical Bayes (EB) estimator of the population size is proposed. We compare all the point estimators in terms of relative bias and relative mean squared error. Next, two new interval estimators – (a) an EB highest posterior distribution interval and (b) a frequentist interval estimator based on a parametric bootstrap method, are proposed. The comparison is then carried among the two proposed interval estimators and interval estimators derived from the currently available estimators in terms of coverage probability and average length (AL). Based on comprehensive numerical results, we rank and recommend the point estimators as well as interval estimators for practical use. Finally, a real-life data set for a green treefrog population is used as a demonstration for all the methods discussed. 相似文献

18.

Clustering Discrete Data Through the Multinomial Mixture Model

J. Portela 《统计学通讯:理论与方法》2013,42(20):3250-3263

In this work, the multinomial mixture model is studied, through a maximum likelihood approach. The convergence of the maximum likelihood estimator to a set with characteristics of interest is shown. A method to select the number of mixture components is developed based on the form of the maximum likelihood estimator. A simulation study is then carried out to verify its behavior. Finally, two applications on real data of multinomial mixtures are presented. 相似文献

19.

Approaches to regression analysis with multiple measurements from individual sampling units

《Journal of Statistical Computation and Simulation》2012,82(3-4):149-175

Application of ordinary least-squares regression to data sets which contain multiple measurements from individual sampling units produces an unbiased estimator of the parameters but a biased estimator of the covariance matrix of the parameter estimates. The present work considers a random coefficient, linear model to deal with such data sets: this model permits many senses in which multiple measurements are taken from a sampling unit, not just when it is measured at several times. Three procedures to estimate the covariance matrix of the error term of the model are considered. Given these, three procedures to estimate the parameters of the model and their covariance matrix are considered; these are ordinary least-squares, generalized least-squares, and an adjusted ordinary least-squares procedure which produces an unbiased estimator of the covariance matrix of the parameters with small samples. These various procedures are compared in simulation studies using three examples from the biological literature. The possibility of testing hypotheses about the vector of parameters is also considered. It is found that all three procedures for regression estimation produce estimators of the parameters with bias of no practical consequence, Both generalized least-squares and adjusted ordinary least-squares generally produce estimators of the covariance matrix of the parameter estimates with bias of no practical consequence, while ordinary least-squares produces a negatively biased estimator. Neither ordinary nor generalized least-squares provide satisfactory hypothesis tests of the vector of parameter estimates. It is concluded that adjusted ordinary least-squares, when applied with either of two of the procedures used to estimate the error coveriance matrix, shows promise for practical application with data sets of the nature considered here. 相似文献

20.

Efficient Estimation of an Additive Quantile Regression Model

YEBIN CHENG JAN G. DE GOOIJER DAWIT ZEROM 《Scandinavian Journal of Statistics》2011,38(1):46-62

Abstract. In this paper, two non‐parametric estimators are proposed for estimating the components of an additive quantile regression model. The first estimator is a computationally convenient approach which can be viewed as a more viable alternative to existing kernel‐based approaches. The second estimator involves sequential fitting by univariate local polynomial quantile regressions for each additive component with the other additive components replaced by the corresponding estimates from the first estimator. The purpose of the extra local averaging is to reduce the variance of the first estimator. We show that the second estimator achieves oracle efficiency in the sense that each estimated additive component has the same variance as in the case when all other additive components were known. Asymptotic properties are derived for both estimators under dependent processes that are strictly stationary and absolutely regular. We also provide a demonstrative empirical application of additive quantile models to ambulance travel times. 相似文献