首页 | 本学科首页   官方微博 | 高级检索  
 共查询到20条相似文献,搜索用时 15 毫秒
In this article, we derive general matrix formulae for second-order biases of maximum likelihood estimators (MLEs) in a class of heteroscedastic symmetric nonlinear regression models, thus generalizing some results in the literature. This class of regression models includes all symmetric continuous distributions, and has a wide range of practical applications in various fields such as engineering, biology, medicine and economics, among others. The variety of distributions with different kurtosis coefficients than the normal may give more flexibility in the choice of an appropriate distribution, particularly to accommodate outlying and influential observations. We derive a joint iterative process for estimating the mean and dispersion parameters. We also present simulation studies for the biases of the MLEs.  相似文献   

An extension of some standard likelihood based procedures to heteroscedastic nonlinear regression models under scale mixtures of skew-normal (SMSN) distributions is developed. This novel class of models provides a useful generalization of the heteroscedastic symmetrical nonlinear regression models (Cysneiros et al., 2010), since the random term distributions cover both symmetric as well as asymmetric and heavy-tailed distributions such as skew-t, skew-slash, skew-contaminated normal, among others. A simple EM-type algorithm for iteratively computing maximum likelihood estimates of the parameters is presented and the observed information matrix is derived analytically. In order to examine the performance of the proposed methods, some simulation studies are presented to show the robust aspect of this flexible class against outlying and influential observations and that the maximum likelihood estimates based on the EM-type algorithm do provide good asymptotic properties. Furthermore, local influence measures and the one-step approximations of the estimates in the case-deletion model are obtained. Finally, an illustration of the methodology is given considering a data set previously analyzed under the homoscedastic skew-t nonlinear regression model.  相似文献   

Observations collected over time are often autocorrelated rather than independent, and sometimes include observations below or above detection limits (i.e. censored values reported as less or more than a level of detection) and/or missing data. Practitioners commonly disregard censored data cases or replace these observations with some function of the limit of detection, which often results in biased estimates. Moreover, parameter estimation can be greatly affected by the presence of influential observations in the data. In this paper we derive local influence diagnostic measures for censored regression models with autoregressive errors of order p (hereafter, AR(p)‐CR models) on the basis of the Q‐function under three useful perturbation schemes. In order to account for censoring in a likelihood‐based estimation procedure for AR(p)‐CR models, we used a stochastic approximation version of the expectation‐maximisation algorithm. The accuracy of the local influence diagnostic measure in detecting influential observations is explored through the analysis of empirical studies. The proposed methods are illustrated using data, from a study of total phosphorus concentration, that contain left‐censored observations. These methods are implemented in the R package ARCensReg.  相似文献   

In this paper we consider the Capital Asset Pricing Model under Elliptical (symmetric) Distributions. This class of distributions, which contains the normal distribution, t, contaminated normal and power exponential, among others, offers a more flexible framework for modelling asset prices or returns. In order to analyze the sensibility to possible outliers and/or atypical returns of the maximum likelihood estimators, the local influence method was implemented. The results are illustrated by using a set of shares from companies who trade in the Chilean Stock Market. Our main conclusion is that symmetric distributions having heavier tails than those of the normal distribution, especially the t distribution with small degrees of freedom, show a better fit and allow the reduction of the influence of atypical returns in the maximum likelihood estimators.  相似文献   

Two results for D θ-optimal designs for nonlinear regression models are shown to follow directly from approximate design theory. The first result considered is one concerning the replication of exact designs with minimum support, first established by Atkinson and Hunter and by M.J. Box in 1968, while the second pertains to a heteroscedastic model introduced by Velilla and Llosa in 1992. An illustrative example is provided.  相似文献   

Conditionally autoregressive (CAR) models are often used to analyze a spatial process observed over a lattice or a set of irregular regions. The neighborhoods within a CAR model are generally formed deterministically using the inter-distances or boundaries between the regions. To accommodate directional and inherent anisotropy variation, a new class of spatial models is proposed that adaptively determines neighbors based on a bivariate kernel using the distances and angles between the centroid of the regions. The newly proposed model generalizes the usual CAR model in a sense of accounting for adaptively determined weights. Maximum likelihood estimators are derived and simulation studies are presented for the sampling properties of the estimates on the new model, which is compared to the CAR model. Finally the method is illustrated using a data set on the elevated blood lead levels of children under the age of 72 months observed in Virginia in the year of 2000.  相似文献   

For longitudinal time series data, linear mixed models that contain both random effects across individuals and first-order autoregressive errors within individuals may be appropriate. Some statistical diagnostics based on the models under a proposed elliptical error structure are developed in this work. It is well known that the class of elliptical distributions offers a more flexible framework for modelling since it contains both light- and heavy-tailed distributions. Iterative procedures for the maximum-likelihood estimates of the model parameters are presented. Score tests for the presence of autocorrelation and the homogeneity of autocorrelation coefficients among individuals are constructed. The properties of test statistics are investigated through Monte Carlo simulations. The local influence method for the models is also given. The analysed results of a real data set illustrate the values of the models and diagnostic statistics.  相似文献   

The purpose of this paper is to develop diagnostics analysis for nonlinear regression models (NLMs) under scale mixtures of skew-normal (SMSN) distributions introduced by Garay et al. [Nonlinear regression models based on SMSN distributions. J. Korean Statist. Soc. 2011;40:115–124]. This novel class of models provides a useful generalization of the symmetrical NLM [Vanegas LH, Cysneiros FJA. Assessment of diagnostic procedures in symmetrical nonlinear regression models. Comput. Statist. Data Anal. 2010;54:1002–1016] since the random terms distributions cover both symmetric as well as asymmetric and heavy-tailed distributions such as the skew-t, skew-slash, skew-contaminated normal distributions, among others. Motivated by the results given in Garay et al. [Nonlinear regression models based on SMSN distributions. J. Korean Statist. Soc. 2011;40:115–124], we presented a score test for testing the homogeneity of the scale parameter and its properties are investigated through Monte Carlo simulations studies. Furthermore, local influence measures and the one-step approximations of the estimates in the case-deletion model are obtained. The newly developed procedures are illustrated considering a real data set.  相似文献   

A spatial process observed over a lattice or a set of irregular regions is usually modeled using a conditionally autoregressive (CAR) model. The neighborhoods within a CAR model are generally formed using only the inter-distances or boundaries between the regions. To accommodate directional spatial variation, a new class of spatial models is proposed using different weights given to neighbors in different directions. The proposed model generalizes the usual CAR model by accounting for spatial anisotropy. Maximum likelihood estimators are derived and shown to be consistent under some regularity conditions. Simulation studies are presented to evaluate the finite sample performance of the new model as compared to the CAR model. Finally, the method is illustrated using a data set on the crime rates of Columbus, OH and on the elevated blood lead levels of children under the age of 72 months observed in Virginia in the year of 2000.  相似文献   

In some fields, we are forced to work with missing data in multivariate time series. Unfortunately, the data analysis in this context cannot be carried out in the same way as in the case of complete data. To deal with this problem, a Bayesian analysis of multivariate threshold autoregressive models with exogenous inputs and missing data is carried out. In this paper, Markov chain Monte Carlo methods are used to obtain samples from the involved posterior distributions, including threshold values and missing data. In order to identify autoregressive orders, we adapt the Bayesian variable selection method in this class of multivariate process. The number of regimes is estimated using marginal likelihood or product parameter-space strategies.  相似文献   

Summary This paper presents a selective survey on panel data methods. The focus is on new developments. In particular, linear multilevel models, specific nonlinear, nonparametric and semiparametric models are at the center of the survey. In contrast to linear models there do not exist unified methods for nonlinear approaches. In this case conditional maximum likelihood methods dominate for fixed effects models. Under random effects assumptions it is sometimes possible to employ conventional maximum likelihood methods using Gaussian quadrature to reduce a T-dimensional integral. Alternatives are generalized methods of moments and simulated estimators. If the nonlinear function is not exactly known, nonparametric or semiparametric methods should be preferred. Helpful comments and suggestions from an unknown referee are gratefully acknowledged.  相似文献   

The class of nonlinear reproductive dispersion mixed models (NRDMMs) is an extension of nonlinear reproductive dispersion models and generalized linear mixed models. This paper discusses the influence analysis of the model based on Laplace approximation. The equivalence of case-deletion models and mean-shift outlier models in NRDMMs is investigated, and some diagnostic measures are proposed via the case-deletion method. We also investigate the assessment of local influence of various perturbation schemes. The proposed method is illustrated with an example.  相似文献   

Maximum likelihood (ML) estimation with spatial econometric models is a long-standing problem that finds application in several areas of economic importance. The problem is particularly challenging in the presence of missing data, since there is an implied dependence between all units, irrespective of whether they are observed or not. Out of the several approaches adopted for ML estimation in this context, that of LeSage and Pace [Models for spatially dependent missing data. J Real Estate Financ Econ. 2004;29(2):233–254] stands out as one of the most commonly used with spatial econometric models due to its ability to scale with the number of units. Here, we review their algorithm, and consider several similar alternatives that are also suitable for large datasets. We compare the methods through an extensive empirical study and conclude that, while the approximate approaches are suitable for large sampling ratios, for small sampling ratios the only reliable algorithms are those that yield exact ML or restricted ML estimates.  相似文献   

Spatial regression models are important tools for many scientific disciplines including economics, business, and social science. In this article, we investigate postmodel selection estimators that apply least squares estimation to the model selected by penalized estimation in high-dimensional regression models with spatial autoregressive errors. We show that by separating the model selection and estimation process, the postmodel selection estimator performs at least as well as the simultaneous variable selection and estimation method in terms of the rate of convergence. Moreover, under perfect model selection, the 2 rate of convergence is the oracle rate of s/n, compared with the convergence rate of ◂√▸slogp/n in the general case. Here, n is the sample size and p, s are the model dimension and number of significant covariates, respectively. We further provide the convergence rate of the estimation error in the form of sup norm, and ideally the rate can reach as fast as ◂√▸logs/n.  相似文献   

This paper discusses the tests for departures from nominal dispersion in the framework of generalized nonlinear models with varying dispersion and/or additive random effects. We consider two classes of exponential family distributions. The first is discrete exponential family distributions, such as Poisson, binomial, and negative binomial distributions. The second is continuous exponential family distributions, such as normal, gamma, and inverse Gaussian distributions. Correspondingly, we develop a unifying approach and propose several tests for testing for departures from nominal dispersion in two classes of generalized nonlinear models. The score test statistics are constructed and expressed in simple, easy to use, matrix formulas, so that the tests can easily be implemented using existing statistical software. The properties of test statistics are investigated through Monte Carlo simulations.  相似文献   

The method of estimated generalized least squares estimation of multiple response models is extended to the randomly missing date case. This estimation procedure is computationally simply when there are many missing data but the number of distinct patterns of missing data for the response vectors is small. The consistency and asymptotic normality of the proposed estimators are established.  相似文献   

We discuss and evaluate bootstrap algorithms for obtaining confidence intervals for parameters in Generalized Linear Models when the data are correlated. The methods are based on a stratified bootstrap and are suited to correlation occurring within “blocks” of data (e.g., individuals within a family, teeth within a mouth, etc.). Application of the intervals to data from a Dutch follow-up study on preterm infants shows the corroborative usefulness of the intervals, while the intervals are seen to be a powerful diagnostic in studying annual measles data. In a simulation study, we compare the coverage rates of the proposed intervals with existing methods (e.g., via Generalized Estimating Equations). In most cases, the bootstrap intervals are seen to perform better than current methods, and are produced in an automatic fashion, so that the user need not know (or have to guess) the dependence structure within a block.  相似文献   

In this work, we develop some diagnostics for nonlinear regression model with scale mixtures of skew-normal (SMSN) and first-order autoregressive errors. The SMSN distribution class covers symmetric as well as asymmetric and heavy-tailed distributions, which offers a more flexible framework for modelling. Maximum-likelihood (ML) estimates are computed via an expectation–maximization-type algorithm. Local influence diagnostics and score test for the correlation are also derived. The performances of the ML estimates and the test statistic are investigated through Monte Carlo simulations. Finally, a real data set is used to illustrate our diagnostic methods.  相似文献   

Time-series data are often subject to measurement error, usually the result of needing to estimate the variable of interest. Generally, however, the relationship between the surrogate variables and the true variables can be rather complicated compared to the classical additive error structure usually assumed. In this article, we address the estimation of the parameters in autoregressive models in the presence of function measurement errors. We first develop a parameter estimation method with the help of validation data; this estimation method does not depend on functional form and the distribution of the measurement error. The proposed estimator is proved to be consistent. Moreover, the asymptotic representation and the asymptotic normality of the estimator are also derived, respectively. Simulation results indicate that the proposed method works well for practical situation.  相似文献   

When a two-level multilevel model (MLM) is used for repeated growth data, the individuals constitute level 2 and the successive measurements constitute level 1, which is nested within the individuals that make up level 2. The heterogeneity among individuals is represented by either the random-intercept or random-coefficient (slope) model. The variance components at level 1 involve serial effects and measurement errors under constant variance or heteroscedasticity. This study hypothesizes that missing serial effects or/and heteroscedasticity may bias the results obtained from two-level models. To illustrate this effect, we conducted two simulation studies, where the simulated data were based on the characteristics of an empirical mouse tumour data set. The results suggest that for repeated growth data with constant variance (measurement error) and misspecified serial effects (ρ > 0.3), the proportion of level-2 variation (intra-class correlation coefficient) increases with ρ and the two-level random-coefficient model is the minimum AIC (or AICc) model when compared with the fixed model, heteroscedasticity model, and random-intercept model. In addition, the serial effect (ρ > 0.1) and heteroscedasticity are both misspecified, implying that the two-level random-coefficient model is the minimum AIC (or AICc) model when compared with the fixed model and random-intercept model. This study demonstrates that missing serial effects and/or heteroscedasticity may indicate heterogeneity among individuals in repeated growth data (mixed or two-level MLM). This issue is critical in biomedical research.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号