期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Functional quasi-likelihood regression models with smooth random effects

Jeng-Min Chiou Hans-Georg Müller Jane-Ling Wang 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2003,65(2):405-423

Summary. We propose a class of semiparametric functional regression models to describe the influence of vector-valued covariates on a sample of response curves. Each observed curve is viewed as the realization of a random process, composed of an overall mean function and random components. The finite dimensional covariates influence the random components of the eigenfunction expansion through single-index models that include unknown smooth link and variance functions. The parametric components of the single-index models are estimated via quasi-score estimating equations with link and variance functions being estimated nonparametrically. We obtain several basic asymptotic results. The functional regression models proposed are illustrated with the analysis of a data set consisting of egg laying curves for 1000 female Mediterranean fruit-flies (medflies). 相似文献

2.

Quasi-Likelihood Regression with Multiple Indices and Smooth Link and Variance Functions

Jeng-Min Chiou Hans-Georg Müller 《Scandinavian Journal of Statistics》2004,31(3):367-386

Abstract. A flexible semi-parametric regression model is proposed for modelling the relationship between a response and multivariate predictor variables. The proposed multiple-index model includes smooth unknown link and variance functions that are estimated non-parametrically. Data-adaptive methods for automatic smoothing parameter selection and for the choice of the number of indices M are considered. This model adapts to complex data structures and provides efficient adaptive estimation through the variance function component in the sense that the asymptotic distribution is the same as if the non-parametric components are known. We develop iterative estimation schemes, which include a constrained projection method for the case where the regression parameter vectors are mutually orthogonal. The proposed methods are illustrated with the analysis of data from a growth bioassay and a reproduction experiment with medflies. Asymptotic properties of the estimated model components are also obtained. 相似文献

3.

Asymptotic results in gamma kernel regression

Jianhong Shi 《统计学通讯:理论与方法》2013,42(12):3489-3509

Abstract

Based on the Gamma kernel density estimation procedure, this article constructs a nonparametric kernel estimate for the regression functions when the covariate are nonnegative. Asymptotic normality and uniform almost sure convergence results for the new estimator are systematically studied, and the finite performance of the proposed estimate is discussed via a simulation study and a comparison study with an existing method. Finally, the proposed estimation procedure is applied to the Geyser data set. 相似文献

4.

Assessing the performance of variational methods for mixed logistic regression models

《Journal of Statistical Computation and Simulation》2012,82(8):765-779

We present a variational estimation method for the mixed logistic regression model. The method is based on a lower bound approximation of the logistic function [Jaakkola, J.S. and Jordan, M.I., 2000, Bayesian parameter estimation via variational methods. Statistics & Computing, 10, 25–37.]. Based on the approximation, an EM algorithm can be derived that results in a considerable simplification of the maximization problem in that it does not require the numerical evaluation of integrals over the random effects. We assess the performance of the variational method for the mixed logistic regression model in a simulation study and an empirical data example, and compare it to Laplace's method. The results indicate that the variational method is a viable choice for estimating the fixed effects of the mixed logistic regression model under the condition that the number of outcomes within each cluster is sufficiently high. 相似文献

5.

Variable selection for semiparametric varying coefficient partially linear model based on modal regression with missing data

Yafeng Xia Yarong Qu Nailing Sun 《统计学通讯:理论与方法》2013,42(20):5121-5137

Abstract

In this article, we focus on the variable selection for semiparametric varying coefficient partially linear model with response missing at random. Variable selection is proposed based on modal regression, where the non parametric functions are approximated by B-spline basis. The proposed procedure uses SCAD penalty to realize variable selection of parametric and nonparametric components simultaneously. Furthermore, we establish the consistency, the sparse property and asymptotic normality of the resulting estimators. The penalty estimation parameters value of the proposed method is calculated by EM algorithm. Simulation studies are carried out to assess the finite sample performance of the proposed variable selection procedure. 相似文献

6.

Decomposition of Prediction Error in Multilevel Models

D. Afshartous J. de Leeuw 《统计学通讯:模拟与计算》2013,42(4):909-928

ABSTRACT

We present a decomposition of prediction error for the multilevel model in the context of predicting a future observable y _*j in the jth group of a hierarchical dataset. The multilevel prediction rule is used for prediction and the components of prediction error are estimated via a simulation study that spans the various combinations of level-1 (individual) and level-2 (group) sample sizes and different intraclass correlation values. Additionally, analytical results present the increase in predicted mean square error (PMSE) with respect to prediction error bias. The components of prediction error provide information with respect to the cost of parameter estimation versus data imputation for predicting future values in a hierarchical data set. Specifically, the cost of parameter estimation is very small compared to data imputation. 相似文献

7.

Bayesian Estimation of the Number of Change Points in Simple Linear Regression Models

Tsai-Hung Fan Kuo-Ching Chang Chung-Bow Lee 《统计学通讯:理论与方法》2013,42(4):689-710

A Bayesian approach is considered to detect the number of change points in simple linear regression models. A normal-gamma empirical prior for the regression parameters based on maximum likelihood estimator (MLE) is employed in the analysis. Under mild conditions, consistency for the number of change points and boundedness between the estimated location and the true location of the change points are established. The Bayesian approach to the detection of the number of change points is suitable whether the switching simple regression is continuous or discontinuous. Some simulation results are given to confirm the accuracy of the proposed estimator. 相似文献

8.

The Extremal Dependence Measure and Asymptotic Independence

《随机性模型》2013,29(2):205-227

Abstract

Extremal dependence analysis assesses the tendency of large values of components of a random vector to occur simultaneously. This kind of dependence information can be qualitatively different than what is given by correlation which averages over the total body of the joint distribution. Also, correlation may be completely inappropriate for heavy tailed data. We study the extremal dependence measure (EDM), a measure of the tendency of large values of components of a random vector to occur simultaneously and show consistency of an estimator of the EDM. We also show asymptotic normality of an idealized estimator in a restricted case of multivariate regular variation where scaling functions do not have to be estimated. 相似文献

9.

Regression with an infinite number of observations applied to estimating the parameters of the stable distribution using the empirical characteristic function

J. Martin Van Zyl 《统计学通讯:理论与方法》2013,42(11):3323-3331

ABSTRACT

The parameters of stable law parameters can be estimated using a regression based approach involving the empirical characteristic function. One approach is to use a fixed number of points for all parameters of the distribution to estimate the characteristic function. In this work the results are derived where all points in an interval is used to estimate the empirical characteristic function, thus least squares estimators of a linear function of the parameters, using an infinite number of observations. It was found that the procedure performs very good in small samples. 相似文献

10.

Invariant Subspaces and Regression

Patrick Lang Ann Gironella Rieken Venema 《统计学通讯:理论与方法》2013,42(3):491-504

相似文献

11.

A semi-parametric cox’s regression model for zero-inflated left-censored time to event data

Roel Braekers Yves Grouwels 《统计学通讯:理论与方法》2013,42(7):1969-1988

Abstract

In some clinical, environmental, or economical studies, researchers are interested in a semi-continuous outcome variable which takes the value zero with a discrete probability and has a continuous distribution for the non-zero values. Due to the measuring mechanism, it is not always possible to fully observe some outcomes, and only an upper bound is recorded. We call this left-censored data and observe only the maximum of the outcome and an independent censoring variable, together with an indicator. In this article, we introduce a mixture semi-parametric regression model. We consider a parametric model to investigate the influence of covariates on the discrete probability of the value zero. For the non-zero part of the outcome, a semi-parametric Cox’s regression model is used to study the conditional hazard function. The different parameters in this mixture model are estimated using a likelihood method. Hereby the infinite dimensional baseline hazard function is estimated by a step function. As results, we show the identifiability and the consistency of the estimators for the different parameters in the model. We study the finite sample behaviour of the estimators through a simulation study and illustrate this model on a practical data example. 相似文献

12.

Treatment Effects With Heterogeneous Externalities

Tiziano Arduini Edoardo Rainone 《商业与经济统计学杂志》2020,38(4):826-838

Abstract

This article proposes a new method for estimating heterogeneous externalities in policy analysis when social interactions take the linear-in-means form. We establish that the parameters of interest can be identified and consistently estimated using specific functions of the share of the eligible population. We also study the finite sample performance of the proposed estimators using Monte Carlo simulations. The method is illustrated using data on the PROGRESA program. We find that more than 50% of the effects of the program on schooling attendance are due to externalities, which are heterogeneous within and between poor and nonpoor households. 相似文献

13.

The asymptotic behaviour of the residual sum of squares in models with multiple break points

Alastair R. Hall Denise R. Osborn Nikolaos Sakkas 《Econometric Reviews》2017,36(6-9):667-698

ABSTRACT

Models with multiple discrete breaks in parameters are usually estimated via least squares. This paper, first, derives the asymptotic expectation of the residual sum of squares and shows that the number of estimated break points and the number of regression parameters affect the expectation differently. Second, we propose a statistic for testing the joint hypothesis that the breaks occur at specified points in the sample. Our analytical results cover models estimated by the ordinary, nonlinear, and two-stage least squares. An application to U.S. monetary policy rejects the assumption that breaks are associated with changes in the chair of the Fed. 相似文献

14.

Sparsity identification in ultra-high dimensional quantile regression models with longitudinal data

Xianli Gao 《统计学通讯:理论与方法》2020,49(19):4712-4736

Abstract

In this paper, we propose a variable selection method for quantile regression model in ultra-high dimensional longitudinal data called as the weighted adaptive robust lasso (WAR-Lasso) which is double-robustness. We derive the consistency and the model selection oracle property of WAR-Lasso. Simulation studies show the double-robustness of WAR-Lasso in both cases of heavy-tailed distribution of the errors and the heavy contaminations of the covariates. WAR-Lasso outperform other methods such as SCAD and etc. A real data analysis is carried out. It shows that WAR-Lasso tends to select fewer variables and the estimated coefficients are in line with economic significance. 相似文献

15.

A new information criterion-based bandwidth selection method for non-parametric regressions

《Journal of Statistical Computation and Simulation》2012,82(17):3446-3455

ABSTRACT

Local linear estimator is a popularly used method to estimate the non-parametric regression functions, and many methods have been derived to estimate the smoothing parameter, or the bandwidth in this case. In this article, we propose an information criterion-based bandwidth selection method, with the degrees of freedom originally derived for non-parametric inferences. Unlike the plug-in method, the new method does not require preliminary parameters to be chosen in advance, and is computationally efficient compared to the cross-validation (CV) method. Numerical study shows that the new method performs better or comparable to existing plug-in method or CV method in terms of the estimation of the mean functions, and has lower variability than CV selectors. Real data applications are also provided to illustrate the effectiveness of the new method. 相似文献

16.

Unsupervised learning of regression mixture models with unknown number of components

《Journal of Statistical Computation and Simulation》2012,82(12):2308-2334

ABSTRACT

We propose a new unsupervised learning algorithm to fit regression mixture models with unknown number of components. The developed approach consists in a penalized maximum likelihood estimation carried out by a robust expectation–maximization (EM)-like algorithm. We derive it for polynomial, spline, and B-spline regression mixtures. The proposed learning approach is unsupervised: (i) it simultaneously infers the model parameters and the optimal number of the regression mixture components from the data as the learning proceeds, rather than in a two-fold scheme as in standard model-based clustering using afterward model selection criteria, and (ii) it does not require accurate initialization unlike the standard EM for regression mixtures. The developed approach is applied to curve clustering problems. Numerical experiments on simulated and real data show that the proposed algorithm performs well and provides accurate clustering results, and confirm its benefit for practical applications. 相似文献

17.

Composite quantile regression for varying-coefficient single-index models

Yan Fan Maozai Tian 《统计学通讯:理论与方法》2013,42(10):3027-3047

ABSTRACT

The varying-coefficient single-index model (VCSIM) is a very general and flexible tool for exploring the relationship between a response variable and a set of predictors. Popular special cases include single-index models and varying-coefficient models. In order to estimate the index-coefficient and the non parametric varying-coefficients in the VCSIM, we propose a two-stage composite quantile regression estimation procedure, which integrates the local linear smoothing method and the information of quantile regressions at a number of conditional quantiles of the response variable. We establish the asymptotic properties of the proposed estimators for the index-coefficient and varying-coefficients when the error is heterogeneous. When compared with the existing mean-regression-based estimation method, our simulation results indicate that our proposed method has comparable performance for normal error and is more robust for error with outliers or heavy tail. We illustrate our methodologies with a real example. 相似文献

18.

Inference in Approximately Sparse Correlated Random Effects Probit Models With Panel Data

《商业与经济统计学杂志》2012,30(1):1-18

Abstract

We propose a simple procedure based on an existing “debiased” l₁-regularized method for inference of the average partial effects (APEs) in approximately sparse probit and fractional probit models with panel data, where the number of time periods is fixed and small relative to the number of cross-sectional observations. Our method is computationally simple and does not suffer from the incidental parameters problems that come from attempting to estimate as a parameter the unobserved heterogeneity for each cross-sectional unit. Furthermore, it is robust to arbitrary serial dependence in underlying idiosyncratic errors. Our theoretical results illustrate that inference concerning APEs is more challenging than inference about fixed and low-dimensional parameters, as the former concerns deriving the asymptotic normality for sample averages of linear functions of a potentially large set of components in our estimator when a series approximation for the conditional mean of the unobserved heterogeneity is considered. Insights on the applicability and implications of other existing Lasso-based inference procedures for our problem are provided. We apply the debiasing method to estimate the effects of spending on test pass rates. Our results show that spending has a positive and statistically significant average partial effect; moreover, the effect is comparable to found using standard parametric methods. 相似文献

19.

Some theoretical results regarding the polygonal distribution

Hien D. Nguyen Geoffrey J. McLachlan 《统计学通讯:理论与方法》2018,47(20):5083-5095

Polygonal distributions are a class of distributions that can be defined via the mixture of triangular distributions over the unit interval. We demonstrate that the densities of polygonal distributions are dense in the class of continuous and concave densities with bounded second derivatives. Furthermore, we prove that polygonal density functions provide O(g^{? 2}) approximations (where g is the number of triangular distribution components), in the supremum distance, to any density function from the hypothesized class. Parametric consistency and Hellinger consistency results for the maximum likelihood (ML) estimator are obtained. A result regarding model selection via penalized ML estimation is proved. 相似文献

20.

Random balanced resampling: A new method for estimating variance components in unbalanced designs

《Journal of Statistical Computation and Simulation》2012,82(3):181-194

Variance components in factorial designs with balanced data are commonly estimated by equating mean squares to expected mean squares. For unbalanced data, the usual extensions of this approach are the Henderson methods, which require formulas that are rather involved. Alternatively, maximum likelihood estimation based on normality has been proposed. Although the algorithm for maximum likelihood is computationally complex, programs exist in some statistical packages. This article introduces a simpler method, that of creating a balanced data set by resampling from the original one. Revised formulas for expected mean squares are presented for the two-way case; they are easily generalized to larger factorial designs. The results of a number of simulation studies indicate that, in certain types of designs, the proposed method has performance advantages over both the Henderson Method I and maximum likelihood estimators. 相似文献