首页 | 本学科首页   官方微博 | 高级检索  
 共查询到20条相似文献,搜索用时 15 毫秒
This article discusses the estimation of the parameter function for a functional linear regression model under heavy-tailed errors' distributions and in the presence of outliers. Standard approaches of reducing the high dimensionality, which is inherent in functional data, are considered. After reducing the functional model to a standard multiple linear regression model, a weighted rank-based procedure is carried out to estimate the regression parameters. A Monte Carlo simulation and a real-world example are used to show the performance of the proposed estimator and a comparison made with the least-squares and least absolute deviation estimators.  相似文献   

In this paper the most commonly used diagnostic criteria for the identification of outliers or leverage points in the ordinary regression model are reviewed. Their use in the context of the errors-in-variables (e.v.) linear model is discussed and evidence is given that under the e.v. model assumptions the distinction between outliers and leverage points no longer exists.  相似文献   

This article studies computation problem in the context of estimating parameters of linear mixed model for massive data. Our algorithms combine the factored spectrally transformed linear mixed model method with a sequential singular value decomposition calculation algorithm. This combination solves the operation limitation of the method and also makes this algorithm feasible to big dataset, especially when the data has a tall and thin design matrix. Our simulation studies show that our algorithms make the calculation of linear mixed model feasible for massive data on ordinary desktop and have same estimating accuracy with the method based on the whole data.  相似文献   

E. Brunel  A. Roche 《Statistics》2015,49(6):1298-1321
Our aim is to estimate the unknown slope function in the functional linear model when the response Y is real and the random function X is a second-order stationary and periodic process. We obtain our estimator by minimizing a standard (and very simple) mean-square contrast on linear finite dimensional spaces spanned by trigonometric bases. Our approach provides a penalization procedure which allows to automatically select the adequate dimension, in a non-asymptotic point of view. In fact, we can show that our penalized estimator reaches the optimal (minimax) rate of convergence in the sense of the prediction error. We complete the theoretical results by a simulation study and a real example that illustrates how the procedure works in practice.  相似文献   

The construction of confidence sets for the parameters of a flexible simple linear regression model for interval-valued random sets is addressed. For that purpose, the asymptotic distribution of the least-squares estimators is analyzed. A simulation study is conducted to investigate the performance of those confidence sets. In particular, the empirical coverages are examined for various interval linear models. The applicability of the procedure is illustrated by means of a real-life case study.  相似文献   

In practice, it is not uncommon to encounter the situation that a discrete response is related to both a functional random variable and multiple real-value random variables whose impact on the response is nonlinear. In this paper, we consider the generalized partial functional linear additive models (GPFLAM) and present the estimation procedure. In GPFLAM, the nonparametric functions are approximated by polynomial splines and the infinite slope function is estimated based on the principal component basis function approximations. We obtain the estimator by maximizing the quasi-likelihood function. We investigate the finite sample properties of the estimation procedure via Monte Carlo simulation studies and illustrate our proposed model by a real data analysis.  相似文献   

When studying associations between a functional covariate and scalar response using a functional linear model (FLM), scientific knowledge may indicate possible monotonicity of the unknown parameter curve. In this context, we propose an F-type test of monotonicity, based on a full versus reduced nested model structure, where the reduced model with monotonically constrained parameter curve is nested within an unconstrained FLM. For estimation under the unconstrained FLM, we consider two approaches: penalised least-squares and linear mixed model effects estimation. We use a smooth then monotonise approach to estimate the reduced model, within the null space of monotone parameter curves. A bootstrap procedure is used to simulate the null distribution of the test statistic. We present a simulation study of the power of the proposed test, and illustrate the test using data from a head and neck cancer study.  相似文献   


In this paper, we extend a variance shift model, previously considered in the linear mixed models, to the linear mixed measurement error models using the corrected likelihood of Nakamura (1990 Nakamura, T. (1990). Corrected score function for errors in variables models: methodology and application to generalized linear models. Biometrika 77:127137.[Crossref], [Web of Science ®] [Google Scholar]). This model assumes that a single outlier arises from an observation with inflated variance. We derive the score test and the analogue of the likelihood ratio test, to assess whether the ith observation has inflated variance. A parametric bootstrap procedure is implemented to obtain empirical distributions of the test statistics. Finally, results of a simulation study and an example of real data are presented to illustrate the performance of proposed tests.  相似文献   

In this paper, we consider the empirical likelihood inferences of the partial functional linear model with missing responses. Two empirical log-likelihood ratios of the parameters of interest are constructed, and the corresponding maximum empirical likelihood estimators of parameters are derived. Under some regularity conditions, we show that the proposed two empirical log-likelihood ratios are asymptotic standard Chi-squared. Thus, the asymptotic results can be used to construct the confidence intervals/regions for the parameters of interest. We also establish the asymptotic distribution theory of corresponding maximum empirical likelihood estimators. A simulation study indicates that the proposed methods are comparable in terms of coverage probabilities and average lengths of confidence intervals. An example of real data is also used to illustrate our proposed methods.  相似文献   

We empirically illustrate how concepts and methods involved in a grade of membership (GoM) analysis can be used to sort individuals by competence. Our study relies on a data set compiled from the international survey on higher education graduates called REFLEX. We focus on the subset of data related to the perception of own competencies. It is first decomposed into fuzzy clusters that form a hierarchical fuzzy partition. Then, we calculate a scalar measure of competencies for each fuzzy cluster, and subsequently use the individual GoM scores to combine cluster-based competencies to position individuals on a scale from 0 to 1.  相似文献   

The research described herein was motivated by a study of the relationship between the performance of students in senior high schools and at universities in China. A special linear structural equation model is established, in which some parameters are known and both the responses and the covariables are measured with errors. To explore the relationship between the true responses and latent covariables and to estimate the parameters, we suggest a non-iterative estimation approach that can account for the external dependence between the true responses and latent covariables. This approach can also deal with the collinearity problem because the use of dimension-reduction techniques can remove redundant variables. Combining further with the information that some of parameters are given, we can perform estimation for the other unknown parameters. An easily implemented algorithm is provided. A simulation is carried out to provide evidence of the performance of the approach and to compare it with existing methods. The approach is applied to the education example for illustration, and it can be readily extended to more general models.  相似文献   

I suggest an extension of the semiparametric transformation model that specifies a time-varying regression structure for the transformation, and thus allows time-varying structure in the data. Special cases include a stratified version of the usual semiparametric transformation model. The model can be thought of as specifying a first order Taylor expansion of a completely flexible baseline. Large sample properties are derived and estimators of the asymptotic variances of the regression coefficients are given. The method is illustrated by a worked example and a small simulation study. A goodness of fit procedure for testing if the regression effects lead to a satisfactory fit is also suggested.  相似文献   

In this article, we propose a novel approach to fit a functional linear regression in which both the response and the predictor are functions. We consider the case where the response and the predictor processes are both sparsely sampled at random time points and are contaminated with random errors. In addition, the random times are allowed to be different for the measurements of the predictor and the response functions. The aforementioned situation often occurs in longitudinal data settings. To estimate the covariance and the cross‐covariance functions, we use a regularization method over a reproducing kernel Hilbert space. The estimate of the cross‐covariance function is used to obtain estimates of the regression coefficient function and of the functional singular components. We derive the convergence rates of the proposed cross‐covariance, the regression coefficient, and the singular component function estimators. Furthermore, we show that, under some regularity conditions, the estimator of the coefficient function has a minimax optimal rate. We conduct a simulation study and demonstrate merits of the proposed method by comparing it to some other existing methods in the literature. We illustrate the method by an example of an application to a real‐world air quality dataset. The Canadian Journal of Statistics 47: 524–559; 2019 © 2019 Statistical Society of Canada  相似文献   

Cluster analysis is one of the most widely used method in statistical analyses, in which homogeneous subgroups are identified in a heterogeneous population. Due to the existence of the continuous and discrete mixed data in many applications, so far, some ordinary clustering methods such as, hierarchical methods, k-means and model-based methods have been extended for analysis of mixed data. However, in the available model-based clustering methods, by increasing the number of continuous variables, the number of parameters increases and identifying as well as fitting an appropriate model may be difficult. In this paper, to reduce the number of the parameters, for the model-based clustering mixed data of continuous (normal) and nominal data, a set of parsimonious models is introduced. Models in this set are extended, using the general location model approach, for modeling distribution of mixed variables and applying factor analyzer structure for covariance matrices. The ECM algorithm is used for estimating the parameters of these models. In order to show the performance of the proposed models for clustering, results from some simulation studies and analyzing two real data sets are presented.  相似文献   

Abrupt changes often occur for environmental and financial time series. Most often, these changes are due to human intervention. Change point analysis is a statistical tool used to analyze sudden changes in observations along the time series. In this paper, we propose a Bayesian model for extreme values for environmental and economic datasets that present a typical change point behavior. The model proposed in this paper addresses the situation in which more than one change point can occur in a time series. By analyzing maxima, the distribution of each regime is a generalized extreme value distribution. In this model, the change points are unknown and considered parameters to be estimated. Simulations of extremes with two change points showed that the proposed algorithm can recover the true values of the parameters, in addition to detecting the true change points in different configurations. Also, the number of change points was a problem to be considered, and the Bayesian estimation can correctly identify the correct number of change points for each application. Environmental and financial data were analyzed and results showed the importance of considering the change point in the data and revealed that this change of regime brought about an increase in the return levels, increasing the number of floods in cities around the rivers. Stock market levels showed the necessity of a model with three different regimes.  相似文献   

Overdispersion is a problem encountered in the analysis of count data that can lead to invalid inference if unaddressed. Decision about whether data are overdispersed is often reached by checking whether the ratio of the Pearson chi-square statistic to its degrees of freedom is greater than one; however, there is currently no fixed threshold for declaring the need for statistical intervention. We consider simulated cross-sectional and longitudinal datasets containing varying magnitudes of overdispersion caused by outliers or zero inflation, as well as real datasets, to determine an appropriate threshold value of this statistic which indicates when overdispersion should be addressed.  相似文献   

In this paper, we investigate the testing for serial correlation in a linear model with validation data, then we apply the empirical likelihood method to construct the test statistic and derive the asymptotic distribution of the test statistic under null hypothesis. Simulation results show that our method performs well both in size and power with finite same size.  相似文献   

We consider a functional linear model where the explicative variables are known stochastic processes taking values in a Hilbert space, the main example is given by Gaussian processes in L2([0,1])L2([0,1]). We propose estimators of the Sobol indices in this functional linear model. Our estimators are based on U-statistics. We prove the asymptotic normality and the efficiency of our estimators and we compare them from a theoretical and practical point of view with classical estimators of Sobol indices.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号