首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
ABSTRACT. In this paper we consider logspline density estimation for data that may be left-truncated or right-censored. For randomly left-truncated and right-censored data the product-limit estimator is known to be a consistent estimator of the survivor function, having a faster rate of convergence than many density estimators. The product-limit estimator and B-splines are used to construct the logspline density estimate for possibly censored or truncated data. Rates of convergence are established when the log-density function is assumed to be in a Besov space. An algorithm involving a procedure similar to maximum likelihood, stepwise knot addition, and stepwise knot deletion is proposed for the estimation of the density function based upon sample data. Numerical examples are used to show the finite-sample performance of inference based on the logspline density estimation.  相似文献   

2.
The author considers the estimation of the common probability density of independent and identically distributed random variables observed with added white noise. She assumes that the unknown density belongs to some class of supersmooth functions, and that the error distribution is ordinarily smooth, meaning that its characteristic function decays polynomially asymptotically. In this context, the author evaluates the minimax rate of convergence of the pointwise risk and describes a kernel estimator having this rate. She computes upper bounds for the L2 risk of this estimator.  相似文献   

3.
In conditional logspline modelling, the logarithm of the conditional density function, log f(y|x), is modelled by using polynomial splines and their tensor products. The parameters of the model (coefficients of the spline functions) are estimated by maximizing the conditional log-likelihood function. The resulting estimate is a density function (positive and integrating to one) and is twice continuously differentiable. The estimate is used further to obtain estimates of regression and quantile functions in a natural way. An automatic procedure for selecting the number of knots and knot locations based on minimizing a variant of the AIC is developed. An example with real data is given. Finally, extensions and further applications of conditional logspline models are discussed.  相似文献   

4.
The problem of nonparametric estimation of a probability density function is studied when the sample observations are contaminated with random noise. Previous authors have proposed estimators which use kernel density and deconvolution techniques. The appearance and properties of the previously proposed estimators are affected by constants Mn and hn which the user may choose. However, the optimal choices of these constants depend on the sample size n, the noise distribution and the unknown distribution which is being estimated. Hence, in practice, Mn and hn are optimally selected as functions of the data. In this paper it is shown that a class of the proposed estimators are uniformly, strongly consistent when Mn and hn are allowed to be random variables. Even when Mn and hn are constants, these results are new findings.  相似文献   

5.
In general, the precise date of onset of pregnancy is unknown and may only be estimated from ultrasound biometric measurements of the embryo. We want to estimate the density of the random variables corresponding to the interval between last menstrual period and true onset of pregnancy. The observations correspond to the variables of interest up to an additive noise. We suggest an estimation procedure based on deconvolution. It requires the knowledge of the density of the noise which is not available. But we have at our disposal another specific sample with replicate observations for twin pregnancies. This allows both to estimate the noise density and to improve the deconvolution step. Convergence rates of the final estimator are studied and compared with other settings. Our estimator involves a cut‐off parameter for which we propose a cross‐validation type procedure. Lastly, we estimate the target density in spontaneous pregnancies with an estimation of the noise obtained from replicate observations in twin pregnancies.  相似文献   

6.
The nonparametric density function estimation using sample observations which are contaminated with random noise is studied. The particular form of contamination under consideration is Y = X + Z, where Y is an observable random variableZ is a random noise variable with known distribution, and X is an absolutely continuous random variable which cannot be observed directly. The finite sample size performance of a strongly consistent estimator for the density function of the random variable X is illustrated for different distributions. The estimator uses Fourier and kernel function estimation techniques and allows the user to choose constants which relate to bandwidth windows and limits on integration and which greatly affect the appearance and properties of the estimates. Numerical techniques for computation of the estimated densities and for optimal selection of the constant are given.  相似文献   

7.
Grouped data are commonly encountered in applications. All data from a continuous population are grouped due to rounding of the individual observations. The Bernstein polynomial model is proposed as an approximate model in this paper for estimating a univariate density function based on grouped data. The coefficients of the Bernstein polynomial, as the mixture proportions of beta distributions, can be estimated using an EM algorithm. The optimal degree of the Bernstein polynomial can be determined using a change-point estimation method. The rate of convergence of the proposed density estimate to the true density is proved to be almost parametric by an acceptance–rejection argument used for generating random numbers. The proposed method is compared with some existing methods in a simulation study and is applied to the Chicken Embryo Data.  相似文献   

8.
Abstract.  Stochastic differential equations have been shown useful in describing random continuous time processes. Biomedical experiments often imply repeated measurements on a series of experimental units and differences between units can be represented by incorporating random effects into the model. When both system noise and random effects are considered, stochastic differential mixed-effects models ensue. This class of models enables the simultaneous representation of randomness in the dynamics of the phenomena being considered and variability between experimental units, thus providing a powerful modelling tool with immediate applications in biomedicine and pharmacokinetic/pharmacodynamic studies. In most cases the likelihood function is not available, and thus maximum likelihood estimation of the unknown parameters is not possible. Here we propose a computationally fast approximated maximum likelihood procedure for the estimation of the non-random parameters and the random effects. The method is evaluated on simulations from some famous diffusion processes and on real data sets.  相似文献   

9.
The Tweedie compound Poisson distribution is a subclass of the exponential dispersion family with a power variance function, in which the value of the power index lies in the interval (1,2). It is well known that the Tweedie compound Poisson density function is not analytically tractable, and numerical procedures that allow the density to be accurately and fast evaluated did not appear until fairly recently. Unsurprisingly, there has been little statistical literature devoted to full maximum likelihood inference for Tweedie compound Poisson mixed models. To date, the focus has been on estimation methods in the quasi-likelihood framework. Further, Tweedie compound Poisson mixed models involve an unknown variance function, which has a significant impact on hypothesis tests and predictive uncertainty measures. The estimation of the unknown variance function is thus of independent interest in many applications. However, quasi-likelihood-based methods are not well suited to this task. This paper presents several likelihood-based inferential methods for the Tweedie compound Poisson mixed model that enable estimation of the variance function from the data. These algorithms include the likelihood approximation method, in which both the integral over the random effects and the compound Poisson density function are evaluated numerically; and the latent variable approach, in which maximum likelihood estimation is carried out via the Monte Carlo EM algorithm, without the need for approximating the density function. In addition, we derive the corresponding Markov Chain Monte Carlo algorithm for a Bayesian formulation of the mixed model. We demonstrate the use of the various methods through a numerical example, and conduct an array of simulation studies to evaluate the statistical properties of the proposed estimators.  相似文献   

10.
Search design is searching and estimating for a few non zero effects in a large set of effects along with estimation of elements in a set of unknown parameters. In presence of noise, the probability of discrimination between the true non zero effect from an alternative one depends on the design and an unknown parameter, say ρ. We develop a new criterion for design comparison which is independent of ρ and for a family density weight function show that it discriminates and ranks the designs precisely. This criterion is invariance to the variable noise which may be present between designs due to noise factors. This allows us to extend the design comparison to classes of equivalent designs.  相似文献   

11.
The problem of nonparametric estimation of the spectral density function of a partially observed homogeneous random field is addressed. In particular, a class of estimators with favorable asymptotic performance (bias, variance, rate of convergence) is proposed. The proposed estimators are actually shown to be √N-consistent if the autocovariance function of the random field is supported on a compact set, and close to √N-consistent if the autocovariance function decays to zero sufficiently fast for increasing lags.  相似文献   

12.
In this paper we investigate nonparametric estimation of some functionals of the conditional distribution of a scalar response variable Y given a random variable X taking values in a semi-metric space. These functionals include the regression function, the conditional cumulative distribution, the conditional density and some other ones. The literature on nonparametric functional statistics is only concerning pointwise consistency results, and our main aim is to prove the uniform almost complete convergence (with rate) of the kernel estimators of these nonparametric models. Unlike in standard multivariate cases, the gap between pointwise and uniform results is not immediate. So, suitable topological considerations are needed, implying changes in the rates of convergence which are quantified by entropy considerations. These theoretical uniform consistency results are (or will be) key tools for many further developments in functional data analysis.  相似文献   

13.
The Cramér-von Mises test methodology is applied to build a goodness-of fit test for the mixed Rasch model. The Mixed Rasch Model is a probability model of a multivariate discrete random variable driven by an unknown latent continuous variable. The problem of estimation of the unknown fixed difficulty parameters and the latent density function is also considered. The theoretical results are illustrated through simulations and an application to real Quality of Life data.  相似文献   

14.
This paper deals with the problem of predicting the real‐valued response variable using explanatory variables containing both multivariate random variable and random curve. The proposed functional partial linear single‐index model treats the multivariate random variable as linear part and the random curve as functional single‐index part, respectively. To estimate the non‐parametric link function, the functional single‐index and the parameters in the linear part, a two‐stage estimation procedure is proposed. Compared with existing semi‐parametric methods, the proposed approach requires no initial estimation and iteration. Asymptotical properties are established for both the parameters in the linear part and the functional single‐index. The convergence rate for the non‐parametric link function is also given. In addition, asymptotical normality of the error variance is obtained that facilitates the construction of confidence region and hypothesis testing for the unknown parameter. Numerical experiments including simulation studies and a real‐data analysis are conducted to evaluate the empirical performance of the proposed method.  相似文献   

15.
The methods of estimation of nonparametric regression function are quite common in statistical application. In this paper, the new Bayesian wavelet thresholding estimation is considered. The new mixture prior distributions for the estimation of nonparametric regression function by applying wavelet transformation are investigated. The reversible jump algorithm to obtain the appropriate prior distributions and value of thresholding is used. The performance of the proposed estimator is assessed with simulated data from well-known test functions by comparing the convergence rate of the proposed estimator with respect to another by evaluating the average mean square error and standard deviations. Finally by applying the developed method, density function of galaxy data is estimated.  相似文献   

16.
This paper addresses the problem of the probability density estimation in the presence of covariates when data are missing at random (MAR). The inverse probability weighted method is used to define a nonparametric and a semiparametric weighted probability density estimators. A regression calibration technique is also used to define an imputed estimator. It is shown that all the estimators are asymptotically normal with the same asymptotic variance as that of the inverse probability weighted estimator with known selection probability function and weights. Also, we establish the mean squared error (MSE) bounds and obtain the MSE convergence rates. A simulation is carried out to assess the proposed estimators in terms of the bias and standard error.  相似文献   

17.
Methods for comparing designs for a random (or mixed) linear model have focused primarily on criteria based on single-valued functions. In general, these functions are difficult to use, because of their complex forms, in addition to their dependence on the model's unknown variance components. In this paper, a graphical approach is presented for comparing designs for random models. The one-way model is used for illustration. The proposed approach is based on using quantiles of an estimator of a function of the variance components. The dependence of these quantiles on the true values of the variance components is depicted by plotting the so-called quantile dispersion graphs (QDGs), which provide a comprehensive picture of the quality of estimation obtained with a given design. The QDGs can therefore be used to compare several candidate designs. Two methods of estimation of variance components are considered, namely analysis of variance and maximum-likelihood estimation.  相似文献   

18.
We extend the family of multivariate generalized linear mixed models to include random effects that are generated by smooth densities. We consider two such families of densities, the so-called semi-nonparametric (SNP) and smooth nonparametric (SMNP) densities. Maximum likelihood estimation, under either the SNP or the SMNP densities, is carried out using a Monte Carlo EM algorithm. This algorithm uses rejection sampling and automatically increases the MC sample size as it approaches convergence. In a simulation study we investigate the performance of these two densities in capturing the true underlying shape of the random effects distribution. We also examine the implications of misspecification of the random effects distribution on the estimation of the fixed effects and their standard errors. The impact of the assumed random effects density on the estimation of the random effects themselves is investigated in a simulation study and also in an application to a real data set.  相似文献   

19.
In this paper, we consider partially linear additive models with an unknown link function, which include single‐index models and additive models as special cases. We use polynomial spline method for estimating the unknown link function as well as the component functions in the additive part. We establish that convergence rates for all nonparametric functions are the same as in one‐dimensional nonparametric regression. For a faster rate of the parametric part, we need to define appropriate ‘projection’ that is more complicated than that defined previously for partially linear additive models. Compared to previous approaches, a distinct advantage of our estimation approach in implementation is that estimation directly reduces estimation in the single‐index model and can thus deal with much larger dimensional problems than previous approaches for additive models with unknown link functions. Simulations and a real dataset are used to illustrate the proposed model.  相似文献   

20.
In this note we consider estimation of a mixture model of count data which is composed of two discrete random variables. Conditional and unconditional estimation procedures are given for estimating the unknown parameter(s) of interest using the likelihood function. Asymptotic relative efficiencies are given to examine the amount of information loss in using the two estimation procedures. Specifically, we study the change in asymptotic relative efficiency, if any, in different parameter settings.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号