期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Estimation of Conditional Ranks and Tests of Exogeneity in Nonparametric Nonseparable Models

Frédérique Fève Jean-Pierre Florens Ingrid Van Keilegom 《商业与经济统计学杂志》2018,36(2):334-345

Consider a nonparametric nonseparable regression model Y = ?(Z, U), where ?(Z, U) is strictly increasing in U and U ～ U[0, 1]. We suppose that there exists an instrument W that is independent of U. The observable random variables are Y, Z, and W, all one-dimensional. We construct test statistics for the hypothesis that Z is exogenous, that is, that U is independent of Z. The test statistics are based on the observation that Z is exogenous if and only if V = F_Y|Z(Y|Z) is independent of W, and hence they do not require the estimation of the function ?. The asymptotic properties of the proposed tests are proved, and a bootstrap approximation of the critical values of the tests is shown to be consistent and to work for finite samples via simulations. An empirical example using the U.K. Family Expenditure Survey is also given. As a byproduct of our results we obtain the asymptotic properties of a kernel estimator of the distribution of V, which equals U when Z is exogenous. We show that this estimator converges to the uniform distribution at faster rate than the parametric n^{? 1/2}-rate. 相似文献

2.

Multicovariate-adjusted regression models

《Journal of Statistical Computation and Simulation》2012,82(9):813-827

We introduce multicovariate-adjusted regression (MCAR), an adjustment method for regression analysis, where both the response (Y) and predictors (X ₁, …, X _p) are not directly observed. The available data have been contaminated by unknown functions of a set of observable distorting covariates, Z ₁, …, Z _s, in a multiplicative fashion. The proposed method substantially extends the current contaminated regression modelling capability, by allowing for multiple distorting covariate effects. MCAR is a flexible generalisation of the recently proposed covariate-adjusted regression method, an effective adjustment method in the presence of a single covariate, Z. For MCAR estimation, we establish a connection between the MCAR models and adaptive varying coefficient models. This connection leads to an adaptation of a hybrid backfitting estimation algorithm. Extensive simulations are used to study the performance and limitations of the proposed iterative estimation algorithm. In particular, the bias and mean square error of the proposed MCAR estimators are examined, relative to a baseline and a consistent benchmark estimator. The method is also illustrated with a Pima Indian diabetes data set, where the response and predictors are potentially contaminated by body mass index and triceps skin fold thickness. Both distorting covariates measure aspects of obesity, an important risk factor in type 2 diabetes. 相似文献

3.

Simulations and computations of nonparametric density estimates for the deconvolution problem

《Journal of Statistical Computation and Simulation》2012,82(3-4):145-167

The nonparametric density function estimation using sample observations which are contaminated with random noise is studied. The particular form of contamination under consideration is Y = X + Z, where Y is an observable random variableZ is a random noise variable with known distribution, and X is an absolutely continuous random variable which cannot be observed directly. The finite sample size performance of a strongly consistent estimator for the density function of the random variable X is illustrated for different distributions. The estimator uses Fourier and kernel function estimation techniques and allows the user to choose constants which relate to bandwidth windows and limits on integration and which greatly affect the appearance and properties of the estimates. Numerical techniques for computation of the estimated densities and for optimal selection of the constant are given. 相似文献

4.

Nonparametric two‐step regression estimation when regressors and error are dependent

Jons Pinkse 《Revue canadienne de statistique》2000,28(2):289-300

This paper considers estimation of the function g in the model Y_t = g(X_t ) + ?_t when E(?_t|Xt) ≠ 0 with nonzero probability. We assume the existence of an instrumental variable Z_t that is independent of ?_t, and of an innovation ηt = X_t — E(X_t|Z_t). We use a nonparametric regression of X_t on Z_t to obtain residuals η_t, which in turn are used to obtain a consistent estimator of g. The estimator was first analyzed by Newey, Powell & Vella (1999) under the assumption that the observations are independent and identically distributed. Here we derive a sample mean‐squared‐error convergence result for independent identically distributed observations as well as a uniform‐convergence result under time‐series dependence. 相似文献

5.

Laplace random variables with application to price indices

Saralees Nadarajah 《AStA Advances in Statistical Analysis》2009,93(3):345-369

Laplace distributions are becoming increasingly popular models in economics and finance. In this note, the exact distribution of the ratio Z=|X/Y| is derived when X and Y are independent Laplace random variables. This distribution arises when one is interested in comparing the performances of two economic or financial entities. We consider estimation issues of the distribution and illustrate an application for consumer price indices from the six major economics. Several computer programs are given for implementation of the methods used. 相似文献

6.

Consistent deconvolution in density estimation

Luc Devroye 《Revue canadienne de statistique》1989,17(2):235-239

Suppose we have n observations from X = Y + Z, where Z is a noise component with known distribution, and Y has an unknown density f. When the characteristic function of Z is nonzero almost everywhere, we show that it is possible to construct a density estimate f_n such that for all f, Iim_n| |=0. 相似文献

7.

Nonparametric estimation of survival functions for incomplete observations when the life time distribution is proportionally related ot the censoring time distribution

Nader Ebrahimi 《统计学通讯:理论与方法》2013,42(12):2887-2898

Let X₁, X₂…,X_n be a random sample from [ILM0001] and let Y₁, …,Y_n be a random sample from [ILM0002]. Then instead of observing a complete sample X₁,…X_n, we can only observe the pairs Z_i. = min(X_i.,Y_i) and [ILM0003] In this paper, we consider estimation of survival function [ILM0004] when [ILM0005], where β is an unknown positive real number.

相似文献

8.

Kshirsagar-Type Lower Bounds for Mean Squared Error of Prediction

Min Qin 《统计学通讯:理论与方法》2013,42(6):861-872

Let Y be an observable random vector and Z be an unobserved random variable with joint density f(y, z | θ), where θ is an unknown parameter vector. Considering the problem of predicting Z based on Y, we derive Kshirsagar type lower bounds for the mean squared error of any predictor of Z. These bounds do not require the regularity conditions of Bhattacharyya bounds and hence are more widely applicable. Moreover, the new bounds are shown to be sharper than the corresponding Bhattacharyya bounds. The conditions for attaining the new lower bounds are useful for easy derivation of best unbiased predictors, which we illustrate with some examples. 相似文献

9.

Model fitting and inference under latent equilibrium processes

Bhattacharya S Gelfand AE Holsinger KE 《Statistics and Computing》2007,17(2):193-208

This paper presents a methodology for model fitting and inference in the context of Bayesian models of the type f(Y | X,θ)f(X|θ)f(θ), where Y is the (set of) observed data, θ is a set of model parameters and X is an unobserved (latent) stationary stochastic process induced by the first order transition model f(X ^(t+1)|X ^(t),θ), where X ^(t) denotes the state of the process at time (or generation) t. The crucial feature of the above type of model is that, given θ, the transition model f(X ^(t+1)|X ^(t),θ) is known but the distribution of the stochastic process in equilibrium, that is f(X|θ), is, except in very special cases, intractable, hence unknown. A further point to note is that the data Y has been assumed to be observed when the underlying process is in equilibrium. In other words, the data is not collected dynamically over time. We refer to such specification as a latent equilibrium process (LEP) model. It is motivated by problems in population genetics (though other applications are discussed), where it is of interest to learn about parameters such as mutation and migration rates and population sizes, given a sample of allele frequencies at one or more loci. In such problems it is natural to assume that the distribution of the observed allele frequencies depends on the true (unobserved) population allele frequencies, whereas the distribution of the true allele frequencies is only indirectly specified through a transition model. As a hierarchical specification, it is natural to fit the LEP within a Bayesian framework. Fitting such models is usually done via Markov chain Monte Carlo (MCMC). However, we demonstrate that, in the case of LEP models, implementation of MCMC is far from straightforward. The main contribution of this paper is to provide a methodology to implement MCMC for LEP models. We demonstrate our approach in population genetics problems with both simulated and real data sets. The resultant model fitting is computationally intensive and thus, we also discuss parallel implementation of the procedure in special cases. 相似文献

10.

Partial and ecological correlation: a common three-term covariance decomposition

Renato Guseo 《Statistical Methods and Applications》2010,19(1):31-46

相似文献

11.

The linear combination,product and ratio of Laplace random variables

Saralees Nadarajah 《Statistics》2013,47(6):535-545

The distributions of linear combinations, products and ratios of random variables arise in many areas of engineering. In this paper, the exact distributions of the linear combination α X+β Y, the product |X Y| and the ratio |X/Y| are derived when X and Y are independent Laplace random variables. The Laplace distribution, being the oldest model for continuous data, has been one of the most popular models for measurement errors in engineering. 相似文献

12.

Synthetic data method to incorporate external information into a current study

Tian Gu Jeremy M. G. Taylor Wenting Cheng Bhramar Mukherjee 《Revue canadienne de statistique》2019,47(4):580-603

We consider the situation where there is a known regression model that can be used to predict an outcome, Y, from a set of predictor variables X . A new variable B is expected to enhance the prediction of Y. A dataset of size n containing Y, X and B is available, and the challenge is to build an improved model for Y| X ,B that uses both the available individual level data and some summary information obtained from the known model for Y| X . We propose a synthetic data approach, which consists of creating m additional synthetic data observations, and then analyzing the combined dataset of size n + m to estimate the parameters of the Y| X ,B model. This combined dataset of size n + m now has missing values of B for m of the observations, and is analyzed using methods that can handle missing data (e.g., multiple imputation). We present simulation studies and illustrate the method using data from the Prostate Cancer Prevention Trial. Though the synthetic data method is applicable to a general regression context, to provide some justification, we show in two special cases that the asymptotic variances of the parameter estimates in the Y| X ,B model are identical to those from an alternative constrained maximum likelihood estimation approach. This correspondence in special cases and the method's broad applicability makes it appealing for use across diverse scenarios. The Canadian Journal of Statistics 47: 580–603; 2019 © 2019 Statistical Society of Canada 相似文献

13.

Two graphical displays for the detection of potentially influential subsets in regression

Ali S. Hadi 《Journal of applied statistics》1990,17(3):313-327

In the context of the general linear model Y=Xβ+ε, the matrix P_z =Z(Z^TZ)^?1 Z^T , where Z=(X: Y), plays an important role in determining least squares results. In this article we propose two graphical displays for the off-diagonal as well as the diagonal elements of P_Z . The two graphs are based on simple ideas and are useful in the detection of potentially influential subsets of observations in regression. Since P_Z is invariant with respect to permutations of the columns of Z, an added advantage of these graphs is that they can be used to detect outliers in multivariate data where the rows of Z are usually regarded as a random sample from a multivariate population. We also suggest two calibration points, one for the diagonal elements of P_Z and the other for the off-diagonal elements. The advantage of these calibration points is that they take into consideration the variability of the off-diagonal as well as the diagonal elements of P_Z . They also do not suffer from masking. 相似文献

14.

Prediction based on linear combinations of order statistics and bivariate concomitants in the case of multivariate elliptical distributions

《Journal of Statistical Computation and Simulation》2012,82(5):1079-1098

In this paper, by considering a (3n+1) -dimensional random vector (X₀, X^T, Y^T, Z^T)^T having a multivariate elliptical distribution, we derive the exact joint distribution of (X₀, a^TX_(n), b^TY_[n], c^TZ_[n])^T, where a, b, c∈?ⁿ, X_(n)=(X₍₁₎, …, X_(n))^T, X₍₁₎<···<X_(n), is the vector of order statistics arising from X, and Y_[n]=(Y_[1], …, Y_[n])^T and Z_[n]=(Z_[1], …, Z_[n])^T denote the vectors of concomitants corresponding to X_(n) ((Y_[r], Z_[r])^T, for r=1, …, n, is the vector of bivariate concomitants corresponding to X_(r)). We then present an alternate approach for the derivation of the exact joint distribution of (X₀, X_(r), Y_[r], Z_[r])^T, for r=1, …, n. We show that these joint distributions can be expressed as mixtures of four-variate unified skew-elliptical distributions and these mixture forms facilitate the prediction of X_(r), say, based on the concomitants Y_[r] and Z_[r]. Finally, we illustrate the usefulness of our results by a real data. 相似文献

15.

Logistic regression with a partially observed covariate

Dawn W. Blackhurst Mark D. Schluchter 《统计学通讯:模拟与计算》2013,42(1):163-177

We present results of a Monte Carlo study comparing four methods of estimating the parameters of the logistic model logit (pr (Y = 1 | X, Z)) = α₀ + α ₁ X + α ₂ Z where X and Z are continuous covariates and X is always observed but Z is sometimes missing. The four methods examined are 1) logistic regression using complete cases, 2) logistic regression with filled-in values of Z obtained from the regression of Z on X and Y, 3) logistic regression with filled-in values of Z and random error added, and 4) maximum likelihood estimation assuming the distribution of Z given X and Y is normal. Effects of different percent missing for Z and different missing value mechanisms on the bias and mean absolute deviation of the estimators are examined for data sets of N = 200 and N = 400. 相似文献

16.

Empirical Likelihood in a Semi-Parametric Model for Missing Response Data

Lichun Wang Noel Veraverbeke 《统计学通讯:理论与方法》2013,42(4):625-639

Let Y be a response and, given covariate X,Y has a conditional density f(y | x, θ), where θ is a unknown p-dimensional vector of parameters and the marginal distribution of X is unknown. When responses are missing at random, with auxiliary information and imputation, we define an adjusted empirical log-likelihood ratio for the mean of Y and obtain its asymptotic distribution. A simulation study is conducted to compare the adjusted empirical log-likelihood and the normal approximation method in terms of coverage accuracies. 相似文献

17.

CHARACTERIZATIONS OF SHIFT-INVARIANT DISTRIBUTIONS BASED ON SUMMATION MODULO ONE

Roel J.G. Wilms Jan G.F. Thiemann 《Australian & New Zealand Journal of Statistics》1994,36(3):351-354

Let X₁Y_1,…, Y_n be independent random variables. We characterize the distributions of X and Y_j satisfying the equation {X+Y₁++Y_n}=_dX, where {Z} denotes the fractional part of a random variable Z. In the case of full generality, either X is uniformly distributed on [0,1), or Y_j has.a shifted lattice distribution and X is shift-invariant. We also give a characterization of shift-invariant distributions. Finally, we consider some special cases of this equation. 相似文献

18.

A new class of bivariate lifetime distributions

Serkan Eryilmaz 《统计学通讯:理论与方法》2017,46(24):12324-12335

This paper introduces a new class of bivariate lifetime distributions. Let {X_i}_{i ? 1} and {Y_i}_{i ? 1} be two independent sequences of independent and identically distributed positive valued random variables. Define T₁ = min?(X₁, …, X_M) and T₂ = min?(Y₁, …, Y_N), where (M, N) has a discrete bivariate phase-type distribution, independent of {X_i}_{i ? 1} and {Y_i}_{i ? 1}. The joint survival function of (T₁, T₂) is studied. 相似文献

19.

Adaptive Warped Kernel Estimators

下载免费PDF全文

Gaëlle Chagny 《Scandinavian Journal of Statistics》2015,42(2):336-360

In this work, we develop a method of adaptive non‐parametric estimation, based on ‘warped’ kernels. The aim is to estimate a real‐valued function s from a sample of random couples (X,Y). We deal with transformed data (Φ(X),Y), with Φ a one‐to‐one function, to build a collection of kernel estimators. The data‐driven bandwidth selection is performed with a method inspired by Goldenshluger and Lepski (Ann. Statist., 39, 2011, 1608). The method permits to handle various problems such as additive and multiplicative regression, conditional density estimation, hazard rate estimation based on randomly right‐censored data, and cumulative distribution function estimation from current‐status data. The interest is threefold. First, the squared‐bias/variance trade‐off is automatically realized. Next, non‐asymptotic risk bounds are derived. Lastly, the estimator is easily computed, thanks to its simple expression: a short simulation study is presented. 相似文献

20.

Adaptive Deconvolution on the Non‐negative Real Line

下载免费PDF全文

Gwennaëlle Mabon 《Scandinavian Journal of Statistics》2017,44(3):707-740

In this paper, we consider the problem of adaptive density or survival function estimation in an additive model defined by Z=X+Y with X independent of Y, when both random variables are non‐negative. This model is relevant, for instance, in reliability fields where we are interested in the failure time of a certain material that cannot be isolated from the system it belongs. Our goal is to recover the distribution of X (density or survival function) through n observations of Z, assuming that the distribution of Y is known. This issue can be seen as the classical statistical problem of deconvolution that has been tackled in many cases using Fourier‐type approaches. Nonetheless, in the present case, the random variables have the particularity to be supported. Knowing that, we propose a new angle of attack by building a projection estimator with an appropriate Laguerre basis. We present upper bounds on the mean squared integrated risk of our density and survival function estimators. We then describe a non‐parametric data‐driven strategy for selecting a relevant projection space. The procedures are illustrated with simulated data and compared with the performances of a more classical deconvolution setting using a Fourier approach. Our procedure achieves faster convergence rates than Fourier methods for estimating these functions. 相似文献