期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Estimation of scale functions to model heteroscedasticity by regularised kernel-based quantile methods

R. Hable A. Christmann 《Journal of nonparametric statistics》2014,26(2):219-239

A main goal of regression is to derive statistical conclusions on the conditional distribution of the output variable Y given the input values x. Two of the most important characteristics of a single distribution are location and scale. Regularised kernel methods (RKMs) – also called support vector machines in a wide sense – are well established to estimate location functions like the conditional median or the conditional mean. We investigate the estimation of scale functions by RKMs when the conditional median is unknown, too. Estimation of scale functions is important, e.g. to estimate the volatility in finance. We consider the median absolute deviation (MAD) and the interquantile range as measures of scale. Our main result shows the consistency of MAD-type RKMs. 相似文献

2.

A fully nonparametric diagnostic test for homogeneity of variances

Lan Wang Xiao‐Hua Zhou 《Revue canadienne de statistique》2005,33(4):545-558

The authors propose a new nonparametric diagnostic test for checking the constancy of the conditional variance function σ²(x) in the regression model Y_i = m(x_i) + σ(x_i)?_i, i = 1,…, m. Their test, which does not assume a known parametric form for the conditional mean function m(x), is inspired by a recent asymptotic theory in the analysis of variance when the number of factor levels is large. The authors demonstrate through simulations the good finite‐sample properties of the test and illustrate its use in a study on the effect of drug utilization on health care costs. 相似文献

3.

On Semiparametric Mode Regression Estimation

Ali Gannoun Jerome Saracco Keming Yu 《统计学通讯:理论与方法》2013,42(7):1141-1157

It has been found that, for a variety of probability distributions, there is a surprising linear relation between mode, mean, and median. In this article, the relation between mode, mean, and median regression functions is assumed to follow a simple parametric model. We propose a semiparametric conditional mode (mode regression) estimation for an unknown (unimodal) conditional distribution function in the context of regression model, so that any m-step-ahead mean and median forecasts can then be substituted into the resultant model to deliver m-step-ahead mode prediction. In the semiparametric model, Least Squared Estimator (LSEs) for the model parameters and the simultaneous estimation of the unknown mean and median regression functions by the local linear kernel method are combined to infer about the parametric and nonparametric components of the proposed model. The asymptotic normality of these estimators is derived, and the asymptotic distribution of the parameter estimates is also given and is shown to follow usual parametric rates in spite of the presence of the nonparametric component in the model. These results are applied to obtain a data-based test for the dependence of mode regression over mean and median regression under a regression model. 相似文献

4.

Within groups analysis of covariance: multiple comparisons at specified design points using a robust measure location when there is curvature

《Journal of Statistical Computation and Simulation》2012,82(16):3236-3246

Consider the situation where measurements are taken at two different times and let M_j(x) be some conditional robust measure of location associated with the random variable Y at time j, given that some covariate X=x. The goal is to test H₀: M₁(x)=M₂(x) for each x∈ x₁,?…?, x_K such that the probability of one or more Type I errors is less than α, where x₁,?…?, x_K are K specified values of the covariate. The paper reports simulation results comparing two methods aimed at accomplishing this goal without specifying some parametric form for the regression line. The first method is based on a simple modification of the method in Wilcox [Introduction to robust estimation and hypothesis testing. 3rd ed. San Diego, CA: Academic Press; 2012, Section 11.11.1]. The main result here is that the second method, which has never been studied, can have higher power, sometimes substantially so. Data from the Well Elderly 2 study, which motivated this paper, are used to illustrate that the alternative approach can make a practical difference. Here, the estimate of M_j(x) is based in part on either a 20% trimmed mean or the Harrell–Davis quantile estimator, but in principle the more successful method can be used with any robust location estimator. 相似文献

5.

Box-Cox transformed linear models: A parameter-based asymptotic approach

Gemai Chen Richard A. Lockhart 《Revue canadienne de statistique》1997,25(4):517-529

A Box-Cox transformed linear model usually has the form y(λ) = μ + β₁x₁ +… + β_px_p + oe, where y(λ) is the power transform of y. Although widely used in practice, the Fisher information matrix for the unknown parameters and, in particular, its inverse have not been studied seriously in the literature. We obtain those two important matrices to put the Box-Cox transformed linear model on a firmer ground. The question of how to make inference on β = (β₁,…,β_p)^T when λ; is estimated from the data is then discussed for large but finite sample size by studying some parameter-based asymptotics. Both unconditional and conditional inference are studied from the frequentist point of view. 相似文献

6.

A comparison of bayes and maximum likelihood estimation of the intraclass correlation coefficient

Judy L. Palmer Ph.D Lyle D. Broemeling Ph.D 《统计学通讯:理论与方法》2013,42(3):953-975

Two methods of estimating the intraclass correlation coefficient (p) for the one-way random effects model were compared in several simulation experiments using balanced and unbalanced designs. Estimates based on a Bayes approach and a maximum likelihood approach were compared on the basis of their biases (differences between estimates and true values of p) and mean square errors (mean square errors of estimates of p) in each of the simulation experiments. The Bayes approach used the median of a conditional posterior density as its estimator. 相似文献

7.

On the validation of fiducial techniques

Andr Plante 《Revue canadienne de statistique》1979,7(2):217-226

A structured model is essentially a family of random vectors X_θ defined on a probability space with values in a sample space. If, for a given sample value x and for each ω in the probability space, there is at most one parameter value θ for which X_θ(ω) is equal to x, then the model is called additive at x. When a certain conditional distribution exists, a frequency interpretation specific to additive structured models holds, and is summarized in a unique structured distribution for the parameter. Many of the techniques used by Fisher in deriving and handling his fiducial probability distribution are shown to be valid when dealing with a structured distribution. 相似文献

8.

Selection of regress or va riables when E(Y) is an unknown honlinear function

J.R. Green M.F. Al-bayatti 《Statistics》2013,47(1):15-33

We consider the problem of deciding which of a set of p independent variables x₁ X₂J x_s we are to regard as being functionally involved in the mean of a dependent normal random variable Y and estimating E( Y) in terms of the chosen x's. This mean is an unknown function (assumed to be doubly differentiable) of some or all of the x's, so that the problem is of wide relevance. We approximate to the hypersurface in two different ways, and select within each approximation:

(a)For the situation where the mean of Y is assumed to be a linear function of the x's, we use ono of the optimum methods of selection.

(b)More generally, in the space of the X's the function will be approximately linear in a relatively small region. Accordingly this p-dimensional space is subdivided into smaller regions by a clustering procedure, and a hyperplane if fitted with in each region to aproximate to the unknown responce surface.An adaption of an optimum-regressor-selection procedure is then used to assist in the selection of the regressors

Approximate F tests are given to choose between models, including deciding how many x's to retain. Alternatively: the application of Akaike's Extended Maximum Likelihood Principle provides another way of choosing between the models and of selecting regressor variables. The methods are applied to data on glass manufacture. 相似文献

9.

Estimation of the linear-plateau segmented regression model in the presence of measurement error

Scott D. Grimshaw 《统计学通讯:理论与方法》2013,42(8):2399-2413

It is well known that when the true values of the independent variable are unobservable due to measurement error, the least squares estimator for a regression model is biased and inconsistent. When repeated observations on each x_i are taken, consistent estimators for the linear-plateau model can be formed. The repeated observations are required to classify each observation to the appropriate line segment. Two cases of repeated observations are treated in detail. First, when a single value of y_i is observed with the repeated observations of x_i the least squares estimator using the mean of the repeated x_i observations is consistent and asymptotically normal. Second, when repeated observations on the pair (x_i, y_i ) are taken the least squares estimator is inconsistent, but two consistent estimators are proposed: one that consistently estimates the bias of the least squares estimator and adjusts accordingly; the second is the least squares estimator using the mean of the repeated observations on each pair. 相似文献

10.

A synthetic control chart for monitoring the small shifts in a process mean based on an attribute inspection

Wenhui Zhou Na Liu 《统计学通讯:理论与方法》2020,49(9):2189-2204

Abstract

In this paper, a synthetic control chart is proposed by integrating the salient features of the np_x chart and the CRL chart. The synthetic chart achieves higher detection effectiveness on both small and large mean shifts while retaining the operational simplicity of the attribute charts owing to only using attribute inspection. Both statistical and economic design of the synthetic chart are considered and numerical tests have indicated that the synthetic chart has a higher power for detecting mean shifts than the np_x chart, MON chart and CUSUM chart. In addition, sensitivity analyses are also performed under both the statistical and economic design model. 相似文献

11.

Semiparametric Estimators for Limited Dependent Variable (LDV) Models with Endogenous Regressors

Myoung-Jae Lee 《Econometric Reviews》2013,32(2):171-214

This article reviews semiparametric estimators for limited dependent variable (LDV) models with endogenous regressors, where nonlinearity and nonseparability pose difficulties. We first introduce six main approaches in the linear equation system literature to handle endogenous regressors with linear projections: (i) ‘substitution’ replacing the endogenous regressors with their projected versions on the system exogenous regressors x, (ii) instrumental variable estimator (IVE) based on E{(error) × x} = 0, (iii) ‘model-projection’ turning the original model into a model in terms of only x-projected variables, (iv) ‘system reduced form (RF)’ finding RF parameters first and then the structural form (SF) parameters, (v) ‘artificial instrumental regressor’ using instruments as artificial regressors with zero coefficients, and (vi) ‘control function’ adding an extra term as a regressor to control for the endogeneity source. We then check if these approaches are applicable to LDV models using conditional mean/quantiles instead of linear projection. The six approaches provide a convenient forum on which semiparametric estimators in the literature can be categorized, although there are a few exceptions. The pros and cons of the approaches are discussed, and a small-scale simulation study is provided for some reviewed estimators. 相似文献

12.

Conditional residual lifetimes of coherent systems under double monitoring

A. Parvardeh N. Balakrishnan Azam Arshadipour 《统计学通讯:理论与方法》2017,46(7):3401-3410

In this paper, we obtain a mixture representation for the reliability function of the conditional residual lifetime of a coherent system with n independent and identically distributed (i.i.d.) components under double monitoring. We suppose that at time t₁, j components have failed while at time t₂ the system is still alive. Based on these mixture representation, we then study stochastic comparisons of the conditional residual lifetimes of two coherent systems with independent and identical components. 相似文献

13.

Moments of Order Statistics from Weibull Distribution in the Presence of Multiple Outliers

Khalaf S. Sultan Mohamed E. Moshref 《统计学通讯:理论与方法》2014,43(10-12):2214-2226

In this article, we derive exact expressions for the single and product moments of order statistics from Weibull distribution under the contamination model. We assume that X₁, X₂, …, X_{n ? p} are independent with density function f(x) while the remaining, p observations (outliers) X_{n ? p + 1}, …, X_n are independent with density function arises from some modified version of f(x), which is called g(x), in which the location and/or scale parameters have been shifted in value. Next, we investigate the effect of the outliers on the BLUE of the scale parameter. Finally, we deduce some special cases. 相似文献

14.

Simultaneous estimation of several CDF’s: homogeneity constraint

A. K. Md. Ehsanes Saleh B. M. Golam Kibria Florence George 《统计学通讯:理论与方法》2018,47(12):2813-2826

Let {x_ij(1 ? j ? n_i)|i = 1, 2, …, k} be k independent samples of size n_j from respective distributions of functions F_j(x)(1 ? j ? k). A classical statistical problem is to test whether these k samples came from a common distribution function, F(x) whose form may or may not be known. In this paper, we consider the complementary problem of estimating the distribution functions suspected to be homogeneous in order to improve the basic estimator known as “empirical distribution function” (edf), in an asymptotic setup. Accordingly, we consider four additional estimators, namely, the restricted estimator (RE), the preliminary test estimator (PTE), the shrinkage estimator (SE), and the positive rule shrinkage estimator (PRSE) and study their characteristic properties based on the mean squared error (MSE) and relative risk efficiency (RRE) with tables and graphs. We observed that for k ? 4, the positive rule SE performs uniformly better than both shrinkage and the unrestricted estimator, while PTEs works reasonably well for k < 4. 相似文献

15.

Cumulative or adjacent logits: Which choice for an ordinal logistic latent variable model?

Petan Dossar 《统计学通讯:理论与方法》2018,47(11):2563-2575

With ordinal response items, a graded response model (GRM) is of cumulative logits type, while the polytomous Rasch model (PRM) is based on adjacent logits. In this work, we compare the two approaches. We show that the PRM is superior to the GRM, with interesting properties that we prove. Note S_ν the sum of item responses of individual ν and Θ_ν its latent parameter; we show i) S_ν is a sufficient statistic for θ_ν and ii) a property of “stochastic ordering” of the conditional distributions G_θ/S. The second property, less known, is, to our knowledge, nowhere satisfactorily demonstrated. 相似文献

16.

Geometric ergodicity of nonlinear autoregressive models with changing conditional variances

Min Chen Gemai Chen 《Revue canadienne de statistique》2000,28(3):605-614

The authors give easy‐to‐check sufficient conditions for the geometric ergodicity and the finiteness of the moments of a random process x_t = ?(x_t‐1,…, x_t‐p) + ?_tσ(x_t‐1,…, x_t‐q) in which ?: R^p → R, σ R^q → R and (?_t) is a sequence of independent and identically distributed random variables. They deduce strong mixing properties for this class of nonlinear autoregressive models with changing conditional variances which includes, among others, the ARCH(p), the AR(p)‐ARCH(p), and the double‐threshold autoregressive models. 相似文献

17.

Marginally restricted sequential D‐optimal designs

Jesús López‐Fidalgo Raul Martín‐Martín Douglas P. Wiens 《Revue canadienne de statistique》2008,36(3):397-410

In many experiments, not all explanatory variables can be controlled. When the units arise sequentially, different approaches may be used. The authors study a natural sequential procedure for “marginally restricted” D‐optimal designs. They assume that one set of explanatory variables (x₁) is observed sequentially, and that the experimenter responds by choosing an appropriate value of the explanatory variable x₂. In order to solve the sequential problem a priori, the authors consider the problem of constructing optimal designs with a prior marginal distribution for x₁. This eliminates the influence of units already observed on the next unit to be designed. They give explicit designs for various cases in which the mean response follows a linear regression model; they also consider a case study with a nonlinear logistic response. They find that the optimal strategy often consists of randomizing the assignment of the values of x₂. 相似文献

18.

ASYMPTOTIC AND SMALL SAMPLE STATISTICAL PROPERTIES OF RANDOM FRAILTY VARIANCE ESTIMATES FOR SHARED GAMMA FRAILTY MODELS

《统计学通讯:模拟与计算》2013,42(3):581-595

This paper concerns maximum likelihood estimation for the semiparametric shared gamma frailty model; that is the Cox proportional hazards model with the hazard function multiplied by a gamma random variable with mean 1 and variance θ. A hybrid ML-EM algorithm is applied to 26 400 simulated samples of 400 to 8000 observations with Weibull hazards. The hybrid algorithm is much faster than the standard EM algorithm, faster than standard direct maximum likelihood (ML, Newton Raphson) for large samples, and gives almost identical results to the penalised likelihood method in S-PLUS 2000. When the true value θ₀ of θ is zero, the estimates of θ are asymptotically distributed as a 50–50 mixture between a point mass at zero and a normal random variable on the positive axis. When θ₀ > 0, the asymptotic distribution is normal. However, for small samples, simulations suggest that the estimates of θ are approximately distributed as an x ? (100 ? x)% mixture, 0 ≤ x ≤ 50, between a point mass at zero and a normal random variable on the positive axis even for θ₀ > 0. In light of this, p-values and confidence intervals need to be adjusted accordingly. We indicate an approximate method for carrying out the adjustment. 相似文献

19.

Partly linear models on Riemannian manifolds

Wenceslao Gonzalez-Manteiga Guillermo Henry 《Journal of applied statistics》2012,39(8):1797-1809

In partly linear models, the dependence of the response y on (x ^T, t) is modeled through the relationship y=x ^T β+g(t)+?, where ? is independent of (x ^T, t). We are interested in developing an estimation procedure that allows us to combine the flexibility of the partly linear models, studied by several authors, but including some variables that belong to a non-Euclidean space. The motivating application of this paper deals with the explanation of the atmospheric SO₂ pollution incidents using these models when some of the predictive variables belong in a cylinder. In this paper, the estimators of β and g are constructed when the explanatory variables t take values on a Riemannian manifold and the asymptotic properties of the proposed estimators are obtained under suitable conditions. We illustrate the use of this estimation approach using an environmental data set and we explore the performance of the estimators through a simulation study. 相似文献

20.

Nonparametric tests for conditional independence using conditional distributions

Taoufik Bouezmarni 《Journal of nonparametric statistics》2014,26(4):697-719

The concept of causality is naturally defined in terms of conditional distribution, however almost all the empirical works focus on causality in mean. This paper aims to propose a nonparametric statistic to test the conditional independence and Granger non-causality between two variables conditionally on another one. The test statistic is based on the comparison of conditional distribution functions using an L₂ metric. We use Nadaraya–Watson method to estimate the conditional distribution functions. We establish the asymptotic size and power properties of the test statistic and we motivate the validity of the local bootstrap. We ran a simulation experiment to investigate the finite sample properties of the test and we illustrate its practical relevance by examining the Granger non-causality between S&P 500 Index returns and VIX volatility index. Contrary to the conventional t-test which is based on a linear mean-regression, we find that VIX index predicts excess returns both at short and long horizons. 相似文献