首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
While it is often argued that a p-value is a probability; see Wasserstein and Lazar, we argue that a p-value is not defined as a probability. A p-value is a bijection of the sufficient statistic for a given test which maps to the same scale as the Type I error probability. As such, the use of p-values in a test should be no more a source of controversy than the use of a sufficient statistic. It is demonstrated that there is, in fact, no ambiguity about what a p-value is, contrary to what has been claimed in recent public debates in the applied statistics community. We give a simple example to illustrate that rejecting the use of p-values in testing for a normal mean parameter is conceptually no different from rejecting the use of a sample mean. The p-value is innocent; the problem arises from its misuse and misinterpretation. The way that p-values have been informally defined and interpreted appears to have led to tremendous confusion and controversy regarding their place in statistical analysis.  相似文献   

2.
We consider the properties of the trimmed mean, as regards minimax-variance L-estimation of a location parameter in a Kolmogorov neighbourhood K() of the normal distribution: We first review some results on the search for an L-minimax estimator in this neighbourhood, i.e. a linear combination of order statistics whose maximum variance in Kt() is a minimum in the class of L-estimators. The natural candidate – the L-estimate which is efficient for that member of Kt,() with minimum Fisher information – is known not to be a saddlepoint solution to the minimax problem. We show here that it is not a solution at all. We do this by showing that a smaller maximum variance is attained by an appropriately trimmed mean. We argue that this trimmed mean, as well as being computationally simple – much simpler than the efficient L-estimate referred to above, and simpler than the minimax M- and R-estimators – is at least “nearly” minimax.  相似文献   

3.
It is well known that Yates' algorithm can be used to estimate the effects in a factorial design. We develop a modification of this algorithm and call it modified Yates' algorithm and its inverse. We show that the intermediate steps in our algorithm have a direct interpretation as estimated level-specific mean values and effects. Also we show how Yates' or our modified algorithm can be used to construct the blocks in a 2 k factorial design and to generate the layout sheet of a 2 k−p fractional factorial design and the confounding pattern in such a design. In a final example we put together all these methods by generating and analysing a 26-2 design with 2 blocks.  相似文献   

4.
The conceptual predictive statistic, Cp, is a widely used criterion for model selection in linear regression. Cp serves as an estimator of a discrepancy, a measure that reflects the disparity between the generating model and a fitted candidate model. This discrepancy, based on scaled squared error loss, is asymmetric: an alternate measure is obtained by reversing the roles of the two models in the definition of the measure. We propose a variant of the Cp statistic based on estimating a symmetrized version of the discrepancy targeted by Cp. We claim that the resulting criterion provides better protection against overfitting than Cp, since the symmetric discrepancy is more sensitive towards detecting overspecification than its asymmetric counterpart. We illustrate our claim by presenting simulation results. Finally, we demonstrate the practical utility of the new criterion by discussing a modeling application based on data collected in a cardiac rehabilitation program at University of Iowa Hospitals and Clinics.  相似文献   

5.
Maximum likelihood estimation is investigated in the context of linear regression models under partial independence restrictions. These restrictions aim to assume a kind of completeness of a set of predictors Z in the sense that they are sufficient to explain the dependencies between an outcome Y and predictors X: ?(Y|Z, X) = ?(Y|Z), where ?(·|·) stands for the conditional distribution. From a practical point of view, the former model is particularly interesting in a double sampling scheme where Y and Z are measured together on a first sample and Z and X on a second separate sample. In that case, estimation procedures are close to those developed in the study of double‐regression by Engel & Walstra (1991) and Causeur & Dhorne (1998) . Properties of the estimators are derived in a small sample framework and in an asymptotic one, and the procedure is illustrated by an example from the food industry context.  相似文献   

6.
A system subject to a point process of shocks is considered. The shocks occur in accordance with a renewal process or a nonhomogeneous Poisson process. Each shock independently of the previous history leads to a system failure with probability θ and is survived with a complimentary probability θ̄. A number of problems in reliability and safety analysis can be interpreted by means of this model. The exact solution for the probability of survival W̄(t,θ) can be obtained only in the form of infinite series (renewal process of shocks). Approximate solutions and new simple bounds for the probability of survival are obtained. The introduced method is based on the notion of a stochastic hazard rate process. A supplementary characteristic in this analysis is the mean of the hazard rate process. This method makes it possible to consider a generalization important in practical applications when the probability of a system failure under the effect of a current shock depends on the time since the previous one.  相似文献   

7.
When process data follow a particular curve in quality control, profile monitoring is suitable and appropriate for assessing process stability. Previous research in profile monitoring focusing on nonlinear parametric (P) modeling, involving both fixed and random-effects, was made under the assumption of an accurate nonlinear model specification. Lately, nonparametric (NP) methods have been used in the profile monitoring context in the absence of an obvious linear P model. This study introduces a novel technique in profile monitoring for any nonlinear and auto-correlated data. Referred to as the nonlinear mixed robust profile monitoring (NMRPM) method, it proposes a semiparametric (SP) approach that combines nonlinear P and NP profile fits for scenarios in which a nonlinear P model is adequate over part of the data but inadequate of the rest. These three methods (P, NP, and NMRPM) account for the auto-correlation within profiles and treats the collection of profiles as a random sample with a common population. During Phase I analysis, a version of Hotelling’s T2 statistic is proposed for each approach to identify abnormal profiles based on the estimated random effects and obtain the corresponding control limits. The performance of the NMRPM method is then evaluated using a real data set. Results reveal that the NMRPM method is robust to model misspecification and performs adequately against a correctly specified nonlinear P model. Control charts with the NMRPM method have excellent capability of detecting changes in Phase I data with control limits that are easily computable.  相似文献   

8.
Abstract

This paper examines the high dimensional asymptotics of the naive Hotelling T2 statistic. Naive Bayes has been utilized in high dimensional pattern recognition as a method to avoid singularities in the estimated covariance matrix. The naive Hotelling T2 statistic, which is equivalent to the estimator of the naive canonical correlation, is a statistically important quantity in naive Bayes and its high dimensional behavior has been studied under several conditions. In this paper, asymptotic normality of the naive Hotelling T2 statistic under a high dimension low sample size setting is developed using the central limit theorem of a martingale difference sequence.  相似文献   

9.
Elvia Flores 《Statistics》2013,47(5):431-454
In this work, we consider a non-parametric estimator of the variance in one-dimensional diffusion models or, more generally, in Itô processes with a deterministic diffusion term and a general non-anticipative drift. The estimation is based on the quadratic variation of discrete time observations over a finite interval. In particular, a central limit theorem (CLT) is proved for the deviation in L p norm (p≥; 1) between the variance and this estimator. The method of the proof consists in writing the L p norm of the deviation, when the drift term is equal to zero, as a sum of 4-dependent random variables. The moments are then computed by means of a Gaussian approximation and a CLT for m-dependent random variables is applied. The convergence is stable in law, this allows the result for processes with general drifts to be obtained, by using Girsanov's formula.  相似文献   

10.
11.
In the present article, we discuss the regression of a point on the surface of a unit sphere in d dimensions given a point on the surface of a unit sphere in p dimensions, where p may not be equal to d. Point projection is added to the rotation and linear transformation for regression link function. The identifiability of the model is proved. Then, parameter estimation in this set up is discussed. Simulation studies and data analyses are done to illustrate the model.  相似文献   

12.
Tim Fischer  Udo Kamps 《Statistics》2013,47(1):142-158
There are several well-known mappings which transform the first r common order statistics in a sample of size n from a standard uniform distribution to a full vector of dimension r of order statistics in a sample of size r from a uniform distribution. Continuing the results reported in a previous paper by the authors, it is shown that transformations of these types do not lead to order statistics from an i.i.d. sample of random variables, in general, when being applied to order statistics from non-uniform distributions. By accepting the loss of one dimension, a structure-preserving transformation exists for power function distributions.  相似文献   

13.
Superefficiency of a projection density estimator The author constructs a projection density estimator with a data‐driven truncation index. This estimator reaches the superoptimal rates 1/n in mean integrated square error and {In ln(n/n}1/2 in uniform almost sure convergence over a given subspace which is dense in the class of all possible densities; the rate of the estimator is quasi‐optimal everywhere else. The subspace in question may be chosen a priori by the statistician.  相似文献   

14.
The hierarchically orthogonal functional decomposition of any measurable function η of a random vector X=(X1,?…?, Xp) consists in decomposing η(X) into a sum of increasing dimension functions depending only on a subvector of X. Even when X1,?…?, Xp are assumed to be dependent, this decomposition is unique if the components are hierarchically orthogonal. That is, two of the components are orthogonal whenever all the variables involved in one of the summands are a subset of the variables involved in the other. Setting Y=η(X), this decomposition leads to the definition of generalized sensitivity indices able to quantify the uncertainty of Y due to each dependent input in X [Chastaing G, Gamboa F, Prieur C. Generalized Hoeffding–Sobol decomposition for dependent variables – application to sensitivity analysis. Electron J Statist. 2012;6:2420–2448]. In this paper, a numerical method is developed to identify the component functions of the decomposition using the hierarchical orthogonality property. Furthermore, the asymptotic properties of the components estimation is studied, as well as the numerical estimation of the generalized sensitivity indices of a toy model. Lastly, the method is applied to a model arising from a real-world problem.  相似文献   

15.
In this study, a new method for the estimation of the shrinkage and biasing parameters of Liu-type estimator is proposed. Because k is kept constant and d is optimized in Liu’s method, a (k, d) pair is not guaranteed to be the optimal point in terms of the mean square error of the parameters. The optimum (k, d) pair that minimizes the mean square error, which is a function of the parameters k and d, should be estimated through a simultaneous optimization process rather than through a two-stage process. In this study, by utilizing a different objective function, the parameters k and d are optimized simultaneously with the particle swarm optimization technique.  相似文献   

16.
This paper examines the existence of time trends in the infant mortality rates in a number of countries in the twentieth century. We test for the presence of deterministic trends by adopting a linear model for the log-transformed data. Instead of assuming that the error term is a stationary I(0), or alternatively, a non-stationary I(1) process, we allow for the possibility of fractional integration and hence for a much greater degree of flexibility in the dynamic specification of the series. Indeed, once the linear trend is removed, all series appear to be I(d) with 0<d<1, implying long-range dependence. As expected, the time trend coefficients are significantly negative, although of a different magnitude from those obtained assuming integer orders of differentiation.  相似文献   

17.
A structured model is essentially a family of random vectors Xθ defined on a probability space with values in a sample space. If, for a given sample value x and for each ω in the probability space, there is at most one parameter value θ for which Xθ(ω) is equal to x, then the model is called additive at x. When a certain conditional distribution exists, a frequency interpretation specific to additive structured models holds, and is summarized in a unique structured distribution for the parameter. Many of the techniques used by Fisher in deriving and handling his fiducial probability distribution are shown to be valid when dealing with a structured distribution.  相似文献   

18.
Suppose that when a unit operatesin a certain environment, its lifetime has distribution G,and when the unit operates in another environment, its lifetimehas a different distribution, say F. Moreover, supposethe unit is operated for a certain period of time in the firstenvironment and is then transferred to the second environment.Thus we observe a censored lifetime in the first environmentand a failure time of a ``used' unit in the second environment.We propose an EM algorithm approach for obtaining a self-consistentestimator of F. Moreover, suppose using observations from both environments.The case where failure times are subject to right censoring isconsidered as well. We also establish the maximum likelihoodestimator of F. Moreover, suppose when the unit is repairable. Applicationand simulation studies are presented to illustrate the methodsderived.  相似文献   

19.
7 and 8 introduce a power max-autoregressive process, in short pARMAX, as an alternative to heavy tailed ARMA when modeling rare events. In this paper, an extension of pARMAX is considered, by including a random component which makes the model more applicable to real data. We will see conditions under which this new model, here denoted as pRARMAX, has unique stationary distribution and we analyze its extremal behavior. Based on Bortot and Tawn (1998), we derive a threshold-dependent extremal index which is a functional of the coefficient of tail dependence of 14 and 15 which in turn relates with the pRARMAX parameter. In order to fit a pRARMAX model to an observed data series, we present a methodology based on minimizing the Bayes risk in classification theory and analyze this procedure through a simulation study. We illustrate with an application to financial data.  相似文献   

20.
Consider a population the individuals in which can be classified into groups. Let y, the number of individuals in a group, be distributed according to a probability function f(y;øo) where the functional form f is known. The random variable y cannot be observed directly, and hence a random sample of groups cannot be obtained. Consider a random sample of N individuals from the population. Suppose the N individuals are distributed into S groups with x1, x2, …, xS representatives respectively. The random variable x, the number of individuals in a group in the sample, will be a fraction of its population counterpart y, and the distributions of x and y need not have the same functional form. If the two random variables x and y have the same functional form for their distributions, then the particular common distribution is called an invariant abundance distribution. The paper provides a characterization of invariant abundance distributions in the class of power-series distributions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号