首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In this paper, we will investigate the nonparametric estimation of the distribution function F of an absolutely continuous random variable. Two methods are analyzed: the first one based on the empirical distribution function, expressed in terms of i.i.d. lattice random variables and, secondly, the kernel method, which involves nonlattice random vectors dependent on the sample size n; this latter procedure produces a smooth distribution estimator that will be explicitly corrected to reduce the effect of bias or variance. For both methods, the non-Studentized and Studentized statistics are considered as well as their bootstrap counterparts and asymptotic expansions are constructed to approximate their distribution functions via the Edgeworth expansion techniques. On this basis, we will obtain confidence intervals for F(x) and state the coverage error order achieved in each case.  相似文献   

2.
In this article, we develop a series estimation method for unknown time-inhomogeneous functionals of Lévy processes involved in econometric time series models. To obtain an asymptotic distribution for the proposed estimators, we establish a general asymptotic theory for partial sums of bivariate functionals of time and nonstationary variables. These results show that the proposed estimators in different situations converge to quite different random variables. In addition, the rates of convergence depend on various factors rather than just the sample size. Finite sample simulations are provided to evaluate the finite sample performance of the proposed model and estimation method.  相似文献   

3.
The theory of acceptance sampling by variables is well known when the underlying distribution is normal. When the normality assumption is not true, using the usual normal case method can be quite misleading. In this paper we deal with the Laplace distribution for both the standard deviation known and then unknown. We establish a decision rule for accepting a lot of product containing a defective proportion p. We determine the density function of the decision rule statistic, for small and large sample sizes. We give some practical ways to choose the sample size and the acceptance constant to obtain a desired operating characteristic curve  相似文献   

4.
A sequence of possibly dependent random variables is maximally dependent if all the sample maxima in the sequence have stochastically maximal distributions in the class of all distributions with the same marginals. For a sequence of maximally dependent standard uniform random variables, we determine the distribution functions of record times and values. We show that the distribution of the record occurrence times coincides with the respective distribution for the i.i.d. sequence, and the distributions of the record values are stochastically maximal in the class of sequences with the same record times distributions, containing all the exchangeable sequences. We also derive analytic formulae for the moments of record values from the maximally dependent sequence, and compare them with those of the i.i.d. case.  相似文献   

5.
《统计学通讯:理论与方法》2012,41(13-14):2321-2341
For the case where at least two sets have an odd number of variables we do not have the exact distribution of the generalized Wilks Lambda statistic in a manageable form, adequate for manipulation. In this article, we develop a family of very accurate near-exact distributions for this statistic for the case where two or three sets have an odd number of variables. We first express the exact characteristic function of the logarithm of the statistic in the form of the characteristic function of an infinite mixture of Generalized Integer Gamma distributions. Then, based on truncations of this exact characteristic function, we obtain a family of near-exact distributions, which, by construction, match the first two exact moments. These near-exact distributions display an asymptotic behaviour for increasing number of variables involved. The corresponding cumulative distribution functions are obtained in a concise and manageable form, relatively easy to implement computationally, allowing for the computation of virtually exact quantiles. We undertake a comparative study for small sample sizes, using two proximity measures based on the Berry-Esseen bounds, to assess the performance of the near-exact distributions for different numbers of sets of variables and different numbers of variables in each set.  相似文献   

6.
We study the correlation structure for a mixture of ordinal and continuous repeated measures using a Bayesian approach. We assume a multivariate probit model for the ordinal variables and a normal linear regression for the continuous variables, where latent normal variables underlying the ordinal data are correlated with continuous variables in the model. Due to the probit model assumption, we are required to sample a covariance matrix with some of the diagonal elements equal to one. The key computational idea is to use parameter-extended data augmentation, which involves applying the Metropolis-Hastings algorithm to get a sample from the posterior distribution of the covariance matrix incorporating the relevant restrictions. The methodology is illustrated through a simulated example and through an application to data from the UCLA Brain Injury Research Center.  相似文献   

7.
The speed of convergence of the distribution of the normalized maximum, of a sample of independent and identically distributed random variables, to its asymptotic distribution is considered in this article. Assuming that the cumulative distribution function of the random variables is known, the error committed replacing the actual distribution of the normalized maximum by its asympotic distribution is studied. Instead of using the arithmetical scale of probabilities, we measure the difference between the actual and asympotic distribution in terms of the double-log scale used for building the probability plotting paper for the the latter. We demonstrate that the difference between the double-log values corresponding to two probabilities in the upper tail is almost exactly equal to the logarithm of the distribution may not be uniform in this double-log scale and that the difference between the actual and asymptotic distributions, on the probebility plotting paper, may be a logarithmic, power, or even exponential function in the upper tail when the latter distribution is of Fisher-Tippett type I, but that difference is at most logarithmic in the upper tail for type II and III distributions. This fact is exploited to obtain transformed variables that converge tothe asymptotic distribution faster than the original variable on the probabilites plotting paper  相似文献   

8.
闫懋博  田茂再 《统计研究》2021,38(1):147-160
Lasso等惩罚变量选择方法选入模型的变量数受到样本量限制。文献中已有研究变量系数显著性的方法舍弃了未选入模型的变量含有的信息。本文在变量数大于样本量即p>n的高维情况下,使用随机化bootstrap方法获得变量权重,在计算适应性Lasso时构建选择事件的条件分布并剔除系数不显著的变量,以得到最终估计结果。本文的创新点在于提出的方法突破了适应性Lasso可选变量数的限制,当观测数据含有大量干扰变量时能够有效地识别出真实变量与干扰变量。与现有的惩罚变量选择方法相比,多种情境下的模拟研究展示了所提方法在上述两个问题中的优越性。实证研究中对NCI-60癌症细胞系数据进行了分析,结果较以往文献有明显改善。  相似文献   

9.
ABSTRACT

In this paper we consider the tail behavior of a two-dimensional dependent renewal risk model with two dependent classes of insurance business, in which the claim sizes are governed by a common renewal counting process, and their inter-arrival times are dependent, identically distributed. For the case that the claim size distribution belongs to the intersection of long-tailed distribution class and dominant variation class, we obtain an asymptotic formula, which holds uniformly for all time in an infinite interval. Moreover, we point out that the formula still holds uniformly for all time in an infinite interval for widely dependent random variables (r.v.s) under some conditions.  相似文献   

10.
In this paper, we consider a mixed compound Poisson process, that is, a random sum of independent and identically distributed (i.i.d.) random variables where the number of terms is a Poisson process with random intensity. We study nonparametric estimators of the jump density by specific deconvolution methods. Firstly, assuming that the random intensity has exponential distribution with unknown expectation, we propose two types of estimators based on the observation of an i.i.d. sample. Risks bounds and adaptive procedures are provided. Then, with no assumption on the distribution of the random intensity, we propose two non‐parametric estimators of the jump density based on the joint observation of the number of jumps and the random sum of jumps. Risks bounds are provided, leading to unusual rates for one of the two estimators. The methods are implemented and compared via simulations.  相似文献   

11.
12.
In this article the authors show how by adequately decomposing the null hypothesis of the multi-sample block-scalar sphericity test it is possible to obtain the likelihood ratio test statistic as well as a different look over its exact distribution. This enables the construction of well-performing near-exact approximations for the distribution of the test statistic, whose exact distribution is quite elaborate and non-manageable. The near-exact distributions obtained are manageable and perform much better than the available asymptotic distributions, even for small sample sizes, and they show a good asymptotic behavior for increasing sample sizes as well as for increasing number of variables and/or populations involved.  相似文献   

13.
Well-known characterizations of the geometric distribution via the independence of some contrast and the minimum in a sample of i.i.d. random variables are illustrated and supplemented.  相似文献   

14.
Weed, Bradley and Grovindarajulu (1974) propose one-sample probability ratio tests based on Lehmann alternatives. They also study the finite sure termination of the stopping times. Motivated by Stein's proof of (1946) of the termination of a sequential probability ratio test (SPRT) in the case of independent and identically distributed (i.i.d.) random variables and the work of Sethuraman (1970) for the two- sample rank order SPRT, we obtain a very mild condition (namely, that a certain random variable U(Z) is not identically zero) for the finite sure termination of the existence of the moment generating function (m.g.f.) for the stopping time of one- sample rank order SPRT's.  相似文献   

15.
In order to describe or generate so-called outliers in univariate statistical data, contamination models are often used. These models assume that k out of n independent random variables are shifted or multiplicated by some constant, whereas the other observations still come i.i.d. from some common target distribution. Of course, these contaminants do not necessarily stick out as the extremes in the sample. Moreover, it is the amount and magnitude of ‘contamination” which determines the number of obvious outliers. Using the concept of Davies and Gather (1993) to formalize the outlier notion we quantify the amount of contamination needed to produce a prespecified expected number of ‘genuine’ outliers. In particular, we demonstrate that for sample of moderate size from a normal target distribution a rather large shift of the contaminants is necessary to yield a certain expected number of outliers. Such an insight is of interest when designing simulation studies where outliers shoulod occur as well as in theoretical investigations on outliers.  相似文献   

16.
In this paper, we study the strong law of large numbers for the generalized sample relative entropy of non homogeneous Markov chains taking values from a finite state space. First, we introduce the definitions of generalized sample relative entropy and generalized sample relative entropy rate. Then, using a strong limit theorem for the delayed sums of the functions of two variables and a strong law of large numbers for non homogeneous Markov chains, we obtain the strong law of large numbers for the generalized sample relative entropy of non homogeneous Markov chains. As corollaries, we obtain some important results.  相似文献   

17.
In this article, we generalize the partially linear single-index models to the scenario with some endogenous covariates variables. It is well known that the estimators based on the existing methods are often inconsistent because of the endogeneity of covariates. To deal with the endogenous variables, we introduce some auxiliary instrumental variables. A three-stage estimation procedure is proposed for partially linear single-index instrumental variables models. The first stage is to obtain a linear projection of endogenous variables on a set of instrumental variables, the second stage is to estimate the link function by using local linear smoother for given constant parameters, and the last stage is to obtain the estimators of constant parameters based on the estimating equation. Asymptotic normality is established for the proposed estimators. Some simulation studies are undertaken to assess the finite sample performance of the proposed estimation procedure.  相似文献   

18.
In this paper, we introduce a new distribution generated by gamma random variables. We show that this distribution includes as a special case the distribution of the lower record value from a sequence of i.i.d. random variables from a population with the exponentiated (generalized) exponential distribution. The properties of this distribution are derived and the estimation of the model parameters is discussed. Some applications to real data sets are finally presented for illustration.  相似文献   

19.
In this article, we consider sequences of i.i.d. random variables and, under suitable conditions on the (common) distribution function, we prove large deviation principles for sequences of maxima, minima and pairs formed by maxima and minima. The i.i.d. random variables can be either unbounded or bounded; in the first case maxima and minima have to be suitably normalized.  相似文献   

20.
In real‐data analysis, deciding the best subset of variables in regression models is an important problem. Akaike's information criterion (AIC) is often used in order to select variables in many fields. When the sample size is not so large, the AIC has a non‐negligible bias that will detrimentally affect variable selection. The present paper considers a bias correction of AIC for selecting variables in the generalized linear model (GLM). The GLM can express a number of statistical models by changing the distribution and the link function, such as the normal linear regression model, the logistic regression model, and the probit model, which are currently commonly used in a number of applied fields. In the present study, we obtain a simple expression for a bias‐corrected AIC (corrected AIC, or CAIC) in GLMs. Furthermore, we provide an ‘R’ code based on our formula. A numerical study reveals that the CAIC has better performance than the AIC for variable selection.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号