首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Let π1 and π2 be two exponential populations with location parameters α and β respectively. It is shown that the probability of misclassification for the optimal rule, RE, depends on α and β only through their ratio α/β=r, and that the smaller the ratio r (i.e. the smaller α is compared to β), the greater the superiority of the optimal rule RE over the commonly used rule RN. When α and β are unknown, the sample-based version RE(s) of RE exhibits the same pattern of superiority over the sample-based RN(s) of RN.  相似文献   

2.
The effects of applying the normal classificatory rule to a nonnormal population are studied here. These are assessed through the distribution of the misclassification errors in the case of the Edgeworth type distribution. Both theoretical and empirical results are presented. An examination of the latter shows that the effects of this type of nonnormality are marginal. The probability of misclassification of an observation from ∏1, using the appropriate LR rule, is always larger than one using the normal approximation (μ1<μ2). Converse condition holds for the misclassification of an observation from ∏2. Overall error rates are not affected by the skewness factor to any great extent.  相似文献   

3.
Let X1…, Xm and Y1…, Yn be two independent sequences of i.i.d. random variables with distribution functions Fx(.|θ) and Fy(. | φ) respectively. Let g(θ, φ) be a real-valued function of the unknown parameters θ and φ. The purpose of this paper is to suggest a sequential procedure which gives a fixed-width confidence interval for g(θ, φ) so that the coverage probability is approximately α (preas-signed). Certain asymptotic optimality properties of the sequential procedure are established. A Monte Carlo study is presented.  相似文献   

4.
Summary The problem of predicting the number of change points in a piecewise linear model is studied from a Bayesian viewpoint. For a given a priori joint probability functionf R,C=fRf C/R, whereR is the number of change points andC=C′(R)=(C1,…,CR) is the change-point epoch vector, the marginal posterior probability functionf R.C/Y is obtained, and then used to find predictors forR andC(R).  相似文献   

5.
Consider k( ? 2) normal populations with unknown means μ1, …, μk, and a common known variance σ2. Let μ[1] ? ??? ? μ[k] denote the ordered μi.The populations associated with the t(1 ? t ? k ? 1) largest means are called the t best populations. Hsu and Panchapakesan (2004) proposed and investigated a procedure RHPfor selecting a non empty subset of the k populations whose size is at most m(1 ? m ? k ? t) so that at least one of the t best populations is included in the selected subset with a minimum guaranteed probability P* whenever μ[k ? t + 1] ? μ[k ? t] ? δ*, where P*?and?δ* are specified in advance of the experiment. This probability requirement is known as the indifference-zone probability requirement. In the present article, we investigate the same procedure RHP for the same goal as before but when k ? t < m ? k ? 1 so that at least one of the t best populations is included in the selected subset with a minimum guaranteed probability P* whatever be the configuration of the unknown μi. The probability requirement in this latter case is termed the subset selection probability requirement. Santner (1976) proposed and investigated a different procedure (RS) based on samples of size n from each of the populations, considering both cases, 1 ? m ? k ? t and k ? t < m ? k. The special case of t = 1 was earlier studied by Gupta and Santner (1973) and Hsu and Panchapakesan (2002) for their respective procedures.  相似文献   

6.
Suppose that we have a nonparametric regression model Y = m(X) + ε with XRp, where X is a random design variable and is observed completely, and Y is the response variable and some Y-values are missing at random. Based on the “complete” data sets for Y after nonaprametric regression imputation and inverse probability weighted imputation, two estimators of the regression function m(x0) for fixed x0Rp are proposed. Asymptotic normality of two estimators is established, which is used to construct normal approximation-based confidence intervals for m(x0). We also construct an empirical likelihood (EL) statistic for m(x0) with limiting distribution of χ21, which is used to construct an EL confidence interval for m(x0).  相似文献   

7.
A. Berlinet 《Statistics》2013,47(5):479-495
This paper deals with a special adaptive estimation problem, namely how can one select for each set of i.i.d. data X 1, …, X n the better of two given estimates of the data-generating probability density. Such a problem was studied by Devroye and Lugosi [Combinatorial Methods in Density Estimation, Springer, Berlin, 2001] who proposed a feasible suboptimal selection (called the Scheffé selection) as an alternative to the optimal but nonfeasible selection which minimizes the L1-error. In many typical situations, the L1-error of the Scheffé selection was shown to tend to zero for n→∞ as fast as the L1-error of the optimal estimate. This asymptotic result was based on an inequality between the total variation errors of the Scheffé and optimal selections. The present paper extends this inequality to the class of φ-divergence errors containing the L1-error as a special case. The first extension compares the φ-divergence errors of the mentioned Scheffé and optimal selections of Devroye and Lugosi. The second extension deals with a class of generalized Scheffé selections adapted to the φ-divergence error criteria and reducing to the classical Scheffé selection for the L1-criterion. It compares the φ-divergence errors of these feasible selections and the optimal nonfeasible selections minimizing the φ-divergence errors. Both extensions are motivated and illustrated by examples.  相似文献   

8.
In this paper, the three-decision procedures to classify p treatments as better than or worse than one control, proposed for normal/symmetric probability models [Bohrer, Multiple three-decision rules for parametric signs. J. Amer. Statist. Assoc. 74 (1979), pp. 432–437; Bohrer et al., Multiple three-decision rules for factorial simple effects: Bonferroni wins again!, J. Amer. Statist. Assoc. 76 (1981), pp. 119–124; Liu, A multiple three-decision procedure for comparing several treatments with a control, Austral. J. Statist. 39 (1997), pp. 79–92 and Singh and Mishra, Classifying logistic populations using sample medians, J. Statist. Plann. Inference 137 (2007), pp. 1647–1657]; in the literature, have been extended to asymmetric two-parameter exponential probability models to classify p(p≥1) treatments as better than or worse than the best of q(q≥1) control treatments in terms of location parameters. Critical constants required for the implementation of the proposed procedure are tabulated for some pre-specified values of probability of no misclassification. Power function of the proposed procedure is also defined and a common sample size necessary to guarantee various pre-specified power levels are tabulated. Optimal allocation scheme is also discussed. Finally, the implementation of the proposed methodology is demonstrated through numerical example.  相似文献   

9.
When classification rules are constructed using sample estimatest it is known that the probability of misclassification is not minimized. This article introduces a biased minimum X2 rule to classify items from a multivariate normal population. Using the principle of variance reduction, the probability of misclassification is reduced when the biased procedure is employed. Results of sampling experiments over a broad range of conditions are provided to demonstrate this improvement.  相似文献   

10.
R-squared (R2) and adjusted R-squared (R2Adj) are sometimes viewed as statistics detached from any target parameter, and sometimes as estimators for the population multiple correlation. The latter interpretation is meaningful only if the explanatory variables are random. This article proposes an alternative perspective for the case where the x’s are fixed. A new parameter is defined, in a similar fashion to the construction of R2, but relying on the true parameters rather than their estimates. (The parameter definition includes also the fixed x values.) This parameter is referred to as the “parametric” coefficient of determination, and denoted by ρ2*. The proposed ρ2* remains stable when irrelevant variables are removed (or added), unlike the unadjusted R2, which always goes up when variables, either relevant or not, are added to the model (and goes down when they are removed). The value of the traditional R2Adj may go up or down with added (or removed) variables, either relevant or not. It is shown that the unadjusted R2 overestimates ρ2*, while the traditional R2Adj underestimates it. It is also shown that for simple linear regression the magnitude of the bias of R2Adj can be as high as the bias of the unadjusted R2 (while their signs are opposite). Asymptotic convergence in probability of R2Adj to ρ2* is demonstrated. The effects of model parameters on the bias of R2 and R2Adj are characterized analytically and numerically. An alternative bi-adjusted estimator is presented and evaluated.  相似文献   

11.
A sequence of independent lifetimes X 1,…, X m , X m+1,…, X n were observed from inverse Weibull distribution with mean stress θ1 and reliability R 1(t 0) at time t 0 but later it was found that there was a change in the system at some point of time m and it is reflected in the sequence after X m by change in mean stress θ1 and in reliability R 2(t 0) at time t 0. The Bayes estimators of m, R 1(t 0) and R 2(t 0) are derived when a poor and a more detailed prior information is introduced into the inferential procedure. The effects of correct and wrong prior information on the Bayes estimators are studied.  相似文献   

12.
In this paper we consider a sequence of independent continuous symmetric random variables X1, X2, …, with heavy-tailed distributions. Then we focus on limiting behavior of randomly weighted averages Sn = R(n)1X1 + ??? + R(n)nXn, where the random weights R(n)1, …, Rn(n) which are independent of X1, X2, …, Xn, are the cuts of (0, 1) by the n ? 1 order statistics from a uniform distribution. Indeed we prove that cnSn converges in distribution to a symmetric α-stable random variable with cn = n1 ? 1/α1/α(α + 1).  相似文献   

13.
For a confidence interval (L(X),U(X)) of a parameter θ in one-parameter discrete distributions, the coverage probability is a variable function of θ. The confidence coefficient is the infimum of the coverage probabilities, inf  θ P θ (θ∈(L(X),U(X))). Since we do not know which point in the parameter space the infimum coverage probability occurs at, the exact confidence coefficients are unknown. Beside confidence coefficients, evaluation of a confidence intervals can be based on the average coverage probability. Usually, the exact average probability is also unknown and it was approximated by taking the mean of the coverage probabilities at some randomly chosen points in the parameter space. In this article, methodologies for computing the exact average coverage probabilities as well as the exact confidence coefficients of confidence intervals for one-parameter discrete distributions are proposed. With these methodologies, both exact values can be derived.  相似文献   

14.
We consider the Gibbs sampler as a tool for generating an absolutely continuous probability measure ≥ on Rd. When an appropriate irreducibility condition is satisfied, the Gibbs Markov chain (Xn;n ≥ 0) converges in total variation to its target distribution ≥. Sufficient conditions for geometric convergence have been given by various authors. Here we illustrate, by means of simple examples, how slow the convergence can be. In particular, we show that given a sequence of positive numbers decreasing to zero, say (bn;n ≥ 1), one can construct an absolutely continuous probability measure ≥ on Rd which is such that the total variation distance between ≥ and the distribution of Xn, converges to 0 at a rate slower than that of the sequence (bn;n ≥ 1). This can even be done in such a way that ≥ is the uniform distribution over a bounded connected open subset of Rd. Our results extend to hit-and-run samplers with direction distributions having supports with symmetric gaps.  相似文献   

15.
A Bayesian analysis is provided for the Wilcoxon signed-rank statistic (T+). The Bayesian analysis is based on a sign-bias parameter φ on the (0, 1) interval. For the case of a uniform prior probability distribution for φ and for small sample sizes (i.e., 6 ? n ? 25), values for the statistic T+ are computed that enable probabilistic statements about φ. For larger sample sizes, approximations are provided for the asymptotic likelihood function P(T+|φ) as well as for the posterior distribution P(φ|T+). Power analyses are examined both for properly specified Gaussian sampling and for misspecified non Gaussian models. The new Bayesian metric has high power efficiency in the range of 0.9–1 relative to a standard t test when there is Gaussian sampling. But if the sampling is from an unknown and misspecified distribution, then the new statistic still has high power; in some cases, the power can be higher than the t test (especially for probability mixtures and heavy-tailed distributions). The new Bayesian analysis is thus a useful and robust method for applications where the usual parametric assumptions are questionable. These properties further enable a way to do a generic Bayesian analysis for many non Gaussian distributions that currently lack a formal Bayesian model.  相似文献   

16.
17.
Consider k( ? 2) normal populations whose means are all known or unknown and whose variances are unknown. Let σ2[1] ? ??? ? σ[k]2 denote the ordered variances. Our goal is to select a non empty subset of the k populations whose size is at most m(1 ? m ? k ? 1) so that the population associated with the smallest variance (called the best population) is included in the selected subset with a guaranteed minimum probability P* whenever σ2[2][1]2 ? δ* > 1, where P* and δ* are specified in advance of the experiment. Based on samples of size n from each of the populations, we propose and investigate a procedure called RBCP. We also derive some asymptotic results for our procedure. Some comparisons with an earlier available procedure are presented in terms of the average subset sizes for selected slippage configurations based on simulations. The results are illustrated by an example.  相似文献   

18.
The restricted minimum φ-divergence estimator, [Pardo, J.A., Pardo, L. and Zografos, K., 2002, Minimum φ-divergence estimators with constraints in multinomial populations. Journal of Statistical Planning and Inference, 104, 221–237], is employed to obtain estimates of the cell frequencies of an I×I contingency table under hypotheses of symmetry, marginal homogeneity or quasi-symmetry. The associated φ-divergence statistics are distributed asymptotically as chi-squared distributions under the null hypothesis. The new estimators and test statistics contain, as particular cases, the classical estimators and test statistics previously presented in the literature for the cited problems. A simulation study is presented, for the symmetry problem, to choose the best function φ2 for estimation and the best function φ1 for testing.  相似文献   

19.
In this article a natural extension of the beta-binomial distribution is developed. Forced binary choice situations are modeled such that each individual has a probability p of knowing the correct answer. (This probability is distributed f(p) across the population.) Hence each individual will guess at the correct answer with probability 1 – p. The observable random variable R, the total number of correct answers (both by knowing and guessing) out of k trials has a rather complicated distribution. However, when f(p) is distributed beta with parameters m and n, the distribution P(r; k, m, n) can be expressed in terms of the well-known Gaussian hypergeometric function. This distribution has implications for true-false tests, taste tests, and virtually every other forced binary choice situation.  相似文献   

20.
The present article discusses the statistical distribution for the estimator of Rosenthal's ‘file-drawer’ number NR, which is an estimator of unpublished studies in meta-analysis. We calculate the probability distribution function of NR. This is achieved based on the central limit theorem and the proposition that certain components of the estimator NR follow a half-normal distribution, derived from the standard normal distribution. Our proposed distributions are supported by simulations and investigation of convergence.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号