首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 142 毫秒
1.
The problem of detecting influential observations in principalcomponent analysis was discussed by several authors. Radhakrishnan and kshirsagar ( 1981 ), Critchley ( 1985 ), jolliffe ( 1986 )among others discussed this topicby using the influence functions I(X;θs)and I(X;Vs)of eigenvalues and eigenvectors, which wwere derived under the assumption that the eigenvalues of interest were simple. In this paper we propose the influence functionsI(X;∑q s=1θsVsVs T)and I(x;∑q s=1VsVs t)(q<p;p:number of variables) to investigate the influence onthe subspace spanned by principal components. These influence functions are applicable not only to the case where the edigenvalues of interst are all simple but also to the case where there are some multiple eigenvalues among those of interest.  相似文献   

2.
3.
Two consistent nonexact-confidence-interval estimation methods, both derived from the consistency-equivalence theorem in Plante (1991), are suggested for estimation of problematic parametric functions with no consistent exact solution and for which standard optimal confidence procedures are inadequate or even absurd, i.e., can provide confidence statements with a 95% empty or all-inclusive confidence set. A belt C(·) from a consistent nonexact-belt family, used with two confidence coefficients (γ = infθ Pθ [ θ ? C(X)] and γ+ = supθ Pθ[θ ? C(X)], is shown to provide a consistent nonexact-belt solution for estimating μ21 in the Behrens-Fisher problem. A rule for consistent behaviour enables any confidence belt to be used consistently by providing each sample point with best upper and lower confidence levels [δ+(x) ≥ γ+, δ(x) ≤ γ], which give least-conservative consistent confidence statements ranging from practically exact through informative to noninformative. The rule also provides a consistency correction L(x) = δ+(x)-δ(X) enabling alternative confidence solutions to be compared on grounds of adequacy; this is demonstrated by comparing consistent conservative sample-point-wise solutions with inconsistent standard solutions for estimating μ21 (Creasy-Fieller-Neyman problem) and $\sqrt {\mu _1^2 + \mu _2^2 }$, a distance-estimation problem closely related to Stein's 1959 example  相似文献   

4.
In this paper we assume that in a random sample of size ndrawn from a population having the pdf f(x; θ) the smallest r1 observations and the largest r2 observations are censored (r10, r20). We consider the problem of estimating θ on the basis of the middle n-r1-r2 observations when either f(x;θ)=θ-1f(x/θ) or f(x;θ) = (aθ)1f(x-θ)/aθ) where f(·) is a known pdf, a (<0) is known and θ (>0) is unknown. The minimum mean square error (MSE) linear estimator of θ proposed in this paper is a “shrinkage” of the minimum variance linear unbiased estimator of θ. We obtain explicit expressions of these estimators and their mean square errors when (i) f(·) is the uniform pdf defined on an interval of length one and (ii) f(·) is the standard exponential pdf, i.e., f(x) = exp(–x), x0. Various special cases of censoring from the left (right) and no censoring are considered.  相似文献   

5.
The largest value of the constant c for which holds over the class of random variables X with non-zero mean and finite second moment, is c=π. Let the random variable (r.v.) X with distribution function F(·) have non-zero mean and finite second moment. In studying a certain random walk problem (Daley, 1976) we sought a bound on the characteristic function of the form for some positive constant c. Of course the inequality is non-trivial only provided that . This note establishes that the best possible constant c =π. The wider relevance of the result is we believe that it underlines the use of trigonometric inequalities in bounding the (modulus of a) c.f. (see e.g. the truncation inequalities in §12.4 of Loève (1963)). In the present case the bound thus obtained is the best possible bound, and is better than the bound (2) |1-?(θ)| ≥ |θEX|-θ2EX2\2 obtained by applying the triangular inequality to the relation which follows from a two-fold integration by parts in the defining equation (*). The treatment of the counter-example furnished below may also be of interest. To prove (1) with c=π, recall that sin u > u(1-u/π) (all real u), so Since |E sinθX|-|E sin(-θX)|, the modulus sign required in (1) can be inserted into (4). Observe that since sin u > u for u < 0, it is possible to strengthen (4) to (denoting max(0,x) by x+) To show that c=π is the best possible constant in (1), assume without loss of generality that EX > 0, and take θ > 0. Then (1) is equivalent to (6) c < θEX2/{EX-|1-?(θ)|/θ} for all θ > 0 and all r.v.s. X with EX > 0 and EX2. Consider the r.v. where 0 < x < 1 and 0 < γ < ∞. Then EX=1, EX2=1+γx2, From (4) it follows that |1-?(θ)| > 0 for 0 < |θ| <π|EX|/EX2 but in fact this positivity holds for 0 < |θ| < 2π|EX|/EX2 because by trigonometry and the Cauchy-Schwartz inequality, |1-?(θ)| > |Re(1-?(θ))| = |E(1-cosθX)| = 2|E sin2θX/2| (10) >2(E sinθX/2)2 (11) >(|θEX|-θ2EX2/2π)2/2 > 0, the inequality at (11) holding provided that |θEX|-θ2EX2/2π > 0, i.e., that 0 < |θ| < 2π|EX|/EX2. The random variable X at (7) with x= 1 shows that the range of positivity of |1-?(θ)| cannot in general be extended. If X is a non-negative r.v. with finite positive mean, then the identity shows that (1-?(θ))/iθEX is the c.f. of a non-negative random variable, and hence (13) |1-?(θ)| < |θEX| (all θ). This argument fans if pr{X < 0}pr{X> 0} > 0, but as a sharper alternative to (14) |1-?(θ)| < |θE|X||, we note (cf. (2) and (3)) first that (15) |1-?(θ)| < |θEX| +θ2EX2/2. For a bound that is more precise for |θ| close to 0, |1-?(θ)|2= (Re(1-?(θ)))2+ (Im?(θ))2 <(θ2EX2/2)2+(|θEX| +θ2EX2-/π)2, so (16) |1-?(θ)| <(|θEX| +θ2EX2-/π) + |θ|3(EX2)2/8|EX|.  相似文献   

6.
Consider the randomly weighted sums Sm(θ) = ∑mi = 1θiXi, 1 ? m ? n, and their maxima Mn(θ) = max?1 ? m ? nSm(θ), where Xi, 1 ? i ? n, are real-valued and dependent according to a wide type of dependence structure, and θi, 1 ? i ? n, are non negative and arbitrarily dependent, but independent of Xi, 1 ? i ? n. Under some mild conditions on the right tails of the weights θi, 1 ? i ? n, we establish some asymptotic equivalence formulas for the tail probabilities of Sn(θ) and Mn(θ) in the case where Xi, 1 ? i ? n, are dominatedly varying, long-tailed and subexponential distributions, respectively.  相似文献   

7.
In this paper, we obtain some results for the asymptotic behavior of the tail probability of a random sum Sτ = ∑τk = 1Xk, where the summands Xk, k = 1, 2, …, are conditionally dependent random variables with a common subexponential distribution F, and the random number τ is a non negative integer-valued random variable, independent of {Xk: k ? 1}.  相似文献   

8.
Let X1, X2,…,Xn be independent, indentically distributed random variables with density f(x,θ) with respect to a σ-finite measure μ. Let R be a measurable set in the sample space X. The value of X is observable if X ? (X?R) and not otherwise. The number J of observable X’s is binomial, N, Q, Q = 1?P(X ? R). On the basis of J observations, it is desired to estimate N and θ. Estimators considered are conditional and unconditional maximum likelihood and modified maximum likelihood using a prior weight function to modify the likelihood before maximizing. Asymptotic expansions are developed for the [Ncirc]’s of the form [Ncirc] = N + α√N + β + op(1), where α and β are random variables. All estimators have the same α, which has mean 0, variance σ2 (a function of θ) and is asymptotically normal. Hence all are asymptotically equivalent by the usual limit distributional theory. The β’s differ and Eβ can be considered an “asymptotic bias”. Formulas are developed to compare the asymptotic biases of the various estimators. For a scale parameter family of absolutely continuous distributions with X = (0,∞) and R = (T,∞), special formuli are developed and a best estimator is found.  相似文献   

9.
Let X1, …,Xn be a random sample from a normal distribution with mean θ and variance σ2 The problem is to estimate θ with loss function L(θ,e) = v(e-θ) where v(x) = b(exp(ax)-ax-l) and where a, b are constants with b>0, a¦0. Zellner (1986), showed that [Xbar] ? σ2a/2n dominates [Xbar] and hence [Xbar] is inadmissible. The question of what values of c and d render c[Xbar]+ d admissible is studied here.  相似文献   

10.
The first stochastic model is based upon two urns A and B, where A contains a fixed number of white and black balls and B is empty. The player selects an integer β ≥, 2 and draws the balls one by one (with replacement) from urn A and balls of the same colour are put in urn B. The process is continued as long as the number of white balls in B exceeds (β-1) times the number of black balls in B. The player stops after drawing β(x-1) balls and is declared to be a winner if urn B has (x-1) black balls. It is shown that x has the Geeta distribution.

Assuming that the mean μ is a function of two parameters θ and β it has been shown that for small changes inthe value of θ there exists a difference-differential equation which leads to the Geeta distribution.  相似文献   

11.
Let (X, Y) be a bivariate random vector with joint distribution function FX, Y(x, y) = C(F(x), G(y)), where C is a copula and F and G are marginal distributions of X and Y, respectively. Suppose that (Xi, Yi), i = 1, 2, …, n is a random sample from (X, Y) but we are able to observe only the data consisting of those pairs (Xi, Yi) for which Xi ? Yi. We denote such pairs as (X*i, Yi*), i = 1, 2, …, ν, where ν is a random variable. The main problem of interest is to express the distribution function FX, Y(x, y) and marginal distributions F and G with the distribution function of observed random variables X* and Y*. It is shown that if X and Y are exchangeable with marginal distribution function F, then F can be uniquely determined by the distributions of X* and Y*. It is also shown that if X and Y are independent and absolutely continuous, then F and G can be expressed through the distribution functions of X* and Y* and the stress–strength reliability P{X ? Y}. This allows also to estimate P{X ? Y} with the truncated observations (X*i, Yi*). The copula of bivariate random vector (X*, Y*) is also derived.  相似文献   

12.
Let X =(x)ij=(111, …, X,)T, i = l, …n, be an n X random matrix having multivariate symmetrical distributions with parameters μ, Σ. The p-variate normal with mean μ and covariance matrix is a member of this family. Let be the squared multiple correlation coefficient between the first and the succeeding p1 components, and let p2 = + be the squared multiple correlation coefficient between the first and the remaining p1 + p2 =p – 1 components of the p-variate normal vector. We shall consider here three testing problems for multivariate symmetrical distributions. They are (A) to test p2 =0 against; (B) to test against =0, 0; (C) to test against p2 =0, We have shown here that for problem (A) the uniformly most powerful invariant (UMPI) and locally minimax test for the multivariate normal is UMPI and is locally minimax as p2 0 for multivariate symmetrical distributions. For problem (B) the UMPI and locally minimax test is UMPI and locally minimax as for multivariate symmetrical distributions. For problem (C) the locally best invariant (LBI) and locally minimax test for the multivariate normal is also LBI and is locally minimax as for multivariate symmetrical distributions.  相似文献   

13.
Consider n independent random variables Zi,…, Zn on R with common distribution function F, whose upper tail belongs to a parametric family F(t) = Fθ(t),t ≥ x0, where θ ∈ ? ? R d. A necessary and sufficient condition for the family Fθ, θ ∈ ?, is established such that the k-th largest order statistic Zn?k+1:n alone constitutes the central sequence yielding local asymptotic normality ( LAN ) of the loglikelihood ratio of the vector (Zn?i+1:n)1 i=kof the k largest order statistics. This is achieved for k = k(n)→n→∞∞ with k/n→n→∞ 0.

In the case of vectors of central order statistics ( Zr:n, Zr+1:n,…, Zs:n ), with r/n and s/n both converging to q ∈ ( 0,1 ), it turns out that under fairly general conditions any order statistic Zm:n with r ≤ m ≤s builds the central sequence in a pertaining LAN expansion.These results lead to asymptotically optimal tests and estimators of the underlying parameter, which depend on single order statistics only  相似文献   

14.
Let X be a normally distributed p-dimensional column vector with mean μ and positive definite covariance matrix σ. and let X α, α = 1,…, N, be a random sample of size N from this distribution. Partition X as ( X 1, X (2)', X '(3))', where X1 is one-dimension, X(2) is p2- dimensional, and so 1 + p1 + p2 = p. Let ρ1 and ρ be the multiple correlation coefficients of X1 with X(2) and with ( X '(2), X '(3))', respectively. Write ρ2/2 = ρ2 - ρ2/1. We shall cosider the following two problems  相似文献   

15.
Consider an ergodic Markov chain X(t) in continuous time with an infinitesimal matrix Q = (qij) defined on a finite state space {0, 1,…, N}. In this note, we prove that if X(t) is skip-free positive (negative, respectively), i.e., qij, = 0 for j > i+ 1 (i > j+ 1), then the transition probability pij(t) = Pr[X(t)=j | X(0) =i] can be represented as a linear combination of p0N(t) (p(m)(N0)(t)), 0 ≤ m ≤N, where f(m)(t) denotes the mth derivative of a function f(t) with f(0)(t) =f(t). If X(t) is a birth-death process, then pij(t) is represented as a linear combination of p0N(m)(t), 0 ≤mN - |i-j|.  相似文献   

16.
Let (θ1,x1),…,(θn,xn) be independent and identically distributed random vectors with E(xθ) = θ and Var(x|θ) = a + bθ + cθ2. Let ti be the linear Bayes estimator of θi and θ~i be the linear empirical Bayes estimator of θi as proposed in Robbins (1983). When Ex and Var x are unknown to the statistician. The regret of using θ~i instead of ti because of ignorance of the mean and the variance is ri = E(θi ? θi)2 ?E(tii)2. Under appropriate conditions cumulative regret Rn = r1+…rn is shown to have a finite limit even when n tends to infinity. The limit can be explicitly computed in terms of a,b,c and the first four moments of x.  相似文献   

17.
Consider K(>2) independent populations π1,..,π k such that observations obtained from π k are independent and normally distributed with unknown mean µ i and unknown variance θ i i = 1,…,k. In this paper, we provide lower percentage points of Hartley's extremal quotient statistic for testing an interval hypothesisH 0 θ [k] θ [k] > δ vs. H a : θ [k] θ [1] ≤ δ , where δ ≥ 1 is a predetermined constant and θ [k](θ [1]) is the max (min) of the θi,…,θ k . The least favorable configuration (LFC) for the test under H 0 is determined in order to obtain the lower percentage points. These percentage points can also be used to construct an upper confidence bound for θ[k][1].  相似文献   

18.
The asymptotic expansions for the coverage probability of a confidence set centred at the James–Stein estimator presented in our previous publications show that this probability depends on the non-centrality parameter τ2 (the sum of the squares of the means of normal distributions). In this paper we establish how these expansions can be used for a construction of confidence region with constant confidence level, which is asymptotically (the same formula for both case τ→0 and τ→∞) equal to some fixed value 1?α. We establish the shrinkage rate for the confidence region according to the growth of the dimension p and also the value of τ for which we observe quick decreasing of the coverage probability to the nominal level 1?α. When p→∞ this value of τ increases as O(p1/4). The accuracy of the results obtained is shown by the Monte-Carlo statistical simulations.  相似文献   

19.
This paper offers a predictive approach for the selection of a fixed number (= t) of treatments from k treatments with the goal of controlling for predictive losses. For the ith treatment, independent observations X ij (j = 1,2,…,n) can be observed where X ij ’s are normally distributed N(θ i ; σ 2). The ranked values of θ i ’s and X i ’s are θ (1) ≤ … ≤ θ (k) and X [1] ≤ … ≤ X [k] and the selected subset S = {[k], [k? 1], … , [k ? t+1]} will be considered. This paper distinguishes between two types of loss functions. A type I loss function associated with a selected subset S is the loss in utility from the selector’s view point and is a function of θ i with i ? S. A type II loss function associated with S measures the unfairness in the selection from candidates’ viewpoint and is a function of θ i with i ? S. This paper shows that under mild assumptions on the loss functions S is optimal and provides the necessary formulae for choosing n so that the two types of loss can be controlled individually or simultaneously with a high probability. Predictive bounds for the losses are provided, Numerical examples support the usefulness of the predictive approach over the design of experiment approach.  相似文献   

20.
Consider the problem of inference about a parameter θ in the presence of a nuisance parameter v. In a Bayesian framework, a number of posterior distributions may be of interest, including the joint posterior of (θ, ν), the marginal posterior of θ, and the posterior of θ conditional on different values of ν. The interpretation of these various posteriors is greatly simplified if a transformation (θ, h(θ, ν)) can be found so that θ and h(θ, v) are approximately independent. In this article, we consider a graphical method for finding this independence transformation, motivated by techniques from exploratory data analysis. Some simple examples of the use of this method are given and some of the implications of this approximate independence in a Bayesian analysis are discussed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号