We study the behavior of bivariate empirical copula process 𝔾 n (·, ·) on pavements [0, k n /n]2 of [0, 1]2, where k n is a sequence of positive constants fulfilling some conditions. We provide a upper bound for the strong approximation of 𝔾 n (·, ·) by a Gaussian process when k n /n↘γ as n → ∞, where 0 ≤ γ ≤1.  相似文献   

For the Bose-Einstein Statistics, where n indistinguishable balls are distributed in m urns such that all the arrangements are equally likely, define the random variables

Mk = number of urns containing exactly k balls each;

Nk = number of urns containing at least k balls each.

We consider the approximation of the distributions of Mk and Nk by suitable normal distributions, for large but finite m. Estimates are found for the error in the approximation to both the probability mass function and the distribution function in each case. These results apply also to the alternative model where no urn is allowed to be empty. The results are illustrated by some numerical examples.  相似文献   

Consider a semi-Markov process {X(t), t>0} with transition epochs T0 T1, T2…. Suppose that at each one of the epochs {Tn} one of R possible events, E1, E2,…, ER can happen, where the occurrences of successive events form a Markov chain. for a fixed r, let the times the event Er happens be Uo U1, U2,…. In this paper we are interested in the process {Y(t), t>0)} where Y(t)=X(Uk) if and only if Uk≤tk+1. It will be shown that {Y(t)} is a semi-Markov process, and its properties with respect to those of {X(t)} will be examined.  相似文献   

In this paper, we obtain some results for the asymptotic behavior of the tail probability of a random sum Sτ = ∑τk = 1Xk, where the summands Xk, k = 1, 2, …, are conditionally dependent random variables with a common subexponential distribution F, and the random number τ is a non negative integer-valued random variable, independent of {Xk: k ? 1}.  相似文献   

The linear model Y - N(Xb, σ2∑) with a restriction R'b = M'u + c is considered, where X, R, M, ∑ and c are known. Explicit formulae are obtained for the best linear unbiased estimator of K'b, for the F-test of the hypothesis K'b = W'v + a, and for the simultaneous confidence intervals of the parameters K′i b' s, where K = [K1,K2,…Ks], w, and a are known, none of the matrices X, ∑, R, M, K, and W is required to have full ranks, and the design X can be one - or multi-way,complete or incomplete, balanced or not balanced, connected or disconnected.  相似文献   

Let T2 i=z′iS?1zi, i==,…k be correlated Hotelling's T2 statistics under normality. where z=(z′i,…,z′k)′ and nS are independently distributed as Nkp((O,ρ?∑) and Wishart distribution Wp(∑, n), respectively. The purpose of this paper is to study the distribution function F(x1,…,xk) of (T2 i,…,T2 k) when n is large. First we derive an asymptotic expansion of the characteristic function of (T2 i,…,T2 k) up to the order n?2. Next we give asymptotic expansions for (T2 i,…,T2 k) in two cases (i)ρ=Ik and (ii) k=2 by inverting the expanded characteristic function up to the orders n?2 and n?1, respectively. Our results can be applied to the distribution function of max (T2 i,…,T2 k) as a special case.  相似文献   

The problem of inference in Bayesian Normal mixture models is known to be difficult. In particular, direct Bayesian inference (via quadrature) suffers from a combinatorial explosion in having to consider every possible partition of n observations into k mixture components, resulting in a computation time which is O(k n). This paper explores the use of discretised parameters and shows that for equal-variance mixture models, direct computation time can be reduced to O(D k n k), where relevant continuous parameters are each divided into D regions. As a consequence, direct inference is now possible on genuine data sets for small k, where the quality of approximation is determined by the level of discretisation. For large problems, where the computational complexity is still too great in O(D k n k) time, discretisation can provide a convergence diagnostic for a Markov chain Monte Carlo analysis.  相似文献   

Consider a family of square-integrable Rd-valued statistics Sk = Sk(X1,k1; X2,k2;…; Xm,km), where the independent samples Xi,kj respectively have ki i.i.d. components valued in some separable metric space Xi. We prove a strong law of large numbers, a central limit theorem and a law of the iterated logarithm for the sequence {Sk}, including both the situations where the sample sizes tend to infinity while m is fixed and those where the sample sizes remain small while m tends to infinity. We also obtain two almost sure convergence results in both these contexts, under the additional assumption that Sk is symmetric in the coordinates of each sample Xi,kj. Some extensions to row-exchangeable and conditionally independent observations are provided. Applications to an estimator of the dimension of a data set and to the Henze-Schilling test statistic for equality of two densities are also presented.  相似文献   

Let X and Y be two arbitrary k-dimensional discrete random vectors, for k ≥ 1. We prove that there exists a coupling method which minimizes P( X ≠ Y ). This result is used to find the least upper bound for the metric d( X, Y ) = supA|P( X ∈ A ) ? P( Y ∈ A )| and to derive the inequality d(Σ X i, Σ Y i) ≤ Σd( X i, Y i). We thus obtain a unified method to measure the disparity between the distributions of sums of independent random vectors. Several examples are given.  相似文献   

Let X1 be a strictly stationary multiple time series with values in Rd and with a common density f. Let X1,.,.,Xn, be n consecutive observations of X1. Let k = kn, be a sequence of positive integers, and let Hni be the distance from Xi to its kth nearest neighbour among Xj, j i. The multivariate variable-kernel estimate fn, of f is defined by where K is a given density. The complete convergence of fn, to f on compact sets is established for time series satisfying a dependence condition (referred to as the strong mixing condition in the locally transitive sense) weaker than the strong mixing condition. Appropriate choices of k are explicitly given. The results apply to autoregressive processes and bilinear time-series models.  相似文献   

This paper investigates tail behavior of the randomly weighted sum ∑nk = 1θkXk and reaches an asymptotic formula, where Xk, 1 ? k ? n, are real-valued linearly wide quadrant-dependent (LWQD) random variables with a common heavy-tailed distribution, and θk, 1 ? k ? n, independent of Xk, 1 ? k ? n, are n non-negative random variables without any dependence assumptions. The LWQD structure includes the linearly negative quadrant-dependent structure, the negatively associated structure, and hence the independence structure. On the other hand, it also includes some positively dependent random variables and some other random variables. The obtained result coincides with the existing ones.  相似文献   

As the sample size increases, the coefficient of skewness of the Fisher's transformation z= tanh-1r, of the correlation coefficient decreases much more rapidly than the excess of its kurtosis. Hence, the distribution of standardized z can be approximated more accurately in terms of the t distribution with matching kurtosis than by the unit normal distribution. This t distribution can, in turn be subjected to Wallace's approximation resulting in a new normal approximation for the Fisher's z transform. This approximation, which can be used to estimate the probabilities, as well as the percentiles, compares favorably in both accuracy and simplicity, with the two best earlier approximations, namely, those due to Ruben (1966) and Kraemer (1974). Fisher (1921) suggested approximating distribution of the variance stabilizing transform z=(1/2) log ((1 +r)/(1r)) of the correlation coefficient r by the normal distribution with mean = (1/2) log ((1 + p)/(lp)) and variance =l/(n3). This approximation is generally recognized as being remarkably accurate when ||Gr| is moderate but not so accurate when ||Gr| is large, even when n is not small (David (1938)). Among various alternatives to Fisher's approximation, the normalizing transformation due to Ruben (1966) and a t approximation due to Kraemer (1973), are interesting on the grounds of novelty, accuracy and/or aesthetics. If r?= r/√ (1r2) and r?|Gr = |Gr/√(1|Gr2), then Ruben (1966) showed that (1) gn (r,|Gr) ={(2n5)/2}1/2r?r{(2n3)/2}1/2r?|GR, {1 + (1/2)(r?r2+r?|Gr2)}1/2 is approximately unit normal. Kraemer (1973) suggests approximating (2) tn (r, |Gr) = (r|GR1) √ (n2), √(11r2) √(1|Gr2) by a Student's t variable with (n2) degrees of freedom, where after considering various valid choices for |Gr1 she recommends taking |Gr1= |Gr*, the median of r given n and |Gr.  相似文献   

Consider k( ? 2) normal populations whose means are all known or unknown and whose variances are unknown. Let σ2[1] ? ??? ? σ[k]2 denote the ordered variances. Our goal is to select a non empty subset of the k populations whose size is at most m(1 ? m ? k ? 1) so that the population associated with the smallest variance (called the best population) is included in the selected subset with a guaranteed minimum probability P* whenever σ2[2][1]2 ? δ* > 1, where P* and δ* are specified in advance of the experiment. Based on samples of size n from each of the populations, we propose and investigate a procedure called RBCP. We also derive some asymptotic results for our procedure. Some comparisons with an earlier available procedure are presented in terms of the average subset sizes for selected slippage configurations based on simulations. The results are illustrated by an example.  相似文献   


Least squares estimator of the stability parameter ? ? |α| + |β| for a spatial unilateral autoregressive process Xk, ? = αXk ? 1, ? + βXk, ? ? 1 + ?k, ? is investigated and asymptotic normality with a scaling factor n5/4 is shown in the unstable case ? = 1. The result is in contrast to the unit root case of the AR(p) model Xk = α1Xk ? 1 + ??? + αpXk ? p + ?k, where the limiting distribution of the least squares estimator of the unit root parameter ? ? α1 + ??? + αp is not normal.  相似文献   

A class of invariant Bayes rules is derived for testing homogeneity of k (≥2) different populations against (kt) slippage alternatives that some (unknown) subset of size t of the given populations has parameter larger than the remaining k-t, where t is a given integer between 1 and k-1. For a similar problem in nonparametric situations, locally best tests based on ranks are derived.  相似文献   

Let π1…, πk denote k(≥ 2) populations with unknown means μ1 , …, μk and variances σ1 2 , …, σk 2 , respectively and let πo denote the control population having mean μo and variance σo 2 . It is assumed that these populations are normally distributed with correlation matrix {ρij}. The goal is to select a subset, of populations of π1 , …, πk which contains all the populations with means larger than or equal to the mean of the control one. Procedures are given for selecting such a subset so that the probability that all the populations with means larger than or equal to the mean of the control one are included in the selected subset is at least equal to a predetermined value P?(l/k < P? < 1). The goal treated here is a first step screening procedure that allows the experimenter to choose a subset and withhold judgement about which one has the largest mean. Then, if the one with the largest mean is desired it can be chosen from the selected subset on the basis of cost and other considerations. Percentage points are also included.  相似文献   

In pattern classification of sampled vector valued random variables it is often essential, due to computational and accuracy considerations, to consider certain measurable transformations of the random variable. These transformations are generally of a dimension-reducing nature. In this paper we consider the class of linear dimension reducing transformations, i.e., the k × n matrices of rank k where k < n and n is the dimension of the range of the sampled vector random variable.

In this connection, we use certain results (Decell and Quirein, 1973), that guarantee, relative to various class separability criteria, the existence of an extremal transformation. These results also guarantee that the extremal transformation can be expressed in the form (Ik∣ Z)U where Ik is the k × k identity matrix and U is an orthogonal n × n matrix. These results actually limit the search for the extremal linear transformation to a search over the obviously smaller class of k × n matrices of the form (Ik ∣Z)U. In this paper these results are refined in the sense that any extremal transformation can be expressed in the form (IK∣Z)Hp … H1 where p ≤ min{k, n?k} and Hi is a Householder transformation i=l,…, p, The latter result allows one to construct a sequence of transformations (LK∣ Z)H1, (IK Z)H2H1 … such that the values of the class separability criterion evaluated at this sequence is a bounded, monotone sequence of real numbers. The construction of the i-th element of the sequence of transformations requires the solution of an n-dimensional optimization problem. The solution, for various class separability criteria, of the optimization problem will be the subject of later papers. We have conjectured (with supporting theorems and empirical results) that, since the bounded monotone sequence of real class separability values converges to its least upper bound, this least upper bound is an extremal value of the class separability criterion.

Several open questions are stated and the practical implications of the results are discussed.  相似文献   

Consider k independent random samples with different sample sizes such that the ith sample comes from the cumulative distribution function (cdf) F i  = 1 ? (1 ? F)α i , where α i is a known positive constant and F is an absolutely continuous cdf. Also, suppose that we have observed the maximum and minimum of the first k samples. This article shows how one can construct the nonparametric prediction intervals for the order statistics of the future samples on the basis of these information. Three schemes are studied and in each case exact expressions for the prediction coefficients of prediction intervals are derived. Numerical computations are given for illustrating the results. Also, a comparison study is done while the complete samples are available.  相似文献   

Suppose that data {(x l,i,n , y l,i,n ): l?=?1, …, k; i?=?1, …, n} are observed from the regression models: Y l,i,n ?=?m l (x l,i,n )?+?? l,i,n , l?=?1, …, k, where the regression functions {m l } l=1 k are unknown and the random errors {? l,i,n } are dependent, following an MA(∞) structure. A new test is proposed for testing the hypothesis H 0: m 1?=?·?·?·?=?m k , without assuming that {m l } l=1 k are in a parametric family. The criterion of the test derives from a Crámer-von-Mises-type functional based on different distances between {[mcirc]} l and {[mcirc]} s , l?≠?s, l, s?=?1, …, k, where {[mcirc] l } l=1 k are nonparametric Gasser–Müller estimators of {m l } l=1 k . A generalization of the test to the case of unequal design points, with different sample sizes {n l } l=1 k and different design densities {f l } l=1 k , is also considered. The asymptotic normality of the test statistic is obtained under general conditions. Finally, a simulation study and an analysis with real data show a good behavior of the proposed test.  相似文献   

