首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The first aim of this paper is to introduce a modular test for the three-way contingency table (TT). The second aim is to describe the procedure of generating TT using the bar method. The third aim is on the one hand to suggest the measure of untruthfulness of H0 and on the other hand to compare the quality of independence tests by using their power. Critical values for analyzed statistics were determined by simulating the Monte Carlo method.  相似文献   

2.
Let X (n) and X (1) be the largest and smallest order statistics, respectively, of a random sample of fixed size n. Quite generally, X (1) and X (n) are approximately independent for n sufficiently large. In this article, we study the dependence properties of random extremes in terms of their copula, when the sample size has a left-truncated binomial distribution and show that they tend to be more dependent in this case. We also give closed-form formulas for the measures of association Kendall's τ and Spearman's ρ to measure the amount of dependence between two extremes.  相似文献   

3.
For location–scale families, we consider a random distance between the sample order statistics and the quasi sample order statistics derived from the null distribution as a measure of discrepancy. The conditional qth quantile and expectation of the random discrepancy on the given sample are chosen as test statistics. Simulation results of powers against various alternatives are illustrated under the normal and exponential hypotheses for moderate sample size. The proposed tests, especially the qth quantile tests with a small or large q, are shown to be more powerful than other prominent goodness-of-fit tests in most cases.  相似文献   

4.
For two independent populations X and Y we develop the empirical distribution function estimator for the difference of order statistics of the form X (i)Y (j). The key practical application for this estimator pertains to inference between quantiles from two independent populations.  相似文献   

5.
Two processes of importance in statistics and probability are the empirical and partial-sum processes. Based on d-dimensional data X1, … Xa the empirical measure is defined for any ARd by the sample proportion of observations in A. When normalized, Fn yields the empirical process Wn: = n1/2 (Fn - F), where F denotes the “true” probability measure. To define partial-sum processes, one needs data that are assigned to specified locations (in contrast to the above, where specified unit masses are assigned to random locations). A suitable context for many applications is that of data attached to points of a lattice, say {Xj:j ϵ Jd} where J = {1, 2,…}, for which the partial sums are defined for any ARd by Thus S(A) is the sum of the data contained in A. When normalized, S yields the partial-sum process. This paper provides an overview of asymptotic results for empirical and partial-sum processes, including strong laws and central limit theorems, together with some indications of their inferential implications.  相似文献   

6.
Suppose that a density fθ (x) belongs to an exponential family, but that inference about θ must be based on data that are obtained from a density that is proportional to W(x)fθ(x). The authors study the Fisher information about θ in observations obtained from such weighted distributions and give conditions under which this information is greater than under the original density. These conditions involve the hazard- and reversed-hazard-rate functions.  相似文献   

7.
In this paper, we study the Pitman measure of closeness of order statistics of two independent samples from the same distribution to population quantiles. We then derive various exact expressions of the probability closeness of order statistics from the X and Y samples. Some distribution-free results for the median of the sampling distribution are obtained. Exact and explicit expressions are presented for Uniform(?1, 1) and exponential distributions. Numerical results for illustrative purposes are also provided.  相似文献   

8.
In this paper we give some properties of the expected values of any order statistic when one of its adjacent order statistics is known (order mean function) from a sequence of sizen of independent and identically distributed random variables with discrete distribution. Furthermore, we obtain the explicit expressions of the distribution from these order mean functions, and finally, we show the necessary and sufficient conditions for any real function to be an order mean function. We also add some examples of characterization of discrete distributions from the order mean functions. Partially supported by Consejería de Cultura y Educación (C.A.R.M.), under Grant PIB 95/90.  相似文献   

9.
The process of detection of outliers is an interesting and important aspect in the analysis of data, as it could impact the inference. There are various methods available in the literature for detection of outliers in multivariate data [V. Barnett and T. Lewis, Outliers in Statistical Data, John Wiley & Sons, Chichester, 1994] using the Mahalanobis distance measure. An attempt is made to propose an alternate method of outlier detection based on the comedian introduced by Falk [On MAD and Comedians, Ann. Inst. Statist. Math. 49 (1997), pp. 615–644]. The proposed method is computationally efficient with high breakdown value and low computation time. Further, important properties, namely, success rates (SR) and false detection rates (FDR) are studied and compared with some of the well-known outlier detection methods through a simulation study. The Comedian method has high SR and low FDR for all combination of parameters. On removal of the detected outliers or down weighing, the same, highly robust and approximately affine equivariant estimators of multivariate location and scatter can be obtained. Finally, the method is applied to well-known real data sets to evaluate its performance.  相似文献   

10.
In this paper, we consider the auto-odds ratio function (AORF) as a measure of serial association for a stationary time series process of categorical data at two different time points. Numerical measures such as the autocorrelation function (ACF) have no meaningful interpretation, unless the time series data are numerical. Instead, we use the AORF as a measure of association to study the serial dependency of the categorical time series for both ordinal and nominal categories. Biswas and Song [Discrete-valued ARMA processes. Stat Probab Lett. 2009;79(17):1884–1889] provided some results on this measure for Pegram's operator-based AR(1) process with binary responses. Here, we extend this measure to more general set-ups, i.e. for AR(p) and MA(q) processes and for a general number of categories. We discuss how this method can effectively be used in parameter estimation and model selection. Following Weiß [Empirical measures of signed serial dependence in categorical time series. J Stat Comput Simul. 2011;81(4):411–429], we derive the large sample distribution of the estimator of the AORF under independent and identically distributed (iid) set-up. Some simulation results and two categorical data examples (one is ordinal and other nominal) are presented to illustrate the proposed method.  相似文献   

11.
A necessary and sufficient condition that two distributions having finite means are identical is that for any fixed integer r > 0, the expected values of their rth (n ? r) order statistics are equal [or the expected values of their (n-r)th (n > r ? 0) order statistics are equal] for all n where n is the sample size.  相似文献   

12.
SenGupta (1987) proposed a locally most powerful test which is globally (one sided) unbiased, and an estimator of p, the equicorrelation coefficient of a standard symmetric multivariate normal (SSMN) distribution. Here we use the idea in Williams (1984) to illustrate the construction and use of ancillary statistics to make inference about p. The test and confidence intervals based on this construction are conditionally optimal.  相似文献   

13.
The Gini coefficient is used to measure inequality in populations. However, shifts in the population distribution may affect subgroups differently. Consequently, it can be informative to examine inequality separately for these segments. Consider an independently and identically distributed sample split based on ranking and compute the Gini coefficient for each partition. These coefficients, calculated from post-stratified data, are not functions of U-statistics. Therefore, previous theoretical and methodological results cannot be applied. In this article, the asymptotic joint distribution is derived for the partitioned coefficients and bootstrap methods for inference are developed. Finally, an application to per capita income across census tracts is examined.  相似文献   

14.
Abstract

In this paper, we consider a k-out-of-n system consisting of n identical components with independent lifetimes. We show that when the underlying distribution function F(t) is absolutely continuous, then it can be univocally determined by some particular mean residual lives or mean inactivity times of the system. It is then shown that these results may be extended to coherent (or mixed) systems.  相似文献   

15.
Fantasy sports, particularly the daily variety in which new lineups are selected each day, are a rapidly growing industry. The two largest companies in the daily fantasy business, DraftKings and Fanduel, have been valued as high as $2 billion. This research focuses on the development of a complete system for daily fantasy basketball, including both the prediction of player performance and the construction of a team. First, a Bayesian random effects model is used to predict an aggregate measure of daily NBA player performance. The predictions are then used to construct teams under the constraints of the game, typically related to a fictional salary cap and player positions. Permutation based and K-nearest neighbors approaches are compared in terms of the identification of “successful” teams—those who would be competitive more often than not based on historical data. We demonstrate the efficacy of our system by comparing our predictions to those from a well-known analytics website, and by simulating daily competitions over the course of the 2015–2016 season. Our results show an expected profit of approximately $9,000 on an initial $500 investment using the K-nearest neighbors approach, a 36% increase relative to using the permutation-based approach alone. Supplementary materials for this article are available online.  相似文献   

16.
ABSTRACT

Based on the observed dual generalized order statistics drawn from an arbitrary unknown distribution, nonparametric two-sided prediction intervals as well as prediction upper and lower bounds for an ordinary and a dual generalized order statistic from another iid sequence with the same distribution are developed. The prediction intervals for dual generalized order statistics based on the observed ordinary generalized order statistics are also developed. The coverage probabilities of these prediction intervals are exact and free of the parent distribution, F. Finally, numerical computations and real examples of the coverage probabilities are presented for choosing the appropriate limits of the prediction.  相似文献   

17.
This paper proposes an optimal estimation method for the shape parameter, probability density function and upper tail probability of the Pareto distribution. The new method is based on a weighted empirical distribution function. The exact efficiency functions of the estimators relative to the existing estimators are derived. The paper gives L 1-optimal and L 2-optimal weights for the new weighted estimator. Monte Carlo simulation results confirm the theoretical conclusions. Both theoretical and simulation results show that the new estimation method is more efficient relative to several existing methods in many situations.  相似文献   

18.
In this paper, we investigate the effect of a cold standby component on the mean residual life (MRL) of a system. When the system fails, a cold standby component is immediately put in operation. We particularly focus on the coherent systems in which, after putting the standby component into operation, the failure of the system is due to the next component failure. For these systems, we define MRL functions and obtain their explicit expressions. Also some stochastic ordering results are provided. Such systems include k-out-of-n systems. Hence, our results extend some results in literature.  相似文献   

19.
Often, in reliability theory, risk analysis, renewal processes and actuarial studies, mean residual life function or life expectancy plays an important role in studying the conditional tail measure of lifetime data. In this paper, we introduce the notion of the mean residual waiting time of records and present some monotonic and aging properties. Sharp bounds for the mean residual waiting time of records are also investigated.  相似文献   

20.
Kurt Hoffmann 《Statistics》2013,47(4):559-566
The problem of characterizing an exponential family by sufficiency of certain statistics is considered. In distinction to most of the papers on this subject we do not want to characterize an exponential family of order less than or equal to k by the existence of a-dimensional sufficient statis tics in.GepenoenX' oi inc sample size. Since such a characterization is only valid under regularity assumptions, which is shown in the paper, we consider a stronger property of an exponential family which turns out to be a characteristic one. At this the concept of generalized likelihood functions will play an important role.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号