期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Power analysis of independence testing for three-way contingency tables of small sizes

Piotr Sulewski 《Journal of applied statistics》2018,45(13):2481-2498

The first aim of this paper is to introduce a modular test for the three-way contingency table (TT). The second aim is to describe the procedure of generating TT using the bar method. The third aim is on the one hand to suggest the measure of untruthfulness of H₀ and on the other hand to compare the quality of independence tests by using their power. Critical values for analyzed statistics were determined by simulating the Monte Carlo method. 相似文献

2.

On Dependence Properties of Random Minima and Maxima

Ali Dolati 《统计学通讯:理论与方法》2013,42(3):393-399

Let X _(n) and X ₍₁₎ be the largest and smallest order statistics, respectively, of a random sample of fixed size n. Quite generally, X ₍₁₎ and X _(n) are approximately independent for n sufficiently large. In this article, we study the dependence properties of random extremes in terms of their copula, when the sample size has a left-truncated binomial distribution and show that they tend to be more dependent in this case. We also give closed-form formulas for the measures of association Kendall's τ and Spearman's ρ to measure the amount of dependence between two extremes. 相似文献

3.

Goodness-of-fit tests for location–scale families based on random distance

《Journal of Statistical Computation and Simulation》2012,82(4):739-752

For location–scale families, we consider a random distance between the sample order statistics and the quasi sample order statistics derived from the null distribution as a measure of discrepancy. The conditional qth quantile and expectation of the random discrepancy on the given sample are chosen as test statistics. Simulation results of powers against various alternatives are illustrated under the normal and exponential hypotheses for moderate sample size. The proposed tests, especially the qth quantile tests with a small or large q, are shown to be more powerful than other prominent goodness-of-fit tests in most cases. 相似文献

4.

A distribution function estimator for the difference of order statistics from two independent samples

Alan D. Hutson 《Statistical Papers》2009,50(1):203-208

For two independent populations X and Y we develop the empirical distribution function estimator for the difference of order statistics of the form X _(i)−Y _(j). The key practical application for this estimator pertains to inference between quantiles from two independent populations. 相似文献

5.

Asymptotic results for empirical and partial-sum processes: A review

Ronald Pyke 《Revue canadienne de statistique》1984,12(4):241-264

Two processes of importance in statistics and probability are the empirical and partial-sum processes. Based on d-dimensional data X₁, … X_a the empirical measure is defined for any A ⊂ R^d by the sample proportion of observations in A. When normalized, F_n yields the empirical process W_n: = n^1/2 (F_n - F), where F denotes the “true” probability measure. To define partial-sum processes, one needs data that are assigned to specified locations (in contrast to the above, where specified unit masses are assigned to random locations). A suitable context for many applications is that of data attached to points of a lattice, say {X_j:j ϵ J^d} where J = {1, 2,…}, for which the partial sums are defined for any A ⊂ R^d by Thus S(A) is the sum of the data contained in A. When normalized, S yields the partial-sum process. This paper provides an overview of asymptotic results for empirical and partial-sum processes, including strong laws and central limit theorems, together with some indications of their inferential implications. 相似文献

6.

Fisher information in weighted distributions

Satish Iyengar Paul Kvam Harshinder Singh 《Revue canadienne de statistique》1999,27(4):833-841

Suppose that a density f_θ (x) belongs to an exponential family, but that inference about θ must be based on data that are obtained from a density that is proportional to W(x)f_θ(x). The authors study the Fisher information about θ in observations obtained from such weighted distributions and give conditions under which this information is greater than under the original density. These conditions involve the hazard- and reversed-hazard-rate functions. 相似文献

7.

Comparison of order statistics in two-sample problem in the sense of Pitman closeness

Jafar Ahmadi Mohammad Z. Raqab 《Statistics》2013,47(4):729-743

In this paper, we study the Pitman measure of closeness of order statistics of two independent samples from the same distribution to population quantiles. We then derive various exact expressions of the probability closeness of order statistics from the X and Y samples. Some distribution-free results for the median of the sampling distribution are obtained. Exact and explicit expressions are presented for Uniform(?1, 1) and exponential distributions. Numerical results for illustrative purposes are also provided. 相似文献

8.

Characterization of discrete populations through conditional expectations of order statistics

Manuel Franco José M. Ruiz 《Statistical Papers》1998,39(3):249-262

In this paper we give some properties of the expected values of any order statistic when one of its adjacent order statistics is known (order mean function) from a sequence of sizen of independent and identically distributed random variables with discrete distribution. Furthermore, we obtain the explicit expressions of the distribution from these order mean functions, and finally, we show the necessary and sufficient conditions for any real function to be an order mean function. We also add some examples of characterization of discrete distributions from the order mean functions. Partially supported by Consejería de Cultura y Educación (C.A.R.M.), under Grant PIB 95/90. 相似文献

9.

Outlier detection for high dimensional data using the Comedian approach

《Journal of Statistical Computation and Simulation》2012,82(5):745-757

The process of detection of outliers is an interesting and important aspect in the analysis of data, as it could impact the inference. There are various methods available in the literature for detection of outliers in multivariate data [V. Barnett and T. Lewis, Outliers in Statistical Data, John Wiley & Sons, Chichester, 1994] using the Mahalanobis distance measure. An attempt is made to propose an alternate method of outlier detection based on the comedian introduced by Falk [On MAD and Comedians, Ann. Inst. Statist. Math. 49 (1997), pp. 615–644]. The proposed method is computationally efficient with high breakdown value and low computation time. Further, important properties, namely, success rates (SR) and false detection rates (FDR) are studied and compared with some of the well-known outlier detection methods through a simulation study. The Comedian method has high SR and low FDR for all combination of parameters. On removal of the detected outliers or down weighing, the same, highly robust and approximately affine equivariant estimators of multivariate location and scatter can be obtained. Finally, the method is applied to well-known real data sets to evaluate its performance. 相似文献

10.

Time series analysis of categorical data using auto-odds ratio function

Raju Maiti Atanu Biswas 《Statistics》2018,52(2):426-444

In this paper, we consider the auto-odds ratio function (AORF) as a measure of serial association for a stationary time series process of categorical data at two different time points. Numerical measures such as the autocorrelation function (ACF) have no meaningful interpretation, unless the time series data are numerical. Instead, we use the AORF as a measure of association to study the serial dependency of the categorical time series for both ordinal and nominal categories. Biswas and Song [Discrete-valued ARMA processes. Stat Probab Lett. 2009;79(17):1884–1889] provided some results on this measure for Pegram's operator-based AR(1) process with binary responses. Here, we extend this measure to more general set-ups, i.e. for AR(p) and MA(q) processes and for a general number of categories. We discuss how this method can effectively be used in parameter estimation and model selection. Following Weiß [Empirical measures of signed serial dependence in categorical time series. J Stat Comput Simul. 2011;81(4):411–429], we derive the large sample distribution of the estimator of the AORF under independent and identically distributed (iid) set-up. Some simulation results and two categorical data examples (one is ordinal and other nominal) are presented to illustrate the proposed method. 相似文献

11.

An alternative proof of order statistics moment problem

M. M. Ali 《Revue canadienne de statistique》1976,4(1):151-153

A necessary and sufficient condition that two distributions having finite means are identical is that for any fixed integer r > 0, the expected values of their rth (n ? r) order statistics are equal [or the expected values of their (n-r)th (n > r ? 0) order statistics are equal] for all n where n is the sample size. 相似文献

12.

CONDITIONAL INFERENCE ABOUT AN EQUICORRELATION COEFFICIENT

Evan J. Williams Paul Yip 《Australian & New Zealand Journal of Statistics》1989,31(1):138-142

SenGupta (1987) proposed a locally most powerful test which is globally (one sided) unbiased, and an estimator of p, the equicorrelation coefficient of a standard symmetric multivariate normal (SSMN) distribution. Here we use the idea in Williams (1984) to illustrate the construction and use of ancillary statistics to make inference about p. The test and confidence intervals based on this construction are conditionally optimal. 相似文献

13.

Theory and methods for partitioned Gini coefficients computed on post-stratified data

Chaitra H. Nagaraja 《统计学通讯:理论与方法》2017,46(10):4809-4823

The Gini coefficient is used to measure inequality in populations. However, shifts in the population distribution may affect subgroups differently. Consequently, it can be informative to examine inequality separately for these segments. Consider an independently and identically distributed sample split based on ranking and compute the Gini coefficient for each partition. These coefficients, calculated from post-stratified data, are not functions of U-statistics. Therefore, previous theoretical and methodological results cannot be applied. In this article, the asymptotic joint distribution is derived for the partitioned coefficients and bootstrap methods for inference are developed. Finally, an application to per capita income across census tracts is examined. 相似文献

14.

Some representation theorems in terms of mean residual lifetime and mean inactivity time of coherent systems

Mahdi Tavangar 《统计学通讯:理论与方法》2013,42(2):226-235

Abstract

In this paper, we consider a k-out-of-n system consisting of n identical components with independent lifetimes. We show that when the underlying distribution function F(t) is absolutely continuous, then it can be univocally determined by some particular mean residual lives or mean inactivity times of the system. It is then shown that these results may be extended to coherent (or mixed) systems. 相似文献

15.

A Starting Point for Navigating the World of Daily Fantasy Basketball

Charles South Ryan Elmore Andrew Clarage Rob Sickorez Jing Cao 《The American statistician》2019,73(2):179-185

Fantasy sports, particularly the daily variety in which new lineups are selected each day, are a rapidly growing industry. The two largest companies in the daily fantasy business, DraftKings and Fanduel, have been valued as high as $2 billion. This research focuses on the development of a complete system for daily fantasy basketball, including both the prediction of player performance and the construction of a team. First, a Bayesian random effects model is used to predict an aggregate measure of daily NBA player performance. The predictions are then used to construct teams under the constraints of the game, typically related to a fictional salary cap and player positions. Permutation based and K-nearest neighbors approaches are compared in terms of the identification of “successful” teams—those who would be competitive more often than not based on historical data. We demonstrate the efficacy of our system by comparing our predictions to those from a well-known analytics website, and by simulating daily competitions over the course of the 2015–2016 season. Our results show an expected profit of approximately $9,000 on an initial $500 investment using the K-nearest neighbors approach, a 36% increase relative to using the permutation-based approach alone. Supplementary materials for this article are available online. 相似文献

16.

Prediction intervals for ordinary and dual generalized order statistics from two independent sequences

M. M. Mohie El-Din 《统计学通讯:模拟与计算》2013,42(9):2703-2723

ABSTRACT

Based on the observed dual generalized order statistics drawn from an arbitrary unknown distribution, nonparametric two-sided prediction intervals as well as prediction upper and lower bounds for an ordinary and a dual generalized order statistic from another iid sequence with the same distribution are developed. The prediction intervals for dual generalized order statistics based on the observed ordinary generalized order statistics are also developed. The coverage probabilities of these prediction intervals are exact and free of the parent distribution, F. Finally, numerical computations and real examples of the coverage probabilities are presented for choosing the appropriate limits of the prediction. 相似文献

17.

Optimal estimation for the Pareto distribution

《Journal of Statistical Computation and Simulation》2012,82(12):2059-2076

This paper proposes an optimal estimation method for the shape parameter, probability density function and upper tail probability of the Pareto distribution. The new method is based on a weighted empirical distribution function. The exact efficiency functions of the estimators relative to the existing estimators are derived. The paper gives L ₁-optimal and L ₂-optimal weights for the new weighted estimator. Monte Carlo simulation results confirm the theoretical conclusions. Both theoretical and simulation results show that the new estimation method is more efficient relative to several existing methods in many situations. 相似文献

18.

A note on the mean residual life of a coherent system with a cold standby component

Ahmad Mirjalili Majid Rezaei 《统计学通讯:理论与方法》2017,46(20):10348-10358

In this paper, we investigate the effect of a cold standby component on the mean residual life (MRL) of a system. When the system fails, a cold standby component is immediately put in operation. We particularly focus on the coherent systems in which, after putting the standby component into operation, the failure of the system is due to the next component failure. For these systems, we define MRL functions and obtain their explicit expressions. Also some stochastic ordering results are provided. Such systems include k-out-of-n systems. Hence, our results extend some results in literature. 相似文献

19.

Some results on the mean residual waiting time of records

Mohammad Z. Raqab Majid Asadi 《Statistics》2013,47(5):493-504

Often, in reliability theory, risk analysis, renewal processes and actuarial studies, mean residual life function or life expectancy plays an important role in studying the conditional tail measure of lifetime data. In this paper, we introduce the notion of the mean residual waiting time of records and present some monotonic and aging properties. Sharp bounds for the mean residual waiting time of records are also investigated. 相似文献

20.

Sufficiency,generalized likelihood function and exponential family

Kurt Hoffmann 《Statistics》2013,47(4):559-566

The problem of characterizing an exponential family by sufficiency of certain statistics is considered. In distinction to most of the papers on this subject we do not want to characterize an exponential family of order less than or equal to k by the existence of a-dimensional sufficient statis tics in.GepenoenX' oi inc sample size. Since such a characterization is only valid under regularity assumptions, which is shown in the paper, we consider a stronger property of an exponential family which turns out to be a characteristic one. At this the concept of generalized likelihood functions will play an important role. 相似文献