期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A direct approach to false discovery rates

John D. Storey 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2002,64(3):479-498

Summary. Multiple-hypothesis testing involves guarding against much more complicated errors than single-hypothesis testing. Whereas we typically control the type I error rate for a single-hypothesis test, a compound error rate is controlled for multiple-hypothesis tests. For example, controlling the false discovery rate FDR traditionally involves intricate sequential p -value rejection methods based on the observed data. Whereas a sequential p -value method fixes the error rate and estimates its corresponding rejection region, we propose the opposite approach—we fix the rejection region and then estimate its corresponding error rate. This new approach offers increased applicability, accuracy and power. We apply the methodology to both the positive false discovery rate pFDR and FDR, and provide evidence for its benefits. It is shown that pFDR is probably the quantity of interest over FDR. Also discussed is the calculation of the q -value, the pFDR analogue of the p -value, which eliminates the need to set the error rate beforehand as is traditionally done. Some simple numerical examples are presented that show that this new approach can yield an increase of over eight times in power compared with the Benjamini–Hochberg FDR method. 相似文献

2.

Non‐parametric Estimation of Extreme Risk Measures from Conditional Heavy‐tailed Distributions

Jonathan El Methni Laurent Gardes Stéphane Girard 《Scandinavian Journal of Statistics》2014,41(4):988-1012

In this paper, we introduce a new risk measure, the so‐called conditional tail moment. It is defined as the moment of order a ≥ 0 of the loss distribution above the upper α‐quantile where α ∈ (0,1). Estimating the conditional tail moment permits us to estimate all risk measures based on conditional moments such as conditional tail expectation, conditional value at risk or conditional tail variance. Here, we focus on the estimation of these risk measures in case of extreme losses (where α ↓0 is no longer fixed). It is moreover assumed that the loss distribution is heavy tailed and depends on a covariate. The estimation method thus combines non‐parametric kernel methods with extreme‐value statistics. The asymptotic distribution of the estimators is established, and their finite‐sample behaviour is illustrated both on simulated data and on a real data set of daily rainfalls. 相似文献

3.

A robust process capability index

Arthur B. Yeh Sumankumar Bhattcharya 《统计学通讯:模拟与计算》2013,42(2):565-589

The existing process capability indices (PCI's) assume that the distribution of the process being investigated is normal. For non-normal distributions, PCI's become unreliable in that PCI's may indicate the process is capable when in fact it is not. In this paper, we propose a new index which can be applied to any distribution. The proposed indexC_f:, is directly related to the probability of non-conformance of the process. For a given random sample, the estimation of C_f boils down to estimating non-parametrically the tail probabilities of an unknown distribution. The approach discussed in this paper is based on the works by Pickands (1975) and Smith (1987). We also discuss the construction of bootstrap confidence intervals of C_f: based on the so-called accelerated bias correction method (BC _a:). Several simulations are carried out to demonstrate the flexibility and applicability of C_f:. Two real life data sets are analyzed using the proposed index. 相似文献

4.

A martingale residual diagnostic for longitudinal and recurrent event data

Entisar Elgmati Daniel Farewell Robin Henderson 《Lifetime data analysis》2010,16(1):118-135

One method of assessing the fit of an event history model is to plot the empirical standard deviation of standardised martingale residuals. We develop an alternative procedure which is valid also in the presence of measurement error and applicable to both longitudinal and recurrent event data. Since the covariance between martingale residuals at times t ₀ and t > t ₀ is independent of t, a plot of these covariances should, for fixed t ₀, have no time trend. A test statistic is developed from the increments in the estimated covariances, and we investigate its properties under various types of model misspecification. Applications of the approach are presented using two Brazilian studies measuring daily prevalence and incidence of infant diarrhoea and a longitudinal study into treatment of schizophrenia. 相似文献

5.

The generalized Gudermannian distribution: inference and volatility modelling

Emrah Altun 《Statistics》2019,53(2):364-386

In this paper, we introduce a new distribution, called generalized Gudermannian (GG) distribution, and its skew extension for GARCH models in modelling daily Value-at-Risk (VaR). Basic structural properties of the proposed distribution are obtained including probability density and cumulative distribution functions, moments, and stochastic representation. The maximum likelihood method is used to estimate unknown parameters of the proposed model and finite sample performance of maximum likelihood estimates are evaluated by means of Monte-Carlo simulation study. The real data application on Nikkei 225 index is given to demonstrate the performance of GARCH model specified under skew extension of GG innovation distribution against normal, Student's-t, skew normal and generalized error and skew generalized error distributions in terms of the accuracy of VaR forecasts. The empirical results show that the GARCH model with GG innovation distribution produces the most accurate VaR forecasts for all confidence levels. 相似文献

6.

Data envelopment analysis in the presence of measurement error: case study from the National Database of Nursing Quality Indicators® (NDNQI®)

Byron J. Gajewski Robert Lee Nancy Dunton 《Journal of applied statistics》2012,39(12):2639-2653

Data envelopment analysis (DEA) is the most commonly used approach for evaluating healthcare efficiency [B. Hollingsworth, The measurement of efficiency and productivity of health care delivery. Health Economics 17(10) (2008), pp. 1107–1128], but a long-standing concern is that DEA assumes that data are measured without error. This is quite unlikely, and DEA and other efficiency analysis techniques may yield biased efficiency estimates if it is not realized [B.J. Gajewski, R. Lee, M. Bott, U. Piamjariyakul, and R.L. Taunton, On estimating the distribution of data envelopment analysis efficiency scores: an application to nursing homes’ care planning process. Journal of Applied Statistics 36(9) (2009), pp. 933–944; J. Ruggiero, Data envelopment analysis with stochastic data. Journal of the Operational Research Society 55 (2004), pp. 1008–1012]. We propose to address measurement error systematically using a Bayesian method (Bayesian DEA). We will apply Bayesian DEA to data from the National Database of Nursing Quality Indicators^® to estimate nursing units’ efficiency. Several external reliability studies inform the posterior distribution of the measurement error on the DEA variables. We will discuss the case of generalizing the approach to situations where an external reliability study is not feasible. 相似文献

7.

z Test for the significance of the mean of a stable probability distribution with 1<α≤2

Michael Parkinson 《Journal of applied statistics》2013,40(3):465-482

The analysis of data using a stable probability distribution with tail parameter α<2 (sometimes called a Pareto–Levy distribution) seems to have been avoided in the past in part because of the lack of a significance test for the mean, even though it appears to be the correct distribution to use for describing returns in the financial markets. A z test for the significance of the mean of a stable distribution with tail parameter 1<α≤2 is defined. Tables are calculated and displayed for the 5% and 1% significance levels for a range of tail and skew parameters α and β. Through the use of maximum likelihood estimates, the test becomes a practical tool even when α and β are not that accurately determined. As an example, the z test is applied to the daily closing prices for the Dow Jones Industrial average from 2 January 1940 to 19 March 2010. 相似文献

8.

Maximum likelihood estimation of skew-t copulas with its applications to stock returns

Toshinao Yoshiba 《Journal of Statistical Computation and Simulation》2018,88(13):2489-2506

The multivariate Student-t copula family is used in statistical finance and other areas when there is tail dependence in the data. It often is a good-fitting copula but can be improved on when there is tail asymmetry. Multivariate skew-t copula families can be considered when there is tail dependence and tail asymmetry, and we show how a fast numerical implementation for maximum likelihood estimation is possible. For the copula implicit in a multivariate skew-t distribution, the fast implementation makes use of (i) monotone interpolation of the univariate marginal quantile function and (ii) a re-parametrization of the correlation matrix. Our numerical approach is tested with simulated data with data-driven parameters. A real data example involves the daily returns of three stock indices: the Nikkei225, S&P500 and DAX. With both unfiltered returns and GARCH/EGARCH filtered returns, we compare the fits of the Azzalini–Capitanio skew-t, generalized hyperbolic skew-t, Student-t, skew-Normal and Normal copulas. 相似文献

9.

Least-squares estimation of distribution functions in johnson's translation system

《Journal of Statistical Computation and Simulation》2012,82(4):271-297

To summarize a set of data by a distribution function in Johnson's translation system, we use a least-squares approach to parameter estimation wherein we seek to minimize the distance between the vector of "uniformized" oeder statistics and the corresponding vector of expected values. We use the software package FITTRI to apply this technique to three problems arising respectively in medicine, applied statistics, and civil engineering. Compared to traditional methods of distribution fitting based on moment matching, percentile matchingL ₁ estimation, and L _? estimation, the least-squares technique is seen to yield fits of similar accuracy and to converge more rapidly and reliably to a set of acceptable parametre estimates. 相似文献

10.

Loss-based approach to two-piece location-scale distributions with applications to dependent data

Leisen Fabrizio Rossini Luca Villa Cristiano 《Statistical Methods and Applications》2020,29(2):309-333

Two-piece location-scale models are used for modeling data presenting departures from symmetry. In this paper, we propose an objective Bayesian methodology for the tail parameter of two particular distributions of the above family: the skewed exponential power distribution and the skewed generalised logistic distribution. We apply the proposed objective approach to time series models and linear regression models where the error terms follow the distributions object of study. The performance of the proposed approach is illustrated through simulation experiments and real data analysis. The methodology yields improvements in density forecasts, as shown by the analysis we carry out on the electricity prices in Nordpool markets.

相似文献

11.

On the Sampling Distributions of the Estimated Process Loss Indices with Asymmetric Tolerances

Y. C. Chang W. L. Pearn Chien-Wei Wu 《统计学通讯:模拟与计算》2013,42(6):1153-1170

The inverse Gaussian distribution provides a flexible model for analyzing positive, right-skewed data. The generalized variable test for equality of several inverse Gaussian means with unknown and arbitrary variances has satisfactory Type-I error rate when the number of samples (k) is small (Tian, 2006). However, the Type-I error rate tends to be inflated when k goes up. In this article, we propose a parametric bootstrap (PB) approach for this problem. Simulation results show that the proposed test performs very satisfactorily regardless of the number of samples and sample sizes. This method is illustrated by an example. 相似文献

12.

On tail index estimation based on multivariate data

A. Dematteo S. Clémençon 《Journal of nonparametric statistics》2016,28(1):152-176

This article is devoted to the study of tail index estimation based on i.i.d. multivariate observations, drawn from a standard heavy-tailed distribution, that is, of which Pareto-like marginals share the same tail index. A multivariate central limit theorem for a random vector, whose components correspond to (possibly dependent) Hill estimators of the common tail index α, is established under mild conditions. We introduce the concept of (standard) heavy-tailed random vector of tail index α and show how this limit result can be used in order to build an estimator of α with small asymptotic mean squared error, through a proper convex linear combination of the coordinates. Beyond asymptotic results, simulation experiments illustrating the relevance of the approach promoted are also presented. 相似文献

13.

Sequential combining in discriminant analysis

T. Górecki 《Journal of applied statistics》2015,42(2):398-408

In practice, it often happens that we have a number of base methods of classification. We are not able to clearly determine which method is optimal in the sense of the smallest error rate. Then we have a combined method that allows us to consolidate information from multiple sources in a better classifier. I propose a different approach, a sequential approach. Sequentiality is understood here in the sense of adding posterior probabilities to the original data set and so created data are used during classification process. We combine posterior probabilities obtained from base classifiers using all combining methods. Finally, we combine these probabilities using a mean combining method. To the original data set we add obtained posterior probabilities as additional features. In each step we change our additional probabilities to achieve the minimum error rate for base methods. Experimental results on different data sets demonstrate that the method is efficient and that this approach outperforms base methods providing a reduction in the mean classification error rate. 相似文献

14.

Estimating the probability of obtaining nonfeasible parameter estimates of the generalized pareto distribution

《Journal of Statistical Computation and Simulation》2012,82(1-3):197-209

In this paper we consider the problem of estimating the parameters of the generalized Pareto distribution. Both the method of moments and probability-weighted moments do not guarantee that their respective estimates will be consistent with the observed data. We present simple programs to predict the probability of obtaining such nonfeasible estimates. Our estimation techniques are based on results from intensive simulations and the successful modelling of the lower tail of the distribution of the upper bound of the support. More simulations are performed to validate the new procedure. 相似文献

15.

A heteroscedastic measurement error model based on skew and heavy-tailed distributions with known error variances

Lorena Cáceres Tomaya 《Journal of Statistical Computation and Simulation》2018,88(11):2185-2200

In this paper, we study inference in a heteroscedastic measurement error model with known error variances. Instead of the normal distribution for the random components, we develop a model that assumes a skew-t distribution for the true covariate and a centred Student's t distribution for the error terms. The proposed model enables to accommodate skewness and heavy-tailedness in the data, while the degrees of freedom of the distributions can be different. Maximum likelihood estimates are computed via an EM-type algorithm. The behaviour of the estimators is also assessed in a simulation study. Finally, the approach is illustrated with a real data set from a methods comparison study in Analytical Chemistry. 相似文献

16.

Estimation and prediction for Chen distribution with bathtub shape under progressive censoring

Tanmay Kayal Devendra Pratap Singh Manoj Kumar Rastogi 《Journal of Statistical Computation and Simulation》2017,87(2):348-366

We consider estimation of the unknown parameters of Chen distribution [Chen Z. A new two-parameter lifetime distribution with bathtub shape or increasing failure rate function. Statist Probab Lett. 2000;49:155–161] with bathtub shape using progressive-censored samples. We obtain maximum likelihood estimates by making use of an expectation–maximization algorithm. Different Bayes estimates are derived under squared error and balanced squared error loss functions. It is observed that the associated posterior distribution appears in an intractable form. So we have used an approximation method to compute these estimates. A Metropolis–Hasting algorithm is also proposed and some more approximate Bayes estimates are obtained. Asymptotic confidence interval is constructed using observed Fisher information matrix. Bootstrap intervals are proposed as well. Sample generated from MH algorithm are further used in the construction of HPD intervals. Finally, we have obtained prediction intervals and estimates for future observations in one- and two-sample situations. A numerical study is conducted to compare the performance of proposed methods using simulations. Finally, we analyse real data sets for illustration purposes. 相似文献

17.

Parameter and reliability estimation for an exponentiated half-logistic distribution under progressive type II censoring

《Journal of Statistical Computation and Simulation》2012,82(8):1711-1727

In this article, we deal with a two-parameter exponentiated half-logistic distribution. We consider the estimation of unknown parameters, the associated reliability function and the hazard rate function under progressive Type II censoring. Maximum likelihood estimates (M LEs) are proposed for unknown quantities. Bayes estimates are derived with respect to squared error, linex and entropy loss functions. Approximate explicit expressions for all Bayes estimates are obtained using the Lindley method. We also use importance sampling scheme to compute the Bayes estimates. Markov Chain Monte Carlo samples are further used to produce credible intervals for the unknown parameters. Asymptotic confidence intervals are constructed using the normality property of the MLEs. For comparison purposes, bootstrap-p and bootstrap-t confidence intervals are also constructed. A comprehensive numerical study is performed to compare the proposed estimates. Finally, a real-life data set is analysed to illustrate the proposed methods of estimation. 相似文献

18.

Approximating M/G/1 Waiting Time Tail Probabilities

《随机性模型》2013,29(2):173-191

Abstract

We propose a new approximation formula for the waiting time tail probability of the M/G/1 queue with FIFO discipline and unlimited waiting space. The aim is to address the difficulty of obtaining good estimates when the tail probability has non-exponential asymptotics. We show that the waiting time tail probability can be expressed in terms of the waiting time tail probability of a notional M/G/1 queue with truncated service time distribution plus the tail probability of an extreme order statistic. The Cramér–Lundberg approximation is applied to approximate the tail probability of the notional queue. In essence, our technique extends the applicability of the Cramér–Lundberg approximation to cases where the standard Lundberg condition does not hold. We propose a simple moment-based technique for estimating the parameters of the approximation; numerical results demonstrate that our approximation can yield very good estimates over the whole range of the argument. 相似文献

19.

Bias corrected MLEs under progressive type-II censoring scheme

《Journal of Statistical Computation and Simulation》2012,82(14):2714-2726

ABSTRACT

Censoring frequently occurs in survival analysis but naturally observed lifetimes are not of a large size. Thus, inferences based on the popular maximum likelihood (ML) estimation which often give biased estimates should be corrected in the sense of bias. Here, we investigate the biases of ML estimates under the progressive type-II censoring scheme (pIIcs). We use a method proposed in Efron and Johnstone [Fisher's information in terms of the hazard rate. Technical Report No. 264, January 1987, Stanford University, Stanford, California; 1987] to derive general expressions for bias corrected ML estimates under the pIIcs. This requires derivation of the Fisher information matrix under the pIIcs. As an application, exact expressions are given for bias corrected ML estimates of the Weibull distribution under the pIIcs. The performance of the bias corrected ML estimates and ML estimates are compared by simulations and a real data application. 相似文献

20.

Empirical null distribution-based modeling of multi-class differential gene expression detection

Xiting Cao Marshall I. Hertz 《Journal of applied statistics》2013,40(2):347-357

In this paper, we study the multi-class differential gene expression detection for microarray data. We propose a likelihood-based approach to estimating an empirical null distribution to incorporate gene interactions and provide a more accurate false-positive control than the commonly used permutation or theoretical null distribution-based approach. We propose to rank important genes by p-values or local false discovery rate based on the estimated empirical null distribution. Through simulations and application to lung transplant microarray data, we illustrate the competitive performance of the proposed method. 相似文献