期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Bayesian ROC curve estimation under binormality using a rank likelihood

Jiezhun Gu Subhashis Ghosal 《Journal of statistical planning and inference》2009

There are various methods to estimate the parameters in the binormal model for the ROC curve. In this paper, we propose a conceptually simple and computationally feasible Bayesian estimation method using a rank-based likelihood. Posterior consistency is also established. We compare the new method with other estimation methods and conclude that our estimator generally performs better than its competitors. 相似文献

2.

Hypothesis tests to determine if all true positives have been identified on a receiver operating characteristic curve

Laurens Beran 《Journal of applied statistics》2014,41(6):1332-1341

For classification problems where the test data are labeled sequentially, the point at which all true positives are first identified is often of critical importance. This article develops hypothesis tests to assess whether all true positives have been labeled in the test data. The tests use a partial receiver operating characteristic (ROC) that is generated from a labeled subset of the test data. These methods are developed in the context of unexploded ordnance (UXO) classification, but are applicable to any binary classification problem. First, the likelihood of the observed ROC given binormal model parameters is derived using order statistics, leading to a nonlinear parameter estimation problem. I then derive the approximate distribution of the point on the ROC at which all true instances are found. Using estimated binormal parameters, this distribution can be integrated up to a desired confidence level to define a critical false alarm rate (FAR). If the selected operating point is before this critical point, then additional labels out to the critical point are required. A second test uses the uncertainty in binormal parameters to determine the critical FAR. These tests are demonstrated with UXO classification examples and both approaches are recommended for testing operating points. 相似文献

3.

Improving an estimator of Hsieh and Turnbull for the binormal ROC curve

Ori DavidovYuval Nov 《Journal of statistical planning and inference》2012,142(4):872-877

The estimator of Hsieh and Turnbull (1996) for the binormal receiver operating characteristic (ROC) curve is extended from grouped to ungrouped data. The new estimator is shown to be consistent and asymptotically normally distributed, and simulation results show that it outperforms Hsieh and Turnbull's original estimator. 相似文献

4.

Confidence Bands for ROC Curves With Serially Dependent Data

Kajal Lahiri Liu Yang 《商业与经济统计学杂志》2018,36(1):115-130

We propose serial correlation-robust asymptotic confidence bands for the receiver operating characteristic (ROC) curve and its functional, viz., the area under ROC curve (AUC), estimated by quasi-maximum likelihood in the binormal model. Our simulation experiments confirm that this new method performs fairly well in finite samples, and confers an additional measure of robustness to nonnormality. The conventional procedure is found to be markedly undersized in terms of yielding empirical coverage probabilities lower than the nominal level, especially when the serial correlation is strong. An example from macroeconomic forecasting demonstrates the importance of accounting for serial correlation when the probability forecasts for real GDP declines are evaluated using ROC. Supplementary materials for this article are available online. 相似文献

5.

An application of lomax distributions in receiver operating characteristic(roc)curve analysis

Gregory Campbell Makarand V. Ratnaparkhi 《统计学通讯:理论与方法》2013,42(6):1681-1687

Receiver operating characteristic(ROC)curves are useful for studying the performance of diagnostic tests. ROC curves occur in many fields of applications including psychophysics, quality control and medical diagnostics. In practical situations, often the responses to a diagnostic test are classified into a number of ordered categories. Such data are referred to as ratings data. It is typically assumed that the underlying model is based on a continuous probability distribution. The ROC curve is then constructed from such data using this probability model. Properties of the ROC curve are inherited from the model. Therefore, understanding the role of different probability distributions in ROC modeling is an interesting and important area of research. In this paper the Lomax distribution is considered as a model for ratings data and the corresponding ROC curve is derived. The maximum likelihood estimation procedure for the related parameters is discussed. This procedure is then illustrated in the analysis of a neurological data example. 相似文献

6.

Testing the difference between two Kolmogorov–Smirnov values in the context of receiver operating characteristic curves

Wojtek J. Krzanowski David J. Hand 《Journal of applied statistics》2011,38(3):437-450

The maximum vertical distance between a receiver operating characteristic (ROC) curve and its chance diagonal is a common measure of effectiveness of the classifier that gives rise to this curve. This measure is known to be equivalent to a two-sample Kolmogorov–Smirnov statistic; so the absolute difference D between two such statistics is often used informally as a measure of difference between the corresponding classifiers. A significance test of D is of great practical interest, but the available Kolmogorov–Smirnov distribution theory precludes easy analytical construction of such a significance test. We, therefore, propose a Monte Carlo procedure for conducting the test, using the binormal model for the underlying ROC curves. We provide Splus/R routines for the computation, tabulate the results for a number of illustrative cases, apply the methods to some practical examples and discuss some implications. 相似文献

7.

Adjusting ROC curves for covariates in the presence of verification bias

Ronen Fluss Benjamin ReiserDavid Faraggi 《Journal of statistical planning and inference》2012,142(1):1-11

The ROC (receiver operating characteristic) curve is frequently used for describing effectiveness of a diagnostic marker or test. Classical estimation of the ROC curve uses independent identically distributed samples taken randomly from the healthy and diseased populations. Frequently not all subjects undergo a definitive gold standard assessment of disease status (verification). Estimation of the ROC curve based on data only from subjects with verified disease status may be badly biased (verification bias). In this work we investigate the properties of the doubly robust (DR) method for estimating the ROC curve adjusted for covariates (ROC regression) under verification bias. We develop the estimator's asymptotic distribution and examine its finite sample size properties via a simulation study. We apply this procedure to fingerstick postprandial blood glucose measurement data adjusting for age. 相似文献

8.

Bayes estimation of Moran–Downton bivariate exponential distribution based on censored samples

Yu-Jau Lin Y. L. Lio 《Journal of Statistical Computation and Simulation》2013,83(5):837-852

In this paper, we discuss a Bayesian estimation procedure for the parameters in a Moran–Downton bivariate exponential distribution based on complete and censored samples. A Markov-chain Monte Carlo method is used to obtain the Bayes estimates of the parameters. An intensive simulation experiment is conducted to study the performance of the proposed Bayesian estimation procedure. Discussions and suggestions are provided based on the simulation results. A numerical example is presented to illustrate the Bayesian estimation procedure developed here and some concluding remarks are provided. 相似文献

9.

带线性约束的多元线性回归模型参数估计

李小胜王申令《统计研究》2016,33(11):85-92

本文首先构造线性约束条件下的多元线性回归模型的样本似然函数,利用Lagrange法证明其合理性。其次,从似然函数的角度讨论线性约束条件对模型参数的影响,对由传统理论得出的参数估计作出贝叶斯与经验贝叶斯的改进。做贝叶斯改进时,将矩阵正态-Wishart分布作为模型参数和精度阵的联合共轭先验分布,结合构造的似然函数得出参数的后验分布,计算出参数的贝叶斯估计;做经验贝叶斯改进时,将样本分组,从方差的角度讨论由子样得出的参数估计对总样本的参数估计的影响,计算出经验贝叶斯估计。最后,利用Matlab软件生成的随机矩阵做模拟。结果表明,这两种改进后的参数估计均较由传统理论得出的参数估计更精确,拟合结果的误差比更小,可信度更高,在大数据的情况下,这种计算方法的速度更快。相似文献

10.

Compare diagnostic tests using transformation-invariant smoothed ROC curves()

Tang L Du P Wu C 《Journal of statistical planning and inference》2010,140(11):3540-3551

Receiver operating characteristic (ROC) curve, plotting true positive rates against false positive rates as threshold varies, is an important tool for evaluating biomarkers in diagnostic medicine studies. By definition, ROC curve is monotone increasing from 0 to 1 and is invariant to any monotone transformation of test results. And it is often a curve with certain level of smoothness when test results from the diseased and non-diseased subjects follow continuous distributions. Most existing ROC curve estimation methods do not guarantee all of these properties. One of the exceptions is Du and Tang (2009) which applies certain monotone spline regression procedure to empirical ROC estimates. However, their method does not consider the inherent correlations between empirical ROC estimates. This makes the derivation of the asymptotic properties very difficult. In this paper we propose a penalized weighted least square estimation method, which incorporates the covariance between empirical ROC estimates as a weight matrix. The resulting estimator satisfies all the aforementioned properties, and we show that it is also consistent. Then a resampling approach is used to extend our method for comparisons of two or more diagnostic tests. Our simulations show a significantly improved performance over the existing method, especially for steep ROC curves. We then apply the proposed method to a cancer diagnostic study that compares several newly developed diagnostic biomarkers to a traditional one. 相似文献

11.

Bayesian Estimators for Small Area Models Shrinking Both Means and Variances

下载免费PDF全文

Shonosuke Sugasawa Hiromasa Tamae Tatsuya Kubokawa 《Scandinavian Journal of Statistics》2017,44(1):150-167

For small area estimation of area‐level data, the Fay–Herriot model is extensively used as a model‐based method. In the Fay–Herriot model, it is conventionally assumed that the sampling variances are known, whereas estimators of sampling variances are used in practice. Thus, the settings of knowing sampling variances are unrealistic, and several methods are proposed to overcome this problem. In this paper, we assume the situation where the direct estimators of the sampling variances are available as well as the sample means. Using this information, we propose a Bayesian yet objective method producing shrinkage estimation of both means and variances in the Fay–Herriot model. We consider the hierarchical structure for the sampling variances, and we set uniform prior on model parameters to keep objectivity of the proposed model. For validity of the posterior inference, we show under mild conditions that the posterior distribution is proper and has finite variances. We investigate the numerical performance through simulation and empirical studies. 相似文献

12.

Detection and Localization in Test Accuracy: A Bayesian Perspective

Lyle D. Broemeling 《统计学通讯:理论与方法》2013,42(8):1555-1564

In assessing the area under the ROC curve for the accuracy of a diagnostic test, it is imperative to detect and locate multiple abnormalities per image. This approach takes that into account by adopting a statistical model that allows for correlation between the reader scores of several regions of interest (ROI).

The ROI method of partitioning the image is taken. The readers give a score to each ROI in the image and the statistical model takes into account the correlation between the scores of the ROI's of an image in estimating test accuracy. The test accuracy is given by Pr[Y > Z] + (1/2)Pr[Y = Z], where Y is an ordinal diagnostic measurement of an affected ROI, and Z is the diagnostic measurement of an unaffected ROI. This way of measuring test accuracy is equivalent to the area under the ROC curve. The parameters are the parameters of a multinomial distribution, then based on the multinomial distribution, a Bayesian method of inference is adopted for estimating the test accuracy.

Using a multinomial model for the test results, a Bayesian method based on the predictive distribution of future diagnostic scores is employed to find the test accuracy. By resampling from the posterior distribution of the model parameters, samples from the posterior distribution of test accuracy are also generated. Using these samples, the posterior mean, standard deviation, and credible intervals are calculated in order to estimate the area under the ROC curve. This approach is illustrated by estimating the area under the ROC curve for a study of the diagnostic accuracy of magnetic resonance angiography for diagnosis of arterial atherosclerotic stenosis. A generalization to multiple readers and/or modalities is proposed.

A Bayesian way to estimate test accuracy is easy to perform with standard software packages and has the advantage of employing the efficient inclusion of information from prior related imaging studies. 相似文献

13.

Generalized fiducial inference for generalized exponential distribution

Liang Yan 《Journal of Statistical Computation and Simulation》2018,88(7):1369-1381

This article mainly considers interval estimation of the scale and shape parameters of the generalized exponential (GE) distribution. We adopt the generalized fiducial method to construct a kind of new confidence intervals for the parameters of interest and compare them with the frequentist and Bayesian methods. In addition, we give the comparison of the point estimation based on the frequentist, generalized fiducial and Bayesian methods. Simulation results show that a new procedure based on generalized fiducial inference is more applicable than the non-fiducial methods for the point and interval estimation of the GE distribution. Finally, two lifetime data sets are used to illustrate the application of our new procedure. 相似文献

14.

Bayesian Estimation of the Parameters of Bivariate Exponential Distributions

David D. Hanagal K. A. Ahmadi 《统计学通讯:模拟与计算》2013,42(7):1391-1413

In this article, we develop an empirical Bayesian approach for the Bayesian estimation of parameters in four bivariate exponential (BVE) distributions. We have opted for gamma distribution as a prior for the parameters of the model in which the hyper parameters have been estimated based on the method of moments and maximum likelihood estimates (MLEs). A simulation study was conducted to compute empirical Bayesian estimates of the parameters and their standard errors. We use moment estimators or MLEs to estimate the hyper parameters of the prior distributions. Furthermore, we compare the posterior mode of parameters obtained by different prior distributions and the Bayesian estimates based on gamma priors are very close to the true values as compared to improper priors. We use MCMC method to obtain the posterior mean and compared the same using the improper priors and the classical estimates, MLEs. 相似文献

15.

Nonparametric estimation of the ROC curve for length-biased and right-censored data

Shanshan Song 《统计学通讯:理论与方法》2020,49(19):4648-4668

Abstract

ROC curve is a fundamental evaluation tool in medical researches and survival analysis. The estimation of ROC curve has been studied extensively with complete data and right-censored survival data. However, these methods are not suitable to analyze the length-biased and right-censored data. Since this kind of data includes the auxiliary information that truncation time and residual time share the same distribution, the two new estimators for the ROC curve are proposed by taking into account this auxiliary information to improve estimation efficiency. Numerical simulation studies with different assumed cases and real data analysis are conducted. 相似文献

16.

Bayesian analysis of joint mean and covariance models for longitudinal data

Dengke Xu Zhongzhan Zhang Liucang Wu 《Journal of applied statistics》2014,41(11):2504-2514

Efficient estimation of the regression coefficients in longitudinal data analysis requires a correct specification of the covariance structure. If misspecification occurs, it may lead to inefficient or biased estimators of parameters in the mean. One of the most commonly used methods for handling the covariance matrix is based on simultaneous modeling of the Cholesky decomposition. Therefore, in this paper, we reparameterize covariance structures in longitudinal data analysis through the modified Cholesky decomposition of itself. Based on this modified Cholesky decomposition, the within-subject covariance matrix is decomposed into a unit lower triangular matrix involving moving average coefficients and a diagonal matrix involving innovation variances, which are modeled as linear functions of covariates. Then, we propose a fully Bayesian inference for joint mean and covariance models based on this decomposition. A computational efficient Markov chain Monte Carlo method which combines the Gibbs sampler and Metropolis–Hastings algorithm is implemented to simultaneously obtain the Bayesian estimates of unknown parameters, as well as their standard deviation estimates. Finally, several simulation studies and a real example are presented to illustrate the proposed methodology. 相似文献

17.

Analysing nonlinear time series with central subspace

《Journal of Statistical Computation and Simulation》2012,82(7):1073-1084

Traditionally, time series analysis involves building an appropriate model and using either parametric or nonparametric methods to make inference about the model parameters. Motivated by recent developments for dimension reduction in time series, an empirical application of sufficient dimension reduction (SDR) to nonlinear time series modelling is shown in this article. Here, we use time series central subspace as a tool for SDR and estimate it using mutual information index. Especially, in order to reduce the computational complexity in time series, we propose an efficient estimation method of minimal dimension and lag using a modified Schwarz–Bayesian criterion, when either of the dimensions and the lags is unknown. Through simulations and real data analysis, the approach presented in this article performs well in autoregression and volatility estimation. 相似文献

18.

Pharmacokinetic parameters estimation using adaptive Bayesian P-splines models

Jullion A Lambert P Beck B Vandenhende F 《Pharmaceutical statistics》2009,8(2):98-112

In preclinical and clinical experiments, pharmacokinetic (PK) studies are designed to analyse the evolution of drug concentration in plasma over time i.e. the PK profile. Some PK parameters are estimated in order to summarize the complete drug's kinetic profile: area under the curve (AUC), maximal concentration (C(max)), time at which the maximal concentration occurs (t(max)) and half-life time (t(1/2)).Several methods have been proposed to estimate these PK parameters. A first method relies on interpolating between observed concentrations. The interpolation method is often chosen linear. This method is simple and fast. Another method relies on compartmental modelling. In this case, nonlinear methods are used to estimate parameters of a chosen compartmental model. This method provides generally good results. However, if the data are sparse and noisy, two difficulties can arise with this method. The first one is related to the choice of the suitable compartmental model given the small number of data available in preclinical experiment for instance. Second, nonlinear methods can fail to converge. Much work has been done recently to circumvent these problems (J. Pharmacokinet. Pharmacodyn. 2007; 34:229-249, Stat. Comput., to appear, Biometrical J., to appear, ESAIM P&S 2004; 8:115-131).In this paper, we propose a Bayesian nonparametric model based on P-splines. This method provides good PK parameters estimation, whatever be the number of available observations and the level of noise in the data. Simulations show that the proposed method provides better PK parameters estimations than the interpolation method, both in terms of bias and precision. The Bayesian nonparametric method provides also better AUC and t(1/2) estimations than a correctly specified compartmental model, whereas this last method performs better in t(max) and C(max) estimations.We extend the basic model to a hierarchical one that treats the case where we have concentrations from different subjects. We are then able to get individual PK parameter estimations. Finally, with Bayesian methods, we can get easily some uncertainty measures by obtaining credibility sets for each PK parameter. 相似文献

19.

Bayesian Bandwidth Estimation in Nonparametric Time-Varying Coefficient Models

Tingting Cheng Jiti Gao Xibin Zhang 《商业与经济统计学杂志》2019,37(1):1-12

Bandwidth plays an important role in determining the performance of nonparametric estimators, such as the local constant estimator. In this article, we propose a Bayesian approach to bandwidth estimation for local constant estimators of time-varying coefficients in time series models. We establish a large sample theory for the proposed bandwidth estimator and Bayesian estimators of the unknown parameters involved in the error density. A Monte Carlo simulation study shows that (i) the proposed Bayesian estimators for bandwidth and parameters in the error density have satisfactory finite sample performance; and (ii) our proposed Bayesian approach achieves better performance in estimating the bandwidths than the normal reference rule and cross-validation. Moreover, we apply our proposed Bayesian bandwidth estimation method for the time-varying coefficient models that explain Okun’s law and the relationship between consumption growth and income growth in the U.S. For each model, we also provide calibrated parametric forms of the time-varying coefficients. Supplementary materials for this article are available online. 相似文献

20.

Bayesian composite quantile regression for linear mixed-effects models

Yuzhu Tian Heng Lian Maozai Tian 《统计学通讯:理论与方法》2017,46(15):7717-7731

Longitudinal data are commonly modeled with the normal mixed-effects models. Most modeling methods are based on traditional mean regression, which results in non robust estimation when suffering extreme values or outliers. Median regression is also not a best choice to estimation especially for non normal errors. Compared to conventional modeling methods, composite quantile regression can provide robust estimation results even for non normal errors. In this paper, based on a so-called pseudo composite asymmetric Laplace distribution (PCALD), we develop a Bayesian treatment to composite quantile regression for mixed-effects models. Furthermore, with the location-scale mixture representation of the PCALD, we establish a Bayesian hierarchical model and achieve the posterior inference of all unknown parameters and latent variables using Markov Chain Monte Carlo (MCMC) method. Finally, this newly developed procedure is illustrated by some Monte Carlo simulations and a case analysis of HIV/AIDS clinical data set. 相似文献