期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Assessing naïve Bayes as a method for screening credit applicants

A. C. Antonakis 《Journal of applied statistics》2009,36(5):537-545

The naïve Bayes rule (NBR) is a popular and often highly effective technique for constructing classification rules. This study examines the effectiveness of NBR as a method for constructing classification rules (credit scorecards) in the context of screening credit applicants (credit scoring). For this purpose, the study uses two real-world credit scoring data sets to benchmark NBR against linear discriminant analysis, logistic regression analysis, k-nearest neighbours, classification trees and neural networks. Of the two aforementioned data sets, the first one is taken from a major Greek bank whereas the second one is the Australian Credit Approval data set taken from the UCI Machine Learning Repository (available at http://www.ics.uci.edu/~mlearn/MLRepository.html). The predictive ability of scorecards is measured by the total percentage of correctly classified cases, the Gini coefficient and the bad rate amongst accepts. In each of the data sets, NBR is found to have a lower predictive ability than some of the other five methods under all measures used. Reasons that may negatively affect the predictive ability of NBR relative to that of alternative methods in the context of credit scoring are examined. 相似文献

2.

The Wilcoxon–Mann–Whitney Procedure Fails as a Test of Medians

George W. Divine H. James Norton Anna E. Barón Elizabeth Juarez-Colunga 《The American statistician》2018,72(3):278-286

相似文献

3.

Using the Inverse Maximum Ratio-Λ as a Technique to Quantify Surface Uniformity

Avinash Chandran Derek Brown Jerome Danoff Loretta DiPietro 《统计学通讯:模拟与计算》2016,45(3):1129-1142

Techniques used in variability assessment are subsequently used to draw conclusions regarding the “spread”/uniformity of data curves. Due to the limitations of these techniques, they are not adequate for circumstances where data manifest with multiple peaks. Examples of these manifestations (in three-dimensional space) include under-foot pressure distributions recorded for different types of footwear (Becerro-de-Bengoa-Vallejo et al., 2014 Biau, D.J. (2011). In brief: Standard deviation and standard error. Clinical Orthopaedics and Related Research 469(9):2661–2664.[Crossref], [PubMed], [Web of Science ®] , [Google Scholar]; Cibulka et al., 1994 Cibulka, M.T., Sinacore, D.R., Mueller, M.J. (1994). Shin splints and forefoot contact running: A case report. Journal of Orthopaedic &; Sports Physical Therapy 20(2):98–102.[Crossref], [PubMed], [Web of Science ®] , [Google Scholar]; Davies et al., 2003 Davies, M.B., Betts, R.P., Scott, I.R. (2003). Optical plantar pressure analysis following internal fixation for displaced intra-articular os calcis fractures. Foot &; Ankle International 24(11):851–856.[PubMed], [Web of Science ®] , [Google Scholar]), surface textures and interfaces designed to impact friction, and and and molecular surface structures such as viral epitopes (Torras and Garcia-Valls, 2004 Torras, C., Garcia-Valls, R. (2004). Quantification of membrane morphology by interpretation of scanning electron microscopy images. Journal of Membrane Science 233(1–2):119–127.[Crossref], [Web of Science ®] , [Google Scholar]; Pacejka, 1997; Fustaffson, 1997). This article proposes a technique for generating a single variable – Λ that will quantify the uniformity of such surfaces. We define and validate this technique using several mathematical and graphical models. 相似文献

4.

Robust BCa–JaB method as a diagnostic tool for linear regression models

Ufuk Beyaztas Michael A. Martin 《Journal of applied statistics》2014,41(7):1593-1610

The Jackknife-after-bootstrap (JaB) technique originally developed by Efron [8 B. Efron, Jackknife-after-bootstrap standard errors and influence functions, J. R. Stat. Soc. 54 (1992), pp. 83–127. [Google Scholar]] has been proposed as an approach to improve the detection of influential observations in linear regression models by Martin and Roberts [12 M.A. Martin and S. Roberts, Jackknife-after-bootstrap regression influence diagnostics, J. Nonparametr. Stat. 22 (2010), pp. 257–269. doi: 10.1080/10485250903287906[Taylor &; Francis Online], [Web of Science ®] , [Google Scholar]] and Beyaztas and Alin [2 U. Beyaztas and A. Alin, Jackknife-after-bootstrap method for detection of influential observations in linear regression model, Comm. Statist. Simulation Comput. 42 (2013), pp. 1256–1267. doi: 10.1080/03610918.2012.661908[Taylor &; Francis Online], [Web of Science ®] , [Google Scholar]]. The method is based on the use of percentile-method confidence intervals to provide improved cut-off values for several single case-deletion influence measures. In order to improve JaB, we propose using robust versions of Efron [7 B. Efron, Better bootstrap confidence intervals, J. Amer. Statist. Assoc. 82 (1987), pp. 171–185. doi: 10.1080/01621459.1987.10478410[Taylor &; Francis Online], [Web of Science ®] , [Google Scholar]]’s bias-corrected and accelerated (BCa) bootstrap confidence intervals. In this study, the performances of robust BCa–JaB and conventional JaB methods are compared in the cases of DFFITS, Welsch's distance and modified Cook's distance influence diagnostics. Comparisons are based on both real data examples and through a simulation study. Our results reveal that under a variety of scenarios, our proposed method provides more accurate and reliable results, and it is more robust to masking effects. 相似文献

5.

Heine process as a q-analog of the Poisson process—waiting and interarrival times

Andreas Kyriakoussis 《统计学通讯:理论与方法》2017,46(8):4088-4102

In this study, we introduce the Heine process, {X_q(t), t > 0}, 0 < q < 1, where the random variable X_q(t), for every t > 0, represents the number of events (occurrences or arrivals) during a time interval (0, t]. The Heine process is introduced as a q-analog of the basic Poisson process. Also, in this study, we prove that the distribution of the waiting time W_{ν, q}, ν ? 1, up to the νth arrival, is a q-Erlang distribution and the interarrival times T_{k, q} = W_{k, q} ? W_{k ? 1, q},?k = 1, 2, …, ν with W_{0, q} = 0 are independent and equidistributed with a q-Exponential distribution. 相似文献

6.

The Yule–Walker equations as a weighted least-squares problem and the association with tapering

Joan Parrish Bee Leng Lee 《统计学通讯:理论与方法》2013,42(17):5112-5122

ABSTRACT

A common method for estimating the time-domain parameters of an autoregressive process is to use the Yule–Walker equations. Tapering has been shown intuitively and proven theoretically to reduce the bias of the periodogram in the frequency domain, but the intuition for the similar bias reduction in the time-domain estimates has been lacking. We provide insightful reasoning for why tapering reduces the bias in the Yule–Walker estimates by showing them to be equivalent to a weighted least-squares problem. This leads to the derivation of an optimal taper which behaves similarly to commonly used tapers. 相似文献

7.

Soccer matches as experiments: how often does the ‘best’ team win?

G. K. Skinner G. H. Freeman 《Journal of applied statistics》2009,36(10):1087-1095

Models in which the number of goals scored by a team in a soccer match follow a Poisson distribution, or a closely related one, have been widely discussed. We here consider a soccer match as an experiment to assess which of two teams is superior and examine the probability that the outcome of the experiment (match) truly represents the relative abilities of the two teams. Given a final score, it is possible by using a Bayesian approach to quantify the probability that it was or was not the case that ‘the best team won’. For typical scores, the probability of a misleading result is significant. Modifying the rules of the game to increase the typical number of goals scored would improve the situation, but a level of confidence that would normally be regarded as satisfactory could not be obtained unless the character of the game was radically changed. 相似文献

8.

Unrestricted consumption under a deterministic wealth and an Ornstein–Uhlenbeck process as a discount rate

Julia Eisenberg 《随机性模型》2018,34(2):139-153

We consider an individual or household endowed with an initial capital and an income, modeled as a linear function of time. Assuming that the discount rate evolves as an Ornstein–Uhlenbeck process, we target to find an unrestricted consumption strategy such that the value of the expected discounted consumption is maximized. Differently than in the case with restricted consumption rates, we can determine the optimal strategy and the value function. 相似文献

9.

“Super-covariates”: Using predicted control group outcome as a covariate in randomized clinical trials

Björn Holzhauer Emmanuel Taiwo Adewuyi 《Pharmaceutical statistics》2023,22(6):1062-1075

The power of randomized controlled clinical trials to demonstrate the efficacy of a drug compared with a control group depends not just on how efficacious the drug is, but also on the variation in patients' outcomes. Adjusting for prognostic covariates during trial analysis can reduce this variation. For this reason, the primary statistical analysis of a clinical trial is often based on regression models that besides terms for treatment and some further terms (e.g., stratification factors used in the randomization scheme of the trial) also includes a baseline (pre-treatment) assessment of the primary outcome. We suggest to include a “super-covariate”—that is, a patient-specific prediction of the control group outcome—as a further covariate (but not as an offset). We train a prognostic model or ensembles of such models on the individual patient (or aggregate) data of other studies in similar patients, but not the new trial under analysis. This has the potential to use historical data to increase the power of clinical trials and avoids the concern of type I error inflation with Bayesian approaches, but in contrast to them has a greater benefit for larger sample sizes. It is important for prognostic models behind “super-covariates” to generalize well across different patient populations in order to similarly reduce unexplained variability whether the trial(s) to develop the model are identical to the new trial or not. In an example in neovascular age-related macular degeneration we saw efficiency gains from the use of a “super-covariate”. 相似文献

10.

An alternative measure of ordinal association as a value-validity correction of the Goodman–Kruskal gamma

Tarald O. Kvålseth 《统计学通讯:理论与方法》2017,46(21):10582-10593

相似文献

11.

Nonparametric estimation of 100(1 − p)% expected shortfall: p → 0 as sample size is increased

Santanu Dutta Suparna Biswas 《统计学通讯:模拟与计算》2018,47(2):338-352

Expected shortfall (ES) is a well-known measure of extreme loss associated with a risky asset or portfolio. For any 0 < p < 1, the 100(1 ? p) percent ES is defined as the mean of the conditional loss distribution, given the event that the loss exceeds (1 ? p)th quantile of the marginal loss distribution. Estimation of ES based on asset return data is an important problem in finance. Several nonparametric estimators of the expected shortfall are available in the literature. Using Monte Carlo simulations, we compare the accuracy of these estimators under the condition that p → 0 as n → ∞ for several asset return time series models, where n is the sample size. Not much seems to be known regarding the properties of the ES estimators under this condition. For p close to zero, the ES measures an extreme loss in the right tail of the loss distribution of the asset or portfolio. Our simulations and real-data analysis provide insight into the effect of varying p with n on the performance of nonparametric ES estimators. 相似文献

12.

The SLA DBIO 100 Poll: 100 Journals Voted by SLA’s BioMedical and Life Sciences Division as the Most Influential over the Last 100 Years

Tony Stankus Sarah E. Spiegel 《Serials Review》2013,39(4):202-212

Abstract

An expert opinion poll of the members of the BioMedical and Life Sciences Division (DBIO) of the Special Libraries Association (SLA), conducted on the occasion of the SLA's Centennial (Washington, DC, June 14–17, 2009), identified 100 journals across three major categories (Clinical Medicine and Allied Health Sciences, Journals Primarily Reporting Molecular and Cellular Biology, and Journals of Natural History) as the most influential over the last 100 years. The Top Ten, containing winners from all three of these groups, were also named, as were a “Journal” and a “Publisher” of the Centennial.” 相似文献