期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Non-parametric maximum likelihood estimation of interval-censored failure time data subject to misclassification

Andrew C. Titman 《Statistics and Computing》2017,27(6):1585-1593

The paper considers non-parametric maximum likelihood estimation of the failure time distribution for interval-censored data subject to misclassification. Such data can arise from two types of observation scheme; either where observations continue until the first positive test result or where tests continue regardless of the test results. In the former case, the misclassification probabilities must be known, whereas in the latter case, joint estimation of the event-time distribution and misclassification probabilities is possible. The regions for which the maximum likelihood estimate can only have support are derived. Algorithms for computing the maximum likelihood estimate are investigated and it is shown that algorithms appropriate for computing non-parametric mixing distributions perform better than an iterative convex minorant algorithm in terms of time to absolute convergence. A profile likelihood approach is proposed for joint estimation. The methods are illustrated on a data set relating to the onset of cardiac allograft vasculopathy in post-heart-transplantation patients. 相似文献

2.

Divergence-based confidence intervals in false-positive misclassification model

《Journal of Statistical Computation and Simulation》2012,82(6):527-540

In this article, we introduce minimum divergence estimators of parameters of a binary response model when data are subject to false-positive misclassification and obtained using a double-sampling plan. Under this set up, the problem of goodness-of-fit is considered and divergence-based confidence intervals (CIs) for a population proportion parameter are derived. A simulation experiment is carried out to compare the coverage probabilities of the new CIs. An application to real data is also given. 相似文献

3.

Estimation of misclassification probabilities by bootstrap methods

Samprit Chatterjee Sangit Chatterjee 《统计学通讯:模拟与计算》2013,42(6):645-656

Several methods have been proposed to estimate the misclassification probabilities when a linear discriminant function is used to classify an observation into one of several populations. We describe the application of bootstrap sampling to the above problem. The proposed method has the advantage of not only furnishing the estimates of misclassification probabilities but also provides an estimate of the standard error of estimate. The method is illustrated by a small simulation experiment. It is then applied to three published, well accessible data sets, which are typical of large, medium and small data sets encountered in practice. 相似文献

4.

Discriminant Analysis for the von Mises-Fisher Distribution

Adelaide Figueiredo 《统计学通讯:模拟与计算》2013,42(9):1991-2003

The von Mises-Fisher distribution is widely used for modeling directional data. In this article, we derive the discriminant rules based on this distribution to assign objects into pre-existing classes. We determine a distance between two von Mises-Fisher populations and we calculate estimates of the misclassification probabilities. We also analyze the behavior of the distance between two von Mises-Fisher populations and of the estimates of the misclassification probabilities when we modify the parameters of the populations or the samples size or the dimension of the sphere. Finally, we present an example with real spherical data available in the literature. 相似文献

5.

Simple computation of a bivariate normal integral arising from a problem of misclassification with applications to the diagnosis of hypertension

M.D. Mohamed el Lozy 《统计学通讯:理论与方法》2013,42(19):2195-2205

The joint distribution of the true and observed values of a variable that is subject to measurement error is bivariate normal.An important special case occurs when we want the joint probability of the true value being below a cutoff point and the observed value above it.In that case the required integral can be simply evaluated using a Gaussian quadrature formula, which can easily be evaluated using a calculator.This formula is used to estimate the probabilities of misclassification of participants in screening programs for hypertension.It shows that basing a diagnosis on a single visit, at which a single measurement was made leads to a very high risk of misclassification.The probability of a subject having a blood pressure below the cutoff point, given that the observed pressure is above it, would be 0.45.Increasing the number of visits to three, and measuring the blood pressure twice at each visit, as advocated by Rosner and Polk (1979), would bring the probability down to 0.29. 相似文献

6.

Qdf misclassification probabilities for known population parameters

C. K. Bayne W. Y. Tan 《统计学通讯:理论与方法》2013,42(22):2315-2326

Approximated QDF misclassification probabilities have been derived for bivariate normal populations with known parameter values. Tne effect of unequal covariances and population distance on the misclassification probabilities are examined 相似文献

7.

Bayesian Mixture Labeling and Clustering

Weixin Yao 《统计学通讯:理论与方法》2013,42(3):403-421

Label switching is one of the fundamental issues for Bayesian mixture modeling. It occurs due to the nonidentifiability of the components under symmetric priors. Without solving the label switching, the ergodic averages of component specific quantities will be identical and thus useless for inference relating to individual components, such as the posterior means, predictive component densities, and marginal classification probabilities. The author establishes the equivalence between the labeling and clustering and proposes two simple clustering criteria to solve the label switching. The first method can be considered as an extension of K-means clustering. The second method is to find the labels by minimizing the volume of labeled samples and this method is invariant to the scale transformation of the parameters. Using a simulation example and the application of two real data sets, the author demonstrates the success of these new methods in dealing with the label switching problem. 相似文献

8.

ACCOUNTING FOR NON-COMPLIANCE IN THE ANALYSIS OF RANDOMIZED RESPONSE DATA

Ardo van den Hout Irene Klugkist 《Australian & New Zealand Journal of Statistics》2009,51(3):353-372

The randomized response model is a misclassification design that is used to protect the privacy of respondents with respect to sensitive questions. Conditional misclassification probabilities are specified by the researcher and are therefore considered to be known. It is to be expected that some of the respondents do not comply with respect to the misclassification design. These respondents induce extra perturbation, which is not accounted for in the standard randomized response model. An extension of the randomized response model is presented that takes into account assumptions with respect to non-compliance under simple random sampling. The extended model is investigated using Bayesian inference. The research is motivated by randomized response data concerning violations of regulations for social benefit. 相似文献

9.

Misclassification of current status data

Karen McKeown Nicholas P. Jewell 《Lifetime data analysis》2010,16(2):215-230

We describe a simple method for nonparametric estimation of a distribution function based on current status data where observations of current status information are subject to misclassification. Nonparametric maximum likelihood techniques lead to use of a straightforward set of adjustments to the familiar pool-adjacent-violators estimator used when misclassification is assumed absent. The methods consider alternative misclassification models and are extended to regression models for the underlying survival time. The ideas are motivated by and applied to an example on human papilloma virus (HPV) infection status of a sample of women examined in San Francisco. 相似文献

10.

Linear discrimination for three known normal populations

Mark J. Schervish 《Journal of statistical planning and inference》1984,10(2):167-175

A random vector is assumed to have one of three known multivariate normal distributions with equal covariance matrices. It is desired to separate the three distributions by means of a single linear discriminant function. Such a function can lead to a classification rule. The function whose classification rule minimizes the average of the three probabilities of misclassification is found. Also the function is found whose rule minimizes the maximum of the three probabilities of misclassification. 相似文献

11.

A Bayesian mixture of experts approach to covariate misclassification

Michelle Xia P. Richard Hahn Paul Gustafson 《Revue canadienne de statistique》2020,48(4):731-750

This article considers misclassification of categorical covariates in the context of regression analysis; if unaccounted for, such errors usually result in mis-estimation of model parameters. With the presence of additional covariates, we exploit the fact that explicitly modelling non-differential misclassification with respect to the response leads to a mixture regression representation. Under the framework of mixture of experts, we enable the reclassification probabilities to vary with other covariates, a situation commonly caused by misclassification that is differential on certain covariates and/or by dependence between the misclassified and additional covariates. Using Bayesian inference, the mixture approach combines learning from data with external information on the magnitude of errors when it is available. In addition to proving the theoretical identifiability of the mixture of experts approach, we study the amount of efficiency loss resulting from covariate misclassification and the usefulness of external information in mitigating such loss. The method is applied to adjust for misclassification on self-reported cocaine use in the Longitudinal Studies of HIV-Associated Lung Infections and Complications. 相似文献

12.

A three-population constrained discrimination procedure

David Patterson 《统计学通讯:理论与方法》2013,42(16):4771-4787

ABSTRACT

Classification rules with a reserve judgment option provide a way to satisfy constraints on the misclassification probabilities when there is a high degree of overlap among the populations. Constructing rules which maximize the probability of correct classification while satisfying such constraints is a difficult optimization problem. This paper uses the form of the optimal solution to develop a relatively simple and computationally fast method for three populations which has a non parametric quality in controlling the misclassification probabilities. Simulations demonstrate that this procedure performs well. 相似文献

13.

Bayesian sample size determination for estimating binomial parameters from data subject to misclassification 总被引：1，自引：0，他引：1

E. Rahme L. Joseph & T. W. Gyorkos 《Journal of the Royal Statistical Society. Series C, Applied statistics》2000,49(1):119-128

We investigate the sample size problem when a binomial parameter is to be estimated, but some degree of misclassification is possible. The problem is especially challenging when the degree to which misclassification occurs is not exactly known. Motivated by a Canadian survey of the prevalence of toxoplasmosis infection in pregnant women, we examine the situation where it is desired that a marginal posterior credible interval for the prevalence of width w has coverage 1−α, using a Bayesian sample size criterion. The degree to which the misclassification probabilities are known a priori can have a very large effect on sample size requirements, and in some cases achieving a coverage of 1−α is impossible, even with an infinite sample size. Therefore, investigators must carefully evaluate the degree to which misclassification can occur when estimating sample size requirements. 相似文献

14.

Combinatoric classification of multivariate normal variates

C.L. Dunn W.B. Smith 《统计学通讯:理论与方法》2013,42(13):1317-1340

Consider classifying an n × I observation vector as coming from one of two multivariate normal distributions which differ both in mean vectors and covariance matrices. A class of dis-crimination rules based upon n independent univariate discrim-inate functions is developed yielding exact misclassification probabilities when the population parameters are known. An efficient search of this class to select the procedure with minimum expected misclassification is made by employing an algorithm of the implicit enumeration type used in integer programming. The procedure is applied to the classification of male twins as either monozygotic or dizygotic. 相似文献

15.

Bayesian analysis of complementary Poisson rate parameters with data subject to misclassification

《Journal of statistical planning and inference》2005,134(1):36-48

We formulate closed-form Bayesian estimators for two complementary Poisson rate parameters using double sampling with data subject to misclassification and error free data. We also derive closed-form Bayesian estimators for two misclassification parameters in the modified Poisson model we assume. We use our results to determine credible sets for the rate and misclassification parameters. Additionally, we use MCMC methods to determine Bayesian estimators for three or more rate parameters and the misclassification parameters. We also perform a limited Monte Carlo simulation to examine the characteristics of these estimators. We demonstrate the efficacy of the new Bayesian estimators and highest posterior density regions with examples using two real data sets. 相似文献

16.

An asymptotic approximation for EPMC in linear discriminant analysis based on monotone missing data

Nobumichi Shutoh 《Journal of statistical planning and inference》2012,142(1):110-125

In this paper, we propose an asymptotic approximation for the expected probabilities of misclassification (EPMC) in the linear discriminant function on the basis of k-step monotone missing training data for general k. We derive certain relations of the statistics in order to obtain the approximation. Finally, we perform Monte Carlo simulation to evaluate the accuracy of our result and to compare it with existing approximations. 相似文献

17.

A Classification Method for Directional Data with Application to the Human Skull

Ashis Sengupta 《统计学通讯:理论与方法》2013,42(3):457-466

There are many well-known methods applied in classification problem for linear data with both known and unknown distribution. Here, we deal with classification involving data on torus and cylinder. A new method involving a generalized likelihood ratio test is developed for classifying in two populations using directional data. The approach assumes that one of the probabilities of misclassification is known. The procedure is constructed by applying Gibbs sampler on the conditionally specified distribution. A parametric bootstrap approach is also presented. An application to data involving linear and circular measurements on human skull from two tribal populations is given. 相似文献

18.

Bayesian adjustment for unidirectional misclassification in ordinal covariates

Liangrui Sun Michelle Xia Yuanyuan Tang Philip G. Jones 《Journal of Statistical Computation and Simulation》2017,87(18):3440-3468

In this paper, we study the identification of Bayesian regression models, when an ordinal covariate is subject to unidirectional misclassification. Xia and Gustafson [Bayesian regression models adjusting for unidirectional covariate misclassification. Can J Stat. 2016;44(2):198–218] obtained model identifiability for non-binary regression models, when there is a binary covariate subject to unidirectional misclassification. In the current paper, we establish the moment identifiability of regression models for misclassified ordinal covariates with more than two categories, based on forms of observable moments. Computational studies are conducted that confirm the theoretical results. We apply the method to two datasets, one from the Medical Expenditure Panel Survey (MEPS), and the other from Translational Research Investigating Underlying Disparities in Acute Myocardial infarction Patients Health Status (TRIUMPH). 相似文献

19.

On a distributionfree method in discriminant analysis

K. Fischer Chr Thiele 《Statistics》2013,47(2):281-289

Linear discriminant rules for two symmetrical distributions, which only need the first and second moments of these distributions, are presented. The rules are based on Zhezhel's idea using the most unfavourable probabilities of misclassification as an optimality criterion. Also a rule is considered which deals with distributions differing in a location and scale parameter. 相似文献

20.

On triple sampling scremes for estimating from binomial data with misclassification errors ∗

Yosef Hochbeg Aaron Tenenbein 《统计学通讯:理论与方法》2013,42(13):1523-1533

Previous work has been carried out on the use of double sampling schemes for inference from binomial data which are subject to misclassification. The double sampling scheme utilizes a sample of n units which are classified by both a fallible and a true device and another sample of n₂ units which are classified only by a fallible device. A triple sampljng scheme incorporates an additional sample of n_l units which are classified only by the true device. In this paper we apply this triple sampling to estimation from binomialdata. First estimation of a binomial proportion is discussed under different misclassification structures. Then, the problem of optimal allocation of sample sizes is discussed. 相似文献