期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Likelihood-based Estimation of the Regression Model with Scrambled Responses

Rodney Strachan Maxwell King & Sarjinder Singh 《Australian & New Zealand Journal of Statistics》1998,40(3):279-290

A significant problem in the collection of responses to potentially sensitive questions, such as relating to illegal, immoral or embarrassing activities, is non-sampling error due to refusal to respond or false responses. Eichhorn & Hayre (1983) suggested the use of scrambled responses to reduce this form of bias. This paper considers a linear regression model in which the dependent variable is unobserved but for which the sum or product with a scrambling random variable of known distribution, is known. The performance of two likelihood-based estimators is investigated, namely of a Bayesian estimator achieved through a Markov chain Monte Carlo (MCMC) sampling scheme, and a classical maximum-likelihood estimator. These two estimators and an estimator suggested by Singh, Joarder & King (1996) are compared. Monte Carlo results show that the Bayesian estimator out-performs the classical estimators in all cases, and the relative performance of the Bayesian estimator improves as the responses become more scrambled. 相似文献

2.

Scrambled randomized response methods for obtaining sensitive quantitative data

Benjamin H. Eichhorn Lakhbir S. Hayre 《Journal of statistical planning and inference》1983,7(4):307-316

We study a multiplicative randomized response method for obtaining responses to sensitive questions when the answers are quantitative. The method involves the respondent multiplying his sensitive answer by a random number from a known distribution, and giving the product to the interviewer, who does not know the value of the random number and thus receives a scrambled response. Some particular distributions for the random scrambling number are proposed and studied, and ways of generating the scrambling numbers are discussed. Some modifications for increasing the efficiency of the method are proposed, and numerical results are given that show the scrambled response method is generally superior to the previously used method of randomizing questions. 相似文献

3.

Mean and sensitivity estimation in optional randomized response models

Sat Gupta Javid Shabbir Supriti Sehra 《Journal of statistical planning and inference》2010

Optional randomized response models were introduced by Gupta et al. (2002). These models are based on the basic premise that a question may be sensitive for one respondent but may not be sensitive for another. In an optional RRT (randomized response technique) model, a respondent is asked to provide a scrambled response only if the respondent considers the question sensitive. Otherwise, the respondent provides a truthful response. The researcher does not know which type of response is provided. The proportion of respondents who provide a scrambled response is known as the sensitivity level of the question. In this paper, we estimate simultaneously the mean and the sensitivity level of a quantitative-response sensitive question using a two stage optional RRT model. The estimators are unbiased and asymptotically normally distributed. We discuss the advantages and disadvantages of using additive and multiplicative scrambling. 相似文献

4.

REGRESSION ANALYSIS USING SCRAMBLED RESPONSES

Sarjinder Singh A.H. Joarder Maxwell L. King 《Australian & New Zealand Journal of Statistics》1996,38(2):201-211

This paper investigates the general linear regression model Y = Xβ+e assuming the dependent variable is observed as a scrambled response using Eichhorn & Hayre's (1983) approach to collecting sensitive personal information. The estimates of the parameters in the model remain unbiased, but the variances of the estimates increase due to scrambling. The Wald test of the null hypothesis H₀: β=β₀, against the alternative hypothesis H_a: β#β₀, is also investigated. Parameter estimates obtained from scrambled responses are compared to those from conventional or direct-question surveys, using simulation. The coverage by nominal 95% confidence intervals is also reported. 相似文献

5.

An improved randomized response model: estimation of mean

Christopher R. Gjestvang Sarjinder Singh 《Journal of applied statistics》2009,36(12):1361-1367

In this paper, we suggest a new randomized response model useful for collecting information on quantitative sensitive variables such as drug use and income. The resultant estimator has been found to be better than the usual additive randomized response model. An interesting feature of the proposed model is that it is free from the known parameters of the scrambling variable unlike the additive model due to Himmelfarb and Edgell [S. Himmelfarb and S.E. Edgell, Additive constant model: a randomized response technique for eliminating evasiveness to quantitative response questions, Psychol. Bull. 87(1980), 525–530]. Relative efficiency of the proposed model has also been studied with the corresponding competitors. At the end, an application of the proposed model has been discussed. 相似文献

6.

Goodness-of-Fit Testing for the Newcomb-Benford Law With Application to the Detection of Customs Fraud

Lucio Barabesi Andrea Cerasa Andrea Cerioli Domenico Perrotta 《商业与经济统计学杂志》2018,36(2):346-358

The Newcomb-Benford law for digit sequences has recently attracted interest in antifraud analysis. However, most of its applications rely either on diagnostic checks of the data, or on informal decision rules. We suggest a new way of testing the Newcomb-Benford law that turns out to be particularly attractive for the detection of frauds in customs data collected from international trade. Our approach has two major advantages. The first one is that we control the rate of false rejections at each stage of the procedure, as required in antifraud applications. The second improvement is that our testing procedure leads to exact significance levels and does not rely on large-sample approximations. Another contribution of our work is the derivation of a simple expression for the digit distribution when the Newcomb-Benford law is violated, and a bound for a chi-squared type of distance between the actual digit distribution and the Newcomb-Benford one. 相似文献

7.

Efficient estimation of population mean of sensitive variable in presence of scrambled responses

Housila P. Singh 《统计学通讯:理论与方法》2017,46(19):9557-9565

This paper addresses the problem of estimating the population mean of a sensitive variable in presence of scrambled response. We have suggested a randomized response model which uses known values of mean and variance of scrambling variable. We have shown that the proposed randomized response model is always better than that of Gjestvang and Singh's (2009) and Singh and Tarray's (2014) randomized response models. A numerical study for efficiency comparison is also presented. 相似文献

8.

Some criteria for projection pursuit

G. Eslava F. H. C. Marriott 《Statistics and Computing》1994,4(1):13-20

The idea of searching for orthogonal projections, from a multidimensional space into a linear subspace, as an aid to detecting non-linear structure has been named exploratory projection pursuit.Most approaches are tied to the idea of searching for interesting projections. Typically, an interesting projection is one where the distribution of the projected data differs from the normal distribution. In this paper we define two projection indices which are aimed specifically at finding projections that best show grouped structure in the plane, if this exists in the multi-dimensional space. These involve a numerical optimization problem which is tackled in two stages, the projection and the pursuit; the first is based on a procedure to generate pseudo-random rotation matrices in the sense of the grand tour by D. Asimov (1985), and the second is a local numerical optimization procedure. One artificial and one real example illustrate the performance of the suggested indices. 相似文献

9.

Mean and sensitivity estimation of a sensitive variable through additive scrambling

Zawar Hussain Bander Al-Zahrani 《统计学通讯:理论与方法》2013,42(1):182-193

Abstract

This article focuses on reducing the additional variance due to randomization of the responses. The idea of additive scrambling and its inverse has been used along with (i) split sample approach and (ii) double response approach. Specifically, our proposal is based on Gupta et al. (2006) randomized response model. We selected this model for improvement because it provides estimator of mean and sensitivity level of a sensitive variable and is better than all of its competitors proposed earlier to it and even Gupta et al. (2006) sensitivity estimator is better than that of Gupta et al. (2010). Our suggested estimators are unbiased estimators and perform better than Gupta et al. (2006) estimator. The issue of privacy protection is also discussed. 相似文献

10.

A Potts‐mixture spatiotemporal joint model for combined magnetoencephalography and electroencephalography data

Yin Song Farouk Nathoo Arif Babul 《Revue canadienne de statistique》2019,47(4):688-711

We develop a new methodology for determining the location and dynamics of brain activity from combined magnetoencephalography (MEG) and electroencephalography (EEG) data. The resulting inverse problem is ill‐posed and is one of the most difficult problems in neuroimaging data analysis. In our development we propose a solution that combines the data from three different modalities, magnetic resonance imaging (MRI), MEG and EEG, together. We propose a new Bayesian spatial finite mixture model that builds on the mesostate‐space model developed by Daunizeau & Friston [Daunizeau and Friston, NeuroImage 2007; 38, 67–81]. Our new model incorporates two major extensions: (i) We combine EEG and MEG data together and formulate a joint model for dealing with the two modalities simultaneously; (ii) we incorporate the Potts model to represent the spatial dependence in an allocation process that partitions the cortical surface into a small number of latent states termed mesostates. The cortical surface is obtained from MRI. We formulate the new spatiotemporal model and derive an efficient procedure for simultaneous point estimation and model selection based on the iterated conditional modes algorithm combined with local polynomial smoothing. The proposed method results in a novel estimator for the number of mixture components and is able to select active brain regions, which correspond to active variables in a high‐dimensional dynamic linear model. The methodology is investigated using synthetic data and simulation studies and then demonstrated on an application examining the neural response to the perception of scrambled faces. R software implementing the methodology along with several sample datasets are available at the following GitHub repository https://github.com/v2south/PottsMix . The Canadian Journal of Statistics 47: 688–711; 2019 © 2019 Statistical Society of Canada 相似文献

11.

Hansen and Hurwitz estimator with scrambled response on the second call

Giancarlo Diana Javid Shabbir 《Journal of applied statistics》2014,41(3):596-611

In this paper we propose a modified version of the estimator of Hansen and Hurwitz [12] in the case of quantitative sensitive variable and consider a randomization mechanism on the second call that provides privacy protection to the respondents to get truthful information. We use variance of the modified estimator as a tool to measure privacy protection and it is observed that the higher is the variance, the lower is the efficiency but the higher is the privacy protection. To overcome this efficiency loss, we consider a linear regression estimator using known non-sensitive auxiliary information. With consideration of four scrambled models, we try to make a trade-off between efficiency and privacy protection. To show this compromise, analytical and numerical comparisons are obtained. 相似文献

12.

Spatial Matching of Multiple Configurations of Points with a Bioinformatics Application

J. M. Marín C. Nieto 《统计学通讯:理论与方法》2013,42(12):1977-1995

In this article, we present a general model to deal with the problem of matching multiple objects or configurations of points from a Bayesian point of view. We study both labeled and non labeled cases. Our model generalizes, in terms of non singular affine transformations and multiple configurations, previous two-terms matching models. As a practical application in Bioinformatics, we consider data from a microarray experiment of gorilla, bonobo, and human-cultured fibroblasts. We find out the matchings and the best affine transformation between the projections of genes on a two-dimensional space, obtained by a multidimensional scaling technique. 相似文献

13.

Projection techniques for nonlinear principal component analysis 总被引：4，自引：0，他引：4

Richard J. Bolton David J. Hand Andrew R. Webb 《Statistics and Computing》2003,13(3):267-276

Principal Components Analysis (PCA) is traditionally a linear technique for projecting multidimensional data onto lower dimensional subspaces with minimal loss of variance. However, there are several applications where the data lie in a lower dimensional subspace that is not linear; in these cases linear PCA is not the optimal method to recover this subspace and thus account for the largest proportion of variance in the data.Nonlinear PCA addresses the nonlinearity problem by relaxing the linear restrictions on standard PCA. We investigate both linear and nonlinear approaches to PCA both exclusively and in combination. In particular we introduce a combination of projection pursuit and nonlinear regression for nonlinear PCA. We compare the success of PCA techniques in variance recovery by applying linear, nonlinear and hybrid methods to some simulated and real data sets.We show that the best linear projection that captures the structure in the data (in the sense that the original data can be reconstructed from the projection) is not necessarily a (linear) principal component. We also show that the ability of certain nonlinear projections to capture data structure is affected by the choice of constraint in the eigendecomposition of a nonlinear transform of the data. Similar success in recovering data structure was observed for both linear and nonlinear projections. 相似文献

14.

TWO-STAGE SUPPORT ESTIMATION

Samuel Müller 《Australian & New Zealand Journal of Statistics》2005,47(4):463-472

This paper presents a two‐stage procedure for estimating the conditional support curve of a random variable X, given the information of a random vector X. Quantile estimation is followed by an extremal analysis on the residuals for problems which can be written as regression models. The technique is applied to data from the National Bureau of Economic Research and US Census Bureau's Center for Economic Studies which contain all four‐digit manufacturing industries. Simulation results show that in linear regression models the proposed estimation procedure is more efficient than the extreme linear regression quantile. 相似文献

15.

Diagnostics for repeated measurements in generalized linear mixed effects models

Minkyung Oh 《Journal of applied statistics》2019,46(14):2666-2676

ABSTRACT

As there is an extensive body of research on diagnostics in regression models, various outlier detection methods have been developed. These methods have been extended to mixed effects models and generalized linear models, but there exist intrinsic drawbacks and limitations. This paper presents two-dimensional plots to identify discordant subjects and observations in generalized linear mixed effects models, displaying discordance in two directions. The sTudentized Residual Sum of Squares is not an extension of any regression tools but a new approach designed to efficiently reflect the characteristics of repeated measures. And this noteworthy clustering of outliers is identified in the plot. Applications to real-life examples are presented to illustrate the favorable/beneficial performance of the new tool. 相似文献

16.

Reversible jump methods for generalised linear models and generalised linear mixed models

Jonathan J. Forster Roger C. Gill Antony M. Overstall 《Statistics and Computing》2012,22(1):107-120

A reversible jump algorithm for Bayesian model determination among generalised linear models, under relatively diffuse prior distributions for the model parameters, is proposed. Orthogonal projections of the current linear predictor are used so that knowledge from the current model parameters is used to make effective proposals. This idea is generalised to moves of a reversible jump algorithm for model determination among generalised linear mixed models. Therefore, this algorithm exploits the full flexibility available in the reversible jump method. The algorithm is demonstrated via two examples and compared to existing methods. 相似文献

17.

A composite class of estimators using scrambled response mechanism for sensitive population mean in successive sampling

Kumari Priyanka Pidugu Trisandhya 《统计学通讯:理论与方法》2019,48(4):1009-1032

This paper considers the problem of estimation of population mean of a sensitive characteristics using non-sensitive auxiliary variable at current move in two move successive sampling. The proposed estimator is studied under five different scrambled response models. Various estimators have been elaborated to be the member of the proposed class of estimators. The properties of the proposed estimators have been analysed. Many estimators belonging to the proposed class have been explored under five scrambled response models. In order to identify the scrambled model effect, the proposed composite class of estimators is compared to the direct methods. Respondents privacy protection have also been elaborated under different models. Theoretical results are supplemented with numerical demonstrations using real data. Simulation has been carried out to show the applicability of proposed estimators and hence suitable recommendations are forwarded. 相似文献

18.

Functional linear discriminant analysis for irregularly sampled curves

Gareth M. James & Trevor J. Hastie 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2001,63(3):533-550

We introduce a technique for extending the classical method of linear discriminant analysis (LDA) to data sets where the predictor variables are curves or functions. This procedure, which we call functional linear discriminant analysis ( FLDA ), is particularly useful when only fragments of the curves are observed. All the techniques associated with LDA can be extended for use with FLDA. In particular FLDA can be used to produce classifications on new (test) curves, give an estimate of the discriminant function between classes and provide a one- or two-dimensional pictorial representation of a set of curves. We also extend this procedure to provide generalizations of quadratic and regularized discriminant analysis. 相似文献

19.

Projection pursuit by involution

Donal P. Krouse 《统计学通讯:模拟与计算》2013,42(4):1517-1546

A novel projection pursuit method based on projecting the data onto itself is proposed. Using a number of real datasets it is shown how to obtain interesting one and two-dimensional projections using only O(n) evaluations of a one-dimensional projection index. 相似文献

20.

Estimation of Finite Population Variance Using Scrambled Responses in the Presence of Auxiliary Information

Sarjinder Singh Stephen A. Sedory Raghunath Arnab 《统计学通讯:模拟与计算》2015,44(4):1050-1065

In this article, a new estimator for estimating the finite population variance of a sensitive variable based on scrambled responses collected using a randomization device is introduced. The estimator is then improved by using known auxiliary information. The estimators due to Das and Tripathi (1978: Sankhya) and Isaki (1983: JASA) are shown to be special cases of the proposed estimator. Numerical simulations are performed to study the magnitude of the gain in efficiency when using the estimator with auxiliary information with respect to the estimator based only on the scrambled responses. An idea to extend the present work from SRSWOR design to more complex design is also given. 相似文献