首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A family of methods for statistical disclosure control
Authors:Andreas Quatember  Monika Cornelia Hausner
Institution:1. Department for Applied Statistics , Johannes Kepler University Linz , Altenberger Str. 69, Linz , 4040 , Austria;2. Statistical Office of the Federal State of Salzburg , Rainerstr. 27, Salzburg , 5020 , Austria
Abstract:Statistical disclosure control (SDC) is a balancing act between mandatory data protection and the comprehensible demand from researchers for access to original data. In this paper, a family of methods is defined to ‘mask’ sensitive variables before data files can be released. In the first step, the variable to be masked is ‘cloned’ (C). Then, the duplicated variable as a whole or just a part of it is ‘suppressed’ (S). The masking procedure's third step ‘imputes’ (I) data for these artificial missings. Then, the original variable can be deleted and its masked substitute has to serve as the basis for the analysis of data. The idea of this general ‘CSI framework’ is to open the wide field of imputation methods for SDC. The method applied in the I-step can make use of available auxiliary variables including the original variable. Different members of this family of methods delivering variance estimators are discussed in some detail. Furthermore, a simulation study analyzes various methods belonging to the family with respect to both, the quality of parameter estimation and privacy protection. Based on the results obtained, recommendations are formulated for different estimation tasks.
Keywords:statistical disclosure control  data quality  masking  imputation methods  post-randomization method
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号