Disclosure Risk and Disclosure Avoidance for Microdata |
| |
Authors: | Gerhard Paass |
| |
Affiliation: | Institute for Applied Information Technology, German National Research Center for Computer Science , Sankt Augustin , Federal Republic of Germany |
| |
Abstract: | Under given concrete exogenous conditions, the fraction of identifiable records in a microdata file without positive identifiers such as name and address is estimated. The effect of possible noise in the data, as well as the sample property of microdata files, is taken into account. Using real microdata files, it is shown that there is no risk of disclosure if the information content of characteristics known to the investigator (additional knowledge) is limited. Files with additional knowledge of large information content yield a high risk of disclosure. This can be eliminated only by massive modifications of the data records, which, however, involve large biases for complex statistical evaluations. In this case, the requirement for privacy protection and high-quality data perhaps may be fulfilled only if the linkage of such files with extensive additional knowledge is prevented by appropriate organizational and legal restrictions. |
| |
Keywords: | Anonymization methods Discriminant analysis Identification of individual records Noise Privacy protection Sample surveys |
|
|