BAYESIAN INSIGHTS ON DISCLOSURE LIMITATION: MASK OR IMPUTE?
Abstract
Statistical agencies seek to disseminate useful data while keeping low the risk of statistical confidentiality disclosure. Recognizing that reidentification of data is generally inadequate to protect its confidentiality against attack by a data snooper, agencies restrict the data they release for general use. Typically, these restricted data procedures have involved transformation or masking of the original, collected data through such devices as adding noise, topcoding, data swapping, and recoding. Recently, proposals have been put forth for the release of synthetic data, simulated from models constructed from the original data. This paper gives a framework for the comparison of masking and synthetic data as two approaches to disclosure limitation. Particular attention is paid to data utility and disclosure risk. Examples of instantiation of masking and of synthetic data construction are provided to illustrate the concepts. Particular attention is paid to data swapping. Insights drawn from the Bayesian paxadigm are provided.
- Authors:
- Publication Date:
- Research Org.:
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- Sponsoring Org.:
- US Department of Energy (US)
- OSTI Identifier:
- 766734
- Report Number(s):
- LA-UR-00-3771
TRN: AH200038%%610
- DOE Contract Number:
- W-7405-ENG-36
- Resource Type:
- Conference
- Resource Relation:
- Conference: Conference title not supplied, Conference location not supplied, Conference dates not supplied; Other Information: PBD: 1 Oct 2000
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE; CONSTRUCTION; TRANSFORMATIONS; STATISTICS; DATA-FLOW PROCESSING
Citation Formats
KELLER-MCNULTY, S, and DUNCAN, G. BAYESIAN INSIGHTS ON DISCLOSURE LIMITATION: MASK OR IMPUTE?. United States: N. p., 2000.
Web.
KELLER-MCNULTY, S, & DUNCAN, G. BAYESIAN INSIGHTS ON DISCLOSURE LIMITATION: MASK OR IMPUTE?. United States.
KELLER-MCNULTY, S, and DUNCAN, G. 2000.
"BAYESIAN INSIGHTS ON DISCLOSURE LIMITATION: MASK OR IMPUTE?". United States. https://www.osti.gov/servlets/purl/766734.
@article{osti_766734,
title = {BAYESIAN INSIGHTS ON DISCLOSURE LIMITATION: MASK OR IMPUTE?},
author = {KELLER-MCNULTY, S and DUNCAN, G},
abstractNote = {Statistical agencies seek to disseminate useful data while keeping low the risk of statistical confidentiality disclosure. Recognizing that reidentification of data is generally inadequate to protect its confidentiality against attack by a data snooper, agencies restrict the data they release for general use. Typically, these restricted data procedures have involved transformation or masking of the original, collected data through such devices as adding noise, topcoding, data swapping, and recoding. Recently, proposals have been put forth for the release of synthetic data, simulated from models constructed from the original data. This paper gives a framework for the comparison of masking and synthetic data as two approaches to disclosure limitation. Particular attention is paid to data utility and disclosure risk. Examples of instantiation of masking and of synthetic data construction are provided to illustrate the concepts. Particular attention is paid to data swapping. Insights drawn from the Bayesian paxadigm are provided.},
doi = {},
url = {https://www.osti.gov/biblio/766734},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Sun Oct 01 00:00:00 EDT 2000},
month = {Sun Oct 01 00:00:00 EDT 2000}
}