This paper focuses on preserving the privacy of sensitive patterns when inducing decision trees. Our record augmentation approach for hiding sensitive classification rules in binary datasets is preferred over other heuristic solutions like output perturbation or cryptographic techniques since the raw data itself is readily available for public use. We describe the process and an indicative experiment using a prototype hiding tool.
展开▼