Learning Private Representations through Entropy-based Adversarial Training

cs.LG arXiv:2507.10194
View PDF arXiv JSON

Abstract

How can we learn a representation with high predictive power while preserving user privacy? We present an adversarial representation learning method for sanitizing sensitive content from the learned representation. Specifically, we introduce a variant of entropy - focal entropy, which mitigates the potential information leakage of the existing entropy-based approaches. We showcase feasibility on multiple benchmarks. The results suggest high target utility at moderate privacy leakage.

PDF Viewer