A New Linear Scaling Rule for Private Adaptive Hyperparameter Optimization

A New Linear Scaling Rule for Private Adaptive Hyperparameter Optimization
By: Ashwinee Panda, Xinyu Tang, Saeed Mahloujifar, Vikash Sehwag, Prateek Mittal Posted: May 8, 2024
arXiv:2212.04486v3 Announce Type: replace-cross
Abstract: An open problem in differentially private deep learning is hyperparameter optimization (HPO). DP-SGD introduces new hyperparameters and complicates existing ones, forcing researchers to painstakingly tune hyperparameters with hundreds of trials, which in turn makes it impossible to account for the privacy cost of HPO without destroying the utility. We propose an adaptive HPO method that uses cheap trials (in terms of privacy cost and runtime) to estimate optimal hyperparameters and scales them up. We obtain state-of-the-art performance on 22 benchmark tasks, across computer vision and natural language processing, across pretraining and finetuning, across architectures and a wide range of $varepsilon in [0.01,8.0]$, all while accounting for the privacy cost of HPO.
Provided by:

DoctorMorDi

DoctorMorDi

Moderator and Editor