Out-of-Distribution Generalization

Smoothness-Adaptive Sharpness-Aware Minimization for Finding Flatter Minima