out-of-distribution generalization

Smoothness-Adaptive Sharpness-Aware Minimization for Finding Flatter Minima