Tags

inexact deflation

power iteration

principal component analysis

adaptive optimization

local smoothness

out-of-distribution generalization

sharpness-aware minimization

frank-wolfe algorithm

lotterty ticket hypothesis

machine learning