2

How Much Pre-training Is Enough to Discover a Good Subnetwork?

When is Momentum Extragradient Optimal? A Polynomial-Based Analysis

Fast Quantum State Tomography via Accelerated Non-convex Programming

Local Stochastic Factored Gradient Descent for Distributed Quantum State Tomography