*Equal contribution
(α) Alphabetical order
Preconditioning via Randomized Range Deflation (RandRAND)
Strongly-polynomial time and validation analysis of policy gradient methods
Policy optimization over general state and action spaces
Dual dynamic programming for stochastic programs over an infinite horizon
Learning a Local Trading Strategy: Deep Reinforcement Learning for Grid-scale Renewable Energy Integration
Reinforcement Learning-Based Control for Waste Biorefining Processes Under Uncertainty
Efficient parallel implementation of the multiplicative weight update method for graph-based linear programs
Implicit regularization of Bregman proximal point algorithm and mirror descent on separable data
Communication lower bounds for nested bilinear algorithms via rank expansion of Kronecker products