Pass@K Policy Optimization: Solving Harder Reinforcement Learning Problems

Christian Walder
Christian Walder
Research Scientist
Honorary Visting Fellow

My research interests include Bayesian machine learning, sequence models for music and reinforcement learning.