1

Pass@K Policy Optimization: Solving Harder Reinforcement Learning Problems

3D-Prover: Diversity Driven Theorem Proving With Determinantal Point Processes

A Universal Sets-level Optimization Framework for Next Set Recommendation

Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming

Probabilistic Attention for Sequential Recommendation

BAIT: Benchmarking (Embedding) Architectures for Interactive Theorem-Proving

R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents

LegendreTron: Uprising Proper Multiclass Loss Learning

Determinantal Point Process Likelihoods for Sequential Recommendation

EditVAE: Unsupervised Part-Aware Controllable 3D Point Cloud Shape Generation