denotes first/co-first author
Date
Title
Venue
Aug. 2025
Heuristics Considered Harmful: RL With Random Rewards Should Not Make LLMs Reason
NYRL 2024
May 2025
Scaling Offline RL via Efficient and Expressive Shortcut Models
NeurIPS 2025
May 2025
Efficient Controllable Diffusion via Optimal Classifier Guidance
ArXiv
May 2025
Improved Bounds for Delay Dependent Bandits
Preprint
May 2025
Convergence Of Consistency Model With Multistep Sampling Under General Data Assumptions
ICML 2025
Oct. 2024
TurboHopp: Accelerated Molecule Scaffold Hopping with Consistency Models
NeurIPS 2024
April 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
NeurIPS 2024
March 2024
RL for Consistency Models: Reward Guided Text-to-Image Generation with Fast Inference
RLC 2024
Feb. 2024
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
ICML 2024
Feb. 2024
Dataset Reset Policy Optimization for RLHF
ArXiv
Dec. 2023
A Kernel Method Approach to Orbital Debris Blast Point Determination
AIAA 2024
May 2023
Overdetermined Eigenvector Approach to Passive Angles-Only Relative Orbit Determination
JGCD 2023