Redistribution-based Cost Inference for Safe Offline RL
2024-06-01
·
1 mins read
·
completed
Offline RL
RLHF
Safe AI
Constrained RL
Writeup pending