Exploration Posteriors for Generative Modeling Using Only Negative Rewards arxiv.org 1 points by numeri 6 hours ago