Train transformer language models with reinforcement learning.
[](/packages/trl)
<a href="/packages/trl"><img src="/api/badges/trl?period=month" alt="PyPI Stats"></a>