Agentic execution environments for RL training with simple Gymnasium-style APIs
An e2e framework for creating, deploying and using isolated execution environments for agentic RL training, with examples for training LLMs to play games.
Sign in to leave a comment.
No comments yet.