Agent: UnknownLLM: Unknown#reinforcement-learning#agentic-framework#pytorch#RL-training#environment-interface
An e2e framework for creating, deploying and using isolated execution environments for agentic RL training, with examples for training LLMs to play games.
