Trainer For MARL That Fits With PettingZoo
After 9 months of work I finally got my first successful run in a simple RL environment where the agent learns to find a target 🎉 I’m still validating more SARL scenarios, but I’m now thinking ahead toward MARL and wanted some advice on architecture and trainer choice. Current RL engine structure: 1. SimulationEngine • Handles both logic and physics orchestration • Calls the other layers internally 2. EnvironmentEngine • Handles environment logic 3. BulletWorld • Builds and […]