MetaDrive – Topdown – Test Time Adaptation – No Backprop, Online-ish local weight learning
This is a very premature and early proof of concept. But I figured I would show it because this is how I imagine the future of OOD autnomous driving to be solved, albeit with a lot more complexity and engineering. While this is still IID training, and not an OOD task, the concept still works in OOD tasks. Model was trained on racetrack oval, racetrack, and racetrack large. Conv3D encoder. Some intrinsic motivation magic. And uhh… PPO. 5120 […]