Experiments


run-id: epochs Reward Function Training Setup Avg 10 Laps Time Average # Crashes/Lap Comments Run Script
exp_0 1,000,000 None - Collision Reset N/A N/A The kart seems performs random actions, and it will never make it to the first checkpoint. However, one interesting observation is that it seems like it tries to slow down and turn away when about to collide with walls, not sure why it does that. mlagents-learn ./Assets/Karting/Prefabs/AI/train_config.yaml --train --run-id=exp_1 --results-dir ./Assets/results/
exp_1 1,000,000 +1 for pass checkpoints
-1 for hit wall - Collision Reset N/A N/A The training “mean rewards” starts from negative and converges into 0. And the model performs really bad. This is somehow expected since most of the time the car is not passing the checkpoints, thus not receiving rewards. There has to be a reward that guides the kart towards the checkpoint, which is in exp_2. mlagents-learn ./Assets/Karting/Prefabs/AI/train_config.yaml --train --run-id=exp_1 --results-dir ./Assets/results/
exp_2 1,000,000 +1 for pass checkpoints
-1 for hit wall
+0.05 for driving towards checkpoint - Collision Reset 00:01:57:92 313 Simply adding reward for heading towards checkpoint greatly improves model’s performance. mlagents-learn ./Assets/Karting/Prefabs/AI/train_config.yaml --train --run-id=exp_2 --results-dir ./Assets/results/
exp_3 1,000,000 +1 for pass checkpoints
-1 for hit wall
+0.05 for driving towards checkpoint
+0.05 for local speed - Collision Reset 00:01:26:45 600 The model runs faster! (expected) but also crashed more often (also expected). mlagents-learn ./Assets/Karting/Prefabs/AI/train_config.yaml --train --run-id=exp_3 --results-dir ./Assets/results/
exp_4 1,000,000 -1 for hit wall
+0.05 for driving towards checkpoint

Plan


Demo Script