The DiffSim Trinity: Control, Planning, and Search

The DiffSim Trinity contains algorithms based on differentiable simulation for end-to-end control, planning, and search, within the domain of autonomous vehicles. We use Waymax and the Waymo Open Motion Dataset (WOMD). The codebase contains:

Analytic World Models (AWM) for jointly learning a policy and diverse predictive models (odometry, planner, inverse state).
Differentiable Simulation Search (DSS) for gradient-based search at inference-time using learned policies.

This repository is based on the following papers:

Autonomous Vehicle Controllers From End-to-End Differentiable Simulation, @IROS 2025
Unlocking Efficient Vehicle Dynamics Modeling via Analytic World Models, @AAAI 2026
Autonomous Vehicle Path Planning by Searching With Differentiable Simulation, @AAAI 2026

Repository Layout

configs/ – training hyperparameters for each variant plus shared constants.
train/ – entrypoints for AWM and DSS training and the corresponding trainers.
eval/ – evaluation entrypoints and metrics loops for AWM and DSS.
models/ – feature extractors, state processing utilities, and RNN actor-critic architectures.
utils/ – data loading, observations, plotting, and state manipulation.

Setup

Use Python 3.11 with a CUDA-enabled GPU. The packages in requirements.txt target CUDA 12.x.
(Optional) Create and activate a virtual environment, for example using micromamba.

Install dependencies:

python3 -m pip install -r requirements.txt

Obtain access to the Waymo Open Motion Dataset and configure Waymax credentials as described in the Waymax setup guide.

Data paths

Default training/validation paths point to the public GCS URIs:

Training: gs://waymo_open_dataset_motion_v_1_1_0/uncompressed/tf_example/training/training_tfexample.tfrecord@1000
Validation: gs://waymo_open_dataset_motion_v_1_1_0/uncompressed/tf_example/validation/validation_tfexample.tfrecord@150

If you mirror the TFRecords locally, update training_path / validation_path in the relevant config under configs/. Control dataset caching via the dataset_cache keyword in the config (True for in-memory, False for none, or a path for on-disk).

Training

AWM Agent (policy-driven)

Trains the policy to select actions, while still learning planner and odometry heads.

python3 train/main_train_awm.py

Uses configs/conf_train_awm.py.
Checkpoints to logs/train_awm/ by default; change log_folder near the top of the script if needed.
Key toggles: use_planner_for_train=False, num_envs (batch size).

AWM Agent (planner-driven)

Uses the planner to select actions during training.

python3 train/main_train_awm_planner_driven.py

Uses configs/conf_train_awm_planner_driven.py (contains the setting use_planner_for_train=True).
Outputs to logs/train_awm_planner_driven/ by default.

DSS Agent

At inference time the DSS agent requires two policies, one for the ego-vehicle, which uses route conditioning (last waypoint), and one for the other agents, with no route conditioning.

# Ego policy (route-conditioned)
python3 train/main_train_search_ego.py

# Other-agents policy (no waypoint conditioning)
python3 train/main_train_search_other.py

Configs live in configs/conf_train_search_ego.py and configs/conf_train_search_other.py.
Checkpoints land in logs/train_search_ego/ and logs/train_search_other/.

Adjust batch sizes via num_envs if you hit OOM.

Evaluation

AWM evaluation

python3 eval/main_eval_awm.py --expe_id train_awm --epochs 79 \
  --use_planner_for_eval 0 --planning_horizon 10 --num_imagined_rollouts 1

--expe_id must match the folder under logs/.
If the model was trained planner-driven, set --use_planner_for_eval 1.
Useful flags: --use_mpc to toggle model-predictive control, --num_imagined_rollouts (how many rollouts to imagine), --planning_horizon (how long to imagine each trajectory), --top_k (how many candidate actions to aggregate when using MPC)

DSS evaluation

Requires both trained policies (ego + others).

python3 eval/main_eval_search.py --expe_id train_search_other --epochs 39 \
  --do_search True --ego_policy_weights train_search_ego/params_39.pkl \
  --num_modes 4 --num_actions_to_commit_to 3 \
  --imagination_length 15 --step_size 1000. 0.01

Visualization helpers

utils/custom_plots.py contains plot_animated and plot_animated_awm for visualizing scenarios and imagined trajectories. Call them inside the eval scripts (see inline docstring examples) to dump GIFs of the agent behavior.

Citations

If you use this work, consider citing:

Autonomous Vehicle Controllers From End-to-End Differentiable Simulation

@inproceedings{nachkov2024autonomous,
  title={Autonomous Vehicle Controllers From End-to-End Differentiable Simulation},
  author={Nachkov, Asen and Paudel, Danda Pani and Van Gool, Luc},
  booktitle={2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
  year={2025},
}

Unlocking Efficient Vehicle Dynamics Modeling via Analytic World Models

@inproceedings{nachkov2025unlocking,
  title={Unlocking Efficient Vehicle Dynamics Modeling via Analytic World Models},
  author={Nachkov, Asen and Paudel, Danda Pani and Zaech, Jan-Nico and Scaramuzza, Davide and Van Gool, Luc},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  year={2026}
}

Autonomous Vehicle Path Planning by Searching With Differentiable Simulation

@inproceedings{nachkov2025search,
  title={Autonomous Vehicle Path Planning by Searching With Differentiable Simulation},
  author={Nachkov, Asen and Paudel, Danda Pani and Zaech, Jan-Nico and Scaramuzza, Davide and Van Gool, Luc},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  year={2026}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

The DiffSim Trinity: Control, Planning, and Search

Repository Layout

Setup

Data paths

Training

AWM Agent (policy-driven)

AWM Agent (planner-driven)

DSS Agent

Evaluation

AWM evaluation

DSS evaluation

Visualization helpers

Citations

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
eval		eval
models		models
train		train
utils		utils
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

insait-institute/DiffSimTrinity

Folders and files

Latest commit

History

Repository files navigation

The DiffSim Trinity: Control, Planning, and Search

Repository Layout

Setup

Data paths

Training

AWM Agent (policy-driven)

AWM Agent (planner-driven)

DSS Agent

Evaluation

AWM evaluation

DSS evaluation

Visualization helpers

Citations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages