Queuing

Driverless AI supports automatic queuing of experiments to avoid system overload. You can launch multiple experiments simultaneously that are automatically queued and run when the necessary resources become available.

Note: By default, each node runs two experiments at a time. This is controlled by the worker_remote_processors option in the config.toml file. Additional options that control resource allocation can also be configured in the config.toml file.

Also see Multinode Training (Alpha)