Scheduling workloads to AWS placement groups¶
Run:ai supports AWS placement groups when building and submitting a job. AWS Placement Groups are used to maximize throughput and performance of distributed training workloads.
To enable and configure this feature:
- Press
Jobs | New job
. - In
Scheduling and lifecycle
enable theTopology aware scheduling
. - In
Topology key
, enter the label of the topology of the node. -
In
Scheduling rule
chooseRequired
orPreferred
from the drop down.Required
—when enabled, all PODs must be scheduled to the same placement group.Preferred
—when enabled, this is a best-effort, to place as many PODs on the same placement group.