streaming.base.partition.get_partitions(algo, num_samples, num_canonical_nodes, num_physical_nodes, ranks_per_node, workers_per_rank, batch_size, drop_first=0, initial_physical_nodes=None)[source]#

Partition the given number of samples to nodes, ranks, and workers.

Either canonical or physical nodes must be evenly divisible by the other.

It is suggested to set num_canonical_nodes higher than your expected number of physical nodes, because scaling your number of nodes below that level may result in more shards being used across node boundaries due to preserving the same global sample order.

  • algo (str) – Partition algorithm name.

  • num_samples (int) – Dataset size.

  • num_canonical_nodes (int) – Number of canonical nodes.

  • num_physical_nodes (int) – Number of physical nodes.

  • ranks_per_node (int) – Number of ranks per node.

  • workers_per_rank (int) – Number of worker partitions per rank.

  • batch_size (int) – Batch size of DataLoader and dataset, which affects how the dataset is partitioned over the workers.

  • drop_first (int) – Number of samples seen already, which are dropped. Defaults to 0.

  • initial_physical_nodes (int, optional) – Number of physical nodes at the start of training. Defaults to None.


NDArray[np.int64] – Partitions of shape (physical nodes, ranks per node, workers per rank, batches per worker, batch size).