streaming#
MosaicML Streaming Datasets for cloud-native model training.
Classes
Writes a streaming CSV dataset. |
|
Writes a streaming JSON dataset. |
|
A streaming dataset whose shards reside locally as a pytorch Dataset. |
|
Writes a streaming MDS dataset. |
|
A dataset, or sub-dataset if mixing, from which we stream/cache samples. |
|
A streaming data loader. |
|
A mid-epoch-resumable streaming/caching pytorch IterableDataset. |
|
Writes a streaming TSV dataset. |
|
Writes a streaming XSV dataset. |