streaming.text#
Natively supported NLP datasets.
Classes
Implementation of the C4 (Colossal Cleaned Common Crawl) dataset using StreamingDataset. |
|
Implementation of the English Wikipedia 2020-01-01 streaming dataset. |
|
Implementation of the the Pile using StreamingDataset. |