composer.algorithms.functional.apply_seq_length_warmup
- composer.algorithms.functional.apply_seq_length_warmup(batch: Dict[str, Tensor], curr_seq_len: int, truncate: bool) Union[Tuple[Union[Tensor, Tuple[Tensor, ...], List[Tensor]], Union[Tensor, Tuple[Tensor, ...], List[Tensor]]], List[Tensor], Dict[str, Tensor], Tensor][source]
Progressively increases the sequence length during training.
Changes the sequence length of all tensors in the provided dictionary to
curr_seq_len, by either truncating the tensors (truncate=True) or reshaping the tensors to create new examples from the extra tokens (truncate=False).The schedule for
curr_seq_lenover training time should be managed out of this function.- Parameters
- Returns
batch – a Mapping of input tensors to the model, where all tensors have curr_seq_len in the second dimension.