Transformer

Transformer estimator is a transformer neural network that forecasts probability distributions for the next forecast horizon values, given the preceding context length values. It also uses lagged values and time features, automatically computed based on the selected time frequency.

Training parameters
The size of the batches to be used for training and prediction.
Default value is the number of batches for which on average each time step appears in one sample per epoch.
Using a fixed random seed allows for reproducible result.
Model parameters
Set the context length to the model's default value (context length = forecast horizon).
Distribution to use to evaluate observations and sample predictions.
Encode time series identifiers and use them as external features.
Number of evaluation samples per time series, to increase parallelism during inference.