You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
During the pre-training, is the used loss function a combination of the MSE loss calculated on the first [mean] output head and the quantile loss calculated on other 9 [quantile] output heads?
The text was updated successfully, but these errors were encountered:
Thank you very much for your immediate reply. I have another question about the experiment results in the paper. Why are the MAE of the fine-tuned TimesFM (Table 2) is higher than the zero-shot results (Table 5) , on ETTm1 and ETTm2?
During the pre-training, is the used loss function a combination of the MSE loss calculated on the first [mean] output head and the quantile loss calculated on other 9 [quantile] output heads?
The text was updated successfully, but these errors were encountered: