Hello!
I noticed that the code seems to calculate the metrics (mae, mape, rmse) for each mini-batch and then average them during the test phase.
However, due to the unbalanced distribution of nulls in the test data, I find that such an approach gives different results compared to the canonical approaches used in previous work (e.g. DCRNN and GWNet), even if the dataset is padded.
Hello!
I noticed that the code seems to calculate the metrics (mae, mape, rmse) for each mini-batch and then average them during the test phase.
However, due to the unbalanced distribution of nulls in the test data, I find that such an approach gives different results compared to the canonical approaches used in previous work (e.g. DCRNN and GWNet), even if the dataset is padded.