-
Hi! I have a question regarding the .train() stage and .eval() stage. Did you skip the validation set for simplicity (which in this case, replaces the test data in the training stage) |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
In training stage, preds for train dataloader is outputted while on inference stage its prediction on test dataloader . During model.train(), all the weights at each layers including dropout, batchnorm, layernorm are initialized and gives raw logits on train dataloader which is then transformed into probabilities and hence we get predictions During model.eval(), weights of train mode are saved but dropout and normalization layers are excluded(turned off), and then generate predictions |
Beta Was this translation helpful? Give feedback.
In training stage, preds for train dataloader is outputted while on inference stage its prediction on test dataloader .
Its not same.
During model.train(), all the weights at each layers including dropout, batchnorm, layernorm are initialized and gives raw logits on train dataloader which is then transformed into probabilities and hence we get predictions
During model.eval(), weights of train mode are saved but dropout and normalization layers are excluded(turned off), and then generate predictions