Why torch.softmax was not needed on logits for ComputerVision Model_0 ON FashionMNIST data? #798

ankitgooner · 2024-01-09T10:51:42Z

ankitgooner
Jan 9, 2024

While preparing the accuracy fn parameters on test data argmax was directly used without applying softmax prior to it unlike multi class model training eariler
Can someone please assist with the explanation.?

Answered by mrdbourke

Jan 11, 2024

Hey @ankitgooner ,

In short, softmax is a transformation from logits -> prediction probabilities that range from 0 to 1.

Softmax makes the logits easier to understand from a human perspective.

However, it's not 100% necessary to use softmax when performing argmax on the logits to get the predictions.

See a fuller explanation here: #314

View full answer

mrdbourke · 2024-01-11T05:33:02Z

mrdbourke
Jan 11, 2024
Maintainer

Hey @ankitgooner ,

In short, softmax is a transformation from logits -> prediction probabilities that range from 0 to 1.

Softmax makes the logits easier to understand from a human perspective.

However, it's not 100% necessary to use softmax when performing argmax on the logits to get the predictions.

See a fuller explanation here: #314

1 reply

ankitgooner Jan 11, 2024
Author

thanks for the link to fuller explanation as well @mrdbourke .
Apologies i could not locate that during the search.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why torch.softmax was not needed on logits for ComputerVision Model_0 ON FashionMNIST data? #798

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Why torch.softmax was not needed on logits for ComputerVision Model_0 ON FashionMNIST data? #798

Uh oh!

ankitgooner Jan 9, 2024

Replies: 1 comment · 1 reply

Uh oh!

mrdbourke Jan 11, 2024 Maintainer

Uh oh!

ankitgooner Jan 11, 2024 Author

ankitgooner
Jan 9, 2024

Replies: 1 comment 1 reply

mrdbourke
Jan 11, 2024
Maintainer

ankitgooner Jan 11, 2024
Author