You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We should have this implemented.
The name varies a bit in related literature and other implementations. Sometimes it is called stochastic layers, sometimes drop path, layer drop, even drop connect although that usually means sth different (dropout on weights).
We should have this implemented.
The name varies a bit in related literature and other implementations. Sometimes it is called stochastic layers, sometimes drop path, layer drop, even drop connect although that usually means sth different (dropout on weights).
This was introduced in this paper: Deep Networks with Stochastic Depth
This is commonly used for Transformers. For example:
External example implementations:
DropPath
This needs to depend (conditional logic, #24) on the train flag (#18).
The text was updated successfully, but these errors were encountered: