Skip to content

Stochastic depth #99

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
albertz opened this issue Feb 4, 2022 · 0 comments · Fixed by #106
Closed

Stochastic depth #99

albertz opened this issue Feb 4, 2022 · 0 comments · Fixed by #106

Comments

@albertz
Copy link
Member

albertz commented Feb 4, 2022

We should have this implemented.
The name varies a bit in related literature and other implementations. Sometimes it is called stochastic layers, sometimes drop path, layer drop, even drop connect although that usually means sth different (dropout on weights).

This was introduced in this paper: Deep Networks with Stochastic Depth

This is commonly used for Transformers. For example:

External example implementations:

This needs to depend (conditional logic, #24) on the train flag (#18).

@albertz albertz changed the title Stochastic layers Stochastic depth Feb 7, 2022
albertz added a commit that referenced this issue Feb 8, 2022
albertz added a commit that referenced this issue Feb 8, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant