-
Notifications
You must be signed in to change notification settings - Fork 132
SplitDimsLayer fix feature_dim_axis on feature-dim split #705
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Test for #704
8f1d12c
to
ad95110
Compare
So, Git bisecting: It is bad in current master (ea3e91f).
So yes, as expected #683 introduced this bug. |
Note on the potential wrong behaviorThis did not cause an error, just wrong behavior!
And then 2D convolution, it would operate on the two expected spatial dims. The spatial dims are those which are not the feature dim and the batch dim. So it matters, from these splitted dims 40 -> (40,1), which of these are actually the feature dim. Earlier, the last axis (with dim 1) was the new feature dim.
In some follow up 2D conv with options:
you then see:
So the second spatial dim this operated on was |
Fix #704.
WARNING: This potentially changed the behavior of configs.
E.g. the one reported in #703.
Most of the attention and transducer encoders which used initial convolutional layers.
Related is #596. Maybe actually the fix for that in #683 introduces this new bug.