Skip to content

Conversation

Judyxujj
Copy link
Contributor

@Judyxujj Judyxujj commented May 5, 2023

No description provided.

from torch import nn


class ConformerFeedForwardv1(nn.Module):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In our earlier implementation in RETURNN-common, we decided to call this ConformerPositionwiseFeedForward. Why this name change?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The name used here is decided in the last implementation meeting

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So why was it decided that way? Why the name change?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think most of the people including me are not aware of the name ConformerPositionwiseFeedForward is used in RETURNN-common. If we want to use the consistent name as in RETURNN-common, I can change the name here

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@mmz33 was working on that, and then me as well. We had lots of earlier discussions on all of this, like naming of modules, etc. So I wonder a bit why we ignore all of that now.

Copy link
Member

@albertz albertz May 5, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For reference: rwth-i6/returnn_common#58

Although this was only the starting point. Since then, it was also further developed.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the reference, I have now also made activation configurable as in RETURNN-common

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would have expected that we stick to our earlier discussions and decisions, and not redo all of those discussions, which are all not really so important anyway.

And introducing inconsistencies to our other existing implementation for no good reason is also not nice.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@albertz in earlier discussions and decisions nearly no one took part. We specifically started over having as much people involved as possible. I definitely see it as a problem that you can not join the Tuesday meeting, and we should find a solution for that, e.g. have another date.

Also if the discussions are not important, then I do not understand the large number of comments from you here.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also if the discussions are not important, then I do not understand the large number of comments from you here.

It should not be inconsistent to the existing implementation.

Also, we already had some experience with the earlier decisions, and some things were changed later on. So now re-deciding that again first takes takes time, and then also ignores that earlier experience.

So, I would stick to the original decisions, and be consistent to the existing implementation, unless there are really good reasons to change some parts.

@JackTemaki
Copy link
Contributor

@albertz please stop adding each comment as itself, and collect them in one review and send them at once.

@albertz
Copy link
Member

albertz commented May 5, 2023

@albertz please stop adding each comment as itself, and collect them in one review and send them at once.

I think that the way I do it is easier, as you can work already as you go. At least, I would prefer it that way, both as the reviewer, and also as the developer.

@vieting
Copy link
Contributor

vieting commented May 5, 2023

@albertz please stop adding each comment as itself, and collect them in one review and send them at once.

I think that the way I do it is easier, as you can work already as you go. At least, I would prefer it that way, both as the reviewer, and also as the developer.

A plus for Nick's approach is that other subscribers don't get spammed with mails for every comment.

@albertz albertz mentioned this pull request May 5, 2023
@SimBe195
Copy link
Contributor

SimBe195 commented May 9, 2023

Some tests should also be added, similar to #4.

@Judyxujj Judyxujj requested review from Atticus1806 and albertz May 12, 2023 13:26
@Judyxujj
Copy link
Contributor Author

Some tests should also be added, similar to #4.

@Judyxujj Judyxujj closed this May 12, 2023
@Judyxujj Judyxujj reopened this May 12, 2023

def __init__(
self,
input_dim: int,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please change the input to take some kind of ConformerFeedForwardConfig (c.f. #8 for inspiration) and also define that Config in this file, as agreed on in the last meeting.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The inputs are defined as Config class now

Copy link
Contributor

@Atticus1806 Atticus1806 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot for the extensive work, looks good!

@Atticus1806 Atticus1806 self-requested a review May 16, 2023 07:01
@curufinwe curufinwe merged commit 0f20821 into main May 18, 2023
@curufinwe curufinwe deleted the jingjing_conformer_FFN branch May 18, 2023 07:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants