Design philosophy behind removal of EMA from Corrdiff #1342
Unanswered
joshdorrington
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I see that in commit #604 EMA was removed from the training loop for the Corrdiff model. As the paper still mentions the use of EMA, which is the same training approach used in the ESD paper, would anyone be able to explain the reasoning behind dropping EMA? I looked to see if there was some discussion about it here in the github history but didn't find anything.
Thanks for any thoughts! I am applying Corrdiff to different problems than in the original paper, and so I'm trying to get the best sense of the sensitivities and rationale behind parameter choices as I can :)
Beta Was this translation helpful? Give feedback.
All reactions