Hi @fkryan,
thank you all for this interesting project :)
I have gone through all the demos, e.g., the Colab Demo Notebook; got the AUC, Avg L2 and Min L2 for the GazeFollow dataset: (AUC = 0.9584406567323296; Avg L2: 0.0984195497739822; Min L2: 0.04108351483186137), but I am wondering how you generated the video of the three actors in 'The office' - i.e., the video clip (not the .png).
Thanks