How about the Top1 accuracy of gen mean and gen tail across 10 scales of the goal image in both stages? And what is the interval of sampled frame?