Table 1 in the paper lists the weights of the reward function. May I ask if these are the weights used during training?