How can the cutoff be determined for a different dataset. Could you suggest any documentation or repo for the same.