按照TinyLLM的readme教程,对模型进行训练,到20000多steps就出现了nan,修改了lr好像还是有问题 <img width="696" height="547" alt="Image" src="https://github.com/user-attachments/assets/129d6fab-5e42-46fc-8d01-3b85ef7ec27c" />