suggestion: train.py启动参数优化 #14

yuanzhoulvpi2017 · 2024-12-19T11:52:44Z

Megatron-lm的那套配置config，真的非常让人看懂。model_args、data_args、train_args 这种如果不熟悉的话，很难搞清楚。
我看了一下你们的代码：有create_config.py部分，然后在训练的时候，加载config.json文件？

当前思路确实很不错，但这种多个步骤，还是有点麻烦的。

可以按照hf的这种代码形式，

把相关的参数都是用dataclass定义好：可以写清楚每个变量是用来干嘛的。
相关约束，使用__post_init__约束好：各个参数会互相调整影响，在这里都非常容易做好调整。

# code copy from https://github.com/huggingface/transformers/blob/66ab300aaff9ef509f8736cf186ab9b6a0ef4f3b/examples/pytorch/language-modeling/run_clm.py#L242-L249

    parser = HfArgumentParser((ModelArguments, DataTrainingArguments, TrainingArguments))
    if len(sys.argv) == 2 and sys.argv[1].endswith(".json"):
        # If we pass only one argument to the script and it's the path to a json file,
        # let's parse it to get our arguments.
        model_args, data_args, training_args = parser.parse_json_file(json_file=os.path.abspath(sys.argv[1]))
    else:
        model_args, data_args, training_args = parser.parse_args_into_dataclasses()

期待这个仓库越来越好～

The text was updated successfully, but these errors were encountered:

zzhhjjj · 2024-12-19T13:48:50Z

Thanks for your advice! I agree that we can make it clearer and easier to use

3outeille added the enhancement New feature or request label Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

suggestion: train.py启动参数优化 #14

suggestion: train.py启动参数优化 #14

yuanzhoulvpi2017 commented Dec 19, 2024

zzhhjjj commented Dec 19, 2024

suggestion: train.py启动参数优化 #14

suggestion: train.py启动参数优化 #14

Comments

yuanzhoulvpi2017 commented Dec 19, 2024

zzhhjjj commented Dec 19, 2024