DTA_task2_baseline

Baseline method for DTA Dialogue Summarization task.

Setup

Install dependencies

Please install all the dependency packages using the following command:

pip install -r requirements.txt

Quick Start

Download dataset (passord: 7mvn) and unzip it under the data folder.
Download the pretrained model mbart-large-50.
Execute command python3 preprocess.py to generate data for model training. This will generate train.jsonl and dev.jsonl in the data folder. Note that it will take few minutes.

You can execute bash train.sh or the following command to train an baseline model.

python3 -u pipeline.py \
    --do_train \
    --do_eval \
    --src_lang zh_CN \
    --tgt_lang zh_CN \
    --train_filename data/train.jsonl \
    --val_filename data/dev.jsonl \
    --max_src_len ${max_src_len} \
    --max_tgt_len ${max_tgt_len} \
    --remark ${remark} \
    --pretrained_model_path ${save_dir} \
    --vocab_path ${vocab_dir} \
    --save_dir ${save_dir} \
    --batch_size ${batch_size} \
    --num_train_epochs ${iter} \
    --skip_eval_epochs ${skip_iter} \
    --learning_rate ${learning_rate}

You can execute bash test.sh or the following command to generate dialogue summary by the model trained before. And you will get your generated results in the test.pred file.

python3 -u pipeline.py \
    --do_test \
    --src_lang zh_CN \
    --tgt_lang zh_CN \
    --test_filename data/dev.jsonl \
    --max_src_len ${max_src_len} \
    --max_tgt_len ${max_tgt_len} \
    --remark ${remark} \
    --pretrained_model_path ${model_dir} \
    --vocab_path ${vocab_dir} \
    --save_dir ${save_dir} \
    --batch_size ${batch_size} \
    --num_train_epochs ${iter} \
    --skip_eval_epochs ${skip_iter} \
    --learning_rate ${learning_rate}

Bug or Questions?

If you have any questions about the code, please open an issue or contact us by [email protected].

Please try to specify the problem with details so we can help you better and quicker!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DTA_task2_baseline

Setup

Install dependencies

Quick Start

Bug or Questions?

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
ModelUtils.py		ModelUtils.py
README.md		README.md
args.py		args.py
pipeline.py		pipeline.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
test.sh		test.sh
train.sh		train.sh

Folders and files

Latest commit

History

Repository files navigation

DTA_task2_baseline

Setup

Install dependencies

Quick Start

Bug or Questions?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages