Add PRC models to test_text_generation_example.py #1695

wenbinc-Bin · 2025-01-14T05:49:49Z

What does this PR do?

Add Chinese models to test_text_generation_example.py

Qwen/Qwen1.5-14B
Qwen/Qwen1.5-32B
THUDM/chatglm2-6b
Qwen/Qwen2.5-7B
Qwen/Qwen2.5-14B
Qwen/Qwen2.5-32B
Qwen/Qwen1.5-72B
Qwen/Qwen2-72B
Qwen/Qwen2.5-72B

Signed-off-by: Chen, Wenbin <[email protected]>

yafshar · 2025-01-14T15:10:16Z

@wenbinc-Bin would you please post the test results with these additions.

yafshar

LGTM!

Hi @regisss, this PR is ready for your final review. Could you please take a look?

yafshar · 2025-01-14T16:49:53Z

>>> RUN_SLOW=true GAUDI2_CI=1 python -m pytest tests/test_text_generation_example.py -s -v -k qwen --token $(TOKEN)

12 passed, 64 deselected, 1 warning in 640.85s (0:10:40)

wenbinc-Bin · 2025-01-15T02:40:07Z

Also chaglm2

>>> GAUDI2_CI=1 python -m pytest tests/test_text_generation_example.py -s -v -k "THUDM/chatglm2-6b"

1 passed, 75 deselected, 1 warning in 26.50s

yafshar · 2025-01-16T20:49:30Z

@wenbinc-Bin do we need to add all the tests? Maybe we can remove Qwen/Qwen1.5-14B & Qwen/Qwen2.5-14B if not necessary

Signed-off-by: Chen, Wenbin <[email protected]>

wenbinc-Bin · 2025-01-17T02:34:16Z

@wenbinc-Bin do we need to add all the tests? Maybe we can remove Qwen/Qwen1.5-14B & Qwen/Qwen2.5-14B if not necessary

I can remove them. 14B models are relatively less important.

yafshar

Hi @regisss, this PR is ready for your final review. Could you please take a look?

libinta · 2025-01-23T19:44:16Z

tests/test_text_generation_example.py

@@ -41,6 +41,7 @@
            ("bigcode/starcoder2-3b", 1, False, 261.07213776344133, True),
            ("adept/persimmon-8b-base", 4, False, 366.73968820698406, False),
            ("Qwen/Qwen1.5-7B", 4, False, 490.8621617893209, False),
+            ("Qwen/Qwen1.5-32B", 4, False, 120, False),


why do we need both 7B and 32B of 1.5 if we already have 2.5?

Because we said we support these models, we need to make sure there are no regression. We don't know what models users are using.

one type of each is enough, we can't test all. but if you want to test on your own for each release, you can do that.

libinta · 2025-01-23T19:45:04Z

tests/test_text_generation_example.py

+            ("Qwen/Qwen2.5-7B", 4, False, 490, False),
+            ("Qwen/Qwen2.5-32B", 4, False, 120, False),


can you remove 7B from multi test?

Do you mean remove "Qwen/Qwen2.5-7B"? I think we need to test small models. Customers use small models to test and build their APPs.

libinta · 2025-01-23T19:45:53Z

tests/test_text_generation_example.py

@@ -87,6 +91,9 @@
            ("meta-llama/Meta-Llama-3-70B-Instruct", 8, 1, 64),
            ("facebook/opt-66b", 2, 1, 28.48069266504111),
            ("google/gemma-2-9b", 8, 1, 110.12610917383735),
+            ("Qwen/Qwen1.5-72B", 2, 1, 26),


why do we need 1.5 and 2.0 if we have 2.5?

Because we said we support these models, we need to make sure there are no regression. We don't know what models users are using.

yafshar · 2025-01-30T21:04:50Z

@wenbinc-Bin can you please remove the extra ones as Libin suggested and only keep one type of each, so we can merge and close this PR. Thanks

Signed-off-by: Chen, Wenbin <[email protected]>

wenbinc-Bin · 2025-02-02T04:55:27Z

Ok, I update the PR.

Add PRC models to test_text_generation_example.py

cc139dd

Signed-off-by: Chen, Wenbin <[email protected]>

wenbinc-Bin requested a review from regisss as a code owner January 14, 2025 05:49

yafshar approved these changes Jan 14, 2025

View reviewed changes

remove 14B models

8358548

Signed-off-by: Chen, Wenbin <[email protected]>

yafshar approved these changes Jan 17, 2025

View reviewed changes

libinta reviewed Jan 23, 2025

View reviewed changes

remove extra ones and only keep one type of each.

46dfc32

Signed-off-by: Chen, Wenbin <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PRC models to test_text_generation_example.py #1695

Add PRC models to test_text_generation_example.py #1695

wenbinc-Bin commented Jan 14, 2025

yafshar commented Jan 14, 2025

yafshar left a comment

yafshar commented Jan 14, 2025

wenbinc-Bin commented Jan 15, 2025

yafshar commented Jan 16, 2025

wenbinc-Bin commented Jan 17, 2025

yafshar left a comment

libinta Jan 23, 2025

wenbinc-Bin Jan 24, 2025

libinta Jan 30, 2025

libinta Jan 23, 2025

wenbinc-Bin Jan 24, 2025

libinta Jan 23, 2025

wenbinc-Bin Jan 24, 2025

yafshar commented Jan 30, 2025

wenbinc-Bin commented Feb 2, 2025

		("Qwen/Qwen2.5-7B", 4, False, 490, False),
		("Qwen/Qwen2.5-32B", 4, False, 120, False),

Add PRC models to test_text_generation_example.py #1695

Are you sure you want to change the base?

Add PRC models to test_text_generation_example.py #1695

Conversation

wenbinc-Bin commented Jan 14, 2025

What does this PR do?

yafshar commented Jan 14, 2025

yafshar left a comment

Choose a reason for hiding this comment

yafshar commented Jan 14, 2025

wenbinc-Bin commented Jan 15, 2025

yafshar commented Jan 16, 2025

wenbinc-Bin commented Jan 17, 2025

yafshar left a comment

Choose a reason for hiding this comment

libinta Jan 23, 2025

Choose a reason for hiding this comment

wenbinc-Bin Jan 24, 2025

Choose a reason for hiding this comment

libinta Jan 30, 2025

Choose a reason for hiding this comment

libinta Jan 23, 2025

Choose a reason for hiding this comment

wenbinc-Bin Jan 24, 2025

Choose a reason for hiding this comment

libinta Jan 23, 2025

Choose a reason for hiding this comment

wenbinc-Bin Jan 24, 2025

Choose a reason for hiding this comment

yafshar commented Jan 30, 2025

wenbinc-Bin commented Feb 2, 2025