Dynamically assert model temperature value in argparser #856

dberardi99 · 2025-03-12T04:30:19Z

A new method get_model_temperature(args: argparse.Namespace) has been added to hardcode the model temperature value based on the model name. The chat session models (namely the ones ending with "-chat" or "-azure") have been treated as the corresponding base models. The temperature values can be found in the following data table #366 (comment)

The temperature value will be automatically aligned to the one of the model chosen only if no temperature has been set (args.temperature == TEMPERATURE). Instead, if a wrong model name is fed (args.model in models.LLM.all_llm_names() == 0), the above method is skipped and the temperature is left unchanged to its default value.

In addition, the default temperature has been changed to 1.0 since it is the one relative to the default model (vertex_ai_gemini-1-5)

Fix #366

dberardi99 · 2025-03-14T00:29:57Z

Hi @DonggeLiu Could you please review it when you get a chance? Many thanks in advance!

DonggeLiu

Thanks @dberardi99.
I left a comment to clarify a bit more about the task.
Please let me know if that makes sense : )

DonggeLiu · 2025-03-14T00:31:15Z

llm_toolkit/models.py

@@ -46,7 +46,7 @@
 # Model hyper-parameters.
 MAX_TOKENS: int = 2000
 NUM_SAMPLES: int = 1
-TEMPERATURE: float = 0.4
+TEMPERATURE: float = 1.0


Ops let's keep the default temperature the same for now to avoid causing surprising results in other people's recent experiments.
We can grid search for the best default values for our use case later.

Perfect, I got it. I'll reset it to its prior value

DonggeLiu · 2025-03-14T00:42:42Z

run_all_experiments.py

@@ -279,6 +279,9 @@ def parse_args() -> argparse.Namespace:

  if args.temperature:
    assert 2 >= args.temperature >= 0, '--temperature must be within 0 and 2.'
+
+  if args.temperature == TEMPERATURE and args.model in models.LLM.all_llm_names():
+    args.temperature = run_one_experiment.get_model_temperature(args)


I reckon the issue we solve has 2 tasks:

Main: Different models have different temperature ranges, we want to assert that if user specified a temperature in args, then it should fall into the corresponding models range.

Minor: Define a default temperature for each model.

let's solve the main task first:
1.1. Define the temperature rate for each model class in https://github.com/google/oss-fuzz-gen/blob/main/llm_toolkit/models.py. Use inheritance to minimize the changes needed.
1.2. Replace this hardcoded assertion with dynamic assertion based on the model name:

oss-fuzz-gen/run_all_experiments.py

Lines 280 to 281 in 33bddff

if args.temperature:

assert 2 >= args.temperature >= 0, '--temperature must be within 0 and 2.'

Then we can work on the minor task:

Add default temperatures under each class.

Set the temperature as the default value here, if user did not specify it.

Everything's clear! Just one thing, do you prefer to solve only the different temperature ranges in this PR or to implement both temperature ranges and default temperatures here?

DonggeLiu

Thanks @dberardi99, some nits.

DonggeLiu · 2025-03-15T21:15:09Z

llm_toolkit/models.py

@@ -61,6 +61,8 @@ class LLM:

  _max_attempts = 5  # Maximum number of attempts to get prediction response

+  temperature_range: list[float] = [0.0, 2.0]  # Default model temperature range


Let's keep this attribute but make the value a constant like TEMPERATURE defined at the top of the file:

temperature_range: list[float] = TEMPERATURE_RANGE

DonggeLiu · 2025-03-15T21:42:33Z

llm_toolkit/models.py

+      if (hasattr(subcls, 'temperature_range') and hasattr(subcls, 'name')
+          and subcls.name != AIBinaryModel.name):
+        ranges[subcls.name] = subcls.temperature_range
+    return ranges


IIUC, you are replicating all_llm_names.
Would it be more maintainable and extensible to extract and reuse the repeating logic of these two functions? E.g., a function to return all models so that we can reuse it to acquire other attributes in the future.

@classmethod def _all_llm_models(cls): """ Returns a list of LLM model classes that have a `name` attribute and are not `AIBinaryModel`. """ models = [] for subcls in cls.all_llm_subclasses(): # May need a different filter logic here. if subcls.name != AIBinaryModel.name: models.append(subcls) return models @classmethod def all_llm_names(cls) -> list[str]: """Returns the current model name and all child model names.""" return [m.name for m in cls._all_llm_models()] @classmethod def all_llm_temperature_ranges(cls) -> dict[str, list[float, float]]: """Returns the current model name and all child model temperature ranges.""" return { m.name: m.temperature_range for m in cls._all_llm_models() if hasattr(m, 'temperature_range') }

Feel free to adjust/simplify these functions as you see fit, particular the filtering logic.
These above are just examples.

Good idea! I've slightly modified it by introducing the all_llm_search method, which allows to output the desired attribute for all LLMs. If you don't like the idea, I can switch back to your implementation.

DonggeLiu · 2025-03-15T21:43:32Z

run_all_experiments.py

+    ranges = models.LLM.all_llm_temperature_ranges()
+    assert ranges[args.model][1] >= args.temperature >= ranges[args.model][0], (
+      f'--temperature must be within {ranges[args.model][0]} and '
+      f'{ranges[args.model][1]}.')


Add ... for model args.model specify the model name we parsed.

dberardi99 · 2025-04-12T17:50:51Z

@DonggeLiu I'm so sorry for the great delay... If you like this final implementation I can open a new PR creating a new branch for this fix in order to leave the modification history clearer.

DonggeLiu

Thanks @dberardi99.
This is already pretty good, just a final question regarding reliability.

I don't think you have to create a new PR, as the modification history is pretty clear. But please let me know if you prefer otherwise.

Also, as your code is almost ready to merge, I will create a new base (#986) for your PR so that we can experiment with it before merging into main.

DonggeLiu · 2025-04-14T03:35:53Z

llm_toolkit/models.py

+    names = cls.all_llm_search('name')
+    tr = cls.all_llm_search('temperature_range')
+    for i in range(len(names)):
+      out[names[i]] = tr[i]


tr and names may mismatch.
This code assumes that all_llm_subclasses will always return the subclasses in the same order, but I don't think Python's __subclasses__ can guarantee that.

I've had the same your doubt regarding the order in which Python returns subclasses. I've done lots of tests and the order was always the same, but I can make additional researches and in case we are unsure about it propose a small fix to increase reliability

How about using dictionaries like we discussed earlier?
#856 (comment)

I've slightly adapted the filtering function to return also all the models names, so that now we guarantee the correct assignment between model name and temperature range

dberardi99 · 2025-04-14T06:39:32Z

Thanks @dberardi99. This is already pretty good, just a final question regarding reliability.

I don't think you have to create a new PR, as the modification history is pretty clear. But please let me know if you prefer otherwise.

Also, as your code is almost ready to merge, I will create a new base (#986) for your PR so that we can experiment with it before merging into main.

Awesome! We can continue working in this PR with no problem in this case

…ach model

Dynamically assert model temperature value in argparser

33bddff

DonggeLiu requested changes Mar 14, 2025

View reviewed changes

dberardi99 added 2 commits March 14, 2025 12:45

Merge branch 'main' of https://github.com/google/oss-fuzz-gen

a3357a1

Dynamic assertion of temperature range based on model name

f8f198c

dberardi99 requested a review from DonggeLiu March 14, 2025 21:15

DonggeLiu reviewed Mar 15, 2025

View reviewed changes

DonggeLiu marked this pull request as draft April 11, 2025 06:02

Introduced a search method for LLMs' attributes.

c546ea6

dberardi99 requested a review from DonggeLiu April 12, 2025 17:43

DonggeLiu reviewed Apr 14, 2025

View reviewed changes

DonggeLiu mentioned this pull request Apr 14, 2025

@dberardi99: Dynamically assert model temperature value in argparser #986

Open

DonggeLiu changed the base branch from main to exp-856 April 14, 2025 03:42

Modified extraction logic to assign the proper temperature_range to e…

4339b7d

…ach model

dberardi99 requested a review from DonggeLiu April 16, 2025 06:26

	if args.temperature:
	assert 2 >= args.temperature >= 0, '--temperature must be within 0 and 2.'

		@@ -61,6 +61,8 @@ class LLM:

		_max_attempts = 5 # Maximum number of attempts to get prediction response

		temperature_range: list[float] = [0.0, 2.0] # Default model temperature range

Dynamically assert model temperature value in argparser #856

Are you sure you want to change the base?

Dynamically assert model temperature value in argparser #856

Uh oh!

Conversation

dberardi99 commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dberardi99 commented Mar 14, 2025

Uh oh!

DonggeLiu left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DonggeLiu left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dberardi99 commented Apr 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DonggeLiu left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dberardi99 Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dberardi99 commented Apr 14, 2025

Uh oh!

Uh oh!

dberardi99 commented Mar 12, 2025 •

edited

Loading

dberardi99 commented Apr 12, 2025 •

edited

Loading

DonggeLiu left a comment •

edited

Loading

dberardi99 Apr 14, 2025 •

edited

Loading