fix bug when resuming generation with OpenAI models #775

RaoulDrake · 2024-04-23T14:53:19Z

I've come across what I think must be a bug when resuming generation with OpenAI models, i.e., letting a model complete a partial response. Without the fix in this pull request, the partial response is discarded, which is likely not the intended behaviour and can also lead to a raised Exception a little further down the line. With the fix, everything works fine for me, i.e., any partial response is included in the OpenAI API request and the model can successfully complete the partial response.

…ation with OpenAI models

Harsha-Nori · 2024-04-23T17:51:23Z

@RaoulDrake, We would love this feature, but unfortunately the OpenAI API prohibits pre-filling/partially completing an assistant message for the last turn in a conversation (which significantly hampers our ability to enforce constraints).

There's a chance this has changed recently, but to my understanding, this will run into failures on the OpenAI API for now.

RaoulDrake · 2024-04-23T19:50:44Z

@Harsha-Nori, thanks for the quick feedback.

Here's a minimal example that works for me, i.e., no troubles on the OpenAI API side and the model successfully completes the response with "Paris":

import os
from guidance import models, gen, system, user, assistant

api_key = os.environ.get('OPENAI_API_KEY')
gpt = models.OpenAI("gpt-3.5-turbo", api_key=api_key, echo=False)

with system():
    lm = gpt + "You are a helpful assistant."

with user():
    lm += "What is the capital of France?"

with assistant():
    lm += "The capital of France is " + gen(name="capital", temperature=0.7, suffix=".")

print(lm, end="\n\n")
print(lm["capital"])

Output:

<|im_start|>system
You are a helpful assistant.<|im_end|><|im_start|>user
What is the capital of France?<|im_end|><|im_start|>assistant
The capital of France is Paris.<|im_end|>

Paris

riedgar-ms · 2024-04-24T11:46:16Z

@RaoulDrake can you add a test to show how this works, please?

RaoulDrake · 2024-04-25T13:14:26Z

@riedgar-ms Sure, would the minimal example above turned into a test case in tests/models/test_openai.py suffice? For anything beyond that, I'm afraid I'm a little bit short on time at the moment, unfortunately, so in that case I'd have to get back to you in a couple of weeks, if it's still of interest then.

riedgar-ms · 2024-04-25T15:25:02Z

That would be great, thanks!

codecov-commenter · 2024-04-25T15:29:14Z

Codecov Report

Attention: Patch coverage is 0% with 3 lines in your changes are missing coverage. Please review.

Project coverage is 62.34%. Comparing base (f5ad01d) to head (12ab89b).

Files	Patch %	Lines
guidance/models/_openai.py	0.00%	3 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #775      +/-   ##
==========================================
- Coverage   69.04%   62.34%   -6.71%     
==========================================
  Files          55       55              
  Lines        4071     4074       +3     
==========================================
- Hits         2811     2540     -271     
- Misses       1260     1534     +274

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

RaoulDrake · 2024-04-27T16:17:21Z

@riedgar-ms I have added the minimal example as a test case test_openai_prefill to test_openai.py.

slundberg · 2024-05-06T17:09:16Z

@RaoulDrake as @Harsha-Nori said, we would love to add the ability to set a prefix for OpenAI calls. But unfortunately OpenAI does not support that currently. They do allow you to end your request with an assistant block, but any generation from the model begins a new assistant block (it seems) it does not continue the last one given. So this means we sometimes get a continuation-like behavior, but often not:

fix bug of missing partial message when resuming assistant role gener…

5c16f88

…ation with OpenAI models

RaoulDrake and others added 3 commits April 27, 2024 17:40

Merge branch 'guidance-ai:main' into main

a239ef3

add test_openai_prefill

c2ada93

Merge remote-tracking branch 'origin/main'

12ab89b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix bug when resuming generation with OpenAI models #775

fix bug when resuming generation with OpenAI models #775

RaoulDrake commented Apr 23, 2024

Harsha-Nori commented Apr 23, 2024

RaoulDrake commented Apr 23, 2024 •

edited

Loading

riedgar-ms commented Apr 24, 2024

RaoulDrake commented Apr 25, 2024

riedgar-ms commented Apr 25, 2024

codecov-commenter commented Apr 25, 2024 •

edited

Loading

RaoulDrake commented Apr 27, 2024

slundberg commented May 6, 2024

fix bug when resuming generation with OpenAI models #775

Are you sure you want to change the base?

fix bug when resuming generation with OpenAI models #775

Conversation

RaoulDrake commented Apr 23, 2024

Harsha-Nori commented Apr 23, 2024

RaoulDrake commented Apr 23, 2024 • edited Loading

riedgar-ms commented Apr 24, 2024

RaoulDrake commented Apr 25, 2024

riedgar-ms commented Apr 25, 2024

codecov-commenter commented Apr 25, 2024 • edited Loading

Codecov Report

RaoulDrake commented Apr 27, 2024

slundberg commented May 6, 2024

RaoulDrake commented Apr 23, 2024 •

edited

Loading

codecov-commenter commented Apr 25, 2024 •

edited

Loading