We should limit the length of content send to OpenAI model based on the supported token limit.

### What language are you using?

Dotnet (OOP)

### Expected Behavior

Can perform a chat session without receiving a token length limit. I would expect the binding to trim the content so that it stays under the supported limit provided by the LLM deployment model.

### Actual Behavior

Exception while executing function: Functions.chatQuery This model's maximum context length is 4096 tokens. However, your messages resulted in 4109 tokens (4046 in the messages, 63 in the functions). Please reduce the length of the messages or functions.
Status: 400 (model_error)
ErrorCode: context_length_exceeded

Content:
{
  "error": {
    "message": "This model's maximum context length is 4096 tokens. However, your messages resulted in 4109 tokens (4046 in the messages, 63 in the functions). Please reduce the length of the messages or functions.",
    "type": "invalid_request_error",
    "param": "messages",
    "code": "context_length_exceeded"
  }
}


### Host.json

_No response_

### Steps to Reproduce

Create a long chat session.
At some point, it will return with the token limit above.

It appears that we try and retrieve all the chat history and send this to OpenAI for conversation context. We need to limit the amount of chat history we send so we stay below the content limit.

### Relevant code being tried

_No response_

### Relevant log output

_No response_

### Where are you facing this problem?

Local - Core Tools

### Additional Information

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

We should limit the length of content send to OpenAI model based on the supported token limit. #113

What language are you using?

Expected Behavior

Actual Behavior

Host.json

Steps to Reproduce

Relevant code being tried

Relevant log output

Where are you facing this problem?

Additional Information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

We should limit the length of content send to OpenAI model based on the supported token limit. #113

Description

What language are you using?

Expected Behavior

Actual Behavior

Host.json

Steps to Reproduce

Relevant code being tried

Relevant log output

Where are you facing this problem?

Additional Information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions