(EAI-1187) Instruct chatbot on handling fetch_page fallback #897

hschawe · 2025-08-22T16:22:18Z

Jira: https://jira.mongodb.org/browse/EAI-1187

Changes

Let user know that we searched after fetch_page fallback
Also added a couple new prompt adherence eval cases for the new instruction
Summary of the prompt changes & why they're needed:
1. OpenAI's GPT-4.1 prompting guide recommends to put all tool instructions in the tool descriptions, and I saw a notable increase across all metrics after doing this (for fetch_page and prompt adherence eval cases)
2. There were still some issues with selecting the right tool & formatting the responses, so I put the "coordination" instructions in the system prompt, which improved prompt adherence & the tool call metrics
3. Repeating instructions greatly helps the LLM follow instructions when it comes to tool instructions. I tested out some search fallback changes (which did not make it into this PR) and the fetch_page assumption instructions (which did) and in both cases having an extra line of instruction really helped the model use the tools correctly

Notes

Evaluation runs for this change compared to main
The LLM isn't always great at always giving the fallback response instruction in only the {fallback_to_search} case. Sometimes when fetch_page has to truncate the page, the LLM will sometimes say it couldn't use the page, even if it didn't call the search_tool.

…h_page calls

mongodben

small request for change

packages/chatbot-server-mongodb-public/src/systemPrompt.ts

mongodben

LGTM, great prompt engineering work here!

1 small suggestion

mongodben · 2025-08-28T18:40:26Z

OpenAI's GPT-4.1 prompting guide recommends to put all tool instructions in the tool descriptions, and I saw a notable increase across all metrics after doing this (for fetch_page and prompt adherence eval cases)

great, in the prompting guide and evals i trust.

interesting that this is the optimal pattern now. it actually used to be that the API had a hard cap on how long the description could be, and you were supposed to put all this info in the system prompt. wonder when it changed.

There were still some issues with selecting the right tool & formatting the responses, so I put the "coordination" instructions in the system prompt, which improved prompt adherence & the tool call metrics

sounds good

Repeating instructions greatly helps the LLM follow instructions when it comes to tool instructions. I tested out some search fallback changes (which did not make it into this PR) and the fetch_page assumption instructions (which did) and in both cases having an extra line of instruction really helped the model use the tools correctly

👍

The LLM isn't always great at always giving the fallback response instruction in only the {fallback_to_search} case. Sometimes when fetch_page has to truncate the page, the LLM will sometimes say it couldn't use the page, even if it didn't call the search_tool.

sorry i dont really understand what you mean here. can you explain a bit more?

and, is this an issue that we should address? if so, with what priority? should there be a follow up ticket?

Co-authored-by: Ben Perlmutter <[email protected]>

hschawe · 2025-08-28T19:08:34Z

The LLM isn't always great at always giving the fallback response instruction in only the {fallback_to_search} case. Sometimes when fetch_page has to truncate the page, the LLM will sometimes say it couldn't use the page, even if it didn't call the search_tool.

sorry i dont really understand what you mean here. can you explain a bit more?

and, is this an issue that we should address? if so, with what priority? should there be a follow up ticket?

when fetch_page is used on a long page (>150,000 characters), the page is truncated and we do an on-page search for relevant content. the LLM knows that this search is happening, so sometimes - not always - it will give the fallback disclaimer, even though it didn't fall back to the search_content tool.

i don't think this is that big of an issue especially since it doesn't affect how the LLM uses the tools, but it would be good to dedicate some time to this on a separate ticket (EAI-1288)

hschawe added 2 commits August 22, 2025 09:05

fetch_page fallback disclaimer instruction

a5a89b9

Remove unused str & tweaks to make prompt encapsulate successful fetc…

d65ddcb

…h_page calls

hschawe marked this pull request as ready for review August 22, 2025 19:07

mongodben requested changes Aug 25, 2025

View reviewed changes

packages/chatbot-server-mongodb-public/src/systemPrompt.ts Outdated Show resolved Hide resolved

PR review - importantNote shouldn't be in the systemPromptContent

20d1717

hschawe requested a review from mongodben August 25, 2025 20:17

mongodben reviewed Aug 25, 2025

View reviewed changes

packages/chatbot-server-mongodb-public/src/systemPrompt.ts Outdated Show resolved Hide resolved

hschawe added 9 commits August 26, 2025 10:08

Add 1 prompt adherence eval case for fallback instruction

b675ff5

Merge branch 'main' of github.com:mongodb/chatbot into EAI-1187

cc3f35b

Test run: Tool instructions in tool descriptions

06ed18c

Making progress

81758d1

Add missing file

24e0023

No matching URL fallback case only

0d1cf60

Minor but impactful wording change

255c998

Preface instructions & run all conversation evals

50cbf61

Clarify fetch_page vs. search_content confusion

bf440dc

mongodben reviewed Aug 28, 2025

View reviewed changes

packages/chatbot-server-mongodb-public/src/systemPrompt.ts Outdated Show resolved Hide resolved

mongodben approved these changes Aug 28, 2025

View reviewed changes

PR review: string interpolation of tool names

e129f81

Co-authored-by: Ben Perlmutter <[email protected]>

mongodben mentioned this pull request Aug 28, 2025

(EAI-1285): [docs] Custom prompt guidance + custom tool instructions update #906

Merged

hschawe merged commit a00268b into main Aug 28, 2025
2 checks passed

hschawe deleted the EAI-1187 branch August 28, 2025 19:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(EAI-1187) Instruct chatbot on handling fetch_page fallback #897

(EAI-1187) Instruct chatbot on handling fetch_page fallback #897

Uh oh!

hschawe commented Aug 22, 2025 •

edited

Loading

Uh oh!

mongodben left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mongodben left a comment

Uh oh!

mongodben commented Aug 28, 2025

Uh oh!

hschawe commented Aug 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

(EAI-1187) Instruct chatbot on handling fetch_page fallback #897

(EAI-1187) Instruct chatbot on handling fetch_page fallback #897

Uh oh!

Conversation

hschawe commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Notes

Uh oh!

mongodben left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mongodben left a comment

Choose a reason for hiding this comment

Uh oh!

mongodben commented Aug 28, 2025

Uh oh!

hschawe commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hschawe commented Aug 22, 2025 •

edited

Loading

hschawe commented Aug 28, 2025 •

edited

Loading