New Feature: Function calling w/ Context #9135

brandonh-msft · 2024-10-07T15:18:09Z

Scenario

Using auto-invoke tool calls
User uploads files into a chat with a multi-agent system (implemented via Function Calling between Orchestrator and multiple, described, "agents" as Functions)
Orchestrator determines Agent X is best suited to answer the user
Agent X requires the files from the user to fully complete its job

Today, it seems the only things I can pass to a function call are JSON-serializable objects. However, in this scenario, I can send the LLM the image data via ImageContent message items but the LLM can only come back with, say, a filename that I surround this content with - not the content itself. This means that even if my Agent Function takes BinaryData as an input, the LLM doesn't provide it in its function call request. It can provide something like "Filename: X.png" which I could correlate back to the user's input, though, but I am unable to get back to that input to use it in the actual call to the agent.

So, enabling some way for me to get back to the original message which triggered the function call request is necessary for this scenario. I'm thinking something along the lines of an auto-injected parameter to the function, if the developer provides it on the signature (a la CancellationToken)

The text was updated successfully, but these errors were encountered:

markwallace-microsoft · 2024-10-08T06:33:29Z

@brandonh-msft you could try create an Auto Function Invocation Filter, this will give you access to the function that is being called and also the associated Chat History, see https://github.com/microsoft/semantic-kernel/blob/main/dotnet/samples/Concepts/Filtering/AutoFunctionInvocationFiltering.cs, you could pass the ImageContent via KernelArguments for your specific function.

Let me know how this goes.

brandonh-msft · 2024-10-08T15:56:36Z

@brandonh-msft you could try create an Auto Function Invocation Filter, this will give you access to the function that is being called and also the associated Chat History, see https://github.com/microsoft/semantic-kernel/blob/main/dotnet/samples/Concepts/Filtering/AutoFunctionInvocationFiltering.cs, you could pass the ImageContent via KernelArguments for your specific function.

Let me know how this goes.

I was with you until "pass the ImageContent via KernelArguments for your specific function"

When observing context.ChatHistory, I see the message from the user w/ all the files attached as ImageContent items. So they're already there. This leaves me with a few questions:

Where would I put these same items so they are accessible from the function that is the target of context.Function? If, as you suggest, I copy them over to context.Arguments then...
How would I get to the items from within the Function? As far as I know, the only things I have access to are the parameter inputs to the function which are just primitives.
What does this mean for scale & concurrency? e.g. if I'm adding them to kernel arguments and my Kernel is a singleton in the app, am I corrupting other requests?

It seems to me like we need a pattern akin to Azure Functions' FunctionContext as an always-available parameter on a function call. As it is right now, when SK gets the request from the LLM to make a tool call, it loses context of the conversation as a whole when making the call, leaving only what the LLM deems necessary for the function to execute; this seems to be the gap. Said another way, if I could add the AutoFunctionInvocationContext as a parameter on my Function so that I can get to everything that was in it, that would be great.

Thanks

brandonh-msft · 2024-10-08T16:45:24Z

Here's what I came up with, would love your thoughts:

Filter:

internal class ContextPopulateInvocationFilter : IAutoFunctionInvocationFilter
{
    private const string FunctionContextKey = "__functionContext";
    public Task OnAutoFunctionInvocationAsync(AutoFunctionInvocationContext context, Func<AutoFunctionInvocationContext, Task> next)
    {
        if (context.Arguments is not null)
        {
            context.Arguments[FunctionContextKey] = context;
        }

        return next(context);
    }
}

Function registration:

_kernel.CreateFunctionFromMethod(async (Guid conversationId, AutoFunctionInvocationContext __functionContext) => {
...
}, parameters: [new ("conversationId") { IsRequired = true, ParameterType = typeof(Guid), Description = "The ID of the Conversation as specified by the initial context to the system" }]);

I have access to the entirety of the invocation context in this manner. Would there be any unintended side effects of this approach?

The biggest caveat is that I am dependent upon .Arguments being not null, though it is declared as nullable. I cannot set its value to a new KernelArguments() because it's an init-only setter. So if it comes in null and the target function needs __functionContext, it's SOL.

brandonh-msft · 2024-10-08T17:08:03Z

The above approach resulted in me getting a serialization error on cancellationToken - so there must be one buried in the context somewhere. Trying to pull just Chat History off the arguments (since that's all I need) resulted in a serialization overflow. Still trying.

brandonh-msft · 2024-10-08T17:16:38Z

So, adding the whole Chat History resulted in recursive function invocations - I'm guessing because the history included items that requested tool calls. So, instead of passing the whole thing I simply pulled out all assistant and user messages that didn't have tool calls requested. This seems to have done the trick. If I need anything else from the Context, I'll have to manually add it to arguments as well.

If we could get Context to be serializable (not hit the cancellationToken issue) or create a purpose-built context for this, it seems that'd be much more ideal.

github-actions · 2025-01-08T02:02:02Z

This issue is stale because it has been open for 90 days with no activity.

github-actions · 2025-01-24T02:00:25Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

brandonh-msft · 2025-01-28T00:56:58Z

brandonh-msft changed the title ~~New Feature: Function calling w/ File Content~~ New Feature: Function calling w/ Context Oct 7, 2024

markwallace-microsoft added the triage label Oct 7, 2024

evchaki added Agent Framework and removed triage labels Oct 8, 2024

crickman added agents and removed Agent Framework labels Oct 9, 2024

github-actions bot added the stale Issue is stale because it has been open for 90 days with no activity label Jan 8, 2025

github-actions bot closed this as completed Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Feature: Function calling w/ Context #9135

New Feature: Function calling w/ Context #9135

brandonh-msft commented Oct 7, 2024

markwallace-microsoft commented Oct 8, 2024

brandonh-msft commented Oct 8, 2024 •

edited

Loading

brandonh-msft commented Oct 8, 2024 •

edited

Loading

brandonh-msft commented Oct 8, 2024

brandonh-msft commented Oct 8, 2024 •

edited

Loading

github-actions bot commented Jan 8, 2025

github-actions bot commented Jan 24, 2025

brandonh-msft commented Jan 28, 2025

New Feature: Function calling w/ Context #9135

New Feature: Function calling w/ Context #9135

Comments

brandonh-msft commented Oct 7, 2024

Scenario

markwallace-microsoft commented Oct 8, 2024

brandonh-msft commented Oct 8, 2024 • edited Loading

brandonh-msft commented Oct 8, 2024 • edited Loading

brandonh-msft commented Oct 8, 2024

brandonh-msft commented Oct 8, 2024 • edited Loading

github-actions bot commented Jan 8, 2025

github-actions bot commented Jan 24, 2025

brandonh-msft commented Jan 28, 2025

brandonh-msft commented Oct 8, 2024 •

edited

Loading

brandonh-msft commented Oct 8, 2024 •

edited

Loading

brandonh-msft commented Oct 8, 2024 •

edited

Loading