api: RequestCost configurations #103

mathetake · 2025-01-16T02:26:16Z

This adds the RequestCost field to AIGatewayRoute,
which will allows users to do the rate limiting etc
based on the calculated "token usage".

This is based on the new feature introduced in

and because of the feature, the only thing we have to do
from AI Gateway side is to set a dynamic metadata as per
the comment in the API.

Signed-off-by: Takeshi Yoneda <[email protected]>

api/v1alpha1/api.go

mathetake · 2025-01-16T02:31:34Z

cc @envoyproxy/ai-gateway-assignable

mathetake · 2025-01-16T02:31:51Z

will do the e2e tests in another PR

api/v1alpha1/api.go

Signed-off-by: Takeshi Yoneda <[email protected]>

api/v1alpha1/api.go

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake · 2025-01-17T00:28:21Z

marking as draft until all ready including tests

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake · 2025-01-17T01:17:18Z

ok ready - I would like to unblock #111 so I will defer the end to end tests in a subsequent PR.

mathetake · 2025-01-17T01:17:53Z

also CEL implementation is TODO as it will be a relatively large change

mathetake · 2025-01-17T01:18:42Z

tests/testupstream/main.go

+	for k, v := range r.Header {
+		fmt.Printf("header %q: %s\n", k, v)
+	}
 	if v := r.Header.Get(expectedHeadersKey); v != "" {


note: intentionally left as the testupstream does this elsewhere everywhere

internal/extproc/processor.go

Signed-off-by: Takeshi Yoneda <[email protected]>

api/v1alpha1/api.go

Signed-off-by: Takeshi Yoneda <[email protected]>

api/v1alpha1/api.go

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake · 2025-01-17T20:31:20Z

I made it plural so that i can be used to have a separate rate limits per metrics. ptal @aabchoo @yuzisun

Signed-off-by: Takeshi Yoneda <[email protected]>

api/v1alpha1/api.go

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake · 2025-01-18T03:47:36Z

ready 🎵

filterconfig/filterconfig.go

Signed-off-by: Takeshi Yoneda <[email protected]>

api: RequestCost configuratinos

1a7bba7

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake commented Jan 16, 2025

View reviewed changes

api/v1alpha1/api.go Show resolved Hide resolved

mathetake marked this pull request as ready for review January 16, 2025 02:31

mathetake requested a review from a team as a code owner January 16, 2025 02:31

mathetake commented Jan 16, 2025

View reviewed changes

api/v1alpha1/api.go Outdated Show resolved Hide resolved

typoe

9d72325

Signed-off-by: Takeshi Yoneda <[email protected]>

yuzisun reviewed Jan 16, 2025

View reviewed changes

api/v1alpha1/api.go Outdated Show resolved Hide resolved

Merge remote-tracking branch 'origin/main' into requestcosts

dac62bf

mathetake mentioned this pull request Jan 16, 2025

Extract Input/Output token usage from request. #111

Open

mathetake added 3 commits January 16, 2025 16:16

more

4c2a180

Signed-off-by: Takeshi Yoneda <[email protected]>

more

6b3f0d1

Signed-off-by: Takeshi Yoneda <[email protected]>

more

e862d46

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake marked this pull request as draft January 17, 2025 00:28

mathetake added 6 commits January 16, 2025 16:39

more

c723388

Signed-off-by: Takeshi Yoneda <[email protected]>

Merge remote-tracking branch 'origin/main' into requestcosts

2a9905b

more

9360d3d

Signed-off-by: Takeshi Yoneda <[email protected]>

cumulative

840d31c

Signed-off-by: Takeshi Yoneda <[email protected]>

more

efd6742

Signed-off-by: Takeshi Yoneda <[email protected]>

more

80b2cbd

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake marked this pull request as ready for review January 17, 2025 01:16

mathetake requested review from yuzisun, wengyao04 and aabchoo January 17, 2025 01:17

mathetake commented Jan 17, 2025

View reviewed changes

yuzisun reviewed Jan 17, 2025

View reviewed changes

internal/extproc/processor.go Outdated Show resolved Hide resolved

fuzz

3b5b6e3

Signed-off-by: Takeshi Yoneda <[email protected]>

yuzisun reviewed Jan 17, 2025

View reviewed changes

api/v1alpha1/api.go Outdated Show resolved Hide resolved

yuzisun reviewed Jan 17, 2025

View reviewed changes

api/v1alpha1/api.go Show resolved Hide resolved

mathetake requested a review from arkodg January 17, 2025 02:01

mathetake added 4 commits January 16, 2025 18:12

review: llm prefix

3ac2978

Signed-off-by: Takeshi Yoneda <[email protected]>

more

7bfce7d

Signed-off-by: Takeshi Yoneda <[email protected]>

more

52b1231

Signed-off-by: Takeshi Yoneda <[email protected]>

more

27fca7b

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake requested a review from yuzisun January 17, 2025 04:29

fuzz

21bf4ca

Signed-off-by: Takeshi Yoneda <[email protected]>

aabchoo reviewed Jan 17, 2025

View reviewed changes

api/v1alpha1/api.go Outdated Show resolved Hide resolved

mathetake added 3 commits January 17, 2025 11:07

Merge remote-tracking branch 'origin/main' into requestcosts

249931b

make it plural

9c4f19a

Signed-off-by: Takeshi Yoneda <[email protected]>

fix unit tests

ca73791

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake requested a review from aabchoo January 17, 2025 20:30

more unit tests

d3be1cd

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake commented Jan 17, 2025

View reviewed changes

api/v1alpha1/api.go Show resolved Hide resolved

mathetake added 2 commits January 17, 2025 19:43

more

d7a7cbe

Signed-off-by: Takeshi Yoneda <[email protected]>

Distinct

e6370a8

Signed-off-by: Takeshi Yoneda <[email protected]>

mathetake commented Jan 18, 2025

View reviewed changes

filterconfig/filterconfig.go Outdated Show resolved Hide resolved

mathetake added 3 commits January 17, 2025 19:59

tweatk

721eacf

Signed-off-by: Takeshi Yoneda <[email protected]>

fix

7fedc18

Signed-off-by: Takeshi Yoneda <[email protected]>

Merge remote-tracking branch 'origin/main' into requestcosts

54ed443

yuzisun approved these changes Jan 18, 2025

View reviewed changes

yuzisun merged commit f4ba5cc into main Jan 18, 2025
11 checks passed

yuzisun deleted the requestcosts branch January 18, 2025 18:04

This was referenced Jan 18, 2025

Allow customizing token cost calculation #97

Open

feat: implements CEL expression API for costs #153

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

api: RequestCost configurations #103

api: RequestCost configurations #103

mathetake commented Jan 16, 2025 •

edited

Loading

mathetake commented Jan 16, 2025

mathetake commented Jan 16, 2025

mathetake commented Jan 17, 2025

mathetake commented Jan 17, 2025

mathetake commented Jan 17, 2025

mathetake Jan 17, 2025

mathetake commented Jan 17, 2025

mathetake commented Jan 18, 2025

api: RequestCost configurations #103

api: RequestCost configurations #103

Conversation

mathetake commented Jan 16, 2025 • edited Loading

mathetake commented Jan 16, 2025

mathetake commented Jan 16, 2025

mathetake commented Jan 17, 2025

mathetake commented Jan 17, 2025

mathetake commented Jan 17, 2025

mathetake Jan 17, 2025

Choose a reason for hiding this comment

mathetake commented Jan 17, 2025

mathetake commented Jan 18, 2025

mathetake commented Jan 16, 2025 •

edited

Loading