feat(gen ai): showcase different options for computation-based metric #12756

Valeriy-Burlaka · 2024-11-08T15:05:33Z

Description

Fixes #

Note: Before submitting a pull request, please open an issue for discussion if you are not associated with Google.

Checklist

I have followed Sample Guidelines from AUTHORING_GUIDE.MD
README is updated to include all relevant information
Tests pass: nox -s py-3.9 (see Test Environment Setup)
Lint pass: nox -s lint (see Test Environment Setup)
These samples need a new API enabled in testing projects to pass (let us know which ones)
These samples need a new/updated env vars in testing projects set to pass (let us know which ones)
This sample adds a new sample directory, and I updated the CODEOWNERS file with the codeowners for this sample
This sample adds a new Product API, and I updated the Blunderbuss issue/PR auto-assigner with the codeowners for this sample
Please merge this PR for me once it is approved

msampathkumar · 2024-11-08T15:28:36Z

generative_ai/evaluation/get_rouge_score.py

@@ -37,7 +39,37 @@ def get_rouge_score() -> EvalResult:
    life, including endangered species, it faces serious threats from
    climate change, ocean acidification, and coral bleaching."""

-    # Compare pre-generated model responses against the reference (ground truth).
+    # Option1: Run model inference and evaluate model response against the reference (ground truth)


The code samples looks too big now!

Yep, I understand

Valeriy-Burlaka · 2024-11-08T15:35:57Z

generative_ai/evaluation/get_rouge_score.py

@@ -37,7 +39,37 @@ def get_rouge_score() -> EvalResult:
    life, including endangered species, it faces serious threats from
    climate change, ocean acidification, and coral bleaching."""

-    # Compare pre-generated model responses against the reference (ground truth).
+    # Option1: Run model inference and evaluate model response against the reference (ground truth)


@msampathkumar , I'm thinking about showcasing 2 different options of using the computation-based metrics — Bring-your-own-response (BYOR) and with running model inference.
The reason is that for me, as a developer, the line between these options wasn't immediately obvious (hence this issue with the "prompt" column being silently unused), so I want to make it crystal-clear.

While I understand your point, this code samples is still too big(100 lines). Let me check with the tech writing team.

Also note, I don't see any example response section for this part of the code.

msampathkumar · 2025-02-14T09:33:07Z

Waiting for Kokoro CI - Python 3.13 to complete

msampathkumar · 2025-02-14T09:37:43Z

@Valeriy-Burlaka - can you check and address the unresolved comments ?

feat(gen ai): showcase different options for computation-based metric

215e7d2

Valeriy-Burlaka self-assigned this Nov 8, 2024

Valeriy-Burlaka requested review from a team as code owners November 8, 2024 15:05

Valeriy-Burlaka marked this pull request as draft November 8, 2024 15:05

product-auto-label bot added the samples Issues that are directly related to samples. label Nov 8, 2024

msampathkumar reviewed Nov 8, 2024

View reviewed changes

Valeriy-Burlaka commented Nov 8, 2024

View reviewed changes

StaticScuzzi approved these changes Nov 13, 2024

View reviewed changes

msampathkumar self-assigned this Feb 4, 2025

msampathkumar marked this pull request as ready for review February 14, 2025 09:32

msampathkumar requested a review from a team as a code owner February 14, 2025 09:32

msampathkumar added the waiting-response Waiting for the author's response. label Feb 14, 2025

msampathkumar added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Feb 24, 2025

kokoro-team removed kokoro:force-run Add this label to force Kokoro to re-run the tests. labels Feb 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(gen ai): showcase different options for computation-based metric #12756

feat(gen ai): showcase different options for computation-based metric #12756

Uh oh!

Valeriy-Burlaka commented Nov 8, 2024

Uh oh!

msampathkumar Nov 8, 2024

Uh oh!

Valeriy-Burlaka Nov 12, 2024

Uh oh!

Valeriy-Burlaka Nov 8, 2024 •

edited

Loading

Uh oh!

msampathkumar Nov 12, 2024

Uh oh!

msampathkumar Nov 12, 2024

Uh oh!

msampathkumar commented Feb 14, 2025

Uh oh!

msampathkumar commented Feb 14, 2025

Uh oh!

Uh oh!

feat(gen ai): showcase different options for computation-based metric #12756

Are you sure you want to change the base?

feat(gen ai): showcase different options for computation-based metric #12756

Uh oh!

Conversation

Valeriy-Burlaka commented Nov 8, 2024

Description

Checklist

Uh oh!

msampathkumar Nov 8, 2024

Choose a reason for hiding this comment

Uh oh!

Valeriy-Burlaka Nov 12, 2024

Choose a reason for hiding this comment

Uh oh!

Valeriy-Burlaka Nov 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

msampathkumar Nov 12, 2024

Choose a reason for hiding this comment

Uh oh!

msampathkumar Nov 12, 2024

Choose a reason for hiding this comment

Uh oh!

msampathkumar commented Feb 14, 2025

Uh oh!

msampathkumar commented Feb 14, 2025

Uh oh!

Uh oh!

Valeriy-Burlaka Nov 8, 2024 •

edited

Loading