Add Kaggle Benchmarks SDK notebook with belief update task and temporal control results by RCSharm07 · Pull Request #2 · arjunvad123/the-observer-hypothesis

RCSharm07 · 2026-04-13T05:15:34Z

SDK-compatible notebook for the Learning track submission. Runs
belief_update episodes in canonical and shuffled-control conditions, evaluates against
frontier models (Gemini 2.5 Flash, GPT-5.4 Mini), and computes temporal sensitivity
gaps. Includes retry logic for API flakiness and multi-model comparison output.

master1223347 and others added 3 commits April 8, 2026 15:02

created learning_benchmark

bf8fe90

feat: add kaggle-benchmarks SDK notebook with temporal control results

b2fdbe5

push

4a78253

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Kaggle Benchmarks SDK notebook with belief update task and temporal control results#2

Add Kaggle Benchmarks SDK notebook with belief update task and temporal control results#2
RCSharm07 wants to merge 3 commits intoarjunvad123:mainfrom
master1223347:Kaggle/benchmark_Creation

RCSharm07 commented Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RCSharm07 commented Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants