Unofficial: Interactive dashboard visualizing all 352 submissions #747
Replies: 5 comments 6 replies
-
Dashboard Update — v4 (March 31, 2026)Major update to the dashboard: Data:
New sections:
Chart fixes:
Live dashboard: https://nathanmaine.github.io/parameter-golf-experiment-lab/ PRs for all 7 research directions: #1191, #1192, #1193, #1194, #1195, #1196, #1197 |
Beta Was this translation helpful? Give feedback.
-
|
@Ribin545 Glad the dashboard helped! I needed it as well many times! The RTX 3090 is solid hitting 1.84 what are you getting now without it? |
Beta Was this translation helpful? Give feedback.
-
Dashboard Update - v5 (April 2, 2026)Major update focused on data completeness and TTT legality filtering. Data:
New: TTT Legality Filtering Following the TTT legality discussion in issue #402 and the rulings from @0hq and @valerio-oai, the leaderboard now includes TTT compliance classification:
The leaderboard now defaults to "Legal Only" so the realistic competition state is visible immediately. All submissions are still accessible via the filter dropdown or search bar. A disclaimer block under the leaderboard heading explains that this classification is our best interpretation of the current rules and may not be 100% accurate. @0hq @valerio-oai - if any of these classifications are off, happy to adjust. If you think your submission is miscategorized, open an issue on the dashboard repo or let me know here. Why this matters: Without filtering, the top ~33 submissions are dominated by n-gram cache approaches scoring below 0.5 BPB. Many of these have been closed by organizers. The "Legal Only" view shows the actual state of the neural modeling competition, where the real innovation is happening in the 1.05-1.12 BPB range. Other updates:
Live dashboard: https://nathanmaine.github.io/parameter-golf-experiment-lab/ |
Beta Was this translation helpful? Give feedback.
-
Dashboard Update - v8 (April 3, 2026)This is NOT an official OpenAI resource. This dashboard is an independent, unofficial project by one participant. All classifications (Legal/Illegal/Suspect) are our best interpretation of the current rules based on issue #402 and the illegal submissions megathread #677. They may not be 100% accurate. Data:
New filters (per community feedback from @samquiring):
Updated sections:
New record attempts submitted:
Both are pure neural submissions with no n-gram cache, no multi-epoch TTT. Full details and reproduction commands in the PRs. Corrections welcome: If your submission's size, BPB, or legality status is showing incorrectly, open an issue at https://github.com/NathanMaine/parameter-golf-experiment-lab/issues or comment here. Live dashboard: https://nathanmaine.github.io/parameter-golf-experiment-lab/ |
Beta Was this translation helpful? Give feedback.
-
Dashboard Update - v9 (April 6, 2026)This is NOT an official OpenAI resource. Independent, unofficial project by one participant. Data
Bug Fixes
New: DGX Spark PROTEUS Ablation Data
New: SLOT Legality Context
All Sections Updated
New Key Discoveries (Section 7)
Competition Landscape
Feedback welcome - open an issue on the dashboard repo. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I built an interactive dashboard that visualizes data from all 352 submissions with BPB scores:
Live Dashboard →
What you can do:
Data includes:
Also includes a technique effectiveness matrix showing what worked and what didn't across 46+ experiments, plus cost analysis for anyone budgeting their RunPod spend.
Source: github.com/NathanMaine/parameter-golf-experiment-lab
If your submission data looks wrong, let me know — happy to fix it. The data was pulled from submission.json files as of March 24.
Good luck everyone! 🏌️
Beta Was this translation helpful? Give feedback.
All reactions