Sakana AI

All

39 repositories

ShinkaEvolve
Public
ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution
Python
•
Apache License 2.0
•150•810•7•9•Updated Jan 25, 2026Jan 25, 2026
Kamon
Public
Data and code for understanding and generation of Kamon.
Python
•
Creative Commons Attribution Share Alike 4.0 International
•6•31•0•0•Updated Jan 24, 2026Jan 24, 2026
drq
Public
Digital Red Queen: Adversarial Program Evolution in Core War with LLMs
Red
•
Apache License 2.0
•20•170•1•0•Updated Jan 13, 2026Jan 13, 2026
DroPE
Public
Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding
Python
•17•194•2•1•Updated Jan 12, 2026Jan 12, 2026
ALE-Bench
Public
The official repository of ALE-Bench
Python
•
Apache License 2.0
•20•153•2•0•Updated Jan 5, 2026Jan 5, 2026
IASC
Public
LLMs for Constructed Languages
HTML
•
Creative Commons Attribution Share Alike 4.0 International
•4•42•1•0•Updated Jan 2, 2026Jan 2, 2026
continuous-thought-machines
Public
Continuous Thought Machines, because thought takes time and reasoning is a process.
Python
•
Apache License 2.0
•269•1.7k•1•2•Updated Dec 29, 2025Dec 29, 2025
repo
Public
RePo: Language Models with Context Re-Positioning
Python
•7•64•1•0•Updated Dec 24, 2025Dec 24, 2025
AI-Scientist-v2
Public
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Python
•
Other
•383•2.1k•30•9•Updated Dec 19, 2025Dec 19, 2025
AI-Scientist
Public
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬
Jupyter Notebook
•
Other
•1.7k•12k•89•20•Updated Dec 19, 2025Dec 19, 2025
Sudoku-Bench
Public
An AI benchmark for creative, human-like problem solving using Sudoku variants
JavaScript
•
MIT License
•15•157•1•0•Updated Dec 13, 2025Dec 13, 2025
treequest
Public
A Tree Search Library with Flexible API for LLM Inference-Time Scaling
Python
•
Apache License 2.0
•65•513•1•1•Updated Dec 9, 2025Dec 9, 2025
TinySwallow-ChatUI
Public
Browser-based chat UI for TinySwallow-1.5B that runs without API calls.
CSS
•
Apache License 2.0
•8•130•0•0•Updated Dec 1, 2025Dec 1, 2025
robust-kbench
Public
Python
•
Apache License 2.0
•11•80•4•1•Updated Nov 22, 2025Nov 22, 2025
neuroevolution-for-ai
Public
Neuroevolution Community
2•6•0•0•Updated Nov 17, 2025Nov 17, 2025
mle-bench-shinka-agent
Public
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
Python
•
Other
•205•0•0•0•Updated Nov 12, 2025Nov 12, 2025
petri-dish-nca
Public
Python
•6•51•0•2•Updated Nov 6, 2025Nov 6, 2025
google-code-golf-2025
Public
Python
•
Apache License 2.0
•0•4•0•0•Updated Oct 31, 2025Oct 31, 2025
asal
Public
Automating the Search for Artificial Life with Foundation Models!
Jupyter Notebook
•
Apache License 2.0
•52•449•1•0•Updated Oct 23, 2025Oct 23, 2025
shachi
Public
Reimagining Agent-based Modeling with Large Language Model Agents via Shachi
Python
•
Apache License 2.0
•3•27•0•0•Updated Oct 10, 2025Oct 10, 2025
TAID
Public
Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"
Python
•
Apache License 2.0
•8•120•3•0•Updated Oct 6, 2025Oct 6, 2025
natural_niches
Public
The code repository of the paper: Competition and Attraction Improve Model Fusion
Jupyter Notebook
•
Apache License 2.0
•33•168•1•0•Updated Aug 25, 2025Aug 25, 2025
BALROG
Public
Benchmarking Agentic LLM and VLM Reasoning On Games
Python
•
MIT License
•41•1•0•0•Updated Aug 19, 2025Aug 19, 2025
EDINET-Bench
Public
Evaluating the performance of LLMs on Japanese challenging financial tasks.
Python
•
Apache License 2.0
•3•29•0•0•Updated Jul 28, 2025Jul 28, 2025
TransEvalnia
Public
Reasoning-based Evaluation and Ranking of Translations.
Python
•
Apache License 2.0
•4•18•1•0•Updated Jul 18, 2025Jul 18, 2025
ab-mcts-arc2
Public
Python
•
Apache License 2.0
•18•106•1•0•Updated Jun 30, 2025Jun 30, 2025
RLT
Public
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
Python
•
Apache License 2.0
•54•358•3•0•Updated Jun 23, 2025Jun 23, 2025
edinet2dataset
Public
edinet2dataset is a tool to construct financial dataset using EDINET.
Python
•
Apache License 2.0
•8•30•0•0•Updated Jun 11, 2025Jun 11, 2025
text-to-lora
Public
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
machine-learning lora fine-tuning hypernetworks llm
Python
•
Apache License 2.0
•65•939•2•0•Updated Jun 8, 2025Jun 8, 2025
L2D
Public
Large language models to diffusion finetuning code
Python
•
Apache License 2.0
•3•23•0•0•Updated Jun 2, 2025Jun 2, 2025