Source: ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution

Robert Tjarko Lange, Yuki Imajuku, Edoardo Cetin

AuthorRobert Tjarko Lange, Yuki Imajuku, Edoardo Cetin

Source: ShinkaEvolve

Summary

ShinkaEvolve is an open-source framework from Sakana AI that uses LLMs as mutation operators for evolutionary program optimization. It addresses the two main limitations of prior work (like AlphaEvolve): sample inefficiency (requiring thousands of evaluations) and closed-source implementations. Through three algorithmic innovations — adaptive parent sampling, code novelty rejection-sampling, and bandit-based LLM ensemble selection — ShinkaEvolve achieves state-of-the-art results using orders of magnitude fewer evaluations.

Key Claims

Unprecedented sample efficiency. ShinkaEvolve discovered a new state-of-the-art circle packing solution using only 150 samples, compared to thousands required by AlphaEvolve. The framework’s exploration/exploitation balance is key.
Three innovations drive efficiency. (1) Power-law parent sampling balances exploitation of top solutions with exploration of novel ones. (2) Novelty rejection-sampling uses code embeddings to filter out mutations too similar to existing programs. (3) UCB1-based bandit selection dynamically allocates work to the most productive LLMs in a multi-model ensemble.
Broad applicability across domains. Demonstrated on mathematical optimization (circle packing), agentic scaffold design (AIME math reasoning), competitive programming (ALE-Bench), and LLM training design (MoE load balancing loss). Each domain showed consistent improvements.
Evolved agent scaffolds generalize. The agentic scaffold discovered for AIME tasks (three specialized expert personas + peer review + synthesis) generalized across different LLM backends (GPT-4.1-mini, GPT-4.1, o4-mini) and unseen problem sets (AIME 2025).
Novel MoE loss function discovered. ShinkaEvolve evolved a new mixture-of-experts load balancing loss that adds a regularization term targeting under-specialized experts. This “safety net” activates only for experts below a usage floor, avoiding over-regularization of well-balanced layers.
Meta-scratchpad accelerates learning. A periodic meta-analysis summarizes successful optimization strategies from recent generations and appends insights to future mutation prompts, providing knowledge diffusion across the evolutionary process.

Relevance and Implications

ShinkaEvolve extends the Autonomous Code Optimization paradigm from single-codebase optimization to general-purpose program evolution. The framework’s ability to discover novel mathematical solutions, design effective agent architectures, and even improve neural network training loss functions demonstrates that LLM-guided evolution is a general-purpose discovery tool. The open-source release (Apache 2.0) makes this accessible to any team with a fitness function and a benchmark. The MoE load balancing result is particularly notable — it shows that evolutionary search can discover improvements to the training process of LLMs themselves.

Source: ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution

Source: ShinkaEvolve

Summary

Key Claims

Relevance and Implications

Sources

Related pages