CodeEvolve Experiments

                                         █████        █████                                         
                                     ██    ███████████████   ███                                    
                                  ██  ████████████ ███████████   ██                                 
                               ██  █████████████    ██████████████ ██                               
                             ██  ██████████████      ███████████████  █                             
                            █  ██████████  █████     ████  ███████████ ██                           
                          ██ ████████ ████ █   ██  ██   ██████  ███████  █                          
                         █  █████████    ███    █  █    ██ █    █████████ █                         
                        █  ██████████      █ █████ █████   █   ███████████ █                        
                       ██ █████████████████████ ██ ██ █████████████████████ █                       
                       █ ██████     █████  █   █ █  ██  ██  █████     ██████ █                      
                      █ ███████    ██ █   ██  ███  ███  ██   █████   ███████ █                      
                      █ ███████████████  █  ███  █ █ ███     ████████████████ █                     
                     █ ██████████     ██  █   ███   ██ █    ██     ██████████ █                     
                     █ ████████████   ███████ ███  ███████████    ███████████ █                     
                     █ ███████████████████      █  ██       █████████████████ █                     
                     █ ███████████   ██   ██████    ███████  ██   ███████████ █                     
                     █ ███████████████   ███████ ██  ██████   ███████████████ █                     
                     ██ ███████████████████████  ██  ████████████████████████ █                     
                      █ ██████████████████████  ████   ██████████████████ ██ ██                     
                      ██ █████████████████████  ████  ██████████████████████ █                      
                       █ ██████████████████████  ██  ██████████████████████ █                       
                        █ ██████████████████████ ██ ██████████████████████ ██                       
                         █ █████████████████████     ████████████████████ ██                        
                          █  ████████████████          █████████████████ ██                         
                           ██ ████ ██        █ ██  ██ █        ███████  █                           
                             ██ ███ ██ ███ ██ ██  █ █████ ███ ██ ████ ██                            
                               ██ ████████████    ██  ██ █████████  ██                              
                                 ██  ████  ████ █████ ████ █████ ██                                 
                                    ███  ███ ██████████ ████  ███                                   
                                        █████           █████                                       
                                                 ███                                                
                                                              
                                                                                                    
              ██████               ██         ██████                 ██                             
             ██    ██  █████   ██████  █████  ██      ██  ██  ████   ██ ██   ██  ████               
             ██       ██   ██ ██   ██ ██   ██ ██████  ██  █  ██   █  ██  ██ ██  █   ██              
             ██    ██ ██   ██ ██   ██ ██      ██       ████  ██   █  ██  ██ █  ██                   
               █████   █████   ██████  █████  ██████    ██    ████   ██   ███    ████

An open-source framework that combines large language models with evolutionary algorithms to discover and optimize high-performing code solutions.

This is a companion repository to science-codeevolve, and contains the complete experimental setup and results for the CodeEvolve paper.

Overview

This repository provides:

Experimental configurations for reproducing all results
Raw experimental data from paper runs (.pkl, .py, .txt files)
Analysis notebooks with visualizations and statistical tests

The benchmark problems themselves are implemented in the main science-codeevolve repository.

Repository Structure

science-codeevolve-experiments/
├── experiments/          # Raw experimental results
├── notebooks/           # Analysis and visualization
│   ├── experiment_analysis.ipynb       # Main analysis notebook
│   └── figs/                           # Generated figures from paper
└── README.md

Directory Details

experiments/: Contains results from paper experiments including:
- Solution histories (.py files)
- Checkpoints (.pkl files)
- Logs and metadata (.txt files)
- Multiple runs with different seeds/configurations
notebooks/: Jupyter notebooks for analysis
- experiment_analysis.ipynb: Statistical analysis and comparisons

Prerequisites

Install CodeEvolve Framework

First, install the main CodeEvolve framework:

# Clone and install CodeEvolve framework
git clone https://github.com/inter-co/science-codeevolve.git
cd science-codeevolve
conda env create -f environment.yml
conda activate codeevolve
cd ..

Clone Experiments Repository

# Clone this experiments repository
git clone https://github.com/inter-co/science-codeevolve-experiments.git
cd science-codeevolve-experiments

Configure LLM API Access

Set your LLM API credentials as environment variables:

export API_KEY=your_api_key_here
export API_BASE=your_api_base_url

Reproducibility

This repository supports two distinct notions of reproducibility:

1) Reproducing the paper analysis (deterministic, using included artifacts)

The folder experiments/ contains the raw artifacts used in the paper (checkpoints, histories, logs). The notebook(s) in notebooks/ analyze those artifacts to generate the plots and comparisons. Re-running the analysis should reproduce the reported figures/tables as long as your analysis environment is compatible.

2) Re-running the full search (best-effort; exact replay depends on the LLM provider)

Exact numerical reproduction of a full evolutionary run is not guaranteed when using hosted LLM APIs.

Why:

Many commercial LLM providers do not support deterministic sampling or do not honor seed.
Even when a provider accepts seed, outputs can vary due to backend nondeterminism (load balancing, infrastructure-level randomness, model version rollouts).

This is not a limitation of CodeEvolve’s evolutionary framework: CodeEvolve is seedable for its internal stochastic decisions, and it forwards model seed to OpenAI-compatible endpoints when supported. The remaining nondeterminism comes from the LLM backbone/provider.

Citation

@article{assumpção2025codeevolveopensourceevolutionary,
      title={CodeEvolve: An open source evolutionary coding agent for algorithm discovery and optimization},
      author={Henrique Assumpção and Diego Ferreira and Leandro Campos and Fabricio Murai},
      year={2025},
      eprint={2510.14150},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2510.14150},
}

Releases

Experiments are versioned to match the main repository:

v0.1.0: Initial release, corresponds to v1 of CodeEvolve's paper and v0.1.0 of the main repo.
v0.2.0: Current release, corresponds to v3 of CodeEvolve's paper and v0.2.0 of the main repo.
v0.2.1: Current release, corresponds to v4 of CodeEvolve's paper and v0.2.0 of the main repo.

Acknowledgements

The authors thank Bruno Grossi for his continuous support during the development of this project. We thank Fernando Augusto and Tiago Machado for useful conversations about possible applications of CodeEvolve. We also thank the OpenEvolve community for their inspiration and discussion about evolutionary coding agents.

License

All software is licensed under the Apache License, Version 2.0 (Apache 2.0); you may not use this file except in compliance with the Apache 2.0 license. You may obtain a copy of the Apache 2.0 license at: https://www.apache.org/licenses/LICENSE-2.0.

This is not an official Inter product.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
assets		assets
experiments		experiments
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CodeEvolve Experiments

Overview

Repository Structure

Directory Details

Prerequisites

Install CodeEvolve Framework

Clone Experiments Repository

Configure LLM API Access

Reproducibility

1) Reproducing the paper analysis (deterministic, using included artifacts)

2) Re-running the full search (best-effort; exact replay depends on the LLM provider)

Citation

Releases

Acknowledgements

License

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CodeEvolve Experiments

Overview

Repository Structure

Directory Details

Prerequisites

Install CodeEvolve Framework

Clone Experiments Repository

Configure LLM API Access

Reproducibility

1) Reproducing the paper analysis (deterministic, using included artifacts)

2) Re-running the full search (best-effort; exact replay depends on the LLM provider)

Citation

Releases

Acknowledgements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages