SVRP-RL: Reinforcement Learning for Stochastic Vehicle Routing Problem

This repository contains an implementation of the paper "Reinforcement Learning for Solving Stochastic Vehicle Routing Problem" by Iklassov, Sobirov, Solozabal, and Takáč.

Overview

The Stochastic Vehicle Routing Problem (SVRP) extends the classic Vehicle Routing Problem by introducing uncertainty in customer demands and travel costs. This implementation uses a Reinforcement Learning approach to tackle this challenging optimization problem.

Key features:

Neural network architecture with state and memory embeddings
Attention mechanism for node selection
REINFORCE algorithm with baseline for training
Multiple inference strategies (greedy, random sampling, beam search)
Support for varying levels of stochasticity in the environment

Project Structure

svrp_rl/
├── main.py               # Main entry point for training/inference
├── config.py             # Configuration parameters
├── models/
│   ├── __init__.py
│   ├── attention.py      # Attention layer implementation
│   ├── embedding.py      # State and memory embedding components
│   └── policy.py         # Full policy model architecture
├── env/
│   ├── __init__.py
│   ├── svrp_env.py       # SVRP environment implementation
│   └── weather.py        # Weather simulation for stochastic variables
├── training/
│   ├── __init__.py
│   ├── reinforce.py      # REINFORCE algorithm implementation
│   └── baseline.py       # Baseline model implementation  
├── inference/
│   ├── __init__.py
│   └── inference.py      # Inference strategies (greedy, random, beam search)

Installation

Install dependencies

pip install -r requirements.txt


## Usage

### Training

To train a model with default parameters:

```bash
python main.py

To customize training parameters:

python main.py --num_nodes 50 --num_vehicles 2 --embedding_dim 256 --epochs 200 --batch_size 16

Testing

To evaluate a trained model:

python main.py --test --load_model checkpoints/model_final

To compare different inference strategies:

python main.py --test --load_model checkpoints/model_final --inference greedy
python main.py --test --load_model checkpoints/model_final --inference random --num_samples 32
python main.py --test --load_model checkpoints/model_final --inference beam --beam_width 5

Key Parameters

--num_nodes: Number of customer nodes plus depot
--num_vehicles: Number of vehicles available
--capacity: Maximum vehicle capacity
--a_ratio, --b_ratio, --gamma_ratio: Signal ratios for stochastic components
--embedding_dim: Dimension of state and memory embeddings
--epochs: Number of training epochs
--batch_size: Batch size for training
--inference: Inference strategy (greedy, random, beam)

Environment Settings

The SVRP environment models stochasticity through three components:

Constant component (a_ratio): Fixed part of the stochastic variables
Weather component (b_ratio): Part influenced by weather variables
Noise component (gamma_ratio): Random noise

The model learns to leverage weather information to predict stochastic variables and make better routing decisions.

Results

The implementation achieves competitive results compared to classical methods:

3.43% improvement over Ant Colony Optimization
Superior performance in correlated environments where weather affects both demand and travel costs
Efficient inference suitable for real-time industrial applications

Citation

If you use this code in your research, please cite the original paper:

@article{iklassov2023reinforcement,
  title={Reinforcement Learning for Solving Stochastic Vehicle Routing Problem},
  author={Iklassov, Zangir and Sobirov, Ikboljon and Solozabal, Ruben and Tak{\'a}{\v{c}}, Martin},
  journal={arXiv preprint arXiv:2311.07708},
  year={2023}
}

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SVRP-RL: Reinforcement Learning for Stochastic Vehicle Routing Problem

Overview

Project Structure

Installation

Install dependencies

Testing

Key Parameters

Environment Settings

Results

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
env		env
inference		inference
models		models
training		training
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
config.py		config.py
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

SVRP-RL: Reinforcement Learning for Stochastic Vehicle Routing Problem

Overview

Project Structure

Installation

Install dependencies

Testing

Key Parameters

Environment Settings

Results

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages