Model-based, model-free Learning Simulator

This project was presented on RLDM 2019 as a poster. See http://rldm.org/papers/abstracts.pdf Poster Session 1, Poster 61.

System requirements and required libraries

Tested Platform:

Ubuntu 18.04 LTS
GCC 7.3.0
Clang 6.0.0
Python: 3.6.7

Required Python Packages:

Package Name	Tested Version
Pytorch	0.4.1
dill	0.2.8.2
numpy	1.15.1
matplotlib	2.2.3
scipy	1.1.0
scikit-learn	0.20.0
tqdm	4.26.0
gym	0.10.5
ggplot	0.11.5
pandas	0.23.4

One should be able to run the script (see Running simulation section) once these packages installed. We also provided a C++ library to improve the performance. Its only dependency is boost::python library. If you already have boost::python installed, simply run

make -C ./lib

It is recommended to build boost library following the official website instructions. But I will give a specific instructions here which may be too specific that will fail depending on your machine configuration.

Download Boost library source code https://www.boost.org/. I am using 1.67 version. Direct link. Extract the archive and save it to ~/Downloads/

Copy header files

sudo cp -r ~/Downloads/boost_1_67_0/boost /usr/local/include/boost/

Configure which python interpreter to build against

echo "using python : 3.6 : /usr/bin/python3 : /usr/include/python3.6m : /usr/lib ;" > ~/user-config.jam

Build Boost libraries

cd ~/Downloads/boost_1_67_0/
./bootstrap.sh –prefix=/usr/local
./b2
sudo ./b2 install

Something to check before building our C++ FORWARD extension
- Run ldconfig -v 2>/dev/null | grep -v ^$'\t' to see your linking path contains /usr/local/lib
- Run echo | gcc -E -Wp,-v – to see if your include path contains /usr/local/include
- Check if you have clang installed. If not you can install it using apt or change variable $(CC) from clang++ to g++ in lib/Makefile

boost::python bulid complete. You can build the library now.

Running simulation

python main.py [options]

A recommended simulation configuration

python main.py -d --episodes 1000 --trials 20 --all-mode --disable-detail-plot

Generate learning curve plots

python main.py --re-analysis [pickle file name] --learning-curve-plot --disable-action-compare --use-confidence-interval

Generate P_mf probability density function plots

python main.py --re-analysis [pickle file name] --cross-compare

To view the complete list of options, run

python main.py -h

Known issue

sklearn may have some trouble with cloudpickle, see solutions https://stackoverflow.com/questions/52596204/the-imp-module-is-deprecated
ggplot may have some issue with pandas, see solutions https://stackoverflow.com/questions/50591982/importerror-cannot-import-name-timestamp

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
lib		lib
README.md		README.md
analysis.py		analysis.py
arbitrator.py		arbitrator.py
common.py		common.py
ddqn.py		ddqn.py
forward.py		forward.py
main.py		main.py
mdp.py		mdp.py
regdata.csv		regdata.csv
rltest.py		rltest.py
sarsa.py		sarsa.py
simulation.py		simulation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Model-based, model-free Learning Simulator

System requirements and required libraries

Running simulation

Known issue

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Model-based, model-free Learning Simulator

System requirements and required libraries

Running simulation

Known issue

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages