-
1.a) open terminal and execute the following commands in the root directory of the project to create a virtual environment with required project dependencies
python -m venv finalprojectsource finalproject/bin/activatepip install -r requirements.txt
2. cd into scripts folder and execute the training script for all 6 algorithms to generate the first 3 metrics results, and weights for all models, one by one
- uncomment lines 490-493 for DQN Model-Free model
- Execute script:
- `python q_dqn_doubledqn.py`
- After script is finished, comment back lines 490-493 and uncomment lines 496-499 for DQN Hyperparameters model
- Execute script:
- `python q_dqn_doubledqn.py`
- repeat this procedure for remaining models until all 3 metrics and weights are generated for all models: DQN Exploitation, DQN Exploration, Double DQN, and Q-learning models
3. Execute the evaluation script to generate average reward metric results and all model evaluations
- python evaluate_dqn.py
4. Execute the training script for the first fine tuned DQN model to generate its first 3 metrics results, and weights
- python train_finetuned_dqn.py
5. Execute the evaluation script to generate average reward metric results and evaluations for first fine tuned model
- python finetuned_evaluation.py
6. Execute the training script for the second fine tuned DQN model to generate its first 3 metrics results, and weights
- python train_finetuned_dqn.py
7. Execute the evaluation script to generate average reward metric results and evaluations for second fine tuned model
- python finetuned_evaluation2.py
Note: this requires large computation resources, I ran the script for several days in both local and google doc and it resulted in a killed file zsh: killed python train_finetuned_dqn.py locally and runtime stop on google doc
9. run evaluation script to generate average reward metric results and all (A2C, PPO, DQN) model evaluations for new algorithms and environment implementation
- python train_finetuned_dqn2.py