This project aims to implement and evaluate quantization techniques on machine learning models deployed to Jetson Nano edge devices.
Install dependencies: pip install -r requirements.txt
Create quantized model from scratch: make
- Extract unquantized parameters:
make setup - Calculate quantized parameters:
make quantize_params - Create calibration prep model:
make prep_model - Calculate quantized activations:
make quantize_activations - Calculate quantized biases:
make quantize_biases - Statically quantize model:
make quantize_model_symm - Dynamically quantize model:
make quantize_model_dyn_symm
- Extract unquantized parameters:
make setup - Calculate quantized parameters:
make quantize_params_asymm - Create calibration prep model:
make prep_model_asymm - Calculate quantized activations:
make quantize_activations_asymm - Calculate quantized biases:
make quantize_biases_asymm - Statically quantize model:
make quantize_model_asymm - Dynamically quantize model:
make quantize_model_dyn_asymm
Validate quantized parameters: make validate
Clear activations/, biases/, params/, and models/ directories: make clean