diff --git a/docs/performance_tuning.md b/docs/performance_tuning.md index ddf0de20d4..74a890eb1b 100644 --- a/docs/performance_tuning.md +++ b/docs/performance_tuning.md @@ -8,7 +8,7 @@ This document gives an overview of various parameters that can be configured to Download ResNet50 model -```bash +```text mkdir models docker run -u $(id -u):$(id -g) -v ${PWD}/models:/models openvino/ubuntu20_dev:2024.6.0 omz_downloader --name resnet-50-tf --output_dir /models docker run -u $(id -u):$(id -g) -v ${PWD}/models:/models:rw openvino/ubuntu20_dev:2024.6.0 omz_converter --name resnet-50-tf --download_dir /models --output_dir /models --precisions FP32 @@ -25,7 +25,7 @@ To enable Performance Hints for your application, use the following command: CPU -```bash +```text docker run --rm -d -v ${PWD}/models/public/resnet-50-tf:/opt/model -p 9001:9001 openvino/model_server:latest \ --model_path /opt/model --model_name resnet --port 9001 \ --plugin_config "{\"PERFORMANCE_HINT\": \"THROUGHPUT\"}" \ @@ -34,7 +34,7 @@ docker run --rm -d -v ${PWD}/models/public/resnet-50-tf:/opt/model -p 9001:9001 GPU -```bash +```text docker run --rm -d --device=/dev/dri --group-add=$(stat -c "%g" /dev/dri/render* | head -n 1) -u $(id -u):$(id -g) \ -v ${PWD}/models/public/resnet-50-tf:/opt/model -p 9001:9001 openvino/model_server:latest-gpu \ --model_path /opt/model --model_name resnet --port 9001 \ @@ -50,7 +50,7 @@ To enable Performance Hints for your application, use the following command: CPU -```bash +```text docker run --rm -d -v ${PWD}/models/public/resnet-50-tf:/opt/model -p 9001:9001 openvino/model_server:latest \ --model_path /opt/model --model_name resnet --port 9001 \ --plugin_config "{\"PERFORMANCE_HINT\": \"LATENCY\"}" \ @@ -59,7 +59,7 @@ docker run --rm -d -v ${PWD}/models/public/resnet-50-tf:/opt/model -p 9001:9001 GPU -```bash +```text docker run --rm -d --device=/dev/dri --group-add=$(stat -c "%g" /dev/dri/render* | head -n 1) -u $(id -u):$(id -g) \ -v ${PWD}/models/public/resnet-50-tf:/opt/model -p 9001:9001 openvino/model_server:latest-gpu \ --model_path /opt/model --model_name resnet --port 9001 \ @@ -129,7 +129,7 @@ In case of using CPU plugin to run the inference, it might be also beneficial to - Example: Following docker command will set `NUM_STREAMS` parameter to a value `1`: -```bash +```text docker run --rm -d --cpuset-cpus 0,1,2,3 -v ${PWD}/models/public/resnet-50-tf:/opt/model -p 9001:9001 openvino/model_server:latest \ --model_path /opt/model --model_name resnet --port 9001 \ --plugin_config "{\"NUM_STREAMS\": \"1\"}" @@ -190,7 +190,7 @@ Model's plugin configuration is a dictionary of param:value pairs passed to Open Following docker command sets a parameter `NUM_STREAMS` to a value `32` and disables CPU pinning. -```bash +```text docker run --rm -d -v ${PWD}/models/public/resnet-50-tf:/opt/model -p 9001:9001 openvino/model_server:latest \ --model_path /opt/model --model_name resnet --port 9001 --grpc_workers 8 --nireq 32 \ --plugin_config "{\"NUM_STREAMS\": 32, \"ENABLE_CPU_PINNING\": false}"