-
Notifications
You must be signed in to change notification settings - Fork 2
Expand file tree
/
Copy pathtrain.sh
More file actions
executable file
·13 lines (10 loc) · 776 Bytes
/
train.sh
File metadata and controls
executable file
·13 lines (10 loc) · 776 Bytes
1
2
3
4
5
6
7
8
9
10
11
12
13
# Please note that the number of GPUs should be consistant with parameter `nproc_per_node`.
# Single-Node Multi-Process.
CUDA_VISIBLE_DEVICES=0,1,2,3 python -m torch.distributed.launch --nproc_per_node=4 train.py --config=config/cityscapes_pspnet.yaml
# TODO(xwd): Multi-Node Multi-Process. Put this codebase to other nodes and run the script below:
# Please note that you should assign the master node specifically.
# Any other information please refer: `https://zhuanlan.zhihu.com/p/52110617`
# python -m torch.distributed.launch --nproc_per_node=NUM_GPUS_YOU_HAVE
# --nnodes=2 --node_rank=0 --master_addr="192.168.1.1"
# --master_port=1234 YOUR_TRAINING_SCRIPT.py (--arg1 --arg2 --arg3
# and all other arguments of your training script)