I trained the model on APNEWS on my server with 56 cpus, it took almost all of them to train the model, as well as 20g+ memory
I trained the model on APNEWS on my server with 56 cpus, it took almost all of them to train the model, as well as 20g+ memory