Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #3519
+12
−3
The logs for this run have expired and are no longer available.
Loading