Hi TorchSpec team, is there any interest in contributing the vLLM Mooncake hidden-states connector upstream? I maintain speculative decoding in vLLM and I think it would be very helpful for Speculators and other specdec training projects to have this maintained natively in vLLM
Hi TorchSpec team, is there any interest in contributing the vLLM Mooncake hidden-states connector upstream? I maintain speculative decoding in vLLM and I think it would be very helpful for Speculators and other specdec training projects to have this maintained natively in vLLM