Pre-compiled custom CUDA extension for Block Sparse Attention (Python 3.11 / PyTorch 2.6.0+cu124).
python machine-learning cuda pytorch pytorch-extension llm-optimization cuda-12 block-sparse-attention precompiled-wheel
-
Updated
May 4, 2026