efficient-attention

Here are 13 public repositories matching this topic...

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

cuda triton attention vit quantization video-generation mlsys inference-acceleration efficient-attention llm llm-infra video-generate

Updated Jan 17, 2026
Cuda

jlamprou / Infini-Attention

Star

Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M context keypass retrieval

transformer infinite attention efficient-attention llm qwen

Updated May 9, 2024
Python

Ascend-Research / CascadedGaze

Star

The official PyTorch implementation for CascadedGaze: Efficiency in Global Context Extraction for Image Restoration, TMLR'24.

efficiency transformer image-restoration deblurring denoising efficient-attention

Updated Feb 13, 2025
Python

davidsvy / cosformer-pytorch

Star

Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".

deep-learning neural-network pytorch artificial-intelligence transformer attention-mechanism iclr efficient-attention iclr2022

Updated Oct 29, 2021
Jupyter Notebook

HolmesShuan / Compact-Global-Descriptor

Star

Pytorch implementation of "Compact Global Descriptor for Neural Networks" (CGD).

efficient pytorch convolutional-neural-networks attention-mechanism attention-model efficient-attention

Updated Jan 9, 2025
Python

robflynnyh / hydra-linear-attention

Star

Implementation of: Hydra Attention: Efficient Attention with Many Heads (https://arxiv.org/abs/2209.07484)

machine-learning transformers attention linear-attention efficient-attention

Updated Jan 8, 2023
Python

gmlwns2000 / sea-attention

Star

Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)

attention linear-attention efficient-attention sea-attention

Updated Jun 20, 2025
Python

MAGICS-LAB / NonparametricHopfield

Star

Nonparametric Modern Hopfield Models

efficient-transformers efficient-attention modern-hopfield-networks modern-hopfield-model efficient-hopfield-models efficient-hopfield-networks

Updated Jan 8, 2024
Jupyter Notebook

zhenyi4 / ssa

Star

Official repository for "SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space"

efficiency pre-training efficient-attention sparse-attention llm

Updated Jan 30, 2026
Python

Lanerra / DWARF

Star

O(N) attention with a bounded inference KV cache. D4 Daubechies wavelet field + content-gated Q·K gather at dyadic offsets.

nlp rust machine-learning deep-learning language-modeling pytorch wavelet language-model attention-mechanism wavelet-transform ablation inference-efficiency ablation-study kv-cache linear-attention efficient-attention sparse-attention wavelet-attention

Updated Feb 28, 2026
Python

pszemraj / samba-pytorch

Star

Minimal implementation of Samba by Microsoft in PyTorch

language-model ssm pytorch-implementation efficient-attention llm long-context-modeling mamba-state-space-models

Updated Nov 24, 2024
Python

priyanshujiiii / awesome-Attention

Sponsor

Star

Resources and references on solved and unsolved problems in attention mechanisms.

machine-learning deep-learning transformer attention attention-mechanism ai-research survey-paper solved-problems unsolved-problems efficient-attention llm

Updated Aug 11, 2025

playboy2fine / sage

Star

🤖 Build a customizable, reliable Discord bot with Sage, designed for flexibility to enhance your server's interaction and engagement.

android python aws laravel ai deep-learning wordpress-theme cuda pytorch triton vit copilot wordpress-laravel claude inference-acceleration efficient-attention llm llm-infra

Updated Feb 27, 2026
Python

Improve this page

Add a description, image, and links to the efficient-attention topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the efficient-attention topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

efficient-attention

Here are 13 public repositories matching this topic...

thu-ml / SageAttention

jlamprou / Infini-Attention

Ascend-Research / CascadedGaze

davidsvy / cosformer-pytorch

HolmesShuan / Compact-Global-Descriptor

robflynnyh / hydra-linear-attention

gmlwns2000 / sea-attention

MAGICS-LAB / NonparametricHopfield

zhenyi4 / ssa

Lanerra / DWARF

pszemraj / samba-pytorch

priyanshujiiii / awesome-Attention

playboy2fine / sage

Improve this page

Add this topic to your repo