Skip to content

Popular repositories Loading

  1. any-precision-llm any-precision-llm Public

    [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

    Python 123 6

  2. flashTP flashTP Public

    Torch-native C++/CUDA library to accelerate tensor-product layers in MLIPs

    Cuda 57 5

  3. flashneuron flashneuron Public

    C++ 41 6

  4. Ginex Ginex Public

    Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching

    Python 41 8

  5. OpenDNN OpenDNN Public

    OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library

    C++ 27 5

  6. DecDEC DecDEC Public

    [OSDI 2025] DecDEC: A Systems Approach to Advancing Low‑Bit LLM Quantization

    Python 22 3

Repositories

Showing 10 of 80 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…