Popular repositories Loading
-
bitnet-mlx-engine
bitnet-mlx-engine PublicBitNet 1-bit LLM inference on Apple Silicon: 89 tok/s with custom Metal kernel. Includes CoreML/ANE benchmarks and speculative decoding.
Python 1
-
ccrun-benchmark
ccrun-benchmark PublicDoes the language your container runtime is written in affect workload performance? We built the same runtime in Python, Go, Rust, and C, then ran 540+ benchmarks to find out.
Python 1
-
gemma4-eval-kit
gemma4-eval-kit Public🧪 70-test, 6-model AI benchmark: Gemma 4 vs Gemini Pro vs Flash vs Qwen. 420 verified runs across 13 categories. All prompts, rubrics, runner code & raw results included. Code executed, constraints…
Python
If the problem persists, check the GitHub status page or contact support.