asadvendor-boop

asadvendor-boop

Popular repositories Loading

bitnet-mlx-engine bitnet-mlx-engine Public

BitNet 1-bit LLM inference on Apple Silicon: 89 tok/s with custom Metal kernel. Includes CoreML/ANE benchmarks and speculative decoding.

Python 1
ccrun-benchmark ccrun-benchmark Public

Does the language your container runtime is written in affect workload performance? We built the same runtime in Python, Go, Rust, and C, then ran 540+ benchmarks to find out.

Python 1
gemma4-eval-kit gemma4-eval-kit Public

🧪 70-test, 6-model AI benchmark: Gemma 4 vs Gemini Pro vs Flash vs Qwen. 420 verified runs across 13 categories. All prompts, rubrics, runner code & raw results included. Code executed, constraints…

Python