Kernel AI Assistant

A high-performance, local-first intelligent agent for Android. All inference runs entirely on-device — no cloud APIs, no telemetry, no data leakage. Built on Google's Gemma-4 model family via LiteRT.

How It Works

The app operates on a Brain–Memory–Action triad:

The Brain: A three-model cascade that adapts to your hardware. FunctionGemma (270M) handles instant intent routing, while Gemma-4 E-4B/E-2B is loaded on-demand for complex reasoning — delivering flagship-class AI on 8–12GB devices.
The Memory: A local RAG (Retrieval-Augmented Generation) system using sqlite-vec and EmbeddingGemma-300M. The assistant remembers personal facts, preferences, and conversation history across sessions with zero data leaving the device.
The Action: A modular skill framework. Native Kotlin skills handle high-privilege OS integrations (SMS, device controls, media). Community-extensible WebAssembly skills run in a sandboxed Chicory runtime for safe extensibility.

Tech Stack

Component	Technology
Language	Kotlin
UI	Jetpack Compose, Material 3 Dynamic Color
Inference	Google AI Edge (LiteRT + LiteRT-LM)
Reasoning	Gemma-4 E-4B / E-2B (INT4 quantized)
Intent Router	FunctionGemma-270M (Mobile Actions fine-tune)
Embeddings	EmbeddingGemma-300M (768-dim)
Vector Search	sqlite-vec (NDK)
Wasm Runtime	Chicory (pure JVM)
DI	Hilt
Persistence	Room
Min SDK	API 35 (Android 15)

Features

🧠 On-device reasoning — Gemma-4 running on GPU/NPU via LiteRT, no internet required
🗣️ Voice + text input — tap-to-talk with auto-stop, or type
💾 Persistent memory — RAG-powered recall of personal facts and conversation history
🔧 Native skills — Flashlight, DND, Bluetooth, Alarms, SMS, Email, Media Control, Notes
🧩 Wasm skill store — Community-extensible plugins (Rust → Wasm) with sandboxed execution
🏠 Home Assistant — Smart home control via Wasm skill
🔒 100% private — No cloud APIs, no telemetry, all data stays on device

Roadmap

Phase	Description	Status
1	Core LiteRT-LM integration + GPU/NPU acceleration + Chat UI (+ #59 active model display, #60 model selection)	🚧
2	sqlite-vec + EmbeddingGemma for local RAG	⬜
3	FunctionGemma intent router + Native Skills + Voice I/O (Live Mode #64, "Hey Jandal" wake word #65)	⬜
4	Chicory Wasm runtime + GitHub Skill Store	⬜
5	8GB device optimization (dynamic weight loading)	⬜

Getting Started

Prerequisites

Android Studio Ladybug (2024.2.1) or newer
Android NDK (for sqlite-vec)
JDK 17
Physical Android device with 8–12GB RAM (emulator cannot test GPU/NPU)

Setup

git clone https://github.com/NickMonrad/kernel-ai-assistant.git
cd kernel-ai-assistant
./scripts/download-models.sh       # Download AI models from HuggingFace (~3GB)

Open in Android Studio, connect your device via USB, and run.

Key Technical Pillars

Inference: Google AI Edge (LiteRT) with INT4 quantization, GPU + NPU delegates
Extensibility: GitHub-indexed Skill Store (Rust → Wasm) + Native Kotlin skills
Context: Recursive context window management with semantic summarization
Privacy: 100% offline-capable; no telemetry or external LLM API dependencies

License

Apache License 2.0 — see LICENSE for details.

This project adapts code from Google AI Edge Gallery and uses the LiteRT-LM library. See NOTICE for attribution.

Name		Name	Last commit message	Last commit date
Latest commit History 131 Commits
.github		.github
app		app
core		core
docs		docs
feature		feature
gradle		gradle
keystore		keystore
models		models
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts
specification.md		specification.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kernel AI Assistant

How It Works

Tech Stack

Features

Roadmap

Getting Started

Prerequisites

Setup

Key Technical Pillars

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Kernel AI Assistant

How It Works

Tech Stack

Features

Roadmap

Getting Started

Prerequisites

Setup

Key Technical Pillars

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages