Skip to content
View wuwangzhang1216's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing
  • San Francisco

Organizations

@Lightning-Goods

Block or report wuwangzhang1216

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
wuwangzhang1216/README.md

Hey, I'm Steve Wu ๐Ÿ‘‹

AI Engineer & LLM Researcher

Building AI-native tools that actually ship. Obsessed with making large language models more useful, more controllable, and more open.


๐Ÿš€ What I'm Working On

  • ๐Ÿ”ฌ Abliterix โ€” Fully automated LLM abliteration framework. LoRA + Optuna TPE optimization, 135+ model configs, 9 peer-reviewed techniques (NeurIPS/ACL/ICLR). 0โ€“1.5% refusal rate with 0.01 KL divergence.
  • ๐Ÿ—„๏ธ OpenDB โ€” AI-native database & long-term memory for AI agents. 93.6% on LongMemEval (#3 on leaderboard), zero embeddings, zero vector DBs โ€” just SQLite FTS5. 12 MCP tools, works with every major agent framework.
  • ๐Ÿฆฌ OpenYak โ€” Open-source local-first AI desktop app supporting 100+ models
  • ๐Ÿง  LLM alignment research โ€” abliteration, representation engineering & steering vectors
  • ๐Ÿ” Vector search engines & RAG pipelines at scale
  • ๐Ÿค– Multi-agent orchestration โ€” built from scratch, no heavy frameworks
  • ๐Ÿ“ AI-native writing & knowledge systems
  • ๐Ÿ”ฌ Publishing fine-tuned & abliterated models on HuggingFace

๐Ÿ›  Tech Stack

Languages
Python C++ Rust Go CUDA TypeScript

AI/ML & Deep Learning
PyTorch Transformers HuggingFace PEFT DeepSpeed vLLM

Research & Techniques
RepEng Abliteration RLHF Quantization FlashAttention KnowledgeDistillation MoE

LLM APIs & Inference
OpenAI Anthropic Ollama

Vector Databases & Retrieval
FAISS pgvector Milvus

System Design & Distributed Systems
Microservices DistributedSystems EventDriven APIDesign Kafka RabbitMQ LoadBalancing Caching Sharding HA Scalability Observability

Full Stack
FastAPI Next.js React PostgreSQL Redis

Infra
Docker Kubernetes AWS Linux


๐ŸŽ“ Background

  • ๐ŸŽ“ Honours BSc in Computer Science & Mathematics, University of Toronto (3.95/4.0)
  • ๐Ÿ’ผ 10+ years in Databases, LLMs & AI Agent Systems

HuggingFace

Pinned Loading

  1. openyak/openyak openyak/openyak Public

    Open-source local-first AI agent for desktop work. No account, no telemetry: use local models with Ollama/Rapid-MLX or bring your own provider key.

    Python 777 56

  2. abliterix abliterix Public

    Automated alignment adjustment for LLMs โ€” direct steering, LoRA, and MoE expert-granular abliteration, optimized via multi-objective Optuna TPE.

    Python 214 40

  3. claude-code-source-all-in-one claude-code-source-all-in-one Public

    Always up-to-date open-source mirror of Claude Code (currently v2.1.123). Run from source with Claude subscription/API, ChatGPT subscription (GPT-5.5 / GPT-5.4), OpenAI-compatible providers, or locโ€ฆ

    TypeScript 79 94

  4. openDB openDB Public

    AI-native local database for files, search, and long-term agent memory.

    Python 79 20

  5. ora ora Public

    Real-time on-device speech translation for macOS. Silero VAD + Qwen3-ASR-1.7B + Qwen3.5 (MLX) on Apple Silicon. No cloud, no API keys, no telemetry.

    Swift 36 4