Zhen Ee
I'm a backend engineer building AI systems. Most of my career has been on distributed services and proxy infrastructure; the last stretch has been going deep on local LLM inference, RAG retrieval, and computer vision.
// currently (May 2026): shipping the EU AI Footprint Scanner as a GitHub App — first product under Argus Intelligence, for EU SMEs ahead of the 2 August 2026 GPAI deadline. More on /now.
Selected projects
all →- aegis-rag AI/ML
A modular RAG framework for benchmarking retrieval strategies — vector vs. keyword, with and without rerankers — before anything touches an LLM.
- Local LLM orchestration and benchmarking AI/ML · Systems
Running Qwen and Llama variants on 16GB consumer hardware via llama.cpp — measuring the real trade-off between tokens per second and logic quality on quantized GGUF models.
Qwen 4B at ~50 TPS on 16GB · 8B hits the swap wall at default quants
- EU AI Footprint Scanner AI/ML · Tools
AST-based static analysis that detects AI/ML library use across a Python codebase, classified into simplified EU AI Act risk tiers. The first product I'm shipping under Argus Intelligence.
- Edge routing proxy Backend · Systems
A multi-layer reverse proxy for upstream-resilient routing — OpenResty, Couchbase, and a TypeScript control plane. Serves ~10M requests/day in production from a 2 vCPU / 2 GB node.
10M req/day in production · 2 vCPU GCP e2-small · 70k concurrent at 100% in load test