Lucky7andOne
Back to blog
AI Apr 18, 2026 9 min read

Running LLMs at the Edge: Production Lessons

Quantization, KV-cache tricks, and how we shipped a 7B model inside a factory gateway.

AK
Aisha Khan
Head of AI