Stitched a small inference cache into Notaify
Spent the morning wiring an in-memory LRU cache in front of the model router so identical stack traces don't re-bill us every time. Saw cost drop visibly by lunch. The boring kind of win that I like the most.