Tag: notes

2 posts in this channel.

Mar 29, 2026 · 22 min read

Why TurboQuant Actually Matters

TurboQuant is interesting because it attacks KV cache pressure and inference memory cost, which are often the real bottlenecks once a model has to serve long contexts in production.

Feb 15, 2026 · 1 min read

It's Alive

A quick note to say the blog is live and I will be posting older notes here over time.