mullama
/blog

notes from the runtime

Engineering notes on local inference, native bindings, GGUF, and what it takes to embed a model in a real application. RSS.