Private drafts mirror — not public, not indexed

Local inference · agentic systems · Tartu

I build agentic AI systems on hardware I own.

This is where I write about that work — what I'm testing, what breaks, and what I find. Diary-shaped, evidence-first. No conclusions I haven't run.

4× RTX 6000 Pro Blackwell · 384 GB VRAM · a workshop in Tartu
The rig mid-build: a wooden-frame chassis with a Noctua fan on top, cables everywhere, on a stand in a dark workshop.
where the work happens — Tartu, June 2026

Writing

newest first
Jun 1, 2026 Origin

How this started

Summer 2025, one GPU bought on a hunch, a €9,000 receipt and the regret that came with it — and the project that turned it all into four Blackwells. The build, in photos.

Read the origin story →
May 31, 2026 Validated

Dense is byte-reproducible; MoE flips at near-ties

I went looking for decode speed and found something else: quantized MoE inference loses determinism in proportion to how aggressively you quantize — and a leaked timestamp in my own harness was faking half the signal. A controlled walk across seven configurations.

Read the finding →