Exploring how complex systems can work better.

Practical engineering and thoughtful research on the problems that sit between people and technology.

Research · Open Source · Fractional & Advisory

Current Work

pacabench

Benchmarking LLM agents shouldn't mean wrestling with brittle scripts and lost progress. pacabench is a local-first tool that provides reproducible, reliable benchmarks with isolated execution, persistent state, and built-in metrics tracking—no SDK required.

Working Together

I work with companies at critical technical inflection points.

If you're facing a hard problem at the intersection of technology and organisation, let's talk.

Archive

context-store

Users expect full message history, but LLMs have hard limits. Most teams rebuild the same Redis + Postgres stack; context-store extracts that pattern into a reliable Elixir service with Raft consensus, horizontal scaling, and deterministic compaction.