About me

I’m Miguel Brandão, a Python-focused engineer working on agent evaluation platforms and tasks for large language models (LLMs). Recent work includes:

ControlArena (UK Government): task/setting development with production merges across core and infra repos.
LinuxBench (Redwood Research): platform tooling and task development from project inception (internal/private).
METR: human baselines, task development and reliability work acknowledged in HCAST.

Previously, I was a Senior Software Developer at Softinsa–IBM, building backend systems in Python (FastAPI/Flask), Docker, and Kubernetes.

Always happy to connect—feel free to reach out via LinkedIn or email.