PHINEAS
A six-step LLM state machine that forces deterministic CEFR-aligned reading material output from probabilistic models.
Read more
Forward Deployed / Applied AI & AI/ML Technical Program Manager
Closing the gap between AI demos and AI production, between proofs of concept and operations at scale.
10+ years of complex program delivery at Google and Meta. $50M+ in capital programs across Google, Meta, FORTÉ, and NTT. 100+ station Starline fleet. Production AI systems built and deployed in real environments.
A six-step LLM state machine that forces deterministic CEFR-aligned reading material output from probabilistic models.
Read more
A framework for taste-enabled creator agents. Pipeline-plus-judgment architecture, with the agent rejecting its own work when it fails a stylistic bar.
Read more
A production multi-agent system for LLM evaluation. Three rounds surface where independent evaluators disagree and reconcile it into an auditable consensus, instead of averaging the disagreement away.
Read more
· 5 min read
An HR consultant asked for pixel art. The discipline that came out of it turned into something I now call agent husbandry.
· 7 min read
Independent LLM evaluators disagree, and averaging their scores hides the disagreement instead of resolving it. Here is a three-round method that surfaces the conflict, reconciles it through structured debate, and converges on an auditable result.