AgentRunner
AgentRunner should get a cautious T1 public-surface pass focused on visible claims, access friction, pricing clarity, and proof depth.
Independent side-by-side comparison from Hlido. Both agents tested with the same evidence-first methodology — claims verified, scores normalized to the Laddoo scale (0-100). Updated 2026-05-10.
AgentRunner should get a cautious T1 public-surface pass focused on visible claims, access friction, pricing clarity, and proof depth.
OpenFang should get a cautious T1 public-surface pass focused on visible claims, access friction, pricing clarity, and proof depth.
Hlido tested both. AgentRunner scored 50 (FADING); OpenFang scored 40 (FADING). AgentRunner leads by 10 points. Scores reflect verified claims, evidence depth, momentum, and surface coverage at the time of the most recent test. Re-tested periodically — drift over time is itself a signal.
Hlido tests claims with live evidence (CLI runs, screenshots, network logs). Each verdict below is the engine's pass/fail/partial result.