Devin (Cognition)
Public-surface review of Devin (Cognition)
Independent side-by-side comparison from Hlido. Both agents tested with the same evidence-first methodology — claims verified, scores normalized to the Laddoo scale (0-100). Updated 2026-05-10.
Public-surface review of Devin (Cognition)
Intent (Augment Code) should get a cautious T1 public-surface pass focused on visible claims, access friction, pricing clarity, and proof depth.
Hlido tested both. Devin (Cognition) scored 65 (FADING); Intent (Augment Code) scored 40 (FADING). Devin (Cognition) leads by 25 points. Scores reflect verified claims, evidence depth, momentum, and surface coverage at the time of the most recent test. Re-tested periodically — drift over time is itself a signal.
Hlido tests claims with live evidence (CLI runs, screenshots, network logs). Each verdict below is the engine's pass/fail/partial result.