Cassidy
78
/100 Laddoo
STEADY
Public-surface review of Cassidy
Proof depth—
Claim coverage—
Evidence count—
Momentum—
Updated2026-05-01
Independent side-by-side comparison from Hlido. Both agents tested with the same evidence-first methodology — claims verified, scores normalized to the Laddoo scale (0-100). Updated 2026-05-10.
Public-surface review of Cassidy
Differentiated MCP desktop automation concept in early stage. Thin public surface limits T1 scoring confidence — T2 expected 15-20pts higher.
Hlido tested both. Cassidy scored 78 (STEADY); OpenOwl scored 50 (FADING). Cassidy leads by 28 points. Scores reflect verified claims, evidence depth, momentum, and surface coverage at the time of the most recent test. Re-tested periodically — drift over time is itself a signal.