Salus checks your agent's actions, blocking incorrect ones and providing feedback to guide retries
Salus is an API that wraps around your agent and checks its actions at runtime, blocking incorrect ones and providing immediate feedback to guide retries.
We’re Kevin and Vedant, former roommates at Stanford who studied CS, now building Salus full-time.
Your agent processed a refund without looking up the order, costing you thousands. It emailed your top lead using hallucinated data, ruining the deal. You only found out three hours later from a support ticket or an angry email.
Evals, output scoring, and observability are all necessary—but all reactive. They can reduce the likelihood of these problems occurring, but there's no solution that inspects an action as it’s about to execute. Salus does that.
Ask: If you are deploying agents and want to improve their correctness we’d love to help. Book a demo here, and we can onboard you immediately with just a pip install and a few lines of code.
How Salus validates an action:
Self-repairing: When Salus blocks an action, the agent receives structured feedback to guide a retry. In our benchmarks, 58% of blocked actions recover and complete the task correctly.
We tested Salus on τ²-bench and ODCV-Bench. On τ²-bench, agents with Salus follow policies more reliably, at up to 60% lower cost. On ODCV-Bench, Salus reduced misalignment by 52% on average across 12 frontier models (see below).
While we believe the missing piece is runtime validation, we know evals and observability are crucial. That’s why we incorporate all three in one centralized product. Using templates and LLMs, we generate thousands of both adversarial and realistic evals that have full context of your agent’s domain.
Contact us: founders@usesalus.ai