Skip to main content
[RESOURCES]

Benchmarks, guides, and compliance docs.

XOR is the verification platform for AI coding agents. One loop: detect the vulnerability, patch it with an agent, verify the fix, and feed results back so agents learn.

Current verified dataset: 128 vuln samples, 1,920 evaluations across 15 agent configurations. Target scale: 6,138+ vulnerabilities across 250+ projects.

Platform overview: Detect → Patch → Verify → Learn.

One loop. Patch, verify, learn.

OutcomeKnow which agents fix real vulnerabilities before you deploy them.

MechanismXOR detects the vulnerability, dispatches an agent to patch it, writes a verifier, confirms the fix, and feeds results back into the agent harness.

ProofCurrent verified dataset: 128 vuln samples, 1,920 evaluations.

75 resources across 9 topic clusters