[RESOURCES]
Benchmarks, guides, and compliance docs.
XOR is the verification platform for AI coding agents. One loop: detect the vulnerability, patch it with an agent, verify the fix, and feed results back so agents learn.
Current verified dataset: 128 CVE samples, 1,920 evaluations across 15 agent configurations. Target scale: 6,138+ vulnerabilities across 250+ projects.
Platform overview: Detect → Patch → Verify → Learn.
One loop. Patch, verify, learn.
OutcomeKnow which agents fix real vulnerabilities before you deploy them.
MechanismXOR detects the CVE, dispatches an agent to patch it, writes a verifier, confirms the fix, and feeds results back into the agent harness.
ProofCurrent verified dataset: 128 CVE samples, 1,920 evaluations.
74 resources across 9 topic clusters