Data Compressor in Vulnerability-Agent-Bench — 5 vulnerabilities tested

5 vulnerability samples from a data compressor, generating 75 evaluations across 15 agents.

Overview

This data compressor is a high-performance compression library used by HDF5, Zarr, and scientific computing frameworks to compress large datasets. The library implements multiple compression algorithms and is optimized for throughput, handling gigabytes of data efficiently. Compression algorithms are complex state machines with tight loop performance requirements, creating tension between safety and speed.

Benchmark coverage

5 vulnerability samples from this data compressor are included in Vulnerability-Agent-Bench, generating 75 individual evaluations across 15 agent configurations. These samples focus on buffer overflow vulnerabilities in decompression and integer overflow bugs in size calculations.

Vulnerability classes

Data compressor samples cover vulnerability patterns in high-performance data transformation:

Heap buffer overflows during decompression when output size estimates are too small
Integer overflow in compression buffer allocation where size calculations wrap around
Out-of-bounds writes in decompression loops when boundary conditions are not checked
Buffer underflow in format parsing where read pointers are not bounds-checked
Resource exhaustion where decompression of crafted data triggers excessive memory allocation
Null pointer dereference when compression codec parameters are invalid or missing

Why data compressor bugs are interesting for agent evaluation

Data compressor vulnerabilities test an agent's ability to understand compression algorithm implementation and memory safety during data transformation. The codebase requires careful handling of compressed data streams and format validation. Bugs often involve boundary conditions between compression blocks or incorrect size calculations in decompression loops. Agents must generate fixes that validate compressed data correctly while preserving the performance characteristics that make the library valuable in scientific computing.

Scientific computing frameworks often process untrusted data files, and decompression bugs can lead to memory corruption that silently corrupts numerical results. This makes decompression one of the most security-critical but often overlooked components in data processing pipelines.

Agent performance on data compressor

Per-project performance data is not yet published. Overall agent performance is available at the full results page, where you can view pass rates and costs by agent. The benchmark methodology explains how agents were evaluated.

Codebases with similar compression and algorithm-heavy code:

Archive Library, archive format parsing with embedded compression handling
Image Codec, image codec with complex decoding pipelines
Git Library, Git object compression with variable-length encoding

Explore more

Full benchmark results
Agent profiles
Methodology
Economics analysis, cost per verified patch

FAQ

What does data compressor testing reveal about agents?

Compression libraries require understanding compression algorithms and memory safety in decompression. 5 samples test agents on data transformation and boundary condition handling.

Benchmark Results

62.7% pass rate. $2.64 per fix. Real data from 1,920 evaluations.

Benchmark Methodology

How XOR benchmarks AI coding agents on real security vulnerabilities. Reproducible, deterministic, and transparent.