A new framework measures the value of AI by how much money a human would demand to do a job without it.
April 26, 2026
Original Paper
WAGE-Bench: Measuring the Economic Value of AI in Real Work
SSRN · 6605518
The Takeaway
AI evaluation is moving away from abstract benchmarks toward a concrete metric based on human hourly wages. This system calculates the actual monetary productivity boost an AI provides for a specific task. Most tests focus on whether an AI can pass a bar exam or a math quiz. This framework measures the real-world economic worth of the tool in a professional setting. It provides companies with a clear way to see if an AI investment is actually paying for itself.
From the abstract
This paper introduces WAGE-Bench, a new framework for evaluating artificial intelligence based on the practical economic value it creates in human work. Standard benchmarks measure technical capability but reveal little about how much AI actually reduces the cost of completing real tasks. WAGE-Bench addresses this gap by eliciting individuals' willingness to accept compensation for performing realistic tasks under alternative AI conditions-without AI, with AI assistance, or with alternative mode