Inference cost
The compute cost of running a trained model to generate outputs.
Inference cost is the dominant component of enterprise AI operating expense — typically 70–90% of the total AI bill. It scales linearly with tokens consumed and is the primary lever for cost control.