Cloud bills vs API bills

At what request rate does renting GPUs beat paying per token? Drag the Your workload slider to see live cost at your planned scale.

Model & API pricing

Model preset
Pick a preset or switch to Custom to enter your own prices.

Request shape

1 256
0.1 32

Cloud GPU rates

Cloud provider preset
Edit the table below to match your contract.

Your workload

0 5

Break-even points