I Tracked LLM Pricing for 8 Weeks. Here's What the Data Shows.
Real pricing data from 1,111 models across 33 providers, collected daily since April 24, 2026.
The Problem That Started This
A few months ago, I was building an AI product and hit a wall: Which LLM should we use?
The question sounds simple. But the pricing landscape for AI models shifts constantly and without warning. OpenAI has 69 models listed on their pricing page. Google has 18. Anthropic launched three new Claude versions in the span of weeks. By the time I had compared options and made a decision, I had no confidence the numbers were still current.
I checked pricing pages. I checked docs. I found no single place that told me:
- What did this cost last week vs. today?
- Which providers actually changed their prices recently?
- How wide is the spread for comparable models?
So I automated it. Starting April 24, 2026, I built a daily scraper that pulls pricing data from every major provider. Here is what 8 weeks of data actually shows.
Finding 1: The Big Providers Did Not Move
The headline finding is not dramatic: Anthropic, OpenAI, Google, Mistral, and xAI all held their prices flat for the entire 59-day period. Not a single price change detected across those five providers.
That is useful information on its own. If you are budgeting around these providers, your cost model from April is still accurate today.
Here is the current pricing snapshot for the most-used models, accurate as of June 22:
| Model | Input / 1M tokens | Output / 1M tokens | Note |
|---|---|---|---|
| openai / gpt-4-turbo | $10.00 | $30.00 | Stable since Apr 24 |
| openai / gpt-4o (chatgpt-4o-latest) | $5.00 | $15.00 | Stable since May 12 |
| anthropic / Claude Sonnet 4.5 | $3.00 | $15.00 | Stable since Apr 24 |
| anthropic / Claude Opus 4.5 | $5.00 | $25.00 | Stable since Apr 24 |
| anthropic / Claude Haiku 4.5 | $1.00 | $5.00 | Stable since Apr 24 |
| google / Gemini 2.5 Pro | $1.25 | $10.00 | Stable since Apr 26 |
| google / Gemini 2.5 Flash | $0.30 | $2.50 | Stable since Apr 26 |
| google / Gemini 2.0 Flash | $0.10 | $0.40 | Stable since Apr 26 |
| mistral / Codestral | $1.00 | $3.00 | Stable since Apr 24 |
| xai / Grok 4.3 | $1.25 | $2.50 | Stable since May 16 |
Finding 2: The Pricing Spread is Enormous
The bigger story is not volatility. It is the 600x price range that now exists across the tracked catalog, from sub-cent inference to frontier flagship models.
| Model | Input / 1M tokens | Output / 1M tokens | Note |
|---|---|---|---|
| groq / llama-3.1-8b-instant | $0.05 | $0.08 | Cheapest tracked |
| deepseek / deepseek-v4-flash | $0.14 | $0.28 | |
| google / Gemini 2.0 Flash | $0.10 | $0.40 | |
| deepseek / deepseek-v4-pro | $0.44 | $0.87 | |
| xai / Grok 4.3 | $1.25 | $2.50 | |
| anthropic / Claude Sonnet 4.5 | $3.00 | $15.00 | |
| openai / gpt-4-turbo | $10.00 | $30.00 | |
| openai / gpt-5.4-pro | $30.00 | $180.00 | Highest-cost frontier tracked |
Groq's Llama 3.1 8B costs $0.05 per million input tokens. GPT-4 Turbo costs $10.00 (200x). GPT-5.4 Pro costs $30.00 (600x). All three are in our dataset today. DeepSeek V4 Pro at $0.44 per million input tokens consistently ranks near the top on reasoning benchmarks at a fraction of the flagship price.
The question teams should be asking is not just “which model is best?” but “which model is best for this specific task given the cost?” For many production workloads, the answer is not the model you started with.
Finding 3: All 5 Price Changes Came from One Provider
Over 59 days, our scraper detected 5 pricing changes across all 1,111 models. Every single one was on Together AI, a hosting aggregator that runs third-party open-source models.
| Date | Model | Before | After | Change |
|---|---|---|---|---|
| May 27 | qwen37-max | $2.50 / $7.50 | $1.25 / $3.75 | -50% |
| Jun 2 | Qwen3.5-9B | $0.10 / $0.15 | $0.17 / $0.25 | +70% |
| Jun 2 | Llama-3.3-70B-Turbo | $0.88 / $0.88 | $1.04 / $1.04 | +18% |
| Jun 2 | Meta-Llama-3-8B-Lite | $0.10 / $0.10 | $0.14 / $0.14 | +40% |
| Jun 17 | DeepSeek-V4-Pro | $2.10 / $4.40 | $1.74 / $3.48 | -17% / -21% |
The biggest move: Qwen37-max dropped 50% overnight on May 27, with no public announcement that reached mainstream channels. Teams running that model through Together AI saw their costs cut in half. Teams not monitoring pricing had no idea.
Three models saw price increases on June 2: Qwen3.5-9B (+70%), Llama-3.3-70B-Turbo (+18%), and Meta-Llama-3-8B-Lite (+40%). Not all pricing movement favors the buyer.
Finding 4: No Provider Notifies You Directly
Across all 5 detected changes, the announcement path was the same: nothing sent directly to users. One change (DeepSeek-V4-Pro via Together) appeared in a provider blog post days later. The rest were silent.
Compare that to how other infrastructure pricing works:
- AWS sends email notifications for pricing changes to affected customers
- Stripe gives 30 days notice before any fee changes
- Twilio posts a public changelog and emails account owners
LLM providers are operating more like spot markets than enterprise software. Prices move when they move. Most teams find out on their invoice.
What I Built
After a few weeks of running this manually, I automated it and opened it up. Token Prices tracks 1,100+ models across 33 providers daily and surfaces changes the moment they happen.
The tool gives you:
- 1A live pricing dashboard across all tracked providers and models
- 2A price change feed showing every detected move with before/after prices
- 3Historical data going back to April 24, 2026 (the full dataset)
- 4A REST API so you can pull pricing into your own FinOps tooling
The free tier covers the 10 major providers. Paid plans add historical depth, more providers, and API access. tokenprices.io
Open Questions
- 1.Are there pricing changes I missed? If you spotted a move that did not show up in this data, I want to know.
- 2.Which providers should I add next? The scraper can cover more platforms. What is on your list?
- 3.How do you handle model selection today? Dashboard? Spreadsheet? Gut feeling?
- 4.What would make this data actionable for your team? Alerts? Cost projections? A comparison view?
Let us know at support@tokenprices.io