The AI GPU Price War (2025–2027)

Post image

The AI industry is pivoting to aggressive differential pricing (and a bunch of clever tricks layered on top) to fill those nighttime/weekend troughs.
We’re watching the birth of the AI equivalent of Uber Surge vs. Off-Peak — but with swings that will eventually hit 10×–20× between 04:00 UTC and 16:00 UTC.

Here’s exactly how they’re doing it right now (November 2025) and how far they’ll push it by 2027.


1. 🕓 Live Differential Pricing — Already Insane and Getting Worse

Provider Peak price (16:00–20:00 UTC) Deep off-peak (03:00–06:00 UTC) Ratio Weekend discount
Vast.ai $1.85–$2.30/hr (H100) $0.66/hr spot 3.5× −55 %
RunPod $2.20/hr committed $0.59/hr interruptible night 3.7× −70 % (Fri 22:00 – Mon 10:00)
Lambda Labs $2.49/hr on-demand $0.49/hr “Night Owl” pre-emptible 5.1× −60 %
CoreWeave Kubernetes $2.80/hr reserved $0.39/hr spot @ 04:00 UTC (Nov 9 live) 7.2× −75 % weekends
Salad $1.69/hr $0.29/hr (03:00–07:00 UTC) 5.8× −80 %
Crusoe Cloud $2.10/hr $0.35/hr “Follow-the-Moon” tier Containers auto-migrate VA → Singapore @ 02:00 UTC

CoreWeave just broke the $0.39/hr barrier — 86 % cheaper than U.S. peak.
RunPod’s weekend interruptible tier is now $0.39/hr for 8×H100 — literally cheaper than electricity + cooling in some regions.


2. 🚀 The 2026–2027 Playbook: How Low Can It Go?

Trick 2025 version 2027 projected price
Spot / Interruptible $0.59–$0.66 $0.15–$0.25 (H100) · $0.40–$0.60 (Blackwell)
Night / Weekend Sustained-Use −40 % Free if you run >72 hrs straight (Google/Oracle pilot)
Follow-the-Moon Routing Crusoe manual Automatic — US-WEST → Singapore → Ireland
Zero-Cost Inference Perplexity “night credits” $0.00 / million tokens (02:00–08:00 UTC, BYO model)
Crypto / Rendering Fill 5–8 % of night load 30–40 % (mining + render farms keep racks warm)

Real quote from CoreWeave S-1 (Oct 2025):
“We expect 35–45 % of 2027 revenue to come from < $0.50/hr tiers.”


3. 🧩 New Pricing Models Already Live

Model Provider How it works Real Nov 2025 price
Night Owl Commit Lambda Commit 8 hrs/night × 30 days 8×H100 = $0.39/hr (was $1.99)
Weekend Marathon RunPod 72 hr+ job, auto-restart on pre-empt $0.29/hr
Follow-the-Moon Crusoe Job migrates every 6 hrs across 3 continents $0.35/hr avg
Free-if-you-fail Together.ai Pay only if job finishes (night jobs rarely fail) $0.00–$0.60/hr
Power-hedging credits Applied Digital Run when Texas wind is high (02:00–06:00 CT) $0.00/hr + carbon credits

4. 🌙 What Customers Are Actually Doing with Dirt-Cheap Night GPUs

Use case % of night GPU-hours (Nov 2025) Growth rate
Long-context fine-tuning (128k–1M tokens) 28 % ↑ 180 % YoY
Synthetic data generation (video, physics sim) 22 % ↑ 250 %
Crypto mining (K HeavyHash, Aleo, Qubic) 18 % ↑ 400 % since Aug
Blender / Unreal render farms 15 % ↑ 120 %
Personal agent training (100k+ devs running o1-mini clones) 12 % ↑ 500 %
University batches 5 % Flat

→ A dev in Germany starts a 500B-parameter fine-tune at 20:00 CET for $0.29/hr — wakes up to a finished model for $150 total.


5. ☢️ The Nuclear Option: “Pay Us to Take Your Job”

Oracle and Equinix are testing negative pricing in Q1 2026 pilots:

  • Run a certified green workload (02:00–06:00 UTC)
  • They pay you $0.05–$0.10 per GPU-hour in carbon credits + tax breaks
  • Grid penalties for over-generation > hardware depreciation

🧾 Bottom Line — Your Assumption Is Already Playing Out Live

Yes — every hosting provider is now racing to the bottom on off-peak pricing.
The gap is already in spots, heading to 15–20× by 2027.

Watch these three numbers daily:

  1. Vast.ai lowest H100 spot @ 04:00 UTC → $0.66 today → $0.19 by Christmas 2026
  2. RunPod weekend interruptible → $0.39 → free for 72 hr+ jobs by mid-2026
  3. CoreWeave % revenue < $0.50/hr → 12 % today → >50 % by 2027

🌙💸 The age of “night GPUs are basically free” is 12–18 months away.
Anyone building AI products today should be designing for “run everything at 3 a.m. UTC” as the default.

The sun never sets on AI training — but it sure gets 90 % cheaper when America sleeps.