EU data residency · zero retention

Unlimited AI inference.
One flat price.
Hosted in Europe.

A drop-in replacement for the OpenAI and Anthropic APIs, running near-SOTA open weights on our own GPUs in Finland. €20 a month, flat — no per-token billing, no data retention. We pin one model, measure how it behaves, and publish the results — failures included. Point your existing tools at it and keep working.

Get started — €20/month Read the manifesto →
# Drop-in replacement. Same endpoints, same tools.
from openai import OpenAI
client = OpenAI(
  base_url="https://api.affordableai.eu/v1"
)

# Claude Code, Cursor, Continue, aider — all compatible
export ANTHROPIC_BASE_URL="https://api.affordableai.eu"
Powered by NVIDIA B300 EU data centres · Finland SOC 2 Type II infra 100% renewable energy MIT-licensed open weights
Capabilities

Frontier AI without the meter running.

Everything a developer needs, priced like a utility — and built for teams that care where their data lives.

Zero retention

Prompts and completions live only in GPU memory. Nothing is written to disk or logged.

No token billing

Twenty euros a month, used as much as fair-use allows. No per-token meter, and no surprise on the invoice at the end of the month.

Drop-in replacement

The same endpoints your tools already speak: OpenAI SDKs, Cursor, Claude Code, Continue, aider. Change the base URL, nothing else.

One million token context

Entire codebases, long histories, and large documents in one session. Hybrid attention keeps it practical, and flat pricing means a long context costs the same as a short one.

Fast at any load

A single B300 holds sub-second time-to-first-token even under concurrent load. Speculative decoding and KV caching keep decode fast as users pile on.

Streaming by default

Tokens stream as they're generated over server-sent events. No polling, no batch waits.

Why this model

DeepSeek V4 Flash vs frontier models.

Published benchmarks from the HuggingFace model card, V4 Flash in Max reasoning mode (no external tools) against the leading closed-source models of 2026. Source: DeepSeek V4 Flash.

BenchmarkV4 Flash (Max)Opus 4.8GPT-5.5
LiveCodeBench91.688.8
GPQA Diamond88.193.693.6
HLE34.849.841.4
SWE Verified79.088.6

V4 Flash leads on code generation: its LiveCodeBench score (91.6) tops both Opus 4.8 and GPT-5.5. On the broader reasoning benchmarks it trails the newest frontier models, but it's a 284B-parameter mixture-of-experts with just 13B active, so it runs on a single GPU and costs a fraction per token.

It's MIT-licensed with open weights, so there's no vendor lock-in: the same model that runs DeepSeek's own API, served from our GPUs in Finland instead of theirs.

Performance

Faster than the official API.

We benchmarked a single B300 against DeepSeek's own API on identical weights:

175 tok/s
Single-user decode
12,300 tok/s
Aggregate @ 256 concurrent
48 tok/s/user
At quality ceiling
86 ms
TTFT with cached prefix

Measured on a single NVIDIA B300, June 2026. Full config and data →

Why Europe got AI wrong

Three reasons the EU needs a different approach.

1. Europe can't out-train Silicon Valley

The US hosts ~75% of the world's AI supercomputer capacity; the EU holds under 5% (Epoch AI, 2025). America's largest clusters are scaling past a gigawatt; xAI's Colossus alone runs 200,000 H100s at 300 MW. Meanwhile, OpenAI raised $122 billion in a single round (March 2026) — more than the EU's entire InvestAI target, most of which is reallocated from existing programmes. In May 2026, Mistral's CEO told the French National Assembly Europe has two years before it becomes a US "vassal state." Training models from scratch is a game Europe already lost. The opening is deployment: run the best open-weight models on European GPUs and compete on operations, price, and trust.

2. Token billing makes AI a luxury good

Per-token pricing turns a developer tool into a budget line that gets capped and cut. Teams are rationing prompts and restricting access after burning through annual budgets in months, and some startups now exist only to track and reduce token spend. Inference should be priced like a utility, not metered like a luxury.

3. US-hosted models are one directive away from disappearing

On June 12, 2026, the US issued its first-ever export control on AI models, forcing Anthropic to disable Claude Fable 5 and Mythos 5 for all foreign users overnight. This wasn't a chip restriction — it was a deployed model pulled by government order. Europe already relies on non-EU providers for over 80% of its digital infrastructure. Any application built on US-hosted AI is one directive away from going dark. If your inference runs outside the EU, you don't control it.

  • EU compute only
    Finland and Germany. No third-country transfers, and no US export controls apply.
  • Flat price, unlimited use
    €20/month, no token meter. Use as much as fair-use allows.
  • Best open weights, zero lock-in
    MIT-licensed weights, public and portable. The infrastructure is ours.
  • EU AI Act ready
    Downstream provider under Article 50 transparency rules. Not high-risk. Full compliance page.
Pricing

One plan. Every feature.

Developer
€20/mo

Everything included. No surprises.

  • DeepSeek V4 Flash
  • 1M token context window
  • API + all tool integrations
  • Up to 10 concurrent requests
  • No token billing
  • Email support
Get early access
Teams · 5+ seats
€16/seat

Volume pricing for engineering teams.

  • Everything in Developer
  • Centralised billing
  • Usage dashboard
  • Priority routing
  • Dedicated support
Contact us

One email when we launch. That's it.

hi@affordableai.eu

Who runs this.

EU
Independently operated · Dutch BV

AffordableAI is operated by a privately held Dutch company (BV). Bootstrapped and independently funded, with no outside investors. The infrastructure is built and run in Europe.

🇪🇺 EU-operated🇳🇱 NetherlandsBootstrappedIndependent