Skip to main content

Risk Management

Risk Matrix

RiskProbabilityImpactMitigation
OpenRouter price increaseMediumMediumMulti-provider routing, DashScope fallback
OpenRouter rate limitsLowHighAggressive caching, tier upgrade
Vast.ai interruption (Phase 4)MediumMediumOpenRouter as automatic fallback
Token costs out of controlMediumHighHard budget cap, alerts, prompt caching
EC2 downtimeLowHighCloudWatch auto-recovery + Lambda
Low user adoptionMediumHighIterate on use cases, pivot pricing, open-source core
Provider API breaking changesMediumMediumProvider abstraction layer isolates impact

Break-Even Analysis

Self-hosted H200 becomes cheaper than OpenRouter when:

Monthly OpenRouter spend > 545 EUR (Vast.ai GPU cost)

With Qwen3 30B at $0.08/$0.28 per 1M tokens, this means:

  • ~7.8 billion input tokens/month at pure cost parity
  • ~260 million tokens/day — enterprise-level usage

Conclusion: The switch to self-hosted is driven by capabilities (fine-tuning, privacy, latency) not by token cost savings alone.

Financial Summary

Phase 1 (M1-2)      Phase 2 (M2-4)      Phase 3 (M4-6)      Phase 4 (M6+)
42 EUR/mo 42-100 EUR/mo 100-300 EUR/mo 625+ EUR/mo

Break-even: Break-even: Break-even: Break-even:
5 users @ 10 EUR 15 users or Revenue > 300 EUR Revenue > 1000 EUR
2-3 Pro @ 100 EUR