Routeur 9
7.8

Routeur 9

  • Arrêtez de gaspiller votre budget API — Optimisez le routage, codez plus longtemps.
  • L'open-source AI Un proxy qui assure le fonctionnement continu de votre environnement de développement.

9Router Key Insights

Modèle de tarification : Gratuit (Open Source)
Niveau gratuit : Oui  
Marqué comme : AI Gateway / LLM Router / Token Saver
Prix: $0
Multi-Provider Routing:
3-Tier Smart Fallback:
RTK Token Saver:
Caveman Mode:
OpenAI-Compatible Endpoint:
Format Translation:
Multi-Account Round Robin:
Real-Time Quota Tracking:
OAuth Auto Token Refresh:
Custom Model Combos:
Usage Analytics Dashboard:
Ollama Support:
Context Window Transparency:

What is 9Router?

Routeur 9

Routeur 9 is a free, open-source local AI proxy and token optimisation gateway built for developers who use AI outils de codage like Claude Code, Cursor, Codex, Cline, and GitHub Copilot. It sits between your CLI tools and over 40 upstream LLM providers, exposing a single OpenAI-compatible endpoint at localhost:20128/v1. Its core function is to eliminate coding downtime caused by quota exhaustion or rate limits through a 3-tier automatic fallback system that cascades from premium subscriptions to budget API tiers to genuinely free providers. 

On top of routing, its built-in RTK Token Saver compresses tool outputs such as git diff and log dumps before they reach the LLM, cutting input token costs by 20 to 40 percent on every single request. For teams and solo developers tired of mid-session interruptions and runaway API bills, 9Router is the infrastructure layer that keeps the code flowing at near-zero cost.

Key Features of 9Router
RTK Token Saver — Compress Before You Send
RTK Token Saver 9Router

Routeur 9's RTK (Rust Token Kompressor, ported to JavaScript) intercepts every tool_result in your prompt before it is dispatched to the LLM. It peeks at the first kilobyte of data, identifies the content type (git diff, log dump, file tree, grep output), and applies targeted lossless filters. The result is 20 to 40 percent fewer input tokens per request with absolutely no loss of context. If a filter fails or makes output larger, RTK silently discards it so your request is never broken.

Smart 3-Tier Fallback — Zero Downtime Routing

The fallback engine is the beating heart of 9Router. You configure a “Combo” that chains up to five model tiers: your active subscription (e.g. Claude Code Pro at $20 per month), a cheap backup (e.g. GLM at $0.60 per million tokens), and a free unlimited provider (e.g. Kiro AI). The instant a rate-limit or quota-exhausted error is detected, 9Router catches it silently and re-routes the exact same request to the next tier. Your coding tool never sees the switch.

Caveman Mode — Slash Output Token Costs
Caveman Mode 9Router

Inspired by the viral “why use many token when few token do trick” prompt by Julius Brussee, Caveman Mode injects a system-level instruction that forces the LLM to respond in terse, stripped-down language. The technical substance of the reply is preserved in full, but verbose filler is dropped, delivering up to 65 percent fewer output tokens. For high-volume agentic coding workflows, this alone can eliminate a meaningful chunk of monthly API expenditure.

Format Translation Engine — One Endpoint for All Tools
Format Translation Engine 9Router

9Router acts as a universal protocol translator. Your CLI tool sends a standard OpenAI JSON request to localhost, and 9Router unpacks it, restructures it into the native format of the destination provider (Claude, Gemini, Vertex, Kiro, Cursor), fires it off, then translates the response back into OpenAI format before handing it to your tool. The result is that Claude Code, Codex, Cline, Roo, and 12 other supported CLI tools all route through a single configuration point.

Real-Time Quota Tracking and Multi-Account Support

Routeur 9's dashboard provides live token consumption data, reset countdowns (5-hour, daily, weekly), and per-model cost estimation. Multi-account round-robin per provider lets you load-balance across multiple accounts so one hitting its limit does not stall the entire stack. OAuth tokens are refreshed automatically in the background so there is no manual re-authentication during active sessions.

Free Provider Ecosystem — Genuine Zero-Cost AI

9Router ships with first-class support for three genuinely free providers: Kiro AI (free unlimited Claudie 4.5, GLM-5, MiniMax via AWS/Google OAuth), OpenCode Free (no auth, models auto-fetched), and Vertex AI ($300 Google Cloud credits for new accounts). Combined with RTK, a developer using only the free tier pays exactly $0 per month while still accessing production-grade models like Claude Sonnet 4.5.

9Router Pricing Plans

NiveauCoût (USD)Ce que vous obtenez
9Router Software$ 0 pour toujoursFull proxy, all features, open-source MIT licence
Free Providers (Kiro, OpenCode)$0Unlimited Claude 4.5, GLM-5, MiniMax, no API key
IA des sommets$0 (new GCP: $300 credit)Gemini 3 Pro, DeepSeek, GLM-5 via Google Cloud
Cheap Backup (GLM-5.1)0.60 $ pour 1 M de jetonsDaily reset, great for overflow routing
Cheap Backup (MiniMax M2.7)0.20 $ pour 1 M de jetons5-hour reset, cheapest per-token option
Kimi K2.5 Flat9 $ par mois10M tokens monthly at $0.90 per 1M effective
Claude Code Pro/Max20 $ à 200 $ par moisPremium subscription maximised via 9Router
Copilote GitHub10 $ à 19 $ par moisRouted via MITM for model flexibility

Deployment Flexibility — Local, VPS, Docker

9Router is not locked to a single machine. For shared teams or remote workflows, it deploys on any VPS with a straightforward npm run build && npm run start process and a handful of environment variables. Docker images are published to both Docker Hub (decolua/9router) and GitHub Container Registry for multi-platform linux/amd64 and linux/arm64 support. 

Cloudflare Tunnel integration means remote tools like Cursor on a laptop can route through a server-hosted 9Router instance without opening firewall ports. The SQLite database persists all provider configs, combos, and usage history via a mounted volume.

Avantages et inconvénients

Avantages
  • Free forever, MIT licence.
  • 20 to 40% token savings via RTK.
  • 65% output token reduction with Caveman.
  • 12 CLI tools supported natively.
  • Genuine zero-cost free provider support.
  • Docker, VPS, and Cloudflare deployment ready.
Inconvénients
  • Requires Node.js setup (not GUI-only).
  • No per-model latency benchmarking dashboard.
  • Some free providers (iFlow, Qwen) discontinued in 2026.

Best 9Router Alternatives

AI Gateway / LLM Router / Token SaverMulti-Provider FallbackToken Optimisation
OmniRoute4-tier, 36+ providers, TypeScriptSemantic cache only
LiteLLM100+ providers, enterprise focusNo built-in compression
OuvrirRouterCloud-based, no local installNo token saving features
LobeChat GatewayLimited provider listNo RTK-equivalent
Verdict: 9Router wins on local-first, zero-cost token compression and free tier depth.pythonlibraries.
  • Get unlimited free AI coding with smart fallback routing
  • Gratuit
  • chaque AI coder needs this local token saver
7.0
Sécurité de la plateforme
9.0
Sans risque et remboursement
8.0
Services et fonctionnalités
7.0
Assistance Clients
7.8 Note générale

Laissez un commentaire

Votre adresse courriel n'apparaitra pas. Les champs obligatoires sont marqués *

Ce site utilise Akismet pour réduire les spams. Découvrez comment vos données de commentaire sont traitées.

Routeur 9
7.8/10
© Copyright 2023 - 2026 | Devenez un AI Pro | Fait avec ♥