9Router
7.8

9Router

  • Fermate l'emorragia di budget API: instradate in modo più intelligente, scrivete più a lungo.
  • L'open-source AI Un proxy che mantiene il tuo stack di sviluppo attivo 24 ore su 24.

9Router Key Insights

Modello di tariffazione: Gratuito (Open Source)
Livello gratuito: Si  
Contrassegnato come: AI Gateway / LLM Router / Token Saver
Prezzo: $0
Multi-Provider Routing:
3-Tier Smart Fallback:
RTK Token Saver:
Caveman Mode:
OpenAI-Compatible Endpoint:
Format Translation:
Multi-Account Round Robin:
Real-Time Quota Tracking:
OAuth Auto Token Refresh:
Custom Model Combos:
Usage Analytics Dashboard:
Ollama Support:
Context Window Transparency:

What is 9Router?

9Router

9Router is a free, open-source local AI proxy and token optimisation gateway built for developers who use AI strumenti di codifica like Claude Code, Cursor, Codex, Cline, and GitHub Copilot. It sits between your CLI tools and over 40 upstream LLM providers, exposing a single OpenAI-compatible endpoint at localhost:20128/v1. Its core function is to eliminate coding downtime caused by quota exhaustion or rate limits through a 3-tier automatic fallback system that cascades from premium subscriptions to budget API tiers to genuinely free providers. 

On top of routing, its built-in RTK Token Saver compresses tool outputs such as git diff and log dumps before they reach the LLM, cutting input token costs by 20 to 40 percent on every single request. For teams and solo developers tired of mid-session interruptions and runaway API bills, 9Router is the infrastructure layer that keeps the code flowing at near-zero cost.

Key Features of 9Router
RTK Token Saver — Compress Before You Send
RTK Token Saver 9Router

9Router's RTK (Rust Token Kompressor, ported to JavaScript) intercepts every tool_result in your prompt before it is dispatched to the LLM. It peeks at the first kilobyte of data, identifies the content type (git diff, log dump, file tree, grep output), and applies targeted lossless filters. The result is 20 to 40 percent fewer input tokens per request with absolutely no loss of context. If a filter fails or makes output larger, RTK silently discards it so your request is never broken.

Smart 3-Tier Fallback — Zero Downtime Routing

The fallback engine is the beating heart of 9Router. You configure a “Combo” that chains up to five model tiers: your active subscription (e.g. Claude Code Pro at $20 per month), a cheap backup (e.g. GLM at $0.60 per million tokens), and a free unlimited provider (e.g. Kiro AI). The instant a rate-limit or quota-exhausted error is detected, 9Router catches it silently and re-routes the exact same request to the next tier. Your coding tool never sees the switch.

Caveman Mode — Slash Output Token Costs
Caveman Mode 9Router

Inspired by the viral “why use many token when few token do trick” prompt by Julius Brussee, Caveman Mode injects a system-level instruction that forces the LLM to respond in terse, stripped-down language. The technical substance of the reply is preserved in full, but verbose filler is dropped, delivering up to 65 percent fewer output tokens. For high-volume agentic coding workflows, this alone can eliminate a meaningful chunk of monthly API expenditure.

Format Translation Engine — One Endpoint for All Tools
Format Translation Engine 9Router

9Router acts as a universal protocol translator. Your CLI tool sends a standard OpenAI JSON request to localhost, and 9Router unpacks it, restructures it into the native format of the destination provider (Claude, Gemini, Vertex, Kiro, Cursor), fires it off, then translates the response back into OpenAI format before handing it to your tool. The result is that Claude Code, Codex, Cline, Roo, and 12 other supported CLI tools all route through a single configuration point.

Real-Time Quota Tracking and Multi-Account Support

9Router's dashboard provides live token consumption data, reset countdowns (5-hour, daily, weekly), and per-model cost estimation. Multi-account round-robin per provider lets you load-balance across multiple accounts so one hitting its limit does not stall the entire stack. OAuth tokens are refreshed automatically in the background so there is no manual re-authentication during active sessions.

Free Provider Ecosystem — Genuine Zero-Cost AI

9Router ships with first-class support for three genuinely free providers: Kiro AI (free unlimited Claudio 4.5, GLM-5, MiniMax via AWS/Google OAuth), OpenCode Free (no auth, models auto-fetched), and Vertex AI ($300 Google Cloud credits for new accounts). Combined with RTK, a developer using only the free tier pays exactly $0 per month while still accessing production-grade models like Claude Sonnet 4.5.

9Router Pricing Plans

FilaCosto (USD)What You Get
9Router Software$ 0 per sempreFull proxy, all features, open-source MIT licence
Free Providers (Kiro, OpenCode)$0Unlimited Claude 4.5, GLM-5, MiniMax, no API key
Vertice AI$0 (new GCP: $300 credit)Gemini 3 Pro, DeepSeek, GLM-5 via Google Cloud
Cheap Backup (GLM-5.1)$ 0.60 per 1 milione di tokenDaily reset, great for overflow routing
Cheap Backup (MiniMax M2.7)$ 0.20 per 1 milione di token5-hour reset, cheapest per-token option
Kimi K2.5 Flat$ 9 al mese10M tokens monthly at $0.90 per 1M effective
Claude Code Pro/Max$ 20– $ 200 al mesePremium subscription maximised via 9Router
Copilota GitHub$ 10– $ 19 al meseRouted via MITM for model flexibility

Deployment Flexibility — Local, VPS, Docker

9Router is not locked to a single machine. For shared teams or remote workflows, it deploys on any VPS with a straightforward npm run build && npm run start process and a handful of environment variables. Docker images are published to both Docker Hub (decolua/9router) and GitHub Container Registry for multi-platform linux/amd64 and linux/arm64 support. 

Cloudflare Tunnel integration means remote tools like Cursor on a laptop can route through a server-hosted 9Router instance without opening firewall ports. The SQLite database persists all provider configs, combos, and usage history via a mounted volume.

Pro e contro

Pro
  • Free forever, MIT licence.
  • 20 to 40% token savings via RTK.
  • 65% output token reduction with Caveman.
  • 12 CLI tools supported natively.
  • Genuine zero-cost free provider support.
  • Docker, VPS, and Cloudflare deployment ready.
Contro
  • Requires Node.js setup (not GUI-only).
  • No per-model latency benchmarking dashboard.
  • Some free providers (iFlow, Qwen) discontinued in 2026.

Best 9Router Alternatives

AI Gateway / LLM Router / Token SaverMulti-Provider FallbackToken Optimisation
omniRoute4-tier, 36+ providers, TypeScriptSemantic cache only
LiteLLM100+ providers, enterprise focusNo built-in compression
OpenRouterCloud-based, no local installNo token saving features
LobeChat GatewayLimited provider listNo RTK-equivalent
Verdetto: 9Router wins on local-first, zero-cost token compression and free tier depth.pythonlibraries.
  • Get unlimited free AI coding with smart fallback routing
  • Gratis
  • Ogni AI coder needs this local token saver
7.0
Sicurezza della piattaforma
9.0
Senza rischi e rimborsati
8.0
Tutti i servizi
7.0
Servizio clienti
7.8 Valutazione

Lascia un Commento

L'indirizzo email non verrà pubblicato. I campi obbligatori sono contrassegnati con *

Questo sito utilizza Akismet per ridurre lo spam. Scopri come vengono elaborati i dati dei tuoi commenti.

9Router
7.8/10
© Copyright 2023 - 2026 | Diventa un AI Pro | Fatto con ♥