AISmush is a drop-in proxy that makes Claude Code 90% cheaper — without changing how you work.
Real users. Real savings. $0.03 sessions that used to cost $30.
A heavy coding session burns through $20-50 in API costs. Most of those tokens are spent on mechanical tasks — reading files, processing tool results, making simple edits — that don't need Claude's $15/M token brain.
One command scans your codebase, sends it to AI for deep analysis, and generates Claude Code agents customized to YOUR project — your patterns, your frameworks, your architecture.
Not generic templates. Agents that know your specific file structure, your naming conventions, your test framework, your build commands.
AISmush automatically detects what kind of work each turn requires and routes it to the cheapest model that can handle it.
Planning and architecture? Claude ($15/M). Reading files and making edits? DeepSeek ($0.27/M). That's a 55x cost difference on the turns that matter most.
Every tool result passes through our compression engine before reaching the AI. We strip what the AI doesn't need while keeping what it does.
Content-type aware — we know the difference between code (strip comments), JSON (never touch), and logs (deduplicate aggressively). Inspired by RTK's approach.
Every developer's frustration: "I already told you this yesterday."
AISmush automatically captures what files you worked on, what tools you used, what decisions were made. Next session, it injects that context — your AI remembers.
Claude handles 200K tokens. DeepSeek handles 64K. Long sessions blow past DeepSeek's limit, causing failures and lost work.
AISmush automatically manages the mismatch. Old tool results get trimmed, large contexts route to Claude, and your work is never blocked.
See exactly what you're saving. Every request tracked: which provider, how many tokens, what it cost, what it would have cost on Claude alone.
One command. 3.6MB binary. No dependencies.
aismush --scan generates agents for your project.
aismush-start launches Claude Code. You save 90%.
Routes between Claude + DeepSeek. Max savings (~90%). Needs a free DeepSeek API key.
aismush-start
No DeepSeek needed. Still get compression, memory, agents, and tracking. Dashboard shows potential savings.
aismush-start --direct
Linux & macOS. Works with or without a DeepSeek key.
Route to Ollama/llama.cpp for zero-cost execution on your own hardware.
Add Gemini, GPT-4, Mistral as routing targets. Best model for each task.
Shared savings dashboard for engineering teams. Track ROI across developers.