The Intelligence Layer
for AI Coding Agents

One proxy. Claude Code, Goose, or any AI agent — Anthropic or OpenAI API format. Slash costs by 90% without changing how you work.

Any agent. Any API format. One binary. Real savings.

Get AISmush Free See How It Works

Works with every AI coding agent

Claude Code Goose Any Anthropic API client Any OpenAI API client
v1.0 — Major Release

Now Works With Every AI Coding Agent

Claude Code, Goose, Aider, or anything that talks to an LLM API. One proxy gives every agent compression, caching, memory, smart routing, and cost tracking — automatically.

💻
Claude Code
Just run aismush-start
🦪
Goose
One env var, done
🔧
Any Agent
Anthropic or OpenAI API

Install AISmush, start it, point your agent at localhost:1849. That's it. Setup guide →

Community Savings — Live

$0.00
Total Saved
$0.00
Routing Savings
$0.00
Compression Savings
0
Requests
0
Tokens Compressed
0
Users

Every AI coding agent has the same cost problem.

Whether you use Claude Code, Goose, or any other agent, a heavy coding session burns $20-50 in API costs. Most tokens go to mechanical work — reading files, processing tool results, making simple edits — that doesn't need frontier-model pricing.

AISmush sits between your agent and the API. It doesn't care which agent you're using. It just makes every session cheaper.

$30+
Typical session
any AI agent
$3
Same session
with AISmush

Everything You Need to Code Faster, Smarter, Cheaper

Game Changer

AI-Generated Project Agents

One command scans your codebase, sends it to AI for deep analysis, and generates agents customized to YOUR project — your patterns, your frameworks, your architecture.

Not generic templates. Agents that know your specific file structure, your naming conventions, your test framework, your build commands.

  • Scans your codebase in seconds
  • 5-7 AI calls for deep analysis (~$0.03)
  • Generates agents, skills, and CLAUDE.md
  • Each agent assigned the cheapest model that can do the job
  • Resumes where it left off if interrupted
  • Incremental re-scans — only re-analyzes changed files (~$0.003)
$ aismush --scan


# Analyzing your codebase...
Detected: Rust + TypeScript + React
Type: fullstack web app (complex)

# Generating project-specific agents:
├─ rust-expert (sonnet) ✓
├─ frontend-engineer (sonnet) ✓
├─ test-runner (haiku) ✓
├─ debugger (sonnet) ✓
└─ explorer (haiku) ✓

Created 5 agents, 8 skills, CLAUDE.md
Included Free

21 Battle-Tested Engineering Skills — Auto-Installed

aismush --scan now auto-installs 21 production-grade engineering workflow skills from Addy Osmani's agent-skills library alongside your project-specific agents.

Osmani is a well-known engineering leader at Google. These skills encode the entire professional development lifecycle — from first idea to production ship — as reusable, agent-ready workflows.

  • Covers the full cycle: define → plan → build → verify → review → ship
  • TDD, code review, security audits, performance profiling, CI/CD setup
  • Debugging, documentation, refactoring, incident response, and more
  • MIT licensed — auto-cached locally, auto-updates on rescan
  • Works with Claude Code, Goose, or any compatible agent
  • Combined with your project agents = a complete engineering team in one command
$ aismush --scan


# Project agents generated (as before)...

# Installing engineering workflow skills:
addyosmani/agent-skills (MIT)

├─ tdd · code-review · security-audit
├─ debug-issue · performance · refactor
├─ plan-feature · write-docs · ci-cd
├─ incident-response · api-design
└─ + 10 more covering the full dev lifecycle

21 skills installed. Cached locally.

Done. Project agents + engineering skills ready.
Core Feature

Smart Model Routing + Blast-Radius Analysis

AISmush automatically detects what kind of work each turn requires and routes it to the cheapest model that can handle it.

Planning and architecture? Claude ($15/M). Reading files and making edits? DeepSeek ($0.27/M). That's a 55x cost difference on the turns that matter most.

  • Zero latency overhead — pure heuristic routing
  • Claude for reasoning, DeepSeek for execution
  • Blast-radius aware — parses imports to know which files are critical
  • Editing a shared type? Claude. Editing a leaf file? DeepSeek.
  • Automatic failover between providers
  • Error recovery detection (3+ errors → Claude)
# Works the same for any agent:

Claude Code or Goose or any client
       ↓
AISmush proxy
       ↓

# Routing decisions per turn:
"Plan the auth system" → Claude ($0.45)
Tool result: Read file → DeepSeek ($0.001)
Tool result: Edit file → DeepSeek ($0.001)
Tool result: Run tests → DeepSeek ($0.001)
"Debug this error" → Claude ($0.12)

Session: $0.58 instead of $12.40
Biggest Token Saver

Structural Summarization — 3-5x Fewer Tokens

The biggest single improvement to token usage. Older tool results in your conversation get replaced with compact structural summaries — just function signatures, type definitions, and imports.

Your last 4 messages stay fully intact. Only older code results get summarized. JSON, YAML, and error results are never touched.

  • 200-line file becomes ~30 lines (3-5x reduction)
  • Saves thousands of tokens per request in long sessions
  • Content-type aware — only summarizes code, never data
  • Supports Rust, TypeScript, Python, Go, and more
  • Combined with standard compression: 60-80% total reduction
# Old message tool_result (6,000 tokens):
use std::collections::HashMap;
use crate::db::Db;
// ... 180 lines of implementation ...
// comments, function bodies, tests ...

# After structural summary (1,200 tokens):
[Structural summary (200 lines -> 28 lines)]
use std::collections::HashMap;
use crate::db::Db;
pub struct ProxyState { ... }
impl ProxyState { ... }
pub async fn handle() -> Response { ... }
fn compress_text() -> String { ... }

5x reduction. API surface preserved.
Nobody Has Solved This

Deep Memory — Full Conversation Capture

Every developer's frustration: "I already told you this yesterday."

Other tools remember tool names. AISmush captures entire conversations — your questions, the AI's answers, the reasoning, the decisions. Searchable by meaning, not just keywords.

  • Full conversation capture (not just tool names)
  • Keyword search always on (FTS5, zero overhead)
  • Semantic search opt-in (--embeddings) — finds "JWT validation" when you search "auth bug"
  • Search via dashboard, CLI, or API
  • Auto-injects relevant past context into new sessions
  • Lightweight by default — no 90MB model unless you want it
$ aismush --search "auth token bug"

# Found 3 relevant conversations:

1. [3 days ago] score: 0.92
  You: "Fix the JWT token expiry bug"
  AI: "The issue is in validate_token().
      The expiry check uses > instead of >=..."
  Tools: Read(jwt.rs), Edit(jwt.rs), Bash(cargo test)

2. [5 days ago] score: 0.87
  You: "Add refresh token support"
  AI: "I'll create a refresh endpoint that..."
  - Added WebSocket broadcast server
Reliability

Context Window Management

Claude handles 200K tokens. DeepSeek handles 64K. Long sessions blow past DeepSeek's limit, causing failures and lost work.

AISmush automatically manages the mismatch. Old tool results get trimmed, large contexts route to Claude, and your work is never blocked.

  • Under 55K: both providers work fine
  • 55-64K: trim old tool results for DeepSeek
  • Over 64K: auto-route to Claude (200K window)
  • Never breaks tool_use/tool_result pairing
# Context growing during long session:

Turn 1: 5K tokens → DeepSeek
Turn 10: 25K tokens → DeepSeek
Turn 20: 48K tokens → DeepSeek
Turn 25: 58K tokens → compress + DeepSeek
Turn 30: 72K tokens → auto-route to Claude

# Without AISmush: DeepSeek fails at 64K.
# With AISmush: seamless handoff.
Transparency

Real-Time Cost Dashboard

See exactly what you're saving. Every request tracked: which provider, how many tokens, what it cost, what it would have cost at full frontier-model pricing.

  • Live dashboard at localhost:1849/dashboard
  • Per-request cost breakdown
  • Savings percentage with all-Claude comparison
  • Request history with full detail
  • Memory viewer
  • Stats persist across sessions in SQLite
Session Stats

Requests: 142
Claude turns: 12 (planning/debugging)
DeepSeek turns: 130 (execution)

Actual cost: $1.82
All-Claude cost: $18.40
Saved: $16.58 (90.1%)
Autonomous

Plan Orchestrator — Say "Go" and Walk Away

Ask your agent to make a plan, then say "run plan". AISmush analyzes every step, maps each one to the best specialized agent, figures out what can run in parallel, and executes the entire thing autonomously.

  • Reads plans agents already generate — no custom format needed
  • Maps steps to specialized agents (rust-expert, data-engineer, etc.)
  • Independent steps run in parallel for maximum speed
  • Context from completed steps feeds forward automatically
  • Verifies results with cargo check/test after completion
  • Always asks for confirmation before executing
You: make a plan to add auth

Agent writes plan with 5 steps...

You: go

PLAN: Add authentication (5 steps)
Wave 1: Step 1 → rust-expert
Step 2 → data-engineer
Wave 2: Step 3 → backend-engineer
Wave 3: Step 4,5 → test-runner

Ready to execute? [Go / No]

Three Steps. That's It.

1

Install

One command. ~15MB RAM. Won't slow your machine.

2

Scan

aismush --scan generates project agents + 21 engineering skills.

3

Code

Point any agent at localhost:1849 and start saving 90%.

Works With Any Agent, Three Ways

Same proxy, same savings — choose the setup that fits your workflow.

Claude Code (Default)

Routes between Claude + DeepSeek. Max savings (~90%). Needs a free DeepSeek API key.

aismush-start

Direct Mode (Any Agent, No DeepSeek)

No DeepSeek needed. Still get compression, memory, agents, and tracking. Dashboard shows potential savings.

aismush-start --direct

Goose / Other Agents

Auto-detects installed agents. If both are present, it asks which to launch. All routing, compression, and memory features apply automatically.

aismush-start --goose

Install in 10 Seconds

One command on any platform. Works with or without a DeepSeek key.

Linux & macOS:

curl -fsSL https://raw.githubusercontent.com/Skunk-Tech/aismush/main/install.sh | bash

Windows — Scoop (recommended):

scoop bucket add aismush https://github.com/Skunk-Tech/aismush && scoop install aismush

Windows — winget:

winget install SkunkTech.AISmush

Windows — PowerShell one-liner:

irm https://raw.githubusercontent.com/Skunk-Tech/aismush/main/install.ps1 | iex
All Releases GitHub