The Claude Code Router Pattern

I had 78 custom slash commands in my Claude Code vault. Every prompt I sent was paying for all 78, even when I only used one. Here's what was happening and how I cut the catalog to 40 skills without losing a single workflow.

The hidden cost

Every custom skill in Claude Code ships with a name and a description field. Both get injected into the system prompt on every turn, regardless of whether you ever invoke that skill in the session. That's how the model knows the skill exists and what it does, but it's also how a quiet folder full of helpers turns into a tax you didn't realize you were paying.

The math is simple and unflattering:

Average description length: ~40 tokens.
78 skills times 40 tokens = ~3,100 tokens per turn.
A 100-turn session: ~310,000 tokens spent on skill metadata before Claude does any actual work.

Per turn

~3,100 tokens

Per 100-turn session

~310,000 tokens

The catalog tax you never see

You don't see this in your editor. The skills sit in claude/skills/<name>/SKILL.md. Adding one costs nothing visible. Adding ten doesn't feel different. By the time you notice the catalog is dominating the system prompt, you have 78 of them and no obvious place to cut.

The fix in one sentence

Replace clusters of near-identical skills with one router skill that dispatches to sub-files.

The router's description is the only thing in the system prompt. The sub-files, the actual procedures, are read on invocation, not at session start. One router replaces N near-clones, and the ambient token cost drops to that of a single skill.

Worked example: `/p`, project briefing

Before consolidation I had 12 skills, one per project: /p-cto, /p-rankrush, and ten others. Every one of them ran the same 6-step briefing algorithm: load task cache, find today's focus, pull recent decisions from the project index, identify the last session's edits, optionally compute billable hours, render the briefing. The only thing that differed was the project slug and a handful of file path patterns.

So I collapsed them: one dispatcher (claude/skills/p/SKILL.md, a table of 12 alias blocks) plus one canonical, parameterized algorithm (claude/skills/_p-template/SKILL.md).

claude/skills/

p/ router, 1 slot in the catalog
SKILL.md dispatch table, loaded every turn
_p-template/ underscore-prefixed, never loads ambiently
SKILL.md the shared 6-step algorithm, read on invocation
eo/ another router, 6 period workflows
lifecycle/ another router, 4 workflows
... 40 skills total, down from 78

One router plus an underscore-prefixed template replaces 12 near-identical skills. The dispatcher is the only thing the model sees every turn.

Invocation: /p cto looks up cto in the dispatcher, reads _p-template, substitutes the cto parameters, runs the algorithm, and renders the briefing. The model never sees _p-template in the system prompt: its filename starts with _, and the skill list excludes underscore-prefixed entries. The dispatcher's description is all that loads ambiently, about 40 tokens, regardless of how many aliases exist.

Before: 12 skills, every turn 480 tokens/turn

92% less, every turn

After: 1 router 40 tokens/turn

Same 12 workflows. 480 tokens per turn down to 40, a 92 percent cut in ambient cost.

The four consolidations I shipped

I applied the same pattern across the catalog. Here's the scorecard:

Router	Replaced	Skills removed
/p <alias>	12 per-project briefing skills	11
/eo <s\|d\|w\|m\|q\|y>	6 end-of-period skills	5
/so <d\|w\|m\|q\|y>	5 start-of-period skills	4
/lifecycle <close\|reopen>	4 lifecycle skills	3
/update <machine\|skills\|tools-index>	3 refresh skills	2

78 to 40 skills. Zero workflows lost.

Net: 25 fewer top-level skills. Zero workflows lost. Every original slash command still works, it just dispatches through a router now.

The /eo consolidation is a good second example: end-of-session, end-of-day, end-of-week, end-of-month, end-of-quarter, end-of-year. Six distinct procedures that share the "wrap up the period, write a retrospective, update caches" skeleton but differ in exactly how. Six sub-files under claude/skills/eo/, one dispatcher table at the top of eo/SKILL.md. Same pattern, different domain.

The 40-skill cap

The cap isn't arbitrary. It's the line where the catalog stops dominating the system prompt. I codified it in a vault rules file and wrote a /skill-audit skill that runs monthly. Every new skill proposal runs through one gate: does it replace an existing skill, belong to an existing router, or scope to a single project? If any of those is true, it should route, consolidate, or scope, not take a fresh top-level slot. Only when nothing routes it away, and it is used often, and the catalog is under the cap, does it earn its own skill. Otherwise the workflow stays a manual checklist in the relevant project. The catalog stays lean because /skill-audit blocks the regrowth, the same way a linter blocks the import sprawl you'd otherwise accumulate.

When NOT to consolidate

Two anti-patterns I had to learn the hard way:

Distinct workflows that share early steps but diverge. /tdd and /debug both start with "read the code first." But the dispatch logic afterwards is genuinely different: TDD is RED-GREEN-REFACTOR; debug is reproduce-pattern-hypothesize-fix. One router with a 200-line if tree would be worse than two skills. Keep them separate.

One-shot rituals you invoke once a year. /eo y doesn't need to merge with /eo m even though both are end-of-period retrospectives. The procedures differ by more than the alias: annual review pulls quarterly notes, monthly review pulls weekly notes. The router pattern is for near-clones, not for anything that shares a vague theme.

How to apply this to your setup

1. Find

Spot the clones

skills whose descriptions share their first 5 words

2. Route

Pick one router

near-identical siblings, one shared skeleton

3. Collapse

Dispatcher, template, sub-files

preserve each procedure verbatim

Three steps to fold a clone cluster into one router.

Grep your claude/skills/ folder for description fields starting with the same 5 words. That's the cleanest signal of clone candidates. If 12 skill descriptions all start with "Generate a briefing for", those 12 want to be a router.
Look for adjacent skill names with the same prefix or suffix. /p-foo, /p-bar, /p-baz is the obvious signal. So is /close-client, /close-project, /reopen-client, /reopen-project, that's the /lifecycle shape.
One dispatcher plus one template plus N sub-files. Preserve the original procedures verbatim inside the sub-files; only the entry point changes. Don't rewrite logic while you consolidate, that's two refactors in one PR and a recipe for regressions.

Not sure it is worth it yet? Run this gut check:

Is your catalog carrying clones?

Three or more skills share a name prefix Several descriptions open with the same words Your catalog is past 40 skills You add skills faster than you retire them

Catalog is lean

Time to consolidate

0 of 4 ticked. No clone clusters yet. Keep the cap and re-check monthly.

Result

Catalog at 40. System prompt ~1k tokens lighter per turn. New skills default to routers, not top-level entries. The audit runs monthly and surfaces drift before it compounds. If your Claude Code setup feels heavier than it should, this is usually where the weight is hiding.

If you found this useful, you might also want an effective AI strategy for your team and the case for letting your developers work without pull requests, same lens, different parts of the engineering stack.

I help engineering teams set up Claude Code so it actually compounds: production-grade CLAUDE.md, memory rules, a lean skill catalog that stays lean. If you have 50+ custom skills and want to know which to consolidate first, here's where to start.

The hidden cost

The fix in one sentence

Worked example: /p, project briefing

The four consolidations I shipped

The 40-skill cap

When NOT to consolidate

How to apply this to your setup

Result

Stay in the loop

Worked example: `/p`, project briefing