Token Savior replaces raw file reads with structural queries. Measured across 60 real coding tasks against baseline Claude Code tools.
Token Savior parses your codebase into a structural graph of functions, classes, and dependencies. One call to switch_project.
The agent queries specific symbols instead of reading entire files. get_function_source, get_dependents, get_call_chain.
84% fewer characters but better-structured information. The signal-to-noise ratio improves, and accuracy jumps from 56% to 96%.
Click a row to expand details.
| Category | N | Score A | Score B | Delta |
|---|
get_call_chain queries are slower per-call than Grep heuristics. Score still climbs from 5/8 to 8/8, so the trade-off is correctness over latency.switch_project + find_symbol ~6s vs one Grep at ~3s. Score still improves (1.17 → 1.83) thanks to fewer false negatives.
tsbench is open and reproducible. python generate.py --seed 42 gives the same 60-task project every time.
Run it on your agent:
If you run the benchmark on another agent and want to submit results, open a PR or issue. We'll add your results to the leaderboard.
Structural code navigation for Claude Code.
Less context, better answers.