aicoolies logo
Tokscale logo

Tokscale

CLI token usage tracker for AI coding agents

Share
open-sourceOpen Source
Visit Website →

Tokscale is a CLI tool that tracks token usage and costs across AI coding agents including Claude Code, Codex, OpenCode, Gemini CLI, Cursor, and more. Built with a native Rust core for high-performance processing, it provides detailed breakdowns of input, output, cache, and reasoning tokens with real-time pricing calculations via LiteLLM data. Features include interactive 2D/3D contribution graphs, web visualization dashboards, global leaderboards, and JSON export for cost analysis.

Tokscale brings visibility to the often-opaque world of AI coding agent costs. As developers adopt tools like Claude Code, Codex, Gemini CLI, and Cursor, token consumption can escalate quickly without clear insight into where the spend is going. Tokscale reads JSONL conversation logs from these tools and breaks down usage into input, output, cache read, cache write, and reasoning tokens with precise cost calculations using LiteLLM pricing data.

The tool is built with a native Rust core that delivers processing speeds roughly ten times faster than pure JavaScript alternatives, making it practical even for developers with months of accumulated conversation logs. A web-based visualization layer renders interactive contribution graphs in both 2D and 3D, along with filterable dashboards that help identify the most expensive sessions, models, and time periods. JSON export supports integration with spreadsheets or BI tools for deeper analysis.

Beyond individual tracking, Tokscale includes community features like a global leaderboard where developers can compare token usage and a profile system with contribution statistics. The tool supports flexible time-period filtering covering all-time, monthly, and weekly views. For teams and individuals looking to optimize their AI tooling budgets, Tokscale provides the data foundation needed to make informed decisions about which models and agents deliver the best value for their workflows.

Pricing

Free and open source

Platforms

CLI tool (Rust/Node.js), cross-platform

Categories

Tags

Use Cases

Alternatives

LiteLLM logo

LiteLLM

Unified API proxy for 100+ LLMs

Drop-in OpenAI-compatible proxy supporting 100+ LLM providers with load balancing, spend tracking, rate limiting, and fallback routing. Acts as a unified gateway for all your AI model calls, letting teams switch between providers, enforce budgets, and add reliability layers without changing application code. Essential infrastructure for multi-model AI architectures.

open-sourceOpen Source
Portkey logo

Portkey

AI gateway with observability, routing, and guardrails

Portkey is an AI gateway and observability platform providing a unified API for 200+ LLM providers with intelligent routing, caching, rate limiting, and guardrails. Route requests across OpenAI, Anthropic, Google, and more with automatic failover, load balancing, and cost optimization. Features request logging, prompt management, evaluation tools, and real-time monitoring. The open-source gateway can be self-hosted; Portkey Cloud adds managed observability and team features.

freemiumOpen Source
Helicone logo

Helicone

Open-source LLM observability through a single-line proxy

Helicone is an open-source LLM observability platform that monitors AI applications through a single-line proxy integration. Change your API base URL to route requests through Helicone and instantly get logging, cost tracking, latency monitoring, caching, rate limiting, and user analytics. Supports OpenAI, Anthropic, Google, and 300+ model providers. Has processed over 2 billion LLM interactions. Features prompt experimentation, evaluation tools, and a gateway for request management.

freemiumOpen Source

Related Tools

Grok logo

Grok Build

Top Pick

xAI's terminal coding agent with parallel subagents and worktree-aware automation

Grok Build is xAI's terminal-first coding agent for planning, editing, testing, and reviewing code from a local CLI. The early beta exposes subagent controls, worktree mode, headless JSON output, best-of-N parallel attempts, sandbox profiles, and experimental memory. It fits developers comparing Claude Code, Codex, and Gemini CLI for local agentic workflows with deeper parallel execution.

paid
Warp logo

Warp

Top Pick

The modern terminal with AI

GPU-accelerated terminal built in Rust, now evolved into an Agentic Development Environment (ADE) used by 700K+ developers. Features block-based output navigation, AI command suggestions via the Oz orchestration engine, multi-line editing with syntax highlighting, and a built-in code editor with LSP support. Available on macOS, Linux, and Windows. Includes Warp Drive for sharing workflows, real-time session collaboration, and BYOK support for OpenAI, Anthropic, and Google API keys.

freemiumTelemetry
Ghostty logo

Ghostty

Top Pick

Fast, native terminal emulator

GPU-accelerated terminal emulator written in Zig by Mitchell Hashimoto (HashiCorp co-founder). Native UI rendering on macOS and Linux. Supports ligatures, true color, Kitty graphics protocol, and splits/tabs. Configurable via a simple key-value file with sensible defaults. Open-source with 20K+ GitHub stars and a focus on correctness, speed, and minimal resource usage. Growing as a modern alternative to iTerm2, Alacritty, and WezTerm.

open-sourceOpen Source
Claude Code logo

Claude Code

Top Pick

Anthropic's agentic coding CLI

Anthropic's agentic CLI coding tool that delegates complex tasks to Claude directly from the terminal. Understands entire codebases via automatic context gathering, edits multiple files, runs shell commands, and manages Git workflows autonomously. Supports CLAUDE.md for persistent project instructions, integrates with VS Code and JetBrains, and uses Claude Opus/Sonnet with extended thinking for complex architectural decisions. Built for terminal-first developers.

paidOpen Source
Hermes Agent logo

Hermes Agent

Top Pick

Open-source AI agent framework with persistent memory, reusable skills, tools, and messaging gateways

Hermes Agent is an open-source AI agent framework with persistent memory, reusable skills, 40+ tools, cron jobs, and messaging gateways.

open-sourceOpen Source
pi dev code

Pi

Top Pick

Minimal terminal coding harness

Pi is an open-source minimalist coding agent built by Mario Zechner that runs in the terminal with an intentionally lean design — just four core tools (read, write, edit, bash) and a ~300-word system prompt. It treats the context window as scarce, omits sub-agents and MCP by default, and pushes extensibility to TypeScript extensions, skills, and prompt templates developers build for their own needs. A radical counterpoint to bloated AI coding tools.

open-sourceOpen Source