aicoolies logo

llmfit

Find which AI models actually run on your hardware in one command

Share
open-sourceOpen Source
Visit Website →

llmfit is a Rust-based terminal tool that matches over 200 LLM models from 30+ providers against your exact hardware specs. The interactive TUI scores each model on fit, speed, VRAM usage, and context length, helping you avoid downloading models that won't run on your machine. It supports Ollama, llama.cpp, MLX, Docker Model Runner, and LM Studio backends.

llmfit solves one of the most frustrating steps in local LLM workflows: figuring out which model will actually run on your hardware before wasting time downloading it. The interactive TUI presents a scored list of compatible models ranked by how well they fit your available VRAM, RAM, and compute capabilities. Each model shows detailed metrics including expected tokens per second, memory requirements, and maximum context length at different quantization levels.

Built in Rust for speed and reliability, llmfit detects your GPU type, VRAM capacity, system RAM, and CPU capabilities automatically. It then cross-references this hardware profile against its database of over 200 models across providers including Ollama, llama.cpp, MLX, Docker Model Runner, LM Studio, and more. The tool has been adopted as a core dependency by Hugging Face's hf-agents extension, which uses llmfit to automatically select the best model for coding agent workflows.

With over 20,000 GitHub stars and MIT license, llmfit has become the standard hardware-model matching tool in the local LLM ecosystem. It pairs naturally with llmserve, a sister project that handles serving the selected model. The tool fills a gap that no other listed tool addresses: the pre-download decision of whether a model will actually perform well on your specific hardware configuration.

Pricing

Free and open-source (MIT license)

Platforms

Rust binary; macOS, Linux, Windows; detects GPU/CPU automatically

Categories

Tags

Use Cases

Alternatives

Related Tools

Ghostty logo

Ghostty

Top Pick

Fast, native terminal emulator

GPU-accelerated terminal emulator written in Zig by Mitchell Hashimoto (HashiCorp co-founder). Native UI rendering on macOS and Linux. Supports ligatures, true color, Kitty graphics protocol, and splits/tabs. Configurable via a simple key-value file with sensible defaults. Open-source with 20K+ GitHub stars and a focus on correctness, speed, and minimal resource usage. Growing as a modern alternative to iTerm2, Alacritty, and WezTerm.

open-sourceOpen Source
Claude Code logo

Claude Code

Top Pick

Anthropic's agentic coding CLI

Anthropic's agentic CLI coding tool that delegates complex tasks to Claude directly from the terminal. Understands entire codebases via automatic context gathering, edits multiple files, runs shell commands, and manages Git workflows autonomously. Supports CLAUDE.md for persistent project instructions, integrates with VS Code and JetBrains, and uses Claude Opus/Sonnet with extended thinking for complex architectural decisions. Built for terminal-first developers.

paidOpen Source
Hermes Agent logo

Hermes Agent

Top Pick

Open-source AI agent framework with persistent memory, reusable skills, tools, and messaging gateways

Hermes Agent is an open-source AI agent framework with persistent memory, reusable skills, 40+ tools, cron jobs, and messaging gateways.

open-sourceOpen Source
pi dev code

Pi

Top Pick

Minimal terminal coding harness

Pi Coding Agent is an MIT-licensed Node.js CLI from earendil-works for building and running coding agents in a local terminal. The current package describes a read/bash/edit/write toolset and session management, while the repo positions Pi as a unified LLM API, agent loop, TUI, and coding-agent CLI. It is best framed as a lean, self-extensible BYO-model toolkit rather than a managed IDE.

open-sourceOpen Source
OpenCode logo

OpenCode

Top Pick

Open-source AI coding agent for the terminal

Open-source terminal-based AI coding agent built in Go by the SST team, with a rich TUI (Bubble Tea) supporting 75+ model providers including OpenAI, Anthropic, Gemini, Bedrock, Groq, and OpenRouter. Features vim-like editing, persistent SQLite sessions, and LSP integration for 40+ languages. Fully free with no vendor lock-in, it has rapidly grown to 95k+ GitHub stars.

open-source
Codex logo

Codex

Top Pick

OpenAI coding agent for app, editor, terminal, and cloud work

Codex is OpenAI's coding agent for software development across the Codex app, editor, terminal, and cloud tasks. It helps write, review, debug, refactor, and automate code, with ChatGPT plan access for managed surfaces and API-key usage for CLI, SDK, and IDE workflows. The open-source CLI and SDK support local repository work, while cloud features add GitHub review, Slack/Linear integrations, worktrees, skills, MCP, and automations.

freemiumOpen Source

Comparisons