aicoolies logo
MediaPipe logo

MediaPipe

On-device ML solutions for mobile and edge AI

Share
open-sourceOpen Source
Visit Website →

MediaPipe is Google's open-source framework for building on-device machine learning pipelines across mobile, web, desktop, and edge platforms. It provides pre-built solutions for face detection, hand tracking, pose estimation, object detection, image classification, text classification, and on-device LLM inference. MediaPipe runs entirely locally without cloud dependencies, supporting Android, iOS, Python, and web browsers.

MediaPipe is Google's production-grade framework for deploying machine learning models directly on user devices without requiring network connectivity or cloud processing. Originally developed internally at Google for products like Google Lens, Nest cameras, and YouTube, it was open-sourced to give developers access to the same optimized ML pipeline infrastructure. The framework handles the entire lifecycle from model optimization and quantization to efficient inference on CPUs, GPUs, and specialized accelerators.

The framework ships with ready-to-use solutions covering the most common on-device ML tasks: face detection and face mesh with 468 landmarks, hand tracking with 21 key points, full-body pose estimation, object detection and tracking, image segmentation, text classification and embedding, and audio classification. Each solution includes pre-trained models optimized for real-time performance on mobile hardware. MediaPipe also supports custom model deployment through its Task API, which wraps TensorFlow Lite models with pre- and post-processing logic.

With over 34,000 GitHub stars, MediaPipe has become the standard toolkit for developers building privacy-preserving, low-latency AI features into applications. Recent updates added on-device LLM inference capabilities and support for Google's Gemma models running locally. The framework is available for Android, iOS, web browsers via WebAssembly, and Python desktop applications under the Apache-2.0 license, making it accessible across virtually every platform where on-device ML is relevant.

Pricing

Free and open source under Apache-2.0 license

Platforms

Android, iOS, web, Python — on-device inference

Categories

Tags

Use Cases

Alternatives

Related Tools

Hermes Agent logo

Hermes Agent

Top Pick

Open-source AI agent framework with persistent memory, reusable skills, tools, and messaging gateways

Hermes Agent is an open-source AI agent framework with persistent memory, reusable skills, 40+ tools, cron jobs, and messaging gateways.

open-sourceOpen Source
BeeAI Framework logo

BeeAI Framework

Python and TypeScript framework for production multi-agent systems

BeeAI Framework is an Apache-2.0 toolkit for building production-ready AI agents and multi-agent systems in Python and TypeScript. Its docs cover agents, tools, RAG, memory, workflows, backend providers, serving, and A2A/MCP integration surfaces, making it a vendor-neutral option for teams comparing LangGraph, CrewAI, Mastra, and related agent runtimes.

open-sourceOpen SourceTelemetry
Superserve logo

Superserve

Open-source Firecracker sandboxes for long-running AI agents

Superserve is an open-source sandbox infrastructure layer for AI agents that need durable computers instead of short-lived shells. It runs isolated Firecracker microVMs, supports pause, resume, snapshot, fork, preview URLs, MCP connectivity, SDK/API control, Docker workloads, and self-hosting, while the hosted service adds pay-as-you-go agent sandboxes for teams.

open-sourceOpen Source

Anthropic Agent Skills

Official Claude Agent Skills examples, spec, and plugin marketplace for reusable agent capabilities

Anthropic Agent Skills is Anthropic's official reference repo and Claude Code plugin marketplace for reusable Skill folders. It packages example SKILL.md workflows, document skills, a Claude API skill, templates, and the Agent Skills spec so teams can turn repeatable instructions, scripts, and resources into on-demand Claude capabilities instead of copying prompts across sessions.

freeTelemetry
agmsg logo

agmsg

Cross-agent messaging for CLI coding agents

agmsg is an MIT-licensed Bash and SQLite messaging layer for CLI coding agents. It lets Claude Code, Codex, Gemini CLI, GitHub Copilot CLI, Antigravity, OpenCode, Hermes, and other terminal agents exchange messages through a shared local database instead of relying on a human copy-paste relay. It is intentionally not MCP, not a broker, and not a subagent framework.

open-sourceOpen Source
eve vercel

eve by Vercel

Filesystem-first framework for durable AI agents

Eve is Vercel's filesystem-first TypeScript framework for building durable AI agents as ordinary project files. It combines Markdown instructions and skills, typed tools, channels, connections, subagents, schedules, sandboxes, and evals with Vercel's agent runtime so teams can ship deployable agents without hand-rolling orchestration. The current beta fits Vercel-native backend agent projects.

open-sourceOpen Source