aicoolies logo
TaskWeaver logo

TaskWeaver

Code-first agent framework for data analytics tasks

Share
open-sourceOpen Source
Visit Website →

TaskWeaver is Microsoft's open-source code-first agent framework that converts natural language requests into executable Python code for data analytics and workflow automation. Unlike text-based agent frameworks, it preserves rich in-memory data structures like DataFrames across conversation turns, supports custom algorithm plugins as callable functions, and verifies generated code before execution. It includes a Planner for task decomposition and a Code Interpreter for generation and execution.

TaskWeaver addresses a specific limitation of most agent frameworks: they treat everything as text strings, losing the ability to work with native Python data structures like pandas DataFrames, numpy arrays, and dictionaries across conversation rounds. When a user asks TaskWeaver to pull data from a SQL database, run anomaly detection, and visualize results, the framework generates Python code that operates on actual in-memory objects rather than serializing everything to strings or files between steps. This makes it practical for real data analytics workflows where a business analyst wants to interact with data using natural language while the system handles the code generation and execution transparently.

The architecture has three core components. The Planner acts as the entry point, breaking user requests into subtasks and managing execution with self-reflection — if something goes wrong, it adjusts the plan rather than failing. The Code Generator produces Python code for each subtask, considering available plugins and domain-specific examples. The Code Executor runs the generated code in isolated processes with session management to keep different users' data separate. Plugins are standard Python functions that encapsulate custom algorithms — developers write functions for their domain-specific operations (SQL queries, ML models, API calls) and TaskWeaver treats them as callable tools the LLM can orchestrate. Domain knowledge is incorporated through configurable examples that teach the planner how to approach specific types of tasks.

TaskWeaver ships with a Docker image for containerized deployment and supports code execution in separate processes for security isolation. The framework works with OpenAI models and can be configured for local LLMs as well. The research paper behind TaskWeaver details a benchmark of 258 test cases for evaluating data analytics agent performance. While the project's release cadence has slowed compared to its initial launch in late 2023, it remains a reference implementation for the code-first approach to data agent design — particularly relevant as the industry shifts from simple chat-based interactions toward agents that can genuinely manipulate and analyze structured data.

Pricing

Free and open-source under MIT license

Platforms

Python, Docker, supports OpenAI and local LLMs

Categories

Tags

Use Cases

Alternatives

Browser Use logo

Browser Use

AI agent framework for web browser automation

Browser Use is an open-source AI agent framework with 99K+ GitHub stars enabling LLMs to control web browsers via natural language. Y Combinator-backed, it lets agents navigate sites, fill forms, extract data, and complete multi-step tasks autonomously. Built on Playwright with vision-based element detection, multi-tab management, cookie persistence, and self-correcting actions. Supports OpenAI, Anthropic, and local models with a simple Python API for building custom browser agents.

open-sourceOpen Source
Agno logo

Agno

Lightweight multi-modal agent framework

Fast, lightweight Python framework for building multi-modal AI agents, formerly known as Phidata. Includes built-in memory, knowledge bases, tools, and reasoning capabilities with 40K+ GitHub stars. Designed for developers who want to build production-ready agents quickly with minimal boilerplate, supporting structured outputs and multi-agent coordination out of the box.

open-sourceOpen Source

Claude-Flow

Multi-agent orchestration platform for Claude Code

Claude-Flow is an open-source multi-agent orchestration platform that deploys dozens of concurrent Claude Code agents with shared memory and coordinated workflows. It enables parallel task execution, hierarchical agent coordination, and persistent context across sessions. Run via npx with zero setup. Described as the leading agent orchestration platform for Claude by industry analysts, it has 9,100+ GitHub stars and is used for complex codebase-wide refactoring and multi-file development tasks.

open-sourceOpen Source

Related Tools

Hermes Agent logo

Hermes Agent

Top Pick

Open-source AI agent framework with persistent memory, reusable skills, tools, and messaging gateways

Hermes Agent is an open-source AI agent framework with persistent memory, reusable skills, 40+ tools, cron jobs, and messaging gateways.

open-sourceOpen Source

Accomplish Coworker

Open-source desktop AI coworker for browsing and code execution.

Accomplish Coworker is an MIT-licensed open-source AI coworker that runs on the desktop, combining computer-use style browsing with code execution so agents can research, implement, run, and debug workflows in one local environment.

open-sourceOpen SourceTelemetry

Headroom

Context compression for LLM apps and coding agents

Headroom is an Apache-2.0 context compression layer for LLM apps and coding agents. It compresses tool output, logs, files, RAG chunks, and agent history through a local library, proxy, wrapper, or MCP server, with retrieval hooks for bringing originals back when needed. Treat its savings numbers as Headroom-reported benchmarks, not independent aicoolies measurements.

open-sourceOpen SourceTelemetry

Codebase Memory MCP

Codebase knowledge graph MCP server for AI coding agents

Codebase Memory MCP is an MIT-licensed MCP server that turns a repository into a persistent code knowledge graph for AI coding agents. It gives Claude Code, Cursor, Codex-style agents, and other MCP clients structural queries for functions, classes, call chains, routes, and architecture, helping them explore large projects without repeatedly rereading files or relying only on broad search.

open-sourceOpen SourceTelemetry
BeeAI Framework logo

BeeAI Framework

Python and TypeScript framework for production multi-agent systems

BeeAI Framework is an Apache-2.0 toolkit for building production-ready AI agents and multi-agent systems in Python and TypeScript. Its docs cover agents, tools, RAG, memory, workflows, backend providers, serving, and A2A/MCP integration surfaces, making it a vendor-neutral option for teams comparing LangGraph, CrewAI, Mastra, and related agent runtimes.

open-sourceOpen SourceTelemetry
Klavis AI logo

Klavis AI

MCP integration platform for agent tool use at scale

Klavis AI is an Apache-2.0 MCP integration platform for teams connecting AI agents to external SaaS tools and APIs. The public repo and official docs position it as infrastructure for reliable tool access at scale, so it fits teams that want reusable MCP connectors without treating every integration as a one-off script or custom OAuth maintenance project.

open-sourceOpen SourceTelemetry