Stacks

Curated, opinionated tool combinations for specific use cases, roles, and budgets.

Showing 12 of 108 stacks

Agent Eval Observability Stack: Self-Hosted Tracing, Testing, and Debugging

A self-hostable agent evaluation and observability stack for teams replacing ad hoc LangSmith-style dashboards with open dev loops: Judgeval scores behavior, Laminar traces and tests workflows, TraceRoot debugs failures, Prompt Flow organizes eval runs, and OpenAI Agents SDK provides a runnable agent surface.

JudgevalLaminarTraceRootPrompt FlowOpenAI Agents SDK

RAG Production Retrieval Stack: Multimodal Ingestion, Indexing, and Evaluation

$0/mo

A production RAG retrieval stack for teams that need more than a demo chatbot: LlamaIndex coordinates indexing and retrieval, RAGFlow and RAG-Anything handle difficult multimodal documents, and Judgeval keeps retrieval quality measurable before and after launch.

LlamaIndexRAGFlowRAG-AnythingJudgeval

Production LLM Evaluation Stack

varies

A production LLM evaluation stack should catch regressions before release, probe security failures, and close the loop with real traces and user feedback. This stack combines Promptfoo for CI gates, DeepEval/OpenAI Evals for metric-heavy test suites, and Langfuse or Helicone for observability and production datasets.

PromptfooDeepEvalOpenAI EvalsLangfuseHelicone

Multi-Agent Coding Orchestration Stack

varies

A stack for teams adopting multi-agent autonomous development. Covers daemon-based issue processing with Symphony, parallel agent fleet management with Agent Orchestrator, Claude-native swarm intelligence with ruflo, and lightweight multi-engine looping with ralphy. All tools are open source and composable for different team sizes and workflows.

SymphonyAgent Orchestratorrufloralphy

Zero-Trust Self-Hosted Infrastructure Stack

$20/mo

A modern self-hosted infrastructure stack for teams and homelabbers who want secure remote access, container management, SSH workflows, and lightweight authentication without enterprise complexity or subscription costs. All tools are open source and deployable on minimal hardware.

NetBirdTermixApple ContainerizationAuthentik

AI Agent Browser Automation Stack

$0/mo

A production stack for giving AI agents the ability to browse, interact with, and extract data from the web. Combines Browserless for headless browser infrastructure, Page Agent for in-page AI interaction, and Nango for API integrations — enabling AI agents to operate across both web interfaces and APIs.

BrowserlessPage AgentNango

Open-Source Feature Flag and Experimentation Stack

$0/mo

A complete open-source stack for feature flags, A/B testing, and progressive rollouts using GrowthBook for experimentation, Flagsmith for simple feature toggles, and supporting infrastructure. All tools are self-hostable and replace paid platforms like LaunchDarkly and Statsig at zero software cost.

GrowthBookFlagsmithValkey

Chinese AI Model Ecosystem Stack

$0/mo

A complete open-source toolkit for working with Chinese AI models covering fine-tuning, inference serving, agent development, and voice synthesis from the leading Chinese AI research labs.

ms-swiftXinferenceQwen-AgentFlashMLAGPT-SoVITS

MCP Production Infrastructure Stack

$0/mo

A complete toolkit for deploying and managing Model Context Protocol servers at production scale, from browser automation through server management to gateway routing and agent orchestration.

Playwright MCPMCPMmcp-proxyMCPJungleNanobot

API Documentation Stack

$0/mo

A complete API documentation toolkit combining modern reference rendering, developer portal generation, and property-based API testing for teams that treat their API docs as a product.

ScalarZudokuSchemathesis

Modern Frontend Component Stack

$0/mo

A curated frontend toolkit combining headless primitives, zero-runtime styling, animated components, and number transitions for building polished React applications with full design control.

Ark UIPanda CSSMagic UIMotionNumberFlow

AI-Powered Security Testing Stack

varies

A comprehensive security testing toolkit that combines AI-powered vulnerability discovery, LLM security assessment, API fuzzing, and supply chain analysis to protect modern applications across their entire attack surface.

StackHawkFuzzyAISchemathesisSonatype Lifecycle