Stacks
Curated, opinionated tool combinations for specific use cases, roles, and budgets.
Showing 12 of 108 stacks
Agent Eval Observability Stack: Self-Hosted Tracing, Testing, and Debugging
$0/moA self-hostable agent evaluation and observability stack for teams replacing ad hoc LangSmith-style dashboards with open dev loops: Judgeval scores behavior, Laminar traces and tests workflows, TraceRoot debugs failures, Prompt Flow organizes eval runs, and OpenAI Agents SDK provides a runnable agent surface.
RAG Production Retrieval Stack: Multimodal Ingestion, Indexing, and Evaluation
$0/moA production RAG retrieval stack for teams that need more than a demo chatbot: LlamaIndex coordinates indexing and retrieval, RAGFlow and RAG-Anything handle difficult multimodal documents, and Judgeval keeps retrieval quality measurable before and after launch.
Production LLM Evaluation Stack
variesA production LLM evaluation stack should catch regressions before release, probe security failures, and close the loop with real traces and user feedback. This stack combines Promptfoo for CI gates, DeepEval/OpenAI Evals for metric-heavy test suites, and Langfuse or Helicone for observability and production datasets.
Multi-Agent Coding Orchestration Stack
variesA stack for teams adopting multi-agent autonomous development. Covers daemon-based issue processing with Symphony, parallel agent fleet management with Agent Orchestrator, Claude-native swarm intelligence with ruflo, and lightweight multi-engine looping with ralphy. All tools are open source and composable for different team sizes and workflows.
Zero-Trust Self-Hosted Infrastructure Stack
$20/moA modern self-hosted infrastructure stack for teams and homelabbers who want secure remote access, container management, SSH workflows, and lightweight authentication without enterprise complexity or subscription costs. All tools are open source and deployable on minimal hardware.
AI Agent Browser Automation Stack
$0/moA production stack for giving AI agents the ability to browse, interact with, and extract data from the web. Combines Browserless for headless browser infrastructure, Page Agent for in-page AI interaction, and Nango for API integrations — enabling AI agents to operate across both web interfaces and APIs.
Open-Source Feature Flag and Experimentation Stack
$0/moA complete open-source stack for feature flags, A/B testing, and progressive rollouts using GrowthBook for experimentation, Flagsmith for simple feature toggles, and supporting infrastructure. All tools are self-hostable and replace paid platforms like LaunchDarkly and Statsig at zero software cost.
Chinese AI Model Ecosystem Stack
$0/moA complete open-source toolkit for working with Chinese AI models covering fine-tuning, inference serving, agent development, and voice synthesis from the leading Chinese AI research labs.
MCP Production Infrastructure Stack
$0/moA complete toolkit for deploying and managing Model Context Protocol servers at production scale, from browser automation through server management to gateway routing and agent orchestration.
API Documentation Stack
$0/moA complete API documentation toolkit combining modern reference rendering, developer portal generation, and property-based API testing for teams that treat their API docs as a product.
Modern Frontend Component Stack
$0/moA curated frontend toolkit combining headless primitives, zero-runtime styling, animated components, and number transitions for building polished React applications with full design control.
AI-Powered Security Testing Stack
variesA comprehensive security testing toolkit that combines AI-powered vulnerability discovery, LLM security assessment, API fuzzing, and supply chain analysis to protect modern applications across their entire attack surface.