2 tools tagged
Showing 2 of 2 tools
Open-source post-building layer for agents — tracing, evals, and online monitoring
Judgeval is the open-source post-building layer for AI agents from Judgment Labs, providing OpenTelemetry-based tracing, hosted and custom evaluation scorers, and online behavior monitoring for LLM-powered applications. Instrument any function with a single decorator, score live production traffic against faithfulness and instruction-adherence checks, and feed real-world failures back into reinforcement learning or supervised fine-tuning loops.
Microsoft's zero-code-change RL trainer for AI agents
Agent Lightning is Microsoft Research's open-source framework that makes AI agents trainable through reinforcement learning with virtually zero code changes. Supports RL, Automatic Prompt Optimization, and Supervised Fine-tuning across any agent framework including LangChain, OpenAI Agents SDK, AutoGen, and CrewAI. 14K+ GitHub stars, ranked among Microsoft's top 50 most-starred projects.