Loading...
Loading...
Training, fine-tuning, and evaluating AI/ML models for specific use cases
Showing 24 of 81 tools
ML experiment tracking and model monitoring
Weights and Biases is the AI developer platform for experiment tracking, model monitoring, and ML workflow orchestration. Weave extends W&B with LLM ops capabilities for prompt engineering, evaluation, and deployment. Enables teams to track experiments, monitor model performance in production, manage datasets, log LLM application traces, and collaborate on ML projects with visualization dashboards, automated logging, and enterprise SSO and RBAC compliance.
Data factory for AI teams and model training
Labelbox is a comprehensive data platform for AI teams handling reinforcement learning, evaluations, robotics, and human feedback workflows. Core capabilities include RL data generation with knowledge work rubrics, custom evaluations for private benchmarks and model comparisons, robotics data with full-stack video and trajectories, and an expert network of 1.5M+ knowledge workers including 50K+ PhDs. Trusted by 80% of leading AI labs for production data operations.
ML inference platform for production AI models
Baseten is the inference platform for deploying AI models at scale with dedicated and pre-optimized model APIs and performance-optimized infrastructure. Specializes in image generation, transcription, text-to-speech, LLM serving, embeddings, and compound AI workloads. Delivers 75% latency reduction with 415ms cold starts and 3000+ concurrent scaling. Available as managed cloud or self-hosted, trusted by Cursor, Notion, Descript, and Sourcegraph for production inference.
AI Lakehouse with Feature Store for real-time ML
Hopsworks is a data-intensive AI platform combining a Python-centric Feature Store with MLOps capabilities for production ML systems. Provides sub-millisecond feature retrieval powered by RonDB, dual offline and online storage for batch and real-time inference, experiment tracking, model registry, and deployment pipelines. Available as managed cloud on AWS, Azure, and GCP, self-hosted on Kubernetes, or serverless platform.
3D reconstruction without camera parameters
DUSt3R is Naver's breakthrough 3D reconstruction method that generates dense 3D scenes from unconstrained image pairs without known camera intrinsics or extrinsics. It casts pairwise reconstruction as pointmap regression, removing hard geometric constraints of projective camera models. Supports multi-view alignment, depth estimation, visual localization, and extends to MASt3R and MUSt3R for large-scale applications.
ByteDance multimodal document image parser
Dolphin is ByteDance's multimodal document parsing model that handles intertwined text, tables, formulas, and figures in complex documents. Using a two-stage analyze-then-parse approach with a Swin Transformer vision encoder and MBart decoder, it performs layout analysis and parallel element parsing with heterogeneous anchor prompts. Dolphin-v2 adds document-type awareness for invoices, papers, and forms.
Declarative multimodal AI data infrastructure
Pixeltable is a declarative data infrastructure for multimodal AI that stores video, audio, images, and documents as first-class column types. Define Python computed columns for inference and transformations, and Pixeltable auto-orchestrates execution with incremental updates. Built-in vector search eliminates the need for separate vector databases while supporting RAG and semantic search workflows.
NVIDIA's optimized AI model serving platform
Triton Inference Server is NVIDIA's open-source inference serving platform that deploys AI models from TensorRT, PyTorch, ONNX, TensorFlow, OpenVINO, Python, and more across cloud, data center, and edge environments. It supports dynamic batching, model ensembles, concurrent model execution on GPUs and CPUs, and real-time, streaming, and batch inference patterns. Includes Model Analyzer for profiling and Model Navigator for automated optimization.
Fast memory-efficient GPU attention kernels
FlashAttention is a fast and memory-efficient exact attention implementation that reduces GPU memory usage from quadratic to linear in sequence length. Created by Tri Dao, it achieves 3-4x speedups over baseline implementations through IO-aware tiling that minimizes HBM reads and writes. Versions include FlashAttention-2 with improved parallelism, FlashAttention-3 optimized for Hopper H100 GPUs, and FlashAttention-4 targeting Hopper and Blackwell architectures.
Open-source toolkit for audio, music, and speech generation
Amphion is an open-source audio generation toolkit from OpenMMLab designed for reproducible research in speech synthesis, voice conversion, singing voice synthesis, and text-to-audio generation. It implements state-of-the-art models including MaskGCT, DualCodec, VITS, and VALL-E with built-in architecture visualizations for educational use. The project ships with the Emilia-Large dataset of 200,000 hours of speech data and includes multiple vocoders and evaluation metrics for benchmarking.
Tokenizer-free multilingual TTS with voice cloning
VoxCPM is an open-source text-to-speech system from OpenBMB generating continuous speech across 30 languages without traditional tokenization. Its 2B parameter end-to-end diffusion architecture produces 48kHz studio-quality audio with natural prosody and emotion. Key capabilities include voice design from text descriptions, few-shot voice cloning, and multilingual synthesis without language-specific modules. The Apache 2.0 project has 8,700 GitHub stars.
Serverless AI inference for generative media at scale
fal.ai is a serverless AI inference platform providing ultra-low-latency APIs for generating images, videos, audio, and 3D models. With 600+ production-ready models and native Python and JavaScript SDKs, it eliminates GPU management while delivering 30-50% lower costs than alternatives. Automatic scaling with no cold starts and real-time streaming support make it ideal for interactive AI applications.
Open-source metadata platform for data discovery
DataHub is an open-source metadata platform for data discovery, governance, and observability, originally developed at LinkedIn. It provides a centralized catalog with 80+ integrations for data warehouses, lakes, dashboards, and ML platforms. DataHub offers real-time metadata ingestion, column-level lineage tracking, automated quality checks, and fine-grained access policies. Used by 3,000+ organizations in production. Apache 2.0 licensed with 11.8K+ GitHub stars.
Lightweight mobile and edge AI inference engine
MNN is a lightweight, high-performance deep learning inference engine developed by Alibaba and battle-tested across 30+ Alibaba apps including Taobao, DingTalk, and Youku. It supports TensorFlow, ONNX, PyTorch, and Caffe models with optimized backends for CPU, GPU, and NPU on mobile and edge devices. MNN includes on-device LLM inference, an OpenCV-like image processing library, and Python bindings for rapid prototyping. Apache 2.0 licensed with 15K+ stars.
AI-powered data quality for ML datasets
Cleanlab is a data-centric AI library that automatically detects and fixes label errors, outliers, and data quality issues in machine learning datasets. It works with any ML model and any data type including text, images, tabular, and audio by analyzing model predictions to identify mislabeled examples, near-duplicates, and ambiguous data points. Cleanlab helps teams improve model accuracy by cleaning training data rather than tuning model architecture.
Lightning-fast DataFrame library in Rust
Polars is an extremely fast DataFrame library written in Rust that provides a powerful query engine for data manipulation in Python, Node.js, and R. Built on Apache Arrow columnar format, Polars delivers performance that outpaces Pandas by 10-100x on common operations through parallel execution and SIMD optimizations. It features lazy evaluation with automatic query optimization, streaming for out-of-core processing, and an expressive API for filtering, joining, and aggregating datasets.
Open-source platform for managing AI coding agents as teammates
Multica is an open-source managed agents platform that lets you assign coding tasks to AI agents like Claude Code and Codex as if they were team members. It provides a unified dashboard for task assignment, execution monitoring, and skill reuse across local and cloud compute environments. With multi-workspace support, team-level isolation, and reusable skill compounding, Multica turns autonomous coding agents into organized, trackable development resources.
Modular AI prompt framework for everyday tasks
Fabric is an open-source framework that organizes AI prompts into reusable patterns for solving everyday tasks like summarizing content, explaining code, extracting insights from videos, and generating social media posts. Written in Go with support for 20+ AI providers including OpenAI, Claude, Gemini, and Ollama, it runs from the command line and can serve as a REST API. With 40,000+ GitHub stars, Fabric bridges the gap between AI capabilities and practical workflow automation.
Open-source cross-platform file sharing over local network
LocalSend is a free, open-source application for secure peer-to-peer file and message sharing between nearby devices over your local network. It works on Windows, macOS, Linux, Android, iOS, and Fire OS without requiring an internet connection or third-party servers. Each device generates TLS/SSL certificates for encrypted HTTPS communication, making it a privacy-first alternative to AirDrop that works across all operating systems.
AI agent that builds other AI agents
Archon is an open-source AI meta-agent created by Cole Medin that autonomously builds, refines, and optimizes other AI agents. Now evolving into Archon OS, it serves as a knowledge and task management backbone for AI coding assistants. The system uses an agentic coding workflow with framework knowledge bases for Pydantic AI, LangGraph, and other agent frameworks, enabling developers to describe what they need and let Archon generate the agent code, test it, and iterate until it works.
npm for design engineers — shadcn/ui component marketplace
21st.dev is the largest open-source marketplace of shadcn/ui-based React Tailwind components, blocks, and hooks. Used by 1.4M developers with 200K monthly active users. Functions as an MCP server enabling AI coding agents to discover and compose UI components via API. Features a Magic AI feature that generates components from natural language prompts. All components installable with npx shadcn.
Autonomous scientific discovery via agentic tree search
AI Scientist v2 is Sakana AI's open-source system for fully autonomous scientific research using LLM-powered agentic tree search. It generates hypotheses, designs experiments, writes and executes code, analyzes results, and produces publishable manuscripts without human intervention. The system uses progressive exploration with backtracking to navigate the research space efficiently.
Lightweight C++ inference for Google Gemma models
gemma.cpp is Google's standalone C++ inference engine built specifically for running Gemma language models without Python or CUDA dependencies. It provides optimized CPU inference using SIMD instructions and Highway library, supports Gemma 2 and Gemma 3 models, and runs on x86 and ARM architectures. Designed for embedded systems, edge devices, and server deployments needing minimal overhead.
State-of-the-art open-source code language models
DeepSeek Coder is a family of open-source code language models trained from scratch on 2 trillion tokens of code and natural language data. Available in sizes from 1B to 33B parameters, these models support 80+ programming languages with 16K context windows and fill-in-the-blank capabilities. DeepSeek Coder outperforms CodeLlama-34B on HumanEval and MBPP benchmarks while being commercially licensable under MIT.