aicoolies logo

Best tools for AI Model Training

Training, fine-tuning, and evaluating AI/ML models for specific use cases

Showing 24 of 82 tools

Deep Lake logo

Deep Lake

AI data runtime for multimodal datasets and vector search

Deep Lake is an open-source AI data runtime from Activeloop for storing, versioning, and querying multimodal data and embeddings. It fits teams building RAG, training, evaluation, or dataset-heavy agent workflows that need a bridge between vector search, structured metadata, and large image, text, audio, or video collections.

open-sourceOpen Source
Weights & Biases logo

Weights & Biases

ML experiment tracking and model monitoring

Weights & Biases is an AI developer platform for experiment tracking, artifact and model lineage, model monitoring, and Weave-based LLM evaluation. It helps teams log runs, compare metrics, manage datasets and model artifacts, and collaborate through dashboards, reports, alerts, SSO/RBAC controls, and hosted or self-managed deployment options.

freemium
Labelbox logo

Labelbox

Data factory for AI teams and model training

Labelbox is a comprehensive data platform for AI teams handling reinforcement learning, evaluations, robotics, and human feedback workflows. Core capabilities include RL data generation with knowledge work rubrics, custom evaluations for private benchmarks and model comparisons, robotics data with full-stack video and trajectories, and an expert network of 1.5M+ knowledge workers including 50K+ PhDs. Trusted by 80% of leading AI labs for production data operations.

paid
Baseten logo

Baseten

ML inference platform for production AI models

Baseten is the inference platform for deploying AI models at scale with dedicated and pre-optimized model APIs and performance-optimized infrastructure. Specializes in image generation, transcription, text-to-speech, LLM serving, embeddings, and compound AI workloads. Delivers 75% latency reduction with 415ms cold starts and 3000+ concurrent scaling. Available as managed cloud or self-hosted, trusted by Cursor, Notion, Descript, and Sourcegraph for production inference.

api-usage-based
Hopsworks logo

Hopsworks

AI Lakehouse with Feature Store for real-time ML

Hopsworks is a data-intensive AI platform combining a Python-centric Feature Store with MLOps capabilities for production ML systems. Provides sub-millisecond feature retrieval powered by RonDB, dual offline and online storage for batch and real-time inference, experiment tracking, model registry, and deployment pipelines. Available as managed cloud on AWS, Azure, and GCP, self-hosted on Kubernetes, or serverless platform.

freemiumOpen Source
DUSt3R logo

DUSt3R

3D reconstruction without camera parameters

DUSt3R is Naver's breakthrough 3D reconstruction method that generates dense 3D scenes from unconstrained image pairs without known camera intrinsics or extrinsics. It casts pairwise reconstruction as pointmap regression, removing hard geometric constraints of projective camera models. Supports multi-view alignment, depth estimation, visual localization, and extends to MASt3R and MUSt3R for large-scale applications.

open-sourceOpen Source

Dolphin

ByteDance multimodal document image parser

Dolphin is ByteDance's multimodal document parsing model that handles intertwined text, tables, formulas, and figures in complex documents. Using a two-stage analyze-then-parse approach with a Swin Transformer vision encoder and MBart decoder, it performs layout analysis and parallel element parsing with heterogeneous anchor prompts. Dolphin-v2 adds document-type awareness for invoices, papers, and forms.

open-sourceOpen Source
Pixeltable logo

Pixeltable

Declarative multimodal AI data infrastructure

Pixeltable is a declarative data infrastructure for multimodal AI that stores video, audio, images, and documents as first-class column types. Define Python computed columns for inference and transformations, and Pixeltable auto-orchestrates execution with incremental updates. Built-in vector search eliminates the need for separate vector databases while supporting RAG and semantic search workflows.

open-sourceOpen Source
Triton Inference Server logo

Triton Inference Server

NVIDIA's optimized AI model serving platform

Triton Inference Server is NVIDIA's open-source inference serving platform that deploys AI models from TensorRT, PyTorch, ONNX, TensorFlow, OpenVINO, Python, and more across cloud, data center, and edge environments. It supports dynamic batching, model ensembles, concurrent model execution on GPUs and CPUs, and real-time, streaming, and batch inference patterns. Includes Model Analyzer for profiling and Model Navigator for automated optimization.

open-sourceOpen Source

FlashAttention

Fast memory-efficient GPU attention kernels

FlashAttention is a fast and memory-efficient exact attention implementation that reduces GPU memory usage from quadratic to linear in sequence length. Created by Tri Dao, it achieves 3-4x speedups over baseline implementations through IO-aware tiling that minimizes HBM reads and writes. Versions include FlashAttention-2 with improved parallelism, FlashAttention-3 optimized for Hopper H100 GPUs, and FlashAttention-4 targeting Hopper and Blackwell architectures.

open-sourceOpen Source

Amphion

Open-source toolkit for audio, music, and speech generation

Amphion is an open-source audio generation toolkit from OpenMMLab designed for reproducible research in speech synthesis, voice conversion, singing voice synthesis, and text-to-audio generation. It implements state-of-the-art models including MaskGCT, DualCodec, VITS, and VALL-E with built-in architecture visualizations for educational use. The project ships with the Emilia-Large dataset of 200,000 hours of speech data and includes multiple vocoders and evaluation metrics for benchmarking.

open-sourceOpen Source
VoxCPM logo

VoxCPM

Tokenizer-free multilingual TTS with voice cloning

VoxCPM is an open-source text-to-speech system from OpenBMB generating continuous speech across 30 languages without traditional tokenization. Its 2B parameter end-to-end diffusion architecture produces 48kHz studio-quality audio with natural prosody and emotion. Key capabilities include voice design from text descriptions, few-shot voice cloning, and multilingual synthesis without language-specific modules. The Apache 2.0 project has 8,700 GitHub stars.

open-sourceOpen Source
fal.ai logo

fal.ai

Serverless AI inference for generative media at scale

fal.ai is a serverless AI inference platform providing ultra-low-latency APIs for generating images, videos, audio, and 3D models. With 600+ production-ready models and native Python and JavaScript SDKs, it eliminates GPU management while delivering 30-50% lower costs than alternatives. Automatic scaling with no cold starts and real-time streaming support make it ideal for interactive AI applications.

api-usage-based
DataHub logo

DataHub

Open-source metadata platform for data discovery

DataHub is an open-source metadata platform for data discovery, governance, and observability, originally developed at LinkedIn. It provides a centralized catalog with 80+ integrations for data warehouses, lakes, dashboards, and ML platforms. DataHub offers real-time metadata ingestion, column-level lineage tracking, automated quality checks, and fine-grained access policies. Used by 3,000+ organizations in production. Apache 2.0 licensed with 11.8K+ GitHub stars.

freemiumOpen Source

MNN

Lightweight mobile and edge AI inference engine

MNN is a lightweight, high-performance deep learning inference engine developed by Alibaba and battle-tested across 30+ Alibaba apps including Taobao, DingTalk, and Youku. It supports TensorFlow, ONNX, PyTorch, and Caffe models with optimized backends for CPU, GPU, and NPU on mobile and edge devices. MNN includes on-device LLM inference, an OpenCV-like image processing library, and Python bindings for rapid prototyping. Apache 2.0 licensed with 15K+ stars.

open-sourceOpen Source
Cleanlab logo

Cleanlab

AI-powered data quality for ML datasets

Cleanlab is a data-centric AI library that automatically detects and fixes label errors, outliers, and data quality issues in machine learning datasets. It works with any ML model and any data type including text, images, tabular, and audio by analyzing model predictions to identify mislabeled examples, near-duplicates, and ambiguous data points. Cleanlab helps teams improve model accuracy by cleaning training data rather than tuning model architecture.

freemiumOpen Source
Polars logo

Polars

Lightning-fast DataFrame library in Rust

Polars is an extremely fast DataFrame library written in Rust that provides a powerful query engine for data manipulation in Python, Node.js, and R. Built on Apache Arrow columnar format, Polars delivers performance that outpaces Pandas by 10-100x on common operations through parallel execution and SIMD optimizations. It features lazy evaluation with automatic query optimization, streaming for out-of-core processing, and an expressive API for filtering, joining, and aggregating datasets.

open-sourceOpen Source

Multica

Open-source platform for managing AI coding agents as teammates

Multica is an open-source managed agents platform that lets you assign coding tasks to AI agents like Claude Code and Codex as if they were team members. It provides a unified dashboard for task assignment, execution monitoring, and skill reuse across local and cloud compute environments. With multi-workspace support, team-level isolation, and reusable skill compounding, Multica turns autonomous coding agents into organized, trackable development resources.

open-sourceOpen Source

Fabric

Modular AI prompt framework for everyday tasks

Fabric is an open-source framework that organizes AI prompts into reusable patterns for solving everyday tasks like summarizing content, explaining code, extracting insights from videos, and generating social media posts. Written in Go with support for 20+ AI providers including OpenAI, Claude, Gemini, and Ollama, it runs from the command line and can serve as a REST API. With 40,000+ GitHub stars, Fabric bridges the gap between AI capabilities and practical workflow automation.

open-sourceOpen Source

LocalSend

Open-source cross-platform file sharing over local network

LocalSend is a free, open-source application for secure peer-to-peer file and message sharing between nearby devices over your local network. It works on Windows, macOS, Linux, Android, iOS, and Fire OS without requiring an internet connection or third-party servers. Each device generates TLS/SSL certificates for encrypted HTTPS communication, making it a privacy-first alternative to AirDrop that works across all operating systems.

open-sourceOpen Source

Archon

AI agent that builds other AI agents

Archon is an open-source AI meta-agent created by Cole Medin that autonomously builds, refines, and optimizes other AI agents. Now evolving into Archon OS, it serves as a knowledge and task management backbone for AI coding assistants. The system uses an agentic coding workflow with framework knowledge bases for Pydantic AI, LangGraph, and other agent frameworks, enabling developers to describe what they need and let Archon generate the agent code, test it, and iterate until it works.

open-sourceOpen Source
21st.dev logo

21st.dev

npm for design engineers — shadcn/ui component marketplace

21st.dev is the largest open-source marketplace of shadcn/ui-based React Tailwind components, blocks, and hooks. Used by 1.4M developers with 200K monthly active users. Functions as an MCP server enabling AI coding agents to discover and compose UI components via API. Features a Magic AI feature that generates components from natural language prompts. All components installable with npx shadcn.

freemiumOpen Source

AI Scientist v2

Autonomous scientific discovery via agentic tree search

AI Scientist v2 is Sakana AI's open-source system for fully autonomous scientific research using LLM-powered agentic tree search. It generates hypotheses, designs experiments, writes and executes code, analyzes results, and produces publishable manuscripts without human intervention. The system uses progressive exploration with backtracking to navigate the research space efficiently.

open-sourceOpen Source

gemma.cpp

Lightweight C++ inference for Google Gemma models

gemma.cpp is Google's standalone C++ inference engine built specifically for running Gemma language models without Python or CUDA dependencies. It provides optimized CPU inference using SIMD instructions and Highway library, supports Gemma 2 and Gemma 3 models, and runs on x86 and ARM architectures. Designed for embedded systems, edge devices, and server deployments needing minimal overhead.

open-sourceOpen Source